Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 4200 over 12 years Aaron Marcuse-Kubitza backups/: svn:ignore: Also ignore .*, which inc...
  bin 4506 about 12 years Aaron Marcuse-Kubitza join: Documented that it's case- and punctuatio...
  config 272 almost 13 years Aaron Marcuse-Kubitza Moved bien_password to new config dir
  inputs 4504 about 12 years Aaron Marcuse-Kubitza intersect, union: Made case- and punctuation-in...
  lib 4499 about 12 years Aaron Marcuse-Kubitza maps.py: Added simplify()
  mappings 4504 about 12 years Aaron Marcuse-Kubitza intersect, union: Made case- and punctuation-in...
  schemas 4501 about 12 years Aaron Marcuse-Kubitza schemas/functions.sql: _alt(): Added extra numb...
  to_do 4091 over 12 years Aaron Marcuse-Kubitza to_do/: svn:ignore OpenOffice lock files
Makefile 10.1 KB 3764 over 12 years Aaron Marcuse-Kubitza root Makefile, input.Makefile: Maps validation:...
README.TXT 10.4 KB 4487 about 12 years Aaron Marcuse-Kubitza README.TXT: Data import: Added testing steps to...
map 1.22 KB 3475 over 12 years Aaron Marcuse-Kubitza root map: Run bin/map with a nice increment of ...

Latest revisions

# Date Author Comment
4506 09/07/2012 09:16 AM Aaron Marcuse-Kubitza

join: Documented that it's case- and punctuation-insensitive.

4505 09/07/2012 09:16 AM Aaron Marcuse-Kubitza

bin/map: map_table(): Refactored to map simplified to original column names first and then determine column index for each original name, in order to avoid trying to recover the original name from a simplified name where multiple original names might collide onto the same simplified name. Documented that it's case- and punctuation-insensitive.

4504 09/07/2012 09:11 AM Aaron Marcuse-Kubitza

intersect, union: Made case- and punctuation-insensitive. mappings/Veg+-VegBIEN.csv: Removed no longer needed duplicate entries for each first letter case, which must now be removed for case- and punctuation-insensitive intersect/union to work. Note that the SpeciesLink `svn diff` hides _alt entry 0, which contains one of the removed duplicate columns that appears in the diff.

4503 09/07/2012 08:42 AM Aaron Marcuse-Kubitza

bin/map: map_table(): Resolve all mappings and prefixes after applying maps.simplify()

4502 09/07/2012 08:37 AM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/map.csv: _alt all scientificNameAuthorship synonyms together in one _alt

4501 09/07/2012 08:27 AM Aaron Marcuse-Kubitza

schemas/functions.sql: _alt(): Added extra numbered parameters. Eventually these will need to be converted to variadic args, but this will require special support from column-based import.

4500 09/07/2012 07:26 AM Aaron Marcuse-Kubitza

join: Use new maps.simplify()

4499 09/07/2012 07:26 AM Aaron Marcuse-Kubitza

maps.py: Added simplify()

4498 09/07/2012 07:23 AM Aaron Marcuse-Kubitza

join: Match terms with non-alphanumeric chars removed

4497 09/07/2012 07:15 AM Aaron Marcuse-Kubitza

join: Match terms case-insensitively

View all revisions | View revisions

Also available in: Atom