Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  logs 8801 over 11 years Aaron Marcuse-Kubitza inputs/input.Makefile: SVN: add, %/add: */logs:...
.map.csv.last_cleanup 0 Bytes 5491 about 12 years Aaron Marcuse-Kubitza Added inputs/.NCBI/. This uses many of the new ...
VegBIEN.csv 42 Bytes 10174 over 11 years Aaron Marcuse-Kubitza bugfix: inputs/input.Makefile: %/VegBIEN.csv: f...
_no_import 0 Bytes 14036 over 10 years Aaron Marcuse-Kubitza bugfix: added nodes/_no_import, because the NCB...
create.sql 1.17 KB 11863 about 11 years Aaron Marcuse-Kubitza bugfix: inputs/.NCBI/import_order.txt: added no...
header.csv 143 Bytes 11863 about 11 years Aaron Marcuse-Kubitza bugfix: inputs/.NCBI/import_order.txt: added no...
map.csv 345 Bytes 11863 about 11 years Aaron Marcuse-Kubitza bugfix: inputs/.NCBI/import_order.txt: added no...
run 63 Bytes 10098 over 11 years Aaron Marcuse-Kubitza inputs/.NCBI/: added new-style import runscript...
test.xml.ref 2.82 KB 11396 about 11 years Aaron Marcuse-Kubitza fix: bin/map: put template: comment out the "Pu...
unmapped_terms.csv 103 Bytes 5722 about 12 years Aaron Marcuse-Kubitza inputs/.NCBI/nodes/header.csv: Updated for new ...
  • svn:ignore: *

Latest revisions

# Date Author Comment
14036 07/14/2014 08:24 PM Aaron Marcuse-Kubitza

bugfix: added nodes/_no_import, because the NCBI import doesn't work due to a Postgres 9.3 bug (#859). _no_import used to not be needed because we didn't import the hidden sources.

11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11863 12/06/2013 06:12 AM Aaron Marcuse-Kubitza

bugfix: inputs/.NCBI/import_order.txt: added nodes.src, names.src so that these would be installed under new-style import as well. this means that their columns will now be automapped, requiring the names to be renamed to VegCore names in nodes/create.sql.

11396 10/21/2013 07:14 PM Aaron Marcuse-Kubitza

fix: bin/map: put template: comment out the "Put template:" label so that the output is valid XML, and displays properly in a browser rather than showing a syntax error

11107 09/29/2013 08:58 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: nest all taxonoccurrences inside a stratum event, so that the parent locationevent is always fully populated before child locationevents point to it. (previously, a stub parent event was created when the child event was imported first, which blocked the fully-populated parent event from being inserted later on.) this uses auto-folding (for VegBank/CVS) and auto-forwarding (for other datasources) to prune empty stratum events for taxonoccurrences that don't have strata. (see wiki.vegpath.org/Auto-folding, wiki.vegpath.org/Auto-forwarding for more info about these normalization techniques.) note that the inserted row counts stay exactly the same for all datasources except VegBank (which was being fixed), indicating that this signficant change to the mappings did not change the semantics of the import of taxonoccurrences.

10866 09/04/2013 11:06 PM Aaron Marcuse-Kubitza

inputs/*/*/test.xml.ref: updated source.shortname for new datasource name, which now starts out with .new suffix

10174 07/06/2013 03:55 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: %/VegBIEN.csv: for new-style datasources, use a symlink to mappings/VegCore-VegBIEN.csv directly instead of prefiltering VegCore-VegBIEN.csv to include only the columns in map.csv. prefiltering used to be performed as part of mapping the map.csv VegCore output terms to VegBIEN using bin/join, but is no longer needed because the staging table columns are now VegCore terms. instead, the full VegCore-VegBIEN.csv is needed so that derived columns added in stage I or II validations are detected by bin/map (rather than just the original source columns in map.csv).

10098 06/27/2013 09:54 PM Aaron Marcuse-Kubitza

inputs/.NCBI/: added new-style import runscripts, which renamed the staging table columns to VegCore

8801 05/02/2013 08:53 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: SVN: add, %/add: */logs: also svn:ignore *.gz, used for compressed log files

8176 03/25/2013 09:01 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: %/.map.csv.last_cleanup: Run fix_line_endings after canon/translate to standardize Python's \r\n line endings back to \n. This prevents issues with mixed line endings because LibreOffice (and probably Excel) treat all cell-internal line endings as \n but row line endings as whatever the file had, while text editors like jEdit translate all line endings to whatever the autodetected line ending is. (This creates spurious line ending diffs when a map spreadsheet containing multiline cells is edited in a text editor.)

View revisions

Also available in: Atom