Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  logs 8801 over 11 years Aaron Marcuse-Kubitza inputs/input.Makefile: SVN: add, %/add: */logs:...
+header.dat 2.26 KB 7218 almost 12 years Aaron Marcuse-Kubitza inputs/Madidi/: Refreshed. Note that new export...
.map.csv.last_cleanup 0 Bytes 7218 almost 12 years Aaron Marcuse-Kubitza inputs/Madidi/: Refreshed. Note that new export...
VegBIEN.csv 42 Bytes 10301 over 11 years Aaron Marcuse-Kubitza inputs/Madidi/: switched to new-style import
header.csv 0 Bytes 7218 almost 12 years Aaron Marcuse-Kubitza inputs/Madidi/: Refreshed. Note that new export...
map.csv 6.16 KB 10344 over 11 years Aaron Marcuse-Kubitza bugfix: inputs/Madidi/IndividualObservation/map...
new_terms.csv 1.08 KB 10344 over 11 years Aaron Marcuse-Kubitza bugfix: inputs/Madidi/IndividualObservation/map...
postprocess.sql 108 Bytes 10301 over 11 years Aaron Marcuse-Kubitza inputs/Madidi/: switched to new-style import
run 63 Bytes 10301 over 11 years Aaron Marcuse-Kubitza inputs/Madidi/: switched to new-style import
test.xml.ref 11.7 KB 11396 about 11 years Aaron Marcuse-Kubitza fix: bin/map: put template: comment out the "Pu...
unmapped_terms.csv 435 Bytes 10209 over 11 years Aaron Marcuse-Kubitza inputs/*/*/map.csv for CSV tables with a row_nu...
  • svn:ignore: *

Latest revisions

# Date Author Comment
11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11396 10/21/2013 07:14 PM Aaron Marcuse-Kubitza

fix: bin/map: put template: comment out the "Put template:" label so that the output is valid XML, and displays properly in a browser rather than showing a syntax error

11107 09/29/2013 08:58 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: nest all taxonoccurrences inside a stratum event, so that the parent locationevent is always fully populated before child locationevents point to it. (previously, a stub parent event was created when the child event was imported first, which blocked the fully-populated parent event from being inserted later on.) this uses auto-folding (for VegBank/CVS) and auto-forwarding (for other datasources) to prune empty stratum events for taxonoccurrences that don't have strata. (see wiki.vegpath.org/Auto-folding, wiki.vegpath.org/Auto-forwarding for more info about these normalization techniques.) note that the inserted row counts stay exactly the same for all datasources except VegBank (which was being fixed), indicating that this signficant change to the mappings did not change the semantics of the import of taxonoccurrences.

10866 09/04/2013 11:06 PM Aaron Marcuse-Kubitza

inputs/*/*/test.xml.ref: updated source.shortname for new datasource name, which now starts out with .new suffix

10344 07/19/2013 09:30 AM Aaron Marcuse-Kubitza

bugfix: inputs/Madidi/IndividualObservation/map.csv: removed derived column FieldFamilyFullName#originalFamily, which should not be in the map table because it can contain only columns that are initially in the table before running postprocess.sql

10301 07/17/2013 11:40 AM Aaron Marcuse-Kubitza

inputs/Madidi/: switched to new-style import

10298 07/17/2013 09:27 AM Aaron Marcuse-Kubitza

inputs/Madidi/IndividualObservation/map.csv: translated 1:many mappings ( FieldFamilyFullName->{family,originalFamily} ) to derived columns (in postprocess.sql) to work with new-style import, which must have a 1:1 relationship between input and output columns

10292 07/17/2013 04:24 AM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: replaced /_first filter with mapping to DUPLICATE special term (VegCore.vegpath.org?DUPLICATE). this removes collisions that don't need a postprocessing formula to combine the columns.

10290 07/17/2013 03:15 AM Aaron Marcuse-Kubitza

inputs/Madidi/IndividualObservation/map.csv: SeniorCollector: don't prepend it to the CollectorString because the CollectorString already contains it. this may be a change between the BIEN2 and refreshed Madidi data (which uses a significantly different schema).

10257 07/11/2013 12:09 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: added distinguishing #... suffix (e.g. UNUSED#institutionID) to the special terms OMIT, PRIVATE, UNUSED (VegCore.vegpath.org#Special-terms) to avoid creating a collision in the staging table renaming

View revisions

Also available in: Atom