Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  logs 8801 over 11 years Aaron Marcuse-Kubitza inputs/input.Makefile: SVN: add, %/add: */logs:...
.map.csv.last_cleanup 0 Bytes 4118 over 12 years Aaron Marcuse-Kubitza inputs: Moved maps into subfolders, using the s...
VegBIEN.csv 42 Bytes 10530 over 11 years Aaron Marcuse-Kubitza inputs/SpeciesLink/: switched to new-style impo...
header.csv 3.91 KB 10091 over 11 years Aaron Marcuse-Kubitza added inputs/*/*/header.csv for CSV inputs, whi...
map.csv 12.3 KB 10530 over 11 years Aaron Marcuse-Kubitza inputs/SpeciesLink/: switched to new-style impo...
new_terms.csv 12.1 KB 10529 over 11 years Aaron Marcuse-Kubitza inputs/SpeciesLink/Specimen/map.csv: renamed DU...
postprocess.sql 718 Bytes 12516 almost 11 years Aaron Marcuse-Kubitza bugfix: *.sql: public.source_by_shortname(): ne...
run 63 Bytes 10530 over 11 years Aaron Marcuse-Kubitza inputs/SpeciesLink/: switched to new-style impo...
specieslink.0.header.csv 3.91 KB 4101 over 12 years Aaron Marcuse-Kubitza inputs: Moved src files into VegCSV subfolders ...
test.xml.ref 24.4 KB 11396 about 11 years Aaron Marcuse-Kubitza fix: bin/map: put template: comment out the "Pu...
unmapped_terms.csv 205 Bytes 10528 over 11 years Aaron Marcuse-Kubitza inputs/SpeciesLink/Specimen/: translated multi-...
  • svn:ignore: *

Latest revisions

# Date Author Comment
12516 02/27/2014 01:27 PM Aaron Marcuse-Kubitza

bugfix: *.sql: public.source_by_shortname(): need to wrap it in a nested SELECT because Postgres incorrectly does not constant-fold (inline) it, leading to a slowdown when it is therefore run many times. this is done using the steps at wiki.vegpath.org/Postgres_queries#wrap-function-call-in-nested-SELECT .

11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11396 10/21/2013 07:14 PM Aaron Marcuse-Kubitza

fix: bin/map: put template: comment out the "Put template:" label so that the output is valid XML, and displays properly in a browser rather than showing a syntax error

11107 09/29/2013 08:58 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: nest all taxonoccurrences inside a stratum event, so that the parent locationevent is always fully populated before child locationevents point to it. (previously, a stub parent event was created when the child event was imported first, which blocked the fully-populated parent event from being inserted later on.) this uses auto-folding (for VegBank/CVS) and auto-forwarding (for other datasources) to prune empty stratum events for taxonoccurrences that don't have strata. (see wiki.vegpath.org/Auto-folding, wiki.vegpath.org/Auto-forwarding for more info about these normalization techniques.) note that the inserted row counts stay exactly the same for all datasources except VegBank (which was being fixed), indicating that this signficant change to the mappings did not change the semantics of the import of taxonoccurrences.

10866 09/04/2013 11:06 PM Aaron Marcuse-Kubitza

inputs/*/*/test.xml.ref: updated source.shortname for new datasource name, which now starts out with .new suffix

10530 07/31/2013 04:04 PM Aaron Marcuse-Kubitza

inputs/SpeciesLink/: switched to new-style import, using the steps at wiki.vegpath.org/Adding_new-style_import_to_a_datasource

10529 07/31/2013 03:59 PM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/map.csv: renamed DUPLICATE#of:... output columns to be <= 63 chars long, in order to be valid PostgreSQL columns without collisions

10528 07/31/2013 03:37 PM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/: translated multi-column filters to postprocessing derived columns, using the steps at wiki.vegpath.org/Adding_new-style_import_to_a_datasource#Translating-filters-to-postprocessing-derived-columns

10494 07/30/2013 04:51 PM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/map.csv: conceptual_darwin_2003_1_0_BoundingBox: remapped to UNUSED

10436 07/26/2013 12:26 PM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/map.csv: remapped Darwin Core synonyms to DUPLICATE. this avoids the need to translate these to postprocessing derived columns for new-style import, and also speeds up column-based import because there are less automatic alts to perform to resolve filter-less collisions. the svn diff was verified by replacing DUPLICATE#of:dwc_terms<term>#... with <term>, removing the comment, and checking that this removes the diff (except where VegCore has renamed a DwC term).

View revisions

Also available in: Atom