Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  logs 8801 about 11 years Aaron Marcuse-Kubitza inputs/input.Makefile: SVN: add, %/add: */logs:...
.map.csv.last_cleanup 0 Bytes 8261 over 11 years Aaron Marcuse-Kubitza Added inputs/GBIF/raw_occurrence_record/ from r...
.rsync_ignore 11 Bytes 10008 about 11 years Aaron Marcuse-Kubitza added inputs/GBIF/raw_occurrence_record_plants/...
VegBIEN.csv 42 Bytes 10174 almost 11 years Aaron Marcuse-Kubitza bugfix: inputs/input.Makefile: %/VegBIEN.csv: f...
header.csv 508 Bytes 9411 about 11 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record/header.csv: r...
map.csv 1.24 KB 11888 over 10 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record_plants/map.cs...
new_terms.csv 413 Bytes 11888 over 10 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record_plants/map.cs...
postprocess.sql 1.26 KB 11887 over 10 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record_plants/postpr...
run 4.22 KB 11869 over 10 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record_plants/run: u...
table.tsv.md5 33 Bytes 9882 about 11 years Aaron Marcuse-Kubitza mappings/VegCore-VegBIEN.csv: genus->taxonlabel...
test.xml.ref 20.6 KB 11881 over 10 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record_plants/test.x...
unmapped_terms.csv 201 Bytes 11888 over 10 years Aaron Marcuse-Kubitza inputs/GBIF/raw_occurrence_record_plants/map.cs...
  • svn:ignore: *

Latest revisions

# Date Author Comment
11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11888 12/10/2013 06:35 AM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/map.csv: row_num: remapped to plain *row_num, like the other datasources that have this field

11887 12/10/2013 06:31 AM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: Remove institutions that we have direct data for: rerun time: noted that this is only fast after manual vacuuming of the table (to remove the deleted rows from the index). autovacuum apparently does not run, although it should.

11881 12/09/2013 07:24 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/test.xml.ref: reran test, which added yearCollected/monthCollected/dayCollected

11869 12/09/2013 02:43 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/run: updated import() runtime (same), documented table cleanup runtime (1.5 h)

11868 12/09/2013 02:38 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: CREATE INDEX ... specimenHolderInstitutions: documented runtime (45 min)

11867 12/09/2013 02:28 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: Remove institutions that we have direct data for: documented runtime (3.5 min)

11396 10/21/2013 07:14 PM Aaron Marcuse-Kubitza

fix: bin/map: put template: comment out the "Put template:" label so that the output is valid XML, and displays properly in a browser rather than showing a syntax error

11107 09/29/2013 08:58 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: nest all taxonoccurrences inside a stratum event, so that the parent locationevent is always fully populated before child locationevents point to it. (previously, a stub parent event was created when the child event was imported first, which blocked the fully-populated parent event from being inserted later on.) this uses auto-folding (for VegBank/CVS) and auto-forwarding (for other datasources) to prune empty stratum events for taxonoccurrences that don't have strata. (see wiki.vegpath.org/Auto-folding, wiki.vegpath.org/Auto-forwarding for more info about these normalization techniques.) note that the inserted row counts stay exactly the same for all datasources except VegBank (which was being fixed), indicating that this signficant change to the mappings did not change the semantics of the import of taxonoccurrences.

10866 09/04/2013 11:06 PM Aaron Marcuse-Kubitza

inputs/*/*/test.xml.ref: updated source.shortname for new datasource name, which now starts out with .new suffix

View revisions

Also available in: Atom