Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  Source 11396 about 11 years Aaron Marcuse-Kubitza fix: bin/map: put template: comment out the "Pu...
  _MySQL 11705 about 11 years Aaron Marcuse-Kubitza copyright scrub: inputs/: removed data provider...
  _archive 10089 over 11 years Aaron Marcuse-Kubitza added inputs/GBIF/_archive/
  _src 11705 about 11 years Aaron Marcuse-Kubitza copyright scrub: inputs/: removed data provider...
  logs 8801 over 11 years Aaron Marcuse-Kubitza inputs/input.Makefile: SVN: add, %/add: */logs:...
  raw_occurrence_record_plants 12516 almost 11 years Aaron Marcuse-Kubitza bugfix: *.sql: public.source_by_shortname(): ne...
  verify 12018 almost 11 years Aaron Marcuse-Kubitza inputs/input.Makefile: add!: verify/: also svn:...
Makefile 27 Bytes 9385 over 11 years Aaron Marcuse-Kubitza added inputs/GBIF/Makefile, which links to ../i...
gbif_extract.tar.gz.md5 33 Bytes 7571 almost 12 years Aaron Marcuse-Kubitza inputs/: Added .md5 files for all .zip, .gz
gbif_extract.tar.gz.url 71 Bytes 7586 almost 12 years Aaron Marcuse-Kubitza Added inputs/GBIF/gbif_extract.tar.gz.url
import_order.txt 36 Bytes 10088 over 11 years Aaron Marcuse-Kubitza removed inputs/GBIF/Specimen/, which has been r...
map.csv 1.24 KB 10425 over 11 years Aaron Marcuse-Kubitza bugfix: inputs/*/*/map.csv for specimen tables:...
new_terms.csv 454 Bytes 11788 about 11 years Aaron Marcuse-Kubitza **/new_terms.csv, unmapped_terms.csv updated (u...
run 153 Bytes 10051 over 11 years Aaron Marcuse-Kubitza inputs/GBIF/run: inherit from lib/runscripts/da...
table.run 87 Bytes 10268 over 11 years Aaron Marcuse-Kubitza bugfix: inputs/GBIF/table.run: need to include ...
unmapped_terms.csv 232 Bytes 11788 about 11 years Aaron Marcuse-Kubitza **/new_terms.csv, unmapped_terms.csv updated (u...
  • svn:ignore: *

Latest revisions

# Date Author Comment
12516 02/27/2014 01:27 PM Aaron Marcuse-Kubitza

bugfix: *.sql: public.source_by_shortname(): need to wrap it in a nested SELECT because Postgres incorrectly does not constant-fold (inline) it, leading to a slowdown when it is therefore run many times. this is done using the steps at wiki.vegpath.org/Postgres_queries#wrap-function-call-in-nested-SELECT .

12018 02/02/2014 12:49 AM Aaron Marcuse-Kubitza

inputs/input.Makefile: add!: verify/: also svn:ignore *.tsv, *.txt

11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11888 12/10/2013 06:35 AM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/map.csv: row_num: remapped to plain *row_num, like the other datasources that have this field

11887 12/10/2013 06:31 AM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: Remove institutions that we have direct data for: rerun time: noted that this is only fast after manual vacuuming of the table (to remove the deleted rows from the index). autovacuum apparently does not run, although it should.

11881 12/09/2013 07:24 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/test.xml.ref: reran test, which added yearCollected/monthCollected/dayCollected

11869 12/09/2013 02:43 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/run: updated import() runtime (same), documented table cleanup runtime (1.5 h)

11868 12/09/2013 02:38 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: CREATE INDEX ... specimenHolderInstitutions: documented runtime (45 min)

11867 12/09/2013 02:28 PM Aaron Marcuse-Kubitza

inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: Remove institutions that we have direct data for: documented runtime (3.5 min)

11788 11/26/2013 11:11 PM Aaron Marcuse-Kubitza

**/new_terms.csv, unmapped_terms.csv updated (using `make missing_mappings`)

View revisions

Also available in: Atom