Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 10267 over 11 years Aaron Marcuse-Kubitza backups/Makefile: %.backup/restore: documented ...
  bin 10348 over 11 years Aaron Marcuse-Kubitza bugfix: bin/repl: text mode: repurpose this to ...
  config 7801 over 11 years Aaron Marcuse-Kubitza root Makefile: VegBIEN DB: mk_db: Added command...
  exports 9928 over 11 years Aaron Marcuse-Kubitza added exports/_archive/
  inputs 10434 over 11 years Aaron Marcuse-Kubitza bugfix: inputs/SpeciesLink/Specimen/map.csv: dw...
  lib 10402 over 11 years Aaron Marcuse-Kubitza lib/sh/db.sh: mk_select(): support passing $col...
  mappings 10420 over 11 years Aaron Marcuse-Kubitza mappings/VegCore.htm: regenerated from wiki. ad...
  planning 10429 over 11 years Aaron Marcuse-Kubitza planning/timeline/timeline.2013.xls: Adding der...
  schemas 10392 over 11 years Aaron Marcuse-Kubitza schemas/vegbien.sql: geoscrub_input_new: update...
  web 10306 over 11 years Aaron Marcuse-Kubitza web/links/index.htm: updated to Firefox bookmar...
.htaccess 326 Bytes 8771 over 11 years Aaron Marcuse-Kubitza /.htaccess: use canonical URL without symlinks
.rsync_filter.upload 33 Bytes 10042 over 11 years Aaron Marcuse-Kubitza /.rsync_ignore: temp files: hide them on upload...
.rsync_ignore 12 Bytes 10042 over 11 years Aaron Marcuse-Kubitza /.rsync_ignore: temp files: hide them on upload...
Makefile 12.6 KB 10223 over 11 years Aaron Marcuse-Kubitza /Makefile: mysql-Linux: also install mysql-work...
README.TXT 23.3 KB 10379 over 11 years Aaron Marcuse-Kubitza /README.TXT: Maintenance: added instructions fo...
fix_perms 97 Bytes 7560 over 11 years Aaron Marcuse-Kubitza Added root fix_perms
map 1001 Bytes 6949 almost 12 years Aaron Marcuse-Kubitza vegbien_dest: Changed default $prefix to "", so...
new_terms.csv 38.1 KB 7222 almost 12 years Aaron Marcuse-Kubitza new_terms.csv: Regenerated
run 433 Bytes 9916 over 11 years Aaron Marcuse-Kubitza /run: geoscrub_input/make(): documented runtime...
unmapped_terms.csv 13.1 KB 7201 almost 12 years Aaron Marcuse-Kubitza **/new_terms.csv, **/unmapped_terms.csv: Regene...

Latest revisions

# Date Author Comment
10434 07/26/2013 11:56 AM Aaron Marcuse-Kubitza

bugfix: inputs/SpeciesLink/Specimen/map.csv: dwc_terms_scientificName: remapped to scientificName instead of taxonName to match that DwC term name, as well as the mappings of other *scientificName terms

10433 07/26/2013 11:06 AM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/map.csv: marked dwc_geospatial_VerbatimLatitude,Longitude as exact duplicates of dwc_terms_*

10432 07/26/2013 10:52 AM Aaron Marcuse-Kubitza

inputs/SpeciesLink/Specimen/map.csv: remapped identical _alt-ed fields to DUPLICATE. this avoids the need to translate these to postprocessing derived columns for new-style import, and also speeds up column-based import because there are less automatic _alts to perform to resolve filter-less collisions.

10431 07/26/2013 10:06 AM Aaron Marcuse-Kubitza

bugfix: inputs/SpeciesLink/Specimen/map.csv: *CollectorNumber: moved these to the same _alt group as recordNumber, because they are actually duplicates

10430 07/26/2013 09:43 AM Aaron Marcuse-Kubitza

correction: inputs/SpeciesLink/Specimen/map.csv: FieldNumber: fixed incorrect comment that these fields are identical to recordNumber, when instead they have the same *meaning but not the same values. instead, values are stored under either of the two terms. the previous conclusion had been based on an incorrect query, which used != instead of the NULL-sensitive IS NOT DISTINCT FROM.

10429 07/25/2013 08:14 PM Aaron Marcuse-Kubitza

planning/timeline/timeline.2013.xls: Adding derived columns: extended to overlap with all subtasks

10428 07/25/2013 08:12 PM Aaron Marcuse-Kubitza

planning/timeline/timeline.2013.xls: Geoscrubbing: split into separate re-run and automated pipeline tasks

10427 07/25/2013 08:09 PM Aaron Marcuse-Kubitza

planning/timeline/timeline.2013.xls: moved Data provider validations before Adding derived columns because ensuring that the source data is in the database is more important than the derived data, which can always be added later

10426 07/25/2013 08:00 PM Aaron Marcuse-Kubitza

planning/timeline/timeline.2013.xls: Data provider validations: added dot in July because some amount of datasource-level validation happens when mappings issues are discovered during the refactoring

10425 07/25/2013 07:34 PM Aaron Marcuse-Kubitza

bugfix: inputs/*/*/map.csv for specimen tables: remapped eventDate,day,month,year to *Collected, because a general date always applies to the observation itself rather than to any parent event (specimens don't have a parent event)

View all revisions | View revisions

Also available in: Atom