Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 almost 13 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 4751 over 12 years Aaron Marcuse-Kubitza backups/Makefile: Backups: Full DB: Specify the...
  bin 4788 over 12 years Aaron Marcuse-Kubitza review: Don't remove XML functions that are uni...
  config 272 about 13 years Aaron Marcuse-Kubitza Moved bien_password to new config dir
  inputs 4836 over 12 years Aaron Marcuse-Kubitza inputs/import.stats.xls: Updated import times
  lib 4758 over 12 years Aaron Marcuse-Kubitza xml_dom.py: replace_with_text(): Support ints a...
  mappings 4840 over 12 years Aaron Marcuse-Kubitza mappings/Veg+.terms.csv: Removed terms that are...
  schemas 4830 over 12 years Aaron Marcuse-Kubitza schemas/vegbien.sql: stemobservation: stemobser...
  to_do 4524 over 12 years Aaron Marcuse-Kubitza to_do/timeline.doc: Updated to reflect addition...
  validation 4523 over 12 years Aaron Marcuse-Kubitza Added validation/
Makefile 9.99 KB 4752 over 12 years Aaron Marcuse-Kubitza root Makefile: PostgreSQL: postgres-Linux: Adde...
README.TXT 11.1 KB 4793 over 12 years Aaron Marcuse-Kubitza README.TXT: Data import: Added note that `make ...
map 1.22 KB 3475 over 12 years Aaron Marcuse-Kubitza root map: Run bin/map with a nice increment of ...
new_terms.csv 8.09 KB 4714 over 12 years Aaron Marcuse-Kubitza Updated aggregated unmapped_terms.csv, new_term...
unmapped_terms.csv 5.91 KB 4714 over 12 years Aaron Marcuse-Kubitza Updated aggregated unmapped_terms.csv, new_term...

Latest revisions

# Date Author Comment
4840 09/19/2012 06:36 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Removed terms that are in mappings/Veg+-VegCore.csv

4839 09/19/2012 06:31 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Added sources where missing

4838 09/19/2012 06:20 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Added Source and Comments columns from mappings/Veg+.terms.csv. Reordered columns to put Comments first.

4837 09/19/2012 06:17 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Removed duplicate entries for stem_id/stemID, collector

4836 09/19/2012 05:56 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

4835 09/19/2012 05:24 PM Aaron Marcuse-Kubitza

inputs/REMIB/Specimen/: Filter out invalid, frameshifted rows so they don't produce errors in the import or anomalies like thousands of taxondeterminations for one taxonoccurrence. This involves moving the CSVs to Specimen.src and using a create.sql to create the filtered table.

4834 09/19/2012 04:47 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Forward occurrenceID to taxonoccurrence.sourceaccessioncode when there is no other taxonoccurrence.sourceaccessioncode, to ensure that taxonoccurrence is uniquely identified so that there is one taxonoccurrence per organism

4833 09/19/2012 04:16 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: taxonoccurrence.authortaxoncode alternatives: Use _first instead of _alt because when one of these fields is present, it can be used directly even if it's sometimes NULL, without needing to spend a lot of time _alting together fields that won't be used. Datasources where the authortaxoncode is sometimes NULL usually have a separate sourceaccessioncode for the taxonoccurrence. (In the rare case that they don't, they should map a non-NULL field to recordNumber or tag to ensure that taxonoccurrences can be uniquely identified.)

4832 09/19/2012 04:07 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped tag to taxonoccurrence.authortaxoncode when the record is an organism, in case there is no other ID for the taxonoccurrence. This fixes a bug in FIA and TEAM data where all organisms in a plot used the same taxonoccurrence because taxonoccurrence was not properly constrained, causing the loss of individual taxondeterminations on each organism.

4831 09/19/2012 03:36 PM Aaron Marcuse-Kubitza

input.Makefile: Testing: %/test.by_col.xml: Do abort tester if by-column test fails. There are no longer small rowcount differences between row-based and column-based import on some datasources, so this is now possible.

View all revisions | View revisions

Also available in: Atom