Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 3701 about 12 years Aaron Marcuse-Kubitza backups/Makefile: Added synchronization of back...
  bin 4068 about 12 years Aaron Marcuse-Kubitza bin/map: collision_suffix: Setting back to _alt...
  config 272 almost 13 years Aaron Marcuse-Kubitza Moved bien_password to new config dir
  inputs 4068 about 12 years Aaron Marcuse-Kubitza bin/map: collision_suffix: Setting back to _alt...
  lib 4041 about 12 years Aaron Marcuse-Kubitza xml_func.py: Added simplify()
  mappings 4066 about 12 years Aaron Marcuse-Kubitza mappings/DwC2-VegBIEN.specimens.csv, VegCSV-Veg...
  schemas 4061 about 12 years Aaron Marcuse-Kubitza schemas/vegbien.sql: location: location_unique_...
  to_do 2547 over 12 years Aaron Marcuse-Kubitza to_do/timeline.doc: Updated to reflect the mont...
Makefile 10.1 KB 3764 about 12 years Aaron Marcuse-Kubitza root Makefile, input.Makefile: Maps validation:...
README.TXT 9.03 KB 3845 about 12 years Aaron Marcuse-Kubitza README.TXT: After a new import: Added steps to ...
map 1.22 KB 3475 about 12 years Aaron Marcuse-Kubitza root map: Run bin/map with a nice increment of ...

Latest revisions

# Date Author Comment
4068 08/16/2012 12:39 PM Aaron Marcuse-Kubitza

bin/map: collision_suffix: Setting back to _alt to test if _merge caused the SpeciesLink slowdown. SpeciesLink contains a huge number of equivalent columns due to each DwC term being present with namespaces for all versions of the DwC schema, and these columns can be combined either using _alt or _merge. _merge is only useful if the values in different versions of the same DwC field are different, which is not likely the case.

4067 08/16/2012 12:29 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated with stats from latest import. The import time for SpeciesLink (the slowest datasource) doubled, to 16 hours, most likely due to replacing _alt with the slower _merge, which preserves more input data.

4066 08/15/2012 11:30 AM Aaron Marcuse-Kubitza

mappings/DwC2-VegBIEN.specimens.csv, VegCSV-VegBIEN.specimens.csv: occurrenceID: Mapped to location.authorlocationcode instead of sourceaccessioncode so that it would not override any location- or event-related IDs in location.authorlocationcode merely by being mapped to the sourceaccessioncode field (which takes precedence over the authorlocationcode when specified)

4065 08/15/2012 10:43 AM Aaron Marcuse-Kubitza

mappings/VegCSV-VegBIEN.specimens.csv: occurrenceID: Mapped to specimenreplicate.sourceaccessioncode for mergability with DwC

4064 08/15/2012 09:14 AM Aaron Marcuse-Kubitza

mappings/VegCSV-VegBIEN.specimens.csv: Mapped voucherType to indirect voucher _if statements' conditions

4063 08/15/2012 09:02 AM Aaron Marcuse-Kubitza

mappings/VegCSV-VegBIEN.specimens.csv: locationID: location.sourceaccessioncode mapping: Added /_alt suffix for mergability with DwC

4062 08/15/2012 08:53 AM Aaron Marcuse-Kubitza

mappings/DwC2-VegBIEN.specimens.csv: collectionID: Mapped to location.authorlocationcode as merge with collectionCode, the same way as it is for specimenreplicate.collectioncode_dwc

4061 08/15/2012 08:23 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: location_unique_within_datasource_by_authorlocationcode unique index: Added `parent_id IS NULL` condition so that an authorlocationcode is not unintentionally treated as globally unique when a parent location is available (which implies that the authorlocationcode is a subplot code)

4060 08/15/2012 08:20 AM Aaron Marcuse-Kubitza

mappings/VegCSV-VegBIEN.specimens.csv: catalogNumber: Added location.authorlocationcode mapping for mergability with DwC

4059 08/15/2012 08:13 AM Aaron Marcuse-Kubitza

mappings/DwC2-VegBIEN.specimens.csv: location.authorlocationcode mappings: Added /_alt/3 for mergability with VegCSV mappings to same field

View all revisions | View revisions

Also available in: Atom