Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 3701 about 12 years Aaron Marcuse-Kubitza backups/Makefile: Added synchronization of back...
  bin 4049 about 12 years Aaron Marcuse-Kubitza bin/map: collision_suffix: Changed to use _merg...
  config 272 almost 13 years Aaron Marcuse-Kubitza Moved bien_password to new config dir
  inputs 4049 about 12 years Aaron Marcuse-Kubitza bin/map: collision_suffix: Changed to use _merg...
  lib 4041 about 12 years Aaron Marcuse-Kubitza xml_func.py: Added simplify()
  mappings 4055 about 12 years Aaron Marcuse-Kubitza mappings/VegCSV-VegBIEN.specimens.csv: Cleaned ...
  schemas 4054 about 12 years Aaron Marcuse-Kubitza schemas/functions.sql: join_strs_transform(): U...
  to_do 2547 over 12 years Aaron Marcuse-Kubitza to_do/timeline.doc: Updated to reflect the mont...
Makefile 10.1 KB 3764 about 12 years Aaron Marcuse-Kubitza root Makefile, input.Makefile: Maps validation:...
README.TXT 9.03 KB 3845 about 12 years Aaron Marcuse-Kubitza README.TXT: After a new import: Added steps to ...
map 1.22 KB 3475 about 12 years Aaron Marcuse-Kubitza root map: Run bin/map with a nice increment of ...

Latest revisions

# Date Author Comment
4055 08/15/2012 07:46 AM Aaron Marcuse-Kubitza

mappings/VegCSV-VegBIEN.specimens.csv: Cleaned up using `make mappings/`

4054 08/15/2012 07:18 AM Aaron Marcuse-Kubitza

schemas/functions.sql: join_strs_transform(): Use STRICT optimization to avoid needing to manually check if the state value or input value is NULL (http://www.postgresql.org/docs/8.3/static/sql-createaggregate.html#AEN51596)

4053 08/15/2012 07:15 AM Aaron Marcuse-Kubitza

schemas/functions.sql: join_strs(), join_strs_transform(): Reversed order of params to enable strict optimization, which replaces the state value with the first parameter, which used to be the delimiter (http://www.postgresql.org/docs/8.3/static/sql-createaggregate.html#AEN51596)

4052 08/15/2012 07:07 AM Aaron Marcuse-Kubitza

Renamed join_strs_transform_preserve_empty() to join_strs_transform() now that there are no other join_strs_transform_...() functions

4051 08/15/2012 07:06 AM Aaron Marcuse-Kubitza

schemas/functions.sql: Removed no longer used join_strs_transform_fold_empty()

4050 08/15/2012 07:06 AM Aaron Marcuse-Kubitza

schemas/functions.sql: join_strs() aggregate: Use join_strs_transform_preserve_empty() as an optimization because all our data has already had '' replaced with NULL by sql_io.cleanup_table() in csv2db. This will help speed up _merges now that they are performed on a large scale in the slowest datasource, SpeciesLink.

4049 08/15/2012 07:02 AM Aaron Marcuse-Kubitza

bin/map: collision_suffix: Changed to use _merge instead of _alt to avoid losing source data on import when multiple fields collide

4048 08/15/2012 06:58 AM Aaron Marcuse-Kubitza

bin/map: Preventing collisions if multiple inputs mapping to same output: Made collision suffix configurable so it can easily be changed

4047 08/15/2012 06:56 AM Aaron Marcuse-Kubitza

bin/map: Preventing collisions if multiple inputs mapping to same output: Made collision suffix configurable so it can easily be changed

4046 08/15/2012 06:52 AM Aaron Marcuse-Kubitza

mappings/DwC2-VegBIEN.specimens.csv, VegCSV-VegBIEN.specimens.csv: taxonoccurrence.sourceaccessioncode mappings: Added catalogNumber mapping, which takes precendence over recordNumber and is applicable to specimens data and direct vouchers. recordNumber should only be used as a last resort (before the taxon name) because this is collector-assigned and often not unique within anything.

View all revisions | View revisions

Also available in: Atom