Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 6725 almost 12 years Aaron Marcuse-Kubitza Added backups/fix_perms
  bin 6785 almost 12 years Aaron Marcuse-Kubitza my2pg: Translate blob to bytea
  config 6321 about 12 years Aaron Marcuse-Kubitza Added config/bien_read_password
  inputs 6808 almost 12 years Aaron Marcuse-Kubitza inputs/CVS/taxonObservation_/map.csv: Use denor...
  lib 6801 almost 12 years Aaron Marcuse-Kubitza sql_io.py: put_table(): is_function: Fixed bug ...
  mappings 6795 almost 12 years Aaron Marcuse-Kubitza mappings/VegCore-VegBIEN.csv: institutionCode: ...
  schemas 6794 almost 12 years Aaron Marcuse-Kubitza schemas/vegbien.sql: sourcename: Added sourcena...
  to_do 4524 about 12 years Aaron Marcuse-Kubitza to_do/timeline.doc: Updated to reflect addition...
  validation 5971 about 12 years Aaron Marcuse-Kubitza Updated validation/BIEN2_Analytical_DB_overview...
Makefile 12.9 KB 6544 almost 12 years Aaron Marcuse-Kubitza root Makefile: apt-get: Use --yes to allow unat...
README.TXT 15.9 KB 6804 almost 12 years Aaron Marcuse-Kubitza README.TXT: Data import: Creating enough disk s...
map 989 Bytes 5158 about 12 years Aaron Marcuse-Kubitza root map: Removed no longer needed public schem...
new_terms.csv 30.4 KB 4887 about 12 years Aaron Marcuse-Kubitza Regenerated root unmapped_terms.csv, new_terms.csv
unmapped_terms.csv 5.8 KB 4887 about 12 years Aaron Marcuse-Kubitza Regenerated root unmapped_terms.csv, new_terms.csv

Latest revisions

# Date Author Comment
6808 12/12/2012 06:27 PM Aaron Marcuse-Kubitza

inputs/CVS/taxonObservation_/map.csv: Use denorm_* denormalized taxonomic ranks in place of the normalized ranks when both are provided

6807 12/12/2012 06:25 PM Aaron Marcuse-Kubitza

input.Makefile: Maps validation: %/new_terms.csv: Fixed bug where need to filter unmapped_terms.csv's terms out of the output column, not the input column, because that's what the unmapped terms are generated from. Usually these columns are the same for unmapped terms, but sometimes an output term is changed from the original column's name but still doesn't match a VegCore term in mappings/VegCore-VegBIEN.csv.

6806 12/12/2012 06:08 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Added comment with instructions to update all inputs with these settings, using `make inputs/add`

6805 12/12/2012 06:07 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: verify: Also ignore *.xlsx

6804 12/12/2012 06:00 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Creating enough disk space: Added instructions for removing archived backups to free up space

6803 12/12/2012 05:15 PM Aaron Marcuse-Kubitza

inputs/CVS/taxonObservation_/map.csv: Fixed bug where taxonLevel, not taxonRank, needs to be mapped to taxonRank, because CVS's taxonRank is actually a number, while taxonLevel contains the corresponding text string

6802 12/12/2012 05:12 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Before import, added step to make sure there is at least 100GB of disk space

6801 12/12/2012 04:41 PM Aaron Marcuse-Kubitza

sql_io.py: put_table(): is_function: Fixed bug where need to add the pkeys table's test pkey constraint after the data is added rather than when the empty table is created, to avoid adding a pkey constraint that will later be violated by data which returns multiple output rows for an input row (such as calls to _split())

6800 12/12/2012 04:36 PM Aaron Marcuse-Kubitza

sql_io.py: put_table(): insert_into_pkeys(): Allow callers to override run_query_into()'s add_pkey_ param in case the initial version of the pkeys table should not yet have the test pkey constraint (e.g. because data is added after the table is created)

6799 12/12/2012 04:24 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Checking for errors: Search for "Command exited with non-zero status" to find errors, which is faster than checking that each input's log ends in "Encountered 0 error(s)"

View all revisions | View revisions

Also available in: Atom