Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 4751 about 12 years Aaron Marcuse-Kubitza backups/Makefile: Backups: Full DB: Specify the...
  bin 5126 about 12 years Aaron Marcuse-Kubitza tnrs_db: Exclude taxonomic names which have alr...
  config 272 almost 13 years Aaron Marcuse-Kubitza Moved bien_password to new config dir
  inputs 5130 about 12 years Aaron Marcuse-Kubitza mappings/VegCore-VegBIEN.csv: Mapped accepted* ...
  lib 5129 about 12 years Aaron Marcuse-Kubitza sql_io.py: cleanup_table(): Don't clean up the ...
  mappings 5130 about 12 years Aaron Marcuse-Kubitza mappings/VegCore-VegBIEN.csv: Mapped accepted* ...
  schemas 5078 about 12 years Aaron Marcuse-Kubitza Regenerated vegbien.ERD exports
  to_do 4524 about 12 years Aaron Marcuse-Kubitza to_do/timeline.doc: Updated to reflect addition...
  validation 4523 about 12 years Aaron Marcuse-Kubitza Added validation/
Makefile 9.99 KB 4752 about 12 years Aaron Marcuse-Kubitza root Makefile: PostgreSQL: postgres-Linux: Adde...
README.TXT 11.3 KB 5040 about 12 years Aaron Marcuse-Kubitza README.TXT: Data import: Starting column-based ...
map 1.28 KB 4981 about 12 years Aaron Marcuse-Kubitza root map: Fixed custom public schema override t...
new_terms.csv 30.4 KB 4887 about 12 years Aaron Marcuse-Kubitza Regenerated root unmapped_terms.csv, new_terms.csv
unmapped_terms.csv 5.8 KB 4887 about 12 years Aaron Marcuse-Kubitza Regenerated root unmapped_terms.csv, new_terms.csv

Latest revisions

# Date Author Comment
5130 09/28/2012 03:23 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped accepted* taxonomic terms

5129 09/28/2012 03:00 PM Aaron Marcuse-Kubitza

sql_io.py: cleanup_table(): Don't clean up the pkey, because the canonicalization involved may produce collisions (as it does for TNRS.tnrs)

5128 09/28/2012 02:58 PM Aaron Marcuse-Kubitza

sql.py: Added pkey_col_()

5127 09/28/2012 02:31 PM Aaron Marcuse-Kubitza

tnrs.py: tnrs_request(): Added comment that names containing only whitespace characters are ignored by TNRS and do not receive a response row. Our tnrs_db and reimport pipeline handles the necessary re-matching-up by just creating taxonpaths for each Name_submitted, and then letting the data import process on the following import attach to the prepopulated taxonpaths.

5126 09/28/2012 02:17 PM Aaron Marcuse-Kubitza

tnrs_db: Exclude taxonomic names which have already been scrubbed, by using a filter-out LEFT JOIN on TNRS.tnrs

5125 09/28/2012 02:02 PM Aaron Marcuse-Kubitza

tnrs.py: max_pause: Changed to 30 min because TNRS sometimes freezes for ~10 min. The freezing usually happens while the data is being uploaded rather than when it's being retrieved, so that the max_pause would not apply, but to be on the safe side, requests should not time out unnecessarily.

5124 09/28/2012 01:27 PM Aaron Marcuse-Kubitza

tnrs_db: tnrs_profiler: Use iter_text='name' for consistency with tnrs.tnrs_request()'s own profiler's iter_text

5123 09/28/2012 01:25 PM Aaron Marcuse-Kubitza

tnrs_db: Print cumulative profiling information after every TNRS request, rather than just at the end

5122 09/28/2012 01:22 PM Aaron Marcuse-Kubitza

inputs/.TNRS/tnrs/tnrs.make: Append to the log file instead of overwriting it, so that the TNRS scrubbing of each import's new taxonomic names can be included in one log file. Echo the command to the log file to identify separate runs.

5121 09/28/2012 01:15 PM Aaron Marcuse-Kubitza

TNRS-related programs: Use "names" instead of "taxons" for variable names because what's being submitted are actually verbatim taxonomic names, not official references to specific taxa

View all revisions | View revisions

Also available in: Atom