Project

General

Profile

Statistics
| Revision:
Name Size Revision Age Author Comment
  _archive 1598 over 12 years Aaron Marcuse-Kubitza Moved _archive/tapir2flatClient/trunk/client/ t...
  analysis 3076 over 12 years Aaron Marcuse-Kubitza Added top-level analysis dir for range modeling
  backups 4751 about 12 years Aaron Marcuse-Kubitza backups/Makefile: Backups: Full DB: Specify the...
  bin 4920 about 12 years Aaron Marcuse-Kubitza tnrs_client: Support parsing multiple taxons at...
  config 272 almost 13 years Aaron Marcuse-Kubitza Moved bien_password to new config dir
  inputs 4919 about 12 years Aaron Marcuse-Kubitza inputs/import.stats.xls: Updated import times
  lib 4917 about 12 years Aaron Marcuse-Kubitza streams.py: Line iteration: Added read_all()
  mappings 4922 about 12 years Aaron Marcuse-Kubitza mappings/VegCore.csv: Added and mapped aggregat...
  schemas 4863 about 12 years Aaron Marcuse-Kubitza schemas/functions.sql: Added _in_to_m()
  to_do 4524 about 12 years Aaron Marcuse-Kubitza to_do/timeline.doc: Updated to reflect addition...
  validation 4523 about 12 years Aaron Marcuse-Kubitza Added validation/
Makefile 9.99 KB 4752 about 12 years Aaron Marcuse-Kubitza root Makefile: PostgreSQL: postgres-Linux: Adde...
README.TXT 11.1 KB 4793 about 12 years Aaron Marcuse-Kubitza README.TXT: Data import: Added note that `make ...
map 1.22 KB 3475 over 12 years Aaron Marcuse-Kubitza root map: Run bin/map with a nice increment of ...
new_terms.csv 30.4 KB 4887 about 12 years Aaron Marcuse-Kubitza Regenerated root unmapped_terms.csv, new_terms.csv
unmapped_terms.csv 5.8 KB 4887 about 12 years Aaron Marcuse-Kubitza Regenerated root unmapped_terms.csv, new_terms.csv

Latest revisions

# Date Author Comment
4922 09/21/2012 03:20 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added and mapped aggregateOccurrenceID

4921 09/21/2012 03:12 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: taxonOccurrenceID: Re-sourced to VegBank taxonobservation and DwC occurrenceID, because this is where the VegBIEN table name came from

4920 09/21/2012 02:57 PM Aaron Marcuse-Kubitza

tnrs_client: Support parsing multiple taxons at once, by specifying each as a command-line argument. Increased the max_pause to 10 min to support large batches. Limited the batch size to 5000 names, using the limit at <http://tnrs.iplantcollaborative.org/TNRSapp.html&gt;. Note that when using xargs to pass many names, xargs will by default split its arguments into chunks of 5000. You can change this using the -n option.

4919 09/21/2012 02:29 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

4918 09/21/2012 01:20 PM Aaron Marcuse-Kubitza

Added tnrs_client. Note that obtaining an actual CSV requires four (!) steps: submit, retrieve, prepare download, and download. The output of the retrieve step is unusable because the array has different lengths depending on the taxonomic ranks present in the provided taxon name. This initial version runs one name at a time, but could later be expanded to batch process because TNRS can run multiple names at once.

4917 09/21/2012 12:36 PM Aaron Marcuse-Kubitza

streams.py: Line iteration: Added read_all()

4916 09/21/2012 08:24 AM Aaron Marcuse-Kubitza

inputs/Madidi/Plot/map.csv: Soil component measurements: Documented that units are assumed to be % based on the range of values

4915 09/21/2012 08:18 AM Aaron Marcuse-Kubitza

sql_io.py: null_strs: Added '-'

4914 09/21/2012 08:18 AM Aaron Marcuse-Kubitza

sql_io.py: cleanup_table(): Fixed bug where each column name needed to be converted to Unicode before being concatenated with other strings, to support non-ASCII characters

4913 09/21/2012 07:57 AM Aaron Marcuse-Kubitza

inputs/SALVIAS/plotMetadata/map.csv, inputs/SALVIAS-CSV/Plot/map.csv: Soil component measurements: Documented that units are assumed to be % based on the range of values

View all revisions | View revisions

Also available in: Atom