2012-04-09 conference call¶

To do¶

~~reload all datasources~~ except those excluded by Brad
- ~~investigate host vs. VM performance first?~~: VM is actually 9% faster
~~import CTFS VegX~~ raw data instead
~~parallelize Python import scripts~~ using column-based import instead
- split time between this and importing CTFS
- will allow SpeciesLink to be imported much faster by using all cores at once
automate validation of new data sources
- replace column names for each datasource in a SQL validation script that uses DwC2 names
~~serialization of VegX~~
- CSV? ~~JSON?~~
find John Donoghue's geo-validation scripts

TNRS: PHP/MySQL from Brad on nimoy
Geoscrub: Python/PostgreSQL/R from John on eos, and PHP/MySQL from Brad on nimoy

Files (1)