2013-10-31 conference call¶
Martha's notes¶
Upcoming¶
- call next week at usual time (Th 9am PT&Tucson/12pm ET)
Availability¶
- Brad is getting set up with iPlant again
- Paul will only be available to do geoscrubbing for a short while longer
- see the *Google spreadsheet* (and please add your availability for future weeks once it's known):
Loading Google Spreadsheet...
Decisions made¶
usability testing¶
- usability testing should take place after datasource validations
- Brad thinks this is faster than doing usability testing first
- Brad says the goal is to create the normalized DB, but Brian says we absolutely do need to get data to the scientists
- Brad thinks he would be able to produce the data request extracts, too (although he hasn't tried yet)
- to fully test the schema, we should find someone other than me or Brad to try extracting data from the schema (lower priority)
datasource validations¶
CVS¶
- do CVS to completion before moving on to other datasources
- change format to match VegBank as closely as possible, to facilitate making the VegBank-related fixes
Madidi¶
- either Peter Jorgensen or Brad can validate this, but send to Peter Jorgensen in any case
VegBank¶
- Bob planning to talk to Mike Lee about which columns to include in the validation
inter-datasource deduplication¶
- this should be part of the core DB (Brad)
adding new data¶
- not until the database is done
- consider adding more data from Canada, since this is a perpetual data hole (Brian)
To do for Brian¶
- let
Naiaand John know the status of getting the data to them (i.e. that we will be sending them test extracts to evaluate for completeness)
To do for Paul¶
change the owner of everything in thegeoscrub
DB tobien
do a final test run of the scripts (geoscrub.sh
andupdate_validation_data.sh
)
To do for Aaron¶
geoscrubbing¶
test-run Paul's scripts once he's readyfull-DB reload to get updated input data for the geoscrubbing re-run- takes 5.5 h on full input file
CVS validation¶
change format to match VegBankVegBank-related fixesfull-DB reload to get VegBank-related schema changes
reload DB extract from Mike Leesend validation extract to Bob and Mike Lee
other validation¶
- Madidi (Peter Jorgensen/Brad)
- FIA
- MO (Peter Jorgensen)
- ARIZ, U, TEX (Brad)
- GBIF, UNCC