2013-12-05 conference call¶
Martha's notes¶
to-dos:
Brian, Brad, Bob, Mark
Review the outstanding issues and features in the Data Sources table and flag them as "egregious" or "not egregious" so Aaron knows which to fix by December 20.
For any "egregious" items, followup with an email to the bien-db mailing list to inform everyone in case raging debate should ensue.
Brad
Send Aaron the final set of queries for the quantitative (aggregating) validations (using the SALVIAS, NYBG queries Aaron reminded you about).Aaron
CVS: Work with Bob and Mike to complete the "record spot check" validations using the current approach (input and output tabs).
GBIF, FIA, BIEN2 traits: Send Brad just the "output" information for record spot checking.
Run the aggregating queries on each source (dependent on Brad providing the queries)
Fix any problems deemed "egregious" in data sources (dependent on 3B's and Mark identifying the egregious ones).Brian, Brad, Bob, Mike
Document the information on data for which access is restricted (say, by end of January)
Martha
Clarify if there are schema changes to support terms of use that need to be made before Dec. 20 (even though we won't yet load that information).[no]Clarify whether the decision was to put the attribution information on the website for Dec 20.[yes, as part of each row]
upcoming¶
- call next week at usual time (Th 9am PT/10am Tucson/12pm ET)
- there will be a separate planning call on Tu. 12/17 at the usual time
- this is for just Martha, Mark, Bob, Brad, and Brian
- this may replace the usual Thursday call that week?
availability¶
- see the *Google spreadsheet* (and please add your availability for future weeks once it's known):
to do for Mark and Brian¶
indicate whether you are available on Tu. 12/17 at 9am PT/10am Tucson/12pm ET for the planning conference callthe call is happening then, regardless of their availability
to do for Bob¶
provide list of embargoed CVS plotssee CVS embargoes
to do for Aaron¶
spot-checking validation¶
CVS, FIA, GBIF
CVS validation¶
remove embargoed CVS plots
to do for Brad¶
aggregating validations¶
provide list of aggregating queries- use the SALVIAS and NYBG summarizing queries as a starting point?
spot-checking validation¶
- validate FIA using the downloads from their website
release timeline¶
for 12/20 release¶
- spot-checking validation: CVS,
FIA, GBIF - aggregating validations: output queries
attribution: top-level and primary data provider indicated in each recordseedatasource
,specimenHolderInstitutions
columns inanalytical_stem
.conditions of use: web page with conditions for each top-level datasourcesee Datasource conditions of use
for January¶
- materialized views: CSV export of
viewFullOccurrence
-style view- Martha says we won't get to this by the end of the year
for February/March¶
- have wider BIEN group look at DB
decisions made¶
spot-checking validation¶
- just create output tab in order to speed up creating the extract
- for 12/20 release, just focus on "egregious errors" (Mark)
- Brad will check FIA using the downloads from their website
aggregating validations (counts)¶
- need set of generic queries on plots, specimens, traits
- we will create just the output side of these queries; the data providers will create the input side
- Brian wants these by the end of the year
materialized views¶
needviewFullOccurrence
-style view (i.e.analytical_stem
)analytical_stem
just needs to be exported to CSV- don't need a web-based querying interface (Martha)
- but include just a human-readable subset of the
analytical_stem
columns (Bob)
attribution¶
for 12/20 release, just on web page, not in a separate DB schema
conditions of use¶
CVS export may contain embargoed records (for private lands and endangered species) which we need to remove