Project

General

Profile

2013-12-05 conference call

Martha's notes

to-dos:

Brian, Brad, Bob, Mark

Review the outstanding issues and features in the Data Sources table and flag them as "egregious" or "not egregious" so Aaron knows which to fix by December 20.
For any "egregious" items, followup with an email to the bien-db mailing list to inform everyone in case raging debate should ensue.

Brad

Send Aaron the final set of queries for the quantitative (aggregating) validations (using the SALVIAS, NYBG queries Aaron reminded you about).

Aaron

CVS: Work with Bob and Mike to complete the "record spot check" validations using the current approach (input and output tabs).
GBIF, FIA, BIEN2 traits: Send Brad just the "output" information for record spot checking.
Run the aggregating queries on each source (dependent on Brad providing the queries)
Fix any problems deemed "egregious" in data sources (dependent on 3B's and Mark identifying the egregious ones).

Brian, Brad, Bob, Mike

Document the information on data for which access is restricted (say, by end of January)

Martha

Clarify if there are schema changes to support terms of use that need to be made before Dec. 20 (even though we won't yet load that information). [no]
Clarify whether the decision was to put the attribution information on the website for Dec 20. [yes, as part of each row]

upcoming

  • call next week at usual time (Th 9am PT/10am Tucson/12pm ET)
  • there will be a separate planning call on Tu. 12/17 at the usual time
    • this is for just Martha, Mark, Bob, Brad, and Brian
    • this may replace the usual Thursday call that week?

availability

Loading Google Spreadsheet...

to do for Mark and Brian

  • indicate whether you are available on Tu. 12/17 at 9am PT/10am Tucson/12pm ET for the planning conference call the call is happening then, regardless of their availability

to do for Bob

to do for Aaron

spot-checking validation

  • CVS, FIA, GBIF

CVS validation

  • remove embargoed CVS plots

to do for Brad

aggregating validations

spot-checking validation

  • validate FIA using the downloads from their website

release timeline

for 12/20 release

  • spot-checking validation: CVS, FIA, GBIF
  • aggregating validations: output queries
  • attribution: top-level and primary data provider indicated in each record see datasource, specimenHolderInstitutions columns in analytical_stem .
  • conditions of use: web page with conditions for each top-level datasource see Datasource conditions of use

for January

  • materialized views: CSV export of viewFullOccurrence-style view
    • Martha says we won't get to this by the end of the year

for February/March

  • have wider BIEN group look at DB

decisions made

spot-checking validation

  • just create output tab in order to speed up creating the extract
  • for 12/20 release, just focus on "egregious errors" (Mark)
  • Brad will check FIA using the downloads from their website

aggregating validations (counts)

  • need set of generic queries on plots, specimens, traits
  • we will create just the output side of these queries; the data providers will create the input side
  • Brian wants these by the end of the year

materialized views

  • need viewFullOccurrence-style view (i.e. analytical_stem)
  • analytical_stem just needs to be exported to CSV
    • don't need a web-based querying interface (Martha)
  • but include just a human-readable subset of the analytical_stem columns (Bob)

attribution

  • for 12/20 release, just on web page, not in a separate DB schema

conditions of use

  • CVS export may contain embargoed records (for private lands and endangered species) which we need to remove