Project

General

Profile

2012-11-28 breakout groups

IUCN

  • time element
  • population change 10-100 yrs
  • abundance
  • inputs: range size, estimates, land cover/change layers
  • range fragmentation
  • growth forms
  • range size estimates: convex hull, maxent
  • decision tree: outcomes -> threatened, vulnerable
  • contact IUCN contacts
  • global classification of threatened status
  • conservation value of species
  • species within country boundaries
  • workflow
  • continental/global scale
  • outside feedback on decision tree
  • who to best contact
  • database of IUCN info
    • transparency: trace back how done
  • herbarium specimen data->deduce status
  • how many species to detect
  • applying redlist criteria to specimens
  • range modeling w/ convex hull->range size
  • rasterized species distrib
  • use ranges as part of tool
  • select area->list on the fly
  • list, status for any area
  • classification of species
  • aggregate by vegetation class?

Geo/Eco/Phylo

  • 10km grid cells
  • convert ~40,000 polygon range models to rasters
  • aggregate to 100km cells to create easier to work with
  • matrix where rows are unique
  • presence/absence at 10km resolution, per cell in matrix
  • script to unpack matrix to a map
  • actual observations
  • potential ranges from maxent
  • maxent->convex hull->Gentry
  • store as compactly as possible: not 12GB R file
  • 100s of MB
  • probabilistic estimate of presence/absence
  • probability estimate
  • occurrence data
  • need enough precision for that level
  • matrix tabulating presence/absence
  • matrix interpretation
  • shapefile for each species, which follows political boundaries
  • raster overlay
  • more vs. less inclusive for area
    • most conservative estimate: making sure that you get it if it's there
    • i.e. smaller
    • saying something is rare when it isn't is more problematic
  • already is a raster with the sampling intensity John has created
  • ultrametracize
  • increase coverage in tree: data w/o sequences
  • randomly placing species in genera
    • base vs. tip
  • matching unplaced taxa
    • non-trivial coding
    • Naim? after 3 mos
  • having to do something repetitively
  • phylogenetic conservatism of range size
  • phylogenetic signals
  • range size relationships
  • phylogenetic correction
  • grid data, plots
  • beta diversity
    • descriptive taxonomic/phylogenetic/functional diversity
  • what trait maps look like
  • community assembly
  • relationship between community-weighted traits
  • species lists, height data
  • remote data
  • how tall trees should be
  • talk to Jenz
  • what species are they? Panama+Peru
  • BIEN data, plot data
  • signal characters
  • trait descriptions
  • phylogenetic long-range length
  • relate traits to spatial variability and productivity estimates
  • cophonetic matrix of tips
  • traits, habit data
  • conservatism, species measures
  • with TRY data
  • geographic range, habit
  • important part of algorithms
  • measures of phylogenetic signals
  • algorithm for k
  • improved k->accessible framework

DB

Validations

  • candidate datasets
  • NYBG data in DwC, look at this afternoon
  • plots validations
  • refine list
  • programming standard part of import
  • does extraction conform to what contributed
  • data providers are dynamic databases, data changes
  • determinations changed->no match
  • detect gross discrepancies
  • analyze things at given points in time
  • outcomes with major economic impacts
  • what data working with
  • compare data ingested
  • compare raw CSV to analytical DB output
  • if output = input
  • details of validations
  • herbarium and plot source
  • try everything, the more the better
  • 2-part validations
  • check that aggregation of individuals is correct
  • area in which results will be stored, pointers to source data
  • std summarizations, visualizations
  • concentrate on papers

4 papers

  • TREE paper
    • descriptive version
    • popularizing it
  • white paper
    • what BIEN is about
    • wider than subgroup
    • more generic than BIEN so would not use BIEN in title
  • 1st paper focuses on BIEN 3.0
    • flexible schema for confederating biodiversity occurrences
    • standardization to TDWG standard
  • BIEN 4.0
    • what would we do
    • where you would want to go in the future
    • concurrently with BIEN 3.0 paper
    • IEEE/ACM paper: conceptual modeling, DB challenges
    • EER
  • next generation of biodiversity occurrence systems
    • build on iPlant proposal
    • pull together data of various types
  • generic challenges of where we want to go
  • where to go with BIEN
  • TREE limitation: authorship number
  • bioscience
  • concepts changing
  • opinion piece in Science or Nature
  • paper-related
  • feedback to data providers
  • what came into BIEN 3
  • range checks
    • aspect < 360deg
  • 300 data providers
  • dbSourceName: verbatim institutionCode
  • DwC herbarium codes
  • concatenated sources (duplicates)
  • critical juncture->send e-mail that data used in project, product is being launched
  • refresh data
  • send std e-mail, web form show to data provider if don't want to provide data
  • send errors to people instead of going someplace to get them
  • herbarium acronyms -> contact information
  • index by subprovider: include top-level and subsource
  • with DwC, data just goes in
  • compile from TNRS, geoscrub the list of errors
  • add validations
  • summaries on Plone site
  • draft or outline of project
  • WordPress maintain own page
  • Redmine site w/ notes
  • presentation on TDWG extension in Florence conference
  • afternoon subgroups
  • habit data
  • Barbara's habit scripts
  • meet back at 1:30pm

Stephanie on NCEAS

  • core funding for working groups from NSF
  • Packard foundation ecosystems
  • topical and core funding
  • NSF core funds renewed 5 years ago
  • no-cost extension on NSF money
  • 11 new working groups next year
  • Moore foundation covers core expenses over next 3 years
  • conservation, resource mgmt
  • press release in Jan/Feb for # new projects
  • smaller amounts to fund: scatter of research
  • what is on leading edge
  • DataONE online
    • big repositories
    • education: summer training workshop
  • data discovery and integration
  • synthesis
  • support from Packard foundation: scholarship model
  • summer training
  • 4-5 people working on it
  • continue beyond 3 years?
  • obtain more core NSF funds?
  • more working groups, sabbaticals, etc.
  • infrastructure for groups with own funding
    • very piecemeal
  • convene RCNs
  • not a convention center
  • collaboration tools to provide to a working group
  • suitably themed

Afternoon

  • 5:15pm Elsies
  • 6:30pm group dinner Blue Agave
  • whitepaper
  • focal points:
    • growth forms
    • ultrametric tree
    • IUCN decision tree
  • taxonomy issue
  • phenology
  • range size issue
  • 20 min to validate plots
  • validation scripts
  • traits
  • first groups:
    • IUCN: 2-3
    • range methodology: 3-4
    • validations: rest of afternoon, in main room
    • phenology and traits: next to main room
  • later groups:
    • BIEN whitepaper: Bob
  • rarity paper
  • metadata
  • BIEN2 flavors
    • analyses on different flavors

Regroup

Traits

  • phenology paper
  • when no info specimen description
  • geographic patterns in plant traits
  • mean traits, not mean variance
  • Plone site has presentation
  • papers on herbaria
  • parse habit groups
  • assess species level phenology for tropical species
  • extract tropical info
  • ready for comments
  • changing functional diversity of plots
  • volume niche
  • need to write paper in 2 weeks
  • functional volume temperate vs tropical
  • temperate sites in red, green is tropical, blue is overlap
  • functional beta diversity
  • mean trait value, overlap of 3D objects
  • latitudinal separation
  • functional data diversity
  • data is from plots

DB

  • decision tree decisions
  • extent of occurrence
  • critically endangered categories
  • 1 of 6 categories
  • use observations in cells
  • add pixels to correspond to IUCN criteria
  • model grain vs. observations
  • occupancy of pixels
  • < 5 occurrences
  • current, future threat
  • clipping model range by land use layer
  • for each pixel, desc how far each cell moves and whether acceptible pixel
  • under climate change, allow movement into adjacent pixels
  • possibilities for local
  • not future maxent based on climate change
  • assign confidence based on sampling intensity
  • attach confidence to category
  • start including abundance
  • in practice, abundance-weighted measures not used
  • project page to WordPress
  • use Plone as document and notes repository
  • Redmine wiki has running record
  • DB enhancements
  • code versioning repository
  • WordPress site is nascent public interface
    • based on PHP
  • technical things in Redmine

Whitepaper

  • bits and pieces
  • test of NY data: one mapping problem, but other than that looks great
  • CTFS, Madidi extracts
  • many more plots
  • will send export of data (different schema)
  • target schema
  • exchange schema: VegX?
  • technological questions
  • generic validations
  • is pipeline working
  • what stage in workflow process where doc takes place
  • clear wording
  • decide mechanistically how to make this work
  • Barbara Dobrin sending scripts to Aaron
    • sent one, adding doct to rest
    • keyword search
    • mining natural language
    • isolated keywords

To do

  • create workflow to infer taxon habit
  • put habit analysis algorithm into BIEN