2014-06-06 separate conference call on data dictionary¶
Martha's notes¶
the order in which Aaron should proceed with defining terms is:
1)
analytical_stem_view
2)viewFullOccurrence[mapping to analytical_stem_view done]
3)the tables from the normalized VegBIEN schema that are necessary to create the analytical_stem_view and viewFullOccurrence tables[see analytical_stem_view tables]
Regarding what we are referring to as the "viewFullOccurrence" table:Aaron, you have the DDL from Brad for the BIEN3 viewFullOccurrence table. As guidance on the VegBIEN tables that are used to create it, look at the scripts for the BIEN2 bien_web.observation table.
For priority #3 in the previous message (below), Brian McGill provided the list of the VegBIEN tables necessary to create analytical_stem_table:
plotsourcelistspecimenreplicatecollector[party]aggregateoccurrencetaxonverbatimparty identifiedby[party]taxondeterminationtaxon_scrubtaxonoccurrenceplantobservationstemobservationparty_collector[party]taxonlabelfamily_higher_plant_groupcultivated_family_locationsthreatened_taxonlabelHe said to first document each table in terms scientists can understand. After that, proceed with documenting the columns in the tables.
decisions¶
OK to do a first pass filling in the definitions from memory (Mark)"for right now, we just need that list of definitions for attributes" (Mark)
the primary purpose of the data dictionary is to help the scientists understand the termsthe fields that are most useful to the scientists are the highest priority (Mark)- "it is also necessary to document the database in toto, so that someone can intelligently use and extend it" (Mark)
put data dictionary in Google spreadsheet for easier editing by scientists
to do for Aaron¶
create Google spreadsheet forsee VegBIEN data dictionary spreadsheet .analytical_stem_view
data dictionaryinclude the following columns: column, type, definition or formula, comment, provenance (part of col name), normalized-VegBIEN equivalent (same as formula), approved by (for scientists' initials)
fill in definitions using your understanding of the termsstart with fields that are most useful to the scientists
- indicate provenance for every attribute
make column name links work in Chromeexplicit hyperlinks have been added in the Google spreadsheet .- switch to definitions from the source data dictionaries
- add table prefix to every attribute
- "Define the attributes for analytical_stem_view and viewFullOccurrence, and then move on to
describing the tables from VegBIEN that are necessary to create those tables, followed by defining the terms in those VegBIEN tables." (Martha)- "the tables from VegBIEN that Brian McGill listed also need to be documented" (Martha)
- "we ultimately need descriptions/definitions of ALL schema objects,
including e.g. Tables as Brian McGill has already requested" (Mark) - in phpPgAdmin, make embedded links clickable
- for renamed terms, indicate what type of renaming was perfomed (eg. camelcased, etc.)