/ - Changes - BIEN 3 - NCEAS Projects

root @ 4750

#	Date	Author	Comment
4750	09/17/2012 11:41 AM	Aaron Marcuse-Kubitza	inputs///map.csv: Mapped variants of subspecies directly to new subspecies term
4749	09/17/2012 11:31 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: subspecies, infraspecificEpithet: Added _alts for datasources that specify both
4748	09/17/2012 11:27 AM	Aaron Marcuse-Kubitza	input.Makefile: Mapping: $(map2db): Inline $(map) because this is the only place it's used
4747	09/17/2012 11:26 AM	Aaron Marcuse-Kubitza	input.Makefile: Mapping: $(map): Don't require flat files because they don't need to be used directly anymore (staging tables are used instead)
4746	09/17/2012 11:24 AM	Aaron Marcuse-Kubitza	input.Makefile: Mapping: $(map2db): Always use staging tables, because the flat files don't need to be used directly anymore
4745	09/17/2012 11:02 AM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Remapped subspecies, subSpeciesName to new subspecies term
4744	09/17/2012 10:52 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped subspecies, variety, forma, cultivar
4743	09/17/2012 10:47 AM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added subspecies, variety, forma, cultivar
4742	09/17/2012 10:33 AM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
4741	09/17/2012 10:30 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxon.authority_id: Added descriptive comment that this is the authority which defines the taxon name (as opposed to the author of the taxon name)
4740	09/17/2012 10:29 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxon: Added author_id for the author of the taxon name. This is distinct from authority_id, which is the authority used to determine which taxon name to apply.
4739	09/17/2012 10:14 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_db_view: Use new denormalized placepath table instead of place, which significantly reduces the number of joins
4738	09/17/2012 10:11 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Removed stateprovince, country because these are now in placepath (as well as in place.rank)
4737	09/17/2012 10:01 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_db_view: LEFT JOIN locationcoords and locationplace so that locations will be included even if they don't have one of these two determinations
4736	09/17/2012 10:00 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_db_view: Fixed bug where method was being joined instead of left-joined, causing only rows with a method to be included
4735	09/17/2012 09:44 AM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
4734	09/17/2012 09:41 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: locationplace: Added identifier_id, so that different identifiers (e.g. the data provider and GNRS) can provide separate locationplaces even if the standardized name happens to be the same as the original name
4733	09/17/2012 09:31 AM	Aaron Marcuse-Kubitza	mappings/VegBank-VegBIEN.csv: Added place->locationplace renaming
4732	09/17/2012 09:30 AM	Aaron Marcuse-Kubitza	mappings/VegBIEN-VegBank.csv: Reversed the order of the columns so it's a more natural forward renaming, and renamed the file to VegBank-VegBIEN.csv to reflect the new column order
4731	09/17/2012 09:27 AM	Aaron Marcuse-Kubitza	mappings/VegBIEN-VegBank.csv: Fixed order of plantconcept->taxon renaming because the VegBIEN column is on the right
4730	09/17/2012 09:26 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed namedplace to place for simplicity and consistency with placepath and locationplace
4729	09/17/2012 09:09 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxon: Made authority an fkey to reference instead of a text field
4728	09/17/2012 09:03 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Moved steps to include a taxon name at a rank with no explicit column from taxon's comment to taxonpath's comment, because that is the table the steps apply to
4727	09/17/2012 09:00 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added placepath (analogous to taxonpath), and point locationplace to it instead of directly to namedplace
4726	09/17/2012 08:11 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Split locationdetermination into locationcoords and locationplace, so that coordinate determinations can be made separately from place determinations
4725	09/17/2012 07:22 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Removed authore, authorn because this information is now in locationdetermination as verbatimlongitude, verbatimlatitude
4724	09/17/2012 07:20 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Removed centerlatitude/longitude, publiclatitude/longitude because this information is now in locationdetermination
4723	09/17/2012 06:49 AM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Fixed lines
4722	09/17/2012 06:48 AM	Aaron Marcuse-Kubitza	mappings/VegBIEN-VegBank.csv: Added table rename plantconcept->taxon
4721	09/17/2012 06:47 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxonpath.scientificnamewithauthor: Added comment that it's equivalent to "Name sec. x"
4720	09/17/2012 06:43 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxon: Added comment that it's VegBank's plantConcept table
4719	09/14/2012 11:21 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
4718	09/14/2012 11:18 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed plantconcept to taxonpath for consistency with DwC's Taxon category and to emphasize that the table stores taxonomic paths
4717	09/14/2012 11:11 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed plantname to taxon for consistency with DwC's Taxon category
4716	09/14/2012 11:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantname: Renamed plantname field to taxonname for consistency with DwC's Taxon category
4715	09/14/2012 10:55 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
4714	09/14/2012 10:49 PM	Aaron Marcuse-Kubitza	Updated aggregated unmapped_terms.csv, new_terms.csv. This removes terms that contained a filter (which is now in a separate column) and moves new terms that are unmapped from new_terms.csv to unmapped_terms.csv. Note that the majority of unmapped terms are from VegBank's huge tables, and are not part of the core fields needed for the analytical DB.
4713	09/14/2012 10:41 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxonrank: Switched to using extended taxonomic ranks list derived from VegX at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegBIEN_taxonomic_schema#Extended>. This renames division to phylum and splits up 'cultivar/forma'.
4712	09/14/2012 10:39 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxonrank: Removed 'authority', which doesn't belong as a taxonomic rank
4711	09/14/2012 10:38 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantname: Added authority so each taxonomic level can have its own authority (author). Include it in the plantname_unique unique index because plantname is a globally scoped table.
4710	09/14/2012 10:25 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxonrank: Removed 'binomial', which doesn't belong as a taxonomic rank
4709	09/14/2012 10:24 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Changed analytical_db_view to use new denormalized taxonomic names in plantconcept, which significantly reduces the number of joins. Note that changing the tables used by a view which depends on other tables will cause those tables to be reordered in dependency order to appear before the view, causing things to be moved around in the svn diff.
4708	09/14/2012 10:01 PM	Aaron Marcuse-Kubitza	inputs/Madidi/Organism/map.csv: Remapped Specie+autor to new scientificNameWithAuthorship. Mapped Species and morphotypes to now-available scientificName.
4707	09/14/2012 09:59 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Moved scientificNameWithAuthorship before scientificName in taxonoccurrence.authortaxoncode's _alts
4706	09/14/2012 09:55 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped scientificNameWithAuthorship as an _alt of taxonoccurrence.authortaxoncode
4705	09/14/2012 09:53 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped scientificNameWithAuthorship
4704	09/14/2012 09:51 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added scientificNameWithAuthorship
4703	09/14/2012 09:47 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Taxonomic names: Remapped to new denormalized fields in plantconcept
4702	09/14/2012 09:08 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantname: Added comment documenting how to include a taxon name at a rank with no explicit column, by using the plantname table as an ordered linked list linked together using parent_id. (This method of using a linked list is one way of storing an ordered list of user-defined data. It is similar to using locationevent.previous_id to link successive reobservations of the same location together.) Note that plantname can store both the official tree of life and the data provider's own custom tree of life (or a subset thereof), with the two being distinguished by whether the data provider's or TNRS's taxondeterminations point to them.
4701	09/14/2012 08:53 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantname: Added verbatimrank to store ranks of custom taxonomic levels, such as rosids. Note that even if you specify a custom verbatimrank, you must also specify a closest-match rank from the taxonrank closed list. This ensures that every taxonomic name is placed in the correct relative order in the taxonomic hierarchy.
4700	09/14/2012 08:38 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantconcept: Made plantname_id optional because the datasource's plantconcepts do not need to be placed in the recursive plantname hierarchy
4699	09/14/2012 08:35 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantconcept: Added datasource_id and appropriate unique indexes to enable scoping by datasource. Moved plantcode right after datasource_id because it will be used for the sourceaccessioncode (if any).
4698	09/14/2012 08:21 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Moved plantconcept.plantdescription to plantname and renamed it to description, so that a taxon of any rank can have a description
4697	09/14/2012 08:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantconcept: Added denormalized taxonomic ranks from <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegBIEN_taxonomic_schema#Primary> and concatenated scientific name fields
4696	09/14/2012 07:25 PM	Aaron Marcuse-Kubitza	Removed no longer used ucase_first
4695	09/14/2012 07:23 PM	Aaron Marcuse-Kubitza	Removed no longer used bin/union
4694	09/14/2012 07:22 PM	Aaron Marcuse-Kubitza	Removed no longer used join_union_sort
4693	09/14/2012 07:21 PM	Aaron Marcuse-Kubitza	Removed no longer used ci_map, because all relevant mapping scripts are now case-insensitive
4692	09/14/2012 07:19 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Inline $(review_) because it's only used once
4691	09/14/2012 07:18 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Removed no longer used $(review)
4690	09/14/2012 07:17 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Don't set $(SHELL) to /bin/bash because this is no longer needed
4689	09/14/2012 07:16 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Removed empty VegCSV section. mappings/Makefile's only functionality is now to clean up (sort) the core maps whenever they change and create human-readable maps from them.
4688	09/14/2012 07:13 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Removed no longer used self maps, because the new automapping mechanism does not use them
4687	09/14/2012 07:09 PM	Aaron Marcuse-Kubitza	input.Makefile: Existing maps discovery: Substituted Veg+ for $(via) because it's now only used once
4686	09/14/2012 07:05 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Changed input column header from VegCore[Veg+] to VegCore because this is more accurate. This is possible now that we're using new automapping scripts that do not require a particular column header.
4685	09/14/2012 06:39 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Changed _merge to _join everywhere because _merge's (slower) duplicate elimination functionality is not needed (the combined columns do not both contain the same value, so they can simply be concatenated)
4684	09/14/2012 06:38 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Changed _merge to _join everywhere because _merge's (slower) duplicate elimination functionality is not needed (the combined columns do not both contain the same value, so they can simply be concatenated)
4683	09/14/2012 06:21 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: _label(): Accept params of any type, in order to support types other than text (which come from staging tables that are imported directly from a SQL export). This fixes a bug in SALVIAS.plotMetadata's column-based import.
4682	09/14/2012 06:17 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: _label(): Support NULL labels by not prepending a label
4681	09/14/2012 06:04 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Changed output column header from Veg+ to VegCore because this is more accurate. This is possible now that we're using new automapping scripts that do not require a particular column header. Note that this change now requires the map.csvs to use VegCore as their output column header, because otherwise the Veg+ header will get automapped to VegCore. (The header replacing is a feature to support changing the header when the schema of the column's terms changes.)
4680	09/14/2012 06:03 PM	Aaron Marcuse-Kubitza	mappings/root.sh: Changed output column header from Veg+ to VegCore because this is more accurate following the initial automapping
4679	09/14/2012 05:59 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Changed output column header from Veg+ to VegCore because the names will be VegCore names after automapping. This is possible now that we're using new automapping scripts that do not require a particular column header.
4678	09/14/2012 05:53 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Copied the Change factor formula to all rows (it displays an empty string for rows that don't have both a row-based and a column-based import)
4677	09/14/2012 05:49 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Added steps to record the import times in inputs/import.stats.xls
4676	09/14/2012 05:42 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
4675	09/14/2012 05:40 PM	Aaron Marcuse-Kubitza	Added import_times
4674	09/13/2012 02:40 PM	Aaron Marcuse-Kubitza	mappings/root.sh: Removed no longer needed $in_root_suffix
4673	09/13/2012 02:39 PM	Aaron Marcuse-Kubitza	src_map: Upgraded to match new map format by adding Filter column
4672	09/13/2012 02:38 PM	Aaron Marcuse-Kubitza	input.Makefile: $(viaMaps): Fixed bug where could not wrap it in $(wildcard) because that would prevent map.csv from being created when a new datasource or new subdir is added
4671	09/12/2012 05:36 PM	Aaron Marcuse-Kubitza	input.Makefile: $(viaMaps): Removed extra addition of */map.csv, which is already included because all $(tables) have or will get a map.csv
4670	09/12/2012 05:34 PM	Aaron Marcuse-Kubitza	mappings/: Removed no longer used derived file Veg+.vocab.csv
4669	09/12/2012 05:33 PM	Aaron Marcuse-Kubitza	input.Makefile: Removed no longer used $(vocab)
4668	09/12/2012 05:32 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps validation: %/new_terms.csv: Filter out $(coreMap) and $(dict) successively instead of $(vocab), to avoid requiring intermediate mapping files not edited by the user
4667	09/12/2012 05:28 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps validation: $(newTerms): Don't hardcode the caller's first filter_out_ci by prerequisite position; instead allow them to specify the command (including the var name) themselves
4666	09/12/2012 05:24 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps validation: $(newTerms): For simplicity, subset the columns before running filter_out_ci
4665	09/12/2012 05:20 PM	Aaron Marcuse-Kubitza	mappings/: Removed no longer used Veg+-VegBIEN.csv and derived autogen Veg+.self.csv
4664	09/12/2012 05:16 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps building: %/unmapped_terms.csv: Use $(coreMap) instead of $(vocab) because the terms should already be translated to VegCore terms, rather than still being Veg+
4663	09/12/2012 05:13 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps validation: $(newTerms): Fixed bug where header needed to be removed before running filter_out_ci because filter_out_ci only removes the header if it matches the vocabulary's header. Removing the header afterward can cause the first row to be removed instead if the header was already removed.
4662	09/12/2012 05:11 PM	Aaron Marcuse-Kubitza	cols: Support CSVs without a header, such as intermediates that become unmapped_terms.csv, new_terms.csv
4661	09/12/2012 04:37 PM	Aaron Marcuse-Kubitza	inputs/: Regenerated unmapped_terms.csv, new_terms.csv
4660	09/12/2012 04:25 PM	Aaron Marcuse-Kubitza	input.Makefile: %/.map.csv.last_cleanup: Removed no longer used prerequisite $(vocab)
4659	09/12/2012 04:24 PM	Aaron Marcuse-Kubitza	input.Makefile: %/.map.csv.last_cleanup: Canonicalize separately on $(coreMap) and $(dict), instead of requiring them to be combined in $(vocab)
4658	09/12/2012 04:20 PM	Aaron Marcuse-Kubitza	input.Makefile: Use mappings/VegCore-VegBIEN.csv instead of mappings/Veg+-VegBIEN.csv as the core map, because the automapper now takes care of Veg+ -> VegCore translation
4657	09/12/2012 04:14 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Moved filter suffixes to separate filter column to enable automapping to work on those mappings' terms, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Map_refactoring#Move-filter-suffixes-to-separate-filter-column>. Note that the only changes to VegBIEN.csvs are the (now automapped) names of terms in "No join mapping" comments.
4656	09/12/2012 03:37 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Added Filter column to contain any suffix added after the term, so that the automapping mechanism does not have to deal with the filter expressions
4655	09/12/2012 03:35 PM	Aaron Marcuse-Kubitza	Added cat_cols
4654	09/12/2012 03:34 PM	Aaron Marcuse-Kubitza	Added ins_col
4653	09/12/2012 03:13 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps building: %/.map.csv.last_cleanup: Reference fixed prerequisites by name instead of by position in the prerequisites list
4652	09/12/2012 02:28 PM	Aaron Marcuse-Kubitza	Removed no longer used intersect
4651	09/12/2012 02:18 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Removed no longer needed [Veg+] suffix in root, because the input column is no longer used by old-style map utilities such as union that needed this

Project

General

Profile