/ - Changes - BIEN 3 - NCEAS Projects

root @ 7055

#	Date	Author	Comment
7055	01/04/2013 09:09 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added covering indexes where needed, as described at <https://projects.nceas.ucsb.edu/nceas/issues/549>
7054	01/04/2013 06:59 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: fkeys to source: Added covering indexes where needed, as described at <https://projects.nceas.ucsb.edu/nceas/issues/549>
7053	01/04/2013 06:22 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: commconcept: Renamed source_id back to reference_id (it was previously renamed to source_id in a bulk rename)
7052	01/04/2013 06:20 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxondetermination: Added back reference_id, which is different than the scoping source_id (reference_id was previously renamed to source_id in a bulk rename)
7051	01/04/2013 06:04 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed taxonconcept_concept_source_id_fkey back to taxonconcept_concept_reference_id_fkey
7050	01/04/2013 06:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed _reference_id_fkey fkeys to _source_id_fkey
7049	01/04/2013 05:32 PM	Aaron Marcuse-Kubitza	inputs/CTFS/_no_import: Temporarily remove CTFS from the public DB per Rick Condit's request (due to validation issues)
7048	01/04/2013 05:25 PM	Aaron Marcuse-Kubitza	import_all: Run import with $public_import set in order to exclude excluded datasources
7047	01/04/2013 05:23 PM	Aaron Marcuse-Kubitza	input.Makefile: Import to VegBIEN: %/import: Don't run the import if $public_import flag is set and the datasource contains a _no_import file. This allows just excluding a datasource from the public DB, without also removing it from automated testing.
7046	01/04/2013 05:17 PM	Aaron Marcuse-Kubitza	lib/common.Makefile: Added $(and), $(or), $(not)
7045	01/04/2013 04:30 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
7044	01/04/2013 04:12 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxondetermination: Added scoping source_id field to allow other datasources (e.g. TNRS) to make taxondeterminations. (Repurposed existing non-scoping source_id.)
7043	01/03/2013 08:25 AM	Aaron Marcuse-Kubitza	make_analytical_db: Fixed bug where can't give public_ select access to all analytical_db views because this apparently adds access rather than passing through the underlying table's permissions
7042	01/03/2013 08:18 AM	Aaron Marcuse-Kubitza	make_analytical_db: Give public_ select access to analytical_db views. This causes the actual access to depend on the underlying table's permissions.
7041	01/03/2013 07:43 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
7040	01/03/2013 07:39 AM	Aaron Marcuse-Kubitza	mappings/Makefile: VegCore.csv: Include only terms that start with a lowercase letter or are all caps. This also avoids the need to filter out VegCore.tables.csv.
7039	01/03/2013 07:31 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Changed line endings to \n in preparation for not running filter_out_cs on the file (which changes line endings to \r\n)
7038	01/03/2013 02:31 AM	Aaron Marcuse-Kubitza	import_all: `make backups/vegbien.$version.backup/test`: Documented that this uses $dump_opts. $dump_opts must be manually set when running this command outside of import_all.
7037	01/03/2013 02:21 AM	Aaron Marcuse-Kubitza	backups/Makefile: Synchronization: %/download: Download the .md5 file first, so that the user is prompted right away for their password rather than after the main file has finished downloading, by which time the password prompt has timed out
7036	01/03/2013 12:02 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
7035	01/02/2013 11:34 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
7034	01/02/2013 11:32 PM	Aaron Marcuse-Kubitza	mappings/Makefile: VegCore.csv: Fixed bug where need to filter out VegCore.tables.csv case-sensitively so that field names which are the same as a table name don't get filtered out
7033	01/02/2013 11:23 PM	Aaron Marcuse-Kubitza	Added filter_out_cs
7032	01/02/2013 09:21 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Added step to ensure there are no local modifications using `svn st`
7031	01/02/2013 08:56 PM	Aaron Marcuse-Kubitza	make_analytical_db: Also grant USAGE on the analytical_db schema itself to bien_read, public_
7030	01/02/2013 07:14 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: After import: Also check that the provider_count table contains entries for all inputs
7029	01/02/2013 07:03 PM	Aaron Marcuse-Kubitza	Added inputs/.geoscrub/_src/geovalidity-table.txt, which was attached to Jim's geovalidation e-mail (provided in README.TXT)
7028	01/02/2013 06:43 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
7027	01/02/2013 06:39 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: recording the import times in inputs/import.stats.xls: Updated column group header to "By column"
7026	01/02/2013 06:36 PM	Aaron Marcuse-Kubitza	backups/Makefile: Removed no longer used $(psqlVerbose)
7025	01/02/2013 06:36 PM	Aaron Marcuse-Kubitza	backups/Makefile: Removed %.backup/rm_indexes, which is no longer needed because archived imports are now backed up instead of being stored without indexes in the live DB
7024	01/02/2013 06:31 PM	Aaron Marcuse-Kubitza	backups/Makefile: %.backup/remove: Fixed bug where need to use $no_search_path option to psql_script_vegbien
7023	12/21/2012 03:34 PM	Aaron Marcuse-Kubitza	import_all: Allow caller to override $dump_opts
7022	12/21/2012 03:33 PM	Aaron Marcuse-Kubitza	pg_dump_vegbien: Renamed $opts env var to $dump_opts to avoid conflicting with other commands' vars of the same name
7021	12/21/2012 03:22 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
7020	12/21/2012 03:20 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
7019	12/21/2012 02:14 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Removed location_unique_within_parent_by_sourceaccessioncode, which duplicates location_unique_within_creator_by_sourceaccessioncode because the sourceaccessioncode is globally unique
7018	12/21/2012 02:10 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: projectID: Use project.projectname if project.sourceaccessioncode isn't provided
7017	12/21/2012 02:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: location_unique_within_parent: Split into _by_sourceaccessioncode and _by_authorlocationcode_position, with each ID being matched separately. This way, if the initial import of a subplot's location provides both fields, but fkey references use only one field, the fkey references will still match the existing location because only one of the fields needs to match.
7016	12/21/2012 01:26 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: elevationInMeters: Use parent_location.elevation_m when location.elevation_m not provided
7015	12/21/2012 01:17 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: scientificName: Fixed bug where need to use accepted_taxonlabel.taxonomicname instead of accepted_taxonverbatim.taxonomicname, because taxonverbatim's name component fields aren't populated if the name doesn't match a scrubbed name. The datasource's own taxonverbatim can't be used for this because the canon_label_id refers to the concatenated taxonomic name owned by the TNRS datasource.
7014	12/21/2012 01:00 PM	Aaron Marcuse-Kubitza	inputs/NVS/Plot/map.csv: Corrected Plot ID mapping to go to subplotID instead of locationID, because each subplot gets its own ID in this field
7013	12/21/2012 12:50 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: location_unique_within_parent: Also apply this constraint when sourceaccessioncode is provided, because it may be a concatenated value populated for use by the analytical DB but which is not used as an fkey by the datasource itself
7012	12/21/2012 12:30 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: locationID: Concatenate parent location's and subplot's IDs using '; ' instead of ' '
7011	12/21/2012 12:22 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Renamed locationName to locationID because it's now globally unique (within the datasource) and can be used as a sourceaccessioncode
7010	12/21/2012 12:19 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: locationName: For subplots without their own sourceaccessioncode (globally unique ID), prepend the parent location's unique ID so that locationName is globally unique
7009	12/21/2012 12:07 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: locationID/locationName + subplot -> location.sourceaccessioncode mapping: Fixed bug where subplot was incorrectly being mapped to this field even when there was no location. (This field can only be populated if both location and subplot are specified.) Also only map locationID for this, to avoid inconsistencies where one table supplies locationID+subplot, while another table supplies locationName+subplot, but they both get mapped to the same field, preventing plots from being matched up with their observations when creating the analytical_stem.
7008	12/21/2012 11:31 AM	Aaron Marcuse-Kubitza	xml_func.py: Simplifying functions: Logic: _and(), _or(): Evaluate an expression of only constant values
7007	12/21/2012 11:30 AM	Aaron Marcuse-Kubitza	lists.py: Added and_(), or_()
7006	12/21/2012 11:28 AM	Aaron Marcuse-Kubitza	xml_func.py: is_scalar(): Fixed bug where need to check if value is a string before calling is_var_name()
7005	12/21/2012 10:15 AM	Aaron Marcuse-Kubitza	inputs/NVS/StemObservation/map.csv: Remapped Verbatim Code to authorTaxonCode, because as it's used this is actually an identifier for the taxon, not the stem, despite Nick Spencer's revised mapping
7004	12/20/2012 05:21 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Regenerated exports
7003	12/20/2012 05:21 PM	Aaron Marcuse-Kubitza	README.TXT: Schema changes: Update graphical ERD exports: Added step to commit changes
7002	12/20/2012 05:02 PM	Aaron Marcuse-Kubitza	inputs/NVS/*/map.csv: Remapped with Nick Spencer's suggested changes
7001	12/20/2012 04:41 PM	Aaron Marcuse-Kubitza	xml_func.py: _first(): Fixed bug where need to choose the first non-empty param, by first pruning empty child nodes
7000	12/20/2012 04:38 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: authortaxoncode mappings: Only using authorTaxonCode if there is no plant ID: Added individualID, stemID to the terms that cause authorTaxonCode not to be mapped to VegBIEN authortaxoncode
6999	12/20/2012 04:03 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: authortaxoncode mappings: Only using authorTaxonCode if there is no plant ID: Added individualID, stemID to the terms that cause authorTaxonCode not to be mapped to VegBIEN authortaxoncode
6998	12/20/2012 03:59 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Renamed individualID to individualObservationID because this actually corresponds to plantobservation.sourceaccessioncode, which is an observation of an individual
6997	12/20/2012 03:56 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
6996	12/20/2012 03:53 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Recording the import times: Changed <version> back to $version because these commands are actually run on vegbiendev, where $version is set. (Modifications to import.stats.xls would be made on your local machine.)
6995	12/20/2012 03:50 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Added step to unset $version before starting the import, to avoid importing on top of the last import's data
6994	12/20/2012 02:47 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Replaced $version with <version> where it needs to be manually filled in
6993	12/20/2012 02:40 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: On nimoy: Added command to set $version
6992	12/20/2012 02:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: authortaxoncode mappings: Only use authorTaxonCode if there is no plant ID, because an individual plant gets its own taxonoccurrence and thus needs the taxonoccurrence's IDs to be unique to the plant, regardless of what the author designates as the taxonoccurrence code
6991	12/20/2012 01:47 PM	Aaron Marcuse-Kubitza	Generated inputs/NVS/new_terms.csv
6990	12/20/2012 01:47 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: $(svnFilesGlob): Also match *terms.csv in top-level dir
6989	12/20/2012 01:23 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped authorTaxonCode
6988	12/20/2012 01:12 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Regenerated from wiki
6987	12/20/2012 01:12 PM	Aaron Marcuse-Kubitza	README.TXT: Maintenance: VegCore data dictionary: Added step to commit updated mappings/VegCore.csv
6986	12/20/2012 12:13 PM	Aaron Marcuse-Kubitza	schemas/Makefile: %/publish: Fixed bug where commands were not being run transactionally, because --single-transaction requires `--file -` to work properly
6985	12/20/2012 11:36 AM	Aaron Marcuse-Kubitza	input.Makefile: Editing import: Removed rotate because appending the current svn revision doesn't make sense, since this is not related to the revision used to import the datasource
6984	12/20/2012 11:34 AM	Aaron Marcuse-Kubitza	input.Makefile: Editing import: Added rename/% and use it in rotate
6983	12/20/2012 11:21 AM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6982	12/20/2012 11:21 AM	Aaron Marcuse-Kubitza	schemas/Makefile: Use $* instead of $(@D) for clarity. $(@D) is only needed when the dir part of the target includes a prefix in addition to the % stem.
6981	12/20/2012 10:45 AM	Aaron Marcuse-Kubitza	make_analytical_db: Automatically call export_analytical_db when finished
6980	12/20/2012 10:35 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: make_family_higher_plant_group(): Added `taxonepithet IS NOT NULL` filter, to allow make_analytical_db to proceed even when the NCBI import fails (leaving some nodes with rank = 'family' but no associated taxonepithet). The most recent NCBI import failed due to the search_path/DuplicateException bug resulting from the import schema and public being in the search_path together.
6979	12/20/2012 10:14 AM	Aaron Marcuse-Kubitza	schemas/Makefile: Fixed bug where need `SHELL := /bin/bash` for \$(confirmRmPublicSchema) to work correctly
6978	12/20/2012 10:12 AM	Aaron Marcuse-Kubitza	lib/common.Makefile: $(confirm): Added comment that this requires `SHELL := /bin/bash` to work correctly
6977	12/20/2012 10:09 AM	Aaron Marcuse-Kubitza	import_all: after_import(): Added `make backups/vegbien.$version.backup/test`
6976	12/20/2012 10:05 AM	Aaron Marcuse-Kubitza	sql.py: DbConn._db(): search_path: Don't append the existing search_path, because it usually includes the public schema, which is now different from the schema being imported into. This fixes a bug where sql.function_exists() would find public-schema functions in both the public schema and the import's schema because both were in the search_path, causing a DuplicateException "more than one function named ...". Note that the elements of the existing search_path are no longer needed now that vegbien_dest's $schemas includes $public. Also note that if an instance of DbConn does not specify the schemas param, the existing search_path will be left as-is rather than overwritten with an empty list.
6975	12/20/2012 09:54 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: recording the import times in inputs/import.stats.xls: Added step to determine the import date using import_date
6974	12/20/2012 09:52 AM	Aaron Marcuse-Kubitza	import_date: Added note that Mac and Linux differ in the order they sort the logs in
6973	12/20/2012 09:50 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: recording the import times in inputs/import.stats.xls: Updated pattern for new log filename format
6972	12/20/2012 09:47 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: recording the import times in inputs/import.stats.xls: Removed extra ./ before bin/import_times
6971	12/20/2012 09:46 AM	Aaron Marcuse-Kubitza	import_date: Added note that the time this outputs is the time the first special input finished importing. The import itself generally starts a few minutes before that, and the exact time is in that import's public schema comment.
6970	12/20/2012 09:41 AM	Aaron Marcuse-Kubitza	import_date: Removed duplicate Usage message at top of file, which is repeated in the Usage message provided when the program is run with no arguments
6969	12/20/2012 09:40 AM	Aaron Marcuse-Kubitza	Added import_date
6968	12/20/2012 09:38 AM	Aaron Marcuse-Kubitza	Added mtime
6967	12/20/2012 09:29 AM	Aaron Marcuse-Kubitza	lib/common.Makefile: System: Added $(mtime)
6966	12/20/2012 09:27 AM	Aaron Marcuse-Kubitza	lib/common.Makefile: $(date): Factored date format out into $(dateFmt)
6965	12/20/2012 09:25 AM	Aaron Marcuse-Kubitza	backups/Makefile: Factored $(isMac) out into lib/common.Makefile
6964	12/20/2012 08:30 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: tailing logs: Updated pattern for new log filename format
6963	12/19/2012 02:02 PM	Aaron Marcuse-Kubitza	schemas/Makefile: Installation: %/publish: Fixed bug where need quotes around source schema name
6962	12/19/2012 01:57 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Moved deletion previous imports before the import, so that full DB backup can be automated
6961	12/19/2012 01:55 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: `make backups/vegbien.$version.backup/test`: Added --exclude-schema=public to leave out the previous (now published) import so it doesn't bloat the backup. Note that public is included in the vegbien.$version.backup for the previous import, named according to its version.
6960	12/19/2012 01:49 PM	Aaron Marcuse-Kubitza	import_all: after_import(): Added `make backups/TNRS.backup-remake`
6959	12/19/2012 01:46 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Added step to publish the import to the public schema
6958	12/19/2012 01:42 PM	Aaron Marcuse-Kubitza	import_all: after_import(): Added export_analytical_db
6957	12/19/2012 01:36 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: bin/export_analytical_db: Removed `env public=$version` because export_analytical_db now uses $version as $public when provided
6956	12/19/2012 01:35 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: To remake analytical DB: Removed `env public=...` because $version (which replaces $public) is now set automatically by import_all

Project

General

Profile