Project

General

Profile

Statistics
| Revision:

# Date Author Comment
7033 01/02/2013 11:23 PM Aaron Marcuse-Kubitza

Added filter_out_cs

7032 01/02/2013 09:21 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Added step to ensure there are no local modifications using `svn st`

7031 01/02/2013 08:56 PM Aaron Marcuse-Kubitza

make_analytical_db: Also grant USAGE on the analytical_db schema itself to bien_read, public_

7030 01/02/2013 07:14 PM Aaron Marcuse-Kubitza

README.TXT: Data import: After import: Also check that the provider_count table contains entries for all inputs

7029 01/02/2013 07:03 PM Aaron Marcuse-Kubitza

Added inputs/.geoscrub/_src/geovalidity-table.txt, which was attached to Jim's geovalidation e-mail (provided in README.TXT)

7028 01/02/2013 06:43 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

7027 01/02/2013 06:39 PM Aaron Marcuse-Kubitza

README.TXT: Data import: recording the import times in inputs/import.stats.xls: Updated column group header to "By column"

7026 01/02/2013 06:36 PM Aaron Marcuse-Kubitza

backups/Makefile: Removed no longer used $(psqlVerbose)

7025 01/02/2013 06:36 PM Aaron Marcuse-Kubitza

backups/Makefile: Removed %.backup/rm_indexes, which is no longer needed because archived imports are now backed up instead of being stored without indexes in the live DB

7024 01/02/2013 06:31 PM Aaron Marcuse-Kubitza

backups/Makefile: %.backup/remove: Fixed bug where need to use $no_search_path option to psql_script_vegbien

7023 12/21/2012 03:34 PM Aaron Marcuse-Kubitza

import_all: Allow caller to override $dump_opts

7022 12/21/2012 03:33 PM Aaron Marcuse-Kubitza

pg_dump_vegbien: Renamed $opts env var to $dump_opts to avoid conflicting with other commands' vars of the same name

7021 12/21/2012 03:22 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7020 12/21/2012 03:20 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7019 12/21/2012 02:14 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: Removed location_unique_within_parent_by_sourceaccessioncode, which duplicates location_unique_within_creator_by_sourceaccessioncode because the sourceaccessioncode is globally unique

7018 12/21/2012 02:10 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: projectID: Use project.projectname if project.sourceaccessioncode isn't provided

7017 12/21/2012 02:02 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: location_unique_within_parent: Split into *_by_sourceaccessioncode and *_by_authorlocationcode_position, with each ID being matched separately. This way, if the initial import of a subplot's location provides both fields, but fkey references use only one field, the fkey references will still match the existing location because only one of the fields needs to match.

7016 12/21/2012 01:26 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: elevationInMeters: Use parent_location.elevation_m when location.elevation_m not provided

7015 12/21/2012 01:17 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: scientificName: Fixed bug where need to use accepted_taxon*label*.taxonomicname instead of accepted_taxonverbatim.taxonomicname, because taxonverbatim's name component fields aren't populated if the name doesn't match a scrubbed name. The datasource's own taxonverbatim can't be used for this because the canon_label_id refers to the concatenated taxonomic name owned by the TNRS datasource.

7014 12/21/2012 01:00 PM Aaron Marcuse-Kubitza

inputs/NVS/Plot/map.csv: Corrected Plot ID mapping to go to subplotID instead of locationID, because each subplot gets its own ID in this field

7013 12/21/2012 12:50 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: location_unique_within_parent: Also apply this constraint when sourceaccessioncode is provided, because it may be a concatenated value populated for use by the analytical DB but which is not used as an fkey by the datasource itself

7012 12/21/2012 12:30 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: locationID: Concatenate parent location's and subplot's IDs using '; ' instead of ' '

7011 12/21/2012 12:22 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Renamed locationName to locationID because it's now globally unique (within the datasource) and can be used as a sourceaccessioncode

7010 12/21/2012 12:19 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: locationName: For subplots without their own sourceaccessioncode (globally unique ID), prepend the parent location's unique ID so that locationName is globally unique

7009 12/21/2012 12:07 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: locationID/locationName + subplot -> location.sourceaccessioncode mapping: Fixed bug where subplot was incorrectly being mapped to this field even when there was no location*. (This field can only be populated if both location* and subplot are specified.) Also only map locationID for this, to avoid inconsistencies where one table supplies locationID+subplot, while another table supplies locationName+subplot, but they both get mapped to the same field, preventing plots from being matched up with their observations when creating the analytical_stem.

7008 12/21/2012 11:31 AM Aaron Marcuse-Kubitza

xml_func.py: Simplifying functions: Logic: _and(), _or(): Evaluate an expression of only constant values

7007 12/21/2012 11:30 AM Aaron Marcuse-Kubitza

lists.py: Added and_(), or_()

7006 12/21/2012 11:28 AM Aaron Marcuse-Kubitza

xml_func.py: is_scalar(): Fixed bug where need to check if value is a string before calling is_var_name()

7005 12/21/2012 10:15 AM Aaron Marcuse-Kubitza

inputs/NVS/StemObservation/map.csv: Remapped Verbatim Code to authorTaxonCode, because as it's used this is actually an identifier for the taxon, not the stem, despite Nick Spencer's revised mapping

7004 12/20/2012 05:21 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Regenerated exports

7003 12/20/2012 05:21 PM Aaron Marcuse-Kubitza

README.TXT: Schema changes: Update graphical ERD exports: Added step to commit changes

7002 12/20/2012 05:02 PM Aaron Marcuse-Kubitza

inputs/NVS/*/map.csv: Remapped with Nick Spencer's suggested changes

7001 12/20/2012 04:41 PM Aaron Marcuse-Kubitza

xml_func.py: _first(): Fixed bug where need to choose the first non-empty param, by first pruning empty child nodes

7000 12/20/2012 04:38 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: authortaxoncode mappings: Only using authorTaxonCode if there is no plant ID: Added individualID, stemID to the terms that cause authorTaxonCode not to be mapped to VegBIEN authortaxoncode

6999 12/20/2012 04:03 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: authortaxoncode mappings: Only using authorTaxonCode if there is no plant ID: Added individualID, stemID to the terms that cause authorTaxonCode not to be mapped to VegBIEN authortaxoncode

6998 12/20/2012 03:59 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Renamed individualID to individualObservationID because this actually corresponds to plantobservation.sourceaccessioncode, which is an observation of an individual

6997 12/20/2012 03:56 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

6996 12/20/2012 03:53 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Recording the import times: Changed <version> back to $version because these commands are actually run on vegbiendev, where $version is set. (Modifications to import.stats.xls would be made on your local machine.)

6995 12/20/2012 03:50 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Added step to unset $version before starting the import, to avoid importing on top of the last import's data

6994 12/20/2012 02:47 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Replaced $version with <version> where it needs to be manually filled in

6993 12/20/2012 02:40 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Added command to set $version

6992 12/20/2012 02:26 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: authortaxoncode mappings: Only use authorTaxonCode if there is no plant ID, because an individual plant gets its own taxonoccurrence and thus needs the taxonoccurrence's IDs to be unique to the plant, regardless of what the author designates as the taxonoccurrence code

6991 12/20/2012 01:47 PM Aaron Marcuse-Kubitza

Generated inputs/NVS/new_terms.csv

6990 12/20/2012 01:47 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: $(svnFilesGlob): Also match *terms.csv in top-level dir

6989 12/20/2012 01:23 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped authorTaxonCode

6988 12/20/2012 01:12 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

6987 12/20/2012 01:12 PM Aaron Marcuse-Kubitza

README.TXT: Maintenance: VegCore data dictionary: Added step to commit updated mappings/VegCore.csv

6986 12/20/2012 12:13 PM Aaron Marcuse-Kubitza

schemas/Makefile: %/publish: Fixed bug where commands were not being run transactionally, because --single-transaction requires `--file -` to work properly

6985 12/20/2012 11:36 AM Aaron Marcuse-Kubitza

input.Makefile: Editing import: Removed rotate because appending the current svn revision doesn't make sense, since this is not related to the revision used to import the datasource

6984 12/20/2012 11:34 AM Aaron Marcuse-Kubitza

input.Makefile: Editing import: Added rename/% and use it in rotate

6983 12/20/2012 11:21 AM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6982 12/20/2012 11:21 AM Aaron Marcuse-Kubitza

schemas/Makefile: Use $* instead of $(@D) for clarity. $(@D) is only needed when the dir part of the target includes a prefix in addition to the % stem.

6981 12/20/2012 10:45 AM Aaron Marcuse-Kubitza

make_analytical_db: Automatically call export_analytical_db when finished

6980 12/20/2012 10:35 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: make_family_higher_plant_group(): Added `taxonepithet IS NOT NULL` filter, to allow make_analytical_db to proceed even when the NCBI import fails (leaving some nodes with rank = 'family' but no associated taxonepithet). The most recent NCBI import failed due to the search_path/DuplicateException bug resulting from the import schema and public being in the search_path together.

6979 12/20/2012 10:14 AM Aaron Marcuse-Kubitza

schemas/Makefile: Fixed bug where need `SHELL := /bin/bash` for \$(confirmRmPublicSchema) to work correctly

6978 12/20/2012 10:12 AM Aaron Marcuse-Kubitza

lib/common.Makefile: $(confirm): Added comment that this requires `SHELL := /bin/bash` to work correctly

6977 12/20/2012 10:09 AM Aaron Marcuse-Kubitza

import_all: after_import(): Added `make backups/vegbien.$version.backup/test`

6976 12/20/2012 10:05 AM Aaron Marcuse-Kubitza

sql.py: DbConn._db(): search_path: Don't append the existing search_path, because it usually includes the public schema, which is now different from the schema being imported into. This fixes a bug where sql.function_exists() would find public-schema functions in both the public schema and the import's schema because both were in the search_path, causing a DuplicateException "more than one function named ...". Note that the elements of the existing search_path are no longer needed now that vegbien_dest's $schemas includes $public. Also note that if an instance of DbConn does not specify the schemas param, the existing search_path will be left as-is rather than overwritten with an empty list.

6975 12/20/2012 09:54 AM Aaron Marcuse-Kubitza

README.TXT: Data import: recording the import times in inputs/import.stats.xls: Added step to determine the import date using import_date

6974 12/20/2012 09:52 AM Aaron Marcuse-Kubitza

import_date: Added note that Mac and Linux differ in the order they sort the logs in

6973 12/20/2012 09:50 AM Aaron Marcuse-Kubitza

README.TXT: Data import: recording the import times in inputs/import.stats.xls: Updated pattern for new log filename format

6972 12/20/2012 09:47 AM Aaron Marcuse-Kubitza

README.TXT: Data import: recording the import times in inputs/import.stats.xls: Removed extra ./ before bin/import_times

6971 12/20/2012 09:46 AM Aaron Marcuse-Kubitza

import_date: Added note that the time this outputs is the time the first special input finished importing. The import itself generally starts a few minutes before that, and the exact time is in that import's public schema comment.

6970 12/20/2012 09:41 AM Aaron Marcuse-Kubitza

import_date: Removed duplicate Usage message at top of file, which is repeated in the Usage message provided when the program is run with no arguments

6969 12/20/2012 09:40 AM Aaron Marcuse-Kubitza

Added import_date

6968 12/20/2012 09:38 AM Aaron Marcuse-Kubitza

Added mtime

6967 12/20/2012 09:29 AM Aaron Marcuse-Kubitza

lib/common.Makefile: System: Added $(mtime)

6966 12/20/2012 09:27 AM Aaron Marcuse-Kubitza

lib/common.Makefile: $(date): Factored date format out into $(dateFmt)

6965 12/20/2012 09:25 AM Aaron Marcuse-Kubitza

backups/Makefile: Factored $(isMac) out into lib/common.Makefile

6964 12/20/2012 08:30 AM Aaron Marcuse-Kubitza

README.TXT: Data import: tailing logs: Updated pattern for new log filename format

6963 12/19/2012 02:02 PM Aaron Marcuse-Kubitza

schemas/Makefile: Installation: %/publish: Fixed bug where need quotes around source schema name

6962 12/19/2012 01:57 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Moved deletion previous imports before the import, so that full DB backup can be automated

6961 12/19/2012 01:55 PM Aaron Marcuse-Kubitza

README.TXT: Data import: `make backups/vegbien.$version.backup/test`: Added --exclude-schema=public to leave out the previous (now published) import so it doesn't bloat the backup. Note that public is included in the vegbien.$version.backup for the previous import, named according to its version.

6960 12/19/2012 01:49 PM Aaron Marcuse-Kubitza

import_all: after_import(): Added `make backups/TNRS.backup-remake`

6959 12/19/2012 01:46 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Added step to publish the import to the public schema

6958 12/19/2012 01:42 PM Aaron Marcuse-Kubitza

import_all: after_import(): Added export_analytical_db

6957 12/19/2012 01:36 PM Aaron Marcuse-Kubitza

README.TXT: Data import: bin/export_analytical_db: Removed `env public=$version` because export_analytical_db now uses $version as $public when provided

6956 12/19/2012 01:35 PM Aaron Marcuse-Kubitza

README.TXT: Data import: To remake analytical DB: Removed `env public=...` because $version (which replaces $public) is now set automatically by import_all

6955 12/19/2012 01:32 PM Aaron Marcuse-Kubitza

schemas/Makefile: Installation: py_functions/install: Removed `env public=`, which is not needed since $(psqlAsAdminVegbien) does not use psql_script_vegbien (which uses $public)

6954 12/19/2012 01:28 PM Aaron Marcuse-Kubitza

export_analytical_db: Use vegbien_dest to set the default value for $public

6953 12/19/2012 01:21 PM Aaron Marcuse-Kubitza

README.TXT: Data import: If many inputs have errors: Updated command to `make schemas/$version/uninstall` because the current import's schema is now named $version

6952 12/19/2012 01:15 PM Aaron Marcuse-Kubitza

schemas/Makefile: Installation: $(schemas), $(schemasReversed) (used e.g. by `make schemas/reinstall`): Removed public so that when `make schemas/reinstall` is run before an import, it will not remove any active (published) import which resides in the public schema

6951 12/19/2012 01:10 PM Aaron Marcuse-Kubitza

README.TXT: Schema changes: Reinstall public separately from the other schemas so that it will still be reinstalled when schemas/reinstall excludes the public schema to avoid removing any active (published) import

6950 12/19/2012 01:01 PM Aaron Marcuse-Kubitza

vegbien_dest callers: Removed no longer needed explicit setting $prefix to "", because this is now the default value

6949 12/19/2012 01:00 PM Aaron Marcuse-Kubitza

vegbien_dest: Changed default $prefix to "", so that the majority of callers don't need to manually set $prefix to "" to avoid it defaulting to out_

6948 12/19/2012 12:45 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Use env var $version, which is now set by import_all, instead of manually inserting the version for <version>

6947 12/19/2012 12:40 PM Aaron Marcuse-Kubitza

vegbien_dest: Also export $version

6946 12/19/2012 12:30 PM Aaron Marcuse-Kubitza

import_all: Run the import directly into a new, already-versioned public schema. This removes the need to manually rename the schema after import, and allows the backup commands to use the stored $version shell variable to refer to the last import.

6945 12/19/2012 12:25 PM Aaron Marcuse-Kubitza

schemas/Makefile: %/publish: Added instruction to run `unset version` after the command, to clear the $version shell variable which will be set by import_all

6944 12/19/2012 12:12 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Replaced <import_name> with <version> because the import name is now just the version

6943 12/19/2012 12:10 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Replaced r<revision> with <version> because the version string is now equal to r<revision>

6942 12/19/2012 12:09 PM Aaron Marcuse-Kubitza

README.TXT: Backups: Replaced <date> with <version> because the date is no longer included in the version string

6941 12/19/2012 12:08 PM Aaron Marcuse-Kubitza

README.TXT: Name archived imports without the "public." prefix so that their backups will work with the new `make backups/%.backup/remove` command, which does not add back the prefix

6940 12/19/2012 11:56 AM Aaron Marcuse-Kubitza

backups/Makefile; $(public*): Don't add a "public." prefix to get the name of the public schema

6939 12/19/2012 11:40 AM Aaron Marcuse-Kubitza

backups/Makefile: Removed no longer used $(rmSchema)

6938 12/19/2012 11:39 AM Aaron Marcuse-Kubitza

backups/Makefile: Use \$(rmSchemaCmd) from lib/common.Makefile instead of \$(rmSchema)

6937 12/19/2012 11:20 AM Aaron Marcuse-Kubitza

vegbien_dest: Use $version as $public when $public not provided. When neither is provided, continue to use "public" and also set $version to that.

6936 12/19/2012 11:11 AM Aaron Marcuse-Kubitza

schemas/Makefile: Installation: rotate: Use just the version, without the "public." prefix

6935 12/19/2012 11:04 AM Aaron Marcuse-Kubitza

schemas/Makefile: Installation: `public/install public%/install`: Generalized to /install to allow public schema versions with any name. This requires moving `/install: %.sql` before it to override it.

6934 12/19/2012 11:00 AM Aaron Marcuse-Kubitza

schemas/Makefile: Installation: Merged public/install and public%/install