/ - Changes - BIEN 3 - NCEAS Projects

root @ 6133

#	Date	Author	Comment
6133	11/13/2012 04:28 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Removed VegBank-internal fields starting with d_
6132	11/13/2012 04:19 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Moved tables so commclass would have more room. Moved revision back to original spot.
6131	11/13/2012 04:07 PM	Aaron Marcuse-Kubitza	schemas/filter_ERD.csv: Display referencecontributor->party connection in ERD
6130	11/13/2012 03:56 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Removed no longer used table referenceparty
6129	11/13/2012 03:54 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: referencecontributor: Point to party instead of referenceparty, which duplicates party
6128	11/13/2012 03:51 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: party: Added new suffix field to party_unique unique index
6127	11/13/2012 03:49 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: party: Added fields from referenceparty. Note that referenceparty.type is named partytype.
6126	11/13/2012 03:25 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/salvias_plots.~.clean_up.sql: Rename lookup_MethodCode.Description to lookup_MethodCode_Description to make it globally unique when joined with plotMetadata
6125	11/13/2012 03:24 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: $(svnFilesGlob): Added root-level .sql files containing ~, which run additional commands after the original data is imported
6124	11/13/2012 03:22 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/_MySQL/: Updated svn:ignore from running `make inputs/SALVIAS/add`
6123	11/13/2012 02:30 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: matched place's coordinates: Fixed bug where coordinates entry itself needed to have its datasource (reference) set to geoscrub, in addition to the place entry that uses it, in order to match up properly with geoscrub's corresponding input place (whose coordinates as well as place are owned by the geoscrub datasource)
6122	11/13/2012 02:22 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: matched place's coordinates: Fixed bug where coordinates mappings with and without matched_place_id=0 need to sort together in order to be merged, by prepending ".," to the place attrs list
6121	11/13/2012 02:22 PM	Aaron Marcuse-Kubitza	inputs/VegBank/plot_/test.xml.ref: Updated inserted row count
6120	11/13/2012 12:00 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6119	11/09/2012 08:30 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
6118	11/09/2012 08:20 PM	Aaron Marcuse-Kubitza	inputs/Makefile: Input data: $(rsyncLogs): Also include logs from the datasource's top-level logs/ dir, which contains make_analytical_db.log.sql
6117	11/09/2012 08:09 PM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Remove still-embargoed plots
6116	11/09/2012 08:07 PM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Enable cascading deletes by setting all foreign keys to ON DELETE CASCADE
6115	11/09/2012 07:49 PM	Aaron Marcuse-Kubitza	Added inputs/VegBank/_src/vegbank.schema.sql.make and vegbank.schema.sql
6114	11/09/2012 07:48 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: sql/install: Use new pg_dump_limit to remove security and schema-setting commands
6113	11/09/2012 07:46 PM	Aaron Marcuse-Kubitza	Added pg_dump_limit to filter a PostgreSQL DB dump to remove security and schema-setting commands
6112	11/09/2012 06:37 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_cleaned_unique/create.sql: Removed no longer needed index on latitudeDecimalVerbatim, longitudeDecimalVerbatim, which is now on geoscrub_cultivated instead
6111	11/09/2012 06:32 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Fixed bug where needed to join on new geoscrub_cultivated, not geoscrub, for all geoscrub-related information. geoscrub contains many duplicate records, causing one input row to match many rows in geoscrub, when there should only be one entry for each coordinate pair.
6110	11/09/2012 06:26 PM	Aaron Marcuse-Kubitza	Added inputs/.geoscrub/geoscrub_cultivated/
6109	11/09/2012 06:04 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_cleaned_unique/create.sql: Added index on latitudeDecimalVerbatim, longitudeDecimalVerbatim for use by analytical_stem_view
6108	11/09/2012 05:34 PM	Aaron Marcuse-Kubitza	inputs/newWorld/geoscrub.schema.~.changes.sql: Change countryNameStd type to text to allow merge-joining with place.country in analytical_stem_view
6107	11/09/2012 05:28 PM	Aaron Marcuse-Kubitza	inputs/newWorld/geoscrub.schema.~.changes.sql: ALTER TABLE ... ALTER COLUMN statement: Reformatted to allow adding additional ALTER COLUMN clauses
6106	11/09/2012 05:25 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_cleaned_unique/create.sql: Change latitudeDecimalVerbatim, longitudeDecimalVerbatim types to double precision to allow merge-joining with coordinates.latitude_deg, longitude_deg in analytical_stem_view
6105	11/09/2012 05:12 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Instead of using `make schemas/rotate` and then renaming the public schema to the correct name, just rename directly to the correct name using `make schemas/rename/...`. Use new import_name to determine the import name instead of manually finding the date in the first datasource's log file name.
6104	11/09/2012 05:06 PM	Aaron Marcuse-Kubitza	Added import_name, which gets the name of an import based on its log file names
6103	11/09/2012 04:50 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Moved checking that imports were successful before running make_analytical_db
6102	11/09/2012 04:41 PM	Aaron Marcuse-Kubitza	root Makefile: Installation: Fixed bug where schemas/install needed to happen after inputs/install because some of the public schema's views now depend on inputs
6101	11/09/2012 04:07 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: cultivatedBasis: Concatenate ''::text to geoscrub.isCultivatedReason so it will be cast to a text field both on PostgreSQL 9.1.1 (local machine), which removes any explicit cast to text when creating the view, and 9.1.6 (vegbiendev), which requires an explicit cast to text
6100	11/09/2012 03:49 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: cultivatedBasis: Use geoscrub.isCultivatedReason instead when geoscrub.isCultivated is used as the source for cultivated
6099	11/09/2012 12:53 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Use geoscrub.isCultivated when taxonoccurrence.iscultivated is not provided (joining to geoscrub on the coordinates)
6098	11/08/2012 06:38 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: Schemas: Run all schema installs and uninstalls using no_search_path=1, so that the schemas in the automatic search_path are not required for the command to run
6097	11/08/2012 06:37 PM	Aaron Marcuse-Kubitza	psql_vegbien: Added $no_search_path option to turn off the automatic SET search_path directive
6096	11/08/2012 06:11 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxonverbatim: Added growthform for growthform based on the taxon name rather than provided with the input data's taxonoccurrence
6095	11/08/2012 06:00 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Fixed lines
6094	11/08/2012 05:47 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/plotMetadata/: LEFT JOINed with lookup_MethodCode to create plotMetadata_
6093	11/08/2012 04:52 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: threatened_taxonlabel_view: Fixed bug where needed DISTINCT on resulting taxonlabel_id because some descendants apparently appear in multiple threatened taxonlabels' subtrees
6092	11/08/2012 04:42 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Added threatened, using new threatened_taxonlabel lookup table
6091	11/08/2012 04:12 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: reference_by_shortname(): Fixed bug where need to use $-syntax to reference params in sql-language functions
6090	11/08/2012 04:07 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: threatened_taxonlabel_view: Use new reference_by_shortname()
6089	11/08/2012 03:45 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: Schemas: public: schemas/public/uninstall: Fixed bug where need to run psql_vegbien without public in the search_path because it may have already been deleted
6088	11/08/2012 03:44 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: Schemas: public: schemas/public/install: Fixed bug where need to run psql_vegbien without public in the search_path because it doesn't exist, by setting public to the empty string (deleting it)
6087	11/08/2012 03:42 PM	Aaron Marcuse-Kubitza	vegbien_dest: $schemas: Don't include the , before $public if it has been set to the empty string (deleted)
6086	11/08/2012 03:27 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added reference_by_shortname(). Using this function instead of the manual query should force the query planner to evaluate this expression first, rather than complexly reordering joins to place this nested select as a filter condition.
6085	11/08/2012 03:00 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added threatened_taxonlabel derived table with generating view threatened_taxonlabel_view
6084	11/08/2012 02:48 PM	Aaron Marcuse-Kubitza	Updated inputs/UNCC/Specimen/test.xml.ref inserted row count
6083	11/08/2012 01:38 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Added threatened
6082	11/08/2012 01:21 PM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Remove private columns (plot.reallatitude, reallongitude) that should not be publicly visible
6081	11/08/2012 01:13 PM	Aaron Marcuse-Kubitza	inputs/CVS/Organism/map.csv: Removed now-dropped realLatitude, realLongitude
6080	11/08/2012 01:12 PM	Aaron Marcuse-Kubitza	inputs/CVS/Organism/map.csv: Removed now-dropped realLatitude, realLongitude
6079	11/08/2012 01:12 PM	Aaron Marcuse-Kubitza	Added inputs/CVS/Organism/postprocess.sql to drop private realLatitude, realLongitude columns
6078	11/08/2012 01:10 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: Added back postprocess.sql, which is now used for one-time dropping of private columns that should not be publicly visible
6077	11/08/2012 12:47 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps building: %/.map.csv.last_cleanup: $(dict) canon/translate: Use new $(translate?)
6076	11/08/2012 12:45 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps building: %/.map.csv.last_cleanup: Added $(srcDict) as a prerequisite, so that .last_cleanup will be re-run if it changes. Added empty $(srcDict) target in case it doesn't exist.
6075	11/08/2012 12:39 PM	Aaron Marcuse-Kubitza	inputs/bien_web/observation/map.csv: Omit *_index because they are placeholder columns created by the MySQL to PostgreSQL translation
6074	11/08/2012 12:37 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps building: %/.map.csv.last_cleanup: Fixed bug where can only canon/translate using $(srcDict) if it exists for the datasource
6073	11/08/2012 12:26 PM	Aaron Marcuse-Kubitza	inputs/bien_web/observation/: Regenerated from actual bien_web.observation schema on nimoy, which has additional columns
6072	11/08/2012 12:24 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: $(svnFilesGlob): Added top-level map.csv, which can be used to apply a datasource-global data dictionary to all tables
6071	11/08/2012 12:18 PM	Aaron Marcuse-Kubitza	input.Makefile: Maps building: %/.map.csv.last_cleanup: Also apply any map.csv at the top level of the datasource directory. This can be used to apply a datasource-global data dictionary to all tables.
6070	11/08/2012 12:01 PM	Aaron Marcuse-Kubitza	my2pg: Also remove column comments. Note that these cannot be translated by sed, because PostgreSQL only allows setting column comments in a separate statement, not inline with the column's entry in the CREATE TABLE statement, and sed can only make replacements contiguous with the input line.
6069	11/08/2012 11:28 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Removed incorrect duplicate entry for verbatimSubgenus
6068	11/08/2012 10:58 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: _taxon_family_require_std(): Fixed bug where name needed to be lowercased before checking if it ended in -aceae, to support family names that are uppercase. Note that this resulted in the family not being prepended to the TNRS input name for datasources with uppercase family names, so the next DB import will likely produce a number of unscrubbed TNRS input names which now have the uppercase family prepended.
6067	11/08/2012 10:17 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/tnrs/tnrs.make: Fixed bug where need to reference the log file path relative to the make script itself, because otherwise the log file would go in inputs/.TNRS/logs/tnrs.make.log.sql
6066	11/08/2012 10:07 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/tnrs/tnrs.make: Fixed bug where need to use just the basename of $0 as the log file name
6065	11/08/2012 09:51 AM	Aaron Marcuse-Kubitza	Added inputs/IUCN/
6064	11/08/2012 09:51 AM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: Added _src/ (when it exists). $(_svnFilesGlob): Added .url, .pdf files in _src/.
6063	11/08/2012 07:47 AM	Aaron Marcuse-Kubitza	psql_vegbien: Use bash because it supports substitutions
6062	11/08/2012 07:46 AM	Aaron Marcuse-Kubitza	psql_vegbien: Set the search_path to $out_schemas set by vegbien_dest. This will enable running any psql_vegbien script on a schema other than public.
6061	11/08/2012 07:07 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Changed inner joins on non-datasource taxonlabels to LEFT JOINs, to ensure that an entry is included in the analytical DB even if there was no matched taxonlabel. In theory, this shouldn't be necessary, because every taxonlabel should have a canonical taxonlabel since canon_label_id is auto-populated to the taxonlabel_id if there is no matched_label_id; there should be a taxonverbatim for every datasource and accepted taxonlabel because datasources link to taxonlabel via taxonverbatim and TNRS populates a taxonverbatim for every accepted taxonlabel; and there should be a parsed taxonlabel for every datasource taxonlabel because the mappings populate it.
6060	11/08/2012 06:56 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Removed join on specimenreplicate, because it isn't used in the analytical DB. Each specimen will still get an entry in analytical_*, because it gets its own location.
6059	11/08/2012 06:45 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: Before starting import, added step to run `make inputs/upload` and reinstall newly-uploaded datasources
6058	11/08/2012 03:22 AM	Aaron Marcuse-Kubitza	README.TXT: Schema changes: Remember to update the following files with any renamings: Added mappings/verify.*.sql
6057	11/08/2012 02:21 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: make_analytical_db: Documented how to view progress in log file
6056	11/08/2012 02:18 AM	Aaron Marcuse-Kubitza	make_analytical_db: Run all commands synchronously so the log file output doesn't become jumbled
6055	11/08/2012 02:16 AM	Aaron Marcuse-Kubitza	make_analytical_db: Fixed bug where log file needed to be appended to instead of overwritten
6054	11/08/2012 02:15 AM	Aaron Marcuse-Kubitza	make_analytical_db: Wrap each individual command in `set -x` to avoid echoing low-level commands such as sleep, wait
6053	11/08/2012 02:02 AM	Aaron Marcuse-Kubitza	make_analytical_db: Moved log file to inputs/analytical_db/logs/make_analytical_db.log.sql so it would be synced along with the other import logs
6052	11/08/2012 01:57 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/tnrs/tnrs.make: Output the time at which it's run, so this is included in the log file
6051	11/08/2012 01:53 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/tnrs/tnrs.make: Moved log file to logs/tnrs.make.log.sql so it would automatically be synced along with the other import logs
6050	11/08/2012 01:49 AM	Aaron Marcuse-Kubitza	make_analytical_db: Moved log file to inputs/analytical_db/logs/make_analytical_db.log.sql so it would be synced along with the other import logs
6049	11/08/2012 01:40 AM	Aaron Marcuse-Kubitza	inputs/Makefile: Import logs: $(rsyncLogs): Always download the TNRS daemon log, rather than requiring tnrs_log=1 to be specified to download it
6048	11/08/2012 01:37 AM	Aaron Marcuse-Kubitza	make_analytical_db: Output the time at which it's run, so this is included in the log file
6047	11/08/2012 01:36 AM	Aaron Marcuse-Kubitza	make_analytical_db: Store output in schemas/make_analytical_db.log
6046	11/08/2012 01:24 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Removed no longer used make_analytical_db(). Use bin/make_analytical_db instead.
6045	11/08/2012 01:23 AM	Aaron Marcuse-Kubitza	make_analytical_db: Use new psql_verbose_vegbien
6044	11/08/2012 01:22 AM	Aaron Marcuse-Kubitza	Added psql_verbose_vegbien
6043	11/08/2012 01:18 AM	Aaron Marcuse-Kubitza	make_analytical_db: Use psql_script_vegbien, which contains the necessary psq options, instead of setting those options manually
6042	11/08/2012 01:15 AM	Aaron Marcuse-Kubitza	make_analytical_db: Run the SQL commands directly with psql instead of using the make_analytical_db() function. This provides incremental results and avoids running all commands in one transaction, thus preventing pgAdmin from freezing when the user attempts to access a table used in analytical DB creation (because the TRUNCATE statement fully locks the table until the entire analytical DB is built).
6041	11/08/2012 12:46 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: make_analytical_db(): Added make_family_higher_plant_group()
6040	11/08/2012 12:17 AM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times. Fixed input row counts and import times to include derived data, such as TNRS and geoscrub, which adds to the import time and therefore should be considered in the import's speed. (TNRS was already being included in the import time for some, but not all, imports.)
6039	11/06/2012 05:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Added higherPlantGroup
6038	11/06/2012 04:06 PM	Aaron Marcuse-Kubitza	pg_dump_vegbien: When not in data mode, use --inserts so the INSERT statements generated can be run directly in pgAdmin, they can be directly translated to MySQL, and an empty COPY FROM statement isn't generated for every table
6037	11/06/2012 04:05 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added family_higher_plant_group lookup table, generated with make_family_higher_plant_group() using the contents of higher_plant_group_nodes
6036	11/06/2012 03:40 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added family_higher_plant_group lookup table, generated with make_family_higher_plant_group() using the contents of higher_plant_group_nodes
6035	11/06/2012 03:23 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Always map taxonNameOrEpithet to taxonomicname, now that it's globally unique at all ranks in the datasource that provides it (NCBI)
6034	11/06/2012 03:18 PM	Aaron Marcuse-Kubitza	inputs/.NCBI/nodes/create.sql: Make name_txt completely globally unique by removing all duplicates, not just duplicate genera

Project

General

Profile