/ - Changes - BIEN 3 - NCEAS Projects

root @ 6536

#	Date	Author	Comment
6536	12/03/2012 02:47 PM	Aaron Marcuse-Kubitza	backups/Makefile: %.md5: Fixed bug where md5sum does not have a -q option like md5
6535	12/03/2012 02:43 PM	Aaron Marcuse-Kubitza	backups/Makefile: %.md5: Fixed bug where need to use md5sum instead of md5 on Linux
6534	12/03/2012 02:39 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Filter out non-current taxondeterminations (occurrences with no taxondetermination are preserved)
6533	12/03/2012 02:10 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Removed no longer needed darwin_core table. Use analytical_stem instead, which is now identical.
6532	12/03/2012 02:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: sync_analytical__to_view(): Creating analytical_ table: Fixed bug where need LIMIT 0 so that it can be used on a full DB, which will have data in the tables used by analytical_stem_view
6531	12/03/2012 01:40 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Merged darwin_core into analytical_stem
6530	12/03/2012 01:21 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: darwin_core_view, analytical_stem_view: Updated now that newWorldCountries.isoCode is a text field
6529	12/03/2012 12:35 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: backups: Step to copy backups to jupiter: Added full path to aaronmk/ (/data/dev/aaronmk)
6528	12/03/2012 12:00 PM	Aaron Marcuse-Kubitza	inputs/newWorld/geoscrub.schema.~.changes.sql: Reversed order of adding unique constraints and changing types
6527	12/03/2012 11:57 AM	Aaron Marcuse-Kubitza	inputs/newWorld/geoscrub.schema.~.changes.sql: Changed isoCode type to text. Added unique constraint on isoCode.
6526	12/03/2012 11:06 AM	Aaron Marcuse-Kubitza	backups/Makefile: Added md5s target to generate .md5 files for all backups
6525	12/03/2012 11:05 AM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6524	12/03/2012 10:48 AM	Aaron Marcuse-Kubitza	backups/Makefile: %.md5: Run with `nice -n +5` to avoid slowing down the UI
6523	12/03/2012 10:46 AM	Aaron Marcuse-Kubitza	backups/: svn:ignore: Added .md5. Removed no longer applicable .log.
6522	12/03/2012 10:42 AM	Aaron Marcuse-Kubitza	backups/Makefile: Changed paths to be relative to the Makefile rather than the current directory, so this Makefile can be used in other directories as well (such as jupiter:/aaronmk/VegBIEN.backups/)
6521	12/03/2012 10:34 AM	Aaron Marcuse-Kubitza	backups/Makefile: %.backup: Also create MD5 of backup
6520	12/03/2012 10:31 AM	Aaron Marcuse-Kubitza	backups/Makefile: Added %.md5 target to create checksums of each backup
6519	12/03/2012 10:17 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: backups: Added step to copy backups to jupiter in /aaronmk/VegBIEN.backups/ . The jupiter folder, which has several TB of space available, will replace local backup drives as the location for archived backups.
6518	12/03/2012 10:00 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: Removed additional backup of just the public schema, which is not needed because the public schema is included in the full DB backup. The additional public schema backup increased the total backup size by 60-70%, so this will help conserve limited disk space on vegbiendev as well as on local archives of the backups.
6517	12/03/2012 09:52 AM	Aaron Marcuse-Kubitza	README.TXT: Backups: Full DB: Updated steps to match Data import steps, which add the date to the backup filename when it's created rather than afterwards
6516	12/03/2012 09:42 AM	Aaron Marcuse-Kubitza	README.TXT: Backups: Archived imports: Back up: Added instructions for archiving the last import before backing it up
6515	12/03/2012 09:10 AM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
6514	12/03/2012 09:08 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Removed NOT NULL constraint on dateCollected
6513	12/03/2012 09:07 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: source: Added sampletype field to indicate a plot or specimen datasource
6512	12/03/2012 09:00 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Removed NOT NULL constraint on dateCollected
6511	12/03/2012 08:55 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: sync_analytical_*_to_view(): Added NOT NULL constraints
6510	11/30/2012 05:20 PM	Aaron Marcuse-Kubitza	make_analytical_db: Added step to create darwin_core materialized view
6509	11/30/2012 05:09 PM	Aaron Marcuse-Kubitza	inputs/*/Source/map.csv for non-herbaria: Mapped sampleType
6508	11/30/2012 05:02 PM	Aaron Marcuse-Kubitza	inputs/.herbaria/herbaria/map.csv: Set sampleType to "specimen"
6507	11/30/2012 05:02 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped sampleType
6506	11/30/2012 05:00 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Added sampleType
6505	11/30/2012 04:57 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: source: Added sampletype field to indicate a plot or specimen datasource
6504	11/30/2012 04:55 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added sampletype enum
6503	11/30/2012 04:46 PM	Aaron Marcuse-Kubitza	root Makefile: $(postgresReload-*): Confirm the operation before continuing, since it involves changing PostgreSQL config files in nontrivial ways. Added instructions for setting kernel.shmmax to at least 4GB minus 1 byte on Linux, to work with the shared_buffers setting in postgresql.conf.
6502	11/30/2012 04:03 PM	Aaron Marcuse-Kubitza	schemas/postgresql.conf: shared_buffers: Documented that it must be less than ~95% of SHMMAX
6501	11/30/2012 03:58 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: identifiedBy: Fixed bug where need to use party.fullname instead of name components because the name is now mapped to fullname
6500	11/30/2012 03:28 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view, darwin_core_view: dateCollected: Use the parent plot event's obsstartdate when the subplot event does not have its own obsstartdate
6499	11/30/2012 01:56 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Don't filter out rows without a date or non-current taxondeterminations
6498	11/30/2012 01:54 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Don't filter out rows without a date
6497	11/30/2012 01:28 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added darwin_core_view
6496	11/30/2012 12:56 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: identifiedBy: Fixed bug where need to use party.fullname instead of name components because the name is now mapped to fullname
6495	11/30/2012 12:40 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: sync_analytical_*_to_view(): Added CREATE INDEX statements
6494	11/30/2012 12:31 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Added steps to publish analytical DB on nimoy.bien_web
6493	11/30/2012 10:46 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Changed JOINs to LEFT JOINs to include occurrences without taxondeterminations
6492	11/30/2012 10:21 AM	Aaron Marcuse-Kubitza	export_analytical_db: Use 'NULL' as the NULL value instead of \N, because MySQL has problems with \N
6491	11/30/2012 09:57 AM	Aaron Marcuse-Kubitza	publish_analytical_db: Load to bien3_adb instead of bien_web
6490	11/29/2012 05:41 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: Added step to export analytical DB
6489	11/29/2012 01:11 PM	Aaron Marcuse-Kubitza	root Makefile: $(postgres-Linux): Fixed bug where need $(asAdmin) before commands to rename existing *.conf
6488	11/29/2012 01:01 PM	Aaron Marcuse-Kubitza	root Makefile: $(postgres-Linux): Also install postgresql-contrib, which contains the hstore extension
6487	11/28/2012 06:18 PM	Aaron Marcuse-Kubitza	Added inputs/NVS/
6486	11/28/2012 06:04 PM	Aaron Marcuse-Kubitza	inputs/CVS/Organism/map.csv: Mapped accordingTo to "Weakley 2006"
6485	11/28/2012 06:02 PM	Aaron Marcuse-Kubitza	inputs/NY/Specimen/map.csv: Omit UniqueNYInternalRecordNumber to avoid confusion since this is an internal-only ID. This makes InstitutionCode+CollectionCode+CatalogNumber the globally unique identifier instead.
6484	11/28/2012 06:00 PM	Aaron Marcuse-Kubitza	README.TXT: Added Datasource refreshing section with instructions for refreshing VegBank
6483	11/28/2012 05:57 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed taxonconcept.concept_source_id back to concept_reference_id
6482	11/28/2012 05:52 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed soilobs to soilsample per working group discussion
6481	11/28/2012 05:27 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: verify: Fixed bug where need to use $ prefix before string to parse newline
6480	11/28/2012 05:27 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: verify: Fixed bug where need to use $ prefix before string to parse newline
6479	11/28/2012 05:25 PM	Aaron Marcuse-Kubitza	inputs/NY/verify/: svn:ignore .csv files
6478	11/28/2012 05:25 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: Also svn:ignore .csv files
6477	11/28/2012 02:47 PM	Aaron Marcuse-Kubitza	export_analytical_db: Export NULL as \N to work with MySQL
6476	11/28/2012 01:22 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Added index on NOT NULL columns, starting with institutionCode
6475	11/28/2012 01:19 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Removed primary keys and NOT NULL constraints on columns that sometimes have NULL values
6474	11/28/2012 01:08 PM	Aaron Marcuse-Kubitza	publish_analytical_db: Added CSV dialect information
6473	11/28/2012 12:42 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: $(postgresReload-): Rename existing .conf to *.conf.old
6472	11/27/2012 06:44 PM	Aaron Marcuse-Kubitza	publish_analytical_db: Use LOAD DATA LOCAL INFILE instead of LOAD DATA INFILE to avoid needing FILE permissions on bien_web
6471	11/27/2012 01:17 PM	Aaron Marcuse-Kubitza	Added publish_analytical_db
6470	11/27/2012 12:43 PM	Aaron Marcuse-Kubitza	export_analytical_db: Append the public schema version to the CSV filename
6469	11/27/2012 12:27 PM	Aaron Marcuse-Kubitza	backups/Makefile: $(rsyncBackups): Added *.csv
6468	11/26/2012 06:12 PM	Aaron Marcuse-Kubitza	Added export_analytical_db
6467	11/26/2012 06:10 PM	Aaron Marcuse-Kubitza	backups/: Ignore _* and *.csv
6466	11/26/2012 01:35 PM	Aaron Marcuse-Kubitza	make_analytical_db: mk_analytical_table(): Use explicit schema references everywhere. This fixes a bug where the TRUNCATE/INSERT steps on the public schema's table would reference the analytical_db view instead because they were not schema-scoped.
6465	11/26/2012 01:33 PM	Aaron Marcuse-Kubitza	make_analytical_db: mk_analytical_table(): Factored table references in different schemas out into vars
6464	11/25/2012 09:31 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: recordNumber: Combine identifying fields in taxonoccurrence, plantobservation, and stemobservation to ensure that this field is unique within the plot and not NULL
6463	11/25/2012 09:13 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
6462	11/25/2012 08:52 PM	Aaron Marcuse-Kubitza	make_analytical_db: Moved set -x () around just psql_verbose_vegbien so embedded $() expressions wouldn't also be in set -x (verbose) mode
6461	11/25/2012 08:49 PM	Aaron Marcuse-Kubitza	make_analytical_db: Fixed bug where need to use bash instead of sh because vegbien_dest requires it
6460	11/25/2012 08:37 PM	Aaron Marcuse-Kubitza	make_analytical_db: Factored analytical_* table creation code out into mk_analytical_table() function
6459	11/25/2012 08:28 PM	Aaron Marcuse-Kubitza	make_analytical_db: Create analytical_db views pointing to the analytical_* versions in the public schema
6458	11/25/2012 08:21 PM	Aaron Marcuse-Kubitza	vegbien_dest: $schemas: Removed analytical_db because views that will be added to it were shadowing public schema tables with the same names during population of those tables in make_analytical_db
6457	11/25/2012 07:47 PM	Aaron Marcuse-Kubitza	vegbien_dest: Export $public, to make sure it's available to any invoked scripts as an env var
6456	11/25/2012 07:45 PM	Aaron Marcuse-Kubitza	vegbien_dest: $schemas: Added analytical_db
6455	11/25/2012 07:38 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Added separate tab with stats for 2012-6~9. The Excel format apparently only supports 255 columns, so previous imports had been silently truncated off. Note that once the 2012-10 imports reach column 255, a new tab will need to be created with the 2012-10+ imports.
6454	11/25/2012 07:20 PM	Aaron Marcuse-Kubitza	bin/map: in_is_db: by_col: Clearing errors table: Skip this if the table has been set to None because it didn't exist (and thus was a metadata-only map spreadsheet)
6453	11/25/2012 06:54 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Fixed bug where need to use the specific_epithet from the accepted_taxonverbatim rather than the parsed_taxonverbatim
6452	11/25/2012 06:45 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Include the family any time the genus is not specified, instead of just when accepted_taxonlabel.rank = 'family'. These should have the same effect since TNRS includes the rank, but using COALESCE is clearer.
6451	11/25/2012 06:41 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Changed to also include morphospecies when just the family is specified
6450	11/25/2012 06:35 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: Fixed bug where location.authorlocationcode needed to be used as the plotName when location.sourceaccessioncode was not provided, to ensure that plotName would be NOT NULL
6449	11/25/2012 06:20 PM	Aaron Marcuse-Kubitza	inputs/FIA/import_order.txt: Fixed bug where FIA_COND_unique needed to be explicitly included in import_order.txt now that we're using import_order.txt to import the Source metadata table before the data tables
6448	11/25/2012 06:15 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6447	11/24/2012 03:07 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: $(postgresReload-Linux): Try chmoding both as your user and as the bien user
6446	11/24/2012 02:46 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: $(runTest): Ignore failed diffs when the test is compared to another test's output (e.g. in by_col mode)
6445	11/24/2012 02:41 PM	Aaron Marcuse-Kubitza	bin/map: in_is_db: If table does not exist, set table to None so that db_xml.put_table() doesn't try to access it. This fixes a bug in metadata-only map spreadsheets under column-based import.
6444	11/24/2012 02:40 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Support None in_table by calling put() directly
6443	11/24/2012 02:29 PM	Aaron Marcuse-Kubitza	Removed no longer used geoscrub.*.sql. Use geoscrub_output instead.
6442	11/24/2012 02:27 PM	Aaron Marcuse-Kubitza	Removed no longer used geoscrub_cleaned_unique. Use geoscrub_output instead.
6441	11/24/2012 02:25 PM	Aaron Marcuse-Kubitza	Removed no longer used geoscrub_cultivated. Use analytical_stem_view.cultivated instead.
6440	11/24/2012 02:25 PM	Aaron Marcuse-Kubitza	Removed no longer used geoscrub_cultivated. Use analytical_stem_view.cultivated instead.
6439	11/24/2012 02:23 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: cultivated: Removed BIEN2's geoscrub_cultivated, which has now been replaced by the primary corresponding scripts (and never had particularly many matches to the locations in any case)
6438	11/24/2012 02:14 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: cultivated: Use OR instead of _or() to combine cultivated_family_locations.country IS NOT NULL with the other values, because this field's false value should not be used in place of NULL if all the other values are NULL, as it would be with _or(). (cultivated_family_locations.country IS NOT NULL can indicate presence, but not absence, of cultivated status.)
6437	11/24/2012 02:06 PM	Aaron Marcuse-Kubitza	schemas/functions.sql, vegbien.sql: _and(), _or(): Added comment comparing the function and the corresponding logical operator

Project

General

Profile