/trunk - Changes - BIEN 3 - NCEAS Projects

root/trunk @ 12965

svn:ignore: extern

#	Date	Author	Comment
12965	03/28/2014 07:10 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: use taxonoccurrence instead of location as the table that all specimens should have, as decided in the 2014-03-27 conference call (wiki.vegpath.org/2014-03-27_conference_call#aggregating-validations)
12964	03/28/2014 07:03 AM	Aaron Marcuse-Kubitza	lib/runscripts/util.run: support conventional main() method as well as `all` target
12963	03/28/2014 02:39 AM	Aaron Marcuse-Kubitza	fix: inputs///map.csv: remapped occurrenceID-mapped fields to dataProviderRecordID when these were not globally unique DwC occurrenceIDs (http://rs.tdwg.org/dwc/terms/#occurrenceID)
12962	03/28/2014 02:34 AM	Aaron Marcuse-Kubitza	fix: inputs/CTFS/AggregateObservation/map.csv: field mapped to occurrenceID: remapped to aggregateOrganismObservationID because these are not specimen occurrences
12961	03/28/2014 02:32 AM	Aaron Marcuse-Kubitza	fix: mappings/VegCore-VegBIEN.csv: taxonoccurrence.sourceaccessioncode: need to populate from aggregateOrganismObservationID when only that is available
12960	03/28/2014 02:03 AM	Aaron Marcuse-Kubitza	bugfix: inputs/NY/Ecatalog_all/map.csv: can't use CatalogNumber as pkey because it's not unique and not always populated. this fixes the NY NULL accessionNumbers bug (wiki.vegpath.org/Aggregating_validations_status#bugs).
12959	03/28/2014 01:31 AM	Aaron Marcuse-Kubitza	/README.TXT: moved "to back up e-mails" and "to back up the version history" before settings backup so that the local backup of these is up to date when everything gets backed up
12958	03/28/2014 01:29 AM	Aaron Marcuse-Kubitza	inputs/XAL/Specimen/header.csv: updated
12957	03/28/2014 12:45 AM	Aaron Marcuse-Kubitza	/README.TXT: to synchronize vegbiendev, jupiter, and your local machine: backups/TNRS.backup: do this before the general sync so that any reverse sync that's needed won't include it
12956	03/28/2014 12:44 AM	Aaron Marcuse-Kubitza	/README.TXT: to synchronize vegbiendev, jupiter, and your local machine: backups/TNRS.backup: use bin/sync_upload now that this works for rsync-ignored files
12955	03/28/2014 12:36 AM	Aaron Marcuse-Kubitza	bugfix: lib/sh/sync.sh: don't unintentionally rsync-ignore explicitly-specified files
12954	03/28/2014 12:32 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: filesystem: added is_(), could_be_()
12953	03/28/2014 12:31 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: added contains_match()
12952	03/28/2014 12:31 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: added ends_with()
12951	03/27/2014 11:13 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: run `up` on all machines, not just jupiter, because all must be up-to-date to avoid extraneous diffs
12950	03/27/2014 11:11 PM	Aaron Marcuse-Kubitza	bugfix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: `svn up` on jupiter: need to use up alias because that adds --force
12949	03/27/2014 11:10 PM	Aaron Marcuse-Kubitza	bugfix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: added `svn up` on jupiter: needs to be in main dir (~/bien), not ~/Dropbox/svn/
12948	03/27/2014 11:08 PM	Aaron Marcuse-Kubitza	/README.TXT: to synchronize vegbiendev, jupiter, and your local machine: added `svn up` on jupiter to avoid extraneous diffs when rsyncing
12947	03/27/2014 10:41 AM	Aaron Marcuse-Kubitza	planning/workflow/bien3_architecture/stage_I.png, stages.png: synced to bien3_architecture.pptx
12946	03/27/2014 10:32 AM	Aaron Marcuse-Kubitza	planning/workflow/bien3_architecture.pptx: stage I: clarified that the database input is intended to be a normalized input, and its corresonding output is intended to be denormalized
12945	03/27/2014 10:29 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: stage I: clarified that the database input is intended to be a normalized input, and its corresonding output is intended to be denormalized
12944	03/27/2014 09:02 AM	Aaron Marcuse-Kubitza	bugfix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: _specimens_16_list_distinct_specimen_descriptions: should use DISTINCT
12943	03/27/2014 09:01 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_16_list_distinct_specimen_descriptions
12942	03/27/2014 09:00 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_16_list_distinct_specimen_descriptions
12941	03/27/2014 08:53 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_15_list_distinct_locality_descriptions
12940	03/27/2014 08:48 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_09_list_of_unique_verbatim_author_taxa_with_genus
12939	03/27/2014 08:47 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_08_count_of_unique_verbatim_author_taxa_with_genus
12938	03/27/2014 08:36 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_05_list_of_verbatim_species_excluding_author
12937	03/27/2014 08:35 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_04_count_of_unique_verbatim_species_without_author
12936	03/27/2014 08:23 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_03_list_of_verbatim_families
12935	03/27/2014 08:18 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_02_count_of_unique_verbatim_families
12934	03/27/2014 08:06 AM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: regenerated exports
12933	03/27/2014 08:04 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: public_validations: added _specimens_01_count_of_total_records_specimens_in_source_db
12932	03/27/2014 07:35 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_01_count_of_total_records_specimens_in_source_db
12931	03/27/2014 07:34 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: added config statements for datasource and query planner
12930	03/27/2014 05:06 AM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks: Firefox: added instructions for enabling security.password_lifetime and making all tabs load when the browser is opened
12929	03/27/2014 04:43 AM	Aaron Marcuse-Kubitza	/README.TXT: Schema changes: manually apply schema changes to the live public schema: moved under "update mappings and staging table column names" because this is a necessary part of that step
12928	03/27/2014 04:43 AM	Aaron Marcuse-Kubitza	/README.TXT: Schema changes: manually apply schema changes to the live public schema: moved under "update mappings and staging table column names" because this is a necessary part of that step
12927	03/27/2014 04:40 AM	Aaron Marcuse-Kubitza	/README.TXT: Schema changes: changed "update staging table column names" to "update mappings and staging table column names"
12926	03/27/2014 04:13 AM	Aaron Marcuse-Kubitza	fix: validation/aggregating/specimens/qualitative_validations_specimens.sql: use pg_dump's formatting for COMMENT ON to facilitate diffing against a pg_dump export of the DDL statements
12925	03/27/2014 04:07 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: removed DDL statements so that running the query file does not alter the database, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#remove-DDL-statements
12924	03/27/2014 04:01 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: updated to DB, which pg_dump-formats the views
12923	03/27/2014 03:57 AM	Aaron Marcuse-Kubitza	validation/**.sql: replaced CREATE OR REPLACE VIEW with CREATE VIEW to match pg_dump output for diffing
12922	03/27/2014 03:36 AM	Aaron Marcuse-Kubitza	added inputs/NY/validations.sql
12921	03/27/2014 03:34 AM	Aaron Marcuse-Kubitza	fix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: use pg_dump's formatting for COMMENT ON to facilitate diffing against a pg_dump export of the DDL statements
12920	03/27/2014 03:31 AM	Aaron Marcuse-Kubitza	bugfix: lib/common.Makefile: $(add*): need to wrap w/ $(wildcard) to prevent "targets don't exist" error, because svn 1.7 does not suppress this error even with --force
12919	03/27/2014 03:27 AM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: add!: add* of $(svnFiles): need to ignore errors because svn 1.7 does not suppress the "targets don't exist" error even with --force
12918	03/26/2014 09:34 PM	Aaron Marcuse-Kubitza	fix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: decimalLatitude/decimalLongitude: need to cast to double precision for numeric comparisons
12917	03/26/2014 09:33 PM	Aaron Marcuse-Kubitza	fix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: CollectedDate: updated for refreshed NY data
12916	03/26/2014 09:30 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: fixed typos in column aliases
12915	03/26/2014 09:23 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: translated column names to VegCore, using `bin/in_place validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql env text=1 bin/repl inputs/NY/Ecatalog_all/map.csv` from the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12914	03/26/2014 09:23 PM	Aaron Marcuse-Kubitza	fix: bin/repl: text mode (whether all patterns are plain text) should default to on, not off, if matching entire cells in a spreadsheet
12913	03/26/2014 07:16 PM	Aaron Marcuse-Kubitza	bugfix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: need to enclose additional mixed-case identifiers in "", using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12912	03/26/2014 07:15 PM	Aaron Marcuse-Kubitza	bugfix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: need to enclose additional mixed-case identifiers in "", using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12911	03/26/2014 06:09 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql, NY/qualitative_validations_source_db_NYBG.VegCore.sql: abbreviated view names longer than 63 chars to prevent them from being truncated
12910	03/26/2014 06:07 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: escape any ' inside '...' by doubling them
12909	03/26/2014 06:04 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: translated SQL to Postgres
12908	03/26/2014 05:32 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql, NY/qualitative_validations_source_db_NYBG.VegCore.sql: changed /* */ comments to COMMENT ON comments, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#prepend-CREATE-VIEW
12907	03/26/2014 04:58 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql, NY/qualitative_validations_source_db_NYBG.VegCore.sql: removed no longer needed -- comments containing the query name, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#prepend-CREATE-VIEW
12906	03/26/2014 03:47 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: moved notes to comments to after the query
12905	03/26/2014 03:46 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: moved notes to comments to after the query
12904	03/26/2014 03:44 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: moved "Check" comments to after the query, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12903	03/26/2014 03:22 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: removed "Check: should return [#] rows" comments because these only apply to the NY results, not to all specimens datasources
12902	03/26/2014 03:16 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: prepended CREATE VIEW, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#prepend-CREATE-VIEW and the same abbreviations as the output queries (validation/aggregating/specimens/qualitative_validations_specimens.sql)
12901	03/26/2014 03:01 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: synced "Check" comments to output queries validation/aggregating/specimens/qualitative_validations_specimens.sql
12900	03/26/2014 02:49 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: enclosed mixed-case identifiers in "" using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12899	03/26/2014 02:37 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: translated column names to VegCore, using `bin/in_place validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql env text=1 bin/repl inputs/NY/Ecatalog_all/map.csv` from the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12898	03/26/2014 02:29 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: updated to use column names for refreshed NY data
12897	03/26/2014 02:17 PM	Aaron Marcuse-Kubitza	fix: bin/repl: don't consider uppercase SQL keywords to indicate that a word is in a sentence
12896	03/26/2014 12:02 AM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: use our staging tables instead of the BIEN2 MySQL staging tables
12895	03/25/2014 11:52 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/**.sql: removed trailing whitespace, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#translate-to-Postgres
12894	03/25/2014 11:39 PM	Aaron Marcuse-Kubitza	archived validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.sql
12893	03/25/2014 11:39 PM	Aaron Marcuse-Kubitza	added validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql, copied from qualitative_validations_source_db_NYBG.sql
12892	03/25/2014 11:33 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: added ; at end of `CREATE OR REPLACE VIEW` statements
12891	03/25/2014 04:18 AM	Aaron Marcuse-Kubitza	inputs/run: postprocess(): documented runtime on vegbiendev (1 h)
12890	03/24/2014 06:22 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: removed input-query-specific comments
12889	03/24/2014 06:21 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: reworded rowcount check comments to apply to the output queries
12888	03/24/2014 06:18 PM	Aaron Marcuse-Kubitza	validation/aggregating/specimens/qualitative_validations_specimens.sql: shortened view names to fit within the 63-char limit without truncation
12887	03/24/2014 05:45 PM	Aaron Marcuse-Kubitza	/README.TXT: `make inputs/{NVS,SALVIAS,TEAM}/test`: updated runtime (1 min)
12886	03/24/2014 05:35 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: specimenreplicate.institution_id: renamed to duplicate_institutions_sourcelist_id, as decided in the conference calls (wiki.vegpath.org/2014-03-13_conference_call#schema-changes-2)
12885	03/24/2014 05:32 PM	Aaron Marcuse-Kubitza	inputs/run: postprocess(): updated runtime (25 min)
12884	03/24/2014 05:22 PM	Aaron Marcuse-Kubitza	fix: validation/aggregating/specimens/qualitative_validations_specimens.sql: changed "Full inner join" to "Full outer join" because a FULL JOIN is a type of outer join, not inner join
12883	03/24/2014 05:04 PM	Aaron Marcuse-Kubitza	/README.TXT: calls to `inputs/run postprocess`: direct user to refer to inputs/run for this, so the runtime doesn't have to be updated in multiple places
12882	03/24/2014 05:02 PM	Aaron Marcuse-Kubitza	inputs/run: postprocess(): updated runtime (20 min)
12881	03/24/2014 05:01 PM	Aaron Marcuse-Kubitza	/README.TXT: Schema changes: added steps to update staging table column names on the local machine and vegbiendev
12880	03/24/2014 04:50 PM	Aaron Marcuse-Kubitza	fix: schemas/VegCore/mk_derived: added `EOF` at end to avoid (benign) "here-document delimited by end-of-file" warnings on Linux
12879	03/24/2014 01:49 AM	Aaron Marcuse-Kubitza	mappings/VegCore.htm: regenerated from wiki: rename specimenHolderInstitutions to specimen_duplicate_institutions, as decided in the 2014-03-13 conference call (wiki.vegpath.org/2014-03-13_conference_call#schema-changes-2). note that most schema changes (such as this one) involve mappings changes, which are handled automatically by `inputs/run postprocess; yes\|make inputs/{NVS,SALVIAS,TEAM}/test`.
12878	03/24/2014 01:43 AM	Aaron Marcuse-Kubitza	bugfix: lib/runscripts/table.run: schema/make calls: need to use `make schema` instead because old-style datasources don't have a top-level runscript (the absence of this identifies them as old-style so inputs/input.Makefile works correctly)
12877	03/24/2014 01:21 AM	Aaron Marcuse-Kubitza	/README.TXT: Maintenance: VegCore data dictionary: `make inputs/{NVS,SALVIAS,TEAM}/test`: recorded runtime (30 s)
12876	03/24/2014 01:17 AM	Aaron Marcuse-Kubitza	/README.TXT: Maintenance: VegCore data dictionary: `make inputs/{NVS,SALVIAS,TEAM}/test`: prepended `time` to enable obtaining the runtime
12875	03/24/2014 01:11 AM	Aaron Marcuse-Kubitza	/README.TXT: Maintenance: VegCore data dictionary: `inputs/run postprocess`: updated runtime (20 min)
12874	03/24/2014 12:45 AM	Aaron Marcuse-Kubitza	fix: schemas/util.sql: trim(): by default, cascadingly drop dependent columns so that they don't prevent trim() from succeeding. note that this requires the dependent columns to then be manually re-created.
12873	03/23/2014 11:43 PM	Aaron Marcuse-Kubitza	bugfix: inputs/GBIF/table.run: switched to using lib/runscripts/table.run instead of mysql.table.run because some subdirs (Source/) need the regular table.run to work properly. mysql.table.run should instead be used directly by subdirs that use the MySQL install.
12872	03/22/2014 06:20 AM	Aaron Marcuse-Kubitza	bugfix: lib/sh/util.sh: DON'T do `shopt -s lastpipe` because this causes a segfault on Linux in stderr_matches(). (it also isn't supported on Mac.) use @PIPESTATUS instead. note that we do not currently need lastpipe, since we use @PIPESTATUS (which actually provides more functionality for our purposes).
12871	03/22/2014 06:02 AM	Aaron Marcuse-Kubitza	fix: lib/sh/util.sh: echo_func(): file/line #: display with regular color because the lighter color actually draws attention to rather than away from the faded text
12870	03/22/2014 05:59 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: added plain()
12869	03/22/2014 05:56 AM	Aaron Marcuse-Kubitza	inputs/XAL/Specimen/test.xml.ref: updated for sample data.csv, which contains the columns as a CSV. this fixes a bug where a map.csv must be used on a table that contains the same set of columns (ie. not one with no columns if there are any mappings).
12868	03/22/2014 05:50 AM	Aaron Marcuse-Kubitza	bugfix: lib/sql_io.py: put_table(): is_literals: `return sql.value(cur): need to use sql.value_or_none() instead to support multi-row functions, such as _split() used in specimens data`
12867	03/22/2014 05:06 AM	Aaron Marcuse-Kubitza	fix: inputs/input.Makefile: don't treat *.xml as data files since these are not currently supported
12866	03/22/2014 04:55 AM	Aaron Marcuse-Kubitza	lib/runscripts/util.run: on_exit(): documented that users can also override gateway()/fallback() to perform other commands (or no commands) after the script is read

Project

General

Profile