/trunk/inputs - Changes - BIEN 3 - NCEAS Projects

root/trunk/inputs @ 14845

svn:ignore: .~*

#	Date	Author	Comment
14845	10/18/2014 07:32 PM	Aaron Marcuse-Kubitza	added inputs/XAL/Specimen/data.csv (empty template)
14844	10/18/2014 05:03 PM	Aaron Marcuse-Kubitza	fix: inputs/input.Makefile: add!: verify/: also svn:ignore .zip
14843	10/18/2014 05:03 PM	Aaron Marcuse-Kubitza	fix: inputs/input.Makefile: add!: verify/: also svn:ignore .zip
14840	10/17/2014 02:52 PM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: updated: replaced BIEN2 traits with Cyrille traits
14838	10/16/2014 01:51 PM	Aaron Marcuse-Kubitza	fix: inputs/publishable datasources.xlsx: updated: conditions of use: Canadensys sources: these should actually be marked as no restrictions, in contrast to Brian E's earlier annotations, because they are public domain: the citation is requested, but not legally required
14837	10/16/2014 01:46 PM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: updated
14836	10/16/2014 01:40 PM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: updated: conditions of use: indicated which datasources have no restrictions
14833	10/16/2014 10:49 AM	Aaron Marcuse-Kubitza	inputs/CVS/^taxon_observation.**.sample/test.xml.ref: updated
14832	10/16/2014 10:49 AM	Aaron Marcuse-Kubitza	fix: inputs/CVS/plantConcept_/header.csv: regenerated after fixing the postprocess/cleanup ordering bug (r14827), which had caused header.csv to be incorrectly generated after renames in postprocess.sql were applied
14827	10/14/2014 02:50 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: postprocess must be run after cleanup rather than before because it depends on the cleanup having been performed. this bug was not previously detected because this is only a problem when refreshing a datasource to data in the same format: this would attempt to run an existing postprocess.sql, out of order, instead of starting with no postprocess.sql as we usually do....
14825	10/14/2014 11:05 AM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: $(dbExports): also need to put data.sql before clean_up.sql, etc. previously, this ordering had to be done by naming clean_up.sql, etc so they would sort after data.sql alphabetically, but it can be confusing to have to remember to do this. this fixes a bug in the CVS refresh where cvs.~.clean_up.sql was being run before data.sql, causing some private columns to have been deleted before the data was imported into the tables, creating a column mismatch error.
14819	10/14/2014 09:54 AM	Aaron Marcuse-Kubitza	inputs/Cyrille_traits/Makefile: set custom $(null_strs) which handles "NA"
14818	10/14/2014 09:49 AM	Aaron Marcuse-Kubitza	inputs/input.Makefile: pass make var $(null_strs) to invoked commands so it can be used by lib/sql_io.py
14817	10/14/2014 09:39 AM	Aaron Marcuse-Kubitza	fix: *Makefile: changed line endings to \n so that `patch` can work with pasted input. use `svn di --extensions --ignore-eol-style` to verify no diff.
14812	10/13/2014 06:31 AM	Aaron Marcuse-Kubitza	added inputs/CVS/_src/cvs-eep-archive-2014-10-07-correctedCVSData.{data,schema}.sql.ini
14811	10/13/2014 06:31 AM	Aaron Marcuse-Kubitza	bugfix: inputs/CVS/_src/{data,schema}.sql.ini: sourcefilename: this needs to be on the VM's own HD to avoid crashing MSAccess to PostgreSQL. destinationdatabase: added this back since it is fine to leave this blank.
14810	10/13/2014 06:23 AM	Aaron Marcuse-Kubitza	added inputs/CVS/_src/cvs-eep-archive-2013-10-22-VegBIEN.{data,schema}.sql.ini
14809	10/13/2014 06:17 AM	Aaron Marcuse-Kubitza	added inputs/CVS/_src/{data,schema}.sql.ini
14806	10/12/2014 03:58 PM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: updated: use white text on dark backgrounds for better visibility, and to create more visual contrast for the unredistributable indicators
14805	10/12/2014 03:52 PM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: updated: consolidated Brian E's new columns into a single "conditions?" column. fix: "what is needed to publish it": renamed to "conditions of use/remaining tasks". "conditions of use": changed color scheme to match "publishable?" columns.
14804	10/09/2014 12:34 AM	Aaron Marcuse-Kubitza	added inputs/CVS/verify/Review of CVS data in BIEN3-RKP2014Sept7-Revised.docx from Bob
14799	10/03/2014 05:36 PM	Aaron Marcuse-Kubitza	bugfix: inputs/CVS/plot_/postprocess.sql: locality: site_name should come before directions_to_place because it is at a higher level of granularity
14795	10/03/2014 12:37 PM	Aaron Marcuse-Kubitza	added inputs/CVS/verify/Review of CVS data in BIEN3-RKP2014Sept7.docx from Bob
14789	10/01/2014 09:35 PM	Aaron Marcuse-Kubitza	added inputs/bien2_traits/_no_import since bien2_traits has been replaced by Cyrille_traits
14786	09/30/2014 07:45 AM	Aaron Marcuse-Kubitza	added inputs/Cyrille_traits/
14784	09/30/2014 05:04 AM	Aaron Marcuse-Kubitza	mappings/VegCore.htm: regenerated from wiki. made verbatimLocality a synonym of locality since they are used to store the same data.
14777	09/26/2014 10:57 PM	Aaron Marcuse-Kubitza	fix: inputs/input.Makefile: $(nonHeaderSrcs): updated to exclude new header.txt
14766	09/26/2014 02:30 AM	Aaron Marcuse-Kubitza	inputs/input.Makefile: added %/list_srcs
14762	09/26/2014 12:36 AM	Aaron Marcuse-Kubitza	fix: lib/sh/util.sh: already_exists_msg(): changed calling convention to avoid it seeming like `return 0` is run if already_exists_msg() throws an error, when in fact already_exists_msg() is just a command that should be run before returning/errexiting
14761	09/25/2014 04:28 PM	Aaron Marcuse-Kubitza	fix: inputs/input.Makefile: need to escape $ in commands, including inside comments
14759	09/25/2014 03:15 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: `$(call add*,$(svnFiles))` must be invoked externally to clear the $(wildcard) cache before expanding $(svnFiles)
14745	09/24/2014 11:34 PM	Aaron Marcuse-Kubitza	inputs/VegBank/run*.log: updated. this adds the function call context in addition to the function location.
14697	09/18/2014 04:57 PM	Aaron Marcuse-Kubitza	fix: inputs/.geoscrub/geoscrub_output/geoscrub.csv.run: make(): added warning that this will truncate the geoscrub database tables
14684	09/10/2014 11:00 PM	Aaron Marcuse-Kubitza	added inputs/VegBank/run.call_graph.log
14683	09/10/2014 05:03 PM	Aaron Marcuse-Kubitza	inputs/VegBank/run.log: updated for echo_vars() changes. the PG* vars, which contain important information, will now not need to be filtered out.
14677	09/10/2014 12:51 PM	Aaron Marcuse-Kubitza	added inputs/VegBank/run.log
14676	09/10/2014 12:49 PM	Aaron Marcuse-Kubitza	fix: inputs/input.Makefile: $(svnFilesGlob): *.log should be in both the subdirs and the main dir
14675	09/10/2014 12:48 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: $(svnFilesGlob): *.log
14674	09/09/2014 06:43 PM	Aaron Marcuse-Kubitza	inputs/Makefile: install: install an empty VegBIEN schema instead of all the datasources, at Mark's request. this enables loading just a single datasource.
14633	08/29/2014 03:55 PM	Aaron Marcuse-Kubitza	added inputs/CVS/verify/Review of CVS data in BIEN3.docx
14588	08/26/2014 05:02 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: %/install: $(exportHeader) must come before postprocess because postprocess renames columns
14582	08/26/2014 04:07 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: $(import_install_): need `set -o pipefail` to enable errexit
14579	08/26/2014 02:52 AM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_output/run: documented postprocess() rm=1 runtime (6 min)
14556	08/21/2014 07:31 PM	Aaron Marcuse-Kubitza	fix: inputs/.geoscrub/geoscrub_output/postprocess.sql: map_geovalidity(): unscrubbable names should actually be geoinvalid, not geovalid=NULL, according to Brad
14532	08/20/2014 04:26 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: sql/install: ";" for commands inside $(if) blocks need to be inside the $(if) block, too, because otherwise there will be dangling ";" without a statement (bash does not support empty statements containing just ";")
14512	08/19/2014 11:47 AM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: udpated
14492	08/18/2014 01:58 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_best_match_jerry_lu index to facilitate finding names affected by the match-picking bug (#943)
14490	08/17/2014 07:49 AM	Aaron Marcuse-Kubitza	fix: inputs/HVAA/Specimen/postprocess.sql, map.csv: monthCollected/dayCollected: fix indefinite dates (which aren't supported by Postgres), as decided by Bob (https://docs.google.com/spreadsheets/d/1PI8n0CRttN7ttsXs5qfh5OFFzSoAfJj0gSbylgX6vj4/edit#gid=0)
14468	08/11/2014 07:48 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_by_name index
14467	08/11/2014 07:46 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_by_family index
14466	08/11/2014 07:10 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_by_species_binomial index
14454	08/11/2014 03:40 AM	Aaron Marcuse-Kubitza	bugfix: mappings/VegCore-VegBIEN.csv: prefixed taxonomic ranks: use _concat_nullify() so that the prefix is only added if the epithet is non-NULL
14451	08/11/2014 02:16 AM	Aaron Marcuse-Kubitza	bugfix: inputs/FIA/REF_RESEARCH_STATION/map.csv: mapped country, which is not provided in the FIA data
14448	08/10/2014 05:53 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: removed no longer used scrubbed_unique_taxon_name. the scrubbed name ranks are now generated from the other TNRS columns instead.
14436	08/09/2014 08:46 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: removed no longer used view ValidMatchedTaxon. use taxon_scrub instead.
14434	08/09/2014 08:42 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_scrub: use taxon_best_match directly, to avoid the need for a separate ValidMatchedTaxon view
14433	08/09/2014 08:25 PM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_scrub: merged synonymous columns
14432	08/09/2014 08:11 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: taxon_scrub: documented steps to merge synonymous columns
14431	08/09/2014 07:45 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: removed no longer used view MatchedTaxon. use taxon_best_match instead.
14430	08/09/2014 07:43 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: ValidMatchedTaxon: use taxon_best_match now that it's equivalent to MatchedTaxon
14426	08/09/2014 07:19 PM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: MatchedTaxon: merged synonymous columns
14424	08/09/2014 06:52 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: removed no longer used taxon_scrub.scrubbed_unique_taxon_name.* . use taxon_scrub instead.
14423	08/09/2014 06:50 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_scrub: use taxon_match derived columns instead of the incorrect values in taxon_scrub.scrubbed_unique_taxon_name.* (which does not work with the multi-match strategy)
14422	08/09/2014 05:54 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: MatchedTaxon: use derived columns from taxon_match. this also incorporates the fixes in the new derived columns.
14421	08/09/2014 05:27 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_scrub: use derived columns from taxon_match. this also incorporates the fixes in the new derived columns.
14417	08/05/2014 05:31 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: to port derived column changes to vegbiendev: derived_cols_export() code: documented runtime (6 h)
14416	08/04/2014 06:25 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: removed no longer used derived column __accepted_infraspecific_label, which had a buggy formula that broke derived_cols_populate()
14414	08/04/2014 06:17 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: removed no longer used derived column __accepted_infraspecific_label, which had a buggy formula that broke derived_cols_populate()
14413	08/04/2014 06:10 AM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: to remove a column: updated instructions
14408	08/02/2014 07:58 PM	Aaron Marcuse-Kubitza	**: updated to use the local machine's new hostname, frenzy
14388	07/28/2014 11:58 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: added new derived columns to derived views
14379	07/28/2014 04:45 PM	Aaron Marcuse-Kubitza	fix: schemas/util.sql: derived_col_update(): also need steps to drop column, because DROP __ CASCADE doesn't work when there are dependent views
14378	07/28/2014 04:33 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: _accepted_infraspecific{rank,epithet}: use array slice of new _accepted{genus,specific_epithet,infra_{rank,epithet}}, which is simpler than using remove_prefix() in __accepted_infraspecific_label
14377	07/28/2014 04:21 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: "[accepted_]genus__@DwC__@vegpath.org": don't need to use Accepted_name anymore because _accepted{genus,specific_epithet,infra_{rank,epithet}} is now generated from Accepted_name
14376	07/28/2014 04:02 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match."__accepted_{genus,specific_epithet}": renamed to "__accepted_{genus,specific_epithet,infra_{rank,epithet}}" since this now includes these other ranks as well
14375	07/28/2014 03:48 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match."__accepted_{genus,specific_epithet}": use "Accepted_name" instead of "Accepted_species[_binomial]__@TNRS__@vegpath.org" (from "Accepted_name_species") because Accepted_name_species apparently sometimes does not match the Accepted_name and uses malformed Unicode characters
14368	07/26/2014 10:16 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: `inputs/.TNRS/data.sql.run refresh`: documented runtime (1 min)
14367	07/26/2014 10:15 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match: use "Accepted_species[_binomial]__@TNRS__@vegpath.org" instead of "*Accepted_name_species". this fixes a bug in __accepted_infraspecific_label where Accepted_name_species with trailing whitespace could not be prefix-removed from names that contained just a species binomial.
14366	07/26/2014 09:57 PM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_species[_binomial]__@TNRS__@vegpath.org", which removes trailing whitespace
14365	07/26/2014 08:58 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: added steps to remove a column and to add a non-derived column
14364	07/26/2014 08:48 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: to remove columns or add columns at the end: merged into "to add a new derived column"
14363	07/26/2014 08:44 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: to add columns in the middle: renamed to "to move a column to the middle" for clarity
14362	07/26/2014 08:38 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: to populate a new column: updated to use util.derived_col_update()
14361	07/26/2014 08:24 PM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: to remove columns or add columns: also need to run util.recreate_view()
14360	07/26/2014 08:19 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: to remove columns or add columns at the end: don't need to run `rm=1 inputs/.TNRS/data.sql.run` because this is now run by `make schemas/remake`
14359	07/26/2014 08:13 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: added new derived columns to derived views
14358	07/26/2014 07:52 PM	Aaron Marcuse-Kubitza	schemas/util.sql: remove_prefix(), remove_suffix(): support case-insensitive matching
14357	07/26/2014 07:43 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match.__accepted_infraspecific_label: need to use case-insensitive matching of the removed prefix because TNRS lowercases part of the Accepted_name
14356	07/26/2014 07:40 PM	Aaron Marcuse-Kubitza	schemas/util.sql: remove_prefix(), remove_suffix(): support case-insensitive matching
14347	07/26/2014 05:16 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match: use wrapper for util.remove_prefix() so CHECK constraints that use it don't get dropped when the util schema is reinstalled
14332	07/25/2014 02:21 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: COMMENT: added steps to port derived column changes to vegbiendev
14330	07/25/2014 12:41 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match: derived columns: use new "matched~Name[_no_author]___@TNRS__@vegpath.org" instead of "*Name_matched" so that "No suitable matches found." is removed before concatenating with other fields
14329	07/25/2014 12:14 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "matched~Name[_no_author]___@TNRS__@vegpath.org", which removes the "No suitable matches found." string
14328	07/22/2014 06:50 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: reordered derived columns in dependency order
14327	07/22/2014 06:46 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: "[accepted_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org": need to use "[accepted_]genus__@DwC__@vegpath.org" rather than "*Accepted_name" for this for rank = genus
14326	07/22/2014 06:44 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org"
14325	07/22/2014 06:41 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: "[accepted_]genus__@DwC__@vegpath.org": need to populate this for rank = genus
14324	07/22/2014 05:47 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]taxonomicStatus__@DwC__@vegpath.org"
14323	07/22/2014 05:30 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: derived columns: use "Accepted_family__@TNRS__@vegpath.org" instead of "Accepted_name_family" because "Accepted_name_family" is sometimes missing
14318	07/22/2014 04:12 AM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_family__@TNRS__@vegpath.org", which is needed because "*Accepted_name_family" isn't always populated

Project

General

Profile