Project

General

Profile

Statistics
| Revision:
  • svn:ignore: .~*

# Date Author Comment
14843 10/18/2014 05:03 PM Aaron Marcuse-Kubitza

fix: inputs/input.Makefile: add!: verify/: also svn:ignore .zip

14840 10/17/2014 02:52 PM Aaron Marcuse-Kubitza

inputs/publishable datasources.xlsx: updated: replaced BIEN2 traits with Cyrille traits

14838 10/16/2014 01:51 PM Aaron Marcuse-Kubitza

fix: inputs/publishable datasources.xlsx: updated: conditions of use: Canadensys sources: these should actually be marked as no restrictions, in contrast to Brian E's earlier annotations, because they are public domain: the citation is requested, but not legally required

14837 10/16/2014 01:46 PM Aaron Marcuse-Kubitza

inputs/publishable datasources.xlsx: updated

14836 10/16/2014 01:40 PM Aaron Marcuse-Kubitza

inputs/publishable datasources.xlsx: updated: conditions of use: indicated which datasources have no restrictions

14833 10/16/2014 10:49 AM Aaron Marcuse-Kubitza

inputs/CVS/^taxon_observation.**.sample/test.xml.ref: updated

14832 10/16/2014 10:49 AM Aaron Marcuse-Kubitza

fix: inputs/CVS/plantConcept_/header.csv: regenerated after fixing the postprocess/cleanup ordering bug (r14827), which had caused header.csv to be incorrectly generated after renames in postprocess.sql were applied

14827 10/14/2014 02:50 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: postprocess must be run after cleanup rather than before because it depends on the cleanup having been performed.

this bug was not previously detected because this is only a problem when refreshing a datasource to data in the same format: this would attempt to run an existing postprocess.sql, out of order, instead of starting with no postprocess.sql as we usually do....

14825 10/14/2014 11:05 AM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: $(dbExports): also need to put data.sql before clean_up.sql, etc. previously, this ordering had to be done by naming clean_up.sql, etc so they would sort after data.sql alphabetically, but it can be confusing to have to remember to do this. this fixes a bug in the CVS refresh where cvs.~.clean_up.sql was being run before data.sql, causing some private columns to have been deleted before the data was imported into the tables, creating a column mismatch error.

14819 10/14/2014 09:54 AM Aaron Marcuse-Kubitza

inputs/Cyrille_traits/Makefile: set custom $(null_strs) which handles "NA"

14818 10/14/2014 09:49 AM Aaron Marcuse-Kubitza

inputs/input.Makefile: pass make var $(null_strs) to invoked commands so it can be used by lib/sql_io.py

14817 10/14/2014 09:39 AM Aaron Marcuse-Kubitza

fix: *Makefile: changed line endings to \n so that `patch` can work with pasted input. use `svn di --extensions --ignore-eol-style` to verify no diff.

14812 10/13/2014 06:31 AM Aaron Marcuse-Kubitza

added inputs/CVS/_src/cvs-eep-archive-2014-10-07-correctedCVSData.{data,schema}.sql.ini

14811 10/13/2014 06:31 AM Aaron Marcuse-Kubitza

bugfix: inputs/CVS/_src/{data,schema}.sql.ini: sourcefilename: this needs to be on the VM's own HD to avoid crashing MSAccess to PostgreSQL. destinationdatabase: added this back since it is fine to leave this blank.

14810 10/13/2014 06:23 AM Aaron Marcuse-Kubitza

added inputs/CVS/_src/cvs-eep-archive-2013-10-22-VegBIEN.{data,schema}.sql.ini

14809 10/13/2014 06:17 AM Aaron Marcuse-Kubitza

added inputs/CVS/_src/{data,schema}.sql.ini

14806 10/12/2014 03:58 PM Aaron Marcuse-Kubitza

inputs/publishable datasources.xlsx: updated: use white text on dark backgrounds for better visibility, and to create more visual contrast for the unredistributable indicators

14805 10/12/2014 03:52 PM Aaron Marcuse-Kubitza

inputs/publishable datasources.xlsx: updated: consolidated Brian E's new columns into a single "conditions?" column. fix: "what is needed to publish it": renamed to "conditions of use/remaining tasks". "conditions of use": changed color scheme to match "publishable?" columns.

14804 10/09/2014 12:34 AM Aaron Marcuse-Kubitza

added inputs/CVS/verify/Review of CVS data in BIEN3-RKP2014Sept7-Revised.docx from Bob

14799 10/03/2014 05:36 PM Aaron Marcuse-Kubitza

bugfix: inputs/CVS/plot_/postprocess.sql: locality: site_name should come before directions_to_place because it is at a higher level of granularity

14795 10/03/2014 12:37 PM Aaron Marcuse-Kubitza

added inputs/CVS/verify/Review of CVS data in BIEN3-RKP2014Sept7.docx from Bob

14789 10/01/2014 09:35 PM Aaron Marcuse-Kubitza

added inputs/bien2_traits/_no_import since bien2_traits has been replaced by Cyrille_traits

14786 09/30/2014 07:45 AM Aaron Marcuse-Kubitza

added inputs/Cyrille_traits/

14784 09/30/2014 05:04 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. made verbatimLocality a synonym of locality since they are used to store the same data.

14777 09/26/2014 10:57 PM Aaron Marcuse-Kubitza

fix: inputs/input.Makefile: $(nonHeaderSrcs): updated to exclude new header.txt

14766 09/26/2014 02:30 AM Aaron Marcuse-Kubitza

inputs/input.Makefile: added %/list_srcs

14762 09/26/2014 12:36 AM Aaron Marcuse-Kubitza

fix: lib/sh/util.sh: already_exists_msg(): changed calling convention to avoid it seeming like `return 0` is run if already_exists_msg() throws an error, when in fact already_exists_msg() is just a command that should be run before returning/errexiting

14761 09/25/2014 04:28 PM Aaron Marcuse-Kubitza

fix: inputs/input.Makefile: need to escape $ in commands, including inside comments

14759 09/25/2014 03:15 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: `$(call add*,$(svnFiles))` must be invoked externally to clear the $(wildcard) cache before expanding $(svnFiles)

14745 09/24/2014 11:34 PM Aaron Marcuse-Kubitza

inputs/VegBank/run*.log: updated. this adds the function call context in addition to the function location.

14697 09/18/2014 04:57 PM Aaron Marcuse-Kubitza

fix: inputs/.geoscrub/geoscrub_output/geoscrub.csv.run: make(): added warning that this will truncate the geoscrub database tables

14684 09/10/2014 11:00 PM Aaron Marcuse-Kubitza

added inputs/VegBank/run.call_graph.log

14683 09/10/2014 05:03 PM Aaron Marcuse-Kubitza

inputs/VegBank/run.log: updated for echo_vars() changes. the PG* vars, which contain important information, will now not need to be filtered out.

14677 09/10/2014 12:51 PM Aaron Marcuse-Kubitza

added inputs/VegBank/run.log

14676 09/10/2014 12:49 PM Aaron Marcuse-Kubitza

fix: inputs/input.Makefile: $(svnFilesGlob): *.log should be in both the subdirs and the main dir

14675 09/10/2014 12:48 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: $(svnFilesGlob): *.log

14674 09/09/2014 06:43 PM Aaron Marcuse-Kubitza

inputs/Makefile: install: install an empty VegBIEN schema instead of all the datasources, at Mark's request. this enables loading just a single datasource.

14633 08/29/2014 03:55 PM Aaron Marcuse-Kubitza

added inputs/CVS/verify/Review of CVS data in BIEN3.docx

14588 08/26/2014 05:02 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: %/install: $(exportHeader) must come before postprocess because postprocess renames columns

14582 08/26/2014 04:07 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: $(import_install_): need `set -o pipefail` to enable errexit

14579 08/26/2014 02:52 AM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/run: documented postprocess() rm=1 runtime (6 min)

14556 08/21/2014 07:31 PM Aaron Marcuse-Kubitza

fix: inputs/.geoscrub/geoscrub_output/postprocess.sql: map_geovalidity(): unscrubbable names should actually be geo*in*valid, not geovalid=NULL, according to Brad

14532 08/20/2014 04:26 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: sql/install: ";" for commands inside $(if) blocks need to be inside the $(if) block, too, because otherwise there will be dangling ";" without a statement (bash does not support empty statements containing just ";")

14512 08/19/2014 11:47 AM Aaron Marcuse-Kubitza

inputs/publishable datasources.xlsx: udpated

14492 08/18/2014 01:58 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_best_match_jerry_lu index to facilitate finding names affected by the match-picking bug (#943)

14490 08/17/2014 07:49 AM Aaron Marcuse-Kubitza

fix: inputs/HVAA/Specimen/postprocess.sql, map.csv: monthCollected/dayCollected: fix indefinite dates (which aren't supported by Postgres), as decided by Bob (https://docs.google.com/spreadsheets/d/1PI8n0CRttN7ttsXs5qfh5OFFzSoAfJj0gSbylgX6vj4/edit#gid=0)

14468 08/11/2014 07:48 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_by_name index

14467 08/11/2014 07:46 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_by_family index

14466 08/11/2014 07:10 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added taxon_scrub_by_species_binomial index

14454 08/11/2014 03:40 AM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: prefixed taxonomic ranks: use _concat_nullify() so that the prefix is only added if the epithet is non-NULL

14451 08/11/2014 02:16 AM Aaron Marcuse-Kubitza

bugfix: inputs/FIA/REF_RESEARCH_STATION/map.csv: mapped country, which is not provided in the FIA data

14448 08/10/2014 05:53 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: removed no longer used scrubbed_unique_taxon_name. the scrubbed name ranks are now generated from the other TNRS columns instead.

14436 08/09/2014 08:46 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: removed no longer used view ValidMatchedTaxon. use taxon_scrub instead.

14434 08/09/2014 08:42 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub: use taxon_best_match directly, to avoid the need for a separate ValidMatchedTaxon view

14433 08/09/2014 08:25 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_scrub: merged synonymous columns

14432 08/09/2014 08:11 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon_scrub: documented steps to merge synonymous columns

14431 08/09/2014 07:45 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: removed no longer used view MatchedTaxon. use taxon_best_match instead.

14430 08/09/2014 07:43 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: ValidMatchedTaxon: use taxon_best_match now that it's equivalent to MatchedTaxon

14426 08/09/2014 07:19 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: MatchedTaxon: merged synonymous columns

14424 08/09/2014 06:52 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: removed no longer used taxon_scrub.scrubbed_unique_taxon_name.* . use taxon_scrub instead.

14423 08/09/2014 06:50 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub: use taxon_match derived columns instead of the incorrect values in taxon_scrub.scrubbed_unique_taxon_name.* (which does not work with the multi-match strategy)

14422 08/09/2014 05:54 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon: use derived columns from taxon_match. this also incorporates the fixes in the new derived columns.

14421 08/09/2014 05:27 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub: use derived columns from taxon_match. this also incorporates the fixes in the new derived columns.

14417 08/05/2014 05:31 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: to port derived column changes to vegbiendev: derived_cols_export() code: documented runtime (6 h)

14416 08/04/2014 06:25 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: removed no longer used derived column __accepted_infraspecific_label, which had a buggy formula that broke derived_cols_populate()

14414 08/04/2014 06:17 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: removed no longer used derived column __accepted_infraspecific_label, which had a buggy formula that broke derived_cols_populate()

14413 08/04/2014 06:10 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: to remove a column: updated instructions

14408 08/02/2014 07:58 PM Aaron Marcuse-Kubitza

**: updated to use the local machine's new hostname, frenzy

14388 07/28/2014 11:58 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added new derived columns to derived views

14379 07/28/2014 04:45 PM Aaron Marcuse-Kubitza

fix: schemas/util.sql: derived_col_update(): also need steps to drop column, because DROP __ CASCADE doesn't work when there are dependent views

14378 07/28/2014 04:33 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: _accepted_infraspecific{rank,epithet}: use array slice of new _accepted{genus,specific_epithet,infra_{rank,epithet}}, which is simpler than using remove_prefix() in __accepted_infraspecific_label

14377 07/28/2014 04:21 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: "[accepted_]genus__@DwC__@vegpath.org": don't need to use *Accepted_name anymore because _accepted{genus,specific_epithet,infra_{rank,epithet}} is now generated from *Accepted_name

14376 07/28/2014 04:02 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match."__accepted_{genus,specific_epithet}": renamed to "__accepted_{genus,specific_epithet,infra_{rank,epithet}}" since this now includes these other ranks as well

14375 07/28/2014 03:48 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match."__accepted_{genus,specific_epithet}": use "*Accepted_name" instead of "Accepted_species[_binomial]__@TNRS__@vegpath.org" (from "*Accepted_name_species") because Accepted_name_species apparently sometimes does not match the Accepted_name and uses malformed Unicode characters

14368 07/26/2014 10:16 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: `inputs/.TNRS/data.sql.run refresh`: documented runtime (1 min)

14367 07/26/2014 10:15 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match: use "Accepted_species[_binomial]__@TNRS__@vegpath.org" instead of "*Accepted_name_species". this fixes a bug in __accepted_infraspecific_label where Accepted_name_species with trailing whitespace could not be prefix-removed from names that contained just a species binomial.

14366 07/26/2014 09:57 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_species[_binomial]__@TNRS__@vegpath.org", which removes trailing whitespace

14365 07/26/2014 08:58 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added steps to remove a column and to add a non-derived column

14364 07/26/2014 08:48 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: to remove columns or add columns at the end: merged into "to add a new derived column"

14363 07/26/2014 08:44 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: to add columns in the middle: renamed to "to move a column to the middle" for clarity

14362 07/26/2014 08:38 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: to populate a new column: updated to use util.derived_col_update()

14361 07/26/2014 08:24 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: to remove columns or add columns: also need to run util.recreate_view()

14360 07/26/2014 08:19 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: to remove columns or add columns at the end: don't need to run `rm=1 inputs/.TNRS/data.sql.run` because this is now run by `make schemas/remake`

14359 07/26/2014 08:13 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added new derived columns to derived views

14358 07/26/2014 07:52 PM Aaron Marcuse-Kubitza

schemas/util.sql: remove_prefix(), remove_suffix(): support case-insensitive matching

14357 07/26/2014 07:43 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match.__accepted_infraspecific_label: need to use case-insensitive matching of the removed prefix because TNRS lowercases part of the Accepted_name

14356 07/26/2014 07:40 PM Aaron Marcuse-Kubitza

schemas/util.sql: remove_prefix(), remove_suffix(): support case-insensitive matching

14347 07/26/2014 05:16 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match: use wrapper for util.remove_prefix() so CHECK constraints that use it don't get dropped when the util schema is reinstalled

14332 07/25/2014 02:21 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: COMMENT: added steps to port derived column changes to vegbiendev

14330 07/25/2014 12:41 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match: derived columns: use new "matched~Name[_no_author]___@TNRS__@vegpath.org" instead of "*Name_matched" so that "No suitable matches found." is removed before concatenating with other fields

14329 07/25/2014 12:14 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "matched~Name[_no_author]___@TNRS__@vegpath.org", which removes the "No suitable matches found." string

14328 07/22/2014 06:50 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: reordered derived columns in dependency order

14327 07/22/2014 06:46 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: "[accepted_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org": need to use "[accepted_]genus__@DwC__@vegpath.org" rather than "*Accepted_name" for this for rank = genus

14326 07/22/2014 06:44 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org"

14325 07/22/2014 06:41 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: "[accepted_]genus__@DwC__@vegpath.org": need to populate this for rank = genus

14324 07/22/2014 05:47 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]taxonomicStatus__@DwC__@vegpath.org"

14323 07/22/2014 05:30 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: derived columns: use "Accepted_family__@TNRS__@vegpath.org" instead of "*Accepted_name_family" because "*Accepted_name_family" is sometimes missing

14318 07/22/2014 04:12 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_family__@TNRS__@vegpath.org", which is needed because "*Accepted_name_family" isn't always populated

14317 07/22/2014 04:01 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match: to add columns in the middle: also need to run util.derived_cols_repopulate() since the dependency order has changed

14313 07/22/2014 03:25 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: COMMENT: to add columns in the middle: also need to run util.derived_cols_update()