Project

General

Profile

Statistics
| Revision:
  • svn:ignore: .~*

# Date Author Comment
13591 06/02/2014 04:50 AM Aaron Marcuse-Kubitza

lib/tnrs.py: switched to downloading all matches per name, as is needed to implement #917. note that this will break the parts of the schema that use the tnrs table, until Brad's match-picking algorithm can be implemented, but this tradeoff is necessary to be able to begin scrubbing sooner (Martha; wiki.vegpath.org/2014-05-29_conference_call#TNRS)

13590 06/02/2014 04:35 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: tnrs_input_name: don't scrub accepted names, as using multiple matches per name no longer provides a single accepted name to scrub. instead, the Accepted_* fields can be whitespace-split to generate the same columns that would have been generated by the scrubbing (and without the overhead of the extra TNRS call).

13589 06/02/2014 04:27 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: added back index on Name_submitted, which is needed for tnrs_input_name to work properly (now that there is no automatic index created by a unique constraint)

13587 06/02/2014 03:43 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: tnrs: removed unique constraint on Name_submitted, Name_matched because there can be more than one match with the same Name_matched (but different accepted names, etc.)

13586 06/01/2014 09:00 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: tnrs.tnrs__valid_match index: made it non-unique to allow multiple matches per name, as is needed to implement #917

13585 06/01/2014 05:00 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: tnrs__match_num__fill(): only fill if not set, to support case where tnrs is being restored from a .sql file (where match_num is already set)

13584 06/01/2014 04:36 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: documented runtime to add a constraint (3 min)

13583 06/01/2014 04:35 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: unique constraint on Name_submitted: added Name_matched to allow multiple matches per name, as is needed to implement #917

13582 06/01/2014 03:44 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: documented how to populate a new column

13581 06/01/2014 03:41 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: pkey: use match_num instead of Name_number to allow multiple matches per name, as is needed to implement #917

13580 05/31/2014 10:31 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs.match_num: made it NOT NULL now that it's populated

13579 05/31/2014 10:28 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: populate match_num

13578 05/31/2014 10:25 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: populate match_num

13577 05/31/2014 09:50 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: documented how to add and remove columns

13575 05/31/2014 08:58 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: made COMMENTs start on their own line, using the steps at wiki.vegpath.org/Postgres_queries#make-COMMENTs-start-on-their-own-line

13574 05/31/2014 08:11 PM Aaron Marcuse-Kubitza

inputs/test_taxonomic_names/_scrub/*: updated to TNRS schema

13573 05/31/2014 08:10 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: added match_num

13572 05/31/2014 08:06 PM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql.run: refresh(): documented runtime (1 min)

13570 05/31/2014 06:44 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added tnrs__match_num__next()

13567 05/30/2014 06:34 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added tnrs__batch_begin() trigger to populate the match_num (match sort order)

13540 05/27/2014 10:13 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub.scrubbed_unique_taxon_name.*: added scrubbed_taxon_name_with_author, needed by Jeff Ott's analysis (wiki.vegpath.org/Data_requests)

13533 05/27/2014 12:28 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub: added scrubbed_morphospecies_binomial, analogous to accepted_morphospecies_binomial for scrubbed_*

13532 05/27/2014 12:13 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub: added scrubbed_morphospecies_binomial, analogous to accepted_morphospecies_binomial for scrubbed_*

13531 05/26/2014 11:54 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_scrub: documented how to modify it

13528 05/26/2014 11:20 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added taxon_scrub_modify()

13527 05/23/2014 06:17 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon_modify(): use simpler util.recreate_view()

13526 05/23/2014 06:15 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon_modify(): documented usage

13518 05/21/2014 07:30 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon_modify(): removed no longer needed DROP VIEW statement

13512 05/21/2014 05:50 PM Aaron Marcuse-Kubitza

fix: schemas/util.sql: force_recreate(): renamed to just recreate(), because "force" normally implies that things will be deleted, which this function does not do

13508 05/21/2014 04:25 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: MatchedTaxon.taxonomicStatus: filter using map_taxonomic_status() so that the corrected value is available in the normalized DB, not just analytical_stem

13507 05/21/2014 04:05 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon: to modify: use new MatchedTaxon_modify(), which eliminates the work of putting together the dependent views

13506 05/21/2014 03:53 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added MatchedTaxon_modify()

13503 05/21/2014 04:13 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: map_taxonomic_status(): need to use accepted name instead of scrubbed name (which also includes no-opinion names), as described at http://wiki.vegpath.org/2013-11-14_conference_call#taxonomic-fields. this used to be the accepted name, but got switched when the concatenated name was also used to store the matched name for no-opinion names.

13501 05/21/2014 01:27 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon: documented how to modify it (using util.force_recreate())

13498 05/20/2014 05:46 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon, etc.: added accepted_morphospecies_binomial derived field

13444 05/13/2014 04:50 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: MatchedTaxon.Accepted_name_species: mapped to accepted_species_binomial

13443 05/13/2014 04:09 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: COMMENTs: always include newline before and after

13441 05/13/2014 03:46 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_scrub, etc.: undid rename of accepted name columns to scrubbed_* (r13435), because these are actually not the same (scrubbed_* is the combination of accepted and no-opinion names). the accepted name columns will now be named accepted_*, following the standard naming scheme.

13439 05/13/2014 03:13 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_scrub, etc.: scrubbed_*: use columns from MatchedTaxon whenever possible, to as much as possible avoid the need to join to taxon_scrub.scrubbed_unique_taxon_name.*

13437 05/13/2014 02:29 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/grants.sql: added GRANT statements from schema.sql because these aren't run by `make inputs/.TNRS/reinstall`

13418 05/07/2014 07:17 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: $(datasrc_schema_exists): need to use $(datasrc), not $(schema), as $schema is only what this var is called in the runscripts

13417 05/07/2014 06:48 PM Aaron Marcuse-Kubitza

bugfix: inputs/analytical_db/: need dummy table.run file to cause a schema to be created for this datasource

13416 05/07/2014 06:44 PM Aaron Marcuse-Kubitza

fix: inputs/input.Makefile: $(sortFile): don't print the "add any missing tables to $(sortFile)" message every time the Makefile is run

13415 05/07/2014 06:44 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: install: only run this for datasource dirs

13414 05/07/2014 05:18 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: install: use ./run's install target for clarity

13412 05/07/2014 04:56 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: install: made it idempotent (using new $(datasrc_schema_exists)) so that it could be run by `make install` on an existing system

13411 05/07/2014 04:02 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: $(datasrc_schema_exists): need to use $(shell ...)

13410 05/07/2014 03:31 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: added $(datasrc_schema_exists)

13402 05/03/2014 02:03 PM Aaron Marcuse-Kubitza

added inputs/VegBank/verify/outputBien.log.url

13401 05/03/2014 02:03 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: add: verify/: also svn:ignore *.log

13375 05/01/2014 01:58 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: %/postprocess: invoke runscript if it exists

13374 05/01/2014 01:37 PM Aaron Marcuse-Kubitza

lib/runscripts/validations.pg.sql.run: export_(): make the export idempotent for easier re-runnability

13372 05/01/2014 01:29 PM Aaron Marcuse-Kubitza

fix: lib/runscripts/file.pg.sql.run: removed include of in_datasrc_dir.run, because this location does not apply to all .sql export scripts

13371 05/01/2014 01:15 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: validations.sql must be in a subdir so it won't get run by sql/install

13370 05/01/2014 01:11 PM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: validations.sql must be in a subdir so it won't get run by sql/install

13369 05/01/2014 05:20 AM Aaron Marcuse-Kubitza

inputs/input.Makefile: install: also run validate/install

13368 05/01/2014 04:44 AM Aaron Marcuse-Kubitza

inputs/input.Makefile: added validate/install

13367 05/01/2014 04:09 AM Aaron Marcuse-Kubitza

lib/runscripts/validations.pg.sql.run: export_(): make the export idempotent for easier re-runnability

13366 05/01/2014 03:22 AM Aaron Marcuse-Kubitza

bugfix: inputs/SALVIAS/validations.sql: need to cast character varying to text so that the types of each side of if() match

13357 04/30/2014 05:46 PM Aaron Marcuse-Kubitza

bugfix: **/postprocess.sql: don't use the public schema, because this creates an unsatisfied dependency while the database is being installed, and breaks `make install`

13316 04/24/2014 05:29 PM Aaron Marcuse-Kubitza

inputs/GBIF/_MySQL/.rsync_ignore: added GBIFPortalDB-*.data.sql.gz, because these are intermediate files

13195 04/19/2014 10:14 PM Aaron Marcuse-Kubitza

inputs/Madidi/_src/: set svn:ignore

13164 04/17/2014 08:21 PM Aaron Marcuse-Kubitza

fix: inputs/SALVIAS/projects/postprocess.sql: remove private data that should not be publicly visible: preserve datasets with ipr_specific = '', because they are actually redistributable, according to Brad (http://wiki.vegpath.org/2014-04-17_conference_call#conditions-of-use)

13152 04/16/2014 10:49 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: _specimens_07_list_of_verbatim_subspecific_taxa_with_author: updated filter condition to match output query

13151 04/16/2014 10:48 PM Aaron Marcuse-Kubitza

inputs/NY/run: `make inputs/NY/validate`: updated runtime (8 min, with added queries)

13150 04/16/2014 10:24 PM Aaron Marcuse-Kubitza

fix: inputs/NY/Ecatalog_all/map.csv, postprocess.sql: remapped substrate, vegetation to locationRemarks

13149 04/16/2014 06:41 PM Aaron Marcuse-Kubitza

fix: inputs/NY/Ecatalog_all/map.csv, postprocess.sql: remapped substrate, vegetation to locationRemarks

13147 04/16/2014 04:24 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql, schemas/vegbien.sql: _specimens_13*: also need to include coordinate pairs which have one of their coordinates NULL, by using OR instead of AND

13146 04/16/2014 04:15 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: _specimens_13b_list_of_all_decimal_lat_long: matched column types to output query

13145 04/16/2014 04:14 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: _specimens_13a_list_of_all_verbatim_lat_long: matched column types to output query

13144 04/16/2014 03:13 PM Aaron Marcuse-Kubitza

inputs/NY/validations.sql, schemas/vegbien.sql: _specimens_13_count_of_all_verbatim_and_decimal_lat_long: added breakdowns _specimens_13a_list_of_all_verbatim_lat_long, _specimens_13b_list_of_all_decimal_lat_long to help troubleshoot the diff

13143 04/16/2014 02:04 PM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql, schemas/vegbien.sql: _specimens_13_count_of_all_verbatim_and_decimal_lat_long: count lat/longs together instead of separately, because the DISTINCT is by coordinate pair, not individual coordinate value (which wouldn't make much sense)

13138 04/15/2014 06:52 PM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql: _specimens_13_count_of_all_verbatim_and_decimal_lat_long: use new is_castable(), which is much more accurate than Brad's custom regexp for determining if something is numeric

13137 04/15/2014 06:29 PM Aaron Marcuse-Kubitza

inputs/NY/validations.-.util.sql: added util.is_castable() wrapper

13130 04/14/2014 04:51 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: _specimens_13_count_of_all_verbatim_and_decimal_lat_long: need to include both lat and long in the value to DISTINCT on

13129 04/14/2014 04:48 PM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql: _specimens_13_count_of_all_verbatim_and_decimal_lat_long: need to DISTINCT the values that are being counted, because they are merged by the coordinates_unique unique constraint in the import

13126 04/14/2014 03:58 PM Aaron Marcuse-Kubitza

inputs/NY/run: `make inputs/NY/validate`: documented slow queries: _specimens_12_distinct_collector_name_collect_num_date_w_count

13125 04/14/2014 03:23 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/run_: `make inputs/SALVIAS/validate`: documented slow queries (_plots_06a_list_of_stems). these may need to have their query plans rechecked.

13124 04/14/2014 03:22 PM Aaron Marcuse-Kubitza

inputs/NY/run, inputs/SALVIAS/run_: `make inputs/.../validate`: updated runtime (+2 min)

13123 04/10/2014 04:06 PM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql: specimens*_of_unique_verbatim_author_taxa_with_genus: use scientificName rather than the concatenated ranks, because that is what is imported to taxonlabel.taxonomicname

13115 04/10/2014 02:24 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped subspecies to new taxonverbatim.subspecies for easier access by validations queries

13113 04/10/2014 01:25 PM Aaron Marcuse-Kubitza

fix: inputs/test_taxonomic_names/Taxon/map.csv: scientificName: remapped to scientificName instead of taxonName as this does include the author for some names

13112 04/10/2014 01:25 PM Aaron Marcuse-Kubitza

fix: inputs/NY/Ecatalog_all/map.csv: ScientificName: remapped to scientificName instead of taxonName as this does include the author

13111 04/10/2014 01:17 PM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql: specimens*_of_unique_verb_subsp_taxa_with_author: use taxonName instead of concatenating the ranks, as that corresponds to what we use as the concatenated taxonomic name

13110 04/10/2014 12:59 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: specimens*_of_verbatim_subspecific_taxa_with_author: need `subspecies IS NOT NULL` filter

13109 04/10/2014 12:57 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: _specimens_07_list_of_verbatim_subspecific_taxa_with_author: need to include subspecies (as _specimens_06_count_of_unique_verb_subsp_taxa_with_author does)

13107 04/10/2014 12:03 PM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: specimens*_of_species_binomials: removed incorrect `subspecies IS NOT NULL` filter (this should be on *_of_unique_verb_subsp_taxa_with_author instead)

13095 04/10/2014 03:45 AM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql: _specimens_16_list_distinct_specimen_descriptions: removed duplicated rows using DISTINCT

13089 04/10/2014 02:34 AM Aaron Marcuse-Kubitza

bugfix: inputs/NY/validations.sql: _specimens_03_list_of_verbatim_families: use family as specified in query description, not as implemented

13087 04/10/2014 02:07 AM Aaron Marcuse-Kubitza

bugfix: schemas/vegbien.sql, inputs/NY/validations.sql, validation/aggregating/specimens/qualitative_validations_specimens.sql: _specimens_12_distinct_collector_name_collect_num_date_w_count: dateCollected: cast this to text rather than date because some values for this field are not valid dates and will throw an error if cast to date

13086 04/09/2014 08:19 PM Aaron Marcuse-Kubitza

fix: inputs/NY/validations.sql: _specimens_12_distinct_collector_name_collect_num_date_w_count: dateCollected: matched type to output query

13075 04/08/2014 03:49 PM Aaron Marcuse-Kubitza

fix: inputs/U/Specimen/map.csv: Genus: remapped to taxonName because this field is actually mislabeled in the original column names

13070 04/08/2014 01:40 PM Aaron Marcuse-Kubitza

inputs/NY/run: `make inputs/NY/validate`: updated runtime (6.5 min). this increases as more queries are able to run successfully.

13068 04/08/2014 01:19 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/run_: `make inputs/SALVIAS/validate`: documented runtime (5 min)

13067 04/08/2014 12:49 PM Aaron Marcuse-Kubitza

inputs/bien2_traits/run: documented `make inputs/bien2_traits/validate` runtime (9 min)

13065 04/07/2014 06:19 PM Aaron Marcuse-Kubitza

inputs/NY/run: `make inputs/NY/validate`: updated runtime (5 min)

13056 04/07/2014 09:47 AM Aaron Marcuse-Kubitza

inputs/NY/run: documented `make inputs/NY/validate` runtime (2 min, currently for the input queries)

13055 04/04/2014 06:13 PM Aaron Marcuse-Kubitza

added inputs/Madidi/_src/ to match wiki steps in wiki.vegpath.org/Adding_a_flat-file_datasource

13042 04/02/2014 05:21 PM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql, NY/qualitative_validations_source_db_NYBG.VegCore.sql, inputs/NY/validations.sql: *_of_species_binomials: renamed columns to species_binomial to reflect reverted query name

13041 04/02/2014 05:16 PM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql, NY/qualitative_validations_source_db_NYBG.VegCore.sql, inputs/NY/validations.sql: *_of_verbatim_species_excluding_author: renamed to *_species_binomials for clarity