Project

General

Profile

# Date Author Comment
14448 08/10/2014 05:53 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: removed no longer used scrubbed_unique_taxon_name. the scrubbed name ranks are now generated from the other TNRS columns instead.

14414 08/04/2014 06:17 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: removed no longer used derived column __accepted_infraspecific_label, which had a buggy formula that broke derived_cols_populate()

14375 07/28/2014 03:48 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match."__accepted_{genus,specific_epithet}": use "*Accepted_name" instead of "Accepted_species[_binomial]__@TNRS__@vegpath.org" (from "*Accepted_name_species") because Accepted_name_species apparently sometimes does not match the Accepted_name and uses malformed Unicode characters

14366 07/26/2014 09:57 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_species[_binomial]__@TNRS__@vegpath.org", which removes trailing whitespace

14330 07/25/2014 12:41 PM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: taxon_match: derived columns: use new "matched~Name[_no_author]___@TNRS__@vegpath.org" instead of "*Name_matched" so that "No suitable matches found." is removed before concatenating with other fields

14329 07/25/2014 12:14 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "matched~Name[_no_author]___@TNRS__@vegpath.org", which removes the "No suitable matches found." string

14328 07/22/2014 06:50 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: reordered derived columns in dependency order

14326 07/22/2014 06:44 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org"

14325 07/22/2014 06:41 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: "[accepted_]genus__@DwC__@vegpath.org": need to populate this for rank = genus

14324 07/22/2014 05:47 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]taxonomicStatus__@DwC__@vegpath.org"

14323 07/22/2014 05:30 AM Aaron Marcuse-Kubitza

bugfix: inputs/.TNRS/schema.sql: derived columns: use "Accepted_family__@TNRS__@vegpath.org" instead of "*Accepted_name_family" because "*Accepted_name_family" is sometimes missing

14318 07/22/2014 04:12 AM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_family__@TNRS__@vegpath.org", which is needed because "*Accepted_name_family" isn't always populated

14310 07/22/2014 03:12 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: reordered derived columns in dependency order

14304 07/22/2014 01:48 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[accepted_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org"

14303 07/22/2014 01:37 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[parsed_]morphospecies[_suffix]__@Brad__.morphosp@vegpath.org"

14281 07/21/2014 05:13 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived columns scrubbed_*

14279 07/21/2014 04:34 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: put matched-taxon derived columns before accepted-taxon derived columns

14278 07/21/2014 04:25 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "[matched_]species[_binomial]~(Accepted_)__@TNRS__@vegpath.org"

14269 07/21/2014 09:02 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column _matched_has_accepted

14267 07/21/2014 08:44 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived columns "[matched_]scientificName[_with_author]__@DwC__@vegpath.org", "[accepted_]scientificName[_with_author]__@DwC__@vegpath.org"

14266 07/21/2014 08:29 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived columns "[accepted_]Infraspecific_rank[_abbr]__@TNRS__@vegpath.org", "[accepted_]infraspecificEpithet__@DwC__@vegpath.org"

14265 07/21/2014 08:26 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column "__accepted_infraspecific_{rank,epithet}"

14264 07/21/2014 08:19 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived column __accepted_infraspecific_label

14257 07/21/2014 06:54 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: taxon_match: added derived columns "[accepted_]genus__@DwC__@vegpath.org", "[accepted_]specificEpithet__@DwC__@vegpath.org"

14125 07/18/2014 11:40 PM Aaron Marcuse-Kubitza

fix: inputs/.TNRS/schema.sql: taxon_match: cluster the table so the rows are always exported in the same order, even when an UPDATE statement is used to run triggers

14122 07/18/2014 03:18 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: added _accepted{genus,specific_epithet} derived column, using the steps at wiki.vegpath.org/Postgres_queries#maintaining-derived-column-formulas-across-column-renames

13870 06/26/2014 04:58 AM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: refreshed

13867 06/26/2014 04:28 AM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: refreshed

13863 06/26/2014 02:39 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs_match: renamed to taxon_match to use the normalized VegCore name for this, and to avoid repeating the schema name

13853 06/25/2014 04:28 PM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: refreshed

13850 06/25/2014 03:33 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: renamed to tnrs_match to distinguish it from other TNRS-related tables

13591 06/02/2014 04:50 AM Aaron Marcuse-Kubitza

lib/tnrs.py: switched to downloading all matches per name, as is needed to implement #917. note that this will break the parts of the schema that use the tnrs table, until Brad's match-picking algorithm can be implemented, but this tradeoff is necessary to be able to begin scrubbing sooner (Martha; wiki.vegpath.org/2014-05-29_conference_call#TNRS)

13590 06/02/2014 04:35 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: tnrs_input_name: don't scrub accepted names, as using multiple matches per name no longer provides a single accepted name to scrub. instead, the Accepted_* fields can be whitespace-split to generate the same columns that would have been generated by the scrubbing (and without the overhead of the extra TNRS call).

13579 05/31/2014 10:28 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: populate match_num

13573 05/31/2014 08:10 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: added match_num

11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11708 11/21/2013 08:48 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql, data.sql: updated for PostgreSQL 9.3

11643 11/10/2013 07:02 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: removed no longer used Accepted_scientific_name. use scrubbed_unique_taxon_name instead.

11630 11/10/2013 01:23 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: removed Is_homonym, since this did not take into account the never_homonym status (when the author disambiguates) or the ability of a non-homonym at a lower rank to override a homonym at a higher rank. taking these into account just produces the value of is_valid_match.

11629 11/10/2013 01:19 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: removed Is_plant, since this functionality is now provided by is_valid_match. note that whether a name is a plant is not meaningful for TNRS, because it can match only plant names (thus a "non-plant" is actually a non-match).

11628 11/10/2013 01:06 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: added scrubbed_taxon_name_with_author derived column, which uses the matched name when an accepted name is not available

11627 11/10/2013 09:44 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: removed no longer used Max_score. use is_valid_match to determine validity instead.

11626 11/10/2013 12:09 AM Aaron Marcuse-Kubitza

bugfix: lib/runscripts/file.pg.sql.run: export_(): exclude Source and related tables so that these will be re-created by the staging tables installation instead, ensuring that they are always in sync with the Source/ subdir

11625 11/10/2013 12:08 AM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: updated for new derived columns

10786 08/27/2013 10:14 PM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: re-ran TNRS using `inputs/test_taxonomic_names/test_scrub; rm=1 inputs/.TNRS/data.sql.run export_`

10783 08/27/2013 09:53 PM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: generate from the DB using `rm=1 inputs/.TNRS/data.sql.run export_` instead of being a hand-edited file

10746 08/27/2013 12:25 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs.Time_submitted: renamed to batch and added fkey to batch.id. this requires including the batch table in inputs/.TNRS/data.sql, so that the fkey is satisfied (batch entries are already added by bin/tnrs_db.

9972 06/20/2013 08:03 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: added Is_homonym derived col (uses IRMNG.family_homonym_epithet, genus_homonym_epithet)

9762 06/06/2013 02:42 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: updated for current TSV schema: renamed Accepted_species->Accepted_name_species, Accepted_family->Accepted_name_family

9529 05/23/2013 03:33 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql, data.sql: updated TNRS CSV columns to preserve Name_matched_accepted_family even though it isn't present in the current TNRS CSVs. this way, Name_matched_accepted_family can still be used for previously-scrubbed names, and family_matched can be added back to analytical_stem_view. (now that bin/tnrs_db uses an explicit columns list in COPY TO, the absence of a column in the CSV is no longer a problem.)

9493 05/21/2013 07:37 PM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql, data.sql: updated for new TNRS CSV columns (see bug at https://pods.iplantcollaborative.org/jira/browse/TNRS-183). note that these columns may eventually change back (comment by Naim at https://pods.iplantcollaborative.org/jira/browse/TNRS-183#comment-34444).

7855 03/06/2013 02:18 PM Aaron Marcuse-Kubitza

inputs/.TNRS/data.sql: Re-ran TNRS on the test_taxonomic_names so that the sample data would contain all the accepted names under the current acceptedScientificName formula, and would not produce any entries for unscrubbed names in tnrs_input_name

7293 01/18/2013 07:38 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: Added Max_score column for use in filtering out names that will be rejected by taxondetermination's constraints

7133 01/09/2013 09:57 AM Aaron Marcuse-Kubitza

inputs/.TNRS/schema.sql: tnrs: Added Accepted_scientific_name field which will contain the joined-together accepted name that gets re-parsed by TNRS

5811 10/26/2012 04:34 PM Aaron Marcuse-Kubitza

inputs/.TNRS/: Added data.sql containing the test_taxonomic_names TNRS results, so that a new installation of VegBIEN will contain the necessary data to make the tests pass, including the TNRS import test