/trunk/inputs/.TNRS/data.sql - Changes - BIEN 3 - NCEAS Projects

root/trunk/inputs/.TNRS/data.sql @ 14909

#	Date	Author	Comment
14448	08/10/2014 05:53 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: removed no longer used scrubbed_unique_taxon_name. the scrubbed name ranks are now generated from the other TNRS columns instead.
14414	08/04/2014 06:17 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: removed no longer used derived column __accepted_infraspecific_label, which had a buggy formula that broke derived_cols_populate()
14375	07/28/2014 03:48 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match."__accepted_{genus,specific_epithet}": use "Accepted_name" instead of "Accepted_species[_binomial]__@TNRS__@vegpath.org" (from "Accepted_name_species") because Accepted_name_species apparently sometimes does not match the Accepted_name and uses malformed Unicode characters
14366	07/26/2014 09:57 PM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_species[_binomial]__@TNRS__@vegpath.org", which removes trailing whitespace
14330	07/25/2014 12:41 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: taxon_match: derived columns: use new "matched~Name[_no_author]___@TNRS__@vegpath.org" instead of "*Name_matched" so that "No suitable matches found." is removed before concatenating with other fields
14329	07/25/2014 12:14 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "matched~Name[_no_author]___@TNRS__@vegpath.org", which removes the "No suitable matches found." string
14328	07/22/2014 06:50 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: reordered derived columns in dependency order
14326	07/22/2014 06:44 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org"
14325	07/22/2014 06:41 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: "[accepted_]genus__@DwC__@vegpath.org": need to populate this for rank = genus
14324	07/22/2014 05:47 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[scrubbed_]taxonomicStatus__@DwC__@vegpath.org"
14323	07/22/2014 05:30 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: derived columns: use "Accepted_family__@TNRS__@vegpath.org" instead of "Accepted_name_family" because "Accepted_name_family" is sometimes missing
14318	07/22/2014 04:12 AM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: added derived column "Accepted_family__@TNRS__@vegpath.org", which is needed because "*Accepted_name_family" isn't always populated
14310	07/22/2014 03:12 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: reordered derived columns in dependency order
14304	07/22/2014 01:48 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[accepted_]morphospecies[_binomial]__@Brad__.TNRS@vegpath.org"
14303	07/22/2014 01:37 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[parsed_]morphospecies[_suffix]__@Brad__.morphosp@vegpath.org"
14281	07/21/2014 05:13 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived columns scrubbed_*
14279	07/21/2014 04:34 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: put matched-taxon derived columns before accepted-taxon derived columns
14278	07/21/2014 04:25 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "[matched_]species[_binomial]~(~~Accepted_~~)__@TNRS__@vegpath.org"
14269	07/21/2014 09:02 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column _matched_has_accepted
14267	07/21/2014 08:44 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived columns "[matched_]scientificName[_with_author]__@DwC__@vegpath.org", "[accepted_]scientificName[_with_author]__@DwC__@vegpath.org"
14266	07/21/2014 08:29 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived columns "[accepted_]Infraspecific_rank[_abbr]__@TNRS__@vegpath.org", "[accepted_]infraspecificEpithet__@DwC__@vegpath.org"
14265	07/21/2014 08:26 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column "__accepted_infraspecific_{rank,epithet}"
14264	07/21/2014 08:19 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived column __accepted_infraspecific_label
14257	07/21/2014 06:54 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: taxon_match: added derived columns "[accepted_]genus__@DwC__@vegpath.org", "[accepted_]specificEpithet__@DwC__@vegpath.org"
14125	07/18/2014 11:40 PM	Aaron Marcuse-Kubitza	fix: inputs/.TNRS/schema.sql: taxon_match: cluster the table so the rows are always exported in the same order, even when an UPDATE statement is used to run triggers
14122	07/18/2014 03:18 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: added _accepted{genus,specific_epithet} derived column, using the steps at wiki.vegpath.org/Postgres_queries#maintaining-derived-column-formulas-across-column-renames
13870	06/26/2014 04:58 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: refreshed
13867	06/26/2014 04:28 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: refreshed
13863	06/26/2014 02:39 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs_match: renamed to taxon_match to use the normalized VegCore name for this, and to avoid repeating the schema name
13853	06/25/2014 04:28 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: refreshed
13850	06/25/2014 03:33 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: renamed to tnrs_match to distinguish it from other TNRS-related tables
13591	06/02/2014 04:50 AM	Aaron Marcuse-Kubitza	lib/tnrs.py: switched to downloading all matches per name, as is needed to implement #917. note that this will break the parts of the schema that use the tnrs table, until Brad's match-picking algorithm can be implemented, but this tradeoff is necessary to be able to begin scrubbing sooner (Martha; wiki.vegpath.org/2014-05-29_conference_call#TNRS)
13590	06/02/2014 04:35 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: tnrs_input_name: don't scrub accepted names, as using multiple matches per name no longer provides a single accepted name to scrub. instead, the Accepted_* fields can be whitespace-split to generate the same columns that would have been generated by the scrubbing (and without the overhead of the extra TNRS call).
13579	05/31/2014 10:28 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: populate match_num
13573	05/31/2014 08:10 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: added match_num
11970	01/20/2014 11:33 AM	Aaron Marcuse-Kubitza	moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).
11708	11/21/2013 08:48 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql, data.sql: updated for PostgreSQL 9.3
11643	11/10/2013 07:02 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: removed no longer used Accepted_scientific_name. use scrubbed_unique_taxon_name instead.
11630	11/10/2013 01:23 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: removed Is_homonym, since this did not take into account the never_homonym status (when the author disambiguates) or the ability of a non-homonym at a lower rank to override a homonym at a higher rank. taking these into account just produces the value of is_valid_match.
11629	11/10/2013 01:19 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: removed Is_plant, since this functionality is now provided by is_valid_match. note that whether a name is a plant is not meaningful for TNRS, because it can match only plant names (thus a "non-plant" is actually a non-match).
11628	11/10/2013 01:06 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: added scrubbed_taxon_name_with_author derived column, which uses the matched name when an accepted name is not available
11627	11/10/2013 09:44 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: removed no longer used Max_score. use is_valid_match to determine validity instead.
11626	11/10/2013 12:09 AM	Aaron Marcuse-Kubitza	bugfix: lib/runscripts/file.pg.sql.run: export_(): exclude Source and related tables so that these will be re-created by the staging tables installation instead, ensuring that they are always in sync with the Source/ subdir
11625	11/10/2013 12:08 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: updated for new derived columns
10786	08/27/2013 10:14 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: re-ran TNRS using `inputs/test_taxonomic_names/test_scrub; rm=1 inputs/.TNRS/data.sql.run export_`
10783	08/27/2013 09:53 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: generate from the DB using `rm=1 inputs/.TNRS/data.sql.run export_` instead of being a hand-edited file
10746	08/27/2013 12:25 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs.Time_submitted: renamed to batch and added fkey to batch.id. this requires including the batch table in inputs/.TNRS/data.sql, so that the fkey is satisfied (batch entries are already added by bin/tnrs_db.
9972	06/20/2013 08:03 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: added Is_homonym derived col (uses IRMNG.family_homonym_epithet, genus_homonym_epithet)
9762	06/06/2013 02:42 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: updated for current TSV schema: renamed Accepted_species->Accepted_name_species, Accepted_family->Accepted_name_family
9529	05/23/2013 03:33 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql, data.sql: updated TNRS CSV columns to preserve Name_matched_accepted_family even though it isn't present in the current TNRS CSVs. this way, Name_matched_accepted_family can still be used for previously-scrubbed names, and family_matched can be added back to analytical_stem_view. (now that bin/tnrs_db uses an explicit columns list in COPY TO, the absence of a column in the CSV is no longer a problem.)
9493	05/21/2013 07:37 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql, data.sql: updated for new TNRS CSV columns (see bug at https://pods.iplantcollaborative.org/jira/browse/TNRS-183). note that these columns may eventually change back (comment by Naim at https://pods.iplantcollaborative.org/jira/browse/TNRS-183#comment-34444).
7855	03/06/2013 02:18 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/data.sql: Re-ran TNRS on the test_taxonomic_names so that the sample data would contain all the accepted names under the current acceptedScientificName formula, and would not produce any entries for unscrubbed names in tnrs_input_name
7293	01/18/2013 07:38 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: Added Max_score column for use in filtering out names that will be rejected by taxondetermination's constraints
7133	01/09/2013 09:57 AM	Aaron Marcuse-Kubitza	inputs/.TNRS/schema.sql: tnrs: Added Accepted_scientific_name field which will contain the joined-together accepted name that gets re-parsed by TNRS
5811	10/26/2012 04:34 PM	Aaron Marcuse-Kubitza	inputs/.TNRS/: Added data.sql containing the test_taxonomic_names TNRS results, so that a new installation of VegBIEN will contain the necessary data to make the tests pass, including the TNRS import test

Project

General

Profile