Project

General

Profile

Actions

Task #917

open

Task #928: switch to new TNRS setup

TNRS: Instructions for new version with TPL

Added by Martha Narro over 10 years ago. Updated over 7 years ago.

Status:
New
Priority:
Normal
Start date:
05/13/2014
Due date:
% Done:

80%

Estimated time:
Activity type:
Coding/analysis

Description

From Brad:

I’ve given some thought to the TPL matter. The algorithm isn’t hard, but Aaron will have to do the sorting himself.

1. Make sure sources are selected in the following order: GCC, TPL, Tropicos, USDA
2. When downloading names, do NOT sort by source (ie. don't limit results to just the best match when sorted by source)
3. Download all results (not just best matches)
3a. fix anomaly where there were multiple Selected names for some input names (to avoid breaking constraints)
3b. reimplement the parsed-rank columns for the all-matches strategy, which does not have a single scrubbed name per input name to parse see taxon_match derived columns .
3c. create table and algorithm to store a selected best match for each input name
4. Apply the usual TNRS sort order (see the README ._1) to the matches for a give name.
Aaron, Brad said there are no additional steps to apply here (in step 4). Just proceed to step 5. (--Martha)
Brad now says that actually the TNRS sort order is incorrect because of the Constrain by Source bug .
5. If the best match (indicated by Selected=TRUE) has source=Tropicos and acceptance=accepted AND another match is available where source<is not equal to>Tropicos and acceptance=synonym, use the latter name (we don't need this until after the names are scrubbed (Martha))
6. All other cases, use the best match as flagged

That should filter out most Tropicos nomenclatural synonyms incorrectly labeled accepted. I can unpack #4 for Aaron when the time arrives.

Brad

1 note that edit_distance = (1 - specific_epithet_score)*greatest_length


Files

48_test_names.txt (1.22 KB) 48_test_names.txt Martha Narro, 06/04/2014 05:58 PM
48_test_name_tnrs_results.xlsx (42.2 KB) 48_test_name_tnrs_results.xlsx Martha Narro, 06/04/2014 05:58 PM
Actions #1

Updated by Martha Narro over 10 years ago

Aaron, for now (next couple of months, this must be done on the development TNRS web app since TPL is only on the dev app. Martha will send you the url.

Actions #2

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #3

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Parent task set to #928
Actions #4

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 0 to 20
Actions #5

Updated by Martha Narro over 10 years ago

  • Description updated (diff)
Actions #6

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 20 to 40
Actions #7

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #8

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #9

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #10

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #11

Updated by Martha Narro over 10 years ago

The taxon names Brad sent for testing the rescrubbing are now attached.

Actions #12

Updated by Martha Narro over 10 years ago

  • Description updated (diff)
Actions #13

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • % Done changed from 40 to 60

all the taxon names have now been rescrubbed

Actions #14

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #15

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #16

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #17

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #18

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #19

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #20

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #21

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #22

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)

multiple Selected names bug fixed in r13855

Actions #23

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #24

Updated by Aaron Marcuse-Kubitza over 10 years ago

testing Martha's watcher notifications...

Actions #25

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #26

Updated by Aaron Marcuse-Kubitza over 10 years ago

testing Martha's watcher notifications again after e-mail address fix...

Actions #27

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #28

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #29

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 60 to 70
Actions #30

Updated by Aaron Marcuse-Kubitza over 10 years ago

test update now that watcher notifications have been fixed

Actions #31

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • % Done changed from 70 to 80
  • Description updated (diff)
Actions #32

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)

a single "-" creates strikethrough formatting, so you have to use "--" instead

Actions #33

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 80 to 70
Actions #34

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #35

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #36

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 70 to 80
Actions #37

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
Actions #38

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 80 to 90
Actions #39

Updated by Aaron Marcuse-Kubitza over 10 years ago

  • Description updated (diff)
  • % Done changed from 90 to 80
Actions #40

Updated by Aaron Marcuse-Kubitza over 7 years ago

  • Description updated (diff)
Actions

Also available in: Atom PDF