Task #917
Updated by Aaron Marcuse-Kubitza over 10 years ago
From Brad: I’ve given some thought to the TPL matter. The algorithm isn’t hard, but Aaron will have to do the sorting himself. 1. -Make sure sources are selected in the following order: GCC, TPL, Tropicos, USDA- 2. -When downloading names, do NOT sort by source _(ie. don't limit results to just the best match when sorted by source)_- 3. -Download all results (not just best matches)- 3a. reimplement the scrubbed-name columns for the all-matches strategy, which does not have a single scrubbed name per input name to parse 4. -Apply the usual TNRS sort order to the matches for a give name.- Aaron, Brad said there are no additional steps to apply here (in step 4). Just proceed to step 5. (-Martha) 5. If the best match (indicated by Selected=TRUE) has source=Tropicos and acceptance=accepted AND another match is available where source<is not equal to>Tropicos and acceptance=synonym, use the latter name (we don't need this until after the names are scrubbed (Martha)) _ @Selected=true@ _@Selected=true@ does not always indicate the best match, because sometimes multiple names or no names are marked as @Selected@. in cases where no names are marked as @Selected@, these are often perfectly valid matches. *because of issues with @Selected@, it would be better to use the match scores to determine the best match.*_ match.* 6. All other cases, use the best match as flagged That should filter out most Tropicos nomenclatural synonyms incorrectly labeled accepted. I can unpack #4 for Aaron when the time arrives. Brad