Project

General

Profile

Task #916

Updated by Aaron Marcuse-Kubitza almost 10 years ago

Hi Aaron, 

 Bob only had time to get part way through the VegBank taxon validation file you sent, but there are some errors to correct. It'll be best for him if you fix these, rescrub the TNRS names as described in #917, these and send a new extract before he invests more time, so I'm going to go ahead and create issues for them. Please fix these issues and then send Bob a new file in the format described in issue #915. 

 Line numbers refer to the csv file you sent him. 

 Line 617: TNRS gives a synonym for Aronia prunifolia in a different genus, but this is missed here 

 Likely cause: The BIEN scripts may be using only Tropicos as the taxonomic source, not USDA. Tropicos matches only the genus.  

 Fix: Use all sources for the next round of scrubbing. Use them in the order: TPL, Tropicos, GCC, USDA. 


 Line 897:    Diacritical marks on authors names are often messed up 

 Likely cause: Character set problem. These have been checked using the online version of TNRS and it has been confirmed that diacritics are being rendered correctly.  

 Fix: Find where character set problems need to be handled in the BIEN scripts. (TNRS code works so look at what's done there.) 


 Line 1049:    There are two spellings of Erechtites hieraciifolia and TNRS know of both of them, so why is one rejected here 
 Same issue as Line 617. The two spellings are in USDA and GCC, but not in Tropicos.  


 Please fix these problems and then send Bob a new file in the format described in issue #915. 

Back