NYBG issues¶
- Table of contents
- NYBG issues
- Some fields appear to be imported incorrectly from a CSV file
- Some rows have a MonthCollected but YearCollected is 0
- CoordinatePrecision sometimes uses CV percents (e.g. 'CV 5') instead of decimal degree values
- Some rows have an identifying party but no taxon determination
- MinimumElevation is sometimes '.': FIXED
- Input data is sometimes longer than VegBank allowed a field to be: FIXED
Error rate: 190 error(s)/328,040 row(s) = 0.058%
Some fields appear to be imported incorrectly from a CSV file¶
Sample row:
key | CatalogNumber | Habitat | UniqueNYInternalRecordNumber |
3096 | 401109 | Boggy edge of Acer forest",,,,309412\n3097,"2008-4-2T13:13:38.000EST | Parinari excelsa Sabine |
Error: Habitat field contains CSV formatting
Some rows have a MonthCollected but YearCollected is 0¶
Rows affected: 30
Sample row:
key | YearIdentified | Collector | YearCollected | MonthCollected | DayCollected | CollectedDate | StateProvince | County | UniqueNYInternalRecordNumber |
15265 | 1980 | W. M. Canby | 0 | 8 | 0 | NULL | Maryland | Wicomico Co. | 1107281 |
Error: 0 indicates year is missing
CoordinatePrecision sometimes uses CV percents (e.g. 'CV 5') instead of decimal degree values¶
Rows affected: 165
Sample row:
key | CatalogNumber | Locality | Longitude | Latitude | CoordinatePrecision | UniqueNYInternalRecordNumber |
680 | 1071785 | Guánica Forest. | -66.9 | 17.9 | CV 5 | 1216017 |
Error:
SyntaxException: Invalid XML function syntax: CvException: CV (coefficient of variation) values are only allowed for ratio scale data (see <http://en.wikipedia.org/wiki/Coefficient_of_variation>) function: <_noCV><value>CV 5</value></_noCV> row #: 619 input row: (680L,)
Some rows have an identifying party but no taxon determination¶
Rows affected: < 7 (some rows produce this error because of a mappings bug)
Sample row:
key | Kingdom | Phylum | Class | Order | Family | Genus | Species | Subspecies | ScientificNameAuthor | IdentifiedBy | UniqueNYInternalRecordNumber |
131 | Plantae | NULL | NULL | NULL | Unknown | NULL | NULL | NULL | NULL | C. L. Hitchcock | 670505 |
Error:
NullValueException: columns: plantconcept_id cause: IntegrityError: null value in column "plantconcept_id" violates not-null constraint query: INSERT INTO taxondetermination (determinationtype, taxonoccurrence_id, determinationdate, role_id, party_id, iscurrent) VALUES (E'Author', 309, E'1939-03-30', 1697, 174, E'true') node: <taxondetermination> <role_id><role><rolecode>identifier</rolecode></role></role_id> <determinationdate>1939-03-30</determinationdate> <determinationtype>Author</determinationtype> <iscurrent>true</iscurrent> <party_id> <party> <givenname>C.</givenname> <middlename>L.</middlename> <surname>Hitchcock</surname> </party> </party_id> </taxondetermination> row #: 118 input row: (131L,)
MinimumElevation is sometimes '.': FIXED¶
Rows affected: 2
Sample row:
key | ScientificName | MinimumElevation | MaximumElevation | UniqueNYInternalRecordNumber |
192604 | Aciotis annua (Mart. ex DC.) Triana | . |
0 | 980514 |
Error:
SyntaxException: Invalid XML function syntax: ValueError: could not convert string to float: . function: <_avg> <max>0</max> <min>.</min> </_avg> row #: 187102 input row: (192604L,)
Input data is sometimes longer than VegBank allowed a field to be: FIXED¶
Sample row:
key | CatalogNumber | ScientificName | UniqueNYInternalRecordNumber |
3096 | 401109 | Plagiomnium cuspidatum (Hedw.) T. J. Kop. | Parinari excelsa Sabine |
Error:
DataError: value too long for type character varying(20) query: INSERT INTO plantobservation (authorplantcode) VALUES (E'Parinari excelsa Sabine') node: <plantobservation> <!-- ... --> <authorplantcode>Parinari excelsa Sabine</authorplantcode> </plantobservation> input row: (3096L,)