Project

General

Profile

« Previous | Next » 

Revision 4835

inputs/REMIB/Specimen/: Filter out invalid, frameshifted rows so they don't produce errors in the import or anomalies like thousands of taxondeterminations for one taxonoccurrence. This involves moving the CSVs to Specimen.src and using a create.sql to create the filtered table.

View differences:

inputs/REMIB/Specimen/node.0.header.csv
1
"acronym","accession_number","family","genus","specificEpithet","country","state","county","locality","long_deg","long_min","long_sec","lat_deg","lat_min","lat_sec","coll_day","coll_month","coll_year","collector","habitat","preparation"
inputs/REMIB/Specimen/create.sql
1
SELECT *
2
FROM "Specimen.src"
3
WHERE coll_year ~ E'^(?:1[7-9]|20)\\d{2}$' AND country !~ E'\\d'
inputs/REMIB/Specimen/header.csv
1
acronym,accession_number,family,genus,specificEpithet,country,state,county,locality,long_deg,long_min,long_sec,lat_deg,lat_min,lat_sec,coll_day,coll_month,coll_year,collector,habitat,preparation,row_num
inputs/REMIB/import_order.txt
1
Specimen
inputs/REMIB/Specimen.src/node.0.header.csv
1
"acronym","accession_number","family","genus","specificEpithet","country","state","county","locality","long_deg","long_min","long_sec","lat_deg","lat_min","lat_sec","coll_day","coll_month","coll_year","collector","habitat","preparation"

Also available in: Unified diff