/ - Diff - BIEN 3 - NCEAS Projects

« Previous | Next »

Revision 4835

Added by Aaron Marcuse-Kubitza about 12 years ago

inputs/REMIB/Specimen/: Filter out invalid, frameshifted rows so they don't produce errors in the import or anomalies like thousands of taxondeterminations for one taxonoccurrence. This involves moving the CSVs to Specimen.src and using a create.sql to create the filtered table.

inputs/REMIB/Specimen/node.0.header.csv

"acronym","accession_number","family","genus","specificEpithet","country","state","county","locality","long_deg","long_min","long_sec","lat_deg","lat_min","lat_sec","coll_day","coll_month","coll_year","collector","habitat","preparation"

inputs/REMIB/Specimen/create.sql
	1	SELECT *
	2	FROM "Specimen.src"
	3	WHERE coll_year ~ E'^(?:1[7-9]\|20)\\d{2}$' AND country !~ E'\\d'

inputs/REMIB/Specimen/header.csv

acronym,accession_number,family,genus,specificEpithet,country,state,county,locality,long_deg,long_min,long_sec,lat_deg,lat_min,lat_sec,coll_day,coll_month,coll_year,collector,habitat,preparation,row_num

inputs/REMIB/import_order.txt
	1	Specimen

inputs/REMIB/Specimen.src/node.0.header.csv

"acronym","accession_number","family","genus","specificEpithet","country","state","county","locality","long_deg","long_min","long_sec","lat_deg","lat_min","lat_sec","coll_day","coll_month","coll_year","collector","habitat","preparation"

Also available in: Unified diff

Project

General

Profile

Revision 4835

Added by Aaron Marcuse-Kubitza about 12 years ago