Import process¶
- Backup benchmarks
- Denormalizing a datasource
- Import steps
- General import steps
- Specific import steps
- Adding ACAD--a Darwin Core datasource
- Adding Cyrille traits--a traits datasource
- Adding Madidi--a flat-file plots datasource
- Mapping a new table in VegBank--a SQL plots datasource
- Refreshing ACAD--a Darwin Core datasource
- Refreshing CVS--an MS Access plots datasource
- Refreshing VegBank--a SQL plots datasource
- Individual datasource refresh
- New-style import
- Normalization techniques
- Result filtering
- Row-based import benchmarks
- TNRS workflow
Steps¶
see Import steps
Instructions¶
See README.TXT > Datasource setup, Single datasource import, Full database import
Proposed refactoring¶
The proposed refactoring of the import process (below) would reduce the import time by as much as 14 hours, and avoid the need to maintain countless lines of code in Python and Makefiles:
Staging table preprocessing¶