Revision 9530
Added by Aaron Marcuse-Kubitza over 11 years ago
bin/tnrs_db | ||
---|---|---|
1 | 1 |
#!/usr/bin/env python |
2 | 2 |
# Scrubs the taxonlabels in VegBIEN using TNRS. |
3 | 3 |
|
4 |
# to estimate total runtime: |
|
5 |
# bin/psql_vegbien <<<'SELECT COUNT(*) FROM tnrs_input_name' |
|
6 |
# # names from above * 1.5 multiplier for scrubbing accepted names |
|
7 |
# (the test_taxonomic_names sample from Brad produces 8 accepted names for |
|
8 |
# 15 input names) |
|
9 |
# * ((# ms/name from log file * 1 sec/1000 ms) + (# sec to run |
|
10 |
# `SELECT * FROM "tnrs_input_name"` in log file / tnrs.max_names names/batch)) |
|
11 |
# * 1 hr / 3600 sec * 1 day / 24 hr = # days |
|
12 |
|
|
4 | 13 |
import os.path |
5 | 14 |
import sys |
6 | 15 |
|
Also available in: Unified diff
bin/tnrs_db: documented how to estimate total runtime. note that our tnrs_db wrapper in inputs/.TNRS/tnrs/tnrs.make uses inputs/.TNRS/tnrs/logs/tnrs.make.log.sql as the log file.