Revision 5514
Added by Aaron Marcuse-Kubitza about 12 years ago
lib/db_xml.py | ||
---|---|---|
192 | 192 |
return sql.select(db, table, fields, conds, limit, start) |
193 | 193 |
|
194 | 194 |
# Controls when and how put_table() will partition the input table |
195 |
partition_size = 500000 # rows
|
|
195 |
partition_size = 1000000 # rows; must be >= NCBI.higher_taxa size
|
|
196 | 196 |
|
197 | 197 |
def put_table(db, node, in_table, in_row_ct_ref=None, row_ins_ct_ref=None, |
198 | 198 |
limit=None, start=0, on_error=exc.reraise, col_defaults={}, |
Also available in: Unified diff
db_xml.py: partition_size: Increased to 1,000,000 (>= NCBI.higher_taxa's size) so NCBI.higher_taxa can be imported completely in one partition. This is necessary because NCBI's taxonconcepts are not in dependency order (parents first), so a later partition cannot rely on the parents of its taxonconcepts having already been imported. Instead, all taxonconcepts must be imported at once and then separately, the parents of all taxonconcepts must be set.