Project

General

Profile

« Previous | Next » 

Revision 5514

db_xml.py: partition_size: Increased to 1,000,000 (>= NCBI.higher_taxa's size) so NCBI.higher_taxa can be imported completely in one partition. This is necessary because NCBI's taxonconcepts are not in dependency order (parents first), so a later partition cannot rely on the parents of its taxonconcepts having already been imported. Instead, all taxonconcepts must be imported at once and then separately, the parents of all taxonconcepts must be set.

View differences:

lib/db_xml.py
192 192
    return sql.select(db, table, fields, conds, limit, start)
193 193

  
194 194
# Controls when and how put_table() will partition the input table
195
partition_size = 500000 # rows
195
partition_size = 1000000 # rows; must be >= NCBI.higher_taxa size
196 196

  
197 197
def put_table(db, node, in_table, in_row_ct_ref=None, row_ins_ct_ref=None,
198 198
    limit=None, start=0, on_error=exc.reraise, col_defaults={},

Also available in: Unified diff