/ - Repository - BIEN 3 - NCEAS Projects

Name	Size	Revision	Age	Author	Comment
_archive		1598	almost 13 years	Aaron Marcuse-Kubitza	Moved _archive/tapir2flatClient/trunk/client/ t...
analysis		3076	over 12 years	Aaron Marcuse-Kubitza	Added top-level analysis dir for range modeling
bin		3271	over 12 years	Aaron Marcuse-Kubitza	csv2db: verbosity defaults to 3 so that detaile...
config		272	about 13 years	Aaron Marcuse-Kubitza	Moved bien_password to new config dir
inputs		3261	over 12 years	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats fro...
lib		3281	over 12 years	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Put the data source...
mappings		3229	over 12 years	Aaron Marcuse-Kubitza	mappings/VegX-VegBIEN.stems.csv: Sort the plant...
schemas		3280	over 12 years	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Synced with schema
to_do		2547	over 12 years	Aaron Marcuse-Kubitza	to_do/timeline.doc: Updated to reflect the mont...
Makefile	10.5 KB	3249	over 12 years	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: Schemas: Added schem...
README.TXT	2.96 KB	3205	over 12 years	Aaron Marcuse-Kubitza	README.TXT: Data import: Import data into VegBI...
map	1.21 KB	3140	over 12 years	Aaron Marcuse-Kubitza	top-level map: Added support for custom public ...

#	Date	Author	Comment
3281	07/09/2012 05:26 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Put the data source comment on a separate line in the log file instead of using a carriage return, which sometimes had the desired effect of overwriting the src comment with the first line of the query but sometimes the line lengths weren't right and there wasn't enough overlap
3280	07/09/2012 04:53 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Synced with schema
3279	07/09/2012 04:42 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Removed per-column indexes, which are no longer needed by either row-based or column-based import because they are able to do a merge join or lookup using the table's UNIQUE INDEX. Instead of forcing the database to build and maintain large indexes (15+ GB!) that are not used, optimization-only (non-UNIQUE) indexes should be added as needed only once the database is actually used for queries. In most cases it will not even be necessary to add additional indexes then, because most UNIQUE indexes can be reused for broad lookups (rather than just duplicate elimination). Even the foreign key covering indexes (fki_*) are not needed because we virtually never delete rows in the DB, and even if we were to start doing that regularly, the cost of maintaining the indexes on import is most likely not worth the speed improvements for cascading deletes.
3278	07/09/2012 04:32 PM	Aaron Marcuse-Kubitza	schemas/py_functions.sql: Removed per-column indexes on relational functions, which are no longer needed by row-based import because it is able to do a merge join-style lookup using the table's UNIQUE INDEX. (Note that column-based import doesn't use the (slower) relational functions at all anymore, and instead calls the corresponding SQL function directly using named arguments.)
3277	07/09/2012 04:31 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Removed per-column indexes on relational functions, which are no longer needed by row-based import because it is able to do a merge join-style lookup using the table's UNIQUE INDEX. (Note that column-based import doesn't use the (slower) relational functions at all anymore, and instead calls the corresponding SQL function directly using named arguments.)
3276	07/09/2012 04:26 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: plantname: plantname_unique UNIQUE INDEX: Moved scope_id to the back so that the index can easily be used for lookup queries (not just column-based import) without having to explicitly specify NULL for that field. This takes advantage of a btree sorting feature where a broader lookup can be done using just the first n columns of the index.
3275	07/09/2012 04:15 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: locationevent, specimenreplicate: Turned UNIQUE CONSTRAINTs and UNIQUE INDEXes with nullable fields into partial UNIQUE INDEXes with IS NOT NULL filter conditions, in order to work automatically with sql_gen without requiring a separate covering lookup index. Removed no longer needed covering lookup indexes.
3274	07/09/2012 03:07 PM	Aaron Marcuse-Kubitza	sql_io.py: put_table(): DuplicateKeyException: Fixed bug where combining multiple unique constraints was incorrectly allowed, when in fact the constraints need to be separately applied to the different rows that violate them, which is not currently supported
3273	07/09/2012 03:02 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.with_savepoint(): Log transaction profiling info with level=4 like the rest of the transaction commands, so that it isn't output when the transaction itself should be hidden (e.g. for name versioning or internal commands)
3272	07/09/2012 02:16 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.with_savepoint(): Profile (nested) transactions so that the run time for groups of commands (e.g. csv2db INSERTs) is known

Project

General

Profile

Latest revisions

Project

General

Profile

root @ 3281

Latest revisions