/ - Changes - BIEN 3 - NCEAS Projects

root @ 2257

#	Date	Author	Comment
2257	05/21/2012 06:59 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Moved getting pkeys of already existing rows from DuplicateKeyException to try clause, so that it always runs if there are join_cols. DuplicateKeyException: Add new duplicate key cols to join_cols instead of replacing join_cols so that multiple unique constraints being violated causes the union of their columns to be used for join_cols.
2256	05/21/2012 06:23 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added CustomCode
2255	05/21/2012 06:05 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): joins: Fixed bug where joins dict was being modified without first being copied, causing the input value to be modified
2254	05/21/2012 05:52 PM	Aaron Marcuse-Kubitza	Compare object()-based sentinel values using is. Where sentinel values must be compared using ==, use rand.rand_int() instead.
2253	05/21/2012 05:13 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Added debug messages for every action performed
2252	05/21/2012 04:45 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Moved assignment of in_pkeys_ref outside loop so it wouldn't need to be re-versioned every iteration
2251	05/21/2012 04:42 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Changed temp_suffix to temp_prefix so all temp tables for a given out_table would have the same prefix. (Existing name collisions due to truncated names are not a problem because version prefixes are automatically added.)
2250	05/21/2012 04:23 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Filter dates through _toTimestamp
2249	05/21/2012 04:20 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added _toTimestamp
2248	05/21/2012 04:15 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Filter coordsaccuracy through _toDouble
2247	05/21/2012 04:12 PM	Aaron Marcuse-Kubitza	sql.py: FunctionValueException parsing: Support values containing non-word and non-ASCII characters
2246	05/21/2012 04:11 PM	Aaron Marcuse-Kubitza	exc.py: Support exception messages containing non-ASCII characters
2245	05/18/2012 07:10 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Print debug messages about how exceptions are being handled
2244	05/18/2012 06:45 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): After getting pkeys of already existing rows, insert new rows
2243	05/18/2012 06:42 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Handle FunctionValueExceptions by excluding rows with the invalid value in their "value" column
2242	05/18/2012 06:41 PM	Aaron Marcuse-Kubitza	sql.py: run_query(): Also parse "invalid input syntax at assignment" errors as FunctionValueExceptions
2241	05/18/2012 06:39 PM	Aaron Marcuse-Kubitza	sql_gen.py: Col: Convert string table names to Table objects
2240	05/18/2012 06:09 PM	Aaron Marcuse-Kubitza	sql.py: run_query(): Parse "invalid input value at assignment" errors' values as well
2239	05/18/2012 05:55 PM	Aaron Marcuse-Kubitza	sql.py: run_query(): Parse "invalid input value at assignment" errors as FunctionValueExceptions
2238	05/18/2012 05:27 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): joins: filter_out: Pass NULLs through. Use sql_gen.*2sql_gen() to add the left and right table names to the columns.
2237	05/18/2012 05:26 PM	Aaron Marcuse-Kubitza	sql_gen.py: cond2sql_gen(): Take assume_col param and pass it to value2sql_gen()
2236	05/18/2012 04:45 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Use table-qualified pkey col names whenever possible, to avoid ambiguous column references
2235	05/18/2012 04:12 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: placenames: Convert ranks using _toPlacerank to work with multi-inserts
2234	05/18/2012 04:11 PM	Aaron Marcuse-Kubitza	sql.py: DbConn._db(): Fixed bug where the isolation level was not set to "SERIALIZABLE" in a portable way
2233	05/18/2012 04:04 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): distinct_on is turned off when distinct_on == [] rather than when it's None
2232	05/18/2012 03:48 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added _toPlacerank
2231	05/18/2012 03:43 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added _toTaxonrank
2230	05/18/2012 03:35 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Handle NullValueExceptions by removing invalid rows
2229	05/18/2012 03:31 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added NamedCode
2228	05/18/2012 03:30 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added str() to base classes for debugging
2227	05/18/2012 02:46 PM	Aaron Marcuse-Kubitza	sql.py: mk_select() (and sql_gen.py): Fixed bugs where literal strings were treated as literal values when they should have been treated as column names. Take default_table param to determine default table to use if a column doesn't have an explicit table. put_table(): mk_main_select(): Pass in_tables0 as mk_select()'s default_table.
2226	05/18/2012 12:54 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): cond(): Run additional sql_gen translation functions cond2sql_gen() and col2sql_gen() on the left and right sides of the comparison
2225	05/18/2012 12:50 PM	Aaron Marcuse-Kubitza	sql_gen.py: ValueCond: Fixed bug where values which are Code objects were being converted to Literals. Added cond2sql_gen().
2224	05/17/2012 08:01 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): join(): Use cond() now that it supports sql_gen format
2223	05/17/2012 07:50 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added col2sql_gen() and use it in value2sql_gen()
2222	05/17/2012 07:25 PM	Aaron Marcuse-Kubitza	sql_gen.py: CompareCond: By default, compare NULL values literally. Support operator values to pass NULLs through.
2221	05/17/2012 07:23 PM	Aaron Marcuse-Kubitza	strings.py: remove_prefix(), remove_suffix(): Added removed_ref param
2220	05/17/2012 06:28 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): parse_col(): Use sql_gen.value2sql_gen().to_str()
2219	05/17/2012 06:22 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added as_Table(), unescape_table(), value2sql_gen()
2218	05/17/2012 03:37 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Documented conds param
2217	05/17/2012 03:32 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): cond(): Switched to using sql_gen so that custom conds would be supported
2216	05/17/2012 03:19 PM	Aaron Marcuse-Kubitza	sql_gen.py: ValueCond.to_str(): Made value_code a Code object instead of a string, and renamed it to left_value to reflect where it goes. Added as_ValueCond().
2215	05/17/2012 03:11 PM	Aaron Marcuse-Kubitza	sql.py: esc_value(): Fixed bug where db needed to be referenced through self
2214	05/17/2012 02:22 PM	Aaron Marcuse-Kubitza	sql_gen.py: ValueCond.to_str(): Added value_code param
2213	05/17/2012 02:16 PM	Aaron Marcuse-Kubitza	sql_gen.py: Literal, CompareCond: Implemented to_str(). ValueCond: Autoconvert literal values to Literals.
2212	05/17/2012 02:14 PM	Aaron Marcuse-Kubitza	sql.py: DbConn: Added esc_value()
2211	05/17/2012 01:52 PM	Aaron Marcuse-Kubitza	Moved SQL code generation classes from sql.py to new sql_gen.py. sql_gen.py: Added Code, Literal, ValueCond, and CompareCond. sql.py: Removed Query because we will use a different approach.
2210	05/17/2012 12:43 PM	Aaron Marcuse-Kubitza	sql.py: Added Query, Table, Col
2209	05/17/2012 11:28 AM	Aaron Marcuse-Kubitza	sql.py: get(): Fixed bug where limit=1 needs to be passed to select() as a keyword arg now that the distinct_on param comes before it
2208	05/17/2012 11:01 AM	Aaron Marcuse-Kubitza	sql.py: put_table(): mk_main_select(): Pass outer var conds to mk_select()
2207	05/17/2012 10:57 AM	Aaron Marcuse-Kubitza	sql.py: put_table(): mk_select_(): Fixed bug where it was sometimes being called without distinct_on, causing it to return a different # of rows. Renamed mk_select_() to mk_main_select() for clarity.
2206	05/17/2012 10:48 AM	Aaron Marcuse-Kubitza	sql.py: put_table(): Do inserts and selects in a loop so that it will keep retrying the operation with additional constraints until it succeeds
2205	05/15/2012 03:56 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): mk_select_(): Fixed bug where order_by needed to be None because otherwise it wouldn't match the distinct_on cols if they were specified
2204	05/15/2012 03:55 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): insert_(): Fixed bug where distinct_on was not passed to mk_select_()
2203	05/15/2012 03:30 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): mk_select_(): Fixed bug where distinct_on needed to be passed as a keyword param
2202	05/15/2012 03:21 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): insert_() and mk_select_() take distinct_on param
2201	05/15/2012 03:10 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Factored out code that inserts into pkeys table into run_query_into_pkeys() helper function
2200	05/15/2012 02:55 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Implemented DISTINCT ON according to the distinct_on param
2199	05/15/2012 02:48 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Added distinct_on param to set the columns to SELECT DISTINCT ON
2198	05/15/2012 02:31 PM	Aaron Marcuse-Kubitza	sql.py: clean_name(): Convert names to lowercase so that PostgreSQL will behave the same whether the name is escaped with "" or not. This will help avoid bugs in code that uses temp tables created by the sql module.
2197	05/15/2012 02:29 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Added order_by=None wherever rows were not supposed to be re-ordered. On DuplicateKeyException: Save existing pkeys in temp table for joining on.
2196	05/15/2012 01:31 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Pass limit and start to sql.put_table()
2195	05/15/2012 01:09 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Added limit and start options
2194	05/15/2012 11:54 AM	Aaron Marcuse-Kubitza	sql.py: When creating a temporary entity (table, function, etc.), instead create it as a permanent entity in debug mode so it can be viewed after the program is run
2193	05/15/2012 11:40 AM	Aaron Marcuse-Kubitza	sql.py: DbConn: Store whether in debug mode (log_debug != log_debug_none) for easy use by methods
2192	05/15/2012 11:31 AM	Aaron Marcuse-Kubitza	bin/map: connect_db(): Turn on autocommit mode in debug mode if commit is on, so that incremental results can be seen in the DB
2191	05/15/2012 11:30 AM	Aaron Marcuse-Kubitza	sql.py: DbConn: Use internal autocommit handling instead of DB connection autocommit attr to avoid autocommits inside a savepoint
2190	05/15/2012 11:15 AM	Aaron Marcuse-Kubitza	sql.py: DbConn: Added autocommit option to turn on autocommit mode. Use set_session() instead of SQL command to set isolation level.
2189	05/14/2012 05:50 PM	Aaron Marcuse-Kubitza	sql.py: mk_insert_select(): embeddable: Fixed bug where the function may do different things when run, because the function (and other statements whose cached strings depend on the function name) may be run after the function definition would have changed, by versioning the function name and using CREATE FUNCTION instead of CREATE OR REPLACE FUNCTION so that its definition never changes
2188	05/14/2012 05:28 PM	Aaron Marcuse-Kubitza	sql.py: Parse "function already exists" errors as DuplicateFunctionException
2187	05/14/2012 05:13 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): joins: Fixed bug where join_not_equal did not do what it was designed for, which is filtering out matches of the join condition (before the bug fix, it effectively did a cross join with matching rows excluded, causing duplication of rows). Renamed join_not_equal to filter_out to reflect its intended use. Support table-scoped column names in the WHERE conds list.
2186	05/14/2012 04:22 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Fixed bug where ORDER BY column needed to have table0 name prefixed (if it didn't already have a table name), to avoid ambiguous column references
2185	05/14/2012 04:11 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Fixed bug in joins where right_col had the table name prepended before it was copied for use with a different table name in join_using and join_not_equal
2184	05/14/2012 03:42 PM	Aaron Marcuse-Kubitza	Mapped some unmapped fields in DwC inputs
2183	05/14/2012 02:19 PM	Aaron Marcuse-Kubitza	Added mappings/for_review/DwC2-VegBIEN.specimens.fields.csv
2182	05/14/2012 01:21 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Fixed bug where didn't commit right after inserting node, but instead waited until children with fkeys to parent (independent of the node itself) were inserted
2181	05/14/2012 01:16 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): insert_(): Use insert_select() instead of run_query_into() if new option pkeys_table_exists is on
2180	05/14/2012 12:51 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Support joins with !=
2179	05/14/2012 12:45 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Support only some join columns being join_using
2178	05/14/2012 12:40 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Renamed in_joins to insert_joins and joins to select_joins for clarity
2177	05/14/2012 12:21 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Support children with fkeys to parent
2176	05/14/2012 12:11 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Make tuple optional for None literal values
2175	05/13/2012 02:05 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Removed "SELECT statement missing a WHERE, LIMIT, or OFFSET clause" warnings
2174	05/13/2012 02:02 PM	Aaron Marcuse-Kubitza	bin/map: by_col: row_ct = 0 because it's unknown for now
2173	05/13/2012 02:00 PM	Aaron Marcuse-Kubitza	mk_select(): Support join conditions with literal values
2172	05/13/2012 01:42 PM	Aaron Marcuse-Kubitza	sql.py: mk_insert_select(): embeddable: Don't cache function_query because function def could change and then change back
2171	05/13/2012 01:35 PM	Aaron Marcuse-Kubitza	sql.py: with_savepoint(): Renamed savepoints to have "level" prefix, since the # indicates the level #
2170	05/13/2012 01:32 PM	Aaron Marcuse-Kubitza	sql.py: get_cur_query(): Also accept input params to combine with input_query, and pass input params when get_cur_query() is called
2169	05/13/2012 01:26 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Pass input query to get_cur_query()
2168	05/13/2012 01:19 PM	Aaron Marcuse-Kubitza	sql.py: get_cur_query() and _add_cursor_info(): Support input_query param that will be used if the raw query is None. Pass input_query in DbConn.execute().
2167	05/13/2012 01:09 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Check that query != None
2166	05/13/2012 01:05 PM	Aaron Marcuse-Kubitza	bin/map: out_is_db: Only rollback() and close() out_db if it was connected
2165	05/13/2012 01:04 PM	Aaron Marcuse-Kubitza	sql.py: DbConn: Added connected()
2164	05/13/2012 01:01 PM	Aaron Marcuse-Kubitza	sql.py: Wrapped calls to get_cur_query() that are used as strings in str(), because get_cur_query() can return None
2163	05/13/2012 12:57 PM	Aaron Marcuse-Kubitza	sql.py: next_version(): Versions start from 1, because first existing name was version 0
2162	05/13/2012 12:55 PM	Aaron Marcuse-Kubitza	put_table(): Use short name for temp_suffix now that version # will be added if needed
2161	05/13/2012 12:51 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Parse join columns for literal values and table-scoped names as well
2160	05/13/2012 11:54 AM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: establishmentMeans: Call _toGrowthform on growthform
2159	05/13/2012 11:53 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added _toGrowthform
2158	05/13/2012 11:19 AM	Aaron Marcuse-Kubitza	sql.py: put_table(): Changed temp_prefix to a suffix so main name won't be removed if name is truncated

Project

General

Profile