/ - Changes - BIEN 3 - NCEAS Projects

root @ 2491

#	Date	Author	Comment
2491	05/30/2012 07:38 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): sql.put_table(): Name the into table ...literal instead of ...value if the value column is a literal value
2490	05/30/2012 07:08 PM	Aaron Marcuse-Kubitza	bin/map: Logging: log(): Remove extra debug info from DB query messages and format level 1.5 (summary) messages as Redmine list items
2489	05/30/2012 06:50 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Renamed temp_prefix param to into and allow it to be a sql_gen.Table object. Use into directly as the pkeys table, and make its default value be `out_table.name+'_pkeys'`.
2488	05/30/2012 06:31 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Pass custom temp_prefix to sql.put_table() for relational funcs, so that their value param's input column name is included in the temp table name
2487	05/30/2012 06:19 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Added optional param temp_prefix for the prefix of generated temp tables
2486	05/30/2012 06:13 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Made debug messages more self-documenting
2485	05/30/2012 05:44 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Changed "Setting missing rows' pkeys to NULL" to "Setting pkeys of missing rows to NULL" to avoid having single quote in debug output, which messes up text editor SQL syntax highlighting
2484	05/30/2012 05:40 PM	Aaron Marcuse-Kubitza	sql.py: Parsed exceptions: Use strings.as_tt() to format Python values
2483	05/30/2012 05:37 PM	Aaron Marcuse-Kubitza	strings.py: Split as_table() into as_table() and as_inline_table() depending on whether the table needs to be inlined in an ordered list item or not
2482	05/30/2012 05:36 PM	Aaron Marcuse-Kubitza	strings.py: Split as_table() into as_table() and as_inline_table() depending on whether the table needs to be inlined in an ordered list item or not
2481	05/30/2012 05:03 PM	Aaron Marcuse-Kubitza	strings.py: as_table(): Changed to use formatting because Redmine tables can't be embedded in ordered lists without restarting the numbering
2480	05/30/2012 03:58 PM	Aaron Marcuse-Kubitza	strings.py: as_table(): Fixed bug where table was not ended properly, by adding a space after the last \n and having rstrip() string only newlines
2479	05/29/2012 09:19 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): Columns: Separate columns with newlines
2478	05/29/2012 09:10 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Use new strings.as_table() to format mappings as tables
2477	05/29/2012 09:09 PM	Aaron Marcuse-Kubitza	strings.py: Added as_tt() and as_table()
2476	05/29/2012 09:09 PM	Aaron Marcuse-Kubitza	bin/map: Logging: log(): Strip trailing newlines from msg
2475	05/29/2012 08:40 PM	Aaron Marcuse-Kubitza	strings.py: as_code(): Added multiline param to disable multiline formatted output
2474	05/29/2012 08:33 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): "Ignoring existing rows, comparing on" debug message: Wrap the mapping in strings.as_code() so it will have Redmine syntax-highlighting
2473	05/29/2012 08:26 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): "Putting columns" debug message: Wrap the mapping in strings.as_code() so it will have Redmine syntax-highlighting
2472	05/29/2012 08:22 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Query debug message: Wrap the query in strings.as_code() so it will have Redmine syntax-highlighting
2471	05/29/2012 08:20 PM	Aaron Marcuse-Kubitza	strings.py: Added as_code()
2470	05/29/2012 08:04 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Prepend "DB query" before the query debug message so it can be identified as a DB query
2469	05/29/2012 07:43 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Subset in_table: Document that in_table will be shadowed (hidden) by the created temp table, rather than versioned, now that the table is (almost) always created as a temp table
2468	05/29/2012 07:40 PM	Aaron Marcuse-Kubitza	sql.py: Create temp items as permanent in autocommit mode rather than in debug mode so that temp items are only permanent if actually committing result. This ensures that the generated SQL in test mode matches what would actually get run in regular commit mode, and the SQL is only altered to make the temp items visible if actually debugging (autocommit mode).
2467	05/29/2012 07:30 PM	Aaron Marcuse-Kubitza	sql.py, sql_gen.py: Reformatted generated SQL for presentability by adding newlines
2466	05/29/2012 07:14 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Put a newline before the query in the debug message so that multiline queries have all rows at the left edge rather than the first row prefixed by other text
2465	05/29/2012 07:09 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Don't put generated query debug message all on one line, so that embedded newlines are preserved
2464	05/29/2012 06:59 PM	Aaron Marcuse-Kubitza	sql.py: Fixed bug where queries with versioned identifiers which threw an exception (not related to name collisions) were being output with a too-high log_level, because all exceptions were output with the higher exc_log_level, by making the following changes: DbConn.run_query(): Changed exc_log_level param to log_ignore_excs param so that only certain exceptions would cause the query to be output with a higher log_level. Moved the code that actual emits the query debug message from DbConn.run_query() to module-level run_query() so it would apply the log_ignore_excs filter after the exception had already been parsed into specific types.
2463	05/29/2012 03:16 PM	Aaron Marcuse-Kubitza	Moved "Putting columns" debug message from db_xml.py put_table() to sql.py put_table() to put it in the same place as the other debug messages
2462	05/29/2012 03:12 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added remove_col_rename() and use it where `if isinstance(value, NamedCol): value = value.code` was used
2461	05/29/2012 03:10 PM	Aaron Marcuse-Kubitza	sql_gen.py: CompareCond.to_str(): If left_value has been renamed as a NamedCol, unwrap it
2460	05/29/2012 02:53 PM	Aaron Marcuse-Kubitza	sql_gen.py: Join.to_str(): Fixed bug where USING should be used if all columns are join_same_not_null, rather than join_same, because USING uses plain = for comparison. sql.py: put_table(): input_joins now can use sql_gen.join_same_not_null in order to use USING syntax.
2459	05/25/2012 07:14 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Output debug messages with a level of 1.5 to match sql.put_table()'s level for summary messages
2458	05/25/2012 07:01 PM	Aaron Marcuse-Kubitza	bin/map: Fixed bug where verbosity needed to be 1 outside of test mode so that profiling and errors stats would be printed at end of import. Verbosity defaults to 0.5 rather than 1 in test mode so profiling and errors stats do not clutter up the test output when running automated tests.
2457	05/25/2012 06:55 PM	Aaron Marcuse-Kubitza	bin/map: Only display verbose_errors in test mode, but with any nonzero verbosity. They should not be displayed outside of test mode because verbose errors make the log files huge.
2456	05/25/2012 06:52 PM	Aaron Marcuse-Kubitza	bin/map: Renamed verbose param to verbosity because it's now a number, not a boolean
2455	05/25/2012 06:51 PM	Aaron Marcuse-Kubitza	bin/map: Removed no longer used debug param (verbose=2 is used instead)
2454	05/25/2012 06:48 PM	Aaron Marcuse-Kubitza	bin/map: Fixed bug where verbose_errors' default value depended on debug var, which was not yet set. Removed verbose_errors param and instead turn verbose_errors on whenever verbosity >= 1. Verbosity defaults to 1 in test mode.
2453	05/25/2012 06:33 PM	Aaron Marcuse-Kubitza	bin/map: Logging: Don't set sql.run_raw_query.debug, because it is not used anymore (sql.connect(log_debug=...) is used instead)
2452	05/25/2012 06:29 PM	Aaron Marcuse-Kubitza	bin/map: Logging: Print debug messages (level > 1) prefixed with their level, to distinguish higher- and lower-level debug messages
2451	05/25/2012 06:22 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Only display warning for exceptions with no handler (which are unexpected), not missing mappings for NOT NULL columns (which are normal in datasources without those columns)
2450	05/25/2012 06:15 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Log summarizing debug messages with a level of 1.5 so they will be displayed even when the major SQL queries (which have a level of 2) are not shown
2449	05/25/2012 06:08 PM	Aaron Marcuse-Kubitza	bin/map: Provide a log_debug() function to sql.connect() if verbosity > 1 rather than >= 2, to support fractional verbosities
2448	05/25/2012 06:04 PM	Aaron Marcuse-Kubitza	sql.py: log_debug_none: Fixed bug where needed to take kw arg level to work with verbosity-based logging
2447	05/25/2012 05:57 PM	Aaron Marcuse-Kubitza	bin/map: Allow fractional verbosity values
2446	05/25/2012 05:56 PM	Aaron Marcuse-Kubitza	sql.py: Functions that version created tables, functions, etc. if they already exist: Use (default) exc_log_level=4 to hide the unsuccessful attempts to create items that already exist and show only the successful attempt
2445	05/25/2012 05:43 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Added exc_log_level param to specify a different log_level if the query throws an exception. This will useful for functions that version created tables, functions, etc. if they already exist.
2444	05/25/2012 05:34 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Removed no longer accurate doc comment, because that functionality is now in module-level run_query()
2443	05/25/2012 05:31 PM	Aaron Marcuse-Kubitza	sql.py: Specify log_levels for minor queries so they can be excluded from the debug output
2442	05/25/2012 05:16 PM	Aaron Marcuse-Kubitza	sql.py: select(): Pass log_level to run_query()
2441	05/25/2012 05:13 PM	Aaron Marcuse-Kubitza	sql.py: DbConn.run_query(): Added log_level param and pass it to self.log_debug(). run_query(): Pass extra kw_args to DbConn.run_query() (via run_raw_query()) so that caller can specify log_level.
2440	05/25/2012 04:54 PM	Aaron Marcuse-Kubitza	sql.py: run_query_into(): Fixed bug where "temporary tables cannot specify a schema name"
2439	05/25/2012 04:42 PM	Aaron Marcuse-Kubitza	bin/map: Switched to verbosity-level-based system of logging. verbose is now an integer, and debug sets the minimum verbosity to 2.
2438	05/25/2012 04:37 PM	Aaron Marcuse-Kubitza	input.Makefile: Configuration: Removed debug var since it's not used in the Makefile
2437	05/25/2012 04:09 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): put_table_(): Fixed bug where row_ins_ct_ref needed to be passed recursively to put_table() as keyword arg, because the in_row_ct_ref is not passed recursively
2436	05/25/2012 04:07 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): _simplifyPath: Parse "next" XPath param to extract col name of next level's pkey
2435	05/25/2012 03:26 PM	Aaron Marcuse-Kubitza	bin/map: by_col: xml_func.strip(): Don't remove _simplifyPath because it is now handled by db_xml.put_table()
2434	05/25/2012 03:25 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Added basic special handling for structural XML functions, which for now just skips the function
2433	05/25/2012 03:21 PM	Aaron Marcuse-Kubitza	xml_func.py: strip(): Added preserve param for XML functions not to remove
2432	05/25/2012 02:49 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Handle forward pointers in translation-to-sql_gen step instead of in XML-tree-parsing step, so that special handling for structural XML functions can use the parsed tree before any sql.put_table() processing takes place
2431	05/25/2012 02:44 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added is_node()
2430	05/25/2012 02:22 PM	Aaron Marcuse-Kubitza	sql.py: table_row_count(): Pass start=0 to mk_select() to avoid "SELECT statement missing a WHERE, LIMIT, or OFFSET clause" warnings
2429	05/25/2012 02:12 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Handle unknown exceptions by returning NULL for all rows. Refactored Missing mapping for NOT NULL column handling to use new helper function remove_all_rows().
2428	05/25/2012 01:54 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Assert that insert_out_pkeys and insert_in_pkeys have same row count. Assert that pkeys and in_table have same row count.
2427	05/25/2012 12:57 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Use new sql.table_row_count()
2426	05/25/2012 12:56 PM	Aaron Marcuse-Kubitza	sql.py: Added table_row_count()
2425	05/25/2012 12:52 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Use new sql_gen.row_count
2424	05/25/2012 12:47 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added row_count
2423	05/25/2012 12:41 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Count # rows and update in_row_ct_ref once all columns have been processed. Don't pass in_row_ct_ref to recursive calls because it should only be increased once.
2422	05/25/2012 12:28 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Added in_row_ct_ref param to store the # of input rows processed. Renamed row_ct_ref param to row_ins_ct_ref to distinguish it from new in_row_ct_ref param.
2421	05/24/2012 09:26 PM	Aaron Marcuse-Kubitza	sql_gen.py: MockDb.esc_name(): Don't use sql.esc_name_by_module() to avoid circular dependency on sql module
2420	05/24/2012 09:20 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Factored out mk_select() calls in calls to run_query_into_pkeys() into new helper function insert_into_pkeys()
2419	05/24/2012 09:09 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): run_query_into_pkeys() calls use order_by=None in their select statements because there is a pkey, so order (row #) does not matter
2418	05/24/2012 09:05 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Subset in_table if limit != None or start != 0. start param defaults to 0 again to avoid subsetting the table when starting from row 0 (with no limit).
2417	05/24/2012 08:46 PM	Aaron Marcuse-Kubitza	db_xml.py: put_table(): Don't pass limit, start recursively, because the table subsetting will happen only once in the first invocation of the function. Moved limit, start params to end since they are not passed recursively. start param no longer defaults to 0 because this is not needed since sql.put_table() now sets start to 0 where needed.
2416	05/24/2012 08:38 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Removed limit and start params because they were never fully implemented, and because it's simpler to just have the caller subset their input table
2415	05/24/2012 08:27 PM	Aaron Marcuse-Kubitza	lists.py: Added uniqify()
2414	05/24/2012 08:08 PM	Aaron Marcuse-Kubitza	sql.py: Moved mk_flatten_mapping(), flatten() to Basic queries section since they don't involve database structure info
2413	05/24/2012 08:06 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Use single quotes rather than double quotes around strings where possible
2412	05/24/2012 07:59 PM	Aaron Marcuse-Kubitza	schemas/functions.sql, vegbien.sql: Changed CAST-related relational functions to return NULL on data exceptions and convert the exceptions to warnings. This helps column-based import by mapping invalid values to NULL instead of aborting the whole query on the first invalid value.
2411	05/24/2012 07:33 PM	Aaron Marcuse-Kubitza	sql.py: index_col(): Cache the query so it doesn't try to add an index on the same column multiple times
2410	05/24/2012 07:18 PM	Aaron Marcuse-Kubitza	sql.py mk_select(), sql_gen.py Join.to_str(): Fixed bug where conditions needed to be wrapped in () before being AND-ed together to ensure the proper operator precedence
2409	05/24/2012 06:49 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Add index on columns with invalid values to enable fast filtering
2408	05/24/2012 06:47 PM	Aaron Marcuse-Kubitza	sql.py: Added index_col()
2407	05/24/2012 06:18 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Add pkey on returned pkeys table to enable fast joins
2406	05/24/2012 06:17 PM	Aaron Marcuse-Kubitza	sql.py: Added index_pkey()
2405	05/24/2012 05:41 PM	Aaron Marcuse-Kubitza	sql.py: mk_update(): When running sql_gen.to_name_only_col(), check that the col's table is table
2404	05/24/2012 05:38 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Renamed *pkeys to insert*_pkeys to distinguish them from the full set of pkeys on the input table
2403	05/24/2012 05:27 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): FunctionValueException: Change invalid values to NULL using UPDATE instead of filtering them out using WHERE, to avoid adding lots of conditions to the SELECT statement
2402	05/24/2012 05:11 PM	Aaron Marcuse-Kubitza	sql.py: Added mk_update() and update()
2401	05/24/2012 05:10 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added to_name_only_col()
2400	05/24/2012 04:56 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added as_Value()
2399	05/24/2012 04:29 PM	Aaron Marcuse-Kubitza	sql.py: mk_select(): conds: Use new sql_gen.ColValueCond instead of sql_gen.as_ValueCond(). Documented that Code and ValueCond are sql_gen objects.
2398	05/24/2012 04:28 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added ColValueCond
2397	05/24/2012 03:59 PM	Aaron Marcuse-Kubitza	sql.py: mk_flatten_mapping(): Filter str(col) through clean_name() to remove quotes, etc.
2396	05/24/2012 03:58 PM	Aaron Marcuse-Kubitza	sql.py: Added clean_name()
2395	05/24/2012 03:43 PM	Aaron Marcuse-Kubitza	sql.py: put_table(): Join together input tables into new table for speed and so don't modify input if values edited
2394	05/24/2012 03:37 PM	Aaron Marcuse-Kubitza	sql.py: mk_flatten_mapping(): Take as_items param to return a list of dict items instead of a dict. Sort preserve cols before other cols. flatten(): Turn on as_items so that cols list is sorted in input order, with preserve cols first. This ensures that if a pkey is provided in preserve, it will be the first col in the generated table.
2393	05/24/2012 03:24 PM	Aaron Marcuse-Kubitza	sql.py: mk_flatten_mapping(), flatten(): Take list of cols to select instead of using all cols in all tables to join
2392	05/24/2012 02:58 PM	Aaron Marcuse-Kubitza	sql.py: mk_flatten_mapping(), flatten(): Renamed flat_table param to into to be consistent with run_query_into() and put it first because it is the output param

Project

General

Profile