/ - Changes - BIEN 3 - NCEAS Projects

root @ 4327

#	Date	Author	Comment
4327	08/29/2012 06:33 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Changed _merge to _join wherever the duplicate-eliminating functionality of _merge is not needed and a simple concatenation of non-NULL values is sufficient
4326	08/29/2012 06:24 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _join() simplifying function
4325	08/29/2012 06:22 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added _join()
4324	08/29/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Moved "if subplot" _if statement around /location/parent_id and /location/sourceaccessioncode themselves, so that only one _if cond mapping for subplot is needed. Note that this is only possible because this _if statement uses _exists, allowing it to be fully evaluated by the XML template simplifying mechanism, which supports subtrees as arguments to _if.
4323	08/29/2012 06:06 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed no longer used parentLocationID, parentPlotName (locationID and plotName now automatically map to the correct location). mappings/Veg+-VegCore.csv: Removed no longer used parentPlotID.
4322	08/29/2012 05:57 PM	Aaron Marcuse-Kubitza	xml_func.py: passthru(): Use xml_dom.prune() so that after empty children are removed, the node itself is also removed if it's empty. This enables further pruning of any node that contains the pruned node.
4321	08/29/2012 05:55 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added prune()
4320	08/29/2012 05:52 PM	Aaron Marcuse-Kubitza	xml_func.py: Removed no longer used prune() (use xml_dom.prune_children() instead)
4319	08/29/2012 05:51 PM	Aaron Marcuse-Kubitza	xml_func.py: Use new xml_dom.prune_children()
4318	08/29/2012 05:51 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added prune_empty() and prune_children()
4317	08/29/2012 05:29 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved VegX export subdir to _archive and renamed it to remove ".disabled" suffix and have a VegCSV-like name
4316	08/29/2012 05:24 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Renamed README.TXT to DFtemp.analysis_query.txt because it relates only to a particular query from Shash, and moved it to the _archive/ subdir
4315	08/29/2012 05:21 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved source files into new _src/ subdir to avoid cluttering up the main dir
4314	08/29/2012 05:16 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/_src/
4313	08/29/2012 05:02 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Added non-data files that weren't under version control
4312	08/29/2012 04:59 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved _scripts_to_drop_extra_tables to _archive because they are for a different version of the CTFS database than the extract we received (bci.sql)
4311	08/29/2012 04:57 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved DBv5.txt to _archive because it's for a different version of the CTFS database than the extract we received (bci.sql)
4310	08/29/2012 04:49 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved CTFS_conversion_bci.php to _archive since it's just for the DFtemp (aggregated) mapping
4309	08/29/2012 04:48 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/_archive
4308	08/29/2012 04:39 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
4307	08/28/2012 07:56 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/PlotObservation/
4306	08/28/2012 07:54 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: fieldNumber (authoreventcode): Don't copy to location.authorlocationcode if an actual locationID was specified
4305	08/28/2012 07:51 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Removed no longer needed pass-through optimizations for XML functions, which are now handled by each function's own simplifying function
4304	08/28/2012 07:50 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _name simplifying function
4303	08/28/2012 07:48 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _alt, _merge simplifying functions
4302	08/28/2012 07:45 PM	Aaron Marcuse-Kubitza	xml_func.py: passthru(): First prune the node
4301	08/28/2012 07:43 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Use new passthru()
4300	08/28/2012 07:43 PM	Aaron Marcuse-Kubitza	xml_func.py: Added passthru()
4299	08/28/2012 07:36 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Use new prune()
4298	08/28/2012 07:36 PM	Aaron Marcuse-Kubitza	xml_func.py: Added prune()
4297	08/28/2012 07:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped eventID
4296	08/28/2012 07:24 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS Census terms
4295	08/28/2012 07:20 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added CTFS Census terms
4294	08/28/2012 07:17 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Changed plotEventStartDate, plotEventEndDate to startDate, endDate because a date range always applies to the event
4293	08/28/2012 07:13 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added startDate, endDate
4292	08/28/2012 06:59 PM	Aaron Marcuse-Kubitza	README.TXT: Testing: Mapping process: Added command to include column-based import tests
4291	08/28/2012 06:49 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Update vegbiendev: Added step to run the tests, to make sure the staging tables were installed properly
4290	08/28/2012 06:45 PM	Aaron Marcuse-Kubitza	inputs/CTFS/Plot/: Added create.sql
4289	08/28/2012 06:44 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Added import_order.txt
4288	08/28/2012 06:40 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/Subplot/
4287	08/28/2012 06:36 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS QuadratID
4286	08/28/2012 06:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped subplotID
4285	08/28/2012 06:24 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added subplotID
4284	08/28/2012 06:22 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS Quadrat columns
4283	08/28/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped subplotX, subplotY
4282	08/28/2012 06:14 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed empty mappings for unmapped DwC terms because these terms are now listed and maintained in mappings/Veg+.terms.csv
4281	08/28/2012 06:12 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added Brad's descriptive comments for several VegCore terms
4280	08/28/2012 06:07 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added subplotX, subplotY
4279	08/28/2012 06:03 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Made organismX, organismY the official VegCore terms and map relativePlotX, relativePlotY to them in mappings/Veg+-VegCore.csv
4278	08/28/2012 06:00 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added organismX, organismY as clearer alternatives to relativePlotX, relativePlotY
4277	08/28/2012 05:48 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added CTFS Quadrat columns
4276	08/28/2012 05:38 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/
4275	08/28/2012 05:36 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Only run column-based tests if column-based mode enabled, because these tests are much slower than the row-based tests for small numbers of rows. Note that this involves explicitly turning off column-based mode in the row-based test, to prevent propagation of the by_col env var which both enables these extra tests and sets bin/map to run in column-based mode.
4274	08/28/2012 05:28 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Added by-column test, which is compared to the row-based test's accepted output
4273	08/28/2012 05:20 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Merged $(runTest) and $(test2Db) because all tests go to the database
4272	08/28/2012 05:19 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Moved `$(foreach use_staged,1,...)` from $(test2Db) to $(runTest) because all tests now use the staging tables
4271	08/28/2012 05:15 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Merged $(test2Db) and $(testStaged2Db) because all tests now use the staging tables
4270	08/28/2012 05:14 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: $(runTest): Always use $(map2db) because there are no tests that use other programs (and haven't been in awhile)
4269	08/28/2012 05:09 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Run the core test from the staging table, because derived tables only have a staging table and the flat-file test would produce inconsistent results
4268	08/28/2012 05:00 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Fixed bug where rules needed to generate Veg+.self.csv ($(viaSelfMap)) were still using a pattern match that required a table (`.%.`, `.*.`), even though we are no longer using separate maps for separate tables
4267	08/28/2012 04:44 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS Country and Site columns
4266	08/28/2012 04:25 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added CTFS Country and Site columns
4265	08/28/2012 04:14 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Adding input data: svn adding the generated map spreadsheets and related files: Added header.csv to the list of files added (for derived tables)
4264	08/28/2012 04:07 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Adding input data: Documented how to create tables that will be joined together with another table, and how to create tables that are joins of other tables
4263	08/28/2012 04:01 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Also create header.csv so that there is a CSV header that the map spreadsheets can be autogenerated from
4262	08/28/2012 02:22 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Add row_num column to derived staging tables so they will have a pkey
4261	08/28/2012 02:21 PM	Aaron Marcuse-Kubitza	sql.py: pkey(): Use pkey_col constant if this column exists, to allow using a row_num column as the pkey even when it is placed at the end of the table (due to being added after the table was created)
4260	08/28/2012 01:59 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Support alternative generation of a staging table by joining together other staging tables in a create.sql file
4259	08/28/2012 01:57 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Don't create a row_num column when the table is a joined table because it collides during joins
4258	08/28/2012 01:49 PM	Aaron Marcuse-Kubitza	csv2db: Made input_cmd optional when errors_table_only is on, because the CSV header is not needed to create the errors table
4257	08/28/2012 01:47 PM	Aaron Marcuse-Kubitza	csv2db: Added has_row_num param to disable creating a row_num column
4256	08/28/2012 12:44 PM	Aaron Marcuse-Kubitza	input.Makefile: Existing maps discovery: $(allTables): When prepending unsorted (joined) tables, save them in $(joinedTables) for later use in determining which tables should have a row_num column
4255	08/28/2012 12:27 PM	Aaron Marcuse-Kubitza	README.TXT: Fixed indent
4254	08/28/2012 12:04 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: Install all tables, not just those present in import_order.txt. This will later allow staging tables to be derived by joining together other staging tables, which themselves are not imported but still need to be installed.
4253	08/28/2012 11:53 AM	Aaron Marcuse-Kubitza	input.Makefile: Existing maps discovery: $(tables): Prepend unsorted tables (those that are not present in import_order.txt)
4252	08/28/2012 11:04 AM	Aaron Marcuse-Kubitza	input.Makefile: Renamed "...-%" targets to "%/..." so they are more logically associated with a specific subdir
4251	08/28/2012 10:54 AM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added Madidi terms that don't exist in other datasources
4250	08/28/2012 10:47 AM	Aaron Marcuse-Kubitza	inputs/Madidi/0.plots/map.csv: Added [Veg+] to root to enable auto-mapping
4249	08/28/2012 10:35 AM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
4248	08/27/2012 10:47 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS*/1.organisms/map.csv: Map directly to locationID, plotName instead of parentLocationID, parentPlotName because these terms now map correctly to the parent location when a subplot column exists
4247	08/27/2012 10:43 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: plotName -> /location/authorlocationcode mapping: When subplot is provided, remove this mapping using _if ... _exists instead of _alt so that a NULL subplot value will not cause the parent plot's name to be used for the subplot name
4246	08/27/2012 10:34 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: $(runTest): Remove outputs of successful tests to reduce clutter
4245	08/27/2012 10:32 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: %/test.staging.xml: Don't create test.staging.xml at all for non-flat-file inputs, because it is not needed (diff does not run in this case)
4244	08/27/2012 10:23 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Fixed bug where "if subplot" conditions would evaluate to true only if the subplot was NOT NULL, when they should actually evaluate to true if the datasource specified any subplot column, nullable or not
4243	08/27/2012 10:14 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Removed no longer needed hardcoded _if simplifying code now that there is an _if() simplifying function
4242	08/27/2012 10:10 PM	Aaron Marcuse-Kubitza	db_xml.py: input_col_prefix: Use value of xml_func.var_name_prefix, which is now the place where this value is configured
4241	08/27/2012 10:09 PM	Aaron Marcuse-Kubitza	db_xml.py: Moved input_col_prefix above the put() function that uses it
4240	08/27/2012 10:09 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _if() simplifying function
4239	08/27/2012 10:07 PM	Aaron Marcuse-Kubitza	xml_func.py: Added is_var_name() and is_var()
4238	08/27/2012 10:06 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added NodeEntryIter
4237	08/27/2012 09:33 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _exists()
4236	08/27/2012 09:30 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Added support for custom simplifying functions, which are not hard-coded in simplify()
4235	08/27/2012 09:19 PM	Aaron Marcuse-Kubitza	xml_dom.py: replace_with_text(): Use new bool2str() so that False causes the node to be removed instead of replaced with the empty string
4234	08/27/2012 09:18 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added bool2str()
4233	08/27/2012 08:56 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS*/1.organisms/map.csv: Mapped subplot, Line to new subplot VegCore term
4232	08/27/2012 08:54 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped subplot, which involved replacing an _if with _alt to both remove plotName as the authorlocationcode and use subplot instead when subplot is specified
4231	08/27/2012 08:47 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: locationID, plotName: Redirect to /location/parent_id/location/* if subplot field is specified
4230	08/27/2012 08:42 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Also remove _if statements with only a condition. This is a required transformation, because such _if statements can't be handled by functions._if() due to there being no argument to provide the anyelement type.
4229	08/27/2012 08:06 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Added pruning optimization that removes empty children. Empty children are created when some mappings don't apply to the current datasource.
4228	08/27/2012 07:58 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Only generate children list if node is a function

Project

General

Profile