/ - Changes - BIEN 3 - NCEAS Projects

root @ 4343

#	Date	Author	Comment
4343	08/29/2012 10:04 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Renamed Site.src to Plot.src to use a VegCSV name for the table
4342	08/29/2012 10:01 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Adding input data for each table: `make inputs/<datasrc>/<table>/add`: Added note explaining why you need to use this command instead of just creating an empty directory of the desired name
4341	08/29/2012 08:44 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Added SubplotObservation/
4340	08/29/2012 08:38 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Redirect eventID, fieldNumber (authoreventcode) to parent locationevent when subplot columns exist
4339	08/29/2012 08:23 PM	Aaron Marcuse-Kubitza	inputs/CTFS/import_order.txt: Added PlotObservation
4338	08/29/2012 08:23 PM	Aaron Marcuse-Kubitza	inputs/CTFS/PlotObservation/: Remade (hadn't been automatically remade because it wasn't part of import_order.txt)
4337	08/29/2012 08:13 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Also redirect locationID/plotName to parent location if subplotID column was provided
4336	08/29/2012 08:08 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: location.authorlocationcode mappings: Use _first to remove specimens-related alternatives for this field from consideration when plots-related alternatives exist. This avoids unintentionally using specimens-related columns for this field in plots data.
4335	08/29/2012 08:06 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _first() simplifying function
4334	08/29/2012 08:05 PM	Aaron Marcuse-Kubitza	xml_func.py: Added helper functions variadic_args() and map_names()
4333	08/29/2012 07:38 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: location.authorlocationcode mappings: Placed inside "if subplot" _if statement along with sourceaccessioncode to reduce the number of separate _if statements needing a condition mapping
4332	08/29/2012 07:32 PM	Aaron Marcuse-Kubitza	xml_dom.py: NodeEntryIter: Support entries with multiple children
4331	08/29/2012 07:20 PM	Aaron Marcuse-Kubitza	xml_dom.py: replace(): Support a list of new nodes to replace the old node with
4330	08/29/2012 07:01 PM	Aaron Marcuse-Kubitza	xml_dom.py: Moved only_child() near related method has_one_child()
4329	08/29/2012 07:00 PM	Aaron Marcuse-Kubitza	xml_dom.py: only_child(): Raise exception instead of failing assertion. Include invalid node in exception message for easier debugging.
4328	08/29/2012 06:57 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added only_child() and use it where its definition was used
4327	08/29/2012 06:33 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Changed _merge to _join wherever the duplicate-eliminating functionality of _merge is not needed and a simple concatenation of non-NULL values is sufficient
4326	08/29/2012 06:24 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _join() simplifying function
4325	08/29/2012 06:22 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added _join()
4324	08/29/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Moved "if subplot" _if statement around /location/parent_id and /location/sourceaccessioncode themselves, so that only one _if cond mapping for subplot is needed. Note that this is only possible because this _if statement uses _exists, allowing it to be fully evaluated by the XML template simplifying mechanism, which supports subtrees as arguments to _if.
4323	08/29/2012 06:06 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed no longer used parentLocationID, parentPlotName (locationID and plotName now automatically map to the correct location). mappings/Veg+-VegCore.csv: Removed no longer used parentPlotID.
4322	08/29/2012 05:57 PM	Aaron Marcuse-Kubitza	xml_func.py: passthru(): Use xml_dom.prune() so that after empty children are removed, the node itself is also removed if it's empty. This enables further pruning of any node that contains the pruned node.
4321	08/29/2012 05:55 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added prune()
4320	08/29/2012 05:52 PM	Aaron Marcuse-Kubitza	xml_func.py: Removed no longer used prune() (use xml_dom.prune_children() instead)
4319	08/29/2012 05:51 PM	Aaron Marcuse-Kubitza	xml_func.py: Use new xml_dom.prune_children()
4318	08/29/2012 05:51 PM	Aaron Marcuse-Kubitza	xml_dom.py: Added prune_empty() and prune_children()
4317	08/29/2012 05:29 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved VegX export subdir to _archive and renamed it to remove ".disabled" suffix and have a VegCSV-like name
4316	08/29/2012 05:24 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Renamed README.TXT to DFtemp.analysis_query.txt because it relates only to a particular query from Shash, and moved it to the _archive/ subdir
4315	08/29/2012 05:21 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved source files into new _src/ subdir to avoid cluttering up the main dir
4314	08/29/2012 05:16 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/_src/
4313	08/29/2012 05:02 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Added non-data files that weren't under version control
4312	08/29/2012 04:59 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved _scripts_to_drop_extra_tables to _archive because they are for a different version of the CTFS database than the extract we received (bci.sql)
4311	08/29/2012 04:57 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved DBv5.txt to _archive because it's for a different version of the CTFS database than the extract we received (bci.sql)
4310	08/29/2012 04:49 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Moved CTFS_conversion_bci.php to _archive since it's just for the DFtemp (aggregated) mapping
4309	08/29/2012 04:48 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/_archive
4308	08/29/2012 04:39 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
4307	08/28/2012 07:56 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/PlotObservation/
4306	08/28/2012 07:54 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: fieldNumber (authoreventcode): Don't copy to location.authorlocationcode if an actual locationID was specified
4305	08/28/2012 07:51 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Removed no longer needed pass-through optimizations for XML functions, which are now handled by each function's own simplifying function
4304	08/28/2012 07:50 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _name simplifying function
4303	08/28/2012 07:48 PM	Aaron Marcuse-Kubitza	xml_func.py: Added _alt, _merge simplifying functions
4302	08/28/2012 07:45 PM	Aaron Marcuse-Kubitza	xml_func.py: passthru(): First prune the node
4301	08/28/2012 07:43 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Use new passthru()
4300	08/28/2012 07:43 PM	Aaron Marcuse-Kubitza	xml_func.py: Added passthru()
4299	08/28/2012 07:36 PM	Aaron Marcuse-Kubitza	xml_func.py: simplify(): Use new prune()
4298	08/28/2012 07:36 PM	Aaron Marcuse-Kubitza	xml_func.py: Added prune()
4297	08/28/2012 07:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped eventID
4296	08/28/2012 07:24 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS Census terms
4295	08/28/2012 07:20 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added CTFS Census terms
4294	08/28/2012 07:17 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Changed plotEventStartDate, plotEventEndDate to startDate, endDate because a date range always applies to the event
4293	08/28/2012 07:13 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added startDate, endDate
4292	08/28/2012 06:59 PM	Aaron Marcuse-Kubitza	README.TXT: Testing: Mapping process: Added command to include column-based import tests
4291	08/28/2012 06:49 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Update vegbiendev: Added step to run the tests, to make sure the staging tables were installed properly
4290	08/28/2012 06:45 PM	Aaron Marcuse-Kubitza	inputs/CTFS/Plot/: Added create.sql
4289	08/28/2012 06:44 PM	Aaron Marcuse-Kubitza	inputs/CTFS/: Added import_order.txt
4288	08/28/2012 06:40 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/Subplot/
4287	08/28/2012 06:36 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS QuadratID
4286	08/28/2012 06:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped subplotID
4285	08/28/2012 06:24 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added subplotID
4284	08/28/2012 06:22 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS Quadrat columns
4283	08/28/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped subplotX, subplotY
4282	08/28/2012 06:14 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed empty mappings for unmapped DwC terms because these terms are now listed and maintained in mappings/Veg+.terms.csv
4281	08/28/2012 06:12 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added Brad's descriptive comments for several VegCore terms
4280	08/28/2012 06:07 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added subplotX, subplotY
4279	08/28/2012 06:03 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Made organismX, organismY the official VegCore terms and map relativePlotX, relativePlotY to them in mappings/Veg+-VegCore.csv
4278	08/28/2012 06:00 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added organismX, organismY as clearer alternatives to relativePlotX, relativePlotY
4277	08/28/2012 05:48 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added CTFS Quadrat columns
4276	08/28/2012 05:38 PM	Aaron Marcuse-Kubitza	Added inputs/CTFS/
4275	08/28/2012 05:36 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Only run column-based tests if column-based mode enabled, because these tests are much slower than the row-based tests for small numbers of rows. Note that this involves explicitly turning off column-based mode in the row-based test, to prevent propagation of the by_col env var which both enables these extra tests and sets bin/map to run in column-based mode.
4274	08/28/2012 05:28 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Added by-column test, which is compared to the row-based test's accepted output
4273	08/28/2012 05:20 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Merged $(runTest) and $(test2Db) because all tests go to the database
4272	08/28/2012 05:19 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Moved `$(foreach use_staged,1,...)` from $(test2Db) to $(runTest) because all tests now use the staging tables
4271	08/28/2012 05:15 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Merged $(test2Db) and $(testStaged2Db) because all tests now use the staging tables
4270	08/28/2012 05:14 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: $(runTest): Always use $(map2db) because there are no tests that use other programs (and haven't been in awhile)
4269	08/28/2012 05:09 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Run the core test from the staging table, because derived tables only have a staging table and the flat-file test would produce inconsistent results
4268	08/28/2012 05:00 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Fixed bug where rules needed to generate Veg+.self.csv ($(viaSelfMap)) were still using a pattern match that required a table (`.%.`, `.*.`), even though we are no longer using separate maps for separate tables
4267	08/28/2012 04:44 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Mapped CTFS Country and Site columns
4266	08/28/2012 04:25 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added CTFS Country and Site columns
4265	08/28/2012 04:14 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Adding input data: svn adding the generated map spreadsheets and related files: Added header.csv to the list of files added (for derived tables)
4264	08/28/2012 04:07 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Adding input data: Documented how to create tables that will be joined together with another table, and how to create tables that are joins of other tables
4263	08/28/2012 04:01 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Also create header.csv so that there is a CSV header that the map spreadsheets can be autogenerated from
4262	08/28/2012 02:22 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Add row_num column to derived staging tables so they will have a pkey
4261	08/28/2012 02:21 PM	Aaron Marcuse-Kubitza	sql.py: pkey(): Use pkey_col constant if this column exists, to allow using a row_num column as the pkey even when it is placed at the end of the table (due to being added after the table was created)
4260	08/28/2012 01:59 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Support alternative generation of a staging table by joining together other staging tables in a create.sql file
4259	08/28/2012 01:57 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: %/install: Don't create a row_num column when the table is a joined table because it collides during joins
4258	08/28/2012 01:49 PM	Aaron Marcuse-Kubitza	csv2db: Made input_cmd optional when errors_table_only is on, because the CSV header is not needed to create the errors table
4257	08/28/2012 01:47 PM	Aaron Marcuse-Kubitza	csv2db: Added has_row_num param to disable creating a row_num column
4256	08/28/2012 12:44 PM	Aaron Marcuse-Kubitza	input.Makefile: Existing maps discovery: $(allTables): When prepending unsorted (joined) tables, save them in $(joinedTables) for later use in determining which tables should have a row_num column
4255	08/28/2012 12:27 PM	Aaron Marcuse-Kubitza	README.TXT: Fixed indent
4254	08/28/2012 12:04 PM	Aaron Marcuse-Kubitza	input.Makefile: Staging tables installation: Install all tables, not just those present in import_order.txt. This will later allow staging tables to be derived by joining together other staging tables, which themselves are not imported but still need to be installed.
4253	08/28/2012 11:53 AM	Aaron Marcuse-Kubitza	input.Makefile: Existing maps discovery: $(tables): Prepend unsorted tables (those that are not present in import_order.txt)
4252	08/28/2012 11:04 AM	Aaron Marcuse-Kubitza	input.Makefile: Renamed "...-%" targets to "%/..." so they are more logically associated with a specific subdir
4251	08/28/2012 10:54 AM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added Madidi terms that don't exist in other datasources
4250	08/28/2012 10:47 AM	Aaron Marcuse-Kubitza	inputs/Madidi/0.plots/map.csv: Added [Veg+] to root to enable auto-mapping
4249	08/28/2012 10:35 AM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
4248	08/27/2012 10:47 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS*/1.organisms/map.csv: Map directly to locationID, plotName instead of parentLocationID, parentPlotName because these terms now map correctly to the parent location when a subplot column exists
4247	08/27/2012 10:43 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: plotName -> /location/authorlocationcode mapping: When subplot is provided, remove this mapping using _if ... _exists instead of _alt so that a NULL subplot value will not cause the parent plot's name to be used for the subplot name
4246	08/27/2012 10:34 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: $(runTest): Remove outputs of successful tests to reduce clutter
4245	08/27/2012 10:32 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: %/test.staging.xml: Don't create test.staging.xml at all for non-flat-file inputs, because it is not needed (diff does not run in this case)
4244	08/27/2012 10:23 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Fixed bug where "if subplot" conditions would evaluate to true only if the subplot was NOT NULL, when they should actually evaluate to true if the datasource specified any subplot column, nullable or not

Project

General

Profile