/trunk/inputs/.archive/SALVIAS-CSV/Plot/test.xml.ref - Changes - BIEN 3 - NCEAS Projects

root/trunk/inputs/.archive/SALVIAS-CSV/Plot/test.xml.ref

#	Date	Author	Comment
11970	01/20/2014 11:33 AM	Aaron Marcuse-Kubitza	moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).
11717	11/21/2013 11:59 AM	Aaron Marcuse-Kubitza	inputs/: removed SALVIAS-CSV, because this is a sample datasource which was only there to test the mapping process. it should not be adding records that duplicate SALVIAS, nor should it take up maintenance effort (switching to new-style import, updating to match SALVIAS, etc.).
11396	10/21/2013 07:14 PM	Aaron Marcuse-Kubitza	fix: bin/map: put template: comment out the "Put template:" label so that the output is valid XML, and displays properly in a browser rather than showing a syntax error
10866	09/04/2013 11:06 PM	Aaron Marcuse-Kubitza	inputs///test.xml.ref: updated source.shortname for new datasource name, which now starts out with .new suffix
7703	02/26/2013 04:27 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: _avg(): Use numeric param names to work with SQL functions
7469	02/05/2013 04:32 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv, inputs///map.csv: Applied term renamings from the new dynamically generated Veg+-VegCore.csv, which reflects the current state of the data dictionary. (Permanently switching to the new Veg+-VegCore.csv will be a separate change.) Updates to VegCore term names that have occurred since the data dictionary was created are now able to take effect, which involves remapping and inferring units on several fields.
6773	12/11/2012 07:57 AM	Aaron Marcuse-Kubitza	inputs/SALVIAS-CSV/Plot/map.csv: Mapped observation_type
6482	11/28/2012 05:52 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed soilobs to soilsample per working group discussion
6432	11/24/2012 01:24 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped locality description fields to location.iscultivated using _locationnarrative_is_cultivated()
6406	11/24/2012 07:50 AM	Aaron Marcuse-Kubitza	db_xml.py: put(): _setDefault(): Support setting multiple col_defaults at once by using the param names themselves as the column names
6403	11/24/2012 07:29 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Set the source_id col_default to the datasource name using the new _setDefault() built-in function and _env()
6294	11/19/2012 04:09 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped acceptedCounty, county to the matched place
6217	11/15/2012 08:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed _date/date, because _date using a string date argument is no longer supported under plpython3u (dateutil is missing). Note that PostgreSQL's own date parsing is sufficient for most dates, so this use of _date is not strictly necessary and removing it will improve import times.
6179	11/14/2012 06:30 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed reference -> source to make this table more broadly applicable, and because this now stores the datasource metadata
6123	11/13/2012 02:30 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: matched place's coordinates: Fixed bug where coordinates entry itself needed to have its datasource (reference) set to geoscrub, in addition to the place entry that uses it, in order to match up properly with geoscrub's corresponding input place (whose coordinates as well as place are owned by the geoscrub datasource)
5969	11/02/2012 03:42 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: decimalLatitude/Longitude->geoscrub input coordinates: Also set to NULL if 0 here, not just for the coordinates linked to the datasource's place instance
5958	11/01/2012 10:39 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: matched place: Also map verbatim place's geoscrub-related fields to the matched place, to link up with geoscrub's corresponding input place
5861	10/30/2012 11:23 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Remapped latitude/longitude to new coordinates table
5855	10/30/2012 09:54 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed placepath to place since this contains primary information about the place, including the reference to the canonical place
4912	09/21/2012 07:52 AM	Aaron Marcuse-Kubitza	inputs/SALVIAS/plotMetadata/map.csv, inputs/SALVIAS-CSV/Plot/map.csv: Soil component measurements: Removed no longer needed old-style _units filter, now that unit conversion is handled by mappings/VegCore-VegBIEN.csv using _percent_to_fraction
4908	09/21/2012 07:03 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Soil component measurements: Added default units of percent (cmol_kg for cationExchangeCapacity). This involves translating the names everywhere and adding a _percent_to_fraction conversion in mappings/VegCore-VegBIEN.csv.
4822	09/18/2012 10:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed _rangeStart and _rangeEnd filters from fields which should contain decimal values. These filters should be added on a per-datasource basis instead.
4820	09/18/2012 10:09 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed /_units:[default=m,to=m,to=]/value filter from fields. It should be added on a per-datasource basis instead.
4819	09/18/2012 10:05 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed /_replace:["\bca\.?"=]/value filter from fields. It should be added on a per-datasource basis instead.
4809	09/18/2012 08:14 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/plotMetadata/map.csv, inputs/SALVIAS-CSV/Plot/map.csv: Remapped elev_m from verbatimElevation to elevationInMeters, since the values are all decimals. Note that the units of SALVIAS Elev were provided by a comment from Brad (and can also be assumed to be the same as SALVIAS-CSV elev_m).
4786	09/18/2012 03:58 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Changed _frac units suffix to _fraction for clarity and for consistency with _percent (which is spelled out), as used by SALVIAS (http://salvias.net/Documents/salvias_data_dictionary.html) and elsewhere
4767	09/17/2012 04:35 PM	Aaron Marcuse-Kubitza	inputs///map.csv: Remapped applicable plotArea fields to plotAreaHa
4754	09/17/2012 02:29 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added units suffix to additional VegBIEN fields that have units
4753	09/17/2012 02:01 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added units suffix to all core VegBIEN fields that have units. It is the responsibility of the mappings to ensure that all units are properly translated.
4727	09/17/2012 09:00 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added placepath (analogous to taxonpath), and point locationplace to it instead of directly to namedplace
4726	09/17/2012 08:11 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Split locationdetermination into locationcoords and locationplace, so that coordinate determinations can be made separately from place determinations
4622	09/12/2012 08:04 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed no longer needed /_simplifyPath:[next=parent_id]/path expressions in specific paths because parent_id forwarding is now set globally for all paths in the map root
4621	09/12/2012 07:56 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Added /_simplifyPath:[next=parent_id]/path to root so the returned subplot location will be its parent location if there is no subplot name or ID (indicating that that particular plot did not have subplots). Note that this also causes the parent_id forwarding effect to occur for all other tables containing parent_id, which will help prevent similar issues with subplot events, etc. This will hopefully fix the SALVIAS.plotObservations bug where some organisms did not have a subplot #, causing the subplot location to become NULL and causing the corresponding locationevent rows not to match the locationevent_unique_within_location index filter condition (which requires a parent_id), which caused multiple output table pkeys to be returned for those rows, violating the locationevent_pkeys temp table's primary key.
4620	09/12/2012 07:25 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: namedplace elements: _simplifyPath() calls: Removed no longer needed `require` arg, and removed no longer needed table suffix from `next` arg
4563	09/11/2012 01:23 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: startDate, endDate mappings: Removed _dateRangeStart/_dateRangeEnd filters because these are assumed to already be start and end dates of a range. (eventDate should be used for concatenated date ranges.)
4386	08/30/2012 12:45 PM	Aaron Marcuse-Kubitza	inputs/: Renamed subfolders to VegCSV names, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCSV_subfolders#Rename-subfolders-to-VegCSV-names>
4333	08/29/2012 07:38 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: location.authorlocationcode mappings: Placed inside "if subplot" _if statement along with sourceaccessioncode to reduce the number of separate _if statements needing a condition mapping
4324	08/29/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Moved "if subplot" _if statement around /location/parent_id and /location/sourceaccessioncode themselves, so that only one _if cond mapping for subplot is needed. Note that this is only possible because this _if statement uses _exists, allowing it to be fully evaluated by the XML template simplifying mechanism, which supports subtrees as arguments to _if.
4182	08/22/2012 03:23 PM	Aaron Marcuse-Kubitza	inputs: Move src subdir into main dir, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCSV_subfolders#Move-src-subdir-into-main-dir>
4120	08/20/2012 10:20 PM	Aaron Marcuse-Kubitza	inputs: Moved test outputs into subfolders, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCSV_subfolders#Move-test-outputs-into-subfolders>
4107	08/17/2012 07:30 PM	Aaron Marcuse-Kubitza	inputs: Renamed plots table to 0.plots so import order would be inherent in the dir name, using steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCSV_subfolders#Rename-subfolders-with-import-order>
4043	08/15/2012 06:15 AM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv, DwC1-DwC2.specimens.csv: Split eventDate into eventDate and dateCollected, where eventDate refers only to the date of the sampling event, but dateCollected also refers to the date the particular specimen was collected. (This distinction is important in merging with VegCSV, because in plots data, these two fields are distinct.) Remapped datasources with dateCollected-related fields to new dateCollected.
4016	08/15/2012 12:45 AM	Aaron Marcuse-Kubitza	input via maps: Removed _date/date filter from date fields because the main mappings now have _date around all dates, so this filter is redundant
4001	08/14/2012 09:31 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: locationID: Removed mapping to locationevent.sourceaccessioncode, because locationID relates to the plot, not the plot event. (The locationevent is scoped by the location when the sourceaccessioncode and authoreventcode are not specified, so duplicate elimination will still occur correctly.)
3999	08/14/2012 09:04 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: plotName: Removed authoreventcode mapping because plotName relates to the plot, not the plot event. (The locationevent is scoped by the location when the authoreventcode is not specified, so duplicate elimination will still occur correctly.) Instead map only authoreventcode-related fields (currently CVS's authorObsCode) to authoreventcode, via DwC's (confusingly-named) fieldNumber ("An identifier given to the event in the field").
3994	08/13/2012 06:12 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: location unique keys: Map to a new parent location for the location, instead of a parent locationevent for the locationevent. This much simpler mapping (which does not require _alt or _merge) is possible now that the necessary unique indexes have been set up.
3990	08/13/2012 03:39 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Map to new location.authorlocationcode
3985	08/13/2012 02:07 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: maximumElevationInMeters: Fixed bug where _rangeEnd filter needed to be removed because this only works on a field which can be either a range or the start of a range, such as minimumElevationInMeters (on an end-of-range field, a single value will be removed completely). Added _alt for mergeability with DwC. minimumElevationInMeters: Added elevationrange-to mapping using _rangeEnd for mergeability with DwC.
3984	08/13/2012 01:53 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv, VegCSV-VegBIEN.specimens.csv: minimum/maximumElevationInMeters, minimum/maximumDepthInMeters: Remove any "ca." prefix from value. Doing this on all elevation/depth fields will make the DwC and VegCSV mappings mergeable.
3983	08/13/2012 01:04 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: locality: Mapped using same XPath as DwC, to enable merging
3953	08/10/2012 09:37 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: locationevent: Renamed authorlocationcode to authoreventcode to be consistent with the table name. Note that for our current datasources, the plot = the plot event, so the authoreventcode and authorlocationcode/authorPlotCode will be the same.
3948	08/10/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: decimalLatitude/decimalLongitude: Added _nullIf suffix for mergability with VegCSV-VegBIEN.specimens.csv
3919	08/09/2012 01:29 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Wrapped dates in _date() and _dateRangeStart()/_dateRangeEnd(), to assist in importing date and date range values that PostgreSQL cannot parse. This will increase the import time, but hopefully also decrease the # of invalid values in the errors tables. (These functions can later be optimized to reduce the impact on import time.)
3795	08/06/2012 07:39 PM	Aaron Marcuse-Kubitza	mappings/VegX-VegBIEN.stems.csv: Reversed XPaths so that they start with location instead of plantobservation
3782	08/03/2012 06:32 PM	Aaron Marcuse-Kubitza	mappings/VegX-VegBIEN.stems.csv: Expanded {} expressions using expand_braces, so that each distinct output for the same input is on its own line, improving readability. This will also help enable search-and-replace reversing of XPaths for the re-rooting to location.
3770	08/02/2012 09:12 PM	Aaron Marcuse-Kubitza	mappings: location: Removed centerlatitude/centerlongitude mappings because the lat/long should be in only one place: the locationdetermination. It is up to the database querier to decide which locationdetermination(s) to use as the coordinates for a plot/specimen.
3696	07/31/2012 08:04 PM	Aaron Marcuse-Kubitza	bin/map: Don't create unneeded /_ignore/inLabel element containing the datasource name because sql_io.put_table() now autopopulates the datasource_id
3693	07/30/2012 08:28 PM	Aaron Marcuse-Kubitza	mappings/VegX-VegBIEN.stems.csv: elevation, elevationrange: Added _rangeStart/_rangeEnd filter
3678	07/30/2012 01:31 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv, VegX-VegBIEN.stems.csv: Removed all manual mappings to datasource_id now that datasource_id is auto-populated, both on the VegBIEN output side and the DwC/VegX input side. This should greatly simplify many of the mappings!
3642	07/27/2012 06:31 PM	Aaron Marcuse-Kubitza	input.Makefile: Testing: Renamed import.*.out tests to end in .xml because they now contain XML import trees for validation, and this extension turns on XML syntax highlighting in a text editor
3641	07/27/2012 06:03 PM	Aaron Marcuse-Kubitza	bin/map: out_is_db: Output the put template to stdout so it will be validated in the automated testing
3221	07/05/2012 11:22 AM	Aaron Marcuse-Kubitza	mappings/VegX-VegBIEN.stems.csv: Map collectiondate to specimenreplicate via voucher when the voucher is indirect, rather than always directly to the taxonoccurrence, because the collectiondate relates to the specimenreplicate, not the taxonoccurrence, and is not necessarily 1:1 with it
3163	06/29/2012 03:30 AM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Dropped unique constraint on lat/long because it covered only some rows, which interfered with column-based import's selection of different insert methods based on the presence or absence of duplicate keys. (With the constraint, locations with coordinates would have duplicates eliminated, but locations without coordinates would not be able to find which row was added for a particular location because there was no lookup key to join on, and would all just use the first inserted row.) The previous behavior didn't make much sense anyway, because it would assert that two locationevents occurred in the same place just because they had the same coordinates, which may not have been precise enough to make this determination. Asserting that two locationevents occurred in the same place is really part of the secondary validation, not the import process.
2015	04/30/2012 04:15 AM	Aaron Marcuse-Kubitza	bin/map: If outputting to a DB, also create output XML elements for NULL input values. This will help with the transition to using the same XML tree for all rows.
1323	03/09/2012 04:00 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: lat/long locationdetermination: Removed [!namedplace_id] key so that it's merged into the namedplace locationdetermination
1322	03/09/2012 03:54 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Changed namedplace mappings to use new nested format for storing place containment relationships
1316	03/09/2012 02:30 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Map namedplaces using new rank field
1315	03/09/2012 02:25 PM	Aaron Marcuse-Kubitza	vegbien.sql: namedplace: Added rank. Do duplicate elimination using rank and parent_id instead of placesystem
1210	03/02/2012 12:17 PM	Aaron Marcuse-Kubitza	vegbien.sql: stemobservation: Added diameteraccuracy. Reordered fields.
1206	03/01/2012 05:33 PM	Aaron Marcuse-Kubitza	vegbien.sql: Changed specimenreplicate_unique_collectionnumber constraint to include verbatimcollectorname because collection number is assigned by collector
1087	02/21/2012 06:21 PM	Aaron Marcuse-Kubitza	SALVIAS tests: Fixed invalid accepted test outputs due to not running `make empty_db` before running tests when using the no-redo optimization shortcut
1086	02/21/2012 06:14 PM	Aaron Marcuse-Kubitza	SALVIAS mappings: Fixed plot key mappings to map the correct values to subplot and parent plot
1073	02/21/2012 03:31 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Mapped datasource name to new project.datasource. Fixes project duplicate elimination.
1071	02/21/2012 03:02 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Mapped current lat/long to centerlat/long as well so location duplicate elimination will work properly
1069	02/21/2012 01:30 PM	Aaron Marcuse-Kubitza	sql.py: put(): When encountering a DuplicateKeyException, use dict_subset_right_join to fill in explicit NULL values for columns which don't have data. This causes the database to use the UNIQUE constraint's index to look up the record, instead of relying on individual column indexes for the columns that did have data, which may or may not be available.
1063	02/21/2012 12:15 PM	Aaron Marcuse-Kubitza	VegX-VegBIEN mapping: Mapped locationevent.methodnarrative to new plotmethod table
1056	02/18/2012 04:56 PM	Aaron Marcuse-Kubitza	VegX-VegBIEN mapping: Map datasource name (/_ignore/inLabel) to new locationevent.datasource instead of location.reference
1025	02/17/2012 03:59 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Remap to new locationdetermination fields
1015	02/16/2012 05:05 PM	Aaron Marcuse-Kubitza	VegX-VegBIEN mapping: Use the input data source's label (e.g. SALVIAS) everywhere a reference is needed
878	02/07/2012 01:38 PM	Aaron Marcuse-Kubitza	Added SALVIAS-CSV tests

Project

General

Profile