/ - Changes - BIEN 3 - NCEAS Projects

root @ 4015

#	Date	Author	Comment
4015	08/15/2012 12:39 AM	Aaron Marcuse-Kubitza	inputs/SALVIAS-CSV/maps/VegCSV.organisms.csv: census_date: Don't map directly to the year, as this field is allowed to be a full date even though our data sample contains only years. Note that _date/date will automatically detect plain years and treat them as years, and so will casts to timestamp.
4014	08/15/2012 12:33 AM	Aaron Marcuse-Kubitza	inputs/SALVIAS*/maps/VegCSV.organisms.csv: census_date: Documented that this is for the subplot, not the organism, as all organisms in a subplot have the same value for it
4013	08/15/2012 12:09 AM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: verbatimLatitude/verbatimLongitude: Fixed mappings to use _alt/2 instead of _alt/1 to avoid collisions with decimalLatitude/decimalLongitude
4012	08/14/2012 11:54 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: _merge(): Changed sort_orders to match the $-variable name instead of the function parameter name, so each line of the VALUES clause would use the same number for both
4011	08/14/2012 11:52 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: _merge(): Filter out NULL values as optimization so DISTINCT ON only has to consider non-NULL values
4010	08/14/2012 11:48 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: join_strs(): Return NULL if all strings were NULL or ''. This fixes unexpected behavior in _merge() where all elements are NULL but the return value is non-NULL.
4009	08/14/2012 11:32 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added join_strs_transform_preserve_empty() and use it in join_strs_transform_fold_empty()
4008	08/14/2012 11:25 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Renamed join_strs_() to join_strs_transform_fold_empty() for clarity and to indicate that it's for use by the join_strs() aggregate
4007	08/14/2012 11:11 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: recordNumber: Added VegCSV mappings for it
4006	08/14/2012 10:51 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: occurrenceID: Added VegCSV mappings for it
4005	08/14/2012 10:44 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: mappings to /location/sourceaccessioncode: Added _alt to prioritize them properly
4004	08/14/2012 10:39 PM	Aaron Marcuse-Kubitza	inputs/UNCC/maps/DwC.specimens.csv: herbarium: Fixed mapping to go to institutionCode instead of collectionCode
4003	08/14/2012 10:36 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Remapped institutionCode/collectionCode/catalogNumber location mappings to location.authorlocationcode
4002	08/14/2012 09:50 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Reset methodtaxonclass lines so that only one needs to be repositioned after syncing with the schema
4001	08/14/2012 09:31 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: locationID: Removed mapping to locationevent.sourceaccessioncode, because locationID relates to the plot, not the plot event. (The locationevent is scoped by the location when the sourceaccessioncode and authoreventcode are not specified, so duplicate elimination will still occur correctly.)
4000	08/14/2012 09:27 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Mapped locationID, for mergability with VegCSV
3999	08/14/2012 09:04 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: plotName: Removed authoreventcode mapping because plotName relates to the plot, not the plot event. (The locationevent is scoped by the location when the authoreventcode is not specified, so duplicate elimination will still occur correctly.) Instead map only authoreventcode-related fields (currently CVS's authorObsCode) to authoreventcode, via DwC's (confusingly-named) fieldNumber ("An identifier given to the event in the field").
3998	08/14/2012 08:40 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: locationevent: locationevent_unique_within_location: Added authoreventcode to index. It was already in the locationevent_unique_within_parent_by_authoreventcode index, but also needed to be in the no-parent (non-subplot) index. This fixes locationevent duplicate elimination when a locationevent sourceaccessioncode is not specified.
3997	08/14/2012 08:27 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: location_unique_within_datasource unique index: Added COALESCE and `WHERE sourceaccessioncode IS NOT NULL` now that sourceaccessioncode is nullable. Renamed location_unique_within_datasource and location_unique_authorlocationcode to location_unique_within_datasource_by_... to show that both are alternatives for globally unique keys. schemas/vegbien.ERD.mwb: Moved elements slightly to reduce the number of lines that need to be repositioned after syncing with the schema.
3996	08/14/2012 07:35 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Mapped verbatimElevation and samplingProtocol, for mergability with VegCSV
3995	08/14/2012 07:12 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
3994	08/13/2012 06:12 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: location unique keys: Map to a new parent location for the location, instead of a parent locationevent for the locationevent. This much simpler mapping (which does not require _alt or _merge) is possible now that the necessary unique indexes have been set up.
3993	08/13/2012 05:52 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports, now including both pages in vegbien.ERD.core.pdf. Renamed schemas/vegbien.ERD.core.pdf to vegbien.ERD.pdf because it now includes the full schema.
3992	08/13/2012 05:48 PM	Aaron Marcuse-Kubitza	schemas/filter_ERD.csv: Removed extraneous lines to improve readability. schemas/vegbien.ERD.mwb: Reconfigured elements to put only the most important ones in the core subset (the top page).
3991	08/13/2012 03:59 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Made sourceaccessioncode optional if authorlocationcode is specified, since either of these fields can now serve as the unique key
3990	08/13/2012 03:39 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Map to new location.authorlocationcode
3989	08/13/2012 03:23 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Support uniquely specifying a location by its authorlocationcode
3988	08/13/2012 03:13 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Added authorlocationcode to unique indexes
3987	08/13/2012 02:58 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Added authorlocationcode
3986	08/13/2012 02:45 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Added location_unique_within_parent_by_coords unique index that uses COALESCE, replacing location_unique_subplot_coords unique constraint
3985	08/13/2012 02:07 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: maximumElevationInMeters: Fixed bug where _rangeEnd filter needed to be removed because this only works on a field which can be either a range or the start of a range, such as minimumElevationInMeters (on an end-of-range field, a single value will be removed completely). Added _alt for mergeability with DwC. minimumElevationInMeters: Added elevationrange-to mapping using _rangeEnd for mergeability with DwC.
3984	08/13/2012 01:53 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv, VegCSV-VegBIEN.specimens.csv: minimum/maximumElevationInMeters, minimum/maximumDepthInMeters: Remove any "ca." prefix from value. Doing this on all elevation/depth fields will make the DwC and VegCSV mappings mergeable.
3983	08/13/2012 01:04 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: locality: Mapped using same XPath as DwC, to enable merging
3982	08/13/2012 01:01 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Mapped individualCount. This will enable merging with VegCSV.
3981	08/13/2012 12:51 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Cleaned up. This still needs to be run manually with `make mappings/` because the derived maps are symlinks rather than make targets, so make never touches the non-derived map and doesn't run its recipe in the automated tests
3980	08/13/2012 12:48 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv, VegCSV-VegBIEN.specimens.csv: taxondetermination mappings: Removed iscurrent=true because it is not the role of the mappings to specify which taxondetermination is the current one. Eventually, the order of the determinations will need to be specified using a sort # or similar, and the DB will select the current one for queries to use. Ensure all mappings have :[isoriginal=true] so that they match up between DwC and VegCSV.
3979	08/13/2012 12:35 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv, VegCSV-VegBIEN.specimens.csv: taxondetermination mappings: Ensure all mappings have :[iscurrent=true] or equivalent so that they sort together, and match up between DwC and VegCSV
3978	08/13/2012 12:19 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: individualCount: Disambiguated alternate meaning as stem count by changing stem count fields to map to new stemCount term, which maps to plantobservation.stemcount
3977	08/13/2012 12:12 PM	Aaron Marcuse-Kubitza	mappings/Veg+.terms.csv: Added stemCount
3976	08/13/2012 12:10 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Cleaned up
3975	08/13/2012 12:01 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Mapped identificationQualifier. This will enable merging with VegCSV.
3974	08/13/2012 11:59 AM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: identificationQualifier (taxon fit): Removed mapping to prefix of binomial field, since that field should just contain what the datasource said was the binomial. It's TNRS's job to concatenate the taxon fit, etc. with the binomial and other name parts for name resolution.
3973	08/13/2012 11:27 AM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: fieldNumber: Remapped to authoreventcode because this is (confusingly) the author code for the event, according to the DwC definition
3972	08/13/2012 11:22 AM	Aaron Marcuse-Kubitza	inputs/NY, ARIZ: FieldNumber: Remapped to recordNumber because term usage was inconsistent with DwC definition. Datasources sometimes confuse this term, because it seems like the collection number, but is actually the author code for the event (VegBank's authorObsCode).
3971	08/13/2012 11:20 AM	Aaron Marcuse-Kubitza	schemas/vegbank.ERD.pdf: Restored to VegBank ERD, which had gotten overwritten when the vegbien.ERD exports were regenerated
3970	08/13/2012 10:58 AM	Aaron Marcuse-Kubitza	mappings/DwC1-DwC2.specimens.csv: Removed Source column and source-related comments because this information is now maintained in mappings/Veg+.terms.csv
3969	08/13/2012 10:55 AM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Removed Source column because this information is now maintained in mappings/Veg+.terms.csv
3968	08/13/2012 10:49 AM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Removed Source column and source-related comments because this information is now maintained in mappings/Veg+.terms.csv
3967	08/13/2012 10:44 AM	Aaron Marcuse-Kubitza	Added mappings/Veg+.terms.csv, which will serve the purpose of listing all available terms with their source. This will remove the need to store the sources in the mappings, where they are out of place and difficult to maintain during refactoring.
3966	08/13/2012 10:37 AM	Aaron Marcuse-Kubitza	Added mappings/Veg+.terms.csv, which will serve the purpose of listing all available terms with their source. This will remove the need to store the sources in the mappings, where they are out of place and difficult to maintain during refactoring.
3965	08/13/2012 10:19 AM	Aaron Marcuse-Kubitza	mappings/VegX-VegCSV.stems.csv: Removed Comments and Source columns because this information is now maintained in mappings/VegCSV-VegBIEN.specimens.csv. This will simplify later VegCSV refactoring, because the Comments and Source columns will not need to be changed along with the VegCSV column.
3964	08/13/2012 10:02 AM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Removed Comments and Source columns because this information is now maintained in mappings/VegCSV-VegBIEN.specimens.csv. This will simplify later VegCSV refactoring, because the Comments and Source columns will not need to be changed along with the VegCSV column.
3963	08/13/2012 10:00 AM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Changed plotID to locationID and parentPlotID to parentLocationID to use DwC-related terms
3962	08/13/2012 09:31 AM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: collectionID: Fixed mapping to point to collectioncode_dwc instead of collectionnumber, as this is an ID of the collection rather than within it
3961	08/13/2012 09:15 AM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import
3960	08/10/2012 10:29 PM	Aaron Marcuse-Kubitza	schemas: Renamed vegbien.ERD.pdf to vegbien.ERD.1_pg.pdf since it's not the primary PDF that should be used, due to its slow load time
3959	08/10/2012 10:26 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
3958	08/10/2012 10:23 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: specimenreplicate: specimenreplicate_plantobservation_1_to_1: Only apply when sourceaccessioncode and catalognumber_dwc are NULL, in order to support multiple specimenreplicates for one plant in plots data. specimenreplicate_unique_catalognumber: Added plantobservation_id, so that catalognumber_dwc (a sort of authorSpecimenCode for plots data) only needs to be unique within a plant. Eventually, we will want to migrate the mappings so that collectionnumber is used for this purpose instead.
3957	08/10/2012 10:16 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: specimenreplicate: Made plantobservation_id optional again, since indirect vouchers do create specimenreplicates without a parent plantobservation. schemas/vegbien.ERD.mwb: Fixed lines.
3956	08/10/2012 10:02 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: specimenreplicate: Made plantobservation_id required, since that is now the parent table fkey
3955	08/10/2012 10:00 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Fixed lines
3954	08/10/2012 09:51 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Adjusted lines. Adjusted position of locationdetermination to put location directly next to locationevent. Expanded location to fill newly-available space.
3953	08/10/2012 09:37 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: locationevent: Renamed authorlocationcode to authoreventcode to be consistent with the table name. Note that for our current datasources, the plot = the plot event, so the authoreventcode and authorlocationcode/authorPlotCode will be the same.
3952	08/10/2012 09:22 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Changed VegCSV term fieldNumber (from DwC) to recordNumber to be consistent with the TDWG meaning of fieldNumber, which defines it as the author code for the event, not the organism (what VegBIEN calls the authorlocationcode and VegBank calls the authorObsCode)
3951	08/10/2012 08:47 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: Comments: Removed no longer applicable comments about XPath syntax added to affect sort order
3950	08/10/2012 08:35 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: height: Removed mapping to plantobservation.overallheight, since the height is a stem field rather than a plant field. Note that a height in the organisms table will be mapped to the height in a single stemobservation for that plant, with NULL sourceaccessioncode and authorstemcode. Note also that this change is possible because no mapped datasource yet provides a valid overallheight with multiple stems or that differs from its single stem's height. (Although SALVIAS sometimes provides both a stem height and an organism height, that height is always either the same, or the organism height is invalid. See <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/SALVIAS_issues#Some-organisms-have-one-stem-but-different-heights-in-the-organisms-and-stems-tables>.)
3949	08/10/2012 06:56 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: establishmentMeans: Removed obsolete mapping to growthform, since growthforms and cultivated/native information are no longer merged into one field in VegBIEN (which they were when this mapping was created)
3948	08/10/2012 06:18 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: decimalLatitude/decimalLongitude: Added _nullIf suffix for mergability with VegCSV-VegBIEN.specimens.csv
3947	08/10/2012 06:10 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: coordinateUncertaintyInMeters: Added _noCV suffix for mergability with VegCSV-VegBIEN.specimens.csv
3946	08/10/2012 06:00 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: catalogNumber: Added _if wrapper for mergability with VegCSV-VegBIEN.specimens.csv
3945	08/10/2012 05:52 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: catalogNumber direct voucher _if statement: Changed @name to "if indirect voucher", so that it's logical consistent with the else branch following it. It was previously "if direct voucher" because the _if statement only contained a case for direct vouchers, and the else branch was being used in place of a _not() function.
3944	08/10/2012 05:38 PM	Aaron Marcuse-Kubitza	mappings/roots: plots roots: Default to using VegCSV instead of VegX for new plots datasources
3943	08/10/2012 05:35 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: catalogNumber _if statements: Changed @names to more descriptive comments. This also prevents the @name from looking confusingly like the condition of the _if statement, which is actually supplied through the cond param and is usually located in a separate mapping.
3942	08/10/2012 05:20 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: catalogNumber: Split _if apart into separate _ifs for the indirect and direct voucher cases. Moved direct voucher _if inwards so it is just wrapping catalognumber_dwc itself. This will enable this mapping to be used for specimens data, which is always considered a direct voucher and will always have this _if return true. Also moved indirect voucher _if inwards in the same way, so that a future SQL function implementation of _if only needs to concern itself with returning one value or another, not with handling entire XML subtrees. Note that if the indirect voucher _if returns false, NOT NULL and CHECK constraint violations will cause the intervening voucher and specimenreplicate elements to be deleted, thus having the same effect. Use new @name syntax for distinguishing _if statements.
3941	08/10/2012 05:02 PM	Aaron Marcuse-Kubitza	mappings: Removed no longer used for_review/VegBIEN-DwC2.specimens.csv
3940	08/10/2012 04:49 PM	Aaron Marcuse-Kubitza	xml_func.py: _if(): Changed documentation about name param for distinguishing separate _if statements to use @name attribute instead, so that the XML/SQL function mechanism doesn't have to deal with code that's solely for XPath merging
3939	08/10/2012 04:09 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
3938	08/10/2012 04:08 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Fixed lines
3937	08/10/2012 03:57 PM	Aaron Marcuse-Kubitza	schemas/filter_ERD.csv: Removed no longer applicable specimenreplicate inheritance filters
3936	08/10/2012 03:50 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated with stats from latest import. Note that the import now includes additional date parsing on all date fields, which adds 1/2-1 hour to the import time. Eventually, we will want to translate _date() to PL/pgSQL and only use extra date processing if PostgreSQL's cast to timestamp doesn't work, which should greatly reduce this time.
3935	08/09/2012 05:37 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
3934	08/09/2012 05:35 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Removed inheritance link between specimenreplicate and taxonoccurrence, which is not needed now that specimenreplicate is mapped via plantobservation. mappings/DwC2-VegBIEN.specimens.csv: As part of this change, moved mappings to specimenreplicate fields inherited from taxonoccurrence to go directly to taxonoccurrence.
3933	08/09/2012 05:15 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
3932	08/09/2012 05:14 PM	Aaron Marcuse-Kubitza	schemas/vegbien.ERD.mwb: Synced with schema
3931	08/09/2012 05:13 PM	Aaron Marcuse-Kubitza	mappings/VegCSV-VegBIEN.specimens.csv: catalogNumber: Default to mapping via plantobservation rather than via voucher when no voucherType is specified, in order to be consistent with the specimens data mapping for catalogNumber
3930	08/09/2012 03:31 PM	Aaron Marcuse-Kubitza	Regenerated mappings/for_review/VegX-VegCSV.stems.csv. Note that running `make mappings/` did not change mappings/VegX-VegCSV.stems.csv, because all changes were deletions of lines.
3929	08/09/2012 03:29 PM	Aaron Marcuse-Kubitza	mappings/VegX-VegCSV.stems.csv: Removed no longer used user-defined terms (simpleUserdefined). Note that CTFS does use user-defined terms, but these are all defined in its own map spreadsheet.
3928	08/09/2012 03:24 PM	Aaron Marcuse-Kubitza	mappings: Removed no longer needed VegX-VegBIEN mappings
3927	08/09/2012 03:23 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Made VegCSV-VegBIEN.specimens.csv a non-derived map, since the VegX-VegCSV mapping is no longer used. This causes automatic creation of a for_review file.
3926	08/09/2012 03:21 PM	Aaron Marcuse-Kubitza	plots inputs: Removed maps/.VegX.*.csv.last_cleanup
3925	08/09/2012 03:13 PM	Aaron Marcuse-Kubitza	plots inputs: Remapped all VegX via maps to VegCSV. See steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegX-%3EVegCSV>.
3924	08/09/2012 02:45 PM	Aaron Marcuse-Kubitza	join: Added map_1_core_only option that uses only columns 0 and 1 of map_1. This is useful for one-time refactoring joins where the Source column, mappings comments, etc. shouldn't be part of the datasource's via map (although they will be part of the autogenerated VegBIEN map)
3923	08/09/2012 02:33 PM	Aaron Marcuse-Kubitza	join: Use opts.env_usage() for usage message
3922	08/09/2012 02:04 PM	Aaron Marcuse-Kubitza	mappings: Made VegCSV-VegBIEN.{plots,organisms,stems}.csv symlinks to VegCSV-VegBIEN.specimens.csv
3921	08/09/2012 01:46 PM	Aaron Marcuse-Kubitza	mappings/Makefile: VegCSV-VegBIEN.specimens.csv: Commented out combining with DwC2-VegBIEN mappings, because merging DwC and VegX/VegCSV into one map is a lower priority than replacing all datasource VegX mappings with VegCSV (which does not require the merging but does require XPaths that don't collide, which is not yet the case)
3920	08/09/2012 01:40 PM	Aaron Marcuse-Kubitza	lib/xml_func.py: _if(): Made then param optional, so that user can just map to the else branch as a shortcut for logically inverting the condition. (Note that a _not() XML function does not exist yet, so this is also a workaround.)
3919	08/09/2012 01:29 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Wrapped dates in _date() and _dateRangeStart()/_dateRangeEnd(), to assist in importing date and date range values that PostgreSQL cannot parse. This will increase the import time, but hopefully also decrease the # of invalid values in the errors tables. (These functions can later be optimized to reduce the impact on import time.)
3918	08/09/2012 01:25 PM	Aaron Marcuse-Kubitza	sql_io.py: put_table(): is_literals: is_function: Fixed bug where function call needed to be recreated in each iteration of the main loop, because the arguments to the function, which are based on mapping, may change as the result of error handling replacing invalid values with NULL
3917	08/09/2012 01:13 PM	Aaron Marcuse-Kubitza	sql_io.py: put_table(): is_literals: Fixed bug where sql.select() that calls the function needed to be run recoverably, to auto-rollback errors. Made sql.select() cacheable because SQL functions are immutable, so it should be idempotent.
3916	08/09/2012 01:03 PM	Aaron Marcuse-Kubitza	mappings/DwC2-VegBIEN.specimens.csv: Remapped taxonRemarks to taxondetermination.notes because http://rs.tdwg.org/dwc/terms/#taxonRemarks indicates that these notes are "about the taxon", not the specimen/plant in general

Project

General

Profile