Project

General

Profile

Statistics
| Revision:
  • svn:ignore: .~*

# Date Author Comment
12000 01/23/2014 01:04 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: event__participant: allow multiple event__participant columns

11996 01/22/2014 10:59 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: project_participant: use [!...] negative lookahead assertion so that multiple project_participant columns will properly map to separate projectcontributor rows

11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

11959 01/15/2014 10:08 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped project_participant

11958 01/15/2014 10:07 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. merged confusingly-named TaxonObservation into TaxonOccurrence. added project_participant.

11513 10/30/2013 09:49 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped taxon_determination__is_current, taxon_determination__is_original

11512 10/30/2013 09:46 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: main taxondetermination: use [!isoriginal=true] instead of [!isoriginal] so that adding a manual isoriginal field does not prevent this selector from matching

11509 10/30/2013 09:02 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. added taxon_determination__is_current, taxon_determination__is_original.

11107 09/29/2013 08:58 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: nest all taxonoccurrences inside a stratum event, so that the parent locationevent is always fully populated before child locationevents point to it. (previously, a stub parent event was created when the child event was imported first, which blocked the fully-populated parent event from being inserted later on.) this uses auto-folding (for VegBank/CVS) and auto-forwarding (for other datasources) to prune empty stratum events for taxonoccurrences that don't have strata. (see wiki.vegpath.org/Auto-folding, wiki.vegpath.org/Auto-forwarding for more info about these normalization techniques.) note that the inserted row counts stay exactly the same for all datasources except VegBank (which was being fixed), indicating that this signficant change to the mappings did not change the semantics of the import of taxonoccurrences.

11105 09/28/2013 10:40 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: stratum's locationevent: link this to the parent locationevent, so that the parent locationevent's information (such as locationeventcontributors) is accessible to the stratum's locationevent

11097 09/28/2013 06:51 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped event__participant

11096 09/28/2013 05:51 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. added EventParticipant table containing event__participant.

11082 09/24/2013 02:14 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped stratum__name

11081 09/24/2013 02:09 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: added source_id to allow different datasources to have their own strata

11078 09/24/2013 01:45 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: added source_id to allow different datasources to have their own strata

11071 09/22/2013 08:25 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. added Stratum table.

10848 08/31/2013 07:36 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: don't map datasetURL to source.url for taxa-only data (this mapping should only occur for Source tables)

10722 08/23/2013 11:41 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped datasetURL

10721 08/23/2013 11:38 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped datasetURL

10720 08/22/2013 06:12 PM Aaron Marcuse-Kubitza

fix: mappings/VegCore-VegBIEN.csv: source__modified_date: remapped to pubdate instead of datelastmodified because this is actually metadata for the source itself, rather than for the VegBIEN record of the source

10717 08/22/2013 05:38 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped source__modified_date (different from vegcore.vegpath.org?modified, which is for the data record)

10716 08/22/2013 05:36 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. added source__version (= edition), source__modified_date.

10714 08/22/2013 04:38 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped edition

10616 08/07/2013 09:07 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. added EQUIV (also mapped in mappings/VegCore-VegBIEN.csv).

10565 08/01/2013 02:56 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped municipality

10420 07/25/2013 04:43 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. added dayOfYear (=julianDay), which is different from startDayOfYear/endDayOfYear.

10289 07/17/2013 02:37 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. Special terms: added instructions for adding a distinguishing suffix to each special term in the format special_term#suffix. this is needed for new-style import to make the resulting column name unique within the staging table.

10288 07/17/2013 02:35 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: mapped DUPLICATE to nothing so that it would not be treated as an unmapped term

10287 07/17/2013 02:33 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. Special terms: added DUPLICATE.

10285 07/17/2013 01:51 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: regenerated from wiki. moved UNUSED, PRIVATE underneath OMIT as subterms.

10284 07/14/2013 06:02 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki

10173 07/06/2013 03:37 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: cultivated, oldGrowth: use just cultivated if it's provided, rather than /_alt-ing it back with oldGrowth (which it was generated from)

10172 07/06/2013 03:30 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: fixed priority of cultivated and oldGrowth so cultivated is used first if it's available

9882 06/12/2013 10:49 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: genus->taxonlabel.taxonomicname: filter out genera that contain numbers (using new _filter_genus()), which break TNRS and prevent it from matching any other parts of the name. later, these genera can instead be moved to the end of the name, where TNRS will correctly match them as Unmatched_terms.

9459 05/17/2013 06:00 PM Aaron Marcuse-Kubitza

bugfix: mappings/VegCore-VegBIEN.csv: place.geovalid: added missing /1 after _alt

9415 05/16/2013 04:15 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: place.geovalid: added latLongDomainValid to the values to _and together

9413 05/16/2013 04:06 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: place.geovalid: use false instead of NULL

8235 03/28/2013 04:25 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Renamed salvias_plots table plotMetadata to PlotMetadata because of SALVIAS refresh on nimoy

8234 03/28/2013 04:18 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Added flower, fruit, commonName.

8233 03/28/2013 03:37 PM Aaron Marcuse-Kubitza

mappings/Makefile: $(vocab); bin/redmine_synonyms: Support crossed out (deprecated) terms

8206 03/27/2013 08:23 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: subplotID,subplot -> location.sourceaccessioncode: Fixed bug where need /_first to handle the case where both subplotID and subplot are provided

8078 03/19/2013 02:28 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: locationRemarks: Remapped to locationnarrative because location.notespublic is a boolean field

7866 03/06/2013 08:36 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Renamed specimenDonor to specimenDonorInstitution to clarify that this is an institution, not a collector.

7841 03/05/2013 09:07 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Renamed specimenHolders to specimenHolderInstitutions to make it obvious that this is a list of institutions, such as would be in institutionCode in a DwC export.

7832 03/05/2013 07:19 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Taxon: Moved scrubbing-related terms to MatchedTaxon.

7821 03/05/2013 05:14 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Renamed MatchedTaxonDetermination/AcceptedTaxonDetermination to MatchedTaxon/AcceptedTaxon because they relate to a taxon name rather than to an application of that name to a TaxonOccurrence

7816 03/05/2013 01:56 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Split specimenIndexer into specimenStampingInstitution and specimenDonor (which are often the same).

7815 03/05/2013 12:25 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Removed specimenIndexer->institutionCode mappings because the institutionCode should refer only to the specimenHolder

7809 03/05/2013 03:02 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Editing: Changed ambiguous terms templates to list the name of the alternative itself in the alternatives list, instead of the ambiguous term + a source suffix. This avoids the need to choose a source suffix for each alternative of an ambiguous term, as the ambiguous term itself (without the suffix) can be listed as the synonym. Coordinates: Disambiguated verbatimCoordinateSystem.

7808 03/05/2013 02:30 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.thesaurus.csv: `uniq`ify the map after applying initial translations, to support mappings that appear twice (once as the result of an alternative's synonym entry and again as the result of an ambiguous term's alternatives listing)

7806 03/01/2013 08:17 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Added specimenDescription.

7805 03/01/2013 08:00 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Split apart identificationQualifier and taxonFit. Split apart taxonConfidence and identificationVerificationStatus. Moved identificationQualifier to new QualifiedTaxonName table.

7716 02/26/2013 05:47 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Remapped organismNotes to be a synonym of occurrenceRemarks, since notes on an organism are more generally notes on an occurrence.

7713 02/26/2013 05:19 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped occurrenceRemarks

7712 02/26/2013 04:53 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Only use one of collection and collectionURL instead of concatenating them when both are provided (as requested by Brad <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#ACAD&gt;)

7703 02/26/2013 04:27 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: _avg(): Use numeric param names to work with SQL functions

7680 02/26/2013 01:34 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped latitude/longitude_DMS to coordinates.latitude_deg using new _dms_to_dd(text)

7668 02/26/2013 10:39 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: latitude/longitude_deg,min,sec: Also mapped to the geoscrub coordinates entry

7667 02/26/2013 10:34 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: latitude/longitude_sec: Fixed name, which had been incorrectly automapped to verbatim*

7666 02/26/2013 10:31 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. latitude/longitude: Moved _deg,min,sec components after unit alternatives so that unit alternatives following them wouldn't incorrectly be interpreted as alternatives for the *components instead of for the overall, unitless term.

7665 02/26/2013 10:22 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped latitude/longitude_deg,min,sec

7664 02/26/2013 10:15 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Merged coordinateUncertainty and coordinatePrecision, since datasources often use these interchangeably (the only difference is the DwC-recommended units).

7663 02/26/2013 10:08 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Source: Added authors. Renamed creator to firstPublisher. Moved dataOwner to Source as list. Coordinates: latitude/longitude, coordinatePrecision: Added units disambiguation. latitude/longitude_DMS: Added components. Global IDs: Changed "globally unique" tag to "global ID" and hyperlinked it to <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Global_IDs> (which now includes detailed examples of the various types of global IDs).

7651 02/22/2013 11:15 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Moved Forming IDs section to separate Global IDs wiki page (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Global_IDs).

7649 02/22/2013 10:29 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Moved preferred ID order to Forming IDs section as examples list. Made examples globally unique by prefixing the outer ID that scopes the value.

7648 02/22/2013 09:29 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Forming IDs: Added instructions for making a local ID global. Added note that these globally unique IDs are not the same as UUIDs. Forming preferred IDs: Reworded as a set of instructions rather than a definition. Clarified that this becomes the database primary key.

7647 02/22/2013 08:47 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Added section on forming preferred IDs, which are the ID which is most directly associated with the identified object.

7646 02/22/2013 08:17 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Documentation has been added on how to choose term names (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Naming) and how to form globally unique ID values (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Forming-IDs). Source and Specimen terms have been renamed to be self-explanatory and unambiguous (the DwC equivalents remain as synonyms). Short definitions of Source terms have been added to explain the differences between them. Source, Specimen, and Collection terms have been shortened according to the new instructions for choosing preferred term names (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Naming).

7628 02/20/2013 01:10 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. verbatimScientificName, verbatimTaxonName have been merged into concatenatedScientificName, and ID has been disambiguated.

7610 02/20/2013 07:33 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped reproductiveCondition

7608 02/20/2013 05:33 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. matched*Fit_fraction has been renamed to matched*Confidence_fraction.

7605 02/20/2013 05:08 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Data owner terms and taxon synonyms have been added, and morphospecies has been disambiguated.

7527 02/12/2013 01:08 PM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. Brad's new DwC ID terms spreadsheet has now been added, and a number of the ID terms clarified, disambiguated, and recategorized. In particular, institutionCode has now been split into the custodialInstitutions and collectingInstitution, to differentiate between which institution has the specimen vs. stamped the specimen. This distinction is important because the catalogNumber, stamped on the specimen, is only unique within the collectingInstitution. Most datasources don't unambiguously specify which institution their institutionCode is referring to, so it has been assumed to be custodialInstitutions unless a data dictionary says otherwise (as is the case for UNCC). In addition, a MatchedTaxonDetermination table has been added with the *_matched fields from TNRS.

7525 02/12/2013 12:09 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Ignore leading ? when sorting so that ambiguous terms sort alphabetically with other terms. This prevents terms from moving from their previous location when they become ambiguous.

7523 02/12/2013 12:05 PM Aaron Marcuse-Kubitza

mappings/VegCore.vocab.csv: Changed line endings to \r\n in preparation for having a Python script run on it (which changes the line endings)

7522 02/12/2013 11:47 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Added back ambiguous terms, so that the vocabulary contains all terms defined by VegCore, regardless of whether they are ambiguous or unambiguous terms

7521 02/12/2013 11:44 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Added back synonyms, so that the vocabulary contains all terms defined by VegCore, regardless of whether they are synonyms or primary terms. This also prevents VegCore.vocab.csv from losing entries when terms are renamed, which made it difficult to verify that no terms were lost when refactoring.

7516 02/09/2013 07:45 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki. The original*, accepted*, and verbatim* Taxon fields have now been moved to separate OriginalTaxonDetermination, AcceptedTaxonDetermination, and TaxonVerbatim tables.

7515 02/09/2013 06:52 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki

7514 02/09/2013 06:34 AM Aaron Marcuse-Kubitza

mappings/VegCore.htm: Regenerated from wiki

7512 02/09/2013 04:04 AM Aaron Marcuse-Kubitza

mappings/VegCore.*.csv: Regenerated from wiki. A plain text label is now used for Replace with, which fixes a bug where the PRIVATE permalink pointed to its Replace with in realLatitude instead of its definition.

7510 02/09/2013 03:13 AM Aaron Marcuse-Kubitza

mappings/VegCore.*.csv: Regenerated from wiki. Alternatives now contain the "Alternative" label as plain text rather than as an image title, thus avoiding an HTML anchor conflict with the definition and allowing ambiguous terms to be placed before their alternatives as well as after.

7505 02/09/2013 02:47 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. Taxon terms with prefixes for other TaxonDeterminations now indicate the analogous term in an "analogous to" label next to the term

7504 02/07/2013 01:57 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: datasourceRecordID: Fixed bug where also need to add datasourceRecordID next to occurrenceID for an institutionCode remap switch

7499 02/07/2013 11:55 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped datasourceRecordID

7492 02/07/2013 07:22 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. Alternatives are now able to use h3 instead of h4 (which had display problems). realLatitude/Longitude is now no longer needs the ? prefix to have its replacement (PRIVATE) interpreted as an alternative, and thus is properly able to be included in the vocabulary.

7491 02/07/2013 07:16 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Use the term's type label instead of its header level to determine if it's a synonym or alternative. This allows header levels to be chosen for presentational reasons rather than being constrained by being parsable.

7488 02/07/2013 06:26 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. The data dictionary has been reformatted to be much more vertically compact, by placing the term type (Synonym, Alternative, etc.) and sources (From:) on the same line as the term. Note that globalUniqueIdentifier_SpeciesLink has been removed from the vocabulary because a definition entry has been added for it (when this entry is missing, the term is incorrectly identified as a primary term).

7487 02/07/2013 06:21 AM Aaron Marcuse-Kubitza

mappings/Makefile, redmine_synonyms: Updated for new VegCore data dictionary format, which prefixes the term type (Synonym, Alternative, etc.) to the term instead of including it as a section label. This ensures that the term type of a non-primary term is shown next to the term when it is visited via a permalink, which causes the term header to appear at the top of the screen and obscures the section header containing the type.

7486 02/07/2013 06:00 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.thesaurus.csv: removal of tables: ignore errors if grep found no match

7485 02/07/2013 02:06 AM Aaron Marcuse-Kubitza

Renamed mappings/VegCore.csv to VegCore.vocab.csv and Veg+-VegCore.csv to VegCore.thesaurus.csv for clarity

7484 02/07/2013 02:03 AM Aaron Marcuse-Kubitza

mappings/Makefile, input.Makefile: Renamed $(dict) to $(thesaurus) because Veg+-VegCore.csv is actually a thesaurus, not a dictionary

7483 02/07/2013 01:57 AM Aaron Marcuse-Kubitza

mappings/Makefile: Replaced occurrences of VegCore.csv with $(vocab) and Veg+-VegCore.csv with $(dict)

7481 02/06/2013 07:33 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Sort terms by name so that reordering terms in the VegCore data dictionary does not cause Veg+-VegCore.csv to change. This makes it much easier to identify synonyms and ambiguous terms that were accidentally deleted during a data dictionary refactoring. (Note that these are no longer included in VegCore.csv, so this is required in addition to sorting VegCore.csv by name.)

7480 02/06/2013 07:26 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Sort terms by name so that reordering terms in the VegCore data dictionary does not cause VegCore.csv to change. This makes it much easier to identify terms that were accidentally deleted during a data dictionary refactoring.

7479 02/05/2013 06:19 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. This adds cf_aff.

7478 02/05/2013 06:18 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Filter out namespaces by matching only terms whose header links within the data dictionary

7477 02/05/2013 06:08 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. This causes TNRS's Annotations (cf/aff) to be mapped into VegBIEN.

7476 02/05/2013 06:05 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: matched*Fit_fraction: Remapped to taxonconfidence instead of taxonfit

7475 02/05/2013 05:56 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Fixed bug where need to remove duplicates, which are no longer supported by canon, by removing alternatives of ambiguous terms when these occur separately from their definitions