Project

General

Profile

Statistics
| Revision:

# Date Author Comment
4845 09/19/2012 07:44 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Removed duplicates

4844 09/19/2012 07:22 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: Use new mappings/VegCore.csv as the VegCore vocabulary to canonicalize on, in order to also canonicalize VegCore terms which are not yet mapped to VegBIEN. This results in several DwC terms getting their case standardized according to http://rs.tdwg.org/dwc/terms/. Continue to determine unmapped terms using mappings/VegCore-VegBIEN.csv, because a term should not be considered mapped until it has been mapped all the way through to VegBIEN.

4843 09/19/2012 07:12 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Removed trailing spaces from terms

4842 09/19/2012 07:05 PM Aaron Marcuse-Kubitza

mappings/Veg+.unmapped_terms.csv: Removed duplicates of VegCore terms

4841 09/19/2012 07:02 PM Aaron Marcuse-Kubitza

mappings/: Split Veg+.terms.csv into VegCore.csv and Veg+.unmapped_terms.csv

4840 09/19/2012 06:36 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Removed terms that are in mappings/Veg+-VegCore.csv

4839 09/19/2012 06:31 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Added sources where missing

4838 09/19/2012 06:20 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Added Source and Comments columns from mappings/Veg+.terms.csv. Reordered columns to put Comments first.

4837 09/19/2012 06:17 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Removed duplicate entries for stem_id/stemID, collector

4836 09/19/2012 05:56 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

4835 09/19/2012 05:24 PM Aaron Marcuse-Kubitza

inputs/REMIB/Specimen/: Filter out invalid, frameshifted rows so they don't produce errors in the import or anomalies like thousands of taxondeterminations for one taxonoccurrence. This involves moving the CSVs to Specimen.src and using a create.sql to create the filtered table.

4834 09/19/2012 04:47 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Forward occurrenceID to taxonoccurrence.sourceaccessioncode when there is no other taxonoccurrence.sourceaccessioncode, to ensure that taxonoccurrence is uniquely identified so that there is one taxonoccurrence per organism

4833 09/19/2012 04:16 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: taxonoccurrence.authortaxoncode alternatives: Use _first instead of _alt because when one of these fields is present, it can be used directly even if it's sometimes NULL, without needing to spend a lot of time _alting together fields that won't be used. Datasources where the authortaxoncode is sometimes NULL usually have a separate sourceaccessioncode for the taxonoccurrence. (In the rare case that they don't, they should map a non-NULL field to recordNumber or tag to ensure that taxonoccurrences can be uniquely identified.)

4832 09/19/2012 04:07 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped tag to taxonoccurrence.authortaxoncode when the record is an organism, in case there is no other ID for the taxonoccurrence. This fixes a bug in FIA and TEAM data where all organisms in a plot used the same taxonoccurrence because taxonoccurrence was not properly constrained, causing the loss of individual taxondeterminations on each organism.

4831 09/19/2012 03:36 PM Aaron Marcuse-Kubitza

input.Makefile: Testing: %/test.by_col.xml: Do abort tester if by-column test fails. There are no longer small rowcount differences between row-based and column-based import on some datasources, so this is now possible.

4830 09/18/2012 11:13 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: stemobservation: stemobservation_unique_within_plantobservation unique index: Added tag so that a stemobservation can be scoped by its tag when no other ID is specified

4829 09/18/2012 11:11 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: stemobservation: stemobservation_unique_within_plantobservation unique index: Fixed bug where filter condition underconstrained stemobservation when neither sourceaccessioncode nor authorstemcode was specified, by making sure that at least one *_unique index always applies

4828 09/18/2012 11:08 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Remapped tag to new stemobservation.tag

4827 09/18/2012 11:06 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: stemobservation: Added tag, tags

4826 09/18/2012 10:53 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: tag: Removed no longer applicable comment

4825 09/18/2012 10:49 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Removed no longer used previousTag and the complex mapping logic that attempts to place both tags in VegBIEN in the correct order but does not work for column-based import. tag: Removed iscurrent=true because there is now only one tag field.

4824 09/18/2012 10:41 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/*/map.csv: Remapped all versions of stem and tree tags to tag, with the second tag superceding the first, to avoid the complex VegCore-VegBIEN mapping logic that attempts to place both tags in VegBIEN in the correct order but does not work for column-based import. inputs/SALVIAS-CSV/Organism/map.csv: stem and tree tags: Made the stem tag supercede the tree tag instead of vice versa, to have as specific of a tag as possible.

4823 09/18/2012 10:30 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/stems/map.csv: Copied Brad's comments on plotObservations.tag1, tag2 to stem_tag1, stem_tag2

4822 09/18/2012 10:18 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Removed _rangeStart and _rangeEnd filters from fields which should contain decimal values. These filters should be added on a per-datasource basis instead.

4821 09/18/2012 10:12 PM Aaron Marcuse-Kubitza

inputs/ARIZ/Specimen/map.csv: Documented that MinimumElevationInMeters, MinimumElevationInMeters contain some verbatim values, including ranges and units

4820 09/18/2012 10:09 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Removed /_units:[default=m,to=m,to=]/value filter from fields. It should be added on a per-datasource basis instead.

4819 09/18/2012 10:05 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Removed /_replace:["\bca\.?"=]/value filter from fields. It should be added on a per-datasource basis instead.

4818 09/18/2012 09:36 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: verbatimElevation->elevation_m mapping: Translate units automatically (currently only works in row-based mode). Don't remove any "ca." prefix because this is a datasource-specific filter that does not apply to current datasources with verbatimElevation. Also map verbatimElevation to location.verbatimelevation.

4817 09/18/2012 09:21 PM Aaron Marcuse-Kubitza

inputs/NCU-NCSC/Specimen/map.csv: Elevation: Removed comment that it includes units, because this is now part of the definition of verbatimElevation

4816 09/18/2012 09:20 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Documented that verbatimElevation must include units

4815 09/18/2012 09:14 PM Aaron Marcuse-Kubitza

inputs/ARIZ/Specimen/map.csv: Remapped VerbatimElevation to UNUSED

4814 09/18/2012 09:11 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Remapped all unused terms to special value UNUSED. Remapped all private terms to special value PRIVATE. Remapped all deliberately unmapped terms to special value OMIT.

4813 09/18/2012 08:53 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Remapped realLatitude, realLongitude to new special value PRIVATE, which is more specific than OMIT

4812 09/18/2012 08:51 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added special value PRIVATE

4811 09/18/2012 08:44 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added special values OMIT, UNUSED

4810 09/18/2012 08:20 PM Aaron Marcuse-Kubitza

inputs/VegBank/plot_/map.csv: Remapped elevation from verbatimElevation to elevationInMeters, since the values are all decimals. The units come from the data dictionary.

4809 09/18/2012 08:14 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/plotMetadata/map.csv, inputs/SALVIAS-CSV/Plot/map.csv: Remapped elev_m from verbatimElevation to elevationInMeters, since the values are all decimals. Note that the units of SALVIAS Elev were provided by a comment from Brad (and can also be assumed to be the same as SALVIAS-CSV elev_m).

4808 09/18/2012 08:02 PM Aaron Marcuse-Kubitza

inputs/NCU-NCSC/Specimen/map.csv: Documented that Elevation includes units

4807 09/18/2012 07:50 PM Aaron Marcuse-Kubitza

inputs/Madidi/Plot/map.csv: Remapped Minimum altitude from minimumElevationInMeters to verbatimElevation_m, since it is a range, not a minimum. Note that the units are assumed based on the range of values present and the region the data is from (Madidi National Park).

4806 09/18/2012 07:46 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Also mapped verbatimElevation_m to verbatimelevation

4805 09/18/2012 07:44 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Also mapped verbatimElevation_m to elevationrange_m

4804 09/18/2012 07:38 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped verbatimElevation_m

4803 09/18/2012 07:31 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added verbatimElevation_m

4802 09/18/2012 07:28 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Mapped realLatitude, realLongitude to OMIT because private data should not be placed in a public database

4801 09/18/2012 07:26 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added realLatitude, realLongitude

4800 09/18/2012 07:23 PM Aaron Marcuse-Kubitza

inputs/VegBank/plot_/map.csv: Documented that elevationrange is unused

4799 09/18/2012 07:13 PM Aaron Marcuse-Kubitza

inputs/Madidi/Plot/map.csv: Fixed comments on Direction and OrientaciĆ³n/exposicion so each comment refers to the other field that is equivalent

4798 09/18/2012 07:10 PM Aaron Marcuse-Kubitza

inputs/Madidi/Plot/map.csv: Remapped Altitude from verbatimElevation to elevationInMeters, since the values are all decimals. Note that the units are assumed based on the range of values present and the region the data is from (Madidi National Park).

4797 09/18/2012 06:50 PM Aaron Marcuse-Kubitza

inputs/CTFS/Plot/map.csv: Remapped Elevation from verbatimElevation to elevationInMeters, since it is a float in the original bci.sql database. Note that the units are assumed based on the range of values present and the country the data is from (Panama).

4796 09/18/2012 06:33 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped elevationInMeters

4795 09/18/2012 06:30 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added elevationInMeters

4794 09/18/2012 05:43 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: Added verbatimelevation

4793 09/18/2012 05:21 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Added note that `make schemas/reinstall` must be done after running make_analytical_db on a previous import

4792 09/18/2012 05:16 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added indexes for additional analytical_db_view joins, as described at <https://projects.nceas.ucsb.edu/nceas/issues/494>

4791 09/18/2012 04:51 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added indexes for the analytical_db_view joins, as described at <https://projects.nceas.ucsb.edu/nceas/issues/494>

4790 09/18/2012 04:28 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Added note that `make schemas/rotate` must be done after running make_analytical_db

4789 09/18/2012 04:17 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Renamed _pct_to_frac() to _percent_to_fraction() and _frac_to_pct() to _fraction_to_percent(), for clarity and for consistency with _percent (which is spelled out), as used by SALVIAS (http://salvias.net/Documents/salvias_data_dictionary.html) and elsewhere

4788 09/18/2012 04:06 PM Aaron Marcuse-Kubitza

review: Don't remove XML functions that are unit conversions

4787 09/18/2012 04:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Changed _frac units suffix to _fraction for clarity and for consistency with _percent (which is spelled out), as used by SALVIAS (http://salvias.net/Documents/salvias_data_dictionary.html) and elsewhere

4786 09/18/2012 03:58 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Changed _frac units suffix to _fraction for clarity and for consistency with _percent (which is spelled out), as used by SALVIAS (http://salvias.net/Documents/salvias_data_dictionary.html) and elsewhere

4785 09/18/2012 03:47 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Remapped intercept_cm to new intercept_cm so that units match

4784 09/18/2012 03:45 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped intercept_cm

4783 09/18/2012 03:41 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _cm_to_m()

4782 09/18/2012 03:39 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added intercept_cm

4781 09/18/2012 03:35 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Changed volumeCanopy to the more accurate intercept_m. volumeCanopy was the closest equivalent VegX term, but did not really fit line-intercept information, nor did it include units.

4780 09/18/2012 03:28 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added intercept_m

4779 09/18/2012 02:46 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonscope: Added comment that it stores the scope of a morphospecies name

4778 09/18/2012 02:32 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

4777 09/18/2012 02:31 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Commit: Shortened import message to fit on one line in the README, to avoid issues when copying and pasting

4776 09/17/2012 05:02 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _ha_to_m2(text), _pct_to_frac(text)

4775 09/17/2012 04:55 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use _m2_to_ha() on location.area_m2 to get plotAreaHa

4774 09/17/2012 04:50 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use _m2_to_ha() on location.area_m2 to get plotAreaHa

4773 09/17/2012 04:49 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _m2_to_ha()

4772 09/17/2012 04:46 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv, Veg+.terms.csv: Removed imprecise and no longer used plotArea and area. Use plotArea_<units> instead.

4771 09/17/2012 04:44 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Remapped applicable plotArea fields to plotArea_m2

4770 09/17/2012 04:41 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped plotArea_m2

4769 09/17/2012 04:40 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added plotArea_m2

4768 09/17/2012 04:39 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Renamed plotAreaHa to plotArea_ha for consistency with VegBIEN units suffixing convention, which includes an "_"

4767 09/17/2012 04:35 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Remapped applicable plotArea fields to plotAreaHa

4766 09/17/2012 04:19 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Removed inaccurate SizeOfSite->plotArea mapping, which does not match units

4765 09/17/2012 04:16 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped plotAreaHa

4764 09/17/2012 04:16 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _ha_to_m2()

4763 09/17/2012 04:11 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added plotAreaHa

4762 09/17/2012 04:08 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Standardize area using VegX /plots/plot/area instead of Madidi Inventory+description.Area

4761 09/17/2012 04:01 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use _frac_to_pct() on aggregateoccurrence.cover_frac to get pctCover

4760 09/17/2012 03:43 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _pct_to_frac()

4759 09/17/2012 03:37 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: coverPercent: Convert to fraction using _pct_to_frac()

4758 09/17/2012 03:37 PM Aaron Marcuse-Kubitza

xml_dom.py: replace_with_text(): Support ints and floats

4757 09/17/2012 03:36 PM Aaron Marcuse-Kubitza

xml_dom.py: replace_with_text(): Support ints and floats

4756 09/17/2012 03:31 PM Aaron Marcuse-Kubitza

xml_func.py: simplify(): Run xml_dom.prune_empty() on function nodes that don't have an explicit simplifying function. This allows single-arg functions with no arg to be pruned rather than called with no args (causing errors if the single param does not have a default value).

4755 09/17/2012 02:31 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

4754 09/17/2012 02:29 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added units suffix to additional VegBIEN fields that have units

4753 09/17/2012 02:01 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added units suffix to all core VegBIEN fields that have units. It is the responsibility of the mappings to ensure that all units are properly translated.

4752 09/17/2012 12:18 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: postgres-Linux: Added postgresql-postgis apt-get

4751 09/17/2012 11:58 AM Aaron Marcuse-Kubitza

backups/Makefile: Backups: Full DB: Specify the date suffix of the backup when it's created rather than adding it afterwards. This allows the user to specify a suffix that matches the corresponding public-schema backup.

4750 09/17/2012 11:41 AM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Mapped variants of subspecies directly to new subspecies term

4749 09/17/2012 11:31 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: subspecies, infraspecificEpithet: Added _alts for datasources that specify both

4748 09/17/2012 11:27 AM Aaron Marcuse-Kubitza

input.Makefile: Mapping: $(map2db): Inline $(map) because this is the only place it's used

4747 09/17/2012 11:26 AM Aaron Marcuse-Kubitza

input.Makefile: Mapping: $(map): Don't require flat files because they don't need to be used directly anymore (staging tables are used instead)

4746 09/17/2012 11:24 AM Aaron Marcuse-Kubitza

input.Makefile: Mapping: $(map2db): Always use staging tables, because the flat files don't need to be used directly anymore