Project

General

Profile

Statistics
| Revision:

# Date Author Comment
4778 09/18/2012 02:32 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

4777 09/18/2012 02:31 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Commit: Shortened import message to fit on one line in the README, to avoid issues when copying and pasting

4776 09/17/2012 05:02 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _ha_to_m2(text), _pct_to_frac(text)

4775 09/17/2012 04:55 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use _m2_to_ha() on location.area_m2 to get plotAreaHa

4774 09/17/2012 04:50 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use _m2_to_ha() on location.area_m2 to get plotAreaHa

4773 09/17/2012 04:49 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _m2_to_ha()

4772 09/17/2012 04:46 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv, Veg+.terms.csv: Removed imprecise and no longer used plotArea and area. Use plotArea_<units> instead.

4771 09/17/2012 04:44 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Remapped applicable plotArea fields to plotArea_m2

4770 09/17/2012 04:41 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped plotArea_m2

4769 09/17/2012 04:40 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added plotArea_m2

4768 09/17/2012 04:39 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Renamed plotAreaHa to plotArea_ha for consistency with VegBIEN units suffixing convention, which includes an "_"

4767 09/17/2012 04:35 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Remapped applicable plotArea fields to plotAreaHa

4766 09/17/2012 04:19 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Removed inaccurate SizeOfSite->plotArea mapping, which does not match units

4765 09/17/2012 04:16 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped plotAreaHa

4764 09/17/2012 04:16 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _ha_to_m2()

4763 09/17/2012 04:11 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added plotAreaHa

4762 09/17/2012 04:08 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Standardize area using VegX /plots/plot/area instead of Madidi Inventory+description.Area

4761 09/17/2012 04:01 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use _frac_to_pct() on aggregateoccurrence.cover_frac to get pctCover

4760 09/17/2012 03:43 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _pct_to_frac()

4759 09/17/2012 03:37 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: coverPercent: Convert to fraction using _pct_to_frac()

4758 09/17/2012 03:37 PM Aaron Marcuse-Kubitza

xml_dom.py: replace_with_text(): Support ints and floats

4757 09/17/2012 03:36 PM Aaron Marcuse-Kubitza

xml_dom.py: replace_with_text(): Support ints and floats

4756 09/17/2012 03:31 PM Aaron Marcuse-Kubitza

xml_func.py: simplify(): Run xml_dom.prune_empty() on function nodes that don't have an explicit simplifying function. This allows single-arg functions with no arg to be pruned rather than called with no args (causing errors if the single param does not have a default value).

4755 09/17/2012 02:31 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

4754 09/17/2012 02:29 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added units suffix to additional VegBIEN fields that have units

4753 09/17/2012 02:01 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added units suffix to all core VegBIEN fields that have units. It is the responsibility of the mappings to ensure that all units are properly translated.

4752 09/17/2012 12:18 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: postgres-Linux: Added postgresql-postgis apt-get

4751 09/17/2012 11:58 AM Aaron Marcuse-Kubitza

backups/Makefile: Backups: Full DB: Specify the date suffix of the backup when it's created rather than adding it afterwards. This allows the user to specify a suffix that matches the corresponding public-schema backup.

4750 09/17/2012 11:41 AM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Mapped variants of subspecies directly to new subspecies term

4749 09/17/2012 11:31 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: subspecies, infraspecificEpithet: Added _alts for datasources that specify both

4748 09/17/2012 11:27 AM Aaron Marcuse-Kubitza

input.Makefile: Mapping: $(map2db): Inline $(map) because this is the only place it's used

4747 09/17/2012 11:26 AM Aaron Marcuse-Kubitza

input.Makefile: Mapping: $(map): Don't require flat files because they don't need to be used directly anymore (staging tables are used instead)

4746 09/17/2012 11:24 AM Aaron Marcuse-Kubitza

input.Makefile: Mapping: $(map2db): Always use staging tables, because the flat files don't need to be used directly anymore

4745 09/17/2012 11:02 AM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Remapped subspecies, subSpeciesName to new subspecies term

4744 09/17/2012 10:52 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped subspecies, variety, forma, cultivar

4743 09/17/2012 10:47 AM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added subspecies, variety, forma, cultivar

4742 09/17/2012 10:33 AM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

4741 09/17/2012 10:30 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon.authority_id: Added descriptive comment that this is the authority which defines the taxon name (as opposed to the author of the taxon name)

4740 09/17/2012 10:29 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon: Added author_id for the author of the taxon name. This is distinct from authority_id, which is the authority used to determine which taxon name to apply.

4739 09/17/2012 10:14 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Use new denormalized placepath table instead of place, which significantly reduces the number of joins

4738 09/17/2012 10:11 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: Removed stateprovince, country because these are now in placepath (as well as in place.rank)

4737 09/17/2012 10:01 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: LEFT JOIN locationcoords and locationplace so that locations will be included even if they don't have one of these two determinations

4736 09/17/2012 10:00 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_db_view: Fixed bug where method was being joined instead of left-joined, causing only rows with a method to be included

4735 09/17/2012 09:44 AM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

4734 09/17/2012 09:41 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: locationplace: Added identifier_id, so that different identifiers (e.g. the data provider and GNRS) can provide separate locationplaces even if the standardized name happens to be the same as the original name

4733 09/17/2012 09:31 AM Aaron Marcuse-Kubitza

mappings/VegBank-VegBIEN.csv: Added place->locationplace renaming

4732 09/17/2012 09:30 AM Aaron Marcuse-Kubitza

mappings/VegBIEN-VegBank.csv: Reversed the order of the columns so it's a more natural forward renaming, and renamed the file to VegBank-VegBIEN.csv to reflect the new column order

4731 09/17/2012 09:27 AM Aaron Marcuse-Kubitza

mappings/VegBIEN-VegBank.csv: Fixed order of plantconcept->taxon renaming because the VegBIEN column is on the right

4730 09/17/2012 09:26 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Renamed namedplace to place for simplicity and consistency with placepath and locationplace

4729 09/17/2012 09:09 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon: Made authority an fkey to reference instead of a text field

4728 09/17/2012 09:03 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Moved steps to include a taxon name at a rank with no explicit column from taxon's comment to taxonpath's comment, because that is the table the steps apply to

4727 09/17/2012 09:00 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added placepath (analogous to taxonpath), and point locationplace to it instead of directly to namedplace

4726 09/17/2012 08:11 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Split locationdetermination into locationcoords and locationplace, so that coordinate determinations can be made separately from place determinations

4725 09/17/2012 07:22 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: Removed authore, authorn because this information is now in locationdetermination as verbatimlongitude, verbatimlatitude

4724 09/17/2012 07:20 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: Removed centerlatitude/longitude, publiclatitude/longitude because this information is now in locationdetermination

4723 09/17/2012 06:49 AM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Fixed lines

4722 09/17/2012 06:48 AM Aaron Marcuse-Kubitza

mappings/VegBIEN-VegBank.csv: Added table rename plantconcept->taxon

4721 09/17/2012 06:47 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonpath.scientificnamewithauthor: Added comment that it's equivalent to "Name sec. x"

4720 09/17/2012 06:43 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon: Added comment that it's VegBank's plantConcept table

4719 09/14/2012 11:21 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

4718 09/14/2012 11:18 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Renamed plantconcept to taxonpath for consistency with DwC's Taxon category and to emphasize that the table stores taxonomic paths

4717 09/14/2012 11:11 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Renamed plantname to taxon for consistency with DwC's Taxon category

4716 09/14/2012 11:02 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantname: Renamed plantname field to taxonname for consistency with DwC's Taxon category

4715 09/14/2012 10:55 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

4714 09/14/2012 10:49 PM Aaron Marcuse-Kubitza

Updated aggregated unmapped_terms.csv, new_terms.csv. This removes terms that contained a filter (which is now in a separate column) and moves new terms that are unmapped from new_terms.csv to unmapped_terms.csv. Note that the majority of unmapped terms are from VegBank's huge tables, and are not part of the core fields needed for the analytical DB.

4713 09/14/2012 10:41 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonrank: Switched to using extended taxonomic ranks list derived from VegX at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegBIEN_taxonomic_schema#Extended&gt;. This renames *division to *phylum and splits up 'cultivar/forma'.

4712 09/14/2012 10:39 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonrank: Removed 'authority', which doesn't belong as a taxonomic rank

4711 09/14/2012 10:38 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantname: Added authority so each taxonomic level can have its own authority (author). Include it in the plantname_unique unique index because plantname is a globally scoped table.

4710 09/14/2012 10:25 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonrank: Removed 'binomial', which doesn't belong as a taxonomic rank

4709 09/14/2012 10:24 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Changed analytical_db_view to use new denormalized taxonomic names in plantconcept, which significantly reduces the number of joins. Note that changing the tables used by a view which depends on other tables will cause those tables to be reordered in dependency order to appear before the view, causing things to be moved around in the svn diff.

4708 09/14/2012 10:01 PM Aaron Marcuse-Kubitza

inputs/Madidi/Organism/map.csv: Remapped Specie+autor to new scientificNameWithAuthorship. Mapped Species and morphotypes to now-available scientificName.

4707 09/14/2012 09:59 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Moved scientificNameWithAuthorship before scientificName in taxonoccurrence.authortaxoncode's _alts

4706 09/14/2012 09:55 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped scientificNameWithAuthorship as an _alt of taxonoccurrence.authortaxoncode

4705 09/14/2012 09:53 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped scientificNameWithAuthorship

4704 09/14/2012 09:51 PM Aaron Marcuse-Kubitza

mappings/Veg+.terms.csv: Added scientificNameWithAuthorship

4703 09/14/2012 09:47 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Taxonomic names: Remapped to new denormalized fields in plantconcept

4702 09/14/2012 09:08 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantname: Added comment documenting how to include a taxon name at a rank with no explicit column, by using the plantname table as an ordered linked list linked together using parent_id. (This method of using a linked list is one way of storing an ordered list of user-defined data. It is similar to using locationevent.previous_id to link successive reobservations of the same location together.) Note that plantname can store both the official tree of life and the data provider's own custom tree of life (or a subset thereof), with the two being distinguished by whether the data provider's or TNRS's taxondeterminations point to them.

4701 09/14/2012 08:53 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantname: Added verbatimrank to store ranks of custom taxonomic levels, such as rosids. Note that even if you specify a custom verbatimrank, you must also specify a closest-match rank from the taxonrank closed list. This ensures that every taxonomic name is placed in the correct relative order in the taxonomic hierarchy.

4700 09/14/2012 08:38 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantconcept: Made plantname_id optional because the datasource's plantconcepts do not need to be placed in the recursive plantname hierarchy

4699 09/14/2012 08:35 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantconcept: Added datasource_id and appropriate unique indexes to enable scoping by datasource. Moved plantcode right after datasource_id because it will be used for the sourceaccessioncode (if any).

4698 09/14/2012 08:21 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Moved plantconcept.plantdescription to plantname and renamed it to description, so that a taxon of any rank can have a description

4697 09/14/2012 08:02 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: plantconcept: Added denormalized taxonomic ranks from <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegBIEN_taxonomic_schema#Primary> and concatenated scientific name fields

4696 09/14/2012 07:25 PM Aaron Marcuse-Kubitza

Removed no longer used ucase_first

4695 09/14/2012 07:23 PM Aaron Marcuse-Kubitza

Removed no longer used bin/union

4694 09/14/2012 07:22 PM Aaron Marcuse-Kubitza

Removed no longer used join_union_sort

4693 09/14/2012 07:21 PM Aaron Marcuse-Kubitza

Removed no longer used ci_map, because all relevant mapping scripts are now case-insensitive

4692 09/14/2012 07:19 PM Aaron Marcuse-Kubitza

mappings/Makefile: Inline $(review_) because it's only used once

4691 09/14/2012 07:18 PM Aaron Marcuse-Kubitza

mappings/Makefile: Removed no longer used $(review)

4690 09/14/2012 07:17 PM Aaron Marcuse-Kubitza

mappings/Makefile: Don't set $(SHELL) to /bin/bash because this is no longer needed

4689 09/14/2012 07:16 PM Aaron Marcuse-Kubitza

mappings/Makefile: Removed empty VegCSV section. mappings/Makefile's only functionality is now to clean up (sort) the core maps whenever they change and create human-readable maps from them.

4688 09/14/2012 07:13 PM Aaron Marcuse-Kubitza

mappings/Makefile: Removed no longer used self maps, because the new automapping mechanism does not use them

4687 09/14/2012 07:09 PM Aaron Marcuse-Kubitza

input.Makefile: Existing maps discovery: Substituted Veg+ for $(via) because it's now only used once

4686 09/14/2012 07:05 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Changed input column header from VegCore[Veg+] to VegCore because this is more accurate. This is possible now that we're using new automapping scripts that do not require a particular column header.

4685 09/14/2012 06:39 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Changed _merge to _join everywhere because _merge's (slower) duplicate elimination functionality is not needed (the combined columns do not both contain the same value, so they can simply be concatenated)

4684 09/14/2012 06:38 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Changed _merge to _join everywhere because _merge's (slower) duplicate elimination functionality is not needed (the combined columns do not both contain the same value, so they can simply be concatenated)

4683 09/14/2012 06:21 PM Aaron Marcuse-Kubitza

schemas/functions.sql: _label(): Accept params of any type, in order to support types other than text (which come from staging tables that are imported directly from a SQL export). This fixes a bug in SALVIAS.plotMetadata's column-based import.

4682 09/14/2012 06:17 PM Aaron Marcuse-Kubitza

schemas/functions.sql: _label(): Support NULL labels by not prepending a label

4681 09/14/2012 06:04 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Changed output column header from Veg+ to VegCore because this is more accurate. This is possible now that we're using new automapping scripts that do not require a particular column header. Note that this change now requires the map.csvs to use VegCore as their output column header, because otherwise the Veg+ header will get automapped to VegCore. (The header replacing is a feature to support changing the header when the schema of the column's terms changes.)

4680 09/14/2012 06:03 PM Aaron Marcuse-Kubitza

mappings/root.sh: Changed output column header from Veg+ to VegCore because this is more accurate following the initial automapping

4679 09/14/2012 05:59 PM Aaron Marcuse-Kubitza

inputs/*/*/map.csv: Changed output column header from Veg+ to VegCore because the names will be VegCore names after automapping. This is possible now that we're using new automapping scripts that do not require a particular column header.