Project

General

Profile

Statistics
| Revision:

# Date Author Comment
6159 11/14/2012 02:25 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/import_order.txt: Fixed bug where geoscrub_cultivated needs to be installed after geoscrub_cleaned_unique, not before as it would be with the default alphabetical sort order

6158 11/14/2012 02:24 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_cultivated/: Use _no_import file to exclude geoscrub_cultivated from the import, because it's used directly as a lookup table by analytical_stem rather than being imported. This ensures that there is no import log or input row count for geoscrub_cultivated in the import times, which would skew the import row count because the row count would be included even though no columns are mapped.

6157 11/14/2012 02:18 PM Aaron Marcuse-Kubitza

input.Makefile: $(tables): Fixed bug where need to use $(importTables) instead of $(tables) in all places that should use only imported tables, rather than just in the import process itself

6156 11/14/2012 02:13 PM Aaron Marcuse-Kubitza

input.Makefile: Import to VegBIEN: Added support for tables which should be installed but not imported, but which must be installed after tables which are imported rather than before. This currently applies to geoscrub.geoscrub_cultivated, which depends on geoscrub_cleaned_unique (and therefore must be installed after it), but which should not be imported because it's used directly as a lookup table by analytical_stem.

6155 11/14/2012 10:02 AM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Documented that plots with confidentialitystatus >= 4 are not deleted if their embargos have already expired. This applies to the Shenandoah NP data, which has confidentialitystatus = 5 but is no longer embargoed according to the embargo table

6154 11/13/2012 08:10 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/: Mapped unmapped fields with a VegCore/VegBIEN equivalent. plotMetadata_/: Remapped life_zone to communityID because it is now alt-ed together with vegetation*, and thus not just a description with life_zone_code as its globally unique name.

6153 11/13/2012 07:35 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

6152 11/13/2012 07:10 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencetype: Added terms from reference.referencetype closed list in VegBank data dictionary. Cited sources in comment.

6151 11/13/2012 06:39 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: reference.referencetype: Changed type to referencetype enum

6150 11/13/2012 06:38 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added referencetype enum, containing VegBank's values in reference.referencetype as well as values for bien_web.datasource.aggregatorOrPrimary and bien_web.dataSourceNormalized.isHerbarium,isAggregator

6149 11/13/2012 06:23 PM Aaron Marcuse-Kubitza

specimenreplicate: Made institution_id an fkey to referencename instead of party, to later be matched up with reference entries for each aggregator's subprovider

6148 11/13/2012 06:15 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Added referencename_unique unique index on name

6147 11/13/2012 06:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Made reference_id optional so it can be populated later when referencenames are scrubbed

6146 11/13/2012 05:58 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Renamed identifier to name because it is specifically any name for the reference, not necessarily an ID

6145 11/13/2012 05:53 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Renamed referencealtident to referencename to allow any verbatim reference name to go here, with reference containing the corresponding accepted reference name

6144 11/13/2012 05:50 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: reference: Added accesslevel, accessconditions from bien_web.datasource

6143 11/13/2012 05:41 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: address: Added street2 from bien_web.party.address2

6142 11/13/2012 05:38 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: address: Renamed fields to bien_web.party names

6141 11/13/2012 05:12 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: party: Added department from bien_web.party

6140 11/13/2012 05:06 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/plotMetadata_/map.csv: Mapped lookup_MethodCode_Description to new observationMeasure

6139 11/13/2012 05:06 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: method: Made name optional when description or observationmeasure is specified

6138 11/13/2012 05:03 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: method: method_unique: Include observationmeasure since the method name sometimes is not globally unique (e.g. in SALVIAS)

6137 11/13/2012 04:58 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped observationMeasure

6136 11/13/2012 04:57 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: observationMeasure: Added source to DwC samplingProtocol

6135 11/13/2012 04:54 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added observationMeasure

6134 11/13/2012 04:40 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Added family_higher_plant_group

6133 11/13/2012 04:28 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Removed VegBank-internal fields starting with d_

6132 11/13/2012 04:19 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Moved tables so commclass would have more room. Moved revision back to original spot.

6131 11/13/2012 04:07 PM Aaron Marcuse-Kubitza

schemas/filter_ERD.csv: Display referencecontributor->party connection in ERD

6130 11/13/2012 03:56 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Removed no longer used table referenceparty

6129 11/13/2012 03:54 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencecontributor: Point to party instead of referenceparty, which duplicates party

6128 11/13/2012 03:51 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: party: Added new suffix field to party_unique unique index

6127 11/13/2012 03:49 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: party: Added fields from referenceparty. Note that referenceparty.type is named partytype.

6126 11/13/2012 03:25 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/salvias_plots.~.clean_up.sql: Rename lookup_MethodCode.Description to lookup_MethodCode_Description to make it globally unique when joined with plotMetadata

6125 11/13/2012 03:24 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: $(svnFilesGlob): Added root-level .sql files containing ~, which run additional commands after the original data is imported

6124 11/13/2012 03:22 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/_MySQL/: Updated svn:ignore from running `make inputs/SALVIAS/add`

6123 11/13/2012 02:30 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: matched place's coordinates: Fixed bug where coordinates entry itself needed to have its datasource (reference) set to geoscrub, in addition to the place entry that uses it, in order to match up properly with geoscrub's corresponding input place (whose coordinates as well as place are owned by the geoscrub datasource)

6122 11/13/2012 02:22 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: matched place's coordinates: Fixed bug where coordinates mappings with and without matched_place_id=0 need to sort together in order to be merged, by prepending ".," to the place attrs list

6121 11/13/2012 02:22 PM Aaron Marcuse-Kubitza

inputs/VegBank/plot_/test.xml.ref: Updated inserted row count

6120 11/13/2012 12:00 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6119 11/09/2012 08:30 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

6118 11/09/2012 08:20 PM Aaron Marcuse-Kubitza

inputs/Makefile: Input data: $(rsyncLogs): Also include logs from the datasource's top-level logs/ dir, which contains make_analytical_db.log.sql

6117 11/09/2012 08:09 PM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Remove still-embargoed plots

6116 11/09/2012 08:07 PM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Enable cascading deletes by setting all foreign keys to ON DELETE CASCADE

6115 11/09/2012 07:49 PM Aaron Marcuse-Kubitza

Added inputs/VegBank/_src/vegbank.schema.sql.make and vegbank.schema.sql

6114 11/09/2012 07:48 PM Aaron Marcuse-Kubitza

input.Makefile: Staging tables installation: sql/install: Use new pg_dump_limit to remove security and schema-setting commands

6113 11/09/2012 07:46 PM Aaron Marcuse-Kubitza

Added pg_dump_limit to filter a PostgreSQL DB dump to remove security and schema-setting commands

6112 11/09/2012 06:37 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_cleaned_unique/create.sql: Removed no longer needed index on latitudeDecimalVerbatim, longitudeDecimalVerbatim, which is now on geoscrub_cultivated instead

6111 11/09/2012 06:32 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Fixed bug where needed to join on new geoscrub_cultivated, not geoscrub, for all geoscrub-related information. geoscrub contains many duplicate records, causing one input row to match many rows in geoscrub, when there should only be one entry for each coordinate pair.

6110 11/09/2012 06:26 PM Aaron Marcuse-Kubitza

Added inputs/.geoscrub/geoscrub_cultivated/

6109 11/09/2012 06:04 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_cleaned_unique/create.sql: Added index on latitudeDecimalVerbatim, longitudeDecimalVerbatim for use by analytical_stem_view

6108 11/09/2012 05:34 PM Aaron Marcuse-Kubitza

inputs/newWorld/geoscrub.schema.~.changes.sql: Change countryNameStd type to text to allow merge-joining with place.country in analytical_stem_view

6107 11/09/2012 05:28 PM Aaron Marcuse-Kubitza

inputs/newWorld/geoscrub.schema.~.changes.sql: ALTER TABLE ... ALTER COLUMN statement: Reformatted to allow adding additional ALTER COLUMN clauses

6106 11/09/2012 05:25 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_cleaned_unique/create.sql: Change latitudeDecimalVerbatim, longitudeDecimalVerbatim types to double precision to allow merge-joining with coordinates.latitude_deg, longitude_deg in analytical_stem_view

6105 11/09/2012 05:12 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Instead of using `make schemas/rotate` and then renaming the public schema to the correct name, just rename directly to the correct name using `make schemas/rename/...`. Use new import_name to determine the import name instead of manually finding the date in the first datasource's log file name.

6104 11/09/2012 05:06 PM Aaron Marcuse-Kubitza

Added import_name, which gets the name of an import based on its log file names

6103 11/09/2012 04:50 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Moved checking that imports were successful before running make_analytical_db

6102 11/09/2012 04:41 PM Aaron Marcuse-Kubitza

root Makefile: Installation: Fixed bug where schemas/install needed to happen after inputs/install because some of the public schema's views now depend on inputs

6101 11/09/2012 04:07 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: cultivatedBasis: Concatenate ''::text to geoscrub.isCultivatedReason so it will be cast to a text field both on PostgreSQL 9.1.1 (local machine), which removes any explicit cast to text when creating the view, and 9.1.6 (vegbiendev), which requires an explicit cast to text

6100 11/09/2012 03:49 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: cultivatedBasis: Use geoscrub.isCultivatedReason instead when geoscrub.isCultivated is used as the source for cultivated

6099 11/09/2012 12:53 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Use geoscrub.isCultivated when taxonoccurrence.iscultivated is not provided (joining to geoscrub on the coordinates)

6098 11/08/2012 06:38 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: Schemas: Run all schema installs and uninstalls using no_search_path=1, so that the schemas in the automatic search_path are not required for the command to run

6097 11/08/2012 06:37 PM Aaron Marcuse-Kubitza

psql_vegbien: Added $no_search_path option to turn off the automatic SET search_path directive

6096 11/08/2012 06:11 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonverbatim: Added growthform for growthform based on the taxon name rather than provided with the input data's taxonoccurrence

6095 11/08/2012 06:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Fixed lines

6094 11/08/2012 05:47 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/plotMetadata/: LEFT JOINed with lookup_MethodCode to create plotMetadata_

6093 11/08/2012 04:52 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: threatened_taxonlabel_view: Fixed bug where needed DISTINCT on resulting taxonlabel_id because some descendants apparently appear in multiple threatened taxonlabels' subtrees

6092 11/08/2012 04:42 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Added threatened, using new threatened_taxonlabel lookup table

6091 11/08/2012 04:12 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: reference_by_shortname(): Fixed bug where need to use $-syntax to reference params in sql-language functions

6090 11/08/2012 04:07 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: threatened_taxonlabel_view: Use new reference_by_shortname()

6089 11/08/2012 03:45 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: Schemas: public: schemas/public/uninstall: Fixed bug where need to run psql_vegbien without public in the search_path because it may have already been deleted

6088 11/08/2012 03:44 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: Schemas: public: schemas/public/install: Fixed bug where need to run psql_vegbien without public in the search_path because it doesn't exist, by setting public to the empty string (deleting it)

6087 11/08/2012 03:42 PM Aaron Marcuse-Kubitza

vegbien_dest: $schemas: Don't include the , before $public if it has been set to the empty string (deleted)

6086 11/08/2012 03:27 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added reference_by_shortname(). Using this function instead of the manual query should force the query planner to evaluate this expression first, rather than complexly reordering joins to place this nested select as a filter condition.

6085 11/08/2012 03:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added threatened_taxonlabel derived table with generating view threatened_taxonlabel_view

6084 11/08/2012 02:48 PM Aaron Marcuse-Kubitza

Updated inputs/UNCC/Specimen/test.xml.ref inserted row count

6083 11/08/2012 01:38 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added threatened

6082 11/08/2012 01:21 PM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Remove private columns (plot.reallatitude, reallongitude) that should not be publicly visible

6081 11/08/2012 01:13 PM Aaron Marcuse-Kubitza

inputs/CVS/Organism/map.csv: Removed now-dropped realLatitude, realLongitude

6080 11/08/2012 01:12 PM Aaron Marcuse-Kubitza

inputs/CVS/Organism/map.csv: Removed now-dropped realLatitude, realLongitude

6079 11/08/2012 01:12 PM Aaron Marcuse-Kubitza

Added inputs/CVS/Organism/postprocess.sql to drop private realLatitude, realLongitude columns

6078 11/08/2012 01:10 PM Aaron Marcuse-Kubitza

input.Makefile: Staging tables installation: Added back postprocess.sql, which is now used for one-time dropping of private columns that should not be publicly visible

6077 11/08/2012 12:47 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: %/.map.csv.last_cleanup: $(dict) canon/translate: Use new $(translate?)

6076 11/08/2012 12:45 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: %/.map.csv.last_cleanup: Added $(srcDict) as a prerequisite, so that .last_cleanup will be re-run if it changes. Added empty $(srcDict) target in case it doesn't exist.

6075 11/08/2012 12:39 PM Aaron Marcuse-Kubitza

inputs/bien_web/observation/map.csv: Omit *_index because they are placeholder columns created by the MySQL to PostgreSQL translation

6074 11/08/2012 12:37 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: %/.map.csv.last_cleanup: Fixed bug where can only canon/translate using $(srcDict) if it exists for the datasource

6073 11/08/2012 12:26 PM Aaron Marcuse-Kubitza

inputs/bien_web/observation/: Regenerated from actual bien_web.observation schema on nimoy, which has additional columns

6072 11/08/2012 12:24 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: $(svnFilesGlob): Added top-level map.csv, which can be used to apply a datasource-global data dictionary to all tables

6071 11/08/2012 12:18 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: %/.map.csv.last_cleanup: Also apply any map.csv at the top level of the datasource directory. This can be used to apply a datasource-global data dictionary to all tables.

6070 11/08/2012 12:01 PM Aaron Marcuse-Kubitza

my2pg: Also remove column comments. Note that these cannot be translated by sed, because PostgreSQL only allows setting column comments in a separate statement, not inline with the column's entry in the CREATE TABLE statement, and sed can only make replacements contiguous with the input line.

6069 11/08/2012 11:28 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Removed incorrect duplicate entry for verbatimSubgenus

6068 11/08/2012 10:58 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: _taxon_family_require_std(): Fixed bug where name needed to be lowercased before checking if it ended in -aceae, to support family names that are uppercase. Note that this resulted in the family not being prepended to the TNRS input name for datasources with uppercase family names, so the next DB import will likely produce a number of unscrubbed TNRS input names which now have the uppercase family prepended.

6067 11/08/2012 10:17 AM Aaron Marcuse-Kubitza

inputs/.TNRS/tnrs/tnrs.make: Fixed bug where need to reference the log file path relative to the make script itself, because otherwise the log file would go in inputs/.TNRS/logs/tnrs.make.log.sql

6066 11/08/2012 10:07 AM Aaron Marcuse-Kubitza

inputs/.TNRS/tnrs/tnrs.make: Fixed bug where need to use just the basename of $0 as the log file name

6065 11/08/2012 09:51 AM Aaron Marcuse-Kubitza

Added inputs/IUCN/

6064 11/08/2012 09:51 AM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Added _src/ (when it exists). $(_svnFilesGlob): Added .url, .pdf files in _src/.

6063 11/08/2012 07:47 AM Aaron Marcuse-Kubitza

psql_vegbien: Use bash because it supports substitutions

6062 11/08/2012 07:46 AM Aaron Marcuse-Kubitza

psql_vegbien: Set the search_path to $out_schemas set by vegbien_dest. This will enable running any psql_vegbien script on a schema other than public.

6061 11/08/2012 07:07 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Changed inner joins on non-datasource taxonlabels to LEFT JOINs, to ensure that an entry is included in the analytical DB even if there was no matched taxonlabel. In theory, this shouldn't be necessary, because every taxonlabel should have a canonical taxonlabel since canon_label_id is auto-populated to the taxonlabel_id if there is no matched_label_id; there should be a taxonverbatim for every datasource and accepted taxonlabel because datasources link to taxonlabel via taxonverbatim and TNRS populates a taxonverbatim for every accepted taxonlabel; and there should be a parsed taxonlabel for every datasource taxonlabel because the mappings populate it.

6060 11/08/2012 06:56 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Removed join on specimenreplicate, because it isn't used in the analytical DB. Each specimen will still get an entry in analytical_*, because it gets its own location.