Project

General

Profile

Statistics
| Revision:

# Date Author Comment
6234 11/16/2012 12:24 PM Aaron Marcuse-Kubitza

Added inputs/.geoscrub/_src/ to store Jim's geoscrub CSV

6233 11/16/2012 12:21 PM Aaron Marcuse-Kubitza

schemas/functions.sql: _map(): Changed error message for an unmapped value to "Value not in map" rather than "Invalid map value", because an unmapped value is not necessarily explicitly invalid

6232 11/16/2012 12:16 PM Aaron Marcuse-Kubitza

inputs/VegBank/plot_/map.csv: confidentialitystatus filter: Merged mappings for 0 with other public-equivalent fields. Note that fuzzed plots are still public, because the private columns have been removed.

6231 11/15/2012 11:16 PM Aaron Marcuse-Kubitza

inputs/VegBank/plot_/map.csv: Mapped confidentialitystatus to dcterms:accessRights with an appropriate _map filter

6230 11/15/2012 11:16 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped dcterms:accessRights

6229 11/15/2012 11:14 PM Aaron Marcuse-Kubitza

schemas/functions.sql: _map(): Raise data_exception if value not in map and no default provided (not the same as a NULL default value)

6228 11/15/2012 10:54 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: verbatimGrowthForm: Removed _map filter, which applied only to SALVIAS and has now been moved to the applicable SALVIAS tables

6227 11/15/2012 10:51 PM Aaron Marcuse-Kubitza

inputs/SALVIAS*/plotObservations/map.csv: Remapped Habit to growthForm with _map filter applied

6226 11/15/2012 10:43 PM Aaron Marcuse-Kubitza

sql_io.py: put_table(): Special handling for functions with hstore params: Fixed bug where need to unwrap literal values of mapping, which might be sql_gen.Literal objects

6225 11/15/2012 10:43 PM Aaron Marcuse-Kubitza

sql_gen.py: Added get_value()

6224 11/15/2012 10:42 PM Aaron Marcuse-Kubitza

dicts.py: join(): Added support for unhashable types, which are passed through. This adds support for SQL literal values which are dicts (hstores).

6223 11/15/2012 10:25 PM Aaron Marcuse-Kubitza

xml_func.py: Removed no longer used _map(), which has been replaced by a corresponding DB function

6222 11/15/2012 10:22 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _map(), which uses the new hstore functionality. This expands _map() functionality to column-based import.

6221 11/15/2012 10:20 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: DB and bien user: mk_db: hstore extension: Fixed bug where need to use `CREATE EXTENSION hstore SCHEMA pg_catalog` instead of createlang, because hstore must be explicitly created in pg_catalog or else it will be created in the public schema instead, causing it to get deleted every time the public schema is reinstalled and cascading the delete to everything (including in other schemas) that uses hstore

6220 11/15/2012 10:04 PM Aaron Marcuse-Kubitza

sql_io.py: put_table(): Added special handling for functions with hstore params. Note that although _map() doesn't exist yet as a DB function, this code must be in place before _map() is created to avoid param type mismatch errors.

6219 11/15/2012 08:57 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: postgres-Linux: Changed plpython to plpython3 in order to install plpython3u

6218 11/15/2012 08:30 PM Aaron Marcuse-Kubitza

schemas/py_functions.sql: _date(): Removed features that require dateutil, which is not available under plpython3u. This includes removing the now-unused date string parameter.

6217 11/15/2012 08:26 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Removed _date/date, because _date using a string date argument is no longer supported under plpython3u (dateutil is missing). Note that PostgreSQL's own date parsing is sufficient for most dates, so this use of _date is not strictly necessary and removing it will improve import times.

6216 11/15/2012 08:12 PM Aaron Marcuse-Kubitza

schemas/py_functions.sql: Replaced xrange() with range() for plpython3u

6215 11/15/2012 08:05 PM Aaron Marcuse-Kubitza

root Makefile: Python: python-Linux: Also install python3, needed by plpython3u

6214 11/15/2012 08:04 PM Aaron Marcuse-Kubitza

schemas/py_functions.sql: Updated except clause syntax for PostgreSQL 9.1.6

6213 11/15/2012 08:03 PM Aaron Marcuse-Kubitza

schemas/*.sql: Updated for PostgreSQL 9.1.6, which has standard_conforming_strings = on (which affects \-escapes in string literals), escape_string_warning not explicitly set, and uses ALTER TABLE ONLY instead of ALTER TABLE

6212 11/15/2012 07:49 PM Aaron Marcuse-Kubitza

README.TXT: Removed step to manually run make_analytical_db, now that this is done automatically by import_all. Added separate instructions to remake the analytical DB.

6211 11/15/2012 07:45 PM Aaron Marcuse-Kubitza

import_all: Change to main directory make targets are run from. Use relative paths to bin/ commands, which is possible now that the current dir is set.

6210 11/15/2012 07:41 PM Aaron Marcuse-Kubitza

import_all: Create a background process that waits until the import is done and then runs make_analytical_db

6209 11/15/2012 07:36 PM Aaron Marcuse-Kubitza

Added waitpid

6208 11/15/2012 06:52 PM Aaron Marcuse-Kubitza

import_all: Documented that `wait %1` waits for asynchronous commands

6207 11/15/2012 06:40 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: DB and bien user: mk_db: Also install hstore extension. Note that this is only supported by PostgreSQL 9.1+.

6206 11/15/2012 06:33 PM Aaron Marcuse-Kubitza

input.Makefile: Editing import: Updated queries for current schema

6205 11/15/2012 06:27 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_cultivated/create.sql: Fixed bug where need to filter out NULL lat/longs because primary keys can't contain NULL values

6204 11/15/2012 06:17 PM Aaron Marcuse-Kubitza

schemas/py_functions.sql: Changed function languages to plpython3u to match the new installed version. Note that plpythonu is not available on Mac under PostgreSQL 9.1.6.

6203 11/15/2012 05:59 PM Aaron Marcuse-Kubitza

reinstall_all: Fixed bug where also need to include datasources starting with . such as .TNRS/, by using with_all's new $all option

6202 11/15/2012 05:58 PM Aaron Marcuse-Kubitza

with_all: Added $all option to also include datasources starting with . such as .TNRS/. This is necessary for reinstall_all, which needs to install all datasources.

6201 11/15/2012 05:18 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: $(pg_ctl-*): Fixed bug where need to pause for a few seconds after restarting PostgreSQL, to wait for the server to be ready to accept connections

6200 11/15/2012 05:12 PM Aaron Marcuse-Kubitza

root Makefile: Installation: uninstall: Removed inputs/uninstall because the DB will be uninstalled anyway, so the inputs don't need to be individually removed first

6199 11/15/2012 05:11 PM Aaron Marcuse-Kubitza

schemas/postgresql.Mac.conf: Added back unix_socket_directory setting, which is apparently still needed in PostgreSQL 9.1.6

6198 11/15/2012 05:06 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: postgres-Linux: Also install postgresql.conf

6197 11/15/2012 04:54 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: postgres-Darwin: Also install postgresql.Mac.conf

6196 11/15/2012 04:40 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: $(macUsePostgresLib): Factored out PostgreSQL dir to $(macPostgresDir)

6195 11/15/2012 04:38 PM Aaron Marcuse-Kubitza

schemas/postgresql.Mac.conf: Updated to PostgreSQL 9.1.6's postgresql.conf

6194 11/15/2012 04:29 PM Aaron Marcuse-Kubitza

root Makefile: Datasources: inputs/install: Fixed bug where need to `wait` after `. bin/reinstall_all` to wait for inputs to finish installing before installing the public schema. This is necessary because views in the public schema now have dependencies on some datasources, such as TNRS.

6193 11/15/2012 04:25 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: $(psqlAsAdmin): Use new $(asAdmin)

6192 11/15/2012 04:25 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: Schemas: schemas/public/install: Use $(psqlNoSearchPath) instead of $(psqlAsBien) because the search_path is set by vegbien.sql

6191 11/15/2012 04:16 PM Aaron Marcuse-Kubitza

root Makefile: Datasources: Added inputs/install override which runs `. bin/reinstall_all` instead, in order to install all datasources simultaneously

6190 11/15/2012 04:03 PM Aaron Marcuse-Kubitza

root Makefile: Python: python-Darwin: Added instructions to install Python 3.2 (Python 2 comes with Mac OS X, but Python 3.2 is needed for plpython3u)

6189 11/15/2012 03:55 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: DB and bien user: mk_db: Updated for PostgreSQL 9.1.6 on the Mac, which only provides plpython3u (Python 3)

6188 11/15/2012 03:54 PM Aaron Marcuse-Kubitza

root Makefile: VegBIEN DB: DB and bien user: mk_db: Updated for PostgreSQL 9.1.6, which requires the DB name to be specified on the command line instead of in the $PGDATABASE env var set by postgres_vegbien. Fixed bug where need to run createlang as postgres superuser, because plpythonu is an untrusted language (with unrestricted access to the entire DB).

6187 11/15/2012 03:51 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: postgres-Darwin: Updated for PostgreSQL 9.1.6, which requires some /usr/lib/ symlinks to be changed to newer versions installed in the PostgreSQL lib/ dir

6186 11/15/2012 03:49 PM Aaron Marcuse-Kubitza

input.Makefile: $(psqlAsBien), csv2db: Turn off the automatic search_path where needed, because when the input is installed, the schemas in it may not exist yet

6185 11/15/2012 02:16 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: place: Renamed geosource_valid to geovalid. (It had gotten renamed in the reference -> source rename.)

6184 11/15/2012 02:12 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: location: Renamed confidentialitystatus->accesslevel, confidentialityreason->accessconditions to match the corresponding fields in source. Note that accessconditions stores more than confidentialityreason did, because it can contain details about the accesslevel in addition to the reason for it.

6183 11/15/2012 02:07 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: source.accesslevel, location.confidentialitystatus: Changed type to accesslevel

6182 11/15/2012 02:03 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added accesslevel enum

6181 11/15/2012 01:51 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6180 11/14/2012 06:37 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

6179 11/14/2012 06:30 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Renamed reference -> source to make this table more broadly applicable, and because this now stores the datasource metadata

6178 11/14/2012 06:19 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Scope it by top-level datasource, because institutionCodes (which map to this field) are not globally unique. This involves renaming the previous reference_id field, which was for the matched reference, to matched_reference_id, to allow a scoping reference_id field.

6177 11/14/2012 06:16 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Made taxonoccurrence.verbatimcollectorname an fkey to party, and renamed it to collector_id

6176 11/14/2012 05:57 PM Aaron Marcuse-Kubitza

inputs/VegBank/taxonobservation_/map.csv: Mapped new givenname, surname (from collector_id's party) to recordedBy

6175 11/14/2012 05:54 PM Aaron Marcuse-Kubitza

inputs/VegBank/taxonobservation_/create.sql: Also join to collector_id's party to include collector name

6174 11/14/2012 05:53 PM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Rename taxoninterpretation.party_id to taxoninterpretation_party_id to make it globally unique when joining taxoninterpretation to other tables

6173 11/14/2012 05:48 PM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Rename party.d_obscount to party_d_obscount to make it globally unique when joining with other tables

6172 11/14/2012 05:43 PM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Rename taxoninterpretation.party_id to taxoninterpretation_party_id to make it globally unique when joining taxoninterpretation to other tables

6171 11/14/2012 05:35 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Made taxonoccurrence.verbatimcollectorname an fkey to party, and renamed it to collector_id

6170 11/14/2012 05:32 PM Aaron Marcuse-Kubitza

input.Makefile: Existing maps discovery: $(allTables): Fixed bug where need to remove extra whitespace before $(tables) when there are no $(joinedTables)

6169 11/14/2012 05:32 PM Aaron Marcuse-Kubitza

lib/mappings.Makefile: Checking if $(termsSubdirs) defined: Fixed bug where can't use ifndef because that checks if the variable is empty, not undefined. Need to use `ifeq ($(origin var),undefined)` instead.

6168 11/14/2012 05:11 PM Aaron Marcuse-Kubitza

inputs/TEAM/V*/map.csv: Omit *Method, because it just contains "Derived" for a small fraction of the rows

6167 11/14/2012 04:47 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/: Updated to new salvias_plots export on nimoy, which has a different schema

6166 11/14/2012 04:03 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/salvias_plots.~.clean_up.sql: Moved Ensure globally unique column names to end to match VegBank order

6165 11/14/2012 03:54 PM Aaron Marcuse-Kubitza

my2pg: *int types: Added mediumint

6164 11/14/2012 03:30 PM Aaron Marcuse-Kubitza

Placed inputs/SALVIAS/_archive/ under version control

6163 11/14/2012 03:18 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/salvias_plots.~.clean_up.sql: Remove private data that should not be publicly visible, indicated by plotMetadata.AccessCode = 1

6162 11/14/2012 03:17 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/salvias_plots.~.clean_up.sql: Enable cascading deletes by adding the necessary fkeys

6161 11/14/2012 03:17 PM Aaron Marcuse-Kubitza

Added inputs/SALVIAS/_src/salvias_data_access_controls.txt

6160 11/14/2012 02:26 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6159 11/14/2012 02:25 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/import_order.txt: Fixed bug where geoscrub_cultivated needs to be installed after geoscrub_cleaned_unique, not before as it would be with the default alphabetical sort order

6158 11/14/2012 02:24 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_cultivated/: Use _no_import file to exclude geoscrub_cultivated from the import, because it's used directly as a lookup table by analytical_stem rather than being imported. This ensures that there is no import log or input row count for geoscrub_cultivated in the import times, which would skew the import row count because the row count would be included even though no columns are mapped.

6157 11/14/2012 02:18 PM Aaron Marcuse-Kubitza

input.Makefile: $(tables): Fixed bug where need to use $(importTables) instead of $(tables) in all places that should use only imported tables, rather than just in the import process itself

6156 11/14/2012 02:13 PM Aaron Marcuse-Kubitza

input.Makefile: Import to VegBIEN: Added support for tables which should be installed but not imported, but which must be installed after tables which are imported rather than before. This currently applies to geoscrub.geoscrub_cultivated, which depends on geoscrub_cleaned_unique (and therefore must be installed after it), but which should not be imported because it's used directly as a lookup table by analytical_stem.

6155 11/14/2012 10:02 AM Aaron Marcuse-Kubitza

inputs/VegBank/vegbank.~.clean_up.sql: Documented that plots with confidentialitystatus >= 4 are not deleted if their embargos have already expired. This applies to the Shenandoah NP data, which has confidentialitystatus = 5 but is no longer embargoed according to the embargo table

6154 11/13/2012 08:10 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/: Mapped unmapped fields with a VegCore/VegBIEN equivalent. plotMetadata_/: Remapped life_zone to communityID because it is now alt-ed together with vegetation*, and thus not just a description with life_zone_code as its globally unique name.

6153 11/13/2012 07:35 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

6152 11/13/2012 07:10 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencetype: Added terms from reference.referencetype closed list in VegBank data dictionary. Cited sources in comment.

6151 11/13/2012 06:39 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: reference.referencetype: Changed type to referencetype enum

6150 11/13/2012 06:38 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added referencetype enum, containing VegBank's values in reference.referencetype as well as values for bien_web.datasource.aggregatorOrPrimary and bien_web.dataSourceNormalized.isHerbarium,isAggregator

6149 11/13/2012 06:23 PM Aaron Marcuse-Kubitza

specimenreplicate: Made institution_id an fkey to referencename instead of party, to later be matched up with reference entries for each aggregator's subprovider

6148 11/13/2012 06:15 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Added referencename_unique unique index on name

6147 11/13/2012 06:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Made reference_id optional so it can be populated later when referencenames are scrubbed

6146 11/13/2012 05:58 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: referencename: Renamed identifier to name because it is specifically any name for the reference, not necessarily an ID

6145 11/13/2012 05:53 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Renamed referencealtident to referencename to allow any verbatim reference name to go here, with reference containing the corresponding accepted reference name

6144 11/13/2012 05:50 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: reference: Added accesslevel, accessconditions from bien_web.datasource

6143 11/13/2012 05:41 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: address: Added street2 from bien_web.party.address2

6142 11/13/2012 05:38 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: address: Renamed fields to bien_web.party names

6141 11/13/2012 05:12 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: party: Added department from bien_web.party

6140 11/13/2012 05:06 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/plotMetadata_/map.csv: Mapped lookup_MethodCode_Description to new observationMeasure

6139 11/13/2012 05:06 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: method: Made name optional when description or observationmeasure is specified

6138 11/13/2012 05:03 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: method: method_unique: Include observationmeasure since the method name sometimes is not globally unique (e.g. in SALVIAS)

6137 11/13/2012 04:58 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped observationMeasure

6136 11/13/2012 04:57 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: observationMeasure: Added source to DwC samplingProtocol

6135 11/13/2012 04:54 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added observationMeasure