/ - Changes - BIEN 3 - NCEAS Projects

root @ 6234

#	Date	Author	Comment
6234	11/16/2012 12:24 PM	Aaron Marcuse-Kubitza	Added inputs/.geoscrub/_src/ to store Jim's geoscrub CSV
6233	11/16/2012 12:21 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: _map(): Changed error message for an unmapped value to "Value not in map" rather than "Invalid map value", because an unmapped value is not necessarily explicitly invalid
6232	11/16/2012 12:16 PM	Aaron Marcuse-Kubitza	inputs/VegBank/plot_/map.csv: confidentialitystatus filter: Merged mappings for 0 with other public-equivalent fields. Note that fuzzed plots are still public, because the private columns have been removed.
6231	11/15/2012 11:16 PM	Aaron Marcuse-Kubitza	inputs/VegBank/plot_/map.csv: Mapped confidentialitystatus to dcterms:accessRights with an appropriate _map filter
6230	11/15/2012 11:16 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped dcterms:accessRights
6229	11/15/2012 11:14 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: _map(): Raise data_exception if value not in map and no default provided (not the same as a NULL default value)
6228	11/15/2012 10:54 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: verbatimGrowthForm: Removed _map filter, which applied only to SALVIAS and has now been moved to the applicable SALVIAS tables
6227	11/15/2012 10:51 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS*/plotObservations/map.csv: Remapped Habit to growthForm with _map filter applied
6226	11/15/2012 10:43 PM	Aaron Marcuse-Kubitza	sql_io.py: put_table(): Special handling for functions with hstore params: Fixed bug where need to unwrap literal values of mapping, which might be sql_gen.Literal objects
6225	11/15/2012 10:43 PM	Aaron Marcuse-Kubitza	sql_gen.py: Added get_value()
6224	11/15/2012 10:42 PM	Aaron Marcuse-Kubitza	dicts.py: join(): Added support for unhashable types, which are passed through. This adds support for SQL literal values which are dicts (hstores).
6223	11/15/2012 10:25 PM	Aaron Marcuse-Kubitza	xml_func.py: Removed no longer used _map(), which has been replaced by a corresponding DB function
6222	11/15/2012 10:22 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added _map(), which uses the new hstore functionality. This expands _map() functionality to column-based import.
6221	11/15/2012 10:20 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: DB and bien user: mk_db: hstore extension: Fixed bug where need to use `CREATE EXTENSION hstore SCHEMA pg_catalog` instead of createlang, because hstore must be explicitly created in pg_catalog or else it will be created in the public schema instead, causing it to get deleted every time the public schema is reinstalled and cascading the delete to everything (including in other schemas) that uses hstore
6220	11/15/2012 10:04 PM	Aaron Marcuse-Kubitza	sql_io.py: put_table(): Added special handling for functions with hstore params. Note that although _map() doesn't exist yet as a DB function, this code must be in place before _map() is created to avoid param type mismatch errors.
6219	11/15/2012 08:57 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: postgres-Linux: Changed plpython to plpython3 in order to install plpython3u
6218	11/15/2012 08:30 PM	Aaron Marcuse-Kubitza	schemas/py_functions.sql: _date(): Removed features that require dateutil, which is not available under plpython3u. This includes removing the now-unused date string parameter.
6217	11/15/2012 08:26 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Removed _date/date, because _date using a string date argument is no longer supported under plpython3u (dateutil is missing). Note that PostgreSQL's own date parsing is sufficient for most dates, so this use of _date is not strictly necessary and removing it will improve import times.
6216	11/15/2012 08:12 PM	Aaron Marcuse-Kubitza	schemas/py_functions.sql: Replaced xrange() with range() for plpython3u
6215	11/15/2012 08:05 PM	Aaron Marcuse-Kubitza	root Makefile: Python: python-Linux: Also install python3, needed by plpython3u
6214	11/15/2012 08:04 PM	Aaron Marcuse-Kubitza	schemas/py_functions.sql: Updated except clause syntax for PostgreSQL 9.1.6
6213	11/15/2012 08:03 PM	Aaron Marcuse-Kubitza	schemas/*.sql: Updated for PostgreSQL 9.1.6, which has standard_conforming_strings = on (which affects \-escapes in string literals), escape_string_warning not explicitly set, and uses ALTER TABLE ONLY instead of ALTER TABLE
6212	11/15/2012 07:49 PM	Aaron Marcuse-Kubitza	README.TXT: Removed step to manually run make_analytical_db, now that this is done automatically by import_all. Added separate instructions to remake the analytical DB.
6211	11/15/2012 07:45 PM	Aaron Marcuse-Kubitza	import_all: Change to main directory make targets are run from. Use relative paths to bin/ commands, which is possible now that the current dir is set.
6210	11/15/2012 07:41 PM	Aaron Marcuse-Kubitza	import_all: Create a background process that waits until the import is done and then runs make_analytical_db
6209	11/15/2012 07:36 PM	Aaron Marcuse-Kubitza	Added waitpid
6208	11/15/2012 06:52 PM	Aaron Marcuse-Kubitza	import_all: Documented that `wait %1` waits for asynchronous commands
6207	11/15/2012 06:40 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: DB and bien user: mk_db: Also install hstore extension. Note that this is only supported by PostgreSQL 9.1+.
6206	11/15/2012 06:33 PM	Aaron Marcuse-Kubitza	input.Makefile: Editing import: Updated queries for current schema
6205	11/15/2012 06:27 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_cultivated/create.sql: Fixed bug where need to filter out NULL lat/longs because primary keys can't contain NULL values
6204	11/15/2012 06:17 PM	Aaron Marcuse-Kubitza	schemas/py_functions.sql: Changed function languages to plpython3u to match the new installed version. Note that plpythonu is not available on Mac under PostgreSQL 9.1.6.
6203	11/15/2012 05:59 PM	Aaron Marcuse-Kubitza	reinstall_all: Fixed bug where also need to include datasources starting with . such as .TNRS/, by using with_all's new $all option
6202	11/15/2012 05:58 PM	Aaron Marcuse-Kubitza	with_all: Added $all option to also include datasources starting with . such as .TNRS/. This is necessary for reinstall_all, which needs to install all datasources.
6201	11/15/2012 05:18 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: $(pg_ctl-*): Fixed bug where need to pause for a few seconds after restarting PostgreSQL, to wait for the server to be ready to accept connections
6200	11/15/2012 05:12 PM	Aaron Marcuse-Kubitza	root Makefile: Installation: uninstall: Removed inputs/uninstall because the DB will be uninstalled anyway, so the inputs don't need to be individually removed first
6199	11/15/2012 05:11 PM	Aaron Marcuse-Kubitza	schemas/postgresql.Mac.conf: Added back unix_socket_directory setting, which is apparently still needed in PostgreSQL 9.1.6
6198	11/15/2012 05:06 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: postgres-Linux: Also install postgresql.conf
6197	11/15/2012 04:54 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: postgres-Darwin: Also install postgresql.Mac.conf
6196	11/15/2012 04:40 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: $(macUsePostgresLib): Factored out PostgreSQL dir to $(macPostgresDir)
6195	11/15/2012 04:38 PM	Aaron Marcuse-Kubitza	schemas/postgresql.Mac.conf: Updated to PostgreSQL 9.1.6's postgresql.conf
6194	11/15/2012 04:29 PM	Aaron Marcuse-Kubitza	root Makefile: Datasources: inputs/install: Fixed bug where need to `wait` after `. bin/reinstall_all` to wait for inputs to finish installing before installing the public schema. This is necessary because views in the public schema now have dependencies on some datasources, such as TNRS.
6193	11/15/2012 04:25 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: $(psqlAsAdmin): Use new $(asAdmin)
6192	11/15/2012 04:25 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: Schemas: schemas/public/install: Use $(psqlNoSearchPath) instead of $(psqlAsBien) because the search_path is set by vegbien.sql
6191	11/15/2012 04:16 PM	Aaron Marcuse-Kubitza	root Makefile: Datasources: Added inputs/install override which runs `. bin/reinstall_all` instead, in order to install all datasources simultaneously
6190	11/15/2012 04:03 PM	Aaron Marcuse-Kubitza	root Makefile: Python: python-Darwin: Added instructions to install Python 3.2 (Python 2 comes with Mac OS X, but Python 3.2 is needed for plpython3u)
6189	11/15/2012 03:55 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: DB and bien user: mk_db: Updated for PostgreSQL 9.1.6 on the Mac, which only provides plpython3u (Python 3)
6188	11/15/2012 03:54 PM	Aaron Marcuse-Kubitza	root Makefile: VegBIEN DB: DB and bien user: mk_db: Updated for PostgreSQL 9.1.6, which requires the DB name to be specified on the command line instead of in the $PGDATABASE env var set by postgres_vegbien. Fixed bug where need to run createlang as postgres superuser, because plpythonu is an untrusted language (with unrestricted access to the entire DB).
6187	11/15/2012 03:51 PM	Aaron Marcuse-Kubitza	root Makefile: PostgreSQL: postgres-Darwin: Updated for PostgreSQL 9.1.6, which requires some /usr/lib/ symlinks to be changed to newer versions installed in the PostgreSQL lib/ dir
6186	11/15/2012 03:49 PM	Aaron Marcuse-Kubitza	input.Makefile: $(psqlAsBien), csv2db: Turn off the automatic search_path where needed, because when the input is installed, the schemas in it may not exist yet
6185	11/15/2012 02:16 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: place: Renamed geosource_valid to geovalid. (It had gotten renamed in the reference -> source rename.)
6184	11/15/2012 02:12 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: location: Renamed confidentialitystatus->accesslevel, confidentialityreason->accessconditions to match the corresponding fields in source. Note that accessconditions stores more than confidentialityreason did, because it can contain details about the accesslevel in addition to the reason for it.
6183	11/15/2012 02:07 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: source.accesslevel, location.confidentialitystatus: Changed type to accesslevel
6182	11/15/2012 02:03 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added accesslevel enum
6181	11/15/2012 01:51 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6180	11/14/2012 06:37 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
6179	11/14/2012 06:30 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed reference -> source to make this table more broadly applicable, and because this now stores the datasource metadata
6178	11/14/2012 06:19 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: referencename: Scope it by top-level datasource, because institutionCodes (which map to this field) are not globally unique. This involves renaming the previous reference_id field, which was for the matched reference, to matched_reference_id, to allow a scoping reference_id field.
6177	11/14/2012 06:16 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Made taxonoccurrence.verbatimcollectorname an fkey to party, and renamed it to collector_id
6176	11/14/2012 05:57 PM	Aaron Marcuse-Kubitza	inputs/VegBank/taxonobservation_/map.csv: Mapped new givenname, surname (from collector_id's party) to recordedBy
6175	11/14/2012 05:54 PM	Aaron Marcuse-Kubitza	inputs/VegBank/taxonobservation_/create.sql: Also join to collector_id's party to include collector name
6174	11/14/2012 05:53 PM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Rename taxoninterpretation.party_id to taxoninterpretation_party_id to make it globally unique when joining taxoninterpretation to other tables
6173	11/14/2012 05:48 PM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Rename party.d_obscount to party_d_obscount to make it globally unique when joining with other tables
6172	11/14/2012 05:43 PM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Rename taxoninterpretation.party_id to taxoninterpretation_party_id to make it globally unique when joining taxoninterpretation to other tables
6171	11/14/2012 05:35 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Made taxonoccurrence.verbatimcollectorname an fkey to party, and renamed it to collector_id
6170	11/14/2012 05:32 PM	Aaron Marcuse-Kubitza	input.Makefile: Existing maps discovery: $(allTables): Fixed bug where need to remove extra whitespace before $(tables) when there are no $(joinedTables)
6169	11/14/2012 05:32 PM	Aaron Marcuse-Kubitza	lib/mappings.Makefile: Checking if $(termsSubdirs) defined: Fixed bug where can't use ifndef because that checks if the variable is empty, not undefined. Need to use `ifeq ($(origin var),undefined)` instead.
6168	11/14/2012 05:11 PM	Aaron Marcuse-Kubitza	inputs/TEAM/V/map.csv: Omit Method, because it just contains "Derived" for a small fraction of the rows
6167	11/14/2012 04:47 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/: Updated to new salvias_plots export on nimoy, which has a different schema
6166	11/14/2012 04:03 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/salvias_plots.~.clean_up.sql: Moved Ensure globally unique column names to end to match VegBank order
6165	11/14/2012 03:54 PM	Aaron Marcuse-Kubitza	my2pg: *int types: Added mediumint
6164	11/14/2012 03:30 PM	Aaron Marcuse-Kubitza	Placed inputs/SALVIAS/_archive/ under version control
6163	11/14/2012 03:18 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/salvias_plots.~.clean_up.sql: Remove private data that should not be publicly visible, indicated by plotMetadata.AccessCode = 1
6162	11/14/2012 03:17 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/salvias_plots.~.clean_up.sql: Enable cascading deletes by adding the necessary fkeys
6161	11/14/2012 03:17 PM	Aaron Marcuse-Kubitza	Added inputs/SALVIAS/_src/salvias_data_access_controls.txt
6160	11/14/2012 02:26 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6159	11/14/2012 02:25 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/import_order.txt: Fixed bug where geoscrub_cultivated needs to be installed after geoscrub_cleaned_unique, not before as it would be with the default alphabetical sort order
6158	11/14/2012 02:24 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_cultivated/: Use _no_import file to exclude geoscrub_cultivated from the import, because it's used directly as a lookup table by analytical_stem rather than being imported. This ensures that there is no import log or input row count for geoscrub_cultivated in the import times, which would skew the import row count because the row count would be included even though no columns are mapped.
6157	11/14/2012 02:18 PM	Aaron Marcuse-Kubitza	input.Makefile: $(tables): Fixed bug where need to use $(importTables) instead of $(tables) in all places that should use only imported tables, rather than just in the import process itself
6156	11/14/2012 02:13 PM	Aaron Marcuse-Kubitza	input.Makefile: Import to VegBIEN: Added support for tables which should be installed but not imported, but which must be installed after tables which are imported rather than before. This currently applies to geoscrub.geoscrub_cultivated, which depends on geoscrub_cleaned_unique (and therefore must be installed after it), but which should not be imported because it's used directly as a lookup table by analytical_stem.
6155	11/14/2012 10:02 AM	Aaron Marcuse-Kubitza	inputs/VegBank/vegbank.~.clean_up.sql: Documented that plots with confidentialitystatus >= 4 are not deleted if their embargos have already expired. This applies to the Shenandoah NP data, which has confidentialitystatus = 5 but is no longer embargoed according to the embargo table
6154	11/13/2012 08:10 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/: Mapped unmapped fields with a VegCore/VegBIEN equivalent. plotMetadata_/: Remapped life_zone to communityID because it is now alt-ed together with vegetation*, and thus not just a description with life_zone_code as its globally unique name.
6153	11/13/2012 07:35 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
6152	11/13/2012 07:10 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: referencetype: Added terms from reference.referencetype closed list in VegBank data dictionary. Cited sources in comment.
6151	11/13/2012 06:39 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: reference.referencetype: Changed type to referencetype enum
6150	11/13/2012 06:38 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Added referencetype enum, containing VegBank's values in reference.referencetype as well as values for bien_web.datasource.aggregatorOrPrimary and bien_web.dataSourceNormalized.isHerbarium,isAggregator
6149	11/13/2012 06:23 PM	Aaron Marcuse-Kubitza	specimenreplicate: Made institution_id an fkey to referencename instead of party, to later be matched up with reference entries for each aggregator's subprovider
6148	11/13/2012 06:15 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: referencename: Added referencename_unique unique index on name
6147	11/13/2012 06:00 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: referencename: Made reference_id optional so it can be populated later when referencenames are scrubbed
6146	11/13/2012 05:58 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: referencename: Renamed identifier to name because it is specifically any name for the reference, not necessarily an ID
6145	11/13/2012 05:53 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: Renamed referencealtident to referencename to allow any verbatim reference name to go here, with reference containing the corresponding accepted reference name
6144	11/13/2012 05:50 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: reference: Added accesslevel, accessconditions from bien_web.datasource
6143	11/13/2012 05:41 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: address: Added street2 from bien_web.party.address2
6142	11/13/2012 05:38 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: address: Renamed fields to bien_web.party names
6141	11/13/2012 05:12 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: party: Added department from bien_web.party
6140	11/13/2012 05:06 PM	Aaron Marcuse-Kubitza	inputs/SALVIAS/plotMetadata_/map.csv: Mapped lookup_MethodCode_Description to new observationMeasure
6139	11/13/2012 05:06 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: method: Made name optional when description or observationmeasure is specified
6138	11/13/2012 05:03 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: method: method_unique: Include observationmeasure since the method name sometimes is not globally unique (e.g. in SALVIAS)
6137	11/13/2012 04:58 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped observationMeasure
6136	11/13/2012 04:57 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: observationMeasure: Added source to DwC samplingProtocol
6135	11/13/2012 04:54 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Added observationMeasure

Project

General

Profile