Project

General

Profile

Statistics
| Revision:

# Date Author Comment
6435 11/24/2012 01:48 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: cultivated: Also set if family/country combination found in cultivated_family_locations

6434 11/24/2012 01:39 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: cultivated_family_locations: Added data from nimoy:/home/boyle/bien2/geoscrub/cultivated/cult_by_taxon/flag_by_taxa.inc

6433 11/24/2012 01:33 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added cultivated_family_locations to store locations where various taxon families are considered cultivated

6432 11/24/2012 01:24 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped locality description fields to location.iscultivated using _locationnarrative_is_cultivated()

6431 11/24/2012 01:23 PM Aaron Marcuse-Kubitza

xml_func.py: Simplifying functions: Added passthru entries for _and, _or

6430 11/24/2012 01:06 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added _locationnarrative_is_cultivated()

6429 11/24/2012 12:57 PM Aaron Marcuse-Kubitza

lib/PostgreSQL-MySQL.csv: Change text to varchar(255) because text columns can't be used in indexes in MySQL

6428 11/24/2012 12:51 PM Aaron Marcuse-Kubitza

lib/PostgreSQL-MySQL.csv: Resaved in Excel, which removed unnecessary quotes around fields

6427 11/24/2012 12:22 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_aggregate: Added identifiedBy, which is no longer a scoping field (which would prevent scientificNameWithMorphospecies from being unique) now that there is only one taxondetermination for each taxonoccurrence

6426 11/24/2012 12:05 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: dateCollected: For plots data, use the locationevent obsstartdate instead of the collectiondate in order to group taxonoccurrences/stems from the same locationevent together

6425 11/24/2012 11:59 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_* pkeys: Added dateCollected because the records are actually unique within the location*event*, not the location

6424 11/24/2012 11:57 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Exclude records with no collectiondate or obsstartdate, which is required to uniquely identify a record

6423 11/24/2012 11:54 AM Aaron Marcuse-Kubitza

analytical_stem_view: dateCollected: Use locationevent.obsstartdate when aggregateoccurrence.collectiondate is not provided

6422 11/24/2012 11:37 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Include only the current taxondetermination for each taxonoccurrence, to avoid cross-joining taxondeterminations with stems and thus multiplying the number of rows for datasources that have multiple taxondeterminations per taxonoccurrence

6421 11/24/2012 11:33 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: Added AFTER trigger to set the current taxondetermination for the taxonoccurrence

6420 11/24/2012 11:11 AM Aaron Marcuse-Kubitza

lib/PostgreSQL-MySQL.csv: Statements ending in ";": When matching any character, use .*? (with the (?s) flag) instead of [^;]* in order to allow embedded ; to be matched. This fixes a bug where a CREATE VIEW statement was not removed because it contained an embedded ; .

6419 11/24/2012 11:06 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: Added unique index to ensure that there is only one current determination for each taxonoccurrence

6418 11/24/2012 11:05 AM Aaron Marcuse-Kubitza

lib/PostgreSQL-MySQL.csv: Remove indexes with WHERE clauses

6417 11/24/2012 10:34 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_aggregate: Added primary key on institutionCode, plotName, scientificNameWithMorphospecies, recordNumber. Note that this makes these fields NOT NULL, which should not be a problem because there are inner joins instead of LEFT JOINs on most of the tables which provide them, and LEFT JOINed tables have their identifying fields combined to create a NOT NULL value.

6416 11/24/2012 10:27 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: recordNumber: Combine identifying fields in taxonoccurrence, plantobservation, and stemobservation to ensure that this field is unique within the plot and not NULL

6415 11/24/2012 10:23 AM Aaron Marcuse-Kubitza

lib/PostgreSQL-MySQL.csv: Only match a statement-terminating ; when it's at the end of a line

6414 11/24/2012 10:02 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_aggregate: Added primary key on institutionCode, plotName, scientificNameWithMorphospecies. Note that this makes these fields NOT NULL, which should not be a problem because there are inner joins instead of LEFT JOINs on the tables which provide them.

6413 11/24/2012 09:21 AM Aaron Marcuse-Kubitza

db_xml.py: put(): _setDefault(): Delay the evaluation of each col_default's value until the col_default is actually retrieved. This fixes a bug in the source table mappings where the explicit source entry was being created after the col_default source entry, causing the initial entry, which did not have the additional fields populated, to be used instead.

6412 11/24/2012 09:14 AM Aaron Marcuse-Kubitza

dicts.py: Added WrapDict, a dict that runs a function on each value retrieved

6411 11/24/2012 08:59 AM Aaron Marcuse-Kubitza

db_xml.py: put(): _setDefault(): Fixed bug where need to copy col_defaults before calling update() on it, to avoid modifying the input value (which may be reused by the caller, expecting it to be unmodified)

6410 11/24/2012 08:54 AM Aaron Marcuse-Kubitza

db_xml.py: put(): col_defaults param: Fixed bug where need to use None as default value, because col_defaults will be modified by put() and the {} default value is a global instance

6409 11/24/2012 08:29 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: source table mappings: Set shortname to env var $source when it's not explicitly specified, because shortname is a required field of source

6408 11/24/2012 08:16 AM Aaron Marcuse-Kubitza

db_xml.py: put(): Pass through the values of nodes which are text nodes

6407 11/24/2012 08:15 AM Aaron Marcuse-Kubitza

db_xml.py: put(): put_(): Support setDefault() values which are text nodes, by passing text strings through when put() is run on all col_defaults entries

6406 11/24/2012 07:50 AM Aaron Marcuse-Kubitza

db_xml.py: put(): _setDefault(): Support setting multiple col_defaults at once by using the param names themselves as the column names

6405 11/24/2012 07:47 AM Aaron Marcuse-Kubitza

dicts.py: DictProxy: Implemented delitem()

6404 11/24/2012 07:32 AM Aaron Marcuse-Kubitza

bin/map: update_in_label(): Removed hardcoded source_id col_default, which is now set in mappings/VegCore-VegBIEN.csv's output root

6403 11/24/2012 07:29 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Set the source_id col_default to the datasource name using the new _setDefault() built-in function and _env()

6402 11/24/2012 07:25 AM Aaron Marcuse-Kubitza

db_xml.py: put(): Added _setDefault() built-in function, which adds an entry to col_defaults

6401 11/24/2012 07:23 AM Aaron Marcuse-Kubitza

xml_func.py: _env(): Fixed bug where need to retrieve actual string value of name param using xml_dom.NodeTextEntryIter instead of NodeEntryIter

6400 11/24/2012 07:20 AM Aaron Marcuse-Kubitza

xml_func.py: _env(): Fixed bug where need to use xml_dom.replace_with_text() instead of xml_dom.replace() because replace() requires a DOM node

6399 11/24/2012 06:44 AM Aaron Marcuse-Kubitza

bin/map: update_in_label(): Set $source env var to the in_label (datasource name), to make it available to _env()

6398 11/24/2012 06:43 AM Aaron Marcuse-Kubitza

xml_func.py: Simplifying functions: Added _env()

6397 11/24/2012 06:05 AM Aaron Marcuse-Kubitza

Added inputs/VegBank/Source/, containing referenceType metadata

6396 11/24/2012 06:00 AM Aaron Marcuse-Kubitza

Added inputs/SpeciesLink/Source/, containing referenceType metadata

6395 11/24/2012 05:55 AM Aaron Marcuse-Kubitza

Added inputs/SALVIAS*/Source/, containing referenceType metadata

6394 11/24/2012 05:47 AM Aaron Marcuse-Kubitza

Added inputs/REMIB/Source/, containing referenceType metadata

6393 11/24/2012 05:41 AM Aaron Marcuse-Kubitza

Added inputs/GBIF/Source/, containing referenceType metadata

6392 11/24/2012 05:34 AM Aaron Marcuse-Kubitza

Added inputs/TEAM/Source/, containing referenceType metadata

6391 11/24/2012 05:33 AM Aaron Marcuse-Kubitza

Placed inputs/TEAM/_src/Vegetation-Tree-and-Liana-Metadata-1.5.pdf under version control

6390 11/24/2012 05:27 AM Aaron Marcuse-Kubitza

inputs/FIA/import_order.txt: Added Source, which needs to come before Organism

6389 11/24/2012 05:22 AM Aaron Marcuse-Kubitza

Added inputs/Madidi/Source/, containing referenceType metadata

6388 11/24/2012 05:19 AM Aaron Marcuse-Kubitza

Added inputs/FIA/Source/, containing referenceType metadata

6387 11/24/2012 05:14 AM Aaron Marcuse-Kubitza

Added inputs/CVS/Source/, containing referenceType metadata

6386 11/24/2012 05:07 AM Aaron Marcuse-Kubitza

Added inputs/CTFS/Source/, containing referenceType metadata

6385 11/24/2012 05:05 AM Aaron Marcuse-Kubitza

bin/map: Support map spreadsheets containing only metadata mappings (with no corresponding staging table), by falling back to an empty table when the named table does not exist

6384 11/24/2012 04:19 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: institutionCode: Also map to the sourcename's matched source, which identifies whether the source is a herbarium

6383 11/24/2012 04:08 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: source: Made shortname NOT NULL to ensure that all datasources have a globally-unique short name

6382 11/24/2012 03:33 AM Aaron Marcuse-Kubitza

import_all: Added import of inputs/.herbaria/ before the main import

6381 11/24/2012 03:28 AM Aaron Marcuse-Kubitza

Added inputs/.herbaria/

6380 11/24/2012 03:25 AM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Also run %/add on all data subdirs

6379 11/24/2012 03:21 AM Aaron Marcuse-Kubitza

input.Makefile: Existing maps discovery: Moved tables discovery to its own section, above SVN so it can be used by SVN

6378 11/24/2012 03:11 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: referenceType: Fixed sort order

6377 11/24/2012 03:09 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped referenceType

6376 11/24/2012 03:06 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added referenceType

6375 11/24/2012 02:10 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: institutionCode: Remap to source.shortname when specimen information is not provided, as is the case for geoscrub.herbaria on nimoy

6374 11/24/2012 01:47 AM Aaron Marcuse-Kubitza

inputs/bien_web/observation/map.csv: Mapped observationID->occurrenceID

6373 11/24/2012 01:20 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: Add input data for each table present in the datasource: Added step to run `make inputs/<datasrc>/<table>/install` if the table is in a .sql export

6372 11/24/2012 01:17 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: MySQL inputs: Added step to install the export, which needs to happen before mapping individual tables

6371 11/24/2012 01:13 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: Add input data for each table present in the datasource: Replaced "CSV" with "CSV" because there can be multiple CSV part files for one table

6370 11/24/2012 01:11 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: Add input data for each table present in the datasource: Don't add a CSV or create.sql file for tables that are in a .sql export

6369 11/24/2012 01:06 AM Aaron Marcuse-Kubitza

README.TXT: Schema changes: Sync ERD with vegbien.sql schema: Changed instructions to just select tables with arrows next to them rather than all tables, because each table that's updated will have its lines reset and the number of lines that need to be fixed should be minimized

6368 11/24/2012 01:02 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: Accept the test cases: `make inputs/<datasrc>/test by_col=1`: Clarified that errors could indicate bugs in the VegBIEN unique constraints

6367 11/24/2012 12:59 AM Aaron Marcuse-Kubitza

README.TXT: Data import: To remake analytical DB: Added explicit public schema setting since the analytical DB is often manually remade after the public schema has been renamed. Removed warnings that certain commands must be run after running make_analytical_db, because the "remake analytical DB" instructions no longer require this.

6366 11/24/2012 12:48 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: MySQL inputs: Added steps to export the database to a PostgreSQL-compatible .sql file, which can be directly used by the install process without the need to export each table as CSV

6365 11/24/2012 12:36 AM Aaron Marcuse-Kubitza

README.TXT: Datasource setup: Choosing a table name: Documented that for .sql exports, you must use the name of the table in the DB export, not a suggested or custom name

6364 11/24/2012 12:34 AM Aaron Marcuse-Kubitza

input.Makefile: Staging tables installation: $(dbExports): Also include the files that would be generated by running _MySQL/*.make and creating the corresponding PostgreSQL translations

6363 11/24/2012 12:18 AM Aaron Marcuse-Kubitza

input.Makefile: Staging tables installation: Moved .sql export downloading and translation to separate Input data retrieval section

6362 11/23/2012 11:41 PM Aaron Marcuse-Kubitza

Added lib/MySQL.{data,schema}.sql.make templates to use in datasources' _MySQL/ dirs

6361 11/23/2012 10:38 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6360 11/21/2012 11:13 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Changed to use Brad's formula, which concatenates genus and specific_epithet/morphospecies, and uses family if just the family is present, rather than using the full taxonomic name

6359 11/21/2012 11:05 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Concatenated taxonlabel: Don't prepend family if the taxonName/scientificName itself is the family, so that the family is not duplicated in the concatenated taxonomic name

6358 11/21/2012 10:19 PM Aaron Marcuse-Kubitza

schemas/functions.sql: _nullIf(): Removed NOT NULL constraint on null param, to support use a (nullable) column rather than a literal as the null-equivalent value

6357 11/21/2012 09:08 PM Aaron Marcuse-Kubitza

xml_func.py: Simplifying functions: Added _nullIf(), to remove calls with no null value

6356 11/21/2012 09:00 PM Aaron Marcuse-Kubitza

xml_dom.py: Added prune_parent()

6355 11/21/2012 08:51 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _or()

6354 11/21/2012 08:20 PM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _merge_words()

6353 11/21/2012 08:04 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Renamed geosourceValid to geovalid. (It had gotten renamed in the reference -> source rename.)

6352 11/21/2012 08:00 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Renamed georeferenceValid to geovalid

6351 11/21/2012 07:48 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times. This now includes the Canadensys plants-related datasources HIBG, JBM, QFA, TRT, TRTE, UBC, VASCAN, and WIN.

6350 11/20/2012 09:59 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6349 11/20/2012 09:42 PM Aaron Marcuse-Kubitza

Added inputs/HIBG/

6348 11/20/2012 09:33 PM Aaron Marcuse-Kubitza

Added inputs/JBM/

6347 11/20/2012 09:29 PM Aaron Marcuse-Kubitza

Added inputs/VASCAN/

6346 11/20/2012 09:22 PM Aaron Marcuse-Kubitza

Added inputs/WIN/

6345 11/20/2012 09:18 PM Aaron Marcuse-Kubitza

Added inputs/UBC/

6344 11/20/2012 09:14 PM Aaron Marcuse-Kubitza

Added inputs/TRTE/Specimen/

6343 11/20/2012 09:11 PM Aaron Marcuse-Kubitza

Added inputs/QFA/

6342 11/20/2012 09:06 PM Aaron Marcuse-Kubitza

Added inputs/TRT/

6341 11/20/2012 08:21 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Allow bien_read to SELECT from all tables in the public schema

6340 11/20/2012 08:10 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Allow bien_read to SELECT from analytical_aggregate, analytical_stem

6339 11/20/2012 08:09 PM Aaron Marcuse-Kubitza

lib/PostgreSQL-MySQL.csv: Removed GRANT/REVOKE because SCHEMA GRANTs are not supported in MySQL

6338 11/20/2012 07:57 PM Aaron Marcuse-Kubitza

pg_dump_vegbien: non-$owners mode: Removed --no-privileges in order to include GRANTs to other users

6337 11/20/2012 07:49 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: $(postgresReload-Linux): Making schemas/*.conf world-readable: Fixed bug where need to do this as the bien user, which owns the files

6336 11/20/2012 07:46 PM Aaron Marcuse-Kubitza

root Makefile: PostgreSQL: $(postgresReload-*): Make schemas/*.conf world-readable so it's readable by the postgres user, which the .conf installation is run as