Project

General

Profile

Statistics
| Revision:

# Date Author Comment
7418 02/02/2013 11:18 AM Aaron Marcuse-Kubitza

import_all: Moved `waitpid $jobs` into after_import()

7417 02/01/2013 04:57 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Fixed table sizes

7416 02/01/2013 04:51 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Regenerated exports

7415 02/01/2013 04:34 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: removed all accessioncode fields, as VegBIEN does not use them

7414 02/01/2013 03:10 PM Aaron Marcuse-Kubitza

Added inputs/FIA/_src/FIADB_version4.accdb and FIADB_version4.sql (created from it using Access To PostgreSQL and the additional transformations at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Tools#MS-Access-database-MDB&gt;)

7413 01/31/2013 08:20 PM Aaron Marcuse-Kubitza

Added inputs/FIA/COND_unique/, generated from new FIA data

7412 01/31/2013 08:05 PM Aaron Marcuse-Kubitza

inputs/FIA/FIA_COND_unique/create.sql: Fixed bug where need to remove `CREATE TABLE :table AS` at beginning because that is added by the make target

7411 01/31/2013 08:03 PM Aaron Marcuse-Kubitza

inputs/FIA/geoscrub.~.clean_up.sql: Moved creation of FIA_COND_unique to FIA_COND_unique/create.sql

7410 01/31/2013 07:40 PM Aaron Marcuse-Kubitza

README.TXT: Full database import: Updated time until import_all returns control to the shell to account for the TNRS names now being imported concurrently with the inputs rather than before them

7409 01/31/2013 07:31 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Also include morphospecies in the accepted taxondetermination's taxonverbatim, so that it can easily be retrieved by the analytical DB views

7408 01/31/2013 07:15 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Fixed bug where need to use the taxonName or scientificName when the name components are not provided, as is the case when there is no scrubbed taxondetermination (because TNRS returns no match)

7407 01/31/2013 06:08 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. This adds Brad's DwC ID terms and their definitions in <https://projects.nceas.ucsb.edu/nceas/attachments/download/621/vegbien_identifier_examples.xlsx&gt;.

7406 01/31/2013 05:06 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Regenerated exports

7405 01/31/2013 04:04 PM Aaron Marcuse-Kubitza

join: Added support for direct mappings to VegBIEN by passing through outputs that start with / (indicating an XPath rather than a term)

7404 01/31/2013 04:01 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7403 01/31/2013 11:38 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Added family_matched, taxonName_matched, scientificNameAuthorship_matched

7402 01/31/2013 11:02 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Added family_verbatim, scientificName_verbatim, scientificNameAuthorship_verbatim from datasource taxondetermination

7401 01/31/2013 10:57 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7400 01/31/2013 10:30 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Fixed bug where need to use identifiedBy and dateIdentified from the datasource taxondetermination rather than the canonical taxondetermination (whichever taxondetermination is most scrubbed)

7399 01/31/2013 10:23 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: taxondetermination_set_iscurrent(): is_datasource_current: Fixed bug where need to filter out determinationtypes for matched/accepted determinations, which are not datasource determinations

7398 01/31/2013 10:19 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: taxondetermination_set_iscurrent(): Fixed bug where need to also set existing datasource_current taxondetermination's is_datasource_current to false

7397 01/31/2013 08:52 AM Aaron Marcuse-Kubitza

xml_dom.py: replace_with_text(): Added support for all scalar (non-Node) types, which will be stringified using strings.ustr()

7396 01/31/2013 03:52 AM Aaron Marcuse-Kubitza

schemas/functions.sql: Added _fix_date()

7395 01/31/2013 02:49 AM Aaron Marcuse-Kubitza

sql_io.py: put_table(): Documented that much of the complexity of the normalizing algorithm is due to PostgreSQL not having a native command for insert/on duplicate select

7394 01/31/2013 02:24 AM Aaron Marcuse-Kubitza

sql_io.py: put_table(): Corrected "insert/if not exists get" to "insert/on duplicate select"

7393 01/31/2013 01:52 AM Aaron Marcuse-Kubitza

sql_io.py: put_table(): Removed no longer applicable requirement that it be run at the beginning of a transaction, which was only required when the output table was locked during the function call

7392 01/31/2013 01:48 AM Aaron Marcuse-Kubitza

sql_io.py: put_table(): Documented that the function's insert/if not exists get algorithm does not support database triggers that populate fields covered by a unique constraint

7391 01/31/2013 01:42 AM Aaron Marcuse-Kubitza

inputs/FIA/_src/_README.TXT: Documented that FIA does not provide data for some states, e.g. HI

7390 01/30/2013 10:48 PM Aaron Marcuse-Kubitza

config/: Set svn:ignore to exclude *password files

7389 01/30/2013 10:41 PM Aaron Marcuse-Kubitza

Removing config/bien_read_password from version control

7388 01/30/2013 10:30 PM Aaron Marcuse-Kubitza

Removing config/bien_password from version control

7387 01/29/2013 03:26 PM Aaron Marcuse-Kubitza

inputs/FIA/: Added refreshed data (not yet mapped)

7386 01/29/2013 03:15 PM Aaron Marcuse-Kubitza

input.Makefile: Existing maps discovery: $(exts): Also match uppercase versions of extensions

7385 01/29/2013 03:12 PM Aaron Marcuse-Kubitza

lib/common.Makefile: Added $(ucase) and $(ci)

7384 01/29/2013 01:56 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/Makefile: Table bundling: $(tableCsvs): Fixed bug where need to replace % with $* in $(csvPattern)

7383 01/29/2013 01:15 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/Makefile: Table bundling: Fixed bug where need to remove trailing slashes from dirs that will match a target pattern

7382 01/29/2013 01:09 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/Makefile: Added Table bundling targets to regroup CSVs by tables

7381 01/29/2013 01:09 PM Aaron Marcuse-Kubitza

lib/common.Makefile: Added $(mkdir)

7380 01/29/2013 11:02 AM Aaron Marcuse-Kubitza

Added inputs/FIA/_src/_README.TXT with Bob's comments

7379 01/29/2013 11:02 AM Aaron Marcuse-Kubitza

input.Makefile: SVN: $(_svnFilesGlob): Added README.TXT

7378 01/29/2013 10:33 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. Synonym lists have now been translated to sections to create a web page anchor for each synonym, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore_refactoring#Index-synonyms-as-web-page-anchors&gt;. This enables searching for VegCore synonyms in the data dictionary as well as terms, and makes it possible to swap a term and a synonym while still keeping both as indexed anchors.

7377 01/29/2013 06:19 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki. All uncategorized terms have now been moved to tables.

7376 01/29/2013 06:19 AM Aaron Marcuse-Kubitza

README.TXT: Maintenance: VegCore data dictionary: Added steps to check that no terms were lost when moving terms

7375 01/28/2013 05:13 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

7374 01/28/2013 05:12 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7373 01/25/2013 03:54 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: coordinates: Only use county_centroids coordinates when datasource coordinates are not provided, not also when datasource coordinates aren't geovalid. This also fixes a bug where (NULL) county_centroids coordinates were used for non-geovalid coordinates even when there was no county_centroids match, rather than including the non-geovalid coordinates.

7372 01/25/2013 03:34 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7371 01/25/2013 11:27 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: Added is_datasource_current, which is autopopulated to the most recent datasource-provided taxondetermination

7370 01/25/2013 11:07 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: Added taxondetermination_single_accepted_determination unique index to facilitate joining on the accepted determination

7369 01/25/2013 11:05 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: Added taxondetermination_single_matched_determination unique index to facilitate joining on the matched determination

7368 01/25/2013 10:32 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxondetermination: Removed notespublic, notesmgt, which are not used by VegBIEN

7367 01/25/2013 09:30 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon_trait_view: scientificName: Use taxonverbatim.taxonname when taxonlabel/taxonverbatim.taxonomicname are not provided, to accommodate TNRS names. This is part of the workaround for the bug where the taxonlabel's taxonomicname (concatenated taxonomicname) is occasionally not populated.

7366 01/25/2013 09:10 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon_trait_view: Added workaround for bug where the taxonlabel's taxonomicname (concatenated taxonomicname) is occasionally not populated due to a taxonlabel constraint violation, by using the taxonverbatim's taxonomicname instead in these cases. This bug, which appeared in the r7317 import, is so far not reproducible (tested on Mac OS X), so its cause is unknown, but may be caused by a bug in functions._merge_prefix(), which is run on the taxonlabel's taxonomicname but not the taxonverbatim's taxonomicname.

7365 01/24/2013 09:51 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Added dateIdentified, identificationRemarks per Brad's request (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#E-mail-on-2013-1-16)

7364 01/24/2013 09:40 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/Makefile: Added extraction targets to extract zip archives

7363 01/24/2013 09:07 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/download: Use new Makefile, which uses make logic to determine if a file needs to be downloaded

7362 01/24/2013 09:05 PM Aaron Marcuse-Kubitza

Added inputs/FIA/_src/Makefile, with targets to download each zip archive

7361 01/24/2013 08:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: derived terms: Added _bien suffix per Brad's request (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#Brad-Boyles-comments)

7360 01/24/2013 03:22 PM Aaron Marcuse-Kubitza

Added inputs/FIA/_src/FIADB_version4.accdb.url

7359 01/24/2013 03:18 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/download: Only run wget on files that don't yet exist

7358 01/24/2013 03:16 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/download: Run wget in same directory as script to ensure files get downloaded there

7357 01/24/2013 03:06 PM Aaron Marcuse-Kubitza

inputs/FIA/_src/download: Set svn:executable

7356 01/24/2013 03:04 PM Aaron Marcuse-Kubitza

Added inputs/FIA/_src/download to download archives of CSVs for each state

7355 01/24/2013 03:03 PM Aaron Marcuse-Kubitza

to_do/timeline.2013.xls: Updated with changes during conference call

7354 01/24/2013 09:46 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon_trait_view: Renamed datasource_taxonverbatim to taxonverbatim because there is now only one taxonverbatim

7353 01/24/2013 09:31 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon_trait_view: Moved the taxondetermination.iscurrent filter to the join condition to allow using the taxondetermination_single_current_determination index

7352 01/24/2013 09:24 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxon_trait_view: Join only on the primary taxonlabel, not the accepted taxonlabel, because the scrubbed name is now available directly via the taxonlabel attached to the scrubbed taxondetermination

7351 01/24/2013 09:11 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Added locality

7350 01/24/2013 08:18 AM Aaron Marcuse-Kubitza

inputs/UNCC/Specimen/map.csv: accession: Remapped to catalogNumber per Bob's corrections

7349 01/23/2013 10:31 PM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: Regenerated exports

7348 01/23/2013 10:25 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7347 01/23/2013 10:01 PM Aaron Marcuse-Kubitza

README.TXT: Schema changes: Added instructions to run the appropriate sync function when changing the analytical views

7346 01/23/2013 09:56 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Added georeferenceProtocol, which is set to 'county centroid' when county centroid coordinates are used

7345 01/23/2013 08:12 PM Aaron Marcuse-Kubitza

make_analytical_db: Don't run export_analytical_db if the SQL script exits with an error

7344 01/23/2013 08:04 PM Aaron Marcuse-Kubitza

README.TXT: Full database import: record the import times in inputs/import.stats.xls: Added `export version=<version>` because import_times may be run in a shell different from the one that the import was run in

7343 01/23/2013 08:03 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

7342 01/22/2013 07:43 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonverbatim: taxonverbatim_unique: Added morphoname for cases when there is just a morphoname, and to distinguish taxonverbatims with the same taxonlabel but different morphonames

7341 01/22/2013 07:43 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: stemobservation: Added stemobservation_non_empty CHECK constraint to prevent creating an empty stemobservation for plantobservation rows without stem data but with stem mappings

7340 01/22/2013 07:36 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: stemobservation: Added stemobservation_non_empty CHECK constraint to prevent creating an empty stemobservation for plantobservation rows without stem data but with stem mappings

7339 01/22/2013 07:34 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: stemobservation: Added stemobservation_non_empty CHECK constraint to prevent creating an empty stemobservation for plantobservation rows without stem data but with stem mappings

7338 01/22/2013 07:16 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonverbatim: taxonverbatim_unique: Added morphoname for cases when there is just a morphoname, and to distinguish taxonverbatims with the same taxonlabel but different morphonames

7337 01/22/2013 07:11 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonverbatim: Allow taxonlabel_id to be NULL when morphoname is provided

7336 01/22/2013 07:09 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonverbatim: Allow taxonlabel_id to be NULL when morphoname is provided

7335 01/22/2013 07:04 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: taxonverbatim: Added source_id to allow creating taxonverbatims without a (scoping) taxonlabel

7334 01/22/2013 05:34 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Removed speciesBinomialWithMorphospecies now that it's duplicated by scientificNameWithMorphospecies

7333 01/22/2013 05:28 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Create it using the speciesBinomialWithMorphospecies formula, per Brad's request at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#2013-1-18>

7332 01/22/2013 05:05 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: Added coordinateSource to indicate whether coordinates are from county_centroids (georeferencing) or the source data

7331 01/22/2013 05:00 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: Added coordinatesource enum

7330 01/22/2013 04:50 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Regenerated from wiki

7329 01/22/2013 04:34 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: coordinates: Also use the county_centroids coordinates when the datasource coordinates are not geovalid. (Note that canon_place.geovalid will be NULL, i.e. not true, when the datasource coordinates are NULL.)

7328 01/22/2013 04:28 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: scientificName: Set to taxonverbatim.taxonname instead per Brad's changes at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#2013-1-18&gt;. Renamed to taxonName since this now doesn't include the author, which is part of DwC's scientificName field.

7327 01/22/2013 03:55 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: sync_analytical_stem_to_view(): Support running the function when dependent views do not exist. This allows using the sync function when changing column names of the analytical_stem_view, which sometimes requires manually dropping and re-creating the analytical_aggregate_view.

7326 01/22/2013 02:49 PM Aaron Marcuse-Kubitza

backups/Makefile: %.md5/test: Added comment to run with `make -s` to avoid echoing make commands

7325 01/22/2013 02:42 PM Aaron Marcuse-Kubitza

README.TXT: Full database import: Added steps to scrub unscrubbed taxondeterminations (if they are not scrubbed automatically)

7324 01/22/2013 02:06 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/_src/README.TXT: Added e-mails from Jim about how the county_centroids data was generated

7323 01/22/2013 01:18 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: coordinates: Use new county_centroids coordinates and uncertainty when the datasource's coordinates are not available

7322 01/22/2013 01:10 PM Aaron Marcuse-Kubitza

Added inputs/.geoscrub/county_centroids/ from Jim

7321 01/22/2013 01:09 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/import_order.txt: Added geoscrub_output

7320 01/22/2013 12:24 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

7319 01/22/2013 12:19 PM Aaron Marcuse-Kubitza

README.TXT: Full database import: In PostgreSQL: Added step to check that there are TNRS taxondeterminations