Project

General

Profile

Statistics
| Revision:

# Date Author Comment
6673 12/06/2012 11:18 PM Aaron Marcuse-Kubitza

dict2redmine: Generate an outline instead of a table so each term will be indexed in the page's table of contents

6672 12/06/2012 11:13 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: coordinates: coordinates_unique: Removed md5() around verbatimcoordinates because functions within unique indexes (other than the standard COALESCE) are not yet supported by the import algorithm

6671 12/06/2012 11:10 PM Aaron Marcuse-Kubitza

exc.py: e_msg(): Emit a warning instead of an AssertionError if e.args0 isn't a string, to assist in debugging malformed exceptions

6670 12/06/2012 11:02 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: sampleType: Re-sourced to bien_web.observationType

6669 12/06/2012 10:34 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Fixed bug where need to use the taxonomicname in accepted_taxonlabel instead of accepted_taxonverbatim, because taxonverbatim only contains fields provided by the data provider (in this case, TNRS), but TNRS does not provide the taxonomic name (taxon name+author), only the taxon name and author components separately

6668 12/06/2012 10:09 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: coordinates: coordinates_unique: Use md5() on verbatimcoordinates so that it doesn't cause the index row size to be exceeded. This should fix a bug in the HIBG import where long verbatimcoordinates values were causing the error 'OperationalError: index row size 2784 exceeds maximum 2712 for index "coordinates_unique"'.

6667 12/06/2012 09:56 PM Aaron Marcuse-Kubitza

backups/Makefile: Synchronization: Replaced download target, which downloads all backups, with %/download, which downloads just a specific backup, because you would generally only want to extract a single backup from the archive for reinstallation

6666 12/06/2012 09:47 PM Aaron Marcuse-Kubitza

backups/Makefile: Synchronization: Sync with jupiter instead of vegbiendev. This requires running `make backups/upload` on vegbiendev to archive the files, instead of `make backups/download` to download them to your local machine.

6665 12/06/2012 08:58 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/map.csv: Removed no longer accurate comment that county is not yet used by VegBIEN

6664 12/06/2012 08:56 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/map.csv: *validity: Remapped 2 ("Point is <=5km from putative GADM polygon, but still outside it") to true instead of false, because 5km is close enough to the polygon that the mismatch could result from shapefile simplifying, boundary changes, or other factors that don't affect geovalidity

6663 12/06/2012 08:52 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/map.csv: *validity: Remapped 0 ("Complete name provided, but couldn't be scrubbed to GADM") to NULL instead of false, because the absence of a name match does not mean the coordinates are invalid

6662 12/06/2012 08:51 PM Aaron Marcuse-Kubitza

inputs/.{NCBI,TNRS}/import_order.txt: Added Source

6661 12/06/2012 08:50 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.

6660 12/06/2012 08:44 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.

6659 12/06/2012 08:43 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: SVN: add: verify/: Added *.xls to svn:ignore

6658 12/06/2012 08:33 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/postprocess.sql: Added index on decimallatitude, decimallongitude

6657 12/06/2012 08:30 PM Aaron Marcuse-Kubitza

Added inputs/.geoscrub/geoscrub_output/postprocess.sql, which adds NOT NULL constraints on decimallatitude, decimallongitude

6656 12/06/2012 06:55 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Changed type of boolean columns to integer so that they will be exported as 1/0 instead of t/f by export_analytical_db. This will enable MySQL's LOAD DATA INFILE to import the values correctly.

6655 12/06/2012 06:07 PM Aaron Marcuse-Kubitza

backups/Makefile: Checksums: %.md5/test: Only use md5sum's -v option on Mac, because it's not supported on Linux (there, verbose mode is the default)

6654 12/06/2012 05:57 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: cultivated* source: Added picklist value to URL

6653 12/06/2012 05:46 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Creating analytical_aggregate table: publish_analytical_db: Rewrapped line

6652 12/06/2012 05:45 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Creating analytical_aggregate table: Changed name to analytical_aggregate_r<revision> to allow storing different versions simultaneously

6651 12/06/2012 05:26 PM Aaron Marcuse-Kubitza

publish_analytical_db: Require caller to specify the name of the table to load data into. This allows appending a revision to analytical_aggregate, or publishing a table other than analytical_aggregate.

6650 12/06/2012 05:24 PM Aaron Marcuse-Kubitza

publish_analytical_db: Require caller to specify the name of the table to load data into. This allows appending a revision to analytical_aggregate, or publishing a table other than analytical_aggregate.

6649 12/06/2012 05:23 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: SVN: add: verify/: Added *.xls to svn:ignore

6648 12/06/2012 04:33 PM Aaron Marcuse-Kubitza

backups/Makefile: SQL: Full DB: vegbien.%.backup: Also generate MD5 sum

6647 12/06/2012 04:18 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6646 12/05/2012 10:57 AM Aaron Marcuse-Kubitza

README.TXT: Data import: Delete previous imports based on the full DB backup file

6645 12/05/2012 10:56 AM Aaron Marcuse-Kubitza

backups/Makefile: Support removing public schema versions based on the version of a full DB backup

6644 12/05/2012 10:52 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Removed the additional dict namespace for the SALVIAS sources. This removes the extra "dict:" namespace on the generate Redmine source term names.

6643 12/05/2012 10:49 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Added TNRS provider namespace, inserting it before BIEN in the sort order

6642 12/05/2012 10:43 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Changed + to _ in URL fragments

6641 12/05/2012 10:41 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Removed the additional BIEN namespace for the BIEN sources, and use just BIEN2 and VegBIEN as the sub-namespaces. This removes the extra "BIEN:" namespace on the generate Redmine source term names.

6640 12/05/2012 10:37 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Removed the "terms" text in the current DwC terms' provider, and leave just the sort order. This removes the extra "terms:" namespace on the generate Redmine source term names.

6639 12/05/2012 10:33 AM Aaron Marcuse-Kubitza

dict2redmine: url_term(): Remove empty URL comments

6638 12/05/2012 10:32 AM Aaron Marcuse-Kubitza

dict2redmine: url_comment_text(): Interpret a URL comment containing just a number as a sort order without text

6637 12/05/2012 10:29 AM Aaron Marcuse-Kubitza

dict2redmine: url_term(): Prefix any provider in the URL to the term name, to create a namespace. Each hierarchical component of the provider is stored in a URL comment.

6636 12/05/2012 10:27 AM Aaron Marcuse-Kubitza

dict2redmine: Added url_comment_re

6635 12/05/2012 10:27 AM Aaron Marcuse-Kubitza

dict2redmine: Added url_comment_text()

6634 12/05/2012 10:26 AM Aaron Marcuse-Kubitza

dict2redmine: Call simplify_url() just on the first source so that source2redmine_url() can use the raw URL (to extract comments, etc.)

6633 12/05/2012 09:09 AM Aaron Marcuse-Kubitza

dict2redmine: Removed no longer used explicit Definition column #

6632 12/05/2012 09:06 AM Aaron Marcuse-Kubitza

dict2redmine: Use the input spreadsheet's column names and order, and pass through columns other than the term and sources columns

6631 12/05/2012 09:05 AM Aaron Marcuse-Kubitza

mappingsf/VegCore.csv, Veg+-VegCore.csv: Renamed Comments to Definition to match Redmine table

6630 12/05/2012 09:04 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Reversed order of Comments, Sources columns to match Redmine table order

6629 12/05/2012 08:58 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Reversed order of Comments, Sources columns to match Redmine table order

6628 12/05/2012 08:56 AM Aaron Marcuse-Kubitza

dict2redmine: Store term_str in a var before using it, like sources_str

6627 12/05/2012 08:43 AM Aaron Marcuse-Kubitza

dict2redmine: Added Definition column

6626 12/05/2012 08:32 AM Aaron Marcuse-Kubitza

dict2redmine: Take term and sources col #s as args instead of hardcoding them by column name or position

6625 12/05/2012 08:25 AM Aaron Marcuse-Kubitza

dict2redmine: url_term(): Also match any namespace that's part of the term

6624 12/05/2012 08:21 AM Aaron Marcuse-Kubitza

dict2redmine: Sources: Use source2redmine_url() to extract the term from each source URL

6623 12/05/2012 08:20 AM Aaron Marcuse-Kubitza

dict2redmine: source2redmine_url(): Support empty URLs

6622 12/05/2012 08:15 AM Aaron Marcuse-Kubitza

dict2redmine: url_term(): Fixed bug where need to use match.group() instead of match.groups()

6621 12/05/2012 08:02 AM Aaron Marcuse-Kubitza

mappings/Makefile: Create VegCore.redmine from VegCore.csv

6620 12/05/2012 08:01 AM Aaron Marcuse-Kubitza

Added dict2redmine

6619 12/05/2012 07:26 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Renamed Source column to Sources because it can contain multiple sources

6618 12/05/2012 07:12 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Source: DwC terms: Scoped sort order by category, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore_refactoring#Scope-DwC-sort-order-by-category>

6617 12/05/2012 06:35 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Source: VegX terms: Split combined field group/field sort order into separate sort orders for field and field group

6616 12/05/2012 06:22 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Source: VegX terms: Added top-level table sort order

6615 12/05/2012 06:07 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: taxonName: Reordered sources so it would sort with *TaxonName and scientificName

6614 12/05/2012 06:04 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Source: DwC Taxon: Added sort order so it would sort together with its fields

6613 12/05/2012 05:58 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Source: DwC occurrenceID: Corrected sort order to 019 instead of 000

6612 12/05/2012 05:55 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Source: DwC terms: Added category, with category sort order, as URL comment. This will allow terms to be sorted just within their category rather than globally for DwC.

6611 12/05/2012 05:49 AM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Source: DwC: dcterms: Added back "dcterms:" prefix to URL fragment

6610 12/05/2012 05:31 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Source: TNRS terms: Added sort order to web page fragment (simple_download, detailed_download)

6609 12/05/2012 05:25 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Removed no longer used Order within table column. Instead, embed the sort order in the URL using a () comment.

6608 12/05/2012 05:23 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Merged the Order within table column with the Source URL, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore_refactoring#Merging-the-Order-within-table-column-with-the-Source-URL&gt;. Sorting on the Source column now groups related terms together according to their sort order in the source they came from.

6607 12/05/2012 05:11 AM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Order within table: Filled in missing sort orders

6606 12/05/2012 04:51 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Source: Web pages: Use / instead of . to separate nested elements of URL fragment. Use _ instead of + to represent space.

6605 12/05/2012 04:19 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Order within table: Filled in missing sort orders

6604 12/05/2012 03:58 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Source: Removed trailing whitespace

6603 12/05/2012 03:43 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Order within table: Fixed to include one entry for every URL, including when the Order field is empty and there are multiple URLs

6602 12/05/2012 03:33 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Order within table: Fixed to include one entry for every URL

6601 12/05/2012 02:03 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Source: "dcterms:" terms: Fixed URL fragments to use : instead of # after dcterms

6600 12/05/2012 01:42 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Sources: BIEN2: Moved DB sort order right before the DB name in the URL to avoid duplicating the DB name in the comment

6599 12/05/2012 01:35 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Sources: Added sort order comments to URLs so they sort in the order indicated at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Sources&gt;. URL comments are enclosed in (), and the sort order element of a comment is a number right after the ( .

6598 12/05/2012 12:37 AM Aaron Marcuse-Kubitza

mappings/Makefile: .Veg+-VegCore.csv.last_cleanup: Sort by the source URL instead of the VegCore term

6597 12/05/2012 12:35 AM Aaron Marcuse-Kubitza

mappings/Makefile: Split .Veg+-VegCore.csv.last_cleanup and .VegX-VegCore.csv.last_cleanup into separate targets so their recipes can be different

6596 12/05/2012 12:17 AM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped dcterms:rights

6595 12/04/2012 11:52 PM Aaron Marcuse-Kubitza

backups/Makefile: Synchronization: Also sync *.md5

6594 12/04/2012 09:52 PM Aaron Marcuse-Kubitza

import_all: Fixed bug where need to wait for all asynchronous commands started before the main import, not just the first

6593 12/04/2012 09:51 PM Aaron Marcuse-Kubitza

import_all: Import all Source tables before the herbaria list, so that any custom metadata will override the info in the herbaria list

6592 12/04/2012 09:43 PM Aaron Marcuse-Kubitza

input.Makefile: Tables discovery: $(dontImport): Don't import the Source table when $import_source env var is set to ""

6591 12/04/2012 09:33 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.

6590 12/04/2012 09:22 PM Aaron Marcuse-Kubitza

Added inputs/VASCAN/Source/

6589 12/04/2012 09:18 PM Aaron Marcuse-Kubitza

csvs.py: stream_info(): Use the Excel dialect and an empty header if the CSV file is empty

6588 12/04/2012 08:29 PM Aaron Marcuse-Kubitza

pg_dump_limit: Also remove CREATE DATABASE statements

6587 12/04/2012 08:09 PM Aaron Marcuse-Kubitza

Added inputs/JBM/Source/

6586 12/04/2012 08:07 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Removed type->dcterms:type automapping because this term can have many different meanings

6585 12/04/2012 08:06 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Removed type->dcterms:type automapping because this term can have many different meanings

6584 12/04/2012 08:03 PM Aaron Marcuse-Kubitza

Added inputs/NVS/Source/

6583 12/04/2012 08:02 PM Aaron Marcuse-Kubitza

Added inputs/IUCN/European_Red_List_Plants/header.csv

6582 12/04/2012 08:02 PM Aaron Marcuse-Kubitza

Added inputs/CVS/_src/

6581 12/04/2012 08:01 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: $(svnFilesGlob): Include test.xml.ref instead of all test*.xml* to avoid including test outputs

6580 12/04/2012 07:57 PM Aaron Marcuse-Kubitza

inputs/*/verify/: Updated svn:ignore

6579 12/04/2012 07:55 PM Aaron Marcuse-Kubitza

mappings/VegCore-VegBIEN.csv: Mapped verbatimCoordinates

6578 12/04/2012 07:54 PM Aaron Marcuse-Kubitza

Updated inputs/HIBG/Specimen/new_terms.csv

6577 12/04/2012 07:50 PM Aaron Marcuse-Kubitza

Added inputs/HIBG/Source/

6576 12/04/2012 07:49 PM Aaron Marcuse-Kubitza

inputs/HIBG/verify/: Updated svn:ignore

6575 12/04/2012 07:47 PM Aaron Marcuse-Kubitza

Added inputs/NCU-NCSC/Source/

6574 12/04/2012 07:47 PM Aaron Marcuse-Kubitza

inputs/NCU-NCSC/verify/: Updated svn:ignore