/ - Changes - BIEN 3 - NCEAS Projects

root @ 6673

#	Date	Author	Comment
6673	12/06/2012 11:18 PM	Aaron Marcuse-Kubitza	dict2redmine: Generate an outline instead of a table so each term will be indexed in the page's table of contents
6672	12/06/2012 11:13 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: coordinates: coordinates_unique: Removed md5() around verbatimcoordinates because functions within unique indexes (other than the standard COALESCE) are not yet supported by the import algorithm
6671	12/06/2012 11:10 PM	Aaron Marcuse-Kubitza	exc.py: e_msg(): Emit a warning instead of an AssertionError if e.args⁰ isn't a string, to assist in debugging malformed exceptions
6670	12/06/2012 11:02 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: sampleType: Re-sourced to bien_web.observationType
6669	12/06/2012 10:34 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Fixed bug where need to use the taxonomicname in accepted_taxonlabel instead of accepted_taxonverbatim, because taxonverbatim only contains fields provided by the data provider (in this case, TNRS), but TNRS does not provide the taxonomic name (taxon name+author), only the taxon name and author components separately
6668	12/06/2012 10:09 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: coordinates: coordinates_unique: Use md5() on verbatimcoordinates so that it doesn't cause the index row size to be exceeded. This should fix a bug in the HIBG import where long verbatimcoordinates values were causing the error 'OperationalError: index row size 2784 exceeds maximum 2712 for index "coordinates_unique"'.
6667	12/06/2012 09:56 PM	Aaron Marcuse-Kubitza	backups/Makefile: Synchronization: Replaced download target, which downloads all backups, with %/download, which downloads just a specific backup, because you would generally only want to extract a single backup from the archive for reinstallation
6666	12/06/2012 09:47 PM	Aaron Marcuse-Kubitza	backups/Makefile: Synchronization: Sync with jupiter instead of vegbiendev. This requires running `make backups/upload` on vegbiendev to archive the files, instead of `make backups/download` to download them to your local machine.
6665	12/06/2012 08:58 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_output/map.csv: Removed no longer accurate comment that county is not yet used by VegBIEN
6664	12/06/2012 08:56 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_output/map.csv: *validity: Remapped 2 ("Point is <=5km from putative GADM polygon, but still outside it") to true instead of false, because 5km is close enough to the polygon that the mismatch could result from shapefile simplifying, boundary changes, or other factors that don't affect geovalidity
6663	12/06/2012 08:52 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_output/map.csv: *validity: Remapped 0 ("Complete name provided, but couldn't be scrubbed to GADM") to NULL instead of false, because the absence of a name match does not mean the coordinates are invalid
6662	12/06/2012 08:51 PM	Aaron Marcuse-Kubitza	inputs/.{NCBI,TNRS}/import_order.txt: Added Source
6661	12/06/2012 08:50 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.
6660	12/06/2012 08:44 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.
6659	12/06/2012 08:43 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: SVN: add: verify/: Added *.xls to svn:ignore
6658	12/06/2012 08:33 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_output/postprocess.sql: Added index on decimallatitude, decimallongitude
6657	12/06/2012 08:30 PM	Aaron Marcuse-Kubitza	Added inputs/.geoscrub/geoscrub_output/postprocess.sql, which adds NOT NULL constraints on decimallatitude, decimallongitude
6656	12/06/2012 06:55 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_*: Changed type of boolean columns to integer so that they will be exported as 1/0 instead of t/f by export_analytical_db. This will enable MySQL's LOAD DATA INFILE to import the values correctly.
6655	12/06/2012 06:07 PM	Aaron Marcuse-Kubitza	backups/Makefile: Checksums: %.md5/test: Only use md5sum's -v option on Mac, because it's not supported on Linux (there, verbose mode is the default)
6654	12/06/2012 05:57 PM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: cultivated* source: Added picklist value to URL
6653	12/06/2012 05:46 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: On nimoy: Creating analytical_aggregate table: publish_analytical_db: Rewrapped line
6652	12/06/2012 05:45 PM	Aaron Marcuse-Kubitza	README.TXT: Data import: On nimoy: Creating analytical_aggregate table: Changed name to analytical_aggregate_r<revision> to allow storing different versions simultaneously
6651	12/06/2012 05:26 PM	Aaron Marcuse-Kubitza	publish_analytical_db: Require caller to specify the name of the table to load data into. This allows appending a revision to analytical_aggregate, or publishing a table other than analytical_aggregate.
6650	12/06/2012 05:24 PM	Aaron Marcuse-Kubitza	publish_analytical_db: Require caller to specify the name of the table to load data into. This allows appending a revision to analytical_aggregate, or publishing a table other than analytical_aggregate.
6649	12/06/2012 05:23 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: SVN: add: verify/: Added *.xls to svn:ignore
6648	12/06/2012 04:33 PM	Aaron Marcuse-Kubitza	backups/Makefile: SQL: Full DB: vegbien.%.backup: Also generate MD5 sum
6647	12/06/2012 04:18 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: Updated import times
6646	12/05/2012 10:57 AM	Aaron Marcuse-Kubitza	README.TXT: Data import: Delete previous imports based on the full DB backup file
6645	12/05/2012 10:56 AM	Aaron Marcuse-Kubitza	backups/Makefile: Support removing public schema versions based on the version of a full DB backup
6644	12/05/2012 10:52 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Removed the additional dict namespace for the SALVIAS sources. This removes the extra "dict:" namespace on the generate Redmine source term names.
6643	12/05/2012 10:49 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Added TNRS provider namespace, inserting it before BIEN in the sort order
6642	12/05/2012 10:43 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Changed + to _ in URL fragments
6641	12/05/2012 10:41 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Removed the additional BIEN namespace for the BIEN sources, and use just BIEN2 and VegBIEN as the sub-namespaces. This removes the extra "BIEN:" namespace on the generate Redmine source term names.
6640	12/05/2012 10:37 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Removed the "terms" text in the current DwC terms' provider, and leave just the sort order. This removes the extra "terms:" namespace on the generate Redmine source term names.
6639	12/05/2012 10:33 AM	Aaron Marcuse-Kubitza	dict2redmine: url_term(): Remove empty URL comments
6638	12/05/2012 10:32 AM	Aaron Marcuse-Kubitza	dict2redmine: url_comment_text(): Interpret a URL comment containing just a number as a sort order without text
6637	12/05/2012 10:29 AM	Aaron Marcuse-Kubitza	dict2redmine: url_term(): Prefix any provider in the URL to the term name, to create a namespace. Each hierarchical component of the provider is stored in a URL comment.
6636	12/05/2012 10:27 AM	Aaron Marcuse-Kubitza	dict2redmine: Added url_comment_re
6635	12/05/2012 10:27 AM	Aaron Marcuse-Kubitza	dict2redmine: Added url_comment_text()
6634	12/05/2012 10:26 AM	Aaron Marcuse-Kubitza	dict2redmine: Call simplify_url() just on the first source so that source2redmine_url() can use the raw URL (to extract comments, etc.)
6633	12/05/2012 09:09 AM	Aaron Marcuse-Kubitza	dict2redmine: Removed no longer used explicit Definition column #
6632	12/05/2012 09:06 AM	Aaron Marcuse-Kubitza	dict2redmine: Use the input spreadsheet's column names and order, and pass through columns other than the term and sources columns
6631	12/05/2012 09:05 AM	Aaron Marcuse-Kubitza	mappingsf/VegCore.csv, Veg+-VegCore.csv: Renamed Comments to Definition to match Redmine table
6630	12/05/2012 09:04 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Reversed order of Comments, Sources columns to match Redmine table order
6629	12/05/2012 08:58 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Reversed order of Comments, Sources columns to match Redmine table order
6628	12/05/2012 08:56 AM	Aaron Marcuse-Kubitza	dict2redmine: Store term_str in a var before using it, like sources_str
6627	12/05/2012 08:43 AM	Aaron Marcuse-Kubitza	dict2redmine: Added Definition column
6626	12/05/2012 08:32 AM	Aaron Marcuse-Kubitza	dict2redmine: Take term and sources col #s as args instead of hardcoding them by column name or position
6625	12/05/2012 08:25 AM	Aaron Marcuse-Kubitza	dict2redmine: url_term(): Also match any namespace that's part of the term
6624	12/05/2012 08:21 AM	Aaron Marcuse-Kubitza	dict2redmine: Sources: Use source2redmine_url() to extract the term from each source URL
6623	12/05/2012 08:20 AM	Aaron Marcuse-Kubitza	dict2redmine: source2redmine_url(): Support empty URLs
6622	12/05/2012 08:15 AM	Aaron Marcuse-Kubitza	dict2redmine: url_term(): Fixed bug where need to use match.group() instead of match.groups()
6621	12/05/2012 08:02 AM	Aaron Marcuse-Kubitza	mappings/Makefile: Create VegCore.redmine from VegCore.csv
6620	12/05/2012 08:01 AM	Aaron Marcuse-Kubitza	Added dict2redmine
6619	12/05/2012 07:26 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Renamed Source column to Sources because it can contain multiple sources
6618	12/05/2012 07:12 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Source: DwC terms: Scoped sort order by category, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore_refactoring#Scope-DwC-sort-order-by-category>
6617	12/05/2012 06:35 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Source: VegX terms: Split combined field group/field sort order into separate sort orders for field and field group
6616	12/05/2012 06:22 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Source: VegX terms: Added top-level table sort order
6615	12/05/2012 06:07 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: taxonName: Reordered sources so it would sort with *TaxonName and scientificName
6614	12/05/2012 06:04 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Source: DwC Taxon: Added sort order so it would sort together with its fields
6613	12/05/2012 05:58 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Source: DwC occurrenceID: Corrected sort order to 019 instead of 000
6612	12/05/2012 05:55 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Source: DwC terms: Added category, with category sort order, as URL comment. This will allow terms to be sorted just within their category rather than globally for DwC.
6611	12/05/2012 05:49 AM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Source: DwC: dcterms: Added back "dcterms:" prefix to URL fragment
6610	12/05/2012 05:31 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Source: TNRS terms: Added sort order to web page fragment (simple_download, detailed_download)
6609	12/05/2012 05:25 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Removed no longer used Order within table column. Instead, embed the sort order in the URL using a () comment.
6608	12/05/2012 05:23 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Merged the Order within table column with the Source URL, using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore_refactoring#Merging-the-Order-within-table-column-with-the-Source-URL>. Sorting on the Source column now groups related terms together according to their sort order in the source they came from.
6607	12/05/2012 05:11 AM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Order within table: Filled in missing sort orders
6606	12/05/2012 04:51 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Source: Web pages: Use / instead of . to separate nested elements of URL fragment. Use _ instead of + to represent space.
6605	12/05/2012 04:19 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Order within table: Filled in missing sort orders
6604	12/05/2012 03:58 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Source: Removed trailing whitespace
6603	12/05/2012 03:43 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Order within table: Fixed to include one entry for every URL, including when the Order field is empty and there are multiple URLs
6602	12/05/2012 03:33 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Order within table: Fixed to include one entry for every URL
6601	12/05/2012 02:03 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv: Source: "dcterms:" terms: Fixed URL fragments to use : instead of # after dcterms
6600	12/05/2012 01:42 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Sources: BIEN2: Moved DB sort order right before the DB name in the URL to avoid duplicating the DB name in the comment
6599	12/05/2012 01:35 AM	Aaron Marcuse-Kubitza	mappings/VegCore.csv, Veg+-VegCore.csv: Sources: Added sort order comments to URLs so they sort in the order indicated at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Sources>. URL comments are enclosed in (), and the sort order element of a comment is a number right after the ( .
6598	12/05/2012 12:37 AM	Aaron Marcuse-Kubitza	mappings/Makefile: .Veg+-VegCore.csv.last_cleanup: Sort by the source URL instead of the VegCore term
6597	12/05/2012 12:35 AM	Aaron Marcuse-Kubitza	mappings/Makefile: Split .Veg+-VegCore.csv.last_cleanup and .VegX-VegCore.csv.last_cleanup into separate targets so their recipes can be different
6596	12/05/2012 12:17 AM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped dcterms:rights
6595	12/04/2012 11:52 PM	Aaron Marcuse-Kubitza	backups/Makefile: Synchronization: Also sync *.md5
6594	12/04/2012 09:52 PM	Aaron Marcuse-Kubitza	import_all: Fixed bug where need to wait for all asynchronous commands started before the main import, not just the first
6593	12/04/2012 09:51 PM	Aaron Marcuse-Kubitza	import_all: Import all Source tables before the herbaria list, so that any custom metadata will override the info in the herbaria list
6592	12/04/2012 09:43 PM	Aaron Marcuse-Kubitza	input.Makefile: Tables discovery: $(dontImport): Don't import the Source table when $import_source env var is set to ""
6591	12/04/2012 09:33 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.
6590	12/04/2012 09:22 PM	Aaron Marcuse-Kubitza	Added inputs/VASCAN/Source/
6589	12/04/2012 09:18 PM	Aaron Marcuse-Kubitza	csvs.py: stream_info(): Use the Excel dialect and an empty header if the CSV file is empty
6588	12/04/2012 08:29 PM	Aaron Marcuse-Kubitza	pg_dump_limit: Also remove CREATE DATABASE statements
6587	12/04/2012 08:09 PM	Aaron Marcuse-Kubitza	Added inputs/JBM/Source/
6586	12/04/2012 08:07 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Removed type->dcterms:type automapping because this term can have many different meanings
6585	12/04/2012 08:06 PM	Aaron Marcuse-Kubitza	mappings/Veg+-VegCore.csv: Removed type->dcterms:type automapping because this term can have many different meanings
6584	12/04/2012 08:03 PM	Aaron Marcuse-Kubitza	Added inputs/NVS/Source/
6583	12/04/2012 08:02 PM	Aaron Marcuse-Kubitza	Added inputs/IUCN/European_Red_List_Plants/header.csv
6582	12/04/2012 08:02 PM	Aaron Marcuse-Kubitza	Added inputs/CVS/_src/
6581	12/04/2012 08:01 PM	Aaron Marcuse-Kubitza	input.Makefile: SVN: $(svnFilesGlob): Include test.xml.ref instead of all test.xml to avoid including test outputs
6580	12/04/2012 07:57 PM	Aaron Marcuse-Kubitza	inputs/*/verify/: Updated svn:ignore
6579	12/04/2012 07:55 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: Mapped verbatimCoordinates
6578	12/04/2012 07:54 PM	Aaron Marcuse-Kubitza	Updated inputs/HIBG/Specimen/new_terms.csv
6577	12/04/2012 07:50 PM	Aaron Marcuse-Kubitza	Added inputs/HIBG/Source/
6576	12/04/2012 07:49 PM	Aaron Marcuse-Kubitza	inputs/HIBG/verify/: Updated svn:ignore
6575	12/04/2012 07:47 PM	Aaron Marcuse-Kubitza	Added inputs/NCU-NCSC/Source/
6574	12/04/2012 07:47 PM	Aaron Marcuse-Kubitza	inputs/NCU-NCSC/verify/: Updated svn:ignore

Project

General

Profile