Project

General

Profile

Statistics
| Revision:

# Date Author Comment
6745 12/11/2012 02:53 AM Aaron Marcuse-Kubitza

Added inputs/VegBank/_archive

6744 12/11/2012 02:50 AM Aaron Marcuse-Kubitza

input.Makefile: Testing: Added `%/test: %/test.xml` to allow testing just a subdir

6743 12/11/2012 02:42 AM Aaron Marcuse-Kubitza

input.Makefile: General targets: Added `%/: %/map.csv` to allow remaking just a subdirectory

6742 12/11/2012 01:53 AM Aaron Marcuse-Kubitza

inputs/CVS/: Refreshed data with new export from Bob

6741 12/11/2012 01:52 AM Aaron Marcuse-Kubitza

inputs/CVS/cvs-archive-2012-12-04.schema.sql: Fixed types using the steps at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Tools#MS-Access-database-MDB>

6740 12/11/2012 01:48 AM Aaron Marcuse-Kubitza

bin/map: Removed column names simplification, which was causing columns with the same alphanumeric characters but different punctuation to be simplified to the same name. Name simplification is now performed by the mapping mechanism itself, and can be overridden in the mappings.

6739 12/11/2012 01:24 AM Aaron Marcuse-Kubitza

Regenerated inputs/VegBank/new_terms.csv

6738 12/11/2012 12:08 AM Aaron Marcuse-Kubitza

Added inputs/NCU/_src/NCU_specimens_public_2012-12-10.zip.url

6737 12/11/2012 12:04 AM Aaron Marcuse-Kubitza

inputs/NCU/: Refreshed data with new export from Bob

6736 12/10/2012 09:33 PM Aaron Marcuse-Kubitza

Renamed inputs/NCU-NCSC/ to NCU because this is the primary herbarium contained in the data

6735 12/10/2012 09:31 PM Aaron Marcuse-Kubitza

Renamed inputs/NCU-NCSC/ to NCU because this is the primary herbarium contained in the data

6734 12/10/2012 09:21 PM Aaron Marcuse-Kubitza

Added inputs/NCU-NCSC/_archive

6733 12/10/2012 09:21 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Also add _archive/ subdir

6732 12/10/2012 08:23 PM Aaron Marcuse-Kubitza

publish_analytical_db: Time the import of the data

6731 12/10/2012 08:17 PM Aaron Marcuse-Kubitza

export_analytical_db: Also create a .md5 for the export

6730 12/10/2012 08:16 PM Aaron Marcuse-Kubitza

export_analytical_db: Run commands in the root svn dir

6729 12/10/2012 08:05 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: soil composition terms: Removed ppm units from the definition, since units are actually fraction or percent

6728 12/10/2012 08:03 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Moved On local machine steps after On nimoy steps, because the On nimoy steps are more important

6727 12/10/2012 07:59 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Comments: Added quotes around quotations from other sources

6726 12/10/2012 07:56 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Definitions: Added quotes around quotations from other sources

6725 12/10/2012 07:52 PM Aaron Marcuse-Kubitza

Added backups/fix_perms

6724 12/10/2012 07:45 PM Aaron Marcuse-Kubitza

backups/Makefile: Synchronization: %/download: Also download any .md5 file for the file

6723 12/10/2012 07:24 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Added instructions to verify the export's MD5 sum

6722 12/10/2012 07:23 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Replaced step to manually upload the analytical_aggregate export with the command to download it from jupiter

6721 12/10/2012 07:18 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Removed step to rename any existing analytical_aggregate table, since the import is now done directly into the versioned table

6720 12/10/2012 07:11 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: VegX terms without definitions in VegX: Added definitions from non-VegX sources, etc.

6719 12/10/2012 06:28 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Added instructions to verify the backups' MD5 sums on jupiter

6718 12/10/2012 06:23 PM Aaron Marcuse-Kubitza

README.TXT: Data import: Removed step to copy backups to jupiter, because this now done by `make backups/upload`

6717 12/10/2012 06:11 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: sync_*_to_view(): Also add `GRANT SELECT TO bien_read` on the view used to generate the table, in case the permission was lost when the view was modified

6716 12/10/2012 06:08 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: sync_*_to_view(): Added `GRANT SELECT TO bien_read`

6715 12/10/2012 06:04 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Added back bien_read's SELECT permissions, which had gotten removed when the tables were re-synced to their views

6714 12/10/2012 06:03 PM Aaron Marcuse-Kubitza

schemas/vegbien.my.sql: Regenerated with expanded repl word matching

6713 12/10/2012 06:00 PM Aaron Marcuse-Kubitza

repl: :-prefixing of words to form vars: Fixed bug where : must be matched as a lookbehind assertion, not a capturing group, because the provided regexp itself or its replacement may reference capturing groups, which it expects to be numbered starting with 1

6712 12/10/2012 05:47 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6711 12/10/2012 05:47 PM Aaron Marcuse-Kubitza

Regenerated inputs/NY/Specimen/new_terms.csv

6710 12/07/2012 06:49 PM Aaron Marcuse-Kubitza

inputs/JBM/Specimen/test.xml.ref: Updated inserted row count, which had gotten changed when a test was run on a non-empty database

6709 12/07/2012 06:34 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: height_ft: Added source to VegBank:stemHeight, which includes a description of the term

6708 12/07/2012 06:30 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: height_m: Added source to VegBank:stemHeight, which includes a description of the term

6707 12/07/2012 06:27 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: projectName: Added definition from VegX schema

6706 12/07/2012 06:25 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: project*Date: Re-sourced to VegBank:project.*Date, since VegX does not have an equivalent term

6705 12/07/2012 06:16 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: VegX terms: Added definitions from VegX schema, where provided

6704 12/07/2012 05:55 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: projectName: Added source to VegX:project.title

6703 12/07/2012 05:50 PM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore.csv.last_cleanup, .Veg+-VegCore.csv.last_cleanup: Also replace Veg+ terms in sources list, which are references to VegCore terms that have since been renamed

6702 12/07/2012 05:47 PM Aaron Marcuse-Kubitza

repl: text mode: Also match "vars" with the term prefixed by ":". Consider .- to be word characters. Only match a word when preceeded by whitespace or CSV field start characters.

6701 12/07/2012 05:41 PM Aaron Marcuse-Kubitza

repl: column mode: Removed parsing and checking of column name, which prevents using repl for general-purpose regexp/word replacement

6700 12/07/2012 04:41 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Definition: Moved closed list values to new Values column

6699 12/07/2012 04:39 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added Values column to store closed list values

6698 12/07/2012 04:35 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: geovalidation terms: Removed source to DwC:georeferenceVerificationStatus, because that is for georeferencing, not geovalidation

6697 12/07/2012 04:30 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: obs*Date: Re-sourced to VegX:obs*Date

6696 12/07/2012 04:23 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: projectID: Re-sourced to plotObservation.projectID

6695 12/07/2012 04:17 PM Aaron Marcuse-Kubitza

dict2redmine: RedmineTableWriter: Fixed bug where need to escape embedded | , using new redmine_table_esc()

6694 12/07/2012 04:16 PM Aaron Marcuse-Kubitza

dict2redmine: Added redmine_table_esc()

6693 12/07/2012 04:13 PM Aaron Marcuse-Kubitza

dict2redmine: Added redmine_esc()

6692 12/07/2012 04:06 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: TCS terms: Added TCS comments from <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegBIEN_taxonomic_schema#TCS>

6691 12/07/2012 03:58 PM Aaron Marcuse-Kubitza

dict2redmine: redmine_add_links(): Include the [] in the link text, to avoid the need for redmine_pad(), etc.

6690 12/07/2012 03:55 PM Aaron Marcuse-Kubitza

dict2redmine: redmine_add_links(): Make the link bold so it stands out as a link

6689 12/07/2012 03:53 PM Aaron Marcuse-Kubitza

dict2redmine: redmine_add_links(): Use new redmine_pad()

6688 12/07/2012 03:53 PM Aaron Marcuse-Kubitza

dict2redmine: Added redmine_pad()

6687 12/07/2012 03:51 PM Aaron Marcuse-Kubitza

dict2redmine: redmine_add_links(): Use redmine_url() to create the internal link

6686 12/07/2012 03:51 PM Aaron Marcuse-Kubitza

dict2redmine: redmine_url(): Support internal links

6685 12/07/2012 03:47 PM Aaron Marcuse-Kubitza

dict2redmine: redmine_add_links(): Fixed bug where need to explicitly specify the source name as the link text

6684 12/07/2012 03:44 PM Aaron Marcuse-Kubitza

dict2redmine: RedmineDictWriter: Link citations to entry in sources list

6683 12/07/2012 03:18 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Restored name of latLongDomainValid term, which had gotten replaced with coordinatePrecision

6682 12/07/2012 03:16 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: startDate, endDate: Changed comment to "a date range usually applies to the event"

6681 12/07/2012 03:14 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added Examples column to store data in TCS Examples column at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegBIEN_taxonomic_schema#TCS>

6680 12/07/2012 03:10 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: non-phylogenetic taxonomic terms: Added definitions from TCS schema

6679 12/07/2012 03:07 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: *forma, *variety: Fixed sources, which had been swapped between the two sets of terms

6678 12/07/2012 02:57 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Special values: Moved comments to Comments column

6677 12/07/2012 01:11 PM Aaron Marcuse-Kubitza

dict2redmine: Fixed bug where all header fields need to be preserved because columns are now filtered out instead of removed in each row

6676 12/07/2012 01:05 PM Aaron Marcuse-Kubitza

dict2redmine: Put the definition before and outside of the fields table

6675 12/07/2012 12:53 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Moved Definition values that are actually comments into separate Comments column

6674 12/07/2012 12:46 PM Aaron Marcuse-Kubitza

dict2redmine: RedmineDictWriter: Omit empty columns from the fields table

6673 12/06/2012 11:18 PM Aaron Marcuse-Kubitza

dict2redmine: Generate an outline instead of a table so each term will be indexed in the page's table of contents

6672 12/06/2012 11:13 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: coordinates: coordinates_unique: Removed md5() around verbatimcoordinates because functions within unique indexes (other than the standard COALESCE) are not yet supported by the import algorithm

6671 12/06/2012 11:10 PM Aaron Marcuse-Kubitza

exc.py: e_msg(): Emit a warning instead of an AssertionError if e.args0 isn't a string, to assist in debugging malformed exceptions

6670 12/06/2012 11:02 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: sampleType: Re-sourced to bien_web.observationType

6669 12/06/2012 10:34 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_stem_view: scientificNameWithMorphospecies: Fixed bug where need to use the taxonomicname in accepted_taxonlabel instead of accepted_taxonverbatim, because taxonverbatim only contains fields provided by the data provider (in this case, TNRS), but TNRS does not provide the taxonomic name (taxon name+author), only the taxon name and author components separately

6668 12/06/2012 10:09 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: coordinates: coordinates_unique: Use md5() on verbatimcoordinates so that it doesn't cause the index row size to be exceeded. This should fix a bug in the HIBG import where long verbatimcoordinates values were causing the error 'OperationalError: index row size 2784 exceeds maximum 2712 for index "coordinates_unique"'.

6667 12/06/2012 09:56 PM Aaron Marcuse-Kubitza

backups/Makefile: Synchronization: Replaced download target, which downloads all backups, with %/download, which downloads just a specific backup, because you would generally only want to extract a single backup from the archive for reinstallation

6666 12/06/2012 09:47 PM Aaron Marcuse-Kubitza

backups/Makefile: Synchronization: Sync with jupiter instead of vegbiendev. This requires running `make backups/upload` on vegbiendev to archive the files, instead of `make backups/download` to download them to your local machine.

6665 12/06/2012 08:58 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/map.csv: Removed no longer accurate comment that county is not yet used by VegBIEN

6664 12/06/2012 08:56 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/map.csv: *validity: Remapped 2 ("Point is <=5km from putative GADM polygon, but still outside it") to true instead of false, because 5km is close enough to the polygon that the mismatch could result from shapefile simplifying, boundary changes, or other factors that don't affect geovalidity

6663 12/06/2012 08:52 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/map.csv: *validity: Remapped 0 ("Complete name provided, but couldn't be scrubbed to GADM") to NULL instead of false, because the absence of a name match does not mean the coordinates are invalid

6662 12/06/2012 08:51 PM Aaron Marcuse-Kubitza

inputs/.{NCBI,TNRS}/import_order.txt: Added Source

6661 12/06/2012 08:50 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.

6660 12/06/2012 08:44 PM Aaron Marcuse-Kubitza

input.Makefile: SVN: add: Add a Source table to store datasource metadata. This adds a Source table to all herbaria which are listed in .herbaria, and therefore didn't previously need a Source table to indicate their referenceType and sampleType.

6659 12/06/2012 08:43 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: SVN: add: verify/: Added *.xls to svn:ignore

6658 12/06/2012 08:33 PM Aaron Marcuse-Kubitza

inputs/.geoscrub/geoscrub_output/postprocess.sql: Added index on decimallatitude, decimallongitude

6657 12/06/2012 08:30 PM Aaron Marcuse-Kubitza

Added inputs/.geoscrub/geoscrub_output/postprocess.sql, which adds NOT NULL constraints on decimallatitude, decimallongitude

6656 12/06/2012 06:55 PM Aaron Marcuse-Kubitza

schemas/vegbien.sql: analytical_*: Changed type of boolean columns to integer so that they will be exported as 1/0 instead of t/f by export_analytical_db. This will enable MySQL's LOAD DATA INFILE to import the values correctly.

6655 12/06/2012 06:07 PM Aaron Marcuse-Kubitza

backups/Makefile: Checksums: %.md5/test: Only use md5sum's -v option on Mac, because it's not supported on Linux (there, verbose mode is the default)

6654 12/06/2012 05:57 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: cultivated* source: Added picklist value to URL

6653 12/06/2012 05:46 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Creating analytical_aggregate table: publish_analytical_db: Rewrapped line

6652 12/06/2012 05:45 PM Aaron Marcuse-Kubitza

README.TXT: Data import: On nimoy: Creating analytical_aggregate table: Changed name to analytical_aggregate_r<revision> to allow storing different versions simultaneously

6651 12/06/2012 05:26 PM Aaron Marcuse-Kubitza

publish_analytical_db: Require caller to specify the name of the table to load data into. This allows appending a revision to analytical_aggregate, or publishing a table other than analytical_aggregate.

6650 12/06/2012 05:24 PM Aaron Marcuse-Kubitza

publish_analytical_db: Require caller to specify the name of the table to load data into. This allows appending a revision to analytical_aggregate, or publishing a table other than analytical_aggregate.

6649 12/06/2012 05:23 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: SVN: add: verify/: Added *.xls to svn:ignore

6648 12/06/2012 04:33 PM Aaron Marcuse-Kubitza

backups/Makefile: SQL: Full DB: vegbien.%.backup: Also generate MD5 sum

6647 12/06/2012 04:18 PM Aaron Marcuse-Kubitza

inputs/import.stats.xls: Updated import times

6646 12/05/2012 10:57 AM Aaron Marcuse-Kubitza

README.TXT: Data import: Delete previous imports based on the full DB backup file