Project

General

Profile

Activity

From 02/08/2013 to 03/09/2013

03/09/2013

10:50 PM Revision 7877: web/VegRef/.htaccess: Turned off RewriteEngine temporarily because it was preventing the internal VegRef redirection from working
Aaron Marcuse-Kubitza
10:48 PM Revision 7876: Added web/VegRef/, which provides a URL-shortening service for vegetation references. VegRef URLs are short and human-readable enough to be used as database column names. This also causes Google spreadsheets to display a clickable link for each column name, which takes the user to the data dictionary definition for the term. String row IDs (such as global IDs <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Global_IDs>) can also be clickable VegRef URLs.
Aaron Marcuse-Kubitza

03/08/2013

10:26 PM Revision 7875: inputs/NCU/Specimen/map.csv: cultivated: Map false (0) to NULL because cultivated can only be true or NULL, not false
Aaron Marcuse-Kubitza
09:14 PM Revision 7874: inputs/CTFS/_src/_README.TXT: Added e-mails from Brad Boyle about issues with species aggregation methods
Aaron Marcuse-Kubitza

03/07/2013

06:27 PM Revision 7873: inputs/UNCC/Specimen/map.csv: Populated specimenHolderInstitutions
Aaron Marcuse-Kubitza
03:21 PM Revision 7872: Added planning/goals/BIEN3_derived_data_products.docx.url
Aaron Marcuse-Kubitza
03:19 PM Revision 7871: Added planning/goals/BIEN3_derived_data_products.docx
Aaron Marcuse-Kubitza

03/06/2013

11:00 PM Revision 7870: inputs/ARIZ/omoccurrences/map.csv: cultivationStatus: Map to NULL if it's false, because cultivated can only be true or NULL
Aaron Marcuse-Kubitza
10:01 PM Revision 7869: schemas/vegbien.sql: analytical_stem_view: Merged taxonName_verbatim and scientificNameAuthorship_verbatim into scientificName_verbatim
Aaron Marcuse-Kubitza
09:29 PM Revision 7868: schemas/vegbien.sql: analytical_plot: Added specimens fields at the end, for use by plots data that also tracks specimens (e.g. Madidi, SALVIAS)
Aaron Marcuse-Kubitza
09:21 PM Revision 7867: README.TXT: Full database import: In PostgreSQL: Removed tests related to unscrubbed_taxondetermination_view and TNRS taxondeterminations, which are no longer needed because TNRS is joined directly to analytical_stem_view
Aaron Marcuse-Kubitza
08:36 PM Revision 7866: mappings/VegCore.htm: Regenerated from wiki. Renamed specimenDonor to specimenDonorInstitution to clarify that this is an institution, not a collector.
Aaron Marcuse-Kubitza
08:30 PM Revision 7865: inputs/UNCC/Specimen/map.csv: Mapped flower, fruit -> reproductiveCondition using translation is from e-mail conv between Jim Matthews at Habitat Assessment and Bob Peet on 2010-12-2~3
Aaron Marcuse-Kubitza
08:17 PM Revision 7864: make_analytical_db: Support the $log flag, which toggles output to the terminal or a log file
Aaron Marcuse-Kubitza
07:50 PM Revision 7863: inputs/import.stats.xls: Updated import times. The core import time has dropped by more than half (!) to ~12 hours, now that the TNRS scrubbing is added using a simple LEFT JOIN, instead of being pushed through the normalized schema. Not since October has the import been this fast!
Aaron Marcuse-Kubitza
07:27 PM Revision 7862: import_times: When there are no matching logs, exit with status 0 instead of 1 because this is a normal condition
Aaron Marcuse-Kubitza
07:26 PM Revision 7861: import_times: Fixed bug where sed would wait for user input from stdin when there were no matching logs
Aaron Marcuse-Kubitza
07:22 PM Revision 7860: README.TXT: Full database import: record the import times in inputs/import.stats.xls: If the row counts do not match up: Added step to get the datasource names from the log file footers
Aaron Marcuse-Kubitza
07:12 PM Revision 7859: inputs/UNCC/Specimen/map.csv: cultivated: assume anything non-NULL to be cultivated, so that we err on the side of excluding possibly-cultivated specimens
Aaron Marcuse-Kubitza
06:49 PM Revision 7858: inputs/UNCC/_src/_README.TXT: Added e-mail conv between Jim Matthews at Habitat Assessment and Bob Peet about the meanings of the Leaf, Flower, Fruit, and Root codes
Aaron Marcuse-Kubitza
05:38 PM Revision 7857: README.TXT: Full database import: Added steps to manually back up the DB. (These steps are specific to the import process, and include filters to exclude previous imports. There are also general backup steps in the Backups > Full DB section.)
Aaron Marcuse-Kubitza
02:56 PM Revision 7856: inputs/ACAD/Specimen/map.csv: institutionCode: Replace "Acadia University" with the IH code ACAD
Aaron Marcuse-Kubitza
02:18 PM Revision 7855: inputs/.TNRS/data.sql: Re-ran TNRS on the test_taxonomic_names so that the sample data would contain all the accepted names under the current acceptedScientificName formula, and would not produce any entries for unscrubbed names in tnrs_input_name
Aaron Marcuse-Kubitza
01:59 PM Revision 7854: schemas/vegbien.sql: tnrs_input_name: taxonlabel.taxonomicname subquery: Don't exclude taxonlabels with a matched_label_id, because matched_label_id is no longer used to indicate that a name has a scrubbed name (this was used when all of TNRS was imported at the beginning of the import)
Aaron Marcuse-Kubitza
01:56 PM Revision 7853: schemas/vegbien.sql: tnrs_input_name: accepted_name subquery: Renamed to the VegCore table name AcceptedTaxon
Aaron Marcuse-Kubitza
01:54 PM Revision 7852: schemas/vegbien.sql: tnrs_input_name: taxonlabel.taxonomicname subquery: Use new MatchedTaxon view, which use VegCore column names where possible
Aaron Marcuse-Kubitza
01:46 PM Revision 7851: schemas/vegbien.sql: tnrs_input_name: accepted_name subquery: Use new MatchedTaxon, AcceptedTaxon views, which use VegCore column names where possible
Aaron Marcuse-Kubitza
01:22 PM Revision 7850: README.TXT: Full database import: Added instructions to run TNRS and view the TNRS log
Aaron Marcuse-Kubitza
10:21 AM Revision 7849: README.TXT: Full database import: Removed no longer necessary command to scrub unscrubbed taxondeterminations
Aaron Marcuse-Kubitza

03/05/2013

11:07 PM Revision 7848: *.sql: concat_ws() calls: Fixed bug where need to surround it with NULLIF(..., '') because concat_ws() returns '' instead of the required NULL when supplied with only NULL args
Aaron Marcuse-Kubitza
10:22 PM Revision 7847: schemas/vegbien.sql: analytical_stem_view: concat_ws() calls: Fixed bug where arguments were still enclosed in ARRAY[], because the name had been changed from concat_delim() but PostgreSQL removed the VARIADIC keyword without also removing the ARRAY[] wrapper
Aaron Marcuse-Kubitza
10:11 PM Revision 7846: schemas/vegbien.sql: Removed no longer needed unscrubbed_taxondetermination_view. Use taxonlabel joined to ScrubbedTaxon instead.
Aaron Marcuse-Kubitza
10:05 PM Revision 7845: inputs/.TNRS/schema.sql: ScrubbedTaxon: Include only matches with a match score of at least 0.8 (score_ok())
Aaron Marcuse-Kubitza
09:59 PM Revision 7844: inputs/.TNRS/schema.sql: Added back score_ok() and tnrs.tnrs_score_ok index
Aaron Marcuse-Kubitza
09:53 PM Revision 7843: *.sql: Replaced concat_delim() with concat_ws(), which is built-in as of PostgreSQL 9.1 (http://stackoverflow.com/questions/1943433/postgresql-concat-ws-like-function)
Aaron Marcuse-Kubitza
09:08 PM Revision 7842: inputs/U/Specimen/map.csv: Populated specimenHolderInstitutions
Aaron Marcuse-Kubitza
09:07 PM Revision 7841: mappings/VegCore.htm: Regenerated from wiki. Renamed specimenHolders to specimenHolderInstitutions to make it obvious that this is a list of institutions, such as would be in institutionCode in a DwC export.
Aaron Marcuse-Kubitza
09:06 PM Revision 7840: inputs/SpeciesLink/Specimen/map.csv: Fixed embedded line endings
Aaron Marcuse-Kubitza
08:28 PM Revision 7839: inputs/NY/Ecatalog_all/map.csv: PlantFungDescription->occurrenceRemarks: Removed "plant fungus" label because this field actually contains the specimenDescription
Aaron Marcuse-Kubitza
08:06 PM Revision 7838: inputs/.TNRS/import_order.txt: Removed public.unscrubbed_taxondetermination_view
Aaron Marcuse-Kubitza
08:01 PM Revision 7837: inputs/.TNRS/: Moved no longer needed public.unscrubbed_taxondetermination_view/ to _archive/
Aaron Marcuse-Kubitza
07:52 PM Revision 7836: root Makefile: Datasources: scrub: Removed no longer needed scrub-remake, which has been replaced by direct use of TNRS.ScrubbedTaxon
Aaron Marcuse-Kubitza
07:46 PM Revision 7835: schemas/vegbien.sql: analytical_stem_view: taxondetermination: Fixed bug where need to include only the is_datasource_current taxondetermination
Aaron Marcuse-Kubitza
07:43 PM Revision 7834: schemas/vegbien.sql: analytical_stem_view: Use new TNRS.ScrubbedTaxon directly instead of the scrubbed taxondeterminations added to the normalized DB
Aaron Marcuse-Kubitza
07:20 PM Revision 7833: inputs/.TNRS/schema.sql: MatchedTaxon: Renamed matchedTaxonomicStatus to preferred name taxonomicStatus
Aaron Marcuse-Kubitza
07:19 PM Revision 7832: mappings/VegCore.htm: Regenerated from wiki. Taxon: Moved scrubbing-related terms to MatchedTaxon.
Aaron Marcuse-Kubitza
07:16 PM Revision 7831: inputs/.TNRS/schema.sql: Added matched* prefix where needed
Aaron Marcuse-Kubitza
07:02 PM Revision 7830: inputs/.TNRS/schema.sql: MatchedTaxon: Added missing Name_submitted column
Aaron Marcuse-Kubitza
06:59 PM Revision 7829: inputs/.TNRS/schema.sql: MatchedTaxon: Name_matched: Fixed mapping to matchedTaxonName
Aaron Marcuse-Kubitza
06:56 PM Revision 7828: inputs/.TNRS/schema.sql: MatchedTaxon: Fixed bug where can't map Source to source because it refers just to the source of the matched name, rather than of the record (which would be TNRS itself)
Aaron Marcuse-Kubitza
06:55 PM Revision 7827: inputs/.TNRS/schema.sql: tnrs_populate_derived_fields(): Use new concat_delim()
Aaron Marcuse-Kubitza
06:43 PM Revision 7826: schemas/functions.sql: _join*(): Use new concat_delim()
Aaron Marcuse-Kubitza
06:41 PM Revision 7825: schemas/functions.sql: Added concat_delim() (also in public, but this copy is for use by the functions schema)
Aaron Marcuse-Kubitza
06:37 PM Revision 7824: inputs/.TNRS/schema.sql: Removed no longer used tnrs+accepted. Use ScrubbedTaxon instead.
Aaron Marcuse-Kubitza
06:22 PM Revision 7823: inputs/.TNRS/schema.sql: Added MatchedTaxon, AcceptedTaxon, ScrubbedTaxon views, which rename the columns to VegCore names
Aaron Marcuse-Kubitza
05:17 PM Revision 7822: Removed inputs/GBIF/_src/GBIFPortalDB-2012-12-11.dump*, which has been replaced by GBIFPortalDB-2013-02-20.dump
Aaron Marcuse-Kubitza
05:14 PM Revision 7821: mappings/VegCore.htm: Regenerated from wiki. Renamed MatchedTaxonDetermination/AcceptedTaxonDetermination to MatchedTaxon/AcceptedTaxon because they relate to a taxon *name* rather than to an application of that name to a TaxonOccurrence
Aaron Marcuse-Kubitza
02:43 PM Revision 7820: schemas/functions.sql: functions._map(): Made function STRICT because it always returns NULL if either argument is NULL
Aaron Marcuse-Kubitza
02:39 PM Revision 7819: inputs/UNCC/Specimen/map.csv: campus: Mapped to cultivated, interpreting non-NULL values as true
Aaron Marcuse-Kubitza
02:29 PM Revision 7818: inputs/UNCC/Specimen/map.csv: campus: Removed mapping to occurrenceRemarks because this is just a code, not a description. Added definition from Bob's validation comments.
Aaron Marcuse-Kubitza
02:19 PM Revision 7817: inputs/UNCC/Specimen/map.csv: accession: Removed comment that this is actually the catalogNumber, because they are now synonyms
Aaron Marcuse-Kubitza
01:56 PM Revision 7816: mappings/VegCore.htm: Regenerated from wiki. Split specimenIndexer into specimenStampingInstitution and specimenDonor (which are often the same).
Aaron Marcuse-Kubitza
12:25 PM Revision 7815: mappings/VegCore-VegBIEN.csv: Removed specimenIndexer->institutionCode mappings because the institutionCode should refer only to the specimenHolder
Aaron Marcuse-Kubitza
12:18 PM Revision 7814: Added inputs/GBIF/_src/GBIFPortalDB-2013-02-20.dump.gz (a new download they posted to the FTP site)
Aaron Marcuse-Kubitza
12:13 PM Revision 7813: schemas/vegbien.sql: analytical_specimen: Removed growthForm, threatened_bien at Brad's request (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#UNCC > Brad Boyle's comments > growthForm, threatened_bien)
Aaron Marcuse-Kubitza
11:24 AM Revision 7812: inputs/UNCC/Specimen/map.csv: Removed mappings to reproductiveCondition because we don't know what the single-letter codes mean
Aaron Marcuse-Kubitza
11:16 AM Revision 7811: inputs/UNCC/Specimen/map.csv: Fixed embedded line endings
Aaron Marcuse-Kubitza
11:05 AM Revision 7810: schemas/vegbien.sql: _locationnarrative_is_cultivated(): Fixed bug where need to use NULL instead of false (and bool_or() instead of OR) for a non-match, because the absence of a match doesn't mean it's not cultivated
Aaron Marcuse-Kubitza
03:02 AM Revision 7809: mappings/VegCore.htm: Regenerated from wiki. Editing: Changed ambiguous terms templates to list the name of the alternative itself in the alternatives list, instead of the ambiguous term + a source suffix. This avoids the need to choose a source suffix for each alternative of an ambiguous term, as the ambiguous term itself (without the suffix) can be listed as the synonym. Coordinates: Disambiguated verbatimCoordinateSystem.
Aaron Marcuse-Kubitza
02:30 AM Revision 7808: mappings/Makefile: VegCore.thesaurus.csv: `uniq`ify the map after applying initial translations, to support mappings that appear twice (once as the result of an alternative's synonym entry and again as the result of an ambiguous term's alternatives listing)
Aaron Marcuse-Kubitza

03/04/2013

11:57 PM Revision 7807: schemas/vegbien.sql: analytical_stem_view: Added temperature_C, precipitation_m for possible use in John Wiens' plant climatic niche evolution project ("how fast is the rate of climatic niche evolution among species compared to projected changes in climate over the next 100 years? How do these rates compare for temperature and precipitation variables?")
Aaron Marcuse-Kubitza

03/01/2013

08:17 PM Revision 7806: mappings/VegCore.htm: Regenerated from wiki. Added specimenDescription.
Aaron Marcuse-Kubitza
08:00 PM Revision 7805: mappings/VegCore.htm: Regenerated from wiki. Split apart identificationQualifier and taxonFit. Split apart taxonConfidence and identificationVerificationStatus. Moved identificationQualifier to new QualifiedTaxonName table.
Aaron Marcuse-Kubitza
07:48 PM Revision 7804: README.TXT: Full database import: To view progress: Use tail -100 to show more of the log files, and not tail -f to avoid blocking when the user just wants to check the status
Aaron Marcuse-Kubitza
04:56 PM Revision 7803: inputs/BRIT/specimen_flat/map.csv: Only map Notes_Plant to occurrenceRemarks if there is a separate Locality_Description, in order to filter out Notes_Plant values that concatenate everything together (often including a herbarium name that would mess up the cultivated flag parsing)
Aaron Marcuse-Kubitza
04:49 PM Revision 7802: my2pg.data: Replace \0 (with the backslash) with 0 because \0 isn't allowed in UTF-8
Aaron Marcuse-Kubitza
03:53 PM Revision 7801: root Makefile: VegBIEN DB: mk_db: Added command to import arbitrary DB users from a config file
Aaron Marcuse-Kubitza

02/28/2013

03:31 PM Revision 7800: inputs/import.stats.xls: Updated import times
Aaron Marcuse-Kubitza
03:29 PM Revision 7799: inputs/TEX/Specimen2/map.csv: _join() PLANT DATA and MISC NOTES to form occurrenceRemarks instead of _alt()ing them
Aaron Marcuse-Kubitza
03:27 PM Revision 7798: inputs/TEX/Specimen2/test.xml.ref: Accepted test
Aaron Marcuse-Kubitza
02:58 PM Revision 7797: inputs/BRIT/_MySQL/MySQL.*.sql.make: Include all tables, not just specimen_flat
Aaron Marcuse-Kubitza
02:57 PM Revision 7796: my2pg: Replace tinytext, mediumtext with text
Aaron Marcuse-Kubitza
11:48 AM Revision 7795: Added inputs/GBIF/_src/_README.TXT with Brad's GBIF filtering rules
Aaron Marcuse-Kubitza
11:30 AM Revision 7794: inputs/FIA/TREE/postprocess.sql: Use :table instead of hardcoding the table name
Aaron Marcuse-Kubitza
11:26 AM Revision 7793: inputs/FIA/*/postprocess.sql: UNIQUE constraint: Added UNITCD because although it may not be needed to scope PLOT, it is present in all the core tables and thus must be part of the joins' USING columns
Aaron Marcuse-Kubitza
10:58 AM Revision 7792: inputs/FIA/*/postprocess.sql for core tables: Added unique constraint on the datasource's primary key fields
Aaron Marcuse-Kubitza
02:38 AM Revision 7791: Added inputs/CTFS/AggregateObservation/ from BIEN2 data
Aaron Marcuse-Kubitza
02:16 AM Revision 7790: inputs/CTFS/: Switched global _no_import to table-specific _no_imports to allow adding new tables that *are* imported
Aaron Marcuse-Kubitza

02/27/2013

09:54 PM Revision 7789: Added schemas/TAPIR/SpeciesLink.tapir_request.txt symlink
Aaron Marcuse-Kubitza
09:53 PM Revision 7788: Added inputs/SpeciesLink/_src/tapir_request.txt
Aaron Marcuse-Kubitza
09:52 PM Revision 7787: Added schemas/TAPIR/
Aaron Marcuse-Kubitza
09:48 PM Revision 7786: Added planning/use_cases/rare_species.png
Aaron Marcuse-Kubitza
09:47 PM Revision 7785: Added planning/goals/BIEN deliverables.png
Aaron Marcuse-Kubitza
09:31 PM Revision 7784: Added planning/resources/iPlant_Overview_BIEN_2011.pdf.url
Aaron Marcuse-Kubitza
09:29 PM Revision 7783: planning/goals/Enquist_BIEN_Overview_2011.pptx: Replaced with .url file. Note that the Plone site <http://groups.nceas.ucsb.edu/bien/meetings> is not searchable by document filename.
Aaron Marcuse-Kubitza
09:26 PM Revision 7782: Added planning/workflow/Boyle_BIEN_data_quality.pdf.url
Aaron Marcuse-Kubitza
09:21 PM Revision 7781: Added planning/goals/BIEN_Science_2011.pptx.url, BIEN-Tech-Oct2011.ppt.url, Enquist_BIEN_Overview_2011.pptx
Aaron Marcuse-Kubitza
09:00 PM Revision 7780: Added schemas/VegX/run.py
Aaron Marcuse-Kubitza
08:53 PM Revision 7779: Added schemas/VegBIEN/growth_forms/Barbara_Dobrin.parsing.scripts/
Aaron Marcuse-Kubitza
08:35 PM Revision 7778: Added schemas/VegX/Veg-X_paper_in_press_Applied_Vegetation_Science.pdf.url
Aaron Marcuse-Kubitza
08:25 PM Revision 7777: Added schemas/VegX/JVS-VegX.pdf.url
Aaron Marcuse-Kubitza
08:22 PM Revision 7776: Added schemas/VegBank/VegBank_Data_Model.with_Collections_intro.ppt
Aaron Marcuse-Kubitza
08:13 PM Revision 7775: Added planning/goals/BIEN_White_Paper.pdf.url
Aaron Marcuse-Kubitza
08:11 PM Revision 7774: Added planning/meetings/2010-12-08.working_group/BIEN3-plan.docx
Aaron Marcuse-Kubitza
08:04 PM Revision 7773: Added schemas/VegBIEN/planning/taxonomy/On_the_Use_of_Taxonomic_Concepts_in_Support_of_Biodiversity_Research_and_Taxonomy.pdf.url
Aaron Marcuse-Kubitza
07:59 PM Revision 7772: Added schemas/BIEN2/ERD/
Aaron Marcuse-Kubitza
07:54 PM Revision 7771: Added schemas/BIEN2/bien2_core_schema.png
Aaron Marcuse-Kubitza
07:54 PM Revision 7770: Added schemas/BIEN2/mappings/BIEN2.viewFullOccurrence-DwC.xls
Aaron Marcuse-Kubitza
07:51 PM Revision 7769: Added schemas/VegBank/mappings/VegBank.observation-VegX.plotObservation.xls
Aaron Marcuse-Kubitza
07:49 PM Revision 7768: schemas/vegbien.ERD.mwb: Fixed table positions due to plantobservation field addition
Aaron Marcuse-Kubitza
07:35 PM Revision 7767: Added schemas/DiGIR/gbif-occurrencestore.digir.client.url
Aaron Marcuse-Kubitza
07:33 PM Revision 7766: Added schemas/DiGIR/gbif-crawler.digir.*.url
Aaron Marcuse-Kubitza
07:27 PM Revision 7765: inputs/.geoscrub/import_order.txt: Added Source
Aaron Marcuse-Kubitza
07:09 PM Revision 7764: Added schemas/VegX/
Aaron Marcuse-Kubitza
07:00 PM Revision 7763: Added schemas/VegBIEN/taxonomy/family_higher_plant_group.sql
Aaron Marcuse-Kubitza
06:57 PM Revision 7762: Added planning/use_cases/Trait_BIEN_data_use_case.docx
Aaron Marcuse-Kubitza
06:56 PM Revision 7761: Added planning/workflow/2010.working_group.white_board/
Aaron Marcuse-Kubitza
06:54 PM Revision 7760: Added planning/milestones/_archive/BIEN3 Milestones and Development Timeline_MLN2.*
Aaron Marcuse-Kubitza
06:51 PM Revision 7759: Added planning/workflow/BIEN-modArch-Dec2010 NS-SBD 1.4.ppt
Aaron Marcuse-Kubitza
06:50 PM Revision 7758: Added planning/milestones/_archive/BIEN3_milestones.2011-12-13.doc
Aaron Marcuse-Kubitza
06:48 PM Revision 7757: Added planning/milestones/BIEN Milestone – feedback.docx
Aaron Marcuse-Kubitza
06:46 PM Revision 7756: Added schemas/VegBIEN/planning/taxonomy/Nomenclature_excerpt.ppt
Aaron Marcuse-Kubitza
06:44 PM Revision 7755: Added planning/use_cases/
Aaron Marcuse-Kubitza
06:42 PM Revision 7754: Added planning/meetings/2012-11-26~30.working_group/
Aaron Marcuse-Kubitza
06:41 PM Revision 7753: Added schemas/BIEN2/Guide_to_the_BIEN2_database.pdf
Aaron Marcuse-Kubitza
06:39 PM Revision 7752: Added planning/meetings/2011-10-24~28.working_group/
Aaron Marcuse-Kubitza
06:36 PM Revision 7751: Added schemas/VegBIEN/planning/BIEN_OctWG-Whtbrd*.jpg
Aaron Marcuse-Kubitza
06:32 PM Revision 7750: Added schemas/BIEN2/
Aaron Marcuse-Kubitza
06:29 PM Revision 7749: Added schemas/VegBIEN/
Aaron Marcuse-Kubitza
06:20 PM Revision 7748: Moved schemas/vegbank.* to schemas/VegBank/
Aaron Marcuse-Kubitza
06:15 PM Revision 7747: Added planning/milestones/BIEN3 - Progress since the 2011 working group.docx
Aaron Marcuse-Kubitza
06:13 PM Revision 7746: planning/meetings/: Added Martha's conference call notes
Aaron Marcuse-Kubitza
06:06 PM Revision 7745: Moved planning/milestones.doc to planning/milestones/
Aaron Marcuse-Kubitza
06:04 PM Revision 7744: Moved planning/timeline.* to timeline/
Aaron Marcuse-Kubitza
06:01 PM Revision 7743: Renamed to_do/ to planning/
Aaron Marcuse-Kubitza
05:53 PM Revision 7742: Added schemas/VegCore/
Aaron Marcuse-Kubitza
05:53 PM Revision 7741: Added schemas/DwC/
Aaron Marcuse-Kubitza
05:35 PM Revision 7740: Added schemas/BIEN_logo.png from the bien.nceas.ucsb.edu website
Aaron Marcuse-Kubitza
05:29 PM Revision 7739: Renamed schemas/BIEN_logo.png to BIEN_logo.small.png
Aaron Marcuse-Kubitza
05:10 PM Revision 7738: Added inputs/SALVIAS/_archive/salvias_bien_19nov2009.tar.gz.url, .md5
Aaron Marcuse-Kubitza
04:57 PM Revision 7737: Added inputs/REMIB/_archive/remib_raw.0.header.specimens.txt
Aaron Marcuse-Kubitza
04:53 PM Revision 7736: Added inputs/CTFS/_src/ctfs_extract.tab.url, .md5
Aaron Marcuse-Kubitza
04:48 PM Revision 7735: Added inputs/NVS/_src/NVS_VegX.XSLT, _src/VegX/
Aaron Marcuse-Kubitza
04:43 PM Revision 7734: Added inputs/NVS/_src/_README.TXT
Aaron Marcuse-Kubitza
04:41 PM Revision 7733: Added inputs/TurboVeg/_src/DBASEDIC_rkp2011*.xlsx
Aaron Marcuse-Kubitza
04:40 PM Revision 7732: Added inputs/TurboVeg/_src/_README.TXT
Aaron Marcuse-Kubitza
03:45 PM Revision 7731: inputs/import.stats.xls: Updated import times
Aaron Marcuse-Kubitza
03:36 PM Revision 7730: README.TXT: Full database import: Don't run backups/fix_perms with sudo because owners are not rsynced to jupiter, so fixing them first is unnecessary. (It's not necessary to change the file owner to bien at any other time, either, because commands are not run as the bien user. It's in fact better to leave the file owner as yourself, to allow rsync to set the modification times.)
Aaron Marcuse-Kubitza
03:24 PM Revision 7729: README.TXT: Full database import: Added steps to set $version in all vegbiendev terminals
Aaron Marcuse-Kubitza
03:00 PM Revision 7728: lib/common.Makefile: rsync: $(rsync): Also exclude .lk* lock files
Aaron Marcuse-Kubitza
02:57 PM Revision 7727: README.TXT: Full database import: Updated log file sync steps for syncing via jupiter
Aaron Marcuse-Kubitza
02:56 PM Revision 7726: inputs/BRIT/specimen_flat/test.xml.ref: Updated inserted row count
Aaron Marcuse-Kubitza

02/26/2013

08:27 PM Revision 7725: schemas/py_functions.sql: Added _date(year integer, month integer, day integer)
Aaron Marcuse-Kubitza
08:08 PM Revision 7724: inputs/BRIT/specimen_flat/postprocess.sql: Added primary key
Aaron Marcuse-Kubitza
07:40 PM Revision 7723: schemas/functions.sql: Added _dms_to_dd() that takes text arguments and casts them to the appropriate type
Aaron Marcuse-Kubitza
07:11 PM Revision 7722: inputs/HVAA/Specimen/map.csv: Remapped fieldNotes to UNUSED. This also fixes a bug where fieldNotes was colliding with other occurrenceRemarks inputs because it was missing a filter.
Aaron Marcuse-Kubitza
06:46 PM Revision 7721: schemas/vegbien.sql: analytical_specimen/analytical_plot: Added occurrenceRemarks
Aaron Marcuse-Kubitza
06:32 PM Revision 7720: inputs/REMIB/Specimen/map.csv: Mapping NULL-equivalent lat/long to NULL: Fixed bug where need special *=* mapping to pass through values not in the map
Aaron Marcuse-Kubitza
06:30 PM Revision 7719: inputs/REMIB/Specimen/map.csv: Map NULL-equivalent text fields (country, state, etc.) to NULL
Aaron Marcuse-Kubitza
06:23 PM Revision 7718: inputs/REMIB/Specimen/map.csv: Map NULL-equivalent lat/long to NULL
Aaron Marcuse-Kubitza
05:59 PM Revision 7717: inputs/SpeciesLink/Specimen/map.csv: dwc_dwcore_Remarks: Remapped to occurrenceRemarks (it had been incorrectly automapped to taxonRemarks). Combined occurrenceRemarks inputs using /_alt.
Aaron Marcuse-Kubitza
05:47 PM Revision 7716: mappings/VegCore.htm: Regenerated from wiki. Remapped organismNotes to be a synonym of occurrenceRemarks, since notes on an organism are more generally notes on an occurrence.
Aaron Marcuse-Kubitza
05:28 PM Revision 7715: inputs/HVAA/Specimen/map.csv: Remapped dynamicProperties to occurrenceRemarks because it contains occurrenceRemarks data
Aaron Marcuse-Kubitza
05:23 PM Revision 7714: schemas/vegbien.sql: analytical_stem_view: Added occurrenceRemarks
Aaron Marcuse-Kubitza
05:19 PM Revision 7713: mappings/VegCore-VegBIEN.csv: Mapped occurrenceRemarks
Aaron Marcuse-Kubitza
04:53 PM Revision 7712: mappings/VegCore-VegBIEN.csv: Only use one of collection and collectionURL instead of concatenating them when both are provided (as requested by Brad <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#ACAD>)
Aaron Marcuse-Kubitza
04:53 PM Revision 7711: mappings/VegCore-VegBIEN.csv: _avg(): Use numeric param names to work with SQL functions
Aaron Marcuse-Kubitza
04:52 PM Revision 7710: mappings/VegCore-VegBIEN.csv: _avg(): Use numeric param names to work with SQL functions
Aaron Marcuse-Kubitza
04:44 PM Revision 7709: inputs/FIA/_src/_README.TXT: Added e-mail from Brad Boyle on 2013-2-8
Aaron Marcuse-Kubitza
04:42 PM Revision 7708: inputs/FIA/_src/: Added attachments from Brad's e-mail on 2013-2-8
Aaron Marcuse-Kubitza
04:37 PM Revision 7707: xml_func.py: Removed no longer used _range(). Use SQL functions._range() instead.
Aaron Marcuse-Kubitza
04:34 PM Revision 7706: schemas/functions.sql: Added _range()
Aaron Marcuse-Kubitza
04:30 PM Revision 7705: xml_func.py: Removed no longer used _avg(). Use SQL functions._avg() instead.
Aaron Marcuse-Kubitza
04:29 PM Revision 7704: schemas/functions.sql: Added _avg()
Aaron Marcuse-Kubitza
04:27 PM Revision 7703: mappings/VegCore-VegBIEN.csv: _avg(): Use numeric param names to work with SQL functions
Aaron Marcuse-Kubitza
04:15 PM Revision 7702: schemas/functions.sql: _dms_to_dd(text): Corrected concatenated form to interpret last 5 digits as MMmmm = MM.mmm instead of MMSSS, per Tom Wendt's explanation that this form is "degrees/minutes/decimalminutes"
Aaron Marcuse-Kubitza
04:05 PM Revision 7701: inputs/MO/Specimen/postprocess.sql: Use :table instead of hardcoding "MO"."Specimen"
Aaron Marcuse-Kubitza
04:04 PM Revision 7700: inputs/MO/Specimen/postprocess.sql: Also remove frameshifted rows where InstitutionCode is NULL
Aaron Marcuse-Kubitza
03:44 PM Revision 7699: inputs/UNCC/Specimen/map.csv: herbarium->specimenIndexer: Replace UNCCD (UNCC database?) with UNCC
Aaron Marcuse-Kubitza
03:27 PM Revision 7698: schemas/functions.sql: functions._dms_to_dd(): Added support for DMS values without separators
Aaron Marcuse-Kubitza
03:09 PM Revision 7697: README.TXT: Datasource setup: Update vegbiendev: Removed command to run the tests on vegbiendev because vegbiendev no longer has an empty public schema for running tests on (the public schema instead contains the latest published import)
Aaron Marcuse-Kubitza
03:04 PM Revision 7696: Mapped inputs/BRIT/
Aaron Marcuse-Kubitza
02:36 PM Revision 7695: my2pg: Remove (0) after char
Aaron Marcuse-Kubitza
02:35 PM Revision 7694: my2pg: Remove any (#) after bytea
Aaron Marcuse-Kubitza
02:33 PM Revision 7693: my2pg: Translate binary to bytea
Aaron Marcuse-Kubitza
02:30 PM Revision 7692: README.TXT: Datasource setup: MySQL inputs: Editing _MySQL/*.make: Set $server to vegbiendev instead so the export command will work from any machine (though it will be faster from vegbiendev itself)
Aaron Marcuse-Kubitza
02:28 PM Revision 7691: inputs/BRIT/_MySQL/MySQL.*.sql.make: Fixed bug where needs to be set to just localhost and *--user* to bien instead
Aaron Marcuse-Kubitza
02:27 PM Revision 7690: README.TXT: Datasource setup: MySQL inputs: Editing _MySQL/*.make: Fixed bug where $server needs to be set to just localhost and *--user* to bien instead.
Aaron Marcuse-Kubitza
02:25 PM Revision 7689: lib/MySQL.*.sql.make: Added --user option to demonstrate how to set the DB user vs. the ssh user
Aaron Marcuse-Kubitza
02:19 PM Revision 7688: inputs/BRIT/_MySQL/MySQL.*.sql.make: Fixed bug where $server needs to be set to bien@localhost
Aaron Marcuse-Kubitza
02:18 PM Revision 7687: README.TXT: Datasource setup: MySQL inputs: Editing _MySQL/*.make: Specified that $server needs to be set to bien@localhost. Install the staging tables: Specified that for a MySQL .sql export, this must be done on vegbiendev.
Aaron Marcuse-Kubitza
02:15 PM Revision 7686: inputs/BRIT/: Added _MySQL/MySQL.*.sql.make
Aaron Marcuse-Kubitza
02:08 PM Revision 7685: README.TXT: Datasource setup: MySQL inputs: .sql exports: Run mysql_bien in the background
Aaron Marcuse-Kubitza
02:03 PM Revision 7684: README.TXT: Datasource setup: MySQL inputs: .sql exports: Added steps to sync the export to vegbiendev. Specified that the database for the MySQL export should be created on vegbiendev.
Aaron Marcuse-Kubitza
01:58 PM Revision 7683: README.TXT: Datasource setup: Update vegbiendev: Updated steps for syncing via jupiter
Aaron Marcuse-Kubitza
01:56 PM Revision 7682: Added inputs/BRIT/ (not yet translated to PostgreSQL)
Aaron Marcuse-Kubitza
01:35 PM Revision 7681: mappings/VegCore-VegBIEN.csv: Mapped latitude/longitude_DMS to coordinates.latitude_deg using new _dms_to_dd(text)
Aaron Marcuse-Kubitza
01:34 PM Revision 7680: mappings/VegCore-VegBIEN.csv: Mapped latitude/longitude_DMS to coordinates.latitude_deg using new _dms_to_dd(text)
Aaron Marcuse-Kubitza
01:29 PM Revision 7679: schemas/functions.sql: Added _dms_to_dd(text)
Aaron Marcuse-Kubitza
01:28 PM Revision 7678: schemas/functions.sql: _dms_to_dd(): Fixed bug where need to use 1 as the multiplier when dir is NULL
Aaron Marcuse-Kubitza
01:19 PM Revision 7677: schemas/functions.sql: _dms_to_dd(): Fixed bug where need to apply the sign multiplier *after* the components are added, because it applies to the DMS as a whole rather than just to the degree component
Aaron Marcuse-Kubitza
12:58 PM Revision 7676: README.TXT: Full database import: Before running `make inputs/upload`, added step to do ./fix_perms on local machine so that files are not uploaded to jupiter with incorrect permissions
Aaron Marcuse-Kubitza
12:57 PM Revision 7675: README.TXT: Full database import: Before running `make inputs/upload`, added step to do svn up on jupiter so that unversioned files are not uploaded before their versioned equivalents are checked out from svn (requiring each file to be manually svn-added before the svn up can proceed)
Aaron Marcuse-Kubitza
12:30 PM Revision 7674: schemas/functions.sql: _dms_to_dd(): Added dir param to specify compass dir suffix, in order to calculate the sign correctly
Aaron Marcuse-Kubitza
11:05 AM Revision 7673: Moved compass_dir from schemas/vegbien.sql to schemas/functions.sql so it can be used by _dms_to_dd()
Aaron Marcuse-Kubitza
10:56 AM Revision 7672: Moved _dms_to_dd() from schemas/vegbien.sql to schemas/functions.sql so it can use functions schema functions
Aaron Marcuse-Kubitza
10:51 AM Revision 7671: schemas/vegbien.sql: Added compass_dir enum
Aaron Marcuse-Kubitza
10:42 AM Revision 7670: inputs/TEX/Specimen/map.csv: Remapped LAT/LONG_DEGREES,MINUTES,SECONDS to new latitude/longitude_deg,min,sec, allowing the DMS coordinates to be translated
Aaron Marcuse-Kubitza
10:40 AM Revision 7669: inputs/REMIB/Specimen/map.csv: Remapped lat/long_deg,min,sec to new latitude/longitude_deg,min,sec, allowing the DMS coordinates to be translated
Aaron Marcuse-Kubitza
10:39 AM Revision 7668: mappings/VegCore-VegBIEN.csv: latitude/longitude_deg,min,sec: Also mapped to the geoscrub coordinates entry
Aaron Marcuse-Kubitza
10:34 AM Revision 7667: mappings/VegCore-VegBIEN.csv: latitude/longitude_sec: Fixed name, which had been incorrectly automapped to verbatim*
Aaron Marcuse-Kubitza
10:31 AM Revision 7666: mappings/VegCore.htm: Regenerated from wiki. latitude/longitude: Moved *_deg,min,sec components after unit alternatives so that unit alternatives following them wouldn't incorrectly be interpreted as alternatives for the *components* instead of for the overall, unitless term.
Aaron Marcuse-Kubitza
10:22 AM Revision 7665: mappings/VegCore-VegBIEN.csv: Mapped latitude/longitude_deg,min,sec
Aaron Marcuse-Kubitza
10:15 AM Revision 7664: mappings/VegCore.htm: Regenerated from wiki. Merged coordinateUncertainty and coordinatePrecision, since datasources often use these interchangeably (the only difference is the DwC-recommended units).
Aaron Marcuse-Kubitza
10:08 AM Revision 7663: mappings/VegCore.htm: Regenerated from wiki. Source: Added authors. Renamed creator to firstPublisher. Moved dataOwner to Source as list. Coordinates: latitude/longitude, coordinatePrecision: Added units disambiguation. latitude/longitude_DMS: Added components. Global IDs: Changed "globally unique" tag to "global ID" and hyperlinked it to <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Global_IDs> (which now includes detailed examples of the various types of global IDs).
Aaron Marcuse-Kubitza
09:58 AM Revision 7662: inputs/SpeciesLink/Specimen/map.csv: conceptual_darwin_2003_1_0_CoordinatePrecision: Remapped to coordinateUncertaintyInMeters, assuming units based on the range and precision of values
Aaron Marcuse-Kubitza
09:53 AM Revision 7661: inputs/HVAA/Specimen/map.csv: Remapped coordinatePrecision to UNUSED
Aaron Marcuse-Kubitza
09:50 AM Revision 7660: inputs/ARIZ/omoccurrences/map.csv: Remapped coordinatePrecision to UNUSED
Aaron Marcuse-Kubitza
09:05 AM Revision 7659: schemas/vegbien.sql: Added _dms_to_dd()
Aaron Marcuse-Kubitza
08:49 AM Revision 7658: inputs/bien2_traits/TraitObservation/map.csv: Remapped Source to sourceType
Aaron Marcuse-Kubitza
08:43 AM Revision 7657: Added inputs/TEX/
Aaron Marcuse-Kubitza
08:42 AM Revision 7656: units.py: convert(): Support ranges
Aaron Marcuse-Kubitza
08:35 AM Revision 7655: Moved parse_range() from xml_func.py to units.py
Aaron Marcuse-Kubitza
08:02 AM Revision 7654: units.py: conversions: Added ft->m conversion
Aaron Marcuse-Kubitza
07:12 AM Revision 7653: src_map: Reopen stdin (/dev/stdin) in universal newlines mode to support files with \r as the line ending
Aaron Marcuse-Kubitza
06:51 AM Revision 7652: csv2db: Open input stream in universal newlines mode to support files with \r as the line ending
Aaron Marcuse-Kubitza

02/22/2013

11:15 PM Revision 7651: mappings/VegCore.htm: Regenerated from wiki. Moved Forming IDs section to separate Global IDs wiki page (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Global_IDs).
Aaron Marcuse-Kubitza
11:09 PM Revision 7650: input.Makefile: $(exts): Added .tab
Aaron Marcuse-Kubitza
10:29 PM Revision 7649: mappings/VegCore.htm: Regenerated from wiki. Moved preferred ID order to Forming IDs section as examples list. Made examples globally unique by prefixing the outer ID that scopes the value.
Aaron Marcuse-Kubitza
09:29 PM Revision 7648: mappings/VegCore.htm: Regenerated from wiki. Forming IDs: Added instructions for making a local ID global. Added note that these globally unique IDs are not the same as UUIDs. Forming preferred IDs: Reworded as a set of instructions rather than a definition. Clarified that this becomes the database primary key.
Aaron Marcuse-Kubitza
08:47 PM Revision 7647: mappings/VegCore.htm: Regenerated from wiki. Added section on forming preferred IDs, which are the ID which is most directly associated with the identified object.
Aaron Marcuse-Kubitza
08:17 PM Revision 7646: mappings/VegCore.htm: Regenerated from wiki. Documentation has been added on how to choose term names (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Naming) and how to form globally unique ID values (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Forming-IDs). Source and Specimen terms have been renamed to be self-explanatory and unambiguous (the DwC equivalents remain as synonyms). Short definitions of Source terms have been added to explain the differences between them. Source, Specimen, and Collection terms have been shortened according to the new instructions for choosing preferred term names (https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Naming).
Aaron Marcuse-Kubitza
08:00 PM Revision 7645: inputs/NY/Ecatalog_all/test.xml.ref: Updated inserted row count
Aaron Marcuse-Kubitza

02/21/2013

05:13 PM Revision 7644: schemas/vegbien.sql: analytical_stem_view: scientificName_verbatim: Use the taxonname (which omits the family and author) instead when available, at Brad's request
Aaron Marcuse-Kubitza
04:02 PM Revision 7643: schemas/vegbien.sql: Disambiguated taxonlabel.taxonomicname, taxonverbatim.taxonomicname, and taxonverbatim.taxonname
Aaron Marcuse-Kubitza
03:14 PM Revision 7642: schemas/vegbien.sql: taxon_trait_view: Use taxonverbatim.taxonname (the binomial) in preference to taxonverbatim.taxonomicname (the binomial+author) or taxonlabel.taxonomicname (the family+binomial+author), at Mark's request
Aaron Marcuse-Kubitza
02:05 PM Revision 7641: to_do/timeline.2013.xls: Obtain and load any additional new data: Extended beyond April because it's ongoing
Aaron Marcuse-Kubitza
02:02 PM Revision 7640: README.TXT: Full database import: On nimoy: Moved analytical_aggregate table creation right after analytical_aggregate..csv download because it can happen while the file is downloaded
Aaron Marcuse-Kubitza
11:13 AM Revision 7639: to_do/timeline.2013.xls: Reload existing data in need of refresh: Added additional cell because this is still in progress
Aaron Marcuse-Kubitza
11:11 AM Revision 7638: to_do/timeline.2013.xls: Placed check mark in completed cells. Multi-cell tasks that are partially complete have check marks in part of their cells to indicate this.
Aaron Marcuse-Kubitza

02/20/2013

05:35 PM Revision 7637: Added inputs/FIA/REF_*/
Aaron Marcuse-Kubitza
05:33 PM Revision 7636: inputs/FIA/_src/Makefile: Table bundling: Also unpack FIADB_REFERENCE/
Aaron Marcuse-Kubitza
05:13 PM Revision 7635: Added inputs/FIA/_src/FIADB_REFERENCE.ZIP.md5, .url
Aaron Marcuse-Kubitza
04:11 PM Revision 7634: schemas/vegbien.sql: provider_count_view: Group data by dataprovider and dataset, rather than by each separately, to satisfy the GBIF attribution requirements as described by Brad
Aaron Marcuse-Kubitza
02:00 PM Revision 7633: schemas/vegbien.sql: analytical_plot/specimen: Moved threatened_bien after growthForm as requested by Brad <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#SpeciesLink>
Aaron Marcuse-Kubitza
01:54 PM Revision 7632: schemas/vegbien.sql: analytical_stem_view: Moved threatened_bien after growthForm as requested by Brad <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#SpeciesLink>
Aaron Marcuse-Kubitza
01:53 PM Revision 7631: schemas/vegbien.sql: sync_analytical_stem_to_view(): Also drop and re-create new dependent views analytical_plot, analytical_specimen
Aaron Marcuse-Kubitza
01:20 PM Revision 7630: inputs/TEAM/V*/map.csv: Mapped plotArea_ha based on "1ha Plot Number" field name
Aaron Marcuse-Kubitza
01:12 PM Revision 7629: inputs/TEAM/V*/map.csv: Id: Remapped to individualObservationID
Aaron Marcuse-Kubitza
01:10 PM Revision 7628: mappings/VegCore.htm: Regenerated from wiki. verbatimScientificName, verbatimTaxonName have been merged into concatenatedScientificName, and ID has been disambiguated.
Aaron Marcuse-Kubitza
01:09 PM Revision 7627: inputs/*/*/map.csv: Automapped taxonConfidence
Aaron Marcuse-Kubitza
12:28 PM Revision 7626: schemas/vegbien.sql: analytical_stem_view: plotArea_ha: Fixed bug where need to use parent plot's area when subplot's area is not provided
Aaron Marcuse-Kubitza
12:18 PM Revision 7625: inputs/SALVIAS/plotMetadata_/create.sql: Added primary key
Aaron Marcuse-Kubitza
11:42 AM Revision 7624: inputs/NY/NY_DATA_20130212.~.utils.sql: pkey: Made table an enum instead of a text field so that Ecatalog_LatLong could be sorted before Ecatalog
Aaron Marcuse-Kubitza
11:32 AM Revision 7623: inputs/NY/Ecatalog_all/create.sql: primary key: Combine table, key into one value so that the primary key has a single column. This is necessary because multi-column primary keys are not supported by the import algorithm.
Aaron Marcuse-Kubitza
11:28 AM Revision 7622: inputs/NY/NY_DATA_20130212.~.utils.sql: Added pkey record type for use by Ecatalog_all's generated pkey
Aaron Marcuse-Kubitza
10:47 AM Revision 7621: inputs/NY/Ecatalog_all/create.sql: Add table column which scopes the key field. Add primary key to allow sorting by it. inputs/NY/Ecatalog_all/map.csv: Documented that UniqueNYInternalRecordNumber is sometimes duplicated.
Aaron Marcuse-Kubitza
09:20 AM Revision 7620: make_analytical_db: Wait for tnrs.make to acquire the tnrs.make lock if being run right after import
Aaron Marcuse-Kubitza
09:13 AM Revision 7619: schemas/vegbien.sql: analytical_stem_view: taxonName_matched: Fixed bug where need to use the matched_taxonverbatim instead of the datasource_taxonverbatim
Aaron Marcuse-Kubitza
08:58 AM Revision 7618: Moved wait on tnrs.make lock from import_all to make_analytical_db, so that running make_analytical_db for a one-time import also waits on the lock
Aaron Marcuse-Kubitza
08:56 AM Revision 7617: schemas/vegbien.sql: taxondetermination: taxondetermination_unique: Added determinationtype so that when the matched and accepted determinations are the same, they still both get created rather than the second one being removed due to the unique constraint
Aaron Marcuse-Kubitza
07:59 AM Revision 7616: schemas/vegbien.sql: analytical_specimen: Removed speciesBinomialWithMorphospecies because it doesn't apply to specimens
Aaron Marcuse-Kubitza
07:58 AM Revision 7615: schemas/vegbien.sql: Added analytical_plot view
Aaron Marcuse-Kubitza
07:55 AM Revision 7614: schemas/vegbien.sql: Added analytical_specimen view
Aaron Marcuse-Kubitza
07:44 AM Revision 7613: schemas/vegbien.sql: analytical_stem_view: Moved recordedBy, recordNumber before dateCollected as requested by Brad <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#ACAD>
Aaron Marcuse-Kubitza
07:40 AM Revision 7612: schemas/vegbien.ERD.mwb: Synced with schema
Aaron Marcuse-Kubitza
07:38 AM Revision 7611: schemas/vegbien.sql: Added reproductiveCondition
Aaron Marcuse-Kubitza
07:33 AM Revision 7610: mappings/VegCore-VegBIEN.csv: Mapped reproductiveCondition
Aaron Marcuse-Kubitza
07:28 AM Revision 7609: schemas/vegbien.sql: plantobservation: Added reproductivecondition
Aaron Marcuse-Kubitza
05:33 AM Revision 7608: mappings/VegCore.htm: Regenerated from wiki. matched*Fit_fraction has been renamed to matched*Confidence_fraction.
Aaron Marcuse-Kubitza
05:32 AM Revision 7607: inputs/.TNRS/public.unscrubbed_taxondetermination_view/map.csv: Updated for new mappings/VegCore.htm
Aaron Marcuse-Kubitza
05:10 AM Revision 7606: inputs/bien_web/observation/map.csv: Re-automapped taxonMorphospecies
Aaron Marcuse-Kubitza
05:08 AM Revision 7605: mappings/VegCore.htm: Regenerated from wiki. Data owner terms and taxon synonyms have been added, and morphospecies has been disambiguated.
Aaron Marcuse-Kubitza
04:51 AM Revision 7604: schemas/vegbien.sql: analytical_stem_view: Moved identifiedBy, dateIdentified, identificationRemarks right after the *_verbatim terms that they relate to, as requested by Brad <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/Spot-checking#ACAD>
Aaron Marcuse-Kubitza
02:25 AM Revision 7603: schemas/vegbien.sql: analytical_stem_view: Use new concat_delim() instead of array_to_string() surrounded by NULLIF(..., '')
Aaron Marcuse-Kubitza
02:19 AM Revision 7602: schemas/vegbien.sql: Added concat_delim()
Aaron Marcuse-Kubitza
01:43 AM Revision 7601: schemas/vegbien.sql: analytical_stem_view: Removed deprecated taxonNameWithMorphospecies now that we have speciesBinomialWithMorphospecies
Aaron Marcuse-Kubitza
01:17 AM Revision 7600: schemas/vegbien.sql: analytical_stem_view: speciesBinomial: Added morphospecies suffix to create speciesBinomialWithMorphospecies
Aaron Marcuse-Kubitza
12:15 AM Revision 7599: inputs/import.stats.xls: Updated import times
Aaron Marcuse-Kubitza
12:06 AM Revision 7598: README.TXT: Full database import: Check that unscrubbed_taxondetermination_view returns no rows: Documented that this takes 90 s with LIMIT 1
Aaron Marcuse-Kubitza

02/19/2013

11:16 PM Revision 7597: schemas/vegbien.sql: _taxon_family_require_std(): Also allow non-aceae families accepted by TNRS
Aaron Marcuse-Kubitza
09:38 PM Revision 7596: Added inputs/SALVIAS/_archive/salvias_plots.*.sql.zip.md5
Aaron Marcuse-Kubitza
09:35 PM Revision 7595: Added inputs/VegBank/_archive/vegbank_for_bien.tar.gz.url
Aaron Marcuse-Kubitza
09:29 PM Revision 7594: Added inputs/U/UtrechtHerbarium.csv.tar.gz.url
Aaron Marcuse-Kubitza
09:28 PM Revision 7593: Added inputs/TEAM/_archive/ci-team_extract.tar.gz.url
Aaron Marcuse-Kubitza
09:27 PM Revision 7592: Added inputs/SpeciesLink/_archive/specieslink*.txt.gz.url
Aaron Marcuse-Kubitza
09:22 PM Revision 7591: Added inputs/REMIB/_archive/remib_raw.csv.tar.gz.url
Aaron Marcuse-Kubitza
09:19 PM Revision 7590: Added inputs/NY/NYSpecimenDataAmericas.csv.tar.gz.url
Aaron Marcuse-Kubitza
09:17 PM Revision 7589: Added inputs/NCU/_archive/NCU-NCSC_2010-02-12.csv.tar.gz.url
Aaron Marcuse-Kubitza
09:14 PM Revision 7588: Added inputs/MO/mo_digirexport.tar.gz.url
Aaron Marcuse-Kubitza
09:13 PM Revision 7587: Added inputs/Madidi/_archive/2010-1-2/madidi_plots_original_12jan2010.zip.url
Aaron Marcuse-Kubitza
09:11 PM Revision 7586: Added inputs/GBIF/gbif_extract.tar.gz.url
Aaron Marcuse-Kubitza
09:10 PM Revision 7585: Added inputs/FIA/fia_extract.tar.gz.url
Aaron Marcuse-Kubitza
09:08 PM Revision 7584: Added inputs/CVS/_archive/CVS-allTaxonOccurrences_2010-01-12.txt.tar.gz.url
Aaron Marcuse-Kubitza
09:04 PM Revision 7583: Added inputs/ARIZ/ARIZ_DiGIR_21012010.csv.tar.gz.url
Aaron Marcuse-Kubitza
08:55 PM Revision 7582: Added inputs/UNCC/Specimen/UNCC.csv.url, UNCC.csv.md5
Aaron Marcuse-Kubitza
08:45 PM Revision 7581: Added inputs/XAL/_src/digir.xml.gz.md5
Aaron Marcuse-Kubitza
08:39 PM Revision 7580: Added inputs/UNCC/_src/ with UNCC.csv.zip.md5
Aaron Marcuse-Kubitza
08:23 PM Revision 7579: Added inputs/SpeciesLink/_src
Aaron Marcuse-Kubitza

02/16/2013

08:24 AM Revision 7578: README.TXT: Datasource setup: MySQL inputs: .sql exports: Use new mysql_bien to connect to the MySQL DB created for the datasource
Aaron Marcuse-Kubitza
08:22 AM Revision 7577: Added mysql_bien, which runs a MySQL command on the local MySQL server
Aaron Marcuse-Kubitza
08:06 AM Revision 7576: Added inputs/GBIF/_src/GBIFPortalDB-2012-12-11.dump.md5 (md5sum of the expanded file)
Aaron Marcuse-Kubitza
08:02 AM Revision 7575: root Makefile: MySQL: mysql-Linux: Also install phpMyAdmin
Aaron Marcuse-Kubitza
08:01 AM Revision 7574: root Makefile: MySQL: mysql-Linux: Split apt-get dependencies into separate commands, like for other apt-get commands, to avoid having one failed dependency prevent the following dependencies from being installed
Aaron Marcuse-Kubitza
07:57 AM Revision 7573: root Makefile: MySQL: *mysql_users: Also add bien_read user
Aaron Marcuse-Kubitza
07:49 AM Revision 7572: root Makefile: MySQL: Renamed *mysql_user to *mysql_users because there can be multiple users
Aaron Marcuse-Kubitza
06:51 AM Revision 7571: inputs/: Added .md5 files for all .zip, .gz
Aaron Marcuse-Kubitza
06:47 AM Revision 7570: Added inputs/HVAA/Specimen/Herbario_occur_1360871068.csv.url
Aaron Marcuse-Kubitza
06:39 AM Revision 7569: lib/common.Makefile: rsync: $(rsync*): Use --no-group because the file group is different depending on the machine
Aaron Marcuse-Kubitza
06:10 AM Revision 7568: input.Makefile: SVN: $(_svnFilesGlob): Also add .md5 files. This allows svn to track where unversioned files should be in the directory tree.
Aaron Marcuse-Kubitza
06:07 AM Revision 7567: input.Makefile: SVN: $(_svnFilesGlob): .url, .pdf, and README.TXT in the top-level dir: Fixed bug where had extra / after brace expr
Aaron Marcuse-Kubitza
06:00 AM Revision 7566: input.Makefile: SVN: $(_svnFilesGlob): Also add .url, .pdf, and README.TXT in the top-level dir
Aaron Marcuse-Kubitza
05:53 AM Revision 7565: input.Makefile: SVN: $(_svnFilesGlob): Add .url, .pdf, and README.TXT files in all subdirs, not just _src
Aaron Marcuse-Kubitza
05:25 AM Revision 7564: lib/common.Makefile: remote server: Use jupiter instead of vegbiendev, to ensure that all files get uploaded there rather than only to vegbiendev. This involves adding an extra database import step to download the uploaded files from jupiter onto vegbiendev.
Aaron Marcuse-Kubitza
02:50 AM Revision 7563: inputs/FIA/_src/Makefile: all: Extract zip files before running tables target, because it requires the created dirs
Aaron Marcuse-Kubitza
02:40 AM Revision 7562: schemas/vegbien.ERD.mwb: Fixed table sizes
Aaron Marcuse-Kubitza
01:17 AM Revision 7561: Removed no longer used fix_permissions. Use root fix_perms instead.
Aaron Marcuse-Kubitza
01:16 AM Revision 7560: Added root fix_perms
Aaron Marcuse-Kubitza

02/15/2013

11:58 PM Revision 7559: Moved Checksums from backups/Makefile to lib/common.Makefile so all dirs (including inputs/) can use md5sum testing
Aaron Marcuse-Kubitza
11:08 PM Revision 7558: lib/common.Makefile: $(remote): Made remote basepath configurable in $(remote_basepath)
Aaron Marcuse-Kubitza
11:04 PM Revision 7557: lib/common.Makefile: Renamed $(src_server) to $(remote_host) and $(src_user) to $(remote_user) for clarity
Aaron Marcuse-Kubitza
10:16 PM Revision 7556: inputs/GBIF/: Added refresh metadata
Aaron Marcuse-Kubitza

02/14/2013

11:49 AM Revision 7555: Added inputs/HVAA/
Aaron Marcuse-Kubitza
11:14 AM Revision 7554: Added inputs/ARIZ/_archive
Aaron Marcuse-Kubitza
11:13 AM Revision 7553: inputs/ARIZ/: Removed previous data now that it has been refreshed
Aaron Marcuse-Kubitza
11:08 AM Revision 7552: inputs/ARIZ/: Mapped refresh
Aaron Marcuse-Kubitza
09:48 AM Revision 7551: Added inputs/ARIZ/import_order.txt
Aaron Marcuse-Kubitza
09:22 AM Revision 7550: Added inputs/NY/_archive/
Aaron Marcuse-Kubitza
09:20 AM Revision 7549: inputs/NY/: Removed tables from previous extract
Aaron Marcuse-Kubitza
08:59 AM Revision 7548: inputs/NY/: Mapped refresh
Aaron Marcuse-Kubitza
08:58 AM Revision 7547: inputs/*/*/VegBIEN.csv: Regenerated from mappings/VegCore-VegBIEN.csv
Aaron Marcuse-Kubitza
08:52 AM Revision 7546: Added inputs/NY/import_order.txt
Aaron Marcuse-Kubitza
02:51 AM Revision 7545: inputs/ARIZ/: Added SQL export for refresh
Aaron Marcuse-Kubitza
02:33 AM Revision 7544: my2pg.data: Translate indefinite (zero) months which have a definite day. This is unusual, but does appear in some data such as the ARIZ DB.
Aaron Marcuse-Kubitza
02:28 AM Revision 7543: my2pg.data: Translate indefinite dates (dates with 0 as the month or day)
Aaron Marcuse-Kubitza
02:23 AM Revision 7542: my2pg: Use my2pg.data to perform data-only replacements, instead of duplicating them in both my2pg and my2pg.data
Aaron Marcuse-Kubitza
02:01 AM Revision 7541: my2pg: named UNIQUE KEYs: Comment out the name because PostgreSQL requires it to be globally unique, but MySQL only requires it to be unique within the table
Aaron Marcuse-Kubitza
01:53 AM Revision 7540: my2pg: Translate UNIQUE KEYs instead of removing them
Aaron Marcuse-Kubitza
01:49 AM Revision 7539: my2pg*: Removed KEYs: Comment out the definition rather than removing it
Aaron Marcuse-Kubitza
01:45 AM Revision 7538: my2pg*: Remove FOREIGN KEYs because MySQL does not dump tables in dependency order, which prevents PostgreSQL from creating tables whose fkeys refer to a later table
Aaron Marcuse-Kubitza
01:33 AM Revision 7537: my2pg*: Replacing invalid table elements to remove them: Use a dummy CHECK constraint instead of a boolean field to avoid adding fields to the table. The elements can't always simply be removed because sed can't remove the trailing comma of the previous element, and removing the following comma doesn't work for the last element in the table.
Aaron Marcuse-Kubitza
12:11 AM Revision 7536: my2pg*: Replace '0000-00-00 00:00:00' with '-infinity'
Aaron Marcuse-Kubitza
12:04 AM Revision 7535: my2pg: Replace datetime with timestamp
Aaron Marcuse-Kubitza

02/13/2013

11:59 PM Revision 7534: my2pg: Remove COLLATE field attribute
Aaron Marcuse-Kubitza
11:56 PM Revision 7533: lib/MySQL.*.sql.make: Documented that $server user/host are for ssh, not the DB
Aaron Marcuse-Kubitza
11:55 PM Revision 7532: lib/MySQL.*.sql.make: Documented that $server can also contain a username (which will be used by ssh)
Aaron Marcuse-Kubitza
11:51 PM Revision 7531: my2pg_export: Use the --quick option to facilitate exporting large tables (it avoids retrieving all rows before outputting any of them)
Aaron Marcuse-Kubitza
11:00 PM Revision 7530: README.TXT: Datasource setup: Added instructions for MS Access databases
Aaron Marcuse-Kubitza
10:43 PM Revision 7529: README.TXT: Datasource setup: MySQL inputs: Added instruction to skip the Add input data for each table section
Aaron Marcuse-Kubitza
10:40 PM Revision 7528: inputs/NY/: Added SQL export for refresh
Aaron Marcuse-Kubitza

02/12/2013

01:08 PM Revision 7527: mappings/VegCore.htm: Regenerated from wiki. Brad's new DwC ID terms spreadsheet has now been added, and a number of the ID terms clarified, disambiguated, and recategorized. In particular, institutionCode has now been split into the custodialInstitutions and collectingInstitution, to differentiate between which institution has the specimen vs. stamped the specimen. This distinction is important because the catalogNumber, stamped on the specimen, is only unique within the collectingInstitution. Most datasources don't unambiguously specify which institution their institutionCode is referring to, so it has been assumed to be custodialInstitutions unless a data dictionary says otherwise (as is the case for UNCC). In addition, a MatchedTaxonDetermination table has been added with the *_matched fields from TNRS.
Aaron Marcuse-Kubitza
12:15 PM Revision 7526: inputs/CVS/observation_/map.csv: baseSaturation: Resolved ambiguous term
Aaron Marcuse-Kubitza
12:09 PM Revision 7525: mappings/Makefile: VegCore.vocab.csv: Ignore leading ? when sorting so that ambiguous terms sort alphabetically with other terms. This prevents terms from moving from their previous location when they become ambiguous.
Aaron Marcuse-Kubitza
12:07 PM Revision 7524: Added sort_ci to sort a spreadsheet, ignoring leading punctuation
Aaron Marcuse-Kubitza
12:05 PM Revision 7523: mappings/VegCore.vocab.csv: Changed line endings to \r\n in preparation for having a Python script run on it (which changes the line endings)
Aaron Marcuse-Kubitza
11:47 AM Revision 7522: mappings/Makefile: VegCore.vocab.csv: Added back ambiguous terms, so that the vocabulary contains all terms defined by VegCore, regardless of whether they are ambiguous or unambiguous terms
Aaron Marcuse-Kubitza
11:44 AM Revision 7521: mappings/Makefile: VegCore.vocab.csv: Added back synonyms, so that the vocabulary contains all terms defined by VegCore, regardless of whether they are synonyms or primary terms. This also prevents VegCore.vocab.csv from losing entries when terms are renamed, which made it difficult to verify that no terms were lost when refactoring.
Aaron Marcuse-Kubitza
05:50 AM Revision 7520: inputs/MO/Specimen/postprocess.sql: Remove frameshifted rows by detecting InstitutionCodes without any letters
Aaron Marcuse-Kubitza
04:59 AM Revision 7519: inputs/ARIZ/Specimen/map.csv: CollectorNumber/FieldNumber: Use /_first to map these identical fields to the same location
Aaron Marcuse-Kubitza
04:54 AM Revision 7518: inputs/ARIZ/Specimen/map.csv: Fixed bug where the column names for InstitutionCode and CollectionCode were reversed in the source data
Aaron Marcuse-Kubitza
04:14 AM Revision 7517: inputs/*/Specimen/map.csv for Canadensys sources: Remapped institutionID to UNUSED
Aaron Marcuse-Kubitza

02/09/2013

07:45 AM Revision 7516: mappings/VegCore.htm: Regenerated from wiki. The original*, accepted*, and verbatim* Taxon fields have now been moved to separate OriginalTaxonDetermination, AcceptedTaxonDetermination, and TaxonVerbatim tables.
Aaron Marcuse-Kubitza
06:52 AM Revision 7515: mappings/VegCore.htm: Regenerated from wiki
Aaron Marcuse-Kubitza
06:34 AM Revision 7514: mappings/VegCore.htm: Regenerated from wiki
Aaron Marcuse-Kubitza
04:08 AM Revision 7513: README.TXT: Maintenance: VegCore data dictionary: Replaced VegCore.*.csv with VegCore.htm because now that VegCore.*.csv are sorted alphabetically, they generally don't change when VegCore.htm changes
Aaron Marcuse-Kubitza
04:04 AM Revision 7512: mappings/VegCore.*.csv: Regenerated from wiki. A plain text label is now used for Replace with, which fixes a bug where the PRIVATE permalink pointed to its Replace with in realLatitude instead of its definition.
Aaron Marcuse-Kubitza
03:55 AM Revision 7511: redmine_synonyms: Support plain text labels other than Alternative, such as Replace with
Aaron Marcuse-Kubitza
03:13 AM Revision 7510: mappings/VegCore.*.csv: Regenerated from wiki. Alternatives now contain the "Alternative" label as plain text rather than as an image title, thus avoiding an HTML anchor conflict with the definition and allowing ambiguous terms to be placed before their alternatives as well as after.
Aaron Marcuse-Kubitza
03:11 AM Revision 7509: README.TXT: Maintenance: VegCore data dictionary: Updated VegCore.csv filename to VegCore.*.csv
Aaron Marcuse-Kubitza
02:57 AM Revision 7508: redmine_synonyms: Support alternatives which contain the "Alternative" label as plain text rather than as an image title. This is done to include the "Alternative" label in the HTML anchor and thus prevent the anchor from conflicting with the actual definition of the alternative (which would otherwise have the same anchor text). This allows ambiguous terms to be placed before their alternatives as well as after, because there won't be anchor conflicts that need to be resolved with careful ordering.
Aaron Marcuse-Kubitza
02:48 AM Revision 7507: mappings/VegCore.csv: Regenerated from wiki. Taxon terms with prefixes for other TaxonDeterminations now indicate the analogous term in an "analogous to" label next to the term
Aaron Marcuse-Kubitza
02:47 AM Revision 7506: mappings/VegCore.csv: Regenerated from wiki. Taxon terms with prefixes for other TaxonDeterminations now indicate the analogous term in an "analogous to" label next to the term
Aaron Marcuse-Kubitza
02:47 AM Revision 7505: mappings/VegCore.csv: Regenerated from wiki. Taxon terms with prefixes for other TaxonDeterminations now indicate the analogous term in an "analogous to" label next to the term
Aaron Marcuse-Kubitza
 

Also available in: Atom