/ - Changes - BIEN 3 - NCEAS Projects

root @ 11951

#	Date	Author	Comment
11951	01/15/2014 08:11 AM	Aaron Marcuse-Kubitza	bin/boldify: made it idempotent
11950	01/15/2014 08:08 AM	Aaron Marcuse-Kubitza	bugfix: bin/boldify: fixed extended regular expression syntax, which doesn't support a \] inside [] (you instead have to put the ] right after the opening [^ )
11949	01/15/2014 07:59 AM	Aaron Marcuse-Kubitza	added bin/boldify, which makes Redmine links bold
11948	01/14/2014 09:30 PM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. Quick links: removed page descriptions, which clutter up the quick links list
11947	01/14/2014 09:28 PM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. Quick links: added link to meetings
11946	01/14/2014 09:14 PM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. Quick links: reordered in the order they should apper to visitors to the BIEN links page, rather than to me in the bookmarks toolbar.
11945	01/14/2014 09:04 PM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. updated favicons.
11944	01/14/2014 08:56 PM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. added links for sPlot and iPlant open-sourcing.
11943	01/14/2014 08:34 PM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: updated
11942	01/14/2014 08:31 PM	Aaron Marcuse-Kubitza	inputs/datasource_release_status.xlsx: renamed to `publishable datasources.xlsx` to match the spreadsheet title
11941	01/09/2014 01:38 AM	Aaron Marcuse-Kubitza	planning/meetings/BIEN conference call availability.xlsx: updated
11940	01/09/2014 12:31 AM	Aaron Marcuse-Kubitza	/README.TXT: to synchronize a Mac's settings with my testing machine's: added step to remove the downloaded Spam folder, because spam e-mails often contain viruses that would trigger clamscan
11939	12/20/2013 10:45 PM	Aaron Marcuse-Kubitza	fix: planning/timeline/timeline.xls: realigned legend
11938	12/20/2013 10:44 PM	Aaron Marcuse-Kubitza	planning/timeline/timeline.xls: hid previous weeks
11937	12/20/2013 10:43 PM	Aaron Marcuse-Kubitza	planning/timeline/timeline.xls: crossed out and hid completed tasks
11936	12/20/2013 10:41 PM	Aaron Marcuse-Kubitza	planning/timeline/timeline.xls: updated for progress
11935	12/20/2013 05:41 PM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. open-sourcing BIEN: added links for VegBank's license (GPLv2), and how these terms apply to us (a diff of our changes is not GPL-ed under GPLv2, although it is claimed to be GPL-ed under GPLv3)
11934	12/20/2013 04:41 PM	Aaron Marcuse-Kubitza	inputs/VegBank/^taxon_observation.**.sample/create.sql, map.csv: added new project columns
11933	12/20/2013 04:31 PM	Aaron Marcuse-Kubitza	inputs/VegBank/taxon_observation.**/postprocess.sql: added the project table
11932	12/20/2013 04:25 PM	Aaron Marcuse-Kubitza	mapped inputs/VegBank/project/, which includes the projectName for attribution
11931	12/20/2013 02:56 PM	Aaron Marcuse-Kubitza	inputs/CVS/^taxon_observation.**.sample/create.sql, map.csv: added new project columns
11930	12/20/2013 02:44 PM	Aaron Marcuse-Kubitza	inputs/CVS/taxon_observation.**/postprocess.sql: added the project table
11929	12/20/2013 02:42 PM	Aaron Marcuse-Kubitza	inputs/CVS/project/map.csv: mapped stopDate->projectEndDate
11928	12/20/2013 02:35 PM	Aaron Marcuse-Kubitza	mapped inputs/CVS/project/, which includes the projectName for attribution
11927	12/20/2013 01:25 AM	Aaron Marcuse-Kubitza	inputs/VegBIEN/Redmine/svn/.htaccess: updated to use much faster direct repository URL rather than Redmine web interface, now that the repository itself is publicly accessible in addition to the Redmine view of it
11926	12/20/2013 01:18 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.xls: updated for progress
11925	12/20/2013 01:13 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: renamed to timeline.xls so that this can continue to be used for 2014 (leaving a symlink from the old filename to preserve permalinks)
11924	12/20/2013 12:28 AM	Aaron Marcuse-Kubitza	fix: inputs/TEX/Specimen*/map.csv, postprocess.sql: habitat: also placed in occurrenceRemarks so that this field gets parsed for growth form information, as requested by Brad (wiki.vegpath.org/TEX_validation#2013-2-26)
11923	12/19/2013 11:49 PM	Aaron Marcuse-Kubitza	fix: inputs/TEX/Specimen*/map.csv: mapped constant values for specimenHolderInstitutions, country. these have to be added with `rm=1 ./inputs/TEX/Specimen.../run postprocess`.
11922	12/19/2013 11:42 PM	Aaron Marcuse-Kubitza	bugfix: inputs/TEX/Specimen2/map.csv: mapped BARCODE to accessionNumber so that we have a unique ID for each row
11921	12/19/2013 11:11 PM	Aaron Marcuse-Kubitza	schemas/vegbien.sql: analytical_stem_view: scientificName_verbatim: don't use taxonverbatim.taxonname+author as the scientificName_verbatim if only the author is provided. (this lead to weird scientificName_verbatims that contain just the author.)
11920	12/17/2013 08:06 AM	Aaron Marcuse-Kubitza	inputs/datasource_release_status.xlsx: updated
11919	12/17/2013 07:28 AM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks. added links for fixing the "App Store is temporarily unavailable" error (turn on Spotlight) and modifying a running shell script (unlink it first).
11918	12/17/2013 05:47 AM	Aaron Marcuse-Kubitza	bugfix: bin/map: in_is_db: don't ignore errors when the table does not exist, because these prevent an errexit and allow an import to continue when a staging table is missing. suppressing this error had previously been necessary because metadata-only tables (Source/) used to not have installed staging tables, and the program had to react accordingly.
11917	12/16/2013 07:05 PM	Aaron Marcuse-Kubitza	inputs/CVS/^taxon_observation.**.sample/create.sql: added Mike Lee's additional plots used to validate confidentiality-related fields (wiki.vegpath.org/CVS_validation#plots-to-include)
11916	12/16/2013 06:00 PM	Aaron Marcuse-Kubitza	bugfix: inputs/CVS/^taxon_observation.**.sample/create.sql: include taxonName in the subset of columns that's imported for the validation, because it is _alt-ed with scientificName for forming the TNRS input name. this is unique to CVS, which is why it was not part of the validation subset copied from the VegBank subset.
11915	12/16/2013 05:46 PM	Aaron Marcuse-Kubitza	/README.TXT: Full database import: documented that you should always start with a clean shell, which does not have changes to the env vars. (there have been inexplicable bugs that went away after closing and reopening the terminal window.) note that running `exec bash` is not sufficient to reset the env vars.
11914	12/16/2013 04:58 PM	Aaron Marcuse-Kubitza	fix: lib/sh/util.sh: verbosity_min(): usage: clarified that '' is a special value that causes $verbosity to be overwritten to ''
11913	12/16/2013 04:45 PM	Aaron Marcuse-Kubitza	lib/runscripts/table.run: added test_() target and use it in remake_VegBIEN_mappings() (it would not be clear that remake_VegBIEN_mappings() runs the tests)
11912	12/16/2013 01:43 PM	Aaron Marcuse-Kubitza	bugfix: inputs/.TNRS/schema.sql: granted bien_read SELECT access to derived views as well as the core tnrs table
11911	12/15/2013 05:30 PM	Aaron Marcuse-Kubitza	updated inputs/datasource_release_status.xlsx
11910	12/15/2013 05:27 PM	Aaron Marcuse-Kubitza	added inputs/datasource_release_status.xlsx, export of Google spreadsheet at https://docs.google.com/spreadsheet/ccc?key=0ArZXrTAXd-TYdDRRb2RxYi11TWZrQVh5bVdKOURCeFE
11909	12/12/2013 08:57 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: updated for progress
11908	12/12/2013 08:35 AM	Aaron Marcuse-Kubitza	bugfix: schemas/vegbien.sql: location: use the place_id from the parent location when no place_id is specified. this fixes a bug in analytical_stem_view where the parent location's place_id was used because it was sometimes missing from the sublocation, but the parent place_id itself was sometimes missing instead if sublocations each had their own place information. this way, it is always available directly in the sublocation, populated from the parent location if needed.
11907	12/12/2013 08:27 AM	Aaron Marcuse-Kubitza	bugfix: schemas/vegbien.sql: location: added place_id which is autopopulated from the current locationplace. join on this in plot.**, to avoid a 1:many join when a location has multiple locationplaces.
11906	12/11/2013 11:10 PM	Aaron Marcuse-Kubitza	bugfix: schemas/vegbien.sql: locationevent_unique_within_parent_by_location unique index: need COALESCE around location_id since it's nullable
11905	12/11/2013 10:54 PM	Aaron Marcuse-Kubitza	fix: inputs/CVS/^taxon_observation..sample/: added _no_import because this table duplicates part of what's imported from taxon_observation.
11904	12/11/2013 10:42 PM	Aaron Marcuse-Kubitza	bugfix: inputs/VegBank/plot/: added _no_import because this table is left-joined and should not be imported separately
11903	12/11/2013 10:40 PM	Aaron Marcuse-Kubitza	bugfix: inputs/{.NCBI,CTFS}/*.src/: added _no_import because these tables are left-joined and should not be imported separately
11902	12/11/2013 09:56 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: removed table names from datasources where only one table is imported
11901	12/11/2013 09:52 PM	Aaron Marcuse-Kubitza	fix: inputs/import.stats.xls: removed deleted tables from current import
11900	12/11/2013 09:51 PM	Aaron Marcuse-Kubitza	inputs/import.stats.xls: updated import times
11899	12/11/2013 07:56 PM	Aaron Marcuse-Kubitza	updated backups/TNRS.backup.md5
11898	12/11/2013 07:56 PM	Aaron Marcuse-Kubitza	added backups/vegbien.r11786.backup.md5
11897	12/11/2013 07:53 PM	Aaron Marcuse-Kubitza	/README.TXT: Full database import: backups: added step to download backup to local machine
11896	12/11/2013 07:45 PM	Aaron Marcuse-Kubitza	bugfix: /Makefile: install: need to run inputs/download in live mode so that the flat files are actually downloaded
11895	12/11/2013 07:43 PM	Aaron Marcuse-Kubitza	lib/common.Makefile: added %/live, for use with `make inputs/download`
11894	12/10/2013 07:44 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: rescheduled tasks
11893	12/10/2013 07:40 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: updated for progress
11892	12/10/2013 07:36 AM	Aaron Marcuse-Kubitza	/README.TXT: Full database import: In PostgreSQL: documented that the tables to check are located in the r# schema, not public
11891	12/10/2013 07:32 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: updated for progress
11890	12/10/2013 07:32 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: datasource validations: reordered datasources according to Brian Enquist's new validation order (wiki.vegpath.org/Spot-checking_validation_order)
11889	12/10/2013 07:10 AM	Aaron Marcuse-Kubitza	fix: schemas/vegbien.sql: analytical_specimen: added specimens-related columns that are in analytical_plot
11888	12/10/2013 06:35 AM	Aaron Marcuse-Kubitza	inputs/GBIF/raw_occurrence_record_plants/map.csv: row_num: remapped to plain *row_num, like the other datasources that have this field
11887	12/10/2013 06:31 AM	Aaron Marcuse-Kubitza	inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: Remove institutions that we have direct data for: rerun time: noted that this is only fast after manual vacuuming of the table (to remove the deleted rows from the index). autovacuum apparently does not run, although it should.
11886	12/10/2013 05:18 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: hid previous weeks
11885	12/10/2013 05:18 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: added timespan dots ◦ for supertasks
11884	12/10/2013 05:15 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: legend: changed to movable text box to avoid needing to erase and repopulate the header columns with the legend cells
11883	12/10/2013 05:03 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: crossed out and hid completed tasks
11882	12/10/2013 04:58 AM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: updated for progress
11881	12/09/2013 07:24 PM	Aaron Marcuse-Kubitza	inputs/GBIF/raw_occurrence_record_plants/test.xml.ref: reran test, which added yearCollected/monthCollected/dayCollected
11880	12/09/2013 07:23 PM	Aaron Marcuse-Kubitza	inputs/CVS/plantConcept_/create.sql: documented runtime (3 min)
11879	12/09/2013 06:59 PM	Aaron Marcuse-Kubitza	inputs/CTFS/*.src/: added test.xml.ref
11878	12/09/2013 06:58 PM	Aaron Marcuse-Kubitza	inputs/CTFS/*.src/: added VegBIEN.csv
11877	12/09/2013 06:56 PM	Aaron Marcuse-Kubitza	bugfix: inputs/CTFS/TaxonOccurrence*/map.csv: things mapped to taxonObservationID: remapped to taxonOccurrenceID since taxonObservationID is not mapped to anything in VegBIEN (denormalized VegCore doesn't distinguish between taxon occurrences and taxon observations of them)
11876	12/09/2013 05:46 PM	Aaron Marcuse-Kubitza	bugfix: inputs/ARIZ/~.clean_up.sql: prevent "column already exists" errors when there is an input column of the same name as an output column
11875	12/09/2013 05:44 PM	Aaron Marcuse-Kubitza	bugfix: lib/runscripts/datasrc_dir.run: import(): don't run `sql/install` if the schema already exists, because this will try to rerun all the schema-creation queries. note that this idempotent functionality was not provided by the `make .../install` target that was previously used (idempotency is new with new-style import).
11874	12/09/2013 05:26 PM	Aaron Marcuse-Kubitza	bugfix: schemas/vegbien.sql: updated for renamed county_centroids column names
11873	12/09/2013 04:16 PM	Aaron Marcuse-Kubitza	inputs/.geoscrub/import_order.txt: added county_centroids so that it would be installed by new-style import
11872	12/09/2013 03:54 PM	Aaron Marcuse-Kubitza	bugfix: lib/runscripts/datasrc_dir.run: import(): can't run `datasrc_make reinstall` anymore because this now defers to the runscript for new-style import datasources (which was done so that `make .../install` properly reinstalls all the datasources). instead, call the applicable make targets manually (there are just 2 of them).
11871	12/09/2013 03:37 PM	Aaron Marcuse-Kubitza	inputs/FIA/TREE/run: documented import() runtime (1.5 h), which includes table cleanup runtime (1 h)
11870	12/09/2013 03:09 PM	Aaron Marcuse-Kubitza	bugfix: bin/pg_dump_limit: support errexit by ignoring the nonzero exit status that grep returns when it doesn't match anything
11869	12/09/2013 02:43 PM	Aaron Marcuse-Kubitza	inputs/GBIF/raw_occurrence_record_plants/run: updated import() runtime (same), documented table cleanup runtime (1.5 h)
11868	12/09/2013 02:38 PM	Aaron Marcuse-Kubitza	inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: CREATE INDEX ... specimenHolderInstitutions: documented runtime (45 min)
11867	12/09/2013 02:28 PM	Aaron Marcuse-Kubitza	inputs/GBIF/raw_occurrence_record_plants/postprocess.sql: Remove institutions that we have direct data for: documented runtime (3.5 min)
11866	12/09/2013 02:27 PM	Aaron Marcuse-Kubitza	/README.TXT: Datasource setup: added steps to backup e-mails
11865	12/06/2013 07:46 AM	Aaron Marcuse-Kubitza	bugfix: inputs/CTFS/import_order.txt: added .src so that these would be installed under new-style import as well. this means that their columns will now be automapped, requiring the names to be renamed to VegCore names in /create.sql. note that VegCore taxonOccurrenceID has been renamed to taxonObservationID since this was last run.
11864	12/06/2013 06:56 AM	Aaron Marcuse-Kubitza	inputs/.geoscrub/run: documented import() runtime (20 min)
11863	12/06/2013 06:12 AM	Aaron Marcuse-Kubitza	bugfix: inputs/.NCBI/import_order.txt: added nodes.src, names.src so that these would be installed under new-style import as well. this means that their columns will now be automapped, requiring the names to be renamed to VegCore names in nodes/create.sql.
11862	12/06/2013 06:01 AM	Aaron Marcuse-Kubitza	fix: /Makefile: inputs/reinstall: commented out to avoid a cascade of "overriding commands for target" warnings. this will revert to the default uninstall, install sequence for this target rather than the simultaneous-reinstall optimization (which can still be invoked manually).
11861	12/06/2013 05:52 AM	Aaron Marcuse-Kubitza	lib/sh/local.sh: public_schema_exists(): use a higher log_level for pg_schema_exists, to avoid all the verbose output involved in running the query
11860	12/06/2013 05:44 AM	Aaron Marcuse-Kubitza	bugfix: lib/sh/local.sh: public_schema_exists(): can no longer use psql_script_vegbien for this, because using `SET search_path` (called by psql_script_vegbien) with a schema that does not exist no longer produces an error. instead, use new pg_schema_exists(), which uses a different command that does produce an error if the schema does not exist.
11859	12/06/2013 05:38 AM	Aaron Marcuse-Kubitza	lib/sh/db.sh: added pg_require_schema()
11858	12/06/2013 05:37 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: stderr2stdout(): documented that this redirects fd 2->1 and log_fd (but not back to 2)
11857	12/06/2013 05:34 AM	Aaron Marcuse-Kubitza	bugfix: lib/sh/util.sh: stderr2stdout() use `command` before tee, which re-filters log_fd so that stderr itself is also filtered. this allows log-filtering out an otherwise-confusing benign error when using e.g. stderr_matches().
11856	12/06/2013 04:31 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: added not(), for use in prefixing wrapped commands
11855	12/06/2013 04:14 AM	Aaron Marcuse-Kubitza	lib/sh/db.sh: added pg_schema_exists()
11854	12/06/2013 04:10 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: added stderr_matches()
11853	12/06/2013 03:59 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: documented that fds 2x/3x should not be used because we use these, as opposed to 1x which is used by the shell internally
11852	12/06/2013 03:57 AM	Aaron Marcuse-Kubitza	lib/sh/util.sh: added stdout_contains()

Project

General

Profile