/trunk - Changes - BIEN 3 - NCEAS Projects

root/trunk @ 14608

svn:ignore: extern

#	Date	Author	Comment
14608	08/28/2014 08:54 AM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: updated location of `screen` for added commands
14607	08/28/2014 08:53 AM	Aaron Marcuse-Kubitza	/README.TXT: added trailing / on dirs to make clear that they're dirs
14606	08/28/2014 08:40 AM	Aaron Marcuse-Kubitza	config/VirtualBox_VMs/vegbiendev/README.TXT: added instructions to configure the VM to support VirtualBox
14605	08/28/2014 08:22 AM	Aaron Marcuse-Kubitza	config/VirtualBox_VMs/vegbiendev/README.TXT: added instructions to retrieve the contents of the VM, with the VirtualBox changes added
14604	08/28/2014 07:47 AM	Aaron Marcuse-Kubitza	config/VirtualBox_VMs/vegbiendev/README.TXT: to retrieve the original contents of the backup from the VM: added steps to restore the correct VM snapshot
14603	08/28/2014 07:40 AM	Aaron Marcuse-Kubitza	config/VirtualBox_VMs/vegbiendev/README.TXT: also generate list of all the files whose permissions were changed since the backup, but which are extracted with their changed permissions instead of their original ones in the backup
14602	08/28/2014 07:05 AM	Aaron Marcuse-Kubitza	config/VirtualBox_VMs/vegbiendev/README.TXT: added instructions to retrieve the original contents of the backup from the VM
14601	08/28/2014 05:47 AM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up vegbiendev: also back up /home/aaronmk/bien/ (instead of just symlinking to the local copy), since this can be done space-efficiently with hardlinks. this ensures that the vegbiendev backup will not be modified when the local copy of bien/ is.
14600	08/28/2014 03:10 AM	Aaron Marcuse-Kubitza	lib/csvs.py: JsonReader: factored out row-dict-to-list into new row_dict_to_list_reader so that JSON-specific preprocessing is kept separate from the row format translation
14599	08/27/2014 03:17 PM	Aaron Marcuse-Kubitza	lib/csvs.py: added MultiFilter, which enables applying multiple filters by nesting
14598	08/26/2014 07:57 PM	Aaron Marcuse-Kubitza	lib/tnrs.py: single_tnrs_request(): JSON mode: implemented output of JSON data
14597	08/26/2014 07:53 PM	Aaron Marcuse-Kubitza	lib/tnrs.py: single_tnrs_request(): factored out wrapping in TnrsOutputStream, since this is done for both modes
14596	08/26/2014 07:47 PM	Aaron Marcuse-Kubitza	fix: lib/tnrs.py: JSON mode: TSV export columns: need to translate these to JSON column names before they can be used with the JSON data
14595	08/26/2014 07:44 PM	Aaron Marcuse-Kubitza	lib/csvs.py: added JsonReader, which reads parsed JSON data as row tuples
14594	08/26/2014 07:43 PM	Aaron Marcuse-Kubitza	lib/csvs.py: added row_dict_to_list(), which translates a CSV dict-based row to a list-based one
14593	08/26/2014 07:43 PM	Aaron Marcuse-Kubitza	lib/csvs.py: RowNumFilter: added support for filtering the header row as well
14592	08/26/2014 07:42 PM	Aaron Marcuse-Kubitza	lib/csvs.py: ColInsertFilter: added support for filtering the header row as well
14591	08/26/2014 05:12 PM	Aaron Marcuse-Kubitza	lib/csvs.py: InputRewriter: documented that this is also a stream (in addition to inheriting from StreamFilter)
14590	08/26/2014 05:11 PM	Aaron Marcuse-Kubitza	bugfix: lib/csvs.py: InputRewriter: accept a reader, as would be expected, instead of a custom stream whose lines are tuples
14589	08/26/2014 05:08 PM	Aaron Marcuse-Kubitza	fix: lib/sql_io.py: append_csv(): use new csvs.ProgressInputFilter instead of streams.ProgressInputStream(csvs.StreamFilter(__)), so that the input to csvs.InputRewriter is a reader, not a stream. this avoids the need for csvs.InputRewriter to accept a stream whose lines are tuples, instead of the expected reader.
14588	08/26/2014 05:02 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: %/install: $(exportHeader) must come before postprocess because postprocess renames columns
14587	08/26/2014 04:50 PM	Aaron Marcuse-Kubitza	exports/: svn:ignore: added *.gz
14586	08/26/2014 04:49 PM	Aaron Marcuse-Kubitza	lib/csvs.py: added ProgressInputFilter, analogous to streams.ProgressInputStream
14585	08/26/2014 04:46 PM	Aaron Marcuse-Kubitza	lib/sql_io.py: added commented-out debug statement used to troubleshoot copy_expert() errors
14584	08/26/2014 04:45 PM	Aaron Marcuse-Kubitza	lib/dicts.py: added pair_keys(), pair_values()
14583	08/26/2014 04:15 PM	Aaron Marcuse-Kubitza	bugfix: lib/streams.py: CaptureStream: end_idx must also be > start_idx
14582	08/26/2014 04:07 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: $(import_install_): need `set -o pipefail` to enable errexit
14581	08/26/2014 03:47 AM	Aaron Marcuse-Kubitza	/README.TXT: to backup files not in Time Machine: don't need to review diff because command is unidirectional
14580	08/26/2014 02:59 AM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: "repeat until only minimal changes" should refer to the first sync command
14579	08/26/2014 02:52 AM	Aaron Marcuse-Kubitza	inputs/.geoscrub/geoscrub_output/run: documented postprocess() rm=1 runtime (6 min)
14578	08/25/2014 10:17 PM	Aaron Marcuse-Kubitza	lib/tnrs.py: single_tnrs_request(): use_tnrs_export=False: need to obtain export columns
14577	08/25/2014 10:16 PM	Aaron Marcuse-Kubitza	lib/csvs.py: added header(stream)
14576	08/25/2014 10:16 PM	Aaron Marcuse-Kubitza	fix: lib/tnrs.py: single_tnrs_request(): need to `assert name_ct >= 1`, because with no names, TNRS hangs indefinitely
14575	08/25/2014 09:13 PM	Aaron Marcuse-Kubitza	bin/tnrs_client: added env var to configure use_tnrs_export
14574	08/25/2014 08:18 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up vegbiendev: use inplace=1 to speed stopping and resuming transfer
14573	08/25/2014 07:54 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: removed --extended-attributes (after initial sync) because rsync apparently has to visit every file for this
14572	08/25/2014 07:35 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: also need --extended-attributes
14571	08/25/2014 07:34 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's hard drive: removed --delete-before now that that partition has been expanded
14570	08/25/2014 07:16 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up vegbiendev: exclude /var/lib/mysql.bak,postgresql.bak because the local machine doesn't need 2 copies of this information
14569	08/25/2014 07:05 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up vegbiendev: removed no longer needed exclude of Dropbox subdir backup
14568	08/25/2014 06:58 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up vegbiendev: also need to do steps under Maintenance > "to synchronize vegbiendev, jupiter, and your local machine" because /home/aaronmk/bien is not synced here
14567	08/25/2014 06:52 PM	Aaron Marcuse-Kubitza	bugfix: /README.TXT: to back up vegbiendev: need `overwrite=1`
14566	08/25/2014 06:47 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up vegbiendev: removed no longer needed exclude of Dropbox subdir backup
14565	08/25/2014 06:46 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the version history: don't also need this on vegbiendev because it's already on jupiter and the local machine
14564	08/25/2014 06:43 PM	Aaron Marcuse-Kubitza	bugfix: /README.TXT: to back up vegbiendev: need to include Postgres config files
14563	08/25/2014 06:24 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's hard drive: don't back up temp files: added /.fseventsd/
14562	08/25/2014 05:54 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: initial runtime: use range instead because some of the later runtime might have been from the same files
14561	08/25/2014 05:52 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's hard drive: updated initial runtime to include additional transferred files (17 h)
14560	08/25/2014 05:36 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: need to use --delete-before because the backup partition is near capacity
14559	08/25/2014 05:34 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's hard drive: don't back up temp files such as /private/var/vm/*
14558	08/25/2014 05:30 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: back up most Dropbox/Postgres files before stopping processes, to minimize downtime
14557	08/21/2014 07:35 PM	Aaron Marcuse-Kubitza	bugfix: /README.TXT: to back up the local machine's hard drive: can't use ~ with --exclude
14556	08/21/2014 07:31 PM	Aaron Marcuse-Kubitza	fix: inputs/.geoscrub/geoscrub_output/postprocess.sql: map_geovalidity(): unscrubbable names should actually be geoinvalid, not geovalid=NULL, according to Brad
14555	08/21/2014 07:24 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's hard drive: back up the non-Dropbox, non-Postgres files separately to minimize the Dropbox and Postgres downtime
14554	08/21/2014 06:03 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the vegbiendev databases: don't need to review diff for these as it's always unidirectional
14553	08/21/2014 05:55 PM	Aaron Marcuse-Kubitza	/README.TXT: added instructions to back up vegbiendev
14552	08/21/2014 05:12 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's hard drive: also need to repeat backup command until only minimal changes
14551	08/21/2014 05:11 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's hard drive: added step to stop Postgres
14550	08/21/2014 05:10 PM	Aaron Marcuse-Kubitza	bugfix: /README.TXT: to back up the local machine's hard drive: also need to stop Dropbox
14549	08/21/2014 05:06 PM	Aaron Marcuse-Kubitza	/README.TXT: to back up the local machine's settings: added step to remove .DS_Store
14548	08/21/2014 04:47 PM	Aaron Marcuse-Kubitza	fix: /README.TXT: to back up the local machine's settings: Dropbox: shoudl not run with `del=`, because the backup should be an exact replica
14547	08/21/2014 04:25 PM	Aaron Marcuse-Kubitza	backups/TNRS.*: removed no longer needed old TNRS backups, which are part of the respective full-database backups in any case
14546	08/21/2014 02:57 PM	Aaron Marcuse-Kubitza	added config/phpMyAdmin/ symlink to schemas/VegCore/phpMyAdmin/
14545	08/21/2014 12:40 PM	Aaron Marcuse-Kubitza	bugfix: lib/sh/archives.sh: compress(): don't include dir prefix in zip archive
14544	08/21/2014 12:40 PM	Aaron Marcuse-Kubitza	lib/sh/util.sh: cd(): use echo_run instead of a manual echo_cmd call
14543	08/21/2014 12:35 PM	Aaron Marcuse-Kubitza	fix: lib/sh/util.sh: cd(): indent after running cd rather than before
14542	08/21/2014 12:32 PM	Aaron Marcuse-Kubitza	lib/sh/util.sh: cd(): support rebasing path vars for the new dir
14541	08/21/2014 11:51 AM	Aaron Marcuse-Kubitza	bugfix: lib/sh/archives.sh: compress(): need to use zip's path syntax to avoid the file in the archive being named "-"
14540	08/21/2014 08:56 AM	Aaron Marcuse-Kubitza	lib/tnrs.py: added option to avoid using TNRS's TSV export feature, which currently returns incorrect selected matches (vegpath.org/issues/943). this has been implemented up through the GWT/JSON decoding.
14539	08/21/2014 08:50 AM	Aaron Marcuse-Kubitza	lib/tnrs.py: added gwt_decode()
14538	08/21/2014 08:49 AM	Aaron Marcuse-Kubitza	lib/strings.py: added unesc_quotes() and helper functions
14537	08/21/2014 08:49 AM	Aaron Marcuse-Kubitza	lib/strings.py: added json_decode()
14536	08/21/2014 08:38 AM	Aaron Marcuse-Kubitza	/README.TXT: To re-run geoscrubbing: updated runtimes
14535	08/21/2014 08:25 AM	Aaron Marcuse-Kubitza	exports/*_GBIF.csv.run: documented compress_() runtime (20 min-1 h)
14534	08/20/2014 11:12 PM	Aaron Marcuse-Kubitza	lib/runscripts/extract.run: export_(): also compress created file
14533	08/20/2014 11:11 PM	Aaron Marcuse-Kubitza	lib/sh/archives.sh: added compress(), expand(), which handle compression of individual files
14532	08/20/2014 04:26 PM	Aaron Marcuse-Kubitza	bugfix: inputs/input.Makefile: sql/install: ";" for commands inside $(if) blocks need to be inside the $(if) block, too, because otherwise there will be dangling ";" without a statement (bash does not support empty statements containing just ";")
14531	08/20/2014 11:29 AM	Aaron Marcuse-Kubitza	/README.TXT: Full database import: converted database commands to command-line commands to make them easier to run
14530	08/20/2014 11:28 AM	Aaron Marcuse-Kubitza	/README.TXT: Full database import: converted database commands to command-line commands to make them easier to run
14529	08/20/2014 10:55 AM	Aaron Marcuse-Kubitza	web/links/index.htm: updated to Firefox bookmarks: added instructions for how to enable automatic restart on power loss for the UPS (which isn't accessible in the GUI)
14528	08/19/2014 05:13 PM	Aaron Marcuse-Kubitza	fix: schemas/util.sql: contained_within_approx(point geocoord, region postgis.geography): use util.geography() instead of implicit cast to suppress "Coordinate values were coerced into range [-180 -90, 180 90] for GEOGRAPHY" NOTICEs
14527	08/19/2014 05:10 PM	Aaron Marcuse-Kubitza	schemas/util.sql: added geography(util.geocoord), which suppresses "Coordinate values were coerced into range [-180 -90, 180 90] for GEOGRAPHY" NOTICEs
14526	08/19/2014 04:48 PM	Aaron Marcuse-Kubitza	exports/native_status_resolver.csv.run: updated export_() runtime (5 min, now that we're using the narrower New World criterion)
14525	08/19/2014 04:39 PM	Aaron Marcuse-Kubitza	fix: schemas/public_.sql: native_status_resolver: don't include rows with New World coordinates that don't also have New World country names, since the NSR only uses the country name
14524	08/19/2014 04:26 PM	Aaron Marcuse-Kubitza	schemas/public_.sql: native_status_resolver: removed rows with is_geovalid NULL, at Brad's request. note that this removes valid rows with standardized country names.
14523	08/19/2014 04:01 PM	Aaron Marcuse-Kubitza	exports/native_status_resolver.csv.run: updated export_() runtime (30 min)
14522	08/19/2014 03:28 PM	Aaron Marcuse-Kubitza	fix: schemas/public_.sql: native_status_resolver: added country IS NOT NULL filter requested by Brad
14521	08/19/2014 02:35 PM	Aaron Marcuse-Kubitza	fix: schemas/public_.sql: native_status_resolver: remove the id because this prevents SELECT DISTINCT from having the desired effect. instead, the results will be joined back using the other columns.
14520	08/19/2014 02:25 PM	Aaron Marcuse-Kubitza	exports/native_status_resolver.csv.run: upload_(): documented runtime (2.5 min)
14519	08/19/2014 02:18 PM	Aaron Marcuse-Kubitza	bugfix: exports/native_status_resolver.csv.run: upload_(): $live must be exported
14518	08/19/2014 02:15 PM	Aaron Marcuse-Kubitza	exports/native_status_resolver.csv.run: upload_(): use `live=1` instead for consistency with other invocations of put
14517	08/19/2014 02:14 PM	Aaron Marcuse-Kubitza	fix: exports/native_status_resolver.csv.run: upload_(): need `l=1`
14516	08/19/2014 02:12 PM	Aaron Marcuse-Kubitza	exports/native_status_resolver.csv.run: documented export_() runtime (45 min)
14515	08/19/2014 01:48 PM	Aaron Marcuse-Kubitza	exports/native_status_resolver.csv.run: added upload_() to get the file onto nimoy
14514	08/19/2014 01:25 PM	Aaron Marcuse-Kubitza	added exports/native_status_resolver.csv.run
14513	08/19/2014 01:23 PM	Aaron Marcuse-Kubitza	schemas/public_.sql: added native_status_resolver view, requested by Brad (wiki.vegpath.org/Data_requests)
14512	08/19/2014 11:47 AM	Aaron Marcuse-Kubitza	inputs/publishable datasources.xlsx: udpated
14511	08/19/2014 08:37 AM	Aaron Marcuse-Kubitza	lib/tnrs.py: documentation about output of the retrieve step: added that this is also unusable because the array does not contain all the columns and contains no column names
14510	08/19/2014 07:54 AM	Aaron Marcuse-Kubitza	removed no longer used web/BIEN3/Redmine/main/. use Redmine/!__ instead.
14509	08/19/2014 07:52 AM	Aaron Marcuse-Kubitza	web/BIEN3/Redmine/issues/.htaccess: perform .. redirects using new ! prefix

Project

General

Profile