Project

General

Profile

Statistics
| Revision:
  • svn:ignore: extern

# Date Author Comment
13020 04/01/2014 01:36 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: documented that CPU load warning e-mails can safely be ignored. they happen because the parallel imports use all the available cores.

13019 04/01/2014 01:31 PM Aaron Marcuse-Kubitza

fix: lib/common.Makefile: $(nice): use an increment of +10 instead of +5 because +5 still leaves the shell sluggish

13018 04/01/2014 01:29 PM Aaron Marcuse-Kubitza

lib/common.Makefile: added $(nice) and use it everywhere its definition is used

13017 04/01/2014 01:14 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: exiting `screen`: clarify that you must use `exit`, as Ctrl+D gets disabled to prevent accidental exits

13016 04/01/2014 12:47 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: added step to restart Postgres to free up any disk space used by temp tables from the last import (this is apparently not automatically reclaimed)

13015 04/01/2014 12:45 PM Aaron Marcuse-Kubitza

/Makefile: postgres_restart-Linux: documented that the manual running of the command is needed because for some reason, pg_ctl does not work when run inside make

13014 04/01/2014 12:43 PM Aaron Marcuse-Kubitza

fix: /Makefile: postgres_restart-Linux: added pause after telling the user the command to run

13013 04/01/2014 12:42 PM Aaron Marcuse-Kubitza

/Makefile: $(postgresReload-*): use postgres_restart for the postgres-restarting step

13012 04/01/2014 12:30 PM Aaron Marcuse-Kubitza

bugfix: /Makefile: postgres_restart: added separate Linux version that deals with Linux-specific issues (as in $(postgresReload-Linux))

13011 04/01/2014 12:15 PM Aaron Marcuse-Kubitza

/Makefile: added postgres_restart, since this is often invoked separately from the entire postgres_reload target

13010 04/01/2014 11:40 AM Aaron Marcuse-Kubitza

/README.TXT: Full database import: disk space: increased minimum requirement to 500GB (~200GB extra), as the import may use significant additional space for temp tables

13009 04/01/2014 11:37 AM Aaron Marcuse-Kubitza

/README.TXT: Full database import: documented that env vars set before invoking `screen` will be inherited by it, so these steps will work even if they come before `screen`

13008 04/01/2014 11:26 AM Aaron Marcuse-Kubitza

backups/TNRS.backup.md5: updated

13007 04/01/2014 11:23 AM Aaron Marcuse-Kubitza

/README.TXT: Full database import: added steps to set a custom version, if the auto-assigned one would cause a collision with the last import

13006 04/01/2014 11:08 AM Aaron Marcuse-Kubitza

/README.TXT: Full database import: `unset version`: documented that this is needed because it may have been set in the outer shell

13005 03/30/2014 07:54 PM Aaron Marcuse-Kubitza

fix: lib/sql_io.py: put_table(): don't warn if can't create pkey, because this just indicates that a set-returning function was used. this should get rid of the last of the confusing benign warnings in the test output.

13004 03/30/2014 07:53 PM Aaron Marcuse-Kubitza

fix: lib/sql.py: flatten(): don't warn if can't create pkey, because this just indicates that a set-returning function was used

13003 03/30/2014 07:52 PM Aaron Marcuse-Kubitza

lib/sql.py: run_query_into() added add_pkey_warn param to support turning off "could not create unique index" warnings, which are sometimes benign (eg. when using set-returning functions with column-based import)

13002 03/30/2014 06:52 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: disk space: updated schema size (315GB)

13001 03/30/2014 06:45 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: removed `up` on jupiter because this is done as part of "do steps under Maintenance > "to synchronize vegbiendev, ..."

13000 03/30/2014 06:44 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: moved "do steps under Maintenance > "to synchronize vegbiendev, ..." outside of "On local machine" because these steps don't only take place on the local machine

12999 03/30/2014 06:41 PM Aaron Marcuse-Kubitza

/README.TXT: use `up` instead of `svn up --force` for consistency

12998 03/30/2014 06:40 PM Aaron Marcuse-Kubitza

fix: /README.TXT: always use `up` instead of `svn up` since this includes --force

12997 03/30/2014 06:39 PM Aaron Marcuse-Kubitza

/README.TXT: Full database import: removed unneeded `ssh -t vegbiendev.nceas.ucsb.edu exec sudo su - aaronmk` at beginning since this is performed again the first time it's needed

12996 03/30/2014 06:38 PM Aaron Marcuse-Kubitza

fix: /README.TXT: Full database import: removed erroneous line that resulted from a search-and-replace of connection commands in r12396. (it used to read "Follow the steps under Connecting to vegbiendev above, using jupiter instead". this step is now performed on the line below it.)

12995 03/30/2014 06:31 PM Aaron Marcuse-Kubitza

bin/make_analytical_db: removed remake_diff_tables() because this is now done for each datasource in inputs/input.Makefile

12994 03/30/2014 06:28 PM Aaron Marcuse-Kubitza

bugfix: schemas/vegbien.sql: schemas/vegbien.sql(): need to util.use_schema(schema_anchor) before initializing vars that use own-schema functions

12993 03/30/2014 06:12 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: validate: redirect the output to the log, as for other import-related operations

12992 03/30/2014 06:08 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: import: validate at the end of the import

12991 03/30/2014 06:02 PM Aaron Marcuse-Kubitza

inputs/input.Makefile: added new-style aggregating validations (`validate` target)

12990 03/30/2014 06:02 PM Aaron Marcuse-Kubitza

bin/make_analytical_db: removed no longer needed "${public}_validations" schema qualifier, now that it is in the search_path

12989 03/30/2014 06:00 PM Aaron Marcuse-Kubitza

fix: bin/vegbien_dest: added public_validations

12988 03/30/2014 05:41 PM Aaron Marcuse-Kubitza

added inputs/GBIF/_src/0001000-131106143450413.zip.header.txt, which is useful to see what fields will be available when we switch to the new GBIF export format

12987 03/30/2014 05:39 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: removed end_try_subshell, which now does the same thing as end_try

12986 03/30/2014 05:38 PM Aaron Marcuse-Kubitza

fix: lib/sh/archives.sh: unzip(): support -p option, which pipes extracted data to stdout

12985 03/30/2014 05:11 PM Aaron Marcuse-Kubitza

added inputs/GBIF/_src/0001000-131106143450413.zip.header.txt.run

12984 03/30/2014 05:11 PM Aaron Marcuse-Kubitza

added lib/runscripts/extract_header.run

12983 03/30/2014 05:09 PM Aaron Marcuse-Kubitza

fix: lib/sh/make.sh: direct the user to use begin_target instead of set_make_vars (set_make_vars is now used by begin_target)

12982 03/30/2014 05:06 PM Aaron Marcuse-Kubitza

fix: lib/runscripts/util.run: to_top_file(): handle $_remake properly, without requiring deferred_check_target_exists to set to_file()'s flags

12981 03/30/2014 05:03 PM Aaron Marcuse-Kubitza

bugfix: lib/sh/util.sh: die(): usage: documented that if msg uses $(...), save_e is needed

12980 03/30/2014 04:59 PM Aaron Marcuse-Kubitza

bugfix: lib/sh/util.sh: already_exists_msg(): need to save_e, because new $(mk_hint) call resets $?

12979 03/30/2014 04:55 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: die(): always errexit even if $e = 0, because die always indicates an error

12978 03/30/2014 04:53 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: added rethrow!(), which always errexits, even if $e = 0

12977 03/30/2014 04:53 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: rethrow(): also work in situations where $e is not set

12976 03/30/2014 04:50 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: rethrow: made it a function since there is now no need for it to be an alias

12975 03/30/2014 04:47 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: rethrow: removed `test "$e" != 0` since errexit only does anything if $e != 0

12974 03/30/2014 04:45 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: removed separate rethrow_exit*, rethrow_subshell*, since they now do the same thing as rethrow*

12973 03/30/2014 04:42 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: rethrow*!: use new errexit, which works in functions and subshells

12972 03/30/2014 04:38 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: added errexit(), used in place of (exit "$1") because a bug in bash prevents subshells from triggering errexit

12971 03/30/2014 04:18 PM Aaron Marcuse-Kubitza

lib/sh/util.sh: added bool!()

12970 03/30/2014 03:08 PM Aaron Marcuse-Kubitza

fix: lib/sh/util.sh: redir(): need to indent before invoking an external command (not just in command__exec(), but for all redir() calls)

12969 03/29/2014 04:10 AM Aaron Marcuse-Kubitza

lib/sh/make.sh: with_rm(): documented that it only works inside a runscript target that starts w/ begin_target

12968 03/29/2014 04:06 AM Aaron Marcuse-Kubitza

*{.sh,run}: runscript targets: use begin_target instead of echo_func so the target name is properly echoed. note that this requires using with_rm so that $rm is properly progagated to applicable invoked targets. (previously, $rm was progagated to all invoked targets. note that with_rm only works inside a runscript target that starts with begin_target.)

12967 03/29/2014 03:58 AM Aaron Marcuse-Kubitza

lib/sh/make.sh: self_make(): renamed to with_rm() for clarity, since this is used only to progagate $rm, and does not also invoke a command with the same name as the current function, as the name might suggest

12966 03/28/2014 07:17 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: updated _specimens_01_count_of_total_records_specimens_in_source_db

12965 03/28/2014 07:10 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: use taxonoccurrence instead of location as the table that all specimens should have, as decided in the 2014-03-27 conference call (wiki.vegpath.org/2014-03-27_conference_call#aggregating-validations)

12964 03/28/2014 07:03 AM Aaron Marcuse-Kubitza

lib/runscripts/util.run: support conventional main() method as well as `all` target

12963 03/28/2014 02:39 AM Aaron Marcuse-Kubitza

fix: inputs/*/*/map.csv: remapped occurrenceID-mapped fields to dataProviderRecordID when these were not globally unique DwC occurrenceIDs (http://rs.tdwg.org/dwc/terms/#occurrenceID)

12962 03/28/2014 02:34 AM Aaron Marcuse-Kubitza

fix: inputs/CTFS/AggregateObservation/map.csv: field mapped to occurrenceID: remapped to aggregateOrganismObservationID because these are not specimen occurrences

12961 03/28/2014 02:32 AM Aaron Marcuse-Kubitza

fix: mappings/VegCore-VegBIEN.csv: taxonoccurrence.sourceaccessioncode: need to populate from aggregateOrganismObservationID when only that is available

12960 03/28/2014 02:03 AM Aaron Marcuse-Kubitza

bugfix: inputs/NY/Ecatalog_all/map.csv: can't use CatalogNumber as pkey because it's not unique and not always populated. this fixes the NY NULL accessionNumbers bug (wiki.vegpath.org/Aggregating_validations_status#bugs).

12959 03/28/2014 01:31 AM Aaron Marcuse-Kubitza

/README.TXT: moved "to back up e-mails" and "to back up the version history" before settings backup so that the local backup of these is up to date when everything gets backed up

12958 03/28/2014 01:29 AM Aaron Marcuse-Kubitza

inputs/XAL/Specimen/header.csv: updated

12957 03/28/2014 12:45 AM Aaron Marcuse-Kubitza

/README.TXT: to synchronize vegbiendev, jupiter, and your local machine: backups/TNRS.backup: do this before the general sync so that any reverse sync that's needed won't include it

12956 03/28/2014 12:44 AM Aaron Marcuse-Kubitza

/README.TXT: to synchronize vegbiendev, jupiter, and your local machine: backups/TNRS.backup: use bin/sync_upload now that this works for rsync-ignored files

12955 03/28/2014 12:36 AM Aaron Marcuse-Kubitza

bugfix: lib/sh/sync.sh: don't unintentionally rsync-ignore explicitly-specified files

12954 03/28/2014 12:32 AM Aaron Marcuse-Kubitza

lib/sh/util.sh: filesystem: added is_*(), could_be_*()

12953 03/28/2014 12:31 AM Aaron Marcuse-Kubitza

lib/sh/util.sh: added contains_match()

12952 03/28/2014 12:31 AM Aaron Marcuse-Kubitza

lib/sh/util.sh: added ends_with()

12951 03/27/2014 11:13 PM Aaron Marcuse-Kubitza

fix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: run `up` on all machines, not just jupiter, because all must be up-to-date to avoid extraneous diffs

12950 03/27/2014 11:11 PM Aaron Marcuse-Kubitza

bugfix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: `svn up` on jupiter: need to use up alias because that adds --force

12949 03/27/2014 11:10 PM Aaron Marcuse-Kubitza

bugfix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: added `svn up` on jupiter: needs to be in main dir (~/bien), not ~/Dropbox/svn/

12948 03/27/2014 11:08 PM Aaron Marcuse-Kubitza

/README.TXT: to synchronize vegbiendev, jupiter, and your local machine: added `svn up` on jupiter to avoid extraneous diffs when rsyncing

12947 03/27/2014 10:41 AM Aaron Marcuse-Kubitza

planning/workflow/bien3_architecture/stage_I.png, stages.png: synced to bien3_architecture.pptx

12946 03/27/2014 10:32 AM Aaron Marcuse-Kubitza

planning/workflow/bien3_architecture.pptx: stage I: clarified that the database input is intended to be a normalized input, and its corresonding output is intended to be denormalized

12945 03/27/2014 10:29 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: stage I: clarified that the database input is intended to be a normalized input, and its corresonding output is intended to be denormalized

12944 03/27/2014 09:02 AM Aaron Marcuse-Kubitza

bugfix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: _specimens_16_list_distinct_specimen_descriptions: should use DISTINCT

12943 03/27/2014 09:01 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_16_list_distinct_specimen_descriptions

12942 03/27/2014 09:00 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_16_list_distinct_specimen_descriptions

12941 03/27/2014 08:53 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_15_list_distinct_locality_descriptions

12940 03/27/2014 08:48 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_09_list_of_unique_verbatim_author_taxa_with_genus

12939 03/27/2014 08:47 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_08_count_of_unique_verbatim_author_taxa_with_genus

12938 03/27/2014 08:36 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_05_list_of_verbatim_species_excluding_author

12937 03/27/2014 08:35 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_04_count_of_unique_verbatim_species_without_author

12936 03/27/2014 08:23 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_03_list_of_verbatim_families

12935 03/27/2014 08:18 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_02_count_of_unique_verbatim_families

12934 03/27/2014 08:06 AM Aaron Marcuse-Kubitza

schemas/vegbien.ERD.mwb: regenerated exports

12933 03/27/2014 08:04 AM Aaron Marcuse-Kubitza

schemas/vegbien.sql: public_validations: added _specimens_01_count_of_total_records_specimens_in_source_db

12932 03/27/2014 07:35 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: implemented _specimens_01_count_of_total_records_specimens_in_source_db

12931 03/27/2014 07:34 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/qualitative_validations_specimens.sql: added config statements for datasource and query planner

12930 03/27/2014 05:06 AM Aaron Marcuse-Kubitza

web/links/index.htm: updated to Firefox bookmarks: Firefox: added instructions for enabling security.password_lifetime and making all tabs load when the browser is opened

12929 03/27/2014 04:43 AM Aaron Marcuse-Kubitza

/README.TXT: Schema changes: manually apply schema changes to the live public schema: moved under "update mappings and staging table column names" because this is a necessary part of that step

12928 03/27/2014 04:43 AM Aaron Marcuse-Kubitza

/README.TXT: Schema changes: manually apply schema changes to the live public schema: moved under "update mappings and staging table column names" because this is a necessary part of that step

12927 03/27/2014 04:40 AM Aaron Marcuse-Kubitza

/README.TXT: Schema changes: changed "update staging table column names" to "update mappings and staging table column names"

12926 03/27/2014 04:13 AM Aaron Marcuse-Kubitza

fix: validation/aggregating/specimens/qualitative_validations_specimens.sql: use pg_dump's formatting for COMMENT ON to facilitate diffing against a pg_dump export of the DDL statements

12925 03/27/2014 04:07 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: removed DDL statements so that running the query file does not alter the database, using the steps at wiki.vegpath.org/Aggregating_validations_refactoring#remove-DDL-statements

12924 03/27/2014 04:01 AM Aaron Marcuse-Kubitza

validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: updated to DB, which pg_dump-formats the views

12923 03/27/2014 03:57 AM Aaron Marcuse-Kubitza

validation/**.sql: replaced CREATE OR REPLACE VIEW with CREATE VIEW to match pg_dump output for diffing

12922 03/27/2014 03:36 AM Aaron Marcuse-Kubitza

added inputs/NY/validations*.sql*

12921 03/27/2014 03:34 AM Aaron Marcuse-Kubitza

fix: validation/aggregating/specimens/NY/qualitative_validations_source_db_NYBG.VegCore.sql: use pg_dump's formatting for COMMENT ON to facilitate diffing against a pg_dump export of the DDL statements