Project

General

Profile

Statistics
| Revision:
  • svn:ignore: extern

# Date Author Comment
13988 07/11/2014 10:03 AM Aaron Marcuse-Kubitza

bugfix: bin/stop_imports: also need to include `bin/after_import`

13987 07/11/2014 09:56 AM Aaron Marcuse-Kubitza

inputs/.TNRS/Source/map.csv: datasetURL: updated to indicate that this uses the TNRS dev server (the actual URL is private)

13986 07/11/2014 09:46 AM Aaron Marcuse-Kubitza

fix: inputs/VASCAN/Source/map.csv: observationGranularity should not be specimen because this datasource contains only taxa

13985 07/11/2014 09:13 AM Aaron Marcuse-Kubitza

bugfix: bin/import_all: now that always using log files to fix output clutter, need to delete created logs if logging is turned off

13984 07/11/2014 08:45 AM Aaron Marcuse-Kubitza

bugfix: bin/import_all: don't errexit if a background process is Ctrl-C'd

13983 07/11/2014 08:41 AM Aaron Marcuse-Kubitza

bugfix: bin/import_all: was run without initial "." test: don't exit nonzero because this will close the subshell

13982 07/11/2014 08:38 AM Aaron Marcuse-Kubitza

bugfix: bin/import_all: ensure that this is run in a subshell, which is needed so errexits don't close the terminal window

13981 07/11/2014 08:32 AM Aaron Marcuse-Kubitza

bin/import_all: documented that this must be run in a subshell (obtained by running `$0`)

13980 07/11/2014 08:25 AM Aaron Marcuse-Kubitza

bugfix: bin/import_all: need to always use log files for background processes

13979 07/11/2014 08:12 AM Aaron Marcuse-Kubitza

fix: bin/import_all: Source/import: don't use by_col=1 for this because it's slower for small #s of rows. by_col mode is no longer needed for metadata-only tables because these tables now have a single empty row so that they also work in row-based mode.

13978 07/11/2014 08:06 AM Aaron Marcuse-Kubitza

fix: bin/import_all: hidden srcs: use with_all for this to avoid needing to list every source, and to display the backgrounded command with the variables substituted

13977 07/11/2014 07:40 AM Aaron Marcuse-Kubitza

bin/import_all: TNRS, geoscrub: integrated into the list of metadata sources

13976 07/11/2014 07:39 AM Aaron Marcuse-Kubitza

bin/import_all: TNRS, geoscrub: use import rather than publish because the non-imported tables have now been excluded

13975 07/11/2014 07:34 AM Aaron Marcuse-Kubitza

fix: inputs/.geoscrub/geoscrub_output/: added _no_import because these tables are metadata that is used in the analytical DB. this is better than relying on bin/import_all not to import these.

13974 07/10/2014 07:25 PM Aaron Marcuse-Kubitza

fix: bin/import_all: updated for new metadata datasource names (see issue #940)

13973 07/10/2014 07:25 PM Aaron Marcuse-Kubitza

fix: config/VirtualBox_VMs/vegbiendev/README.TXT: updated for new metadata datasource names (see issue #940)

13972 07/10/2014 06:32 PM Aaron Marcuse-Kubitza

fix: inputs/IUCN/: renamed to inputs/.IUCN/ (see issue #940)

13971 07/10/2014 06:04 PM Aaron Marcuse-Kubitza

fix: inputs/newWorld/: renamed to inputs/.newWorld/ (see issue #940)

13970 07/10/2014 04:59 PM Aaron Marcuse-Kubitza

bugfix: inputs/IRMNG/: renamed to inputs/.IRMNG/ so that this comes before TNRS, which depends on it (this is a metadata datasource, so it can start with "."). part of issue #940.

13969 07/10/2014 03:52 PM Aaron Marcuse-Kubitza

config/VirtualBox_VMs/vegbiendev/README.TXT: to test the import: set up the VM: added steps to create a VM snapshot when done

13968 07/10/2014 03:37 PM Aaron Marcuse-Kubitza

config/VirtualBox_VMs/vegbiendev/README.TXT: `make`: documented runtime (15 min)

13967 07/10/2014 03:33 PM Aaron Marcuse-Kubitza

fix: config/VirtualBox_VMs/vegbiendev/README.TXT: removed non-applicable step about the MySQL root password, which is not used for this procedure

13966 07/10/2014 03:32 PM Aaron Marcuse-Kubitza

added config/VirtualBox_VMs/vegbiendev/README.TXT with instructions to test the import

13965 07/10/2014 12:17 PM Aaron Marcuse-Kubitza

inputs/GBIF/_MySQL/.rsync_ignore: don't exclude GBIFPortalDB-*.data.sql.gz, even though this is an intermediate file, because it's better to have a backup of it locally. this was excluded in r13316 (2014-4-24) to free up disk space on the local machine.

13964 07/10/2014 10:57 AM Aaron Marcuse-Kubitza

fix: inputs/Madidi/LocationObservation/postprocess.sql: parse Spanish dates

13963 07/10/2014 09:45 AM Aaron Marcuse-Kubitza

validation/completeness/BIENDatesPopulated.xlsx: updated: Martha added Priority column

13962 07/10/2014 08:41 AM Aaron Marcuse-Kubitza

schemas/util.sql: replace_words(): use map_words() to enable inlining the concatenated hstore, so that it doesn't need to be concatenated on each function call

13961 07/10/2014 08:39 AM Aaron Marcuse-Kubitza

schemas/util.sql: added map_words()

13960 07/10/2014 08:16 AM Aaron Marcuse-Kubitza

schemas/util.sql: added date_from_spanish()

13959 07/10/2014 08:13 AM Aaron Marcuse-Kubitza

schemas/util.sql: added replace_words()

13958 07/10/2014 08:09 AM Aaron Marcuse-Kubitza

schemas/util.sql: added words(text)

13957 07/10/2014 08:02 AM Aaron Marcuse-Kubitza

schemas/util.sql: spanish_date_words(): switched to using hstore because this is better for the algorithm that will translate these

13956 07/10/2014 07:20 AM Aaron Marcuse-Kubitza

schemas/util.sql: added spanish_date_words()

13955 07/10/2014 07:19 AM Aaron Marcuse-Kubitza

schemas/util.sql: added `replacement` type

13954 07/10/2014 04:09 AM Aaron Marcuse-Kubitza

bugfix: mk_set_relation_metadata(): need to include col comments as well so that these are re-created properly by recreate()

13953 07/10/2014 04:07 AM Aaron Marcuse-Kubitza

schemas/util.sql: added mk_set_comments(regclass)

13952 07/10/2014 04:06 AM Aaron Marcuse-Kubitza

schemas/util.sql: show_set_comment(regclass): renamed to mk_set_comment() to match other mk_set_comment() functions

13951 07/10/2014 04:01 AM Aaron Marcuse-Kubitza

schemas/util.sql: added mk_set_col_comments(regclass)

13950 07/10/2014 03:58 AM Aaron Marcuse-Kubitza

schemas/util.sql: added cols(regclass)

13949 07/10/2014 03:56 AM Aaron Marcuse-Kubitza

schemas/util.sql: added mk_set_comment(col_ref)

13948 07/10/2014 03:18 AM Aaron Marcuse-Kubitza

added validation/completeness/BIENDatesPopulated.xlsx from Google spreadsheets

13947 07/09/2014 02:50 PM Aaron Marcuse-Kubitza

fix: mappings/VegCore-VegBIEN.csv: mapped DwC year/month/day, which are split-date alternatives to eventDate. this fixes the missing eventDate in FIA.

13946 07/09/2014 02:45 PM Aaron Marcuse-Kubitza

bugfix: inputs/FIA/*/VegBIEN.csv: regenerated, replacing these with symlinks to the file used by new-style import

13945 07/09/2014 02:34 PM Aaron Marcuse-Kubitza

inputs/FIA/taxon_observation.**/test.xml.ref: updated

13944 07/09/2014 02:34 PM Aaron Marcuse-Kubitza

inputs/FIA/TREE/test.xml.ref: updated

13943 07/09/2014 02:29 PM Aaron Marcuse-Kubitza

inputs/FIA/REF_RESEARCH_STATION/test.xml.ref: updated

13942 07/09/2014 02:13 PM Aaron Marcuse-Kubitza

bugfix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: updated to new TWiki/ location

13941 07/09/2014 02:12 PM Aaron Marcuse-Kubitza

bugfix: /README.TXT: to synchronize vegbiendev, jupiter, and your local machine: updated to new VegBIEN/ location

13940 07/09/2014 12:14 PM Aaron Marcuse-Kubitza

fix: inputs/bien2_traits/TraitObservation/map.csv: mapped VisitingDate to eventDate

13939 07/09/2014 12:05 PM Aaron Marcuse-Kubitza

fix: inputs/Madidi/LocationObservation/postprocess.sql: populated missing eventDate from PlotInventoryName (authorEventCode)

13938 07/09/2014 09:40 AM Aaron Marcuse-Kubitza

schemas/util.sql: mk_set_comment(table_ regclass, comment text): use new util.mk_set_comment(text, text)

13937 07/09/2014 09:38 AM Aaron Marcuse-Kubitza

schemas/util.sql: added mk_set_comment(col col_ref, comment text)

13936 07/09/2014 09:37 AM Aaron Marcuse-Kubitza

schemas/util.sql: added sql(col_ref)

13935 07/09/2014 09:35 AM Aaron Marcuse-Kubitza

schemas/util.sql: added mk_set_comment(on_ text, comment text)

13934 07/09/2014 07:53 AM Aaron Marcuse-Kubitza

schemas/util.sql: added comment(col_ref)

13933 07/09/2014 07:50 AM Aaron Marcuse-Kubitza

schemas/util.sql: added col_num(col_ref)

13932 07/09/2014 07:03 AM Aaron Marcuse-Kubitza

removed no longer needed inputs/.TNRS/grants.sql, since the grants in schema.sql are now being run

13931 07/09/2014 07:03 AM Aaron Marcuse-Kubitza

bugfix: inputs/input.Makefile: sql/install: schema.sql should not be passed through pg_dump_limit because it contains GRANT statements that need to be run

13930 07/09/2014 06:22 AM Aaron Marcuse-Kubitza

schemas/public_.sql: iucn_red_list_view: documented how to regenerate iucn_red_list from this

13929 07/08/2014 04:17 PM Aaron Marcuse-Kubitza

exports/2014-6-4.Iara_Lacher.reserve_prioritization.csv.run: documented runtime (7.5 min) and rowcount (3.5 million)

13928 07/08/2014 03:23 PM Aaron Marcuse-Kubitza

bugfix: schemas/public_.sql: 2014-6-4.Iara_Lacher.reserve_prioritization: fixed slowdown in materializing the view, which was caused by source__observation_type(), by replacing this with a LEFT JOIN as was done for is_threatened_iucn

13927 07/07/2014 07:33 AM Aaron Marcuse-Kubitza

schemas/util.sql: in_south_america(): optimized by using BETWEEN instead of util.contained_within__no_dateline(). this reduces the filter time for 2014-6-4.Iara_Lacher.reserve_prioritization by about 10%.

13926 07/07/2014 07:26 AM Aaron Marcuse-Kubitza

fix: schemas/public_.sql: 2014-6-4.Iara_Lacher.reserve_prioritization: is_threatened_iucn: generate this much faster by doing a LEFT JOIN, which can reuse the same in-memory hash index for every row

13925 07/07/2014 07:04 AM Aaron Marcuse-Kubitza

bugfix: schemas/public_.sql: wrapper functions can't use util-schema types because these will cause the wrapper function to be cascadingly dropped when the util schema is reinstalled

13924 07/07/2014 06:55 AM Aaron Marcuse-Kubitza

bugfix: in_south_america(): must use util.contained_within__no_dateline() instead of util.contained_within_approx() to ensure that the more accurate geometry logic is used

13923 07/07/2014 06:51 AM Aaron Marcuse-Kubitza

bugfix: schemas/public_.sql: source__observation_type(): don't display NOTICEs about the search_path, because this function will be called millions of times

13922 07/07/2014 06:48 AM Aaron Marcuse-Kubitza

bugfix: schemas/public_.sql: in_iucn_red_list(): don't display NOTICEs about the search_path, because this function will be called millions of times

13921 07/07/2014 06:43 AM Aaron Marcuse-Kubitza

schemas/public_.sql: 2014-6-4.Iara_Lacher.reserve_prioritization: added is_threatened_iucn

13920 07/07/2014 06:37 AM Aaron Marcuse-Kubitza

schemas/public_.sql: added in_iucn_red_list()

13919 07/07/2014 06:36 AM Aaron Marcuse-Kubitza

schemas/public_.sql: added iucn_red_list, materialized from iucn_red_list_view

13918 07/07/2014 06:22 AM Aaron Marcuse-Kubitza

bugfix: schemas/vegbien.sql: iucn_red_list_view: need to include only names with an accepted name, and distinctify on the accepted names

13917 07/07/2014 06:17 AM Aaron Marcuse-Kubitza

schemas/public_.sql: added iucn_red_list

13916 07/07/2014 06:14 AM Aaron Marcuse-Kubitza

schemas/util.sql: materialize_view(): use util.copy() instead of util.materialize_query() so that all view metadata is transferred

13915 07/07/2014 06:02 AM Aaron Marcuse-Kubitza

schemas/public_.sql: iucn_red_list: renamed to iucn_red_list_view since this will be materialized

13914 07/07/2014 06:01 AM Aaron Marcuse-Kubitza

schemas/public_.sql: added iucn_red_list

13913 07/07/2014 05:44 AM Aaron Marcuse-Kubitza

schemas/public_.sql: 2014-6-4.Iara_Lacher.reserve_prioritization: added collection_type (using source__observation_type())

13912 07/07/2014 05:36 AM Aaron Marcuse-Kubitza

schemas/public_.sql: added source__observation_type()

13911 07/07/2014 05:04 AM Aaron Marcuse-Kubitza

schemas/public_.sql: 2014-6-4.Iara_Lacher.reserve_prioritization: added accepted_subspecies, and removed accepted_taxon_name_with_author which had been substituting for it

13910 07/07/2014 04:50 AM Aaron Marcuse-Kubitza

schemas/public_.sql: added subspecies() wrapper

13909 07/07/2014 04:38 AM Aaron Marcuse-Kubitza

schemas/util.sql: added subspecies()

13908 07/07/2014 04:10 AM Aaron Marcuse-Kubitza

schemas/public_.sql: 2014-6-4.Iara_Lacher.reserve_prioritization: include only coordinates in South America

13907 07/07/2014 03:55 AM Aaron Marcuse-Kubitza

schemas/util.sql: added south_america(), in_south_america()

13906 07/07/2014 03:37 AM Aaron Marcuse-Kubitza

fix: schemas/util.sql: OPERATOR ~(geocoord, postgis.geometry): renamed to because this is not an approximate comparison for geometry

13905 07/07/2014 03:35 AM Aaron Marcuse-Kubitza

fix: schemas/util.sql: contained_within_approx(geocoord, postgis.geometry): renamed to contained_within__no_dateline(__) because this is not an approximate comparison for geometry

13904 07/07/2014 03:30 AM Aaron Marcuse-Kubitza

schemas/util.sql: lat_long_in_new_world(): renamed to just in_new_world() because the lat/long is implied by the param type

13903 07/07/2014 03:08 AM Aaron Marcuse-Kubitza

schemas/util.sql: lat_long_in_new_world(): take a geocoord param instead of separate lat/long params

13902 07/07/2014 03:00 AM Aaron Marcuse-Kubitza

backups/TNRS.backup.md5: updated

13901 07/06/2014 11:08 PM Aaron Marcuse-Kubitza

schemas/util.sql: added contained_within_approx(geocoord, geometry) and corresponding OPERATOR ~@(geocoord, geometry)

13900 07/06/2014 10:39 PM Aaron Marcuse-Kubitza

schemas/util.sql: added OPERATOR ~@(geocoord, geography)

13899 07/06/2014 10:23 PM Aaron Marcuse-Kubitza

schemas/util.sql: lat_long_in_new_world(): use new contained_within_approx(geocoord, geography)

13898 07/06/2014 10:17 PM Aaron Marcuse-Kubitza

schemas/util.sql: added contained_within_approx(geocoord, postgis.geography), which enables specifying just `(lat, long)` without the ::util.geocoord type specifier

13897 07/06/2014 04:04 PM Aaron Marcuse-Kubitza

schemas/util.sql: OPERATOR (postgis.geography, postgis.geography): renamed to ~ because it's approximate

13896 07/06/2014 03:52 PM Aaron Marcuse-Kubitza

schemas/util.sql: contained_within(): renamed to contained_within_approx() because the latitude lines of geography type bounding boxes bulge outward, creating false positives above and below the bounding box

13895 07/06/2014 03:50 PM Aaron Marcuse-Kubitza

schemas/util.sql: added contained_within__no_dateline(geometry, geometry) and corresponding operator @

13894 07/06/2014 02:51 PM Aaron Marcuse-Kubitza

schemas/util.sql: contained_within(): renamed to contained_within_approx() because the latitude lines of geography type bounding boxes bulge outward, creating false positives above and below the bounding box

13893 07/06/2014 05:49 AM Aaron Marcuse-Kubitza

schemas/util.sql: geometry(geocoord): documented that it is not possible to create a cast for this, as a bug in pg_dump prevents the cast from being exported, even when no export filters are applied

13892 07/05/2014 12:40 PM Aaron Marcuse-Kubitza

schemas/util.sql: point(geocoord): renamed to geometry(geocoord) since this is now a cast

13891 07/05/2014 12:03 PM Aaron Marcuse-Kubitza

schemas/util.sql: point(): return geometry instead of geography to support using points with geometry arithmetic

13890 07/05/2014 11:24 AM Aaron Marcuse-Kubitza

schemas/util.sql: point(): take a single util.geocoord param instead of separate lat/long

13889 07/05/2014 10:45 AM Aaron Marcuse-Kubitza

schemas/util.sql: added geocoord type