/ - Changes - BIEN 3 - NCEAS Projects

root @ 8257

#	Date	Author	Comment
8257	03/28/2013 07:42 PM	Aaron Marcuse-Kubitza	Added inputs/GBIF/_MySQL/MySQL.*.sql.make
8256	03/28/2013 07:36 PM	Aaron Marcuse-Kubitza	inputs/FIA/: Archived no longer used subdirs from BIEN2 export
8255	03/28/2013 07:29 PM	Aaron Marcuse-Kubitza	inputs/FIA/: Archived no longer used subdirs from BIEN2 export
8254	03/28/2013 07:22 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: SVN: add: Removed Source/map.csv prerequisite because it is not related to adding unversioned files in the dir. It was originally a prerequisite in order to auto-create it when the datasource dir is first created, but the map.csv recipe does not currently create metadata-only map.csvs. In the future, metadata-only map.csvs will be replaced with constant columns added to the applicable tables.
8253	03/28/2013 07:19 PM	Aaron Marcuse-Kubitza	Added inputs/FIA/_archive
8252	03/28/2013 07:19 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: %/map.csv: Fixed bug where can only make header.csv if map.csv does not exist, because some subdirs are metadata-only and don't have a corresponding DB table
8251	03/28/2013 07:02 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Install the staging tables: For a MySQL .sql export: Documented which password to use at each of the two password prompts my2pg_export will give you. You could also embed the value of the 2nd prompt in the _MySQL/*.make file using `--password="$(cat path/to/config/bien_password)"`.
8250	03/28/2013 06:56 PM	Aaron Marcuse-Kubitza	README.TXT: Datasource setup: Install the staging tables: Removed requirement that `make inputs/<datasrc>/reinstall quiet=1 &` be run on vegbiendev for MySQL .sql exports, because the hostname is now set to vegbiendev instead of localhost
8249	03/28/2013 06:38 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: sql/install: Use psql_script_vegbien instead of $(psqlNoSearchPath) (which uses psql_verbose_vegbien) because the insert statement for each data row should not be echoed
8248	03/28/2013 06:14 PM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/import: Run remake_VegBIEN_mappings at end to keep mappings to next stage of import process up to date
8247	03/28/2013 06:14 PM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/: Accepted new test output
8246	03/28/2013 06:13 PM	Aaron Marcuse-Kubitza	lib/import.sh: remake_VegBIEN_mappings(): Also remake VegBIEN.csv and test.xml.ref use `make test`
8245	03/28/2013 06:11 PM	Aaron Marcuse-Kubitza	lib/import.sh: Added remake_VegBIEN_mappings()
8244	03/28/2013 06:10 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: %/map.csv: make $*/header.csv first in case it doesn't exist (e.g. if it has been deleted so that it will be remade)
8243	03/28/2013 06:07 PM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/map.csv: Regenerated using new input table mappings
8242	03/28/2013 05:47 PM	Aaron Marcuse-Kubitza	lib/import.sh: Added make() and use it instead of the full make command
8241	03/28/2013 05:23 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: postprocess: Use %/postprocess instead of %/postprocess.sql/run so $*/import is also run
8240	03/28/2013 05:21 PM	Aaron Marcuse-Kubitza	inputs/FIA/: Ran inputs/FIA/import. This maps to VegCore's commonName.
8239	03/28/2013 05:19 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: %/postprocess: Also run the $*/import script, if it exists. Note that this is not the same as the %/import make target.
8238	03/28/2013 05:12 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: %/postprocess.sql/run: Factored out into separate %/postprocess command, which can eventually also perform other actions
8237	03/28/2013 04:59 PM	Aaron Marcuse-Kubitza	inputs/FIA/PLOT/map.csv: ELEV: Remapped to elevation_ft, assuming units based on the actual elevation of the region for a sample plot record
8236	03/28/2013 04:27 PM	Aaron Marcuse-Kubitza	inputs/VegBank/taxonobservation_/map.csv: Mapped int_currplantcommon to vernacularName
8235	03/28/2013 04:25 PM	Aaron Marcuse-Kubitza	mappings/VegCore.htm: Renamed salvias_plots table plotMetadata to PlotMetadata because of SALVIAS refresh on nimoy
8234	03/28/2013 04:18 PM	Aaron Marcuse-Kubitza	mappings/VegCore.htm: Regenerated from wiki. Added flower, fruit, commonName.
8233	03/28/2013 03:37 PM	Aaron Marcuse-Kubitza	mappings/Makefile: $(vocab); bin/redmine_synonyms: Support crossed out (deprecated) terms
8232	03/28/2013 03:24 PM	Aaron Marcuse-Kubitza	README.TXT: Maintenance: VegCore data dictionary: Added steps to update the data dictionary's Tables section if necessary
8231	03/28/2013 02:14 PM	Aaron Marcuse-Kubitza	inputs/GBIF/_MySQL/Makefile: %.data.sql: Added agent table
8230	03/28/2013 01:18 PM	Aaron Marcuse-Kubitza	Added inputs/GBIF/_MySQL/GBIFPortalDB-2013-02-20.data.sql.md5
8229	03/28/2013 01:11 PM	Aaron Marcuse-Kubitza	Added inputs/GBIF/_MySQL/GBIFPortalDB-2013-02-20.schema.sql
8228	03/28/2013 11:02 AM	Aaron Marcuse-Kubitza	Added web/main/svn/, now using .htaccess to forward to Redmine/
8227	03/28/2013 10:55 AM	Aaron Marcuse-Kubitza	Removed web/main/svn, svn-web symlinks because they need to be .htaccess-es in order for the relative mod_rewrite commands to work correctly
8226	03/28/2013 10:50 AM	Aaron Marcuse-Kubitza	Added web/main/svn, svn-web symlinks to Redmine/* for shorter URLs
8225	03/28/2013 10:49 AM	Aaron Marcuse-Kubitza	Added web/main/Redmine/svn-web/
8224	03/28/2013 08:28 AM	Aaron Marcuse-Kubitza	inputs/GBIF/: Added scripts for subsetting refresh
8223	03/28/2013 12:24 AM	Aaron Marcuse-Kubitza	lib/sql.py: table_order_by(): Documented that it returns None if table is a view, because table_cluster_on() would return None. This is necessary for inputs/FIA/occurrence_all/ sorting to work correctly, because specifying a manual sort order would prevent the query planner from just using fast nested loop joins and instead cause it to perform a slow sort. (This appears to be a bug in the query planner, because when the column list specified matches the joined-on indexes, there should be no need for post-nested loop re-sorting.)
8222	03/28/2013 12:20 AM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/test.xml.ref: Updated inserted row count for new row sort order
8221	03/28/2013 12:19 AM	Aaron Marcuse-Kubitza	lib/db_xml.py: put_table(): Fixed bug where also need to advance start to fetch next set when table is a view, because the views that are now being used with the import (inputs/FIA/occurrence_all/) are static rather than dynamic and do not return different rows after the previous set of rows has been imported
8220	03/27/2013 11:43 PM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/import: Removed no longer applicable comment that directional joins are needed for PostgreSQL query planner to avoid slow sorts
8219	03/27/2013 11:40 PM	Aaron Marcuse-Kubitza	inputs/FIA/TREE/import: Reclustered table by TREE.parent path index, to facilitate path-order joins
8218	03/27/2013 11:39 PM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/import: Changed all RIGHT JOINs to inner joins so that tables would be joined in path order (i.e. general->specific). This optimizes the incremental joins so that the small tables are joined to each other before being joined to the large tables, rather than each row of the large tables being looked up in the small tables. This effect may not be noticeable for small LIMIT values, but would become apparent for large LIMIT values, such as the 1-million-row partitions used by db_xml.put_table() for column-based import. Note that inner joins used to cause the query planner to produce incorrect results containing slow sorts, but now this appears to no longer be an issue, perhaps because the result is not sorted by the TREE.ID index (which is not in the same order as the path indexes .unique, .parent).
8217	03/27/2013 10:46 PM	Aaron Marcuse-Kubitza	inputs/FIA/occurrence_all/import: Removed trailing whitespace
8216	03/27/2013 10:30 PM	Aaron Marcuse-Kubitza	Removed unused inputs/FIA/COND_unique/. Use COND instead.
8215	03/27/2013 09:52 PM	Aaron Marcuse-Kubitza	inputs/FIA/import: Use `set -o errexit` instead of putting ` \|\| exit` after each command
8214	03/27/2013 09:52 PM	Aaron Marcuse-Kubitza	lib/import.sh: map_table(): Removed unneeded () around psql. This also fixes a bug where an error exit status from psql would not have aborted the script because `set -o errexit` does not apply to commands enclosed in (). For () you need to use ` \|\| exit` instead (or ` \|\| return` inside a function).
8213	03/27/2013 09:42 PM	Aaron Marcuse-Kubitza	lib/import.sh: Use `set -o errexit` so any command that exits with an error aborts the script. Note that a command's exit status can still be ignored using ` \|\| true`. Removed no longer needed ` \|\| return` in functions.
8212	03/27/2013 09:40 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Renamed rename_if_exists() to try_create() because it can be used to create a column in any way, not just by renaming another column
8211	03/27/2013 09:33 PM	Aaron Marcuse-Kubitza	lib/import.sh: functions: abort if a command encounters an error
8210	03/27/2013 09:17 PM	Aaron Marcuse-Kubitza	schemas/VegCore/mk_derived: Added cultivated from oldGrowth
8209	03/27/2013 09:16 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Added try_mk_derived_col()
8208	03/27/2013 08:35 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/import: Run mk_derived after postprocessing commands
8207	03/27/2013 08:28 PM	Aaron Marcuse-Kubitza	inputs/FIA/import_order.txt: Added occurrence_all/
8206	03/27/2013 08:23 PM	Aaron Marcuse-Kubitza	mappings/VegCore-VegBIEN.csv: subplotID,subplot -> location.sourceaccessioncode: Fixed bug where need /_first to handle the case where both subplotID and subplot are provided
8205	03/27/2013 08:15 PM	Aaron Marcuse-Kubitza	Added inputs/FIA/map.csv, which maps shared columns to VegCore
8204	03/27/2013 08:12 PM	Aaron Marcuse-Kubitza	inputs/FIA/FIA_COND_unique/test.xml.ref: Updated now that PLOT, CONDID have been mapped
8203	03/27/2013 08:12 PM	Aaron Marcuse-Kubitza	inputs/FIA//map.csv for pre-refresh tables: Added back before unmapped column names
8202	03/27/2013 08:03 PM	Aaron Marcuse-Kubitza	lib/csvs.py: stream_info(): Fixed bug where headers with multiline columns were not supported because only the first line (not the first multiline row) is sniffed for the dialect
8201	03/27/2013 06:56 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: %/header.csv: Fixed bug where newlines inside column names were incorrectly formatted by psql's table header formatting, by using COPY TO STDOUT instead
8200	03/27/2013 05:28 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Added do_optionally_ignore()
8199	03/27/2013 04:28 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Renamed rename_if_exists() to try_create() because it can be used to create a column in any way, not just by renaming another column
8198	03/27/2013 04:12 PM	Aaron Marcuse-Kubitza	lib/import.sh: Added mk_derived(). Added mk_derived to usage template.
8197	03/27/2013 04:11 PM	Aaron Marcuse-Kubitza	Added schemas/VegCore/mk_derived, which will be run in the import scripts
8196	03/27/2013 04:09 PM	Aaron Marcuse-Kubitza	lib/import.sh: psql(): Set psql vars :schema, :table, :table_str for use by the psql commands
8195	03/27/2013 03:22 PM	Aaron Marcuse-Kubitza	lib/import.sh: Export $schema, $table so they are available to programs invoked within an import script, which should not reset these vars if they include import.sh
8194	03/27/2013 03:20 PM	Aaron Marcuse-Kubitza	lib/import.sh: Only set $table, $schema if they don't already exist
8193	03/27/2013 03:11 PM	Aaron Marcuse-Kubitza	lib/import.sh: Added $root_dir and use it in $bin_dir
8192	03/27/2013 03:11 PM	Aaron Marcuse-Kubitza	inputs/FIA//import: Use new mk__col()
8191	03/27/2013 02:50 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Renamed to util.sql because now that these schemas are used by the new-style import scripts, there can be more than just functions in them
8190	03/27/2013 02:43 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Added mk_const_col()
8189	03/27/2013 02:37 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Added type_qual()
8188	03/27/2013 02:34 PM	Aaron Marcuse-Kubitza	schemas/util.sql: mk_derived_col(): Added "idempotent" comment
8187	03/27/2013 02:23 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Added mk_derived_col()
8186	03/27/2013 02:22 PM	Aaron Marcuse-Kubitza	inputs/FIA/COND/import: oldGrowth: Updated expr column names
8185	03/27/2013 01:49 PM	Aaron Marcuse-Kubitza	schemas/util.sql: Added typeof(text, regtype)
8184	03/27/2013 12:54 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/import: Removed util. before function names because util is in the search_path
8183	03/27/2013 12:43 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Renamed to util.sql because now that these schemas are used by the new-style import scripts, there can be more than just functions in them
8182	03/25/2013 11:19 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added existing_cols()
8181	03/25/2013 11:12 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: col_type(): Fixed bug where a NULL col name crashed the undefined_column throw, because MESSAGE can't be NULL and the NULL name was nulling out the entire message
8180	03/25/2013 11:08 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: Added col_exists()
8179	03/25/2013 10:31 PM	Aaron Marcuse-Kubitza	inputs/FIA/COND/map.csv: Mapped SLOPE, ASPECT
8178	03/25/2013 10:23 PM	Aaron Marcuse-Kubitza	web/main/.htaccess: remove linewraps (of the form table.path.vg/_-term) used to create a newline for Google spreadsheets
8177	03/25/2013 09:45 PM	Aaron Marcuse-Kubitza	inputs/FIA//map.csv: Replaced . between table and column name with newline, so that table viewers like pgAdmin will display both the table and column name at the left edge of the header cell, rather than displaying only the table name because the column name doesn't fit. This fixes the problem of seeing a bunch of columns whose names all start with a table name, and not knowing what each of them is. It also preserves the ability to see at a glance which table a column is in, which helps in navigating wide tables. Removed before unmapped terms, because whether a term is mapped is generally obvious from the table name itself.
8176	03/25/2013 09:01 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: %/.map.csv.last_cleanup: Run fix_line_endings after canon/translate to standardize Python's \r\n line endings back to \n. This prevents issues with mixed line endings because LibreOffice (and probably Excel) treat all cell-internal line endings as \n but row line endings as whatever the file had, while text editors like jEdit translate all line endings to whatever the autodetected line ending is. (This creates spurious line ending diffs when a map spreadsheet containing multiline cells is edited in a text editor.)
8175	03/25/2013 08:45 PM	Aaron Marcuse-Kubitza	Added bin/fix_line_endings to standardize \r\n line endings to \n
8174	03/25/2013 08:12 PM	Aaron Marcuse-Kubitza	inputs/FIA/COND/import: Renamed COND.oldgrowth to VegCore name oldGrowth
8173	03/25/2013 07:52 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/map.csv: Ensured that joined columns are globally unique, so they don't map to an ambiguous VegCore term in the future
8172	03/25/2013 07:38 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/map.csv: Mapped terms to VegCore
8171	03/25/2013 07:22 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: col_type(): Include column name in error message
8170	03/25/2013 06:57 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/import: Updated column names to match map.csv
8169	03/25/2013 06:47 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: col_type(): Raise undefined_column exception if column does not exist, instead of silently returning NULL
8168	03/25/2013 06:34 PM	Aaron Marcuse-Kubitza	inputs/FIA/import: Abort if any invoked script encounters an error
8167	03/25/2013 05:44 PM	Aaron Marcuse-Kubitza	planning/timeline/timeline.2013.xls: Updated for current progress
8166	03/25/2013 04:55 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/map.csv: Removed no longer needed leading . from joined fields (globally-unique terms), because functions.to_global_col_names() is not used anymore
8165	03/25/2013 04:46 PM	Aaron Marcuse-Kubitza	Added inputs/FIA/occurrence_all/, which combines all the core tables in a denormalized view. Note that it is not necessary to materialize this view into a (large) denormalized table, because the unique indexes and left/right joins allow the rows to be denormalized on the fly.
8164	03/25/2013 04:36 PM	Aaron Marcuse-Kubitza	inputs/FIA/*/import: Use map_table to set column names based on the contents of map.csv, instead of using functions.to_global_col_names() and functions.rename_if_exists(). Added map.csv for all tables.
8163	03/25/2013 03:19 PM	Aaron Marcuse-Kubitza	inputs/FIA/: Changed postprocess.sql scripts to import scripts that can be run directly. Added top-level inputs/FIA/import to run all of them together.
8162	03/25/2013 03:05 PM	Aaron Marcuse-Kubitza	inputs/FIA/COND/postprocess.sql: Removed trailing whitespace
8161	03/25/2013 02:25 PM	Aaron Marcuse-Kubitza	Added lib/import.sh, for use by new, simpler import scripts used by FIA. Note that for now, input.Makefile is still used to create map.csv.
8160	03/22/2013 11:13 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: Moved postprocess.sql from $(exportHeader) to %/install because that is not part of the $(exportHeader) functionality. Added %/header.csv and use it in $(exportHeader).
8159	03/22/2013 11:05 PM	Aaron Marcuse-Kubitza	inputs/input.Makefile: $(catSrcs): Fixed bug where need to use $(nonHeaderSrcs) instead of $(srcs) to exclude header.csv
8158	03/22/2013 08:07 PM	Aaron Marcuse-Kubitza	schemas/functions.sql: map: Added additional columns that are present in the standard map spreadsheet format (filter, notes). These columns are necessary to make COPY FROM work, because it requires the # of columns to be the same in the input data and the output table.

Project

General

Profile