/ - Changes - BIEN 3 - NCEAS Projects

root @ 1156

#	Date	Author	Comment
1156	02/27/2012 11:25 AM	Aaron Marcuse-Kubitza	inputs/SALVIAS/verify.sql: Updated for schema changes
1155	02/27/2012 10:24 AM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
1154	02/27/2012 10:22 AM	Aaron Marcuse-Kubitza	vegbien.ERD.mwb: Re-marked aggregateoccurrence:plantobservation relationship as 1:1 in the ERD. (I think this will need to be manually re-marked whenever either of those tables is updated.)
1153	02/27/2012 10:18 AM	Aaron Marcuse-Kubitza	vegbien.sql: Removed methodgrowthform and growthform, since growthforms can be accommodated by plantconcept in a similar way as higher-order taxonomic ranks
1152	02/27/2012 10:09 AM	Aaron Marcuse-Kubitza	vegbien.sql: methodgrowthform, methodtaxonclass: Removed "included" default value so it's always obvious whether the author intended the classes to be inclusions or exclusions
1151	02/27/2012 10:04 AM	Aaron Marcuse-Kubitza	vegbien.sql: aggregateoccurrence: Removed unneeded fields. Added aggregateoccurrence->coverindex fkey.
1150	02/27/2012 09:54 AM	Aaron Marcuse-Kubitza	vegbien.sql: Added constraint to enforce 1:1 aggregateoccurrence:plantobservation relationship
1149	02/25/2012 08:16 PM	Aaron Marcuse-Kubitza	vegbien.sql: Added plantname unique constraint
1148	02/25/2012 08:01 PM	Aaron Marcuse-Kubitza	bin/map: Use new util.ListDict and util.WrapIter to simplify getting rows by column name instead of index, and to enable a row to be printed with its column names in error messages
1147	02/25/2012 08:00 PM	Aaron Marcuse-Kubitza	util.py: Added WrapIter to wrap an iterator and ListDict to view a list as a dict
1146	02/25/2012 07:38 PM	Aaron Marcuse-Kubitza	bin/map: Use new util.list_flip()
1145	02/25/2012 07:37 PM	Aaron Marcuse-Kubitza	util.py: Added list_flip()
1144	02/25/2012 07:02 PM	Aaron Marcuse-Kubitza	env_password: Fixed to set the environment variable in the calling shell. Do this by cc-ing the tty only on messages before the "Enter password" prompt, because the redirect creates a subshell which causes the env var to only be set within that subshell.
1143	02/25/2012 06:18 PM	Aaron Marcuse-Kubitza	inputs/NYBG-CSV/maps/DwC.specimens.csv: Removed mappings that are already present in mappings/DwC1-DwC2.specimens.csv. This map now contains only the mappings where NYBG-CSV differs from standard DwC1.
1142	02/25/2012 06:14 PM	Aaron Marcuse-Kubitza	inputs/NYBG/maps/DwC.specimens.csv: Removed mappings that are already present in mappings/DwC1-DwC2.specimens.csv. This map now contains only the mappings where NYBG differs from standard DwC1.
1141	02/25/2012 05:58 PM	Aaron Marcuse-Kubitza	Remove accidentally-committed temp file inputs/NYBG/DwC.specimens2.csv
1140	02/25/2012 05:56 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Generate DwC.self.specimens.csv from DwC-VegBIEN.specimens.csv for use in creating full via maps for inputs
1139	02/25/2012 05:40 PM	Aaron Marcuse-Kubitza	input.Makefile: Generate full via maps from input via maps by appending mappings from the via format to itself when available
1138	02/25/2012 04:30 PM	Aaron Marcuse-Kubitza	inputs/NYBG/maps/DwC.specimens.csv: Changed label to "NYBG-DwC" to take advantage of automatic filling in of DwC mappings not specified in the NYBG map
1137	02/25/2012 04:28 PM	Aaron Marcuse-Kubitza	subtract: Support custom column numbers to compare on (instead of just input col). Added ignore option to continue even if input columns don't match.
1136	02/25/2012 04:26 PM	Aaron Marcuse-Kubitza	bin/map: DB inputs: Get all rows in one query (hopefully a significant optimization). Allow maps to contain entries for columns that are not in the DB table.
1135	02/25/2012 04:22 PM	Aaron Marcuse-Kubitza	sql.py: select(): Select all fields if fields == None. Replaced col(cur, idx) with col_names(cur) because an iterator is easier to use than getting by index.
1134	02/25/2012 03:57 PM	Aaron Marcuse-Kubitza	bin/map: Fixed bug in previous implementation of allowing maps for CSV inputs to contain entries for columns that are not in the CSV file
1133	02/25/2012 03:45 PM	Aaron Marcuse-Kubitza	bin/map: Allow maps for CSV inputs to contain entries for columns that are not in the CSV file
1132	02/25/2012 02:54 PM	Aaron Marcuse-Kubitza	Use new sort_map instead of manually specifying the sort order
1131	02/25/2012 02:54 PM	Aaron Marcuse-Kubitza	Added sort_map to sort a map spreadsheet in the standard order
1130	02/25/2012 02:43 PM	Aaron Marcuse-Kubitza	Removed no longer needed join_passthru, because join_union_sort now serves its purpose
1129	02/25/2012 02:42 PM	Aaron Marcuse-Kubitza	Don't generate mappings/for_review/DwC-VegBIEN.specimens.csv because it's a derived map with lots of duplicated mappings for the various DwC versions
1128	02/25/2012 02:41 PM	Aaron Marcuse-Kubitza	mappings/Makefile: Generate DwC-VegBIEN.specimens.csv directly from DwC1-DwC2 and DwC2-VegBIEN mappings by using join_union_sort with header_num=1, rather than via intermediate DwC1-VegBIEN.specimens.csv
1127	02/25/2012 02:37 PM	Aaron Marcuse-Kubitza	union: Added header_num option to select which map's header to use as the output header
1126	02/25/2012 02:28 PM	Aaron Marcuse-Kubitza	Rename join_sort to join_union_sort and have it run union in ignore mode. This will automatically append the joined map when the input map is a derivative of the joined map, such as for NYBG-DwC.
1125	02/25/2012 02:25 PM	Aaron Marcuse-Kubitza	union: Pass through map 0, so that if ignore is set, the input map will still be output. Allow either map's input label to contain the other's input label to enable e.g. appending mappings for an older input version to those for a newer input version.
1124	02/25/2012 01:43 PM	Aaron Marcuse-Kubitza	DwC1-DwC2 mapping: Changed input label to DwC1, which is allowed by the now relaxed label constraints imposed by union
1123	02/25/2012 01:42 PM	Aaron Marcuse-Kubitza	union: Check if two maps can be combined based on whether map 0 column 0 label contains map 1 column 0 label instead of being equal. This allows map 0's input 0 root to contain the datasource name as well as a format that allows it to be combined with a more general map. Added ignore flag to not print an error if column labels don't match.
1122	02/25/2012 01:39 PM	Aaron Marcuse-Kubitza	bin/map: Support optional data format tag in map spreadsheet labels, used by union to check if two maps can be combined
1121	02/25/2012 01:01 PM	Aaron Marcuse-Kubitza	mappings: Added DwC1-DwC2.specimens.csv to core maps so it gets cleaned up
1120	02/25/2012 12:57 PM	Aaron Marcuse-Kubitza	Only generate for_review mappings of core maps and end products
1119	02/25/2012 12:56 PM	Aaron Marcuse-Kubitza	Generate DwC-VegBIEN mapping as union of DwC1 and DwC2 mappings
1118	02/24/2012 08:00 PM	Aaron Marcuse-Kubitza	Generate DwC-VegBIEN mapping as union of DwC1 and DwC2 mappings
1117	02/24/2012 07:40 PM	Aaron Marcuse-Kubitza	NYBG DB mapping: Removed IdentifiedDate and CollectedDate mappings because they are generated from the year/month/day
1116	02/24/2012 07:39 PM	Aaron Marcuse-Kubitza	Added mappings/for_review/DwC1-VegBIEN.specimens.csv
1115	02/24/2012 07:35 PM	Aaron Marcuse-Kubitza	Added DwC1-DwC mapping. Generate DwC1-VegBIEN mapping automatically.
1114	02/24/2012 07:11 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
1113	02/24/2012 07:08 PM	Aaron Marcuse-Kubitza	vegbien.sql: Renamed _keys unique constraints/unique indexes to _unique to better reflect their purpose
1112	02/24/2012 06:54 PM	Aaron Marcuse-Kubitza	vegbien.sql: Added method.diameterheight to store DBH height
1111	02/24/2012 06:44 PM	Aaron Marcuse-Kubitza	VegBIEN: Moved plantstatus.plantlevel to plantname.rank because the taxonomic rank is a property of the name itself
1110	02/24/2012 06:43 PM	Aaron Marcuse-Kubitza	PostgreSQL-MySQL.csv: Fixed custom types translation to match shorter type names
1109	02/24/2012 06:09 PM	Aaron Marcuse-Kubitza	vegbien.sql: Added plantstatus unique constraint
1108	02/24/2012 06:07 PM	Aaron Marcuse-Kubitza	DwC-VegBIEN mapping: Map datasource name via DwC institutionCode
1107	02/24/2012 05:42 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
1106	02/24/2012 05:40 PM	Aaron Marcuse-Kubitza	vegbien.ERD.mwb: Lined up logo and legend with other ERD elements
1105	02/24/2012 05:35 PM	Aaron Marcuse-Kubitza	vegbien.sql: Renamed methodgrowthform.growthformmethod_id to submethod_id. Added methodtaxonclass.submethod_id (similar to methodgrowthform.submethod_id).
1104	02/24/2012 05:27 PM	Aaron Marcuse-Kubitza	vegbien.sql: Added methodgrowthform.growthformmethod_id for specifying a method used by just the growthform
1103	02/24/2012 05:14 PM	Aaron Marcuse-Kubitza	vegbien.ERD.mwb: Rearranged legend to more closely match layout of ERD
1102	02/24/2012 04:51 PM	Aaron Marcuse-Kubitza	vegbien.sql: Reordered plantstatus fields to put the most important fields at the top, which will be visible in the ERD
1101	02/24/2012 04:42 PM	Aaron Marcuse-Kubitza	vegbien.sql: Replaced method.taxonclassincluded,taxonclassexcluded with new many:many methodtaxonclass table. Added methodgrowthform, growthform tables to do the same thing as methodtaxonclass for growth forms.
1100	02/24/2012 03:53 PM	Aaron Marcuse-Kubitza	vegbien.sql: method: Added comment on reference_id
1099	02/24/2012 03:44 PM	Aaron Marcuse-Kubitza	VegBIEN: Moved plotmethod fields to method because they can also apply to strata. Removed no longer used plotmethod table.
1098	02/24/2012 03:13 PM	Aaron Marcuse-Kubitza	input.Makefile: input DB creation: Removed "IF NOT EXISTS" because that check is handled by $(dbExists)
1097	02/24/2012 03:02 PM	Aaron Marcuse-Kubitza	input.Makefile: Don't try to recreate an input DB if it already exists
1096	02/24/2012 03:01 PM	Aaron Marcuse-Kubitza	Added UArizona DB input
1095	02/24/2012 02:42 PM	Aaron Marcuse-Kubitza	Renaming UArizona to UArizona-CSV because there is also a DB input in bien2_staging.ariz_raw on nimoy
1094	02/24/2012 02:31 PM	Aaron Marcuse-Kubitza	Added UArizona input
1093	02/24/2012 12:46 PM	Aaron Marcuse-Kubitza	env_password: Fixed bug where exit command would not cause it to exit, because pipefail shell option was not set. Moved automatic exiting of the calling script into env_password itself.
1092	02/24/2012 12:26 PM	Aaron Marcuse-Kubitza	map: Exit if password not set
1091	02/24/2012 12:18 PM	Aaron Marcuse-Kubitza	env_password: cc stderr if it's a log file
1090	02/23/2012 06:49 PM	Aaron Marcuse-Kubitza	env_password: Print all messages to /dev/tty so the user sees them even if stderr is redirected to a log file. Exit if password not already set, because e.g. scripts run in the background will not be able to prompt for it.
1089	02/23/2012 05:32 PM	Aaron Marcuse-Kubitza	input.Makefile: Don't have make import call verify, because the user often runs import as a test and will not want the output cluttered with verification information. Also, the full imports for which this was intended are often run asynchronously, so that the user will not see the output anyway.
1088	02/23/2012 05:28 PM	Aaron Marcuse-Kubitza	input.Makefile: Don't abort on verification errors, which are expected during development
1087	02/21/2012 06:21 PM	Aaron Marcuse-Kubitza	SALVIAS tests: Fixed invalid accepted test outputs due to not running `make empty_db` before running tests when using the no-redo optimization shortcut
1086	02/21/2012 06:14 PM	Aaron Marcuse-Kubitza	SALVIAS mappings: Fixed plot key mappings to map the correct values to subplot and parent plot
1085	02/21/2012 05:36 PM	Aaron Marcuse-Kubitza	vegbien.sql: locationevent: Added unique constraint for subplots based on subplot location
1084	02/21/2012 05:02 PM	Aaron Marcuse-Kubitza	SALVIAS-db VegX mapping: Map subplots correctly the way SALVIAS-CSV does
1083	02/21/2012 04:54 PM	Aaron Marcuse-Kubitza	SALVIAS verification: Updated to schema changes
1082	02/21/2012 04:42 PM	Aaron Marcuse-Kubitza	input.Makefile: Fixed syntax error in verify %.ref target (outdated variable name)
1081	02/21/2012 04:33 PM	Aaron Marcuse-Kubitza	input.Makefile: Halt psql commands on first error
1080	02/21/2012 04:27 PM	Aaron Marcuse-Kubitza	vegbien.sql: Removed location.authorlocationcode because it's now stored in locationevent as an author-specific setting
1079	02/21/2012 04:24 PM	Aaron Marcuse-Kubitza	vegbien.sql: locationevent: Redid unique constraints to avoid applying authorlocationcode-only duplicate elimination to subplots
1078	02/21/2012 04:16 PM	Aaron Marcuse-Kubitza	SALVIAS mappings: Map SiteCode/plot_code to locationevent.authorlocationcode because locationevent is now the place to store author-specific plot information
1077	02/21/2012 04:10 PM	Aaron Marcuse-Kubitza	SALVIAS mappings: Fixed PlotID mapping to go to locationevent.sourceaccessioncode
1076	02/21/2012 04:06 PM	Aaron Marcuse-Kubitza	VegBIEN: Renamed locationevent.authoreventcode to authorlocationcode to reflect that datasources usually use an author-defined code for a plot rather than a plot event
1075	02/21/2012 04:03 PM	Aaron Marcuse-Kubitza	vegbien.sql: locationevent: Redid unique constraints to handle datasources that treat the authoreventcode as an authorlocationcode. Eventually, authoreventcode will be renamed to authorlocationcode.
1074	02/21/2012 03:51 PM	Aaron Marcuse-Kubitza	vegbien.sql: locationevent: Redid unique constraints to work properly for all fully-specified combinations of keys
1073	02/21/2012 03:31 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Mapped datasource name to new project.datasource. Fixes project duplicate elimination.
1072	02/21/2012 03:16 PM	Aaron Marcuse-Kubitza	vegbien.sql: Renamed project.reference_id to datasource_id and pointed it to party, to match locationevent, etc.
1071	02/21/2012 03:02 PM	Aaron Marcuse-Kubitza	VegBIEN mappings: Mapped current lat/long to centerlat/long as well so location duplicate elimination will work properly
1070	02/21/2012 03:01 PM	Aaron Marcuse-Kubitza	xpath.py: Added support for common subpath after split path's {}
1069	02/21/2012 01:30 PM	Aaron Marcuse-Kubitza	sql.py: put(): When encountering a DuplicateKeyException, use dict_subset_right_join to fill in explicit NULL values for columns which don't have data. This causes the database to use the UNIQUE constraint's index to look up the record, instead of relying on individual column indexes for the columns that did have data, which may or may not be available.
1068	02/21/2012 01:27 PM	Aaron Marcuse-Kubitza	util.py: Added DefaultDict to wrap collections.defaultdict with a simple value passed in the constructor, defaulting to None. Added dict_subset_right_join() to fill in None for subset keys that don't exist.
1067	02/21/2012 01:06 PM	Aaron Marcuse-Kubitza	vegbien.sql: Added method and plotmethod UNIQUE indexes
1066	02/21/2012 01:04 PM	Aaron Marcuse-Kubitza	vegbien.ERD.mwb: Removed embargo table from ERD because its functionality is provided in location.confidentialitystatus,confidentialityreason
1065	02/21/2012 12:36 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
1064	02/21/2012 12:34 PM	Aaron Marcuse-Kubitza	vegbien.sql: Moved locationevent method fields to plotmethod and method. Added comments to method/plotmethod fields, as provided by Michael Lee.
1063	02/21/2012 12:15 PM	Aaron Marcuse-Kubitza	VegX-VegBIEN mapping: Mapped locationevent.methodnarrative to new plotmethod table
1062	02/18/2012 07:18 PM	Aaron Marcuse-Kubitza	Regenerated vegbien.ERD exports
1061	02/18/2012 07:15 PM	Aaron Marcuse-Kubitza	VegX-VegBIEN mapping: Map sourceaccessioncode and voucher (catalognumber_dwc) to correct place. SALVIAS mappings: Map SourceVoucher as an alternative to coll_number.
1060	02/18/2012 06:46 PM	Aaron Marcuse-Kubitza	vegbien.sql: Removed VegBank-internal tables (including user account tables) because they do not belong in the ecological database. Any web interface should store user account information, cached queries, etc. in a separate interface-specific database.
1059	02/18/2012 06:34 PM	Aaron Marcuse-Kubitza	VegX-VegBIEN mapping: Mapped stem tags to new stemtag table
1058	02/18/2012 05:47 PM	Aaron Marcuse-Kubitza	vegbien.sql: Renamed planttag to stemtag and made it a child of stemobservation. Removed trait table from ERD because it's not used for the purpose we want to use traits for.
1057	02/18/2012 05:05 PM	Aaron Marcuse-Kubitza	vegbien.sql: Removed no longer used location.reference_id. Datasource scoping is now done on locationevent instead, so that locations can be shared across datasources that refer to the same plot or point.

Project

General

Profile