Project

General

Profile

Statistics
| Revision:

# Date Author Comment
1156 02/27/2012 11:25 AM Aaron Marcuse-Kubitza

inputs/SALVIAS/verify.sql: Updated for schema changes

1155 02/27/2012 10:24 AM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

1154 02/27/2012 10:22 AM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Re-marked aggregateoccurrence:plantobservation relationship as 1:1 in the ERD. (I think this will need to be manually re-marked whenever either of those tables is updated.)

1153 02/27/2012 10:18 AM Aaron Marcuse-Kubitza

vegbien.sql: Removed methodgrowthform and growthform, since growthforms can be accommodated by plantconcept in a similar way as higher-order taxonomic ranks

1152 02/27/2012 10:09 AM Aaron Marcuse-Kubitza

vegbien.sql: methodgrowthform, methodtaxonclass: Removed "included" default value so it's always obvious whether the author intended the classes to be inclusions or exclusions

1151 02/27/2012 10:04 AM Aaron Marcuse-Kubitza

vegbien.sql: aggregateoccurrence: Removed unneeded fields. Added aggregateoccurrence->coverindex fkey.

1150 02/27/2012 09:54 AM Aaron Marcuse-Kubitza

vegbien.sql: Added constraint to enforce 1:1 aggregateoccurrence:plantobservation relationship

1149 02/25/2012 08:16 PM Aaron Marcuse-Kubitza

vegbien.sql: Added plantname unique constraint

1148 02/25/2012 08:01 PM Aaron Marcuse-Kubitza

bin/map: Use new util.ListDict and util.WrapIter to simplify getting rows by column name instead of index, and to enable a row to be printed with its column names in error messages

1147 02/25/2012 08:00 PM Aaron Marcuse-Kubitza

util.py: Added WrapIter to wrap an iterator and ListDict to view a list as a dict

1146 02/25/2012 07:38 PM Aaron Marcuse-Kubitza

bin/map: Use new util.list_flip()

1145 02/25/2012 07:37 PM Aaron Marcuse-Kubitza

util.py: Added list_flip()

1144 02/25/2012 07:02 PM Aaron Marcuse-Kubitza

env_password: Fixed to set the environment variable in the calling shell. Do this by cc-ing the tty only on messages before the "Enter password" prompt, because the redirect creates a subshell which causes the env var to only be set within that subshell.

1143 02/25/2012 06:18 PM Aaron Marcuse-Kubitza

inputs/NYBG-CSV/maps/DwC.specimens.csv: Removed mappings that are already present in mappings/DwC1-DwC2.specimens.csv. This map now contains only the mappings where NYBG-CSV differs from standard DwC1.

1142 02/25/2012 06:14 PM Aaron Marcuse-Kubitza

inputs/NYBG/maps/DwC.specimens.csv: Removed mappings that are already present in mappings/DwC1-DwC2.specimens.csv. This map now contains only the mappings where NYBG differs from standard DwC1.

1141 02/25/2012 05:58 PM Aaron Marcuse-Kubitza

Remove accidentally-committed temp file inputs/NYBG/DwC.specimens2.csv

1140 02/25/2012 05:56 PM Aaron Marcuse-Kubitza

mappings/Makefile: Generate DwC.self.specimens.csv from DwC-VegBIEN.specimens.csv for use in creating full via maps for inputs

1139 02/25/2012 05:40 PM Aaron Marcuse-Kubitza

input.Makefile: Generate full via maps from input via maps by appending mappings from the via format to itself when available

1138 02/25/2012 04:30 PM Aaron Marcuse-Kubitza

inputs/NYBG/maps/DwC.specimens.csv: Changed label to "NYBG-DwC" to take advantage of automatic filling in of DwC mappings not specified in the NYBG map

1137 02/25/2012 04:28 PM Aaron Marcuse-Kubitza

subtract: Support custom column numbers to compare on (instead of just input col). Added ignore option to continue even if input columns don't match.

1136 02/25/2012 04:26 PM Aaron Marcuse-Kubitza

bin/map: DB inputs: Get all rows in one query (hopefully a significant optimization). Allow maps to contain entries for columns that are not in the DB table.

1135 02/25/2012 04:22 PM Aaron Marcuse-Kubitza

sql.py: select(): Select all fields if fields == None. Replaced col(cur, idx) with col_names(cur) because an iterator is easier to use than getting by index.

1134 02/25/2012 03:57 PM Aaron Marcuse-Kubitza

bin/map: Fixed bug in previous implementation of allowing maps for CSV inputs to contain entries for columns that are not in the CSV file

1133 02/25/2012 03:45 PM Aaron Marcuse-Kubitza

bin/map: Allow maps for CSV inputs to contain entries for columns that are not in the CSV file

1132 02/25/2012 02:54 PM Aaron Marcuse-Kubitza

Use new sort_map instead of manually specifying the sort order

1131 02/25/2012 02:54 PM Aaron Marcuse-Kubitza

Added sort_map to sort a map spreadsheet in the standard order

1130 02/25/2012 02:43 PM Aaron Marcuse-Kubitza

Removed no longer needed join_passthru, because join_union_sort now serves its purpose

1129 02/25/2012 02:42 PM Aaron Marcuse-Kubitza

Don't generate mappings/for_review/DwC-VegBIEN.specimens.csv because it's a derived map with lots of duplicated mappings for the various DwC versions

1128 02/25/2012 02:41 PM Aaron Marcuse-Kubitza

mappings/Makefile: Generate DwC-VegBIEN.specimens.csv directly from DwC1-DwC2 and DwC2-VegBIEN mappings by using join_union_sort with header_num=1, rather than via intermediate DwC1-VegBIEN.specimens.csv

1127 02/25/2012 02:37 PM Aaron Marcuse-Kubitza

union: Added header_num option to select which map's header to use as the output header

1126 02/25/2012 02:28 PM Aaron Marcuse-Kubitza

Rename join_sort to join_union_sort and have it run union in ignore mode. This will automatically append the joined map when the input map is a derivative of the joined map, such as for NYBG-DwC.

1125 02/25/2012 02:25 PM Aaron Marcuse-Kubitza

union: Pass through map 0, so that if ignore is set, the input map will still be output. Allow either map's input label to contain the other's input label to enable e.g. appending mappings for an older input version to those for a newer input version.

1124 02/25/2012 01:43 PM Aaron Marcuse-Kubitza

DwC1-DwC2 mapping: Changed input label to DwC1, which is allowed by the now relaxed label constraints imposed by union

1123 02/25/2012 01:42 PM Aaron Marcuse-Kubitza

union: Check if two maps can be combined based on whether map 0 column 0 label contains map 1 column 0 label instead of being equal. This allows map 0's input 0 root to contain the datasource name as well as a format that allows it to be combined with a more general map. Added ignore flag to not print an error if column labels don't match.

1122 02/25/2012 01:39 PM Aaron Marcuse-Kubitza

bin/map: Support optional data format tag in map spreadsheet labels, used by union to check if two maps can be combined

1121 02/25/2012 01:01 PM Aaron Marcuse-Kubitza

mappings: Added DwC1-DwC2.specimens.csv to core maps so it gets cleaned up

1120 02/25/2012 12:57 PM Aaron Marcuse-Kubitza

Only generate for_review mappings of core maps and end products

1119 02/25/2012 12:56 PM Aaron Marcuse-Kubitza

Generate DwC-VegBIEN mapping as union of DwC1 and DwC2 mappings

1118 02/24/2012 08:00 PM Aaron Marcuse-Kubitza

Generate DwC-VegBIEN mapping as union of DwC1 and DwC2 mappings

1117 02/24/2012 07:40 PM Aaron Marcuse-Kubitza

NYBG DB mapping: Removed IdentifiedDate and CollectedDate mappings because they are generated from the year/month/day

1116 02/24/2012 07:39 PM Aaron Marcuse-Kubitza

Added mappings/for_review/DwC1-VegBIEN.specimens.csv

1115 02/24/2012 07:35 PM Aaron Marcuse-Kubitza

Added DwC1-DwC mapping. Generate DwC1-VegBIEN mapping automatically.

1114 02/24/2012 07:11 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

1113 02/24/2012 07:08 PM Aaron Marcuse-Kubitza

vegbien.sql: Renamed _keys unique constraints/unique indexes to _unique to better reflect their purpose

1112 02/24/2012 06:54 PM Aaron Marcuse-Kubitza

vegbien.sql: Added method.diameterheight to store DBH height

1111 02/24/2012 06:44 PM Aaron Marcuse-Kubitza

VegBIEN: Moved plantstatus.plantlevel to plantname.rank because the taxonomic rank is a property of the name itself

1110 02/24/2012 06:43 PM Aaron Marcuse-Kubitza

PostgreSQL-MySQL.csv: Fixed custom types translation to match shorter type names

1109 02/24/2012 06:09 PM Aaron Marcuse-Kubitza

vegbien.sql: Added plantstatus unique constraint

1108 02/24/2012 06:07 PM Aaron Marcuse-Kubitza

DwC-VegBIEN mapping: Map datasource name via DwC institutionCode

1107 02/24/2012 05:42 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

1106 02/24/2012 05:40 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Lined up logo and legend with other ERD elements

1105 02/24/2012 05:35 PM Aaron Marcuse-Kubitza

vegbien.sql: Renamed methodgrowthform.growthformmethod_id to submethod_id. Added methodtaxonclass.submethod_id (similar to methodgrowthform.submethod_id).

1104 02/24/2012 05:27 PM Aaron Marcuse-Kubitza

vegbien.sql: Added methodgrowthform.growthformmethod_id for specifying a method used by just the growthform

1103 02/24/2012 05:14 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Rearranged legend to more closely match layout of ERD

1102 02/24/2012 04:51 PM Aaron Marcuse-Kubitza

vegbien.sql: Reordered plantstatus fields to put the most important fields at the top, which will be visible in the ERD

1101 02/24/2012 04:42 PM Aaron Marcuse-Kubitza

vegbien.sql: Replaced method.taxonclassincluded,taxonclassexcluded with new many:many methodtaxonclass table. Added methodgrowthform, growthform tables to do the same thing as methodtaxonclass for growth forms.

1100 02/24/2012 03:53 PM Aaron Marcuse-Kubitza

vegbien.sql: method: Added comment on reference_id

1099 02/24/2012 03:44 PM Aaron Marcuse-Kubitza

VegBIEN: Moved plotmethod fields to method because they can also apply to strata. Removed no longer used plotmethod table.

1098 02/24/2012 03:13 PM Aaron Marcuse-Kubitza

input.Makefile: input DB creation: Removed "IF NOT EXISTS" because that check is handled by $(dbExists)

1097 02/24/2012 03:02 PM Aaron Marcuse-Kubitza

input.Makefile: Don't try to recreate an input DB if it already exists

1096 02/24/2012 03:01 PM Aaron Marcuse-Kubitza

Added UArizona DB input

1095 02/24/2012 02:42 PM Aaron Marcuse-Kubitza

Renaming UArizona to UArizona-CSV because there is also a DB input in bien2_staging.ariz_raw on nimoy

1094 02/24/2012 02:31 PM Aaron Marcuse-Kubitza

Added UArizona input

1093 02/24/2012 12:46 PM Aaron Marcuse-Kubitza

env_password: Fixed bug where exit command would not cause it to exit, because pipefail shell option was not set. Moved automatic exiting of the calling script into env_password itself.

1092 02/24/2012 12:26 PM Aaron Marcuse-Kubitza

map: Exit if password not set

1091 02/24/2012 12:18 PM Aaron Marcuse-Kubitza

env_password: cc stderr if it's a log file

1090 02/23/2012 06:49 PM Aaron Marcuse-Kubitza

env_password: Print all messages to /dev/tty so the user sees them even if stderr is redirected to a log file. Exit if password not already set, because e.g. scripts run in the background will not be able to prompt for it.

1089 02/23/2012 05:32 PM Aaron Marcuse-Kubitza

input.Makefile: Don't have make import call verify, because the user often runs import as a test and will not want the output cluttered with verification information. Also, the full imports for which this was intended are often run asynchronously, so that the user will not see the output anyway.

1088 02/23/2012 05:28 PM Aaron Marcuse-Kubitza

input.Makefile: Don't abort on verification errors, which are expected during development

1087 02/21/2012 06:21 PM Aaron Marcuse-Kubitza

SALVIAS tests: Fixed invalid accepted test outputs due to not running `make empty_db` before running tests when using the no-redo optimization shortcut

1086 02/21/2012 06:14 PM Aaron Marcuse-Kubitza

SALVIAS mappings: Fixed plot key mappings to map the correct values to subplot and parent plot

1085 02/21/2012 05:36 PM Aaron Marcuse-Kubitza

vegbien.sql: locationevent: Added unique constraint for subplots based on subplot location

1084 02/21/2012 05:02 PM Aaron Marcuse-Kubitza

SALVIAS-db VegX mapping: Map subplots correctly the way SALVIAS-CSV does

1083 02/21/2012 04:54 PM Aaron Marcuse-Kubitza

SALVIAS verification: Updated to schema changes

1082 02/21/2012 04:42 PM Aaron Marcuse-Kubitza

input.Makefile: Fixed syntax error in verify %.ref target (outdated variable name)

1081 02/21/2012 04:33 PM Aaron Marcuse-Kubitza

input.Makefile: Halt psql commands on first error

1080 02/21/2012 04:27 PM Aaron Marcuse-Kubitza

vegbien.sql: Removed location.authorlocationcode because it's now stored in locationevent as an author-specific setting

1079 02/21/2012 04:24 PM Aaron Marcuse-Kubitza

vegbien.sql: locationevent: Redid unique constraints to avoid applying authorlocationcode-only duplicate elimination to subplots

1078 02/21/2012 04:16 PM Aaron Marcuse-Kubitza

SALVIAS mappings: Map SiteCode/plot_code to locationevent.authorlocationcode because locationevent is now the place to store author-specific plot information

1077 02/21/2012 04:10 PM Aaron Marcuse-Kubitza

SALVIAS mappings: Fixed PlotID mapping to go to locationevent.sourceaccessioncode

1076 02/21/2012 04:06 PM Aaron Marcuse-Kubitza

VegBIEN: Renamed locationevent.authoreventcode to authorlocationcode to reflect that datasources usually use an author-defined code for a plot rather than a plot event

1075 02/21/2012 04:03 PM Aaron Marcuse-Kubitza

vegbien.sql: locationevent: Redid unique constraints to handle datasources that treat the authoreventcode as an authorlocationcode. Eventually, authoreventcode will be renamed to authorlocationcode.

1074 02/21/2012 03:51 PM Aaron Marcuse-Kubitza

vegbien.sql: locationevent: Redid unique constraints to work properly for all fully-specified combinations of keys

1073 02/21/2012 03:31 PM Aaron Marcuse-Kubitza

VegBIEN mappings: Mapped datasource name to new project.datasource. Fixes project duplicate elimination.

1072 02/21/2012 03:16 PM Aaron Marcuse-Kubitza

vegbien.sql: Renamed project.reference_id to datasource_id and pointed it to party, to match locationevent, etc.

1071 02/21/2012 03:02 PM Aaron Marcuse-Kubitza

VegBIEN mappings: Mapped current lat/long to centerlat/long as well so location duplicate elimination will work properly

1070 02/21/2012 03:01 PM Aaron Marcuse-Kubitza

xpath.py: Added support for common subpath after split path's {}

1069 02/21/2012 01:30 PM Aaron Marcuse-Kubitza

sql.py: put(): When encountering a DuplicateKeyException, use dict_subset_right_join to fill in explicit NULL values for columns which don't have data. This causes the database to use the UNIQUE constraint's index to look up the record, instead of relying on individual column indexes for the columns that did have data, which may or may not be available.

1068 02/21/2012 01:27 PM Aaron Marcuse-Kubitza

util.py: Added DefaultDict to wrap collections.defaultdict with a simple value passed in the constructor, defaulting to None. Added dict_subset_right_join() to fill in None for subset keys that don't exist.

1067 02/21/2012 01:06 PM Aaron Marcuse-Kubitza

vegbien.sql: Added method and plotmethod UNIQUE indexes

1066 02/21/2012 01:04 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Removed embargo table from ERD because its functionality is provided in location.confidentialitystatus,confidentialityreason

1065 02/21/2012 12:36 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

1064 02/21/2012 12:34 PM Aaron Marcuse-Kubitza

vegbien.sql: Moved locationevent method fields to plotmethod and method. Added comments to method/plotmethod fields, as provided by Michael Lee.

1063 02/21/2012 12:15 PM Aaron Marcuse-Kubitza

VegX-VegBIEN mapping: Mapped locationevent.methodnarrative to new plotmethod table

1062 02/18/2012 07:18 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

1061 02/18/2012 07:15 PM Aaron Marcuse-Kubitza

VegX-VegBIEN mapping: Map sourceaccessioncode and voucher (catalognumber_dwc) to correct place. SALVIAS mappings: Map SourceVoucher as an alternative to coll_number.

1060 02/18/2012 06:46 PM Aaron Marcuse-Kubitza

vegbien.sql: Removed VegBank-internal tables (including user account tables) because they do not belong in the ecological database. Any web interface should store user account information, cached queries, etc. in a separate interface-specific database.

1059 02/18/2012 06:34 PM Aaron Marcuse-Kubitza

VegX-VegBIEN mapping: Mapped stem tags to new stemtag table

1058 02/18/2012 05:47 PM Aaron Marcuse-Kubitza

vegbien.sql: Renamed planttag to stemtag and made it a child of stemobservation. Removed trait table from ERD because it's not used for the purpose we want to use traits for.

1057 02/18/2012 05:05 PM Aaron Marcuse-Kubitza

vegbien.sql: Removed no longer used location.reference_id. Datasource scoping is now done on locationevent instead, so that locations can be shared across datasources that refer to the same plot or point.