Project

General

Profile

Statistics
| Revision:

# Date Author Comment
932 02/11/2012 05:41 PM Aaron Marcuse-Kubitza

opts.py: Call an error handler if an env var isn't set

931 02/11/2012 05:40 PM Aaron Marcuse-Kubitza

util.py: Added function wrappers for statements noop() and and_()

930 02/11/2012 05:07 PM Aaron Marcuse-Kubitza

inputs/NYBG-CSV: Map via DwC

929 02/11/2012 04:50 PM Aaron Marcuse-Kubitza

Added subtract to subtract map spreadsheets

928 02/11/2012 04:49 PM Aaron Marcuse-Kubitza

ch_root: Ignore empty lines

927 02/11/2012 04:06 PM Aaron Marcuse-Kubitza

Added intersect to intersect two map spreadsheets

926 02/11/2012 03:48 PM Aaron Marcuse-Kubitza

union: Clarified overwrite order of inputs in description

925 02/11/2012 03:31 PM Aaron Marcuse-Kubitza

Removed no longer needed mappings/review

924 02/11/2012 03:31 PM Aaron Marcuse-Kubitza

mappings/Makefile: Regenerate for_review maps automatically when a map changes

923 02/11/2012 03:17 PM Aaron Marcuse-Kubitza

mappings/review: Generalized to convert all mappings to VegBIEN, not just a specific listed set (which was out of date)

922 02/11/2012 03:16 PM Aaron Marcuse-Kubitza

mappings/for_review/DwC-VegBIEN.specimens.csv: Regenerated

921 02/11/2012 03:12 PM Aaron Marcuse-Kubitza

inputs/NYBG/maps/DwC.specimens.csv: Fixed CollectedDate mapping to use the _date XML func

920 02/11/2012 03:07 PM Aaron Marcuse-Kubitza

DwC mappings: Mapped Substrate and Vegetation

919 02/11/2012 02:54 PM Aaron Marcuse-Kubitza

DwC mappings: Mapped BoundingBox, footprintWKT to location.dsgpoly

918 02/11/2012 02:44 PM Aaron Marcuse-Kubitza

DwC mappings: Mapped Notes and PlantFungusDescription to bien.specimenDescription, merged together

917 02/11/2012 02:41 PM Aaron Marcuse-Kubitza

xml_func.py: Added _merge and _label XML funcs

916 02/11/2012 02:12 PM Aaron Marcuse-Kubitza

vegbien.sql: Added specimenreplicate.description

915 02/11/2012 02:01 PM Aaron Marcuse-Kubitza

DwC mappings: Corrected mappings for CollectorNumber, individualID, recordNumber, and fieldNumber

914 02/11/2012 01:31 PM Aaron Marcuse-Kubitza

DwC mappings: Mapped infraspecificEpithet to/from Subspecies

913 02/11/2012 01:22 PM Aaron Marcuse-Kubitza

DwC-VegBIEN mapping: merge decimalLatitude/Longitude with verbatimLatitude/Longitude

912 02/10/2012 05:55 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

911 02/10/2012 05:53 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Fixed lines. Repositioned elements.

910 02/10/2012 05:41 PM Aaron Marcuse-Kubitza

vegbien.sql: Added locationevent.parent_id for subplot events to point to their parent plot event

909 02/10/2012 05:21 PM Aaron Marcuse-Kubitza

vegbien.sql: Added plantconceptscope.namedplace_id

908 02/10/2012 05:14 PM Aaron Marcuse-Kubitza

vegbien.sql: Added taxonoccurrence.method_id

907 02/10/2012 05:00 PM Aaron Marcuse-Kubitza

vegbien.sql: Renamed plotmethod to method

906 02/10/2012 04:47 PM Aaron Marcuse-Kubitza

vegbien.sql: Added locationdetermination.identifier_id

905 02/10/2012 04:39 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Fixed lines

904 02/10/2012 04:35 PM Aaron Marcuse-Kubitza

VegBIEN: Renamed locationplace to locationdetermination

903 02/10/2012 04:18 PM Aaron Marcuse-Kubitza

vegbien.sql: Removed morphospecies, instead using plantconcept and new table plantconceptscope to represent scoped morphospecies. Reorganized plant* tables in ERD.

902 02/10/2012 03:41 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Added plantcorrelation table to ERD

901 02/09/2012 07:08 PM Aaron Marcuse-Kubitza

mappings/VegX-VegBIEN.organisms.csv: Fixed TaxonomicRankBelowSubspeciesEnum mapping to use plantname instead of placing the name directly inside plantconcept. This bug was probably left over from the switch from linking directly to plantname rather than to a plantconcept which has a plantname.

900 02/09/2012 06:54 PM Aaron Marcuse-Kubitza

Renamed get_errors to error_stats and added counts of each error message

899 02/09/2012 06:24 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

898 02/09/2012 06:22 PM Aaron Marcuse-Kubitza

vegbien.sql: Added plotmethod table

897 02/07/2012 05:22 PM Aaron Marcuse-Kubitza

input.Makefile: Renamed $(test) variable (containing the command to run a test) so it wouldn't conflict with test environment variable

896 02/07/2012 05:10 PM Aaron Marcuse-Kubitza

mappings to VegBIEN: Don't map notes to taxondetermination because some organisms have notes but no species name

895 02/07/2012 05:08 PM Aaron Marcuse-Kubitza

mappings/Makefile: Fixed bug in rules for cleaning up core maps whenever they change, which had the target and prerequisite reversed

894 02/07/2012 04:39 PM Aaron Marcuse-Kubitza

input.Makefile: nolog option defaults to on when test is on

893 02/07/2012 04:36 PM Aaron Marcuse-Kubitza

input.Makefile: Fixed bug where no log file was being created, even when nolog was off

892 02/07/2012 04:11 PM Aaron Marcuse-Kubitza

Replaced all type(...) == str with util.is_str(...) to properly treat Unicode objects as strings

891 02/07/2012 04:08 PM Aaron Marcuse-Kubitza

xml_dom.py: minidom.Element.write_opening(): Use new Attr.__str__() method for printing attributes

890 02/07/2012 04:06 PM Aaron Marcuse-Kubitza

bin/map: For XML inputs, use xpath.get()'s new multiple result support to iterate over elements matching the root, rather than just iterating over the first matching element's siblings. This fixes the broken 2-step tests for stems, which didn't filter by the root's attributes.

889 02/07/2012 04:04 PM Aaron Marcuse-Kubitza

xpath.py: get(): Fixed bug where limit was not passed to recursive get() calls

888 02/07/2012 04:02 PM Aaron Marcuse-Kubitza

xml_dom.py: by_tag_name(): Iterate forwards over children unless last_only optimization turned on. Added Attr.__str__() and repr() for debug-printing Attrs.

887 02/07/2012 03:10 PM Aaron Marcuse-Kubitza

xpath.py: get(): Turn off last_only optimization when limit > 1

886 02/07/2012 03:07 PM Aaron Marcuse-Kubitza

xpath.py: get(): Added full support for returning multiple matches

885 02/07/2012 02:33 PM Aaron Marcuse-Kubitza

xpath.py: get(): Added basic structure for returning multiple matches. Added limit parameter to select one or many matches.

884 02/07/2012 02:13 PM Aaron Marcuse-Kubitza

input.Makefile: test/VegBIEN.%.2-step.xml: Use the core map specific to the test's table instead of the main core map

883 02/07/2012 02:10 PM Aaron Marcuse-Kubitza

bin/map: Print error if map root not found inXML input

882 02/07/2012 01:52 PM Aaron Marcuse-Kubitza

mappings/: Removed mappings used by old tester

881 02/07/2012 01:48 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/maps/VegX.*.csv: Replaced symlinks with actual files

880 02/07/2012 01:45 PM Aaron Marcuse-Kubitza

Removed old tester

879 02/07/2012 01:44 PM Aaron Marcuse-Kubitza

Switched to using inputs/test as main test target

878 02/07/2012 01:38 PM Aaron Marcuse-Kubitza

Added SALVIAS-CSV tests

877 02/07/2012 01:31 PM Aaron Marcuse-Kubitza

Added NYBG-CSV tests

876 02/07/2012 01:28 PM Aaron Marcuse-Kubitza

input.Makefile: Run separate tests for each map spreadsheet (input table) rather than all tables at once. This will make it possible to test CSV inputs, which have one CSV per table.

875 02/07/2012 12:54 PM Aaron Marcuse-Kubitza

Added NYBG-CSV input

874 02/07/2012 12:47 PM Aaron Marcuse-Kubitza

inputs/Makefile: Fixed forwarding of empty targets to subdirs

873 02/07/2012 12:20 PM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

872 02/07/2012 12:17 PM Aaron Marcuse-Kubitza

vegbien.sql: Added morphospecies table

871 02/07/2012 11:53 AM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Fixed lines

870 02/07/2012 11:45 AM Aaron Marcuse-Kubitza

vegbien.sql: Removed the taxonOccurrence:aggregateOccurrence 1:1 constraint

869 02/06/2012 05:22 PM Aaron Marcuse-Kubitza

sql.py: truncate(): Use run_raw_query() instead of run_query() because truncate() does not use the recover functionality of run_query(). Also, in the profiling output, this separates the "normal" SQL statements (which use run_query()) from the "core" SQL statements (which use run_raw_query()).

868 02/06/2012 05:09 PM Aaron Marcuse-Kubitza

vegbien.sql: Added indexes for each field in party used in duplicate elimination (for use by sql.put()'s DuplicateKeyException handler)

867 02/06/2012 04:58 PM Aaron Marcuse-Kubitza

sql.py: run_raw_query(): In debug mode, print query after params have been substituted in

866 02/06/2012 04:55 PM Aaron Marcuse-Kubitza

sql.py: Fixed index_cols() to handle UNIQUE indexes with expressions, whose column names are stored in a different format

865 02/06/2012 04:04 PM Aaron Marcuse-Kubitza

sql.py: Print warning if SELECT statement missing a WHERE, LIMIT, or OFFSET clause. Changed bin/map DB input get-all-rows statement to pass start=0 to suppress this warning for that statement.

864 02/06/2012 03:51 PM Aaron Marcuse-Kubitza

db_xml.py: Added start option to get() that passes through to sql.select()

863 02/06/2012 03:50 PM Aaron Marcuse-Kubitza

sql.py: Added start option to select() to set the OFFSET

862 02/06/2012 03:27 PM Aaron Marcuse-Kubitza

sql.py: If run_raw_query.debug flag is set, print each query executed (on a single line)

861 02/06/2012 03:25 PM Aaron Marcuse-Kubitza

strings.py: Added one_line() function to make a string all on one line

860 02/06/2012 03:20 PM Aaron Marcuse-Kubitza

strings.py: Renamed one_line() to remove_extra_newl() to better reflect what it does

859 02/06/2012 03:19 PM Aaron Marcuse-Kubitza

bin/map: Don't print Done after an action in debug logging mode because it messes up newlines when more debugging info is printed right after it

858 02/06/2012 02:45 PM Aaron Marcuse-Kubitza

input.Makefile: Added nolog option to disable creating a log file, e.g. for debugging runs

857 02/06/2012 01:57 PM Aaron Marcuse-Kubitza

xml_dom.py: Remove extra newlines from single-line strings (bin/map doesn't need to do this itself anymore)

856 02/06/2012 01:56 PM Aaron Marcuse-Kubitza

strings.py: Added is_multiline() and one_line() for removing extra newlines from single-line strings

855 02/06/2012 01:48 PM Aaron Marcuse-Kubitza

bin/map: In debug mode, print input XPath's XML tree all on one line

854 02/06/2012 01:22 PM Aaron Marcuse-Kubitza

sql.py: Switched try_insert() to use index_cols() instead of constraint_cols() for "duplicate key value violates unique constraint" errors because they can also be generated by UNIQUE indexes (and there is a UNIQUE index for every UNIQUE constraint)

853 02/06/2012 01:14 PM Aaron Marcuse-Kubitza

sql.py: Added index_cols() to get cols used by an index (similar to constraint_cols())

852 02/06/2012 01:00 PM Aaron Marcuse-Kubitza

vegbien.sql: Fixed duplicate elimination for party to use a UNIQUE index with COALESCE for nullable fields

851 02/06/2012 01:00 PM Aaron Marcuse-Kubitza

sql.py: Fixed bug in try_insert() where DuplicateKeyException was passed only cols0 instead of cols array

850 02/06/2012 12:49 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Fixed lines

849 02/06/2012 12:30 PM Aaron Marcuse-Kubitza

Added get_errors to select just the error messages from `map` output

848 02/06/2012 11:59 AM Aaron Marcuse-Kubitza

Added profile_stats to analyze a profiling statistics file

847 02/06/2012 11:59 AM Aaron Marcuse-Kubitza

bin/map: Added profile_to option which turns on profiling to the specified file

846 02/06/2012 11:13 AM Aaron Marcuse-Kubitza

bin/map: Added "if name == '__main__': main()" idiom so file can be included as well as run. This will be useful for profiling.

845 02/06/2012 11:12 AM Aaron Marcuse-Kubitza

dates.py: Fixed strftime() to pad years and days with leading zeros as datetime.strftime() does

844 02/06/2012 11:01 AM Aaron Marcuse-Kubitza

dates.py: Work around strftime() bug that can't deal with 2/29 on a leap year

843 02/06/2012 10:44 AM Aaron Marcuse-Kubitza

xml_func.py: Added FormatException for SyntaxExceptions generated by strftime() (which are often Python bugs)

842 02/06/2012 09:46 AM Aaron Marcuse-Kubitza

Added schemas/vegbank.revised.sql. Initial version has all "character varying" types replaced with text.

841 02/06/2012 09:45 AM Aaron Marcuse-Kubitza

vegbien.sql: Replaced all "character varying" types with text, removing the length limits. Note that in PostgreSQL, text and "character varying" are stored the same way internally, so this does not affect performance or indexes.

840 02/04/2012 05:28 PM Aaron Marcuse-Kubitza

xml_dom.py: Added documentation labels to each section

839 02/04/2012 05:21 PM Aaron Marcuse-Kubitza

xml_dom.py: Fixed bug in NodeTextEntryIter where an entry containing an element instead of a text node would be returned as the whole entry, instead of the value of the entry

838 02/04/2012 04:56 PM Aaron Marcuse-Kubitza

bin/map: Added support for starting import at a specific row. Refactored row-processing code with and without a map to use a common process_rows() function (with the previous process_rows() being renamed to map_rows()).

837 02/04/2012 04:14 PM Aaron Marcuse-Kubitza

bin/map: Use new util.cast()

836 02/04/2012 04:14 PM Aaron Marcuse-Kubitza

util.py: Added cast() to cast a value while passing None through

835 02/04/2012 04:09 PM Aaron Marcuse-Kubitza

bin/map: Print row # of rows with errors

834 02/04/2012 03:22 PM Aaron Marcuse-Kubitza

sql.py: Fixed error in pkey() where recover was not passed as a named parameter to run_query()

833 02/04/2012 03:12 PM Aaron Marcuse-Kubitza

sql.py: Added documentation labels to each section