Project

General

Profile

Statistics
| Revision:

# Date Author Comment
1019 02/17/2012 03:08 PM Aaron Marcuse-Kubitza

VegX-VegBIEN mapping: Include the datasource name (now provided by map in /_ignore/inLabel) in the appropriate places in both VegX and VegBIEN

1018 02/17/2012 02:47 PM Aaron Marcuse-Kubitza

bin/map: Removed metadata values feature since the syntax used was causing problems with mappings starting with a ":", and metadata can instead be stored as attributes of the primary key's mapping

1017 02/16/2012 06:07 PM Aaron Marcuse-Kubitza

xml_dom.py: Fixed bug in parent() where it didn't account for NodeParentIter's first element returned being the current node, not its parent. Refactored parent() to use parentNode directly, and NodeParentIter to use parent(), instead of the other way around.

1016 02/16/2012 05:38 PM Aaron Marcuse-Kubitza

xml_dom.py: Fixed bug in parent() where incorrect variable name was used

1015 02/16/2012 05:05 PM Aaron Marcuse-Kubitza

VegX-VegBIEN mapping: Use the input data source's label (e.g. SALVIAS) everywhere a reference is needed

1014 02/16/2012 05:04 PM Aaron Marcuse-Kubitza

bin/map: Store the input data source's label (e.g. SALVIAS) in the output XML tree for use by references in the mappings

1013 02/14/2012 06:57 PM Aaron Marcuse-Kubitza

xpath.py: get(): Fixed bug where it would try to create a node named . or .. if . or .. didn't have matching attributes. Now it will just reuse the current or parent node, but create any needed attrs if create is True.

1012 02/14/2012 06:52 PM Aaron Marcuse-Kubitza

util.py: Added list_eq_is() to compare two lists using is

1011 02/14/2012 06:23 PM Aaron Marcuse-Kubitza

xpath.py: Don't allow rooted attributes (doesn't make sense), in case someone tries to do elem[/rooted_attr]

1010 02/14/2012 05:52 PM Aaron Marcuse-Kubitza

bin/map: Moved root.clear() into separate function prep_root() that can be called whenever needed

1009 02/14/2012 05:43 PM Aaron Marcuse-Kubitza

xpath.py: Added get() support for references (different from pointers) to dynamically set the value of an attribute

1008 02/14/2012 05:21 PM Aaron Marcuse-Kubitza

util.py: Added list_get()

1007 02/14/2012 05:10 PM Aaron Marcuse-Kubitza

util.py: Added is_list()

1006 02/14/2012 05:00 PM Aaron Marcuse-Kubitza

bin/map: Use var doc0_root for quick reference to doc0's root

1005 02/14/2012 04:57 PM Aaron Marcuse-Kubitza

xpath.py: get(): Go to root when empty element is encountered at the beginning of an XPath. Added allow_rooted parameter to turn off this functionality when XPaths with a leading slash should not be considered rooted.

1004 02/14/2012 04:32 PM Aaron Marcuse-Kubitza

xpath.py: Don't consider a path starting with "." to be rooted. Do this by not automatically translating an empty path name to ".".

1003 02/14/2012 04:18 PM Aaron Marcuse-Kubitza

xpath.py: Added is_rooted()

1002 02/14/2012 04:16 PM Aaron Marcuse-Kubitza

xpath.py: Added elem_is_empty()

1001 02/14/2012 04:10 PM Aaron Marcuse-Kubitza

xpath.py: Added documentation labels to each section

1000 02/14/2012 04:04 PM Aaron Marcuse-Kubitza

xpath.py: Added support for getting the parent node when encountering ".."

999 02/14/2012 04:02 PM Aaron Marcuse-Kubitza

xml_dom.py: Added parent() to get parent node without recursing past the root node to the document object. Documented that NodeParentIter incorporates this sanity check.

998 02/14/2012 03:44 PM Aaron Marcuse-Kubitza

xpath.py: get(): Renamed parent to root to better reflect that it's the starting point for the search. Calling it parent will later be confusing when we want to get the parent node using "..".

997 02/14/2012 03:31 PM Aaron Marcuse-Kubitza

xpath.py: Added parser support for attribute values that are references to another part of the XML tree

996 02/14/2012 03:21 PM Aaron Marcuse-Kubitza

xml_func.py: Fixed module description comment to reflect that not all XML funcs generate text

995 02/14/2012 03:20 PM Aaron Marcuse-Kubitza

xml_func.py: Refactored to add funcs to the module funcs variable as they are defined. Renamed defined functions to the name of the corresponding XML function.

994 02/14/2012 03:01 PM Aaron Marcuse-Kubitza

xml_func.py: Added _ignore func to "comment out" an XML subtree

993 02/14/2012 02:55 PM Aaron Marcuse-Kubitza

input.Makefile: Fixed error message when no DB file found so that it doesn't incorrectly imply that PostgreSQL inputs are supported

992 02/14/2012 02:54 PM Aaron Marcuse-Kubitza

input.Makefile: Fixed error message when no DB file found so that it doesn't incorrectly imply that PostgreSQL inputs are supported

991 02/14/2012 02:42 PM Aaron Marcuse-Kubitza

input.Makefile: Don't run tests in verbose mode because the run time stats, etc. are not relevant

990 02/14/2012 02:41 PM Aaron Marcuse-Kubitza

bin/map: Only print error/run time stats in verbose mode. input.Makefile: Run import in verbose mode so that error/run time stats are still printed.

989 02/14/2012 02:36 PM Aaron Marcuse-Kubitza

Moved value to string conversion functions infrom util.py to new module format.py

988 02/14/2012 02:32 PM Aaron Marcuse-Kubitza

exc.py, profiling.py: Use util.int2str() to print # iters with thousands separators

987 02/14/2012 02:32 PM Aaron Marcuse-Kubitza

util.py: Added int2str()

986 02/14/2012 02:26 PM Aaron Marcuse-Kubitza

bin/map: Document that the exit status is the # of errors in the import, up to the maximum exit status

985 02/14/2012 02:24 PM Aaron Marcuse-Kubitza

exc.py: Generalize ExTracker to not just print the # of errors at exit. Instead, provide an exit() method that the ExTracker creator can call at exit to set the exit status to the # of errors. This fixes the Python bug where a benign error message was printed if SystemExit was raised in an atexit function.

984 02/14/2012 02:15 PM Aaron Marcuse-Kubitza

bin/map: Set ExPercentTracker's iter_text. Start ExPercentTracker after input processing, because errors in command line options should just end the program and don't need to be tracked.

983 02/14/2012 02:13 PM Aaron Marcuse-Kubitza

exc.py: ExPercentTracker: Added ability to set custom iter_text, similar to ItersProfiler

982 02/14/2012 02:07 PM Aaron Marcuse-Kubitza

bin/map: Use profiling.ItersProfiler. Refactored input row count calculation to have each function aggregate and return the row count, and then display the row count and statistics that depend on it at the end of the program.

981 02/14/2012 02:06 PM Aaron Marcuse-Kubitza

Added profiling.py to time operations and provide the user with statistical information

980 02/14/2012 02:03 PM Aaron Marcuse-Kubitza

util.py: Added basic to_si() to add SI prefix to value

979 02/14/2012 01:02 PM Aaron Marcuse-Kubitza

util.py: Added format_str() to use locale-specific formatting settings, including thousands separator. Use it in to_percent().

978 02/14/2012 12:48 PM Aaron Marcuse-Kubitza

bin/map: Use new ExPercentTracker to print error rate (% of # rows) when program exits

977 02/14/2012 12:47 PM Aaron Marcuse-Kubitza

exc.py: Added ExPercentTracker to track errors as % of iterations

976 02/14/2012 12:47 PM Aaron Marcuse-Kubitza

util.py: Added to_percent()

975 02/14/2012 12:07 PM Aaron Marcuse-Kubitza
974 02/14/2012 12:02 PM Aaron Marcuse-Kubitza

exc.py: print_ex(): Declare emph param as a keywork param instead of popping it from **format

973 02/14/2012 11:54 AM Aaron Marcuse-Kubitza

exc.py: print_ex(): Declare emph param as a keywork param instead of popping it from **format

972 02/13/2012 04:38 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/maps/VegX.organisms.csv: Mapped OrigSpecies and OrigGenus combined to new plantlevel Binomial

971 02/13/2012 04:35 PM Aaron Marcuse-Kubitza

xpath.py: Fixed bug where value of XPath (used for copying to other branches) is retrieved after first XPath element is popped rather than before, which can sometimes leave an empty XPath for value() to run on

970 02/13/2012 03:33 PM Aaron Marcuse-Kubitza

mappings/DwC-VegBIEN.specimens.csv: Fixed bien.vegetation mapping to point to commconcept->commname. Fixed bien.substrate mapping to point to locationevent.landscapenarrative.

969 02/13/2012 03:25 PM Aaron Marcuse-Kubitza

inputs/NYBG/maps/DwC.specimens.csv: Mapped CoordinatePrecision using _noCV

968 02/13/2012 03:24 PM Aaron Marcuse-Kubitza

xml_func.py: Added _noCV func to check that non-ratio-scale data does not contain CV values

967 02/13/2012 02:34 PM Aaron Marcuse-Kubitza

mappings/DwC-VegBIEN.specimens.csv: Fixed locality fields mapping to go to location.locationnarrative

966 02/13/2012 02:05 PM Aaron Marcuse-Kubitza

input.Makefile: For all input types, including DB, import each table in a separate map invocation

965 02/13/2012 01:58 PM Aaron Marcuse-Kubitza

xml_func.py: _range: Treat a None from or to value as an unknown (a la SQL NULL) and return None instead of raising a SyntaxException

964 02/13/2012 01:57 PM Aaron Marcuse-Kubitza

xml_dom.py: NodeTextEntryIter: Convert empty entries (including entries containing error comments) to None

963 02/13/2012 01:48 PM Aaron Marcuse-Kubitza

xml_dom.py: replace(): Added support for new node that's None (deletes existing node)

962 02/13/2012 12:59 PM Aaron Marcuse-Kubitza

xml_func.py: Put SyntaxException's cause on same line as error message so that the whole error is treated as distinct by error_stats

961 02/13/2012 12:47 PM Aaron Marcuse-Kubitza

Added errors_filter_before and errors_filter_after to prepare `map` error messages for easy filtering and then restore line breaks

960 02/13/2012 12:46 PM Aaron Marcuse-Kubitza

error_stats: Fixed to work on Mac

959 02/13/2012 11:26 AM Aaron Marcuse-Kubitza

error_stats: Simplified to use uniq --count option

958 02/13/2012 11:21 AM Aaron Marcuse-Kubitza

input.Makefile: Print error message if no input file found (for file input type). This fixes a bug where map would just take input from stdin when no input file redirect or input DB env vars were specified.

957 02/13/2012 11:10 AM Aaron Marcuse-Kubitza

map: Map standard DB names to original DB names on nimoy

956 02/13/2012 10:29 AM Aaron Marcuse-Kubitza

Regenerated vegbien.ERD exports

955 02/13/2012 10:26 AM Aaron Marcuse-Kubitza

vegbien.sql: Added methodtrait and methodtraitname tables

954 02/13/2012 10:24 AM Aaron Marcuse-Kubitza

PostgreSQL-MySQL.csv: Handle array types

953 02/13/2012 09:54 AM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Recolored plant tables to all have the same color, distinct from the occurrence color

952 02/13/2012 09:32 AM Aaron Marcuse-Kubitza

mappings/VegX-VegBIEN.organisms.csv: Added mappings for SALVIAS fields with no join mapping. This fixes the last of the "no join mapping" errors.

951 02/13/2012 09:27 AM Aaron Marcuse-Kubitza

input.Makefile: svn_props: Set svn:ignore on maps subdirs

950 02/13/2012 09:26 AM Aaron Marcuse-Kubitza

inputs/SALVIAS-CSV/maps/VegX.plots.csv: Fixed mappings without a join mapping in VegX-VegBIEN.*.csv

949 02/13/2012 09:20 AM Aaron Marcuse-Kubitza

VegX mappings: Gentry DBH mapping: Use VegX's attribute and method tables

948 02/13/2012 09:05 AM Aaron Marcuse-Kubitza

mappings/VegX-VegBIEN.organisms.csv: Removed no longer used mapping to taxondetermination.determinationdate. This also prevents ever creating a taxondetermination without a plantconcept.

947 02/11/2012 07:31 PM Aaron Marcuse-Kubitza

bin/map: Added redo option to control whether the database is emptied before inserting new data. Can be used to turn off emptying the DB in test mode, because this is often slow and is not needed if you are running tests on an empty testing database.

946 02/11/2012 07:25 PM Aaron Marcuse-Kubitza

opts.py: env_flag(): Added support for default value if unset

945 02/11/2012 07:25 PM Aaron Marcuse-Kubitza

opts.py: env_flag(): Added support for default value if unset

944 02/11/2012 07:16 PM Aaron Marcuse-Kubitza

bin/map: Use env_flag()'s new env_names usage support to print flags usage

943 02/11/2012 07:16 PM Aaron Marcuse-Kubitza

opts.py: Added env_names usage support to env_flag()

942 02/11/2012 07:05 PM Aaron Marcuse-Kubitza

mappings/VegX-VegBIEN.organisms.csv: Removed no longer needed mapping for taxonDetermination/note

941 02/11/2012 07:03 PM Aaron Marcuse-Kubitza

inputs/SALVIAS-CSV/maps/VegX.organisms.csv: Map cfaff to taxonConcept/fit, which maps to taxondetermination.taxonFit

940 02/11/2012 06:59 PM Aaron Marcuse-Kubitza

inputs/SALVIAS/maps/VegX.organisms.csv: Map cfaff to taxonConcept/fit, which maps to taxondetermination.taxonFit

939 02/11/2012 06:39 PM Aaron Marcuse-Kubitza

join: Print a warning if no join mapping found (in addition to adding this warning to the comments column)

938 02/11/2012 06:27 PM Aaron Marcuse-Kubitza

Removed no longer needed inputs/NYBG/maps/VegX.organisms.csv because NYBG is now mapped via DwC

937 02/11/2012 06:26 PM Aaron Marcuse-Kubitza

mappings/VegX-VegBIEN.organisms.csv: Removed mappings used only by NYBG, because NYBG now maps via DwC

936 02/11/2012 06:11 PM Aaron Marcuse-Kubitza

Added ch_root_via to transform a map spreadsheet to use a different root, using a connecting root that links the input and output roots together

935 02/11/2012 06:02 PM Aaron Marcuse-Kubitza

Added cols to select columns from a spreadsheet

934 02/11/2012 06:02 PM Aaron Marcuse-Kubitza

util.py: Added list_subset()

933 02/11/2012 05:42 PM Aaron Marcuse-Kubitza

ch_root: Fixed detection of unset env vars so that usage message is printed when any option is missing

932 02/11/2012 05:41 PM Aaron Marcuse-Kubitza

opts.py: Call an error handler if an env var isn't set

931 02/11/2012 05:40 PM Aaron Marcuse-Kubitza

util.py: Added function wrappers for statements noop() and and_()

930 02/11/2012 05:07 PM Aaron Marcuse-Kubitza

inputs/NYBG-CSV: Map via DwC

929 02/11/2012 04:50 PM Aaron Marcuse-Kubitza

Added subtract to subtract map spreadsheets

928 02/11/2012 04:49 PM Aaron Marcuse-Kubitza

ch_root: Ignore empty lines

927 02/11/2012 04:06 PM Aaron Marcuse-Kubitza

Added intersect to intersect two map spreadsheets

926 02/11/2012 03:48 PM Aaron Marcuse-Kubitza

union: Clarified overwrite order of inputs in description

925 02/11/2012 03:31 PM Aaron Marcuse-Kubitza

Removed no longer needed mappings/review

924 02/11/2012 03:31 PM Aaron Marcuse-Kubitza

mappings/Makefile: Regenerate for_review maps automatically when a map changes

923 02/11/2012 03:17 PM Aaron Marcuse-Kubitza

mappings/review: Generalized to convert all mappings to VegBIEN, not just a specific listed set (which was out of date)

922 02/11/2012 03:16 PM Aaron Marcuse-Kubitza

mappings/for_review/DwC-VegBIEN.specimens.csv: Regenerated

921 02/11/2012 03:12 PM Aaron Marcuse-Kubitza

inputs/NYBG/maps/DwC.specimens.csv: Fixed CollectedDate mapping to use the _date XML func

920 02/11/2012 03:07 PM Aaron Marcuse-Kubitza

DwC mappings: Mapped Substrate and Vegetation