Project

General

Profile

  • svn:executable: *

# Date Author Comment
11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

10283 07/14/2013 05:52 AM Aaron Marcuse-Kubitza

bugfix: bin/*: spell out [:alnum:] as [a-zA-Z0-9] because Python unfortunately doesn't support character classes

10278 07/14/2013 02:44 AM Aaron Marcuse-Kubitza

bin/*: replaced confusing regexp constructs involving \W inside [] with the much clearer explicit character class [:alnum:] . this avoids adding or subtracting from an inverted class in order to reach a subset of the corresponding positive class, because the subset can just be named explicitly instead.

7449 02/05/2013 06:12 AM Aaron Marcuse-Kubitza

canon: Raise an error if two input terms map to the same simplified string

4649 09/12/2012 02:04 PM Aaron Marcuse-Kubitza

canon: Canonicalize the column header instead of passing it through, in order to properly support CSVs without a header

4626 09/12/2012 08:59 AM Aaron Marcuse-Kubitza

canon, translate, filter_out_ci: Support vocabularies/dictionaries with additional columns in addition to the functional column(s) used by the program. These columns can contain comments, etc. This was not originally supported because Python 2's iterable unpacking only supports "an iterable with the same number of items as there are targets in the target list" (http://docs.python.org/reference/simple_stmts.html#assignment-statements). We now use numeric array indexes instead to get around this limitation, and for consistency with other map-manipulation scripts.

4586 09/11/2012 06:40 AM Aaron Marcuse-Kubitza

Added canon