Project

General

Profile

  • svn:executable: *

# Date Author Comment
11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

7405 01/31/2013 04:04 PM Aaron Marcuse-Kubitza

join: Added support for direct mappings to VegBIEN by passing through outputs that start with / (indicating an XPath rather than a term)

4637 09/12/2012 12:31 PM Aaron Marcuse-Kubitza

join: passthru mode: Fixed bug where empty join mappings needed to have the output field of the right-hand row manually set to the output field of the left-hand row for maps.merge_mappings() to work properly

4634 09/12/2012 12:00 PM Aaron Marcuse-Kubitza

join: When a join mapping exists but is empty, still include any additional columns from that mapping in the combined row

4628 09/12/2012 10:14 AM Aaron Marcuse-Kubitza

join: Added new passthru mode which passes through terms with no input mapping or no join mapping

4506 09/07/2012 09:16 AM Aaron Marcuse-Kubitza

join: Documented that it's case- and punctuation-insensitive.

4500 09/07/2012 07:26 AM Aaron Marcuse-Kubitza

join: Use new maps.simplify()

4498 09/07/2012 07:23 AM Aaron Marcuse-Kubitza

join: Match terms with non-alphanumeric chars removed

4497 09/07/2012 07:15 AM Aaron Marcuse-Kubitza

join: Match terms case-insensitively

4156 08/21/2012 07:50 AM Aaron Marcuse-Kubitza

join: quiet mode: Turn off all warnings, not just "No input mapping" warnings. This is useful when join-unioning a synonymy to a primary map, which may have "No non-empty join mapping" for some terms but this should not be stored in the resulting map's Comments column.

3924 08/09/2012 02:45 PM Aaron Marcuse-Kubitza

join: Added map_1_core_only option that uses only columns 0 and 1 of map_1. This is useful for one-time refactoring joins where the Source column, mappings comments, etc. shouldn't be part of the datasource's via map (although they will be part of the autogenerated VegBIEN map)

3923 08/09/2012 02:33 PM Aaron Marcuse-Kubitza

join: Use opts.env_usage() for usage message

3822 08/07/2012 05:05 AM Aaron Marcuse-Kubitza

join: Support discarding multiple outputs if they should be considered ambiguous

3777 08/03/2012 04:59 PM Aaron Marcuse-Kubitza

join: Fixed bug where when an input to mapped to multiple outputs, the joined row for each output needed to be output separately using writer.writerow()

3766 08/02/2012 08:32 PM Aaron Marcuse-Kubitza

join: Made it multi-safe (supports an input appearing multiple times)

1800 04/03/2012 08:08 PM Aaron Marcuse-Kubitza

join: Also print "No join mapping" warning if a join mapping was found but it was empty. The warning in that case is actually "No non-empty join mapping" to distinguish it from a mapping that's missing entirely. input.Makefile: missing_mappings: Support new "No join mapping" error message.

1766 04/03/2012 02:12 PM Aaron Marcuse-Kubitza

join: Use new maps.join_combinable() to check if column names match

1739 04/02/2012 12:21 PM Aaron Marcuse-Kubitza

join: Also produce warnings for no input mapping (if no comment explaining why no input mapping), in addition to no join mapping

1387 03/13/2012 04:07 PM Aaron Marcuse-Kubitza

join: Don't append suffix to empty output mappings, so that they stay empty ("NULL")

1355 03/11/2012 04:18 PM Aaron Marcuse-Kubitza

join: In error message when map column names don't match, include the actual column names

1283 03/05/2012 04:35 PM Aaron Marcuse-Kubitza

join: Don't raise "No join mapping" error for empty mappings because you only want the error for empty mappings for your particular dataset, which requires more information (namely, the subset of the mappings used by your dataset, some of which will not be in the mappings if standard fields have been subtracted out)

1282 03/05/2012 04:10 PM Aaron Marcuse-Kubitza

join: Fixed bug in "No join mapping" error generation where rows with no existing comments column would cause an IndexError

1170 02/27/2012 02:37 PM Aaron Marcuse-Kubitza

join: Added heuristic search for a match on a parent path, so that every XML func suffix of a path doesn't need its own mapping

939 02/11/2012 06:39 PM Aaron Marcuse-Kubitza

join: Print a warning if no join mapping found (in addition to adding this warning to the comments column)

737 01/30/2012 02:51 PM Aaron Marcuse-Kubitza

join: Merge the column labels as well

735 01/30/2012 02:06 PM Aaron Marcuse-Kubitza

join: Moved mappings-specific merge functionality into maps.merge_mappings()

734 01/30/2012 01:55 PM Aaron Marcuse-Kubitza

join: Use merge_rows() from new maps.py

732 01/30/2012 01:38 PM Aaron Marcuse-Kubitza

join: Merge comments of input map and join map

730 01/30/2012 01:03 PM Aaron Marcuse-Kubitza

join: Report which input mappings are missing a mapping in the join map

725 01/30/2012 11:21 AM Aaron Marcuse-Kubitza

join: For input mappings with no match in the join map, include them in the output map with an empty mapping

274 12/22/2011 07:02 PM Aaron Marcuse-Kubitza

Renamed util to bin

268 12/22/2011 06:39 PM Aaron Marcuse-Kubitza

Moved everything in scripts to root. inputs_Makefile: Don't run "all" when installing.

180 12/15/2011 04:16 PM Aaron Marcuse-Kubitza

join: Added usage item for repeated joins

179 12/15/2011 04:13 PM Aaron Marcuse-Kubitza

join: Changed order of args and redirects to be more intuitive

177 12/15/2011 04:08 PM Aaron Marcuse-Kubitza

Renamed join_maps to join

120 12/08/2011 01:48 PM Aaron Marcuse-Kubitza

Added Comments column with Brad's and Aaron's comments to mapping spreadsheets

67 11/30/2011 04:56 PM Aaron Marcuse-Kubitza

Moved env usage string creation to opts.py. Changed db config var names to use in/out instead of from/to.

62 11/30/2011 03:55 PM Aaron Marcuse-Kubitza

Fixed xml_xpath.get() last_only optimization to handle attrs correctly. Turned off stack traces for errors intended for the user to see.

59 11/30/2011 01:25 PM Aaron Marcuse-Kubitza

map: Added support for custom fkeys to parent in db XML trees. Removed extraneous csv reader/writer config because Excel format is default. Improved documentation.

51 11/29/2011 02:59 PM Aaron Marcuse-Kubitza

Created join_maps to join two 2-column map spreadsheets