Project

General

Profile

Statistics
| Revision:

# Date Author Comment
120 12/08/2011 01:48 PM Aaron Marcuse-Kubitza

Added Comments column with Brad's and Aaron's comments to mapping spreadsheets

119 12/07/2011 05:01 PM Aaron Marcuse-Kubitza

Added stub for SALVIAS database test

118 12/07/2011 05:00 PM Aaron Marcuse-Kubitza

test/map: Added support for database input

117 12/07/2011 04:14 PM Aaron Marcuse-Kubitza

Preparing map to input from DB

116 12/07/2011 03:32 PM Aaron Marcuse-Kubitza

Started preparing map to input from DB

115 12/07/2011 01:27 PM Aaron Marcuse-Kubitza

xml_func.py: Added optimization to first check if function name starts with _ before looking it up in the table

114 12/07/2011 12:24 PM Aaron Marcuse-Kubitza

Added _alt functions for mappings to VegBank authorPlotCode

113 12/07/2011 12:17 PM Aaron Marcuse-Kubitza

xml_func.py: Added _alt function to choose between alternative values and used it for the collector plantName mapping

112 12/07/2011 11:54 AM Aaron Marcuse-Kubitza

VegX-VegBank mapping: Added mapping from taxonName/Simple (NYBG ScientificName) to collector plantName so that collector plantName will always have a value

111 12/07/2011 11:27 AM Aaron Marcuse-Kubitza

xml_func.py: Added support for decimal years (with day as the fraction)

110 12/07/2011 11:16 AM Aaron Marcuse-Kubitza

test/map: Added echoing of commands run

109 12/06/2011 04:19 PM Aaron Marcuse-Kubitza

Added psql_vegbank to easily access vegbank db from the command line

108 12/06/2011 04:07 PM Aaron Marcuse-Kubitza

Ignore OpenOffice lock files in mappings

107 12/06/2011 04:05 PM Aaron Marcuse-Kubitza

Added SALVIAS data CSVs and accepted test output

106 12/06/2011 03:52 PM Aaron Marcuse-Kubitza

test/map: Expanded to include all input CSVs in test/input

105 12/06/2011 03:31 PM Aaron Marcuse-Kubitza

Removed unneeded joins dir

104 12/06/2011 03:30 PM Aaron Marcuse-Kubitza

Moved VegBank mapping joins to main mappings dir so they would have similar paths for the upcoming all-sources tester

103 12/06/2011 03:11 PM Aaron Marcuse-Kubitza

Moved test scripts and files from util to test

102 12/06/2011 02:50 PM Aaron Marcuse-Kubitza

xml_func.py: Added _namepart function for extracting parts of names

101 12/06/2011 02:11 PM Aaron Marcuse-Kubitza

Finished NYBG mapping to VegBank\!

100 12/06/2011 02:04 PM Aaron Marcuse-Kubitza

test_map: Added debug option to print VegBank XML instead of importing it into the database

99 12/06/2011 01:34 PM Aaron Marcuse-Kubitza

xpath.py: Created is_positive() function

98 12/06/2011 01:28 PM Aaron Marcuse-Kubitza

Further refinements to mappings to support database constraints

97 12/06/2011 01:27 PM Aaron Marcuse-Kubitza

xpath.py: Added support for negative attribute assertions with !

96 12/06/2011 10:54 AM Aaron Marcuse-Kubitza

Changed mappings to use keys vs. attrs

95 12/06/2011 10:53 AM Aaron Marcuse-Kubitza

xpath.py: Fixed creation of attrs so it happens even when node already exists

94 12/06/2011 09:59 AM Aaron Marcuse-Kubitza

xpath.py: Added concept of keys vs attrs in XPath elem

93 12/05/2011 05:25 PM Aaron Marcuse-Kubitza

Started filling in required values for VegBank fields in mappings. Will need to refactor to move these to metadata for the datasources.

92 12/05/2011 05:24 PM Aaron Marcuse-Kubitza

Now allow empty rows. Added support for select statement limit.

91 12/05/2011 04:17 PM Aaron Marcuse-Kubitza

Added support for quoted values in XPaths

90 12/05/2011 04:02 PM Aaron Marcuse-Kubitza

Fixed name XML function. Fixed accept_test_output.

89 12/05/2011 03:59 PM Aaron Marcuse-Kubitza

Added support for name XML function. Added error handling for empty rows.

88 12/05/2011 03:28 PM Aaron Marcuse-Kubitza

Made it easier to accept test output

87 12/05/2011 03:18 PM Aaron Marcuse-Kubitza

Added NYBG stemCount metadata

86 12/05/2011 03:11 PM Aaron Marcuse-Kubitza

Added xml_func.py to process mappings whose output needs postprocessing

85 12/05/2011 01:53 PM Aaron Marcuse-Kubitza

Changed VegBank mappings to use XML functions (not implemented yet) to calculate averages and ranges

84 12/05/2011 01:25 PM Aaron Marcuse-Kubitza

Added support for mapping datasource metadata

83 12/05/2011 12:53 PM Aaron Marcuse-Kubitza

Changed for loops to use enumerate() where the index is also needed

82 12/05/2011 12:50 PM Aaron Marcuse-Kubitza

Moved XPath prep code (setting ID, value) to xpath.py

81 12/02/2011 05:27 PM Aaron Marcuse-Kubitza

xpath.py: Added deepcopy() before setting value of other branches to traverse

80 12/02/2011 05:12 PM Aaron Marcuse-Kubitza

NYSpecimenDataAmericas.test.xml: Updated for new NYBG-VegX.organisms.csv

79 12/02/2011 05:11 PM Aaron Marcuse-Kubitza

NYBG-VegX.organisms.csv: Changed voucher (primary key) column to be UniqueNYInternalRecordNumber because CatalogNumber contained an empty value

78 12/02/2011 05:10 PM Aaron Marcuse-Kubitza

xpath.py: Added basic support for split paths

77 12/02/2011 04:30 PM Aaron Marcuse-Kubitza

Merged xml_xpath.py into xpath.py in preparation for changing the XPath parse tree to be the XML DOM tree itself

76 12/02/2011 03:58 PM Aaron Marcuse-Kubitza

Refactored xpath.parse() to use a nested function instead of a class extending Parser

75 12/02/2011 03:04 PM Aaron Marcuse-Kubitza

map: Fixed mislocated import for Parser.SyntaxException

74 12/02/2011 02:21 PM Aaron Marcuse-Kubitza

Removed SALVIAS voucher_string mapping per conference call discussion

73 12/02/2011 02:16 PM Aaron Marcuse-Kubitza

map: Fixed bugs to enable mapping straight from CSV to a database. Still need a way to set plot.authorPlotCode for specimens data.

72 12/02/2011 12:05 PM Aaron Marcuse-Kubitza

Fixed ch_map_root to support subpaths which follow the root by -> rather than /. Changed spreadsheet syntax to have : between label and root.

71 12/01/2011 01:55 PM Aaron Marcuse-Kubitza

Updated extract_plot_map to use new name for VegX-VegBank mapping and re-ran it and join_all_vegbank

70 12/01/2011 01:51 PM Aaron Marcuse-Kubitza

Finished VegX-VegBank mapping and created VegBank joins of mappings to VegX

69 12/01/2011 11:53 AM Aaron Marcuse-Kubitza

Finished ch_map_root (renamed from submap)

68 11/30/2011 05:36 PM Aaron Marcuse-Kubitza

Added submap and extract_plot_map to extract plot subpaths from VegX-VegBank.csv

67 11/30/2011 04:56 PM Aaron Marcuse-Kubitza

Moved env usage string creation to opts.py. Changed db config var names to use in/out instead of from/to.

66 11/30/2011 04:24 PM Aaron Marcuse-Kubitza

Keep *.test.xml out of version control

65 11/30/2011 04:22 PM Aaron Marcuse-Kubitza

Moved options-processing code to opts.py: Added opts.py

64 11/30/2011 04:21 PM Aaron Marcuse-Kubitza

Moved options-processing code to opts.py

63 11/30/2011 04:04 PM Aaron Marcuse-Kubitza

test_map: Compares generated XML to correct version

62 11/30/2011 03:55 PM Aaron Marcuse-Kubitza

Fixed xml_xpath.get() last_only optimization to handle attrs correctly. Turned off stack traces for errors intended for the user to see.

61 11/30/2011 02:32 PM Aaron Marcuse-Kubitza

Changed mappings to place prefix common to all XPaths in the column header

60 11/30/2011 01:31 PM Aaron Marcuse-Kubitza

simplify_xpath: Made it case-insensitive

59 11/30/2011 01:25 PM Aaron Marcuse-Kubitza

map: Added support for custom fkeys to parent in db XML trees. Removed extraneous csv reader/writer config because Excel format is default. Improved documentation.

58 11/29/2011 05:36 PM Aaron Marcuse-Kubitza

map: Added stub for database input

57 11/29/2011 05:33 PM Aaron Marcuse-Kubitza

map: Added more stubs for XML-XML mapping

56 11/29/2011 05:15 PM Aaron Marcuse-Kubitza

Started adding XML-XML mapping support to map

55 11/29/2011 04:43 PM Aaron Marcuse-Kubitza

Split off xpath.py XML functionality into xml_xpath.py

54 11/29/2011 04:28 PM Aaron Marcuse-Kubitza

map: Using SystemExit for usage errors to avoid stack trace

53 11/29/2011 04:22 PM Aaron Marcuse-Kubitza

Merged data2xml and xml2db into map

52 11/29/2011 03:03 PM Aaron Marcuse-Kubitza

Removed trailing whitespace from VegX-VegBank.csv map

51 11/29/2011 02:59 PM Aaron Marcuse-Kubitza

Created join_maps to join two 2-column map spreadsheets

50 11/29/2011 02:11 PM Aaron Marcuse-Kubitza

Renamed mappings to be compatible with Redmine allowed characters in attachment filenames

49 11/29/2011 01:59 PM Aaron Marcuse-Kubitza

Added refactored mappings and changed data2xml to use the new 2-column format

48 11/29/2011 01:25 PM Aaron Marcuse-Kubitza

Refactored db_xml.py's db insertion function to avoid extra nested functions

47 11/29/2011 01:06 PM Aaron Marcuse-Kubitza

Added README.TXT

46 11/29/2011 01:02 PM Aaron Marcuse-Kubitza

Renamed modules to remove _util

45 11/29/2011 12:47 PM Aaron Marcuse-Kubitza

Added svn:ignore for *.pyc

44 11/29/2011 12:42 PM Aaron Marcuse-Kubitza

Renamed xml2db_ and data2xml_ to remove _

43 11/29/2011 12:42 PM Aaron Marcuse-Kubitza

Moved scripts to main directory and associated files to util

42 11/29/2011 12:31 PM Aaron Marcuse-Kubitza

Moved Python modules to shared lib folder

41 11/28/2011 05:32 PM Aaron Marcuse-Kubitza

xml2db: Started refactoring xml2db() to support getting as well as inserting data

40 11/28/2011 05:29 PM Aaron Marcuse-Kubitza

xml2db: Started refactoring xml2db() to support getting as well as inserting data

39 11/28/2011 05:05 PM Aaron Marcuse-Kubitza

xml2db: Changed to return ID (pkey) of inserted record and use this returned value as parent_id instead of getting the parent_id from the parent XML node

38 11/28/2011 03:16 PM Aaron Marcuse-Kubitza

data2xml: Added syntax for split paths, which map to multiple leaves

37 11/28/2011 01:52 PM Aaron Marcuse-Kubitza

xml2db: Improved empty_db to use TRUNCATE instead of DROP DATABASE. Added xml2vegbank to automatically set db env vars.

36 11/28/2011 01:51 PM Aaron Marcuse-Kubitza

data2xml: Improved syntax for XPath lookahead assertions. Changed XML printing to print multiple text nodes on separate lines.

35 11/28/2011 12:15 PM Aaron Marcuse-Kubitza

Moved vegbank_example_ver1.0.2.xml to xml2db, where it should have been

34 11/23/2011 05:22 PM Aaron Marcuse-Kubitza

data2xml: Small correction to NYBG mapping

33 11/23/2011 04:58 PM Aaron Marcuse-Kubitza

data2xml: Created simplify_xpath script to remove duplication from XPath expressions

32 11/23/2011 04:15 PM Aaron Marcuse-Kubitza

data2xml: Added support for * abbrs for backward (child-to-parent) pointers

31 11/23/2011 02:52 PM Aaron Marcuse-Kubitza

In data2xml, fixed determination of which nesting level to put IDs on

30 11/23/2011 02:45 PM Aaron Marcuse-Kubitza

Simplified expansion of * abbrs

29 11/23/2011 02:23 PM Aaron Marcuse-Kubitza

Removed no longer necessary strip() from node value getter

28 11/23/2011 02:22 PM Aaron Marcuse-Kubitza

Added patch for xml.dom.minidom.Element.writexml to avoid adding extra whitespace around text nodes

27 11/23/2011 12:45 PM Aaron Marcuse-Kubitza

Added pointer field name abbreviations to data2xml and NYBG mappings

26 11/22/2011 04:35 PM Aaron Marcuse-Kubitza

In data2xml, fixed pointer handling to deal with pointer targets that are themselves pointers

25 11/22/2011 04:01 PM Aaron Marcuse-Kubitza

In data2xml, added shortcut for lookahead assertion using ! symbol

24 11/22/2011 02:32 PM Aaron Marcuse-Kubitza

In data2xml, fixed backward (child-to-parent) pointer handling to get and set attribute values properly

23 11/22/2011 01:52 PM Aaron Marcuse-Kubitza

In data2xml, fixed xpath.get() to do last_only optimization properly for pointer targets

22 11/22/2011 01:32 PM Aaron Marcuse-Kubitza

In data2xml, added support for XPath pointers

21 11/21/2011 05:48 PM Aaron Marcuse-Kubitza

Merged data2xml XPath functionality into xpath.py. Merged data2xml xml_dom.py and xml2db xml_util.py into identical xml_util.py for each script.