Project

General

Profile

Statistics
| Revision:

# Date Author Comment
109 12/06/2011 04:19 PM Aaron Marcuse-Kubitza

Added psql_vegbank to easily access vegbank db from the command line

108 12/06/2011 04:07 PM Aaron Marcuse-Kubitza

Ignore OpenOffice lock files in mappings

107 12/06/2011 04:05 PM Aaron Marcuse-Kubitza

Added SALVIAS data CSVs and accepted test output

106 12/06/2011 03:52 PM Aaron Marcuse-Kubitza

test/map: Expanded to include all input CSVs in test/input

105 12/06/2011 03:31 PM Aaron Marcuse-Kubitza

Removed unneeded joins dir

104 12/06/2011 03:30 PM Aaron Marcuse-Kubitza

Moved VegBank mapping joins to main mappings dir so they would have similar paths for the upcoming all-sources tester

103 12/06/2011 03:11 PM Aaron Marcuse-Kubitza

Moved test scripts and files from util to test

102 12/06/2011 02:50 PM Aaron Marcuse-Kubitza

xml_func.py: Added _namepart function for extracting parts of names

101 12/06/2011 02:11 PM Aaron Marcuse-Kubitza

Finished NYBG mapping to VegBank\!

100 12/06/2011 02:04 PM Aaron Marcuse-Kubitza

test_map: Added debug option to print VegBank XML instead of importing it into the database

99 12/06/2011 01:34 PM Aaron Marcuse-Kubitza

xpath.py: Created is_positive() function

98 12/06/2011 01:28 PM Aaron Marcuse-Kubitza

Further refinements to mappings to support database constraints

97 12/06/2011 01:27 PM Aaron Marcuse-Kubitza

xpath.py: Added support for negative attribute assertions with !

96 12/06/2011 10:54 AM Aaron Marcuse-Kubitza

Changed mappings to use keys vs. attrs

95 12/06/2011 10:53 AM Aaron Marcuse-Kubitza

xpath.py: Fixed creation of attrs so it happens even when node already exists

94 12/06/2011 09:59 AM Aaron Marcuse-Kubitza

xpath.py: Added concept of keys vs attrs in XPath elem

93 12/05/2011 05:25 PM Aaron Marcuse-Kubitza

Started filling in required values for VegBank fields in mappings. Will need to refactor to move these to metadata for the datasources.

92 12/05/2011 05:24 PM Aaron Marcuse-Kubitza

Now allow empty rows. Added support for select statement limit.

91 12/05/2011 04:17 PM Aaron Marcuse-Kubitza

Added support for quoted values in XPaths

90 12/05/2011 04:02 PM Aaron Marcuse-Kubitza

Fixed name XML function. Fixed accept_test_output.

89 12/05/2011 03:59 PM Aaron Marcuse-Kubitza

Added support for name XML function. Added error handling for empty rows.

88 12/05/2011 03:28 PM Aaron Marcuse-Kubitza

Made it easier to accept test output

87 12/05/2011 03:18 PM Aaron Marcuse-Kubitza

Added NYBG stemCount metadata

86 12/05/2011 03:11 PM Aaron Marcuse-Kubitza

Added xml_func.py to process mappings whose output needs postprocessing

85 12/05/2011 01:53 PM Aaron Marcuse-Kubitza

Changed VegBank mappings to use XML functions (not implemented yet) to calculate averages and ranges

84 12/05/2011 01:25 PM Aaron Marcuse-Kubitza

Added support for mapping datasource metadata

83 12/05/2011 12:53 PM Aaron Marcuse-Kubitza

Changed for loops to use enumerate() where the index is also needed

82 12/05/2011 12:50 PM Aaron Marcuse-Kubitza

Moved XPath prep code (setting ID, value) to xpath.py

81 12/02/2011 05:27 PM Aaron Marcuse-Kubitza

xpath.py: Added deepcopy() before setting value of other branches to traverse

80 12/02/2011 05:12 PM Aaron Marcuse-Kubitza

NYSpecimenDataAmericas.test.xml: Updated for new NYBG-VegX.organisms.csv

79 12/02/2011 05:11 PM Aaron Marcuse-Kubitza

NYBG-VegX.organisms.csv: Changed voucher (primary key) column to be UniqueNYInternalRecordNumber because CatalogNumber contained an empty value

78 12/02/2011 05:10 PM Aaron Marcuse-Kubitza

xpath.py: Added basic support for split paths

77 12/02/2011 04:30 PM Aaron Marcuse-Kubitza

Merged xml_xpath.py into xpath.py in preparation for changing the XPath parse tree to be the XML DOM tree itself

76 12/02/2011 03:58 PM Aaron Marcuse-Kubitza

Refactored xpath.parse() to use a nested function instead of a class extending Parser

75 12/02/2011 03:04 PM Aaron Marcuse-Kubitza

map: Fixed mislocated import for Parser.SyntaxException

74 12/02/2011 02:21 PM Aaron Marcuse-Kubitza

Removed SALVIAS voucher_string mapping per conference call discussion

73 12/02/2011 02:16 PM Aaron Marcuse-Kubitza

map: Fixed bugs to enable mapping straight from CSV to a database. Still need a way to set plot.authorPlotCode for specimens data.

72 12/02/2011 12:05 PM Aaron Marcuse-Kubitza

Fixed ch_map_root to support subpaths which follow the root by -> rather than /. Changed spreadsheet syntax to have : between label and root.

71 12/01/2011 01:55 PM Aaron Marcuse-Kubitza

Updated extract_plot_map to use new name for VegX-VegBank mapping and re-ran it and join_all_vegbank

70 12/01/2011 01:51 PM Aaron Marcuse-Kubitza

Finished VegX-VegBank mapping and created VegBank joins of mappings to VegX

69 12/01/2011 11:53 AM Aaron Marcuse-Kubitza

Finished ch_map_root (renamed from submap)

68 11/30/2011 05:36 PM Aaron Marcuse-Kubitza

Added submap and extract_plot_map to extract plot subpaths from VegX-VegBank.csv

67 11/30/2011 04:56 PM Aaron Marcuse-Kubitza

Moved env usage string creation to opts.py. Changed db config var names to use in/out instead of from/to.

66 11/30/2011 04:24 PM Aaron Marcuse-Kubitza

Keep *.test.xml out of version control

65 11/30/2011 04:22 PM Aaron Marcuse-Kubitza

Moved options-processing code to opts.py: Added opts.py

64 11/30/2011 04:21 PM Aaron Marcuse-Kubitza

Moved options-processing code to opts.py

63 11/30/2011 04:04 PM Aaron Marcuse-Kubitza

test_map: Compares generated XML to correct version

62 11/30/2011 03:55 PM Aaron Marcuse-Kubitza

Fixed xml_xpath.get() last_only optimization to handle attrs correctly. Turned off stack traces for errors intended for the user to see.

61 11/30/2011 02:32 PM Aaron Marcuse-Kubitza

Changed mappings to place prefix common to all XPaths in the column header

60 11/30/2011 01:31 PM Aaron Marcuse-Kubitza

simplify_xpath: Made it case-insensitive

59 11/30/2011 01:25 PM Aaron Marcuse-Kubitza

map: Added support for custom fkeys to parent in db XML trees. Removed extraneous csv reader/writer config because Excel format is default. Improved documentation.

58 11/29/2011 05:36 PM Aaron Marcuse-Kubitza

map: Added stub for database input

57 11/29/2011 05:33 PM Aaron Marcuse-Kubitza

map: Added more stubs for XML-XML mapping

56 11/29/2011 05:15 PM Aaron Marcuse-Kubitza

Started adding XML-XML mapping support to map

55 11/29/2011 04:43 PM Aaron Marcuse-Kubitza

Split off xpath.py XML functionality into xml_xpath.py

54 11/29/2011 04:28 PM Aaron Marcuse-Kubitza

map: Using SystemExit for usage errors to avoid stack trace

53 11/29/2011 04:22 PM Aaron Marcuse-Kubitza

Merged data2xml and xml2db into map

52 11/29/2011 03:03 PM Aaron Marcuse-Kubitza

Removed trailing whitespace from VegX-VegBank.csv map

51 11/29/2011 02:59 PM Aaron Marcuse-Kubitza

Created join_maps to join two 2-column map spreadsheets

50 11/29/2011 02:11 PM Aaron Marcuse-Kubitza

Renamed mappings to be compatible with Redmine allowed characters in attachment filenames

49 11/29/2011 01:59 PM Aaron Marcuse-Kubitza

Added refactored mappings and changed data2xml to use the new 2-column format

48 11/29/2011 01:25 PM Aaron Marcuse-Kubitza

Refactored db_xml.py's db insertion function to avoid extra nested functions

47 11/29/2011 01:06 PM Aaron Marcuse-Kubitza

Added README.TXT

46 11/29/2011 01:02 PM Aaron Marcuse-Kubitza

Renamed modules to remove _util

45 11/29/2011 12:47 PM Aaron Marcuse-Kubitza

Added svn:ignore for *.pyc

44 11/29/2011 12:42 PM Aaron Marcuse-Kubitza

Renamed xml2db_ and data2xml_ to remove _

43 11/29/2011 12:42 PM Aaron Marcuse-Kubitza

Moved scripts to main directory and associated files to util

42 11/29/2011 12:31 PM Aaron Marcuse-Kubitza

Moved Python modules to shared lib folder

41 11/28/2011 05:32 PM Aaron Marcuse-Kubitza

xml2db: Started refactoring xml2db() to support getting as well as inserting data

40 11/28/2011 05:29 PM Aaron Marcuse-Kubitza

xml2db: Started refactoring xml2db() to support getting as well as inserting data

39 11/28/2011 05:05 PM Aaron Marcuse-Kubitza

xml2db: Changed to return ID (pkey) of inserted record and use this returned value as parent_id instead of getting the parent_id from the parent XML node

38 11/28/2011 03:16 PM Aaron Marcuse-Kubitza

data2xml: Added syntax for split paths, which map to multiple leaves

37 11/28/2011 01:52 PM Aaron Marcuse-Kubitza

xml2db: Improved empty_db to use TRUNCATE instead of DROP DATABASE. Added xml2vegbank to automatically set db env vars.

36 11/28/2011 01:51 PM Aaron Marcuse-Kubitza

data2xml: Improved syntax for XPath lookahead assertions. Changed XML printing to print multiple text nodes on separate lines.

35 11/28/2011 12:15 PM Aaron Marcuse-Kubitza

Moved vegbank_example_ver1.0.2.xml to xml2db, where it should have been

34 11/23/2011 05:22 PM Aaron Marcuse-Kubitza

data2xml: Small correction to NYBG mapping

33 11/23/2011 04:58 PM Aaron Marcuse-Kubitza

data2xml: Created simplify_xpath script to remove duplication from XPath expressions

32 11/23/2011 04:15 PM Aaron Marcuse-Kubitza

data2xml: Added support for * abbrs for backward (child-to-parent) pointers

31 11/23/2011 02:52 PM Aaron Marcuse-Kubitza

In data2xml, fixed determination of which nesting level to put IDs on

30 11/23/2011 02:45 PM Aaron Marcuse-Kubitza

Simplified expansion of * abbrs

29 11/23/2011 02:23 PM Aaron Marcuse-Kubitza

Removed no longer necessary strip() from node value getter

28 11/23/2011 02:22 PM Aaron Marcuse-Kubitza

Added patch for xml.dom.minidom.Element.writexml to avoid adding extra whitespace around text nodes

27 11/23/2011 12:45 PM Aaron Marcuse-Kubitza

Added pointer field name abbreviations to data2xml and NYBG mappings

26 11/22/2011 04:35 PM Aaron Marcuse-Kubitza

In data2xml, fixed pointer handling to deal with pointer targets that are themselves pointers

25 11/22/2011 04:01 PM Aaron Marcuse-Kubitza

In data2xml, added shortcut for lookahead assertion using ! symbol

24 11/22/2011 02:32 PM Aaron Marcuse-Kubitza

In data2xml, fixed backward (child-to-parent) pointer handling to get and set attribute values properly

23 11/22/2011 01:52 PM Aaron Marcuse-Kubitza

In data2xml, fixed xpath.get() to do last_only optimization properly for pointer targets

22 11/22/2011 01:32 PM Aaron Marcuse-Kubitza

In data2xml, added support for XPath pointers

21 11/21/2011 05:48 PM Aaron Marcuse-Kubitza

Merged data2xml XPath functionality into xpath.py. Merged data2xml xml_dom.py and xml2db xml_util.py into identical xml_util.py for each script.

20 11/21/2011 04:50 PM Aaron Marcuse-Kubitza

Added empty_db script to reset the vegbank database after running xml2db/test in commit mode

19 11/21/2011 02:01 PM Aaron Marcuse-Kubitza

Changed xml2db and vegbank db to be owned by new user vegbank

18 11/18/2011 05:38 PM Aaron Marcuse-Kubitza

Changed xml2db and data2xml to help standardize mapping to different XML formats

17 11/18/2011 02:48 PM Aaron Marcuse-Kubitza

Added DROP DATABASE and CREATE DATABASE to vegbank.sql

16 11/18/2011 12:52 PM Aaron Marcuse-Kubitza

Changed xml2db to use primarily node contents to determine whether a node is a field or a child table

15 11/17/2011 04:42 PM Aaron Marcuse-Kubitza

Changed xml2db to use the first column in a table as its primary key

14 11/17/2011 04:08 PM Aaron Marcuse-Kubitza

Changed xml2db to avoid inserting duplicate rows

13 11/17/2011 03:31 PM Aaron Marcuse-Kubitza

Initial version of xml2db. Doesn't yet handle all duplicate rows correctly.

12 11/17/2011 11:29 AM Aaron Marcuse-Kubitza

Removed .pyc files

11 11/17/2011 10:29 AM Aaron Marcuse-Kubitza

Added BIEN 3 scripts

10 12/09/2010 01:35 PM Matt Wheeler

Added ability to change the vegx node names to be different from the postgres table names.
This was the easiest way to change the postgres table names when the vegx names are
not useable for some reason. This requires that the node names be altered in the xsd...