Project

General

Profile

Statistics
| Revision:

# Date Author Comment
1820 04/09/2012 06:41 PM Aaron Marcuse-Kubitza

inputs/CTFS/test: Accepted initial test outputs. A lot of leaves are still unmapped with the default mappings.

1819 04/09/2012 06:40 PM Aaron Marcuse-Kubitza

inputs/CTFS/maps: Added initial maps

1818 04/09/2012 06:39 PM Aaron Marcuse-Kubitza

VegX mappings: taxonConcept mappings: Added "tcs:" namespace prefix to appropriate elements. This will make the taxonConcept XPaths compatible with CTFS VegX.

1817 04/09/2012 06:13 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: full via maps (maps/$(via).%.full.csv): $(makeFullCsv): Sort all maps so that rows are re-ordered whether or not a core self map exists. This way, if a core self map is created, it will not cause the sort order of the generated via-format XMLs to change. This makes it easier to accept any changes to test outputs that result from adding a core self map.

1816 04/09/2012 05:53 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegX: Added VegX.self.organisms.csv. Added root attrs to chRoot maps, commented out since it's not ready to be checked in yet.

1815 04/09/2012 05:34 PM Aaron Marcuse-Kubitza

xpath.py: get(): Run xml_dom.by_tag_name() with ignore_namespace=False (possibly later set to True)

1814 04/09/2012 05:32 PM Aaron Marcuse-Kubitza

xml_dom.py: Comments: Added clean_comment() and mk_comment(). Searching child nodes: by_tag_name(): Added ignore_namespace option to ignore namespace of node name.

1813 04/09/2012 05:26 PM Aaron Marcuse-Kubitza

root Makefile: Added %-remake target

1812 04/09/2012 04:53 PM Aaron Marcuse-Kubitza

mappings/Makefile: Renamed joinMaps to dwcMaps and chrootMaps to vegxMaps. Added commented-out code to create VegX.self.organisms.csv (not ready to check in yet because it affects many dependent maps).

1811 04/09/2012 02:52 PM Aaron Marcuse-Kubitza

input.Makefile: Removed no longer needed $(noEmptyMap)

1810 04/09/2012 12:40 PM Aaron Marcuse-Kubitza

xml_func.py: process(): Use new xml_dom.mk_comment()

1809 04/09/2012 12:40 PM Aaron Marcuse-Kubitza

xml_dom.py: Added clean_comment() and mk_comment() to properly sanitize comment contents (comments can't contain '--')

1808 04/09/2012 12:14 PM Aaron Marcuse-Kubitza

Added inputs/TRTE

1807 04/03/2012 08:26 PM Aaron Marcuse-Kubitza

inputs/QMOR/test: Added initial accepted test outputs

1806 04/03/2012 08:26 PM Aaron Marcuse-Kubitza

inputs/QMOR/maps: Added maps

1805 04/03/2012 08:20 PM Aaron Marcuse-Kubitza

Added inputs/QMOR

1804 04/03/2012 08:14 PM Aaron Marcuse-Kubitza

inputs/MT/test: Added initial accepted test outputs

1803 04/03/2012 08:14 PM Aaron Marcuse-Kubitza

inputs/MT/maps: Added maps

1802 04/03/2012 08:13 PM Aaron Marcuse-Kubitza

mappings/Makefile: DwC-VegBIEN.specimens.csv: Don't call remove_empty to produce it, because join now deals with empty mappings correctly by still raising a warning. Removed no longer needed intermediate DwC.ci-VegBIEN.specimens.csv.

1801 04/03/2012 08:09 PM Aaron Marcuse-Kubitza

join: Also print "No join mapping" warning if a join mapping was found but it was empty. The warning in that case is actually "No non-empty join mapping" to distinguish it from a mapping that's missing entirely. input.Makefile: missing_mappings: Support new "No join mapping" error message.

1800 04/03/2012 08:08 PM Aaron Marcuse-Kubitza

join: Also print "No join mapping" warning if a join mapping was found but it was empty. The warning in that case is actually "No non-empty join mapping" to distinguish it from a mapping that's missing entirely. input.Makefile: missing_mappings: Support new "No join mapping" error message.

1799 04/03/2012 07:33 PM Aaron Marcuse-Kubitza

Added inputs/MT

1798 04/03/2012 07:26 PM Aaron Marcuse-Kubitza

Added disown_all to disown all running jobs

1797 04/03/2012 07:26 PM Aaron Marcuse-Kubitza

stop_imports: Call jobspecs relative to $selfDir, rather than assuming it will be run from the svn root dir

1796 04/03/2012 07:18 PM Aaron Marcuse-Kubitza

union: Call maps.merge_headers() using **dict(prefer=header_num) instead of just prefer=header_num in order to work on Python 2.5.2 (which nimoy is running)

1795 04/03/2012 07:00 PM Aaron Marcuse-Kubitza

inputs/ACAD/test: Accepted initial test outputs

1794 04/03/2012 07:00 PM Aaron Marcuse-Kubitza

Added inputs/ACAD/maps/ maps

1793 04/03/2012 06:59 PM Aaron Marcuse-Kubitza

Accepted new test outputs resulting from the addition of the id -> occurrenceID mapping in mappings/DwC1-DwC2.specimens.csv

1792 04/03/2012 06:57 PM Aaron Marcuse-Kubitza

inputs/SALVIAS*/maps: Cleaned up maps for the first time since all via maps became subject to cleanup

1791 04/03/2012 06:55 PM Aaron Marcuse-Kubitza

input.Makefile: Removed no longer needed default "maps/.$(via).%.csv.last_cleanup" rule

1790 04/03/2012 06:54 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: Via maps cleanup: Added `env ignore=1` since with the switch to subtracting $(coreMap), all inputs will attempt to subtract some map, even if it's not subtractable

1789 04/03/2012 06:47 PM Aaron Marcuse-Kubitza

input.Makefile: Don't clean src maps, only build them

1788 04/03/2012 06:45 PM Aaron Marcuse-Kubitza

inputs/ARIZ/maps/DwC.specimens.csv: Re-cleaned up to take advantage of additional entries now removed by subtract

1787 04/03/2012 06:36 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: Via maps cleanup: Subtract $(coreMap) instead of $(coreSelfMap) so that entries whose input and output maps to the same place are subtracted as well

1786 04/03/2012 06:35 PM Aaron Marcuse-Kubitza

subtract: Also remove mappings whose input and output maps to the same non-empty value in map_1

1785 04/03/2012 06:32 PM Aaron Marcuse-Kubitza

util.py: Added all_equal(), all_equal_ignore_none(), have_same_value()

1784 04/03/2012 05:45 PM Aaron Marcuse-Kubitza

mappings/DwC1-DwC2.specimens.csv: Added id -> occurrenceID mapping

1783 04/03/2012 05:43 PM Aaron Marcuse-Kubitza

inputs/SALVIAS-CSV/maps/VegX.%.full.csv: Regenerated using new src maps

1782 04/03/2012 05:41 PM Aaron Marcuse-Kubitza

mappings/DwC1-DwC2.specimens.csv: Added mappings from dcterms elements without namespace to with namespace

1781 04/03/2012 05:40 PM Aaron Marcuse-Kubitza

inputs/SALVIAS-CSV: Built maps/src.%.csv

1780 04/03/2012 05:24 PM Aaron Marcuse-Kubitza

Added inputs/ACAD/maps/src.specimens.csv

1779 04/03/2012 05:23 PM Aaron Marcuse-Kubitza

input.Makefile: Maps building: Autogen src maps with known table names. Sources: $(withCatSrcs): Fixed bug where substitution pattern did not contain %.

1778 04/03/2012 05:22 PM Aaron Marcuse-Kubitza

Added src_map to make a source map spreadsheet from a CSV header

1777 04/03/2012 04:32 PM Aaron Marcuse-Kubitza

input.Makefile: Split Maps section into "Existing maps discovery" and "Maps building" sections. Sources: Added cat, cat-% to cat out sources.

1776 04/03/2012 04:17 PM Aaron Marcuse-Kubitza

input.Makefile: Factored out sources-related code to new Sources section

1775 04/03/2012 04:08 PM Aaron Marcuse-Kubitza

input.Makefile: $(srcMaps): Removed `$(filter-out maps/src.join.%.csv,...)` because maps/src.join.%.csv are no longer created

1774 04/03/2012 03:47 PM Aaron Marcuse-Kubitza

README.TXT: Schema changes: Split updating graphical ERD exports into separate section. Update graphical ERD exports: Added schemas/vegbien.ERD.core.pdf .

1773 04/03/2012 03:42 PM Aaron Marcuse-Kubitza

README.TXT: Added Datasource setup section with instructions to add a new datasource

1772 04/03/2012 03:38 PM Aaron Marcuse-Kubitza

Added inputs/ACAD

1771 04/03/2012 03:37 PM Aaron Marcuse-Kubitza

input.Makefile: Only setSvnIgnore the input dir, since it already exists and doesn't need to be added (inputs/Makefile adds it)

1770 04/03/2012 03:23 PM Aaron Marcuse-Kubitza

inputs/*/maps/DwC.specimens.csv: Removed extranenous XML meta info from DwC column root, since it now just needs to be present in the core via map mappings/DwC-VegBIEN.specimens.csv

1769 04/03/2012 03:22 PM Aaron Marcuse-Kubitza

union: Use new maps.merge_headers() to write properly combined header

1768 04/03/2012 03:21 PM Aaron Marcuse-Kubitza

maps.py: join_combinable(): Fixed roots_combinable() to run on col names instead of roots, which were passed in. merge_mappings(): Factored out mapping column combining into merge_mapping_cols(), which handles an optional prefer param as well to take the header_num env var. Added merge_headers().

1767 04/03/2012 03:17 PM Aaron Marcuse-Kubitza

util.py: Added sort_by_len(), shortest(), longest()

1766 04/03/2012 02:12 PM Aaron Marcuse-Kubitza

join: Use new maps.join_combinable() to check if column names match

1765 04/03/2012 02:11 PM Aaron Marcuse-Kubitza

maps.py: Added cols_combinable() and use it in combinable(). Added join_combinable() and associates helper functions. Added documentation labels to each section.

1764 04/03/2012 01:13 PM Aaron Marcuse-Kubitza

xml_parse.py: ConsecXmlInputStream: Removed read() because that's now defined in streams.FilterStream

1763 04/03/2012 01:11 PM Aaron Marcuse-Kubitza

xml_parse.py: parse_next(): Strip control characters from input stream because they mess up the parser

1762 04/03/2012 01:10 PM Aaron Marcuse-Kubitza

streams.py: FilterStream: Forward all reads to readline()

1761 04/03/2012 01:08 PM Aaron Marcuse-Kubitza

strings.py: Added is_ctrl() and strip_ctrl()

1760 04/03/2012 08:34 AM Aaron Marcuse-Kubitza

xml_parse.py: parse_next(): On parser error, advance to next XML document since the rest of the current document is corrupted

1759 04/03/2012 08:33 AM Aaron Marcuse-Kubitza

streams.py: Added consume(). Added documentation labels to each section.

1758 04/03/2012 08:23 AM Aaron Marcuse-Kubitza

bin/map: For XML inputs, wrap sys.stdin in a LineCountStream and use new xml_parse.docs_iter() on_error() to add input line # to XML parsing exceptions

1757 04/03/2012 08:21 AM Aaron Marcuse-Kubitza

xml_parse.py: Added on_error() handler to parse_next() (passed through by docs_iter()), so that the caller can add useful info like the input line # to the exception message, and decide not to suppress rather than re-raising the exception

1756 04/03/2012 07:19 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined field identificationLabel2 to identificationLabel. Distinguish what are now two identificationLabel fields of the same name by tagging each one with [@id=2] or [@id=1]. inputs/SALVIAS-CSV/maps/VegX.organisms.csv: Merge tag1/stem_tag1 and tag2/stem_tag2 using _alt, since they are never set to different values when both are not NULL (although sometimes just one or just the other is not NULL).

1755 04/02/2012 05:37 PM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined field tag2 to identificationLabel2 to reflect that it will become a second instance of identificationLabel

1754 04/02/2012 05:31 PM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Re-mapped individualOrganismObservation user-defined field lineCover to already existing volumeCanopy

1753 04/02/2012 05:29 PM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Re-mapped individualOrganismObservation user-defined field cover to already existing attribute.coverPercent

1752 04/02/2012 05:13 PM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Re-mapped individualOrganismObservation user-defined field count to already existing aggregateOrganismObservation.aggregateValue

1751 04/02/2012 04:44 PM Aaron Marcuse-Kubitza

vegbien.ERD.mwb: Fixed lines

1750 04/02/2012 01:50 PM Aaron Marcuse-Kubitza

README.TXT: Documented that `make reinstall_db` will delete your VegBIEN DB

1749 04/02/2012 01:48 PM Aaron Marcuse-Kubitza

README.TXT: Documented that `make empty_db` will delete your VegBIEN DB

1748 04/02/2012 01:44 PM Aaron Marcuse-Kubitza

root Makefile: empty_db: Confirm deletion just like for rm_db. rm_db: put $(confirmRmDb) on a separate line and move the $(error) call to the main $(confirm) macro since you always want to abort make if the user cancels (not just not run that command).

1747 04/02/2012 01:34 PM Aaron Marcuse-Kubitza

root Makefile: rm_db: If user cancels, abort in case target was reinstall_db to prevent installing

1746 04/02/2012 01:28 PM Aaron Marcuse-Kubitza

root Makefile: core, rm_core: Fixed bug where no longer existing prerequisites postgres_user, rm_postgres_user were not removed

1745 04/02/2012 01:25 PM Aaron Marcuse-Kubitza

root Makefile: rm_db: Confirm deletion with user. Merged postgres_user, rm_postgres_user into db, rm_db so that deletion confirmation applies to user deletion as well (which would indirectly cause the DB to be deleted).

1744 04/02/2012 01:04 PM Aaron Marcuse-Kubitza

README.TXT: Testing: Updated to add missing mappings

1743 04/02/2012 01:03 PM Aaron Marcuse-Kubitza

root Makefile: test-all: Added missing_mappings

1742 04/02/2012 01:00 PM Aaron Marcuse-Kubitza

Moved maps validation targets from main Makefile to input.Makefile. main Makefile: maps validation: Summarize the output of the inputs' maps validations.

1741 04/02/2012 12:22 PM Aaron Marcuse-Kubitza

Makefile: Also find missing input mappings, in addition to missing join mappings

1740 04/02/2012 12:21 PM Aaron Marcuse-Kubitza

join: Also produce warnings for no input mapping (if no comment explaining why no input mapping), in addition to no join mapping

1739 04/02/2012 12:21 PM Aaron Marcuse-Kubitza

join: Also produce warnings for no input mapping (if no comment explaining why no input mapping), in addition to no join mapping

1738 04/02/2012 12:20 PM Aaron Marcuse-Kubitza

inputs/NY/maps/DwC.specimens.csv: Documented why there is no input mapping for key

1737 04/02/2012 11:29 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined fields stem* to remove the stem* prefix to be consistent with VegBIEN

1736 04/02/2012 11:23 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation/plotObservation user-defined fields sourceaccessioncode to sourceAccessionCode to be consistent with VegX case sensitivity

1735 04/02/2012 11:19 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined field interceptCm to lineCover to be consistent with VegBIEN

1734 04/02/2012 11:18 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined field individualCode to authorPlantCode to be consistent with VegBIEN

1733 04/02/2012 11:17 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined field htFirstBranchM to heightFirstBranch to be consistent with VegBIEN

1732 04/02/2012 11:15 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed individualOrganismObservation user-defined field coverPercent to cover to be consistent with VegBIEN

1731 04/02/2012 11:12 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed abioticObservation user-defined field siltPercent to silt to be consistent with VegBIEN

1730 04/02/2012 11:11 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed abioticObservation user-defined field sandPercent to sand to be consistent with VegBIEN

1729 04/02/2012 11:10 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed abioticObservation user-defined field pottasium to potassium to be consistent with VegBIEN

1728 04/02/2012 11:08 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed abioticObservation user-defined field organicPercent to organic to be consistent with VegBIEN

1727 04/02/2012 11:07 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed abioticObservation user-defined field clayPercent to clay to be consistent with VegBIEN

1726 04/02/2012 11:06 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed abioticObservation user-defined field cationCap to cationExchangeCapacity to be consistent with VegBIEN

1725 04/02/2012 11:02 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Renamed plotObservation user-defined field precipMm to precipitation to be consistent with VegBIEN

1724 04/02/2012 10:56 AM Aaron Marcuse-Kubitza

VegX-VegBIEN.organisms.csv: Changed plotObservation user-defined field plotMethodology to /simpleUserdefined[name=method]/*ID/method/name

1723 04/02/2012 09:47 AM Aaron Marcuse-Kubitza

schemas/postgresql.nimoy.conf: Increased default_statistics_target to 8.4 default value to improve execution query plans

1722 04/02/2012 09:43 AM Aaron Marcuse-Kubitza

Added schemas/postgresql.Mac.conf (for tuning developers' local testing DBs)

1721 04/02/2012 09:42 AM Aaron Marcuse-Kubitza

schemas/postgresql*.conf: Increased checkpoint_segments and checkpoint_completion_target so that checkpoints (performance intensive) are written less often and load-balanced better