Project

General

Profile

# Date Author Comment
11970 01/20/2014 11:33 AM Aaron Marcuse-Kubitza

moved everything into /trunk/ to create the standard svn layout, for use with tools that require this (eg. git-svn). IMPORTANT: do NOT do an `svn up`. instead, re-use your working copy's existing files with `svn switch` (http://svnbook.red-bean.com/en/1.6/svn.ref.svn.c.switch.html).

8233 03/28/2013 03:37 PM Aaron Marcuse-Kubitza

mappings/Makefile: $(vocab); bin/redmine_synonyms: Support crossed out (deprecated) terms

7808 03/05/2013 02:30 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.thesaurus.csv: `uniq`ify the map after applying initial translations, to support mappings that appear twice (once as the result of an alternative's synonym entry and again as the result of an ambiguous term's alternatives listing)

7525 02/12/2013 12:09 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Ignore leading ? when sorting so that ambiguous terms sort alphabetically with other terms. This prevents terms from moving from their previous location when they become ambiguous.

7522 02/12/2013 11:47 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Added back ambiguous terms, so that the vocabulary contains all terms defined by VegCore, regardless of whether they are ambiguous or unambiguous terms

7521 02/12/2013 11:44 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Added back synonyms, so that the vocabulary contains all terms defined by VegCore, regardless of whether they are synonyms or primary terms. This also prevents VegCore.vocab.csv from losing entries when terms are renamed, which made it difficult to verify that no terms were lost when refactoring.

7491 02/07/2013 07:16 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.vocab.csv: Use the term's type label instead of its header level to determine if it's a synonym or alternative. This allows header levels to be chosen for presentational reasons rather than being constrained by being parsable.

7487 02/07/2013 06:21 AM Aaron Marcuse-Kubitza

mappings/Makefile, redmine_synonyms: Updated for new VegCore data dictionary format, which prefixes the term type (Synonym, Alternative, etc.) to the term instead of including it as a section label. This ensures that the term type of a non-primary term is shown next to the term when it is visited via a permalink, which causes the term header to appear at the top of the screen and obscures the section header containing the type.

7486 02/07/2013 06:00 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.thesaurus.csv: removal of tables: ignore errors if grep found no match

7485 02/07/2013 02:06 AM Aaron Marcuse-Kubitza

Renamed mappings/VegCore.csv to VegCore.vocab.csv and Veg+-VegCore.csv to VegCore.thesaurus.csv for clarity

7484 02/07/2013 02:03 AM Aaron Marcuse-Kubitza

mappings/Makefile, input.Makefile: Renamed $(dict) to $(thesaurus) because Veg+-VegCore.csv is actually a thesaurus, not a dictionary

7483 02/07/2013 01:57 AM Aaron Marcuse-Kubitza

mappings/Makefile: Replaced occurrences of VegCore.csv with $(vocab) and Veg+-VegCore.csv with $(dict)

7481 02/06/2013 07:33 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Sort terms by name so that reordering terms in the VegCore data dictionary does not cause Veg+-VegCore.csv to change. This makes it much easier to identify synonyms and ambiguous terms that were accidentally deleted during a data dictionary refactoring. (Note that these are no longer included in VegCore.csv, so this is required in addition to sorting VegCore.csv by name.)

7480 02/06/2013 07:26 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Sort terms by name so that reordering terms in the VegCore data dictionary does not cause VegCore.csv to change. This makes it much easier to identify terms that were accidentally deleted during a data dictionary refactoring.

7478 02/05/2013 06:18 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Filter out namespaces by matching only terms whose header links within the data dictionary

7475 02/05/2013 05:56 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Fixed bug where need to remove duplicates, which are no longer supported by canon, by removing alternatives of ambiguous terms when these occur separately from their definitions

7474 02/05/2013 05:29 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Removed synonyms and ambiguous terms, since the canonicalization of them is handled by Veg+-VegCore.csv. This also reduces the time it takes canon to build the in-memory Python dict of replacements, which scales to all inputs and should speed up the build/test command.

7473 02/05/2013 05:22 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Removed synonyms, since the canonicalization of them is handled by Veg+-VegCore.csv

7472 02/05/2013 05:10 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Match terms by header # instead of matching all anchors, in order to include the leading ? before an ambiguous term

7471 02/05/2013 04:42 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Generate dynamically from VegCore.htm, which allows the VegCore thesaurus to be automatically kept up to date. More importantly, it allows terms in all map spreadsheets to be updated simultaneously when a term is renamed (e.g. by replacing a term with one of its synonyms).

7465 02/05/2013 03:56 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Fixed bugs where also need to filter out ambiguous tables, but shouldn't filter out acronyms (which are regular fields)

7462 02/05/2013 03:19 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Fixed bug where need to filter out table names to avoid applying table replacements to fields which have the same name as a table

7460 02/05/2013 01:59 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegX-VegCore.csv: Sort by the input column instead of the output column to keep the sort order stable across VegCore term renames

7459 02/05/2013 01:46 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Before running collapse_multimap, canonicalize alternatives of ambiguous terms using unambiguous mappings. This ensures that the alternatives lists contain only canonical VegCore terms rather than synonyms.

7457 02/05/2013 01:31 PM Aaron Marcuse-Kubitza

mappings/Veg+-VegCore.csv: Removed Sources, Definition columns because source information is now in the VegCore data dictionary

7453 02/05/2013 12:50 PM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: translate: Fixed bug where need to run on $@ instead of $<

7447 02/05/2013 04:31 AM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Translate the thesaurus's output terms using itself in order to map a synonym of an ambiguous term directly to its alternatives list rather than only to the ambiguous term itself

7446 02/05/2013 04:26 AM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+-VegCore.csv: Run collapse_multimap on the generated map so that all alternatives are included, rather than just the first alternative, when translate maps an ambiguous term

7445 02/05/2013 04:25 AM Aaron Marcuse-Kubitza

redmine_synonyms: Fixed bug where need to output a CSV rather than TSV to be usable by other programs that use map spreadsheets

7438 02/05/2013 01:56 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.tables.csv: Exclude ambiguous table names, which should not be part of the tables summary (as neither are table synonyms)

7435 02/05/2013 01:47 AM Aaron Marcuse-Kubitza

mappings/Makefile: Use new translate_ci instead of translate

7432 02/02/2013 05:39 PM Aaron Marcuse-Kubitza

mappings/Makefile: Added target to create Veg+-VegCore.csv from VegCore.htm, initially commented out until all the synonyms in the existing Veg+-VegCore.csv are added to the VegCore data dictionary <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore_data_dictionary>

7209 01/14/2013 08:39 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.tables.csv: Removed no longer needed removal of Namespaces table, which is now marked as just a section, not a table

7040 01/03/2013 07:39 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Include only terms that start with a lowercase letter or are all caps. This also avoids the need to filter out VegCore.tables.csv.

7034 01/02/2013 11:32 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Fixed bug where need to filter out VegCore.tables.csv case-sensitively so that field names which are the same as a table name don't get filtered out

6893 12/17/2012 08:06 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore.csv: Filter out the VegCore tables so they are not matched as terms. This is necessary because some terms have the same name as a table, but the term should be the match rather than the table.

6851 12/14/2012 08:29 AM Aaron Marcuse-Kubitza

mappings/Makefile: Added VegCore.tables.redmine, which contains the Redmine-formatted list of VegCore tables to paste into <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore#Tables>

6849 12/14/2012 08:12 AM Aaron Marcuse-Kubitza

mappings/Makefile: Added VegCore.tables.csv, which contains all the tables in the VegCore data dictionary

6845 12/14/2012 05:41 AM Aaron Marcuse-Kubitza

mappings/Makefile: Generate VegCore.csv from the VegCore data dictionary page by extracting all HTML anchors (in Redmine, each section heading, and therefore each VegCore term, gets its own anchor)

6844 12/14/2012 05:34 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Changed line endings to \n to match what sed generates from the VegCore data dictionary page

6843 12/14/2012 05:31 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Removed informational columns, because this information is now maintained on the VegCore data dictionary page at <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCore>

6830 12/14/2012 02:42 AM Aaron Marcuse-Kubitza

mappings/Makefile: .Veg+-VegCore.csv.last_cleanup: Also canon the output (VegCore) column to the VegCore.csv vocabulary. ? prefixes are not a problem because there are always at least two alternatives listed for these terms, so canon will not modify the output field.

6703 12/07/2012 05:50 PM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore.csv.last_cleanup, .Veg+-VegCore.csv.last_cleanup: Also replace Veg+ terms in sources list, which are references to VegCore terms that have since been renamed

6699 12/07/2012 04:39 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Added Values column to store closed list values

6676 12/07/2012 01:05 PM Aaron Marcuse-Kubitza

dict2redmine: Put the definition before and outside of the fields table

6675 12/07/2012 12:53 PM Aaron Marcuse-Kubitza

mappings/VegCore.csv: Moved Definition values that are actually comments into separate Comments column

6633 12/05/2012 09:09 AM Aaron Marcuse-Kubitza

dict2redmine: Removed no longer used explicit Definition column #

6629 12/05/2012 08:58 AM Aaron Marcuse-Kubitza

mappings/VegCore.csv, Veg+-VegCore.csv: Reversed order of Comments, Sources columns to match Redmine table order

6627 12/05/2012 08:43 AM Aaron Marcuse-Kubitza

dict2redmine: Added Definition column

6626 12/05/2012 08:32 AM Aaron Marcuse-Kubitza

dict2redmine: Take term and sources col #s as args instead of hardcoding them by column name or position

6621 12/05/2012 08:02 AM Aaron Marcuse-Kubitza

mappings/Makefile: Create VegCore.redmine from VegCore.csv

6598 12/05/2012 12:37 AM Aaron Marcuse-Kubitza

mappings/Makefile: .Veg+-VegCore.csv.last_cleanup: Sort by the source URL instead of the VegCore term

6597 12/05/2012 12:35 AM Aaron Marcuse-Kubitza

mappings/Makefile: Split .Veg+-VegCore.csv.last_cleanup and .VegX-VegCore.csv.last_cleanup into separate targets so their recipes can be different

5635 10/18/2012 12:10 PM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore.csv.last_cleanup: Remove duplicate entries using uniq

5254 10/05/2012 05:07 PM Aaron Marcuse-Kubitza

Makefiles: .last_cleanup targets: Also make the file that's being cleaned up .PRECIOUS so it doesn't get deleted if the .last_cleanup target has an error

5253 10/05/2012 05:04 PM Aaron Marcuse-Kubitza

Makefiles: .last_cleanup targets: Make each individual target .PRECIOUS (don't delete on error) because just making %.last_cleanup precious doesn't seem to prevent deletion

5250 10/04/2012 09:59 PM Aaron Marcuse-Kubitza

Makefiles: Don't delete %.last_cleanup on error because it's a mod time record rather than a generated file, and so that it's left at the last successful cleanup time when a cleanup operation is cancelled

5067 09/27/2012 09:06 AM Aaron Marcuse-Kubitza

mappings/Makefile: .Veg+-VegCore.csv.last_cleanup: Translate VegCore terms using itself so that any mapping to another Veg+ term automatically becomes a mapping to a VegCore term. .VegX-VegCore.csv.last_cleanup: Translate VegCore terms using Veg+-VegCore.csv to keep the terms up to date.

5065 09/27/2012 09:00 AM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore.csv.last_cleanup, .VegCore-VegBIEN.csv.last_cleanup: Apply Veg+-VegCore.csv so that terms can easily be renamed just by adding a mapping in Veg+-VegCore.csv, which will auto-translate all places that use the term. .VegCore-VegBIEN.csv.last_cleanup: Canonicalize to VegCore.csv so case changes in VegCore terms will automatically propagate to VegCore-VegBIEN.csv.

5046 09/27/2012 05:04 AM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore.csv.last_cleanup: Also remake VegCore-VegBIEN.unsourced_terms.csv here, not just in .VegCore-VegBIEN.csv.last_cleanup, so that the unsourced_terms.csv will be remade if the user adds the missing sources to VegCore.csv

5045 09/27/2012 05:03 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCore-VegBIEN.unsourced_terms.csv: Factored remake code into its own make target

4893 09/20/2012 09:56 PM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore-VegBIEN.csv.last_cleanup: Generate VegCore-VegBIEN.unsourced_terms.csv whenever VegCore-VegBIEN.csv changes, to track VegCore terms that are mapped to VegBIEN but not documented in VegCore.csv. Note that this file is not svn:ignored, so it will show up with a ? when the user runs `svn st` if there are any unsourced terms.

4892 09/20/2012 09:47 PM Aaron Marcuse-Kubitza

mappings/Makefile: Changed catch-all `.%.last_cleanup: %` target to a specific target for VegCore-VegBIEN.csv, because it's the only file that uses this target

4891 09/20/2012 09:45 PM Aaron Marcuse-Kubitza

mappings/: Don't generate a for_review version of Veg+-VegCore.csv, because it is identical to the machine-readable Veg+-VegCore.csv (there are no output XPaths to simplify)

4890 09/20/2012 09:41 PM Aaron Marcuse-Kubitza

mappings/: Don't generate a for_review version of VegX-VegCore.csv, because it is identical to the machine-readable VegX-VegCore.csv (there are no output XPaths to simplify)

4889 09/20/2012 09:37 PM Aaron Marcuse-Kubitza

mappings/: Removed Veg+.unmapped_terms.csv because these terms are found in each datasource's new_terms.csv, which are updated regularly, while this file isn't, and which exist for every datasource, while this file only contained terms from a few datasources

4850 09/19/2012 08:14 PM Aaron Marcuse-Kubitza

mappings/Makefile: .VegCore.csv.last_cleanup: Fixed bug where needed to change sorting columns to match new column order

4841 09/19/2012 07:02 PM Aaron Marcuse-Kubitza

mappings/: Split Veg+.terms.csv into VegCore.csv and Veg+.unmapped_terms.csv

4692 09/14/2012 07:19 PM Aaron Marcuse-Kubitza

mappings/Makefile: Inline $(review_) because it's only used once

4691 09/14/2012 07:18 PM Aaron Marcuse-Kubitza

mappings/Makefile: Removed no longer used $(review)

4690 09/14/2012 07:17 PM Aaron Marcuse-Kubitza

mappings/Makefile: Don't set $(SHELL) to /bin/bash because this is no longer needed

4689 09/14/2012 07:16 PM Aaron Marcuse-Kubitza

mappings/Makefile: Removed empty VegCSV section. mappings/Makefile's only functionality is now to clean up (sort) the core maps whenever they change and create human-readable maps from them.

4688 09/14/2012 07:13 PM Aaron Marcuse-Kubitza

mappings/Makefile: Removed no longer used self maps, because the new automapping mechanism does not use them

4670 09/12/2012 05:34 PM Aaron Marcuse-Kubitza

mappings/: Removed no longer used derived file Veg+.vocab.csv

4665 09/12/2012 05:20 PM Aaron Marcuse-Kubitza

mappings/: Removed no longer used Veg+-VegBIEN.csv and derived autogen Veg+.self.csv

4584 09/11/2012 05:49 AM Aaron Marcuse-Kubitza

Added mappings/Veg+.vocab.csv

4570 09/11/2012 02:09 AM Aaron Marcuse-Kubitza

mappings/: Removed no longer needed Veg+-VegCore.to_self.csv, because multiple levels of mappings are no longer needed to get to the VegCore term

4507 09/07/2012 09:18 AM Aaron Marcuse-Kubitza

mappings/: Removed no longer needed Veg+.cs-VegBIEN.csv, which is now the same as Veg+-VegBIEN.csv which was derived from it

4504 09/07/2012 09:11 AM Aaron Marcuse-Kubitza

intersect, union: Made case- and punctuation-insensitive. mappings/Veg+-VegBIEN.csv: Removed no longer needed duplicate entries for each first letter case, which must now be removed for case- and punctuation-insensitive intersect/union to work. Note that the SpeciesLink `svn diff` hides _alt entry 0, which contains one of the removed duplicate columns that appears in the diff.

4268 08/28/2012 05:00 PM Aaron Marcuse-Kubitza

mappings/Makefile: Fixed bug where rules needed to generate Veg+.self.csv ($(viaSelfMap)) were still using a pattern match that required a table (`.%.`, `.*.`), even though we are no longer using separate maps for separate tables

4174 08/22/2012 09:42 AM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+.cs-VegBIEN.csv: Join new Veg+-VegCore.to_self.csv (self-join), instead of Veg+-VegCore.csv, to VegCore-VegBIEN.csv, to support two-level chains of mappings in Veg+-VegCore.csv

4168 08/21/2012 09:34 AM Aaron Marcuse-Kubitza

mappings: Added autogen Veg+-VegCore.to_self.csv, which is Veg+-VegCore.csv joined to itself, and use it as an intermediate map to join to VegCore-VegBIEN.csv. This provides support for two-level chains of mappings in Veg+-VegCore.csv.

4157 08/21/2012 07:51 AM Aaron Marcuse-Kubitza

mappings/Makefile: Veg+.cs-VegBIEN.csv: Join Veg+-VegCore.csv to VegCore-VegBIEN.csv in quiet mode, to avoid adding "No non-empty join mapping" to the Comments column

4155 08/21/2012 07:30 AM Aaron Marcuse-Kubitza

mappings/Makefile: Rewrapped lines

4099 08/17/2012 04:02 PM Aaron Marcuse-Kubitza

mappings/: Removed now-inaccurate ".stems" suffix from VegX-VegCore.stems.csv, which actually applied to all tables

4098 08/17/2012 03:59 PM Aaron Marcuse-Kubitza

mappings/: Removed no longer used ".specimens" suffix from maps, which is now the same for all maps

4088 08/16/2012 08:30 PM Aaron Marcuse-Kubitza

Merged DwC (including DwC1) and VegCSV mappings into new Veg+ schema. This involves replacing occurrences of DwC and VegCSV with Veg+ (or sometimes VegCore) everywhere, as described in <https://projects.nceas.ucsb.edu/nceas/projects/bien/wiki/VegCSV-DwC_merging&gt;.

3967 08/13/2012 10:44 AM Aaron Marcuse-Kubitza

Added mappings/Veg+.terms.csv, which will serve the purpose of listing all available terms with their source. This will remove the need to store the sources in the mappings, where they are out of place and difficult to maintain during refactoring.

3941 08/10/2012 05:02 PM Aaron Marcuse-Kubitza

mappings: Removed no longer used for_review/VegBIEN-DwC2.specimens.csv

3927 08/09/2012 03:23 PM Aaron Marcuse-Kubitza

mappings/Makefile: Made VegCSV-VegBIEN.specimens.csv a non-derived map, since the VegX-VegCSV mapping is no longer used. This causes automatic creation of a for_review file.

3921 08/09/2012 01:46 PM Aaron Marcuse-Kubitza

mappings/Makefile: VegCSV-VegBIEN.specimens.csv: Commented out combining with DwC2-VegBIEN mappings, because merging DwC and VegX/VegCSV into one map is a lower priority than replacing all datasource VegX mappings with VegCSV (which does not require the merging but does require XPaths that don't collide, which is not yet the case)

3904 08/09/2012 10:09 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCSV-VegBIEN.specimens.csv: Temporarily sort by input column rather than output column, to assist in finding terms that map to different places in the DwC- and VegX-VegBIEN mappings

3903 08/09/2012 10:02 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegCSV-VegBIEN.specimens.csv: Use new all option to union, in order to manually review inputs which appear in both maps but map to different places

3901 08/09/2012 09:57 AM Aaron Marcuse-Kubitza

mappings/Makefile: Merged .VegX-VegCSV.stems.csv.last_cleanup into .%.last_cleanup, since VegX-VegCSV.stems.csv now uses the same cleanup operations as the other non-derived maps. Note that this automatically creates a file in for_review for VegX-VegCSV.stems.csv, which is currently identical to it.

3900 08/09/2012 09:52 AM Aaron Marcuse-Kubitza

mappings/Makefile: .%.last_cleanup: Removed simplify_xpath because non-derived maps will now have VegX XPaths in their Source column URLs, which should not be modified

3899 08/09/2012 09:50 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegX-VegCSV.stems.csv: Removed autogeneration command because once file has been generated, regeneration is no longer needed

3898 08/09/2012 09:42 AM Aaron Marcuse-Kubitza

mappings/Makefile: Fixed bug where VegX-VegCSV.stems.csv needed to be removed from $(vegcsvMaps) so it wouldn't be deleted on `make clean`

3826 08/07/2012 05:38 AM Aaron Marcuse-Kubitza

mappings/Makefile: VegX-VegCSV.stems.csv: Clean up when edited using sort_map

3825 08/07/2012 05:27 AM Aaron Marcuse-Kubitza

Added mappings/VegCSV-VegBIEN.specimens.csv, which is generated from VegX-VegCSV.stems.csv

3823 08/07/2012 05:14 AM Aaron Marcuse-Kubitza

Added mappings/VegX-VegCSV.stems.csv. The initial version is autogenerated by joining the simplified VegBIEN XPaths of related maps.