/ - Diff - BIEN 3 - NCEAS Projects

« Previous | Next »

Revision 4626

Added by Aaron Marcuse-Kubitza over 12 years ago

canon, translate, filter_out_ci: Support vocabularies/dictionaries with additional columns in addition to the functional column(s) used by the program. These columns can contain comments, etc. This was not originally supported because Python 2's iterable unpacking only supports "an iterable with the same number of items as there are targets in the target list" (http://docs.python.org/reference/simple_stmts.html#assignment-statements). We now use numeric array indexes instead to get around this limitation, and for consistency with other map-manipulation scripts.

         stream = open(vocab_path, 'rb')
         reader = csv.reader(stream)
         reader.next() # skip header
         for term, in reader: vocab.add(simplify(term))
         for row in reader: vocab.add(simplify(row[0]))
         stream.close()
         # Filter input

         stream = open(dict_path, 'rb')
         reader = csv.reader(stream)
         reader.next() # skip header
         for from_, to in reader: dict_[from_] = to
         for row in reader: dict_[row[0]] = row[1]
         stream.close()
         # Translate input

         stream = open(vocab_path, 'rb')
         reader = csv.reader(stream)
         reader.next() # skip header
         for term, in reader: dict_[simplify(term)] = term
         for row in reader: dict_[simplify(row[0])] = row[0]
         stream.close()
         # Canonicalize input

Also available in: Unified diff

Project

General

Profile

Revision 4626

Added by Aaron Marcuse-Kubitza over 12 years ago