uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From William Colen <william.co...@gmail.com>
Subject Sorting overlapping annotation of same type using UIMAFIT
Date Mon, 21 Nov 2016 02:52:51 GMT
Hi,

In Portuguese we have contractions, that are words composed by, for
example, a preposition + article, pronoun or an adverb.

Example:

Nós acreditávamos nele. (We believed him.)

Where "nele" can be divided into "em" + "ele". (in + him)

To properly analyze this, I created two token annotation with the same
begin and end, but the first I associated with the POS Tag preposition, and
the second pronoun.

This is especially important when we are doing chunking, because the first
token will be part of a prepositional phrase, while the second of a nominal
phrase.

How can I guarantee that when I call UIMAFit JCasUtil.select I will get the
tokens ordered, first the preposition, second the pronoun?

Thank you,
William

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message