opennlp-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Manoj B. Narayanan" <manojb.narayanan2...@gmail.com>
Subject Re: Adding features to Document Categorizer
Date Mon, 21 Aug 2017 04:37:10 GMT
Hi Cohan,

I was also thinking of the same solution. :)

Thanks.

Manoj.

On Fri, Aug 18, 2017 at 7:43 PM, Cohan Sujay Carlos <cohan@aiaioo.com>
wrote:

> One way to do it is to encode the tags into your input text file.
>
> Say one line in the input file is:
>
> CLASS1 Pierre Vinken , 61 years old , will join the board as a
> nonexecutive director Nov. 29 .
>
>
> If you wanted to use POS tags along with the tokens, you could do something
> like this:
>
> CLASS1 Pierre_NNP Vinken_NNP ,_, 61_CD years_NNS old_JJ ,_, will_MD
> join_VB the_DT board_NN as_IN
>     a_DT nonexecutive_JJ director_NN Nov._NNP 29_CD ._.
>
>
> Cohan
>
>
> On Fri, Aug 18, 2017 at 7:22 PM, Manoj B. Narayanan <
> manojb.narayanan2011@gmail.com> wrote:
>
> > Hi,
> >
> > Is there any way by which I can add features to a Document Categorizer.
> For
> > example, the POS tags of the words.
> >
> > Thanks,
> > Manoj
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message