opennlp-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lance Norskog <goks...@gmail.com>
Subject Re: Document Classification
Date Mon, 23 Apr 2012 21:37:36 GMT
Classification is more of a machine learning problem, not a
linguistics problem. There are a few large, sophisticated (complex)
implementations for text analysis: Weka, R, and Mahout.

On Mon, Apr 23, 2012 at 10:41 AM, Alex Kudlick <adkudlick@gmail.com> wrote:
> Hi,
>
> I've just started using open nlp for a project to classify scientific
> articles in to subjects.  I have a few questions:
>
> 1. How do I configure logging for the model? I'm using sf4j-log4j for the
> rest of my application, but the training output from the model just goes to
> stdout.
>
> 2. Is there any support for classifying documents with multiple classes?
> For instance, a given article may be classified as Computational Biology,
> Cell Biology, and Molecular Biology.
>
> Thanks,
>
> Alex Kudlick



-- 
Lance Norskog
goksron@gmail.com

Mime
View raw message