opennlp-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Olivier Grisel <olivier.gri...@ensta.org>
Subject Re: OpenNLP Annotations Proposal
Date Thu, 23 Jun 2011 23:10:32 GMT
2011/6/24 James Kosin <james.kosin@gmail.com>:
> Olivier,
>
> No main() in the classes.  So, how does one get the collection of
> articles started?

It's meant to be used as a library. For instance, it is used by the
following custom pig Loader:

  https://github.com/ogrisel/pignlproc/blob/master/src/main/java/pignlproc/storage/ParsingWikipediaLoader.java

which is in turn called in pig scripts such as:

 https://github.com/ogrisel/pignlproc/blob/master/examples/extract_links.pig

Apache Pig is scripting language and runtime environment to perform
distributed data analysis on an Apache Hadoop (HDFS + MapReduce)
cluster.

  http://pig.apache.org/

-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

Mime
View raw message