lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "OpenNLP" by KojiSekiguchi
Date Mon, 24 Sep 2012 13:33:56 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "OpenNLP" page has been changed by KojiSekiguchi:
http://wiki.apache.org/solr/OpenNLP?action=diff&rev1=10&rev2=11

  NLP is a large field of inquiry. Unless you are familiar with it you may find this patch
confusing. The [[http://opennlp.apache.org/|Apache OpenNLP project]] is the best place to
learn what this package can do.
  
  == Introduction ==
- OpenNLP is a toolkit for Natural Language Processing (NLP). It is an Apache top-level project
located [[here|http://opennlp.apache.org/]]. It includes implementations of many popular NLP
algorithms. This project integrates some of its features into Lucene and Solr. This first
effort incorporates Analyzer chain tools for sentence detection, tokenization, Parts-of-Speech
tagging (nouns, verbs, ejaculations, etc.), Chunking (noun phrases, verb phrases) and Named
Entity Recognition.  See the OpenNLP project page for information on the implementations.
 Here are some use cases:
+ OpenNLP is a toolkit for Natural Language Processing (NLP). It is an Apache top-level project
located [[http://opennlp.apache.org/|here]]. It includes implementations of many popular NLP
algorithms. This project integrates some of its features into Lucene and Solr. This first
effort incorporates Analyzer chain tools for sentence detection, tokenization, Parts-of-Speech
tagging (nouns, verbs, ejaculations, etc.), Chunking (noun phrases, verb phrases) and Named
Entity Recognition.  See the OpenNLP project page for information on the implementations.
 Here are some use cases:
  
  === Indexing interesting words ===
  NLP lets you create a field with only the nouns in a document. This would be useful for
many free text applications. The FilterPayloadsFilter and StripPayloadsFilter below are required
for this. See "Full Example" below.

Mime
View raw message