lucene-pylucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Golbère <br...@orcatec.com>
Subject Re: n-gram word support
Date Fri, 19 Jun 2009 01:35:37 GMT
Andi Vajda wrote:
> On Thu, 18 Jun 2009, Neha Gupta wrote:
>
>> I was wondering if there is a way to read the index and generate 
>> n-grams of
>> words for a document using pylucene? 
>
> PyLucene just wraps Java Lucene. If there is a way to do this in Java 
> Lucene, then use the same way with PyLucene.
> To find out how to do this in Java Lucene, ask the 
> java-user@lucene.apache.org mailing list. To subscribe, see [1].
>
> Andi..
>
> [1] java-user-subscribe@lucene.apache.org 
There is an n-gram tokenizer, EdgeNGramTokenizer, that may be what 
you're looking for.

- Brian


Mime
View raw message