lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sandeep B A <>
Subject Re: Is there any sentence tokenizers in sold 4.9.0?
Date Mon, 08 Sep 2014 07:24:17 GMT
Hi Susheel ,
Thanks for the information.
I have crawled few website and all I need is for sentence tokenizers on the
data I have collected.
These websites are English only.

Well I don't have experience in writing custom sentence tokenizers for
solr. Is there any tutorial link which tell how to do it?

Is it possible to integrate nltk for solr? If yes how to do it? Because I
found sentence tokenizers for English in nltk.

On Sep 5, 2014 8:10 PM, "Sandeep B A" <> wrote:

> Sorry for typo it is solr 4.9.0 instead of sold 4.9.0
>  On Sep 5, 2014 7:48 PM, "Sandeep B A" <> wrote:
>> Hi,
>> I was looking out the options for sentence tokenizers default in solr but
>> could not find it. Does any one used? Integrated from any other language
>> tokenizers to solr. Example python etc.. Please let me know.
>> Thanks and regards,
>> Sandeep

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message