lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Upayavira ...@odoko.co.uk>
Subject Re: Stemmer and stopword Development
Date Thu, 10 Sep 2015 07:23:42 GMT


On Thu, Sep 10, 2015, at 04:45 AM, Imtiaz Shakil Siddique wrote:
> Hi,
> 
> I am trying to develop stemmer and stopword for Bengaly language which is
> not shipped with solr.
> 
> I am trying to make this with machine learning approach but I couldn't
> find
> any good documents to study. It would be very helpful if you could shed
> some lights into this matter.

How are you going to do this with machine learning? What corpus are you
going to use to learn from? Do you have some documents that have been
manually stemmed for which you also have the originals?

Upayavira

Mime
View raw message