lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shashi Kant <sk...@sloan.mit.edu>
Subject Re: Lucene - Text Classification.
Date Mon, 09 Nov 2009 15:40:54 GMT
Take a look at Bayesian text classification, which might be more
efficient for your needs. Google it.
There are several other text classification methods - depending your
needs, you can dig into them.



On Mon, Nov 9, 2009 at 10:33 AM, lucenenew <mitesh.jesani@yahoo.com> wrote:
>
> i want to classify sentences stored as strings to a bunch of keywords related
> to a certain category.
>
> so i will have 10 strings which will be a sentence long. and i will want to
> compare each string to a set of 30 keywords stored somewhere, and then
> compare with another set of 30 keywords, so on.
>
> i want to rank each string based on the number of times it matches a set of
> keywords. so basically i want to categorize each sentence.
>
> is this possible with lucene, or would any other approach be more efficient.
>
> will this process take long? in terms of speed of program.
>
> and what tools would i need?
>
> any help would be great.
>
> thanks.
> --
> View this message in context: http://old.nabble.com/Lucene---Text-Classification.-tp26268359p26268359.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message