lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stefan Groschupf ...@media-style.com>
Subject Re: Document Clustering
Date Tue, 11 Nov 2003 16:41:03 GMT
Hi,
>How is document clustering different/related to text categorization?

Clustering: try to find own categories and put documents that match in it. 
You group all documents with minimal distance together. 

Classification: you have already categories and samples for it, that help you to match other
documents. 
You calculate document distances to the existing categories and put it in the category with
smallest distance.

Cheers
Stefan

-- 
day time: www.media-style.com
spare time: www.text-mining.org | www.weta-group.net




---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message