lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Document Clustering
Date Tue, 11 Nov 2003 15:41:33 GMT

--- Leo Galambos <Leo.G@seznam.cz> wrote:
> Marcel Stör wrote:
> 
> >Hi
> >
> >As everybody seems to be so exited about it, would someone please be
> so kind to explain 
> >what "document based clustering" is?

AFAIK, "document clustering" consists of detection of documents with
similar content (similar subjects/topics).
 
> Hi
> 
> they are trying to implement what you can see in the right panel
> here:
> http://www.egothor.dundee.ac.uk/egothor/q2c.jsp?q=protein
> They may also analyze identical pages (hit #9 and #10) - this could
> be 
> also taken as "clustering" AFAIK.

Intersting.

> For instance, Doug wrote some papers about clustering (if I remember
> it 
> correctly) - see his bibliography.


How is document clustering different/related to text categorization?

Thanks,
Otis


__________________________________
Do you Yahoo!?
Protect your identity with Yahoo! Mail AddressGuard
http://antispam.yahoo.com/whatsnewfree

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message