lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Elshaimaa Ali <>
Subject Document Term matrix
Date Tue, 11 Nov 2014 20:36:13 GMT
Hi All,
I have a Lucene index built with Lucene 4.9 for 584 text documents, I need to extract a Document-term
matrix, and Document Document similarity matrix in-order to use it to cluster the documents.
My questions:1- How can I extract the matrix and compute the similarity between documents
in Lucene.2- Is there any java based code that can cluster the documents from Lucene index.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message