lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <>
Subject Re: Input File Format
Date Thu, 10 Nov 2005 17:54:21 GMT
You must be using the demo program that comes with Lucene.  That is  
merely an example, and a barely decent one at that.  Have a look  
under the covers of that code or the code that ships with Lucene in  
Action at

You can slice and dice "documents" in whatever granularity you like -  
but you have to write code to parse the TREC files into those pieces,  
making appropriate fields and so on.


On 10 Nov 2005, at 12:38, Satyanarayana Ashwin wrote:

> Hello,
> I am new to Lucene. I was trying to use Lucene with TREC-6 Data. The
> question is that each input file given by TREC have multiple documents
> (some files contain over 200 documents) tagged by DOCID. The result  
> given
> by Lucene to a query is a list of files and not documents.
> Q1) Is there a way of getting the query results in terms of documents
> within the files rather than files ( without modifying the code)?
> Q2) If the above is not posssible, what would be the best way to  
> modify
> the code?
> Thanks and Regards,
> Ashwin
> Phd Student,
> Department of Computer Science,
> SUNY,Albany.
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message