lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: a few basic questions
Date Wed, 11 May 2005 19:35:41 GMT
Hello,

It sounds like you missed the Index Format page:
  http://lucene.apache.org/java/docs/fileformats.html

That's the best index format documentation currently available.

Otis


--- Sujatha Das <gsdas@cse.iitb.ac.in> wrote:
> 
> Hi,
> I couldn't find documentation on these issues,
> so a url as response should be just fine.
> 
> The inverted index must look like
> FIELD-1
> term -> (doc,offset)pairs
> 
> Is this correct?
> 
> Say I am trying to index the documents in a corpus under two
> different fields. For instance, I want to store with
> every word, the term text and its stem, what does the inverted index
> look
> like now?
> 
> FIELD1	  FIELD2 
> term_text [term_stem] -> (doc,offset)pairs
> 
> 
> Or somehow a mapping between term_text and term_stem is stored 
> separately is it w/o changing much in the inverted index?
> 
> This is probably a very basic question, but any explanation would be
> of much use to me. Thanks.
> 
> Sujatha Das
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message