Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm
Precedence: bulk
Reply-To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Content-return: allowed
Date: Mon, 16 Feb 2004 12:02:38 +0100
From: "Viparthi, Kiran (AFIS)" <Kiran.Viparthi@fao.org>
Subject: RE: Did you mean...
To: 'Lucene Users List' <lucene-user@jakarta.apache.org>
Message-id: <DD0BDCCADFDA4349AA5300F8B33F2B7FE6D4C3@afexch3.fao.org>
MIME-version: 1.0
Content-type: text/plain
Content-transfer-encoding: 7BIT

Hi Timo,

I was mentioning to your previous code that you can collect all the text
from term.

IndexReader reader = IndexReader.open(ram);
TermEnum te = reader.terms();
StringBuffer sb = new StringBuffer();
while(te.next())
{
Term t = te.term();
 sb.append(t.text());
}

And you can get the tokens using StringTokenizer on the sb.toString() and
put them into Map by calculating the occurrences.
As mentioned I didn't use any information from index so I didn't uses any
TokenStream but let me check it out.

Kiran

-----Original Message-----
From: lucene@nitwit.de [mailto:lucene@nitwit.de] 
Sent: 16 February 2004 11:38
To: Lucene Users List
Subject: Re: Did you mean...


On Thursday 12 February 2004 18:35, Viparthi, Kiran (AFIS) wrote:
> As mentioned the only way I can see is to get the output of the 
> analyzer directly as a TokenStream iterate through it and insert it 
> into a Map.

Could you provide or point me to some example code on how to get and use 
TokenStream. The API docs are somewhat unclear to me...

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org