Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-user@lucene.apache.org
Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates
 216.139.236.158 as permitted sender)
Message-ID: <27785693.post@talk.nabble.com>
Date: Thu, 4 Mar 2010 11:49:00 -0800 (PST)
From: PlusPlus <r.shahidinejad@gmail.com>
To: java-user@lucene.apache.org
Subject: Re: Why is frequency a float number
In-Reply-To: <Pine.LNX.4.64.1003040948300.23212@radix.cryptio.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
References: <27714523.post@talk.nabble.com>
 <Pine.LNX.4.64.1003040948300.23212@radix.cryptio.net>


Thanks for the reply. 
Actually what I'm looking for is to have a kind of fuzzy memberships for the
terms of a document. That is, for each term of a document, I will have a
membership value for that term and each term will be in each document, at
most once.

For that, I will need float TF and IDF values. It seems that Lucene does not
support what I need and I should change Lucene's code which is not an easy
task. Do you have any suggestions for me?

Best,
Reza


hossman wrote:
> 
> 
> :    I was wondering why TF method gets a float parameter. Isn't frequency
> : always considered to be integer? 
> : 
> :    public abstract float tf(float freq)
> 
> Take a look at how PhraseQuery and SPanNearQuery use tf(float).
> 
> For simple terms (and TermQuery) tf is always an integer, but when dealing 
> with phrases the concept of a "sloppy match" (ie: a phrase with a gap in 
> the middle) results in a fractional "frequency" value because it is not as 
> good as an "exact" match on the phrase (which does result in an integer tf 
> value)
> 
> 
> 
> 
> -Hoss
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 
> 

-- 
View this message in context: http://old.nabble.com/Why-is-frequency-a-float-number-tp27714523p27785693.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org