lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ilya Zavorin <izavo...@caci.com>
Subject RE: Efficient string lookup using Lucene
Date Sun, 26 Aug 2012 01:06:36 GMT
Does it mean that the resulting index will be very large?

Thanks,

Ilya

-----Original Message-----
From: Ahmet Arslan [mailto:iorixxx@yahoo.com] 
Sent: Friday, August 24, 2012 4:59 PM
To: java-user@lucene.apache.org
Subject: Re: Efficient string lookup using Lucene

> search for a string "run", I do not need to find "ran" but I do want 
> to find it in all of these strings below:
> 
> Fox is running fast
> !%#^&$run!$!%@&$#
> run,run


With NGramFilter you can do that. But it creates a lot of tokens. For example "Fox is running
fast" becomes 

F
	
o
	
x
	
Fo
	
ox
	
Fox
	
i
	
s
	
is
	
r
	
u
	
n
	
n
	
i
	
n
	
g
	
ru
	
un
	
nn
	
ni
	
in
	
ng
	
*run*
	
unn
	
nni
	
nin
	
ing
	
runn
	
unni
	
nnin
	
ning
	
runni
	
unnin
	
nning
	
runnin
	
unning
	
running
	
f
	
a
	
s
	
t
	
fa
	
as
	
st
	
fas
	
ast
	
fast


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message