lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mck <m...@semb.wever.org>
Subject Re: Replacing FAST functionality at sesam.no - ShingleFilter+ exact matching
Date Tue, 09 Sep 2008 16:58:45 GMT
> *ShortVersion*
>  is there a way to make the ShingleFilter perform exact matching via
> inserting ^ $ begin/end markers?

Reading through the mailing list i see how exact matching can be done, a
la STFW to myself...

So the ShortVersion now stands:

For my query "abcd efgh ijkl"
Why does a (perfect looking) MultiPhraseQuery with
	termArrays[0] = { list_entry_shingles:abcd
			  list_entry_shingles:abcd efgh
			  list_entry_shingles:abcd efgh ijkl 
			}
	termArrays[1] = { list_entry_shingles:efgh
			  list_entry_shingles:efgh ijkl 
			}
	termArrays[2] = { list_entry_shingles:ijkl }

return only "abcd efgh ijkl" !?

(when the field is indexed as TextField this is the only hit i get)
(when the field is indexed as StrField i get zero hits!)

When the index contains 9 entries:
 "abcd efgh ijkl", "abcd efgh", "efgh ijkl", "abcd", "efgh", "ijkl", "ijkl efgh", "efgh abcd",
and "ijkl efgh abcd".

Does this MultiPhraseQuery actually require a match against *every* item
in each termArray on any document?

~mck

-- 
"Everyone thinks of changing the world, but no one thinks of changing
himself." Leo Tolstoy 
| semb.wever.org | sesat.no | sesam.no |

Mime
View raw message