lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jack Krupansky <jack.krupan...@gmail.com>
Subject Re: Searching for "iso surface", and looking for "isosurface"
Date Thu, 17 Dec 2015 19:36:08 GMT
Unfortunately there is no general support for this feature. But you can
approximate it if you have a relatively small set of compound terms using
the DictionaryCompoundWordTokenFilterFactory:
http://lucene.apache.org/core/5_4_0/analyzers-common/org/apache/lucene/analysis/compound/DictionaryCompoundWordTokenFilterFactory.html

The doc is weak. I do have some examples in my old Solr 4.x Deep Dive
e-book:
http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-7/ebook/product-21203548.html

You might also be able to achieve a similar effect with synonyms, but again
only if the set of terms is reasonably small.


-- Jack Krupansky

On Thu, Dec 17, 2015 at 1:45 PM, Xiaolong Zheng <zhengxiaolong@gmail.com>
wrote:

> Hi All,
>
> I want to know what's the common way to implement the searching with
> whitespace removal. For example, if I searching "iso surface" in google, it
> not only search for "iso" or "surface", but also have a search for
> "isosurface".
>
> Is that just simply add another search clause by removing the white space?
>
> Does the "Proximity Searches" can solve this kind of problem? It seems that
> "iso surface"~0 still require at least have a whitespace in between. Does
> it support for "iso surface"~-1, with a negative 1 means minimal distance
> can be remove the whitespace?  (maybe this is a day dreaming)
>
>
>
>
>
> Sincerely,
>
> --Xiaolong
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message