lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xi Shen <>
Subject Which token filter can combine 2 terms into 1?
Date Fri, 21 Dec 2012 07:50:17 GMT

I am looking for a token filter that can combine 2 terms into 1? E.g.

the input has been tokenized by white space:

t1 t2 t2a t3

I want a filter that output:

t1 t2t2a t3

I know it is a very special case, and I am thinking about develop a filter
of my own. But I cannot figure out which API I should use to look for terms
in a Token Stream.

David Shen!/davidshen84

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message