lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From su ha <>
Subject Range queries in successive positions
Date Fri, 02 Mar 2012 07:22:14 GMT
I'm new to Lucene. I'm indexed some documents with Lucene and need to sanitize it to ensure
that they do not have any social security numbers (3-digits 2-digits 4-digits). 

(How) Can I write a query (with the QueryParser) that searches for this pattern?

e.g. I can do [000 to 999] or [00 to 99] or [0000 to 9999], but this causes hits with any
2, 3 or 4 digit number.
Something like "[000 to 999] [00 TO 99] [0000 TO 9999]", I get no hits at all.

Is this possible with the default QueryParser?
Or is there some other programmatic way to do it?
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message