lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Furkan KAMACI <furkankam...@gmail.com>
Subject Re: Regex For *|* at hl.regex.pattern
Date Sat, 12 Apr 2014 22:09:29 GMT
Hi;

I could find a way to achieve it when I debugged the source code. Defining
a delimiter and indexing it as an individual token is the first step.
Writing a regex that "matches" for given delimiter is the next step. Last
step is defining the slop size. When you have a big slop size you get the
whole sentence.

Thanks;
Furkan KAMACI


2014-04-08 13:30 GMT+03:00 Furkan KAMACI <furkankamaci@gmail.com>:

> Hi Jack;
>
> My sentence delimiter is not one character; it is *|*  How to write a
> regex for it?
>
>
> 2014-04-08 8:06 GMT+03:00 Jack Krupansky <jack@basetechnology.com>:
>
> The regex pattern should match the text of the fragment. IOW, exclude
>> whatever delimiters are not allowed in the fragment.
>>
>> The default is:
>>
>> [-\w ,\n"']{20,200}
>>
>> -- Jack Krupansky
>>
>> -----Original Message----- From: Furkan KAMACI
>> Sent: Monday, April 7, 2014 10:21 AM
>> To: solr-user@lucene.apache.org
>> Subject: Regex For *|* at hl.regex.pattern
>>
>>
>> Hi;
>>
>> I try that but it does not work do I miss anything:
>>
>> q=portu&hl.regex.pattern=.*\*\|\*.*&hl.fragsize=120&hl.regex.slop=0.2
>>
>> My aim is to check whether it includes *|* or not (that's why I've put .*
>> beginning and end of the regex to achieve whatever you match)
>>
>> How to fix it?
>>
>> Thanks;
>> Furkan KAMACI
>>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message