manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: "File(s) matching" syntax
Date Tue, 01 Dec 2015 20:05:07 GMT
That's not been the experience of others.

You were trying to use regular expressions; this is not supported on the
Path tab.  Only "*" and "?" are recognized as special match characters.

Thanks,
Karl


On Tue, Dec 1, 2015 at 2:54 PM, <igorfk@ig.com.br> wrote:

> Thank you Karl.
>
> For the record: To achieve my demand I had to create an "Allowed
> documents" Transformation connection and insert that connection into a job
> pipeline.
>
>> Hi Igor,
>>
>> The documentation for the JCIFS connector is pretty
>> good:
>> https://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#jcifsrepository
>> [1]
>>
>> It describes how to construct matches for the Paths tab, using * and ?
>> as wildcards.  They are not regular expressions.
>>
>> Karl
>>
>> On Mon, Nov 30, 2015 at 7:25 PM, <igorfk@ig.com.br> wrote:
>>
>> How can I write an expression to match only certain file types to
>>> index?
>>> For example, I just want to index .pdf .doc and .odt files
>>>
>>> Creating a crawl Job I'd tried to insert a rule "Include indexable
>>> file(s) matching", in "Paths" filters, with this regular expression
>>> /(?i:.(pdf|doc|odt))$ but everything was indexed, seems the
>>> expression was not validated, the same with /.(pdf|doc|odt)$/i and
>>> ".(pdf|doc|odt)$"i
>>> What is the proper way to write filters? Some example?
>>>
>>> Thanks!
>>> Igor Kattar
>>>
>>
>>
>>
>> Links:
>> ------
>> [1]
>>
>> https://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#jcifsrepository
>>
>

Mime
View raw message