lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Charlie Hull <char...@flax.co.uk>
Subject Re: Quick Query about
Date Thu, 09 Nov 2017 09:28:01 GMT
On 09/11/2017 09:13, Karan Saini wrote:
> Hi there,

Hi Karan,

Have you tried the syntax baseDir="//servername/sharedfoldername" ? I 
believe this should work on a Windows network.

Regards

Charlie

> 
> I am new to the Apache Solr and currently exploring how to use this
> technology to search in the PDF files.
> 
> <https://lucene.apache.org/solr/guide/6_6/uploading-structured-data-store-data-with-the-data-import-handler.html#the-tikaentityprocessor>
> https://lucene.apache.org/solr/guide/6_6/uploading-structured-data-store-data-with-the-data-import-handler.html#the-tikaentityprocessor
> 
> <https://lucene.apache.org/solr/guide/6_6/uploading-structured-data-store-data-with-the-data-import-handler.html#the-tikaentityprocessor>
> 
> <https://lucene.apache.org/solr/guide/6_6/uploading-structured-data-store-data-with-the-data-import-handler.html#the-tikaentityprocessor>
> I am able to index the PDF files using the "BinFileDataSource" for the PDF
> files within the same server as shown in the below example.
> 
> Now i want to know if there is a way to change the baseDir pointing to the
> folder present under a different server.
> 
> Please suggest an example to access the PDF files from another server.
> 
> 
> <dataConfig>
>    *<dataSource type="BinFileDataSource"/> <!--Local filesystem-->*
>    <document>
>      <entity name="K2FileEntity" processor="FileListEntityProcessor"
> dataSource="null"
>              recursive = "true"
>              *baseDir="C:/solr-6.6.1/server/solr/core_K2_Depot/Depot"*
> fileName=".*pdf" rootEntity="false">
> 
>              <field column="file" name="id"/>
>              <field column="fileSize" name="size" />-->
>              <field column="fileLastModified" name="lastmodified" />
> 
>                <entity name="pdf" processor="TikaEntityProcessor"
> onError="skip"
>                        url="${K2FileEntity.fileAbsolutePath}" format="text">
> 
>                  <field column="title" name="title" meta="true"/>
>                  <field column="dc:format" name="format" meta="true"/>
>                  <field column="text" name="text"/>
> 
>                </entity>
>      </entity>
>    </document>
> </dataConfig>
> 
> 
> Kind regards,
> Karan
> 


-- 
Charlie Hull
Flax - Open Source Enterprise Search

tel/fax: +44 (0)8700 118334
mobile:  +44 (0)7767 825828
web: www.flax.co.uk

Mime
View raw message