lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anatharaman, Srinatha (Contractor)" <Srinatha_Ananthara...@comcast.com>
Subject RE: Need help in Tika on SolrCloud
Date Wed, 01 Feb 2017 20:32:18 GMT
Is there anyone to help me with my issue?

Your help is much appreciated



I figured out the problem but need solution

In my below data-config file tikaConfig.xml is not recognized by zookeeper (       processor="TikaEntityProcessor"
tikaConfig="tikaConfig.xml" )



<?xml version="1.0" encoding="UTF-8"?>

<dataConfig>

    <dataSource  name="bin" type="BinFileDataSource" />

        <document>

            <entity name="f" dataSource="fileSource" rootEntity="false"

            processor="FileListEntityProcessor"

            baseDir="/app/home/source/"

            fileName=".*\.(com)|(txt)|(docx)"

            onError="skip"

            recursive="true">

                <field column="fileAbsolutePath" name="path" />

                <field column="fileSize" name="size" />

                <field column="fileLastModified" name="lastModified" />

                <field column="link" name="link"/>



                <entity

                    name="documentImport" dataSource="bin"

                    processor="TikaEntityProcessor" tikaConfig="tikaConfig.xml"

                    url="${f.fileAbsolutePath}"

                    format="text">

                    <field column="file" name="fileName"/>

                    <field column="content" name="content"/>

                    <field column="Author" name="author" meta="true"/>

                    <field column="title" name="title" meta="true"/>

                    <field column="text" name="text"/>



                </entity>

        </entity>

        </document>

</dataConfig>









-----Original Message-----
From: Anatharaman, Srinatha (Contractor) [mailto:Srinatha_Anantharaman@comcast.com]
Sent: Wednesday, February 01, 2017 11:51 AM
To: solr-user@lucene.apache.org
Subject: RE: Need help in Tika on SolrCloud



Hi All,



I see below code which is causing my code not to work in SolrCloud

  @Override





  public String getConfigDir() {





    throw new ZooKeeperException(





        ErrorCode.SERVER_ERROR,





        "ZkSolrResourceLoader does not support getConfigDir() - likely, what you are trying
to do is not supported in ZooKeeper mode");





  }







https://github.com/apache/lucene-solr/blob/branch_6_3/solr/core/src/java/org/apache/solr/cloud/ZkSolrResourceLoader.java



Can someone help me with work around



ERROR :

2017-02-01 16:39:55.932 ERROR (Thread-20) [c:dsearch s:shard2 r:core_node3 x:dsearch_shard2_replica2]
o.a.s.h.d.DataImporter Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException:
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config
Processing Document # 1

        at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)

        at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)

        at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475)

        at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458)

        at java.lang.Thread.run(Thread.java:745)

Caused by: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException:
Unable to load Tika Config Processing Document # 1

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416)

        at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329)

        at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)

        ... 4 more

Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika
Config Processing Document # 1

        at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69)

        at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96)

        at org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60)

        at org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76)

        at org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75)

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433)

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516)

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414)

        ... 6 more

Caused by: org.apache.solr.common.cloud.ZooKeeperException: ZkSolrResourceLoader does not
support getConfigDir() - likely, what you are trying to do is not supported in ZooKeeper mode

        at org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151)

        at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91)

        ... 12 more





Thanks,

~Sri



From: Anatharaman, Srinatha (Contractor)

Sent: Wednesday, February 01, 2017 10:04 AM

To: 'solr-user@lucene.apache.org' <solr-user@lucene.apache.org<mailto:solr-user@lucene.apache.org>>

Subject: Need help in Tika on SolrCloud



Hi,



I am new to Solr, I have implemented Solr on single node and my code is working good When
I move the same code to SolrCloud it fails (I made few changes for SolrCloud)



I am trying to load data using Dataimporthandler but it throws error as below



2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1 x:dsearch_shard2_replica1]
o.a.s.h.d.DataImporter Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException:
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config
Processing Document # 1

        at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)

        at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)

        at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475)

        at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458)

        at java.lang.Thread.run(Thread.java:745)

Caused by: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException:
Unable to load Tika Config Processing Document # 1

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416)

        at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329)

        at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)

        ... 4 more

Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika
Config Processing Document # 1

        at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69)

        at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96)

        at org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60)

        at org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76)

        at org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75)

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433)

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516)

        at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414)

        ... 6 more

Caused by: org.apache.solr.common.cloud.ZooKeeperException: ZkSolrResourceLoader does not
support getConfigDir() - likely, what you are trying to do is not supported in ZooKeeper mode

        at org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151)

        at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91)

        ... 12 more





I have attached the code for your reference Could you please help me with the solution



Regards,

~Sri







Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message