lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Furkan KAMACI <furkankam...@gmail.com>
Subject Re: What is the difference between SolrCell based Tika and Tika in Nuch?
Date Sat, 21 Mar 2015 17:36:53 GMT
Hi,

Which versions of Solr and Nutch do you use? Nutch and Solr supports Tika
1.7 at their recent versions.

Kind Regards,
Furkan KAMACI

On Sat, Mar 21, 2015 at 6:46 PM, Erick Erickson <erickerickson@gmail.com>
wrote:

> Well, they could be different versions of Tika, don't know. You can
> tell this from the respective jars in the two projects.
>
> But more importantly, _how_ the fields from Nutch-based Tika maps into
> Solr fields and how they're mapped in SolrCel may be different, but
> this would be because your configurations are different. What I'm
> saying is that _you_ have to insure that your configs do the same
> mapping of extracted meta-data to Solr fields.
>
> Best,
> Erick
>
> On Fri, Mar 20, 2015 at 9:11 PM, zhangxin0804 <zhangxin0804@gmail.com>
> wrote:
> > Hi All,
> >
> >      I am new to Solr. I have a question as follows:
> >      Is there any difference between extract metadata using Tika in Nutch
> > and extract metadata using SolrCell based Tika? I used these two ways to
> > extract metada from PDF files and PNG files, but they almost same. Can
> > anyone tell me about this ?
> >     Thank you so much.
> >
> >
> >
> >
> > --
> > View this message in context:
> http://lucene.472066.n3.nabble.com/What-is-the-difference-between-SolrCell-based-Tika-and-Tika-in-Nuch-tp4194372.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message