nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gabriele Kahlout <gabri...@mysimpatico.com>
Subject Re: Solr 4.0
Date Mon, 09 May 2011 09:31:59 GMT
I think you are talking about this dependency:

<dependency org="org.apache.solr" name="solr-solrj" *rev="1.4.1"*
conf="*->default" />

I've checked out solr 4 svn revision 1099940[1]. What value should I use for
rev?

[1]
http://lucene.472066.n3.nabble.com/Is-it-possible-to-build-Solr-as-a-maven-project-tp2898068p2905051.html

On Tue, Apr 19, 2011 at 2:48 PM, Julien Nioche <
lists.digitalpebble@gmail.com> wrote:

> You need to change the version of SOLR in ivy/ivy.xml then rebuild unless
> you change the jars straight in to nutch-1.3/runtime/local/lib - assuming
> that you're running Nutch locally only
>
> On 19 April 2011 07:09, Haspadar <haspadar@gmail.com> wrote:
>
> > Yes, it occured after removing SolrJ1.4 jar and copy 4.0 version. Before
> it
> > I upgrated Nutch for Solr 3.1 the same way and all worked fine.
> >
> > Thanks
> >
> > 2011/4/19 Markus Jelsma <markus.jelsma@openindex.io>
> >
> > > Hi,
> > >
> > > > Hello.
> > > > I'm using Nutch 1.3. I decided to upgrade Solr to version 4.0 and I
> > > > replaced Nutch libs (Snapshot and SolrJ) from Solr dist. After that I
> > got
> > > > the error at SolrIndexer on Reduce stage:
> > > >
> > > > 11/04/19 01:47:19 INFO mapred.JobClient:  map 100% reduce 27%
> > > > 11/04/19 01:47:21 INFO mapred.JobClient: Task Id :
> > > > attempt_201104190142_0009_r_000000_0, Status : FAILED
> > > > org.apache.solr.common.SolrException: ERROR: [doc=
> http://www.site.net/
> > ]
> > > > Error adding field 'tstamp'='2011-04-18T22:45:17.404Z'
> > > >
> > > > ERROR: [doc=http://www.site.net/] Error adding field
> > > > 'tstamp'='2011-04-18T22:45:17.404Z'
> > > >
> > > > request: http://127.0.0.1:8983/solr/update?wt=javabin&version=2
> > > >     at
> > > >
> > >
> >
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> > > > SolrServer.java:436) at
> > > >
> > >
> >
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> > > > SolrServer.java:245) at
> > > >
> > >
> >
> org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(Abstract
> > > > UpdateRequest.java:105) at
> > > > org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:50) at
> > > > org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:75) at
> > > >
> > >
> >
> org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.ja
> > > > va:48) at
> > > >
> org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
> > > >     at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
> > > >     at org.apache.hadoop.mapred.Child.main(Child.java:170)
> > >
> > > If you are using Solr > 1.4.x then you must upgrade the SolrJ jar's in
> > > Nutch.
> > > Solr 1.4.x and higher are not compatible. Just remove the 1.4.x jar's
> and
> > > copy
> > > over the new.
> > >
> > > >
> > > > I tried to remove tstamp from solrindex-mapping.xml and Solr's
> > > schema.xml.
> > > > But this field is required in schema.xml and I got the error:
> > > >
> > > > 11/04/19 01:58:03 INFO mapred.JobClient: Task Id :
> > > > attempt_201104190142_0010_r_000000_0, Status : FAILED
> > > > org.apache.solr.common.SolrException: ERROR: [doc=
> http://www.site.net/
> > ]
> > > > unknown field 'tstamp'
> > > >
> > > > ERROR: [doc=http://www.site.net/] unknown field 'tstamp'
> > >
> > > Removing a mapping doesn't mean the field isn't copied over. All
> unmapped
> > > fields
> > > are copied as is. The example mapping seems rather useless as it copies
> > > exact
> > > field names. It's only useful if your source fields and destination
> > fields
> > > are
> > > actually different, which is usually not the case if you dedicate a
> Solr
> > > core
> > > for a Nutch crawl.
> > >
> > > You must either not create the field by some plugin or add the field to
> > > your
> > > Solr index.
> > >
> > > I'm surprised this error actually showed up considering the
> incompatible
> > > Javabin versions. Perhaps you already upgraded the SolrJ api?
> > >
> > > >
> > > > How I can upgrade Solr to 4 version?
> > > >
> > > > Thank you.
> > >
> >
>
>
>
> --
> *
> *Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/
> http://www.digitalpebble.com
>



-- 
Regards,
K. Gabriele

--- unchanged since 20/9/10 ---
P.S. If the subject contains "[LON]" or the addressee acknowledges the
receipt within 48 hours then I don't resend the email.
subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
< Now + 48h) ⇒ ¬resend(I, this).

If an email is sent by a sender that is not a trusted contact or the email
does not contain a valid code then the email is not received. A valid code
starts with a hyphen and ends with "X".
∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
L(-[a-z]+[0-9]X)).

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message