lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Please help me integrate Nutch with Solr
Date Mon, 29 Dec 2008 05:18:39 GMT
Tony,

I think you should ignore the advice/code from foofactory blog and just go with NUTCH-442,
as that's most likely going to result in "the official" Nutch-Solr integration.

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Tony Wang <ivytony@gmail.com>
> To: solr-user@lucene.apache.org
> Sent: Saturday, December 27, 2008 2:46:12 PM
> Subject: Re: Please help me integrate Nutch with Solr
> 
> Hi Dingding,
> 
> Thanks for the message. I followed this tutorial here
> http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html,
> trying to integrate Nutch with Solr. Right now, my Nutch installation is
> successful (see http://208.64.71.46:8080, use keywords like DNA or protein,
> etc)
> 
> I am not sure if I understand you right. Should I apply that patch from the
> link you gave me also? I use the latest nightly builds of Nutch and Solr.
> The jsp servlet container is Tomcat6 on my CentOS 5.2 box.
> 
> Thanks again.
> 
> tony
> 
> On Sat, Dec 27, 2008 at 12:38 PM, Dingding Ye wrote:
> 
> > Hi,  Wang.
> >
> > Solr should be act as the backend indexing server for Nutch instead of the
> > lucene index.
> >
> > You have to patch the nutch at first.
> >
> > Check this list:
> >
> >
> > 
> https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12607005#action_12607005
> >
> > On Sun, Dec 28, 2008 at 3:31 AM, Tony Wang wrote:
> >
> > > yes, I kind of jumped in the middle of the learning curve which seems to
> > be
> > > rather deep to me.
> > >
> > > I've tried to find/read many tutorials so far, it seems to me that data
> > is
> > > added to Solr via XML documents. I wonder how to convert Nutch's indexing
> > > data to XML that could be added to Solr. Or is there other simpler ways
> > of
> > > achieving the conversion?
> > >
> > > I just watched this slide
> > > http://www.slideshare.net/pittaya/using-apache-solr
> > >
> > > Thanks!
> > >
> > > Tony
> > >
> > > On Fri, Dec 26, 2008 at 11:25 AM, Otis Gospodnetic <
> > > otis_gospodnetic@yahoo.com> wrote:
> > >
> > > > Tony,
> > > >
> > > > You started an example Solr application.  This application has no
> > > knowledge
> > > > of your Nutch setup and the Lucene index that you've presumably built
> > > with
> > > > Nutch.  The simplest thing you can do to get something to "work" is to
> > > > locate your Lucene/Nutch index and copy all of its contents to
> > > > solr/example/solr/data/index/ directory.  You will need to adjust the
> > > schema
> > > > to match the Lucene/Nutch index fields, too.
> > > >
> > > > But honestly, it looks like you are "starting from the middle" without
> > > > really following things step-by-step and without really understanding
> > > either
> > > > Nutch or Solr.  My suggestion is to first play only with Nutch and
> > learn
> > > how
> > > > to run various Nutch steps, all the way to generating an index.  Then
> > > play
> > > > with Solr (and forget about Nutch) by following the Solr tutorial.
> >  Once
> > > you
> > > > get Solr by itself working, you will understand how the Nutch and Solr
> > > > combination can work together.
> > > >
> > > > Otis
> > > > --
> > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> > > >
> > > >
> > > >
> > > > ----- Original Message ----
> > > > > From: Tony Wang 
> > > > > To: solr-user@lucene.apache.org
> > > > > Sent: Friday, December 26, 2008 12:10:39 PM
> > > > > Subject: Re: Please help me integrate Nutch with Solr
> > > > >
> > > > > Thanks for the reply, I appreciate it.
> > > > >
> > > > > Basically, I downloaded the latest nightly build of Solr --> unzip
it
> > > -->
> > > > > shut down Tomcat -->copy the solr.war file to the Tomcat webapps
> > > > container
> > > > > (/opt/tomcat6/webapps) --> start Tomcat --> set up the schema.xml
> > file
> > > > > located in /opt/tomcat6/webapps/solr/example/solr/conf.
> > > > >
> > > > > After all of these above operations, I go to the directory
> > > > > /opt/tomcat6/webapps/solr/example and run 'java -jar start.jar' and
> > the
> > > > port
> > > > > 8983 is opened. See my solr in action http://208.64.71.46:8983/solr/
> > ,
> > > > you
> > > > > can do a search from there or in the admin panel.
> > > > >
> > > > > If what I did was wrong, could you please give me some instruction?
I
> > > > > appreciate it!
> > > > >
> > > > > Happy holidays!
> > > > >
> > > > > Tony
> > > > >
> > > > >
> > > > > On Fri, Dec 26, 2008 at 9:45 AM, Otis Gospodnetic <
> > > > > otis_gospodnetic@yahoo.com> wrote:
> > > > >
> > > > > > Hi Tony,
> > > > > >
> > > > > > It looks like you already asked on the Nutch list.  Maybe just
> > stick
> > > to
> > > > one
> > > > > > list?
> > > > > > It's impossible to tell what went wrong from the information
you
> > > > provided
> > > > > > below.  Are there any errors in the log?  Are you sure your
solr
> > home
> > > > is set
> > > > > > correctly?
> > > > > >
> > > > > > Otis
> > > > > > --
> > > > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> > > > > >
> > > > > >
> > > > > >
> > > > > > ----- Original Message ----
> > > > > > > From: Tony Wang
> > > > > > > To: solr-user@lucene.apache.org
> > > > > > > Sent: Friday, December 26, 2008 11:20:06 AM
> > > > > > > Subject: Please help me integrate Nutch with Solr
> > > > > > >
> > > > > > > I got the web interface to work at here
> > > > > > > http://208.64.71.46:8080/search.jsp?lang=en&query=tony
, but my
> > > > search
> > > > > > query
> > > > > > > in Solr doesn't seem to work correctly. There is no any
result
> > > > retrieved
> > > > > > > from
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> > 
> http://208.64.71.46:8983/solr/select?q=tony&start=0&rows=10&fl=title%2Curl%2Cscore&qt=standard&wt=standard&hl=on&hl.fl=content
> > > > > > >
> > > > > > > I followed this guide to integrate Nutch with Solr
> > > > > > >
> > > > > >
> > > >
> > >
> > http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
> > > > > > .
> > > > > > > I wonder what could be wrong with my integration.
> > > > > > >
> > > > > > > I use CentOS 5.2, Tomcat6 and Nutch & Solr latest nightly
builds.
> > > > > > >
> > > > > > > Thanks!
> > > > > > >
> > > > > > > Tony
> > > > > > >
> > > > > > > --
> > > > > > > Signature: Success is a journey that never ends.
> > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Signature: Success is a journey that never ends.
> > > >
> > > >
> > >
> > >
> > > --
> > > Signature: Success is a journey that never ends.
> > >
> >
> 
> 
> 
> -- 
> Signature: Success is a journey that never ends.


Mime
View raw message