Return-Path: Delivered-To: apmail-nutch-user-archive@www.apache.org Received: (qmail 37131 invoked from network); 16 Jun 2010 19:51:34 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 16 Jun 2010 19:51:34 -0000 Received: (qmail 16439 invoked by uid 500); 16 Jun 2010 19:51:33 -0000 Delivered-To: apmail-nutch-user-archive@nutch.apache.org Received: (qmail 16396 invoked by uid 500); 16 Jun 2010 19:51:32 -0000 Mailing-List: contact user-help@nutch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@nutch.apache.org Delivered-To: mailing list user@nutch.apache.org Received: (qmail 16388 invoked by uid 99); 16 Jun 2010 19:51:32 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Jun 2010 19:51:32 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dean.delponte@gmail.com designates 209.85.213.182 as permitted sender) Received: from [209.85.213.182] (HELO mail-yx0-f182.google.com) (209.85.213.182) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Jun 2010 19:51:26 +0000 Received: by yxm34 with SMTP id 34so2830375yxm.27 for ; Wed, 16 Jun 2010 12:51:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=7JLHGX0BqMMT5Thmq0M8ZFFbPNN+amcLqJ9Wcfq4EdE=; b=w0X8jRIng8tftbsko2BnCDMgq5ntuJ3qcIryOu2uD6UXK6GMydSFJEHHIwYNQnZzeq Z8A8eDN2uO6L2FDRQ389n7IulH0rsoQNo2g4NnbBZENcGpF8UeGgVyIoSe9nDWAH9KHb TOF7PkIk0KkfW4AnN4rKJja7y/3v2jPpgl2kc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=t5XKbszuBzuZSbsTotXMoK6BI8eV367nP2ZghWd2Eu9UUoBD4vUGOGNRa7wxYmOC2H Ah3EsTGHXJwiM50k7Xw0w02ZMN19BaJSnu3zj55rylfBssnGkxr6dbMEH5h6CHPh3H82 zoBukomKy1cmdqeIxsceRwsG8hJ31i1dTfd1U= MIME-Version: 1.0 Received: by 10.229.248.2 with SMTP id me2mr4258795qcb.44.1276717864417; Wed, 16 Jun 2010 12:51:04 -0700 (PDT) Received: by 10.229.241.19 with HTTP; Wed, 16 Jun 2010 12:51:04 -0700 (PDT) In-Reply-To: References: Date: Wed, 16 Jun 2010 14:51:04 -0500 Message-ID: Subject: Re: Solr 1.4 and Nutch 1.0 Integration From: Dean Del Ponte To: user@nutch.apache.org Content-Type: multipart/alternative; boundary=0016e64653fac97d4104892b0aaa --0016e64653fac97d4104892b0aaa Content-Type: text/plain; charset=ISO-8859-1 Thanks for the offer, but no funding to hire consultants! The google search appliance, http://www.google.com/enterprise/search/mini.html, crawls your site, indexes it and makes the content available for search queries. I thought this could be done with solr and nutch as well. I'm under the impression the solr/nutch integration can be done, but it's not easy. On Wed, Jun 16, 2010 at 2:13 PM, Christopher Bader wrote: > Dean, > > I'm not sure what you mean by a "Google search appliance type experience", > but you don't need Solr to create a site-specific search engine. > > Nutch and Lucene are enough. > > Contact us if you need a Nutch/Lucene consultant. > > CB > > > On Wed, Jun 16, 2010 at 1:17 PM, Dean Del Ponte >wrote: > > > I'm new to Solr, but I'm interested in setting it up to act like a google > > search appliance to crawl and index my website. > > > > It's my understanding that nutch provides the web crawling but needs to > be > > integrated with Solr in order to get a google search appliance type > > experience. > > > > Two questions: > > > > 1. Is the scenario I'm outlining above possible? > > 2. If it is possible, where may I found documentation describing how to > > set > > up a Solr/Nutch instance? > > > > Thanks for your help, > > > > Dean Del Ponte > > > --0016e64653fac97d4104892b0aaa--