Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9BFD6415C for ; Sun, 15 May 2011 20:55:57 +0000 (UTC) Received: (qmail 74037 invoked by uid 500); 15 May 2011 20:55:55 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 73991 invoked by uid 500); 15 May 2011 20:55:55 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 73983 invoked by uid 99); 15 May 2011 20:55:55 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 May 2011 20:55:55 +0000 X-ASF-Spam-Status: No, hits=1.3 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of karl.wright@nokia.com designates 147.243.1.48 as permitted sender) Received: from [147.243.1.48] (HELO mgw-sa02.nokia.com) (147.243.1.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 May 2011 20:55:47 +0000 Received: from vaebh101.NOE.Nokia.com (vaebh101.europe.nokia.com [10.160.244.22]) by mgw-sa02.nokia.com (Switch-3.4.4/Switch-3.4.3) with ESMTP id p4FKtPSY024642 for ; Sun, 15 May 2011 23:55:25 +0300 Received: from smtp.mgd.nokia.com ([65.54.30.5]) by vaebh101.NOE.Nokia.com over TLS secured channel with Microsoft SMTPSVC(6.0.3790.4675); Sun, 15 May 2011 23:55:20 +0300 Received: from 008-AM1MMR1-006.mgdnok.nokia.com (65.54.30.61) by NOK-am1MHUB-01.mgdnok.nokia.com (65.54.30.5) with Microsoft SMTP Server (TLS) id 8.2.255.0; Sun, 15 May 2011 22:55:20 +0200 Received: from 008-AM1MPN1-037.mgdnok.nokia.com ([169.254.7.78]) by 008-AM1MMR1-006.mgdnok.nokia.com ([65.54.30.61]) with mapi id 14.01.0289.008; Sun, 15 May 2011 22:55:20 +0200 From: To: Subject: RE: [ANNOUNCE] Web Crawler Thread-Topic: [ANNOUNCE] Web Crawler Thread-Index: AQHMEz56cE/6yB0szky2z7ckVQvGApSOXkFg Date: Sun, 15 May 2011 20:55:19 +0000 Message-ID: <0C2ADA45C80B224FAFA38F5DEE16A16E0B5BA1@008-AM1MPN1-037.mgdnok.nokia.com> References: <4D6D8E6A.10004@eolya.fr> <1305379745166-2937762.post@n3.nabble.com> In-Reply-To: <1305379745166-2937762.post@n3.nabble.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [173.76.203.71] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginalArrivalTime: 15 May 2011 20:55:20.0722 (UTC) FILETIME=[67355720:01CC1342] X-Nokia-AV: Clean You might want to look at ManifoldCF also. Karl -----Original Message----- From: ext abhayd [mailto:ajdabholkar@hotmail.com]=20 Sent: Saturday, May 14, 2011 9:29 AM To: java-user@lucene.apache.org Subject: Re: [ANNOUNCE] Web Crawler hi Dominique, I am looking for a crawler to feed solr index. After looking at various posts i have settled down on two Nutch and crawl anywhere. I dont see any activities on Nutch wiki so wondering if its not being developed anymore. But most forums say Nutch is standard for solr. Crawl Anywhere looks solid. Any way for users like me to decide which one w= e should go for Nutch or Crawl Anywehre? Concern with crawl anywhere is it supports solr 1.3 index not the latest version Any help on the is really appreciated=20 -- View this message in context: http://lucene.472066.n3.nabble.com/ANNOUNCE-W= eb-Crawler-tp2607833p2937762.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org