Mailing list archives: November 2007

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Ê©ÐË dfs.DataNode - Failed to transfer blk_xxxx to 192.168.140.244:50010 got java.net.SocketException: Connection reset Tue, 20 Nov, 03:18
Guido García Bernardo several requests with different headers to the same resource Fri, 23 Nov, 09:48
Doğacan Güney Re: java.io.IOException: Unknown format version:-3 Wed, 14 Nov, 10:53
Doğacan Güney Re: A record version mismatch occured. Expecting v5, found v69 Mon, 19 Nov, 21:00
Abdou RABBA trying to configure nutch-0.9 Wed, 21 Nov, 12:30
Alexis Votta NullPointerException with trunk Tue, 27 Nov, 14:11
Alvaro Cabrerizo Re: multiple crawl-urlfilter.txt files for different sites Wed, 07 Nov, 08:50
Ana Rodighiero can't find hadoop classes necessary to use Nutch API Wed, 28 Nov, 21:44
Anarus Is there any plugin for data extraction using Xpath, XQuery or regex for nutch Sat, 03 Nov, 09:13
Andrzej Bialecki Re: Higher depth, fewer urls? Thu, 15 Nov, 16:55
Andrzej Bialecki Re: dfs.DataNode - Failed to transfer blk_xxxx to 192.168.140.244:50010 got java.net.SocketException: Connection reset Tue, 20 Nov, 11:57
Andrzej Bialecki Re: fetch: An unexpected error has been detected by Java Runtime Environment Wed, 28 Nov, 09:08
Andrzej Bialecki Re: How to read crawldb Wed, 28 Nov, 09:10
Annona Keene Higher depth, fewer urls? Wed, 14 Nov, 16:45
Bolle, Jeffrey F. Crash in Parser Mon, 26 Nov, 20:08
Bolle, Jeffrey F. RE: Crash in Parser Mon, 26 Nov, 20:12
Bolle, Jeffrey F. RE: Crash in Parser Tue, 27 Nov, 23:16
Brehm, Robert P Error when using nutch Wed, 14 Nov, 23:34
Brehm, Robert P RE: Error when using nutch Tue, 27 Nov, 22:54
Carl Cerecke Re: help for a nutch beginner Tue, 06 Nov, 21:30
Chee Wu Re: I only need fetcher of Nutch,i need not index of Nutch.How to i input segments to my database's tables. Tue, 06 Nov, 09:19
Chee Wu Re: Template/Menu Detection Wed, 07 Nov, 03:06
Chee Wu Re: how can i get the document object in Nutch. Wed, 07 Nov, 08:29
Chee Wu Re: How can I know the Cached Web Charset Thu, 08 Nov, 08:59
Chris Mattmann Re: java.lang.NoClassDefFoundError Nutch 0.9 Thu, 08 Nov, 20:19
Christoph M. URL-Filter for ?indexing?? Tue, 27 Nov, 20:30
Christopher Condit PDF Indexing Problem Tue, 20 Nov, 20:00
Cool Coder Crawl API Help Wed, 21 Nov, 22:18
Cool Coder Re: crawl only option for Crawl.java and crawled content reader class Sat, 24 Nov, 01:51
Cool Coder Re: crawl only option for Crawl.java and crawled content reader class Tue, 27 Nov, 16:29
Cool Coder How to read crawldb Tue, 27 Nov, 22:20
Cool Coder Re: How to read crawldb Wed, 28 Nov, 02:14
Cool Coder Re: How to read crawldb Wed, 28 Nov, 19:20
Cool Coder Merge indexes using nutch v 0.9 Thu, 29 Nov, 21:05
Daniel Clark RE: Out of Memory Error While Crawling Mon, 05 Nov, 17:48
Daniel Clark Cluster hadoop-site.xml Settings Thu, 08 Nov, 18:46
Daniele Zuco graphExtractor.pl Fri, 23 Nov, 19:24
Daniele Zuco Usage readdb dump Tue, 27 Nov, 08:10
Dawid Weiss Re: Language not supported in Carrot2 Sat, 03 Nov, 21:05
Dennis Kubes Re: How to writes the results of successful fetcher to database. Mon, 12 Nov, 03:07
Dennis Kubes Re: How to writes the results of successful fetcher to database. Mon, 12 Nov, 17:19
Dennis Kubes Re: URI is not absolute... Wed, 14 Nov, 16:57
Dennis Kubes Re: URI is not absolute... Thu, 15 Nov, 18:13
Dennis Kubes Re: NullPointerException with trunk Tue, 27 Nov, 16:47
Dennis Kubes Re: NullPointerException with trunk Tue, 27 Nov, 20:16
DigitalPebble nutch-user@lucene.apache.org Wed, 07 Nov, 14:36
Emmanuel Template/Menu Detection Mon, 05 Nov, 15:11
Enis Soztutar Re: Multiple Domains Search Mon, 05 Nov, 07:59
Enis Soztutar Re: Hadoop .15 and eclipse on windows Fri, 09 Nov, 14:00
Enis Soztutar Re: Hadoop .15 and eclipse on windows Fri, 09 Nov, 16:20
Espen Amble Kolstad Re: Generate times Wed, 28 Nov, 11:14
Isabel Drost Re: crawl only option for Crawl.java and crawled content reader class Mon, 26 Nov, 20:31
Isabel Drost Re: crawl only option for Crawl.java and crawled content reader class Mon, 26 Nov, 20:57
Isabel Drost Re: crawl only option for Crawl.java and crawled content reader class Tue, 27 Nov, 21:31
Jasper Kamperman Re: search custom field with search.jsp Thu, 08 Nov, 19:35
Jasper Kamperman Re: very low fieldnorm leading to bad results Fri, 16 Nov, 18:44
Jose C. Lacal Newbie question: fetching specific files only. Mon, 26 Nov, 20:47
Jose C. Lacal Newbie question: fetching specific files only. Wed, 28 Nov, 05:46
Josh Attenberg help for a nutch beginner Tue, 06 Nov, 15:06
Josh Attenberg Re: help for a nutch beginner Thu, 08 Nov, 14:04
Josh Attenberg error using JobStream.py Thu, 08 Nov, 21:25
Josh Attenberg Re: error using JobStream.py Fri, 09 Nov, 01:47
Josh Attenberg Re: help for a nutch beginner Fri, 09 Nov, 21:58
Josh Attenberg Re: help for a nutch beginner Wed, 14 Nov, 13:59
Josh Attenberg A record version mismatch occured. Expecting v5, found v69 Sun, 18 Nov, 19:41
Josh Attenberg Re: A record version mismatch occured. Expecting v5, found v69 Mon, 19 Nov, 19:44
Josh Attenberg Re: A record version mismatch occured. Expecting v5, found v69 Mon, 19 Nov, 20:53
Josh Attenberg No space left on device Wed, 21 Nov, 03:24
Josh Attenberg Re: No space left on device Wed, 21 Nov, 04:58
Josh Attenberg Re: No space left on device Wed, 21 Nov, 13:25
Josh Attenberg Re: No space left on device Wed, 21 Nov, 22:01
Josh Attenberg Re: No space left on device Thu, 22 Nov, 23:02
Josh Attenberg fetch: An unexpected error has been detected by Java Runtime Environment Wed, 28 Nov, 01:13
Josh Attenberg very poor fetch performance with nutch .8 Wed, 28 Nov, 19:50
Josh Attenberg Re: very poor fetch performance with nutch .8 Thu, 29 Nov, 16:18
Karol Rybak Reduce copy slow ? Tue, 06 Nov, 13:23
Karol Rybak Problem with partititioning Tue, 06 Nov, 13:58
Karol Rybak Re: Problem with partititioning Tue, 06 Nov, 14:02
Karol Rybak Re: How i can read the index of Nutch by Lucene's IndexReader. Wed, 07 Nov, 09:35
Karol Rybak Re: Cluster hadoop-site.xml Settings Fri, 09 Nov, 14:07
Karol Rybak Re: help for a nutch beginner Thu, 15 Nov, 10:19
Karol Rybak Re: Crash in Parser Mon, 26 Nov, 22:26
Karol Rybak Generate times Mon, 26 Nov, 23:02
Ken Krugler RE: Hardware Planning Thu, 29 Nov, 16:12
Koe Black maintainability of nutch - building incremental index Fri, 30 Nov, 01:38
Kunal Wku Out of Memory Error While Crawling Mon, 05 Nov, 17:28
Lev Kantorovich nutch 0.9 and eclipse 3.3 - Mon, 19 Nov, 19:18
Lyndon Maydwell Re: No space left on device Wed, 21 Nov, 09:39
Mark Bennett Nutch-0.9 plugins, trouble with ant 1.6.5 and 1.7 Sat, 10 Nov, 01:29
Mark Bennett RE: Nutch-0.9 plugins, trouble with ant 1.6.5 and 1.7 Sat, 10 Nov, 17:36
Martin Xu Is Nutch Administration still active? Thu, 01 Nov, 02:52
Matei Zaharia Fetching many pages off LAN Sat, 10 Nov, 19:57
Matei Zaharia Re: Fetching many pages off LAN Sat, 10 Nov, 22:47
Matei Zaharia Re: Fetching many pages off LAN Sun, 11 Nov, 01:20
Matei Zaharia Re: Fetching many pages off LAN Mon, 12 Nov, 08:27
Matei Zaharia Reduce job in invertlinks and index tasks often fails Sun, 18 Nov, 04:07
Matt Kangas Re: URL-Filter for ?indexing?? Thu, 29 Nov, 05:22
Milan Krendzelak Re: How i can read the index of Nutch by Lucene's IndexReader. Wed, 07 Nov, 10:50
Milan Krendzelak SaveSearch or Adult Filter Wed, 07 Nov, 14:24
Milan Krendzelak Re: SaveSearch or Adult Wed, 07 Nov, 16:07
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Nov 2009269
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167