Mailing list archives: April 2009

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
³Âè¡ Two urls cannot fetch Wed, 01 Apr, 08:40
³Âè¡ Re: Two urls cannot fetch Wed, 01 Apr, 09:00
³Âè¡ only fetch home page Wed, 01 Apr, 09:48
³Âè¡ Re: only fetch home page Wed, 01 Apr, 11:43
³Âè¡ Re: only fetch home page Wed, 01 Apr, 12:03
³Âè¡ Re: only fetch home page Wed, 01 Apr, 12:22
³Âè¡ Re: Two urls cannot fetch Wed, 01 Apr, 12:22
³Âè¡ Re: only fetch home page Wed, 01 Apr, 14:16
³Âè¡ Re: only fetch home page Wed, 01 Apr, 14:38
³Âè¡ Re: only fetch home page Wed, 01 Apr, 14:47
³Âè¡ Re: only fetch home page Wed, 01 Apr, 14:54
³Âè¡ Re: only fetch home page Wed, 01 Apr, 15:17
³Âè¡ Re: only fetch home page Wed, 01 Apr, 15:23
³Âè¡ Re: only fetch home page Wed, 01 Apr, 15:30
Jaime Martín nutch 1.0 Tue, 21 Apr, 21:45
Raymond Balmès Problems with custom field query Wed, 15 Apr, 14:47
Raymond Balmès Re: Problems with custom field query Wed, 15 Apr, 16:38
Raymond Balmès Re: Problems with custom field query Sat, 18 Apr, 15:58
Raymond Balmès Query-more problem Sun, 19 Apr, 16:09
Raymond Balmès Re: Query-more problem Sun, 19 Apr, 16:54
Raymond Balmès Re: Query-more problem Mon, 20 Apr, 17:09
Raymond Balmès Re: Problems with custom field query Mon, 20 Apr, 17:16
Raymond Balmès Re: nutch 1.0 Wed, 22 Apr, 08:38
Raymond Balmès Re: run nutch on eclipse problem? Thu, 23 Apr, 08:18
Raymond Balmès Re: Hadoop thread seems to remain alive Thu, 23 Apr, 12:22
Raymond Balmès Re: Hadoop thread seems to remain alive Fri, 24 Apr, 06:51
Raymond Balmès Re: Hadoop thread seems to remain alive Sat, 25 Apr, 09:27
Raymond Balmès Re: Unable to register IndexingFilter extesion plugin - N 0.9 Mon, 27 Apr, 20:58
Raymond Balmès Re: Problem in generating the war file Mon, 27 Apr, 21:03
Raymond Balmès Re: How to get the html that i crawled Mon, 27 Apr, 21:11
Raymond Balmès dual core and crawling Mon, 27 Apr, 21:17
Raymond Balmès Re: Problem in generating the war file Mon, 27 Apr, 22:08
Raymond Balmès Re: dual core and crawling Tue, 28 Apr, 07:24
Raymond Balmès Re: dual core and crawling Tue, 28 Apr, 15:54
Raymond Balmès Re: dual core and crawling Tue, 28 Apr, 16:44
Raymond Balmès Re: dual core and crawling Tue, 28 Apr, 21:57
Raymond Balmès Re: dual core and crawling Wed, 29 Apr, 11:33
Doğacan Güney Re: crawl_parse keeps growing after re-crawling and segment merging Wed, 01 Apr, 09:21
Doğacan Güney Re: Nutch 1.0 experience Wed, 01 Apr, 19:54
Doğacan Güney Re: The Future of Nutch Thu, 02 Apr, 13:06
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 11:17
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 11:58
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 12:01
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 12:09
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 14:05
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 14:35
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 14:40
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 15:02
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 15:26
Alejandro Gonzalez Re: only fetch home page Wed, 01 Apr, 15:47
Alejandro Gonzalez Re: Problem with Crawler and Parent Directories Thu, 02 Apr, 15:35
Alejandro Gonzalez Re: java heap space error Thu, 09 Apr, 15:45
Alejandro Gonzalez Re: run nutch on eclipse problem? Thu, 23 Apr, 10:09
Alex Basa number of fetcher threads per host? Thu, 09 Apr, 14:16
Alex Basa Re: number of fetcher threads per host? Thu, 09 Apr, 15:26
Alex Basa Re: running two crawlers at the same time Tue, 21 Apr, 14:04
Alex Basa Re: dual core and crawling Tue, 28 Apr, 16:00
Alexander Aristov running two crawlers at the same time Tue, 21 Apr, 12:21
Alexander Aristov Re: hi Kubes:the question about develop environment! Wed, 22 Apr, 06:12
Alexander Aristov Re: hi Kubes:the question about develop environment! Wed, 22 Apr, 17:50
Amin Mohammed-Coleman Re: Seattle / PNW Hadoop + Lucene User Group? Sat, 18 Apr, 06:57
Andrzej Bialecki Re: lukeall-0.9.1 to manually add indexes Wed, 01 Apr, 10:19
Andrzej Bialecki Re: lukeall-0.9.1 to manually add indexes Wed, 01 Apr, 21:41
Andrzej Bialecki Re: Nutch can't find all files Wed, 08 Apr, 06:54
Andrzej Bialecki Re: number of fetcher threads per host? Thu, 09 Apr, 17:16
Andrzej Bialecki Re: Hadoop thread seems to remain alive Thu, 23 Apr, 14:35
Andrzej Bialecki Re: Using nutchBean Thu, 23 Apr, 21:32
Andrzej Bialecki Re: Possible bug in when fetching relative links after a redirect - N 1.0 Wed, 29 Apr, 10:15
Ankur Garg Problem crawling BBC Hindi Site Mon, 06 Apr, 06:12
Anshum Re: ebook resources - including lucene in action Tue, 21 Apr, 12:03
Bradford Stephens Seattle / PNW Hadoop + Lucene User Group? Thu, 16 Apr, 22:27
Bradford Stephens Re: Seattle / PNW Hadoop + Lucene User Group? Sat, 18 Apr, 00:08
Bradford Stephens Re: Seattle / PNW Hadoop + Lucene User Group? Sat, 18 Apr, 18:11
Bradford Stephens Re: Seattle / PNW Hadoop + Lucene User Group? Mon, 20 Apr, 23:28
DS jha nutch/hadoop performance and optimal configuration Thu, 02 Apr, 22:39
DS jha Re: nutch/hadoop performance and optimal configuration Fri, 03 Apr, 15:05
DS jha Re: nutch/hadoop performance and optimal configuration Sat, 04 Apr, 14:19
DS jha resubmitting failed reduce task Wed, 08 Apr, 11:11
David M. Cole Re: Can't build Nutch Mon, 20 Apr, 16:31
David M. Cole Re: Nutch Crawling Questions Tue, 21 Apr, 01:05
David M. Cole Re: nutch 1.0 Tue, 21 Apr, 22:25
Dennis Kubes Re: Crawler Output Flat file or Database? Wed, 01 Apr, 00:59
Dennis Kubes Re: What means "Ignoring position" using ArcSegmentCreator? Sat, 04 Apr, 12:01
Dennis Kubes Re: fetcher issues Mon, 13 Apr, 03:44
Dennis Kubes Re: How to index segments after converted from Heritrix ARC-files. Thu, 16 Apr, 21:29
Dennis Kubes Re: Odd results and broken docs when indexing converted ARC-files. Sat, 18 Apr, 04:45
Dennis Kubes Re: fetcher questions Sat, 18 Apr, 04:56
Dennis Kubes Re: Odd results and broken docs when indexing converted ARC-files (-> link to gif). Sat, 18 Apr, 04:58
Dennis Kubes Re: running two crawlers at the same time Tue, 21 Apr, 14:20
Dennis Kubes Re: hi Kubes:the question about develop environment! Wed, 22 Apr, 14:04
Dennis Kubes Re: hi Kubes:the question about develop environment! Wed, 22 Apr, 14:04
Dennis Kubes Re: Hadoop thread seems to remain alive Thu, 23 Apr, 12:55
Dennis Kubes Re: hi Kubes:the question about develop environment! Thu, 23 Apr, 12:59
Dennis Kubes Re: how to restrict search result in defined domains? Thu, 23 Apr, 13:02
Dennis Kubes Re: How to resume crawler after crash Fri, 24 Apr, 04:08
Dennis Kubes Re: URL Scoring Fri, 24 Apr, 12:42
Dennis Kubes Re: dual core and crawling Mon, 27 Apr, 22:24
Dennis Kubes Re: Nutch fetch creates too many http sessions Mon, 27 Apr, 22:28
Dennis Kubes Re: dual core and crawling Tue, 28 Apr, 15:37
Dennis Kubes Re: dual core and crawling Wed, 29 Apr, 03:00
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Nov 200989
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167