Mailing list archives: September 2008

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Edward Quick search Tue, 16 Sep, 16:30
Kevin MacDonald Possible Crawling bug Tue, 16 Sep, 21:10
Andrzej Bialecki   Re: Possible Crawling bug Thu, 18 Sep, 21:33
Kevin MacDonald     Re: Possible Crawling bug Thu, 18 Sep, 22:13
Andrzej Bialecki       Re: Possible Crawling bug Thu, 18 Sep, 23:01
Kevin MacDonald         Re: Possible Crawling bug Fri, 19 Sep, 03:44
Andrzej Bialecki           Re: Possible Crawling bug Fri, 19 Sep, 09:27
Kevin MacDonald             Re: Possible Crawling bug Fri, 19 Sep, 16:00
salah Elabidi Recrawling Wed, 17 Sep, 09:23
salah Elabidi Recrawling script Wed, 17 Sep, 10:32
salah Elabidi Recrawl script Wed, 17 Sep, 10:39
Edward Quick how much space required? Wed, 17 Sep, 13:30
Kevin MacDonald   Re: how much space required? Wed, 17 Sep, 16:13
Edward Quick     RE: how much space required? Thu, 18 Sep, 07:47
Fwd: Fw: Very Urgent..
Srinivas Gokavarapu   Fwd: Fw: Very Urgent.. Thu, 18 Sep, 05:59
David Jashi Dedup Thu, 18 Sep, 11:41
Andrzej Bialecki   Re: Dedup Thu, 18 Sep, 15:18
r...@vshift.com   Re: Dedup Thu, 18 Sep, 15:43
Tristan Buckner     Re: Dedup Thu, 18 Sep, 21:33
Andrzej Bialecki     Re: Dedup Thu, 18 Sep, 21:35
David Jashi       Re: Dedup Fri, 19 Sep, 06:40
Andrzej Bialecki         Re: Dedup Fri, 19 Sep, 09:30
Edward Quick java.lang.OutOfMemoryError: Java heap space Thu, 18 Sep, 13:19
DoÄŸacan Güney   Re: java.lang.OutOfMemoryError: Java heap space Thu, 18 Sep, 13:30
Edward Quick     RE: java.lang.OutOfMemoryError: Java heap space Thu, 18 Sep, 14:21
DoÄŸacan Güney       Re: java.lang.OutOfMemoryError: Java heap space Thu, 18 Sep, 15:35
Edward Quick running fetches in hadoop Thu, 18 Sep, 14:23
DoÄŸacan Güney   Re: running fetches in hadoop Thu, 18 Sep, 15:34
Edward Quick     RE: running fetches in hadoop Thu, 18 Sep, 16:37
DoÄŸacan Güney       Re: running fetches in hadoop Thu, 18 Sep, 17:13
Edward Quick         RE: running fetches in hadoop Thu, 18 Sep, 19:36
Edward Quick         RE: running fetches in hadoop Fri, 19 Sep, 10:32
DoÄŸacan Güney           Re: running fetches in hadoop Fri, 19 Sep, 10:50
Edward Quick             RE: running fetches in hadoop Fri, 19 Sep, 11:05
Andrzej Bialecki             Re: running fetches in hadoop Fri, 19 Sep, 11:42
Edward Quick               RE: running fetches in hadoop Fri, 19 Sep, 12:47
Edward Quick               RE: running fetches in hadoop Fri, 19 Sep, 19:12
Andrzej Bialecki                 Re: running fetches in hadoop Fri, 19 Sep, 21:06
Edward Quick                   RE: running fetches in hadoop Sat, 20 Sep, 11:11
Edward Quick RegexURLNormalizer warnings Thu, 18 Sep, 14:35
DoÄŸacan Güney   Re: RegexURLNormalizer warnings Thu, 18 Sep, 15:33
Arun Kamal where to find the location of rss feed Sat, 20 Sep, 04:37
David Jashi   Re: where to find the location of rss feed Sat, 20 Sep, 06:04
Alexander Dick Re: Re: Display the description Sat, 20 Sep, 11:38
vishal vachhani Duplicate pages in result of queries Sun, 21 Sep, 16:54
nutch_newbie Nutch and its Growing Capabilities Sun, 21 Sep, 19:05
Kevin MacDonald   Re: Nutch and its Growing Capabilities Mon, 22 Sep, 00:21
toabhishek16 Error in hadoop crawling Mon, 22 Sep, 08:13
Alexander Dick   AW: Error in hadoop crawling Mon, 22 Sep, 08:37
Venkateshprasanna Recreating crawled documents out of Nutch indexes/segments Mon, 22 Sep, 10:54
Kevin MacDonald Possible bug involving redirects Mon, 22 Sep, 21:38
Kevin MacDonald   Re: Possible bug involving redirects Mon, 22 Sep, 22:44
Sjaiful Bahri crawl web content without tag Tue, 23 Sep, 02:37
Julien Nioche Access external resource in plugin Tue, 23 Sep, 11:31
Julien Nioche   Re: Access external resource in plugin Tue, 23 Sep, 13:41
Andrzej Bialecki     Re: Access external resource in plugin Tue, 23 Sep, 14:37
Julien Nioche       Re: Access external resource in plugin Tue, 23 Sep, 15:05
Edward Quick benchmarking Tue, 23 Sep, 11:54
Kevin MacDonald   Re: benchmarking Tue, 23 Sep, 17:14
Kevin MacDonald     Re: benchmarking Tue, 23 Sep, 17:51
DoÄŸacan Güney       Re: benchmarking Tue, 23 Sep, 19:54
Kevin MacDonald         Re: benchmarking Tue, 23 Sep, 20:57
Edward Quick           RE: benchmarking Wed, 24 Sep, 15:35
kevin chen             RE: benchmarking Fri, 26 Sep, 01:01
Edward Quick               RE: benchmarking Fri, 26 Sep, 07:55
Kevin MacDonald De-activating Normalizers Tue, 23 Sep, 19:02
DoÄŸacan Güney   Re: De-activating Normalizers Tue, 23 Sep, 19:48
Kevin MacDonald BasicURLNormalizer problem Tue, 23 Sep, 19:25
Guilherme Menezes Cluster size question Tue, 23 Sep, 21:33
Guilherme Menezes   Re: Cluster size question Tue, 23 Sep, 21:39
Henrik Jönsson Problem with fetcher Wed, 24 Sep, 12:00
Kevin MacDonald   Re: Problem with fetcher Wed, 24 Sep, 16:23
Edward Quick did you mean? Wed, 24 Sep, 13:25
Otis Gospodnetic   Re: did you mean? Wed, 24 Sep, 18:19
Edward Quick keyword match Wed, 24 Sep, 13:36
Otis Gospodnetic   Re: keyword match Wed, 24 Sep, 18:18
DoÄŸacan Güney   Re: keyword match Wed, 24 Sep, 19:40
Edward Quick     RE: keyword match Wed, 24 Sep, 21:05
Nutch How to add a field on nutch database Wed, 24 Sep, 16:25
Wilson Melo Searching error Wed, 24 Sep, 19:24
Koch Martina IOException when Crawling Thu, 25 Sep, 09:30
Edward Quick   RE: IOException when Crawling Thu, 25 Sep, 11:30
Dennis Kubes   Re: IOException when Crawling Thu, 25 Sep, 14:03
Edward Quick pages with duplicate content in search results Thu, 25 Sep, 11:29
Dennis Kubes   Re: pages with duplicate content in search results Thu, 25 Sep, 12:42
vishal vachhani     Re: pages with duplicate content in search results Thu, 25 Sep, 15:40
Dennis Kubes       Re: pages with duplicate content in search results Thu, 25 Sep, 15:56
vishal vachhani         Re: pages with duplicate content in search results Thu, 25 Sep, 16:25
Edward Quick       RE: pages with duplicate content in search results Thu, 25 Sep, 16:35
Edward Quick         RE: pages with duplicate content in search results Thu, 25 Sep, 16:57
Andrzej Bialecki     Re: pages with duplicate content in search results Thu, 25 Sep, 20:10
Edward Quick       RE: pages with duplicate content in search results Thu, 25 Sep, 21:45
Andrzej Bialecki         Re: pages with duplicate content in search results Thu, 25 Sep, 21:53
David Jashi     Re: pages with duplicate content in search results Fri, 26 Sep, 05:53
Manu Warikoo FW: Indexing Files on Local File System Thu, 25 Sep, 18:12
Srinivas Gokavarapu   Re: FW: Indexing Files on Local File System Thu, 25 Sep, 19:49
Manu Warikoo     RE: Indexing Files on Local File System Thu, 25 Sep, 20:53
Kevin MacDonald       Re: Indexing Files on Local File System Thu, 25 Sep, 21:54
Srinivas Gokavarapu         Re: Indexing Files on Local File System Fri, 26 Sep, 05:18
Sjaiful Bahri www.zipclue.com (News Search Engine) Fri, 26 Sep, 07:33
Edward Quick indexing url without parsed content Fri, 26 Sep, 14:00
Edward Quick updatedb says URL normalizing and filtering are set to false Fri, 26 Sep, 14:04
DoÄŸacan Güney   Re: updatedb says URL normalizing and filtering are set to false Sun, 28 Sep, 20:06
Edward Quick     RE: updatedb says URL normalizing and filtering are set to false Sun, 28 Sep, 20:34
Chris Hostetter ANNOUNCE: Application Period Opens for Travel Assistance to ApacheCon US 2008 Fri, 26 Sep, 17:25
Martin Xu Who can share the "nutch admin gui" file Sat, 27 Sep, 01:54
Chetan Patel crawl xml url using nutch-0.9 Sat, 27 Sep, 08:30
Edward Quick   RE: crawl xml url using nutch-0.9 Sat, 27 Sep, 08:55
Chetan Patel     RE: crawl xml url using nutch-0.9 Sat, 27 Sep, 09:41
Chetan Patel       RE: crawl xml url using nutch-0.9 Sat, 27 Sep, 10:44
Edward Quick         RE: crawl xml url using nutch-0.9 Sat, 27 Sep, 11:59
Webmaster           RE: crawl xml url using nutch-0.9 Sat, 27 Sep, 23:05
Webmaster             Stable versions Sun, 28 Sep, 03:04
David Grandinetti         Re: crawl xml url using nutch-0.9 Sun, 28 Sep, 00:06
Chetan Patel           Re: crawl xml url using nutch-0.9 Mon, 29 Sep, 10:09
Javier Puerto Dublin Core parser Mon, 29 Sep, 08:11
daut encoding Mon, 29 Sep, 09:04
David Jashi   Re: encoding Mon, 29 Sep, 09:11
daut     Re: encoding Mon, 29 Sep, 10:27
David Jashi       Re: encoding Mon, 29 Sep, 10:48
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 200966
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167