nutch-user mailing list archives: January 2006

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · Next »Thread · Author · Date
Nalin Kumar OutOfMemoryError while crawling Fri, 30 Dec, 12:51
Pine Cone Java problem Sun, 01 Jan, 17:14
Gal Nitzan   Re: Java problem Sun, 01 Jan, 18:34
Re: uses of 'io.sort.mb' and ' io.sort.factor' in nutch-default.xml
Piotr Kosiorowski   Re: uses of 'io.sort.mb' and ' io.sort.factor' in nutch-default.xml Mon, 02 Jan, 11:53
K.A.Hussain Ali     Re: uses of 'io.sort.mb' and ' io.sort.factor' in nutch-default.xml Mon, 02 Jan, 13:28
Doug Cutting       Re: uses of 'io.sort.mb' and ' io.sort.factor' in nutch-default.xml Mon, 02 Jan, 17:59
Thomas Sondergaard is the nutch shell script only used for initial crawling Mon, 02 Jan, 13:41
Piotr Kosiorowski   Re: is the nutch shell script only used for initial crawling Mon, 02 Jan, 21:02
Thomas Sondergaard     Re: is the nutch shell script only used for initial crawling Tue, 03 Jan, 08:33
Re: Is any one able to successfully run Distributed Crawl?
Doug Cutting   Re: Is any one able to successfully run Distributed Crawl? Mon, 02 Jan, 19:10
Earl Cahill   Re: Is any one able to successfully run Distributed Crawl? Mon, 02 Jan, 21:39
Gal Nitzan     Re: Is any one able to successfully run Distributed Crawl? Tue, 03 Jan, 08:11
Doug Cutting     Re: Is any one able to successfully run Distributed Crawl? Wed, 04 Jan, 23:40
Pushpesh Kr. Rajwanshi       Re: Is any one able to successfully run Distributed Crawl? Sun, 08 Jan, 15:03
Doug Cutting         Re: Is any one able to successfully run Distributed Crawl? Mon, 09 Jan, 20:14
Pushpesh Kr. Rajwanshi           Re: Is any one able to successfully run Distributed Crawl? Sat, 14 Jan, 20:40
Chetan Sahasrabudhe New index representation in search results Mon, 02 Jan, 19:13
Stefan Groschupf   Re: New index representation in search results Tue, 03 Jan, 16:19
Re: New Tutorial Needed
Doug Cutting   Re: New Tutorial Needed Mon, 02 Jan, 19:45
Lukas Vlcek   Re: New Tutorial Needed Wed, 04 Jan, 06:39
Raghavendra Prabhu     Re: New Tutorial Needed Wed, 04 Jan, 06:54
Re: Can we search based on two fileds?
Nguyen Ngoc Giang   Re: Can we search based on two fileds? Tue, 03 Jan, 02:09
Chih How Bong     Re: Can we search based on two fileds? Wed, 04 Jan, 02:44
Sergio Localization bug in web interface?? Tue, 03 Jan, 05:31
Sergio   Re: Localization bug in web interface?? Tue, 03 Jan, 05:54
Teruhiko Kurosaka   RE: Localization bug in web interface?? Wed, 04 Jan, 18:55
Insurance Squared Inc.     upgrade to version 0.8 Wed, 04 Jan, 13:53
Andrzej Bialecki       Re: upgrade to version 0.8 Thu, 05 Jan, 07:36
Byron Miller Limiting search/crawl to specific language Wed, 04 Jan, 04:46
Byron Miller   Re: Limiting search/crawl to specific language Wed, 04 Jan, 05:24
Ken Krugler     Re: Limiting search/crawl to specific language Wed, 04 Jan, 17:53
ogjunk-nu...@yahoo.com Re: [Nutch-general] Limiting search/crawl to specific language Wed, 04 Jan, 07:11
Gal Nitzan   Fetching in multiple machines setup. Wed, 04 Jan, 10:42
Aled Jones Remove links from index Wed, 04 Jan, 11:50
Andrzej Bialecki   Re: Remove links from index Wed, 04 Jan, 12:09
Aled Jones ATB: Remove links from index Wed, 04 Jan, 13:31
Nguyen Ngoc Giang java.io.IOException: already exists Wed, 04 Jan, 14:58
Piotr Kosiorowski   Re: java.io.IOException: already exists Wed, 04 Jan, 15:24
Byron Miller   Re: java.io.IOException: already exists Wed, 04 Jan, 15:54
Nguyen Ngoc Giang     Re: java.io.IOException: already exists Wed, 04 Jan, 16:33
David Wallace   Re: java.io.IOException: already exists Wed, 04 Jan, 20:30
Goldschmidt, Dave RE: upgrade to version 0.8 Wed, 04 Jan, 19:24
Goldschmidt, Dave Scaling Nutch 0.8 via Map/Reduce Wed, 04 Jan, 20:39
Chirag Chaman   RE: Scaling Nutch 0.8 via Map/Reduce Wed, 04 Jan, 21:00
Goldschmidt, Dave   RE: Scaling Nutch 0.8 via Map/Reduce Wed, 04 Jan, 21:10
Chirag Chaman   RE: Scaling Nutch 0.8 via Map/Reduce Thu, 05 Jan, 02:09
Goldschmidt, Dave   RE: Scaling Nutch 0.8 via Map/Reduce Fri, 06 Jan, 16:08
Chirag Chaman   RE: Scaling Nutch 0.8 via Map/Reduce Sat, 07 Jan, 20:04
Bryan Woliner port :8080 no longer brings up Nutch search page! Wed, 04 Jan, 21:29
Bryan Woliner   Re: port :8080 no longer brings up Nutch search page! Wed, 04 Jan, 21:49
RJ   Re: port :8080 no longer brings up Nutch search page! Wed, 04 Jan, 22:32
Albert Chern 0.7, Trunk, Compatibility Question Wed, 04 Jan, 23:33
Stefan Groschupf   Re: 0.7, Trunk, Compatibility Question Sun, 08 Jan, 14:56
Otis Gospodnetic LanguageIdentifierPlugin and CJK Wed, 04 Jan, 23:41
Jérôme Charron   Re: LanguageIdentifierPlugin and CJK Wed, 04 Jan, 23:54
ogjunk-nu...@yahoo.com     Re: [Nutch-general] Re: LanguageIdentifierPlugin and CJK Thu, 05 Jan, 00:34
Cheolgoo Kang       Re: [Nutch-general] Re: LanguageIdentifierPlugin and CJK Thu, 05 Jan, 08:04
Jérôme Charron         Re: [Nutch-general] Re: LanguageIdentifierPlugin and CJK Wed, 11 Jan, 22:46
Jérôme Charron       Re: [Nutch-general] Re: LanguageIdentifierPlugin and CJK Wed, 11 Jan, 22:34
Sunnyvale Fl impossible situation error: score-edit Wed, 04 Jan, 23:43
Sunnyvale Fl   Re: impossible situation error: score-edit Thu, 05 Jan, 22:25
Arun Kumar Sharma Getting java.io.IOException: Couldn't rename \tmp\nutch\mapred\local\map_n68li2\part-0.out with Nutch 0.8 Thu, 05 Jan, 07:08
Stefan Groschupf   Re: Getting java.io.IOException: Couldn't rename \tmp\nutch\mapred\local\map_n68li2\part-0.out with Nutch 0.8 Thu, 05 Jan, 09:50
Arun Kaundal     Re: Getting java.io.IOException: Couldn't rename \tmp\nutch\mapred\local\map_n68li2\part-0.out with Nutch 0.8 Fri, 06 Jan, 07:30
Arun Kaundal       Re: Getting java.io.IOException: Couldn't rename \tmp\nutch\mapred\local\map_n68li2\part-0.out with Nutch 0.8 Fri, 06 Jan, 13:22
Piotr Kosiorowski         Re: Getting java.io.IOException: Couldn't rename \tmp\nutch\mapred\local\map_n68li2\part-0.out with Nutch 0.8 Fri, 06 Jan, 13:40
Goldschmidt, Dave   RE: Getting java.io.IOException: Couldn't rename \tmp\nutch\mapred\local\map_n68li2\part-0.out with Nutch 0.8 Fri, 06 Jan, 15:34
Boštjan Categories Thu, 05 Jan, 07:38
Stefan Groschupf   Re: Categories Thu, 05 Jan, 09:53
Neal Whitley Clustering with clustering-carrot2 Thu, 05 Jan, 10:32
Stefan Groschupf   Re: Clustering with clustering-carrot2 Thu, 05 Jan, 10:35
Neal Whitley     Re: Clustering with clustering-carrot2 Thu, 05 Jan, 18:43
Dan Segel tech... Thu, 05 Jan, 20:40
Dan Segel please disregard last post.... Thu, 05 Jan, 20:41
Byron Miller   Re: please disregard last post.... Thu, 05 Jan, 21:20
Dan Segel     Re: please disregard last post.... Thu, 05 Jan, 21:22
Stefan Groschupf       Re: please disregard last post.... Fri, 06 Jan, 15:47
Re: Multiple anchors on same site - what's better than making these unique?
Doug Cutting   Re: Multiple anchors on same site - what's better than making these unique? Thu, 05 Jan, 20:44
Raghavendra Prabhu pooling for nutch bean Thu, 05 Jan, 20:56
Byron Miller   Re: pooling for nutch bean Thu, 05 Jan, 21:31
Raghavendra Prabhu     Re: pooling for nutch bean Thu, 05 Jan, 21:38
Stefan Groschupf       Re: pooling for nutch bean Fri, 06 Jan, 15:50
Raghavendra Prabhu         Re: pooling for nutch bean Sun, 08 Jan, 06:35
Stefan Groschupf           Re: pooling for nutch bean Sun, 08 Jan, 14:44
Raghavendra Prabhu             Re: pooling for nutch bean Sun, 08 Jan, 15:30
Stefan Groschupf               Re: pooling for nutch bean Sun, 08 Jan, 15:45
Raghavendra Prabhu                 Re: pooling for nutch bean Sun, 08 Jan, 16:00
Howie Wang   Re: pooling for nutch bean Sun, 08 Jan, 18:27
Re: Does Search Result Show Similar Pages Like Google?
Doug Cutting   Re: Does Search Result Show Similar Pages Like Google? Thu, 05 Jan, 21:01
Raghavendra Prabhu resource pool for nutchbean Thu, 05 Jan, 21:43
Chris Mattmann   Re: resource pool for nutchbean Thu, 05 Jan, 21:51
Raghavendra Prabhu     Re: resource pool for nutchbean Thu, 05 Jan, 21:56
Chris Mattmann   Re: resource pool for nutchbean Fri, 06 Jan, 00:04
Dan Segel Google.com Search Thu, 05 Jan, 22:30
Raghavendra Prabhu   Re: Google.com Search Thu, 05 Jan, 22:37
Dan Segel     Re: Google.com Search Thu, 05 Jan, 22:52
Raghavendra Prabhu       Re: Google.com Search Thu, 05 Jan, 22:55
Matt Zytaruk mapred system dir Fri, 06 Jan, 00:59
Stefan Groschupf   Re: mapred system dir Fri, 06 Jan, 15:55
Matt Zytaruk     Re: mapred system dir Fri, 06 Jan, 16:11
K.A.Hussain Ali Dedup - works on single file Fri, 06 Jan, 14:13
Andrzej Bialecki   Re: Dedup - works on single file Fri, 06 Jan, 14:22
Teruhiko Kurosaka app server requirement Fri, 06 Jan, 18:55
Stefan Groschupf   Re: app server requirement Sun, 08 Jan, 14:54
Ed Whittaker Urgent help requested regarding Nutch obeying <META CONTENT="NOARCHIVE"> instructions Fri, 06 Jan, 23:19
Jérôme Charron   Re: Urgent help requested regarding Nutch obeying <META CONTENT="NOARCHIVE"> instructions Fri, 06 Jan, 23:37
Ed Whittaker     Re: Urgent help requested regarding Nutch obeying <META CONTENT="NOARCHIVE"> instructions Fri, 06 Jan, 23:43
Jérôme Charron       Re: Urgent help requested regarding Nutch obeying <META CONTENT="NOARCHIVE"> instructions Fri, 06 Jan, 23:57
Chris Schneider Appropriate MapReduce Hardware Fri, 06 Jan, 23:37
Stefan Groschupf   Re: Appropriate MapReduce Hardware Sun, 08 Jan, 14:53
Doug Cutting   Re: Appropriate MapReduce Hardware Mon, 09 Jan, 19:38
Raghavendra Prabhu nutch task tracker help Sat, 07 Jan, 17:22
Stefan Groschupf   Re: nutch task tracker help Sun, 08 Jan, 14:47
Raghavendra Prabhu mapred setup Sat, 07 Jan, 17:30
Stefan Groschupf   Re: mapred setup Sat, 07 Jan, 17:33
Raghavendra Prabhu     Re: mapred setup Sat, 07 Jan, 17:37
Raghavendra Prabhu       mapred setup Sat, 07 Jan, 17:43
Stefan Groschupf       Re: mapred setup Sat, 07 Jan, 18:36
Raghavendra Prabhu file sytem content is also saved Sat, 07 Jan, 18:28
Stefan Groschupf   Re: file sytem content is also saved Sun, 08 Jan, 14:46
Raghavendra Prabhu     Re: file sytem content is also saved Sun, 08 Jan, 15:23
Thomas Delnoij MD5Hash Sat, 07 Jan, 21:14
Stefan Groschupf   Re: MD5Hash Sat, 07 Jan, 21:23
Gal Nitzan     small problem? Sat, 07 Jan, 22:42
Gal Nitzan       Re: small problem? IGNORE Sun, 08 Jan, 00:19
Thomas Delnoij     Re: MD5Hash Sun, 15 Jan, 19:20
Thomas Delnoij       Re: MD5Hash Wed, 18 Jan, 12:21
Stefan Groschupf         Re: MD5Hash Mon, 23 Jan, 20:43
Jack Tang   Re: MD5Hash Thu, 19 Jan, 06:24
Thomas Delnoij     Re: MD5Hash Fri, 20 Jan, 13:52
Byron Miller JSON output Sun, 08 Jan, 01:22
Message list1 · 2 · 3 · 4 · 5 · Next »Thread · Author · Date
Box list
Jul 2014109
Jun 2014123
May 2014188
Apr 2014127
Mar 2014228
Feb 2014149
Jan 2014109
Dec 2013193
Nov 2013164
Oct 2013207
Sep 201383
Aug 2013251
Jul 2013362
Jun 2013481
May 2013215
Apr 2013219
Mar 2013305
Feb 2013350
Jan 2013279
Dec 2012174
Nov 2012309
Oct 2012314
Sep 2012206
Aug 2012387
Jul 2012336
Jun 2012309
May 2012348
Apr 2012208
Mar 2012235
Feb 2012349
Jan 2012319
Dec 2011319
Nov 2011322
Oct 2011291
Sep 2011305
Aug 2011305
Jul 2011606
Jun 2011283
May 2011159
Apr 2011178
Mar 2011222
Feb 2011241
Jan 2011236
Dec 2010184
Nov 2010266
Oct 2010240
Sep 2010279
Aug 2010230
Jul 2010204
Jun 2010151
May 2010173
Apr 2010194
Mar 2010148
Feb 2010136
Jan 2010193
Dec 2009259
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008249
Nov 2008194
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008194
Jan 2008284
Dec 2007146
Nov 2007233
Oct 2007268
Sep 2007273
Aug 2007301
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167