nutch-user mailing list archives: March 2013

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Re: a lot of threads spinwaiting
feng lu   Re: a lot of threads spinwaiting Fri, 01 Mar, 02:32
Roland     Re: a lot of threads spinwaiting Fri, 01 Mar, 08:48
jc       Re: a lot of threads spinwaiting Fri, 01 Mar, 14:08
Markus Jelsma         RE: a lot of threads spinwaiting Fri, 01 Mar, 14:26
Roland         Re: a lot of threads spinwaiting Fri, 01 Mar, 15:46
jc           Re: a lot of threads spinwaiting Fri, 01 Mar, 19:05
Re: Something for the weekend
feng lu   Re: Something for the weekend Fri, 01 Mar, 04:47
Re: Problem compiling FeedParser plugin with Nutch 2.1 source
Anand Bhagwat   Re: Problem compiling FeedParser plugin with Nutch 2.1 source Fri, 01 Mar, 04:51
Lewis John Mcgibbney     Re: Problem compiling FeedParser plugin with Nutch 2.1 source Fri, 01 Mar, 19:41
Julien Nioche     Re: Problem compiling FeedParser plugin with Nutch 2.1 source Sat, 02 Mar, 08:27
Jorge Luis Betancourt Gonzalez       Re: Problem compiling FeedParser plugin with Nutch 2.1 source Sun, 03 Mar, 04:00
Lewis John Mcgibbney         Re: Problem compiling FeedParser plugin with Nutch 2.1 source Sun, 03 Mar, 04:16
Jorge Luis Betancourt Gonzalez   Re: Problem compiling FeedParser plugin with Nutch 2.1 source Sun, 03 Mar, 04:58
Re: Fetching of URLs from seed list ends up with only a small portion of them indexed by Solr
Amit Sela   Re: Fetching of URLs from seed list ends up with only a small portion of them indexed by Solr Sat, 02 Mar, 00:01
Stefan Scheffler     Re: Fetching of URLs from seed list ends up with only a small portion of them indexed by Solr Sat, 02 Mar, 07:00
Amit Sela       Re: Fetching of URLs from seed list ends up with only a small portion of them indexed by Solr Sat, 02 Mar, 19:32
kiran chitturi Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sat, 02 Mar, 19:12
kiran chitturi   Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sat, 02 Mar, 19:13
neeraj     Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 20:25
Sebastian Nagel     Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 20:41
kiran chitturi       Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 20:45
Sebastian Nagel         Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 20:56
kiran chitturi           Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 21:04
Tejas Patil             Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 21:19
Markus Jelsma               RE: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Sun, 03 Mar, 21:55
kiran chitturi                 Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Mon, 04 Mar, 05:03
Sebastian Nagel                   Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Mon, 04 Mar, 20:33
kiran chitturi                     Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Mon, 04 Mar, 20:45
Tejas Patil                       Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Tue, 05 Mar, 04:18
kiran chitturi                         Re: Nutch 1.6 : java.lang.OutOfMemoryError: unable to create new native thread Tue, 05 Mar, 16:43
kiran chitturi Nutch 1.6 : Fetcher taking long time to finish after the files are fetched Sat, 02 Mar, 20:16
Amit Sela help with nutch-site configuration Sun, 03 Mar, 17:22
kiran chitturi   Re: help with nutch-site configuration Sun, 03 Mar, 18:21
Re: nutch with cassandra internal network usage
Roland   Re: nutch with cassandra internal network usage Mon, 04 Mar, 07:26
Julien Nioche     Re: nutch with cassandra internal network usage Mon, 04 Mar, 10:05
Roland       Re: nutch with cassandra internal network usage Mon, 04 Mar, 12:42
Re: DiskChecker$DiskErrorException
Alexei Korolev   Re: DiskChecker$DiskErrorException Mon, 04 Mar, 08:48
Sebastian Nagel     Re: DiskChecker$DiskErrorException Mon, 04 Mar, 20:53
Adriana Farina Nutch 2.1 crawling step by step and crawling command differences Mon, 04 Mar, 16:23
kiran chitturi   Re: Nutch 2.1 crawling step by step and crawling command differences Mon, 04 Mar, 17:13
Lewis John Mcgibbney     Re: Nutch 2.1 crawling step by step and crawling command differences Mon, 04 Mar, 17:35
Adriana Farina       Re: Nutch 2.1 crawling step by step and crawling command differences Tue, 05 Mar, 09:06
mar...@Automationdirect.com Parsing error for video wmv files Mon, 04 Mar, 21:29
Tejas Patil   Re: Parsing error for video wmv files Tue, 05 Mar, 04:04
mar...@Automationdirect.com     Re: Parsing error for video wmv files Wed, 06 Mar, 15:51
Tejas Patil       Re: Parsing error for video wmv files Wed, 06 Mar, 17:27
kiran chitturi Nutch 1.6 : How to reparse Nutch segments ? Mon, 04 Mar, 21:33
Lewis John Mcgibbney   Re: Nutch 1.6 : How to reparse Nutch segments ? Mon, 04 Mar, 21:51
kiran chitturi     Re: Nutch 1.6 : How to reparse Nutch segments ? Mon, 04 Mar, 22:25
Lewis John Mcgibbney       Re: Nutch 1.6 : How to reparse Nutch segments ? Mon, 04 Mar, 22:54
kiran chitturi         Re: Nutch 1.6 : How to reparse Nutch segments ? Tue, 05 Mar, 00:20
Tejas Patil           Re: Nutch 1.6 : How to reparse Nutch segments ? Tue, 05 Mar, 03:49
kiran chitturi             Re: Nutch 1.6 : How to reparse Nutch segments ? Tue, 05 Mar, 04:07
Tejas Patil               Re: Nutch 1.6 : How to reparse Nutch segments ? Tue, 05 Mar, 04:15
kiran chitturi                 Re: Nutch 1.6 : How to reparse Nutch segments ? Tue, 05 Mar, 16:27
Re: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected
Tejas Patil   Re: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected Tue, 05 Mar, 04:51
Re: Nutch Incremental Crawl
David Philip   Re: Nutch Incremental Crawl Tue, 05 Mar, 05:28
feng lu     Re: Nutch Incremental Crawl Tue, 05 Mar, 05:55
David Philip       Re: Nutch Incremental Crawl Tue, 05 Mar, 06:49
feng lu         Re: Nutch Incremental Crawl Tue, 05 Mar, 07:02
feng lu           Re: Nutch Incremental Crawl Tue, 05 Mar, 07:24
David Philip             Re: Nutch Incremental Crawl Tue, 05 Mar, 11:48
feng lu               Re: Nutch Incremental Crawl Wed, 06 Mar, 01:59
Raja Kulasekaran Robots.db instead of robots.txt Tue, 05 Mar, 09:29
Tejas Patil   Re: Robots.db instead of robots.txt Tue, 05 Mar, 14:57
Raja Kulasekaran     Re: Robots.db instead of robots.txt Tue, 05 Mar, 15:15
Tejas Patil       Re: Robots.db instead of robots.txt Tue, 05 Mar, 15:47
Amit Sela Understanding fetch MapReduce job counters and logs Tue, 05 Mar, 11:16
Lewis John Mcgibbney   Re: Understanding fetch MapReduce job counters and logs Sat, 16 Mar, 00:03
feng lu     Re: Understanding fetch MapReduce job counters and logs Sat, 16 Mar, 15:12
Amit Sela       Re: Understanding fetch MapReduce job counters and logs Sun, 17 Mar, 11:47
feng lu         Re: Understanding fetch MapReduce job counters and logs Sun, 17 Mar, 15:07
Amit Sela           Re: Understanding fetch MapReduce job counters and logs Sun, 17 Mar, 16:58
raviksingh Continue Nutch Crawling After Exception Tue, 05 Mar, 15:22
Lewis John Mcgibbney   Re: Continue Nutch Crawling After Exception Tue, 05 Mar, 18:59
Anand Bhagwat Rest API for Nutch 2.x Tue, 05 Mar, 15:33
Lewis John Mcgibbney   Re: Rest API for Nutch 2.x Tue, 05 Mar, 19:00
raviksingh Find which URL created exception Tue, 05 Mar, 16:38
kiran chitturi   Re: Find which URL created exception Tue, 05 Mar, 16:45
raviksingh     Re: Find which URL created exception Tue, 05 Mar, 16:52
kiran chitturi       Re: Find which URL created exception Tue, 05 Mar, 16:54
kiran chitturi Parse statistics in Nutch Tue, 05 Mar, 17:37
Lewis John Mcgibbney   Re: Parse statistics in Nutch Tue, 05 Mar, 17:59
kiran chitturi     Re: Parse statistics in Nutch Tue, 05 Mar, 18:03
David Philip recrawl - will it re-fetch and parse all the URLS again? Tue, 05 Mar, 18:33
David Philip   Re: recrawl - will it re-fetch and parse all the URLS again? Tue, 05 Mar, 18:37
Jason S keep all pages from a domain in one slice Tue, 05 Mar, 21:17
Markus Jelsma   RE: keep all pages from a domain in one slice Tue, 05 Mar, 22:02
Stubblefield Jason     Re: keep all pages from a domain in one slice Tue, 05 Mar, 22:28
Lewis John Mcgibbney       Re: keep all pages from a domain in one slice Wed, 06 Mar, 05:18
Stubblefield Jason         Re: keep all pages from a domain in one slice Wed, 06 Mar, 09:34
Lewis John Mcgibbney           Re: keep all pages from a domain in one slice Sat, 09 Mar, 02:37
SUJIT PAL           Re: keep all pages from a domain in one slice Sat, 09 Mar, 15:59
feng lu     Re: keep all pages from a domain in one slice Wed, 06 Mar, 03:09
Ahmet A. Akin How stable is Nutch 2.x as of March 2013? Wed, 06 Mar, 08:36
Tejas Patil   Re: How stable is Nutch 2.x as of March 2013? Wed, 06 Mar, 17:07
Anand Bhagwat How to do a force fetch Wed, 06 Mar, 10:22
Eyeris Rodriguez Rueda   image crawling with nutch Wed, 06 Mar, 15:58
Tejas Patil     Re: image crawling with nutch Wed, 06 Mar, 17:35
Tejas Patil   Re: How to do a force fetch Wed, 06 Mar, 17:18
Anand Bhagwat     Re: How to do a force fetch Thu, 07 Mar, 03:54
Tejas Patil       Re: How to do a force fetch Thu, 07 Mar, 04:33
Anand Bhagwat         Re: How to do a force fetch Thu, 07 Mar, 04:37
Tejas Patil           Re: How to do a force fetch Thu, 07 Mar, 04:51
Anand Bhagwat             Re: How to do a force fetch Thu, 07 Mar, 06:04
mma mapred.FileOutputCommitter - Output path is null in cleanup Wed, 06 Mar, 14:56
Lewis John Mcgibbney   Re: mapred.FileOutputCommitter - Output path is null in cleanup Sat, 09 Mar, 20:22
imehesz Nutch 1.6 from Java via HttpServlet Wed, 06 Mar, 23:18
Lewis John Mcgibbney   Re: Nutch 1.6 from Java via HttpServlet Thu, 07 Mar, 00:15
Re: image crawling with nutch
Eyeris Rodriguez Rueda   Re: image crawling with nutch Thu, 07 Mar, 14:31
Eyeris Rodriguez Rueda   Re: image crawling with nutch Fri, 08 Mar, 16:22
Walter Tietze     Re: image crawling with nutch Fri, 08 Mar, 18:22
Eyeris Rodriguez Rueda   Re: image crawling with nutch Fri, 08 Mar, 19:23
Walter Tietze     Re: image crawling with nutch Fri, 08 Mar, 20:48
mar...@Automationdirect.com Re: Parsing error for video wmv files Fri, 08 Mar, 14:37
Ye T Thet Parse benchmark/performance Fri, 08 Mar, 16:12
kiran chitturi   Re: Parse benchmark/performance Fri, 08 Mar, 16:27
Ye T Thet     Re: Parse benchmark/performance Sat, 09 Mar, 03:09
feng lu       Re: Parse benchmark/performance Sun, 10 Mar, 03:17
Ye T Thet         Re: Parse benchmark/performance Sun, 10 Mar, 08:53
Roland von Herget           Re: Parse benchmark/performance Sun, 10 Mar, 10:31
feng lu             Re: Parse benchmark/performance Sun, 10 Mar, 15:27
Ye T Thet               Re: Parse benchmark/performance Mon, 11 Mar, 13:57
Ye T Thet             Re: Parse benchmark/performance Mon, 11 Mar, 14:03
Roland von Herget               Re: Parse benchmark/performance Mon, 11 Mar, 14:57
kiran chitturi                 Re: Parse benchmark/performance Mon, 11 Mar, 15:11
Ye T Thet                   Re: Parse benchmark/performance Mon, 11 Mar, 16:49
ytthet                     Re: Parse benchmark/performance Sun, 17 Mar, 07:48
kiran chitturi                       Re: Parse benchmark/performance Sun, 17 Mar, 09:40
Julien Nioche           Re: Parse benchmark/performance Mon, 11 Mar, 08:56
Ye T Thet             Re: Parse benchmark/performance Mon, 11 Mar, 14:10
lewis john mcgibbney [ANNOUNCEMENT] Welcome Kiran Chitturi as Apache Nutch PMC and Committer Sat, 09 Mar, 20:56
Tejas Patil   Re: [ANNOUNCEMENT] Welcome Kiran Chitturi as Apache Nutch PMC and Committer Sat, 09 Mar, 21:06
kiran chitturi     Re: [ANNOUNCEMENT] Welcome Kiran Chitturi as Apache Nutch PMC and Committer Sun, 10 Mar, 01:27
Kristopher Kane Session failed during parsing: IOException because of OOM Sun, 10 Mar, 04:22
kiran chitturi   Re: Session failed during parsing: IOException because of OOM Sun, 10 Mar, 04:36
Kristopher Kane     Re: Session failed during parsing: IOException because of OOM Mon, 11 Mar, 02:24
kiran chitturi       Re: Session failed during parsing: IOException because of OOM Mon, 11 Mar, 03:13
Kristopher Kane         Re: Session failed during parsing: IOException because of OOM Mon, 11 Mar, 03:30
kiran chitturi           Re: Session failed during parsing: IOException because of OOM Mon, 11 Mar, 04:20
高睿 How to prevent re-crawling? Sun, 10 Mar, 13:29
feng lu   Re: How to prevent re-crawling? Sun, 10 Mar, 14:36
高睿     Re:Re: How to prevent re-crawling? Sun, 10 Mar, 16:26
feng lu       Re: Re: How to prevent re-crawling? Mon, 11 Mar, 02:13
Rohan Thakur does nutch take care of any format change in the websites that is been crawled Mon, 11 Mar, 09:34
Gora Mohanty   Re: does nutch take care of any format change in the websites that is been crawled Mon, 11 Mar, 11:41
Anand Bhagwat How to identify seed URL for a given record from Webpage Mon, 11 Mar, 10:53
Lewis John Mcgibbney   Re: How to identify seed URL for a given record from Webpage Mon, 11 Mar, 16:20
Anand Bhagwat     Re: How to identify seed URL for a given record from Webpage Tue, 12 Mar, 02:39
Lewis John Mcgibbney       Re: How to identify seed URL for a given record from Webpage Tue, 12 Mar, 03:44
Anand Bhagwat         Re: How to identify seed URL for a given record from Webpage Tue, 12 Mar, 04:44
Lewis John Mcgibbney           Re: How to identify seed URL for a given record from Webpage Wed, 13 Mar, 17:10
Anand Bhagwat             Re: How to identify seed URL for a given record from Webpage Thu, 14 Mar, 04:54
Lewis John Mcgibbney               How to identify seed URL for a given record from Webpage Thu, 14 Mar, 18:22
Ye T Thet Nutch 1.x crawler deployment configuration Mon, 11 Mar, 16:45
Dat Tran Iterative Crawling Tue, 12 Mar, 00:03
kiran chitturi   Re: Iterative Crawling Wed, 13 Mar, 17:21
Dat Tran     Re: Iterative Crawling Thu, 14 Mar, 01:13
Lewis John Mcgibbney       Re: Iterative Crawling Thu, 14 Mar, 04:30
kiran chitturi         Re: Iterative Crawling Thu, 14 Mar, 04:32
Dat Tran   Re: Iterative Crawling Fri, 15 Mar, 02:20
Dat Tran     Re: Iterative Crawling Fri, 15 Mar, 02:36
Tejas Patil       Re: Iterative Crawling Fri, 15 Mar, 04:44
Dat Tran         Re: Iterative Crawling Fri, 15 Mar, 10:08
How to Continue to Crawl with Nutch Even An Error Occurs?
kamaci   How to Continue to Crawl with Nutch Even An Error Occurs? Tue, 12 Mar, 17:44
kiran chitturi     Re: How to Continue to Crawl with Nutch Even An Error Occurs? Thu, 14 Mar, 06:27
kamaci   How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 22:47
Markus Jelsma     RE: How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 22:53
kamaci       Re: How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 22:59
Tejas Patil         Re: How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 23:32
kamaci           Re: How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 23:35
Tejas Patil             Re: How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 23:39
kamaci       Re: How to Continue to Crawl with Nutch Even An Error Occurs? Wed, 20 Mar, 23:32
lewis john mcgibbney [WELCOME] Feng Lu as Apache Nutch PMC and Committer Tue, 12 Mar, 22:43
kiran chitturi   Re: [WELCOME] Feng Lu as Apache Nutch PMC and Committer Sun, 17 Mar, 07:01
feng lu   Re: [WELCOME] Feng Lu as Apache Nutch PMC and Committer Sun, 17 Mar, 14:02
Julien Nioche     Re: [WELCOME] Feng Lu as Apache Nutch PMC and Committer Mon, 18 Mar, 12:22
Markus Jelsma       RE: [WELCOME] Feng Lu as Apache Nutch PMC and Committer Mon, 18 Mar, 22:07
kiran chitturi       Re: [WELCOME] Feng Lu as Apache Nutch PMC and Committer Wed, 20 Mar, 14:40
kiran chitturi Mapping nested json objects to map data type Thu, 14 Mar, 03:36
kiran chitturi   Re: Mapping nested json objects to map data type Thu, 14 Mar, 03:37
David Philip Continue to Crawl even when an Error Occured Thu, 14 Mar, 04:17
feng lu   Re: Continue to Crawl even when an Error Occured Thu, 14 Mar, 04:31
David Philip     Re: Continue to Crawl even when an Error Occured Thu, 14 Mar, 04:49
Message list1 · 2 · Next »Thread · Author · Date
Box list
Aug 201819
Jul 201823
Jun 201835
May 201823
Apr 201825
Mar 2018117
Feb 201845
Jan 201825
Dec 201744
Nov 201779
Oct 201744
Sep 201770
Aug 201787
Jul 201752
Jun 201757
May 201776
Apr 201759
Mar 201752
Feb 201736
Jan 201773
Dec 201660
Nov 201678
Oct 2016144
Sep 201672
Aug 201669
Jul 201692
Jun 201696
May 201683
Apr 201677
Mar 201687
Feb 2016137
Jan 2016106
Dec 201579
Nov 201584
Oct 201583
Sep 201590
Aug 201527
Jul 201568
Jun 201572
May 201593
Apr 2015127
Mar 2015137
Feb 2015158
Jan 2015126
Dec 201487
Nov 201473
Oct 201474
Sep 2014177
Aug 2014108
Jul 2014145
Jun 2014123
May 2014188
Apr 2014127
Mar 2014228
Feb 2014149
Jan 2014109
Dec 2013193
Nov 2013164
Oct 2013207
Sep 201383
Aug 2013251
Jul 2013362
Jun 2013481
May 2013215
Apr 2013219
Mar 2013305
Feb 2013350
Jan 2013279
Dec 2012174
Nov 2012309
Oct 2012314
Sep 2012206
Aug 2012387
Jul 2012336
Jun 2012309
May 2012348
Apr 2012208
Mar 2012235
Feb 2012349
Jan 2012319
Dec 2011319
Nov 2011322
Oct 2011291
Sep 2011305
Aug 2011305
Jul 2011606
Jun 2011283
May 2011159
Apr 2011178
Mar 2011222
Feb 2011241
Jan 2011236
Dec 2010184
Nov 2010266
Oct 2010240
Sep 2010279
Aug 2010230
Jul 2010204
Jun 2010151
May 2010173
Apr 2010194
Mar 2010148
Feb 2010136
Jan 2010193
Dec 2009259
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008249
Nov 2008194
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008194
Jan 2008284
Dec 2007146
Nov 2007233
Oct 2007268
Sep 2007273
Aug 2007301
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167