nutch-dev mailing list archives: June 2007

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Nicolás Lichtmaier [PATCH] Moving HitDetails construction to a HitDetails constructor (v2). Fri, 01 Jun, 20:38
Nicolás Lichtmaier Re: [PATCH] Moving HitDetails construction to a HitDetails constructor (v2). Sun, 03 Jun, 19:36
Doğacan Güney Re: Plugins and Thread Safety Mon, 04 Jun, 15:43
Doğacan Güney Re: [Fwd: Nutch 0.9 and Crawl-Delay] Tue, 05 Jun, 05:59
Doğacan Güney Re: [jira] Commented: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. Tue, 05 Jun, 07:29
Doğacan Güney Re: [jira] Commented: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. Tue, 05 Jun, 11:54
Doğacan Güney Re: Plugins initialized all the time! Fri, 08 Jun, 15:30
Doğacan Güney Re: Welcome Doğacan as Nutch committer Tue, 12 Jun, 08:04
Doğacan Güney upgrade to hadoop-0.13? Mon, 18 Jun, 08:20
Doğacan Güney Re: upgrade to hadoop-0.13? Mon, 18 Jun, 12:07
Doğacan Güney Re: Build failed in Hudson: Nutch-Nightly #123 Wed, 20 Jun, 07:07
Doğacan Güney Re: Build failed in Hudson: Nutch-Nightly #123 Wed, 20 Jun, 13:04
Doğacan Güney Re: Build failed in Hudson: Nutch-Nightly #123 Wed, 20 Jun, 14:17
Doğacan Güney Re: Build failed in Hudson: Nutch-Nightly #123 Wed, 20 Jun, 14:19
Doğacan Güney Re: Build failed in Hudson: Nutch-Nightly #123 Wed, 20 Jun, 15:17
Doğacan Güney Re: Found the bug in Generator when number of URLs is small Thu, 21 Jun, 07:03
Doğacan Güney Re: NUTCH-119 :: how hard to fix Wed, 27 Jun, 05:56
Doğacan Güney JIRA email question Wed, 27 Jun, 07:02
Doğacan Güney Re: NUTCH-119 :: how hard to fix Thu, 28 Jun, 06:51
Doğacan Güney Re: [jira] Commented: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly Thu, 28 Jun, 07:35
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Fri, 01 Jun, 07:53
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Fri, 01 Jun, 10:54
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Sat, 02 Jun, 10:09
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-466) Flexible segment format Wed, 06 Jun, 12:30
Doğacan Güney (JIRA) [jira] Issue Comment Edited: (NUTCH-466) Flexible segment format Wed, 06 Jun, 13:08
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-356) Plugin repository cache can lead to memory leak Fri, 08 Jun, 15:37
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation Fri, 15 Jun, 12:26
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation Fri, 15 Jun, 14:24
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser Sat, 16 Jun, 09:36
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-495) Unnecessary delays in Fetcher2 Sat, 16 Jun, 10:36
Doğacan Güney (JIRA) [jira] Created: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code Sat, 16 Jun, 11:01
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation Sat, 16 Jun, 11:03
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code Sat, 16 Jun, 11:07
Doğacan Güney (JIRA) [jira] Assigned: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object Sat, 16 Jun, 11:15
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object Sat, 16 Jun, 11:15
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser Sun, 17 Jun, 09:04
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-270) Apply just the applicable portions of the patch to protocol.httpclient.Http.java Sun, 17 Jun, 09:09
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-476) Would like to add a field to the document class for its MD5 signature Sun, 17 Jun, 09:18
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object Sun, 17 Jun, 20:29
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility Mon, 18 Jun, 06:50
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-492) java.lang.OutOfMemoryError while indexing. Mon, 18 Jun, 08:57
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-493) contentType parse not correctly,,,,got empty content using readseg -get Mon, 18 Jun, 09:01
Doğacan Güney (JIRA) [jira] Created: (NUTCH-501) implementing a different caching mechanism for objects Mon, 18 Jun, 12:04
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-501) implementing a different caching mechanism for objects Mon, 18 Jun, 12:07
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration Mon, 18 Jun, 13:35
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration Mon, 18 Jun, 13:52
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters Mon, 18 Jun, 18:15
Doğacan Güney (JIRA) [jira] Created: (NUTCH-502) Bug in SegmentReader causes infinite loop Tue, 19 Jun, 06:01
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-502) Bug in SegmentReader causes infinite loop Tue, 19 Jun, 06:03
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility Tue, 19 Jun, 07:22
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-502) Bug in SegmentReader causes infinite loop Tue, 19 Jun, 09:22
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility Tue, 19 Jun, 14:27
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap Wed, 20 Jun, 18:09
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility Wed, 20 Jun, 18:45
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation Thu, 21 Jun, 12:29
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-471) Fix synchronization in NutchBean creation Thu, 21 Jun, 15:18
Doğacan Güney (JIRA) [jira] Created: (NUTCH-504) NUTCH-443 broke parsing during fetching Fri, 22 Jun, 08:30
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching Fri, 22 Jun, 08:32
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching Fri, 22 Jun, 08:34
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block Fri, 22 Jun, 08:44
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists Fri, 22 Jun, 08:51
Doğacan Güney (JIRA) [jira] Issue Comment Edited: (NUTCH-503) Generator exits incorrectly for small fetchlists Fri, 22 Jun, 08:59
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching Fri, 22 Jun, 12:24
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once Fri, 22 Jun, 14:30
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists Fri, 22 Jun, 22:39
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-25) needs 'character encoding' detector Sat, 23 Jun, 11:06
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration Sat, 23 Jun, 13:08
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration Sat, 23 Jun, 19:12
Doğacan Güney (JIRA) [jira] Created: (NUTCH-505) Outlink urls should be validated Sat, 23 Jun, 20:15
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-505) Outlink urls should be validated Sat, 23 Jun, 20:21
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration Sat, 23 Jun, 20:45
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-468) Scoring filter should distribute score to all outlinks at once Sun, 24 Jun, 09:30
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-504) NUTCH-443 broke parsing during fetching Sun, 24 Jun, 10:05
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-505) Outlink urls should be validated Sun, 24 Jun, 13:40
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-356) Plugin repository cache can lead to memory leak Sun, 24 Jun, 19:05
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-505) Outlink urls should be validated Mon, 25 Jun, 08:09
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code Tue, 26 Jun, 12:48
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable Tue, 26 Jun, 13:26
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable Tue, 26 Jun, 16:44
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable Tue, 26 Jun, 17:25
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-289) CrawlDatum should store IP address Wed, 27 Jun, 06:39
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable Wed, 27 Jun, 07:07
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable Wed, 27 Jun, 07:07
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code Wed, 27 Jun, 08:40
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code Wed, 27 Jun, 08:40
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation Wed, 27 Jun, 11:00
Doğacan Güney (JIRA) [jira] Resolved: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation Wed, 27 Jun, 12:47
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation Wed, 27 Jun, 12:47
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Thu, 28 Jun, 12:18
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Thu, 28 Jun, 12:46
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Thu, 28 Jun, 13:04
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-392) OutputFormat implementations should pass on Progressable Thu, 28 Jun, 15:59
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable Thu, 28 Jun, 15:59
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists Fri, 29 Jun, 08:49
Doğacan Güney (JIRA) [jira] Created: (NUTCH-506) Nutch should delegate compression to Hadoop Fri, 29 Jun, 12:46
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop Fri, 29 Jun, 12:48
Doğacan Güney (JIRA) [jira] Issue Comment Edited: (NUTCH-506) Nutch should delegate compression to Hadoop Fri, 29 Jun, 12:51
Nicolás Lichtmaier (JIRA) [jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration Sat, 23 Jun, 16:35
Andrzej Bialecki Re: Plugins and Thread Safety Fri, 01 Jun, 16:46
Andrzej Bialecki Re: Plugins and Thread Safety Fri, 01 Jun, 19:11
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Jul 2015137
Jun 2015446
May 2015319
Apr 2015463
Mar 2015384
Feb 2015530
Jan 2015258
Dec 2014162
Nov 2014165
Oct 2014249
Sep 2014376
Aug 2014136
Jul 2014219
Jun 2014355
May 2014378
Apr 2014332
Mar 2014248
Feb 2014168
Jan 2014471
Dec 2013186
Nov 2013177
Oct 2013182
Sep 2013158
Aug 2013182
Jul 2013240
Jun 2013321
May 2013288
Apr 2013437
Mar 2013521
Feb 2013201
Jan 2013560
Dec 2012176
Nov 2012251
Oct 2012200
Sep 2012219
Aug 2012230
Jul 2012301
Jun 2012391
May 2012317
Apr 2012352
Mar 2012297
Feb 2012395
Jan 2012298
Dec 2011318
Nov 2011524
Oct 2011483
Sep 2011605
Aug 2011528
Jul 2011635
Jun 2011418
May 2011176
Apr 2011453
Mar 2011139
Feb 201162
Jan 2011150
Dec 2010100
Nov 201096
Oct 2010177
Sep 2010143
Aug 2010289
Jul 2010364
Jun 2010246
May 201075
Apr 2010124
Mar 2010183
Feb 2010134
Jan 2010106
Dec 200998
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008158
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008190
Jan 2008155
Dec 200768
Nov 2007188
Oct 2007179
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510