| Andrzej Bialecki |
Re: [PATCH] Moving HitDetails construction to a HitDetails constructor (v2). |
Fri, 01 Jun, 20:58 |
| Andrzej Bialecki |
Welcome =?UTF-8?B?RG/En2FjYW4gYXMgTnV0Y2ggY29tbWl0dGVy?= |
Mon, 11 Jun, 20:33 |
| Andrzej Bialecki |
Re: upgrade to hadoop-0.13? |
Mon, 18 Jun, 09:22 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Fri, 01 Jun, 09:14 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Fri, 01 Jun, 14:42 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Sat, 02 Jun, 15:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 16:53 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Sun, 17 Jun, 15:41 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-501) implementing a different caching mechanism for objects |
Mon, 18 Jun, 13:16 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Thu, 21 Jun, 06:22 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:49 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 13:28 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Wed, 27 Jun, 11:08 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 12:33 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 18:46 |
| Briggs |
Plugins and Thread Safety |
Fri, 01 Jun, 16:16 |
| Briggs |
Re: Plugins and Thread Safety |
Fri, 01 Jun, 17:59 |
| Briggs |
Re: Plugins and Thread Safety |
Fri, 01 Jun, 19:30 |
| Briggs |
Re: Plugins and Thread Safety |
Mon, 04 Jun, 15:08 |
| Briggs |
Re: Plugins and Thread Safety |
Mon, 04 Jun, 16:07 |
| Briggs |
Re: [jira] Commented: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Mon, 04 Jun, 17:46 |
| Briggs |
Re: [jira] Commented: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Tue, 05 Jun, 16:16 |
| Briggs |
Lock file problems... |
Thu, 07 Jun, 15:20 |
| Briggs |
Re: Plugins initialized all the time! |
Fri, 08 Jun, 21:43 |
| Briggs |
Re: Plugins initialized all the time! |
Fri, 08 Jun, 21:43 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sat, 16 Jun, 17:43 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Sat, 16 Jun, 17:45 |
| Chris A. Mattmann (JIRA) |
[jira] Closed: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sun, 17 Jun, 17:21 |
| Chris A. Mattmann (JIRA) |
[jira] Resolved: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sun, 17 Jun, 17:21 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Sun, 17 Jun, 17:24 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Sun, 17 Jun, 19:15 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Sun, 17 Jun, 20:07 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 13:16 |
| Chris A. Mattmann (JIRA) |
[jira] Resolved: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:03 |
| Chris A. Mattmann (JIRA) |
[jira] Closed: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:03 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:35 |
| Chris Mattmann |
Re: Welcome Do=?UTF-8?B?xJ8=?=acan as Nutch committer |
Tue, 12 Jun, 14:15 |
| Chris Mattmann |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 13:25 |
| Chris Mattmann |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 15:08 |
| Chris Mattmann |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 15:32 |
| Chris Mattmann |
Re: [jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 00:09 |
| Chris Mattmann |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:40 |
| Chris Mattmann |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:49 |
| Dennis Kubes |
Re: How to create patch? |
Fri, 01 Jun, 07:11 |
| Dennis Kubes |
Re: Welcome =?UTF-8?B?RG/En2FjYW4gYXMgTnV0Y2ggY29tbWl0dGVy?= |
Tue, 12 Jun, 04:06 |
| Dennis Kubes |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 13:59 |
| Dennis Kubes |
Re: [jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Mon, 25 Jun, 23:42 |
| Dennis Kubes |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:45 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 06 Jun, 23:34 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 06 Jun, 23:36 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 16:41 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 16:45 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 23:46 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 23:50 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Thu, 21 Jun, 13:21 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:25 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 03:35 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 03:35 |
| Doug Cutting |
[Fwd: Nutch 0.9 and Crawl-Delay] |
Mon, 04 Jun, 20:25 |
| Doug Cutting |
Re: JIRA email question |
Wed, 27 Jun, 17:43 |
| Doug Cutting (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Fri, 01 Jun, 19:56 |
| Doug Cutting (JIRA) |
[jira] Commented: (NUTCH-479) Support for OR queries |
Fri, 22 Jun, 18:31 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-500) Add hadoop masters configuration file into conf folder |
Mon, 18 Jun, 06:50 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Thu, 21 Jun, 14:44 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 18:12 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 29 Jun, 07:31 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 07:58 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-501) implementing a different caching mechanism for objects |
Mon, 18 Jun, 12:21 |
| Enzo Michelangeli |
Re: Loading mechanism of plugin classes and singleton objects |
Sat, 09 Jun, 07:53 |
| Enzo Michelangeli (JIRA) |
[jira] Commented: (NUTCH-356) Plugin repository cache can lead to memory leak |
Mon, 11 Jun, 03:29 |
| Enzo Michelangeli (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sun, 24 Jun, 13:23 |
| Espen Amble Kolstad (JIRA) |
[jira] Created: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Thu, 14 Jun, 08:07 |
| Espen Amble Kolstad (JIRA) |
[jira] Updated: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Thu, 14 Jun, 08:28 |
| Espen Amble Kolstad (JIRA) |
[jira] Updated: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 09:21 |
| Espen Amble Kolstad (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 14:06 |
| Espen Amble Kolstad (JIRA) |
[jira] Updated: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 14:08 |
| Gal Nitzan |
RE: Lock file problems... |
Thu, 07 Jun, 16:52 |
| Gal Nitzan |
RE: [jira] Resolved: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Sun, 17 Jun, 21:10 |
| Gal Nitzan (JIRA) |
[jira] Commented: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Wed, 06 Jun, 12:04 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation |
Fri, 22 Jun, 07:03 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Mon, 25 Jun, 07:02 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once |
Mon, 25 Jun, 07:02 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 07:07 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly |
Thu, 28 Jun, 07:03 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Thu, 28 Jun, 07:04 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Thu, 28 Jun, 07:04 |
| Kai_testing Middleton |
Re: [jira] Commented: (NUTCH-505) Outlink urls should be validated |
Tue, 26 Jun, 18:04 |
| Kai_testing Middleton |
NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 00:49 |
| Kai_testing Middleton |
Re: NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 20:00 |
| Luca Rondanini |
Re-crawling Problem |
Tue, 26 Jun, 15:37 |
| Marc Miller (JIRA) |
[jira] Created: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Mon, 04 Jun, 16:33 |
| Marc Miller (JIRA) |
[jira] Updated: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Mon, 04 Jun, 16:37 |
| Marc Miller (JIRA) |
[jira] Updated: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Mon, 04 Jun, 16:37 |
| Marcin Okraszewski |
Re: How to create patch? |
Fri, 01 Jun, 06:34 |
| Nigel Daley |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 21:53 |
| Oscar |
Fwd: failed to subscribe 'nutch-user' maillist |
Sat, 30 Jun, 10:58 |