| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Mon, 18 Jun, 21:59 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-502) Bug in SegmentReader causes infinite loop |
Tue, 19 Jun, 06:01 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-502) Bug in SegmentReader causes infinite loop |
Tue, 19 Jun, 06:03 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #122 |
Tue, 19 Jun, 07:00 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 07:22 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-502) Bug in SegmentReader causes infinite loop |
Tue, 19 Jun, 09:22 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 13:16 |
| Chris A. Mattmann (JIRA) |
[jira] Resolved: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:03 |
| Chris A. Mattmann (JIRA) |
[jira] Closed: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:03 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:27 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:35 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 07:00 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 07:07 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 13:04 |
| Chris Mattmann |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 13:25 |
| Dennis Kubes |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 13:59 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 14:17 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 14:19 |
| Chris Mattmann |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 15:08 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 15:17 |
| Chris Mattmann |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 15:32 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 16:41 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 16:45 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 18:09 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Wed, 20 Jun, 18:45 |
| Nigel Daley |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 21:53 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 23:46 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 23:50 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Thu, 21 Jun, 06:22 |
| Vishal Shah |
Found the bug in Generator when number of URLs is small |
Thu, 21 Jun, 06:43 |
| Doğacan Güney |
Re: Found the bug in Generator when number of URLs is small |
Thu, 21 Jun, 07:03 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #124 |
Thu, 21 Jun, 07:07 |
| Vishal Shah (JIRA) |
[jira] Created: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Thu, 21 Jun, 07:39 |
| Vishal Shah (JIRA) |
[jira] Updated: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Thu, 21 Jun, 08:07 |
| Vishal Shah (JIRA) |
[jira] Updated: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Thu, 21 Jun, 09:53 |
| Vishal Shah |
http.content.limit not respected when the Content-Type header has charset attributes |
Thu, 21 Jun, 10:06 |
| Vishal Shah |
RE: Found the bug in Generator when number of URLs is small |
Thu, 21 Jun, 10:43 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation |
Thu, 21 Jun, 12:29 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Thu, 21 Jun, 13:21 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Thu, 21 Jun, 14:44 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-471) Fix synchronization in NutchBean creation |
Thu, 21 Jun, 15:18 |
| Vishal Shah (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 06:49 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation |
Fri, 22 Jun, 07:03 |
| qi wu |
where to put hadoop native lib in tomcat? |
Fri, 22 Jun, 08:11 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:30 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:32 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:34 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block |
Fri, 22 Jun, 08:44 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:49 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 08:51 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 08:59 |
| Rob Young (JIRA) |
[jira] Commented: (NUTCH-479) Support for OR queries |
Fri, 22 Jun, 12:22 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 12:24 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once |
Fri, 22 Jun, 14:30 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 18:12 |
| Doug Cutting (JIRA) |
[jira] Commented: (NUTCH-479) Support for OR queries |
Fri, 22 Jun, 18:31 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 22:39 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #126 |
Sat, 23 Jun, 07:00 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-25) needs 'character encoding' detector |
Sat, 23 Jun, 11:06 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 13:08 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 13:28 |
| Nicolás Lichtmaier (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 16:35 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 19:12 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-505) Outlink urls should be validated |
Sat, 23 Jun, 20:15 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Sat, 23 Jun, 20:21 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 20:45 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #127 |
Sun, 24 Jun, 07:03 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-468) Scoring filter should distribute score to all outlinks at once |
Sun, 24 Jun, 09:30 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Sun, 24 Jun, 10:05 |
| Enzo Michelangeli (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sun, 24 Jun, 13:23 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Sun, 24 Jun, 13:40 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:25 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Sun, 24 Jun, 15:27 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-356) Plugin repository cache can lead to memory leak |
Sun, 24 Jun, 19:05 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Mon, 25 Jun, 07:02 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once |
Mon, 25 Jun, 07:02 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Mon, 25 Jun, 08:09 |
| Dennis Kubes |
Re: [jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Mon, 25 Jun, 23:42 |
| Chris Mattmann |
Re: [jira] Updated: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 00:09 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 03:35 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 03:35 |
| Chris Mattmann |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:40 |
| Dennis Kubes |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:45 |
| Chris Mattmann |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:49 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 07:07 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Tue, 26 Jun, 12:48 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 13:26 |
| Luca Rondanini |
Re-crawling Problem |
Tue, 26 Jun, 15:37 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 15:51 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 16:44 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 16:56 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 17:25 |
| Kai_testing Middleton |
Re: [jira] Commented: (NUTCH-505) Outlink urls should be validated |
Tue, 26 Jun, 18:04 |
| Kai_testing Middleton |
NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 00:49 |
| Doğacan Güney |
Re: NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 05:56 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-289) CrawlDatum should store IP address |
Wed, 27 Jun, 06:39 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Wed, 27 Jun, 07:01 |