| Doğacan Güney |
Re: First Plugin |
Fri, 05 Oct, 13:12 |
| Doğacan Güney |
Re: First Plugin |
Fri, 05 Oct, 13:25 |
| Doğacan Güney |
Re: First Plugin |
Fri, 05 Oct, 13:47 |
| Doğacan Güney |
Re: Adding new class to nutch |
Mon, 29 Oct, 19:12 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
Wed, 03 Oct, 07:42 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter |
Wed, 03 Oct, 07:46 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Thu, 04 Oct, 14:59 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Mon, 08 Oct, 10:58 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Mon, 08 Oct, 11:00 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Thu, 18 Oct, 07:17 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Fri, 26 Oct, 12:21 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Sun, 28 Oct, 14:58 |
| Doğacan Güney (JIRA) |
[jira] Assigned: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sun, 28 Oct, 19:27 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sun, 28 Oct, 19:27 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sun, 28 Oct, 19:29 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Mon, 29 Oct, 14:59 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Mon, 29 Oct, 19:23 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Tue, 30 Oct, 15:15 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Tue, 30 Oct, 19:56 |
| Andrzej Bialecki |
Re: [jira] Closed: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Tue, 09 Oct, 20:57 |
| Andrzej Bialecki |
Re: Selective/Configurable HTML Parsing? |
Tue, 16 Oct, 19:35 |
| Andrzej Bialecki |
Re: Scoring API issues (LONG) |
Thu, 18 Oct, 16:40 |
| Andrzej Bialecki |
Re: Update to URL ordering from Generator.java |
Wed, 24 Oct, 13:06 |
| Andrzej Bialecki |
Re: Upgrading Nutch to Hadoop 0.14 or 0.15 |
Thu, 25 Oct, 19:36 |
| Andrzej Bialecki |
Re: What are the side effects of running crawl multiple times? |
Mon, 29 Oct, 14:55 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Thu, 18 Oct, 10:10 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter |
Thu, 18 Oct, 15:33 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-569) Protocol plugins should report progress to the fetcher |
Tue, 23 Oct, 12:26 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Mon, 29 Oct, 14:54 |
| Antony Bowesman (JIRA) |
[jira] Created: (NUTCH-564) External parser supports encoding attribute |
Wed, 03 Oct, 21:42 |
| Antony Bowesman (JIRA) |
[jira] Updated: (NUTCH-564) External parser supports encoding attribute |
Wed, 03 Oct, 21:50 |
| Antony Bowesman (JIRA) |
[jira] Updated: (NUTCH-564) External parser supports encoding attribute |
Wed, 03 Oct, 21:53 |
| Antony Bowesman (JIRA) |
[jira] Updated: (NUTCH-564) External parser supports encoding attribute |
Wed, 03 Oct, 21:55 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Sun, 07 Oct, 15:32 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Sun, 07 Oct, 15:34 |
| Chris A. Mattmann (JIRA) |
[jira] Issue Comment Edited: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Sun, 07 Oct, 15:34 |
| Chris A. Mattmann (JIRA) |
[jira] Resolved: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Tue, 09 Oct, 00:24 |
| Chris A. Mattmann (JIRA) |
[jira] Closed: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Tue, 09 Oct, 00:26 |
| Chris Mattmann |
Re: [jira] Closed: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Tue, 09 Oct, 21:55 |
| Chris Mattmann |
Re: [jira] Closed: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Wed, 10 Oct, 18:48 |
| Chris Mattmann |
Re: writing a new parse-exe plugin |
Wed, 17 Oct, 20:50 |
| Chris Mattmann |
Re: JIRA, Resolving and Closing Issues |
Thu, 18 Oct, 17:08 |
| Christopher Bader |
Choices in Nutch Web interface? |
Wed, 10 Oct, 18:16 |
| Christopher Bader |
RE: Choices in Nutch Web interface? |
Wed, 10 Oct, 18:48 |
| Dawid Weiss |
Re: Anyone looked for a better HTML parser? |
Wed, 17 Oct, 12:12 |
| Dawid Weiss (JIRA) |
[jira] Created: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Wed, 17 Oct, 12:07 |
| Dawid Weiss (JIRA) |
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Wed, 17 Oct, 12:07 |
| Dawid Weiss (JIRA) |
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Wed, 17 Oct, 12:09 |
| Dawid Weiss (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Thu, 18 Oct, 07:06 |
| Dennis Kubes |
Re: Java Packages (missing) |
Mon, 08 Oct, 05:26 |
| Dennis Kubes |
Re: Strange RemoteException thrown while doing a parse of ~64m documents |
Mon, 08 Oct, 05:30 |
| Dennis Kubes |
Re: [jira] Closed: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Tue, 09 Oct, 21:22 |
| Dennis Kubes |
Re: [jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter |
Wed, 10 Oct, 17:59 |
| Dennis Kubes |
JIRA, Resolving and Closing Issues |
Thu, 18 Oct, 16:58 |
| Dennis Kubes |
Upgrading Nutch to Hadoop 0.14 or 0.15 |
Thu, 25 Oct, 19:11 |
| Dennis Kubes |
Re: Upgrading Nutch to Hadoop 0.14 or 0.15 |
Thu, 25 Oct, 19:51 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-565) Arc File to Nutch Segments Converter |
Tue, 09 Oct, 05:08 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Tue, 09 Oct, 05:11 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Tue, 09 Oct, 05:18 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Tue, 09 Oct, 18:59 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter |
Tue, 09 Oct, 20:45 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter |
Tue, 09 Oct, 20:45 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Thu, 11 Oct, 21:30 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Thu, 11 Oct, 21:32 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Thu, 11 Oct, 21:32 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Tue, 16 Oct, 00:23 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter |
Wed, 17 Oct, 22:43 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Wed, 17 Oct, 22:43 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter |
Thu, 18 Oct, 16:14 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Thu, 18 Oct, 16:55 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Thu, 18 Oct, 16:55 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Thu, 25 Oct, 20:30 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Thu, 25 Oct, 21:22 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Thu, 25 Oct, 21:26 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Fri, 26 Oct, 19:16 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Fri, 26 Oct, 19:16 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-565) Arc File to Nutch Segments Converter |
Fri, 26 Oct, 19:16 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Mon, 29 Oct, 14:54 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Tue, 30 Oct, 18:21 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Tue, 30 Oct, 18:27 |
| Doug Cook |
Anyone looked for a better HTML parser? |
Mon, 15 Oct, 20:44 |
| Doug Cook |
Re: Anyone looked for a better HTML parser? |
Tue, 16 Oct, 14:54 |
| Doug Cook (JIRA) |
[jira] Created: (NUTCH-566) Sun's URL class has bug in creation of relative query URLs |
Wed, 10 Oct, 15:56 |
| Doug Cook (JIRA) |
[jira] Updated: (NUTCH-566) Sun's URL class has bug in creation of relative query URLs |
Wed, 10 Oct, 15:58 |
| Doug Cook (JIRA) |
[jira] Commented: (NUTCH-436) Incorrect handling of relative paths when the embedded URL path is empty |
Tue, 16 Oct, 15:39 |
| Doug Cook (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Wed, 17 Oct, 15:39 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Fri, 05 Oct, 00:03 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Sat, 27 Oct, 17:32 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 15 Oct, 15:33 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Fri, 26 Oct, 13:21 |
| Hal Fulton |
Hits estimation? |
Mon, 01 Oct, 16:36 |
| Hal Fulton |
Re: Hits estimation? |
Mon, 01 Oct, 17:48 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Tue, 09 Oct, 04:30 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-562) Port mime type framework to use Tika mime detection framework |
Tue, 09 Oct, 04:30 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Fri, 19 Oct, 04:12 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Tue, 30 Oct, 04:17 |
| James Phillips |
Quote Please? |
Fri, 26 Oct, 05:18 |
| Ken Krugler |
Re: Update to URL ordering from Generator.java |
Wed, 24 Oct, 15:29 |
| Ken Krugler |
Re: Update to URL ordering from Generator.java |
Wed, 24 Oct, 19:56 |
| Marcin Okraszewski (JIRA) |
[jira] Updated: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list |
Mon, 15 Oct, 20:26 |