| Johannes Zillmann (JIRA) |
[jira] Commented: (NUTCH-273) When a page is redirected, the original url is NOT updated. |
Sun, 19 Nov, 16:13 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-403) Make URL filtering optional in Generator |
Sun, 19 Nov, 18:51 |
| TKDD |
Can I rewrite org.apache.nutch.parse.msword.extractText(InputStream input) like this |
Mon, 20 Nov, 03:00 |
| scott green |
Errors in RegexURLFilter |
Mon, 20 Nov, 15:28 |
| Sami Siren |
Re: Errors in RegexURLFilter |
Mon, 20 Nov, 16:38 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results |
Mon, 20 Nov, 17:00 |
| scott green |
What's the status of Nutch-GUI? |
Mon, 20 Nov, 17:12 |
| scott green |
Re: Errors in RegexURLFilter |
Mon, 20 Nov, 17:12 |
| Sami Siren |
Re: What's the status of Nutch-GUI? |
Mon, 20 Nov, 17:24 |
| scott green |
Re: What's the status of Nutch-GUI? |
Mon, 20 Nov, 17:27 |
| Chris Mattmann |
Re: What's the status of Nutch-GUI? |
Mon, 20 Nov, 18:39 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Mon, 20 Nov, 21:14 |
| Armel T. Nene |
RE: What's the status of Nutch-GUI? |
Mon, 20 Nov, 21:44 |
| Rida Benjelloun (JIRA) |
[jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. |
Mon, 20 Nov, 22:16 |
| Armel T. Nene |
RE: [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. |
Mon, 20 Nov, 22:26 |
| Chris Mattmann |
Re: What's the status of Nutch-GUI? |
Mon, 20 Nov, 23:29 |
| Armel T. Nene |
RE: What's the status of Nutch-GUI? |
Tue, 21 Nov, 00:04 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Tue, 21 Nov, 05:15 |
| Gavino Marras |
Nutch HTTPS & Sessions |
Tue, 21 Nov, 08:24 |
| Gavino Marras |
Nutch crawl a Application Server Authentication |
Tue, 21 Nov, 08:57 |
| Enis Soztutar |
Re: What's the status of Nutch-GUI? |
Tue, 21 Nov, 12:17 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-405) Content object is not properly initialized in map method of ParseSegment |
Tue, 21 Nov, 17:18 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-405) Content object is not properly initialized in map method of ParseSegment |
Tue, 21 Nov, 17:21 |
| Sami Siren (JIRA) |
[jira] Closed: (NUTCH-380) Nutch does not run/build against Hadoop 0.6 |
Tue, 21 Nov, 17:33 |
| Sami Siren (JIRA) |
[jira] Closed: (NUTCH-349) Port Nutch to use Hadoop Text instead of UTF8 |
Tue, 21 Nov, 17:39 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-362) Remove parse-text from unsupported filetypes in parse-plugins.xml |
Tue, 21 Nov, 17:53 |
| Gavino Marras |
Nutch sessions cookies https |
Tue, 21 Nov, 18:00 |
| scott green |
Re: What's the status of Nutch-GUI? |
Tue, 21 Nov, 18:34 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-305) Update crawl and url filter lists to exclude jpeg|JPEG|bmp|BMP |
Tue, 21 Nov, 18:41 |
| Sami Siren |
Re: What's the status of Nutch-GUI? |
Tue, 21 Nov, 20:03 |
| Stefan Groschupf |
Re: What's the status of Nutch-GUI? |
Tue, 21 Nov, 20:08 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Tue, 21 Nov, 20:50 |
| Armel T. Nene |
Nutch folder configuration |
Tue, 21 Nov, 21:55 |
| Armel T. Nene |
RE: Nutch folder configuration |
Tue, 21 Nov, 22:45 |
| scott green |
Re: What's the status of Nutch-GUI? |
Wed, 22 Nov, 02:22 |
| scott green |
Re: What's the status of Nutch-GUI? |
Wed, 22 Nov, 02:26 |
| Sami Siren |
Re: What's the status of Nutch-GUI? |
Wed, 22 Nov, 04:29 |
| scott green |
Re: More fetcher speed increases |
Wed, 22 Nov, 04:40 |
| Stefan Groschupf |
Re: What's the status of Nutch-GUI? |
Wed, 22 Nov, 05:12 |
| AJ Chen |
Re: [jira] Commented: (NUTCH-395) Increase fetching speed |
Wed, 22 Nov, 17:09 |
| Armel T. Nene |
Nutch - Hadoop error |
Wed, 22 Nov, 17:49 |
| Sami Siren |
Re: [jira] Commented: (NUTCH-395) Increase fetching speed |
Wed, 22 Nov, 18:20 |
| AJ Chen |
Re: [jira] Commented: (NUTCH-395) Increase fetching speed |
Wed, 22 Nov, 23:14 |
| Scott Green |
Question on adaptive re-fetch plugin |
Thu, 23 Nov, 06:37 |
| Zaheed Haque |
Re: What's the status of Nutch-GUI? |
Thu, 23 Nov, 07:20 |
| Scott Green |
Re: What's the status of Nutch-GUI? |
Thu, 23 Nov, 08:16 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-331) Fetcher incorrectly reports task progress to tasktracker resulting in skipped URLs |
Thu, 23 Nov, 10:27 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-331) Fetcher incorrectly reports task progress to tasktracker resulting in skipped URLs |
Thu, 23 Nov, 10:56 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-331) Fetcher incorrectly reports task progress to tasktracker resulting in skipped URLs |
Thu, 23 Nov, 10:58 |
| Andrzej Bialecki |
Welcome Chris Mattmann as Nutch committer |
Thu, 23 Nov, 12:10 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 13:27 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 13:29 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-251) Administration GUI |
Thu, 23 Nov, 14:35 |
| Zaheed Haque |
Re: [jira] Updated: (NUTCH-251) Administration GUI |
Thu, 23 Nov, 14:54 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 15:45 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 15:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 15:59 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 16:18 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 16:26 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 16:44 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 16:48 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 16:50 |
| Chris A. Mattmann (JIRA) |
[jira] Resolved: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 17:18 |
| Chris A. Mattmann (JIRA) |
[jira] Closed: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 17:20 |
| Sami Siren |
Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 17:45 |
| Chris Mattmann |
Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 18:08 |
| Sami Siren |
Re: What's the status of Nutch-GUI? |
Thu, 23 Nov, 19:28 |
| Chris Mattmann |
Re: Welcome Chris Mattmann as Nutch committer |
Thu, 23 Nov, 19:28 |
| Sami Siren |
Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values |
Thu, 23 Nov, 20:01 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Thu, 23 Nov, 20:09 |
| kauu |
Re: Question on adaptive re-fetch plugin |
Fri, 24 Nov, 01:38 |
| Piotr Kosiorowski |
Re: 0.7.3 version |
Fri, 24 Nov, 07:29 |
| Andrzej Bialecki |
Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values |
Fri, 24 Nov, 07:54 |
| Thorsten Scherler (JIRA) |
[jira] Created: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable |
Fri, 24 Nov, 13:24 |
| Thorsten Scherler (JIRA) |
[jira] Updated: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable |
Fri, 24 Nov, 13:34 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-390) Javadoc warnings |
Fri, 24 Nov, 18:28 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-185) XMLParser is configurable xml parser plugin. |
Fri, 24 Nov, 18:30 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-339) Refactor nutch to allow fetcher improvements |
Fri, 24 Nov, 18:55 |
| Andrzej Bialecki (JIRA) |
[jira] Assigned: (NUTCH-339) Refactor nutch to allow fetcher improvements |
Fri, 24 Nov, 19:06 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements |
Fri, 24 Nov, 21:52 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-390) Javadoc warnings |
Sat, 25 Nov, 03:29 |
| nutch.newbie (JIRA) |
[jira] Created: (NUTCH-408) Plugin development documentation |
Sat, 25 Nov, 03:45 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-339) Refactor nutch to allow fetcher improvements |
Sat, 25 Nov, 09:42 |
| Stefan Groschupf (JIRA) |
[jira] Updated: (NUTCH-273) When a page is redirected, the original url is NOT updated. |
Sat, 25 Nov, 10:40 |
| Armel Nene (JIRA) |
[jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. |
Sat, 25 Nov, 13:51 |
| Armel T. Nene |
RE: [jira] Created: (NUTCH-408) Plugin development documentation |
Sat, 25 Nov, 14:32 |
| Stefan Groschupf |
Re: [jira] Created: (NUTCH-408) Plugin development documentation |
Sat, 25 Nov, 19:43 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-408) Plugin development documentation |
Sat, 25 Nov, 23:04 |
| Doug Cook (JIRA) |
[jira] Created: (NUTCH-409) Add "short circuit" notion to filters to speedup mixed site/subsite crawling |
Sun, 26 Nov, 00:18 |
| Doug Cook (JIRA) |
[jira] Updated: (NUTCH-409) Add "short circuit" notion to filters to speedup mixed site/subsite crawling |
Sun, 26 Nov, 00:20 |
| Doug Cook |
Re: More fetcher speed increases |
Sun, 26 Nov, 00:20 |
| Doug Cook (JIRA) |
[jira] Commented: (NUTCH-409) Add "short circuit" notion to filters to speedup mixed site/subsite crawling |
Sun, 26 Nov, 01:03 |
| sanjeev |
implement thai lanaguage analyzer during nutch crawl process |
Mon, 27 Nov, 04:46 |
| sanjeev |
implement thai lanaguage analyzer during nutch crawl process |
Mon, 27 Nov, 04:46 |
| sanjeev |
implement thai lanaguage analyzer during nutch crawl process |
Mon, 27 Nov, 04:47 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable |
Mon, 27 Nov, 08:42 |
| Thorsten Scherler (JIRA) |
[jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable |
Mon, 27 Nov, 09:16 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable |
Mon, 27 Nov, 09:40 |
| Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-92) DistributedSearch incorrectly scores results |
Mon, 27 Nov, 19:24 |
| Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results |
Mon, 27 Nov, 19:24 |