Mailing list archives: November 2006

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Johannes Zillmann (JIRA) [jira] Commented: (NUTCH-273) When a page is redirected, the original url is NOT updated. Sun, 19 Nov, 16:13
Sami Siren (JIRA) [jira] Resolved: (NUTCH-403) Make URL filtering optional in Generator Sun, 19 Nov, 18:51
TKDD Can I rewrite org.apache.nutch.parse.msword.extractText(InputStream input) like this Mon, 20 Nov, 03:00
scott green Errors in RegexURLFilter Mon, 20 Nov, 15:28
Sami Siren Re: Errors in RegexURLFilter Mon, 20 Nov, 16:38
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results Mon, 20 Nov, 17:00
scott green What's the status of Nutch-GUI? Mon, 20 Nov, 17:12
scott green Re: Errors in RegexURLFilter Mon, 20 Nov, 17:12
Sami Siren Re: What's the status of Nutch-GUI? Mon, 20 Nov, 17:24
scott green Re: What's the status of Nutch-GUI? Mon, 20 Nov, 17:27
Chris Mattmann Re: What's the status of Nutch-GUI? Mon, 20 Nov, 18:39
nutch.newbie (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Mon, 20 Nov, 21:14
Armel T. Nene RE: What's the status of Nutch-GUI? Mon, 20 Nov, 21:44
Rida Benjelloun (JIRA) [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. Mon, 20 Nov, 22:16
Armel T. Nene RE: [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. Mon, 20 Nov, 22:26
Chris Mattmann Re: What's the status of Nutch-GUI? Mon, 20 Nov, 23:29
Armel T. Nene RE: What's the status of Nutch-GUI? Tue, 21 Nov, 00:04
Sami Siren (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Tue, 21 Nov, 05:15
Gavino Marras Nutch HTTPS & Sessions Tue, 21 Nov, 08:24
Gavino Marras Nutch crawl a Application Server Authentication Tue, 21 Nov, 08:57
Enis Soztutar Re: What's the status of Nutch-GUI? Tue, 21 Nov, 12:17
Sami Siren (JIRA) [jira] Created: (NUTCH-405) Content object is not properly initialized in map method of ParseSegment Tue, 21 Nov, 17:18
Sami Siren (JIRA) [jira] Resolved: (NUTCH-405) Content object is not properly initialized in map method of ParseSegment Tue, 21 Nov, 17:21
Sami Siren (JIRA) [jira] Closed: (NUTCH-380) Nutch does not run/build against Hadoop 0.6 Tue, 21 Nov, 17:33
Sami Siren (JIRA) [jira] Closed: (NUTCH-349) Port Nutch to use Hadoop Text instead of UTF8 Tue, 21 Nov, 17:39
Sami Siren (JIRA) [jira] Resolved: (NUTCH-362) Remove parse-text from unsupported filetypes in parse-plugins.xml Tue, 21 Nov, 17:53
Gavino Marras Nutch sessions cookies https Tue, 21 Nov, 18:00
scott green Re: What's the status of Nutch-GUI? Tue, 21 Nov, 18:34
Sami Siren (JIRA) [jira] Resolved: (NUTCH-305) Update crawl and url filter lists to exclude jpeg|JPEG|bmp|BMP Tue, 21 Nov, 18:41
Sami Siren Re: What's the status of Nutch-GUI? Tue, 21 Nov, 20:03
Stefan Groschupf Re: What's the status of Nutch-GUI? Tue, 21 Nov, 20:08
nutch.newbie (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Tue, 21 Nov, 20:50
Armel T. Nene Nutch folder configuration Tue, 21 Nov, 21:55
Armel T. Nene RE: Nutch folder configuration Tue, 21 Nov, 22:45
scott green Re: What's the status of Nutch-GUI? Wed, 22 Nov, 02:22
scott green Re: What's the status of Nutch-GUI? Wed, 22 Nov, 02:26
Sami Siren Re: What's the status of Nutch-GUI? Wed, 22 Nov, 04:29
scott green Re: More fetcher speed increases Wed, 22 Nov, 04:40
Stefan Groschupf Re: What's the status of Nutch-GUI? Wed, 22 Nov, 05:12
AJ Chen Re: [jira] Commented: (NUTCH-395) Increase fetching speed Wed, 22 Nov, 17:09
Armel T. Nene Nutch - Hadoop error Wed, 22 Nov, 17:49
Sami Siren Re: [jira] Commented: (NUTCH-395) Increase fetching speed Wed, 22 Nov, 18:20
AJ Chen Re: [jira] Commented: (NUTCH-395) Increase fetching speed Wed, 22 Nov, 23:14
Scott Green Question on adaptive re-fetch plugin Thu, 23 Nov, 06:37
Zaheed Haque Re: What's the status of Nutch-GUI? Thu, 23 Nov, 07:20
Scott Green Re: What's the status of Nutch-GUI? Thu, 23 Nov, 08:16
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-331) Fetcher incorrectly reports task progress to tasktracker resulting in skipped URLs Thu, 23 Nov, 10:27
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-331) Fetcher incorrectly reports task progress to tasktracker resulting in skipped URLs Thu, 23 Nov, 10:56
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-331) Fetcher incorrectly reports task progress to tasktracker resulting in skipped URLs Thu, 23 Nov, 10:58
Andrzej Bialecki Welcome Chris Mattmann as Nutch committer Thu, 23 Nov, 12:10
Doğacan Güney (JIRA) [jira] Created: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 13:27
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 13:29
Enis Soztutar (JIRA) [jira] Updated: (NUTCH-251) Administration GUI Thu, 23 Nov, 14:35
Zaheed Haque Re: [jira] Updated: (NUTCH-251) Administration GUI Thu, 23 Nov, 14:54
Chris A. Mattmann (JIRA) [jira] Updated: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 15:45
Chris A. Mattmann (JIRA) [jira] Work started: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 15:45
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 15:59
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 16:18
Chris A. Mattmann (JIRA) [jira] Commented: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 16:26
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 16:44
Chris A. Mattmann (JIRA) [jira] Commented: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 16:48
Chris A. Mattmann (JIRA) [jira] Commented: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 16:50
Chris A. Mattmann (JIRA) [jira] Resolved: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 17:18
Chris A. Mattmann (JIRA) [jira] Closed: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 17:20
Sami Siren Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 17:45
Chris Mattmann Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 18:08
Sami Siren Re: What's the status of Nutch-GUI? Thu, 23 Nov, 19:28
Chris Mattmann Re: Welcome Chris Mattmann as Nutch committer Thu, 23 Nov, 19:28
Sami Siren Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 20:01
Sami Siren (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Thu, 23 Nov, 20:09
kauu Re: Question on adaptive re-fetch plugin Fri, 24 Nov, 01:38
Piotr Kosiorowski Re: 0.7.3 version Fri, 24 Nov, 07:29
Andrzej Bialecki Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values Fri, 24 Nov, 07:54
Thorsten Scherler (JIRA) [jira] Created: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Fri, 24 Nov, 13:24
Thorsten Scherler (JIRA) [jira] Updated: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Fri, 24 Nov, 13:34
Chris A. Mattmann (JIRA) [jira] Assigned: (NUTCH-390) Javadoc warnings Fri, 24 Nov, 18:28
Chris A. Mattmann (JIRA) [jira] Assigned: (NUTCH-185) XMLParser is configurable xml parser plugin. Fri, 24 Nov, 18:30
Andrzej Bialecki (JIRA) [jira] Updated: (NUTCH-339) Refactor nutch to allow fetcher improvements Fri, 24 Nov, 18:55
Andrzej Bialecki (JIRA) [jira] Assigned: (NUTCH-339) Refactor nutch to allow fetcher improvements Fri, 24 Nov, 19:06
Sami Siren (JIRA) [jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements Fri, 24 Nov, 21:52
nutch.newbie (JIRA) [jira] Commented: (NUTCH-390) Javadoc warnings Sat, 25 Nov, 03:29
nutch.newbie (JIRA) [jira] Created: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 03:45
Andrzej Bialecki (JIRA) [jira] Updated: (NUTCH-339) Refactor nutch to allow fetcher improvements Sat, 25 Nov, 09:42
Stefan Groschupf (JIRA) [jira] Updated: (NUTCH-273) When a page is redirected, the original url is NOT updated. Sat, 25 Nov, 10:40
Armel Nene (JIRA) [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. Sat, 25 Nov, 13:51
Armel T. Nene RE: [jira] Created: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 14:32
Stefan Groschupf Re: [jira] Created: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 19:43
nutch.newbie (JIRA) [jira] Commented: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 23:04
Doug Cook (JIRA) [jira] Created: (NUTCH-409) Add "short circuit" notion to filters to speedup mixed site/subsite crawling Sun, 26 Nov, 00:18
Doug Cook (JIRA) [jira] Updated: (NUTCH-409) Add "short circuit" notion to filters to speedup mixed site/subsite crawling Sun, 26 Nov, 00:20
Doug Cook Re: More fetcher speed increases Sun, 26 Nov, 00:20
Doug Cook (JIRA) [jira] Commented: (NUTCH-409) Add "short circuit" notion to filters to speedup mixed site/subsite crawling Sun, 26 Nov, 01:03
sanjeev implement thai lanaguage analyzer during nutch crawl process Mon, 27 Nov, 04:46
sanjeev implement thai lanaguage analyzer during nutch crawl process Mon, 27 Nov, 04:46
sanjeev implement thai lanaguage analyzer during nutch crawl process Mon, 27 Nov, 04:47
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Mon, 27 Nov, 08:42
Thorsten Scherler (JIRA) [jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Mon, 27 Nov, 09:16
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Mon, 27 Nov, 09:40
Dogacan Güney (JIRA) [jira] Commented: (NUTCH-92) DistributedSearch incorrectly scores results Mon, 27 Nov, 19:24
Dogacan Güney (JIRA) [jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results Mon, 27 Nov, 19:24
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 200933
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510