Mailing list archives: November 2006

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Gavino Marras Nutch crawl a Application Server Authentication Tue, 21 Nov, 08:57
Gavino Marras Nutch sessions cookies https Tue, 21 Nov, 18:00
Javier P. L. Modifiying Nutch Indexer Tue, 07 Nov, 10:23
Javier P. L. Re: Modifiying Nutch Indexer Thu, 09 Nov, 10:29
Javier P. L. Re: Last-modified http field Mon, 13 Nov, 15:14
Javier Parapar Lopez Last-modified http field Mon, 13 Nov, 12:24
Jayant Kumar Gandhi (JIRA) [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. Sun, 12 Nov, 07:36
Johannes Zillmann (JIRA) [jira] Commented: (NUTCH-273) When a page is redirected, the original url is NOT updated. Sun, 19 Nov, 16:13
Peter Landolt Brochure for Nutch Thu, 30 Nov, 16:29
Piotr Kosiorowski Re: why can't build in the Linux with ant Sat, 11 Nov, 17:08
Piotr Kosiorowski Re: How to start working with MapReduce? Sat, 11 Nov, 17:10
Piotr Kosiorowski 0.7.3 version Thu, 16 Nov, 21:09
Piotr Kosiorowski Re: 0.7.3 version Fri, 24 Nov, 07:29
Rida Benjelloun (JIRA) [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. Mon, 20 Nov, 22:16
Rod Taylor (JIRA) [jira] Created: (NUTCH-401) Hardcoded /tmp directory in SegmentReader Mon, 13 Nov, 19:35
Sami Siren Re: [jira] Resolved: (NUTCH-395) Increase fetching speed Tue, 14 Nov, 14:55
Sami Siren Re: Errors in RegexURLFilter Mon, 20 Nov, 16:38
Sami Siren Re: What's the status of Nutch-GUI? Mon, 20 Nov, 17:24
Sami Siren Re: What's the status of Nutch-GUI? Tue, 21 Nov, 20:03
Sami Siren Re: What's the status of Nutch-GUI? Wed, 22 Nov, 04:29
Sami Siren Re: [jira] Commented: (NUTCH-395) Increase fetching speed Wed, 22 Nov, 18:20
Sami Siren Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 17:45
Sami Siren Re: What's the status of Nutch-GUI? Thu, 23 Nov, 19:28
Sami Siren Re: [jira] Closed: (NUTCH-406) Metadata tries to write null values Thu, 23 Nov, 20:01
Sami Siren (JIRA) [jira] Commented: (NUTCH-395) Increase fetching speed Fri, 10 Nov, 16:44
Sami Siren (JIRA) [jira] Updated: (NUTCH-395) Increase fetching speed Sat, 11 Nov, 08:57
Sami Siren (JIRA) [jira] Updated: (NUTCH-395) Increase fetching speed Sat, 11 Nov, 08:57
Sami Siren (JIRA) [jira] Commented: (NUTCH-398) map-reduce very slow when crawling on single server Sat, 11 Nov, 09:05
Sami Siren (JIRA) [jira] Created: (NUTCH-399) Change CommandRunner to use concurrent api from jdk Sat, 11 Nov, 15:24
Sami Siren (JIRA) [jira] Resolved: (NUTCH-399) Change CommandRunner to use concurrent api from jdk Sat, 11 Nov, 15:29
Sami Siren (JIRA) [jira] Created: (NUTCH-400) Update & add missing license headers Sun, 12 Nov, 00:11
Sami Siren (JIRA) [jira] Updated: (NUTCH-400) Update & add missing license headers Sun, 12 Nov, 00:11
Sami Siren (JIRA) [jira] Updated: (NUTCH-395) Increase fetching speed Sun, 12 Nov, 20:32
Sami Siren (JIRA) [jira] Commented: (NUTCH-400) Update & add missing license headers Mon, 13 Nov, 18:38
Sami Siren (JIRA) [jira] Resolved: (NUTCH-395) Increase fetching speed Mon, 13 Nov, 19:50
Sami Siren (JIRA) [jira] Commented: (NUTCH-401) Hardcoded /tmp directory in SegmentReader Mon, 13 Nov, 20:31
Sami Siren (JIRA) [jira] Created: (NUTCH-403) Make URL filtering optional in Generator Sat, 18 Nov, 21:36
Sami Siren (JIRA) [jira] Updated: (NUTCH-403) Make URL filtering optional in Generator Sat, 18 Nov, 21:40
Sami Siren (JIRA) [jira] Updated: (NUTCH-403) Make URL filtering optional in Generator Sat, 18 Nov, 21:40
Sami Siren (JIRA) [jira] Resolved: (NUTCH-388) nutch-default.xml has outdated example for urlfilter.order Sat, 18 Nov, 21:59
Sami Siren (JIRA) [jira] Created: (NUTCH-404) Fix LinkDB Usage - implementation mismatch Sun, 19 Nov, 12:54
Sami Siren (JIRA) [jira] Resolved: (NUTCH-404) Fix LinkDB Usage - implementation mismatch Sun, 19 Nov, 12:58
Sami Siren (JIRA) [jira] Resolved: (NUTCH-403) Make URL filtering optional in Generator Sun, 19 Nov, 18:51
Sami Siren (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Tue, 21 Nov, 05:15
Sami Siren (JIRA) [jira] Created: (NUTCH-405) Content object is not properly initialized in map method of ParseSegment Tue, 21 Nov, 17:18
Sami Siren (JIRA) [jira] Resolved: (NUTCH-405) Content object is not properly initialized in map method of ParseSegment Tue, 21 Nov, 17:21
Sami Siren (JIRA) [jira] Closed: (NUTCH-380) Nutch does not run/build against Hadoop 0.6 Tue, 21 Nov, 17:33
Sami Siren (JIRA) [jira] Closed: (NUTCH-349) Port Nutch to use Hadoop Text instead of UTF8 Tue, 21 Nov, 17:39
Sami Siren (JIRA) [jira] Resolved: (NUTCH-362) Remove parse-text from unsupported filetypes in parse-plugins.xml Tue, 21 Nov, 17:53
Sami Siren (JIRA) [jira] Resolved: (NUTCH-305) Update crawl and url filter lists to exclude jpeg|JPEG|bmp|BMP Tue, 21 Nov, 18:41
Sami Siren (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Thu, 23 Nov, 20:09
Sami Siren (JIRA) [jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements Fri, 24 Nov, 21:52
Sami Siren (JIRA) [jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements Tue, 28 Nov, 05:16
Sami Siren (JIRA) [jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements Tue, 28 Nov, 15:54
Sami Siren (JIRA) [jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements Tue, 28 Nov, 17:53
Scott Green Question on adaptive re-fetch plugin Thu, 23 Nov, 06:37
Scott Green Re: What's the status of Nutch-GUI? Thu, 23 Nov, 08:16
Scott Green Multi-NutchBean Thu, 30 Nov, 05:34
Sean Dean (JIRA) [jira] Commented: (NUTCH-233) wrong regular expression hang reduce process for ever Tue, 28 Nov, 13:37
Stanislaw Osinski (JIRA) [jira] Commented: (NUTCH-397) porting clustering-carrot2 plugin to carrot2 v2.0 Sun, 12 Nov, 13:47
Stefan Groschupf Re: Fetcher freezes Fri, 03 Nov, 14:56
Stefan Groschupf Re: What's the status of Nutch-GUI? Tue, 21 Nov, 20:08
Stefan Groschupf Re: What's the status of Nutch-GUI? Wed, 22 Nov, 05:12
Stefan Groschupf Re: [jira] Created: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 19:43
Stefan Groschupf (JIRA) [jira] Updated: (NUTCH-273) When a page is redirected, the original url is NOT updated. Sat, 25 Nov, 10:40
Stefan Neufeind Re: Brochure for Nutch Thu, 30 Nov, 18:04
TKDD Can I rewrite org.apache.nutch.parse.msword.extractText(InputStream input) like this Mon, 20 Nov, 03:00
Teruhiko Kurosaka RE: implement thai lanaguage analyzer in nutch Wed, 08 Nov, 19:16
Teruhiko Kurosaka RE: implement thai lanaguage analyzer in nutch Fri, 10 Nov, 17:57
Thorsten Scherler (JIRA) [jira] Created: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Fri, 24 Nov, 13:24
Thorsten Scherler (JIRA) [jira] Updated: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Fri, 24 Nov, 13:34
Thorsten Scherler (JIRA) [jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable Mon, 27 Nov, 09:16
Uros Gruber (JIRA) [jira] Commented: (NUTCH-398) map-reduce very slow when crawling on single server Wed, 08 Nov, 06:34
Uros Gruber (JIRA) [jira] Commented: (NUTCH-289) CrawlDatum should store IP address Thu, 16 Nov, 08:59
Zaheed Haque Re: What's the status of Nutch-GUI? Thu, 23 Nov, 07:20
Zaheed Haque Re: [jira] Updated: (NUTCH-251) Administration GUI Thu, 23 Nov, 14:54
an...@orbita1.ru deep limitation Mon, 06 Nov, 08:31
hzhong Nutch and Lucene Fri, 10 Nov, 08:08
juwen (JIRA) [jira] Commented: (NUTCH-36) Chinese in Nutch Tue, 07 Nov, 06:23
kauu Re: implement thai lanaguage analyzer in nutch Tue, 07 Nov, 11:59
kauu why can't build in the Linux with ant Thu, 09 Nov, 02:52
kauu How to start working with MapReduce? Thu, 09 Nov, 08:46
kauu Re: How to start working with MapReduce? Thu, 09 Nov, 08:49
kauu Re: Question on adaptive re-fetch plugin Fri, 24 Nov, 01:38
nutch.newbie (JIRA) [jira] Commented: (NUTCH-398) map-reduce very slow when crawling on single server Wed, 08 Nov, 05:20
nutch.newbie (JIRA) [jira] Commented: (NUTCH-261) Multi Language Support Thu, 16 Nov, 08:59
nutch.newbie (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Mon, 20 Nov, 21:14
nutch.newbie (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Tue, 21 Nov, 20:50
nutch.newbie (JIRA) [jira] Commented: (NUTCH-390) Javadoc warnings Sat, 25 Nov, 03:29
nutch.newbie (JIRA) [jira] Created: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 03:45
nutch.newbie (JIRA) [jira] Commented: (NUTCH-408) Plugin development documentation Sat, 25 Nov, 23:04
ogjunk-nu...@yahoo.com Re: implement thai lanaguage analyzer in nutch Wed, 08 Nov, 22:43
sanjeev implement thai lanaguage analyzer in nutch Tue, 07 Nov, 08:06
sanjeev Re: implement thai lanaguage analyzer in nutch Wed, 08 Nov, 03:57
sanjeev Re: implement thai lanaguage analyzer in nutch Wed, 08 Nov, 04:02
sanjeev Re: implement thai lanaguage analyzer in nutch Wed, 08 Nov, 07:25
sanjeev implement thai language in nutch Wed, 08 Nov, 10:24
sanjeev Re: implement thai lanaguage analyzer in nutch Wed, 08 Nov, 10:46
sanjeev Re: implement thai lanaguage analyzer in nutch Thu, 09 Nov, 03:28
sanjeev Re: implement thai lanaguage analyzer in nutch Thu, 09 Nov, 05:48
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 200932
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510