Mailing list archives: July 2005

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
yours...@freemail.hu Bug in index-more plugin? Fri, 01 Jul, 08:59
Stefan Groschupf Re: Bug in index-more plugin? Fri, 01 Jul, 09:07
yours...@freemail.hu Re: Bug in index-more plugin? Fri, 01 Jul, 09:42
Stefan Groschupf Re: Bug in index-more plugin? Fri, 01 Jul, 09:48
yours...@freemail.hu Re: Bug in index-more plugin? Fri, 01 Jul, 09:55
Lutisch谩n Ferenc (JIRA) [jira] Created: (NUTCH-65) index-more plugin can't parse large set of modification-date Fri, 01 Jul, 09:55
Jerome Charron (JIRA) [jira] Commented: (NUTCH-65) index-more plugin can't parse large set of modification-date Fri, 01 Jul, 10:50
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-60) Bad language identifier plugin performances Sat, 02 Jul, 19:32
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-57) text and html files unrecognized Sat, 02 Jul, 19:43
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-27) Patch to get a status of running Fetcher Sat, 02 Jul, 19:54
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-32) Nutch Webapp could only be deployed on root namespace Sat, 02 Jul, 20:26
CC Chaman (JIRA) [jira] Created: (NUTCH-66) Cookies are not being read properly Sat, 02 Jul, 20:37
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-56) Crawling sites with 403 Forbidden robots.txt Sat, 02 Jul, 20:48
Ilia S. Yatsenko both html parser have bug with javascript Sun, 03 Jul, 15:05
Ilia S. Yatsenko RE: both html parser have bug with javascript Sun, 03 Jul, 16:09
Chirag Chaman RE: both html parser have bug with javascript Mon, 04 Jul, 00:17
Nick Lothian (JIRA) [jira] Commented: (NUTCH-65) index-more plugin can't parse large set of modification-date Mon, 04 Jul, 01:57
Nutch开发邮件 Re: Why Crawl failed to fetch so many pages? Mon, 04 Jul, 03:18
Ilia S. Yatsenko RE: both html parser have bug with javascript Mon, 04 Jul, 03:33
zhangjin (JIRA) [jira] Created: (NUTCH-67) I want crawl the websites including news.yahoo.com,game.yahoo.com,blog.yahoo.com,etc! Mon, 04 Jul, 03:42
Ilia S. Yatsenko RE: both html parser have bug with javascript Mon, 04 Jul, 03:43
Ilia S. Yatsenko RE: [jira] Created: (NUTCH-67) I want crawl the websites including news.yahoo.com,game.yahoo.com,blog.yahoo.com,etc! Mon, 04 Jul, 03:55
Jerome Charron (JIRA) [jira] Commented: (NUTCH-65) index-more plugin can't parse large set of modification-date Mon, 04 Jul, 09:22
Ilia S. Yatsenko hits.getTotal() Mon, 04 Jul, 09:54
Andrzej Bialecki Re: both html parser have bug with javascript Mon, 04 Jul, 10:04
Jakob Heidebrecht =?ISO-8859-1?Q?Problems_with_Fetcher_threads=3F?= Mon, 04 Jul, 11:36
Chirag Chaman RE: both html parser have bug with javascript Mon, 04 Jul, 13:14
Lutisch谩n Ferenc (JIRA) [jira] Commented: (NUTCH-65) index-more plugin can't parse large set of modification-date Mon, 04 Jul, 13:30
Andrzej Bialecki Re: both html parser have bug with javascript Mon, 04 Jul, 15:54
Nutch开发邮件 Re: [jira] Created: (NUTCH-67) I want crawl the websites including news.yahoo.com,game.yahoo.com,blog.yahoo.com,etc! Mon, 04 Jul, 16:00
Ilia S. Yatsenko RE: [jira] Created: (NUTCH-67) I want crawl the websites including news.yahoo.com,game.yahoo.com,blog.yahoo.com,etc! Mon, 04 Jul, 16:08
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-66) Cookies are not being read properly Mon, 04 Jul, 16:57
Andrzej Bialecki (JIRA) [jira] Updated: (NUTCH-68) A tool to generate arbitrary fetchlists Tue, 05 Jul, 08:07
Andrzej Bialecki (JIRA) [jira] Created: (NUTCH-68) A tool to generate arbitrary fetchlists Tue, 05 Jul, 08:07
Fredrik Andersson Iterating spidered pages Tue, 05 Jul, 08:58
Andrzej Bialecki Re: LanguageIdentifier refactoring Tue, 05 Jul, 13:02
J閞鬽e Charron Re: LanguageIdentifier refactoring Tue, 05 Jul, 13:52
Andy Liu Re: Iterating spidered pages Tue, 05 Jul, 15:19
Andrzej Bialecki Re: LanguageIdentifier refactoring Tue, 05 Jul, 17:33
Andrzej Bialecki Re: Iterating spidered pages Tue, 05 Jul, 17:38
Chirag Chaman RE: both html parser have bug with javascript Tue, 05 Jul, 20:38
Chirag Chaman RE: [jira] Commented: (NUTCH-66) Cookies are not being read properly Tue, 05 Jul, 20:38
Chirag Chaman RE: [jira] Commented: (NUTCH-66) Cookies are not being read properly Tue, 05 Jul, 20:38
Chirag Chaman Bad URLs causing SEVERE exception Tue, 05 Jul, 20:47
Chirag Chaman Bad URLs causing SEVERE exception Tue, 05 Jul, 20:52
J閞鬽e Charron Re: LanguageIdentifier refactoring Thu, 07 Jul, 13:38
Doug Cutting Re: hits.getTotal() Thu, 07 Jul, 18:20
Doug Cutting Re: Problems with Fetcher threads? Thu, 07 Jul, 18:24
Emilijan Mirceski max fetcher threads per host, buggy behaviour. Thu, 07 Jul, 22:52
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-58) NullPointerException while coping NDFS file Fri, 08 Jul, 10:38
Michael Nebel nutch server performance Fri, 08 Jul, 12:55
Jay Pound Re: [jira] Closed: (NUTCH-58) NullPointerException while coping NDFS file Fri, 08 Jul, 13:38
Matthias Jaekle (JIRA) [jira] Created: (NUTCH-69) fetcher.threads.per.host ignored Fri, 08 Jul, 14:28
Andrzej Bialecki (JIRA) [jira] Resolved: (NUTCH-69) fetcher.threads.per.host ignored Fri, 08 Jul, 14:39
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-63) the distributed search client generate too much logging statements Fri, 08 Jul, 15:45
Stefan Groschupf Re: [jira] Closed: (NUTCH-63) the distributed search client generate too much logging statements Fri, 08 Jul, 16:58
Bernhard Fastenrath ESP - Ethics search protocol for internet search engines. Sat, 09 Jul, 12:22
Erik Hatcher Re: ESP - Ethics search protocol for internet search engines. Sun, 10 Jul, 10:57
Bernhard Fastenrath Re: ESP - Ethics search protocol for internet search engines. Sun, 10 Jul, 13:30
Erik Hatcher Re: ESP - Ethics search protocol for internet search engines. Sun, 10 Jul, 15:26
Bernhard Fastenrath Re: ESP - Ethics search protocol for internet search engines. Sun, 10 Jul, 19:58
Erik Hatcher Re: [Nutch-dev] Re: ESP - Ethics search protocol for internet search engines. Mon, 11 Jul, 00:43
Lutisch谩n Ferenc (JIRA) [jira] Created: (NUTCH-70) duplicate pages - virtual hosts in db. Mon, 11 Jul, 09:13
Diego Basch Possible race condition while loading plugins Mon, 11 Jul, 13:18
Nils Hoeller Website Visualization Questions Mon, 11 Jul, 14:36
Fredrik Andersson Re: Website Visualization Questions Mon, 11 Jul, 14:50
Nils H鰈ler Re: Website Visualization Questions Mon, 11 Jul, 15:26
Piotr Kosiorowski Re: [jira] Created: (NUTCH-70) duplicate pages - virtual hosts in db. Mon, 11 Jul, 18:12
Fredrik Andersson Re: Website Visualization Questions Mon, 11 Jul, 20:33
Bin Shi hi all Mon, 11 Jul, 22:56
Jack Tang Re: hi all Tue, 12 Jul, 01:12
Orkunt Sabuncu Fwd: links in db and pagerank calculation Tue, 12 Jul, 11:43
yours...@freemail.hu Re: [jira] Created: (NUTCH-70) duplicate pages - virtual hosts in db. Tue, 12 Jul, 12:18
Christophe Noel (JIRA) [jira] Created: (NUTCH-71) Search web page doesn't not focus on query input Tue, 12 Jul, 12:19
Christophe Noel (JIRA) [jira] Updated: (NUTCH-71) Search web page doesn't not focus on query input Tue, 12 Jul, 12:19
Christophe Noel (JIRA) [jira] Commented: (NUTCH-71) Search web page doesn't not focus on query input Tue, 12 Jul, 12:30
yours...@freemail.hu Re: [Nutch-dev] Exception "Could not obtain new output block" Wed, 13 Jul, 06:52
Ami...@invitation.sms.ac Amin GH's invitation Thu, 14 Jul, 13:57
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-46) the NDFS problem(Could not obtain new output block for file) Thu, 14 Jul, 21:01
Jack Tang NutchAnalysis and CJK Fri, 15 Jul, 02:49
Transbuerg Tian Re: NutchAnalysis and CJK Fri, 15 Jul, 04:34
Christophe Noel (JIRA) [jira] Created: (NUTCH-72) Query basic filter with correction feature Fri, 15 Jul, 11:27
Christophe Noel (JIRA) [jira] Updated: (NUTCH-72) Query basic filter with correction feature Fri, 15 Jul, 12:00
Christophe Noel (JIRA) [jira] Created: (NUTCH-73) A page for CSV results Fri, 15 Jul, 12:11
Christophe Noel (JIRA) [jira] Updated: (NUTCH-73) A page for CSV results Fri, 15 Jul, 12:11
Feng \(Michael\) Ji a silly question Sat, 16 Jul, 03:27
Fredrik Andersson Re: a silly question Sat, 16 Jul, 07:46
Feng \(Michael\) Ji Re: a silly question Sat, 16 Jul, 12:54
Howie Wang Re: a silly question Sat, 16 Jul, 15:47
Howie Wang Re: a silly question Sat, 16 Jul, 15:55
Feng \(Michael\) Ji Re: a silly question Sat, 16 Jul, 16:53
Howie Wang Re: a silly question Sat, 16 Jul, 17:08
yoursoft Re: [Nutch-dev] Re: a silly question Sat, 16 Jul, 18:08
Feng \(Michael\) Ji Re: [Nutch-dev] Re: a silly question Sat, 16 Jul, 19:12
Feng \(Michael\) Ji Nutch Compiling Sat, 16 Jul, 19:22
Piotr Kosiorowski Re: [Nutch-dev] Re: a silly question Sat, 16 Jul, 20:09
Piotr Kosiorowski Re: Nutch Compiling Sat, 16 Jul, 20:10
Feng \(Michael\) Ji Re: [Nutch-dev] Re: a silly question Sat, 16 Jul, 21:06
Jack Tang Nutch and cluster search result Sun, 17 Jul, 06:19
yoursoft indexed records in segments Sun, 17 Jul, 07:09
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 200933
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510