Mailing list archives: November 2007

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
payo Re: run the crawl Tue, 13 Nov, 23:24
Doğacan Güney Re: java.io.IOException: Unknown format version:-3 Wed, 14 Nov, 10:53
Josh Attenberg Re: help for a nutch beginner Wed, 14 Nov, 13:59
eyal edri java.net.SocketException: Connection reset when using too many threads Wed, 14 Nov, 14:37
karthik085 Re: Different Analyzers Wed, 14 Nov, 15:11
Annona Keene Higher depth, fewer urls? Wed, 14 Nov, 16:45
Dennis Kubes Re: URI is not absolute... Wed, 14 Nov, 16:57
charlie w results display for languages other than English Wed, 14 Nov, 17:28
payo configuration Nutch Wed, 14 Nov, 22:14
Brehm, Robert P Error when using nutch Wed, 14 Nov, 23:34
Karol Rybak Re: help for a nutch beginner Thu, 15 Nov, 10:19
Andrzej Bialecki Re: Higher depth, fewer urls? Thu, 15 Nov, 16:55
Dennis Kubes Re: URI is not absolute... Thu, 15 Nov, 18:13
Yari M Mobile web sites Thu, 15 Nov, 20:26
crazy indexing word file Fri, 16 Nov, 08:15
paradise Exception in thread "main" java.lang.IllegalArgumentException: URI is not absolute Fri, 16 Nov, 08:24
Susam Pal Re: indexing word file Fri, 16 Nov, 08:29
crazy Re: indexing word file Fri, 16 Nov, 09:48
Susam Pal Re: indexing word file Fri, 16 Nov, 10:11
crazy Re: indexing word file Fri, 16 Nov, 10:37
Susam Pal Re: indexing word file Fri, 16 Nov, 10:57
crazy Re: indexing word file Fri, 16 Nov, 11:29
Susam Pal Re: indexing word file Fri, 16 Nov, 11:59
payo =?UTF-8?Q?word_cach=C3=A9?= Fri, 16 Nov, 15:25
crazy Re: indexing word file Fri, 16 Nov, 16:25
carmme...@globo.com Nightly version - no results? Fri, 16 Nov, 18:00
Sathyam Y very low fieldnorm leading to bad results Fri, 16 Nov, 18:26
Jasper Kamperman Re: very low fieldnorm leading to bad results Fri, 16 Nov, 18:44
Matei Zaharia Reduce job in invertlinks and index tasks often fails Sun, 18 Nov, 04:07
Josh Attenberg A record version mismatch occured. Expecting v5, found v69 Sun, 18 Nov, 19:41
Yari M Adddays & topN Mon, 19 Nov, 08:32
crazy Re: indexing word file Mon, 19 Nov, 08:59
crazy Re: indexing excel file Mon, 19 Nov, 14:40
P.Nguy...@Deutschepost.de AW: indexing excel file Mon, 19 Nov, 15:46
crazy Re: AW: indexing excel file Mon, 19 Nov, 16:35
Lev Kantorovich nutch 0.9 and eclipse 3.3 - Mon, 19 Nov, 19:18
Josh Attenberg Re: A record version mismatch occured. Expecting v5, found v69 Mon, 19 Nov, 19:44
Moore, Lee C http://www.mail-archive.com/nutch-user@lucene.apache.org/msg09096.html Mon, 19 Nov, 20:41
Josh Attenberg Re: A record version mismatch occured. Expecting v5, found v69 Mon, 19 Nov, 20:53
Doğacan Güney Re: A record version mismatch occured. Expecting v5, found v69 Mon, 19 Nov, 21:00
Tim Gautier Re: Hadoop .15 and eclipse on windows Mon, 19 Nov, 21:45
Ê©ÐË dfs.DataNode - Failed to transfer blk_xxxx to 192.168.140.244:50010 got java.net.SocketException: Connection reset Tue, 20 Nov, 03:18
|^| /-\\ |\\| |) /-\\ |2 Handling authentication Tue, 20 Nov, 04:57
Susam Pal Re: http://www.mail-archive.com/nutch-user@lucene.apache.org/msg09096.html Tue, 20 Nov, 06:39
eyal edri Re: nutch 0.9 and eclipse 3.3 - Tue, 20 Nov, 06:39
kumarlimbu Is storing 20 fields in a lucene document desirable? Tue, 20 Nov, 11:44
Andrzej Bialecki Re: dfs.DataNode - Failed to transfer blk_xxxx to 192.168.140.244:50010 got java.net.SocketException: Connection reset Tue, 20 Nov, 11:57
P.Nguy...@Deutschepost.de AW: AW: indexing excel file Tue, 20 Nov, 12:12
Christopher Condit PDF Indexing Problem Tue, 20 Nov, 20:00
Josh Attenberg No space left on device Wed, 21 Nov, 03:24
Susam Pal Re: No space left on device Wed, 21 Nov, 04:33
|^| /-\\ |\\| |) /-\\ |2 Re: Handling authentication Wed, 21 Nov, 04:52
Josh Attenberg Re: No space left on device Wed, 21 Nov, 04:58
Susam Pal Re: No space left on device Wed, 21 Nov, 05:09
Sagar Naik Re: Is storing 20 fields in a lucene document desirable? Wed, 21 Nov, 06:48
Lyndon Maydwell Re: No space left on device Wed, 21 Nov, 09:39
Abdou RABBA trying to configure nutch-0.9 Wed, 21 Nov, 12:30
Tomislav Poljak Re: dfs.DataNode - Failed to transfer blk_xxxx to 192.168.140.244:50010 got java.net.SocketException: Connection reset Wed, 21 Nov, 13:13
Josh Attenberg Re: No space left on device Wed, 21 Nov, 13:25
Josh Attenberg Re: No space left on device Wed, 21 Nov, 22:01
Cool Coder Crawl API Help Wed, 21 Nov, 22:18
Josh Attenberg Re: No space left on device Thu, 22 Nov, 23:02
Susam Pal Re: No space left on device Fri, 23 Nov, 05:55
Guido García Bernardo several requests with different headers to the same resource Fri, 23 Nov, 09:48
Tim Gautier Re: No space left on device Fri, 23 Nov, 16:32
Daniele Zuco graphExtractor.pl Fri, 23 Nov, 19:24
obradoa using trunk, urls disappearing when using 4 nodes Fri, 23 Nov, 19:54
obradoa Re: using trunk, urls disappearing when using 4 nodes Fri, 23 Nov, 22:35
jian chen crawl only option for Crawl.java and crawled content reader class Sat, 24 Nov, 01:19
Cool Coder Re: crawl only option for Crawl.java and crawled content reader class Sat, 24 Nov, 01:51
jian chen Re: crawl only option for Crawl.java and crawled content reader class Sat, 24 Nov, 07:35
eyal edri Re: nutch 0.9 and eclipse 3.3 - Sun, 25 Nov, 14:28
josky Relevant feedback Mon, 26 Nov, 13:13
Moore, Lee C RE: http://www.mail-archive.com/nutch-user@lucene.apache.org/msg09096.html Mon, 26 Nov, 19:16
payo process crawl Mon, 26 Nov, 19:16
payo =?UTF-8?Q?Re:_word_cach=C3=A9?= Mon, 26 Nov, 19:24
Bolle, Jeffrey F. Crash in Parser Mon, 26 Nov, 20:08
Bolle, Jeffrey F. RE: Crash in Parser Mon, 26 Nov, 20:12
Isabel Drost Re: crawl only option for Crawl.java and crawled content reader class Mon, 26 Nov, 20:31
Jose C. Lacal Newbie question: fetching specific files only. Mon, 26 Nov, 20:47
Isabel Drost Re: crawl only option for Crawl.java and crawled content reader class Mon, 26 Nov, 20:57
payo Re: trying to configure nutch-0.9 Mon, 26 Nov, 21:34
jian chen Re: crawl only option for Crawl.java and crawled content reader class Mon, 26 Nov, 21:36
Karol Rybak Re: Crash in Parser Mon, 26 Nov, 22:26
Karol Rybak Generate times Mon, 26 Nov, 23:02
charlie w Problems with mixed English/Russian page Tue, 27 Nov, 00:04
Daniele Zuco Usage readdb dump Tue, 27 Nov, 08:10
Alexis Votta NullPointerException with trunk Tue, 27 Nov, 14:11
Cool Coder Re: crawl only option for Crawl.java and crawled content reader class Tue, 27 Nov, 16:29
Dennis Kubes Re: NullPointerException with trunk Tue, 27 Nov, 16:47
Susam Pal Re: NullPointerException with trunk Tue, 27 Nov, 18:54
Ned Rockson Re: Crash in Parser Tue, 27 Nov, 19:24
Dennis Kubes Re: NullPointerException with trunk Tue, 27 Nov, 20:16
Christoph M. URL-Filter for ?indexing?? Tue, 27 Nov, 20:30
Isabel Drost Re: crawl only option for Crawl.java and crawled content reader class Tue, 27 Nov, 21:31
misc Re: Generate times Tue, 27 Nov, 22:15
Cool Coder How to read crawldb Tue, 27 Nov, 22:20
jian chen Re: How to read crawldb Tue, 27 Nov, 22:32
Brehm, Robert P RE: Error when using nutch Tue, 27 Nov, 22:54
Bolle, Jeffrey F. RE: Crash in Parser Tue, 27 Nov, 23:16
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 200981
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167