Mailing list archives: April 2009

Site index · List index
Message list« Previous · 1 · 2 · 3Thread · Author · Date
consultas Re: Nutch 1.0 experience Thu, 02 Apr, 00:18
dealmaker How to find out the encoding and format of the content stored in the index? Sun, 05 Apr, 05:54
dealmaker Re: How to find out the encoding and format of the content stored in the index? Sun, 05 Apr, 16:19
dealmaker Re: Hadoop java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232) while indexing. Fri, 10 Apr, 05:53
dealmaker How come getContent returns HTML Entities? Sun, 12 Apr, 05:05
dealmaker How does Nutch Fetch Files in Relative Path? Tue, 14 Apr, 20:35
fa...@butterflycluster.net Re: How to get the html that i crawled Tue, 28 Apr, 07:40
fishg Re: Hadoop java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232) while indexing. Fri, 17 Apr, 03:24
fmccown Re: type is incompatible in 1.0! Thu, 09 Apr, 14:49
ianwong what is subcollection plugin? Thu, 02 Apr, 11:11
ianwong how to restrict search result in defined domains? Mon, 20 Apr, 12:56
ianwong Re: Multiple "site:" in query Mon, 20 Apr, 13:22
jet2...@trashmail.net nutch: java.nio.charset.IllegalCharsetNameException: Fri, 10 Apr, 00:39
jet2...@trashmail.net java.nio.charset.IllegalCharsetNameException Fri, 10 Apr, 00:41
jqq Searching multiple indexes with Nutch-2 servers,0 segments Mon, 27 Apr, 12:58
kazam Nutch fetch creates too many http sessions Mon, 27 Apr, 16:25
kazam Re: Nutch fetch creates too many http sessions Tue, 28 Apr, 22:09
pranesh Re: Dedup: Job Failed and crawl stopped at depth 1 Fri, 03 Apr, 04:45
ram_sj Re: Crawler Output Flat file or Database? Wed, 01 Apr, 17:49
sgirao How to get the html that i crawled Mon, 27 Apr, 11:28
sgirao Re: How to get the html that i crawled Tue, 28 Apr, 07:36
srinivas jaini java heap space error Thu, 09 Apr, 06:37
v...@free.fr Is it possible to avoid Nutch 1.0 from indexing local directories ? Thu, 30 Apr, 09:14
v...@free.fr Re: Is it possible to avoid Nutch 1.0 from indexing local directories ? Thu, 30 Apr, 14:56
wku_kunal Re: Language Identifier plugin Tue, 14 Apr, 15:17
wu fuheng ebook resources - including lucene in action Mon, 20 Apr, 03:58
yanky young Re: How to find out the encoding and format of the content stored in the index? Sun, 05 Apr, 06:28
yanky young Re: How to find out the encoding and format of the content stored in the index? Sun, 05 Apr, 17:50
yanky young nutch 0.9 protocol-file plugin break with windows file name that contains space Mon, 06 Apr, 07:50
yanky young Re: Problem crawling BBC Hindi Site Mon, 06 Apr, 14:58
yanky young Re: Nutch can't find all files Mon, 06 Apr, 15:18
yanky young Re: nutch-1.0 distribution config problem Mon, 06 Apr, 15:32
yanky young Re: Crawler Output Flat file or Database? Mon, 06 Apr, 16:50
yanky young why nutch repeat fetching some pages Wed, 08 Apr, 05:32
yanky young Re: why nutch repeat fetching some pages Wed, 08 Apr, 12:48
yanky young Re: Nutch can't find all files Thu, 09 Apr, 04:59
yanky young Re: java heap space error Thu, 09 Apr, 14:48
yanky young Re: number of fetcher threads per host? Thu, 09 Apr, 15:10
yanky young Re: app question.... Fri, 10 Apr, 02:33
yanky young Re: fetcher issues Mon, 13 Apr, 03:17
yanky young Re: fetcher issues Mon, 13 Apr, 03:52
yanky young Re: fetcher issues Mon, 13 Apr, 04:47
yanky young Re: Can't build Nutch Mon, 20 Apr, 10:11
zxh116116 nutch-1.0 distribution config problem Fri, 03 Apr, 09:01
zxh116116 Re: nutch-1.0 distribution config problem Fri, 03 Apr, 12:39
zxh116116 nutch-1.0 datanode exception when fetching Mon, 06 Apr, 01:45
zxh116116 in nutch1.0 incread summary problem Tue, 28 Apr, 14:18
Message list« Previous · 1 · 2 · 3Thread · Author · Date
Box list
Dec 200981
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167