Mailing list archives: June 2007

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · Next »Thread · Author · Date
Kai_testing Middleton Possibly use a different library to parse RSS feed for improved performance and compatibility Wed, 20 Jun, 23:42
Kai_testing Middleton fetching http://www.variety.com/</div></a> Thu, 21 Jun, 22:24
Kai_testing Middleton Re: fetching http://www.variety.com/</div></a> Thu, 21 Jun, 23:02
Kai_testing Middleton Re: Using nutch just for the crawler/fetcher Sat, 23 Jun, 02:15
Kai_testing Middleton Re: fetching http://www.variety.com/</div></a> Sat, 23 Jun, 19:28
Kai_testing Middleton how to apply a patch to nutch Mon, 25 Jun, 19:51
Kai_testing Middleton how to apply a patch to nutch Mon, 25 Jun, 22:18
Kai_testing Middleton Re: how to apply a patch to nutch Mon, 25 Jun, 22:53
Kai_testing Middleton Re: how to apply a patch to nutch Mon, 25 Jun, 23:32
Kai_testing Middleton NUTCH-505 - cannot find symbol: variable URL_VALIDATOR Tue, 26 Jun, 04:43
Kai_testing Middleton Re: how to apply a patch to nutch Tue, 26 Jun, 17:57
Kai_testing Middleton Re: not crawling relative URLs Tue, 26 Jun, 19:18
Kai_testing Middleton Re: Possibly use a different library to parse RSS feed for improved performance and compatibility Thu, 28 Jun, 01:59
Kai_testing Middleton Re: not crawling relative URLs Thu, 28 Jun, 18:30
Kai_testing Middleton IOException using feed plugin - NUTCH-444 Thu, 28 Jun, 23:21
Kai_testing Middleton Re: IOException using feed plugin - NUTCH-444 Fri, 29 Jun, 00:02
Kai_testing Middleton Re: IOException using feed plugin - NUTCH-444 Fri, 29 Jun, 18:36
Kai_testing Middleton Re: IOException using feed plugin - NUTCH-444 Sat, 30 Jun, 00:24
Kai_testing Middleton Re: integrate Nutch into my php front page Sat, 30 Jun, 00:27
Kai_testing Middleton Interrupting a nutch crawl -- or use topN? Sat, 30 Jun, 02:10
Karol Rybak Distributed index Thu, 21 Jun, 10:46
Karol Rybak Re: Distributed index Fri, 22 Jun, 12:57
Karol Rybak Re: Distributed index Fri, 22 Jun, 18:25
Karol Rybak Re: Distributed index Fri, 22 Jun, 20:58
Karol Rybak Weird encoding problem Tue, 26 Jun, 07:34
Karol Rybak Problem with ooParser Thu, 28 Jun, 09:33
Ken Krugler Re: Nutch 0.9 and Crawl-Delay Mon, 04 Jun, 19:32
Li Zheng wei How to add parsed metadata to Parse.getData? Sun, 10 Jun, 21:55
Manoharam Reddy How to enable followRedirects? Mon, 04 Jun, 04:30
Manoharam Reddy Complex problem of recrawling economically Tue, 05 Jun, 04:31
Manoharam Reddy is it possible to set different addDays for different sites? Mon, 11 Jun, 05:36
Manoharam Reddy Why Nutch is indexing HTTP 302 pages Mon, 11 Jun, 05:37
Manoharam Reddy Re: What is parse-oo and why doesn't parsed PDF content show up in cached.jsp ? Tue, 12 Jun, 04:42
Manoharam Reddy meaning of depth value - tutorial wrong? Wed, 13 Jun, 05:49
Manoharam Reddy why number of results is more than topN x depth? Wed, 13 Jun, 06:04
Mark_Fletcher Is there a plugin that allows modification of the hit url before it's added to the index? Fri, 29 Jun, 20:03
Mark_Fletcher Re: Is there a plugin that allows modification of the hit url before it's added to the index? Fri, 29 Jun, 23:11
Martin Kammerlander urls/nutch in local is invalid Wed, 06 Jun, 15:02
Martin Kammerlander Re: urls/nutch in local is invalid Wed, 06 Jun, 16:02
Martin Kammerlander indexing only special documents Wed, 06 Jun, 18:29
Martin Kammerlander Re: indexing only special documents Wed, 06 Jun, 22:52
Martin Kammerlander Re: indexing only special documents Fri, 08 Jun, 13:51
Martin Kammerlander Re: Re: indexing only special documents Thu, 14 Jun, 12:47
Mathijs Homminga Checking existence of index segments Sat, 02 Jun, 20:10
Mathijs Homminga Cleaning up segments after indexing Sat, 02 Jun, 20:15
Mathijs Homminga Re: Error with the inject command Sun, 03 Jun, 19:31
Mathijs Homminga Re: Hadoop Log4j ? Thu, 14 Jun, 18:55
Mathijs Homminga Re: Crawl error with hadoop Sat, 30 Jun, 08:38
Matthew A. Bockol Re: integrate Nutch into my php front page Fri, 29 Jun, 23:51
Matthias Jaekle Re: Is fetcher.throttle.bandwidth known to work? Wed, 06 Jun, 12:57
Micah Vivion Having problems getting the field of "content" to be stored Mon, 18 Jun, 23:36
Milan Krendzelak Searching Filter Tue, 19 Jun, 14:14
Milan Krendzelak Re: Searching Filter Tue, 19 Jun, 14:46
Milan Krendzelak Re: Searching Filter Tue, 19 Jun, 16:29
Milan Krendzelak RE: 0.9 document boost inflated Fri, 22 Jun, 08:59
Milan Krendzelak RE: How to score a paticular page higher than the other pages Fri, 22 Jun, 16:21
Milan Krendzelak RE: How to score a paticular page higher than the other pages Mon, 25 Jun, 10:46
Naess, Ronny Re: indexing only special documents Thu, 07 Jun, 08:18
Naess, Ronny Reload index Mon, 18 Jun, 13:22
Naess, Ronny Re: Reload index Tue, 19 Jun, 05:04
Naess, Ronny Re: Problems stemming Tue, 19 Jun, 05:07
Naess, Ronny Re: Re[2]: Problems stemming Tue, 19 Jun, 10:38
Naess, Ronny SV: doubt about indexing Tue, 19 Jun, 10:42
Naess, Ronny Re: doubt about indexing Tue, 19 Jun, 12:22
Naess, Ronny SV: doubt about indexing Tue, 19 Jun, 16:36
Naess, Ronny Lucene client and nutch index Tue, 19 Jun, 17:39
Naess, Ronny Re: Lucene client and nutch index Tue, 19 Jun, 18:08
Naess, Ronny Re: SV: doubt about indexing Wed, 20 Jun, 05:47
Naess, Ronny Re: Reload index Wed, 20 Jun, 05:59
Naess, Ronny Re: Lucene client and nutch index Wed, 20 Jun, 06:07
Naess, Ronny Re: Lucene client and nutch index Wed, 20 Jun, 07:20
Naess, Ronny SV: Lucene client and nutch index Wed, 20 Jun, 08:01
Naess, Ronny Re: meta data plugin needed Wed, 20 Jun, 14:24
Naess, Ronny Re: doubt about indexing Wed, 20 Jun, 14:36
Naess, Ronny The ranking is wrong Tue, 26 Jun, 13:36
Naess, Ronny Re: The ranking is wrong Wed, 27 Jun, 10:30
Naess, Ronny Re: The ranking is wrong Fri, 29 Jun, 13:53
Nick Pisarro Changing Initial number of hits/page Searcher shows. Tue, 05 Jun, 22:43
Nick Pisarro RE(2): Changing Initial number of hits/page Searcher shows. Wed, 06 Jun, 23:45
Pike Re: Nutch and faceted search Sat, 02 Jun, 15:17
Robert Young OR searches possible? Fri, 22 Jun, 09:26
Robert Young Merging Nutch Hits objects Fri, 22 Jun, 11:32
Robert Young Case insensitive searching Tue, 26 Jun, 10:25
Robert Young Stemming with Nutch Thu, 28 Jun, 11:00
Robeyns Bart RE: how fast can nutch fetch urls ? Wed, 20 Jun, 07:20
Robeyns Bart RE: How to score a paticular page higher than the other pages Fri, 22 Jun, 16:56
Roger Dunk Re: integrate Nutch into my php front page Sat, 30 Jun, 01:18
Sami Siren Re: Enabling Spell-Check plugin in contrib Wed, 13 Jun, 19:03
Sami Siren Re: Enabling Spell-Check plugin in contrib Fri, 15 Jun, 15:07
Sami Siren Re: Lucene client and nutch index Wed, 20 Jun, 07:50
Sami Siren Re: [Nutch-general] Integrate nutch crawler with Solr index server Tue, 26 Jun, 14:15
Sami Siren Re: [Nutch-general] Integrate nutch crawler with Solr index server Tue, 26 Jun, 16:05
Sami Siren Re: [Nutch-general] Integrate nutch crawler with Solr index server Tue, 26 Jun, 16:30
Scam Re: Any URL filter available for search.jsp? Thu, 14 Jun, 21:04
Scam Re[2]: Any URL filter available for search.jsp? Thu, 14 Jun, 22:33
Scam Re[2]: Enabling Spell-Check plugin in contrib Thu, 14 Jun, 23:47
Scam Re[2]: Enabling Spell-Check plugin in contrib Fri, 15 Jun, 20:24
Scam Re[3]: Enabling Spell-Check plugin in contrib Sun, 17 Jun, 18:39
Scam Re: Problems stemming Mon, 18 Jun, 16:04
Scam Re[2]: Problems stemming Tue, 19 Jun, 09:53
Message list« Previous · 1 · 2 · 3 · 4 · Next »Thread · Author · Date
Box list
Dec 2009103
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167