Mailing list archives: January 2007

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · Next »Thread · Author · Date
Shailendra Mudgal Re: NameNode throws FileNotFoundException: Parent path does not exist on startup Wed, 17 Jan, 11:37
Alvaro Cabrerizo Re: Searcher doesn't find what expected Wed, 17 Jan, 12:25
yo_keller Re: search or Tomcat ill response Wed, 17 Jan, 14:28
Lukas Vlcek Re: Problem finding out the number of crawled pages per domain Wed, 17 Jan, 15:30
Brian Whitman out of memory error at end of indexing Wed, 17 Jan, 16:57
Albert Chern Re: NameNode throws FileNotFoundException: Parent path does not exist on startup Wed, 17 Jan, 17:15
Brian Whitman Re: out of memory error at end of indexing Wed, 17 Jan, 18:23
cesar voulgaris Re: DB_unfetched status Thu, 18 Jan, 01:02
Shailendra Mudgal How to stop a slow fetch? Thu, 18 Jan, 05:26
Sean Dean Re: How to stop a slow fetch? Thu, 18 Jan, 06:46
Shailendra Mudgal Re: How to stop a slow fetch? Thu, 18 Jan, 06:54
Sean Dean Re: How to stop a slow fetch? Thu, 18 Jan, 07:07
jian chen Re: nutch-0.8 bundle for eclipse Thu, 18 Jan, 07:19
Andrzej Bialecki Re: DB_unfetched status Thu, 18 Jan, 08:09
termo...@gmail.com Nutch 0.8 cannot find all the links on a page Thu, 18 Jan, 08:30
Andrzej Bialecki Re: Nutch 0.8 cannot find all the links on a page Thu, 18 Jan, 13:44
Sami Siren Re: How to stop a slow fetch? Thu, 18 Jan, 20:16
Ledio Ago Reduce segment size Fri, 19 Jan, 01:57
Renaud Richardet Re: nutch-0.8 bundle for eclipse Fri, 19 Jan, 04:15
Sean Dean Re: Reduce segment size Fri, 19 Jan, 07:04
Vlador Re: Nutch 0.8 cannot find all the links on a page Fri, 19 Jan, 09:12
Gal Nitzan notch 0.9 + hadoop 0.10.1 problem Fri, 19 Jan, 09:44
Sean Dean Re: notch 0.9 + hadoop 0.10.1 problem Fri, 19 Jan, 10:03
yl...@ifrance.com Re: Re: problems to exclude subdirectories in a web site Fri, 19 Jan, 14:05
Gal Nitzan java.lang.OutOfMemoryError - trunk Fri, 19 Jan, 15:57
DS jha how to use PorterStemFilter with NutchDocumentAnalyzer Fri, 19 Jan, 17:14
Ledio Ago Reduce segment size Fri, 19 Jan, 17:53
Ledio Ago RE: Reduce segment size Fri, 19 Jan, 17:56
yl...@ifrance.com Input directory urls/url-fr.txt in localhost:9000 is invalid with Hadoop 0.4.0patched and Nutch 0.8.1 Fri, 19 Jan, 18:05
Sean Dean Re: java.lang.OutOfMemoryError - trunk Fri, 19 Jan, 18:24
Ledio Ago RE: Reduce segment size Fri, 19 Jan, 18:36
Gal Nitzan RE: java.lang.OutOfMemoryError - trunk Fri, 19 Jan, 18:38
Gal Nitzan RE: java.lang.OutOfMemoryError - trunk Fri, 19 Jan, 18:41
Sean Dean Re: Reduce segment size Fri, 19 Jan, 19:19
Ledio Ago RE: Reduce segment size Fri, 19 Jan, 19:34
Sean Dean Re: Reduce segment size Fri, 19 Jan, 20:00
Andrzej Bialecki Re: Input directory urls/url-fr.txt in localhost:9000 is invalid with Hadoop 0.4.0patched and Nutch 0.8.1 Fri, 19 Jan, 20:19
Andrzej Bialecki Re: Reduce segment size Fri, 19 Jan, 20:22
Gal Nitzan Does nutch segments from hadoop .7.1 different from hadoop .10.1 Fri, 19 Jan, 21:28
Ledio Ago RE: Reduce segment size Fri, 19 Jan, 21:35
Bharat Beedu Unique out of memory exception while fetching.. Sat, 20 Jan, 08:58
Espen Amble Kolstad Re: java.lang.OutOfMemoryError - trunk Sat, 20 Jan, 12:04
Vlador Limiting the total number of urls to crawl on a single website Sun, 21 Jan, 17:10
Tobias Zahn Indexing only some filetypes with Nutch Sun, 21 Jan, 17:50
Vlador Re: Indexing only some filetypes with Nutch Sun, 21 Jan, 20:29
Jonathan Hunter Compiling PruneIndexTool trouble Mon, 22 Jan, 05:56
Libor Štefek Re: Searcher doesn't find what expected Mon, 22 Jan, 11:33
Sami Siren Re: Compiling PruneIndexTool trouble Mon, 22 Jan, 15:07
Nicolás Lichtmaier "Or" searches in nutch Mon, 22 Jan, 20:51
Dennis Kubes Re: Indexing only some filetypes with Nutch Mon, 22 Jan, 21:07
Alvaro Cabrerizo Re: how to use PorterStemFilter with NutchDocumentAnalyzer Tue, 23 Jan, 08:34
DS jha Re: how to use PorterStemFilter with NutchDocumentAnalyzer Tue, 23 Jan, 15:21
Scott Green Can I generate nutch index without crawling? Tue, 23 Jan, 17:08
Nicolás Lichtmaier Boolean searches, again Tue, 23 Jan, 19:08
Sean Dean Re: Can I generate nutch index without crawling? Tue, 23 Jan, 22:51
Jonathan Hunter Re: Compiling PruneIndexTool trouble Tue, 23 Jan, 23:44
Renaud Richardet Re: Compiling PruneIndexTool trouble Wed, 24 Jan, 00:06
The Golden Condor ! Re: Can I generate nutch index without crawling? Wed, 24 Jan, 00:31
Renaud Richardet cannot search by url (url:) with Nutch 0.8 Wed, 24 Jan, 00:34
Scott Green Re: Can I generate nutch index without crawling? Wed, 24 Jan, 02:53
Enis Soztutar Re: Boolean searches, again Wed, 24 Jan, 09:08
Denis Pimenov nutch scrawls only relative links Wed, 24 Jan, 15:16
Denis Pimenov Re: nutch scrawls only relative links Wed, 24 Jan, 15:35
Aďcha exact matches and stemming Wed, 24 Jan, 17:13
Briggs Merging large sets of segments, help. Wed, 24 Jan, 17:48
Andrzej Bialecki Re: Merging large sets of segments, help. Wed, 24 Jan, 17:58
Briggs Re: Merging large sets of segments, help. Wed, 24 Jan, 18:07
Andrzej Bialecki Re: Merging large sets of segments, help. Wed, 24 Jan, 18:30
Alan Tanaman RE: nutch scrawls only relative links Wed, 24 Jan, 18:34
Briggs Re: Merging large sets of segments, help. Wed, 24 Jan, 18:35
Andrzej Bialecki Re: Merging large sets of segments, help. Wed, 24 Jan, 19:00
Tobias Zahn Re: Indexing only some filetypes with Nutch Wed, 24 Jan, 20:04
Sami Siren Re: Indexing only some filetypes with Nutch Wed, 24 Jan, 20:09
Tobias Zahn Re: Indexing only some filetypes with Nutch Wed, 24 Jan, 20:18
Nicolás Lichtmaier Re: Boolean searches, again Wed, 24 Jan, 22:15
Michael Wechner Problem crawling/fetching using https Wed, 24 Jan, 22:29
Chris Mattmann Re: Problem crawling/fetching using https Wed, 24 Jan, 22:33
Michael Wechner Re: Problem crawling/fetching using https Wed, 24 Jan, 22:44
Chris Mattmann Re: Problem crawling/fetching using https Wed, 24 Jan, 22:49
Michael Wechner Re: Problem crawling/fetching using https Wed, 24 Jan, 22:56
Andrzej Bialecki Re: Problem crawling/fetching using https Wed, 24 Jan, 23:10
Steve W. Partial Success installing Nutch 0.8.1 under Debian Etch: Procedure and Question(s) Wed, 24 Jan, 23:17
Chris Mattmann Re: Problem crawling/fetching using https Wed, 24 Jan, 23:29
Nathan Ter Bogt Multiple collections Thu, 25 Jan, 01:02
Alan Tanaman RE: Multiple collections Thu, 25 Jan, 09:39
Deepa Devanathan Crawling JSPs Thu, 25 Jan, 11:50
Denis Pimenov Re: Crawling JSPs Thu, 25 Jan, 12:06
Jeroen Verhagen Re: Crawling JSPs Thu, 25 Jan, 12:25
Enis Soztutar Re: Can I generate nutch index without crawling? Thu, 25 Jan, 14:13
Chris Mattmann Re: Problem crawling/fetching using https Fri, 26 Jan, 02:05
Boemio, Neil \(FGIC\) http://jakarta.apache.org/taglibs/i18n cannot be resolved Fri, 26 Jan, 03:58
Will Scheidegger Re: http://jakarta.apache.org/taglibs/i18n cannot be resolved Fri, 26 Jan, 07:15
Alvaro Cabrerizo Re: exact matches and stemming Fri, 26 Jan, 08:10
Michael Wechner Re: Problem crawling/fetching using https Fri, 26 Jan, 08:39
Boemio, Neil \(FGIC\) RE: http://jakarta.apache.org/taglibs/i18n cannot be resolved Fri, 26 Jan, 13:55
ma...@jcademy.com Linking url metadata to nutch search results Fri, 26 Jan, 13:57
Andrzej Bialecki Re: Linking url metadata to nutch search results Fri, 26 Jan, 14:05
Erik Höschler Problems Searching an Index with Nutch Fri, 26 Jan, 15:04
Gal Nitzan RE: Problems Searching an Index with Nutch Fri, 26 Jan, 15:52
Erik Höschler Re: Problems Searching an Index with Nutch Fri, 26 Jan, 15:58
Message list« Previous · 1 · 2 · 3 · 4 · Next »Thread · Author · Date
Box list
Dec 200979
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167