Mailing list archives: February 2008

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Mario Méndez Villegas Help to understand the crawl filter Wed, 20 Feb, 00:44
Mario Méndez Villegas Re: How are the Regex URL Filters Supposed to Work? Fri, 22 Feb, 01:11
Oleg Mürk Some questions about Nutch Mon, 11 Feb, 18:31
Oleg Mürk Re: Some questions about Nutch Mon, 18 Feb, 09:54
Oleg Mürrk Re: db.ignore.external.links Sat, 16 Feb, 13:27
Aled Rhys Jones QueryFilter runtime exception - incorrect plugins setup? Sun, 03 Feb, 17:29
Aled Rhys Jones RE: QueryFilter runtime exception - incorrect plugins setup? Tue, 05 Feb, 21:12
Aled Rhys Jones RE: QueryFilter runtime exception - incorrect plugins setup? Sun, 10 Feb, 15:13
Aled Rhys Jones RE: QueryFilter runtime exception - incorrect plugins setup? Sun, 10 Feb, 16:28
Andrzej Bialecki Re: Nutch vs. Heritrix Threading Sat, 02 Feb, 10:00
Andrzej Bialecki Re: Generate Normalizations, Resolving IP addresses, and Duplicates Wed, 06 Feb, 23:46
Andrzej Bialecki Re: Generate Normalizations, Resolving IP addresses, and Duplicates Thu, 07 Feb, 13:37
Andrzej Bialecki Re: nutch vs hadoop versions Sat, 09 Feb, 01:14
Andrzej Bialecki Re: jobtracker is local Thu, 21 Feb, 07:04
Andrzej Bialecki Re: setRetriesSinceFetch ? Tue, 26 Feb, 15:59
Andrzej Bialecki Re: java exception Wed, 27 Feb, 01:28
Andrzej Bialecki Re: Weighting on words Thu, 28 Feb, 18:46
Antonin Slezacek Nutch and Google sitemap protocol Wed, 06 Feb, 13:23
Arkadi.Kosmy...@csiro.au RE: fetcher failing with outofmemory exception Fri, 08 Feb, 23:12
Barry Haddow Re: Nutch and Hadoop Thu, 07 Feb, 21:58
Barry Haddow crawl stops at depth 1 Thu, 14 Feb, 16:31
Barry Haddow Re: crawl stops at depth 1 Thu, 14 Feb, 18:41
Barry Haddow Re: crawl stops at depth 1 Mon, 18 Feb, 18:18
Boris Lau parse-xml with large number of fields Thu, 28 Feb, 21:08
Brian Ulicny Re: Dublin core metadata fields Thu, 28 Feb, 14:34
Brian Ulicny Re: Dublin core metadata fields Thu, 28 Feb, 19:51
Brian Whitman Re: Solr/Nutch Integration Patch Error Tue, 12 Feb, 17:02
Chris Mattmann Re: java.lang.NoClassDefFoundError: org/apache/tika/mime/MimeTypeException in cached.jsp Mon, 04 Feb, 19:14
Chris Mattmann Re: Tika Error ? Thu, 14 Feb, 14:32
DS jha fetcher failing with outofmemory exception Fri, 08 Feb, 05:16
DS jha Re: fetcher failing with outofmemory exception Sun, 10 Feb, 00:15
DS jha Re: fetcher failing with outofmemory exception Mon, 11 Feb, 14:09
Daniel Clark Nutch vs. Heritrix Threading Fri, 01 Feb, 16:56
Dennis Kubes Default normalization of URLs Sat, 02 Feb, 22:22
Dennis Kubes Re: OutofMemory Error with updatedb Sat, 02 Feb, 22:59
Dennis Kubes Re: Urgent help reqd.....plz Tue, 05 Feb, 16:04
Dennis Kubes Re: Urgent help reqd.....plz Tue, 05 Feb, 17:44
Dennis Kubes Re: QueryFilter runtime exception - incorrect plugins setup? Tue, 05 Feb, 17:45
Dennis Kubes Re: Urgent help reqd.....plz Tue, 05 Feb, 20:00
Dennis Kubes Re: Questions on normalizer and filter related code in Crawl, Injector and Generator Tue, 05 Feb, 20:17
Dennis Kubes Re: Urgent help reqd.....plz Tue, 05 Feb, 20:21
Dennis Kubes Re: No urls to fetch Wed, 06 Feb, 14:09
Dennis Kubes Re: Questions on normalizer and filter related code in Crawl, Injector and Generator Wed, 06 Feb, 18:41
Dennis Kubes Generate Normalizations, Resolving IP addresses, and Duplicates Wed, 06 Feb, 18:59
Dennis Kubes Re: Questions on normalizer and filter related code in Crawl, Injector and Generator Wed, 06 Feb, 19:10
Dennis Kubes Re: Generate Normalizations, Resolving IP addresses, and Duplicates Thu, 07 Feb, 00:42
Dennis Kubes Re: Urgent help reqd.....plz Thu, 07 Feb, 16:52
Dennis Kubes Re: strange page rank Thu, 07 Feb, 16:57
Dennis Kubes Deleteing an index document in nutch Thu, 07 Feb, 23:37
Dennis Kubes Re: Deleteing an index document in nutch Fri, 08 Feb, 03:50
Dennis Kubes Re: nutch vs hadoop versions Sun, 10 Feb, 04:44
Dennis Kubes Re: QueryFilter runtime exception - incorrect plugins setup? Sun, 10 Feb, 18:01
Dennis Kubes Re: QueryFilter runtime exception - incorrect plugins setup? Sun, 10 Feb, 18:02
Dennis Kubes Re: strange page rank Mon, 11 Feb, 04:52
Dennis Kubes Re: strange page rank Mon, 11 Feb, 15:47
Dennis Kubes Re: nutch vs hadoop versions Tue, 19 Feb, 19:51
Dennis Kubes Re: NPE in org.apache.hadoop.fs.BufferedFSInputStream.getPos Wed, 20 Feb, 16:56
Dennis Kubes Re: crawl errors over pdf files Fri, 22 Feb, 22:50
Dennis Kubes Re: cat: /home/user/nutch/search/bin/../conf/masters: No such file or directory Mon, 25 Feb, 17:45
Dennis Kubes Re: Cannot delete /home/user/nutch/filesystem/mapreduce/system. Name node is in safe mode. - Error Mon, 25 Feb, 19:49
Dennis Kubes Re: How to update search.dir with least interruption of service? Wed, 27 Feb, 22:18
Dennis Kubes Re: Weighting on words Thu, 28 Feb, 18:42
Developer Developer Installing nutch over existing Hadoop cluster Thu, 14 Feb, 13:19
Developer Developer cat: /home/user/nutch/search/bin/../conf/masters: No such file or directory Mon, 25 Feb, 16:32
Developer Developer Re: cat: /home/user/nutch/search/bin/../conf/masters: No such file or directory Mon, 25 Feb, 18:50
Developer Developer Cannot delete /home/user/nutch/filesystem/mapreduce/system. Name node is in safe mode. - Error Mon, 25 Feb, 18:53
Duan, Nick Nutch and Lucene Wed, 27 Feb, 01:11
Emmanuel Tika Error ? Thu, 14 Feb, 14:07
Emmanuel Re: Tika Error ? Thu, 14 Feb, 14:42
Emmanuel Re: Tika Error ? Sun, 17 Feb, 06:39
Emmanuel Query Searc Sun, 17 Feb, 07:04
Emmanuel setRetriesSinceFetch ? Tue, 26 Feb, 14:54
Emmanuel Ask for expertise and advice Thu, 28 Feb, 14:07
Euan Clark java exception Wed, 27 Feb, 00:43
Euan Clark Re: java exception Wed, 27 Feb, 02:22
Fred Gilmore crawl errors over pdf files Fri, 22 Feb, 22:29
Garnier Garnier Not able to crawl local file system: need help Thu, 28 Feb, 06:40
Guanyu plugin and classloader question Wed, 20 Feb, 02:47
Hilkiah Lavinier apache and nutch ONLY Fri, 08 Feb, 12:14
Hilkiah Lavinier nutch/hadoop parameters for optimal performance Mon, 11 Feb, 01:56
Howie Wang RE: Solr Integration/Stemming? Mon, 11 Feb, 20:37
Howie Wang RE: How to update search.dir with least interruption of service? Wed, 27 Feb, 22:01
Ismael Re: Not able to crawl local file system: need help Thu, 28 Feb, 11:37
Ivannie Re: Re: Nutch 0.9 mysterious failure to crawl sites (stopping at depth=0) Sat, 23 Feb, 03:56
Ivannie Re: Nutch and Lucene Wed, 27 Feb, 07:50
Jake Re: apache and nutch ONLY Fri, 08 Feb, 16:20
Jasper Kamperman Re: Controlling indexing and scoring Thu, 07 Feb, 17:52
Jasper Kamperman Re: Weighting on words Wed, 27 Feb, 18:51
Jasper Kamperman Re: help with boost Thu, 28 Feb, 18:01
Jaya Ghosh Java Error Wed, 20 Feb, 11:21
Jaya Ghosh Java Error Wed, 20 Feb, 11:28
Jaya Ghosh RE: Java Error Thu, 21 Feb, 04:07
Jaya Ghosh Not able to load Nutch Search page Thu, 21 Feb, 08:52
Jaya Ghosh RE: Not able to load Nutch Search page Thu, 21 Feb, 10:21
Jean-Christophe Alleman Weighting on words Wed, 27 Feb, 15:46
Jean-Christophe Alleman help with boost Thu, 28 Feb, 15:25
Jeffrey Koch GettingNutchRunningWithDebian Wiki page: questions and suggestions Thu, 14 Feb, 01:40
Jiaqi Tan Nutch 0.9 mysterious failure to crawl sites (stopping at depth=0) Wed, 20 Feb, 20:52
Jiaqi Tan Re: Nutch 0.9 mysterious failure to crawl sites (stopping at depth=0) Wed, 20 Feb, 22:26
Jiaqi Tan Re: Nutch 0.9 mysterious failure to crawl sites (stopping at depth=0) Wed, 20 Feb, 22:46
Message list1 · 2 · Next »Thread · Author · Date
Box list
Nov 2009277
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167