Mailing list archives: June 2008

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Hemant Bist problem running nutch from eclipse 3.2 in ubuntu hardy. Sat, 14 Jun, 05:47
Otis Gospodnetic   Re: problem running nutch from eclipse 3.2 in ubuntu hardy. Sat, 14 Jun, 05:55
Hemant Bist     Re: problem running nutch from eclipse 3.2 in ubuntu hardy. Sat, 14 Jun, 06:03
Otis Gospodnetic   Re: problem running nutch from eclipse 3.2 in ubuntu hardy. Sat, 14 Jun, 19:43
Marcus Herou Anti-spam Sat, 14 Jun, 10:30
Marcus Herou Nutch anti spam Sat, 14 Jun, 10:51
nutch_newbie customize nutch? Sat, 14 Jun, 14:36
nutch_newbie Something very, very strange....about how my nutch runs... please help! Sat, 14 Jun, 15:29
nutch_newbie Question on re-crawling. Sat, 14 Jun, 21:51
nutch_newbie Crawl parameters/settings Sun, 15 Jun, 19:38
Felix Zimmermann infinite loop-problem Mon, 16 Jun, 12:46
Otis Gospodnetic   Re: infinite loop-problem Tue, 17 Jun, 03:32
beansproud where nutch store crawled data Mon, 16 Jun, 14:41
POIRIER David   RE: where nutch store crawled data Mon, 16 Jun, 14:59
beansproud     RE: where nutch store crawled data Tue, 17 Jun, 13:57
Winton Davies       Re: where nutch store crawled data Tue, 17 Jun, 17:02
beansproud         Re: where nutch store crawled data Fri, 20 Jun, 02:33
beansproud         Re: where nutch store crawled data Fri, 20 Jun, 02:40
Marcus Herou         Re: where nutch store crawled data Sat, 21 Jun, 12:17
Marcus Herou       Re: where nutch store crawled data Tue, 17 Jun, 17:57
Marcus Herou         Re: where nutch store crawled data Tue, 17 Jun, 18:00
Chris Anderson         Re: where nutch store crawled data Tue, 17 Jun, 18:01
Marcus Herou           Re: where nutch store crawled data Tue, 17 Jun, 18:03
Otis Gospodnetic   Re: where nutch store crawled data Wed, 18 Jun, 04:49
Del Rio, Ann how does nutch connect to urls internally? Mon, 16 Jun, 16:22
Susam Pal   Re: how does nutch connect to urls internally? Mon, 16 Jun, 16:47
Del Rio, Ann     RE: how does nutch connect to urls internally? Mon, 16 Jun, 17:17
Del Rio, Ann     RE: how does nutch connect to urls internally? Thu, 19 Jun, 22:54
Otis Gospodnetic   Re: how does nutch connect to urls internally? Fri, 20 Jun, 05:53
Winton Davies     GNUgcj problem? Fri, 20 Jun, 19:38
kevin chen       Re: GNUgcj problem? Sat, 21 Jun, 14:16
Winton Davies         Re: GNUgcj problem? Sat, 21 Jun, 22:01
Del Rio, Ann     RE: how does nutch connect to urls internally? Sat, 21 Jun, 01:53
Otis Gospodnetic   Re: how does nutch connect to urls internally? Sat, 21 Jun, 05:55
Del Rio, Ann     RE: how does nutch connect to urls internally? Mon, 23 Jun, 16:30
Otis Gospodnetic   Re: how does nutch connect to urls internally? Mon, 23 Jun, 17:24
Drew Hite db.ignore.external.links=true and redirects Mon, 16 Jun, 17:09
Drew Hite   Re: db.ignore.external.links=true and redirects Mon, 16 Jun, 17:11
Otis Gospodnetic   Re: db.ignore.external.links=true and redirects Tue, 17 Jun, 03:29
John Thompson ClassNotFoundException: org.apache.nutch.analysis.CommonGrams Mon, 16 Jun, 19:48
Otis Gospodnetic   Re: ClassNotFoundException: org.apache.nutch.analysis.CommonGrams Tue, 17 Jun, 03:28
John Thompson     Re: ClassNotFoundException: org.apache.nutch.analysis.CommonGrams Thu, 19 Jun, 07:10
DS jha getting seed list for vertical search engine Tue, 17 Jun, 03:04
Otis Gospodnetic   Re: getting seed list for vertical search engine Tue, 17 Jun, 03:15
DS jha     Re: getting seed list for vertical search engine Tue, 17 Jun, 18:11
Otis Gospodnetic   Re: getting seed list for vertical search engine Wed, 18 Jun, 04:47
m.harig Nutch is not indexing Tue, 17 Jun, 07:15
Marcus Herou Nutch + HBase Tue, 17 Jun, 17:39
Andrzej Bialecki   Re: Nutch + HBase Tue, 17 Jun, 19:07
Marcus Herou     Re: Nutch + HBase Tue, 17 Jun, 20:00
Andrzej Bialecki       Re: Nutch + HBase Tue, 17 Jun, 20:12
Ruslan Sivak Simple site search Tue, 17 Jun, 18:09
idr...@htwm.de Hadoop get together @ Berlin Tue, 17 Jun, 18:50
wynz lo problems with link limits Tue, 17 Jun, 22:18
Otis Gospodnetic   Re: problems with link limits Wed, 18 Jun, 04:45
wynz lo     Re: problems with link limits Wed, 18 Jun, 13:28
Chris Kline updating retry inteval Tue, 17 Jun, 22:19
Otis Gospodnetic   Re: updating retry inteval Thu, 19 Jun, 13:01
John Martyniak     Re: updating retry inteval Thu, 19 Jun, 14:43
Garnier Garnier Has anybody implemented NUTCH in a C or C++ Application? Wed, 18 Jun, 04:57
Otis Gospodnetic   Re: Has anybody implemented NUTCH in a C or C++ Application? Thu, 19 Jun, 12:58
beansproud two questions about nutch url filter when inject Wed, 18 Jun, 14:38
Eric J. Christeson   Re: two questions about nutch url filter when inject Wed, 18 Jun, 15:33
beansproud     Re: two questions about nutch url filter when inject Thu, 19 Jun, 06:29
Martin Xu All administration gui links in wiki are broken Thu, 19 Jun, 08:14
Martin Xu   Re: All administration gui links in wiki are broken Thu, 19 Jun, 08:37
Otis Gospodnetic   Re: All administration gui links in wiki are broken Thu, 19 Jun, 12:55
John Thompson Can I update my search engine without restarting tomcat? Thu, 19 Jun, 09:32
Wynz Lo   Re: Can I update my search engine without restarting tomcat? Thu, 19 Jun, 11:19
John Thompson     Re: Can I update my search engine without restarting tomcat? Thu, 19 Jun, 18:20
Howie Wang       RE: Can I update my search engine without restarting tomcat? Thu, 19 Jun, 18:31
John Thompson         Re: Can I update my search engine without restarting tomcat? Thu, 19 Jun, 19:17
Eric J. Christeson       Re: Can I update my search engine without restarting tomcat? Thu, 19 Jun, 19:16
Ricardo Ramirez No results when searching via the web Fri, 20 Jun, 22:02
Jason Boss   Re: No results when searching via the web Sat, 21 Jun, 03:00
Howie Wang     RE: No results when searching via the web Sat, 21 Jun, 03:18
John Thompson       Re: No results when searching via the web Sat, 21 Jun, 21:46
Ricardo Ramirez     Re: No results when searching via the web Sun, 22 Jun, 01:54
Howie Wang       RE: No results when searching via the web Sun, 22 Jun, 05:40
Jason Boss         Re: No results when searching via the web Sun, 22 Jun, 08:04
Ricardo Ramirez         Re: No results when searching via the web Mon, 23 Jun, 00:57
Otis Gospodnetic Re: GNUgcj problem? Sat, 21 Jun, 05:49
idr...@htwm.de   Re: GNUgcj problem? Tue, 24 Jun, 05:58
kevin chen Why do I need segment directory when not using cache? Sat, 21 Jun, 14:31
wuqi   Re: Why do I need segment directory when not using cache? Sat, 21 Jun, 17:21
nutch_newbie Re-crawl frequency/memory problem- please help Sat, 21 Jun, 21:43
Viksit Gaur Querying linkdb for a URL with special characters Sun, 22 Jun, 02:33
Otis Gospodnetic   Re: Querying linkdb for a URL with special characters Sun, 22 Jun, 20:00
Otis Gospodnetic Fetching only unfetched URLs Sun, 22 Jun, 20:13
Winton Davies   Error starting Nutch-0.9 in Tomcat 5 Mon, 23 Jun, 04:01
inet-fan No search results - Nutch 0.9 on FreeBSD Sun, 22 Jun, 22:44
inet-fan   Re: No search results - Nutch 0.9 on FreeBSD Mon, 23 Jun, 11:23
inet-fan     Re: No search results - Nutch 0.9 on FreeBSD Mon, 23 Jun, 12:15
Winton Davies default hadoop goes to / Mon, 23 Jun, 04:04
Otis Gospodnetic   Re: default hadoop goes to / Mon, 23 Jun, 04:52
¹ý¼Ñ Does nutch-0.9 support multi-client's host control? Tue, 24 Jun, 06:25
Winton Davies Wiki Index Wed, 25 Jun, 00:03
Winton Davies   Re: Wiki Index Wed, 25 Jun, 23:38
Mathias Conradt URLs not crawled in order (referring to URL list) Wed, 25 Jun, 01:14
Winton Davies   Re: URLs not crawled in order (referring to URL list) Wed, 25 Jun, 01:28
Mathias Conradt     Re: URLs not crawled in order (referring to URL list) Wed, 25 Jun, 02:08
Benny Lipsicas Nutch index vs Lucene index Wed, 25 Jun, 13:54
Lyndon Maydwell   Re: Nutch index vs Lucene index Wed, 25 Jun, 14:58
kranthi reddy Crawling SLASHDOT.ORG Wed, 25 Jun, 17:30
Howie Wang   RE: Crawling SLASHDOT.ORG Wed, 25 Jun, 17:45
kranthi reddy     Re: Crawling SLASHDOT.ORG Wed, 25 Jun, 17:48
Howie Wang       RE: Crawling SLASHDOT.ORG Wed, 25 Jun, 18:15
kranthi reddy         Re: Crawling SLASHDOT.ORG Wed, 25 Jun, 18:23
Howie Wang           RE: Crawling SLASHDOT.ORG Wed, 25 Jun, 18:58
kranthi reddy             Re: Crawling SLASHDOT.ORG Wed, 25 Jun, 19:38
John Thompson Understanding Lucene Document Fields Wed, 25 Jun, 21:58
John Thompson   Re: Understanding Lucene Document Fields Wed, 25 Jun, 22:56
Hector Toll Scoring Formula Thu, 26 Jun, 11:47
Felix Zimmermann individual crawl-urlfilter.txt and nutch-site.xml for different crawls? Thu, 26 Jun, 11:49
Devang Shah   RE: individual crawl-urlfilter.txt and nutch-site.xml for different crawls? Thu, 26 Jun, 13:28
Joe Malcolm     RE: individual crawl-urlfilter.txt and nutch-site.xml for different crawls? Mon, 30 Jun, 19:45
Kursun, Mahmut Funny thing that I realized today by accident Thu, 26 Jun, 15:08
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 200993
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167