Mailing list archives: March 2009

Site index · List index
Message list« Previous · 1 · 2 · 3 · 4 · 5 · Next »Thread · Author · Date
Yves Yu Re: About search inner links information Thu, 05 Mar, 15:00
Yves Yu Re: About search inner links information Thu, 05 Mar, 16:32
Yves Yu Re: About search inner links information Thu, 05 Mar, 16:53
Yves Yu Re: About search inner links information Thu, 05 Mar, 18:05
Yves Yu Re: About search inner links information Fri, 06 Mar, 03:58
ahammad Problem with crawling using the latest 1.0 trunk Mon, 02 Mar, 19:09
ahammad Re: Problem with crawling using the latest 1.0 trunk Mon, 02 Mar, 19:18
ahammad Re: Problem with crawling using the latest 1.0 trunk Mon, 02 Mar, 19:24
ahammad Re: Problem with crawling using the latest 1.0 trunk Mon, 02 Mar, 19:40
ahammad Re: how to make Nutch work for Solr? Fri, 06 Mar, 21:12
alx...@aim.com Re: urls with ? and & symbols Mon, 02 Mar, 23:36
alx...@aim.com Re: urls with ? and & symbols Tue, 03 Mar, 01:07
alx...@aim.com what is needed to index for about 10000 domains Tue, 03 Mar, 20:44
alx...@aim.com Re: what is needed to index for about 10000 domains Tue, 03 Mar, 22:10
alx...@aim.com Re: what is needed to index for about 10000 domains Wed, 04 Mar, 00:14
alx...@aim.com Re: what is needed to index for about 10000 domains Wed, 04 Mar, 04:27
alx...@aim.com Re: what is needed to index for about 10000 domains Wed, 04 Mar, 04:48
alx...@aim.com Re: what is needed to index for about 10000 domains Wed, 04 Mar, 07:22
alx...@aim.com Re: what is needed to index for about 10000 domains Thu, 05 Mar, 21:56
alx...@aim.com error after adding indexes manually Fri, 13 Mar, 23:41
alx...@aim.com Re: error after adding indexes manually Sat, 14 Mar, 00:21
alx...@aim.com Re: error after adding indexes manually Sat, 14 Mar, 01:33
alx...@aim.com Re: error after adding indexes manually Sat, 14 Mar, 04:18
alx...@aim.com Re: error after adding indexes manually Sat, 14 Mar, 04:19
alx...@aim.com Re: error after adding indexes manually Sat, 14 Mar, 23:06
alx...@aim.com Re: Nutch doesn't find all urls.. Any suggestion? Thu, 19 Mar, 17:32
alx...@aim.com Re: Limiting crawls to subwebs Thu, 26 Mar, 21:08
alx...@aim.com lukeall-0.9.1 to manually add indexes Mon, 30 Mar, 05:16
askNutch type is incompatible in 1.0! Mon, 30 Mar, 08:29
bruce Parsing/Crawler Questions.. Wed, 04 Mar, 21:53
bruce RE: Parsing/Crawler Questions.. Thu, 05 Mar, 03:59
bruce app question.... Mon, 30 Mar, 19:47
buddha1021 Re: The Future of Nutch Sat, 14 Mar, 02:42
buddha1021 Re: The Future of Nutch Sat, 14 Mar, 12:45
buddha1021 Re: type is incompatible in 1.0! Tue, 31 Mar, 03:19
consultas Re: The Future of Nutch Sat, 14 Mar, 15:44
dayz...@gmail.com Re: Re: The numFetchers option Sun, 08 Mar, 13:58
dayz...@gmail.com Running multiple processes on a single machine Wed, 11 Mar, 12:28
dealmaker How do you setup your svn for your nutch code? Mon, 02 Mar, 00:22
dealmaker Re: How do you setup your svn for your nutch code? Mon, 02 Mar, 01:27
dealmaker Re: How do you setup your svn for your nutch code? Mon, 02 Mar, 03:54
dealmaker Re: How do you setup your svn for your nutch code? Mon, 02 Mar, 03:55
dealmaker Re: How do you setup your svn for your nutch code? Mon, 02 Mar, 04:10
dealmaker getIndexDocNo ( ) doesn't exist in Nutch nightly build anymore? Tue, 03 Mar, 03:55
dealmaker Does MoreLikeThis work with Nutch 1.0 / nightly build? Tue, 03 Mar, 06:49
dealmaker Hadoop java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232) while indexing. Wed, 04 Mar, 22:59
dealmaker Re: Exception when crawling Wed, 04 Mar, 23:54
dealmaker Re: Problem with crawling using the latest 1.0 trunk Thu, 05 Mar, 00:03
dealmaker Where can I download old carrot2 2.1 code & binary? Thu, 05 Mar, 22:18
dealmaker How to ignore search results that don't have related keywords in main body? Mon, 23 Mar, 05:53
dealmaker Template Detection? Mon, 23 Mar, 08:10
dealmaker Re: Template Detection? Mon, 23 Mar, 14:45
dealmaker How to save additional data into crawl db or segment? Tue, 24 Mar, 21:54
dealmaker How to Boost Keywords in Search Query? Thu, 26 Mar, 19:35
ianwong how to recreate index Wed, 25 Mar, 13:09
ianwong how to set timeout to queryserver Thu, 26 Mar, 05:19
jackyu webapps Sat, 07 Mar, 19:28
jackyu Re: webapps Sun, 08 Mar, 06:34
jackyu Re: The numFetchers option Mon, 09 Mar, 02:45
jackyu Re: URLFilter Plugin ClassNotFoundException Tue, 10 Mar, 03:50
jackyu wiki article not exist Sat, 14 Mar, 12:31
jackyu 1.0 mp3 plugin test not pass Mon, 16 Mar, 16:28
kazam common-terms.utf8 not being found Wed, 04 Mar, 00:16
kazam Re: common-terms.utf8 location Fri, 06 Mar, 17:08
kranthi reddy Crawling Using RSS Feeds Fri, 20 Mar, 11:49
marcel richter external links in cached pages Mon, 02 Mar, 16:11
n_developer Query the user defined field Wed, 04 Mar, 11:33
n_developer Re: Query the user defined field Mon, 09 Mar, 07:29
n_developer embed nutch crawl in an application Wed, 18 Mar, 05:10
n_developer Re: Nutch 1.0 trunk Fetch Schedule Wed, 18 Mar, 13:01
n_developer Re: Nutch 1.0 trunk Fetch Schedule Wed, 18 Mar, 13:50
norton Error with Nutch 1.0 crawling Sun, 29 Mar, 14:46
nutchu...@sycona.com Could not find the main class: admin. Mon, 02 Mar, 07:28
nutchu...@sycona.com Re: Could not find the main class: admin. Mon, 02 Mar, 07:43
nutchu...@sycona.com Input path doesnt exist : XYZ/crawl/segments/20090302092003/parse_data Mon, 02 Mar, 08:27
ram_sj Crawler Output Flat file or Database? Mon, 30 Mar, 00:30
schroedi Re: Nutch Trunk Java requirement Wed, 25 Mar, 14:29
tigertail Re: Problem with crawling using the latest 1.0 trunk Wed, 04 Mar, 17:01
vishal vachhani Re: Pulling out URLs Thu, 12 Mar, 10:14
vishal vachhani Re: Too many open files Nutch 0.8 Mon, 16 Mar, 17:37
vishal vachhani Re: Original tags, attribute defs, multiword tokens, how is this done. Tue, 17 Mar, 14:35
vishal vachhani Re: MergeSegments Error. Thu, 19 Mar, 10:35
yanky young Re: how to crawl multiple websites in each run? Tue, 03 Mar, 04:07
yanky young Re: Exception when crawling Tue, 03 Mar, 04:20
yanky young Re: why I cannot find this link? Tue, 03 Mar, 15:46
yanky young Re: how to crawl multiple websites in each run? Tue, 03 Mar, 16:09
yanky young Re: Keeping content fresh Tue, 03 Mar, 17:15
yanky young Re: why I cannot find this link? Tue, 03 Mar, 17:28
yanky young Re: why I cannot find this link? Wed, 04 Mar, 02:20
yanky young Re: what is needed to index for about 10000 domains Wed, 04 Mar, 02:41
yanky young Re: why I cannot find this link? Wed, 04 Mar, 04:44
yanky young Re: Parsing/Crawler Questions.. Thu, 05 Mar, 01:48
yanky young Re: Hadoop java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232) while indexing. Thu, 05 Mar, 01:50
yanky young Re: A General suggestion: To improve effectiveness of the forums Thu, 05 Mar, 03:41
yanky young Re: Parsing/Crawler Questions.. Thu, 05 Mar, 03:46
yanky young Re: Parsing/Crawler Questions.. Thu, 05 Mar, 04:41
yanky young Re: URLFilter Plugin ClassNotFoundExpections Mon, 09 Mar, 16:15
yanky young Re: Limit Nutch Crawl to Seed URLs Sat, 14 Mar, 06:28
yanky young Re: The Future of Nutch Sat, 14 Mar, 07:03
yanky young Re: synchronized File Writer Mon, 16 Mar, 06:21
Message list« Previous · 1 · 2 · 3 · 4 · 5 · Next »Thread · Author · Date
Box list
Dec 200982
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167