Mailing list archives: August 2009

Site index · List index
Message listThread · Author · Date
Ken Krugler Web Crawler MeetUp info on wiki Mon, 03 Aug, 00:19
Kirby Bohling OSGi progress Mon, 03 Aug, 04:00
Andrzej Bialecki Re: Web Crawler MeetUp info on wiki Mon, 03 Aug, 10:42
Apache Wiki [Nutch Wiki] Trivial Update of "FrontPage" by KenKrugler Mon, 03 Aug, 16:31
Apache Wiki [Nutch Wiki] Trivial Update of "FrontPage" by KenKrugler Mon, 03 Aug, 16:33
Apache Wiki [Nutch Wiki] Trivial Update of "FrontPage" by KenKrugler Mon, 03 Aug, 16:35
Apache Wiki [Nutch Wiki] Update of "ApacheConUs2009MeetUp" by KenKrugler Mon, 03 Aug, 16:43
Apache Wiki [Nutch Wiki] Trivial Update of "ApacheConUs2009MeetUp" by KenKrugler Mon, 03 Aug, 16:43
Apache Wiki [Nutch Wiki] Trivial Update of "ApacheConUs2009MeetUp" by KenKrugler Mon, 03 Aug, 16:45
Ken Krugler MeetUp topic list posted Mon, 03 Aug, 16:51
Andrzej Bialecki Re: MeetUp topic list posted Mon, 03 Aug, 17:27
Ken Krugler Re: MeetUp topic list posted Mon, 03 Aug, 19:02
Ken Krugler Re: MeetUp topic list posted Mon, 03 Aug, 19:08
Andrzej Bialecki Re: OSGi progress Tue, 04 Aug, 13:42
Kirby Bohling Re: OSGi progress Tue, 04 Aug, 14:30
Otis Gospodnetic (JIRA) [jira] Updated: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container. Tue, 04 Aug, 15:04
Otis Gospodnetic (JIRA) [jira] Updated: (NUTCH-738) Close SegmentUpdater when FetchedSegments is closed Tue, 04 Aug, 15:04
ilayaraja serializing and deserializing lucene query Wed, 05 Aug, 05:39
Paul Tomblin Can I add a url to be crawled without putting it in a file and feeding it to "Inject"? Wed, 05 Aug, 16:57
Doğacan Güney About NUTCH-650 (hbase integration) Thu, 06 Aug, 07:53
Andrzej Bialecki Re: About NUTCH-650 (hbase integration) Thu, 06 Aug, 08:07
Marko Bauhardt Re: Can I add a url to be crawled without putting it in a file and feeding it to "Inject"? Thu, 06 Aug, 10:06
Marko Bauhardt (JIRA) [jira] Created: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls Thu, 06 Aug, 10:36
Marko Bauhardt (JIRA) [jira] Updated: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls Thu, 06 Aug, 10:38
Marko Bauhardt (JIRA) [jira] Commented: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls Thu, 06 Aug, 10:46
Sailaja Dhiviti How to enter data in to the Crawldb Fri, 07 Aug, 04:59
Marko Bauhardt Re: How to enter data in to the Crawldb Fri, 07 Aug, 08:28
ranjeet98 How to see System.out.println() values Featcher.java Fri, 07 Aug, 19:18
Marko Bauhardt Re: How to see System.out.println() values Featcher.java Sat, 08 Aug, 11:11
Marko Bauhardt codeformatting Sat, 08 Aug, 11:49
Andrzej Bialecki Re: codeformatting Sat, 08 Aug, 12:05
Marko Bauhardt Re: codeformatting Sat, 08 Aug, 12:15
Apache Wiki [Nutch Wiki] Update of "PublicServers" by ReinierBattenberg Sat, 08 Aug, 12:45
Julien Nioche (JIRA) [jira] Commented: (NUTCH-721) Fetcher2 Slow Sun, 09 Aug, 13:52
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-721) Fetcher2 Slow Sun, 09 Aug, 15:14
Marko Bauhardt nutch gui on github Sun, 09 Aug, 18:32
Marko Bauhardt (JIRA) [jira] Commented: (NUTCH-251) Administration GUI Sun, 09 Aug, 18:36
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-721) Fetcher2 Slow Mon, 10 Aug, 08:14
Julien Nioche (JIRA) [jira] Updated: (NUTCH-721) Fetcher2 Slow Mon, 10 Aug, 12:18
ranjeet98 Re: How to see System.out.println() values Featcher.java Mon, 10 Aug, 17:30
Paul Tomblin Is this a bug? Mon, 10 Aug, 20:27
Paul Tomblin Found a second problem in the same code Mon, 10 Aug, 20:58
Paul Tomblin Why isn't this working? Mon, 10 Aug, 22:05
¹¬ÕÕ fetch failed error 500 Tue, 11 Aug, 02:25
Alex McLintock Re: Why isn't this working? Tue, 11 Aug, 09:35
Alex McLintock Re: fetch failed error 500 Tue, 11 Aug, 09:37
Paul Tomblin Re: Why isn't this working? Tue, 11 Aug, 11:58
¹¬ÕÕ Re: fetch failed error 500 Wed, 12 Aug, 01:44
Julien Nioche (JIRA) [jira] Updated: (NUTCH-679) Fetcher2 implementing Tool Thu, 13 Aug, 14:22
Paul Tomblin My mistake Thu, 13 Aug, 15:26
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-650) Hbase Integration Sun, 16 Aug, 22:28
Ankit Dangi SegmentReader: How to write content to separate multiple files.. Mon, 17 Aug, 09:35
hussam hamdan RE-Crawling Mon, 17 Aug, 09:54
mawanqiang (JIRA) [jira] Created: (NUTCH-748) DiskChecker Could not find Tue, 18 Aug, 06:28
Ankit Dangi SegmentReader: Why Multiple CrawlDatum section for a record.. Tue, 18 Aug, 07:10
Artem Barger Indegree link analysis algorithm. Wed, 19 Aug, 19:34
salima abdulsalam (JIRA) [jira] Created: (NUTCH-749) Fetching the url from crawldb Fri, 21 Aug, 13:38
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-749) Fetching the url from crawldb Fri, 21 Aug, 15:38
ilayar...@rediff.co.in How to use Hbase with Nutch Sun, 23 Aug, 07:09
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-721) Fetcher2 Slow Tue, 25 Aug, 05:47
Fuad Efendi Nutch Performance Improvements Tue, 25 Aug, 16:42
Fuad Efendi RE: Nutch Performance Improvements Tue, 25 Aug, 16:50
Ken Krugler Re: Nutch Performance Improvements Tue, 25 Aug, 17:12
Julien Nioche (JIRA) [jira] Commented: (NUTCH-696) Timeout for Parser Fri, 28 Aug, 13:28
Julien Nioche (JIRA) [jira] Closed: (NUTCH-696) Timeout for Parser Fri, 28 Aug, 13:28
Alexey Torochkov Title inside body Fri, 28 Aug, 14:39
Julien Nioche (JIRA) [jira] Commented: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum Fri, 28 Aug, 14:58
Julien Nioche (JIRA) [jira] Commented: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum Fri, 28 Aug, 15:01
Fuad Efendi RE: Title inside body Fri, 28 Aug, 15:39
Alexey Torochkov Re: Title inside body Fri, 28 Aug, 18:07
Magnús Skúlason Re: Title inside body Fri, 28 Aug, 19:42
Fuad Efendi RE: Title inside body Fri, 28 Aug, 20:01
Fuad Efendi RE: Title inside body Fri, 28 Aug, 20:09
Magnús Skúlason Re: Title inside body Fri, 28 Aug, 20:44
Fuad Efendi RE: Title inside body Fri, 28 Aug, 21:34
Alexey Torochkov Re: Title inside body Fri, 28 Aug, 21:49
Fuad Efendi RE: Title inside body Fri, 28 Aug, 22:54
Alexey Torochkov (JIRA) [jira] Created: (NUTCH-750) HtmlParser plugin - page title extraction Sat, 29 Aug, 09:21
Alexey Torochkov (JIRA) [jira] Updated: (NUTCH-750) HtmlParser plugin - page title extraction Sat, 29 Aug, 09:23
Alexey Torochkov Re: Title inside body Sat, 29 Aug, 09:34
Marko Bauhardt graphical user interface v0.1 for nutch Mon, 31 Aug, 08:29
Marko Bauhardt (JIRA) [jira] Issue Comment Edited: (NUTCH-251) Administration GUI Mon, 31 Aug, 12:17
Message listThread · Author · Date
Box list
Nov 200985
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510