Mailing list archives: January 2009

Site index · List index
Message list« Previous · 1 · 2Thread · Author · Date
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-572) Scoring and redirected Urls Tue, 20 Jan, 15:58
Pau Nutch ScoringFilter plugin problems Tue, 20 Jan, 17:18
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-679) Fetcher2 implementing Tool Tue, 20 Jan, 17:44
Otis Gospodnetic Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 17:48
Doğacan Güney Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 18:13
Otis Gospodnetic Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 20:35
Doğacan Güney Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 20:40
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-661) errors when the uri contains space characters Tue, 20 Jan, 20:48
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 Tue, 20 Jan, 21:08
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly Tue, 20 Jan, 21:34
Piotr Kosiorowski Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 21:35
Otis Gospodnetic Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 21:39
Piotr Kosiorowski Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Tue, 20 Jan, 22:01
Wildan Maulana (JIRA) [jira] Resolved: (NUTCH-681) parse-mp3 compilation problem Wed, 21 Jan, 08:30
Doğacan Güney Re: Nutch ScoringFilter plugin problems Wed, 21 Jan, 08:47
Pau Re: Nutch ScoringFilter plugin problems Wed, 21 Jan, 09:16
julien nioche (JIRA) [jira] Commented: (NUTCH-679) Fetcher2 implementing Tool Wed, 21 Jan, 10:52
Doğacan Güney (JIRA) [jira] Reopened: (NUTCH-681) parse-mp3 compilation problem Wed, 21 Jan, 13:00
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-681) parse-mp3 compilation problem Wed, 21 Jan, 13:12
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-677) Segment merge filering based on segment content Wed, 21 Jan, 14:59
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-664) Possibility to update already stored documents. Wed, 21 Jan, 15:02
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-655) Injecting Crawl metadata Wed, 21 Jan, 15:03
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-650) Hbase Integration Wed, 21 Jan, 15:05
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly Wed, 21 Jan, 15:13
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore Wed, 21 Jan, 15:21
Todd Lipcon (JIRA) [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly Wed, 21 Jan, 15:21
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-628) Host database to keep track of host-level information Wed, 21 Jan, 15:25
Todd Lipcon (JIRA) [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly Wed, 21 Jan, 17:45
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly Wed, 21 Jan, 19:17
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-676) MapWritable is written inefficiently and confusingly Wed, 21 Jan, 19:27
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest Wed, 21 Jan, 19:43
Wildan Maulana (JIRA) [jira] Commented: (NUTCH-681) parse-mp3 compilation problem Thu, 22 Jan, 03:31
Hudson (JIRA) [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly Thu, 22 Jan, 04:15
Hudson (JIRA) [jira] Commented: (NUTCH-681) parse-mp3 compilation problem Thu, 22 Jan, 04:15
Hudson (JIRA) [jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest Thu, 22 Jan, 04:15
Vimal Varghese Re: login failed exception Thu, 22 Jan, 04:50
Stefano Tauriello (JIRA) [jira] Commented: (NUTCH-386) Plugin to index categories by url rules Thu, 22 Jan, 10:42
Beaucarnea (JIRA) [jira] Commented: (NUTCH-386) Plugin to index categories by url rules Thu, 22 Jan, 11:31
Stefano Tauriello (JIRA) [jira] Commented: (NUTCH-386) Plugin to index categories by url rules Thu, 22 Jan, 12:01
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-655) Injecting Crawl metadata Thu, 22 Jan, 20:37
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Thu, 22 Jan, 20:51
Doğacan Güney Re: [jira] Created: (NUTCH-680) Update external jars to latest versions Fri, 23 Jan, 10:01
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-655) Injecting Crawl metadata Fri, 23 Jan, 10:54
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool Fri, 23 Jan, 10:54
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Fri, 23 Jan, 10:57
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool Fri, 23 Jan, 11:18
Dennis Kubes (JIRA) [jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool Fri, 23 Jan, 11:18
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 Fri, 23 Jan, 11:28
Stefano Tauriello (JIRA) [jira] Commented: (NUTCH-386) Plugin to index categories by url rules Fri, 23 Jan, 16:03
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool Fri, 23 Jan, 22:47
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Fri, 23 Jan, 22:49
Doğacan Güney (JIRA) [jira] Issue Comment Edited: (NUTCH-680) Update external jars to latest versions Sat, 24 Jan, 10:29
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-680) Update external jars to latest versions Sat, 24 Jan, 10:29
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-628) Host database to keep track of host-level information Sat, 24 Jan, 10:47
Hudson (JIRA) [jira] Commented: (NUTCH-680) Update external jars to latest versions Sun, 25 Jan, 04:15
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-675) Reduce tasks do not report their status and are killed by jobtracker Sun, 25 Jan, 11:39
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? Sun, 25 Jan, 11:41
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-627) Minimize host address lookup Sun, 25 Jan, 11:41
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-588) Help Need Sun, 25 Jan, 11:41
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 Sun, 25 Jan, 11:41
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. Sun, 25 Jan, 11:43
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-567) Proper (?) handling of URIs in TagSoup. Sun, 25 Jan, 11:45
Pau Re: Nutch ScoringFilter plugin problems Mon, 26 Jan, 12:17
Doğacan Güney Re: Nutch ScoringFilter plugin problems Mon, 26 Jan, 15:58
Apache Wiki [Nutch Wiki] Update of "Mailing" by GrantIngersoll Mon, 26 Jan, 16:32
Apache Wiki [Nutch Wiki] Update of "Mailing" by GrantIngersoll Mon, 26 Jan, 16:33
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-650) Hbase Integration Mon, 26 Jan, 20:29
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-680) Update external jars to latest versions Tue, 27 Jan, 10:22
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Tue, 27 Jan, 18:02
Hudson (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Wed, 28 Jan, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-680) Update external jars to latest versions Wed, 28 Jan, 04:17
Marko Bauhardt Release 1.0? Wed, 28 Jan, 08:45
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects Wed, 28 Jan, 11:01
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 Wed, 28 Jan, 11:35
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password Wed, 28 Jan, 11:38
Guillaume Smet (JIRA) [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password Wed, 28 Jan, 12:13
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password Wed, 28 Jan, 12:40
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password Wed, 28 Jan, 12:59
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password Wed, 28 Jan, 13:11
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-680) Update external jars to latest versions Wed, 28 Jan, 14:13
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Wed, 28 Jan, 20:10
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Wed, 28 Jan, 20:16
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Wed, 28 Jan, 21:10
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-628) Host database to keep track of host-level information Wed, 28 Jan, 21:24
Hudson (JIRA) [jira] Commented: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 Thu, 29 Jan, 04:17
Sami Siren Registration for ApacheCon Europe 2009 is now open! Thu, 29 Jan, 10:18
julien nioche (JIRA) [jira] Created: (NUTCH-682) SOLR indexer does not set boost on the document Thu, 29 Jan, 18:53
Doğacan Güney (JIRA) [jira] Closed: (NUTCH-682) SOLR indexer does not set boost on the document Thu, 29 Jan, 19:13
Doğacan Güney (JIRA) [jira] Created: (NUTCH-683) NUTCH-676 broke CrawlDbMerger Thu, 29 Jan, 19:45
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-683) NUTCH-676 broke CrawlDbMerger Thu, 29 Jan, 19:45
Raghavendra Neelekani Re: [jira] Updated: (NUTCH-683) NUTCH-676 broke CrawlDbMerger Thu, 29 Jan, 19:58
Hudson (JIRA) [jira] Commented: (NUTCH-682) SOLR indexer does not set boost on the document Fri, 30 Jan, 04:20
Doğacan Güney (JIRA) [jira] Created: (NUTCH-684) Dedup support for Solr Fri, 30 Jan, 16:35
Doğacan Güney (JIRA) [jira] Updated: (NUTCH-684) Dedup support for Solr Fri, 30 Jan, 16:36
Raghavendra Neelekani Re: [jira] Created: (NUTCH-683) NUTCH-676 broke CrawlDbMerger Fri, 30 Jan, 18:17
Grease Re: [jira] Created: (NUTCH-633) ParseSegment no longer allow reparsing Sat, 31 Jan, 05:44
Raagu writing plugin Sat, 31 Jan, 09:18
Message list« Previous · 1 · 2Thread · Author · Date
Box list
Dec 200933
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510