Mailing list archives: February 2009

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version Wed, 18 Feb, 06:05
Dmitry Lihachev (JIRA) [jira] Created: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:05
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:11
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:13
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:15
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:15
Dmitry Lihachev (JIRA) [jira] Issue Comment Edited: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:17
Dmitry Lihachev (JIRA) [jira] Issue Comment Edited: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:17
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Thu, 19 Feb, 10:30
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 04:10
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 04:10
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 06:41
Dmitry Lihachev (JIRA) [jira] Issue Comment Edited: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 06:41
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 06:51
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 06:51
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-697) Generate log output for solr indexer and dedup Fri, 20 Feb, 08:11
Dmitry Lihachev (JIRA) [jira] Created: (NUTCH-697) Generate log output for solr indexer and dedup Fri, 20 Feb, 08:11
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 10:11
Dmitry Lihachev (JIRA) [jira] Issue Comment Edited: (NUTCH-684) Dedup support for Solr Fri, 20 Feb, 10:11
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration Fri, 20 Feb, 10:49
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore Tue, 24 Feb, 09:34
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-644) RTF parser doesn't compile anymore Tue, 24 Feb, 09:38
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-644) RTF parser doesn't compile anymore Tue, 24 Feb, 10:36
Dmitry Lihachev (JIRA) [jira] Created: (NUTCH-705) parse-rtf plugin Fri, 27 Feb, 04:18
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-705) parse-rtf plugin Fri, 27 Feb, 04:18
Dmitry Lihachev (JIRA) [jira] Updated: (NUTCH-705) parse-rtf plugin Fri, 27 Feb, 04:30
Dmitry Lihachev (JIRA) [jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore Fri, 27 Feb, 04:32
Doug Cook (JIRA) [jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch Sat, 28 Feb, 19:06
Doug Cook (JIRA) [jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch Sat, 28 Feb, 19:20
Dr. Nadine Hochstotter (JIRA) [jira] Created: (NUTCH-694) Distributed Search Server fails Thu, 19 Feb, 08:39
Dr. Nadine Hochstotter (JIRA) [jira] Commented: (NUTCH-694) Distributed Search Server fails Thu, 19 Feb, 10:52
Dr. Nadine Hochstotter (JIRA) [jira] Commented: (NUTCH-694) Distributed Search Server fails Thu, 19 Feb, 17:32
Dr. Nadine Hochstotter (JIRA) [jira] Commented: (NUTCH-694) Distributed Search Server fails Fri, 20 Feb, 14:51
Eric J. Christeson NTCH-635 LinkAnalysis Tool for Nutch Fri, 13 Feb, 00:05
Frank McCown Support for Sitemap Protocol and Canonical URLs Mon, 16 Feb, 17:28
Gopikrishnan (JIRA) [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. Fri, 27 Feb, 06:12
Hudson (JIRA) [jira] Commented: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 Wed, 04 Feb, 04:11
Hudson (JIRA) [jira] Commented: (NUTCH-279) Additions for regex-normalize Wed, 04 Feb, 04:11
Hudson (JIRA) [jira] Commented: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE Sat, 07 Feb, 04:12
Hudson (JIRA) [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password Sat, 07 Feb, 04:12
Hudson (JIRA) [jira] Commented: (NUTCH-683) NUTCH-676 broke CrawlDbMerger Thu, 12 Feb, 04:13
Hudson (JIRA) [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly Thu, 12 Feb, 04:13
Hudson (JIRA) [jira] Commented: (NUTCH-691) Update jakarta poi jars to the most relevant version Thu, 19 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter Thu, 19 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-688) Fix missing/wrong headers in source files Thu, 19 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-687) Add RAT Thu, 19 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin Fri, 20 Feb, 04:22
Hudson (JIRA) [jira] Commented: (NUTCH-694) Distributed Search Server fails Tue, 24 Feb, 04:16
Hudson (JIRA) [jira] Commented: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles Wed, 25 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-247) robot parser to restrict. Wed, 25 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects Wed, 25 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 Sat, 28 Feb, 04:17
Hudson (JIRA) [jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration Sat, 28 Feb, 04:17
Isabel Drost Hadoop Get Together @ Berlin Mon, 02 Feb, 06:51
Justin Yao would someone help confirm a patch (fix incorrect encoding detection in cached.jsp) Wed, 18 Feb, 18:55
Marko Bauhardt Re: Release 1.0? Mon, 02 Feb, 14:25
Marko Bauhardt Re: Release 1.0? Tue, 03 Feb, 08:24
Meghna Kukreja Url regex normalizer Fri, 27 Feb, 16:32
Meghna Kukreja Re: Url regex normalizer Fri, 27 Feb, 18:50
Meghna Kukreja (JIRA) [jira] Created: (NUTCH-706) Url regex normalizer Fri, 27 Feb, 18:47
Meghna Kukreja (JIRA) [jira] Commented: (NUTCH-706) Url regex normalizer Fri, 27 Feb, 18:49
Michael Chan (JIRA) [jira] Created: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment Sat, 28 Feb, 17:42
Michael Chan (JIRA) [jira] Updated: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment Sat, 28 Feb, 17:44
OpenTeam.ru (JIRA) [jira] Created: (NUTCH-686) Russian Analysis Plugin Tue, 10 Feb, 05:20
OpenTeam.ru (JIRA) [jira] Updated: (NUTCH-686) Russian Analysis Plugin Tue, 10 Feb, 05:20
OpenTeam.ru (JIRA) [jira] Closed: (NUTCH-686) Russian Analysis Plugin Tue, 10 Feb, 05:30
Otis Gospodnetic Re: NutchAnalysis.java STOP_WORDS not configurable? Fri, 27 Feb, 18:21
Otis Gospodnetic (JIRA) [jira] Updated: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment Sat, 28 Feb, 22:44
Peter Sparks (JIRA) [jira] Created: (NUTCH-689) Swf parser doesn't seem to handle relative links Tue, 17 Feb, 20:54
Peter Sparks (JIRA) [jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links Tue, 17 Feb, 20:58
Peter Sparks (JIRA) [jira] Created: (NUTCH-690) bug in DomContentUtils.shouldThrowAwayLink? Tue, 17 Feb, 21:08
Peter Sparks (JIRA) [jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links Tue, 17 Feb, 22:01
Peter Sparks (JIRA) [jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links Tue, 17 Feb, 22:01
Peter Sparks (JIRA) [jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links Wed, 18 Feb, 14:05
Pradeep Pujari Re: NTCH-635 LinkAnalysis Tool for Nutch Fri, 13 Feb, 01:07
Sami Siren dump Fetcher? Wed, 18 Feb, 13:58
Sami Siren Re: would someone help confirm a patch (fix incorrect encoding detection in cached.jsp) Wed, 18 Feb, 20:13
Sami Siren Re: [Nutch Wiki] Update of "InstallingWeb2" by SamiSiren Fri, 20 Feb, 10:10
Sami Siren Re: Url regex normalizer Fri, 27 Feb, 20:18
Sami Siren Re: Release 1.0? Sat, 28 Feb, 08:00
Sami Siren Re: Release 1.0? Sat, 28 Feb, 08:04
Sami Siren planning for nutch-1.0-rc1 Sat, 28 Feb, 08:26
Sami Siren (JIRA) [jira] Updated: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException Tue, 17 Feb, 13:05
Sami Siren (JIRA) [jira] Created: (NUTCH-687) Add RAT Tue, 17 Feb, 14:01
Sami Siren (JIRA) [jira] Updated: (NUTCH-687) Add RAT Tue, 17 Feb, 14:01
Sami Siren (JIRA) [jira] Created: (NUTCH-688) Fix missing/wrong headers in source files Tue, 17 Feb, 14:05
Sami Siren (JIRA) [jira] Commented: (NUTCH-688) Fix missing/wrong headers in source files Tue, 17 Feb, 14:05
Sami Siren (JIRA) [jira] Resolved: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException Tue, 17 Feb, 14:31
Sami Siren (JIRA) [jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) Tue, 17 Feb, 14:37
Sami Siren (JIRA) [jira] Resolved: (NUTCH-582) Add missing type parameters Tue, 17 Feb, 18:45
Sami Siren (JIRA) [jira] Updated: (NUTCH-86) LanguageIdentifier API enhancements Tue, 17 Feb, 19:03
Sami Siren (JIRA) [jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) Tue, 17 Feb, 19:04
Sami Siren (JIRA) [jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 Tue, 17 Feb, 19:06
Sami Siren (JIRA) [jira] Updated: (NUTCH-309) Uses commons logging Code Guards Tue, 17 Feb, 19:06
Sami Siren (JIRA) [jira] Updated: (NUTCH-310) Review Log Levels Tue, 17 Feb, 19:40
Sami Siren (JIRA) [jira] Updated: (NUTCH-249) black- white list url filtering Tue, 17 Feb, 19:40
Sami Siren (JIRA) [jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links Tue, 17 Feb, 21:14
Sami Siren (JIRA) [jira] Resolved: (NUTCH-687) Add RAT Wed, 18 Feb, 08:13
Sami Siren (JIRA) [jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links Wed, 18 Feb, 08:33
Sami Siren (JIRA) [jira] Resolved: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. Wed, 18 Feb, 08:39
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Nov 200972
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510