| Isabel Drost |
Hadoop Get Together @ Berlin |
Mon, 02 Feb, 06:51 |
| Marko Bauhardt |
Re: Release 1.0? |
Mon, 02 Feb, 14:25 |
| Andrzej Bialecki |
Re: Release 1.0? |
Mon, 02 Feb, 16:36 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Mon, 02 Feb, 17:08 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Mon, 02 Feb, 17:09 |
| Marko Bauhardt |
Re: Release 1.0? |
Tue, 03 Feb, 08:24 |
| julien nioche (JIRA) |
[jira] Closed: (NUTCH-656) DeleteDuplicates based on crawlDB only |
Tue, 03 Feb, 10:39 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Tue, 03 Feb, 13:18 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Tue, 03 Feb, 13:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-353) pages that serverside forwards will be refetched every time |
Tue, 03 Feb, 13:19 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-353) pages that serverside forwards will be refetched every time |
Tue, 03 Feb, 13:19 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-558) Need tool to retrieve domain statistics |
Tue, 03 Feb, 13:25 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-558) Need tool to retrieve domain statistics |
Tue, 03 Feb, 13:25 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-279) Additions for regex-normalize |
Tue, 03 Feb, 15:17 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-279) Additions for regex-normalize |
Tue, 03 Feb, 15:17 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results |
Tue, 03 Feb, 15:31 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-92) DistributedSearch incorrectly scores results |
Tue, 03 Feb, 15:31 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 |
Tue, 03 Feb, 15:46 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 |
Tue, 03 Feb, 15:46 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 |
Wed, 04 Feb, 04:11 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-279) Additions for regex-normalize |
Wed, 04 Feb, 04:11 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-685) Content-level redirect status lost in ParseSegment |
Fri, 06 Feb, 10:11 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Fri, 06 Feb, 13:13 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Fri, 06 Feb, 13:13 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE |
Fri, 06 Feb, 13:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE |
Fri, 06 Feb, 13:20 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-251) Administration GUI |
Fri, 06 Feb, 13:21 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Fri, 06 Feb, 13:21 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter |
Fri, 06 Feb, 13:29 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Fri, 06 Feb, 13:31 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Fri, 06 Feb, 13:35 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Fri, 06 Feb, 13:35 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Fri, 06 Feb, 13:35 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-261) Multi Language Support |
Fri, 06 Feb, 13:43 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-261) Multi Language Support |
Fri, 06 Feb, 13:43 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-357) crawling simulation |
Fri, 06 Feb, 13:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-357) crawling simulation |
Fri, 06 Feb, 13:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-455) dedup on tokenized fields is faulty |
Fri, 06 Feb, 13:52 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-455) dedup on tokenized fields is faulty |
Fri, 06 Feb, 13:52 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-262) Summary excerpts and highlights problems |
Fri, 06 Feb, 14:13 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-262) Summary excerpts and highlights problems |
Fri, 06 Feb, 14:13 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-479) Support for OR queries |
Fri, 06 Feb, 14:14 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-479) Support for OR queries |
Fri, 06 Feb, 14:14 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-74) French Analyzer Plugin |
Fri, 06 Feb, 14:17 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-74) French Analyzer Plugin |
Fri, 06 Feb, 14:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE |
Sat, 07 Feb, 04:12 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Sat, 07 Feb, 04:12 |
| Techie |
Re: writing plugin |
Sun, 08 Feb, 12:28 |
| ³ÌԽǿ |
RPC timeout |
Tue, 10 Feb, 02:53 |
| OpenTeam.ru (JIRA) |
[jira] Created: (NUTCH-686) Russian Analysis Plugin |
Tue, 10 Feb, 05:20 |
| OpenTeam.ru (JIRA) |
[jira] Updated: (NUTCH-686) Russian Analysis Plugin |
Tue, 10 Feb, 05:20 |
| OpenTeam.ru (JIRA) |
[jira] Closed: (NUTCH-686) Russian Analysis Plugin |
Tue, 10 Feb, 05:30 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-563) Include custom fields in BasicQueryFilter |
Tue, 10 Feb, 10:33 |
| Apache Wiki |
[Nutch Wiki] Update of "RunNutchInEclipse0.9" by FrankMcCown |
Tue, 10 Feb, 17:16 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Wed, 11 Feb, 09:14 |
| Apache Wiki |
[Nutch Wiki] Trivial Update of "RunNutchInEclipse0.9" by FrankMcCown |
Wed, 11 Feb, 18:04 |
| Apache Wiki |
[Nutch Wiki] Update of "GettingNutchRunningWithWindows" by FrankMcCown |
Wed, 11 Feb, 18:25 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Thu, 12 Feb, 04:13 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Thu, 12 Feb, 04:13 |
| Apache Wiki |
[Nutch Wiki] Update of "IntranetRecrawl" by SAnand |
Thu, 12 Feb, 12:39 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-668) Domain URL Filter |
Thu, 12 Feb, 16:23 |
| Eric J. Christeson |
NTCH-635 LinkAnalysis Tool for Nutch |
Fri, 13 Feb, 00:05 |
| Pradeep Pujari |
Re: NTCH-635 LinkAnalysis Tool for Nutch |
Fri, 13 Feb, 01:07 |
| Frank McCown |
Support for Sitemap Protocol and Canonical URLs |
Mon, 16 Feb, 17:28 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #727 |
Tue, 17 Feb, 04:02 |
| Andrzej Bialecki |
Re: Support for Sitemap Protocol and Canonical URLs |
Tue, 17 Feb, 07:58 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Tue, 17 Feb, 13:05 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-687) Add RAT |
Tue, 17 Feb, 14:01 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-687) Add RAT |
Tue, 17 Feb, 14:01 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-688) Fix missing/wrong headers in source files |
Tue, 17 Feb, 14:05 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-688) Fix missing/wrong headers in source files |
Tue, 17 Feb, 14:05 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Tue, 17 Feb, 14:16 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Tue, 17 Feb, 14:31 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Tue, 17 Feb, 14:37 |
| hasan (JIRA) |
[jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Tue, 17 Feb, 15:49 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-582) Add missing type parameters |
Tue, 17 Feb, 18:45 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-86) LanguageIdentifier API enhancements |
Tue, 17 Feb, 19:03 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Tue, 17 Feb, 19:04 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Tue, 17 Feb, 19:06 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-309) Uses commons logging Code Guards |
Tue, 17 Feb, 19:06 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-310) Review Log Levels |
Tue, 17 Feb, 19:40 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-249) black- white list url filtering |
Tue, 17 Feb, 19:40 |
| Peter Sparks (JIRA) |
[jira] Created: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Tue, 17 Feb, 20:54 |
| Peter Sparks (JIRA) |
[jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Tue, 17 Feb, 20:58 |
| Peter Sparks (JIRA) |
[jira] Created: (NUTCH-690) bug in DomContentUtils.shouldThrowAwayLink? |
Tue, 17 Feb, 21:08 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Tue, 17 Feb, 21:14 |
| Peter Sparks (JIRA) |
[jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Tue, 17 Feb, 22:01 |
| Peter Sparks (JIRA) |
[jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Tue, 17 Feb, 22:01 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #728 |
Wed, 18 Feb, 04:14 |
| Dmitry Lihachev (JIRA) |
[jira] Created: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:32 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:36 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:36 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:38 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:02 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:02 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:35 |
| Dmitry Lihachev (JIRA) |
[jira] Issue Comment Edited: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:41 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. |
Wed, 18 Feb, 05:59 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 06:05 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-687) Add RAT |
Wed, 18 Feb, 08:13 |