| ³ÌԽǿ |
RPC timeout |
Tue, 10 Feb, 02:53 |
| Doğacan Güney |
Re: Release 1.0? |
Sat, 28 Feb, 15:05 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Wed, 11 Feb, 09:14 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-696) Timeout for Parser |
Thu, 19 Feb, 23:22 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 08:55 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 08:55 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 08:55 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:31 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:31 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 20 Feb, 10:33 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 20 Feb, 10:35 |
| Andrew McCall (JIRA) |
[jira] Created: (NUTCH-693) Add configurable option for treating nofollow behaviour. |
Wed, 18 Feb, 21:00 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-693) Add configurable option for treating nofollow behaviour. |
Wed, 18 Feb, 21:00 |
| Andrew McCall (JIRA) |
[jira] Commented: (NUTCH-650) Hbase Integration |
Thu, 19 Feb, 13:54 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Thu, 19 Feb, 13:54 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Sat, 21 Feb, 15:16 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Mon, 23 Feb, 17:24 |
| Andrzej Bialecki |
Re: Release 1.0? |
Mon, 02 Feb, 16:36 |
| Andrzej Bialecki |
Re: Support for Sitemap Protocol and Canonical URLs |
Tue, 17 Feb, 07:58 |
| Andrzej Bialecki |
Re: [Nutch Wiki] Update of "InstallingWeb2" by SamiSiren |
Fri, 20 Feb, 10:07 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Fri, 27 Feb, 07:28 |
| Andrzej Bialecki |
Re: Url regex normalizer |
Fri, 27 Feb, 17:10 |
| Andrzej Bialecki |
Re: Release 1.0? |
Sat, 28 Feb, 18:43 |
| Andrzej Bialecki |
Re: planning for nutch-1.0-rc1 |
Sat, 28 Feb, 18:48 |
| Andrzej Bialecki |
Re: Release 1.0? |
Sat, 28 Feb, 18:51 |
| Andrzej Bialecki |
Re: planning for nutch-1.0-rc1 |
Sat, 28 Feb, 18:57 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Tue, 03 Feb, 13:18 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Tue, 03 Feb, 13:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-353) pages that serverside forwards will be refetched every time |
Tue, 03 Feb, 13:19 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-353) pages that serverside forwards will be refetched every time |
Tue, 03 Feb, 13:19 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-558) Need tool to retrieve domain statistics |
Tue, 03 Feb, 13:25 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-558) Need tool to retrieve domain statistics |
Tue, 03 Feb, 13:25 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-279) Additions for regex-normalize |
Tue, 03 Feb, 15:17 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-279) Additions for regex-normalize |
Tue, 03 Feb, 15:17 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results |
Tue, 03 Feb, 15:31 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-92) DistributedSearch incorrectly scores results |
Tue, 03 Feb, 15:31 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 |
Tue, 03 Feb, 15:46 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 |
Tue, 03 Feb, 15:46 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-685) Content-level redirect status lost in ParseSegment |
Fri, 06 Feb, 10:11 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Fri, 06 Feb, 13:13 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Fri, 06 Feb, 13:13 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE |
Fri, 06 Feb, 13:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE |
Fri, 06 Feb, 13:20 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-251) Administration GUI |
Fri, 06 Feb, 13:21 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Fri, 06 Feb, 13:21 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter |
Fri, 06 Feb, 13:29 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Fri, 06 Feb, 13:31 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Fri, 06 Feb, 13:35 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Fri, 06 Feb, 13:35 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Fri, 06 Feb, 13:35 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-261) Multi Language Support |
Fri, 06 Feb, 13:43 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-261) Multi Language Support |
Fri, 06 Feb, 13:43 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-357) crawling simulation |
Fri, 06 Feb, 13:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-357) crawling simulation |
Fri, 06 Feb, 13:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-455) dedup on tokenized fields is faulty |
Fri, 06 Feb, 13:52 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-455) dedup on tokenized fields is faulty |
Fri, 06 Feb, 13:52 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-262) Summary excerpts and highlights problems |
Fri, 06 Feb, 14:13 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-262) Summary excerpts and highlights problems |
Fri, 06 Feb, 14:13 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-479) Support for OR queries |
Fri, 06 Feb, 14:14 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-479) Support for OR queries |
Fri, 06 Feb, 14:14 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-74) French Analyzer Plugin |
Fri, 06 Feb, 14:17 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-74) French Analyzer Plugin |
Fri, 06 Feb, 14:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 07:31 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:03 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Fri, 20 Feb, 10:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:47 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-701) Replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:48 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Tue, 24 Feb, 11:12 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Wed, 25 Feb, 16:47 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-704) ensure that more important pages are crawled first |
Thu, 26 Feb, 08:29 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Fri, 27 Feb, 18:55 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-705) parse-rtf plugin |
Sat, 28 Feb, 15:46 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains |
Sat, 28 Feb, 18:48 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #727 |
Tue, 17 Feb, 04:02 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #728 |
Wed, 18 Feb, 04:14 |
| Apache Wiki |
[Nutch Wiki] Update of "RunNutchInEclipse0.9" by FrankMcCown |
Tue, 10 Feb, 17:16 |
| Apache Wiki |
[Nutch Wiki] Trivial Update of "RunNutchInEclipse0.9" by FrankMcCown |
Wed, 11 Feb, 18:04 |
| Apache Wiki |
[Nutch Wiki] Update of "GettingNutchRunningWithWindows" by FrankMcCown |
Wed, 11 Feb, 18:25 |
| Apache Wiki |
[Nutch Wiki] Update of "IntranetRecrawl" by SAnand |
Thu, 12 Feb, 12:39 |
| Apache Wiki |
[Nutch Wiki] Update of "RunNutchInEclipse0.9" by FrankMcCown |
Thu, 19 Feb, 19:28 |
| Apache Wiki |
[Nutch Wiki] Update of "RunningNutchAndSolr" by SamiSiren |
Fri, 20 Feb, 08:56 |
| Apache Wiki |
[Nutch Wiki] Update of "InstallingWeb2" by SamiSiren |
Fri, 20 Feb, 09:01 |
| Apache Wiki |
[Nutch Wiki] Update of "DownloadingNutch" by BartoszGadzimski |
Thu, 26 Feb, 17:46 |
| Apache Wiki |
[Nutch Wiki] Update of "SimpleMapReduceTutorial" by BartoszGadzimski |
Thu, 26 Feb, 17:57 |
| Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by BartoszGadzimski |
Thu, 26 Feb, 18:03 |
| Bartosz Gadzimski |
NutchAnalysis.java STOP_WORDS not configurable? |
Tue, 24 Feb, 13:28 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Mon, 02 Feb, 17:08 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Mon, 02 Feb, 17:09 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Tue, 17 Feb, 14:16 |
| Dennis Kubes |
Re: Release 1.0? |
Sat, 28 Feb, 18:44 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Mon, 23 Feb, 14:27 |
| Dmitry Lihachev (JIRA) |
[jira] Created: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:32 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:36 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:36 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 04:38 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:02 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:02 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:35 |
| Dmitry Lihachev (JIRA) |
[jira] Issue Comment Edited: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 05:41 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. |
Wed, 18 Feb, 05:59 |