| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-572) Scoring and redirected Urls |
Tue, 20 Jan, 15:58 |
| Pau |
Nutch ScoringFilter plugin problems |
Tue, 20 Jan, 17:18 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-679) Fetcher2 implementing Tool |
Tue, 20 Jan, 17:44 |
| Otis Gospodnetic |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 17:48 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 18:13 |
| Otis Gospodnetic |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 20:35 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 20:40 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-661) errors when the uri contains space characters |
Tue, 20 Jan, 20:48 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 |
Tue, 20 Jan, 21:08 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Tue, 20 Jan, 21:34 |
| Piotr Kosiorowski |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 21:35 |
| Otis Gospodnetic |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 21:39 |
| Piotr Kosiorowski |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 22:01 |
| Wildan Maulana (JIRA) |
[jira] Resolved: (NUTCH-681) parse-mp3 compilation problem |
Wed, 21 Jan, 08:30 |
| Doğacan Güney |
Re: Nutch ScoringFilter plugin problems |
Wed, 21 Jan, 08:47 |
| Pau |
Re: Nutch ScoringFilter plugin problems |
Wed, 21 Jan, 09:16 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-679) Fetcher2 implementing Tool |
Wed, 21 Jan, 10:52 |
| Doğacan Güney (JIRA) |
[jira] Reopened: (NUTCH-681) parse-mp3 compilation problem |
Wed, 21 Jan, 13:00 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-681) parse-mp3 compilation problem |
Wed, 21 Jan, 13:12 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-677) Segment merge filering based on segment content |
Wed, 21 Jan, 14:59 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-664) Possibility to update already stored documents. |
Wed, 21 Jan, 15:02 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-655) Injecting Crawl metadata |
Wed, 21 Jan, 15:03 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Wed, 21 Jan, 15:05 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 15:13 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore |
Wed, 21 Jan, 15:21 |
| Todd Lipcon (JIRA) |
[jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 15:21 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-628) Host database to keep track of host-level information |
Wed, 21 Jan, 15:25 |
| Todd Lipcon (JIRA) |
[jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 17:45 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 19:17 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 19:27 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Wed, 21 Jan, 19:43 |
| Wildan Maulana (JIRA) |
[jira] Commented: (NUTCH-681) parse-mp3 compilation problem |
Thu, 22 Jan, 03:31 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Thu, 22 Jan, 04:15 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-681) parse-mp3 compilation problem |
Thu, 22 Jan, 04:15 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Thu, 22 Jan, 04:15 |
| Vimal Varghese |
Re: login failed exception |
Thu, 22 Jan, 04:50 |
| Stefano Tauriello (JIRA) |
[jira] Commented: (NUTCH-386) Plugin to index categories by url rules |
Thu, 22 Jan, 10:42 |
| Beaucarnea (JIRA) |
[jira] Commented: (NUTCH-386) Plugin to index categories by url rules |
Thu, 22 Jan, 11:31 |
| Stefano Tauriello (JIRA) |
[jira] Commented: (NUTCH-386) Plugin to index categories by url rules |
Thu, 22 Jan, 12:01 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-655) Injecting Crawl metadata |
Thu, 22 Jan, 20:37 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Thu, 22 Jan, 20:51 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Fri, 23 Jan, 10:01 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-655) Injecting Crawl metadata |
Fri, 23 Jan, 10:54 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool |
Fri, 23 Jan, 10:54 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Fri, 23 Jan, 10:57 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool |
Fri, 23 Jan, 11:18 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool |
Fri, 23 Jan, 11:18 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Fri, 23 Jan, 11:28 |
| Stefano Tauriello (JIRA) |
[jira] Commented: (NUTCH-386) Plugin to index categories by url rules |
Fri, 23 Jan, 16:03 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool |
Fri, 23 Jan, 22:47 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Fri, 23 Jan, 22:49 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-680) Update external jars to latest versions |
Sat, 24 Jan, 10:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-680) Update external jars to latest versions |
Sat, 24 Jan, 10:29 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-628) Host database to keep track of host-level information |
Sat, 24 Jan, 10:47 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-680) Update external jars to latest versions |
Sun, 25 Jan, 04:15 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-675) Reduce tasks do not report their status and are killed by jobtracker |
Sun, 25 Jan, 11:39 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-627) Minimize host address lookup |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-588) Help Need |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sun, 25 Jan, 11:43 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Sun, 25 Jan, 11:45 |
| Pau |
Re: Nutch ScoringFilter plugin problems |
Mon, 26 Jan, 12:17 |
| Doğacan Güney |
Re: Nutch ScoringFilter plugin problems |
Mon, 26 Jan, 15:58 |
| Apache Wiki |
[Nutch Wiki] Update of "Mailing" by GrantIngersoll |
Mon, 26 Jan, 16:32 |
| Apache Wiki |
[Nutch Wiki] Update of "Mailing" by GrantIngersoll |
Mon, 26 Jan, 16:33 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-650) Hbase Integration |
Mon, 26 Jan, 20:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-680) Update external jars to latest versions |
Tue, 27 Jan, 10:22 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Tue, 27 Jan, 18:02 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-680) Update external jars to latest versions |
Wed, 28 Jan, 04:17 |
| Marko Bauhardt |
Release 1.0? |
Wed, 28 Jan, 08:45 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Wed, 28 Jan, 11:01 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 |
Wed, 28 Jan, 11:35 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 11:38 |
| Guillaume Smet (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 12:13 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 12:40 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 12:59 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 13:11 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-680) Update external jars to latest versions |
Wed, 28 Jan, 14:13 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 20:10 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 20:16 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 21:10 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 21:24 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 |
Thu, 29 Jan, 04:17 |
| Sami Siren |
Registration for ApacheCon Europe 2009 is now open! |
Thu, 29 Jan, 10:18 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-682) SOLR indexer does not set boost on the document |
Thu, 29 Jan, 18:53 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-682) SOLR indexer does not set boost on the document |
Thu, 29 Jan, 19:13 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Thu, 29 Jan, 19:45 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Thu, 29 Jan, 19:45 |
| Raghavendra Neelekani |
Re: [jira] Updated: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Thu, 29 Jan, 19:58 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-682) SOLR indexer does not set boost on the document |
Fri, 30 Jan, 04:20 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-684) Dedup support for Solr |
Fri, 30 Jan, 16:35 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-684) Dedup support for Solr |
Fri, 30 Jan, 16:36 |
| Raghavendra Neelekani |
Re: [jira] Created: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Fri, 30 Jan, 18:17 |
| Grease |
Re: [jira] Created: (NUTCH-633) ParseSegment no longer allow reparsing |
Sat, 31 Jan, 05:44 |
| Raagu |
writing plugin |
Sat, 31 Jan, 09:18 |