| Doğacan Güney |
Re: Nutch ScoringFilter plugin problems |
Wed, 21 Jan, 08:47 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Fri, 23 Jan, 10:01 |
| Doğacan Güney |
Re: Nutch ScoringFilter plugin problems |
Mon, 26 Jan, 15:58 |
| Doğacan Güney |
Re: RSS-fecter and index individul-how can i realize this function |
Mon, 05 Jan, 10:32 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 15:49 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 18:13 |
| Doğacan Güney |
Re: [jira] Created: (NUTCH-680) Update external jars to latest versions |
Tue, 20 Jan, 20:40 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 12 Jan, 13:28 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-442) Integrate Solr/Nutch |
Mon, 12 Jan, 13:28 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Mon, 12 Jan, 13:33 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-670) feed plugin does not parse RSS2 enclosures |
Mon, 12 Jan, 13:36 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-652) AdaptiveFetchSchedule#setFetchSchedule doesn't calculate fetch interval correctly |
Mon, 12 Jan, 13:38 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Mon, 12 Jan, 13:44 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 12 Jan, 17:24 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 12 Jan, 17:32 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-678) Hadoop 0.19 requires an update of jets3t |
Mon, 19 Jan, 13:49 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-679) Fetcher2 implementing Tool |
Mon, 19 Jan, 13:51 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-680) Update external jars to latest versions |
Mon, 19 Jan, 14:02 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-678) Hadoop 0.19 requires an update of jets3t |
Mon, 19 Jan, 17:11 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-572) Scoring and redirected Urls |
Tue, 20 Jan, 15:58 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-661) errors when the uri contains space characters |
Tue, 20 Jan, 20:48 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 |
Tue, 20 Jan, 21:08 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Tue, 20 Jan, 21:34 |
| Doğacan Güney (JIRA) |
[jira] Reopened: (NUTCH-681) parse-mp3 compilation problem |
Wed, 21 Jan, 13:00 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-681) parse-mp3 compilation problem |
Wed, 21 Jan, 13:12 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-677) Segment merge filering based on segment content |
Wed, 21 Jan, 14:59 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-664) Possibility to update already stored documents. |
Wed, 21 Jan, 15:02 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-655) Injecting Crawl metadata |
Wed, 21 Jan, 15:03 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Wed, 21 Jan, 15:05 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 15:13 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore |
Wed, 21 Jan, 15:21 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-628) Host database to keep track of host-level information |
Wed, 21 Jan, 15:25 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 19:17 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-676) MapWritable is written inefficiently and confusingly |
Wed, 21 Jan, 19:27 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Wed, 21 Jan, 19:43 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-655) Injecting Crawl metadata |
Fri, 23 Jan, 10:54 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool |
Fri, 23 Jan, 10:54 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Fri, 23 Jan, 10:57 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 |
Fri, 23 Jan, 11:28 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-680) Update external jars to latest versions |
Sat, 24 Jan, 10:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-680) Update external jars to latest versions |
Sat, 24 Jan, 10:29 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-628) Host database to keep track of host-level information |
Sat, 24 Jan, 10:47 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-675) Reduce tasks do not report their status and are killed by jobtracker |
Sun, 25 Jan, 11:39 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-627) Minimize host address lookup |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-588) Help Need |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 |
Sun, 25 Jan, 11:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sun, 25 Jan, 11:43 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Sun, 25 Jan, 11:45 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-650) Hbase Integration |
Mon, 26 Jan, 20:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-680) Update external jars to latest versions |
Tue, 27 Jan, 10:22 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Tue, 27 Jan, 18:02 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Wed, 28 Jan, 11:01 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 |
Wed, 28 Jan, 11:35 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 11:38 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 13:11 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-680) Update external jars to latest versions |
Wed, 28 Jan, 14:13 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 20:16 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 21:24 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-682) SOLR indexer does not set boost on the document |
Thu, 29 Jan, 19:13 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Thu, 29 Jan, 19:45 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-683) NUTCH-676 broke CrawlDbMerger |
Thu, 29 Jan, 19:45 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-684) Dedup support for Solr |
Fri, 30 Jan, 16:35 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-684) Dedup support for Solr |
Fri, 30 Jan, 16:36 |
| Aaron Hammond (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Sat, 10 Jan, 03:46 |
| Andrzej Bialecki |
Re: Site update |
Mon, 05 Jan, 22:28 |
| Andrzej Bialecki |
Re: Site update |
Mon, 05 Jan, 22:49 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-594) Serve Nutch search results in multiple formats including XML and JSON |
Thu, 01 Jan, 20:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-624) Better parsed text by default parser |
Fri, 09 Jan, 18:59 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-624) Better parsed text by default parser |
Fri, 09 Jan, 18:59 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-627) Minimize host address lookup |
Fri, 09 Jan, 19:03 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 12:40 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password |
Wed, 28 Jan, 12:59 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Wed, 28 Jan, 21:10 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #678 |
Thu, 01 Jan, 01:06 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #679 |
Thu, 01 Jan, 04:11 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #680 |
Fri, 02 Jan, 04:12 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #681 |
Sat, 03 Jan, 04:12 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #682 |
Sun, 04 Jan, 04:11 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #683 |
Mon, 05 Jan, 04:11 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #684 |
Tue, 06 Jan, 04:11 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #685 |
Wed, 07 Jan, 04:10 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #686 |
Thu, 08 Jan, 04:13 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #687 |
Fri, 09 Jan, 04:14 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #688 |
Sat, 10 Jan, 04:13 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #689 |
Sun, 11 Jan, 04:15 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #690 |
Mon, 12 Jan, 04:15 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #691 |
Tue, 13 Jan, 04:16 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #693 |
Thu, 15 Jan, 04:29 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #694 |
Fri, 16 Jan, 04:16 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #695 |
Sat, 17 Jan, 04:57 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #696 |
Sat, 17 Jan, 09:29 |
| Apache Wiki |
[Nutch Wiki] Trivial Update of "Release HOWTO" by SamiSiren |
Fri, 09 Jan, 17:50 |
| Apache Wiki |
[Nutch Wiki] Update of "NewPage" by DennisKubes |
Mon, 12 Jan, 17:32 |
| Apache Wiki |
[Nutch Wiki] Update of "NewScoring" by DennisKubes |
Mon, 12 Jan, 17:33 |
| Apache Wiki |
[Nutch Wiki] Update of "NewPage" by DennisKubes |
Mon, 12 Jan, 17:34 |
| Apache Wiki |
[Nutch Wiki] Update of "FrontPage" by DennisKubes |
Mon, 12 Jan, 17:36 |
| Apache Wiki |
[Nutch Wiki] Update of "NewScoring" by OtisGospodnetic |
Tue, 13 Jan, 18:22 |
| Apache Wiki |
[Nutch Wiki] Update of "Mailing" by GrantIngersoll |
Mon, 26 Jan, 16:32 |
| Apache Wiki |
[Nutch Wiki] Update of "Mailing" by GrantIngersoll |
Mon, 26 Jan, 16:33 |