|
Re: ant test failures |
|
| Doğacan Güney |
Re: ant test failures |
Sat, 01 Sep, 12:48 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Mon, 03 Sep, 07:47 |
|
[jira] Updated: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
|
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Mon, 03 Sep, 07:49 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Thu, 20 Sep, 14:06 |
|
[jira] Commented: (NUTCH-546) file URL are filtered out by the crawler |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-546) file URL are filtered out by the crawler |
Mon, 03 Sep, 07:53 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-546) file URL are filtered out by the crawler |
Tue, 11 Sep, 06:39 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-546) file URL are filtered out by the crawler |
Wed, 12 Sep, 04:22 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-532) CrawlDbMerger: wrong computation of last fetch time |
Mon, 03 Sep, 08:27 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-532) CrawlDbMerger: wrong computation of last fetch time |
Mon, 03 Sep, 13:38 |
|
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Mon, 03 Sep, 18:14 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Mon, 03 Sep, 18:20 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Tue, 04 Sep, 11:42 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Mon, 10 Sep, 20:25 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Mon, 10 Sep, 20:44 |
| Emmanuel Joke (JIRA) |
[jira] Closed: (NUTCH-526) Use a combiner in LinDbMerger to improve the performance as in LinkDb |
Tue, 04 Sep, 03:36 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format |
Tue, 04 Sep, 07:16 |
|
[jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
|
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Tue, 04 Sep, 08:46 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Tue, 11 Sep, 11:30 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Fri, 21 Sep, 16:03 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Fri, 21 Sep, 16:05 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Fri, 21 Sep, 16:05 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Tue, 04 Sep, 10:34 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Tue, 04 Sep, 10:34 |
|
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
|
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Tue, 04 Sep, 10:38 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Tue, 04 Sep, 11:42 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Tue, 04 Sep, 15:30 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Wed, 05 Sep, 15:07 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Thu, 06 Sep, 16:32 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-532) CrawlDbMerger: wrong computation of last fetch time |
Tue, 04 Sep, 12:32 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-532) CrawlDbMerger: wrong computation of last fetch time |
Tue, 04 Sep, 17:00 |
|
[jira] Commented: (NUTCH-251) Administration GUI |
|
| Marc Brette (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Wed, 05 Sep, 16:33 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-251) Administration GUI |
Wed, 05 Sep, 18:09 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-546) file URL are filtered out by the crawler |
Thu, 06 Sep, 12:56 |
|
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
Thu, 06 Sep, 13:24 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
Thu, 06 Sep, 17:38 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-524) Generate Problem with Single Node |
Thu, 06 Sep, 13:26 |
| Jeff Maki |
Meta Tags and Indexing |
Thu, 06 Sep, 14:45 |
| Jeff Maki |
Labeling URLs a-la Google |
Thu, 06 Sep, 20:04 |
| ogjunk-nu...@yahoo.com |
Re: Labeling URLs a-la Google |
Fri, 07 Sep, 20:36 |
| Marcin Okraszewski |
=?UTF-8?Q?Limiting_outlink_tags.?= |
Thu, 06 Sep, 21:09 |
| Doğacan Güney |
Re: Limiting outlink tags. |
Fri, 07 Sep, 07:55 |
| Marcin Okraszewski |
Re: Limiting outlink tags. |
Thu, 20 Sep, 20:24 |
| crossany (JIRA) |
[jira] Created: (NUTCH-549) Bug |
Fri, 07 Sep, 02:35 |
|
Re: bug with generate performance |
|
| Doğacan Güney |
Re: bug with generate performance |
Fri, 07 Sep, 07:37 |
| Andrzej Bialecki |
Re: bug with generate performance |
Fri, 07 Sep, 10:50 |
| misc |
Re: bug with generate performance |
Fri, 07 Sep, 23:47 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-550) Parse fails if db.max.outlinks.per.page is -1 |
Fri, 07 Sep, 08:29 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-550) Parse fails if db.max.outlinks.per.page is -1 |
Fri, 07 Sep, 08:29 |
| Jim (JIRA) |
[jira] Created: (NUTCH-551) performance for generate is often really bad |
Fri, 07 Sep, 23:43 |
|
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
|
| Jim (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Sat, 08 Sep, 02:14 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Mon, 10 Sep, 20:00 |
| Jim (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Tue, 11 Sep, 01:59 |
| Jim (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Tue, 11 Sep, 22:08 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Wed, 12 Sep, 06:24 |
| Jim (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Fri, 14 Sep, 20:16 |
| m.harig |
Pl...Give me example |
Sat, 08 Sep, 04:23 |
| r...@rosa.com |
Daniel Udatny is out of the office. |
Sat, 08 Sep, 08:09 |
|
[jira] Updated: (NUTCH-44) too many search results |
|
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-44) too many search results |
Sat, 08 Sep, 09:55 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-44) too many search results |
Sat, 08 Sep, 11:08 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-44) too many search results |
Sat, 08 Sep, 11:25 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-281) cached.jsp: base-href needs to be outside comments |
Sun, 09 Sep, 10:57 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-550) Parse fails if db.max.outlinks.per.page is -1 |
Mon, 10 Sep, 19:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-550) Parse fails if db.max.outlinks.per.page is -1 |
Mon, 10 Sep, 19:41 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-549) Bug |
Mon, 10 Sep, 19:41 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-546) file URL are filtered out by the crawler |
Mon, 10 Sep, 19:47 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-491) dedup fails with ArrayIndexOutOfBoundsException |
Mon, 10 Sep, 19:49 |
|
[jira] Commented: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Mon, 10 Sep, 19:53 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Fri, 21 Sep, 11:38 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child. |
Tue, 25 Sep, 04:18 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #203 |
Tue, 11 Sep, 06:37 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #203 |
Tue, 11 Sep, 07:43 |
| Susam Pal |
Re: Build failed in Hudson: Nutch-Nightly #203 |
Tue, 11 Sep, 09:30 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #203 |
Tue, 11 Sep, 10:43 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-550) Parse fails if db.max.outlinks.per.page is -1 |
Tue, 11 Sep, 06:39 |