| Nicolás Lichtmaier |
[PATCH] Moving HitDetails construction to a HitDetails constructor (v2). |
Fri, 01 Jun, 20:38 |
| Nicolás Lichtmaier |
Re: [PATCH] Moving HitDetails construction to a HitDetails constructor (v2). |
Sun, 03 Jun, 19:36 |
| Doğacan Güney |
Re: Plugins and Thread Safety |
Mon, 04 Jun, 15:43 |
| Doğacan Güney |
Re: [Fwd: Nutch 0.9 and Crawl-Delay] |
Tue, 05 Jun, 05:59 |
| Doğacan Güney |
Re: [jira] Commented: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Tue, 05 Jun, 07:29 |
| Doğacan Güney |
Re: [jira] Commented: (NUTCH-496) ConcurrentModificationException can be thrown when getSorted() is called. |
Tue, 05 Jun, 11:54 |
| Doğacan Güney |
Re: Plugins initialized all the time! |
Fri, 08 Jun, 15:30 |
| Doğacan Güney |
=?UTF-8?Q?Re:_Welcome_Do=C4=9Facan_as_Nutch_committer?= |
Tue, 12 Jun, 08:04 |
| Doğacan Güney |
upgrade to hadoop-0.13? |
Mon, 18 Jun, 08:20 |
| Doğacan Güney |
Re: upgrade to hadoop-0.13? |
Mon, 18 Jun, 12:07 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 07:07 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 13:04 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 14:17 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 14:19 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #123 |
Wed, 20 Jun, 15:17 |
| Doğacan Güney |
Re: Found the bug in Generator when number of URLs is small |
Thu, 21 Jun, 07:03 |
| Doğacan Güney |
Re: NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 05:56 |
| Doğacan Güney |
JIRA email question |
Wed, 27 Jun, 07:02 |
| Doğacan Güney |
Re: NUTCH-119 :: how hard to fix |
Thu, 28 Jun, 06:51 |
| Doğacan Güney |
Re: [jira] Commented: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly |
Thu, 28 Jun, 07:35 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Fri, 01 Jun, 07:53 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Fri, 01 Jun, 10:54 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Sat, 02 Jun, 10:09 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-466) Flexible segment format |
Wed, 06 Jun, 12:30 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-466) Flexible segment format |
Wed, 06 Jun, 13:08 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-356) Plugin repository cache can lead to memory leak |
Fri, 08 Jun, 15:37 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 12:26 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Fri, 15 Jun, 14:24 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sat, 16 Jun, 09:36 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-495) Unnecessary delays in Fetcher2 |
Sat, 16 Jun, 10:36 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Sat, 16 Jun, 11:01 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Sat, 16 Jun, 11:03 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Sat, 16 Jun, 11:07 |
| Doğacan Güney (JIRA) |
[jira] Assigned: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Sat, 16 Jun, 11:15 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Sat, 16 Jun, 11:15 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sun, 17 Jun, 09:04 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-270) Apply just the applicable portions of the patch to protocol.httpclient.Http.java |
Sun, 17 Jun, 09:09 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-476) Would like to add a field to the document class for its MD5 signature |
Sun, 17 Jun, 09:18 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object |
Sun, 17 Jun, 20:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Mon, 18 Jun, 06:50 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-492) java.lang.OutOfMemoryError while indexing. |
Mon, 18 Jun, 08:57 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-493) contentType parse not correctly,,,,got empty content using readseg -get |
Mon, 18 Jun, 09:01 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-501) implementing a different caching mechanism for objects |
Mon, 18 Jun, 12:04 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) implementing a different caching mechanism for objects |
Mon, 18 Jun, 12:07 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Mon, 18 Jun, 13:35 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Mon, 18 Jun, 13:52 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters |
Mon, 18 Jun, 18:15 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-502) Bug in SegmentReader causes infinite loop |
Tue, 19 Jun, 06:01 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-502) Bug in SegmentReader causes infinite loop |
Tue, 19 Jun, 06:03 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 07:22 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-502) Bug in SegmentReader causes infinite loop |
Tue, 19 Jun, 09:22 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 19 Jun, 14:27 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Wed, 20 Jun, 18:09 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Wed, 20 Jun, 18:45 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation |
Thu, 21 Jun, 12:29 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-471) Fix synchronization in NutchBean creation |
Thu, 21 Jun, 15:18 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:30 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:32 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 08:34 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block |
Fri, 22 Jun, 08:44 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 08:51 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 08:59 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Fri, 22 Jun, 12:24 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once |
Fri, 22 Jun, 14:30 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 22 Jun, 22:39 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-25) needs 'character encoding' detector |
Sat, 23 Jun, 11:06 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 13:08 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 19:12 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-505) Outlink urls should be validated |
Sat, 23 Jun, 20:15 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Sat, 23 Jun, 20:21 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 20:45 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-468) Scoring filter should distribute score to all outlinks at once |
Sun, 24 Jun, 09:30 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Sun, 24 Jun, 10:05 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Sun, 24 Jun, 13:40 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-356) Plugin repository cache can lead to memory leak |
Sun, 24 Jun, 19:05 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Mon, 25 Jun, 08:09 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Tue, 26 Jun, 12:48 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 13:26 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 16:44 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 17:25 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-289) CrawlDatum should store IP address |
Wed, 27 Jun, 06:39 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Wed, 27 Jun, 07:07 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Wed, 27 Jun, 07:07 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Wed, 27 Jun, 08:40 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Wed, 27 Jun, 08:40 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Wed, 27 Jun, 11:00 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Wed, 27 Jun, 12:47 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Wed, 27 Jun, 12:47 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 12:18 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 12:46 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 13:04 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 15:59 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 15:59 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Fri, 29 Jun, 08:49 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-506) Nutch should delegate compression to Hadoop |
Fri, 29 Jun, 12:46 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop |
Fri, 29 Jun, 12:48 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-506) Nutch should delegate compression to Hadoop |
Fri, 29 Jun, 12:51 |
| Nicolás Lichtmaier (JIRA) |
[jira] Commented: (NUTCH-501) Implement a different caching mechanism for objects cached in configuration |
Sat, 23 Jun, 16:35 |
| Andrzej Bialecki |
Re: Plugins and Thread Safety |
Fri, 01 Jun, 16:46 |
| Andrzej Bialecki |
Re: Plugins and Thread Safety |
Fri, 01 Jun, 19:11 |