|
[jira] [Commented] (NUTCH-1476) SegmentReader getStats should set parsed = -1 if no parsing took place |
|
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1476) SegmentReader getStats should set parsed = -1 if no parsing took place |
Thu, 11 Oct, 21:33 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1476) SegmentReader getStats should set parsed = -1 if no parsing took place |
Fri, 12 Oct, 04:41 |
|
[jira] [Commented] (NUTCH-1383) IndexingFiltersChecker to show error message instead of null pointer exception |
|
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1383) IndexingFiltersChecker to show error message instead of null pointer exception |
Thu, 11 Oct, 21:33 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1383) IndexingFiltersChecker to show error message instead of null pointer exception |
Fri, 12 Oct, 04:41 |
|
[jira] [Commented] (NUTCH-1252) SegmentReader -get shows wrong data |
|
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1252) SegmentReader -get shows wrong data |
Thu, 11 Oct, 21:33 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1252) SegmentReader -get shows wrong data |
Fri, 12 Oct, 04:41 |
|
[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2 |
|
| kiran (JIRA) |
[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2 |
Mon, 15 Oct, 23:17 |
| Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2 |
Tue, 16 Oct, 07:03 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2 |
Sun, 21 Oct, 04:26 |
|
[jira] [Commented] (NUTCH-710) Support for rel="canonical" attribute |
|
| Iwan Luijks (JIRA) |
[jira] [Commented] (NUTCH-710) Support for rel="canonical" attribute |
Wed, 17 Oct, 07:44 |
| Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-710) Support for rel="canonical" attribute |
Wed, 17 Oct, 08:28 |
| Iwan Luijks (JIRA) |
[jira] [Commented] (NUTCH-710) Support for rel="canonical" attribute |
Wed, 17 Oct, 11:54 |
| Iwan Luijks (JIRA) |
[jira] [Comment Edited] (NUTCH-710) Support for rel="canonical" attribute |
Wed, 17 Oct, 11:54 |
| Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #382 |
Thu, 18 Oct, 09:55 |
| Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #383 |
Fri, 19 Oct, 08:52 |
| kiran (JIRA) |
[jira] [Created] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Thu, 18 Oct, 21:10 |
|
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
|
| kiran (JIRA) |
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Thu, 18 Oct, 21:12 |
| kiran (JIRA) |
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Thu, 18 Oct, 21:42 |
| kiran (JIRA) |
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Thu, 18 Oct, 22:06 |
| kiran (JIRA) |
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Thu, 18 Oct, 22:06 |
| kiran (JIRA) |
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Fri, 19 Oct, 15:32 |
| kiran (JIRA) |
[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series |
Fri, 19 Oct, 15:32 |
| James Sullivan (JIRA) |
[jira] [Created] (NUTCH-1479) nutch readhostdb and updatehostdb do not work with MySQL |
Fri, 19 Oct, 22:18 |
| Julien Nioche (JIRA) |
[jira] [Resolved] (NUTCH-1087) Deprecate crawl command and replace with example script |
Sat, 20 Oct, 08:52 |
| Julien Nioche (JIRA) |
[jira] [Resolved] (NUTCH-1433) Upgrade to Tika 1.2 |
Sat, 20 Oct, 09:16 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1087) Deprecate crawl command and replace with example script |
Sun, 21 Oct, 04:26 |
| James Sullivan (JIRA) |
[jira] [Closed] (NUTCH-1479) nutch readhostdb and updatehostdb do not work with MySQL |
Mon, 22 Oct, 00:46 |
| Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1377) Add option to index via CloudSolrServer instead |
Mon, 22 Oct, 14:46 |
| Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Mon, 22 Oct, 14:50 |
|
[jira] [Commented] (NUTCH-1422) reset signature for redirects |
|
| Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1422) reset signature for redirects |
Tue, 23 Oct, 09:35 |
| Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1422) reset signature for redirects |
Tue, 23 Oct, 13:17 |
| Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-1215) UpdateDB should not require segment as input |
Tue, 23 Oct, 09:47 |
|
[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified |
|
| Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified |
Tue, 23 Oct, 09:51 |
| Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified |
Tue, 23 Oct, 13:25 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified |
Tue, 23 Oct, 14:05 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified |
Thu, 25 Oct, 03:48 |
|
[jira] [Commented] (NUTCH-1215) UpdateDB should not require segment as input |
|
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1215) UpdateDB should not require segment as input |
Tue, 23 Oct, 10:43 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1215) UpdateDB should not require segment as input |
Thu, 25 Oct, 03:48 |
| Markus Jelsma (JIRA) |
[jira] [Resolved] (NUTCH-1341) NotModified time set to now but page not modified |
Tue, 23 Oct, 13:29 |
| Sebastian Nagel (JIRA) |
[jira] [Resolved] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Tue, 23 Oct, 20:55 |
|
[jira] [Commented] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
|
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Tue, 23 Oct, 21:20 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Wed, 24 Oct, 23:28 |
| Hudson (JIRA) |
[jira] [Commented] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns |
Thu, 25 Oct, 03:48 |
| Markus Jelsma (JIRA) |
[jira] [Assigned] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Wed, 24 Oct, 10:54 |
| Markus Jelsma (JIRA) |
[jira] [Work stopped] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Wed, 24 Oct, 10:54 |
| Markus Jelsma (JIRA) |
[jira] [Work started] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Wed, 24 Oct, 10:54 |
| Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1480) SolrIndexer to write to multiple servers. |
Wed, 24 Oct, 11:00 |
| Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #387 |
Wed, 24 Oct, 23:26 |
| Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #388 |
Fri, 26 Oct, 04:23 |
|
[jira] [Updated] (NUTCH-1477) NPE when injecting with DataFileAvroStore |
|
| Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-1477) NPE when injecting with DataFileAvroStore |
Thu, 25 Oct, 13:49 |
| Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-1477) NPE when injecting with DataFileAvroStore |
Thu, 25 Oct, 13:51 |
| Julien Nioche (JIRA) |
[jira] [Updated] (NUTCH-1477) NPE when injecting with DataFileAvroStore |
Thu, 25 Oct, 14:13 |
| Alex diNorcia |
misbehaving crawler |
Thu, 25 Oct, 15:59 |
| Lewis John Mcgibbney |
Re: misbehaving crawler |
Fri, 26 Oct, 13:37 |
| Markus Jelsma |
RE: misbehaving crawler |
Fri, 26 Oct, 13:56 |
| Arni Sumarlidason (JIRA) |
[jira] [Created] (NUTCH-1481) When using MySQL as storage unicode characters within URLS cause nutch to fail |
Fri, 26 Oct, 01:31 |
| Nishikawa Muñumer, Alfonso |
Unsubscription |
Fri, 26 Oct, 06:18 |
|
[jira] [Commented] (NUTCH-1481) When using MySQL as storage unicode characters within URLS cause nutch to fail |
|
| Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1481) When using MySQL as storage unicode characters within URLS cause nutch to fail |
Fri, 26 Oct, 13:31 |
| Arni Sumarlidason (JIRA) |
[jira] [Commented] (NUTCH-1481) When using MySQL as storage unicode characters within URLS cause nutch to fail |
Fri, 26 Oct, 13:49 |
| Arni Sumarlidason (JIRA) |
[jira] [Commented] (NUTCH-1481) When using MySQL as storage unicode characters within URLS cause nutch to fail |
Fri, 26 Oct, 19:35 |
| Roberto Gardenier (JIRA) |
[jira] [Updated] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Mon, 29 Oct, 14:20 |
| Julien Nioche (JIRA) |
[jira] [Created] (NUTCH-1482) Rename HTMLParseFilter |
Mon, 29 Oct, 16:00 |
|
[jira] [Commented] (NUTCH-1482) Rename HTMLParseFilter |
|
| Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1482) Rename HTMLParseFilter |
Mon, 29 Oct, 16:04 |
| Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-1482) Rename HTMLParseFilter |
Mon, 29 Oct, 16:56 |
| Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1482) Rename HTMLParseFilter |
Mon, 29 Oct, 17:06 |
| Sebastian Nagel (JIRA) |
[jira] [Commented] (NUTCH-1482) Rename HTMLParseFilter |
Mon, 29 Oct, 19:50 |
| Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1482) Rename HTMLParseFilter |
Wed, 31 Oct, 08:59 |
|
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
|
| Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
Mon, 29 Oct, 16:16 |
| Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
Mon, 29 Oct, 17:02 |
| Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again |
Mon, 29 Oct, 22:49 |
| Lewis John McGibbney (JIRA) |
[jira] [Assigned] (NUTCH-1370) Expose exact number of urls injected @runtime |
Mon, 29 Oct, 16:20 |
| Lewis John Mcgibbney |
NUTCH-1370 |
Mon, 29 Oct, 16:22 |
| Lewis John Mcgibbney |
Re: NUTCH-1370 |
Mon, 29 Oct, 16:38 |
| Julien Nioche |
Re: NUTCH-1370 |
Mon, 29 Oct, 16:52 |
| Lewis John Mcgibbney |
Re: NUTCH-1370 |
Mon, 29 Oct, 16:57 |
| Julien Nioche |
Re: NUTCH-1370 |
Tue, 30 Oct, 08:27 |
| Lewis John Mcgibbney |
Re: NUTCH-1370 |
Tue, 30 Oct, 22:26 |
| Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-578) URL fetched with 403 is generated over and over again |
Mon, 29 Oct, 23:20 |