| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-418) Fixes parsing of XHTML (e.g. title) |
Thu, 21 Dec, 14:49 |
| Eelco Lempsink (JIRA) |
[jira] Updated: (NUTCH-273) When a page is redirected, the original url is NOT updated. |
Fri, 22 Dec, 09:39 |
| lukai |
Re: [jira] Updated: (NUTCH-273) When a page is redirected, the original url is NOT updated. |
Sun, 24 Dec, 07:34 |
| Carsten Lehmann (JIRA) |
[jira] Created: (NUTCH-419) unavailable robots.txt kills fetch |
Sun, 24 Dec, 12:45 |
|
[jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch |
|
| Carsten Lehmann (JIRA) |
[jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch |
Sun, 24 Dec, 13:01 |
| Carsten Lehmann (JIRA) |
[jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch |
Sun, 24 Dec, 13:10 |
| Carsten Lehmann (JIRA) |
[jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch |
Sun, 24 Dec, 13:10 |
| Carsten Lehmann (JIRA) |
[jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch |
Sun, 24 Dec, 13:26 |
| Dogacan Güney (JIRA) |
[jira] Created: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Tue, 26 Dec, 11:30 |
| Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Tue, 26 Dec, 11:32 |
| Alan Tanaman (JIRA) |
[jira] Created: (NUTCH-421) Allow predeterminate running order of index filters |
Wed, 27 Dec, 13:57 |
|
[jira] Updated: (NUTCH-421) Allow predeterminate running order of index filters |
|
| Alan Tanaman (JIRA) |
[jira] Updated: (NUTCH-421) Allow predeterminate running order of index filters |
Wed, 27 Dec, 14:01 |
| Alan Tanaman (JIRA) |
[jira] Updated: (NUTCH-421) Allow predeterminate running order of index filters |
Wed, 27 Dec, 15:11 |
| Alan Tanaman (JIRA) |
[jira] Updated: (NUTCH-421) Allow predeterminate running order of index filters |
Wed, 27 Dec, 15:11 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-415) Generate should mark selected records in crawlDB |
Thu, 28 Dec, 00:10 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-416) CrawlDatum status and CrawlDbReducer refactoring |
Thu, 28 Dec, 00:14 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-322) Fetcher discards ProtocolStatus, doesn't store redirected pages |
Thu, 28 Dec, 00:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-273) When a page is redirected, the original url is NOT updated. |
Thu, 28 Dec, 00:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-274) Empty row in/at end of URL-list results in error |
Thu, 28 Dec, 00:22 |
| Alan Tanaman |
RE: Issue with Boosting Fields |
Thu, 28 Dec, 13:39 |
| Doğacan Güney |
linkdb bug |
Thu, 28 Dec, 16:15 |
| Andrzej Bialecki |
Re: linkdb bug |
Thu, 28 Dec, 19:04 |
| Doğacan Güney |
Re: linkdb bug |
Fri, 29 Dec, 10:01 |
| Andrzej Bialecki |
Re: linkdb bug |
Sat, 30 Dec, 19:19 |
| Alan Tanaman (JIRA) |
[jira] Created: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Thu, 28 Dec, 19:23 |
| Alan Tanaman (JIRA) |
[jira] Updated: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Thu, 28 Dec, 19:25 |
| st...@archive.org (JIRA) |
[jira] Created: (NUTCH-423) Add other index-basic fields as query plugins |
Fri, 29 Dec, 00:46 |
| st...@archive.org (JIRA) |
[jira] Updated: (NUTCH-423) Add other index-basic fields as query plugins |
Fri, 29 Dec, 00:48 |