| Enis Soztutar (JIRA) |
[jira] Created: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Wed, 18 Jul, 08:16 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Wed, 18 Jul, 08:18 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring |
Wed, 18 Jul, 08:33 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring |
Wed, 18 Jul, 08:35 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Thu, 19 Jul, 06:12 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Thu, 19 Jul, 06:46 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring |
Fri, 27 Jul, 08:06 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring |
Fri, 27 Jul, 08:12 |
| Epo Jemba |
URL Injection with another source than text files |
Wed, 04 Jul, 10:44 |
| Epo Jemba |
Re: URL Injection with another source than text files |
Mon, 09 Jul, 15:15 |
| Erik Hatcher |
Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u,\" with {:method=>:get}" |
Tue, 10 Jul, 12:46 |
| Espen Amble Kolstad (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 12:34 |
| Hal Finkel (JIRA) |
[jira] Updated: (NUTCH-523) web2 searchform problems with patch |
Sat, 21 Jul, 23:58 |
| Hal Finkel (JIRA) |
[jira] Created: (NUTCH-523) web2 searchform problems with patch |
Sat, 21 Jul, 23:58 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Tue, 10 Jul, 04:21 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml |
Tue, 10 Jul, 04:21 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 06:50 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-510) IndexMerger delete working dir |
Thu, 12 Jul, 06:50 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop |
Wed, 18 Jul, 04:20 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-515) Next fetch time is set incorrectly |
Wed, 18 Jul, 04:20 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-517) build encoding should be UTF-8 |
Thu, 19 Jul, 04:27 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining |
Thu, 19 Jul, 04:27 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE |
Fri, 27 Jul, 04:25 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment |
Fri, 27 Jul, 04:25 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS |
Tue, 31 Jul, 04:19 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-533) LinkDbMerger: url normalized is not updated in the key and inlinks list |
Wed, 01 Aug, 05:23 |
| Ian Holsman |
Re: Plans on releasing another bug fix release? |
Thu, 05 Jul, 22:30 |
| Ian Holsman (JIRA) |
[jira] Commented: (NUTCH-524) Generate Problem with Single Node |
Tue, 24 Jul, 17:35 |
| Kai_testing Middleton |
Nutch nightly build and NUTCH-505 draft patch |
Mon, 02 Jul, 06:59 |
| Kai_testing Middleton |
Re: OOM error during parsing with nekohtml |
Mon, 16 Jul, 15:43 |
| Kai_testing Middleton |
Re: no nutch script file under bin directory |
Wed, 18 Jul, 16:41 |
| Kai_testing Middleton |
Re: no nutch script file under bin directory |
Wed, 18 Jul, 18:34 |
| Kai_testing Middleton |
Re: no nutch script file under bin directory |
Wed, 18 Jul, 19:39 |
| Le Quoc Anh |
Error indexer |
Sun, 29 Jul, 09:13 |
| Luca Rondanini |
NUTCH CONSULTANT NEEDED |
Fri, 13 Jul, 15:18 |
| Nathan Wilkinson |
searchserver failover problem |
Tue, 24 Jul, 02:44 |
| Nuther |
Re[2]: Plans on releasing another bug fix release? |
Wed, 04 Jul, 06:12 |
| Rob Young (JIRA) |
[jira] Created: (NUTCH-521) Modified injector to allow newly injected CrawlDatum to overwrite original |
Thu, 19 Jul, 09:51 |
| Rob Young (JIRA) |
[jira] Updated: (NUTCH-521) Modified injector to allow newly injected CrawlDatum to overwrite original |
Thu, 19 Jul, 09:51 |
| Rob Young (JIRA) |
[jira] Created: (NUTCH-527) MapWritable doesn't support all hadoops writable types |
Wed, 25 Jul, 11:03 |
| Rob Young (JIRA) |
[jira] Updated: (NUTCH-527) MapWritable doesn't support all hadoops writable types |
Wed, 25 Jul, 11:07 |
| Rob Young (JIRA) |
[jira] Updated: (NUTCH-527) MapWritable doesn't support all hadoops writable types |
Wed, 25 Jul, 11:45 |
| Robert Young |
Not renewing CrawlDatum on Inject |
Mon, 09 Jul, 17:27 |
| Robert Young |
Re: Not renewing CrawlDatum on Inject |
Tue, 10 Jul, 08:19 |
| Robert Young |
Re: Not renewing CrawlDatum on Inject |
Tue, 10 Jul, 11:32 |
| Robert Young |
Looking to fix relative path issue in linkdb |
Thu, 19 Jul, 10:06 |
| Robert Young |
Re: Looking to fix relative path issue in linkdb |
Thu, 19 Jul, 11:53 |
| Robert Young |
Re: Looking to fix relative path issue in linkdb |
Thu, 19 Jul, 17:16 |
| Robert Young |
Re: [jira] Commented: (NUTCH-527) MapWritable doesn't support all hadoops writable types |
Wed, 25 Jul, 17:39 |
| Robert Young |
Re: [jira] Commented: (NUTCH-527) MapWritable doesn't support all hadoops writable types |
Thu, 26 Jul, 10:51 |
| Shailendra Mudgal |
OOM error during parsing with nekohtml |
Mon, 16 Jul, 10:04 |
| Shailendra Mudgal |
Re: OOM error during parsing with nekohtml |
Tue, 17 Jul, 04:53 |
| Shailendra Mudgal |
Re: OOM error during parsing with nekohtml |
Thu, 19 Jul, 13:26 |
| Tsengtan A Shuy |
mozdex as a backend search engine. |
Sat, 07 Jul, 17:42 |
| Tsengtan A Shuy |
inject command fail on whole-web run |
Sat, 14 Jul, 19:10 |
| Tsengtan A Shuy |
RE: inject command fail on whole-web run |
Sun, 15 Jul, 00:17 |
| Tsengtan A Shuy |
RE: OOM error during parsing with nekohtml |
Mon, 16 Jul, 10:45 |
| Tsengtan A Shuy |
RE: OOM error during parsing with nekohtml |
Mon, 16 Jul, 16:37 |
| Tsengtan A Shuy |
no nutch script file under bin directory |
Tue, 17 Jul, 19:22 |
| Tsengtan A Shuy |
RE: no nutch script file under bin directory |
Tue, 17 Jul, 19:32 |
| Tsengtan A Shuy |
RE: no nutch script file under bin directory |
Wed, 18 Jul, 00:30 |
| Tsengtan A Shuy |
RE: no nutch script file under bin directory |
Wed, 18 Jul, 17:21 |
| Tsengtan A Shuy |
RE: no nutch script file under bin directory |
Wed, 18 Jul, 18:59 |
| Tsengtan A Shuy |
ready for the first assignment |
Wed, 18 Jul, 22:12 |
| Vishal Shah (JIRA) |
[jira] Updated: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment |
Tue, 24 Jul, 07:43 |
| Vishal Shah (JIRA) |
[jira] Created: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment |
Tue, 24 Jul, 07:43 |
| Vishal Shah (JIRA) |
[jira] Commented: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment |
Tue, 24 Jul, 08:20 |
| Vishal Shah (JIRA) |
[jira] Updated: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment |
Tue, 24 Jul, 09:41 |
| anton |
spam detect |
Mon, 09 Jul, 09:33 |
| anuradha (JIRA) |
[jira] Created: (NUTCH-511) Recrawling |
Thu, 12 Jul, 11:40 |
| anuradha (JIRA) |
[jira] Created: (NUTCH-512) Search on date range |
Thu, 12 Jul, 11:40 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #135 |
Mon, 02 Jul, 04:50 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #136 |
Mon, 02 Jul, 07:00 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #137 |
Tue, 03 Jul, 07:00 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #138 |
Tue, 03 Jul, 18:07 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #139 |
Wed, 04 Jul, 07:00 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #140 |
Wed, 04 Jul, 07:29 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #149 |
Sat, 14 Jul, 04:08 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #150 |
Sun, 15 Jul, 04:05 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #151 |
Mon, 16 Jul, 04:17 |
| prem kumar |
running nutch of nfs |
Fri, 13 Jul, 16:04 |
| prem kumar |
resending this query on running nutch on nfs |
Thu, 19 Jul, 07:31 |
| rubdabadub |
Re: Plans on releasing another bug fix release? |
Thu, 05 Jul, 11:10 |