| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #135 |
Mon, 02 Jul, 04:50 |
| Kai_testing Middleton |
Nutch nightly build and NUTCH-505 draft patch |
Mon, 02 Jul, 06:59 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #136 |
Mon, 02 Jul, 07:00 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #137 |
Tue, 03 Jul, 07:00 |
| Briggs |
Plans on releasing another bug fix release? |
Tue, 03 Jul, 14:12 |
| David Fuhry |
Patch to skip hidden plugin directories |
Tue, 03 Jul, 17:33 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #138 |
Tue, 03 Jul, 18:07 |
| Andrzej Bialecki |
Re: Plans on releasing another bug fix release? |
Tue, 03 Jul, 19:53 |
| Doug Cutting |
Re: Plans on releasing another bug fix release? |
Tue, 03 Jul, 23:29 |
| Nuther |
Re[2]: Plans on releasing another bug fix release? |
Wed, 04 Jul, 06:12 |
| Andrzej Bialecki |
Re: Plans on releasing another bug fix release? |
Wed, 04 Jul, 06:56 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #139 |
Wed, 04 Jul, 07:00 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #140 |
Wed, 04 Jul, 07:29 |
| Andrzej Bialecki |
Re: Plans on releasing another bug fix release? |
Wed, 04 Jul, 09:35 |
| Epo Jemba |
URL Injection with another source than text files |
Wed, 04 Jul, 10:44 |
| Briggs |
Re: Plans on releasing another bug fix release? |
Wed, 04 Jul, 19:04 |
| rubdabadub |
Re: Plans on releasing another bug fix release? |
Thu, 05 Jul, 11:10 |
| Ian Holsman |
Re: Plans on releasing another bug fix release? |
Thu, 05 Jul, 22:30 |
| Briggs |
Re: Plans on releasing another bug fix release? |
Fri, 06 Jul, 16:45 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml |
Sat, 07 Jul, 17:18 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Sat, 07 Jul, 17:28 |
| Tsengtan A Shuy |
mozdex as a backend search engine. |
Sat, 07 Jul, 17:42 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment |
Sun, 08 Jul, 08:04 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment |
Sun, 08 Jul, 08:04 |
| Carl Cerecke |
OPIC scoring differences |
Sun, 08 Jul, 22:38 |
| Doğacan Güney |
Re: OPIC scoring differences |
Mon, 09 Jul, 06:00 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment |
Mon, 09 Jul, 06:05 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment |
Mon, 09 Jul, 06:16 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml |
Mon, 09 Jul, 06:18 |
| Emmanuel Joke (JIRA) |
[jira] Closed: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment |
Mon, 09 Jul, 06:18 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml |
Mon, 09 Jul, 06:18 |
| Enis Soztutar (JIRA) |
[jira] Created: (NUTCH-510) IndexMerger delete working dir |
Mon, 09 Jul, 06:37 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Mon, 09 Jul, 06:48 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-510) IndexMerger delete working dir |
Mon, 09 Jul, 06:52 |
| anton |
spam detect |
Mon, 09 Jul, 09:33 |
| Andrzej Bialecki |
Re: OPIC scoring differences |
Mon, 09 Jul, 12:28 |
| Enis Soztutar (JIRA) |
[jira] Issue Comment Edited: (NUTCH-510) IndexMerger delete working dir |
Mon, 09 Jul, 12:34 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker |
Mon, 09 Jul, 13:48 |
| Epo Jemba |
Re: URL Injection with another source than text files |
Mon, 09 Jul, 15:15 |
| Robert Young |
Not renewing CrawlDatum on Inject |
Mon, 09 Jul, 17:27 |
| Andrzej Bialecki |
Re: Not renewing CrawlDatum on Inject |
Mon, 09 Jul, 19:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists |
Tue, 10 Jul, 04:21 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml |
Tue, 10 Jul, 04:21 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring |
Tue, 10 Jul, 07:51 |
| Robert Young |
Re: Not renewing CrawlDatum on Inject |
Tue, 10 Jul, 08:19 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring |
Tue, 10 Jul, 09:24 |
| Robert Young |
Re: Not renewing CrawlDatum on Inject |
Tue, 10 Jul, 11:32 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Tue, 10 Jul, 12:42 |
| Erik Hatcher |
Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u,\" with {:method=>:get}" |
Tue, 10 Jul, 12:46 |
| Andrzej Bialecki |
Re: Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u,\" with {:method=>:get}" |
Tue, 10 Jul, 13:36 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Tue, 10 Jul, 13:51 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring |
Tue, 10 Jul, 14:57 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Tue, 10 Jul, 19:12 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring |
Wed, 11 Jul, 05:57 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring |
Wed, 11 Jul, 05:59 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-505) Outlink urls should be validated |
Wed, 11 Jul, 06:30 |
| Doğacan Güney |
Re: Nutch nightly build and NUTCH-505 draft patch |
Wed, 11 Jul, 06:55 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-505) Outlink urls should be validated |
Wed, 11 Jul, 10:56 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop |
Wed, 11 Jul, 12:04 |
| Doğacan Güney |
Re: OPIC scoring differences |
Wed, 11 Jul, 14:41 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-510) IndexMerger delete working dir |
Wed, 11 Jul, 15:32 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-510) IndexMerger delete working dir |
Wed, 11 Jul, 15:32 |
| Andrzej Bialecki |
Re: OPIC scoring differences |
Wed, 11 Jul, 18:14 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 06:50 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-510) IndexMerger delete working dir |
Thu, 12 Jul, 06:50 |
| Cuongnhc |
how can i fetch a site manual |
Thu, 12 Jul, 06:56 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop |
Thu, 12 Jul, 08:49 |
| anuradha (JIRA) |
[jira] Created: (NUTCH-511) Recrawling |
Thu, 12 Jul, 11:40 |
| anuradha (JIRA) |
[jira] Created: (NUTCH-512) Search on date range |
Thu, 12 Jul, 11:40 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 12:17 |
| Espen Amble Kolstad (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 12:34 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 12:40 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 15:09 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 15:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-511) Recrawling |
Thu, 12 Jul, 15:25 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-512) Search on date range |
Thu, 12 Jul, 15:33 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-513) suffix-urlfilter.txt does not have a template |
Thu, 12 Jul, 17:13 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Thu, 12 Jul, 18:21 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-505) Outlink urls should be validated |
Fri, 13 Jul, 12:28 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-513) suffix-urlfilter.txt does not have a template |
Fri, 13 Jul, 12:35 |
| Luca Rondanini |
NUTCH CONSULTANT NEEDED |
Fri, 13 Jul, 15:18 |
| prem kumar |
running nutch of nfs |
Fri, 13 Jul, 16:04 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-513) suffix-urlfilter.txt does not have a template |
Fri, 13 Jul, 17:21 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-513) suffix-urlfilter.txt does not have a template |
Fri, 13 Jul, 17:23 |
| Dennis Kubes (JIRA) |
[jira] Reopened: (NUTCH-471) Fix synchronization in NutchBean creation |
Fri, 13 Jul, 20:58 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #149 |
Sat, 14 Jul, 04:08 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation |
Sat, 14 Jul, 09:32 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS |
Sat, 14 Jul, 12:10 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS |
Sat, 14 Jul, 12:12 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation |
Sat, 14 Jul, 13:03 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-471) Fix synchronization in NutchBean creation |
Sat, 14 Jul, 13:05 |
| Tsengtan A Shuy |
inject command fail on whole-web run |
Sat, 14 Jul, 19:10 |
| Tsengtan A Shuy |
RE: inject command fail on whole-web run |
Sun, 15 Jul, 00:17 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #150 |
Sun, 15 Jul, 04:05 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #151 |
Mon, 16 Jul, 04:17 |
| Shailendra Mudgal |
OOM error during parsing with nekohtml |
Mon, 16 Jul, 10:04 |
| Tsengtan A Shuy |
RE: OOM error during parsing with nekohtml |
Mon, 16 Jul, 10:45 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-515) Next fetch time is set incorrectly |
Mon, 16 Jul, 12:15 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-515) Next fetch time is set incorrectly |
Mon, 16 Jul, 12:17 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring |
Mon, 16 Jul, 12:28 |