| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 13 Feb, 09:44 |
| Gal Nitzan |
NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue |
Tue, 13 Feb, 11:41 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 13 Feb, 14:04 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Tue, 13 Feb, 14:12 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Tue, 13 Feb, 14:57 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 13 Feb, 15:01 |
| Chris A. Mattmann (JIRA) |
[jira] Resolved: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Tue, 13 Feb, 15:03 |
| Chris A. Mattmann (JIRA) |
[jira] Closed: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Tue, 13 Feb, 15:03 |
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Tue, 13 Feb, 15:05 |
| Renaud Richardet (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Tue, 13 Feb, 16:00 |
| Doug Cutting (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Tue, 13 Feb, 18:40 |
| Doug Cutting |
log guards |
Tue, 13 Feb, 18:47 |
| Jérôme Charron |
Re: log guards |
Tue, 13 Feb, 19:17 |
| st...@archive.org (JIRA) |
[jira] Commented: (NUTCH-437) MapFile in Hadoop 0.10.2 has changed, must update references |
Tue, 13 Feb, 19:33 |
| Chris Mattmann |
Re: log guards |
Tue, 13 Feb, 19:36 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references |
Tue, 13 Feb, 19:51 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references |
Tue, 13 Feb, 19:53 |
| Andrzej Bialecki (JIRA) |
[jira] Assigned: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references |
Tue, 13 Feb, 19:53 |
| Dennis Kubes |
Re: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue |
Tue, 13 Feb, 19:53 |
| Rakesh Reddy |
Personalization of Search Results |
Tue, 13 Feb, 19:58 |
| Jérôme Charron |
Re: log guards |
Tue, 13 Feb, 20:17 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-309) Uses commons logging Code Guards |
Tue, 13 Feb, 20:27 |
| Dennis Kubes |
Re: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue |
Tue, 13 Feb, 21:09 |
| Gal Nitzan |
RE: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue |
Tue, 13 Feb, 21:32 |
| Nick Lothian (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Tue, 13 Feb, 22:35 |
| HUYLEBROECK Jeremy RD-ILAB-SSF |
RE: [jira] Commented: (NUTCH-444) Possibly use a different library toparse RSS feed for improved performance and compatibility |
Tue, 13 Feb, 22:45 |
| Armel T. Nene |
RE: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue |
Tue, 13 Feb, 23:45 |
| ????? ??????? |
How to get score in search.jsp |
Wed, 14 Feb, 07:00 |
| Anton Potekhin |
How to get score in search.jsp |
Wed, 14 Feb, 07:47 |
| Armel Nene (JIRA) |
[jira] Commented: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references |
Wed, 14 Feb, 12:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 15:15 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 15:53 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 17:04 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 17:34 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 17:41 |
| Dennis Kubes |
Re: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue |
Wed, 14 Feb, 20:03 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 20:33 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 14 Feb, 20:36 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Thu, 15 Feb, 03:18 |
| nutch-...@dragonflymc.com |
Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 03:43 |
| Andrzej Bialecki |
Re: Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 06:46 |
| Gal Nitzan |
RE: Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 07:31 |
| Anton Potekhin |
RE: How to get score in search.jsp |
Thu, 15 Feb, 07:55 |
| Andrzej Bialecki |
Re: Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 07:55 |
| Enis Soztutar (JIRA) |
=?utf-8?Q?[jira]_Created:_(NUTCH-445)_Domain_=C4=B0ndexing_/_Query_Filter?= |
Thu, 15 Feb, 10:35 |
| Enis Soztutar (JIRA) |
=?utf-8?Q?[jira]_Updated:_(NUTCH-445)_Domain_=C4=B0ndexing_/_Query_Filter?= |
Thu, 15 Feb, 10:37 |
| Enis Soztutar (JIRA) |
=?utf-8?Q?[jira]_Updated:_(NUTCH-445)_Domain_=C4=B0ndexing_/_Query_Filter?= |
Thu, 15 Feb, 10:39 |
| Doğacan Güney |
lib-http crawl-delay problem |
Thu, 15 Feb, 11:07 |
| rubdabadub |
Re: lib-http crawl-delay problem |
Thu, 15 Feb, 11:45 |
| Doðacan Güney |
Re: lib-http crawl-delay problem |
Thu, 15 Feb, 13:12 |
| rubdabadub |
Re: lib-http crawl-delay problem |
Thu, 15 Feb, 13:21 |
| Enis Soztutar (JIRA) |
=?utf-8?Q?[jira]_Updated:_(NUTCH-445)_Domain_=C4=B0ndexing_/_Query_Filter?= |
Thu, 15 Feb, 13:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Thu, 15 Feb, 13:55 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Thu, 15 Feb, 14:13 |
| ogjunk-nu...@yahoo.com |
Re: lib-http crawl-delay problem |
Thu, 15 Feb, 15:25 |
| Dennis Kubes |
Re: Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 15:30 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt |
Thu, 15 Feb, 15:46 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt |
Thu, 15 Feb, 15:46 |
| Andrzej Bialecki |
Re: Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 16:08 |
| Dennis Kubes |
Re: Injector checking for other than STATUS_INJECTED |
Thu, 15 Feb, 23:34 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Fri, 16 Feb, 15:27 |
| Brian Whitman (JIRA) |
[jira] Commented: (NUTCH-432) JAVA_PLATFORM with spaces (i.e. Mac OS X-ppc-32) breaks bin/nutch script |
Fri, 16 Feb, 17:13 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Sat, 17 Feb, 06:49 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-247) robot parser to restrict. |
Sun, 18 Feb, 07:53 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-247) robot parser to restrict. |
Sun, 18 Feb, 07:57 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Sun, 18 Feb, 12:28 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Mon, 19 Feb, 03:20 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Mon, 19 Feb, 07:58 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-247) robot parser to restrict. |
Mon, 19 Feb, 16:51 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Mon, 19 Feb, 19:14 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Mon, 19 Feb, 19:41 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Mon, 19 Feb, 20:34 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Tue, 20 Feb, 05:41 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Tue, 20 Feb, 15:12 |
| Thorsten Scherler |
Apache Droids - standalone crawl framework |
Tue, 20 Feb, 16:26 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-447) Dmoz Structure Parser Tool |
Tue, 20 Feb, 21:03 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-447) Dmoz Structure Parser Tool |
Tue, 20 Feb, 21:05 |
| Renaud Richardet |
Re: Apache Droids - standalone crawl framework |
Tue, 20 Feb, 21:10 |
| rubdabadub |
Re: Apache Droids - standalone crawl framework |
Tue, 20 Feb, 22:04 |
| Renaud Richardet |
Re: Apache Droids - standalone crawl framework |
Tue, 20 Feb, 22:53 |
| Thorsten Scherler |
[Fwd: Re: Apache Droids - standalone crawl framework] |
Wed, 21 Feb, 00:24 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-447) Dmoz Structure Parser Tool |
Wed, 21 Feb, 09:26 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-447) Dmoz Structure Parser Tool |
Wed, 21 Feb, 14:29 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-448) Allow Plugin Includes and Excludes from File |
Thu, 22 Feb, 07:19 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-448) Allow Plugin Includes and Excludes from File |
Thu, 22 Feb, 07:21 |
| Andrzej Bialecki |
Re: Creating a new scoring filter. |
Thu, 22 Feb, 16:31 |
| Andrzej Bialecki |
Performance optimization for Nutch index / query |
Fri, 23 Feb, 00:43 |
| Steve Severance |
RE: Performance optimization for Nutch index / query |
Fri, 23 Feb, 01:21 |
| Gal Nitzan |
Why not make SOLR the Nutch SE |
Fri, 23 Feb, 07:49 |
| Andrzej Bialecki |
Re: Performance optimization for Nutch index / query |
Fri, 23 Feb, 09:12 |
| Gal Nitzan |
SOLR |
Fri, 23 Feb, 15:13 |
| Brian Whitman |
Re: SOLR |
Fri, 23 Feb, 15:27 |
| Enis Soztutar |
Re: Performance optimization for Nutch index / query |
Fri, 23 Feb, 15:31 |
| rubdabadub |
Re: SOLR |
Fri, 23 Feb, 16:09 |
| cybercouf |
How to add data into segment with my own plugin ? |
Fri, 23 Feb, 16:22 |
| HUYLEBROECK Jeremy RD-ILAB-SSF |
RE: How to add data into segment with my own plugin ? |
Fri, 23 Feb, 17:58 |
| Doug Cutting |
Re: Performance optimization for Nutch index / query |
Fri, 23 Feb, 18:46 |
| Nicolás Lichtmaier |
Re: Re: Creating a new scoring filter. |
Fri, 23 Feb, 22:33 |
| Nigel Daley (JIRA) |
[jira] Created: (NUTCH-449) Format of junit output should be configurable |
Fri, 23 Feb, 22:49 |
| Nigel Daley (JIRA) |
[jira] Updated: (NUTCH-449) Format of junit output should be configurable |
Fri, 23 Feb, 22:49 |