| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt |
Tue, 01 May, 08:42 |
|
[jira] Commented: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt |
|
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt |
Tue, 01 May, 09:03 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt |
Thu, 10 May, 12:47 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #74 |
Thu, 03 May, 07:00 |
| simon_ece |
Nutch - Filtering (REGEX) |
Thu, 03 May, 07:36 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-477) Extend URLFilters to support different filtering chains |
Thu, 03 May, 21:53 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains |
Thu, 03 May, 21:55 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #75 |
Fri, 04 May, 07:05 |
| chee.wu (JIRA) |
[jira] Created: (NUTCH-478) Add function for stopping FetherThread gracefully |
Sat, 05 May, 06:27 |
| Brian Whitman |
SIGSEGV |
Sat, 05 May, 21:59 |
| Dennis Kubes |
Re: SIGSEGV |
Sat, 05 May, 22:39 |
| Andrzej Bialecki |
Re: SIGSEGV |
Sun, 06 May, 15:00 |
| Brian Whitman |
Re: SIGSEGV |
Sun, 06 May, 17:47 |
| Dennis Kubes |
Re: SIGSEGV |
Mon, 07 May, 13:07 |
| Brian Whitman |
Re: SIGSEGV |
Mon, 07 May, 22:34 |
| Brian Whitman |
Re: SIGSEGV |
Wed, 09 May, 18:10 |
|
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
|
| Antonio Eggberg (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Mon, 07 May, 06:06 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sun, 13 May, 11:06 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Mon, 14 May, 17:52 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Mon, 14 May, 21:58 |
| mr_max |
How to install Nutch on Freebsd? |
Mon, 07 May, 07:51 |
| Nuther |
Re: How to install Nutch on Freebsd? |
Mon, 07 May, 06:59 |
| mr_max |
Re: How to install Nutch on Freebsd? |
Mon, 07 May, 08:11 |
| mr_max |
Who of most pages indexed by means of it nutch and how many? |
Mon, 07 May, 08:17 |
| mr_max |
And where it is possible to esteem about all opportunities nutch? |
Mon, 07 May, 08:20 |
| mr_max |
And if nutch it would be written on With С++ worked more quickly? |
Mon, 07 May, 08:21 |
| Vikas |
Scope-based crawling and indexing |
Mon, 07 May, 12:47 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-479) Support for OR queries |
Mon, 07 May, 19:15 |
|
[jira] Updated: (NUTCH-479) Support for OR queries |
|
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-479) Support for OR queries |
Mon, 07 May, 19:18 |
| Nicolás Lichtmaier (JIRA) |
[jira] Updated: (NUTCH-479) Support for OR queries |
Wed, 09 May, 20:32 |
| Ravi Chintakunta (JIRA) |
[jira] Created: (NUTCH-480) Searching multiple indexes with a single nutch instance |
Tue, 08 May, 01:11 |
| Ravi Chintakunta (JIRA) |
[jira] Updated: (NUTCH-480) Searching multiple indexes with a single nutch instance |
Tue, 08 May, 01:13 |
| Bastian Preindl |
Document Classification - indexing question |
Tue, 08 May, 10:30 |
| Armel T. Nene |
RE: Document Classification - indexing question |
Tue, 08 May, 10:51 |
| Bastian Preindl |
Re: Document Classification - indexing question |
Tue, 08 May, 12:37 |
| Armel T. Nene |
RE: Document Classification - indexing question |
Tue, 08 May, 13:03 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #80 |
Wed, 09 May, 07:00 |
|
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
|
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 09 May, 08:47 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Sun, 13 May, 11:15 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Mon, 14 May, 14:52 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Mon, 14 May, 17:54 |
|
[jira] Commented: (NUTCH-470) Adding optional terms to a query |
|
| Ronny Næss (JIRA) |
[jira] Commented: (NUTCH-470) Adding optional terms to a query |
Wed, 09 May, 13:34 |
| Trond Andersen (JIRA) |
[jira] Commented: (NUTCH-470) Adding optional terms to a query |
Wed, 09 May, 13:49 |
|
Re: [jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
|
| Mike Schwartz |
Re: [jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Wed, 09 May, 13:36 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Wed, 09 May, 16:39 |
| Mike Schwartz |
Re: [jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Thu, 10 May, 14:47 |
| Manoharam Reddy |
how is crawl-urlfilter.txt taken care of? |
Wed, 09 May, 15:00 |
| Sami Siren |
Re: how is crawl-urlfilter.txt taken care of? |
Wed, 09 May, 17:58 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Wed, 09 May, 16:55 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Wed, 09 May, 17:16 |
|
[jira] Commented: (NUTCH-472) NullPointerException in ZipTextExtractor if no MIME type for zipped file |
|
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-472) NullPointerException in ZipTextExtractor if no MIME type for zipped file |
Wed, 09 May, 17:20 |
| Antony Bowesman (JIRA) |
[jira] Commented: (NUTCH-472) NullPointerException in ZipTextExtractor if no MIME type for zipped file |
Thu, 10 May, 06:50 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-472) NullPointerException in ZipTextExtractor if no MIME type for zipped file |
Sat, 12 May, 05:28 |
| Andrzej Bialecki (JIRA) |
[jira] Assigned: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 09 May, 17:24 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-476) Would like to add a field to the document class for its MD5 signature |
Wed, 09 May, 17:42 |
| Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser |
Wed, 09 May, 18:03 |
| Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents |
Wed, 09 May, 18:05 |
|
Re: svn commit: r536606 - in /lucene/nutch/trunk: ./ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/metadata/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/util/ src/plugin/creativecommons/src/test/org/creativecommons/nutch/ src/... |
|
| Sami Siren |
Re: svn commit: r536606 - in /lucene/nutch/trunk: ./ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/metadata/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/util/ src/plugin/creativecommons/src/test/org/creativecommons/nutch/ src/... |
Wed, 09 May, 18:21 |
| Andrzej Bialecki |
Re: svn commit: r536606 - in /lucene/nutch/trunk: ./ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/metadata/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/util/ src/plugin/creativecommons/src/test/org/creativecommons/nutch/ src/... |
Wed, 09 May, 18:54 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-418) Fixes parsing of XHTML (e.g. title) |
Wed, 09 May, 18:40 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work. |
Wed, 09 May, 18:44 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-393) Indexer doesn't handle null documents returned by filters |
Wed, 09 May, 18:51 |
| Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-393) Indexer doesn't handle null documents returned by filters |
Wed, 09 May, 19:38 |
| karthik085 |
Recrawl help |
Wed, 09 May, 19:41 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-479) Support for OR queries |
Wed, 09 May, 21:48 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #81 |
Thu, 10 May, 07:07 |