|
Re: refetching interval |
|
| YourSoft |
Re: refetching interval |
Thu, 01 Jun, 10:02 |
| YourSoft |
webgraph |
Thu, 01 Jun, 10:02 |
| Stefan Groschupf (JIRA) |
[jira] Created: (NUTCH-293) support for Crawl-delay in Robots.txt |
Thu, 01 Jun, 17:24 |
| Stefan Groschupf (JIRA) |
[jira] Updated: (NUTCH-293) support for Crawl-delay in Robots.txt |
Thu, 01 Jun, 17:26 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-289) CrawlDatum should store IP address |
Thu, 01 Jun, 18:41 |
| Teruhiko Kurosaka |
how to turn on logging, excersize analyzer, tips on debugging plugins? |
Thu, 01 Jun, 20:01 |
| Teruhiko Kurosaka |
i18n in nutch home page is misnomor |
Thu, 01 Jun, 21:54 |
| Stefan Neufeind (JIRA) |
[jira] Created: (NUTCH-294) Topic-maps of related searchwords |
Fri, 02 Jun, 06:56 |
|
[jira] Commented: (NUTCH-282) Showing too few results on a page (Paging not correct) |
|
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-282) Showing too few results on a page (Paging not correct) |
Fri, 02 Jun, 15:08 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-282) Showing too few results on a page (Paging not correct) |
Fri, 02 Jun, 16:19 |
|
[jira] Commented: (NUTCH-286) Handling common error-pages as 404 |
|
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-286) Handling common error-pages as 404 |
Fri, 02 Jun, 15:13 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-286) Handling common error-pages as 404 |
Fri, 02 Jun, 16:23 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-288) hitsPerSite-functionality "flawed": problems writing a page-navigation |
Fri, 02 Jun, 15:20 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space |
Fri, 02 Jun, 15:31 |
|
[jira] Commented: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" |
|
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" |
Fri, 02 Jun, 15:39 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" |
Fri, 02 Jun, 16:25 |
|
[jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed |
|
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed |
Fri, 02 Jun, 15:45 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed |
Fri, 02 Jun, 16:13 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed |
Fri, 02 Jun, 16:32 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed |
Fri, 02 Jun, 16:54 |
| Stefan Neufeind (JIRA) |
[jira] Updated: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space |
Fri, 02 Jun, 15:51 |
| Stefan Groschupf (JIRA) |
[jira] Closed: (NUTCH-287) Exception when searching with sort |
Fri, 02 Jun, 15:55 |
| Stefan Groschupf (JIRA) |
[jira] Closed: (NUTCH-284) NullPointerException during index |
Fri, 02 Jun, 15:57 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-284) NullPointerException during index |
Fri, 02 Jun, 15:59 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-281) cached.jsp: base-href needs to be outside comments |
Fri, 02 Jun, 15:59 |
|
[jira] Commented: (NUTCH-275) Fetcher not parsing XHTML-pages at all |
|
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-275) Fetcher not parsing XHTML-pages at all |
Fri, 02 Jun, 16:07 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-275) Fetcher not parsing XHTML-pages at all |
Fri, 02 Jun, 16:46 |
| Jerome Charron (JIRA) |
[jira] Commented: (NUTCH-275) Fetcher not parsing XHTML-pages at all |
Wed, 07 Jun, 11:49 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-274) Empty row in/at end of URL-list results in error |
Fri, 02 Jun, 16:13 |
| Stefan Groschupf (JIRA) |
[jira] Updated: (NUTCH-274) Empty row in/at end of URL-list results in error |
Fri, 02 Jun, 16:25 |
| Stefan Groschupf (JIRA) |
[jira] Resolved: (NUTCH-282) Showing too few results on a page (Paging not correct) |
Fri, 02 Jun, 16:34 |
| Stefan Groschupf (JIRA) |
[jira] Closed: (NUTCH-286) Handling common error-pages as 404 |
Fri, 02 Jun, 16:36 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-295) More description for fetcher.threads.fetch property |
Fri, 02 Jun, 16:58 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-295) More description for fetcher.threads.fetch property |
Fri, 02 Jun, 17:00 |
| Thomas Delnoij (JIRA) |
[jira] Created: (NUTCH-296) Image Search |
Sat, 03 Jun, 16:53 |
| Thomas Delnoij (JIRA) |
[jira] Updated: (NUTCH-296) Image Search |
Sat, 03 Jun, 17:05 |
| Stefan Groschupf (JIRA) |
[jira] Created: (NUTCH-297) sandbox svn folder |
Sat, 03 Jun, 17:13 |
|
[jira] Commented: (NUTCH-294) Topic-maps of related searchwords |
|
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-294) Topic-maps of related searchwords |
Sat, 03 Jun, 17:59 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-294) Topic-maps of related searchwords |
Sun, 04 Jun, 17:09 |
| Dawid Weiss (JIRA) |
[jira] Commented: (NUTCH-294) Topic-maps of related searchwords |
Tue, 06 Jun, 14:25 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-294) Topic-maps of related searchwords |
Tue, 06 Jun, 14:32 |
| Dawid Weiss (JIRA) |
[jira] Commented: (NUTCH-294) Topic-maps of related searchwords |
Wed, 07 Jun, 07:35 |
|
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
|
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Sat, 03 Jun, 18:10 |
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Sun, 04 Jun, 15:53 |
| Scott Ganyo (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 14:00 |
| Chris Mattmann |
Re: [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 17:01 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 17:34 |
| Scott Ganyo |
Re: [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 17:53 |
| Stefan Groschupf (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 14:11 |
| Chris Mattmann |
Re: [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 15:20 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 16:50 |
| Stefan Groschupf |
Re: [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Mon, 05 Jun, 17:47 |
| Jerome Charron (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Tue, 13 Jun, 10:42 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Thu, 15 Jun, 19:09 |
| Jerome Charron (JIRA) |
[jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore |
Fri, 16 Jun, 14:32 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection |
Sat, 03 Jun, 18:16 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection |
Sat, 03 Jun, 18:18 |
|
[jira] Updated: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection |
|
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection |
Sat, 03 Jun, 18:18 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection |
Fri, 09 Jun, 04:08 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-187) Cannot start Nutch datanodes on Windows outside of a cygwin environment because of DF |
Sat, 03 Jun, 18:44 |
| Stefan Groschupf (JIRA) |
[jira] Created: (NUTCH-298) if a 404 for a robots.txt is returned no page is fetched at all from the host |
Sat, 03 Jun, 19:44 |
| Stefan Groschupf (JIRA) |
[jira] Updated: (NUTCH-298) if a 404 for a robots.txt is returned no page is fetched at all from the host |
Sat, 03 Jun, 19:53 |
| Stefan Groschupf |
RobotRuleSet |
Sat, 03 Jun, 19:58 |
| Hasan Diwan (JIRA) |
[jira] Created: (NUTCH-299) Bittorrent Parser |
Sat, 03 Jun, 23:04 |
| Hasan Diwan (JIRA) |
[jira] Updated: (NUTCH-299) Bittorrent Parser |
Sat, 03 Jun, 23:07 |
|
[jira] Commented: (NUTCH-299) Bittorrent Parser |
|
| Stefan Neufeind (JIRA) |
[jira] Commented: (NUTCH-299) Bittorrent Parser |
Sun, 04 Jun, 14:15 |
| Hasan Diwan (JIRA) |
[jira] Commented: (NUTCH-299) Bittorrent Parser |
Sun, 04 Jun, 16:04 |