| All day coders |
Compilation errors at revision 638548 |
Tue, 18 Mar, 21:41 |
| All day coders |
Re: Why is Nutch not involved in Google Summer of Code - 2008? |
Sun, 23 Mar, 18:39 |
| All day coders |
Re: Why is Nutch not involved in Google Summer of Code - 2008? |
Mon, 24 Mar, 20:26 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 17:24 |
| Andrzej Bialecki |
Retire the original Fetcher before the release? |
Mon, 17 Mar, 13:05 |
| Andrzej Bialecki |
Re: Retire the original Fetcher before the release? |
Mon, 17 Mar, 14:20 |
| Andrzej Bialecki |
Re: Retire the original Fetcher before the release? |
Mon, 17 Mar, 15:17 |
| Andrzej Bialecki |
Re: Current OPIC implementation |
Tue, 18 Mar, 09:18 |
| Andrzej Bialecki |
Re: Why is Nutch not involved in Google Summer of Code - 2008? |
Sun, 30 Mar, 15:56 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-617) Cached Text Only |
Tue, 04 Mar, 19:23 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-618) Tika error "Media type alias already exists" |
Thu, 06 Mar, 07:17 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 01:30 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Fri, 14 Mar, 12:13 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Fri, 14 Mar, 13:27 |
| Andrzej Bialecki (JIRA) |
[jira] Assigned: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Fri, 14 Mar, 13:29 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Fri, 14 Mar, 14:02 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-613) Empty Summaries and Cached Pages |
Fri, 14 Mar, 14:24 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-613) Empty Summaries and Cached Pages |
Fri, 14 Mar, 14:24 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Fri, 14 Mar, 14:38 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Fri, 14 Mar, 14:38 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-610) Can't Update or modify an index while web gui is running |
Fri, 14 Mar, 14:44 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-601) Recrawling on existing crawl directory using force option |
Fri, 14 Mar, 14:54 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Fri, 14 Mar, 14:54 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 15:10 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 15:10 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-566) Sun's URL class has bug in creation of relative query URLs |
Fri, 14 Mar, 23:34 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-556) automatic adjust the CrawlDatum.fetchInterval according to the number of newly outlinks |
Fri, 14 Mar, 23:38 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
Fri, 14 Mar, 23:42 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-70) duplicate pages - virtual hosts in db. |
Fri, 14 Mar, 23:58 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-70) duplicate pages - virtual hosts in db. |
Fri, 14 Mar, 23:58 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-126) Fetching via https does not work with a proxy (patch) |
Sat, 15 Mar, 00:18 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-126) Fetching via https does not work with a proxy (patch) |
Sat, 15 Mar, 00:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-157) Problem during parsing msword document . It fetching properly but parsing is not working. Please show me the way how can i parse it |
Sat, 15 Mar, 00:20 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-157) Problem during parsing msword document . It fetching properly but parsing is not working. Please show me the way how can i parse it |
Sat, 15 Mar, 00:20 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-168) setting http.content.limit to -1 seems to break text parsing on some files |
Sat, 15 Mar, 00:22 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-168) setting http.content.limit to -1 seems to break text parsing on some files |
Sat, 15 Mar, 00:22 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-189) Injection infinite loop |
Sat, 15 Mar, 00:24 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-189) Injection infinite loop |
Sat, 15 Mar, 00:24 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-620) BasicURLNormalizer should collapse runs of slashes with a single slash |
Sun, 16 Mar, 20:06 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Mon, 17 Mar, 10:01 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Mon, 17 Mar, 12:35 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Mon, 17 Mar, 12:43 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-620) BasicURLNormalizer should collapse runs of slashes with a single slash |
Mon, 17 Mar, 13:23 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException |
Mon, 17 Mar, 16:23 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException |
Mon, 17 Mar, 16:23 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN |
Mon, 17 Mar, 16:44 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN |
Mon, 17 Mar, 16:44 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-243) Some meta-refresh urls get ignored due to matching regular expression |
Mon, 17 Mar, 16:50 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-243) Some meta-refresh urls get ignored due to matching regular expression |
Mon, 17 Mar, 16:50 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-610) Can't Update or modify an index while web gui is running |
Mon, 17 Mar, 16:52 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation |
Tue, 18 Mar, 10:05 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation |
Tue, 18 Mar, 10:05 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation |
Tue, 18 Mar, 10:05 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Tue, 18 Mar, 14:51 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation |
Wed, 19 Mar, 10:34 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-620) BasicURLNormalizer should collapse runs of slashes with a single slash |
Wed, 19 Mar, 10:46 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #393 |
Tue, 18 Mar, 05:34 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #394 |
Wed, 19 Mar, 06:33 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #396 |
Fri, 21 Mar, 16:46 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #397 |
Sat, 22 Mar, 06:10 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #398 |
Sun, 23 Mar, 04:53 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #399 |
Mon, 24 Mar, 09:57 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #400 |
Tue, 25 Mar, 07:56 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #401 |
Wed, 26 Mar, 05:09 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #404 |
Sat, 29 Mar, 07:09 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #405 |
Sun, 30 Mar, 05:05 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #406 |
Mon, 31 Mar, 08:59 |
| Bobby Hubbard (JIRA) |
[jira] Created: (NUTCH-622) Support for application/x-suggestions+json |
Wed, 26 Mar, 15:11 |
| Chen, Tao |
siteinfo.xml |
Sun, 30 Mar, 02:52 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 06:32 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 06:34 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 06:34 |
| Dennis Kubes |
Re: Retire the original Fetcher before the release? |
Mon, 17 Mar, 14:01 |
| Dennis Kubes |
Re: Retire the original Fetcher before the release? |
Mon, 17 Mar, 14:36 |
| Dennis Kubes |
Re: Why is Nutch not involved in Google Summer of Code - 2008? |
Sun, 30 Mar, 15:25 |
| Dennis Kubes |
Re: Why is Nutch not involved in Google Summer of Code - 2008? |
Mon, 31 Mar, 00:04 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Mon, 31 Mar, 05:04 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-555) StackOverflowError in DomContentUtils |
Mon, 31 Mar, 05:14 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-500) Add hadoop masters configuration file into conf folder |
Mon, 31 Mar, 05:16 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-555) StackOverflowError in DomContentUtils |
Mon, 31 Mar, 05:16 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-447) Dmoz Structure Parser Tool |
Mon, 31 Mar, 05:18 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-447) Dmoz Structure Parser Tool |
Mon, 31 Mar, 05:18 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-249) black- white list url filtering |
Mon, 31 Mar, 05:22 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-295) More description for fetcher.threads.fetch property |
Mon, 31 Mar, 05:24 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" |
Mon, 31 Mar, 05:24 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-213) checkstyle |
Mon, 31 Mar, 05:24 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-75) Patch for WebDBReader to get more detailed information about WebDBs |
Mon, 31 Mar, 05:26 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request |
Mon, 31 Mar, 05:28 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-16) boost documents matching a url pattern |
Mon, 31 Mar, 05:30 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-500) Add hadoop masters configuration file into conf folder |
Mon, 31 Mar, 19:52 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-500) Add hadoop masters configuration file into conf folder |
Mon, 31 Mar, 20:00 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Mon, 17 Mar, 02:45 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
Mon, 17 Mar, 02:59 |
| Euan Clark |
Confine nutch to one NIC? |
Sun, 09 Mar, 20:24 |
| Frederic Wenzel |
Nightly builds unavailable |
Wed, 05 Mar, 10:11 |
| Gordon Mohr (JIRA) |
[jira] Commented: (NUTCH-296) Image Search |
Mon, 31 Mar, 21:22 |