| Siddharth Jha (JIRA) |
[jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Mon, 03 Mar, 17:14 |
| Siddharth Jha (JIRA) |
[jira] Created: (NUTCH-617) Cached Text Only |
Tue, 04 Mar, 08:45 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-617) Cached Text Only |
Tue, 04 Mar, 19:23 |
| Frederic Wenzel |
Nightly builds unavailable |
Wed, 05 Mar, 10:11 |
| Sami Siren |
Re: Nightly builds unavailable |
Wed, 05 Mar, 18:27 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-618) Tika error "Media type alias already exists" |
Thu, 06 Mar, 07:17 |
|
[jira] Commented: (NUTCH-618) Tika error "Media type alias already exists" |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 01:30 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 06:34 |
| Chris A. Mattmann (JIRA) |
[jira] Assigned: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 06:32 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-618) Tika error "Media type alias already exists" |
Fri, 07 Mar, 06:34 |
| Euan Clark |
Confine nutch to one NIC? |
Sun, 09 Mar, 20:24 |
| ogjunk-nu...@yahoo.com |
Re: Confine nutch to one NIC? |
Tue, 11 Mar, 20:21 |
| dong chen |
I have some problem with nutch result |
Tue, 11 Mar, 05:34 |
|
[jira] Commented: (NUTCH-296) Image Search |
|
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-296) Image Search |
Wed, 12 Mar, 01:48 |
| Gordon Mohr (JIRA) |
[jira] Commented: (NUTCH-296) Image Search |
Mon, 31 Mar, 21:22 |
|
Problem in running Nutch where proxy authentication is required. |
|
| naveen.gosw...@wipro.com |
Problem in running Nutch where proxy authentication is required. |
Wed, 12 Mar, 16:09 |
| Susam Pal |
Re: Problem in running Nutch where proxy authentication is required. |
Fri, 14 Mar, 17:41 |
| naveen.gosw...@wipro.com |
Problem in running Nutch where proxy authentication is required. |
Wed, 12 Mar, 16:20 |
|
[jira] Commented: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Fri, 14 Mar, 12:13 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Tue, 18 Mar, 05:33 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Fri, 14 Mar, 13:27 |
| Andrzej Bialecki (JIRA) |
[jira] Assigned: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Fri, 14 Mar, 13:29 |
|
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Fri, 14 Mar, 14:02 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Mon, 17 Mar, 02:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Mon, 17 Mar, 10:01 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Tue, 18 Mar, 05:33 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-613) Empty Summaries and Cached Pages |
Fri, 14 Mar, 14:24 |
|
[jira] Commented: (NUTCH-613) Empty Summaries and Cached Pages |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-613) Empty Summaries and Cached Pages |
Fri, 14 Mar, 14:24 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-613) Empty Summaries and Cached Pages |
Sat, 15 Mar, 04:15 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Fri, 14 Mar, 14:38 |
|
[jira] Commented: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Fri, 14 Mar, 14:38 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Sat, 15 Mar, 04:15 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-610) Can't Update or modify an index while web gui is running |
Fri, 14 Mar, 14:44 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-601) Recrawling on existing crawl directory using force option |
Fri, 14 Mar, 14:54 |
|
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Fri, 14 Mar, 14:54 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Sat, 15 Mar, 04:15 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point |
Fri, 14 Mar, 15:00 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED |
Fri, 14 Mar, 15:00 |
|
[jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 15:10 |
| Jesiel Trevisan |
Re: [jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 16:16 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 17:24 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Sat, 15 Mar, 04:15 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Fri, 14 Mar, 15:10 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-566) Sun's URL class has bug in creation of relative query URLs |
Fri, 14 Mar, 23:34 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-556) automatic adjust the CrawlDatum.fetchInterval according to the number of newly outlinks |
Fri, 14 Mar, 23:38 |
|
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
Fri, 14 Mar, 23:42 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb |
Mon, 17 Mar, 02:59 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-70) duplicate pages - virtual hosts in db. |
Fri, 14 Mar, 23:58 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-70) duplicate pages - virtual hosts in db. |
Fri, 14 Mar, 23:58 |
|
[jira] Commented: (NUTCH-126) Fetching via https does not work with a proxy (patch) |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-126) Fetching via https does not work with a proxy (patch) |
Sat, 15 Mar, 00:18 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-126) Fetching via https does not work with a proxy (patch) |
Sat, 15 Mar, 04:15 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-126) Fetching via https does not work with a proxy (patch) |
Sat, 15 Mar, 00:18 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-157) Problem during parsing msword document . It fetching properly but parsing is not working. Please show me the way how can i parse it |
Sat, 15 Mar, 00:20 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-157) Problem during parsing msword document . It fetching properly but parsing is not working. Please show me the way how can i parse it |
Sat, 15 Mar, 00:20 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-168) setting http.content.limit to -1 seems to break text parsing on some files |
Sat, 15 Mar, 00:22 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-168) setting http.content.limit to -1 seems to break text parsing on some files |
Sat, 15 Mar, 00:22 |