|
[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
|
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
Thu, 01 Nov, 13:12 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
Wed, 28 Nov, 18:29 |
|
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Thu, 01 Nov, 13:45 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Sun, 18 Nov, 22:30 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 19 Nov, 21:04 |
| Tomislav Poljak (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Wed, 28 Nov, 10:14 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Thu, 29 Nov, 08:07 |
| Ned Rockson |
When is the Clause.getQuery().getBoost == 0? |
Thu, 01 Nov, 21:32 |
| Andrzej Bialecki |
Re: When is the Clause.getQuery().getBoost == 0? |
Thu, 01 Nov, 21:47 |
|
Re: plugin analyzer |
|
| karthik085 |
Re: plugin analyzer |
Fri, 02 Nov, 03:08 |
| karthik085 |
Nutch automatically deleting sites from search results |
Fri, 02 Nov, 03:27 |
| Joseph Chen (JIRA) |
[jira] Created: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 |
Sat, 03 Nov, 02:25 |
|
Re: How to extract specified information from html? |
|
| qi wu |
Re: How to extract specified information from html? |
Sat, 03 Nov, 13:56 |
| jqq |
Re: How to extract specified information from html? |
Sat, 03 Nov, 14:06 |
| Xin Zhang |
How dose the Nutch-0.9 read the configuration file? |
Sun, 04 Nov, 11:30 |
| eyal edri |
Re: How dose the Nutch-0.9 read the configuration file? |
Sun, 04 Nov, 12:23 |
| Dennis Kubes |
JIRA emails and Nutch |
Sun, 04 Nov, 15:48 |
| Andrzej Bialecki |
Re: JIRA emails and Nutch |
Sun, 04 Nov, 18:36 |
| Doğacan Güney |
Re: JIRA emails and Nutch |
Mon, 05 Nov, 13:19 |
| Dennis Kubes |
Re: JIRA emails and Nutch |
Mon, 05 Nov, 16:31 |
| Dennis Kubes |
Re: JIRA emails and Nutch |
Wed, 07 Nov, 16:38 |
|
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak |
|
| Sam Xia (JIRA) |
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak |
Tue, 06 Nov, 18:55 |
| Sam Xia (JIRA) |
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak |
Tue, 06 Nov, 18:57 |
| Sam Xia (JIRA) |
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak |
Tue, 06 Nov, 18:57 |
| Sam Xia (JIRA) |
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak |
Tue, 06 Nov, 18:57 |
| Sam Xia (JIRA) |
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak |
Tue, 06 Nov, 18:59 |
| n..@bcit |
adding dmoz meta data to index. |
Tue, 06 Nov, 19:29 |
| Sebastian Steinmetz |
Re: adding dmoz meta data to index. |
Wed, 07 Nov, 14:10 |
| Ned Rockson |
Tika API |
Tue, 06 Nov, 22:47 |
| Chris Mattmann |
Re: Tika API |
Wed, 07 Nov, 01:18 |
| Ned Rockson |
Re: Tika API |
Wed, 07 Nov, 01:56 |
| Dennis Kubes |
Re: Tika API |
Wed, 07 Nov, 03:25 |
| Chris Mattmann |
Re: Tika API |
Wed, 07 Nov, 03:05 |
| Ned Rockson |
Re: Tika API |
Wed, 07 Nov, 19:13 |
| karthik085 |
MD5 vs TextProfile Signature |
Wed, 07 Nov, 00:27 |
| Rajasekar Karthik (JIRA) |
[jira] Created: (NUTCH-573) Multiple Domains - Query Search |
Wed, 07 Nov, 18:59 |
|
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
|
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Wed, 07 Nov, 19:22 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Wed, 07 Nov, 20:05 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Wed, 07 Nov, 20:30 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Wed, 07 Nov, 20:36 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Fri, 09 Nov, 05:38 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Wed, 07 Nov, 19:24 |
|
[jira] Commented: (NUTCH-572) Scoring and redirected Urls |
|
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-572) Scoring and redirected Urls |
Wed, 07 Nov, 19:36 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-572) Scoring and redirected Urls |
Wed, 07 Nov, 20:26 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Wed, 07 Nov, 19:45 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-356) Plugin repository cache can lead to memory leak |
Wed, 07 Nov, 19:58 |
|
[jira] Updated: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
|
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Wed, 07 Nov, 20:13 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sat, 10 Nov, 16:29 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sat, 10 Nov, 18:10 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Mon, 12 Nov, 23:01 |
|
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Wed, 07 Nov, 20:13 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Wed, 07 Nov, 20:20 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Wed, 07 Nov, 20:24 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Wed, 07 Nov, 20:51 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Fri, 09 Nov, 14:19 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Fri, 09 Nov, 15:26 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Fri, 09 Nov, 15:54 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Fri, 09 Nov, 17:05 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Fri, 09 Nov, 20:25 |
| Matt Kangas |
Re: [jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Fri, 09 Nov, 20:45 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sat, 10 Nov, 04:39 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sat, 10 Nov, 17:51 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sat, 10 Nov, 20:08 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sun, 11 Nov, 14:36 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Sun, 11 Nov, 19:33 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Mon, 12 Nov, 23:03 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 10:50 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 13:11 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 18:35 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Thu, 15 Nov, 08:54 |
| karthik085 |
db.ignore.internal.links and ranking algorithms |
Wed, 07 Nov, 20:32 |
| Dennis Kubes |
Re: db.ignore.internal.links and ranking algorithms |
Wed, 07 Nov, 20:57 |
| karthik085 |
Re: db.ignore.internal.links and ranking algorithms |
Wed, 07 Nov, 21:18 |
| Dennis Kubes |
Re: db.ignore.internal.links and ranking algorithms |
Wed, 07 Nov, 22:53 |
| karthik085 |
Re: db.ignore.internal.links and ranking algorithms |
Thu, 08 Nov, 04:08 |
| John Doe |
NullPointerException in FetchedSegments.getSummary() |
Thu, 08 Nov, 00:27 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Thu, 08 Nov, 13:20 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-411) Parse ignores meta refresh redirection |
Thu, 08 Nov, 15:04 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-547) Redirection handling: YahooSlurp's algorithm |
Thu, 08 Nov, 15:04 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Thu, 08 Nov, 15:10 |
|
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
|
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Thu, 08 Nov, 15:14 |
| Dawid Weiss (JIRA) |
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Fri, 09 Nov, 07:46 |
| Dawid Weiss (JIRA) |
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Fri, 09 Nov, 07:46 |
| Dawid Weiss (JIRA) |
[jira] Updated: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Fri, 09 Nov, 07:48 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Thu, 08 Nov, 15:14 |
|
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Thu, 08 Nov, 15:21 |
| Dawid Weiss (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Thu, 08 Nov, 16:28 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Fri, 09 Nov, 15:37 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-538) Delete unused classes under o.a.n.util |
Thu, 08 Nov, 19:09 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-538) Delete unused classes under o.a.n.util |
Thu, 08 Nov, 19:09 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block |
Thu, 08 Nov, 19:11 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates |
Thu, 08 Nov, 19:15 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates |
Thu, 08 Nov, 19:15 |
| Ned Rockson |
Usage of mapred-default.xml is deprecated in hadoop0.15.0 |
Thu, 08 Nov, 22:20 |
| John H. Lee (JIRA) |
[jira] Created: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Thu, 08 Nov, 22:43 |
|
[jira] Updated: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
|
| John H. Lee (JIRA) |
[jira] Updated: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Thu, 08 Nov, 22:45 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Tue, 27 Nov, 17:25 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #261 |
Fri, 09 Nov, 05:36 |
| Doğacan Güney |
Re: Build failed in Hudson: Nutch-Nightly #261 |
Fri, 09 Nov, 13:25 |
|
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
|
| Hudson (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Fri, 09 Nov, 05:38 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat |
Sat, 10 Nov, 04:42 |