| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Mon, 12 Nov, 23:01 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Mon, 12 Nov, 23:03 |
| xingjian |
takes the URI info, Content, headers, ect into a MYSQL database. |
Tue, 13 Nov, 05:37 |
| Sagar Naik |
Re: takes the URI info, Content, headers, ect into a MYSQL database. |
Tue, 13 Nov, 05:51 |
| xingjian |
Re: takes the URI info, Content, headers, ect into a MYSQL database. |
Tue, 13 Nov, 06:41 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 10:50 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 13:11 |
| david euler (JIRA) |
[jira] Commented: (NUTCH-540) some problem about the Nutch cache |
Tue, 13 Nov, 14:12 |
| david euler (JIRA) |
[jira] Commented: (NUTCH-540) some problem about the Nutch cache |
Tue, 13 Nov, 14:19 |
| eyal edri |
Need help in updating url in runtime in [Fetcher.java] |
Tue, 13 Nov, 15:30 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 17:46 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Tue, 13 Nov, 18:35 |
| Enis Soztutar (JIRA) |
[jira] Assigned: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 07:20 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 07:58 |
| Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 08:01 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 09:48 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 10:09 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 11:05 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 11:36 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-573) Multiple Domains - Query Search |
Wed, 14 Nov, 13:07 |
| Rajasekar Karthik (JIRA) |
[jira] Created: (NUTCH-576) Different Analyzers Support |
Wed, 14 Nov, 15:10 |
| w00_008 |
Re: Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Wed, 14 Nov, 18:14 |
| Dennis Kubes (JIRA) |
[jira] Reopened: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Wed, 14 Nov, 23:33 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Wed, 14 Nov, 23:40 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. |
Thu, 15 Nov, 08:54 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Thu, 15 Nov, 16:47 |
| Dennis Kubes (JIRA) |
[jira] Issue Comment Edited: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Thu, 15 Nov, 17:09 |
| Renaud Richardet (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Thu, 15 Nov, 17:13 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Thu, 15 Nov, 18:12 |
| Dennis Kubes |
Commit Times for Issues |
Thu, 15 Nov, 21:37 |
| Andrzej Bialecki |
Re: Commit Times for Issues |
Thu, 15 Nov, 21:56 |
| Ned Rockson |
Nutch trunk js-parser problem with extremely long and meaningless Elements |
Fri, 16 Nov, 02:18 |
| xingjian |
about heritrix crawl,Who will tell me in this Nutch forum?thanks |
Fri, 16 Nov, 05:00 |
| Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-573) Multiple Domains - Query Search |
Fri, 16 Nov, 13:14 |
| Chris Mattmann |
Re: Commit Times for Issues |
Fri, 16 Nov, 15:46 |
| Dennis Kubes |
Re: Commit Times for Issues |
Fri, 16 Nov, 17:45 |
| Marcin Okraszewski |
=?UTF-8?Q?Re:_Commit_Times_for_Issues?= |
Fri, 16 Nov, 19:33 |
| Dennis Kubes |
Re: Commit Times for Issues |
Fri, 16 Nov, 20:15 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility |
Fri, 16 Nov, 20:26 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-552) Upgrade Nutch to Hadoop 0.15.x |
Fri, 16 Nov, 20:26 |
| Andrzej Bialecki |
Re: Commit Times for Issues |
Fri, 16 Nov, 20:57 |
| Chris A. Mattmann (JIRA) |
[jira] Created: (NUTCH-577) Use explicit tika-config.xml file to enable mime magic detection to be turned on and off |
Sat, 17 Nov, 23:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-577) Use explicit tika-config.xml file to enable mime magic detection to be turned on and off |
Sun, 18 Nov, 08:48 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Sun, 18 Nov, 22:30 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 19 Nov, 21:04 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-442) Integrate Solr/Nutch |
Mon, 19 Nov, 21:12 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-442) Integrate Solr/Nutch |
Mon, 19 Nov, 21:27 |
| Nathaniel Powell (JIRA) |
[jira] Created: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:39 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:41 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:41 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:43 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:43 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:47 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:49 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:51 |
| Nathaniel Powell (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Tue, 20 Nov, 21:51 |
| Joseph Chen (JIRA) |
[jira] Created: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Wed, 21 Nov, 07:41 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Wed, 21 Nov, 08:50 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-580) Remove deprecated hadoop api calls (FS) |
Wed, 21 Nov, 16:48 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-580) Remove deprecated hadoop api calls (FS) |
Wed, 21 Nov, 16:48 |
| Rohan Mehta (JIRA) |
[jira] Created: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Wed, 21 Nov, 16:58 |
| Rohan Mehta (JIRA) |
[jira] Updated: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Wed, 21 Nov, 17:00 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-582) Add missing type parameters |
Wed, 21 Nov, 18:47 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-582) Add missing type parameters |
Wed, 21 Nov, 18:49 |
| Sami Siren |
Backwards compatibility strategy |
Thu, 22 Nov, 17:45 |
| shaowen yu |
Applicant for Nutch Project |
Fri, 23 Nov, 06:13 |
| Doğacan Güney |
Re: Backwards compatibility strategy |
Fri, 23 Nov, 11:40 |
| Grant Ingersoll |
Re: Applicant for Nutch Project |
Fri, 23 Nov, 12:07 |
| eyal edri |
Maintaining source url data (father) during runtime |
Sun, 25 Nov, 11:34 |
| eyal edri |
Re: Maintaining source url data (father) during runtime |
Mon, 26 Nov, 09:48 |
| jian chen |
Re: Maintaining source url data (father) during runtime |
Mon, 26 Nov, 18:12 |
| Dennis Kubes |
Re: Maintaining source url data (father) during runtime |
Mon, 26 Nov, 18:20 |
| eyal edri |
Re: Maintaining source url data (father) during runtime |
Tue, 27 Nov, 08:01 |
| Enis Soztutar (JIRA) |
[jira] Created: (NUTCH-583) FeedParser empty links for items |
Tue, 27 Nov, 15:01 |
| Frederic Ciminera |
Issue with IndexSearcher initialization in NuchBean |
Tue, 27 Nov, 17:10 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Tue, 27 Nov, 17:25 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Tue, 27 Nov, 19:06 |
| Tomislav Poljak (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Wed, 28 Nov, 10:14 |
| ÑÕÔÏÐý |
some question about development |
Wed, 28 Nov, 14:39 |
| Ruslan Ermilov (JIRA) |
[jira] Created: (NUTCH-584) urls missing from fetchlist |
Wed, 28 Nov, 15:57 |
| Tim Gautier |
Re: some question about development |
Wed, 28 Nov, 18:16 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
Wed, 28 Nov, 18:29 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Thu, 29 Nov, 08:07 |
| Andrea Spinelli (JIRA) |
[jira] Created: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Thu, 29 Nov, 11:13 |
| pavan kumar donepudi |
Parsing ppt with mimetype application/x-mspowerpoint |
Thu, 29 Nov, 15:38 |
| Matt Kangas |
Re: [jira] Created: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Thu, 29 Nov, 21:07 |