| julien nioche (JIRA) |
[jira] Created: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 13:07 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 13:27 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-696) Timeout for Parser |
Wed, 25 Feb, 14:29 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Wed, 25 Feb, 16:47 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 22:19 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 22:19 |
| kr (JIRA) |
[jira] Created: (NUTCH-704) ensure that more important pages are crawled first |
Thu, 26 Feb, 06:51 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-704) ensure that more important pages are crawled first |
Thu, 26 Feb, 08:29 |
| Apache Wiki |
[Nutch Wiki] Update of "DownloadingNutch" by BartoszGadzimski |
Thu, 26 Feb, 17:46 |
| Apache Wiki |
[Nutch Wiki] Update of "SimpleMapReduceTutorial" by BartoszGadzimski |
Thu, 26 Feb, 17:57 |
| Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by BartoszGadzimski |
Thu, 26 Feb, 18:03 |
| Dmitry Lihachev (JIRA) |
[jira] Created: (NUTCH-705) parse-rtf plugin |
Fri, 27 Feb, 04:18 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-705) parse-rtf plugin |
Fri, 27 Feb, 04:18 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-705) parse-rtf plugin |
Fri, 27 Feb, 04:30 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore |
Fri, 27 Feb, 04:32 |
| Gopikrishnan (JIRA) |
[jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. |
Fri, 27 Feb, 06:12 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 27 Feb, 06:22 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Fri, 27 Feb, 06:24 |
| Sami Siren (JIRA) |
[jira] Assigned: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 |
Fri, 27 Feb, 06:24 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Fri, 27 Feb, 07:28 |
| Meghna Kukreja |
Url regex normalizer |
Fri, 27 Feb, 16:32 |
| Andrzej Bialecki |
Re: Url regex normalizer |
Fri, 27 Feb, 17:10 |
| Otis Gospodnetic |
Re: NutchAnalysis.java STOP_WORDS not configurable? |
Fri, 27 Feb, 18:21 |
| Meghna Kukreja (JIRA) |
[jira] Created: (NUTCH-706) Url regex normalizer |
Fri, 27 Feb, 18:47 |
| Meghna Kukreja (JIRA) |
[jira] Commented: (NUTCH-706) Url regex normalizer |
Fri, 27 Feb, 18:49 |
| Meghna Kukreja |
Re: Url regex normalizer |
Fri, 27 Feb, 18:50 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Fri, 27 Feb, 18:55 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-705) parse-rtf plugin |
Fri, 27 Feb, 20:13 |
| Sami Siren |
Re: Url regex normalizer |
Fri, 27 Feb, 20:18 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Sat, 28 Feb, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Sat, 28 Feb, 04:17 |
| dealmaker |
Re: Release 1.0? |
Sat, 28 Feb, 07:10 |
| Sami Siren |
Re: Release 1.0? |
Sat, 28 Feb, 08:00 |
| Sami Siren |
Re: Release 1.0? |
Sat, 28 Feb, 08:04 |
| Sami Siren |
planning for nutch-1.0-rc1 |
Sat, 28 Feb, 08:26 |
| dealmaker |
Re: Release 1.0? |
Sat, 28 Feb, 08:50 |
| Doğacan Güney |
Re: Release 1.0? |
Sat, 28 Feb, 15:05 |
| dealmaker |
Re: Release 1.0? |
Sat, 28 Feb, 15:22 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-705) parse-rtf plugin |
Sat, 28 Feb, 15:46 |
| Michael Chan (JIRA) |
[jira] Created: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment |
Sat, 28 Feb, 17:42 |
| Michael Chan (JIRA) |
[jira] Updated: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment |
Sat, 28 Feb, 17:44 |
| Andrzej Bialecki |
Re: Release 1.0? |
Sat, 28 Feb, 18:43 |
| Dennis Kubes |
Re: Release 1.0? |
Sat, 28 Feb, 18:44 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains |
Sat, 28 Feb, 18:48 |
| Andrzej Bialecki |
Re: planning for nutch-1.0-rc1 |
Sat, 28 Feb, 18:48 |
| Andrzej Bialecki |
Re: Release 1.0? |
Sat, 28 Feb, 18:51 |
| Andrzej Bialecki |
Re: planning for nutch-1.0-rc1 |
Sat, 28 Feb, 18:57 |
| Doug Cook (JIRA) |
[jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch |
Sat, 28 Feb, 19:06 |
| Doug Cook (JIRA) |
[jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch |
Sat, 28 Feb, 19:20 |
| Otis Gospodnetic (JIRA) |
[jira] Updated: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment |
Sat, 28 Feb, 22:44 |