| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Wed, 18 Feb, 08:33 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. |
Wed, 18 Feb, 08:39 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-688) Fix missing/wrong headers in source files |
Wed, 18 Feb, 09:17 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Wed, 18 Feb, 12:31 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 12:45 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-563) Include custom fields in BasicQueryFilter |
Wed, 18 Feb, 12:55 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Wed, 18 Feb, 13:07 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Wed, 18 Feb, 13:21 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-583) FeedParser empty links for items |
Wed, 18 Feb, 13:47 |
| Sami Siren |
dump Fetcher? |
Wed, 18 Feb, 13:58 |
| Peter Sparks (JIRA) |
[jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links |
Wed, 18 Feb, 14:05 |
| Justin Yao |
would someone help confirm a patch (fix incorrect encoding detection in cached.jsp) |
Wed, 18 Feb, 18:55 |
| Sami Siren |
Re: would someone help confirm a patch (fix incorrect encoding detection in cached.jsp) |
Wed, 18 Feb, 20:13 |
| Andrew McCall (JIRA) |
[jira] Created: (NUTCH-693) Add configurable option for treating nofollow behaviour. |
Wed, 18 Feb, 21:00 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-693) Add configurable option for treating nofollow behaviour. |
Wed, 18 Feb, 21:00 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Thu, 19 Feb, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter |
Thu, 19 Feb, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-688) Fix missing/wrong headers in source files |
Thu, 19 Feb, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-687) Add RAT |
Thu, 19 Feb, 04:17 |
| Dr. Nadine Hochstotter (JIRA) |
[jira] Created: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 08:39 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 08:45 |
| Dmitry Lihachev (JIRA) |
[jira] Created: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:05 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:11 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:13 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:15 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:15 |
| Dmitry Lihachev (JIRA) |
[jira] Issue Comment Edited: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:17 |
| Dmitry Lihachev (JIRA) |
[jira] Issue Comment Edited: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:17 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:28 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:30 |
| Dr. Nadine Hochstotter (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 10:52 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 11:10 |
| Andrew McCall (JIRA) |
[jira] Commented: (NUTCH-650) Hbase Integration |
Thu, 19 Feb, 13:54 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Thu, 19 Feb, 13:54 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-696) Timeout for Parser |
Thu, 19 Feb, 16:58 |
| Dr. Nadine Hochstotter (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 17:32 |
| Apache Wiki |
[Nutch Wiki] Update of "RunNutchInEclipse0.9" by FrankMcCown |
Thu, 19 Feb, 19:28 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-696) Timeout for Parser |
Thu, 19 Feb, 23:22 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 04:10 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 04:10 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Fri, 20 Feb, 04:22 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 06:41 |
| Dmitry Lihachev (JIRA) |
[jira] Issue Comment Edited: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 06:41 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 06:51 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 06:51 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 07:31 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-697) Generate log output for solr indexer and dedup |
Fri, 20 Feb, 08:11 |
| Dmitry Lihachev (JIRA) |
[jira] Created: (NUTCH-697) Generate log output for solr indexer and dedup |
Fri, 20 Feb, 08:11 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 08:47 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 08:55 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 08:55 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 08:55 |
| Apache Wiki |
[Nutch Wiki] Update of "RunningNutchAndSolr" by SamiSiren |
Fri, 20 Feb, 08:56 |
| Apache Wiki |
[Nutch Wiki] Update of "InstallingWeb2" by SamiSiren |
Fri, 20 Feb, 09:01 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 09:37 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 09:39 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-573) Multiple Domains - Query Search |
Fri, 20 Feb, 09:39 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Fri, 20 Feb, 09:41 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-247) robot parser to restrict. |
Fri, 20 Feb, 09:43 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains |
Fri, 20 Feb, 09:43 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:03 |
| Andrzej Bialecki |
Re: [Nutch Wiki] Update of "InstallingWeb2" by SamiSiren |
Fri, 20 Feb, 10:07 |
| Sami Siren |
Re: [Nutch Wiki] Update of "InstallingWeb2" by SamiSiren |
Fri, 20 Feb, 10:10 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:11 |
| Dmitry Lihachev (JIRA) |
[jira] Issue Comment Edited: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:11 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Fri, 20 Feb, 10:17 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:31 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:31 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 20 Feb, 10:33 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 20 Feb, 10:35 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-700) Neko1.9.11 goes into a loop |
Fri, 20 Feb, 10:45 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-684) Dedup support for Solr |
Fri, 20 Feb, 10:47 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 20 Feb, 10:49 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-700) Neko1.9.11 goes into a loop |
Fri, 20 Feb, 11:35 |
| Dr. Nadine Hochstotter (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 14:51 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Sat, 21 Feb, 01:08 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Sat, 21 Feb, 15:16 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-694) Distributed Search Server fails |
Mon, 23 Feb, 07:05 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Mon, 23 Feb, 07:23 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Mon, 23 Feb, 14:27 |
| Andrew McCall (JIRA) |
[jira] Updated: (NUTCH-650) Hbase Integration |
Mon, 23 Feb, 17:24 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Tue, 24 Feb, 04:16 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Tue, 24 Feb, 09:20 |
| Dmitry Lihachev (JIRA) |
[jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore |
Tue, 24 Feb, 09:34 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-644) RTF parser doesn't compile anymore |
Tue, 24 Feb, 09:38 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-247) robot parser to restrict. |
Tue, 24 Feb, 09:56 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-701) replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:08 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-701) Replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:08 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Tue, 24 Feb, 10:12 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Tue, 24 Feb, 10:14 |
| Dmitry Lihachev (JIRA) |
[jira] Updated: (NUTCH-644) RTF parser doesn't compile anymore |
Tue, 24 Feb, 10:36 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-701) Replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:48 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-701) Replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:56 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 |
Tue, 24 Feb, 11:04 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Tue, 24 Feb, 11:12 |
| Bartosz Gadzimski |
NutchAnalysis.java STOP_WORDS not configurable? |
Tue, 24 Feb, 13:28 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Wed, 25 Feb, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-247) robot parser to restrict. |
Wed, 25 Feb, 04:17 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Wed, 25 Feb, 04:17 |
| buddha1021 |
Is there the functions of "More Like This" and "Spell Checking"? |
Wed, 25 Feb, 07:18 |