| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-688) Fix missing/wrong headers in source files |
Wed, 18 Feb, 09:17 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-691) Update jakarta poi jars to the most relevant version |
Wed, 18 Feb, 12:45 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-563) Include custom fields in BasicQueryFilter |
Wed, 18 Feb, 12:55 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Wed, 18 Feb, 13:07 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-583) FeedParser empty links for items |
Wed, 18 Feb, 13:47 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 08:45 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin |
Thu, 19 Feb, 10:28 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-694) Distributed Search Server fails |
Thu, 19 Feb, 11:10 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 08:47 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-694) Distributed Search Server fails |
Fri, 20 Feb, 09:37 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Fri, 20 Feb, 09:39 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-573) Multiple Domains - Query Search |
Fri, 20 Feb, 09:39 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Fri, 20 Feb, 09:41 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-247) robot parser to restrict. |
Fri, 20 Feb, 09:43 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains |
Fri, 20 Feb, 09:43 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-694) Distributed Search Server fails |
Mon, 23 Feb, 07:05 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains |
Mon, 23 Feb, 07:23 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Tue, 24 Feb, 09:20 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-247) robot parser to restrict. |
Tue, 24 Feb, 09:56 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-701) replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:08 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-701) Replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:08 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles |
Tue, 24 Feb, 10:12 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration |
Tue, 24 Feb, 10:14 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-701) Replace Fetcher with Fetcher2 |
Tue, 24 Feb, 10:56 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 |
Tue, 24 Feb, 11:04 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-699) Add an "official" solr schema for solr integration |
Fri, 27 Feb, 06:22 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 |
Fri, 27 Feb, 06:24 |
| Sami Siren (JIRA) |
[jira] Assigned: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 |
Fri, 27 Feb, 06:24 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-705) parse-rtf plugin |
Fri, 27 Feb, 20:13 |
| Techie |
Re: writing plugin |
Sun, 08 Feb, 12:28 |
| buddha1021 |
Is there the functions of "More Like This" and "Spell Checking"? |
Wed, 25 Feb, 07:18 |
| dealmaker |
Re: Release 1.0? |
Sat, 28 Feb, 07:10 |
| dealmaker |
Re: Release 1.0? |
Sat, 28 Feb, 08:50 |
| dealmaker |
Re: Release 1.0? |
Sat, 28 Feb, 15:22 |
| hasan (JIRA) |
[jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException |
Tue, 17 Feb, 15:49 |
| julien nioche (JIRA) |
[jira] Closed: (NUTCH-656) DeleteDuplicates based on crawlDB only |
Tue, 03 Feb, 10:39 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-563) Include custom fields in BasicQueryFilter |
Tue, 10 Feb, 10:33 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-668) Domain URL Filter |
Thu, 12 Feb, 16:23 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Wed, 18 Feb, 12:31 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Wed, 18 Feb, 13:21 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-696) Timeout for Parser |
Thu, 19 Feb, 16:58 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-700) Neko1.9.11 goes into a loop |
Fri, 20 Feb, 10:45 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-700) Neko1.9.11 goes into a loop |
Fri, 20 Feb, 11:35 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Sat, 21 Feb, 01:08 |
| julien nioche (JIRA) |
[jira] Created: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 13:07 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 13:27 |
| julien nioche (JIRA) |
[jira] Commented: (NUTCH-696) Timeout for Parser |
Wed, 25 Feb, 14:29 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 22:19 |
| julien nioche (JIRA) |
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum |
Wed, 25 Feb, 22:19 |
| kr (JIRA) |
[jira] Created: (NUTCH-704) ensure that more important pages are crawled first |
Thu, 26 Feb, 06:51 |