| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-607) Update build.xml to include tika jar |
Fri, 08 Feb, 22:24 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 22:58 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 23:00 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-607) Update build.xml to include tika jar in war file |
Fri, 08 Feb, 23:02 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 23:28 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 05:09 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 18:40 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-607) Update build.xml to include tika jar in war file |
Sat, 09 Feb, 18:43 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sat, 09 Feb, 18:56 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sat, 09 Feb, 19:55 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sun, 10 Feb, 04:42 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Mon, 11 Feb, 20:14 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Mon, 11 Feb, 20:14 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-603) Add more default url normalizations |
Mon, 11 Feb, 20:16 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 21:38 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-603) Add more default url normalizations |
Tue, 12 Feb, 05:02 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Tue, 12 Feb, 14:53 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Tue, 12 Feb, 14:55 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-603) Add more default url normalizations |
Tue, 12 Feb, 15:03 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Tue, 12 Feb, 16:53 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Tue, 12 Feb, 22:24 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 |
Wed, 13 Feb, 02:03 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 |
Wed, 13 Feb, 02:07 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-603) Add more default url normalizations |
Thu, 14 Feb, 22:19 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 |
Thu, 14 Feb, 22:23 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-44) too many search results |
Fri, 15 Feb, 21:20 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-44) too many search results |
Fri, 15 Feb, 21:28 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-44) too many search results |
Sat, 16 Feb, 00:26 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-44) too many search results |
Sat, 16 Feb, 00:28 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-44) too many search results |
Mon, 18 Feb, 06:40 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-613) Empty Summaries and Cached Pages |
Tue, 19 Feb, 06:28 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-613) Empty Summaries and Cached Pages |
Tue, 19 Feb, 06:32 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-613) Empty Summaries and Cached Pages |
Tue, 19 Feb, 06:58 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-614) Order Inlinks by OPIC score of parent page |
Wed, 20 Feb, 04:38 |
| Dennis Kubes (JIRA) |
[jira] Work started: (NUTCH-614) Order Inlinks by OPIC score of parent page |
Wed, 20 Feb, 04:38 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-614) Order Inlinks by OPIC score of parent page |
Wed, 20 Feb, 04:38 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-614) Order Inlinks by OPIC score of parent page |
Wed, 20 Feb, 04:40 |
| Dennis Kubes (JIRA) |
[jira] Work started: (NUTCH-578) URL fetched with 403 is generated over and over again |
Mon, 25 Feb, 14:16 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-578) URL fetched with 403 is generated over and over again |
Mon, 25 Feb, 14:16 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-578) URL fetched with 403 is generated over and over again |
Mon, 25 Feb, 15:40 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-614) Order Inlinks by OPIC score of parent page |
Tue, 26 Feb, 22:14 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Fri, 08 Feb, 08:21 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Mon, 11 Feb, 16:36 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Sun, 24 Feb, 07:02 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-613) Empty Summaries and Cached Pages |
Sun, 24 Feb, 07:06 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation |
Sun, 24 Feb, 07:08 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Sun, 24 Feb, 14:55 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again |
Sun, 24 Feb, 15:19 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Tue, 26 Feb, 14:30 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Tue, 26 Feb, 14:30 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Tue, 26 Feb, 17:26 |
| Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-616) Reset Fetch Retry counter when fetch is successful |
Tue, 26 Feb, 17:26 |
| Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval |
Thu, 28 Feb, 14:25 |
| Erol (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Thu, 28 Feb, 19:47 |
| Grant Ingersoll |
ApacheCon Europe BoF for Lucene/Nutch/Solr |
Sun, 03 Feb, 15:41 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 |
Thu, 07 Feb, 04:15 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers |
Fri, 08 Feb, 04:15 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-607) Update build.xml to include tika jar in war file |
Sun, 10 Feb, 05:36 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Wed, 13 Feb, 04:14 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Wed, 13 Feb, 04:14 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Wed, 13 Feb, 04:14 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 |
Fri, 15 Feb, 04:13 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-603) Add more default url normalizations |
Fri, 15 Feb, 04:13 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-44) too many search results |
Tue, 19 Feb, 16:44 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Tue, 26 Feb, 04:11 |
| Nadav Hashimshony |
problem with reading more then one urls from the DB |
Tue, 05 Feb, 15:59 |
| Nadav Hashimshony |
Cant run twice get in SegmentReader |
Wed, 06 Feb, 08:56 |
| Nigel Daley |
Re: Build failed in Hudson: Nutch-trunk #371 |
Wed, 27 Feb, 01:48 |
| Nigel Daley |
Re: Failing Hudson Builds |
Wed, 27 Feb, 08:13 |
| Nigel Daley |
Re: Failing Hudson Builds |
Wed, 27 Feb, 17:22 |
| Nynodata Development Team |
Filter fetching by mime type |
Tue, 26 Feb, 13:55 |
| Sami Siren |
Re: JIRAClient |
Thu, 07 Feb, 14:50 |
| Sami Siren |
Re: Failing Hudson Builds |
Sun, 24 Feb, 19:35 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers |
Thu, 07 Feb, 20:25 |
| Sebastian Steinmetz |
Re: JIRAClient |
Thu, 07 Feb, 11:16 |
| Susam Pal |
Re: nutch latest build - inject operation failing |
Thu, 14 Feb, 14:43 |
| Susam Pal |
Re: nutch latest build - inject operation failing |
Thu, 14 Feb, 16:37 |
| Susam Pal |
Re: nutch latest build - inject operation failing |
Fri, 15 Feb, 16:37 |
| Susam Pal (JIRA) |
[jira] Created: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 18:09 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 18:11 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 19:21 |
| Susam Pal (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Tue, 05 Feb, 18:50 |
| Susam Pal (JIRA) |
[jira] Created: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Fri, 15 Feb, 19:50 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl |
Fri, 15 Feb, 19:54 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option |
Fri, 15 Feb, 20:32 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option |
Fri, 15 Feb, 20:58 |
| esmithers |
Re: nutch latest build - inject operation failing |
Wed, 27 Feb, 23:46 |
| lupin1979 |
Java error crawling |
Thu, 21 Feb, 10:26 |
| nadav hashimshony |
Re: read crawldb. |
Sun, 03 Feb, 08:43 |