| nadav hashimshony |
Re: read crawldb. |
Sun, 03 Feb, 08:43 |
| Grant Ingersoll |
ApacheCon Europe BoF for Lucene/Nutch/Solr |
Sun, 03 Feb, 15:41 |
| Susam Pal (JIRA) |
[jira] Created: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 18:09 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 18:11 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 18:19 |
| Susam Pal (JIRA) |
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option |
Mon, 04 Feb, 19:21 |
| Nadav Hashimshony |
problem with reading more then one urls from the DB |
Tue, 05 Feb, 15:59 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-602) Allow configurable number of handlers for search servers |
Tue, 05 Feb, 16:50 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-602) Allow configurable number of handlers for search servers |
Tue, 05 Feb, 16:54 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-603) Add more default url normalizations |
Tue, 05 Feb, 16:58 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers |
Tue, 05 Feb, 16:58 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-603) Add more default url normalizations |
Tue, 05 Feb, 17:04 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Tue, 05 Feb, 17:20 |
| Susam Pal (JIRA) |
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option |
Tue, 05 Feb, 18:50 |
| Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 |
Tue, 05 Feb, 22:33 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 |
Tue, 05 Feb, 22:35 |
| Nadav Hashimshony |
Cant run twice get in SegmentReader |
Wed, 06 Feb, 08:56 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 |
Wed, 06 Feb, 12:09 |
| Andrzej Bialecki |
JIRAClient |
Wed, 06 Feb, 12:19 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-553) Add more normalization rules to regex-normalize file. |
Wed, 06 Feb, 12:29 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-553) Add more normalization rules to regex-normalize file. |
Wed, 06 Feb, 12:29 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-339) Refactor nutch to allow fetcher improvements |
Wed, 06 Feb, 12:31 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements |
Wed, 06 Feb, 12:31 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled |
Wed, 06 Feb, 12:55 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled |
Wed, 06 Feb, 12:55 |
| Dennis Kubes |
Re: JIRAClient |
Wed, 06 Feb, 14:05 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-551) performance for generate is often really bad |
Wed, 06 Feb, 16:35 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-551) performance for generate is often really bad |
Wed, 06 Feb, 16:35 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-593) Nutch crawl problem |
Wed, 06 Feb, 16:39 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-593) Nutch crawl problem |
Wed, 06 Feb, 16:39 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 |
Thu, 07 Feb, 04:15 |
| DS jha |
nutch latest build - inject operation failing |
Thu, 07 Feb, 06:23 |
| Sebastian Steinmetz |
Re: JIRAClient |
Thu, 07 Feb, 11:16 |
| Andrzej Bialecki |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 13:11 |
| DS jha |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 14:49 |
| Sami Siren |
Re: JIRAClient |
Thu, 07 Feb, 14:50 |
| Andrzej Bialecki |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 15:03 |
| DS jha |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 15:20 |
| Dennis Kubes |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 15:37 |
| DS jha |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 15:48 |
| Dennis Kubes |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 15:54 |
| DS jha |
Re: nutch latest build - inject operation failing |
Thu, 07 Feb, 16:02 |
| Dennis Kubes |
Maybe doing a 0.9.1 release |
Thu, 07 Feb, 17:50 |
| Andrzej Bialecki |
Re: Maybe doing a 0.9.1 release |
Thu, 07 Feb, 18:44 |
| Dennis Kubes |
Re: Maybe doing a 0.9.1 release |
Thu, 07 Feb, 19:05 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers |
Thu, 07 Feb, 20:13 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers |
Thu, 07 Feb, 20:25 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-602) Allow configurable number of handlers for search servers |
Thu, 07 Feb, 22:27 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Fri, 08 Feb, 01:09 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Fri, 08 Feb, 01:11 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers |
Fri, 08 Feb, 04:15 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. |
Fri, 08 Feb, 08:21 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 22:10 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-607) Update build.xml to include tika jar in war file |
Fri, 08 Feb, 22:24 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-607) Update build.xml to include tika jar |
Fri, 08 Feb, 22:24 |
| Dennis Kubes (JIRA) |
[jira] Assigned: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 22:58 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 23:00 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-607) Update build.xml to include tika jar in war file |
Fri, 08 Feb, 23:02 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Fri, 08 Feb, 23:28 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 00:13 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-607) Update build.xml to include tika jar in war file |
Sat, 09 Feb, 01:19 |
| Chris A. Mattmann (JIRA) |
[jira] Created: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Sat, 09 Feb, 01:48 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 05:09 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 08:45 |
| Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 18:40 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-607) Update build.xml to include tika jar in war file |
Sat, 09 Feb, 18:43 |
| Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sat, 09 Feb, 18:56 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Sat, 09 Feb, 19:20 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sat, 09 Feb, 19:23 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sat, 09 Feb, 19:55 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sat, 09 Feb, 20:05 |
| Chris A. Mattmann (JIRA) |
[jira] Work started: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Sun, 10 Feb, 00:04 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Sun, 10 Feb, 00:04 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Sun, 10 Feb, 02:38 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sun, 10 Feb, 04:42 |
| Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) |
Sun, 10 Feb, 04:58 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-607) Update build.xml to include tika jar in war file |
Sun, 10 Feb, 05:36 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Sun, 10 Feb, 18:09 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Sun, 10 Feb, 18:09 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 15:36 |
| Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Mon, 11 Feb, 16:36 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 16:48 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Mon, 11 Feb, 16:52 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 17:06 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 17:40 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 17:58 |
| Ciminera Frederic (JIRA) |
[jira] Created: (NUTCH-610) Can't Update or modify an index while web gui is running |
Mon, 11 Feb, 18:12 |
| Ciminera Frederic (JIRA) |
[jira] Updated: (NUTCH-610) Can't Update or modify an index while web gui is running |
Mon, 11 Feb, 18:20 |
| Ciminera Frederic (JIRA) |
[jira] Updated: (NUTCH-610) Can't Update or modify an index while web gui is running |
Mon, 11 Feb, 18:34 |
| Ciminera Frederic (JIRA) |
[jira] Updated: (NUTCH-610) Can't Update or modify an index while web gui is running |
Mon, 11 Feb, 18:36 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 18:40 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 18:56 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-610) Can't Update or modify an index while web gui is running |
Mon, 11 Feb, 19:01 |
| Chris A. Mattmann (JIRA) |
[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 19:07 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks |
Mon, 11 Feb, 20:14 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Mon, 11 Feb, 20:14 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-603) Add more default url normalizations |
Mon, 11 Feb, 20:16 |
| Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating |
Mon, 11 Feb, 21:38 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-605) Change deprecated configuration methods for Hadoop |
Mon, 11 Feb, 22:49 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-603) Add more default url normalizations |
Mon, 11 Feb, 22:57 |