Mailing list archives: February 2008

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Re: read crawldb.
nadav hashimshony   Re: read crawldb. Sun, 03 Feb, 08:43
Grant Ingersoll ApacheCon Europe BoF for Lucene/Nutch/Solr Sun, 03 Feb, 15:41
Susam Pal (JIRA) [jira] Created: (NUTCH-601) Recrawling on existing crawl directory using force option Mon, 04 Feb, 18:09
[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option
Susam Pal (JIRA)   [jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option Mon, 04 Feb, 18:11
Susam Pal (JIRA)   [jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option Mon, 04 Feb, 19:21
Susam Pal (JIRA)   [jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option Fri, 15 Feb, 20:32
Susam Pal (JIRA)   [jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option Fri, 15 Feb, 20:58
[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option
Andrzej Bialecki (JIRA)   [jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option Mon, 04 Feb, 18:19
Andrzej Bialecki (JIRA)   [jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option Tue, 05 Feb, 17:20
Susam Pal (JIRA)   [jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option Tue, 05 Feb, 18:50
Erol (JIRA)   [jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option Thu, 28 Feb, 19:47
Nadav Hashimshony problem with reading more then one urls from the DB Tue, 05 Feb, 15:59
Dennis Kubes (JIRA) [jira] Created: (NUTCH-602) Allow configurable number of handlers for search servers Tue, 05 Feb, 16:50
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-602) Allow configurable number of handlers for search servers Tue, 05 Feb, 16:54
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers
Andrzej Bialecki (JIRA)   [jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers Tue, 05 Feb, 16:58
Dennis Kubes (JIRA)   [jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers Thu, 07 Feb, 20:13
Sami Siren (JIRA)   [jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers Thu, 07 Feb, 20:25
Hudson (JIRA)   [jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers Fri, 08 Feb, 04:15
Dennis Kubes (JIRA) [jira] Created: (NUTCH-603) Add more default url normalizations Tue, 05 Feb, 16:58
[jira] Updated: (NUTCH-603) Add more default url normalizations
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-603) Add more default url normalizations Tue, 05 Feb, 17:04
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-603) Add more default url normalizations Tue, 12 Feb, 15:03
Andrzej Bialecki (JIRA) [jira] Created: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 Tue, 05 Feb, 22:33
Andrzej Bialecki (JIRA) [jira] Updated: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 Tue, 05 Feb, 22:35
Nadav Hashimshony Cant run twice get in SegmentReader Wed, 06 Feb, 08:56
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 Wed, 06 Feb, 12:09
Andrzej Bialecki JIRAClient Wed, 06 Feb, 12:19
Dennis Kubes   Re: JIRAClient Wed, 06 Feb, 14:05
Sebastian Steinmetz   Re: JIRAClient Thu, 07 Feb, 11:16
Sami Siren     Re: JIRAClient Thu, 07 Feb, 14:50
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-553) Add more normalization rules to regex-normalize file. Wed, 06 Feb, 12:29
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-553) Add more normalization rules to regex-normalize file. Wed, 06 Feb, 12:29
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-339) Refactor nutch to allow fetcher improvements Wed, 06 Feb, 12:31
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements Wed, 06 Feb, 12:31
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled Wed, 06 Feb, 12:55
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled Wed, 06 Feb, 12:55
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-551) performance for generate is often really bad Wed, 06 Feb, 16:35
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-551) performance for generate is often really bad Wed, 06 Feb, 16:35
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-593) Nutch crawl problem Wed, 06 Feb, 16:39
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-593) Nutch crawl problem Wed, 06 Feb, 16:39
Hudson (JIRA) [jira] Commented: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 Thu, 07 Feb, 04:15
DS jha nutch latest build - inject operation failing Thu, 07 Feb, 06:23
Andrzej Bialecki   Re: nutch latest build - inject operation failing Thu, 07 Feb, 13:11
DS jha     Re: nutch latest build - inject operation failing Thu, 07 Feb, 14:49
Andrzej Bialecki       Re: nutch latest build - inject operation failing Thu, 07 Feb, 15:03
DS jha         Re: nutch latest build - inject operation failing Thu, 07 Feb, 15:20
Dennis Kubes           Re: nutch latest build - inject operation failing Thu, 07 Feb, 15:37
DS jha             Re: nutch latest build - inject operation failing Thu, 07 Feb, 15:48
Dennis Kubes               Re: nutch latest build - inject operation failing Thu, 07 Feb, 15:54
DS jha                 Re: nutch latest build - inject operation failing Thu, 07 Feb, 16:02
Susam Pal   Re: nutch latest build - inject operation failing Thu, 14 Feb, 14:43
Dennis Kubes     Re: nutch latest build - inject operation failing Thu, 14 Feb, 16:11
Susam Pal       Re: nutch latest build - inject operation failing Thu, 14 Feb, 16:37
Susam Pal         Re: nutch latest build - inject operation failing Fri, 15 Feb, 16:37
esmithers           Re: nutch latest build - inject operation failing Wed, 27 Feb, 23:46
Dennis Kubes Maybe doing a 0.9.1 release Thu, 07 Feb, 17:50
Andrzej Bialecki   Re: Maybe doing a 0.9.1 release Thu, 07 Feb, 18:44
Dennis Kubes     Re: Maybe doing a 0.9.1 release Thu, 07 Feb, 19:05
Dennis Kubes (JIRA) [jira] Resolved: (NUTCH-602) Allow configurable number of handlers for search servers Thu, 07 Feb, 22:27
Dennis Kubes (JIRA) [jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop Fri, 08 Feb, 01:09
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-605) Change deprecated configuration methods for Hadoop Fri, 08 Feb, 01:11
[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup.
Emmanuel Joke (JIRA)   [jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. Fri, 08 Feb, 08:21
Emmanuel Joke (JIRA)   [jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. Sun, 24 Feb, 07:02
Hudson (JIRA)   [jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup. Tue, 26 Feb, 04:11
Dennis Kubes (JIRA) [jira] Created: (NUTCH-606) Refactoring of Generator, run all urls through checks Fri, 08 Feb, 22:10
[jira] Updated: (NUTCH-607) Update build.xml to include tika jar in war file
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-607) Update build.xml to include tika jar in war file Fri, 08 Feb, 22:24
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-607) Update build.xml to include tika jar in war file Fri, 08 Feb, 23:02
Dennis Kubes (JIRA) [jira] Created: (NUTCH-607) Update build.xml to include tika jar Fri, 08 Feb, 22:24
Dennis Kubes (JIRA) [jira] Assigned: (NUTCH-606) Refactoring of Generator, run all urls through checks Fri, 08 Feb, 22:58
[jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks Fri, 08 Feb, 23:00
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks Fri, 08 Feb, 23:28
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks Sat, 09 Feb, 05:09
Dennis Kubes (JIRA)   [jira] Updated: (NUTCH-606) Refactoring of Generator, run all urls through checks Sat, 09 Feb, 18:40
[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks
Andrzej Bialecki (JIRA)   [jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks Sat, 09 Feb, 00:13
Andrzej Bialecki (JIRA)   [jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks Sat, 09 Feb, 08:45
Andrzej Bialecki (JIRA)   [jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks Sat, 09 Feb, 19:20
Dennis Kubes (JIRA)   [jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks Mon, 11 Feb, 20:14
Hudson (JIRA)   [jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks Wed, 13 Feb, 04:14
[jira] Commented: (NUTCH-607) Update build.xml to include tika jar in war file
Chris A. Mattmann (JIRA)   [jira] Commented: (NUTCH-607) Update build.xml to include tika jar in war file Sat, 09 Feb, 01:19
Hudson (JIRA)   [jira] Commented: (NUTCH-607) Update build.xml to include tika jar in war file Sun, 10 Feb, 05:36
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Nov 200920
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510