Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C9EA2200B84 for ; Tue, 20 Sep 2016 21:00:35 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C8561160AC5; Tue, 20 Sep 2016 19:00:35 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E738A160AC0 for ; Tue, 20 Sep 2016 21:00:34 +0200 (CEST) Received: (qmail 74988 invoked by uid 500); 20 Sep 2016 19:00:34 -0000 Mailing-List: contact user-help@kudu.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@kudu.apache.org Delivered-To: mailing list user@kudu.apache.org Received: (qmail 74980 invoked by uid 99); 20 Sep 2016 19:00:34 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Sep 2016 19:00:34 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id BCE50C05C0 for ; Tue, 20 Sep 2016 19:00:33 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.279 X-Spam-Level: * X-Spam-Status: No, score=1.279 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id 1lUzrN053rKK for ; Tue, 20 Sep 2016 19:00:31 +0000 (UTC) Received: from mail-wm0-f41.google.com (mail-wm0-f41.google.com [74.125.82.41]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 73DCA5FCE5 for ; Tue, 20 Sep 2016 19:00:31 +0000 (UTC) Received: by mail-wm0-f41.google.com with SMTP id w84so151133047wmg.1 for ; Tue, 20 Sep 2016 12:00:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudera-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=aQ5b5pbapRvcq1vqF6oCJcnc/9h3KB75b5CXjHiyxw8=; b=pFlZcCCto2YvAqOsx97oAj1VIhuaU7CcFM7hAzIKIJVkVRHzBXODthEO0A5OA1KP6c jwjXx30BOOFkmwGO+UIVh9iT7OVvOtXGxEBqR6QEsgykKW3E+fspOsPmrK8oaHw3gFd1 OGgsi9QwI/rdCm7qVCfpC7GGK2fXyQGaw6byprnRnoWjnhTL4YMLHHfmA6lWLYlJcLeK TdLzkBbblxNy0MLLf+Y9v1oahBMDjlopGLyUOXa0OTqep3JDYqz0v1yf1CaZizOLPCas CpRGvMghNMcZ9TXy2CLREyh6rJ/5vKPOnZrhL/v7ET6bKX9g4ki3x3a9VOYXFQ3kSqGV sWrg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=aQ5b5pbapRvcq1vqF6oCJcnc/9h3KB75b5CXjHiyxw8=; b=Jm9Z3v0KYam8SEH2fa+0+HhnJWnmXmfXGDKI3jLQoXxaLMn37h+rqpwOOsyfd33FE7 YJgmSNkhneSfrUwGq76pwXf+GYTEte1Xk3a0Q80mMbxj6YGwMbTSGYXND7fmIjoQlsZO iJSG17xGLPHs1Dw/X7vFZoVMrel5DKfmqRlC4uXI1iCEGI5LkqrlyZaaCwD6tKATyAc8 6lcPQpwqbwmIw0/gKDb3gTilhZhkEnNUiGato6yHYkz4HHVz6t9vKEN1FE20KgmfPKVg 3P+tglilsScw7ekJF4JpNSKjyDWpWTWNInm/7Gp7DVBrDlktLkSZ6OftHn0W0zDX3wJc F3Lg== X-Gm-Message-State: AE9vXwNHWfVoOLof4lBjZRmM3Wq/r3sd4pXyLjm730XC6peXGi6tyRaFnDOruecMCGcE0AZDKu3YnMabLGwUSqr1 X-Received: by 10.28.5.133 with SMTP id 127mr4947748wmf.129.1474398024555; Tue, 20 Sep 2016 12:00:24 -0700 (PDT) MIME-Version: 1.0 Received: by 10.28.170.141 with HTTP; Tue, 20 Sep 2016 12:00:03 -0700 (PDT) In-Reply-To: <64BEE379-017C-42D6-AE4D-B9E6F5FC56C0@gmail.com> References: <64BEE379-017C-42D6-AE4D-B9E6F5FC56C0@gmail.com> From: Todd Lipcon Date: Tue, 20 Sep 2016 12:00:03 -0700 Message-ID: Subject: Re: [ANNOUNCE] Apache Kudu 1.0.0 release To: user@kudu.apache.org Cc: dev Content-Type: multipart/alternative; boundary=001a114435cc83029e053cf50a57 archived-at: Tue, 20 Sep 2016 19:00:36 -0000 --001a114435cc83029e053cf50a57 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable -announce On Tue, Sep 20, 2016 at 11:34 AM, Benjamin Kim wrote: > This is awesome!!! Great!!! > > Do you know if any improvements were also made to the Spark plugin jar? > Looks like a few changes based on the git log: https://gist.github.com/4fa3ccc3b9be787227fed89c1bd42837 as well as a number of changes to the Java client (which gets pulled into the Spark jar): https://gist.github.com/e2a8ca78e51773fabb70aae34207199f In particular, I think the partition pruning work in the Java client should reduce the number of Spark partitions if you have predicates on your data frames. (though I haven't personally verified it) -Todd > On Sep 20, 2016, at 12:11 AM, Todd Lipcon wrote: > > The Apache Kudu team is happy to announce the release of Kudu 1.0.0! > > Kudu is an open source storage engine for structured data which supports > low-latency random access together with efficient analytical access > patterns. It is designed within the context of the Apache Hadoop ecosyste= m > and supports many integrations with other data analytics projects both > inside and outside of the Apache Software Foundation. > > This latest version adds several new features, including: > > - Removal of multiversion concurrency control (MVCC) history is now > supported. This allows Kudu to reclaim disk space, where previously Kudu > would keep a full history of all changes made to a given table since the > beginning of time. > > - Most of Kudu=E2=80=99s command line tools have been consolidated under = a new > top-level "kudu" tool. This reduces the number of large binaries > distributed with Kudu and also includes much-improved help output. > > - Administrative tools including "kudu cluster ksck" now support running > against multi-master Kudu clusters. > > - The C++ client API now supports writing data in AUTO_FLUSH_BACKGROUND > mode. This can provide higher throughput for ingest workloads. > > This release also includes many bug fixes, optimizations, and other > improvements, detailed in the release notes available at: > http://kudu.apache.org/releases/1.0.0/docs/release_notes.html > > Download the source release here: > http://kudu.apache.org/releases/1.0.0/ > > Convenience binary artifacts for the Java client and various Java > integrations (eg Spark, Flume) are also now available via the ASF Maven > repository. > > Enjoy the new release! > > - The Apache Kudu team > > > --=20 Todd Lipcon Software Engineer, Cloudera --001a114435cc83029e053cf50a57 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
-announce


On Tue, Sep 20, 2016 at 11:34 AM, Benjamin Kim <b= build11@gmail.com> wrote:
This is awesome!!! Great!!!

Do you = know if any improvements were also made to the Spark plugin jar?

Looks like a few changes based on the git= log:
https://gist.github.com/4fa3ccc3b9be787227fed89c1bd42837

as well as a number of changes to the Java client (whi= ch gets pulled into the Spark jar):=C2=A0


=
In particular, I think the partition pruning work in the Java client s= hould reduce the number of Spark partitions if you have predicates on your = data frames. (though I haven't personally verified it)

-Todd



On Sep 20, 2016, at 12:11 AM, Todd = Lipcon <todd@apache= .org> wrote:

The Apache Kudu team is = happy to announce the release of Kudu 1.0.0!

Kudu is an open source = storage engine for structured data which supports low-latency random access= together with efficient analytical access patterns. It is designed within = the context of the Apache Hadoop ecosystem and supports many integrations w= ith other data analytics projects both inside and outside of the Apache Sof= tware Foundation.

This latest version adds several new features, inc= luding:

- Removal of multiversion concurrency control (MVCC) history= is now supported. This allows Kudu to reclaim disk space, where previously= Kudu would keep a full history of all changes made to a given table since = the beginning of time.

- Most of Kudu=E2=80=99s command line tools h= ave been consolidated under a new top-level "kudu" tool. This red= uces the number of large binaries distributed with Kudu and also includes m= uch-improved help output.

- Administrative tools including "kud= u cluster ksck" now support running against multi-master Kudu clusters= .

- The C++ client API now supports writing data in AUTO_FLUSH_BACKG= ROUND mode. This can provide higher throughput for ingest workloads.
This release also includes many bug fixes, optimizations, and other improv= ements, detailed in the release notes available at:
htt= p://kudu.apache.org/releases/1.0.0/docs/release_notes.html
Download the source release here:
http://kudu.apache.org/releases/1.= 0.0/

Convenience binary artifacts for the Java clien= t and various Java integrations (eg Spark, Flume) are also now available vi= a the ASF Maven repository.

Enjoy the new release!
<= div>
- The Apache Kudu team



--
Tod= d Lipcon
Software Engineer, Cloudera
--001a114435cc83029e053cf50a57--