Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7EF8FD58B for ; Sat, 14 Jul 2012 15:19:02 +0000 (UTC) Received: (qmail 80034 invoked by uid 500); 14 Jul 2012 15:19:01 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 79405 invoked by uid 500); 14 Jul 2012 15:18:54 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 79353 invoked by uid 99); 14 Jul 2012 15:18:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Jul 2012 15:18:52 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of edlinuxguru@gmail.com designates 209.85.213.176 as permitted sender) Received: from [209.85.213.176] (HELO mail-yx0-f176.google.com) (209.85.213.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Jul 2012 15:18:46 +0000 Received: by yenl5 with SMTP id l5so5056028yen.35 for ; Sat, 14 Jul 2012 08:18:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=Dm0I1uH4ZsJxXfvj0gZ76BhAhHJN20jHVLwPI41mxMQ=; b=ad6AZeXA5v6DSpGo73D6OT2l+1Pku/OtOG64PGFm5QCXniIJOQY3RdcTvOT/T2gEZH 2LYgjyJCnnEk/pfNadZmRNuA4+/EIz7oSCCKord0zJCKSfdpE/B6nhIgoSjJwmaUHx+w aXK3ATsTOAesk4OwWZk4RMrhbGkGSY02fLcwJsfIvPfcY7BNpFbw9Lm4wwH0s2EB9dji vdEhbcAT2o+NwePyLTybxUrWwtHJUjs/r9MBtvAGpPH36jQf8vDJxt0gUfCM0lJWHxyU Rh457jx99NVLDK+5x9n2BFOGZI3VpjRpGyc88jT5MHR2RWRtQcz+QvU4hhIPE6pA0OcB 9YnQ== MIME-Version: 1.0 Received: by 10.50.188.131 with SMTP id ga3mr1673754igc.54.1342279105235; Sat, 14 Jul 2012 08:18:25 -0700 (PDT) Received: by 10.64.25.162 with HTTP; Sat, 14 Jul 2012 08:18:25 -0700 (PDT) Date: Sat, 14 Jul 2012 11:18:25 -0400 Message-ID: Subject: [ANN] Hive-protobuf support From: Edward Capriolo To: user@hive.apache.org Content-Type: text/plain; charset=ISO-8859-1 Hello all, My employer, m6d.com, has given the thumbs up to open source our latest hive tool, hive-protobuf. We created this because we work with protobuf formats often and wanted to be able to directly log an query this types without writing one-off User Defined Functions or Input Formats. https://github.com/edwardcapriolo/hive-protobuf Hive-protobuf is much like the new avro support and the already existing thrift support. Here is how it works: if you have a sequence file with a serialized protobuf in the key and a serialized protobuf in the value, a table can be created that describes the data to hive. The table needs only be configured with the protobuf generated class name for the key and value and it turns the nested classes into nested structs. We eventually will migrate the project into core hive but we want to let it incubate in github for a time. (For example there is no support for union types at the moment, maybe other kinks or tunes). Please checkout the project and send pull requests if you have patches. Thank you, Edward