Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B315818A33 for ; Tue, 17 Nov 2015 21:58:12 +0000 (UTC) Received: (qmail 72257 invoked by uid 500); 17 Nov 2015 21:58:12 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 72185 invoked by uid 500); 17 Nov 2015 21:58:12 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 72169 invoked by uid 99); 17 Nov 2015 21:58:12 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Nov 2015 21:58:12 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 021EF2C0453 for ; Tue, 17 Nov 2015 21:58:11 +0000 (UTC) Date: Tue, 17 Nov 2015 21:58:11 +0000 (UTC) From: "Alan Gates (JIRA)" To: dev@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (HIVE-12443) Hive Streaming should expose encoding and serdes for testing MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Alan Gates created HIVE-12443: --------------------------------- Summary: Hive Streaming should expose encoding and serdes for testing Key: HIVE-12443 URL: https://issues.apache.org/jira/browse/HIVE-12443 Project: Hive Issue Type: Improvement Components: Testing Infrastructure, Transactions Affects Versions: 2.0.0 Reporter: Alan Gates Assignee: Alan Gates Currently how records are passed into the hive streaming RecordWriter are converted from the inbound format to Hive format is opaque. The encoding and writing are done in a single call to RecordWriter.write(). This is problematic for test tools that want to intercept the record stream and write it to a benchmark in addition to Hive. All existing RecordWriters have an encode and getSerDe methods. I propose to expose these by making them public in AbstractRecordWriter, and making AbstractRecordWriter a public class (it is currently package private). This keeps the RecordWriter interface clean (stream writers will not need to directly call these methods) and avoids any backwards incompatible changes. Having AbstractRecordWriter public is also desirable for anyone who wants to write their own RecordWriter. -- This message was sent by Atlassian JIRA (v6.3.4#6332)