Return-Path: Delivered-To: apmail-hadoop-hive-dev-archive@minotaur.apache.org Received: (qmail 16375 invoked from network); 31 Dec 2009 19:03:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 31 Dec 2009 19:03:56 -0000 Received: (qmail 11062 invoked by uid 500); 31 Dec 2009 19:03:56 -0000 Delivered-To: apmail-hadoop-hive-dev-archive@hadoop.apache.org Received: (qmail 11000 invoked by uid 500); 31 Dec 2009 19:03:56 -0000 Mailing-List: contact hive-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hive-dev@hadoop.apache.org Delivered-To: mailing list hive-dev@hadoop.apache.org Received: (qmail 10990 invoked by uid 99); 31 Dec 2009 19:03:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Dec 2009 19:03:56 +0000 X-ASF-Spam-Status: No, hits=-10.5 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Dec 2009 19:03:49 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 63906234C045 for ; Thu, 31 Dec 2009 11:03:29 -0800 (PST) Message-ID: <1609387958.1262286209392.JavaMail.jira@brutus.apache.org> Date: Thu, 31 Dec 2009 19:03:29 +0000 (UTC) From: "Namit Jain (JIRA)" To: hive-dev@hadoop.apache.org Subject: [jira] Created: (HIVE-1023) typedbytes: datatypes should be derived from data MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 typedbytes: datatypes should be derived from data ------------------------------------------------- Key: HIVE-1023 URL: https://issues.apache.org/jira/browse/HIVE-1023 Project: Hadoop Hive Issue Type: Improvement Components: Query Processor Reporter: Namit Jain Assignee: Namit Jain FROM ( FROM src SELECT TRANSFORM(src.key, src.value) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.TypedBytesSerDe' RECORDWRITER 'org.apache.hadoop.hive.contrib.util.typedbytes.TypedBytesRecordWriter' USING '/bin/cat' AS (tkey, tvalue) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.TypedBytesSerDe' RECORDREADER 'org.apache.hadoop.hive.contrib.util.typedbytes.TypedBytesRecordReader' ) tmap INSERT OVERWRITE TABLE dest1 SELECT tkey, tvalue; The output is interpreted as a string - however, it is assumed that the script is retuning string data. It would be useful if the reader and the deserializer can be decoupled. The record reader (TypedBytesRecordReader) will read the typed data (independent of the output schema) and then convert it according to the output schema. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.