Return-Path: Delivered-To: apmail-hive-user-archive@www.apache.org Received: (qmail 87233 invoked from network); 21 Feb 2011 14:54:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 21 Feb 2011 14:54:32 -0000 Received: (qmail 40857 invoked by uid 500); 21 Feb 2011 14:54:32 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 40519 invoked by uid 500); 21 Feb 2011 14:54:29 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 40511 invoked by uid 99); 21 Feb 2011 14:54:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Feb 2011 14:54:28 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [98.138.90.47] (HELO nm9-vm1.bullet.mail.ne1.yahoo.com) (98.138.90.47) by apache.org (qpsmtpd/0.29) with SMTP; Mon, 21 Feb 2011 14:54:19 +0000 Received: from [98.138.90.56] by nm9.bullet.mail.ne1.yahoo.com with NNFMP; 21 Feb 2011 14:53:58 -0000 Received: from [98.138.88.236] by tm9.bullet.mail.ne1.yahoo.com with NNFMP; 21 Feb 2011 14:53:58 -0000 Received: from [127.0.0.1] by omp1036.mail.ne1.yahoo.com with NNFMP; 21 Feb 2011 14:53:58 -0000 X-Yahoo-Newman-Property: ymail-5 X-Yahoo-Newman-Id: 722333.5938.bm@omp1036.mail.ne1.yahoo.com Received: (qmail 94287 invoked by uid 60001); 21 Feb 2011 14:53:58 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1298300038; bh=zxcjMe/c/WSyg/Lb5TvBVMFfz3cjo7I/kaYVIqUYxqM=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type; b=ovcOXIA3K/JzmkIEc7iB6GaYpIBvfHHg9FrOI0/kN1gdRWVVuYZkXNRqYMzy5QblIHBhaXKf4bHRH0uFTWV9zpOP2IhevjrTmAAGg9Jzy8b5Mc4vxU9BlHjL3asFZfnpnutUlvmr58uZocW6PfV6744poQsvwkX5E3+YatAwsNI= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type; b=K9rtmpe9R1DuAV2mIqGHBL4BnNup8MSpMQI2lo6kvlMGr/YijPN0fAxsHY1rzuOgU20tzKRdnKY4Rl0PKXYaEwjaNAJv4NafQVfO82jN3vNEhqWQqnWJdSp7t8YAyBufvPt1oYjIYg0xPa7NDZvpy6y3yPiDegWRfTMt6kHg3ro=; Message-ID: <541945.93319.qm@web121220.mail.ne1.yahoo.com> X-YMail-OSG: G_dOKisVM1mFMuHrWS.7BpdUoDWFcLzoQj.tR4nAn_im4mC C1NxgHHqhDWCPVHszjbEz12u_8k4YynEXXIyGjvfV94qYZf_IHg46ObYiN6p amETWc4vhsQt2t7KbQ1fyB1n1qIu4411oM_l7ikIlfwEveVclkUoxdJHKw74 SHG7S1bZXIOzAqkP9VGb8FIGgVqfxU3g9onnDXQ-- Received: from [65.91.151.195] by web121220.mail.ne1.yahoo.com via HTTP; Mon, 21 Feb 2011 06:53:58 PST X-Mailer: YahooMailRC/555 YahooMailWebService/0.8.109.292656 Date: Mon, 21 Feb 2011 06:53:58 -0800 (PST) From: Bejoy Ks Subject: Database/Schema , INTERVAL and SQL IN usages in Hive To: user@hive.apache.org MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="0-1325106477-1298300038=:93319" --0-1325106477-1298300038=:93319 Content-Type: text/plain; charset=us-ascii Hi Experts I'm using hive for a few projects and i found it a great tool in hadoop to process end to end structured data. Unfortunately I'm facing a few challenges out here as follows Availability of database/schemas in Hive I'm having multiple projects running in hive each having fairly large number of tables. With this much tables all together it is looking a bit messed up. Is there any option of creating database/schema in Hive so that I can maintain the tables in different databases/schemas corresponding to each project. Using INTERVAL I need to replicate a job running in Teradata edw into hive, i'm facing a challenge out here.Not able to identify a similar usage corresponding to Interval in teradata within hive. Here is the snippet where I'm facing the issue *** where 1.seq_id = r4.seq_id and r4.mc_datetime >= (r1.rc_datetime + INTERVAL '05' HOUR) In this query how do i replicate the last part in hive ie (r1.rc_datetime + INTERVAL '05' HOUR) , where it is adding 5 hours to the obtained time stamp rc_datetime. *The where condition is part of a very large query involving multiple table joins. Using IN How do we replicate the SQL IN function in hive ie *** where R1.seq_id = r4.seq_id and r1.PROCCESS_PHASE IN ( 'Production', 'Stage' , 'QA', 'Development') the last part of the query is where i'm facing the challenge r1.PROCCESS_PHASE IN ( 'Production', 'Stage' , 'QA', 'Development') *The where condition is part of a very large query involving multiple table joins. Please advise. Regards Bejoy KS --0-1325106477-1298300038=:93319 Content-Type: text/html; charset=us-ascii
Hi Experts
     I'm using hive for a few projects and i found it a great tool in hadoop to process end to end structured data. Unfortunately I'm facing a few challenges out here as follows

Availability of database/schemas in Hive
I'm having multiple projects running in hive each having fairly large number of tables. With this much tables all together it is  looking a bit  messed up. Is there any option of creating database/schema in Hive so that I can maintain the tables in different databases/schemas corresponding to each project.

Using INTERVAL
    I need to replicate a job running in Teradata edw into hive, i'm facing a challenge out here.Not able to identify a similar usage corresponding to Interval in teradata within hive. Here is the snippet where I'm facing the issue
 *** where 1.seq_id = r4.seq_id and r4.mc_datetime >= (r1.rc_datetime + INTERVAL '05' HOUR)
In this query how do i replicate the last part in hive ie (r1.rc_datetime + INTERVAL '05' HOUR) , where it is adding 5 hours to the obtained time stamp rc_datetime.
*The where condition is part of a very large query involving multiple table joins.


Using IN
    How do we replicate the SQL IN function in hive
ie *** where R1.seq_id = r4.seq_id and r1.PROCCESS_PHASE IN ( 'Production', 'Stage' , 'QA', 'Development')
the last part of the query is where i'm facing the challenge r1.PROCCESS_PHASE IN ( 'Production', 'Stage' , 'QA', 'Development')
*The where condition is part of a very large query involving multiple table joins.

Please advise.

Regards
Bejoy KS







--0-1325106477-1298300038=:93319--