Return-Path: X-Original-To: apmail-tajo-dev-archive@minotaur.apache.org Delivered-To: apmail-tajo-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C377517A33 for ; Sun, 22 Mar 2015 04:51:26 +0000 (UTC) Received: (qmail 30221 invoked by uid 500); 22 Mar 2015 04:51:26 -0000 Delivered-To: apmail-tajo-dev-archive@tajo.apache.org Received: (qmail 30181 invoked by uid 500); 22 Mar 2015 04:51:26 -0000 Mailing-List: contact dev-help@tajo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tajo.apache.org Delivered-To: mailing list dev@tajo.apache.org Received: (qmail 30171 invoked by uid 99); 22 Mar 2015 04:51:26 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 22 Mar 2015 04:51:26 +0000 Received: from mail-ig0-f176.google.com (mail-ig0-f176.google.com [209.85.213.176]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 64FBD1A0031 for ; Sun, 22 Mar 2015 04:51:26 +0000 (UTC) Received: by igcau2 with SMTP id au2so18183369igc.0 for ; Sat, 21 Mar 2015 21:51:25 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.107.18.38 with SMTP id a38mr21337255ioj.67.1426999885451; Sat, 21 Mar 2015 21:51:25 -0700 (PDT) Received: by 10.43.101.70 with HTTP; Sat, 21 Mar 2015 21:51:25 -0700 (PDT) In-Reply-To: References: Date: Sat, 21 Mar 2015 21:51:25 -0700 Message-ID: Subject: Re: [DISCUSSION] Hi-Speed Tajo: Fastest Hadoop DW From: Hyunsik Choi To: "dev@tajo.apache.org" Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable TPC-DS is just a benchmark, and it is not test sets. For just test cases, why should we aim at TPC-DS? They are not proper as well as are overkill for this purpose. > For the TPC-DS, Tajo PMC doesn't make any committment in Roadmap already. > TPC-DS does not mean anything to TAJO roadmap. They are NOT TRUE. As you already know, WITH, IN/EXIST subquery, and scalar subquery are already on our roadmap. If they are supported, most of TPC-DS queries will work naturally. I'm concerning with your intension, saying we do not consider TPC-DS. On Sat, Mar 21, 2015 at 4:38 PM, Dongjoon Hyun wrote: > Hyunsik, TPC-DS is not important. It's just a way to see Tajo Maturity as= a > SQL engine. TPC-DS does not mean anything to TAJO roadmap. > > More specifically, the importance is the same with TPC-H, however Tajo > handled TPC-H well now, doesn't it? So, we need just another testsuite to > see how Tajo handle SQL(including WITH/IN/EXISTS) clause well. I think yo= u > agree that in TAJO-1410. > > TPC defines > - TPC-H: TPC-H is an ad-hoc, decision support benchmark. > - TPC-DS: TPC Benchmark=E2=84=A2DS (TPC-DS): The New Decision Support Ben= chmark > Standard > > In these days, TPC-DS is replacing the ancient TPC-H from real site > customers. > The main reason is based on the lack of TPC-H. TPC-H became a basic level= . > > Could you tell me if you have some other testcases in mind, except TPC-DS= ? > > Dongjoon. > > On Sun, Mar 22, 2015 at 3:58 AM, Hyunsik Choi wrote: > >> Could you tell me why you think TPC-DS is important? >> >> On Saturday, March 21, 2015, Dongjoon Hyun wrote: >> >> > Oh, that was an expression of my regret about my weak pull-requests. >> There >> > exists no offence. It's a duty. I hope you didn't waste your time much >> due >> > to my stupid learning cost. >> > >> > After Hyunsik's direct advice, I have studied Tajo Jira more and >> understand >> > what you mean slightly more. Now I understand many speedup issues, e.g= . >> > offheap-based scanner, zero-copy physical layer processing. For newbie= s >> > like me, we need more men like you. >> > >> > For the TPC-DS, Tajo PMC doesn't make any committment in Roadmap >> already. I >> > respect the current roadmap. You need not worry about that. It's my >> > personal hope as I mentioned before in this mailing list. Here is more >> > recent another report about TPC-DS. (It's consistent with the report >> shared >> > by Hyunsik.) >> > >> > >> > >> https://developer.ibm.com/hadoop/blog/2014/12/02/big-sql-3-0-hadoop-ds-b= enchmark-performance-isnt-everything/ >> > >> > >> > Thank you all. >> > >> > Warmly, >> > Dongjoon. >> > >>