Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5344D200C5B for ; Thu, 27 Apr 2017 16:01:28 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 51D6E160BA7; Thu, 27 Apr 2017 14:01:28 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 4A6F7160B98 for ; Thu, 27 Apr 2017 16:01:27 +0200 (CEST) Received: (qmail 8601 invoked by uid 500); 27 Apr 2017 14:01:26 -0000 Mailing-List: contact dev-help@hawq.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hawq.incubator.apache.org Delivered-To: mailing list dev@hawq.incubator.apache.org Received: (qmail 8589 invoked by uid 99); 27 Apr 2017 14:01:26 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Apr 2017 14:01:26 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id AAAD11B0DBE for ; Thu, 27 Apr 2017 14:01:25 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.294 X-Spam-Level: X-Spam-Status: No, score=-0.294 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.796, RCVD_IN_SORBS_SPAM=0.5, URIBL_BLOCKED=0.001, WEIRD_PORT=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=pivotal-io.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 7eOHWLfXHiYy for ; Thu, 27 Apr 2017 14:01:22 +0000 (UTC) Received: from mail-io0-f169.google.com (mail-io0-f169.google.com [209.85.223.169]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 2942B5FC4A for ; Thu, 27 Apr 2017 14:01:22 +0000 (UTC) Received: by mail-io0-f169.google.com with SMTP id r16so23185304ioi.2 for ; Thu, 27 Apr 2017 07:01:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=pivotal-io.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=3W65ZjE3UWgUtc7IgBYtfp7At9QAfGJ3FdcIindoiEM=; b=FSZVdIU+3fuBiJl3MxzKoZtAqM0kRa3XxDjq46Ux92B76+57Wgz1YEKW8wL177Vznx 1HLbJ/iBO3B/jUCV1rqyulGtlzAPHr72lwdJ9FPfzv97h4GW/t/XsLt63C0wbGi4sw1Q wAa5VZxKPtYi0Gi7J6PAlAgCkDCct9UVQeRpFfcnWZsvWWgUwo0Ec5kv6zK+6slvzHsE JTHGYoLnick1uJykr4tdWzy+rJNyXNakUadko1TdDcAMHvAQC4fLr8dU1EXbphZNwKf9 nmMg+bR0UA9i21YLplWQYxgyvjaK2lx+3wl8WLGAVoNhOmm9caimF+S/ehngm0ThOUES CT9w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=3W65ZjE3UWgUtc7IgBYtfp7At9QAfGJ3FdcIindoiEM=; b=LrODL+PVSQCjYLmKlcraXHoR/NdF8JtS7cWjucLwGpOrbMgKxwykC/I/VArRFnJkwX oguoyb6cV2YKbXHeR+0N4qfzZjI2XzzqdTyo5w0O3k15s+vSRBmW1j0codSh+LOcU0Fe 2crKrYE6ogxclCXKaDO3gLRpJxx+vaziEgFvLdKqpC4raZhUTZ7OCj9F94N4mbp8LTge qzA65Lp3SK2cWdtWo+RN5ydRWDQwaHWHoJfn2TCNe3+gieN97Is0Qh6PeooahTxeHBEm 5kowrZKbb/b82QDdJ8AhC9BIYM3Hb2w8gRkTUhmNLJLnss/Zm7Nd2BA9SjSIsSS+sutR HD4g== X-Gm-Message-State: AN3rC/6oiqGPq9NijM4qSXzgsr37C+tY3if0HTjmvL+Ld+h/TRtdMk1V Zi3EnUGyJXEWoldP0Hk4BRhWXGtz96I9GbQ= X-Received: by 10.107.146.139 with SMTP id u133mr4507357iod.160.1493301680097; Thu, 27 Apr 2017 07:01:20 -0700 (PDT) MIME-Version: 1.0 Received: by 10.79.38.77 with HTTP; Thu, 27 Apr 2017 07:01:19 -0700 (PDT) In-Reply-To: References: <038744F5-CBEB-41FC-9E5B-2CFEC360498E@me.com> <96f0db480b8a4916bd1de784ef5e7ff7@BMPRDEXC142.IGI.IG.LOCAL> <895de143c82348318ed349d06962b6a0@BMPRDEXC142.IGI.IG.LOCAL> <68ce1a339f65492fa0a766b080414e18@BMPRDEXC142.IGI.IG.LOCAL> <672AB305-4ADD-428B-936A-AC654C2C9F29@ig.com> From: Jon Roberts Date: Thu, 27 Apr 2017 09:01:19 -0500 Message-ID: Subject: Re: PXF JDBC plugin To: dev@hawq.incubator.apache.org Content-Type: multipart/alternative; boundary=94eb2c055f4e3211e3054e2664e9 archived-at: Thu, 27 Apr 2017 14:01:28 -0000 --94eb2c055f4e3211e3054e2664e9 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable https://issues.apache.org/jira/browse/HAWQ-1445 Jon Roberts On Tue, Apr 25, 2017 at 6:31 AM, Jon Roberts wrote: > No I did not. My post was intended to start a discussion about the plugi= n > and I thought the original contributor would chime in. But I can make th= e > tickets now if you think that is the best way to move forward. > > Jon Roberts > Principal Engineer | jroberts@pivotal.io | 615-426-8661 <(615)%20426-8661= > > > On Tue, Apr 25, 2017 at 2:58 AM, Michael Pearce > wrote: > >> Hi Jon, >> >> Did you make JIRA issues for the two improvements you suggested to be >> made? >> >> Cheers >> Mike >> >> On 04/04/2017, 04:15, "Michael Pearce" wrote: >> >> Hi Jon >> >> I think on issue 1 and 2 are valid improvements that can be made, ad= d >> some Jira's for these. Looking at issue 1 it seems quite trivial for >> someone to contribute the solution for. >> >> On the question front: >> >> 1) from my understanding and using this plugin so far this is what >> the partion by interval allows you to control the number of partitions t= hat >> would be generated, e.g. This is why the sample in document has 1 year s= o >> it only creates 2 fragments. You can also partition by an enum field he= re >> the number of partitions is the number of enums. >> >> 2) I haven't checked specifically, saying that I haven't run into an >> issue when trialling this myself. Obviously this is contributed by Devin= he >> will be the definitive source here. Though I guess can always specially >> test these scenarios. >> >> Sent using OWA for iPhone >> ________________________________________ >> From: Jon Roberts >> Sent: Monday, April 3, 2017 11:57:59 PM >> To: dev@hawq.incubator.apache.org >> Subject: Re: PXF JDBC plugin >> >> https://github.com/apache/incubator-hawq/tree/master/pxf/pxf-jdbc >> >> Issue 1: Security >> The example has the username and password in the connection string. >> >> LOCATION ('pxf://localhost:51200/demodb.myclass' >> '?PROFILE=3DJDBC' >> '&JDBC_DRIVER=3Dcom.mysql.jdbc.Driver' >> '&DB_URL=3Djdbc:mysql:// >> 192.168.200.6:3306/demodb&USER=3Droot&PASS=3Droot' >> ) >> >> Any chance we can get this changed to a connection profile that >> points to a >> file outside of the database? For Greenplum database and S3, the >> LOCATION >> syntax includes "config=3D/path/to/config_file". The config_file >> contains >> the S3 credentials. This seems like a good pattern to follow. >> >> As it is right now, anyone that can connect to the database will be >> able to >> see the username and password of the JDBC connection. >> >> Issue 2: Extra Properties >> >> Some JDBC drivers will need many additional properties beyond the UR= L >> and >> this requires setting it with a put to a Properties variable. An >> example >> of this is Oracle's defaultRowPrefetch property that needs to be >> updated >> from the default of 10 which is designed for OLTP to something large= r >> like >> 2000 which is more ideal for data extracts. >> >> Additionally, you will need the ability to set the isolation level >> which is >> done with setTransactionIsolation on the Connection. I don't believ= e >> you >> can set this on the connection URL either. Many SQL Server and DB2 >> database still don't use snapshot isolation and use dirty reads >> instead. >> Without a dirty read here, your query will block modifications to th= e >> table >> with a "blocking lock". >> >> So for the configuration file will need both an extra properties >> variable >> that is delimited so you can multiple and a isolation level indicato= r. >> >> Questions. >> 1. How do you manage the maximum number of pxf instances? For >> example, if >> you partition like this: >> "PARTITION_BY=3Dcdate:date&RANGE=3D2008-01-01:2010-01-01&INTERVAL=3D= 1:day", >> will >> you create 730 pxf instances and thus 730 concurrent queries to the >> source >> database? >> 2. Have you tested special characters like null, escape, carriage >> return, >> newline, and your delimiter? Maybe that is handled automatically by >> pxfwritable_import so this isn't an issue. >> >> >> >> Jon Roberts >> Principal Engineer | jroberts@pivotal.io | 615-426-8661 >> >> On Mon, Apr 3, 2017 at 3:18 PM, Michael Pearce > > >> wrote: >> >> > I mean the readme file :) at that location. >> > ________________________________________ >> > From: Michael Pearce >> > Sent: Monday, April 3, 2017 9:15:48 PM >> > To: dev@hawq.incubator.apache.org >> > Subject: Re: PXF JDBC plugin >> > >> > On master there is the document that was added. >> > >> > https://github.com/apache/incubator-hawq/tree/master/pxf/pxf-jdbc >> > >> > Reading this I think answers some of the questions. >> > >> > >> > >> > >> > >> > Sent using OWA for iPhone >> > ________________________________________ >> > From: Vineet Goel >> > Sent: Monday, April 3, 2017 6:33:07 PM >> > To: dev@hawq.incubator.apache.org >> > Subject: Re: PXF JDBC plugin >> > >> > Devin and others, >> > >> > It would be great if you can expand a bit more on the >> implementation or >> > point to a document? >> > >> > Thanks! >> > >> > >> > >> > On Mon, Apr 3, 2017 at 7:53 AM Jon Roberts >> wrote: >> > >> > > JDBC PXF is pretty exciting. Are there details on how this work= s? >> > > >> > > Does PXF randomly pick a segment to connect to the JDBC source o= r >> do you >> > > specify a particular node to execute the query? I'm assuming it >> isn't a >> > > parallel query so you don't flood the JDBC source with x number = of >> > "select >> > > *" queries but I could be wrong. >> > > >> > > How do you register the JDBC connection string with PXF? >> > > >> > > How do you manage credentials? >> > > >> > > How do you register the JDBC jar files? >> > > >> > > >> > > Jon Roberts >> > > >> > > On Sat, Apr 1, 2017 at 5:05 PM, Ed Espino >> wrote: >> > > >> > > > Thank you Michael, this is a well-deserved report highlight. I >> have >> > added >> > > > the following to the report: >> > > > >> > > > 3. Community contribution highlight(s): >> > > > >> > > > * Leveraging the extensible PXF design, a JDBC PXF plugin >> > > > was contributed by Devin Jia (github id: jiadexin). Thi= s >> > > > contribution came from the community and not from the >> > > > company which originally donated HAWQ to the ASF. >> > > > >> > > > Warm regrds, >> > > > -=3De >> > > > >> > > > On Sat, Apr 1, 2017 at 1:14 PM, Michael Andr=C3=A9 Pearce < >> > > > michael.andre.pearce@me.com> wrote: >> > > > >> > > > > >> > > > > I would add note about the new JDBC PXF plugin being >> contributed by >> > > Devin >> > > > > Jia. (Kudos to him) >> > > > > >> > > > > It's a good show of the community contributing and growing >> outside of >> > > > just >> > > > > Pivotal staff (I think for incubator review this is >> important) and >> > also >> > > > the >> > > > > benefit of the extensible PXF design the project has done. >> > > >> > The information contained in this email is strictly confidential >> and for >> > the use of the addressee only, unless otherwise indicated. If you >> are not >> > the intended recipient, please do not read, copy, use or disclose >> to others >> > this message or any attachment. Please also notify the sender by >> replying >> > to this email or by telephone (+44(020 7896 0011) and then delete >> the email >> > and any copies of it. Opinions, conclusion (etc) that do not relat= e >> to the >> > official business of this company shall be understood as neither >> given nor >> > endorsed by it. IG is a trading name of IG Markets Limited (a >> company >> > registered in England and Wales, company number 04008957) and IG >> Index >> > Limited (a company registered in England and Wales, company number >> > 01190902). Registered address at Cannon Bridge House, 25 Dowgate >> Hill, >> > London EC4R 2YA. Both IG Markets Limited (register number 195355) >> and IG >> > Index Limited (register number 114059) are authorised and regulate= d >> by the >> > Financial Conduct Authority. >> > >> >> >> The information contained in this email is strictly confidential and for >> the use of the addressee only, unless otherwise indicated. If you are no= t >> the intended recipient, please do not read, copy, use or disclose to oth= ers >> this message or any attachment. Please also notify the sender by replyin= g >> to this email or by telephone (+44(020 7896 0011) and then delete the em= ail >> and any copies of it. Opinions, conclusion (etc) that do not relate to t= he >> official business of this company shall be understood as neither given n= or >> endorsed by it. IG is a trading name of IG Markets Limited (a company >> registered in England and Wales, company number 04008957) and IG Index >> Limited (a company registered in England and Wales, company number >> 01190902). Registered address at Cannon Bridge House, 25 Dowgate Hill, >> London EC4R 2YA. Both IG Markets Limited (register number 195355) and IG >> Index Limited (register number 114059) are authorised and regulated by t= he >> Financial Conduct Authority. >> > > --94eb2c055f4e3211e3054e2664e9--