Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8C004200C61 for ; Tue, 25 Apr 2017 13:31:33 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 8A676160BB3; Tue, 25 Apr 2017 11:31:33 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A8AE4160B9E for ; Tue, 25 Apr 2017 13:31:32 +0200 (CEST) Received: (qmail 21208 invoked by uid 500); 25 Apr 2017 11:31:31 -0000 Mailing-List: contact dev-help@hawq.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hawq.incubator.apache.org Delivered-To: mailing list dev@hawq.incubator.apache.org Received: (qmail 21190 invoked by uid 99); 25 Apr 2017 11:31:31 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Apr 2017 11:31:31 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 15843D0E81 for ; Tue, 25 Apr 2017 11:31:31 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.782 X-Spam-Level: * X-Spam-Status: No, score=1.782 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, URIBL_BLOCKED=0.001, WEIRD_PORT=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=pivotal-io.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id BZaXZU0HbVWc for ; Tue, 25 Apr 2017 11:31:28 +0000 (UTC) Received: from mail-it0-f50.google.com (mail-it0-f50.google.com [209.85.214.50]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4FBEE5FC84 for ; Tue, 25 Apr 2017 11:31:28 +0000 (UTC) Received: by mail-it0-f50.google.com with SMTP id f187so11614327ite.1 for ; Tue, 25 Apr 2017 04:31:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=pivotal-io.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=jhLnaU85Z5DbIQESVam63YL4XXioMka3CuixhNkTcWw=; b=LuTlsAAwr1LaC42v0VIPOfzFu16khl3yS2FomoLnH4nQGBVHhU4nT2izcIqvDeKgLS //dTOAm8OtZg2e+ERDgAV5zzgS6CAi9qfIuxUnIPl8JmL2b03qOERCyAPzVT/kj69ASI dhVG4b1B3wHLrXXZFWtagEBJcZdzLvWTld5Fh8xEF5wbPQpd6fMKuLSmTlvEDWoYWarJ 50i/vp8pIyRfhDO6IPhNbGMfPE3G3Gte90cE60kFQeXv9gsee5fx1Ay4TGGWpzCBGOGk jbaQqcfM/w274fNnYOdjmOK2toLn/HunHXzTLaPpUwA8Q2iD2G/N+cm0ltMISudwyfUv q2PA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=jhLnaU85Z5DbIQESVam63YL4XXioMka3CuixhNkTcWw=; b=O/oX6fx1vjnrqvH0ZMwBWur4cSwbTVXs+ASvpn8CgfYNxOGag+Yh1A5U8140dq8a+U fD4m1nSIhAa6YHIajJjbAZ+tjwErGlEHcpWHf9tOtsYWW4RVccCA+7478tzbG0lEc4Fq eMPBveHr+ke5oyG4JgFwzQwE2Nb1Y4AuEJuAM4d6arCdtVFCXXa3qtoTiLnrJ1y/HgCs kQwL4znmM6r5U5VPbTiU+hI1dsfs2FYWC3WUokXUMmejNVDzdVqEyv7Rmei7il89bQgV XcaS0Mpull4H5sG97o+NES1R+TIRWh2z9zFiku/sYy0it8uoPW2AaE7AqTLYrnaCj5ve 2NZA== X-Gm-Message-State: AN3rC/4gfqkj+8A4qVaJ2lHsFRTlTlu6lqbkpPKVWGtUALswVvRs01os guMxIdMm8qgwWV/BzwOBM9Pj8hgl0heg X-Received: by 10.36.230.5 with SMTP id e5mr3902356ith.0.1493119876658; Tue, 25 Apr 2017 04:31:16 -0700 (PDT) MIME-Version: 1.0 Received: by 10.79.38.77 with HTTP; Tue, 25 Apr 2017 04:31:16 -0700 (PDT) In-Reply-To: <672AB305-4ADD-428B-936A-AC654C2C9F29@ig.com> References: <038744F5-CBEB-41FC-9E5B-2CFEC360498E@me.com> <96f0db480b8a4916bd1de784ef5e7ff7@BMPRDEXC142.IGI.IG.LOCAL> <895de143c82348318ed349d06962b6a0@BMPRDEXC142.IGI.IG.LOCAL> <68ce1a339f65492fa0a766b080414e18@BMPRDEXC142.IGI.IG.LOCAL> <672AB305-4ADD-428B-936A-AC654C2C9F29@ig.com> From: Jon Roberts Date: Tue, 25 Apr 2017 06:31:16 -0500 Message-ID: Subject: Re: PXF JDBC plugin To: dev@hawq.incubator.apache.org Content-Type: multipart/alternative; boundary=94eb2c11c82edb1246054dfc0fbe archived-at: Tue, 25 Apr 2017 11:31:33 -0000 --94eb2c11c82edb1246054dfc0fbe Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable No I did not. My post was intended to start a discussion about the plugin and I thought the original contributor would chime in. But I can make the tickets now if you think that is the best way to move forward. Jon Roberts Principal Engineer | jroberts@pivotal.io | 615-426-8661 On Tue, Apr 25, 2017 at 2:58 AM, Michael Pearce wrote: > Hi Jon, > > Did you make JIRA issues for the two improvements you suggested to be mad= e? > > Cheers > Mike > > On 04/04/2017, 04:15, "Michael Pearce" wrote: > > Hi Jon > > I think on issue 1 and 2 are valid improvements that can be made, add > some Jira's for these. Looking at issue 1 it seems quite trivial for > someone to contribute the solution for. > > On the question front: > > 1) from my understanding and using this plugin so far this is what th= e > partion by interval allows you to control the number of partitions that > would be generated, e.g. This is why the sample in document has 1 year so > it only creates 2 fragments. You can also partition by an enum field her= e > the number of partitions is the number of enums. > > 2) I haven't checked specifically, saying that I haven't run into an > issue when trialling this myself. Obviously this is contributed by Devin = he > will be the definitive source here. Though I guess can always specially > test these scenarios. > > Sent using OWA for iPhone > ________________________________________ > From: Jon Roberts > Sent: Monday, April 3, 2017 11:57:59 PM > To: dev@hawq.incubator.apache.org > Subject: Re: PXF JDBC plugin > > https://github.com/apache/incubator-hawq/tree/master/pxf/pxf-jdbc > > Issue 1: Security > The example has the username and password in the connection string. > > LOCATION ('pxf://localhost:51200/demodb.myclass' > '?PROFILE=3DJDBC' > '&JDBC_DRIVER=3Dcom.mysql.jdbc.Driver' > '&DB_URL=3Djdbc:mysql:// > 192.168.200.6:3306/demodb&USER=3Droot&PASS=3Droot' > ) > > Any chance we can get this changed to a connection profile that point= s > to a > file outside of the database? For Greenplum database and S3, the > LOCATION > syntax includes "config=3D/path/to/config_file". The config_file > contains > the S3 credentials. This seems like a good pattern to follow. > > As it is right now, anyone that can connect to the database will be > able to > see the username and password of the JDBC connection. > > Issue 2: Extra Properties > > Some JDBC drivers will need many additional properties beyond the URL > and > this requires setting it with a put to a Properties variable. An > example > of this is Oracle's defaultRowPrefetch property that needs to be > updated > from the default of 10 which is designed for OLTP to something larger > like > 2000 which is more ideal for data extracts. > > Additionally, you will need the ability to set the isolation level > which is > done with setTransactionIsolation on the Connection. I don't believe > you > can set this on the connection URL either. Many SQL Server and DB2 > database still don't use snapshot isolation and use dirty reads > instead. > Without a dirty read here, your query will block modifications to the > table > with a "blocking lock". > > So for the configuration file will need both an extra properties > variable > that is delimited so you can multiple and a isolation level indicator= . > > Questions. > 1. How do you manage the maximum number of pxf instances? For > example, if > you partition like this: > "PARTITION_BY=3Dcdate:date&RANGE=3D2008-01-01:2010-01-01&INTERVAL=3D1= :day", > will > you create 730 pxf instances and thus 730 concurrent queries to the > source > database? > 2. Have you tested special characters like null, escape, carriage > return, > newline, and your delimiter? Maybe that is handled automatically by > pxfwritable_import so this isn't an issue. > > > > Jon Roberts > Principal Engineer | jroberts@pivotal.io | 615-426-8661 > > On Mon, Apr 3, 2017 at 3:18 PM, Michael Pearce > wrote: > > > I mean the readme file :) at that location. > > ________________________________________ > > From: Michael Pearce > > Sent: Monday, April 3, 2017 9:15:48 PM > > To: dev@hawq.incubator.apache.org > > Subject: Re: PXF JDBC plugin > > > > On master there is the document that was added. > > > > https://github.com/apache/incubator-hawq/tree/master/pxf/pxf-jdbc > > > > Reading this I think answers some of the questions. > > > > > > > > > > > > Sent using OWA for iPhone > > ________________________________________ > > From: Vineet Goel > > Sent: Monday, April 3, 2017 6:33:07 PM > > To: dev@hawq.incubator.apache.org > > Subject: Re: PXF JDBC plugin > > > > Devin and others, > > > > It would be great if you can expand a bit more on the implementatio= n > or > > point to a document? > > > > Thanks! > > > > > > > > On Mon, Apr 3, 2017 at 7:53 AM Jon Roberts > wrote: > > > > > JDBC PXF is pretty exciting. Are there details on how this works= ? > > > > > > Does PXF randomly pick a segment to connect to the JDBC source or > do you > > > specify a particular node to execute the query? I'm assuming it > isn't a > > > parallel query so you don't flood the JDBC source with x number o= f > > "select > > > *" queries but I could be wrong. > > > > > > How do you register the JDBC connection string with PXF? > > > > > > How do you manage credentials? > > > > > > How do you register the JDBC jar files? > > > > > > > > > Jon Roberts > > > > > > On Sat, Apr 1, 2017 at 5:05 PM, Ed Espino > wrote: > > > > > > > Thank you Michael, this is a well-deserved report highlight. I > have > > added > > > > the following to the report: > > > > > > > > 3. Community contribution highlight(s): > > > > > > > > * Leveraging the extensible PXF design, a JDBC PXF plugin > > > > was contributed by Devin Jia (github id: jiadexin). This > > > > contribution came from the community and not from the > > > > company which originally donated HAWQ to the ASF. > > > > > > > > Warm regrds, > > > > -=3De > > > > > > > > On Sat, Apr 1, 2017 at 1:14 PM, Michael Andr=C3=A9 Pearce < > > > > michael.andre.pearce@me.com> wrote: > > > > > > > > > > > > > > I would add note about the new JDBC PXF plugin being > contributed by > > > Devin > > > > > Jia. (Kudos to him) > > > > > > > > > > It's a good show of the community contributing and growing > outside of > > > > just > > > > > Pivotal staff (I think for incubator review this is important= ) > and > > also > > > > the > > > > > benefit of the extensible PXF design the project has done. > > > > > The information contained in this email is strictly confidential an= d > for > > the use of the addressee only, unless otherwise indicated. If you > are not > > the intended recipient, please do not read, copy, use or disclose t= o > others > > this message or any attachment. Please also notify the sender by > replying > > to this email or by telephone (+44(020 7896 0011) and then delete > the email > > and any copies of it. Opinions, conclusion (etc) that do not relate > to the > > official business of this company shall be understood as neither > given nor > > endorsed by it. IG is a trading name of IG Markets Limited (a compa= ny > > registered in England and Wales, company number 04008957) and IG > Index > > Limited (a company registered in England and Wales, company number > > 01190902). Registered address at Cannon Bridge House, 25 Dowgate > Hill, > > London EC4R 2YA. Both IG Markets Limited (register number 195355) > and IG > > Index Limited (register number 114059) are authorised and regulated > by the > > Financial Conduct Authority. > > > > > The information contained in this email is strictly confidential and for > the use of the addressee only, unless otherwise indicated. If you are not > the intended recipient, please do not read, copy, use or disclose to othe= rs > this message or any attachment. Please also notify the sender by replying > to this email or by telephone (+44(020 7896 0011) and then delete the ema= il > and any copies of it. Opinions, conclusion (etc) that do not relate to th= e > official business of this company shall be understood as neither given no= r > endorsed by it. IG is a trading name of IG Markets Limited (a company > registered in England and Wales, company number 04008957) and IG Index > Limited (a company registered in England and Wales, company number > 01190902). Registered address at Cannon Bridge House, 25 Dowgate Hill, > London EC4R 2YA. Both IG Markets Limited (register number 195355) and IG > Index Limited (register number 114059) are authorised and regulated by th= e > Financial Conduct Authority. > --94eb2c11c82edb1246054dfc0fbe--