Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 4BEAC200BF6 for ; Tue, 10 Jan 2017 17:43:10 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 4A798160B3D; Tue, 10 Jan 2017 16:43:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 912FA160B2C for ; Tue, 10 Jan 2017 17:43:09 +0100 (CET) Received: (qmail 62223 invoked by uid 500); 10 Jan 2017 16:43:08 -0000 Mailing-List: contact dev-help@sqoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@sqoop.apache.org Delivered-To: mailing list dev@sqoop.apache.org Received: (qmail 62210 invoked by uid 99); 10 Jan 2017 16:43:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Jan 2017 16:43:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 09FEFC043B for ; Tue, 10 Jan 2017 16:43:08 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.509 X-Spam-Level: ** X-Spam-Status: No, score=2.509 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, T_REMOTE_IMAGE=0.01, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id blNUm_t4RGgJ for ; Tue, 10 Jan 2017 16:43:05 +0000 (UTC) Received: from mail-oi0-f44.google.com (mail-oi0-f44.google.com [209.85.218.44]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4CEA25F1EE for ; Tue, 10 Jan 2017 16:43:05 +0000 (UTC) Received: by mail-oi0-f44.google.com with SMTP id 3so552907207oih.1 for ; Tue, 10 Jan 2017 08:43:05 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudera-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=1t0mYRTzcAuIVQGQmAcflX8uBi63iQ/EJWav42bDXoU=; b=Npgv7pBVcElAVBr2uBONHmeqqRmpmSs1gg1ju8WGGmEuWG19bv6OOlEe6JGGgiQVL2 +pY07jf8fA9P+uM7jR2OBsipzh59im4Uz+MmTuDT6xRu8oTHs2aDxW+v6Rzai3DV3eYk ZNARzWt/j6K4RE5/xRX7ukmjDs2M8Xwgk7BPip+JMzXA7meyFO0MFYkdVQxj3cFPP1Hr gNusHag5ZqvCRQXmfPwi/6eB6YUB7ufmarg1F+2CWVoLnHBf/Yy6gpeFtpvU4o3QWdo5 Yu7jbEUCV8MYrul2Yami8r+O9vdI4F48F6I0eOm4flmYyOLE9gmcOpddlcbMWHjAGfH0 TKIw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=1t0mYRTzcAuIVQGQmAcflX8uBi63iQ/EJWav42bDXoU=; b=kGciTdxGAS33IDW9erNK/8ReYVa1c0asOo6nd8cDXP1gpGgSnfwyHpch4JdYyAiHPj Hd/CGa2507n9a+6Prj0p962ILaapEZNRnz91rxHW8OYAhllSIJGhuB2Yr1jlDXX1RTkR wQ8anibZBJisPHe6F6AQ7MvT+P6p2ghdSCRg/P2cvqGXX7EYVtxmjEGtOlVOQC6CoKvG 62ie/KxphwGIFQSF4lKRZKWBzSp/4OTSEcWnl9A+sOzZhrBJgKO5d6qrDl8NcX2W83rf xQRTZgBYezpVaQAMsY82h2RRsccvNueZR5uvz7rj0qfvXY8w77L/3TsG+x4DlmZNpMO3 dz/w== X-Gm-Message-State: AIkVDXIWaUY8LChQyFDmgoZObCGdCsbRL3dhZ4YC9pAOq/eWHZtuARRloyd6M146WcGnn1xOP8bfwI8W9rUI7Pj9 X-Received: by 10.157.3.209 with SMTP id f75mr1804605otf.261.1484066583980; Tue, 10 Jan 2017 08:43:03 -0800 (PST) MIME-Version: 1.0 Received: by 10.202.241.69 with HTTP; Tue, 10 Jan 2017 08:42:23 -0800 (PST) In-Reply-To: References: From: Szabolcs Vasas Date: Tue, 10 Jan 2017 17:42:23 +0100 Message-ID: Subject: Re: Import more than 10 million records from MySQL to HDFS To: dev@sqoop.apache.org Content-Type: multipart/alternative; boundary=94eb2c03c8ce8fd12e0545c02d4e archived-at: Tue, 10 Jan 2017 16:43:10 -0000 --94eb2c03c8ce8fd12e0545c02d4e Content-Type: text/plain; charset=UTF-8 Hi Wenxing, I have created a table based on the column information you sent but I won't be able to do this testing in the next couple of days. Btw have you tried the import with smaller data sets? I mean have you tried to test what is the biggest data set you can import successfully? Szabolcs On Wed, Jan 4, 2017 at 10:55 AM, wenxing zheng wrote: > Hi Szabolcs, > > I am testing this scenario with our client's slave database. And I am > sorry that I can not share the table definition and the sample data here. > But attached is a sample of table definition with the column types. > > It's quite complex. > > Thanks, Wenxing > > On Wed, Jan 4, 2017 at 4:24 PM, Szabolcs Vasas wrote: > >> Hi Wenxing, >> >> I haven't tried this scenario yet but I would be happy to test it on my >> side. Can you please send me the DDL statement for creating the MySQL >> table >> and some sample data? >> Also it would be very helpful to send the details of the job you would >> like >> to run. >> >> Regards, >> Szabolcs >> >> On Wed, Jan 4, 2017 at 2:54 AM, wenxing zheng >> wrote: >> >> > can anyone help to advice? >> > >> > And I met with a problem when I set the checkColumn with updated_time, >> but >> > currently all the updated_time are in NULL. Under this case, the Sqoop >> will >> > fail to start the job. I think we need to support such kind of case. >> > >> > On Thu, Dec 29, 2016 at 9:18 AM, wenxing zheng > > >> > wrote: >> > >> > > Dear all, >> > > >> > > Did anyone already try to import more than 10 million data from MySQL >> to >> > > HDFS by using the Sqoop2? >> > > >> > > I always failed at the very beginning with various throttling >> settings, >> > > but never made it. >> > > >> > > Appreciated for any advice. >> > > Thanks, Wenxing >> > > >> > >> >> >> >> -- >> Szabolcs Vasas >> Software Engineer >> >> > > -- Szabolcs Vasas Software Engineer --94eb2c03c8ce8fd12e0545c02d4e--