Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8BD7C10CF3 for ; Tue, 30 Jul 2013 01:52:03 +0000 (UTC) Received: (qmail 70271 invoked by uid 500); 30 Jul 2013 01:52:02 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 70226 invoked by uid 500); 30 Jul 2013 01:52:02 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 70217 invoked by uid 99); 30 Jul 2013 01:52:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Jul 2013 01:52:02 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of j.barrett.strausser@gmail.com designates 209.85.214.177 as permitted sender) Received: from [209.85.214.177] (HELO mail-ob0-f177.google.com) (209.85.214.177) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Jul 2013 01:51:56 +0000 Received: by mail-ob0-f177.google.com with SMTP id f8so5401970obp.22 for ; Mon, 29 Jul 2013 18:51:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=A//YgkTFyC6q0/Wmurol9xhvBxwXVliT1H1iCsT7icw=; b=AbCK2AdRSnRpXBW4Pawn2F+s+1ytXRSvBJaCMtFu/0UjVHoQsdRxa1xVFat3eKM053 HyImB0wSAXMWfFyKIGXNex7cjt5FGqpZ2PcH6X/QQ71KbyluUBCaqrnkq3vnFfIU9qzv wm78YaddBWjl83Le6R2zy8tiBGOPYGRTNiAE8mfPAp5/LtxDoa/m+nwyFbfl4Cn2UU7q eFr0fjUZEDc+VaZyu4jQp1TYcgGKUJbLIyIElTxmdfJRbbGXe4ytEU2thDPQI6MINQQE 03ZXqpijgUtooCZqkMez4OcgieebV8mv0FTiVOiKjMinZ6QP+4TId5X/GrLcKu7zDdWr +oQg== MIME-Version: 1.0 X-Received: by 10.182.119.229 with SMTP id kx5mr37201314obb.23.1375149095612; Mon, 29 Jul 2013 18:51:35 -0700 (PDT) Received: by 10.60.17.72 with HTTP; Mon, 29 Jul 2013 18:51:35 -0700 (PDT) Date: Mon, 29 Jul 2013 21:51:35 -0400 Message-ID: Subject: Tablesample doubling From: "j.barrett Strausser" To: user@hive.apache.org Content-Type: multipart/alternative; boundary=001a11c2e3525b7bb104e2b0d90e X-Virus-Checked: Checked by ClamAV on apache.org --001a11c2e3525b7bb104e2b0d90e Content-Type: text/plain; charset=ISO-8859-1 Hello All, Why does TABLESAMPLE(N rows) produce ouptut with 2*N rows? I have the following script: DROP TABLE IF EXISTS sparse_features_small; CREATE TABLE sparse_features_small ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' LINES TERMINATED BY '\n' as SELECT * FROM sparse_features TABLESAMPLE(50000 ROWS) After I execute this by sourcing the file, I can then execute : -- https://github.com/bearrito @deepbearrito --001a11c2e3525b7bb104e2b0d90e Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hello All,

Why does TABLESAMPL= E(N rows) produce ouptut with 2*N rows?


I have the followi= ng script:

DROP TABLE IF EXISTS sparse_features_small;

CREATE= TABLE sparse_features_small ROW FORMAT DELIMITED FIELDS TERMINATED BY '= ;,' LINES TERMINATED BY '\n' as

SELECT
=A0=A0=A0=A0=A0=A0=A0 *
FROM
=A0=A0=A0=A0=A0=A0=A0 spar= se_features
TABLESAMPLE(50000 ROWS)


After I execute thi= s by sourcing the file, I can then execute :



=




--
--001a11c2e3525b7bb104e2b0d90e--