Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C4900200C39 for ; Thu, 16 Mar 2017 17:54:17 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id C33DD160B78; Thu, 16 Mar 2017 16:54:17 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 19844160B72 for ; Thu, 16 Mar 2017 17:54:16 +0100 (CET) Received: (qmail 9906 invoked by uid 500); 16 Mar 2017 16:54:15 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 9894 invoked by uid 99); 16 Mar 2017 16:54:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Mar 2017 16:54:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 117EC1809F9 for ; Thu, 16 Mar 2017 16:54:15 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id hMLa6zu2-t1Y for ; Thu, 16 Mar 2017 16:54:14 +0000 (UTC) Received: from mail-pg0-f42.google.com (mail-pg0-f42.google.com [74.125.83.42]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id CC7CB5F56B for ; Thu, 16 Mar 2017 16:54:13 +0000 (UTC) Received: by mail-pg0-f42.google.com with SMTP id b129so27873239pgc.2 for ; Thu, 16 Mar 2017 09:54:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=cjreUFbvqpUyo+sfbDyJBTKD6II3luYUK32mAobaz2k=; b=ecChIbR5nsGBE+/vj8KvN9CFwkGJPMLXFiiz8nwDm4FwzxpulChUlLN/Na4CDH6Gae hn2xnQGc2PkN353r1w40dHZti6RMFUBa9QQbm6FpQlgUiJj++e7j1lF+Sr7JgqXziRkm QEuvwUdLEQkacU2Q77+BD7HpwAsjWphDAjP5/Z6FK1dV1/xRjzIieU2SeDlOLJgK77iA 9RJwmDbMtD5FOc2aYG1dvsm8Cf62K1F9rmG1Q0IT7QLUI+Pl/aw3Iy0WPCphl6DgcPDE M15+HJHfFM7MhPOjHVZDW4rL8HGWqebmAd2igNCxa+strJykc4o+DoHjppXYgpNiy0V5 fh2Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=cjreUFbvqpUyo+sfbDyJBTKD6II3luYUK32mAobaz2k=; b=FqyTTLx9HhxUHlMaMbH8YaUC0CiNRKHhjfQwfWqriLJZbR0hR1GuUTbGdsBMIxrpvh Uob3pVI2wAo39mFKCoyymXe8IRNe8bvBEJvrgoWsEuTj0BxDuBjbQYJB9XHyMKoQ5kOt gOa6X5WZVeRk1/u0Rz5Te9/Yx8/pfHABQAhSBma6MwAAtfuW4PtOmZhnO9GPObh4z1OJ yQ/BjvQCjlP8MUCvkrE2Bdq70bHQJpzKHnAtCA6TsjGHvM3I2QTMTpGKHZ+6gvE88Gfv JzN9EL1JIqsCeSELDnE/QkjZQwN9tBlzgWzzR3NW5wdBpkWubLqZIC6GBLZ7pDxrRJt4 88ow== X-Gm-Message-State: AFeK/H0sStqAyRVniwWYc5PHLkmqsJmdVhk/+zny6WytYh+Mmc9qFV9r/vK9KBpc4A2hFGUt7e7O2ZhRrDv3nw== X-Received: by 10.84.169.36 with SMTP id g33mr13418024plb.36.1489681258449; Thu, 16 Mar 2017 09:20:58 -0700 (PDT) MIME-Version: 1.0 Received: by 10.100.161.237 with HTTP; Thu, 16 Mar 2017 09:20:58 -0700 (PDT) From: Fran O Date: Thu, 16 Mar 2017 12:20:58 -0400 Message-ID: Subject: HFILE creation to use a different committer To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=94eb2c13e2623d1642054adb72c9 archived-at: Thu, 16 Mar 2017 16:54:18 -0000 --94eb2c13e2623d1642054adb72c9 Content-Type: text/plain; charset=UTF-8 Hi folks, I would like to hear some thoughts on the following use case: I use a custom MR job to create HFiles . This MR writes the HFiles into S3. I was trying to change the Outputcommitter in order to have the reducers writing directly the HFiles to the final destination on S3. After some tests setting the Outputcommitter to be the DirectoOutputcommitter, the tasks are always using the FileOutputCommitter. >> HFileOutputFormat2.configureIncrementalLoad(job, hTable); >> FileOutputFormat.setOutputPath(job, outputPath); >> FileOutputFormat.setCompressOutput(job, true); >> FileOutputFormat.setOutputCompressorClass(job, >>SnappyCodec.class); Looking at the code of the FileOutputFormat methods I see a *getOutputCommitter method but not a set method for the OutputCommitter. * *Could someone bring some light on how to change the OutputCommitter for the tasks?* Thank you, Fran --94eb2c13e2623d1642054adb72c9--