Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0AB1E1882F for ; Mon, 14 Mar 2016 18:03:34 +0000 (UTC) Received: (qmail 8848 invoked by uid 500); 14 Mar 2016 18:03:33 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 8808 invoked by uid 500); 14 Mar 2016 18:03:33 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 8631 invoked by uid 99); 14 Mar 2016 18:03:33 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Mar 2016 18:03:33 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id A7D1F2C1F60 for ; Mon, 14 Mar 2016 18:03:33 +0000 (UTC) Date: Mon, 14 Mar 2016 18:03:33 +0000 (UTC) From: "Keith Turner (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (ACCUMULO-4165) Create a user level API for RFile MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Keith Turner created ACCUMULO-4165: -------------------------------------- Summary: Create a user level API for RFile Key: ACCUMULO-4165 URL: https://issues.apache.org/jira/browse/ACCUMULO-4165 Project: Accumulo Issue Type: Improvement Reporter: Keith Turner Fix For: 1.8.0 Users can bulk import RFiles. Currently the only way users can create RFiles using Accumulo's public API is via AccumuloFileOutputFormat. There is no way to read RFiles in the public API. Also, the internal APIs for reading and writing RFiles are cumbersome to use. I am experimenting with a simple RFile API like the following. Below is an example of writing data. {code:java} LocalFileSystem localFs = FileSystem.getLocal(new Configuration()); RFileWriter writer = RFileFactory.newWriter() .withFileName("/tmp/test100M.rf") .withFileSystem(localFs).build(); writer.startDefaultLocalityGroup(); for (int r = 0; r < 10000000; r++) { for (int cq = 0; cq < 10; cq++) { writer.append(genKey(r, cq), genVal(r, cq)); } } writer.close(); {code} Below is an example of reading data. {code:java} LocalFileSystem localFs = FileSystem.getLocal(new Configuration()); Scanner scanner = RFileFactory.newScanner() .withFileName("/tmp/test100M.rf") .withFileSystem(localFs) .withDataCache(250000000) .withIndexCache(1000000).build(); {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)