Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D2F87114E9 for ; Mon, 12 May 2014 13:47:54 +0000 (UTC) Received: (qmail 31053 invoked by uid 500); 12 May 2014 13:47:54 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 30997 invoked by uid 500); 12 May 2014 13:47:54 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 30989 invoked by uid 99); 12 May 2014 13:47:54 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 May 2014 13:47:54 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of josh.elser@gmail.com designates 209.85.214.179 as permitted sender) Received: from [209.85.214.179] (HELO mail-ob0-f179.google.com) (209.85.214.179) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 May 2014 13:47:51 +0000 Received: by mail-ob0-f179.google.com with SMTP id gq1so8171502obb.10 for ; Mon, 12 May 2014 06:47:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=mLTgg/rYbdRUPGGmC3kYJMGJVjg8xB1XfrVYI7qJ9ZA=; b=DsQOR3O8cchx/t1P/ckEV9oKl1JEMMT5LzahkSwBWxO0WOLDwqKsxhq2fXkp2I3oE5 lRJElnmc0jyhPvRZR2+k30D0PCZ1wFkl2X1ddmDpYeiu2Ic56ujiwKBNWkQJoJwPO2Vc wUyUQuPxgXPU4MOfTqctRdfyiLvqbylLFzxynBeg1HuBICwdNY/NHttufUY9u+1ND8eT vqAZIsA9EDvKq5qo4T1Qo7816KpKNP5rCmcToLdYxiHA1pOuspMbNUXL6pTmmZQTBnlK TJLZi2Lm0vUKAjfAKemohz292Y8/BWcgHAL+nYngHwJX4o//6gzMRls1/IWXFXzQB7oi GHnQ== MIME-Version: 1.0 X-Received: by 10.60.173.228 with SMTP id bn4mr32230740oec.27.1399902447479; Mon, 12 May 2014 06:47:27 -0700 (PDT) Received: by 10.76.93.178 with HTTP; Mon, 12 May 2014 06:47:27 -0700 (PDT) Received: by 10.76.93.178 with HTTP; Mon, 12 May 2014 06:47:27 -0700 (PDT) In-Reply-To: <1399901021696-9748.post@n5.nabble.com> References: <1399901021696-9748.post@n5.nabble.com> Date: Mon, 12 May 2014 09:47:27 -0400 Message-ID: Subject: Re: Delete All Data In Table From: Josh Elser To: user@accumulo.apache.org Content-Type: multipart/alternative; boundary=089e0118254a1a32e204f93430bd X-Virus-Checked: Checked by ClamAV on apache.org --089e0118254a1a32e204f93430bd Content-Type: text/plain; charset=UTF-8 Not really, you enumerated the options pretty thoroughly :) BatchDeleter is slow like you said due to pulling back all of the data to the client and issuing deletes from there. You could get the splits for your table (just in memory or write to disk if they won't fit) and just re-add the splits after. You could also try setting the DevNullIterator on the table for major compaction and then compact it. This is just a little round about. On May 12, 2014 9:30 AM, "BlackJack76" wrote: > Besides using the tableOperations to deleteRows or delete the table > entirely, > what is the fastest way to delete all data in a table? I am currently > using > a BatchDeleter but it is extremely slow when I have a large amount of data. > Any better options? > > I don't want to use the tableOperations because both the deleteRows and > delete blow away the splits. I would like to keep the splits in place. > > Appreciate your thoughts! > > > > > -- > View this message in context: > http://apache-accumulo.1065345.n5.nabble.com/Delete-All-Data-In-Table-tp9748.html > Sent from the Users mailing list archive at Nabble.com. > --089e0118254a1a32e204f93430bd Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

Not really, you enumerated the options pretty thoroughly :)<= /p>

BatchDeleter is slow like you said due to pulling back all o= f the data to the client and issuing deletes from there.

You could get the splits for your table (just in memory or w= rite to disk if they won't fit) and just re-add the splits after.

You could also try setting the DevNullIterator on the table = for major compaction and then compact it. This is just a little round about= .

On May 12, 2014 9:30 AM, "BlackJack76"= <justin.loy@gmail.com> w= rote:
Besides using the tableOperations to deleteRows or delete the table entirel= y,
what is the fastest way to delete all data in a table? =C2=A0I am currently= using
a BatchDeleter but it is extremely slow when I have a large amount of data.=
Any better options?

I don't want to use the tableOperations because both the deleteRows and=
delete blow away the splits. =C2=A0I would like to keep the splits in place= .

Appreciate your thoughts!




--
View this message in context: http://a= pache-accumulo.1065345.n5.nabble.com/Delete-All-Data-In-Table-tp9748.html
Sent from the Users mailing list archive at Nabble.com.
--089e0118254a1a32e204f93430bd--