Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 32CAE200D44 for ; Mon, 20 Nov 2017 22:13:54 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 31717160BF9; Mon, 20 Nov 2017 21:13:54 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 776E8160BE1 for ; Mon, 20 Nov 2017 22:13:53 +0100 (CET) Received: (qmail 70999 invoked by uid 500); 20 Nov 2017 21:13:52 -0000 Mailing-List: contact dev-help@asterixdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@asterixdb.apache.org Delivered-To: mailing list dev@asterixdb.apache.org Received: (qmail 70988 invoked by uid 99); 20 Nov 2017 21:13:52 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Nov 2017 21:13:52 +0000 Received: from [10.17.1.220] (unknown [206.169.106.2]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id C71FA1A00C7 for ; Mon, 20 Nov 2017 21:13:51 +0000 (UTC) From: "Till Westmann" To: dev@asterixdb.apache.org Subject: Re: Temporary Datasets Date: Mon, 20 Nov 2017 13:13:50 -0800 Message-ID: <2702DF13-3B8F-4DA9-8F4D-CAD175B48FFA@apache.org> In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-Mailer: MailMate (1.9.7r5425) archived-at: Mon, 20 Nov 2017 21:13:54 -0000 Good question - we might. If we take it out now, we could break future applications if we reintroduce it. Now I’m torn between not keeping unused stuff and compatibility. In this case I’m leaning slightly towards compatibility … Cheers, Till On 20 Nov 2017, at 12:41, Ian Maxon wrote: > Are we going to preserve the keyword? > > On Mon, Nov 20, 2017 at 12:06 PM, Michael Blow > wrote: >>>> removing temporary datasets? >> >> +1 >> >> >> On Mon, Nov 20, 2017 at 1:52 PM Murtadha Hubail >> wrote: >> >>> Hi all, >>> >>> >>> >>> As you might be aware, we have a feature in AsterixDB to create >>> temporary >>> datasets that differ from regular datasets in some ways such as: >>> Their existence is not persisted in metadata, but only in the CC >>> metadata >>> cache. >>> They don’t’ generate any transaction logs >>> Their files are deleted on NC restart. >>> If they are not accessed for some period of time, their metadata >>> records >>> are removed from the CC metadata cache. >>> >>> >>> Temporary datasets were originally introduced to serve as a staging >>> area >>> between AsterixDB and external systems such as Perglix. However, as >>> the >>> system evolved over the years, the assumptions they were built on >>> don’t >>> hold anymore and they could lead to undesired consequences such as >>> leaking >>> files after a CC restart or inability to access the dataset files on >>> a >>> restarted NC. Therefore, I’m proposing to remove the support for >>> the >>> current temporary datasets and we may add the feature with a careful >>> design >>> at a later stage. >>> >>> >>> >>> Any thoughts or concerns on removing temporary datasets? >>> >>> >>> >>> Cheers, >>> >>> Murtadha >>> >>>