Return-Path: X-Original-To: apmail-asterixdb-dev-archive@minotaur.apache.org Delivered-To: apmail-asterixdb-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 654791891F for ; Mon, 25 Jan 2016 20:41:27 +0000 (UTC) Received: (qmail 69399 invoked by uid 500); 25 Jan 2016 20:41:27 -0000 Delivered-To: apmail-asterixdb-dev-archive@asterixdb.apache.org Received: (qmail 69351 invoked by uid 500); 25 Jan 2016 20:41:27 -0000 Mailing-List: contact dev-help@asterixdb.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@asterixdb.incubator.apache.org Delivered-To: mailing list dev@asterixdb.incubator.apache.org Received: (qmail 69338 invoked by uid 99); 25 Jan 2016 20:41:26 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jan 2016 20:41:26 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 4EDCDC1CF7 for ; Mon, 25 Jan 2016 20:41:26 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 0_b-lGdAduuO for ; Mon, 25 Jan 2016 20:41:25 +0000 (UTC) Received: from mail-wm0-f45.google.com (mail-wm0-f45.google.com [74.125.82.45]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 86AF120C40 for ; Mon, 25 Jan 2016 20:41:24 +0000 (UTC) Received: by mail-wm0-f45.google.com with SMTP id u188so81538076wmu.1 for ; Mon, 25 Jan 2016 12:41:24 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=w+yfxJ1J6Lo1HZ0YcIJ+CfQs9QgNk2/5Zs86pKHod9k=; b=K3yEf6B+7vaYvo02LYONzKPCDClH7OfqMKjkIMmzfWuLcKjUf1GWOG+iLkb4JPVIsG UB+EbOfqz+lpMAHDNL9TyGvd7MUOBSA55oedCnmlSBG0RXbQEFqJGLS1b+VtpNGuv0kd VXzrEGPIeQypebcg5tUE8vsoYcj3ULT5C/4WqhGd9V8uGoVSn0fF/ou4aoz4OQTPC1Fa WpDT6XNICGewJcF4DQ1dtvdQadyUA72t3L4E+BbIUhakMeEjYRxdFwm07d7EOaTTrvaZ Orcz8aJZecCPgZo4BGPEgfI6PJ72hABWmDavmsrLErSwKEiiub1LhqLa5frBOjlq8CVv Opcg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=w+yfxJ1J6Lo1HZ0YcIJ+CfQs9QgNk2/5Zs86pKHod9k=; b=m/pEkQUeLrZ7kyRoLr9bm1+6rjORvHklfbGbqttPmMPtmjqXE5lDNygoqat6DVg4rT 0m/p27nRQoJeaOkADHlksf92F5oh5v5sRARhD5TjpK4FmJcK/veo8V7bHfO2fM0M3brq zEsNCXXeg0UZCRnrymmMCbk/SFWUhV9ejnCS5B9uDrVXyhIqwviBrwqCj2vO9TAC+sX0 VWgnpCz/2atWynKiSNOJn25uIA2D3/4Z51BakuWcn3VFS7ZYmjUEWlcS5ivdG8JCr8Pg 4UlXgt68olBSnjh9lmE+z6yKgh6E3ScpcW+aK4MBI7z8IEdS23MrdbEQ5F27esUhRosA hDbg== X-Gm-Message-State: AG10YOTrCAH28RyXv7cu/GXg1E7U2IvmeUoBlVmuxG6643t/jZ7AwDMU4PWvQ1WEU9IpLLB/5raFmm7rslzLeA== MIME-Version: 1.0 X-Received: by 10.28.213.204 with SMTP id m195mr21444892wmg.53.1453754484192; Mon, 25 Jan 2016 12:41:24 -0800 (PST) Received: by 10.27.170.209 with HTTP; Mon, 25 Jan 2016 12:41:24 -0800 (PST) In-Reply-To: <56A68256.2000700@gmail.com> References: <56A2703E.6090105@gmail.com> <56A4093F.6000702@gmail.com> <56A68256.2000700@gmail.com> Date: Tue, 26 Jan 2016 00:41:24 +0400 Message-ID: Subject: Re: Weekly status From: abdullah alamoudi To: dev@asterixdb.incubator.apache.org Content-Type: multipart/alternative; boundary=001a1146ed2c9efe57052a2e9720 --001a1146ed2c9efe57052a2e9720 Content-Type: text/plain; charset=UTF-8 Answers Inlined. Cheers, Abdullah. Amoudi, Abdullah. On Tue, Jan 26, 2016 at 12:15 AM, Mike Carey wrote: > So my remaining Q's (before we e-mail anyone) would be: > 1. Do we possibly drop some data? (Still trying to > make sure I understand the implications of your first > point below.) > No but we don't know when the record was actually persisted and so, there is a chance that we record a file as has been read completely while there is a small window that allows a crash before persistence. > 2. Do we (silently) ignore faulty records for now? > Yes. > 3. Do we (silently) ignore duplicate records for now? > Yes. > Thx! > Mike > > On 1/24/16 5:48 AM, abdullah alamoudi wrote: > >> Yes, those two are fixed at this stage. >> I am not sure I understand well what you mean by semantics but I will >> explain how the filesystem feed will behave. >> The user will have to provide the following (directories' locations, >> expression to match files names against, formats of records). Once the >> adapter is connected, it will start reading from the specified directories >> until the directories are deleted or the feed is disconnected. >> duplicate records will be skipped. >> >> When there are no more files in the specified directories, the feed >> adapter >> will push the buffered records to storage then wait for more events from >> the file system. >> >> Things that are not yet done properly: >> 1. at-least-once semantics. >> 2. logging of faulty records. >> 3. logging of duplicate records. >> >> 2 &3 are easy to implement correctly while 1 will take about a week to get >> right with a good design and implementation. >> Should I draft a new email? >> >> >> Amoudi, Abdullah. >> >> On Sun, Jan 24, 2016 at 3:14 AM, Mike Carey wrote: >> >> Q: So are both of those (1 & 2) fixed? And could you clarify the >>> semantics that file feeds will have at this stage? :-) >>> Thx! >>> >>> On 1/22/16 11:40 PM, abdullah alamoudi wrote: >>> >>> I think we are ready if they don't care much about at-least-once >>>> semantics. >>>> >>>> Other than that, everything is ready. >>>> The last communication we had with them was a promise to fix: >>>> 1. Duplicate key exceptions >>>> 2. Pushing last few records to storage. >>>> >>>> Cheers >>>> Abdullah >>>> Cool. Are we ready to ping Wisconsin (Condor) again soon...? >>>> (When did you last interact w/them - and where were things left...?) >>>> >>>> On 1/22/16 8:11 AM, abdullah alamoudi wrote: >>>> >>>> It looks like I will not be able to attend the weekly meeting today so I >>>> >>>>> am >>>>> sending my status here: >>>>> 1. Addressed Young-Seok's comments on the Upsert change. >>>>> 2. Completed streaming of Couchbase inserts into AsterixDB. >>>>> 3. Completed the implementation of the flush () operation on all >>>>> IFrameWriters. >>>>> 4. Removed the ExternalLookup operator and merged it with >>>>> UnnestMapOperator >>>>> 5. Created a proposal for Feed changes. >>>>> >>>>> Cheers, >>>>> Abdullah >>>>> >>>>> >>>>> >>>>> > --001a1146ed2c9efe57052a2e9720--