Return-Path: X-Original-To: apmail-ignite-dev-archive@minotaur.apache.org Delivered-To: apmail-ignite-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2A9B81806C for ; Tue, 14 Jul 2015 14:54:45 +0000 (UTC) Received: (qmail 32383 invoked by uid 500); 14 Jul 2015 14:54:45 -0000 Delivered-To: apmail-ignite-dev-archive@ignite.apache.org Received: (qmail 32343 invoked by uid 500); 14 Jul 2015 14:54:45 -0000 Mailing-List: contact dev-help@ignite.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.incubator.apache.org Delivered-To: mailing list dev@ignite.incubator.apache.org Received: (qmail 32331 invoked by uid 99); 14 Jul 2015 14:54:44 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Jul 2015 14:54:44 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 5CB0DD4881 for ; Tue, 14 Jul 2015 14:54:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.9 X-Spam-Level: ** X-Spam-Status: No, score=2.9 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 5NUyYZZ9VUFi for ; Tue, 14 Jul 2015 14:54:36 +0000 (UTC) Received: from mail-wi0-f177.google.com (mail-wi0-f177.google.com [209.85.212.177]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 283C24C0CE for ; Tue, 14 Jul 2015 14:54:36 +0000 (UTC) Received: by wibud3 with SMTP id ud3so16699624wib.0 for ; Tue, 14 Jul 2015 07:53:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=nXIFiMdNHmr9gDG7aH41223CJUjHqeSZcInIYOP+qvs=; b=wqmKVZCQ8zjVrq5hlCpwKZilYpNUEHDRvG/IporyuyYbK1yQnNzFGMmCXO5lIVN8Mh yPo/ofIh2/aRGf0H7wePakdaZBoOwN+uEMkcWZ9hIqm5dx2Iydd4CPUGLBapeL3KSOu7 4Bm3RBhwwwK/TRWln/WrfYofLs9rTwWYYpuQFWVzOVA/kDDyO0/9xJVSA7EVakOE0rzl dJpYihF0O4w77xyR8Fy/vOfb/CLmy84YYG1ZVxEUP+WYImCY6BvReiDSmzM0Mo+o4Hfv M8jrjUSnxi0BgyX4c2LdCq+PMXkZDZO2nAOBQNjrenbuKp57afgrokKvp3jyhe6zkWdP Oe/A== MIME-Version: 1.0 X-Received: by 10.180.215.101 with SMTP id oh5mr6419941wic.6.1436885630252; Tue, 14 Jul 2015 07:53:50 -0700 (PDT) Received: by 10.194.56.199 with HTTP; Tue, 14 Jul 2015 07:53:50 -0700 (PDT) Received: by 10.194.56.199 with HTTP; Tue, 14 Jul 2015 07:53:50 -0700 (PDT) In-Reply-To: References: <20150709195849.GL28615@boudnik.org> Date: Tue, 14 Jul 2015 20:23:50 +0530 Message-ID: Subject: Re: PRAM Distributed Sorting From: Atri Sharma To: dev@ignite.incubator.apache.org Content-Type: multipart/alternative; boundary=001a1135f95292fe5b051ad701a9 --001a1135f95292fe5b051ad701a9 Content-Type: text/plain; charset=UTF-8 While I do agree with you in principle, I am not sure about the startup costs and node transfer costs. This is pretty experimental so I might be re inventing the wheel :) On 14 Jul 2015 19:18, "Gianfranco Murador" wrote: > I believe that an "reduce" function is is appropriate for this type of task > and is generic enough to sort by any criteria. > Maybe I'm wrong, but that's just my opinion. > Regards, > Gianfranco > > 2015-07-14 15:11 GMT+02:00 Atri Sharma : > > > So, consider a relational database, like postgres. A major component of > > sorting performance comes from the in memory sorting that happens for > this > > case. Normally, something like an external sort would be used in > > conjugation with the disk files. However, a big data analytical > production > > use case has this requirement that the available memory to postgres for > > sorting is pretty huge *but* so is the data and the response time has to > be > > really fast and oh, the data has to be streamed from the database given > > certain events. > > > > So what I was thinking was on these lines: > > > > 1) Add a sorting module to the engine. > > 2) Allow the sorting module to get the data streamed through data > > streamers. > > 3) Give sorting module access to the cache. > > 4) Make a sort API which can be used by an external engine to chunk sort > > into ignite, using streamers to stream data and distribute sort across > > multiple threads, and give sorted results back. > > > > Note : This is actually more of a use case for Ignite. The reasons I > > proposed adding it to core were: 1) Since direct interaction with data > > streamer and cache is needed. 2) It would be a good use case demo. 3) It > > might allow Ignite to be used as a pure play sorting engine thus allowing > > existing databases to work with it. > > > > Thoughts? > > > > On Tue, Jul 14, 2015 at 4:49 PM, Gianfranco Murador < > > murador.gianfranco@gmail.com> wrote: > > > > > I would say that in case of a distributed algorithm complexity lies > not > > > only in the number of input data, but also and, more, in the number of > > > messages exchanged between nodes to achieve the result. > > > I agree to maintain a certain principle of locality for related data, > or > > > leave this task to a system that already has a data model suitable to > > > scale sorting ( RDBMS ? ). > > > Regards, > > > Gianfranco > > > > > > > > > 2015-07-14 12:14 GMT+02:00 Atri Sharma : > > > > > > > Hi Roman, > > > > > > > > On Tue, Jul 14, 2015 at 12:32 AM, Roman Shaposhnik < > > roman@shaposhnik.org > > > > > > > > wrote: > > > > > > > > > On Sun, Jul 12, 2015 at 11:41 PM, Atri Sharma > > > > > wrote: > > > > > > > > > > > > > > > What's the interconnect for this system? > > > > > > > > > > > > > Not sure I got what you meant here. > > > > > > > > > > > > -- > > > > Regards, > > > > > > > > Atri > > > > *l'apprenant* > > > > > > > > > > > > > > > -- > > Regards, > > > > Atri > > *l'apprenant* > > > --001a1135f95292fe5b051ad701a9--