Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 45148200BAE for ; Fri, 28 Oct 2016 17:30:45 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 43BAE160AE4; Fri, 28 Oct 2016 15:30:45 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 8C49E160ACA for ; Fri, 28 Oct 2016 17:30:44 +0200 (CEST) Received: (qmail 72413 invoked by uid 500); 28 Oct 2016 15:30:43 -0000 Mailing-List: contact user-help@predictionio.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.incubator.apache.org Delivered-To: mailing list user@predictionio.incubator.apache.org Received: (qmail 72403 invoked by uid 99); 28 Oct 2016 15:30:43 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Oct 2016 15:30:43 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 55172C9FAE for ; Fri, 28 Oct 2016 15:30:43 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.499 X-Spam-Level: X-Spam-Status: No, score=0.499 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=occamsmachete-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 2vmw0QXfWHld for ; Fri, 28 Oct 2016 15:30:42 +0000 (UTC) Received: from mail-pf0-f179.google.com (mail-pf0-f179.google.com [209.85.192.179]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id E50505FB5B for ; Fri, 28 Oct 2016 15:30:41 +0000 (UTC) Received: by mail-pf0-f179.google.com with SMTP id 197so39336918pfu.0 for ; Fri, 28 Oct 2016 08:30:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=occamsmachete-com.20150623.gappssmtp.com; s=20150623; h=from:content-transfer-encoding:mime-version:subject:message-id:date :cc:to; bh=pyNIaXf/MKDNloo58Z3+OeUe8HvmFAvHUbpU6USJczI=; b=rw8xnVjRK1zWSLCqI1ssKunZXfB321FogeWB1ExPDX82f7r7f+73M4mf2NfjpL8IBt KOgSqSzckg5c/8M/9CcHiec/EH0HCwSvXB7CJWSZceYfSaDJ5j8UmnjN6MlqQKi7cvyJ n4Wty4N54GXe5js/OVjv8rX0Xo9pqxBHlvITYm7/XtJytS/2fe/YG+V+T8WCUGfeSjkc XDXTv/++Qrhs10mvk5mYUEF2Y/etqkESJuc3ZJ8/+7FZwVc4b2rwlMUTOMgXhUZ8wmsG ap5I8qW6e425jpuICBsan+lgMf9jeuz7nFeME5QEqH/ffvJxnWfIMLkGhhrZJCgDKeoT yt7Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:content-transfer-encoding:mime-version :subject:message-id:date:cc:to; bh=pyNIaXf/MKDNloo58Z3+OeUe8HvmFAvHUbpU6USJczI=; b=H6zLva86OfbLjDcBUc92N4T1q4d/sf+7hdXgT3VOojs0zItyI+Xc7Wkb23VZ6b3Rb9 hGh7d7NTeiLM5zpj/49TBTELDitNKjInp7NOGOEAZZ41MoDs2NN+roRfwQ0Y87hmPfg3 Gf4U33fgEQmUQ+vg/4H23ePO2mejgWH4561R41C4M1UNHfwb1FNOp3NJJ6pw0mkG1f7p td35QcU5aOyOodM8eqeJpgocXR2WG6e2jhzPbVBXDWKWziytkVBhzIeLHOQOvrVdRcMm vsmPSvvPFrT6T4mgLtauN61/X9AqjvgG4cE7ixN0LYc7wMAzm7Q3VcP5TuKmF2993cfX ftBg== X-Gm-Message-State: ABUngve6Jh11elG5ehdgeBJTfa85Y/XAYdBWNrw99GtJL7BGkB+8HUt5mwECXwV78KaCLA== X-Received: by 10.98.157.148 with SMTP id a20mr627540pfk.1.1477668641046; Fri, 28 Oct 2016 08:30:41 -0700 (PDT) Received: from [192.168.223.2] (ec2-52-59-250-132.eu-central-1.compute.amazonaws.com. [52.59.250.132]) by smtp.gmail.com with ESMTPSA id c15sm19763260pfd.53.2016.10.28.08.30.38 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 28 Oct 2016 08:30:39 -0700 (PDT) From: Pat Ferrel Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 10.0 \(3226\)) Subject: Trouble with HBase and PEventStore.aggregateProperties Message-Id: Date: Fri, 28 Oct 2016 08:30:45 -0700 Cc: user@predictionio.incubator.apache.org To: dev@predictionio.incubator.apache.org X-Mailer: Apple Mail (2.3226) archived-at: Fri, 28 Oct 2016 15:30:45 -0000 The use of PEventStore.aggregateProperties as shown below causes errors = when a large cluster is making the query. It seems to cause a full DB = scan, which results in timeouts. This may be because nearly 400 = (parallelism of the cluster) threads are making requests. But should = this result in a full scan? Is there a better want to get all "item" = properties? Is it possible to index a column that would make this mor = efficient? Any ideas would be appreciated. This makes frequent or fast = training on near Tb data impossible. val fieldsRDD: RDD[(ItemID, PropertyMap)] =3D = PEventStore.aggregateProperties( appName =3D dsp.appName, entityType =3D "item")(sc) BTW: If I reduce parallelism in Spark it slows other parts of the = algorithm unacceptably. I have also experimented with very large = RPC/Scanner timeouts of many minutes=E2=80=94to no avail. Job aborted due to stage failure: Task 44 in stage 147.0 failed 4 times, = most recent failure: Lost task 44.3 in stage 147.0 (TID 24833, = ip-172-16-3-9.eu-central-1.compute.internal): = org.apache.hadoop.hbase.DoNotRetryIOException: Failed after retry of = OutOfOrderScannerNextException: was there a rpc timeout?+details Job aborted due to stage failure: Task 44 in stage 147.0 failed 4 times, = most recent failure: Lost task 44.3 in stage 147.0 (TID 24833, = ip-172-16-3-9.eu-central-1.compute.internal): = org.apache.hadoop.hbase.DoNotRetryIOException: Failed after retry of = OutOfOrderScannerNextException: was there a rpc timeout? at = org.apache.hadoop.hbase.client.ClientScanner.next(ClientScanner.java:403) = at = org.apache.hadoop.hbase.mapreduce.TableRecordReaderImpl.nextKeyValue(Table= RecordReaderImpl.java:232) at = org.apache.hadoop.hbase.mapreduce.TableRecordReader.nextKeyValue(TableReco= rdReader.java:138) at=20