Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 108E0200BCA for ; Mon, 21 Nov 2016 18:48:35 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 0F1CA160AF9; Mon, 21 Nov 2016 17:48:35 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5A60E160AEF for ; Mon, 21 Nov 2016 18:48:34 +0100 (CET) Received: (qmail 25373 invoked by uid 500); 21 Nov 2016 17:48:33 -0000 Mailing-List: contact dev-help@streams.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@streams.incubator.apache.org Delivered-To: mailing list dev@streams.incubator.apache.org Received: (qmail 25357 invoked by uid 99); 21 Nov 2016 17:48:32 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Nov 2016 17:48:32 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 2CDBE1A7BD2 for ; Mon, 21 Nov 2016 17:48:32 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.48 X-Spam-Level: ** X-Spam-Status: No, score=2.48 tagged_above=-999 required=6.31 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id BvVvVhyzsuDO for ; Mon, 21 Nov 2016 17:48:30 +0000 (UTC) Received: from mail-oi0-f65.google.com (mail-oi0-f65.google.com [209.85.218.65]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 4D3115F47A for ; Mon, 21 Nov 2016 17:48:29 +0000 (UTC) Received: by mail-oi0-f65.google.com with SMTP id u15so3953911oie.3 for ; Mon, 21 Nov 2016 09:48:29 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:date:from:to:cc:message-id:subject:mime-version; bh=8/HDcAySL4TAyfkN2mTsdyBt//FCYIMEt7DbcxbBqL4=; b=jiX+pakOS4GzBfhMj6mfOVZGHkGr6189mo4atL3amcMtSiRKIunwFD+K+z5f+4+g1L gPvPPNUb5pLm6awVDSWW9OR3Az3UrtQviQLOtCTGI3oUprl2ZlvPov/qxKhYA/gvfWOC SksE/nlz/XIJPBKEaFXUd2AMAAqTFX5B6S2FqL+8D863ttA71BmXSoqKQYb0Dp0r64wr LzIPCC0Ejcdq/HZGxJC5GpcLBLiIT0OV5lLm9Zj18h0K8gm4jXnRpT3gVUfBysTY8i4N 7W5VTecUTJDyTzI/d3I4XHcNPJ1jBsvBKqG9eQR7l6W5nENFh7Bcuwl4yeaUYhYe9IwK smug== X-Gm-Message-State: AKaTC01emq9ZPEijKHUKrC2hxBwhyaE9t73liXMCoJgpV9oClogREEIrPELnUg6cOYC0qA== X-Received: by 10.157.54.183 with SMTP id h52mr9315246otc.28.1479750508041; Mon, 21 Nov 2016 09:48:28 -0800 (PST) Received: from Steves-MacBook-Pro-3.local.mail (67-198-76-106.dyn.grandenetworks.net. [67.198.76.106]) by smtp.gmail.com with ESMTPSA id s33sm7302644ota.37.2016.11.21.09.48.27 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 21 Nov 2016 09:48:27 -0800 (PST) Date: Mon, 21 Nov 2016 11:48:26 -0600 From: sblackmon To: dev@community.apache.org Cc: "=?utf-8?Q?dev=40streams.incubator.apache.org?=" Message-ID: Subject: Social Media Metrics using Apache stack X-Mailer: Airmail (397) MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="5833336a_25905af2_252b" archived-at: Mon, 21 Nov 2016 17:48:35 -0000 --5833336a_25905af2_252b Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hello ComDev, The Streams podling has been brainstorming ways to increase awareness of = the project and it=E2=80=99s capabilities. =C2=A0We=E2=80=99ve also been = working to make it easier to get started as a user, without starting the = journey by downloading JDK Maven and friends. =C2=A0Using the software to= provide benefit to the =46oundation seems like a good thing to try. One use case for Streams is to build personal or organizational datasets = of social media profiles and content for internal development and analysi= s, using the technologies and tools you and your organization prefer,=C2=A0= rather than those provided by the upstream system. I took the liberty of creating a few Zeppelin notebooks which collect Apa= che project profiles and posts, normalize them to activity streams format= , and interact with them using spark data frames. The notebooks are currently hosted in my zeppelinhub account, which anyon= e with the link below can access. =C2=A0 https://www.zeppelinhub.com/viewer/notebooks/bm90ZTovL3N0ZXZlYmxhY2ttb24v= YXBhY2hlLXplcHBlbGluLWRhc2hib2=46yZC84YjQ5YmY3MWIxYTU0ZTE2YjlkMDQyMTliMzN= lMjQzYS9ub3RlLmpzb24 https://www.zeppelinhub.com/viewer/notebooks/bm90ZTovL3N0ZXZlYmxhY2ttb24v= YXBhY2hlLXplcHBlbGluLWRhc2hib2=46yZC9lNzQzZjRkZGVkMGY0YjA3YTkzZTQ2NW=46kY= jU2ZTQxOS9ub3RlLmpzb24 https://www.zeppelinhub.com/viewer/notebooks/bm90ZTovL3N0ZXZlYmxhY2ttb24v= YXBhY2hlLXplcHBlbGluLWRhc2hib2=46yZC8zZmQ3M2Y1OWEzOGE0YmM2Yj=46kMGM4MzBkN= TczZDU0Mi9ub3RlLmpzb24 If this group sees potential benefit, I=E2=80=99d be happy to work to set= them up for use by anyone at Apache in a dedicated Zeppelin deployment a= nd take the lead on maintaining them going forward. In any case we=E2=80=99d appreciate any feedback on what could would make= this prototype more valuable.. Background on Streams: Apache Streams (incubating) unifies a diverse world of digital profiles a= nd online activities into common formats and vocabularies, and makes thes= e datasets accessible across a variety of databases, devices, and platfor= ms for streaming, browsing, search, sharing, and analytics use-cases. Streams contains libraries and patterns for specifying, publishing, and i= nter-linking schemas, and assists with conversion of activities (posts, s= hares, likes, follows, etc.) and objects (profiles, pages, photos, videos= , etc.) between the representation, format, and encoding preferred by sup= ported data providers (Twitter, Instagram, etc.), and storage services (C= assandra, Elasticsearch, HBase, HD=46S, Neo4J, etc.) In theory pretty much any JSON or XML API which uses a =22look-up by ID a= nd type=E2=80=9D model can be co-erced into collections of activity-strea= ms normalized profiles and posts - systems such as GitHub, JIRA, MeetUp c= ould be added to the roadmap and have notebooks created once those provid= ers are built. --5833336a_25905af2_252b--