Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A6F11200BA3 for ; Thu, 20 Oct 2016 20:37:59 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A59B6160AF5; Thu, 20 Oct 2016 18:37:59 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C4AC0160ACC for ; Thu, 20 Oct 2016 20:37:58 +0200 (CEST) Received: (qmail 54510 invoked by uid 500); 20 Oct 2016 18:37:58 -0000 Mailing-List: contact dev-help@streams.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@streams.incubator.apache.org Delivered-To: mailing list dev@streams.incubator.apache.org Received: (qmail 54495 invoked by uid 99); 20 Oct 2016 18:37:57 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Oct 2016 18:37:57 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 3A1A9C0F48 for ; Thu, 20 Oct 2016 18:37:57 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.997 X-Spam-Level: * X-Spam-Status: No, score=1.997 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id pOUyiA7YLCkt for ; Thu, 20 Oct 2016 18:37:55 +0000 (UTC) Received: from NAM02-BL2-obe.outbound.protection.outlook.com (mail-bl2nam02on0125.outbound.protection.outlook.com [104.47.38.125]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 9DB5C5F479 for ; Thu, 20 Oct 2016 18:37:54 +0000 (UTC) Received: from CY1PR0601MB1488.namprd06.prod.outlook.com (10.163.22.139) by CY1PR0601MB1486.namprd06.prod.outlook.com (10.163.22.12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.659.11; Thu, 20 Oct 2016 18:32:32 +0000 Received: from CY1PR0601MB1488.namprd06.prod.outlook.com ([10.163.22.139]) by CY1PR0601MB1488.namprd06.prod.outlook.com ([10.163.22.139]) with mapi id 15.01.0659.025; Thu, 20 Oct 2016 18:32:32 +0000 From: Benjamin Young To: sblackmon , "dev@streams.incubator.apache.org" CC: Matt Franklin Subject: Re: Granary & SocketHub Thread-Topic: Granary & SocketHub Thread-Index: AQHSKU1krrRQMxxt7kO2c+heMbg3EKCxm6oBgAAHMSs= Date: Thu, 20 Oct 2016 18:32:32 +0000 Message-ID: References: , In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: spf=none (sender IP is ) smtp.mailfrom=byoung@bigbluehat.com; x-originating-ip: [24.196.190.162] x-ms-office365-filtering-correlation-id: 58f0e1ef-2d76-4de1-e6f2-08d3f91774bd x-microsoft-exchange-diagnostics: 1;CY1PR0601MB1486;7:79ORx7tyCY2FFL0iGvPy1F/5LXxYcQnfZcLdM5u57LEludylwVfddwu/3r3IKyf2MEHlllf7sYcVcrA69UZeEzp84KJCHZ/MaGZ4+nBGo96YEa7KfD8Mm9kifuPecBWrCvXo7AKl/NNh4bFP6Yqn/mus9EXbd+gsgnlrIx740+mwO6Hr1sakUBQKuEtNi2gkT/kH9Jqhv4iM6NLnl0JDdLCakkc1k+qwSXEk94Fp3ECTKrHFUvodUz3xF0p709o9DQ3POJlNKOuNhXi/DNNui7q29BkzH5fkWRfYwBa5B79Tc5n9w3j5m7M8o3RL1Zqf814eEHOQ+DJugtrbwmxirO2tEqtSzijbLiyAODOB82E=;20:fBRvx//lnwxlj4eEFcZ1malMNSSyyYfkL5JomZLzdavlnh1X0XWNI6yGvkpy47pTdaNzBLpAQ386MuV1d8bsQMsn4Uk53Q0cKFzB2fDByI4HwyguHzeveLOO6nTyaga5202kZZKVhTgppOzbbjlBbnwDiUnFIw6s4mTg2FOdPAQ= x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:;SRVR:CY1PR0601MB1486; x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:(166708455590820)(81160342030619)(5213294742642); x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(6040176)(601004)(2401047)(5005006)(8121501046)(10201501046)(3002001)(6043046)(6042046);SRVR:CY1PR0601MB1486;BCL:0;PCL:0;RULEID:;SRVR:CY1PR0601MB1486; x-forefront-prvs: 01018CB5B3 x-forefront-antispam-report: SFV:NSPM;SFS:(10019020)(7916002)(199003)(377454003)(189002)(24454002)(92566002)(19580405001)(19580395003)(19625215002)(76176999)(5002640100001)(54356999)(1720100001)(2906002)(11100500001)(76576001)(3660700001)(15975445007)(77096005)(2501003)(2900100001)(99286002)(74316002)(50986999)(33656002)(68736007)(2950100002)(5660300001)(105586002)(86362001)(19627405001)(106356001)(106116001)(16236675004)(4326007)(81156014)(81166006)(101416001)(7846002)(9686002)(189998001)(102836003)(6116002)(10400500002)(3280700002)(586003)(87936001)(3846002)(7696004)(7736002)(7906003)(122556002)(8936002)(19617315012)(5001770100001)(66066001)(8676002)(15395725005)(97736004);DIR:OUT;SFP:1102;SCL:1;SRVR:CY1PR0601MB1486;H:CY1PR0601MB1488.namprd06.prod.outlook.com;FPR:;SPF:None;PTR:InfoNoRecords;A:3;MX:1;LANG:en; received-spf: None (protection.outlook.com: bigbluehat.com does not designate permitted sender hosts) spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: multipart/alternative; boundary="_000_CY1PR0601MB14882FB21A908A2493FE6713B2D50CY1PR0601MB1488_" MIME-Version: 1.0 X-OriginatorOrg: bigbluehat.com X-MS-Exchange-CrossTenant-originalarrivaltime: 20 Oct 2016 18:32:32.0281 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 98d05955-de73-4052-a2d4-cbaabc15b584 X-MS-Exchange-Transport-CrossTenantHeadersStamped: CY1PR0601MB1486 archived-at: Thu, 20 Oct 2016 18:37:59 -0000 --_000_CY1PR0601MB14882FB21A908A2493FE6713B2D50CY1PR0601MB1488_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Great points, Steven. What's always attracted me to Apache Streams is it's descriptiveness (via J= SON Schemas documents) vs. prescriptive-ness. Granary's approach is (curren= tly? ;) ) more prescriptive: https://github.com/snarfed/granary/blob/master/granary/twitter.py vs. https://github.com/apache/streams/tree/STREAMS-26/streams-contrib/streams-p= rovider-twitter ...which is mostly (though not all) a collection of .json and .conf files w= ith a handful of .java files needed (afaict) for last-mile integration with= one's tool. The future I dream about is one where I can pick my tool for my idiosyncrat= ic language, operating system, license reasons, but they'll all work off sh= ared, descriptive "knowledge" documents. Otherwise, we're all pulling separately, and end up with snowflake systems = to process snowflake APIs. However, I also know it's unlikely everyone will= come "under one roof" to work on things. My hope, though, is that the outp= ut of this group (and Granary and Sockethub and...) will be re-usable by as= wide an audience as possible--hence the value of description over prescrip= tion (at least in my book ;) ). Granted, if I'm barking up the wrong tree (again), I'm happy to wander off.= .. Is anything in the above sane? ;) Cheers! Benjamin -- http://bigbluehat.com/ http://linkedin.com/in/benjaminyoung ________________________________ From: sblackmon Sent: Thursday, October 20, 2016 1:26:38 PM To: dev@streams.incubator.apache.org Cc: Matt Franklin; Benjamin Young Subject: Re: Granary & SocketHub On October 18, 2016 at 6:09:49 PM, Matt Franklin (m.ben.franklin@gmail.com<= mailto:m.ben.franklin@gmail.com>) wrote: On Tue, Oct 18, 2016 at 10:39 AM Benjamin Young wrote: > (resending from the correct account...likely the other got spammed...) > > Granary is a project with similar ideas and intents as Apache Streams > (which also needs AS2 support ;) ): > https://github.com/snarfed/granary > Ryan from Granary is on the list I think. Hey Ryan! Cool stuff, too bad i= t's python :) > In fact Apache Streams gets a mention in their "Related Work" section: > https://github.com/snarfed/granary#related-work > > Also mentioned in the Granary related work section is SocketHub: > https://github.com/sockethub/sockethub > Cool stuff, too bad it's LGPL :) > It's aims are similar, but it's reaching way beyond Web-based social APIs > and "back" to including things like IRC, Email, etc. Non-SNS data sources are important for sure. I've posted some work on my pe= rsonal github using the streams framework to parse MBOX files - https://git= hub.com/steveblackmon/streams-apache - and to collect quantified self data = - https://github.com/steveblackmon/humanapi-streams IRC is interesting as well. > What's significant about both these projects (and others they link to) ar= e > the stories they're telling developers-which we can crib from as we think > about the Streams "pitch." They also have relatively minimal setup > docs-which Streams is also heading toward (go Steve!). > Agreed this is key The existence of other open-source projects with similar themes suggests we= 're onto an important problem. We should pay attention to these projects a= nd what is working for them WRT user growth, community growth, tech media c= overage, etc... > > Again, my key objective is to understand the Apache Streams vision along > side projects like these and within the wider space of consolidating soci= al > data. What market does it serve? Is it "personal" (as these projects seem > to be)? Or commercial? Or developer-only (library/framework for wiring up > your own idiosyncratic stuff)? > I think the overall objective of streams remains very similar to what it started as: A way to easily and flexibly ingest multiple different sources of 'activity' data in a normalized ActivityStreams format. For me personally, my interest is in ingesting this data at scale and with as little internally-maintained code as possible. While most of the development so far has been geared toward enabling back-e= nd / commercial-scale data collection and management, I think the future sh= ould be more about enabling individuals and businesses to transcend data si= los using computing resources and code entirely under their own control. Th= is might mean supporting regular users with a full-featured SaaS applicatio= n in addition to continued work on data interoperability. > > Thanks for reading, pondering, and helping me help. :) > > Cheers! > Benjamin > > -- > http://bigbluehat.com/ > http://linkedin.com/in/benjaminyoung > > --_000_CY1PR0601MB14882FB21A908A2493FE6713B2D50CY1PR0601MB1488_--