From general-return-36837-apmail-incubator-general-archive=incubator.apache.org@incubator.apache.org Mon Jul 30 14:51:06 2012 Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 56FFBD4C6 for ; Mon, 30 Jul 2012 14:51:06 +0000 (UTC) Received: (qmail 78763 invoked by uid 500); 30 Jul 2012 14:51:05 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 78459 invoked by uid 500); 30 Jul 2012 14:51:05 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 78451 invoked by uid 99); 30 Jul 2012 14:51:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Jul 2012 14:51:05 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of alexei.fedotov@gmail.com designates 209.85.215.47 as permitted sender) Received: from [209.85.215.47] (HELO mail-lpp01m010-f47.google.com) (209.85.215.47) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Jul 2012 14:50:59 +0000 Received: by lags15 with SMTP id s15so2956682lag.6 for ; Mon, 30 Jul 2012 07:50:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=0vKrxcwvNkjb0cQsydLNw5T2xxm1kPtVGdjHK+e6Ung=; b=a2sN/qaIszwWo2KCztKbMyr6HMlS5tbmMCri/Gh2/ZKgUvOpWKiE0ToNVq89qXHlbK hn56OGzIFKe/vsaNr1MlAVBXPMmc7V7/TxU8r66K7yw5R7AgG3X9+hU/Ugef+8LYoHOt gi7ezk/G0vHJt6UZibBN8s/6RAXgPitrH8xGGc1LcyLZ8d1USJCVy2E5EpxvObO7JPT/ 9zS4C8nhCueGQVL86qdreW+P+iiEw1mx8A4fklKcXq9Jhi7XYD98WhupCWEf1ydz5v7T /5yFbB8rlzChztgh4RELbF5kdf/y9cpRvIlWRrn4KihFCrdEQk0fG2WXVkySTyFgLpP3 zfXA== MIME-Version: 1.0 Received: by 10.152.148.169 with SMTP id tt9mr11559761lab.49.1343659838590; Mon, 30 Jul 2012 07:50:38 -0700 (PDT) Received: by 10.112.82.4 with HTTP; Mon, 30 Jul 2012 07:50:37 -0700 (PDT) Received: by 10.112.82.4 with HTTP; Mon, 30 Jul 2012 07:50:37 -0700 (PDT) In-Reply-To: References: Date: Mon, 30 Jul 2012 18:50:37 +0400 Message-ID: Subject: Re: [VOTE] Accept Blur into the Apache Incubator From: Alexei Fedotov To: general@incubator.apache.org Content-Type: multipart/alternative; boundary=e89a8f23455d60cd2704c60d2fb4 --e89a8f23455d60cd2704c60d2fb4 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable +1 (non-binding) 20.07.2012 20:43 =D0=BF=D0=BE=D0=BB=D1=8C=D0=B7=D0=BE=D0=B2=D0=B0=D1=82=D0= =B5=D0=BB=D1=8C "Aaron McCurry" =D0=BD=D0=B0=D0=BF=D0= =B8=D1=81=D0=B0=D0=BB: > I would like to call a vote for accepting Blur for incubation in the > Apache Incubator. The full proposal is available below. > > Please cast your vote: > > [ ] +1, bring Blur into Incubator > [ ] +0, I don't care either way, > [ ] -1, do not bring Blur into Incubator, because... > > This vote will be open for 72 hours and only votes from the Incubator > PMC are binding. > > Thank you for your consideration! > > Aaron > > http://wiki.apache.org/incubator/BlurProposal > > =3D Blur Proposal =3D > > =3D=3D Abstract =3D=3D > Blur is a search platform capable of searching massive amounts of data > in a cloud computing environment. Blur leverages several existing > Apache projects, including Apache Lucene, Apache Hadoop, Apache > !ZooKeeper and Apache Thrift. Both bulk and near real time (NRT) > updates are possible with Blur. Bulk updates are accomplished using > Hadoop Map/Reduce and NRT are performed through direct Thrift calls. > > =3D=3D Proposal =3D=3D > Blur is an open source search platform capable of querying massive > amounts of data at incredible speeds. Rather than using the flat, > document-like data model used by most search solutions, Blur allows > you to build rich data models and search them in a semi-relational > manner similar to joins while querying a relational database. Using > Blur, you can get precise search results against terabytes of data at > Google-like speeds. Blur leverages multiple open source projects > including Hadoop, Lucene, Thrift and !ZooKeeper to create an > environment where structured data can be transformed into an index > that runs on a Hadoop cluster. Blur uses the power of Map/Reduce for > bulk indexing into Blur. Server failures are handled automatically by > using !ZooKeeper for cluster state and HDFS for index storage. > > =3D=3D Background =3D=3D > Blur was created by Aaron !McCurry in 2010. Blur was developed to > solve the challenges in dealing with searching huge quantities of data > that the traditional RDBMS solutions could not cope with while still > providing JOIN-like capabilities to query the data. Several other > open source projects have implemented aspects of this design including > elasticsearch, Katta and Apache Solr. > > =3D=3D Rationale =3D=3D > There is a need for a distributed search capability within the Hadoop > ecosystem. Currently, there are no other search solutions that > natively leverage HDFS and the failover features of Hadoop in the same > manner as the Blur project. The communities we expect to be most > interested in such a project are government, health care, and other > industries where scalability is a concern. We have made much progress > in developing this project over the past 2 years and believe both the > project and the interested communities would benefit from this work > being openly available and having open development. In future > versions of Blur the API will more closely follow the API=E2=80=99s provi= ded > in Lucene so that systems that already use Lucene can more easily > scale with Blur. Blur can be viewed as a query execution engine that > Lucene based solutions can utilize when scale becomes an issue. > > =3D=3D Initial Goals =3D=3D > The initial goals of the project are: > * To migrate the Blur codebase, issue tracking and wiki from > github.com and integrate the project with the ASF infrastructure. > * Add new committers to the project and grow the community in "The Apach= e > Way". > > =3D=3D Current Status =3D=3D > > =3D=3D=3D Meritocracy =3D=3D=3D > Blur was initially developed by Aaron !McCurry in June 2010. Since > then Blur has continued to evolve with the support of a small > development team at Near Infinity. As a part of the Apache Software > Foundation, the Apache Blur team intends to strongly encourage the > community to help with and contribute to the project. Apache Blur > will actively seek potential committers and help them become familiar > with the codebase. > > =3D=3D=3D Community =3D=3D=3D > A small community has developed around Blur and several project teams > are currently using Blur for their big data search capability. The > source code is currently available on GitHub and there is a dedicated > website (blur.io) that provides an overview of the project. Blur has > been shared with several members of the Apache community and has been > presented at the Bay Area HUG (see > http://www.meetup.com/hadoop/events/20109471/). > > =3D=3D=3D Core Developers =3D=3D=3D > The current developers are employed by Near Infinity Corporation, but > we anticipate interest developing among other companies. > > =3D=3D=3D Alignment =3D=3D=3D > Blur is built on top of a number of Apache projects; Hadoop, Lucene, > !ZooKeeper, and Thrift. It builds with Maven. During the course of > Blur development, a couple of patches have been committed back to the > Lucene project, including LUCENE-2205 and LUCENE-2215. Due to the > strong relationship with the before mentioned Apache projects, the > incubator is a good match for Blur. > > =3D=3D Known Risks =3D=3D > > =3D=3D=3D Orphaned Products =3D=3D=3D > There is only a small risk of being orphaned. The customers that > currently use Blur are committed to improving the codebase of the > project due to its fulfilling needs not addressed by any other > software. In addition, one customer is providing financial support to > further develop Blur given its importance on mission-critical > projects. > > =3D=3D=3D Inexperience with Open Source =3D=3D=3D > The codebase has been treated internally as an open source project > since its beginning, and Near Infinity has extensive experience > developing and releasing open source projects > (http://www.nearinfinity.com/products/open_source). We do not > anticipate difficulty in operating under the Apache Way. > > =3D=3D=3D Homogeneous Developers =3D=3D=3D > Current developers are all employed by Near Infinity but we are > actively seeking contributors from different companies and would > welcome their participation. > > =3D=3D=3D Reliance on Salaried Developers =3D=3D=3D > Blur was originally created by Aaron !McCurry as a personal project > and he remains the primary contributor. Currently, Aaron=E2=80=99s emplo= yer > (Near Infinity) fully supports his continued participation with paid, > dedicated time to work on Blur. All other current developers are paid > by Near Infinity to work on Blur as well. > > =3D=3D=3D Relationships with Other Apache Products =3D=3D=3D > Blur dependencies: > > * Apache Hadoop > * Apache Lucene > * Apache !ZooKeeper > * Apache Thrift > * Apache log4j > > =3D=3D=3D Apache Brand =3D=3D=3D > Our interest in releasing this code as an Apache project is due to its > strong relationship with other Apache projects, i.e. Blur has > dependencies on Hadoop, Lucene, !ZooKeeper, and Thrift and its > uniqueness within the Hadoop ecosystem. > > =3D=3D Documentation =3D=3D > Current documentation can be found at http://blur.io and > https://github.com/nearinfinity/blur. > > =3D=3D Initial Source =3D=3D > Blur has been in development since summer 2010. The core codebase > consists of about ~29,000 (~10,000 if the generated RPC code is not > included) lines of code mainly Java. > > =3D=3D Source and Intellectual Property Submission Plan =3D=3D > Blur core code, examples, documentation, and training materials will > be submitted by Near Infinity Corporation. > > =3D=3D External Dependencies =3D=3D > * concurrentlinkedhashmap - Apache 2.0 License - > http://code.google.com/p/concurrentlinkedhashmap/ > > =3D=3D Cryptography =3D=3D > none > > =3D=3D Required Resources =3D=3D > * Mailing Lists > * blur-private > * blur-dev > * blur-commits > * blur-user > * Subversion Directory > * https://git-wip-us.apache.org/repos/asf/blur.git > * Issue Tracking > * JIRA > * Continuous Integration > * Jenkins > * Web > * http://incubator.apache.org/blur/wiki at http://wiki.apache.org > or http://cwiki.apache.org > > =3D=3D Initial Committers =3D=3D > * Aaron !McCurry (aaron.mccurry at nearinfinity dot com) > * Scott Leberknight (scott.leberknight at nearinfinity dot com) > * Ryan Gimmy (ryan.gimmy at nearinfinity dot com) > * Tim Williams (twilliams at apache dot org) > * Patrick Hunt (phunt at apache dot org) > * Doug Cutting (cutting at apache dot org) > > =3D=3D Affiliations =3D=3D > * Aaron !McCurry, Near Infinity > * Scott Leberknight, Near Infinity > * Ryan Gimmy, Near Infinity > * Patrick Hunt, Cloudera > * Doug Cutting, Cloudera > > =3D=3D Sponsors =3D=3D > * Champion: Patrick Hunt > > =3D=3D Nominated Mentors =3D=3D > * Tim Williams (twilliams at apache dot org) > * Doug Cutting (cutting at apache dot org) > * Patrick Hunt (phunt at apache dot org) > > =3D=3D Sponsoring Entity =3D=3D > * Apache Incubator > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org > For additional commands, e-mail: general-help@incubator.apache.org > > --e89a8f23455d60cd2704c60d2fb4--