Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CAACE11421 for ; Thu, 24 Jul 2014 14:30:40 +0000 (UTC) Received: (qmail 36471 invoked by uid 500); 24 Jul 2014 14:30:36 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 36399 invoked by uid 500); 24 Jul 2014 14:30:36 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 36386 invoked by uid 99); 24 Jul 2014 14:30:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 14:30:36 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of alinazemian@gmail.com designates 209.85.192.41 as permitted sender) Received: from [209.85.192.41] (HELO mail-qg0-f41.google.com) (209.85.192.41) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 14:30:33 +0000 Received: by mail-qg0-f41.google.com with SMTP id q107so3324229qgd.0 for ; Thu, 24 Jul 2014 07:30:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=2rM89W61/bNWj6bIXmQSnIho4vlxJbHpTw0R+x4/GLk=; b=mBCSB4qN987kC6f3LG0oSSvJ/6kleIRkSO90yldEaWeW59VWzlJrYSbxK9ojmezNsk Q37bQCaG2rhp3VZzoOSIQDRme2nCwaadkpcFMXUOXC2wTZrs/Jyow7xltIkc1e/lKezS VSsAYEreiXAW+xacX1flm0AKTDgEZOe28/1tDtl65sKOVjvJ+mIyYNhPGsquzqBhN0C6 O6f3oPbk8ylk6QouCxlmKefl5+Dk16IcQOaW8z/Aw2iJ9PPO+G27xPKF9RzN0n/o+15k X0WekrK9LWZEWDz1gNJyZQncXfx6OpEY9rKZRYj/Z836k5qvoK5hWCZ45XgiV8yau1VT 2eUQ== MIME-Version: 1.0 X-Received: by 10.140.32.134 with SMTP id h6mr13689563qgh.49.1406212208787; Thu, 24 Jul 2014 07:30:08 -0700 (PDT) Received: by 10.140.87.180 with HTTP; Thu, 24 Jul 2014 07:30:08 -0700 (PDT) In-Reply-To: References: Date: Thu, 24 Jul 2014 19:00:08 +0430 Message-ID: Subject: Re: integrating Accumulo with solr From: Ali Nazemian To: "solr-user@lucene.apache.org" Content-Type: multipart/alternative; boundary=001a113b3e042f0ad704fef14b8f X-Virus-Checked: Checked by ClamAV on apache.org --001a113b3e042f0ad704fef14b8f Content-Type: text/plain; charset=UTF-8 Thank you very much. Nice Idea but how can Solr and Accumulo can be synchronized in this way? I know that Solr can be integrated with HDFS and also Accumulo works on the top of HDFS. So can I use HDFS as integration point? I mean set Solr to use HDFS as a source of documents as well as the destination of documents. Regards. On Thu, Jul 24, 2014 at 4:33 PM, Joe Gresock wrote: > Ali, > > Sounds like a good choice. It's pretty standard to store the primary > storage id as a field in Solr so that you can search the full text in Solr > and then retrieve the full document elsewhere. > > I would recommend creating a document structure in Solr with whatever > fields you want indexed (most likely as text_en, etc.), and then store a > "string" field named "content_id", which would be the Accumulo row id that > you look up with a scan. > > One caveat -- Accumulo will be protected at the cell level, but if you need > your Solr search results to be protected by complex authorization strings > similar to Accumulo, you will need to write your own QParserPlugin and use > post filtering: > http://java.dzone.com/articles/custom-security-filtering-solr > > The code you see in that article is written for an earlier version of Solr, > but it's not too difficult to adjust it for the latest (we've done so in > our project). Once you've implemented this, you would store an > "authorizations" string field in each Solr document, and pass in the > authorizations that the user has access to in the fq parameter of every > query. It's also not too bad to write something that parses the Accumulo > authorizations string (like A&B&(C|D|E|F)) and interpret it accordingly in > the QParserPlugin. > > This will give you true row level security in Solr and Accumulo, and it > performs quite well in Solr. > > Let me know if you have any other questions. > > Joe > > > On Thu, Jul 24, 2014 at 4:07 AM, Ali Nazemian > wrote: > > > Dear Joe, > > Hi, > > I am going to store the crawl web pages in accumulo as the main storage > > part of my project and I need to give these data to solr for indexing and > > user searches. I need to do some social and web analysis on my data as > well > > as having some security features. Therefore accumulo is my choice for the > > database part and for index and search I am going to use Solr. Would you > > please guide me through that? > > > > > > > > On Thu, Jul 24, 2014 at 1:28 AM, Joe Gresock wrote: > > > > > We store data in both Solr and Accumulo -- do you have more details > about > > > what kind of data and indexing you want? Is there a reason you're > > thinking > > > of using both databases in particular? > > > > > > > > > On Wed, Jul 23, 2014 at 5:17 AM, Ali Nazemian > > > wrote: > > > > > > > Dear All, > > > > Hi, > > > > I was wondering is there anybody out there that tried to integrate > Solr > > > > with Accumulo? I was thinking about using Accumulo on top of HDFS and > > > using > > > > Solr to index data inside Accumulo? Do you have any idea how can I do > > > such > > > > integration? > > > > > > > > Best regards. > > > > > > > > -- > > > > A.Nazemian > > > > > > > > > > > > > > > > -- > > > I know what it is to be in need, and I know what it is to have plenty. > I > > > have learned the secret of being content in any and every situation, > > > whether well fed or hungry, whether living in plenty or in want. I can > > do > > > all this through him who gives me strength. *-Philippians 4:12-13* > > > > > > > > > > > -- > > A.Nazemian > > > > > > -- > I know what it is to be in need, and I know what it is to have plenty. I > have learned the secret of being content in any and every situation, > whether well fed or hungry, whether living in plenty or in want. I can do > all this through him who gives me strength. *-Philippians 4:12-13* > -- A.Nazemian --001a113b3e042f0ad704fef14b8f--