Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E1A23D970 for ; Mon, 24 Sep 2012 11:51:55 +0000 (UTC) Received: (qmail 64141 invoked by uid 500); 24 Sep 2012 11:51:54 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 64035 invoked by uid 500); 24 Sep 2012 11:51:53 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 64010 invoked by uid 99); 24 Sep 2012 11:51:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Sep 2012 11:51:53 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FSL_RCVD_USER,HTML_FONT_SIZE_LARGE,HTML_MESSAGE,MARKETING_PARTNERS,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dwivedishashwat@gmail.com designates 209.85.212.41 as permitted sender) Received: from [209.85.212.41] (HELO mail-vb0-f41.google.com) (209.85.212.41) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Sep 2012 11:51:47 +0000 Received: by vbkv13 with SMTP id v13so7287362vbk.14 for ; Mon, 24 Sep 2012 04:51:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=4OkE8EvMdZZTBoKbF55XFuRYUdSB0jgFPW32pSKEvto=; b=GR8+wkHxYd3s9o4T9J0ymCu9q8+YK9F6Xk9KnLpNn91pc3rqyRNr69XQqJamIllhS4 /sY3xGszoVzvnyOWZEDIUq7YzQi9K+l4AHG9ks/ntgicgR847Ed+1+LXjwbhl/HQipDI o3LOm+dTs6kQl27JtHi+FLy9G49nUMx1LzNKp3EbRZhtpuOzX4AJ50eNo/Nv7me0FnvD 5BJK787Y6rkiEylbrw273xYL1ORvc5v9a2lcZYWSR54seU/EWD6nAQIaRMjYECadZOqv BXaD4TTs/I1An47NzrcOt5Dt5DNAVCetJZl2x87rbp2IwjNwv3rnGye/hQq+NeUv3oFx /gGg== MIME-Version: 1.0 Received: by 10.52.34.162 with SMTP id a2mr5930875vdj.19.1348487486026; Mon, 24 Sep 2012 04:51:26 -0700 (PDT) Received: by 10.58.56.5 with HTTP; Mon, 24 Sep 2012 04:51:25 -0700 (PDT) In-Reply-To: References: <4969F4E54725764F9C5FBB0A0E4FC3ED03C227EB8E@extxmb35.nam.nsroot.net> <4969F4E54725764F9C5FBB0A0E4FC3ED03C227EC77@extxmb35.nam.nsroot.net> <4969F4E54725764F9C5FBB0A0E4FC3ED03C227EE71@extxmb35.nam.nsroot.net> Date: Mon, 24 Sep 2012 17:21:25 +0530 Message-ID: Subject: Re: Hbase and Search Integration From: shashwat shriparv To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=20cf3079b89896b2ad04ca71352b --20cf3079b89896b2ad04ca71352b Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Can you try this? http://bizosyshsearch.sourceforge.net/ Regards =E2=88=9E Shashwat Shriparv On Mon, Sep 24, 2012 at 5:01 PM, Jignesh Patel wro= te: > I didn't see any respond. Saurabh do you happen to find the solution for > integrating search with HBase. > > > On Tue, Mar 20, 2012 at 11:12 AM, Agarwal, Saurabh < > saurabh.agarwal@citi.com > > wrote: > > > Thanks Ryan. Have you created that Solr secondary index as one of Hbase > > Table? > > > > > > -----Original Message----- > > From: Ryan Tabora [mailto:ratabora@gmail.com] > > Sent: Tuesday, March 20, 2012 10:59 AM > > To: user@hbase.apache.org > > Subject: Re: Hbase and Search Integration > > > > I would suggest when you are loading in the log data you should create = a > > secondary index in Solr that creates indices based on HBase row key. Th= is > > is the way we are implementing Solr search on Hbase in my current > project. > > > > Thank you, > > Ryan > > > > On Mar 20, 2012, at 7:27 AM, Imran M Yousuf wrote: > > > > > Hi Saurabh, > > > > > > On Tue, Mar 20, 2012 at 8:10 PM, Agarwal, Saurabh > > > wrote: > > >> Thanks Imran. Ton of good functionality in Smart CMS. > > >> > > > > > > There are a lot of functionality, but not that they all need to be > > > used at once :). Let me know if you want to give your use a try with > > > Smart CMS, I would be glad to help you out. > > > > > > Thank you, > > > > > > Imran > > > > > >> For our search use case, CMS might be overkill. Lily looks good at > > first glance. Do anyone has experience? > > >> > > >> Thanks, > > >> Saurabh. > > >> > > >> -----Original Message----- > > >> From: Imran M Yousuf [mailto:imyousuf@gmail.com] > > >> Sent: Tuesday, March 20, 2012 9:46 AM > > >> To: user@hbase.apache.org > > >> Subject: Re: Hbase and Search Integration > > >> > > >> Hi Saurabh, > > >> > > >> For integrating HBase and Apache Solr (or any other indexing/search > > >> platform) we came up with Smart CMS [1][2] and there is the Lily > > >> Project [4] too. > > >> > > >> We are on the verge of releasing its 0.1 version which we have been > > >> testing for an extensive period of time and will be used in producti= on > > >> straight away. Smart CMS was designed and developed with a goal > > >> of uniting concepts of Objects with (HBase +Solr). IOW, we want to > > >> design objects, and Smart CMS will take care of persisting it and > > >> making it available for search. Though initially we have chosen > > >> Apache Solr as the search engine but it is very easy to plugin any > > >> other search engine of our choice, since we expose the integration o= f > > >> search functionality through SPI. > > >> > > >> A little bit of history of how we came into developing it and what i= t > > >> is currently being used for. We started the development of it as we > > >> needed a flexible content management system for an e-Commerce Platfo= rm > > >> as a Service of ours. As we engrossed ourselves into it we found > > >> 'content' to be synonymous to 'Object' in OOP paradigm and we starte= d > > >> development around it. As a result now we have a system that can be > > >> used both as a traditional Content Management System and as a Conten= t > > >> Repository. > > >> > > >> We used it in as a traditional CMS capacity to manage Pages for the > > >> partner websites for our e-Commerce PaaS; i.e. customers can create > > >> pages for - product, promotion, store, etc. manage page contents for > > >> front page, category page; links associated products, related produc= ts > > >> etc from UI where the UI is dynamically generated using the content > > >> definitions. We also used the CMS for extensive search functionaliti= es > > >> such as, full text search, facet search, range search, auto completi= on > > >> etc. For this we access the CMS using its Web Service library, we us= e > > >> Solr directly for advance searches and to access both of them we use= a > > >> tag library. The flexibility Smart CMS provided us in fact helped us > > >> win 2 big customers. > > >> > > >> We used CMS as a content repository where Smart CMS is actually bein= g > > >> used to generate domain/dto, data access layer codes for API/Service > > >> layers to use them to persist Java POJOs; i.e.users of it defines a > > >> XML we call 'Content Type Definition'. A content type definition is > > >> synonymous to an Object Diagram; where we define objects, their > > >> inheritance and compositions. This code generation is an approach we > > >> took to bypass Java Reflection API and it is done by a Maven Plugin = we > > >> have written. We have another plugin which helps us start all CMS > > >> related applications within Maven so that we can write integration > > >> tests on the fly. An example of repository mode is available in our > > >> Application Smart Email Queue [3], which is designed to send emails > > >> from our PaaS. After proving sustainable performance in this mode, > > >> Smart CMS has also been chosen for 4G Telecom Application Server > > >> project's database. > > >> > > >> [1] Smart CMS - http://smart-cms.org > > >> [2] Smart CMS Source - > https://github.com/SmartITEngineering/smart-cms > > >> [3] Smart Email Queue - > > https://github.com/SmartITEngineering/smart-email-queue > > >> [4] http://www.lilyproject.org/lily/index.html > > >> > > >> We would welcome any feedback, criticism, involvement in Smart CMS. = If > > >> you have any further queries please feel free to ask them. > > >> > > >> Thank you, > > >> > > >> Imran > > >> > > >> On Tue, Mar 20, 2012 at 7:38 PM, Agarwal, Saurabh > > >> wrote: > > >>> Hi, > > >>> > > >>> Has anyone integrated search ( Luence, Solr or Elastic) with HBase? > > >>> > > >>> We are implementing log search functionality using HBase. Through > > Flume, the logs from multiple apps are getting streamed into HBase > directly. > > >>> > > >>> A very basic use case is to search a keyword for an application for= a > > certain timeframe ( for example - last hour). > > >>> > > >>> Our row key is app_id:timestamp and all log contents are stored in > > columns. We started with Regex filter. It worked but do not provide the > > consistent result. > > >>> > > >>> Now, we are exploring the index search capability in HBase. Our > > thought process is that first create an inverted index table with row > key - > > search documents and column - the row key of the content table. The > search > > will return all the row keys. > > >>> > > >>> Additional requirement - We would like to limit the results for > > certain time frame. Second, we would like to display only limited recor= ds > > in descending time order and come back for more if user want to see mor= e > > records. > > >>> > > >>> Let me know if someone has integrated the search with HBase. > > >>> > > >>> Thanks, > > >>> Saurabh. > > >>> > > >>> -----Original Message----- > > >>> From: Ted Yu [mailto:yuzhihong@gmail.com] > > >>> Sent: Monday, March 19, 2012 12:33 PM > > >>> To: user@hbase.apache.org > > >>> Subject: Re: There is no data value information in HLog? > > >>> > > >>> Hi, > > >>> Have you noticed this in HLogPrettyPrinter ? > > >>> options.addOption("p", "printvals", false, "Print values"); > > >>> > > >>> Looks like you should have specified the above option. > > >>> > > >>> On Mon, Mar 19, 2012 at 7:31 AM, yonghu > wrote: > > >>> > > >>>> Hello, > > >>>> > > >>>> I used the $ ./bin/hbase > org.apache.hadoop.hbase.regionserver.wal.HLog > > >>>> --dump command to check the HLog information. But I can not find a= ny > > >>>> data information. The output of my HLog file is looks like follows= : > > >>>> > > >>>> Sequence 933 from region 85986149309dff24ecf7be4873136f15 in table > > test > > >>>> Action: > > >>>> row: Udo > > >>>> column: Course:Computer > > >>>> at time: Mon Mar 19 14:09:29 CET 2012 > > >>>> > > >>>> Sequence 935 from region 85986149309dff24ecf7be4873136f15 in table > > test > > >>>> Action: > > >>>> row: Udo > > >>>> column: Course:Math > > >>>> at time: Mon Mar 19 14:09:29 CET 2012 > > >>>> > > >>>> The functionality of HLog is for recovery. But without data value > > >>>> information, how can hbase use the information in HLog to do > recovery. > > >>>> My hbase version is 0.92.0. > > >>>> > > >>>> Regards! > > >>>> > > >>>> Yong > > >>>> > > >> > > >> > > >> > > >> -- > > >> Imran M Yousuf > > >> Entrepreneur & CEO > > >> Smart IT Engineering Ltd. > > >> Dhaka, Bangladesh > > >> Twitter: @imyousuf - http://twitter.com/imyousuf > > >> Blog: http://imyousuf-tech.blogs.smartitengineering.com/ > > >> Mobile: +880-1711402557 > > > > > > > > > > > > -- > > > Imran M Yousuf > > > Entrepreneur & CEO > > > Smart IT Engineering Ltd. > > > Dhaka, Bangladesh > > > Twitter: @imyousuf - http://twitter.com/imyousuf > > > Blog: http://imyousuf-tech.blogs.smartitengineering.com/ > > > Mobile: +880-1711402557 > > > > > --=20 =E2=88=9E Shashwat Shriparv --20cf3079b89896b2ad04ca71352b--