Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6FF3310B79 for ; Tue, 28 Jan 2014 17:28:29 +0000 (UTC) Received: (qmail 15704 invoked by uid 500); 28 Jan 2014 17:28:18 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 15575 invoked by uid 500); 28 Jan 2014 17:28:18 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 15567 invoked by uid 99); 28 Jan 2014 17:28:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Jan 2014 17:28:18 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mohajeri@gmail.com designates 209.85.219.46 as permitted sender) Received: from [209.85.219.46] (HELO mail-oa0-f46.google.com) (209.85.219.46) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Jan 2014 17:28:12 +0000 Received: by mail-oa0-f46.google.com with SMTP id n16so756560oag.19 for ; Tue, 28 Jan 2014 09:27:52 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=gVQDpYBIebeiqkSL+O+FK8HKDfXbykwKTE0dhfZAo+A=; b=jg2YM8yaERiQEpWzUj+72tKB8gt1VH2jKiqbRcjZxGq6f7yVx4mGM6Zyjdsi2r7Xh2 RM2ve89VpN0N0zt0U+BjA3iRAWxalekLovtPnUoN7Yh/lgxTTePWpDa7w4M/1c3ZSggR dEBlv9ByhRRJPVGh3ilK83Va5GRxUi2PQ5RdXa4oAA0prXp3MJMbNOlH3IdXl3u//2wK LkEUAaxkbUoqwbHVjg9Br3xjAI5bzdO3nCvnBJ+B1Nr8KbcQ/ykKULhKwCJFVH+Y3jxD zsyp8esLN/4hyvekqTAqW5pPNbM+OkVcAslg2DbgQLFPnzCE/OmtspAwlWuZLnYKgumP O7tw== MIME-Version: 1.0 X-Received: by 10.182.142.229 with SMTP id rz5mr2033936obb.12.1390930072110; Tue, 28 Jan 2014 09:27:52 -0800 (PST) Received: by 10.182.248.165 with HTTP; Tue, 28 Jan 2014 09:27:52 -0800 (PST) In-Reply-To: References: Date: Tue, 28 Jan 2014 09:27:52 -0800 Message-ID: Subject: Re: Suggestion technology/design on this usecase From: Peyman Mohajerian To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=001a11c362e6db0cf004f10b2425 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c362e6db0cf004f10b2425 Content-Type: text/plain; charset=ISO-8859-1 No-sql solution with real-time counters would work, e.g. Cassandra or hbase. But I think elastic search or Solr would be simpler and can do the counting on access. There are solutions that are the combination of both these approaches. On Tue, Jan 28, 2014 at 8:51 AM, Naresh Yadav wrote: > please give suggestions on this... > > > On Tue, Jan 28, 2014 at 3:18 PM, Naresh Yadav wrote: > >> Hi all, >> >> I am new to big data technologies and design so looking for help from >> java world. >> >> I have concept of tags and tagcombinations. >> For example U.S.A and Pen are two tags AND if they come together in some >> definition then register a tagcombination(U.S.A-Pen) for that.. >> >> *tags *(U.S.A, Pen, Pencil, India, Shampoo) >> *tagcombinations*(U.S.A-Pen, India-pencil, U.S.A-Pencil, India-Pen, >> India-Pen-Shampoo) >> >> - millions of tags >> - billions of tagcombinations >> - one tagcombination generally have 2-8 tags.... >> - Every day we get lakhs of new tagcombinations to write >> - daily crores of queries to find matching combination by set of tags >> >> Query need to support : >> one tag or set of tags appears in how many tagcombinationids ???? >> If i query for Pen,India then it should return two tagcombinaions >> (India-Pen, India-Pen-Shampoo))..Query will be fired by application in >> realtime. >> >> Please suggest a solution which is distributed with java client and can >> handle scale of data i am looking for.. >> >> Thanks >> Naresh >> > > > > --001a11c362e6db0cf004f10b2425 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
No-sql solution with real-time counters would work, e.g. C= assandra or hbase. But I think elastic search or Solr would be simpler and = can do the counting on access. There are solutions that are the combination= of both these approaches.
--001a11c362e6db0cf004f10b2425--