hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Naresh Yadav <nyadav....@gmail.com>
Subject Re: Suggestion technology/design on this usecase
Date Tue, 28 Jan 2014 16:51:42 GMT
please give suggestions on this...


On Tue, Jan 28, 2014 at 3:18 PM, Naresh Yadav <nyadav.ait@gmail.com> wrote:

> Hi all,
>
> I am new to big data technologies and design so looking for help from java
> world.
>
> I have concept of tags and tagcombinations.
> For example U.S.A and Pen are two tags AND if they come together in some
> definition then register a tagcombination(U.S.A-Pen) for that..
>
> *tags *(U.S.A, Pen, Pencil, India, Shampoo)
> *tagcombinations*(U.S.A-Pen, India-pencil, U.S.A-Pencil, India-Pen,
> India-Pen-Shampoo)
>
> - millions of tags
> - billions of tagcombinations
> - one tagcombination generally have 2-8 tags....
> - Every day we get lakhs of new tagcombinations to write
> - daily crores of queries to find matching combination by set of tags
>
> Query need to support :
> one tag or set of tags appears in how many tagcombinationids ????
> If i query for Pen,India then it should return two tagcombinaions
> (India-Pen, India-Pen-Shampoo))..Query will be fired by application in
> realtime.
>
> Please suggest a solution which is distributed with java client and can
> handle scale of data  i am looking for..
>
> Thanks
> Naresh
>

Mime
View raw message