hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Naresh Yadav <nyadav....@gmail.com>
Subject Suggestion technology/design on this usecase
Date Tue, 28 Jan 2014 09:48:51 GMT
Hi all,

I am new to big data technologies and design so looking for help from java

I have concept of tags and tagcombinations.
For example U.S.A and Pen are two tags AND if they come together in some
definition then register a tagcombination(U.S.A-Pen) for that..

*tags *(U.S.A, Pen, Pencil, India, Shampoo)
*tagcombinations*(U.S.A-Pen, India-pencil, U.S.A-Pencil, India-Pen,

- millions of tags
- billions of tagcombinations
- one tagcombination generally have 2-8 tags....
- Every day we get lakhs of new tagcombinations to write
- daily crores of queries to find matching combination by set of tags

Query need to support :
one tag or set of tags appears in how many tagcombinationids ????
If i query for Pen,India then it should return two tagcombinaions
(India-Pen, India-Pen-Shampoo))..Query will be fired by application in

Please suggest a solution which is distributed with java client and can
handle scale of data  i am looking for..


View raw message