hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tsuna <tsuna...@gmail.com>
Subject Re: habse schema design and retrieving values through REST interface
Date Tue, 15 Mar 2011 23:51:57 GMT
On Tue, Mar 15, 2011 at 10:19 AM, sreejith P. K. <sreejithpk@nesote.com> wrote:
> I need to maintain a huge table for a 'web crawler' project in HBASE.
> Basically it contains thousands of keywords and for each keyword i need to
> maintain a list of urls (it again will count in thousands). Corresponding to
> each url, i need to store a number, which will in turn resemble the priority
> value the keyword holds.
> Let me explain you a bit, Suppose i have a keyword 'united states', i need
> to store about ten thousand urls corresponding to that keyword. Each keyword
> will be holding a priority value which is an integer. Again i have thousands
> of keywords like that. The rare thing about this is i need to do the project
> in PHP.

Have you looked at ElasticSearch?  Seems like it would do what you
want out of the box.  In your PHP app you simply need to make REST
calls with a bit of JSON here and there, and that would be all.

Benoit "tsuna" Sigoure
Software Engineer @ www.StumbleUpon.com

View raw message