hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohammad Tariq <donta...@gmail.com>
Subject Re: HBase or Cassandra
Date Thu, 21 Mar 2013 10:02:25 GMT
Harsh has got a point. You should consider it. If you really need random
real time read/write, only then you should go for a DB.

Warm Regards,
Tariq
https://mtariq.jux.com/
cloudfront.blogspot.com


On Thu, Mar 21, 2013 at 3:29 PM, Nitin Pawar <nitinpawar432@gmail.com>wrote:

> Oozie is a workflow scheduling and processing engine.
>
> so suppose you have similar kind of incoming data and you want to do a
> bunch of data processing steps on this data as and when it arrives, oozie
> will give you the framework for same
>
>
> On Thu, Mar 21, 2013 at 3:27 PM, oualid ait wafli <
> oualid.aitwafli@gmail.com> wrote:
>
>> Thanks Mohammad,
>> but how can I use Oozie !
>>
>>
>> 2013/3/21 Mohammad Tariq <dontariq@gmail.com>
>>
>>> Hello there,
>>>
>>>   For your use case, Hbase seems to be a better choice. And you workflow
>>> looks good to me.
>>>
>>> Just one suggestion(in case you find it useful). Since, you are going to
>>> do a lot of operations,
>>> you might find it useful to schedule the jobs using Oozie.
>>>
>>> Warm Regards,
>>> Tariq
>>> https://mtariq.jux.com/
>>> cloudfront.blogspot.com
>>>
>>>
>>> On Thu, Mar 21, 2013 at 2:27 PM, oualid ait wafli <
>>> oualid.aitwafli@gmail.com> wrote:
>>>
>>>> I have the CDR files (call details record) as my data and I want read
>>>> from those files the data using Pig.
>>>>
>>>> firstly, I will import the data from sources using Flume, then use Pig
>>>> as an ETL and as a tool to run MapReduce jobs into HDFS. so now I want
>>>> store my data but I have to do a benchmark between HBase and Cassandra.
>>>>
>>>>  My questions:
>>>> - How do you find my idea to analyze, process my data ? Am I in the
>>>> best way ?
>>>> - which one is the best HBase or Cassandra ?
>>>>
>>>>
>>>> Thanks
>>>>
>>>>
>>>>
>>>>
>>>> 2013/3/20 Ted Yu <yuzhihong@gmail.com>
>>>>
>>>>> Can you give us more information about your use case ?
>>>>> e.g. approximate ratio between write vs. read load, amount of log, etc.
>>>>>
>>>>> Cheers
>>>>>
>>>>> On Wed, Mar 20, 2013 at 9:22 AM, oualid ait wafli <
>>>>> oualid.aitwafli@gmail.com> wrote:
>>>>>
>>>>>> Yes I have a data source which contains log files, I want to analyze
>>>>>> those files and store them
>>>>>> any idea ?
>>>>>> thanks
>>>>>>
>>>>>>
>>>>>> 2013/3/20 Ted Yu <yuzhihong@gmail.com>
>>>>>>
>>>>>>> The answer to second question would be subjective.
>>>>>>>
>>>>>>> Do you have specific use case in mind ?
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Mar 20, 2013 at 9:07 AM, oualid ait wafli <
>>>>>>> oualid.aitwafli@gmail.com> wrote:
>>>>>>>
>>>>>>>> Hi,
>>>>>>>>
>>>>>>>> Which is the best HBase or Cassandra ?
>>>>>>>> Which are the criteria to compare those tools( HBase and
Cassandra)
>>>>>>>>
>>>>>>>> Thanks
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>
>
> --
> Nitin Pawar
>

Mime
View raw message