hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nitin Pawar <nitinpawar...@gmail.com>
Subject Re: HBase or Cassandra
Date Thu, 21 Mar 2013 09:59:40 GMT
Oozie is a workflow scheduling and processing engine.

so suppose you have similar kind of incoming data and you want to do a
bunch of data processing steps on this data as and when it arrives, oozie
will give you the framework for same


On Thu, Mar 21, 2013 at 3:27 PM, oualid ait wafli <oualid.aitwafli@gmail.com
> wrote:

> Thanks Mohammad,
> but how can I use Oozie !
>
>
> 2013/3/21 Mohammad Tariq <dontariq@gmail.com>
>
>> Hello there,
>>
>>   For your use case, Hbase seems to be a better choice. And you workflow
>> looks good to me.
>>
>> Just one suggestion(in case you find it useful). Since, you are going to
>> do a lot of operations,
>> you might find it useful to schedule the jobs using Oozie.
>>
>> Warm Regards,
>> Tariq
>> https://mtariq.jux.com/
>> cloudfront.blogspot.com
>>
>>
>> On Thu, Mar 21, 2013 at 2:27 PM, oualid ait wafli <
>> oualid.aitwafli@gmail.com> wrote:
>>
>>> I have the CDR files (call details record) as my data and I want read
>>> from those files the data using Pig.
>>>
>>> firstly, I will import the data from sources using Flume, then use Pig
>>> as an ETL and as a tool to run MapReduce jobs into HDFS. so now I want
>>> store my data but I have to do a benchmark between HBase and Cassandra.
>>>
>>>  My questions:
>>> - How do you find my idea to analyze, process my data ? Am I in the best
>>> way ?
>>> - which one is the best HBase or Cassandra ?
>>>
>>>
>>> Thanks
>>>
>>>
>>>
>>>
>>> 2013/3/20 Ted Yu <yuzhihong@gmail.com>
>>>
>>>> Can you give us more information about your use case ?
>>>> e.g. approximate ratio between write vs. read load, amount of log, etc.
>>>>
>>>> Cheers
>>>>
>>>> On Wed, Mar 20, 2013 at 9:22 AM, oualid ait wafli <
>>>> oualid.aitwafli@gmail.com> wrote:
>>>>
>>>>> Yes I have a data source which contains log files, I want to analyze
>>>>> those files and store them
>>>>> any idea ?
>>>>> thanks
>>>>>
>>>>>
>>>>> 2013/3/20 Ted Yu <yuzhihong@gmail.com>
>>>>>
>>>>>> The answer to second question would be subjective.
>>>>>>
>>>>>> Do you have specific use case in mind ?
>>>>>>
>>>>>> Thanks
>>>>>>
>>>>>>
>>>>>> On Wed, Mar 20, 2013 at 9:07 AM, oualid ait wafli <
>>>>>> oualid.aitwafli@gmail.com> wrote:
>>>>>>
>>>>>>> Hi,
>>>>>>>
>>>>>>> Which is the best HBase or Cassandra ?
>>>>>>> Which are the criteria to compare those tools( HBase and Cassandra)
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>


-- 
Nitin Pawar

Mime
View raw message