hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Carl Steinbach <cwsteinb...@gmail.com>
Subject Re: Interesting claims that seem untrue
Date Mon, 16 Sep 2013 05:34:35 GMT
It doesn't sound like others have seen this yet so I thought I'd point out
that
Carter Shanklin explains his methodology for computing these statistics in
this comment:

http://hortonworks.com/blog/stinger-phase-2-the-journey-to-100x-faster-hive/#comment-160943


On Thu, Sep 12, 2013 at 6:09 PM, Navis류승우 <navis.ryu@nexr.com> wrote:

> It looks like counting codes from company of committer, not of the author.
>
> Considering the huge amount of works by Ashutosh, it's not strange.
>
>
> 2013/9/13 Sanjay Subramanian <Sanjay.Subramanian@wizecommerce.com>
>
>>  I have not read the full blogs but in the year 2013 , IMHO , LOC is a
>> very old metric that defines good software any more...
>>
>>   From: Edward Capriolo <edlinuxguru@gmail.com>
>> Reply-To: "user@hive.apache.org" <user@hive.apache.org>
>> Date: Thursday, September 12, 2013 7:19 AM
>> To: "hive-user@hadoop.apache.org" <hive-user@hadoop.apache.org>, "<
>> hive-dev@hadoop.apache.org>" <hive-dev@hadoop.apache.org>
>> Subject: Interesting claims that seem untrue
>>
>>   I was reading the horton-works blog and found an interesting article.
>>
>> http://hortonworks.com/blog/stinger-phase-2-the-journey-to-100x-faster-hive/#comment-160753
>>
>>  There is a very interesting graphic which attempts to demonstrate lines
>> of code in the 12 release.
>> http://hortonworks.com/wp-content/uploads/2013/09/hive4.png
>>
>>  Although I do not know how they are calculated, they are probably
>> counting code generated by tests output, but besides that they are wrong.
>>
>>  One claim is that Cloudera contributed 4,244 lines of code.
>>
>>  So to debunk that claim:
>>
>>  In https://issues.apache.org/jira/browse/HIVE-4675 Brock Noland from
>> cloudera, created the ptest2 testing framework. He did all the work for
>> ptest2 in hive 12, and it is clearly more then 4,244
>>
>>  This consists of 84 java files
>> [edward@desksandra ptest2]$ find . -name "*.java" | wc -l
>> 84
>>  and by itself is 8001 lines of code.
>> [edward@desksandra ptest2]$ find . -name "*.java" | xargs cat | wc -l
>> 8001
>>
>>  [edward@desksandra hive-trunk]$ wc -l HIVE-4675.patch
>> 7902 HIVE-4675.patch
>>
>>  This is not the only feature from cloudera in hive 12.
>>
>>  There is also a section of the article that talks of a "ROAD MAP" for
>> hive features. I did not know we (hive) had a road map. I have advocated
>> switching to feature based release and having a road map before, but it was
>> suggested that might limit people from itch-scratching.
>>
>>
>>
>>
>>
>> CONFIDENTIALITY NOTICE
>> ======================
>> This email message and any attachments are for the exclusive use of the
>> intended recipient(s) and may contain confidential and privileged
>> information. Any unauthorized review, use, disclosure or distribution is
>> prohibited. If you are not the intended recipient, please contact the
>> sender by reply email and destroy all copies of the original message along
>> with any attachments, from your computer system. If you are the intended
>> recipient, please be advised that the content of this message is subject to
>> access, review and disclosure by the sender's Email System Administrator.
>>
>
>

Mime
View raw message