hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-1608) [Hbase Shell] Relational Algrebra Operators
Date Fri, 02 Nov 2007 00:46:56 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12539494
] 

stack commented on HADOOP-1608:
-------------------------------

Edward:

I'm having some difficulty following the model Edward particularly when there is a dearth
of javadoc (You should at a minumum provide documentation of interfaces).

For example, the Operation interface has a getRelationalOperation method; RelationalOperation
implements Operation.  I would think the getRelationalOperation belongs in RelationalOperation
or calling since RelationalOperation implements Operation, calling getOperation when the implemenation
is a RelationalOperation, then you'd get back a RelationalOperation?

Do these mapreduce jobs work in a cluster of multiple machines?



> [Hbase Shell] Relational Algrebra Operators
> -------------------------------------------
>
>                 Key: HADOOP-1608
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1608
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: contrib/hbase
>    Affects Versions: 0.14.1
>         Environment: All environments 
>            Reporter: Edward Yoon
>            Priority: Minor
>             Fix For: 0.16.0
>
>         Attachments: patch_v01.txt, patch_v02.txt, patch_v03.txt, patch_v04.txt, patch_v05.txt,
patch_v06.txt, patch_v07.txt, patch_v08.txt, patch_v09.txt, patch_v10.txt, patch_v11.txt,
patch_v12.txt, patch_v13.txt, patch_v14.txt
>
>
> Development of relational algebra operators has begun.
>  * Projection 
>   ** selects a subset of the columnfamilies of a relation  
>   ** Result = π ~column_list~ (Relation) 
> {code}
> Hbase > Result = Relation.Projection('year','legnth');
> Hbase > save Result into table('result');
> {code}
>  * Selection
>   ** See : HADOOP-2003 issue's description
> {code}
> Hbase > Result = Relation.Selection(length > 100 and studioName = 'Fox'); 
> Hbase > save Result into table('result');
> {code}
>  * Group
>   ** more details about 'GROUP' operation will be handled in HADOOP-1658 issue. 
>  * θ Join
>  ** The join of two relations R1(A ~1~,A ~2~,...,A ~n~) and R2(B ~1~,B ~2~,...,B ~m~)
is a relation with degree k=n+m and attributes (A ~1~,A ~2~,...,A ~n~, B ~1~,B ~2~,...,B ~m~)
that satisfy the join condition
> {code}
> Hbase > R1 = table('movieLog_table');
> Hbase > R2 = table('personInfo_table');
> Hbase > Result = R1.join(R1.producer: = R2.ROW) and R2; 
>      or Result = R1.join(R1.actor:hero = R2.Row) and R2;
>      or Result = R1.join(R1.actor:hero = R2.Row and R1.studioName = 'Fox' and R2.occupation
= 'singer');
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message