hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Trivial Update of "Hbase/HbaseShell/ShellPlans" by udanax
Date Fri, 26 Oct 2007 07:31:52 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/lucene-hadoop/Hbase/HbaseShell/ShellPlans

------------------------------------------------------------------------------
  = Hbase Shell Commands Plans =
  
  == Hbase Shell Algebraic Commands ==
+ I propose a new way to formulate complex queries from a series of simple queries for the
perform statement of a variety of complex algebra manipulations. 
  
  === Relational Algebra Operations ===
  
@@ -114, +115 @@

  ----
  
  = Some Ideas Note =
- Isn't this query impossible?
  
  {{{
  select column_qualifier1, column_qualifier2 from 2d_table(table_name, columnfamily_name)
where row='row key';
@@ -175, +175 @@

  
  }}}
  
- 
- Algebraic Query Language 
- 
- A new approach of building enterprise distributed computing applications for layer of business
logic level is proposed in this paper. The benefit of the system is light-weightedness of
formal methods and their separate application to program features that does not need to know
full semantic model of Hadoop, Hbase programs.
- 
- I propose a new way to formulate complex queries from a series of simple queries for the
perform statement of a variety of complex algebra manipulations. 
- 
- ||Row Key ||<-12>Column Families ||
- ||<rowbgcolor="#ececec">title ||<-2> year ||<-2>length ||<-2>inColor
||<-2> studioName ||<-2> vote ||<-2> producer ||
- ||Star Wars ||year: || 1977 ||length: || 124 ||inColor: || true ||studioName: || Fox ||
vote:''user1'' || 5 || producer: || George Lucas ||
- || || || || || || || || || || vote:''user2'' || 2 || || ||
- ||Mighty Ducks ||year: || 1991 ||length: || 104 ||inColor: || true ||studioName: || Disney
|| vote:''user1'' || 2 || producer: || Blair Peters ||
- || || || || || || || || || || vote:''user3'' || 4 || || ||
- || || || || || || || || || || vote:''user4'' || 2 || || ||
- ||Wayne's World ||year: || 1992 ||length: || 95 ||inColor: || true ||studioName: || Paramount
|| vote:''user2'' || 3 || producer: || Penelope Spheeris ||
- || || || || || || || || || || vote:''user3'' || 4 || || ||
- ||A cold ||year: || 2006 ||length: || 39 ||inColor: || false ||studioName: || || vote:''user2''
|| 1 || producer: || Minjung Chu ||
- ||The Spiriting away of Sen and Chihiro ||year: || 2001 ||length: || 124 ||inColor: || true
||studioName: ||Studio Ghibli || vote:''user1'' || 5 || producer: || Miyazaki hayao ||
- 
- The goal is to find an unknown pattern between length and user. 
- 
- {{{
- Hbase > A = table('movieLog_table');
- Hbase > B = A.Selection(length > 60);
- Hbase > C = B.Projection('length', 'vote');
- Hbase > D = Sort C by ('length');
- Hbase > Save D into table('m_table');
- Successfully job submitted. 
- 
- Hbase > Show jobs;
- }}}
- 
- m_table : 
- 
- {{{
- Hbase > select * from m_table;
- 
-     Row                  Columnfamilies
- ------------    --------------------------------------  
-    length               vote                     ...
- ------------     -----------------------      ----------
-     104           vote:user1      2(t1)          ...
-                   vote:user3      4(t1)
-                   vote:user4      2(t1)
-     124           vote:user1      5(t1)
-                                   5(t2)
-                   vote:user2      2(t1)
-      95           vote:user2      3(t1)
-                   vote:user3      4(t1)
- 
- Successfully print out the selected data.(0.08 sec)
- Hbase > A = matrix('m_table','vote');
- }}}
- 
- The expected 3d-matrix A of (length by user, time dimension) would be as sampled below :

- 
- {{{
-            vote:user1    vote:user2    vote:user3 vote:user4
- ---------- ----------    ----------    ---------- ----------
-    104              2                           4          2 
- 
-    124              5             2
-                       5(t2)   
-     95                            3             4
- }}}
- 
- TODO: Aggregate average of time series.
- 

Mime
View raw message