hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hbase/HbaseShell/HQL" by BryanDuxbury
Date Sun, 10 Feb 2008 01:52:45 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by BryanDuxbury:

  ''This sets a 'parallel' property on whole table (could be just a column?)?  Whats this
do? When you do selects, does MR job?  St.Ack''
+ ''Why is the parallel option an attribute of the table? Isn't it an attribute of the query?
Setting the size of the m/r job for the whole table seems inefficient. bryanduxbury''
  === Data Loading ===
  HQL Loader utility loads data into Hbase tables from external files. If you have a large
amount of data to load, HQL Loader's parallel support can dramatically reduce the elapsed
time needed to perform that load.
@@ -66, +68 @@

  ''Why not do something like mysql where it loads and dumps a near-binary format?  Problem
w/ above is what to do if cell has tab or new-line in it?  St.Ack''
+ ''Is this best implemented as part of the shell? Why not have a bin/hbase loader that takes
the options on the command line? bryanduxbury''
  === Data Dumping ===

View raw message