hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jai Kumar Singh (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-5166) MultiThreaded Table Mapper analogous to MultiThreaded Mapper in hadoop
Date Tue, 10 Jan 2012 10:24:38 GMT
MultiThreaded Table Mapper analogous to MultiThreaded Mapper in hadoop
----------------------------------------------------------------------

                 Key: HBASE-5166
                 URL: https://issues.apache.org/jira/browse/HBASE-5166
             Project: HBase
          Issue Type: Improvement
            Reporter: Jai Kumar Singh
            Priority: Minor


There is no MultiThreadedTableMapper in hbase currently just like we have a MultiThreadedMapper
in Hadoop for IO Bound Jobs. 
UseCase, webcrawler: take input (urls) from a hbase table and put the content (urls, content)
back into hbase. 
Running these kind of hbase mapreduce job with normal table mapper is quite slow as we are
not utilizing CPU fully (N/W IO Bound).

Moreover, I want to know whether It would be a good/bad idea to use HBase for these kind of
usecases ?. 


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message