hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jaehong choi (JIRA)" <j...@apache.org>
Subject [jira] Issue Comment Edited: (HBASE-1548) give a time range in MapReduce
Date Mon, 22 Jun 2009 16:20:07 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722653#action_12722653
] 

jaehong choi edited comment on HBASE-1548 at 6/22/09 9:19 AM:
--------------------------------------------------------------

I attached a patch.
1. add another org.apache.hadoop.hbase.mapreduce.initTableMapJob() to give timestamps when
we prepare a mapreduce job.
2. change org.apache.hadoop.hbase.mapreduce.TableInputFormat.java and org.apache.hadoop.hbase.mapreduce.TableInputFormatBase.java
to support time range scan.
3. unit test for this is in org.apache.hadoop.hbase.mapreduce.TestTimeRangeMapRed.java

Is there any possible ways to give a Scan instance when we do mapreduce? I tried to find a
way that pass the instance through Configuration. But, it only supports primitive types. If
there is any ways, passing a Scan object would be the best. 

      was (Author: jaehong choi):
    I attached a patch.
1. add another org.apache.hadoop.hbase.mapreduce.initTableMapJob() to give timestamps when
we prepare a mapreduce job.
2. change org.apache.hadoop.hbase.mapreduce.TableInputFormat.java and org.apache.hadoop.hbase.mapreduce.TableInputFormatBase.java
to support time range scan.
3. unit test for this is in org.apache.hadoop.hbase.mapreduce.TestTimeRangeMapRed.java


  
> give a time range in MapReduce
> ------------------------------
>
>                 Key: HBASE-1548
>                 URL: https://issues.apache.org/jira/browse/HBASE-1548
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: jaehong choi
>         Attachments: HBase-1548.patch
>
>
> I've seen a lot of changes in client APIs by HBASE-1304 patch.
> We can give a time range when we scan a table with assign TimeRange instance to Scan
class.
> I also see some code in org.apache.hadoop.hbase.mapreduce.TableInputForamtBase.java,
which uses a scan instance to get a Scanner instance.
> I think we can also give a time range to query when we do mapreduce on hbase. 
> I've already apply this one to my local copy for my purpose, which is about time series
analysis.
> Thanks.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message