hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Peter Dolan (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-2554) Add a HTable get method that retrieves all versions of a particular column and row between two timestamps
Date Tue, 08 Jan 2008 22:54:34 GMT
Add a HTable get method that retrieves all versions of a particular column and row between
two timestamps
---------------------------------------------------------------------------------------------------------

                 Key: HADOOP-2554
                 URL: https://issues.apache.org/jira/browse/HADOOP-2554
             Project: Hadoop
          Issue Type: New Feature
          Components: contrib/hbase
            Reporter: Peter Dolan
            Priority: Minor


The use case:

* A weblog application for which rows are user ids and posts are stored in a single column,
with post date specified by the cell's timestamp.  The application would then need to be able
to display all posts for the last week or month.
* A feedfetcher for which rows are URLs and feed posts are stored in a single column with
the post publish date or fetch time stored in the cell's timestamp.  The application would
then need to be able to display all posts for the last week or month.

Proposed API:

/**
 * Get all versions of the specified row and column whose timestamps are in [minTimestamp,
maxTimestamp]
 */
Map<long, byte[]> getRowTimestamps(Text row, Text column, long minTimestamp, long maxTimestamp);

/**
 * Get all versions of the specified row and column whose timestamps are >= minTimestamp
 */
Map<long, byte[]> getRowTimestamps(Text row, Text column, long minTimestamp);

I'd be happy to take this on myself, as I need it for the above use cases before migrating
my application over to HBase.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message