hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Dimiduk (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-8034) record on-disk data size for store file and make it available during writing
Date Fri, 08 Mar 2013 18:18:13 GMT

    [ https://issues.apache.org/jira/browse/HBASE-8034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13597369#comment-13597369
] 

Nick Dimiduk commented on HBASE-8034:
-------------------------------------

Would it make more sense to expose the number of KeyValues in the HFile?

{code}
+  @Override
+  public long getCurrentSize() throws IOException {
+    if (this.outputStream == null) return -1;
+    return this.outputStream.getPos();
+  }
{code}

This strikes me as flakey. Will there be another thread writing to the OutputStream when this
method is invoked? Should it be synchronized?
                
> record on-disk data size for store file and make it available during writing
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-8034
>                 URL: https://issues.apache.org/jira/browse/HBASE-8034
>             Project: HBase
>          Issue Type: Task
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>            Priority: Minor
>         Attachments: HBASE-8034-v0.patch
>
>
> To better estimate the size of data in the file, and to be able to split files intelligently
during any multi-file compactor like stripe or level.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message