hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hanisha Koneru (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDDS-1496) readChunkFromContainer() should only read the required part of chunk file
Date Tue, 07 May 2019 04:05:00 GMT
Hanisha Koneru created HDDS-1496:
------------------------------------

             Summary: readChunkFromContainer() should only read the required part of chunk
file
                 Key: HDDS-1496
                 URL: https://issues.apache.org/jira/browse/HDDS-1496
             Project: Hadoop Distributed Data Store
          Issue Type: Improvement
            Reporter: Hanisha Koneru
            Assignee: Hanisha Koneru


BlockInputStream#readChunkFromContainer() reads the whole chunk from disk even if we need
to read only a part of the chunk.
This Jira aims to improve readChunkFromContainer so that only that part of the chunk file
is read which is needed by client plus the part of chunk file which is required to verify
the checksum.



For example, lets say the client is reading from index 120 to 450 in the chunk. And let's
say checksum is stored for every 100 bytes in the chunk i.e. the first checksum is for bytes
from index 0 to 99, the next for bytes from index 100 to 199 and so on. To verify bytes from
120 to 450, we would need to read from bytes 100 to 499 so that checksum verification can
be done.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message