apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (APEXMALHAR-2314) Improper functioning in partitioning of sequentialFileRead property of FSRecordReader
Date Wed, 26 Oct 2016 07:40:58 GMT

    [ https://issues.apache.org/jira/browse/APEXMALHAR-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607748#comment-15607748
] 

ASF GitHub Bot commented on APEXMALHAR-2314:
--------------------------------------------

GitHub user deepak-narkhede reopened a pull request:

    https://github.com/apache/apex-malhar/pull/468

    APEXMALHAR-2314 Improper functioning in partitioning for sequentialFileRead for FSRecord

    Fix the StreamCodec for FSRecordReader, initially it was hashcode of blockId's mostly
always unique.
    Hence unable to satisfy the sequentialFileRead property. Now the StreamCodec is modified
to work
    with hashcode of filePath. So all blocks related to a file would be partitioned on same
operator.
    
    Tested with recordReader and verified for sequentialFileRead that all blocks related to
a file are partitioned to single operator.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/deepak-narkhede/apex-malhar APEXMALHAR-2314

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/apex-malhar/pull/468.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #468
    
----
commit 259cc5b80635207e8b0a4d7c0c9b5bc735021de2
Author: deepak-narkhede <mailtodeepakn@gmail.com>
Date:   2016-10-24T11:39:24Z

    APEXMALHAR-2314 Improper functioning in partitioning of sequentialFileRead property of
FSRecordReaderModule.
    
    Modified the StreamCodec to work with hashcode of filepath rather than blockId.

----


> Improper functioning in partitioning of sequentialFileRead property of FSRecordReader

> --------------------------------------------------------------------------------------
>
>                 Key: APEXMALHAR-2314
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2314
>             Project: Apache Apex Malhar
>          Issue Type: Bug
>            Reporter: Deepak Narkhede
>            Assignee: Deepak Narkhede
>            Priority: Minor
>
> Fix the StreamCodec for FSRecordReader, initially it was hashcode of blockId's mostly
always unique.
> Hence unable to satisfy the sequentialFileRead property. Now the StreamCodec is modified
to work
> with hashcode of filePath. So all blocks related to a file would be partitioned on same
operator.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message