hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Milind Bhandarkar (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-856) PERFORMANCE: reduce number of replicas
Date Tue, 23 Jun 2009 19:58:07 GMT

    [ https://issues.apache.org/jira/browse/PIG-856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12723269#action_12723269
] 

Milind Bhandarkar commented on PIG-856:
---------------------------------------

Replication of 2 is 17% faster than replication for 3 for the sort benchmark. But, the sort
benchmark does not have any computation in mappers or reducers. Therefore, the percentage
improvement for Pig will definitely be much less.

> PERFORMANCE: reduce number of replicas
> --------------------------------------
>
>                 Key: PIG-856
>                 URL: https://issues.apache.org/jira/browse/PIG-856
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: 0.3.0
>            Reporter: Olga Natkovich
>
> Currently Pig uses the default number of replicas between MR jobs. Currently, the number
is 3. Given the temp nature of the data, we should never need more than 2 and should explicitely
set it to improve performance and to be nicer to the name node.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message