Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 301CE1DFA for ; Sun, 24 Apr 2011 17:02:30 +0000 (UTC) Received: (qmail 59899 invoked by uid 500); 24 Apr 2011 17:02:27 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 59849 invoked by uid 500); 24 Apr 2011 17:02:27 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 59841 invoked by uid 99); 24 Apr 2011 17:02:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 24 Apr 2011 17:02:26 +0000 X-ASF-Spam-Status: No, hits=3.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of markq2011@gmail.com designates 209.85.210.48 as permitted sender) Received: from [209.85.210.48] (HELO mail-pz0-f48.google.com) (209.85.210.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 24 Apr 2011 17:02:20 +0000 Received: by pzk10 with SMTP id 10so1316934pzk.35 for ; Sun, 24 Apr 2011 10:02:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:date:message-id:subject:from:to :content-type; bh=DEFRdfO0YkZ+9vWvrA6KMQdjTUC8LHWcJy6Q+7PhDKM=; b=nBwdAlg4ZvP/hKBMaf0IASLHqCG7581A7+cmZpUSGhw+xGXf5AbHZ7jzl0yKF95VWY CJey/L12HnZZk4zudkiEzszvUO19GhCO8pWkGfDQcLwnYvvG27sm7D4VBxSvOCyD8Nei bHF+SJd4qT7+K48amhDo9vY4NZoPYg6QnAXRo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=d7lUkSl7SGLLUtnRlJ0BRk5st5gs3pSpoi1ZEbNjKSVsmxacO1x/162EORXTkBKXgQ h2W0PD0FMiRJypGvWeerQDvrCE1xoRue1UDr0T9JNU0ITi17ZsEhttZz06s91hRyOISb h23i2CRi0UNf+5hmFXHPw+6KBqC4iPdBQMXCk= MIME-Version: 1.0 Received: by 10.142.141.18 with SMTP id o18mr1893480wfd.303.1303664520280; Sun, 24 Apr 2011 10:02:00 -0700 (PDT) Received: by 10.143.42.16 with HTTP; Sun, 24 Apr 2011 10:02:00 -0700 (PDT) Date: Sun, 24 Apr 2011 10:02:00 -0700 Message-ID: Subject: SequenceFile.Sorter performance From: Mark question To: common-user Content-Type: multipart/alternative; boundary=000e0cd32ed0a33cda04a1ad0c7a --000e0cd32ed0a33cda04a1ad0c7a Content-Type: text/plain; charset=ISO-8859-1 Hi guys, I'm trying to sort a 2.5 GB sequence file in one mapper using its implemented Sort function, but it's taking long that the map is killed for not reporting . I would increase the default time to get reports from the mapper, but I'll do this only if sorting using SequenceFile.sorter is known to be optimal ... Any one knows ? Or other suggested options? Thanks, Mark --000e0cd32ed0a33cda04a1ad0c7a--