Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1D106D146 for ; Wed, 12 Dec 2012 01:01:39 +0000 (UTC) Received: (qmail 49455 invoked by uid 500); 12 Dec 2012 01:01:34 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 49353 invoked by uid 500); 12 Dec 2012 01:01:34 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 49341 invoked by uid 99); 12 Dec 2012 01:01:34 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Dec 2012 01:01:34 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of pat.ferrel@gmail.com designates 209.85.160.45 as permitted sender) Received: from [209.85.160.45] (HELO mail-pb0-f45.google.com) (209.85.160.45) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Dec 2012 01:01:24 +0000 Received: by mail-pb0-f45.google.com with SMTP id mc8so65630pbc.18 for ; Tue, 11 Dec 2012 17:01:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:content-type:content-transfer-encoding:subject:message-id:date :to:mime-version:x-mailer; bh=hRffMBm7W7eFir7y7486xuZ/IcI0AXfQhna0AzAsaG4=; b=VAZgrZARgEw0+Y+aczSC53NKbHgnzFUxU8sKUbQDff5PTQHwnHoQKui/xKaJ1TnLsM CS4VZCkcFPL5Hd8ST5gaVQ5qRARV0vPAatpbzM4mJTIQkvujkHPpzsM7j302JaXz+I1q AFVCFLsAPgJJWELe+7SNqlPTNUscGb6vyKGQzB+n7y4NQeUS1VyEYH2Zz9xpSe5VZKOy HV6zYbngJU3GqHJwz52EPOEw+bChqNrN0BamPoyScwDD4hLuDBUa22SSayrhWfjIbaMR t+OaxFSmFodP4fDJWoK3p2UliX4YkrEjv3kyE9MxCmqH3wKm/VDxFyBqBsZgvsV3mJCr d0GQ== Received: by 10.66.88.198 with SMTP id bi6mr409073pab.54.1355274063461; Tue, 11 Dec 2012 17:01:03 -0800 (PST) Received: from [192.168.0.4] (c-98-232-27-186.hsd1.wa.comcast.net. [98.232.27.186]) by mx.google.com with ESMTPS id sz6sm14607570pbc.52.2012.12.11.17.01.01 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 11 Dec 2012 17:01:02 -0800 (PST) From: Pat Ferrel Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Subject: Hadoop 101 Message-Id: Date: Tue, 11 Dec 2012 16:49:45 -0800 To: "user@hadoop.apache.org" Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\)) X-Mailer: Apple Mail (2.1499) X-Virus-Checked: Checked by ClamAV on apache.org Stupid question for the day=85 I have a file created by a mahout job of the form: 0 = [356:0.3481597,359:0.3481597,358:0.3481597,361:0.3481597,360:0.3481597] 8 = [356:0.34786037,359:0.34786037,358:0.34786037,361:0.34786037,360:0.3478603= 7] 25 = [284:0.34821576,286:0.34821576,287:0.34821576,288:0.34821576,289:0.3482157= 6] 28 = [452:0.34802154,454:0.34802154,453:0.34802154,456:0.34802154,455:0.3480215= 4] =85 If this were a SequenceFile I could read it and be merrily on my way but = it's a text file. The classes written are key, value pairs = but the file is tab delimited text.=20 I was hoping to do something like: SequenceFile.Reader reader =3D new SequenceFile.Reader(fs, inputFile, = conf); Writable userId =3D new LongWritable(); VectorWritable recommendations =3D new VectorWritable(); while (reader.next(userId, recommendations)) { //do something with each pair } But alas Google fails me. How do you read in key, values pairs from text = files outside of a map or reduce?=20