Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 83902 invoked from network); 29 Nov 2010 22:14:09 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Nov 2010 22:14:09 -0000 Received: (qmail 66511 invoked by uid 500); 29 Nov 2010 22:14:08 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 66468 invoked by uid 500); 29 Nov 2010 22:14:08 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 66460 invoked by uid 99); 29 Nov 2010 22:14:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Nov 2010 22:14:08 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of b.aravinth@gmail.com designates 209.85.215.48 as permitted sender) Received: from [209.85.215.48] (HELO mail-ew0-f48.google.com) (209.85.215.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Nov 2010 22:14:00 +0000 Received: by ewy9 with SMTP id 9so2481988ewy.35 for ; Mon, 29 Nov 2010 14:13:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=f4UYZkAg3iDdMai5EUJMxAM0vsDuiiwQI6Aj2NCZrGo=; b=b4k3pJ0IQoqCkIph9LcoTxJu1zuQau3w8tX3GJvwBkMLghHnsg9UI0JonoA1tr3o/O Kyo60ZMEHytgHWrhhUT6uXD5dyFiLy8e3goucWjcna4VpD9qDCIkQY8qNslLssks5ZDe AWzj7MHzd+vJDxgtXudtOF+6fore9frDXnM9Q= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=v7H36Sazu+zY8QKJfAnA5mDwvHT8FdN9hRBECWmqXr99dtSuCXMbf48kY63ebT4qIJ x5G98lsD6t1zdjupyqD3dFiMNGmu/468ViGCl7Ut2UzOQVtWaeNN6S6efUIZiDSc+Stc GfuI2PwLp/16yrf+7dw/SNevOh+FdFwFv4P94= MIME-Version: 1.0 Received: by 10.216.155.68 with SMTP id i46mr6012264wek.92.1291068818973; Mon, 29 Nov 2010 14:13:38 -0800 (PST) Received: by 10.216.86.203 with HTTP; Mon, 29 Nov 2010 14:13:38 -0800 (PST) In-Reply-To: References: Date: Mon, 29 Nov 2010 17:13:38 -0500 Message-ID: Subject: Re: Image as input to M-R in Hadoop From: Aravinth Bheemaraj To: general@hadoop.apache.org Content-Type: multipart/alternative; boundary=0016e65b48fa55f2e304963862d2 X-Virus-Checked: Checked by ClamAV on apache.org --0016e65b48fa55f2e304963862d2 Content-Type: text/plain; charset=ISO-8859-1 Michael, thanks a lot for your reply. I got to compare the images based on pixels. So is it possible to process the image based on Pixel values rather than XML records? I have read somewhere that the class "InputFormat" can be customized to handle images by extending "InputSplit" and "RecordReader". But I am unsure of the methods which are to be overridden so that I can access pixels of the image. Is there anyway you can help me with this? Regarding the note, I am reading in a directory with multiple image files. On Mon, Nov 29, 2010 at 4:08 PM, Michael Segel wrote: > > Hi, > The short answer is yes you can process images in Hadoop. > Think of the image as a multi-line byte stream. > > As to an existing class, I don't believe that it exists, but shouldn't be > too difficult to cobble. > (If you can read in XML records for processing you should be able to read > in a file containing a series of images.) > > Note: I'm assuming that you're either reading in a directory w multiple > image files, or an image file w multiple images. Otherwise you probably > don't want to use Hadoop. > > > > Date: Mon, 29 Nov 2010 14:56:35 -0500 > > Subject: Image as input to M-R in Hadoop > > From: b.aravinth@gmail.com > > To: general@hadoop.apache.org > > > > Hi, > > > > I am a beginner to Hadoop and I am looking for some help in implementing > the > > Mapper with an image as input. Is there any predefined Writable class for > > processing image? If so, how do I use it? > > > > Also I have read somewhere that compressed formats cannot be processed in > > Hadoop. If this is true, am I making any sense in saying that the JPEG > > images (which are also compressed format) cannot be processed by Hadoop? > > Please correct me if I have misunderstood this concept. > > > > Thanks, > > -- > > Aravinth > > -- Aravinth Bheemaraj University of Florida --0016e65b48fa55f2e304963862d2--