Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 94101 invoked from network); 29 Nov 2010 22:50:06 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Nov 2010 22:50:06 -0000 Received: (qmail 25337 invoked by uid 500); 29 Nov 2010 22:50:05 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 25114 invoked by uid 500); 29 Nov 2010 22:50:04 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 25102 invoked by uid 99); 29 Nov 2010 22:50:04 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Nov 2010 22:50:04 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of shrijeet@rocketfuelinc.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Nov 2010 22:49:58 +0000 Received: by fxm2 with SMTP id 2so3978538fxm.35 for ; Mon, 29 Nov 2010 14:49:37 -0800 (PST) MIME-Version: 1.0 Received: by 10.223.83.133 with SMTP id f5mr6019465fal.29.1291070976980; Mon, 29 Nov 2010 14:49:36 -0800 (PST) Received: by 10.223.96.131 with HTTP; Mon, 29 Nov 2010 14:49:36 -0800 (PST) In-Reply-To: References: Date: Mon, 29 Nov 2010 14:49:36 -0800 Message-ID: Subject: Re: Image as input to M-R in Hadoop From: Shrijeet Paliwal To: general@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org This gentleman here (see below) is doing a hadoop streaming magic and seems to be playing with the image features in map reducy way. Its not using hadoop's java api though, so no help there. Still you can check and see if the articles gives you some clues, http://techportal.ibuildings.com/2009/11/02/precision-color-searching-with-gmagick-and-amazon-elastic-mapreduce/ PS: Pardon if the motivation in the article is orthogonal to yours. -Shrijeet On Mon, Nov 29, 2010 at 2:13 PM, Aravinth Bheemaraj wrote: > Michael, thanks a lot for your reply. > > I got to compare the images based on pixels. So is it possible to process > the image based on Pixel values rather than XML records? > > I have read somewhere that the class "InputFormat" can be customized to > handle images by extending "InputSplit" and "RecordReader". But I am unsure > of the methods which are to be overridden so that I can access pixels of the > image. Is there anyway you can help me with this? > > Regarding the note, I am reading in a directory with multiple image files. > > On Mon, Nov 29, 2010 at 4:08 PM, Michael Segel wrote: > >> >> Hi, >> The short answer is yes you can process images in Hadoop. >> Think of the image as a multi-line byte stream. >> >> As to an existing class, I don't believe that it exists, but shouldn't be >> too difficult to cobble. >> (If you can read in XML records for processing you should be able to read >> in a file containing a series of images.) >> >> Note: I'm assuming that you're either reading in a directory w multiple >> image files, or an image file w multiple images. Otherwise you probably >> don't want to use Hadoop. >> >> >> > Date: Mon, 29 Nov 2010 14:56:35 -0500 >> > Subject: Image as input to M-R in Hadoop >> > From: b.aravinth@gmail.com >> > To: general@hadoop.apache.org >> > >> > Hi, >> > >> > I am a beginner to Hadoop and I am looking for some help in implementing >> the >> > Mapper with an image as input. Is there any predefined Writable class for >> > processing image? If so, how do I use it? >> > >> > Also I have read somewhere that compressed formats cannot be processed in >> > Hadoop. If this is true, am I making any sense in saying that the JPEG >> > images (which are also compressed format) cannot be processed by Hadoop? >> > Please correct me if I have misunderstood this concept. >> > >> > Thanks, >> > -- >> > Aravinth >> >> > > > > -- > Aravinth Bheemaraj > University of Florida >