Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: common-user@hadoop.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <9C3ECE32-65E4-4250-99D6-427475581A74@gmail.com>
References: 
 <CANYdkkMHSXHZwwUZ7=PDTTEL2DSyWhUhboM8ruC+Y6CF4=K60Q@mail.gmail.com>
	<BADF391A-F4D8-4A5B-AD9E-EC07DAFD05C7@gmail.com>
	<CANYdkkNt3kgSV7XBVmb=HRJvri1WAK915i4uy1-aDtHUMQgV8Q@mail.gmail.com>
	<9C3ECE32-65E4-4250-99D6-427475581A74@gmail.com>
Date: Wed, 1 Feb 2012 22:13:15 -0600
Message-ID: 
 <CANYdkkPzgD9urExW3kvUrom8q6c_+2ypR82P=ELF3PeBnVb24g@mail.gmail.com>
Subject: Re: Can't achieve load distribution
From: Mark Kerzner <mark.kerzner@shmsoft.com>
To: common-user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=f46d044786035c4d9c04b7f36a9c

--f46d044786035c4d9c04b7f36a9c
Content-Type: text/plain; charset=ISO-8859-1

Thanks!
Mark

On Wed, Feb 1, 2012 at 7:44 PM, Anil Gupta <anilgupta84@gmail.com> wrote:

> Yes, if ur block size is 64mb. Btw, block size is configurable in Hadoop.
>
> Best Regards,
> Anil
>
> On Feb 1, 2012, at 5:06 PM, Mark Kerzner <mark.kerzner@shmsoft.com> wrote:
>
> > Anil,
> >
> > do you mean one block of HDFS, like 64MB?
> >
> > Mark
> >
> > On Wed, Feb 1, 2012 at 7:03 PM, Anil Gupta <anilgupta84@gmail.com>
> wrote:
> >
> >> Do u have enough data to start more than one mapper?
> >> If entire data is less than a block size then only 1 mapper will run.
> >>
> >> Best Regards,
> >> Anil
> >>
> >> On Feb 1, 2012, at 4:21 PM, Mark Kerzner <mark.kerzner@shmsoft.com>
> wrote:
> >>
> >>> Hi,
> >>>
> >>> I have a simple MR job, and I want each Mapper to get one line from my
> >>> input file (which contains further instructions for lengthy
> processing).
> >>> Each line is 100 characters long, and I tell Hadoop to read only 100
> >> bytes,
> >>>
> >>>
> >>
> job.getConfiguration().setInt("mapreduce.input.linerecordreader.line.maxlength",
> >>> 100);
> >>>
> >>> I see that this part works - it reads only one line at a time, and if I
> >>> change this parameter, it listens.
> >>>
> >>> However, on a cluster only one node receives all the map tasks. Only
> one
> >>> map tasks is started. The others never get anything, they just wait.
> I've
> >>> added 100 seconds wait to the mapper - no change!
> >>>
> >>> Any advice?
> >>>
> >>> Thank you. Sincerely,
> >>> Mark
> >>
>

--f46d044786035c4d9c04b7f36a9c--