Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: mapreduce-user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of lordjoe2000@gmail.com
 designates 209.85.160.176 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <OF239B20E0.6A8F787E-ON852579F7.0057E7B8-852579F7.005835EF@freddiemac.com>
References: 
 <CALEj8eMBNTrSZMHkbg9iDVB=WRrm+Qqm_QF=D7z5ckZc8E4m0Q@mail.gmail.com>
	<OF239B20E0.6A8F787E-ON852579F7.0057E7B8-852579F7.005835EF@freddiemac.com>
Date: Mon, 7 May 2012 09:24:05 -0700
Message-ID: 
 <CALEj8ePFcRXUMCNexCXkvU-jSsyuwPwYi+4aQPkyVTmQnC_Cxg@mail.gmail.com>
Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in Hadoop
From: Steve Lewis <lordjoe2000@gmail.com>
To: mapreduce-user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=bcaec54c5392eaed1804bf74b29f

--bcaec54c5392eaed1804bf74b29f
Content-Type: text/plain; charset=ISO-8859-1

Fair enough - I write a lot of InputFormats since for most of my problems a
line of text is not the proper unit -
I read fasta files - read lines intil you hit a line starting with > and
xml fragments - read until you hit a closing tag


On Mon, May 7, 2012 at 9:03 AM, GUOJUN Zhu <guojun_zhu@freddiemac.com>wrote:

>
> The default FileInputformat split the file according to the size.  If you
> use line text data, the TextFileInputformat respects the line structure for
> input.   We got splits as small as a few KBs.  The file split is a tricky
> business, especially when you want it to respect your logical boundary. It
> is better to use the existing battle-test code than invent your own wheel.
>
> Zhu, Guojun
> Modeling Sr Graduate
> 571-3824370
> guojun_zhu@freddiemac.com
> Financial Engineering
> Freddie Mac
>
>
>     *Steve Lewis <lordjoe2000@gmail.com>*
>
>    05/07/2012 11:17 AM
>     Please respond to
> mapreduce-user@hadoop.apache.org
>
>   To
> mapreduce-user@hadoop.apache.org
> cc
>   Subject
> Re: Ant Colony Optimization for Travelling Salesman Problem in Hadoop
>
>
>
>
> Yes but it is the job of the InputFormat code to implement the behavior -
> it is not necessary to do so and in other cases I choose to create more
> mappers when the mapper has a lot of work
>
> On Mon, May 7, 2012 at 7:54 AM, GUOJUN Zhu <*guojun_zhu@freddiemac.com*<guojun_zhu@freddiemac.com>>
> wrote:
>
> We are using old API of 0.20.  I think when you set "mapred.reduce.tasks"
> with certain number N and use fileinputformat, the default behavior is that
> any file will be split into that number, N, each split smaller than the
> default block size. Of course, other restriction, such as
> "mapred.min.split.size" cannot be set too large (default is as small as
> possible I think).
>
> Zhu, Guojun
> Modeling Sr Graduate*
> **571-3824370* <571-3824370>*
> **guojun_zhu@freddiemac.com* <guojun_zhu@freddiemac.com>
> Financial Engineering
> Freddie Mac
>
>      *sharat attupurath <**sharat_a@hotmail.com* <sharat_a@hotmail.com>*>*
>
>    05/05/2012 11:37 AM
>
>
>      Please respond to*
> **mapreduce-user@hadoop.apache.org* <mapreduce-user@hadoop.apache.org>
>
>   To
> <*mapreduce-user@hadoop.apache.org* <mapreduce-user@hadoop.apache.org>>
> cc
>   Subject
> RE: Ant Colony Optimization for Travelling Salesman Problem in Hadoop
>
>
>
>
>
>
> Since the input files are very small, the default input formats in Hadoop
> all generate just a single InputSplit, so only a single map task is
> executed, and we wont have much parallelism.
>
> I was thinking of writing an InputFormat that would read the whole file as
> an InputSplit and replicate this input split n times (where n would be the
> number of ants in a single stage) so that we'll have n mappers.
> Also I want the input format to return the value as the adjacency matrix
> of the graph (calculating it from the coordinates in the input file).
>
> But I can't find a way to do this. Is it possible? Or is it better to just
> have the input as Text and create the adjacency matrix in the mappers?
>
>  ------------------------------
> Date: Sat, 5 May 2012 08:20:34 -0700
> Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in
> Hadoop
> From: *lordjoe2000@gmail.com* <lordjoe2000@gmail.com>
> To: *mapreduce-user@hadoop.apache.org* <mapreduce-user@hadoop.apache.org>
>
> yes - if you know how you can put it in distributed cache or if it is
> small put in as a String in the config or have all InputFormats read it
> from somewhere
>
> On Sat, May 5, 2012 at 8:08 AM, sharat attupurath <*sharat_a@hotmail.com*<sharat_a@hotmail.com>>
> wrote:
> I looked at both the files. in AbstractNShotInputFormat it is mentioned
> that this input format does not read from files. My input is in a text
> file. I want the whole file as a single record. So is it enough if i just
> copy the contents of the file and return it as a string from
> getValueFromIndex() ?
>
>  ------------------------------
> Date: Fri, 4 May 2012 13:15:46 -0700
>
> Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in
> Hadoop
> From: *lordjoe2000@gmail.com* <lordjoe2000@gmail.com>
> To: *mapreduce-user@hadoop.apache.org* <mapreduce-user@hadoop.apache.org>
>
> Look at NShotInputFormat
> call setNumberKeys to set the number of ants then
> change the method getValueFromIndex to return the text representing the
> original problem to each mapper -
> what happens in the second and third jobs is an exercise but the saved
> test needs to have pheremones and the original problem
>
>
> On Fri, May 4, 2012 at 9:54 AM, sharat attupurath <*sharat_a@hotmail.com*<sharat_a@hotmail.com>>
> wrote:
> Hi,
>
> Thanks a lot for the quick reply sir! We are new to Apache Hadoop and
> haven't yet understood it properly yet. Can you please elaborate on how we
> can have multiple stages of mapreduce jobs for combining the trails as you
> have mentioned?
>
> We have been trying to find out how to write a custom splitter and almost
> all online resources tell to subclass the FileInputFormat and write only a
> custom RecordReader? Will it be possible to generate our splits in that way?
>
> Regards
>
> Sharat
>
>  ------------------------------
> Date: Fri, 4 May 2012 09:22:51 -0700
> Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in
> Hadoop
> From: *lordjoe2000@gmail.com* <lordjoe2000@gmail.com>
> To: *mapreduce-user@hadoop.apache.org* <mapreduce-user@hadoop.apache.org>
>
>
>
>
> On Fri, May 4, 2012 at 9:06 AM, sharat attupurath <*sharat_a@hotmail.com*<sharat_a@hotmail.com>>
> wrote:
> Hi,
>
> We are trying to parallelize the ant colony optimization algorithm for TSP
> over hadoop and are facing some issues. We are using TSPLIB as input files.
> The input is a text file containing eucledian coordinates of the cities -
> first column is city number and the next two columns contain the x and y
> coordinates respectively.
>
> What we intend to do is to take input from this single file, send copies
> of it to multiple mappers (each mapper acts like the ant in the algorithm),
> each mapper works on its input to find its own TSP solution that it outputs
> and finally the reducer outputs the smallest tour found by the mappers.
> Hope we are in the right track. Here are the issues:
>
> 1) Since the input file is small, we need to force hadoop to fire up
> multiple map tasks by replicating the input. How can we make an InputSplit
> of the whole file and replicate it so that the input can be sent to
> multiple mappers?
> Write a custom splitter sending the same data to all mappers - the only
> critical criteria is the number of "Ants"
>
> 2) the algorithm uses a shared pheromone array and each mapper needs to
> read and write data from this. How can we share the pheromone data across
> the mappers.
> You cannot share data across mappers and should not attempt to do so -
> Better to use the reducer(s) to combine trails  combine first pass trails
> and
> then pass the combined trails to another mapreduce job - this with the
> original problem plus the current pheremone trails
>
> Hope the questions are clear enough. Any help would be greatly
> appreciated.
>
> Thank you
>
> Regards
>
> Sharat
>
>
>
> --
> Steven M. Lewis PhD
> 4221 105th Ave NE
> Kirkland, WA 98033 *
> **206-384-1340* <206-384-1340> (cell)
> Skype lordjoe_com
>
>
>
>
>
> --
> Steven M. Lewis PhD
> 4221 105th Ave NE
> Kirkland, WA 98033 *
> **206-384-1340* <206-384-1340> (cell)
> Skype lordjoe_com
>
>
>
>
>
> --
> Steven M. Lewis PhD
> 4221 105th Ave NE
> Kirkland, WA 98033 *
> **206-384-1340* <206-384-1340> (cell)
> Skype lordjoe_com
>
>
>
>
>
> --
> Steven M. Lewis PhD
> 4221 105th Ave NE
> Kirkland, WA 98033
> 206-384-1340 (cell)
> Skype lordjoe_com
>
>
>


-- 
Steven M. Lewis PhD
4221 105th Ave NE
Kirkland, WA 98033
206-384-1340 (cell)
Skype lordjoe_com

--bcaec54c5392eaed1804bf74b29f
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Fair enough - I write a lot of InputFormats since for most of my problems a=
 line of text is not the proper unit -<div>I read fasta files - read lines =
intil you hit a line starting with &gt; and xml fragments - read until you =
hit a closing tag</div>
<div><br><br><div class=3D"gmail_quote">On Mon, May 7, 2012 at 9:03 AM, GUO=
JUN Zhu <span dir=3D"ltr">&lt;<a href=3D"mailto:guojun_zhu@freddiemac.com" =
target=3D"_blank">guojun_zhu@freddiemac.com</a>&gt;</span> wrote:<br><block=
quote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc=
 solid;padding-left:1ex">

<br><font face=3D"sans-serif">The default FileInputformat split the
file according to the size. =A0If you use line text data, the TextFileInput=
format
respects the line structure for input. =A0 We got splits as small as
a few KBs. =A0The file split is a tricky business, especially when you
want it to respect your logical boundary. It is better to use the existing
battle-test code than invent your own wheel. </font>
<br><div class=3D"im"><font face=3D"sans-serif"><br>
Zhu, Guojun<br>
Modeling Sr Graduate<br>
<a href=3D"tel:571-3824370" value=3D"+15713824370" target=3D"_blank">571-38=
24370</a><br>
<a href=3D"mailto:guojun_zhu@freddiemac.com" target=3D"_blank">guojun_zhu@f=
reddiemac.com</a><br>
Financial Engineering<br>
Freddie Mac</font>
<br>
<br>
<br>
</div><p></p><table width=3D"100%">
<tbody><tr valign=3D"top">
<td width=3D"40%"><font size=3D"1" face=3D"sans-serif">=A0 =A0<b>Steve Lewi=
s
&lt;<a href=3D"mailto:lordjoe2000@gmail.com" target=3D"_blank">lordjoe2000@=
gmail.com</a>&gt;</b> </font>
<p><font size=3D"1" face=3D"sans-serif">=A0 =A005/07/2012 11:17 AM</font>
</p><table border=3D"">
<tbody><tr valign=3D"top">
<td bgcolor=3D"white">
<div align=3D"center"><font size=3D"1" face=3D"sans-serif">=A0 =A0Please re=
spond
to<br>
<a href=3D"mailto:mapreduce-user@hadoop.apache.org" target=3D"_blank">mapre=
duce-user@hadoop.apache.org</a></font></div></td></tr></tbody></table>
<br>
</td><td width=3D"59%">
<table width=3D"100%">
<tbody><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">To</font></div>
</td><td><font size=3D"1" face=3D"sans-serif"><a href=3D"mailto:mapreduce-u=
ser@hadoop.apache.org" target=3D"_blank">mapreduce-user@hadoop.apache.org</=
a></font>
</td></tr><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">cc</font></div>
</td><td>
</td></tr><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">Subject</font></d=
iv>
</td><td><font size=3D"1" face=3D"sans-serif">Re: Ant Colony Optimization f=
or Travelling
Salesman Problem in Hadoop</font></td></tr></tbody></table>
<br>
<table>
<tbody><tr valign=3D"top">
<td>
</td><td></td></tr></tbody></table>
<br></td></tr></tbody></table>
<br>
<br>
<br><font size=3D"3">Yes but it is the job of the InputFormat code to imple=
ment
the behavior - it is not necessary to do so and in other cases I choose
to create more mappers when the mapper has a lot of work<br>
</font>
<br><font size=3D"3">On Mon, May 7, 2012 at 7:54 AM, GUOJUN Zhu &lt;</font>=
<a href=3D"mailto:guojun_zhu@freddiemac.com" target=3D"_blank"><font size=
=3D"3" color=3D"blue"><u>guojun_zhu@freddiemac.com</u></font></a><font size=
=3D"3">&gt;
wrote:</font>
<br><font size=3D"3" face=3D"sans-serif"><br>
We are using old API of 0.20. =A0I think when you set </font><font size=3D"=
3" face=3D"Courier New">&quot;mapred.reduce.tasks&quot;
with certain number N and use fileinputformat, the default behavior is
that any file will be split into that number, N, each split smaller than
the default block size. Of course, other restriction, such as &quot;mapred.=
min.split.size&quot;
cannot be set too large (default is as small as possible I think). </font><=
font size=3D"3" face=3D"sans-serif"><br>
<br>
Zhu, Guojun<br>
Modeling Sr Graduate</font><font size=3D"3" color=3D"blue" face=3D"sans-ser=
if"><u><br>
</u></font><a href=3D"tel:571-3824370" target=3D"_blank"><font size=3D"3" c=
olor=3D"blue" face=3D"sans-serif"><u>571-3824370</u></font></a><font size=
=3D"3" color=3D"blue" face=3D"sans-serif"><u><br>
</u></font><a href=3D"mailto:guojun_zhu@freddiemac.com" target=3D"_blank"><=
font size=3D"3" color=3D"blue" face=3D"sans-serif"><u>guojun_zhu@freddiemac=
.com</u></font></a><font size=3D"3" face=3D"sans-serif"><br>
Financial Engineering<br>
Freddie Mac</font><font size=3D"3"> <br>
<br>
</font>
<p>
</p><p></p><p></p><table width=3D"100%">
<tbody><tr valign=3D"top">
<td width=3D"40%"><font size=3D"1" face=3D"sans-serif">=A0 =A0<b>sharat att=
upurath
&lt;</b></font><a href=3D"mailto:sharat_a@hotmail.com" target=3D"_blank"><f=
ont size=3D"1" color=3D"blue" face=3D"sans-serif"><b><u>sharat_a@hotmail.co=
m</u></b></font></a><font size=3D"1" face=3D"sans-serif"><b>&gt;</b>
</font>
<p><font size=3D"1" face=3D"sans-serif">=A0 =A005/05/2012 11:37 AM</font><f=
ont size=3D"3">
</font>
</p><p>
<br>
</p><table border=3D"">
<tbody><tr valign=3D"top">
<td bgcolor=3D"white">
<div align=3D"center"><font size=3D"1" face=3D"sans-serif">=A0 =A0Please re=
spond
to</font><font size=3D"1" color=3D"blue" face=3D"sans-serif"><u><br>
</u></font><a href=3D"mailto:mapreduce-user@hadoop.apache.org" target=3D"_b=
lank"><font size=3D"1" color=3D"blue" face=3D"sans-serif"><u>mapreduce-user=
@hadoop.apache.org</u></font></a></div></td></tr></tbody></table>
<br>
</td><td width=3D"59%">
<table width=3D"100%">
<tbody><tr valign=3D"top">
<td width=3D"10%">
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">To</font></div>
</td><td width=3D"89%"><font size=3D"1" face=3D"sans-serif">&lt;</font><a h=
ref=3D"mailto:mapreduce-user@hadoop.apache.org" target=3D"_blank"><font siz=
e=3D"1" color=3D"blue" face=3D"sans-serif"><u>mapreduce-user@hadoop.apache.=
org</u></font></a><font size=3D"1" face=3D"sans-serif">&gt;</font><font siz=
e=3D"3">
</font>
</td></tr><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">cc</font></div>
</td><td>
</td></tr><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">Subject</font></d=
iv>
</td><td><font size=3D"1" face=3D"sans-serif">RE: Ant Colony Optimization f=
or Travelling
Salesman Problem in Hadoop</font></td></tr></tbody></table>
<br>
<br>
<table>
<tbody><tr valign=3D"top">
<td>
</td><td></td></tr></tbody></table>
<br></td></tr></tbody></table>
<br><font size=3D"3"><br>
<br>
</font><font size=3D"3" face=3D"Tahoma"><br>
Since the input files are very small, the default input formats in Hadoop
all generate just a single InputSplit, so only a single map task is execute=
d,
and we wont have much parallelism. <br>
<br>
I was thinking of writing an InputFormat that would read the whole file
as an InputSplit and replicate this input split n times (where n would
be the number of ants in a single stage) so that we&#39;ll have n mappers.<=
br>
Also I want the input format to return the value as the adjacency matrix
of the graph (calculating it from the coordinates in the input file). <br>
<br>
But I can&#39;t find a way to do this. Is it possible? Or is it better to j=
ust
have the input as Text and create the adjacency matrix in the mappers?</fon=
t><font size=3D"3"><br>
<br>
</font>
<hr><font size=3D"3" face=3D"Tahoma">Date: Sat, 5 May 2012 08:20:34 -0700<b=
r>
Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in
Hadoop<br>
From: </font><a href=3D"mailto:lordjoe2000@gmail.com" target=3D"_blank"><fo=
nt size=3D"3" color=3D"blue" face=3D"Tahoma"><u>lordjoe2000@gmail.com</u></=
font></a><font size=3D"3" face=3D"Tahoma"><br>
To: </font><a href=3D"mailto:mapreduce-user@hadoop.apache.org" target=3D"_b=
lank"><font size=3D"3" color=3D"blue" face=3D"Tahoma"><u>mapreduce-user@had=
oop.apache.org</u></font></a><font size=3D"3" face=3D"Tahoma"><br>
<br>
yes - if you know how you can put it in distributed cache or if it is small
put in as a String in the config or have all InputFormats read it from
somewhere</font><font size=3D"3"><br>
</font><font size=3D"3" face=3D"Tahoma"><br>
On Sat, May 5, 2012 at 8:08 AM, sharat attupurath &lt;</font><a href=3D"mai=
lto:sharat_a@hotmail.com" target=3D"_blank"><font size=3D"3" color=3D"blue"=
 face=3D"Tahoma"><u>sharat_a@hotmail.com</u></font></a><font size=3D"3" fac=
e=3D"Tahoma">&gt;
wrote:</font><font size=3D"3"> </font><font size=3D"3" face=3D"Tahoma"><br>
I looked at both the files. in AbstractNShotInputFormat it is mentioned
that this input format does not read from files. My input is in a text
file. I want the whole file as a single record. So is it enough if i just
copy the contents of the file and return it as a string from getValueFromIn=
dex()
? </font><font size=3D"3"><br>
<br>
</font>
<hr><font size=3D"3" face=3D"Tahoma">Date: Fri, 4 May 2012 13:15:46 -0700</=
font><font size=3D"3">
</font><font size=3D"3" face=3D"Tahoma"><br>
<br>
Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in
Hadoop<br>
From: </font><a href=3D"mailto:lordjoe2000@gmail.com" target=3D"_blank"><fo=
nt size=3D"3" color=3D"blue" face=3D"Tahoma"><u>lordjoe2000@gmail.com</u></=
font></a><font size=3D"3" face=3D"Tahoma"><br>
To: </font><a href=3D"mailto:mapreduce-user@hadoop.apache.org" target=3D"_b=
lank"><font size=3D"3" color=3D"blue" face=3D"Tahoma"><u>mapreduce-user@had=
oop.apache.org</u></font></a><font size=3D"3" face=3D"Tahoma"><br>
<br>
Look at NShotInputFormat <br>
call setNumberKeys to set the number of ants then</font><font size=3D"3"> <=
/font><font size=3D"3" face=3D"Tahoma"><br>
change the method getValueFromIndex to return the text representing the
original problem to each mapper -</font><font size=3D"3"> </font><font size=
=3D"3" face=3D"Tahoma"><br>
what happens in the second and third jobs is an exercise but the saved
test needs to have pheremones and the original problem <br>
</font><font size=3D"3"><br>
</font><font size=3D"3" face=3D"Tahoma"><br>
On Fri, May 4, 2012 at 9:54 AM, sharat attupurath &lt;</font><a href=3D"mai=
lto:sharat_a@hotmail.com" target=3D"_blank"><font size=3D"3" color=3D"blue"=
 face=3D"Tahoma"><u>sharat_a@hotmail.com</u></font></a><font size=3D"3" fac=
e=3D"Tahoma">&gt;
wrote:</font><font size=3D"3"> </font><font size=3D"3" face=3D"Tahoma"><br>
Hi,<br>
<br>
Thanks a lot for the quick reply sir! We are new to Apache Hadoop and haven=
&#39;t
yet understood it properly yet. Can you please elaborate on how we can
have multiple stages of mapreduce jobs for combining the trails as you
have mentioned? <br>
<br>
We have been trying to find out how to write a custom splitter and almost
all online resources tell to subclass the FileInputFormat and write only
a custom RecordReader? Will it be possible to generate our splits in that
way?<br>
<br>
Regards<br>
<br>
Sharat</font><font size=3D"3"><br>
<br>
</font>
<hr><font size=3D"3" face=3D"Tahoma">Date: Fri, 4 May 2012 09:22:51 -0700<b=
r>
Subject: Re: Ant Colony Optimization for Travelling Salesman Problem in
Hadoop<br>
From: </font><a href=3D"mailto:lordjoe2000@gmail.com" target=3D"_blank"><fo=
nt size=3D"3" color=3D"blue" face=3D"Tahoma"><u>lordjoe2000@gmail.com</u></=
font></a><font size=3D"3" face=3D"Tahoma"><br>
To: </font><a href=3D"mailto:mapreduce-user@hadoop.apache.org" target=3D"_b=
lank"><font size=3D"3" color=3D"blue" face=3D"Tahoma"><u>mapreduce-user@had=
oop.apache.org</u></font></a><font size=3D"3">
</font><font size=3D"3" face=3D"Tahoma"><br>
<br>
<br>
</font><font size=3D"3"><br>
</font><font size=3D"3" face=3D"Tahoma"><br>
On Fri, May 4, 2012 at 9:06 AM, sharat attupurath &lt;</font><a href=3D"mai=
lto:sharat_a@hotmail.com" target=3D"_blank"><font size=3D"3" color=3D"blue"=
 face=3D"Tahoma"><u>sharat_a@hotmail.com</u></font></a><font size=3D"3" fac=
e=3D"Tahoma">&gt;
wrote:</font><font size=3D"3"> </font><font size=3D"3" face=3D"Tahoma"><br>
Hi,<br>
<br>
We are trying to parallelize the ant colony optimization algorithm for
TSP over hadoop and are facing some issues. We are using TSPLIB as input
files. The input is a text file containing eucledian coordinates of the
cities - first column is city number and the next two columns contain the
x and y coordinates respectively. <br>
<br>
What we intend to do is to take input from this single file, send copies
of it to multiple mappers (each mapper acts like the ant in the algorithm),
each mapper works on its input to find its own TSP solution that it outputs
and finally the reducer outputs the smallest tour found by the mappers.
Hope we are in the right track. Here are the issues:<br>
<br>
1) Since the input file is small, we need to force hadoop to fire up multip=
le
map tasks by replicating the input. How can we make an InputSplit of the
whole file and replicate it so that the input can be sent to multiple mappe=
rs?</font><font size=3D"3">
</font><font size=3D"3" face=3D"Tahoma"><br>
Write a custom splitter sending the same data to all mappers - the only
critical criteria is the number of &quot;Ants&quot; <br>
<br>
2) the algorithm uses a shared pheromone array and each mapper needs to
read and write data from this. How can we share the pheromone data across
the mappers.</font><font size=3D"3"> </font><font size=3D"3" face=3D"Tahoma=
"><br>
You cannot share data across mappers and should not attempt to do so -
<br>
Better to use the reducer(s) to combine trails =A0combine first pass
trails and</font><font size=3D"3"> </font><font size=3D"3" face=3D"Tahoma">=
<br>
then pass the combined trails to another mapreduce job - this with the
original problem plus the current pheremone trails</font><font size=3D"3">
</font><font size=3D"3" face=3D"Tahoma"><br>
<br>
Hope the questions are clear enough. Any help would be greatly appreciated.
<br>
<br>
Thank you<br>
<br>
Regards</font><font size=3D"3" color=3D"#8f8f8f" face=3D"Tahoma"><br>
<br>
Sharat </font><font size=3D"3" face=3D"Tahoma"><br>
</font><font size=3D"3"><br>
<br>
</font><font size=3D"3" face=3D"Tahoma"><br>
-- <br>
Steven M. Lewis PhD</font><font size=3D"3"> </font><font size=3D"3" face=3D=
"Tahoma"><br>
4221 105th Ave NE</font><font size=3D"3"> </font><font size=3D"3" face=3D"T=
ahoma"><br>
Kirkland, WA 98033</font><font size=3D"3"> </font><font size=3D"3" color=3D=
"blue" face=3D"Tahoma"><u><br>
</u></font><a href=3D"tel:206-384-1340" target=3D"_blank"><font size=3D"3" =
color=3D"blue" face=3D"Tahoma"><u>206-384-1340</u></font></a><font size=3D"=
3" face=3D"Tahoma">
(cell)<br>
Skype lordjoe_com</font><font size=3D"3"><br>
<br>
</font><font size=3D"3" face=3D"Tahoma"><br>
</font><font size=3D"3"><br>
<br>
</font><font size=3D"3" face=3D"Tahoma"><br>
-- <br>
Steven M. Lewis PhD</font><font size=3D"3"> </font><font size=3D"3" face=3D=
"Tahoma"><br>
4221 105th Ave NE</font><font size=3D"3"> </font><font size=3D"3" face=3D"T=
ahoma"><br>
Kirkland, WA 98033</font><font size=3D"3"> </font><font size=3D"3" color=3D=
"blue" face=3D"Tahoma"><u><br>
</u></font><a href=3D"tel:206-384-1340" target=3D"_blank"><font size=3D"3" =
color=3D"blue" face=3D"Tahoma"><u>206-384-1340</u></font></a><font size=3D"=
3" face=3D"Tahoma">
(cell)<br>
Skype lordjoe_com</font><font size=3D"3"><br>
<br>
</font><font size=3D"3" face=3D"Tahoma"><br>
</font><font size=3D"3"><br>
<br>
</font><font size=3D"3" face=3D"Tahoma"><br>
-- <br>
Steven M. Lewis PhD</font><font size=3D"3"> </font><font size=3D"3" face=3D=
"Tahoma"><br>
4221 105th Ave NE</font><font size=3D"3"> </font><font size=3D"3" face=3D"T=
ahoma"><br>
Kirkland, WA 98033</font><font size=3D"3"> </font><font size=3D"3" color=3D=
"blue" face=3D"Tahoma"><u><br>
</u></font><a href=3D"tel:206-384-1340" target=3D"_blank"><font size=3D"3" =
color=3D"blue" face=3D"Tahoma"><u>206-384-1340</u></font></a><font size=3D"=
3" face=3D"Tahoma">
(cell)<br>
Skype lordjoe_com</font><font size=3D"3"><br>
<br>
</font>
<br><font size=3D"3"><br>
</font>
<br>
<br><font size=3D"3">-- <br>
Steven M. Lewis PhD</font>
<br><font size=3D"3">4221 105th Ave NE</font>
<br><font size=3D"3">Kirkland, WA 98033</font>
<br><font size=3D"3"><a href=3D"tel:206-384-1340" value=3D"+12063841340" ta=
rget=3D"_blank">206-384-1340</a> (cell)<br>
Skype lordjoe_com<br>
</font>
<br>
<br><p></p></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>S=
teven M. Lewis PhD<div>4221 105th Ave NE</div><div>Kirkland, WA 98033</div>=
<div>206-384-1340 (cell)<br>Skype lordjoe_com<br><br></div><br>
</div>

--bcaec54c5392eaed1804bf74b29f--