Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: 
 <14678817.328.1349921743999.JavaMail.lancenorskog@Lance-Norskogs-MacBook-Pro.local>
References: <B1D0172B-9F26-4EC6-AB93-4D15DF767597@maprtech.com>
	<14678817.328.1349921743999.JavaMail.lancenorskog@Lance-Norskogs-MacBook-Pro.local>
Date: Wed, 10 Oct 2012 21:26:47 -0500
Message-ID: 
 <CANYdkkO_TR94jXasbmrqBSmK0Yxby-W=y3qo+OLA5Q5Oyi8UCA@mail.gmail.com>
Subject: Re: Hadoop/Lucene + Solr architecture suggestions?
From: Mark Kerzner <mark.kerzner@shmsoft.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=bcaec50165f793b72504cbbf4de9

--bcaec50165f793b72504cbbf4de9
Content-Type: text/plain; charset=ISO-8859-1

That is very interesting, Lance, thank you.

Mark

On Wed, Oct 10, 2012 at 9:15 PM, Lance Norskog <goksron@gmail.com> wrote:

> In the LucidWorks Big Data product, we handle this with a reducer that
> sends documents to a SolrCloud cluster. This way the index files are not
> managed by Hadoop.
>
> ----- Original Message -----
> | From: "Ted Dunning" <tdunning@maprtech.com>
> | To: user@hadoop.apache.org
> | Cc: "Hadoop User" <user@hadoop.apache.org>
> | Sent: Wednesday, October 10, 2012 7:58:57 AM
> | Subject: Re: Hadoop/Lucene + Solr architecture suggestions?
> |
> | I prefer to create indexes in the reducer personally.
> |
> | Also you can avoid the copies if you use an advanced hadoop-derived
> | distro. Email me off list for details.
> |
> | Sent from my iPhone
> |
> | On Oct 9, 2012, at 7:47 PM, Mark Kerzner <mark.kerzner@shmsoft.com>
> | wrote:
> |
> | > Hi,
> | >
> | > if I create a Lucene index in each mapper, locally, then copy them
> | > to under /jobid/mapid1, /jodid/mapid2, and then in the reducers
> | > copy them to some Solr machine (perhaps even merging), does such
> | > architecture makes sense, to create a searchable index with
> | > Hadoop?
> | >
> | > Are there links for similar architectures and questions?
> | >
> | > Thank you. Sincerely,
> | > Mark
> |
>

--bcaec50165f793b72504cbbf4de9
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

That is very interesting, Lance, thank you.<br><br>Mark<br><br><div class=
=3D"gmail_quote">On Wed, Oct 10, 2012 at 9:15 PM, Lance Norskog <span dir=
=3D"ltr">&lt;<a href=3D"mailto:goksron@gmail.com" target=3D"_blank">goksron=
@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">In the LucidWorks Big Data product, we handl=
e this with a reducer that sends documents to a SolrCloud cluster. This way=
 the index files are not managed by Hadoop.<br>

<div class=3D"im HOEnZb"><br>
----- Original Message -----<br>
| From: &quot;Ted Dunning&quot; &lt;<a href=3D"mailto:tdunning@maprtech.com=
">tdunning@maprtech.com</a>&gt;<br>
| To: <a href=3D"mailto:user@hadoop.apache.org">user@hadoop.apache.org</a><=
br>
| Cc: &quot;Hadoop User&quot; &lt;<a href=3D"mailto:user@hadoop.apache.org"=
>user@hadoop.apache.org</a>&gt;<br>
</div><div class=3D"im HOEnZb">| Sent: Wednesday, October 10, 2012 7:58:57 =
AM<br>
| Subject: Re: Hadoop/Lucene + Solr architecture suggestions?<br>
|<br>
</div><div class=3D"HOEnZb"><div class=3D"h5">| I prefer to create indexes =
in the reducer personally.<br>
|<br>
| Also you can avoid the copies if you use an advanced hadoop-derived<br>
| distro. Email me off list for details.<br>
|<br>
| Sent from my iPhone<br>
|<br>
| On Oct 9, 2012, at 7:47 PM, Mark Kerzner &lt;<a href=3D"mailto:mark.kerzn=
er@shmsoft.com">mark.kerzner@shmsoft.com</a>&gt;<br>
| wrote:<br>
|<br>
| &gt; Hi,<br>
| &gt;<br>
| &gt; if I create a Lucene index in each mapper, locally, then copy them<b=
r>
| &gt; to under /jobid/mapid1, /jodid/mapid2, and then in the reducers<br>
| &gt; copy them to some Solr machine (perhaps even merging), does such<br>
| &gt; architecture makes sense, to create a searchable index with<br>
| &gt; Hadoop?<br>
| &gt;<br>
| &gt; Are there links for similar architectures and questions?<br>
| &gt;<br>
| &gt; Thank you. Sincerely,<br>
| &gt; Mark<br>
|<br>
</div></div></blockquote></div><br>

--bcaec50165f793b72504cbbf4de9--