lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Miller <markrmil...@gmail.com>
Subject Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes
Date Mon, 27 Feb 2012 19:47:30 GMT

On Feb 27, 2012, at 2:22 PM, Matthew Parker wrote:

> Thanks for your reply Mark.
> 
> I believe the build was towards the begining of the month. The
> solr.spec.version is 4.0.0.2012.01.10.38.09
> 
> I cannot access the clusterstate.json contents. I clicked on it a couple of
> times, but nothing happens. Is that stored on disk somewhere?

Are you using the new admin UI? That has recently been updated to work better with cloud -
it had some troubles not too long ago. If you are, you should trying using the old admin UI's
zookeeper page - that should show the cluster state.

That being said, there has been a lot of bug fixes over the past month - so you may just want
to update to a recent version.

> 
> I configured a custom request handler to calculate an unique document id
> based on the file's url.
> 
> On Mon, Feb 27, 2012 at 1:13 PM, Mark Miller <markrmiller@gmail.com> wrote:
> 
>> Hey Matt - is your build recent?
>> 
>> Can you visit the cloud/zookeeper page in the admin and send the contents
>> of the clusterstate.json node?
>> 
>> Are you using a custom index chain or anything out of the ordinary?
>> 
>> 
>> - Mark
>> 
>> On Feb 27, 2012, at 12:26 PM, Matthew Parker wrote:
>> 
>>> TWIMC:
>>> 
>>> Environment
>>> =========
>>> Apache SOLR rev-1236154
>>> Apache Zookeeper 3.3.4
>>> Windows 7
>>> JDK 1.6.0_23.b05
>>> 
>>> I have built a SOLR Cloud instance with 4 nodes using the embeded Jetty
>>> servers.
>>> 
>>> I created a 3 node zookeeper ensemble to manage the solr configuration
>> data.
>>> 
>>> All the instances run on one server so I've had to move ports around for
>>> the various applications.
>>> 
>>> I start the 3 zookeeper nodes.
>>> 
>>> I started the first instance of solr cloud with the parameter to have two
>>> shards.
>>> 
>>> The start the remaining 3 solr nodes.
>>> 
>>> The system comes up fine. No errors thrown.
>>> 
>>> I can view the solr cloud console and I can see the SOLR configuration
>>> files managed by ZooKeeper.
>>> 
>>> I published data into the SOLR Cloud instances from SharePoint using
>> Apache
>>> Manifold 0.4-incubating. Manifold is setup to publish the data into
>>> collection1, which is the only collection defined in the cluster.
>>> 
>>> When I query the data from collection1 as per the solr wiki, the results
>>> are inconsistent. Sometimes all the results are there, other times
>> nothing
>>> comes back at all.
>>> 
>>> It seems to be having an issue auto replicating the data across the
>> cloud.
>>> 
>>> Is there some specific setting I might have missed? Based upon what I
>> read,
>>> I thought that SOLR cloud would take care of distributing and replicating
>>> the data automatically. Do you have to tell it what shard to publish the
>>> data into as well?
>>> 
>>> Any help would be appreciated.
>>> 
>>> Thanks,
>>> 
>>> Matt
>>> 
>>> ------------------------------
>>> This e-mail and any files transmitted with it may be proprietary.
>> Please note that any views or opinions presented in this e-mail are solely
>> those of the author and do not necessarily represent those of Apogee
>> Integration.
>> 
>> - Mark Miller
>> lucidimagination.com
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
> 
> 
> -- 
> Regards,
> 
> Matt Parker (CTR)
> Senior Software Architect
> Apogee Integration, LLC
> 5180 Parkstone Drive, Suite #160
> Chantilly, Virginia 20151
> 703.272.4797 (site)
> 703.474.1918 (cell)
> www.apogeeintegration.com
> 
> ------------------------------
> This e-mail and any files transmitted with it may be proprietary.  Please note that any
views or opinions presented in this e-mail are solely those of the author and do not necessarily
represent those of Apogee Integration.

- Mark Miller
lucidimagination.com












Mime
View raw message