lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tarr, Gregory" <Gregory.t...@detica.com>
Subject RE: Corrupt segments file full of zeros
Date Tue, 28 Jun 2011 11:55:24 GMT
The segments file containing lots of zeros means that the index has no
segments.

We could run the following to check this:

SegmentInfos sis = new SegmentInfos();
sis.read(indexDir);
int numSegments = sis.size();
if (numSegments < 1) { // index has no segments }

Greg 

-----Original Message-----
From: Uwe Schindler [mailto:uwe@thetaphi.de] 
Sent: 28 June 2011 12:33
To: java-user@lucene.apache.org
Subject: RE: Corrupt segments file full of zeros

So where is the problem at all? Why should a segments file not contain
lots of zeroes? If the index is not corrupt all is fine.

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: uwe@thetaphi.de


> -----Original Message-----
> From: Tarr, Gregory [mailto:Gregory.tarr@detica.com]
> Sent: Tuesday, June 28, 2011 11:56 AM
> To: java-user@lucene.apache.org
> Subject: RE: Corrupt segments file full of zeros
> 
> Yes I have done that, and you just get "No problems were detected with
this
> index"
> 
> Surely there is a major problem with this index?
> 
> Also the check() procedure takes a long time - is there any way you 
> can
just
> do a health check on the segments file?
> 
> Thanks
> 
> Greg
> 
> -----Original Message-----
> From: Shai Erera [mailto:serera@gmail.com]
> Sent: 28 June 2011 10:36
> To: java-user@lucene.apache.org
> Subject: Re: Corrupt segments file full of zeros
> 
> You can try the CheckIndex tool. You feed it a directory and call
> .check() and it reports the results.
> 
> Shai
> 
> On Tue, Jun 28, 2011 at 11:46 AM, Tarr, Gregory
> <Gregory.tarr@detica.com>wrote:
> 
> > We have a problem with our fileserver where our indexes are hosted 
> > remotely, using Lucene 2.9.3.
> >
> > This can mean that a segments file is written which is full of ASCII

> > zeros. Using the od -ah command, we get:
> >
> > 0000000 nul nul nul nul nul nul nul....etc
> >
> > If opened in Luke, the index opens successfully but has zero
> documents.
> >
> > Why does this open correctly in luke, and is there a procedure in 
> > the lucene code that can verify a segments file, e.g. check whether 
> > it refers to any segments?
> >
> > Thanks
> >
> > Greg
> >
> >
> > Please consider the environment before printing this email.
> >
> > This message should be regarded as confidential. If you have 
> > received this email in error please notify the sender and destroy it
> immediately.
> >
> > Statements of intent shall only become binding when confirmed in 
> > hard copy by an authorised signatory.  The contents of this email 
> > may relate to dealings with other companies under the control of 
> > Detica Limited, details of which can be found at
> http://www.detica.com/statutory-information.
> >
> > Detica Limited is registered in England under No: 1337451.
> > Registered offices: Surrey Research Park, Guildford, Surrey, GU2 
> > 7YP, England.
> >
> >
> Please consider the environment before printing this email.
> 
> This message should be regarded as confidential. If you have received 
> this email in error please notify the sender and destroy it
immediately.
> 
> Statements of intent shall only become binding when confirmed in hard 
> copy by an authorised signatory.  The contents of this email may 
> relate to
dealings
> with other companies under the control of Detica Limited, details of 
> which can be found at http://www.detica.com/statutory-information.
> 
> Detica Limited is registered in England under No: 1337451.
> Registered offices: Surrey Research Park, Guildford, Surrey, GU2 7YP, 
> England.
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Please consider the environment before printing this email.

This message should be regarded as confidential. If you have received this email in error
please notify the sender and destroy it immediately.

Statements of intent shall only become binding when confirmed in hard copy by an authorised
signatory.  The contents of this email may relate to dealings with other companies under the
control of Detica Limited, details of which can be found at http://www.detica.com/statutory-information.

Detica Limited is registered in England under No: 1337451.
Registered offices: Surrey Research Park, Guildford, Surrey, GU2 7YP, England.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message