nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zennet Colburn <zen...@gmail.com>
Subject Mergesegs Severe Errors
Date Mon, 02 May 2005 21:59:11 GMT
Hello Nutch Users, 

I've got about 25 segments and I am trying to merge them. No matter
which way I try I run into fatal errors. If you have an idea of what
is wrong any help is appreciated. I included some commands and the
errors they produce.


$nutch mergesegs my-segments/ -o output_segment_dir -m
my-segments/20050309012353
...
050502 164206 Segment 20050413210403: 199 entries.
050502 164206 Segment 20050413201837: 0 entries.
050502 164206 TOTAL 858045 input entries in 59 segments.
050502 164206 Looking for master index in segments-tripset/20050309012353
050502 164206 SEVERE No master index, and createMaster == false


$nutch mergesegs my-segments -o output_segment_dir -cm
(works initially but after 4 minutes this error occurs)
...
java.io.FileNotFoundException:
my-segments/20050118104221/index/_1rq.f5 (Too many open files)
        at java.io.RandomAccessFile.open(Native Method)
        at java.io.RandomAccessFile.<init>(RandomAccessFile.java:204)
        at org.apache.lucene.store.FSInputStream$Descriptor.<init>(FSDirectory.java:376)
        at org.apache.lucene.store.FSInputStream.<init>(FSDirectory.java:405)
        at org.apache.lucene.store.FSDirectory.openFile(FSDirectory.java:268)
        at org.apache.lucene.index.SegmentReader.openNorms(SegmentReader.java:369)
        at org.apache.lucene.index.SegmentReader.initialize(SegmentReader.java:122)
        at org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:94)
        at org.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:480)
        at org.apache.lucene.index.IndexWriter.maybeMergeSegments(IndexWriter.java:458)
        at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:310)
        at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:294)
        at net.nutch.indexer.IndexSegment.indexPages(IndexSegment.java:121)
        at net.nutch.indexer.IndexSegment.main(IndexSegment.java:223)
        at net.nutch.tools.SegmentMergeTool.run(SegmentMergeTool.java:202)
        at net.nutch.tools.SegmentMergeTool.main(SegmentMergeTool.java:358)
050502 165126 SEVERE my-segments/20050118104221/index/_1rq.f5 (Too
many open files)

zennet

Mime
View raw message