lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marcello Lorenzi <>
Subject SolR vs large PDF
Date Wed, 27 Nov 2013 15:00:48 GMT
Hi All,
on our test environment we have implemented a new search engine based on 
Solr 4.3 with 2 instances hosted on different servers and 1 shard 
present on each servlet container.

During some stress test we noticed a bottleneck into crawling of large 
PDF file that blocks the serving of results from queries to the collections.

Is it possible to boost or mitigate the overhead created by PDFBOX 
during the crawling?


View raw message