lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Performance issues when querying on large documents
Date Sat, 24 Jul 2010 22:53:14 GMT
What are you returning? I'd be quite surprised if it was the search, so
first I'd look elsewhere. In particular, are you returning all 1,000 pages?
What happens if you specify returning a small field (the fl= parameter).

Also, look at the debug output of the query, it breaks down the various
phases of the query processing and that might give you a hint.

If none of that does the trick, please post the query and the relevant parts
of your schema as well as debug output...

Best
Erick

On Fri, Jul 23, 2010 at 2:52 PM, ahammad <ahmed.hammad@gmail.com> wrote:

>
> Hello,
>
> I have an index with lots of different types of documents. One of those
> types basically contains extracts of PDF docs. Some of those PDFs can have
> 1000+ pages, so there would be a lot of stuff to search through.
>
> I am experiencing really terrible performance when querying. My whole index
> has about 270k documents, but less than 1000 of those are the PDF extracts.
> The slow querying occurs when I search only on those PDF extracts (by
> specifying filters), and return 100 results. The 100 results definitely
> adds
> to the issue, but even cutting that down can be slow.
>
> Is there a way to improve querying with such large results? To give an
> idea,
> querying for a single word can take a little over a minute, which isn't
> really viable for an application that revolves around searching. For now, I
> have limited the results to 20, which makes the query execute in roughly
> 10-15 seconds. However, I would like to have the option of returning 100
> results.
>
> Thanks a lot.
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Performance-issues-when-querying-on-large-documents-tp990590p990590.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message