Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9C53A189A0 for ; Fri, 22 Jan 2016 01:18:46 +0000 (UTC) Received: (qmail 61447 invoked by uid 500); 22 Jan 2016 01:18:40 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 61370 invoked by uid 500); 22 Jan 2016 01:18:40 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 61358 invoked by uid 99); 22 Jan 2016 01:18:40 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 22 Jan 2016 01:18:40 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 579E8C181E for ; Fri, 22 Jan 2016 01:18:39 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.508 X-Spam-Level: *** X-Spam-Status: No, score=3.508 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_IMAGE_RATIO_04=0.61, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id GcjL5uE-zpK8 for ; Fri, 22 Jan 2016 01:18:38 +0000 (UTC) Received: from mail-qk0-f172.google.com (mail-qk0-f172.google.com [209.85.220.172]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id C545320D77 for ; Fri, 22 Jan 2016 01:18:37 +0000 (UTC) Received: by mail-qk0-f172.google.com with SMTP id s68so23384591qkh.3 for ; Thu, 21 Jan 2016 17:18:37 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=iq0kCvfhEYa4MwlRZbtuKc9i+9OEzyO89Pumnf+bkRg=; b=Q8PaWFHYxzNxWXGVNQLljBzJzY7rNjNNno3wyXkJ3/HYfPLgzGFWW3emfyZFkC3fBH dZnEzSKzlhTfytfkqpIRoxWcxtQ3MtCNOr5tN6HONfjOKn/cbobuWdCHATXjw3PmlPOt 6Y43h7IfUi/L0fN/mK+wg2iDB72lIKdzdAdm2f5MZDBksVj+8OsXxRJCM5DBGWU9+RGd 0QQEKd6DDx+8zFTHzGkY6Q66RQuENFuehjZK78nk3CSYqghSAm3plJkvPhAgCCZWgOyQ R3tU9zzkF02pgPv516uaxNnfYr6pcMM6drWW9fv6EdZ3Wegfc03SjDhqmrGNUYTpKjq8 IZng== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to :content-type; bh=iq0kCvfhEYa4MwlRZbtuKc9i+9OEzyO89Pumnf+bkRg=; b=DxTVozQUwDeNKPjwAlpTkgI15j5+ljoEjgYU8GNghlexS4rQR2gI+N4xz/hToGuufg uzOuM294Al3Mck+PmCL+MvMxvcVzwnd+Fdcw8g/wj9Oj53z2lj998wekpYsfjRz+X/lE glGDW0HyWBGd1kQZK0h2fep9rGLz6gQbF/u8lP6JbAr0uggAOqDW3qQ/toqCNvNh4T8U AIZAp+5LwiVHXQKW3Q3BAA5KGFWDxpqC7LNyPEq1lJPBGl5vg9c/ML88NxmuHiQvmZXK uXBU+M9ZWGCQTzpC9JNrJ4alr9217jvg6zgGlMF1on9L3cZzv3nRl4LZIfI1zHe22xQY wL/w== X-Gm-Message-State: AG10YORg4H+XUTBozsYI9FFCVOI97gQxv54DXxsnEuuEr0V3m8a+BjUviDziVkuEF/Od03/v4B2pLvX/0x3kpw== MIME-Version: 1.0 X-Received: by 10.55.20.4 with SMTP id e4mr423660qkh.74.1453425517047; Thu, 21 Jan 2016 17:18:37 -0800 (PST) Received: by 10.55.75.215 with HTTP; Thu, 21 Jan 2016 17:18:36 -0800 (PST) Date: Fri, 22 Jan 2016 09:18:36 +0800 Message-ID: Subject: One complex wildcard query lead solr OOM From: Jian Mou To: solr-user@lucene.apache.org Content-Type: multipart/related; boundary=001a1144bc4aa6de6a0529e1ffe3 --001a1144bc4aa6de6a0529e1ffe3 Content-Type: multipart/alternative; boundary=001a1144bc4aa6de640529e1ffe2 --001a1144bc4aa6de640529e1ffe2 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable We are using Solr as our search engine, and recently notice some user input wildcard query can lead to Solr dead loop in org.apache.lucene.util.automaton.Operations.determinize() , and it also eats memory and finally OOM. the wildcard query seems like **?????????-???????o=C2=B7???=C3=A8??**=E3=80= =82 Although we can validate the input parameter, but I also wonder is there any configuration which can disable complex wildcard query like this which lead to serve performance problems. Related statcktrace [image: Inline image 1] Thanks, Jian --001a1144bc4aa6de640529e1ffe2 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

We are using Solr as our= search engine, and recently notice some user input wildcard query can lead= to Solr dead loop in

org.apache.lucene.util.automaton.Operations.determinize()

, and it also eats memory a= nd finally OOM.

the wildcard query= seems like=C2=A0*?????????-???????o=C2=B7???=C3=A8??*=E3=80=82

Al= though we can validate the input parameter, but I also wonder is there any = configuration which can disable complex wildcard query like this which lead= to serve performance problems.


Related statcktrace


3D"Inline<= br>



Thanks,

Jian

--001a1144bc4aa6de640529e1ffe2-- --001a1144bc4aa6de6a0529e1ffe3--