Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 49F2D18C17 for ; Thu, 20 Aug 2015 03:14:20 +0000 (UTC) Received: (qmail 99158 invoked by uid 500); 20 Aug 2015 03:14:17 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 99080 invoked by uid 500); 20 Aug 2015 03:14:17 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 99068 invoked by uid 99); 20 Aug 2015 03:14:16 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Aug 2015 03:14:16 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 47FB0182107 for ; Thu, 20 Aug 2015 03:14:16 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id fktkc8uDeL8y for ; Thu, 20 Aug 2015 03:14:15 +0000 (UTC) Received: from mail-ig0-f178.google.com (mail-ig0-f178.google.com [209.85.213.178]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id B6E1D21165 for ; Thu, 20 Aug 2015 03:14:14 +0000 (UTC) Received: by igfj19 with SMTP id j19so21372502igf.1 for ; Wed, 19 Aug 2015 20:14:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=wychi+g7f3BaPnct/zBtI5A5Tjo2eI+Kd2+nvxx7nk0=; b=BgrYim1J47eqMSUQ9VuqAv8DacroXcuNt9jLFvfAfCxoioD5ykXxfbHcdnyUJgCDEq pvDvpHBULX6MXAfunRFmGWPjNB9zTnf8pxr/Fij70E+4z9esTWBCd9izNoIGsxsXLApE llrqeFHoPzH4t3Q446+Hw4eWfQzQ3HBopJg+vmj+Y3FcmLKaj3HfifH4UrK6oqJQZE8M iSpZkVcyfNkY5SQRYDx19SypXHg2Gxd3aurvD2RMjWLAtvLoAE2igoHJLKz7HXPX9KoF yUCkpaW95JBHAg099GdXEZigyZALqWvRFknhHdCe8lHMv0wraJkumSJWrVn1F7bpaloe aj3g== MIME-Version: 1.0 X-Received: by 10.50.178.133 with SMTP id cy5mr33889192igc.5.1440040453798; Wed, 19 Aug 2015 20:14:13 -0700 (PDT) Received: by 10.79.12.23 with HTTP; Wed, 19 Aug 2015 20:14:13 -0700 (PDT) Date: Thu, 20 Aug 2015 11:14:13 +0800 Message-ID: Subject: Solr having problems with highlighting when using Jieba anaylzer From: Zheng Lin Edwin Yeo To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=089e01538cbeb5ff7e051db58b09 --089e01538cbeb5ff7e051db58b09 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi, I'm using Jieba analyser to index Chinese characters in the Solr. It works fine with the segmentation when using the Anaylsis on the Solr Admin UI. However, when I tried to do highlighting in Solr, it is not highlighting in the correct place. For example, when I search for =E8=87=AA=E7=84=B6=E7=8E= =AF=E5=A2=83=E4=B8=8E=E4=BC=81=E4=B8=9A=E6=9C=AC=E8=BA=AB, it highlight =E8=AE=A4=E4=B8=BA=E8=87=AA=E7=84=B6=E7=8E=AF=E5=A2=83=E4=B8=8E=E4=BC=81=E4=B8=9A=E6=9C=AC=E8=BA=AB=E7=9A=84 Even when I search English character responsibility, it highlight *responsibilit*y. I'm using jieba-analysis-1.0.0, Solr 5.2.1 and Lucene 5.1.0 Regards, Edwin --089e01538cbeb5ff7e051db58b09--