Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 78039 invoked from network); 21 May 2009 12:21:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 21 May 2009 12:21:53 -0000 Received: (qmail 79216 invoked by uid 500); 21 May 2009 12:22:04 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 79125 invoked by uid 500); 21 May 2009 12:22:04 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 79115 invoked by uid 99); 21 May 2009 12:22:04 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 May 2009 12:22:04 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dioxide.software@gmail.com designates 209.85.216.193 as permitted sender) Received: from [209.85.216.193] (HELO mail-px0-f193.google.com) (209.85.216.193) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 May 2009 12:21:54 +0000 Received: by pxi31 with SMTP id 31so886585pxi.29 for ; Thu, 21 May 2009 05:21:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:from:date:message-id :subject:to:content-type; bh=K82W8VpUP11EYR9a4HVDLBQZKkvl2pvmTDB2KjyLnGs=; b=tQvjI4+KGcGvZSV25GC/lzY0SsKVL/Qd+6Fr7ISNFSeoVh5W5OaQDCA/m5iI6KRGN3 XUnSBGJuSxx1FXv5EH4dr6BtdeFogUy2bYEe9HYbPNhuPQDEe+WKY+uIZNPvzGsOF8ix yQQRFTTpwr8+5kl7Nt3y1yxZ374ZRp903qzuQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; b=gRqpRCDFiuPhktC4UzBei7eHwM/58K/vXZIC8WjsIKdm5wp+kmryno0SObuDnk4N8L apCXjrBEwjwGNpgFBbgZ8+2eLc8566ErVlyG6f/g7CV71rCST+CY9S2RNKPQVv/5xrFN aiYjm3s4nXig+uDkWVY1/DmOOWjWHgsr+CMg4= MIME-Version: 1.0 Received: by 10.143.40.5 with SMTP id s5mr869223wfj.19.1242908493399; Thu, 21 May 2009 05:21:33 -0700 (PDT) From: KK Date: Thu, 21 May 2009 17:51:13 +0530 Message-ID: <8db6d74a0905210521s35bb6675sc79745e10c387d93@mail.gmail.com> Subject: hit highlighting in lucene ? To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=001636e0b5e33cc710046a6b2f95 X-Virus-Checked: Checked by ClamAV on apache.org --001636e0b5e33cc710046a6b2f95 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hi All, I was looking for various ways of implementing hit highlighting in Lucene and found some standard classes that does support highlighting like this *lucene*.apache.org/java/2_2_0/api/org/apache/*lucene*/search/*highlight* /package-summary.html ik but what i believe is that this is only for english or does it support other languages. I actually wanted to support highlighting for some non-english languages which I'm able to index and fetch using utf-8 encoding. So this means that if I want to have highlighting then I've to get the utf-8 query and look for the same in the result and add apt tags whereever required, it essentially boils down to implementing the standard highlighter. I think the standard highlighter also supports other languages. Correct me if i'm wrong. Due to my requirement constraints I'm using just simpleAnalyzer and we dont have tokenizers for these regional languages. Any other ideas of doing the same would be helpful as well. Thanks, KK. --001636e0b5e33cc710046a6b2f95--