Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 70279 invoked from network); 26 Dec 2007 19:37:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Dec 2007 19:37:31 -0000 Received: (qmail 26019 invoked by uid 500); 26 Dec 2007 19:37:15 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 25978 invoked by uid 500); 26 Dec 2007 19:37:15 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 25967 invoked by uid 99); 26 Dec 2007 19:37:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Dec 2007 11:37:15 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=RCVD_IN_BL_SPAMCOP_NET,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of liaqatalimian@gmail.com designates 64.233.166.180 as permitted sender) Received: from [64.233.166.180] (HELO py-out-1112.google.com) (64.233.166.180) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Dec 2007 19:36:52 +0000 Received: by py-out-1112.google.com with SMTP id d32so8606610pye.12 for ; Wed, 26 Dec 2007 11:36:56 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:user-agent:mime-version:to:subject:content-type:content-transfer-encoding; bh=pw+VanNR57hovfyt9QPPmcXmQtr1yY67nXORuLrLZ80=; b=C195MTOWlEtcsGNIxPDOygpBDhjGx4EX0TdfZOKzUH5qyxVUO3Fo2hEv+Af5bueCEQDHhtal7hntSTbgiXfLQYV081rEfXZ+g4tefJPF/k3aR9FWaJ/6FLy4W0F4OY8S5lol0I50M8VWXz+3KEYx4mYrRI1RdWmhVr90+J1jJ/8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:content-type:content-transfer-encoding; b=fvqyrRoqtBaUcAHqzCyyQBq2wQIW8SdvpYPzMjWK2oXw0qmCDlmEcETjsBhPlDNvgHD69e7SeiDYPBewQdGpMfZjrFz9RStZEm+VPBMPPUpFN9U36c2J0BzZh3r4BZe09O9l6vP3NUaUnacsjD1txJH5C3U18OBrTdaW5DH9PJQ= Received: by 10.35.131.13 with SMTP id i13mr8581391pyn.50.1198697815905; Wed, 26 Dec 2007 11:36:55 -0800 (PST) Received: from ?10.0.111.21? ( [202.125.143.65]) by mx.google.com with ESMTPS id w43sm12090205pyg.7.2007.12.26.11.36.52 (version=TLSv1/SSLv3 cipher=RC4-MD5); Wed, 26 Dec 2007 11:36:55 -0800 (PST) Message-ID: <4772AD46.9050709@gmail.com> Date: Thu, 27 Dec 2007 00:36:38 +0500 From: Liaqat Ali User-Agent: Thunderbird 2.0.0.9 (Windows/20071031) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: StopWords problem Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Hi, Doro Cohen Thanks for your reply, but I am facing a small problem over here. As I am using notepad for coding, then in which format the file should be saved. public static final String[] URDU_STOP_WORDS = { "کے" ,"کی" ,"سے" ,"کا" ,"کو" ,"ہے" }; Analyzer analyzer = new StandardAnalyzer(URDU_STOP_WORDS); If I save it in ANSI format it will lose the contents, I tried Unicode but it does not work and I also tried UTF-8, but it also generate two errors of identifying two illegal characters. What should be the solution. Kindly guide in this. Thanks .. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org