Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 76966 invoked from network); 2 Oct 2009 15:42:40 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Oct 2009 15:42:40 -0000 Received: (qmail 74786 invoked by uid 500); 2 Oct 2009 15:42:38 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 74721 invoked by uid 500); 2 Oct 2009 15:42:38 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 74711 invoked by uid 99); 2 Oct 2009 15:42:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Oct 2009 15:42:38 +0000 X-ASF-Spam-Status: No, hits=-1.1 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (nike.apache.org: transitioning domain of rnagalin@in.ibm.com does not designate 59.145.155.8 as permitted sender) Received: from [59.145.155.8] (HELO e28smtp08.in.ibm.com) (59.145.155.8) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Oct 2009 15:42:25 +0000 Received: from d28relay01.in.ibm.com (d28relay01.in.ibm.com [9.184.220.58]) by e28smtp08.in.ibm.com (8.14.3/8.13.1) with ESMTP id n92FXAiH020182 for ; Fri, 2 Oct 2009 21:03:10 +0530 Received: from d28av05.in.ibm.com (d28av05.in.ibm.com [9.184.220.67]) by d28relay01.in.ibm.com (8.13.8/8.13.8/NCO v10.0) with ESMTP id n92Fg0ow1826920 for ; Fri, 2 Oct 2009 21:12:01 +0530 Received: from d28av05.in.ibm.com (loopback [127.0.0.1]) by d28av05.in.ibm.com (8.14.3/8.13.1/NCO v10.0 AVout) with ESMTP id n92Fg06r000562 for ; Sat, 3 Oct 2009 01:42:00 +1000 Received: from d23ml172.in.ibm.com (d23ml172.in.ibm.com [9.124.105.37]) by d28av05.in.ibm.com (8.14.3/8.13.1/NCO v10.0 AVin) with ESMTP id n92Fg0O2000559 for ; Sat, 3 Oct 2009 01:42:00 +1000 To: java-user@lucene.apache.org MIME-Version: 1.0 Subject: lucene 2.4.1 : document in index but not returned in search X-Mailer: Lotus Notes Release 7.0 HF277 June 21, 2006 Message-ID: From: Rathinapriya Nagalingam Date: Fri, 2 Oct 2009 21:21:42 +0530 X-MIMETrack: Serialize by Router on d23ml172/23/M/IBM(Release 7.0.2FP3HF80 | July 14, 2008) at 10/02/2009 21:21:45, Serialize complete at 10/02/2009 21:21:45 Content-Type: multipart/alternative; boundary="=_alternative 00563C2D65257643_=" X-Virus-Checked: Checked by ClamAV on apache.org --=_alternative 00563C2D65257643_= Content-Type: text/plain; charset="US-ASCII" Hi, I created an index of around 45000 documents. I search using Title and Abstract field. (Using lucene 2.4.1) When I look in lukeall, some titles are available in index, but I dont get them when I search using title as keyword. I have copied code snapshots below. Recently we upgraded from lucene 2.0 to 2.4.1 and I am fairly new to lucene. Please let me know what could be the possible issue. IndexWriter writer = new IndexWriter(directory, new StandardAnalyzer(), pCreate, IndexWriter.MaxFieldLength.UNLIMITED); ........................ SimpleAnalyzer cjkAnalyzer = new SimpleAnalyzer(); doc.add(new Field(LuceneDocument.TITLE_FIELD, pTitle, Field.Store.YES, Field.Index.ANALYZED_NO_NORMS)); doc.add(new Field(LuceneDocument.ABSTRACT_FIELD, pDescription, Field.Store.YES, Field.Index. ANALYZED_NO_NORMS)); ............... if (localeStr.equals("zh_CN") || localeStr.equals("ko_KR")) { writer.addDocument(doc, cjkAnalyzer); } else { writer.addDocument(doc); } While searching the index, I use the analyser as below. // construct the proper analyzer based on locale if (pLocale.equals("zh_CN") || pLocale.equals( "ko_KR")) { /* NOI18N */ analyzer = new SimpleAnalyzer(); } else { analyzer = new StandardAnalyzer(); } // analyze the keywords ts = analyzer.tokenStream("abstract", //$NON-NLS-1$ new StringReader(sb.toString())); tokens = new ArrayList(); try { while (true) { Token token = ts.next(); if (token == null) { break; } tokens.add(token.termText()); } } catch (IOException ex) { Logger.logException(Logger.TYPE_ERR, this, "analyzeKeywords", ex); //$NON-NLS-1$ } // Loop through the keywords for (int i = 0; i < tokens.length; i++) { // Each keyword must be queried against the title and abstract. BooleanQuery bQuery = new BooleanQuery(); TermQuery titleTerm = new TermQuery( new Term("title", tokens[i])); //$NON-NLS-1$ TermQuery abstractTerm = new TermQuery(new Term( "abstract", tokens[i])); //$NON-NLS-1$ if (keywordStatusListSize > i && ((Boolean) keywordStatusList.get(i)).booleanValue()) { bQuery.add(titleTerm, BooleanClause.Occur.MUST); bQuery.add(abstractTerm, BooleanClause.Occur.MUST); } else { bQuery.add(titleTerm, BooleanClause.Occur.SHOULD); bQuery.add(abstractTerm, BooleanClause.Occur.SHOULD); } if (flag || ((Boolean) keywordStatusList.get(i)).booleanValue()) { keyQuery.add(bQuery, BooleanClause.Occur.MUST); } else { keyQuery.add(bQuery, BooleanClause.Occur.SHOULD); } } ................................ CachingWrapperFilter cf; Searcher searcher; HitCollector collector; ................. some assignments............... searcher.search(keyQuery, cf, collector); Thanks & Regards, Priya PT-7A-012 Residency Road Bangalore India Mob: 99011 22033 --=_alternative 00563C2D65257643_=--