Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 2251 invoked from network); 14 Jan 2011 16:07:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 14 Jan 2011 16:07:08 -0000 Received: (qmail 51777 invoked by uid 500); 14 Jan 2011 16:07:06 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 51121 invoked by uid 500); 14 Jan 2011 16:07:02 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 51113 invoked by uid 99); 14 Jan 2011 16:07:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Jan 2011 16:07:01 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of matthieu.huin@wallix.com designates 84.14.156.235 as permitted sender) Received: from [84.14.156.235] (HELO paris.office.wallix.com) (84.14.156.235) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Jan 2011 16:06:54 +0000 Received: from zimbra.ifr.lan (zimbra.ifr.lan [10.10.1.211]) by paris.office.wallix.com (Postfix) with ESMTP id A53746A433E for ; Fri, 14 Jan 2011 17:06:34 +0100 (CET) Received: from localhost (localhost.localdomain [127.0.0.1]) by zimbra.ifr.lan (Postfix) with ESMTP id 5DFBD324245F for ; Fri, 14 Jan 2011 17:06:36 +0100 (CET) X-Virus-Scanned: amavisd-new at X-Spam-Score: -2.9 X-Spam-Level: Received: from zimbra.ifr.lan ([127.0.0.1]) by localhost (zimbra.ifr.lan [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4uqRTXshDcm6 for ; Fri, 14 Jan 2011 17:06:36 +0100 (CET) Received: from [10.10.4.7] (mhu.ifr.lan [10.10.4.7]) by zimbra.ifr.lan (Postfix) with ESMTP id 1F8803242441 for ; Fri, 14 Jan 2011 17:06:36 +0100 (CET) Message-ID: <4D30748D.8060703@wallix.com> Date: Fri, 14 Jan 2011 17:06:37 +0100 From: Matthieu Huin User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.13) Gecko/20101208 Thunderbird/3.1.7 MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: How to get the frequency of indexed words ? References: <4D306EEF.6080003@wallix.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Flag: NO X-Old-Spam-Status: No, score=-2.9 tagged_above=-10 required=10 tests=[ALL_TRUSTED=-1, BAYES_00=-1.9] autolearn=ham Ian, Thanks for the quick answer. I see this is part of lucene's "contrib" modules, but I am using pyLucene. Is there a way to access this module through pyLucene ? Regards, Matthieu Le 14/01/2011 17:00, Ian Lea a �crit : > http://lucene.apache.org/java/3_0_3/api/contrib-misc/org/apache/lucene/misc/HighFreqTerms.html > > -- > Ian. > > On Fri, Jan 14, 2011 at 3:42 PM, Matthieu Huin wrote: >> Greetings, >> >> Is there an easy way to figure out the frequency of words in an index ? I'd >> like to get, say, the 1000 most often indexed words in order to create an >> auto-completion cache for my application. >> >> Thanks in advance, >> >> >> Matthieu Huin >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org >> >> > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org