Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 891362007D0 for ; Tue, 10 May 2016 12:26:58 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 87967160877; Tue, 10 May 2016 10:26:58 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id D0B601607AA for ; Tue, 10 May 2016 12:26:57 +0200 (CEST) Received: (qmail 92339 invoked by uid 500); 10 May 2016 10:26:56 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 92322 invoked by uid 99); 10 May 2016 10:26:56 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 May 2016 10:26:55 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 95A5D1A0649 for ; Tue, 10 May 2016 10:26:55 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.197 X-Spam-Level: * X-Spam-Status: No, score=1.197 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=yahoo.com Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ffc8o90q6bDi for ; Tue, 10 May 2016 10:26:53 +0000 (UTC) Received: from nm43-vm9.bullet.mail.gq1.yahoo.com (nm43-vm9.bullet.mail.gq1.yahoo.com [67.195.87.212]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id D9EA55F19B for ; Tue, 10 May 2016 10:26:52 +0000 (UTC) Received: from [127.0.0.1] by nm43.bullet.mail.gq1.yahoo.com with NNFMP; 10 May 2016 10:26:45 -0000 Received: from [98.137.12.189] by nm43.bullet.mail.gq1.yahoo.com with NNFMP; 10 May 2016 10:23:58 -0000 Received: from [98.137.12.211] by tm10.bullet.mail.gq1.yahoo.com with NNFMP; 10 May 2016 10:23:58 -0000 Received: from [127.0.0.1] by omp1019.mail.gq1.yahoo.com with NNFMP; 10 May 2016 10:23:58 -0000 X-Yahoo-Newman-Property: ymail-4 X-Yahoo-Newman-Id: 532005.33082.bm@omp1019.mail.gq1.yahoo.com X-YMail-OSG: PY2Se0AVM1lEUeIxOxnG_gfmv5XFoWTjrtKv_IF.QEejyjJHpAi6Y0aJfWy756h b4TB7VHu9e6YuaztImEzvSS_AyINmfIVk6WU0ywXMJGn_L36Y9VJANZQSmdj3wwnxKm7LrvJ.97T d3BDBZiC5ERpUwTTwnAZ0FLjLnXnX0Qs5jLA1l.hbT4DMqRByxzkSpYYThfdiDlCDQ7bte.ubuOK 1GtdHuKbDgm4kfawkaZoue6glqrT1FHoVzm6J6INlBM0SbLYrVjfSeTTDw5GvhlfIRsQU48wXSR8 _XYf_Swh_Dq74uPTE_atPvvljXpInfJXsJ6zJYkereabHHU_g_5RqAKTGs_s3Db.QR6hEk8a6ovB Gb1W6i9.SmoCiYxlybheV64cTev1jcGLdEuG2LwBd6zfk2yqXNqLCAx9SJ4m.L_2juVIQMSjzNwg 0Ndzw2xlovP64IJN_6m6nKj6WcK1WIBwmocYKsOTLXWJVriU39k5hY8rUNFe2y2qGgYKSQyGS8QD bWgRfU_pPUxV1KA_zF5fp Received: from jws10743.mail.gq1.yahoo.com by sendmailws115.mail.gq1.yahoo.com; Tue, 10 May 2016 10:23:57 +0000; 1462875837.994 Date: Tue, 10 May 2016 10:23:41 +0000 (UTC) From: Reply-To: To: "solr-user@lucene.apache.org" Message-ID: <1272040023.1847724.1462875821413.JavaMail.yahoo@mail.yahoo.com> In-Reply-To: References: Subject: how to find out how many times a word appears in a collection of documents? MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_1847723_1230418730.1462875821408" archived-at: Tue, 10 May 2016 10:26:58 -0000 ------=_Part_1847723_1230418730.1462875821408 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Hi everyone, I need to "read" the solr/lucene index and see how many times does words appear in all documents. For example: I have a collection of 1 mil documents and I want to see a list like this:the - 100000 timesbread - 1000 timesspoon - 10 timesfork - 5 times etc. How do I do that??? Kind regards,Christian ------=_Part_1847723_1230418730.1462875821408--