Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DD7C418C4B for ; Wed, 26 Aug 2015 21:45:55 +0000 (UTC) Received: (qmail 36263 invoked by uid 500); 26 Aug 2015 21:45:51 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 36195 invoked by uid 500); 26 Aug 2015 21:45:51 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 36184 invoked by uid 99); 26 Aug 2015 21:45:51 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Aug 2015 21:45:51 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id CAFDA181BC3 for ; Wed, 26 Aug 2015 21:45:50 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3 X-Spam-Level: *** X-Spam-Status: No, score=3 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id FYCnkBObc3Jc for ; Wed, 26 Aug 2015 21:45:38 +0000 (UTC) Received: from esa2.ucsf.iphmx.com (esa2.ucsf.iphmx.com [68.232.143.34]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 8C8F120F3F for ; Wed, 26 Aug 2015 21:45:38 +0000 (UTC) Received: from mcbmobwap003.ucsfmedicalcenter.org ([64.54.35.216]) by esa2.ucsf.iphmx.com with ESMTP/TLS/DHE-RSA-AES256-SHA; 26 Aug 2015 14:45:32 -0700 X-AuditID: 403623d8-f79626d000000d2a-6f-55de337ca9dd Received: from bcuda5.ucsf.edu (otp005580ots.ucsfmedicalcenter.org [64.54.36.202]) by mcbmobwap003.ucsfmedicalcenter.org (Symantec Mail Security) with SMTP id 2A.1A.03370.C733ED55; Wed, 26 Aug 2015 14:45:32 -0700 (PDT) X-ASG-Debug-ID: 1440625531-05477326ecd400f0001-UCWVJZ Received: from EXHT01.net.ucsf.edu (mx.ucsf.edu [64.54.247.193]) by bcuda5.ucsf.edu with ESMTP id SQB5dS1UpFnd2SfG (version=TLSv1 cipher=AES128-SHA bits=128 verify=NO) for ; Wed, 26 Aug 2015 14:45:31 -0700 (PDT) X-Barracuda-Envelope-From: Rebecca.Tang@ucsf.edu X-Barracuda-Apparent-Source-IP: 64.54.247.193 Received: from EX05.net.ucsf.edu ([64.54.247.145]) by EXHT01.net.ucsf.edu ([64.54.247.218]) with mapi id 14.03.0224.002; Wed, 26 Aug 2015 14:45:31 -0700 From: "Tang, Rebecca" To: "solr-user@lucene.apache.org" Subject: find documents based on specific term frequency Thread-Topic: find documents based on specific term frequency X-ASG-Orig-Subj: find documents based on specific term frequency Thread-Index: AQHQ4EiH0yQSKgmUFESBOLhwMrfEwQ== Date: Wed, 26 Aug 2015 21:45:30 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.48.3.126] Content-Type: multipart/alternative; boundary="_000_D203818816505rebeccatangucsfedu_" MIME-Version: 1.0 X-Barracuda-Connect: mx.ucsf.edu[64.54.247.193] X-Barracuda-Start-Time: 1440625531 X-Barracuda-Encrypted: AES128-SHA X-Barracuda-URL: https://bcuda5.ucsf.edu:443/cgi-mod/mark.cgi X-Virus-Scanned: by bsmtpd at ucsf.edu X-Barracuda-BRTS-Status: 1 X-Barracuda-Spam-Score: 0.00 X-Barracuda-Spam-Status: No, SCORE=0.00 using per-user scores of TAG_LEVEL=1000.0 QUARANTINE_LEVEL=4.5 KILL_LEVEL=5.0 tests=HTML_MESSAGE X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.3.21952 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- 0.00 HTML_MESSAGE BODY: HTML included in message X-CFilter-Loop: Reflected X-Brightmail-Tracker: H4sIAAAAAAAAA01Te0hTYRzl253zOr112zR/LKG6YUWlTu1d1giKyKigViSVXvXmlpuO3c22 HiBFFKMskhQfoVaIPSgzyFeUjB42S4Ow1HxkmVmzFHrRk+63e1f779xzfud85/74PpJQvQ3W kMYcG2fNYU2MQinXLYzeGnMgsV+vrRpXLT7T1iLTobU/n3ajTShFuTyTMxnzOGvcijSl4dVo nczSqXacKEnOR19pFwohgZ4Pw9eH5CKeDE/6rylcSEmq6F4EY021waKwAA4NDwWLwjiCtjsn kShoobHpl0LEMeBuLfPxKrocwZHhNS5EkgqBL65WYjqcXgne9mZfpppeDK2DxYTIJ4H3/jlC jNHB3YFamcjHgrdgJAhjOR0NX652BONIik6AN20RmEZC52+eK75xgo6EnqEKmRhDw4VbHVJk BLx7/SdIxFOhq2kYifPb4OmrH772FD0JHpb49xAH3weeSX/IwKeiRilzOhS0eqSVJMHP/MMS ngk9o36sgaPVg1JOFHy8+NK3T6CPIXg+8lv6KJTB6d5q4hSaXhpQvDSgVGlAKZHXwlh7BSHi uVBd5ZVwHFz//BiJeBHcGD+iCJypROQlxJgz0s256XtZi1abGGvP4HebuUxjBmvK4PBFis21 ZtUh31Wa0dWAfhxe5UY0iZgwalpBn14VxObxTrMbxZEyJoJ6NLtfr5qQnpvpNLC8IdVqN3E8 E07VxQg09Y9Ot5uy3QhIQpBClVjKZJ37OGuuaHCjKaSciaSKplTqVXQWa+OyOc7CWf3qApJk gDobLxgnWbkszrHbaLL5ZcGnjhUUOlARD5SRIW4UQ4YJp3ZiM8VbWDNvzJKMampqgsCG+Vls 8qAlmkjqA56mMW+w5/w7STOZWv9a2MDEAAF7/M/uPZorrElNncfuMOFR/j9KRakxGSqR2PVe aCYTmr1L7cPNbKwtsFlkWh9uJrF4XpOP7PUrvZ+3LBtYqlv1LGqwsv4K9+LentTihpANLeXG u672kgTHwf29m2fWgCuvix9bduPTtO1RutKUPOJg1Dy45VCMDK1Om9W1S7Pc8+C2q0b//UNL 3+WN2TvKbnrX8GmjPTvneZPX1RU7at6UF25utBQmNZu74Vdnref4zttOZ9FHRs4b2Pg5hJVn /wK8iyFpegQAAA== --_000_D203818816505rebeccatangucsfedu_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable Hi there, We have an index build on solr 5.0. We received an user question: "Is there a way to search for documents that have a word appearing more tha= n a certain number of times? For example, I want to find documents that onl= y have more than 10 instances of the word "genetics" =85" I'm not sure if it's possible to do this with solr. Does anyone know? Rebecca Tang Applications Developer, UCSF CKM Industry Documents Digital Libraries E: rebecca.tang@ucsf.edu --_000_D203818816505rebeccatangucsfedu_--