Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3001C10C90 for ; Wed, 9 Oct 2013 16:01:30 +0000 (UTC) Received: (qmail 5159 invoked by uid 500); 9 Oct 2013 16:01:25 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 5108 invoked by uid 500); 9 Oct 2013 16:01:25 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Delivered-To: moderator for solr-user@lucene.apache.org Received: (qmail 99310 invoked by uid 99); 9 Oct 2013 13:45:37 -0000 X-ASF-Spam-Status: No, hits=2.6 required=5.0 tests=FORGED_YAHOO_RCVD,FREEMAIL_ENVFROM_END_DIGIT,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy includes SPF record at spf.trusted-forwarder.org) Date: Wed, 9 Oct 2013 06:45:10 -0700 (PDT) From: shahzad73 To: solr-user@lucene.apache.org Message-ID: <1381326310401-4094369.post@n3.nabble.com> In-Reply-To: References: <1381289392680-4094264.post@n3.nabble.com> Subject: Re: Find documents that are composed of % words MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org my client has a strange requirement, he will give a list of 500 words and then set a percentage like 80% now he want to find those pages or documents which consist of the only those 80% of 500 and only 20% unknown. like we have this document word1 word2 word3 word4 and he give the list word1 word2 word3 and set the accuracy to 75% the above doc will meet the criteria because no 1 it matches all words and only 25% words are unknow from the list of searches. here is another way to say this " if 500 words are provided in search then All 500 words words must exist in the document and unknow words should be only 20% if accracy is 80%" -- View this message in context: http://lucene.472066.n3.nabble.com/Find-documents-that-are-composed-of-words-tp4094264p4094369.html Sent from the Solr - User mailing list archive at Nabble.com.