Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 59186 invoked from network); 6 Oct 2009 20:35:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Oct 2009 20:35:09 -0000 Received: (qmail 78101 invoked by uid 500); 6 Oct 2009 20:35:07 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 78043 invoked by uid 500); 6 Oct 2009 20:35:07 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 78033 invoked by uid 99); 6 Oct 2009 20:35:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 20:35:07 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mmasters@gmail.com designates 74.125.78.26 as permitted sender) Received: from [74.125.78.26] (HELO ey-out-2122.google.com) (74.125.78.26) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 20:34:56 +0000 Received: by ey-out-2122.google.com with SMTP id 22so963025eye.3 for ; Tue, 06 Oct 2009 13:33:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=ZX47PTku+GZsKg9Y3KOO6Q/IHpi8jKeMg4LQm+MOSBA=; b=oHtHSYIK8eBnrZk1IgVuGsPVS9WWZ4qmLMXrJD5nJ3jyFNv8Jn+V30HYHSzMWxJhRg N6ql19eRU9WMR/enigZK/2FmpV+mvEJVDCU0Q1odVspIaRv1Re2u2P/ewZlwKLdlDZSx Q+gIXfh+D0XG11TqmLGItt8iv7YQEtMN+yrP4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=ilHK0gpR+ABsEMmsV5pkGKb2gxL/eh5bpGA9jMFEJ4/alrYoU9wZJaMbPS8QEcHsTJ dLZRwXtQ6q3Ur8Jf/Fetog5Z34tjSnIgp9t7N5z1b1Mp/1wXjHlsBUStxSUeDzSUTxkm W4ytXr8svxqYD7Y+i0tzsiF1DrbeU4G/yqr0s= MIME-Version: 1.0 Received: by 10.216.2.19 with SMTP id 19mr420893wee.68.1254861216417; Tue, 06 Oct 2009 13:33:36 -0700 (PDT) In-Reply-To: <84F2545D-14D5-4CDC-A27A-F951ABAE2E74@apache.org> References: <5b8626350910010956n472f4c7ci21afaeb4501882ac@mail.gmail.com> <84F2545D-14D5-4CDC-A27A-F951ABAE2E74@apache.org> Date: Tue, 6 Oct 2009 14:33:36 -0600 Message-ID: <5b8626350910061333k57d9a545w7b2bed281ac4ae6c@mail.gmail.com> Subject: Re: document diversity From: Michael Masters To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org My initial description may have been a little abstract. Maybe I should explain exactly what I'm trying to do. My company has various revenue channels, one of which is per click. If a user does a search, we would like to show results with the greatest revenue, although we don't want people to be able to buy all the top results. Hence, we would like to have some way of mixing results. The mixing of results could be based of potential revenue, relevancy, which revenue stream the result is associated with, etc. The previously mentioned ideas are great btw. -Mike On Sat, Oct 3, 2009 at 4:25 PM, Grant Ingersoll wrote= : > I'm curious, can you elaborate more on the deeper use case for this? > > Perhaps just implementing faceting on doc type would be sufficient? =A0Th= at > way users can drill in on doc type. =A0Alternatively, I suppose you could > implement a hit collector that accesses a field cache on the doc type fie= ld > and promotes lesser seen doc types until they are evenly represented. =A0= Could > also likely write a Function query that does a similar thing. =A0I'd imag= ine > you need to be careful to control your memory. > > -Grant > > On Oct 1, 2009, at 12:56 PM, Michael Masters wrote: > >> I was wondering if there is any way to control what kind of documents >> are returned from a search. For example, lets say we have an index >> built from different types of documents (pdf, txt, html, etc.). Is >> there a way to have the first x results have a specified distribution >> of document types? It would be nice to have an even number of results >> that are from pdfs, txt files, and html files. >> >> >> Any help would greatly be appreciated. >> >> >> -Mike >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org >> > > -------------------------- > Grant Ingersoll > http://www.lucidimagination.com/ > > Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using > Solr/Lucene: > http://www.lucidimagination.com/search > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org