Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 68606 invoked from network); 6 Oct 2009 21:00:39 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Oct 2009 21:00:39 -0000 Received: (qmail 29810 invoked by uid 500); 6 Oct 2009 21:00:36 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 29737 invoked by uid 500); 6 Oct 2009 21:00:36 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 29727 invoked by uid 99); 6 Oct 2009 21:00:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 21:00:36 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [134.96.191.147] (HELO smtp.dfki.de) (134.96.191.147) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 21:00:24 +0000 Received: from smtp.dfki.de (localhost [127.0.0.1]) by imss.7 (Postfix) with ESMTP id B43DF317DC for ; Tue, 6 Oct 2009 23:00:02 +0200 (CEST) Received: from mail.dfki.de (lnv-104.sb.dfki.de [134.96.191.146]) by smtp.dfki.de (Postfix) with ESMTP id A012731746 for ; Tue, 6 Oct 2009 23:00:02 +0200 (CEST) Received: from [192.168.178.47] (BAH7ec6.bah.pppool.de [77.135.126.198]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mail.dfki.de (Postfix) with ESMTPSA id 701B2310E9 for ; Tue, 6 Oct 2009 23:00:02 +0200 (CEST) Message-Id: From: Paul Libbrecht To: java-user@lucene.apache.org In-Reply-To: <5b8626350910061333k57d9a545w7b2bed281ac4ae6c@mail.gmail.com> Content-Type: multipart/signed; boundary=Apple-Mail-39--469788887; micalg=sha1; protocol="application/pkcs7-signature" Mime-Version: 1.0 (Apple Message framework v936) Subject: Re: document diversity Date: Tue, 6 Oct 2009 22:59:56 +0200 References: <5b8626350910010956n472f4c7ci21afaeb4501882ac@mail.gmail.com> <84F2545D-14D5-4CDC-A27A-F951ABAE2E74@apache.org> <5b8626350910061333k57d9a545w7b2bed281ac4ae6c@mail.gmail.com> X-Mailer: Apple Mail (2.936) X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-39--469788887 Content-Type: text/plain; charset=ISO-8859-1; format=flowed; delsp=yes Content-Transfer-Encoding: quoted-printable Just as you can add a query that will boost better things with a =20 higher quality, you can add a query for a higher revenue. Basically, the default operator "should" in boolean-clauses can be =20 used exactly for that: do not force this query to be matched but raise =20= boost if there's something that matches. The translation of the user-query, itself, is marked as "must" (and =20 inside this one is orred in all the various flavours, e.g. per =20 language, phonetic...). paul Le 06-oct.-09 =E0 22:33, Michael Masters a =E9crit : > My initial description may have been a little abstract. Maybe I should > explain exactly what I'm trying to do. My company has various revenue > channels, one of which is per click. If a user does a search, we would > like to show results with the greatest revenue, although we don't want > people to be able to buy all the top results. Hence, we would like to > have some way of mixing results. The mixing of results could be based > of potential revenue, relevancy, which revenue stream the result is > associated with, etc. > > The previously mentioned ideas are great btw. > > -Mike > > > On Sat, Oct 3, 2009 at 4:25 PM, Grant Ingersoll =20 > wrote: >> I'm curious, can you elaborate more on the deeper use case for this? >> >> Perhaps just implementing faceting on doc type would be =20 >> sufficient? That >> way users can drill in on doc type. Alternatively, I suppose you =20 >> could >> implement a hit collector that accesses a field cache on the doc =20 >> type field >> and promotes lesser seen doc types until they are evenly =20 >> represented. Could >> also likely write a Function query that does a similar thing. I'd =20= >> imagine >> you need to be careful to control your memory. >> >> -Grant >> >> On Oct 1, 2009, at 12:56 PM, Michael Masters wrote: >> >>> I was wondering if there is any way to control what kind of =20 >>> documents >>> are returned from a search. For example, lets say we have an index >>> built from different types of documents (pdf, txt, html, etc.). Is >>> there a way to have the first x results have a specified =20 >>> distribution >>> of document types? It would be nice to have an even number of =20 >>> results >>> that are from pdfs, txt files, and html files. >>> >>> >>> Any help would greatly be appreciated. >>> >>> >>> -Mike >>> >>> = --------------------------------------------------------------------- >>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >>> For additional commands, e-mail: java-user-help@lucene.apache.org >>> >> >> -------------------------- >> Grant Ingersoll >> http://www.lucidimagination.com/ >> >> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) =20= >> using >> Solr/Lucene: >> http://www.lucidimagination.com/search >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > --Apple-Mail-39--469788887 Content-Disposition: attachment; filename=smime.p7s Content-Type: application/pkcs7-signature; name=smime.p7s Content-Transfer-Encoding: base64 MIAGCSqGSIb3DQEHAqCAMIACAQExCzAJBgUrDgMCGgUAMIAGCSqGSIb3DQEHAQAAoIIDDzCCAwsw ggJ0oAMCAQICECfgYPDm3JFN4mPGdyWyyVgwDQYJKoZIhvcNAQEFBQAwYjELMAkGA1UEBhMCWkEx JTAjBgNVBAoTHFRoYXd0ZSBDb25zdWx0aW5nIChQdHkpIEx0ZC4xLDAqBgNVBAMTI1RoYXd0ZSBQ ZXJzb25hbCBGcmVlbWFpbCBJc3N1aW5nIENBMB4XDTA5MDUyNjA3MzIzNloXDTEwMDUyNjA3MzIz NlowcDEaMBgGA1UEBBMRTGliYnJlY2h0IEdvdXJkZXQxDTALBgNVBCoTBFBhdWwxHzAdBgNVBAMT FlBhdWwgTGliYnJlY2h0IEdvdXJkZXQxIjAgBgkqhkiG9w0BCQEWE3BhdWxAYWN0aXZlbWF0aC5v cmcwggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEKAoIBAQDeZiQTdNavFBrVY7WMNWscajCzBRZF cWhrkQyh2V0CCrwx7ElHEM0asrD/FbmK+eor0aeav4u/3RKj3qOsV/HV8VRiM2zHaaErrp+/boPw FapmxhPk0TXLceFJAgL5ZA1RsfpcIxGzE/AQ5ic+MqZXp2tZkH6RNeOwoUTURZb2bOZWX5fA1G+L G+IVs20vAN6i/Kdud3JUe9ElfHWfJTl2eO42Hea5MVLnk2IkmCqzxC3CcEchOgOzIrnEjaj/JLPB ZZkDv6sTyEouG4qQHr4trSZCe5anTM2HoPVtmAoaz+Acyt2+0cwIUIPGDjk1K0mvrnCbVIOUNlUe FVJu0svnAgMBAAGjMDAuMB4GA1UdEQQXMBWBE3BhdWxAYWN0aXZlbWF0aC5vcmcwDAYDVR0TAQH/ BAIwADANBgkqhkiG9w0BAQUFAAOBgQDC0P6qlhvM4E2XepLgrDWLj7yp9obwKr2tZBvM3lXGCIyx edzPOMRQABPH3eBbvhKyMb64AoCttJ0vt+QcBLLtc9TkBxPyHyaRHmi4NUMlJJyy1z6pLNc0Ipec U+qv5ck7Cl2SUs7ZQps9ZSW++s4LKKrcdsEs9HD4myt1YusWKTGCAxAwggMMAgEBMHYwYjELMAkG A1UEBhMCWkExJTAjBgNVBAoTHFRoYXd0ZSBDb25zdWx0aW5nIChQdHkpIEx0ZC4xLDAqBgNVBAMT I1RoYXd0ZSBQZXJzb25hbCBGcmVlbWFpbCBJc3N1aW5nIENBAhAn4GDw5tyRTeJjxnclsslYMAkG BSsOAwIaBQCgggFvMBgGCSqGSIb3DQEJAzELBgkqhkiG9w0BBwEwHAYJKoZIhvcNAQkFMQ8XDTA5 MTAwNjIwNTk1NlowIwYJKoZIhvcNAQkEMRYEFC63CAvAcRl8fVsQnjh6inSYKNtDMIGFBgkrBgEE AYI3EAQxeDB2MGIxCzAJBgNVBAYTAlpBMSUwIwYDVQQKExxUaGF3dGUgQ29uc3VsdGluZyAoUHR5 KSBMdGQuMSwwKgYDVQQDEyNUaGF3dGUgUGVyc29uYWwgRnJlZW1haWwgSXNzdWluZyBDQQIQJ+Bg 8ObckU3iY8Z3JbLJWDCBhwYLKoZIhvcNAQkQAgsxeKB2MGIxCzAJBgNVBAYTAlpBMSUwIwYDVQQK ExxUaGF3dGUgQ29uc3VsdGluZyAoUHR5KSBMdGQuMSwwKgYDVQQDEyNUaGF3dGUgUGVyc29uYWwg RnJlZW1haWwgSXNzdWluZyBDQQIQJ+Bg8ObckU3iY8Z3JbLJWDANBgkqhkiG9w0BAQEFAASCAQCV yHRry4nlrY0OgFZc95wtzdmXoNkin0E7FYwUCI7nWv6B8FFYOSKzw/qxEQzyTQ4y8X70lXG6vtLD ab7fJZHA3FbucbOYqmhmDQmUumpuZqs+cWRPIeIpgf6akrxKXvfG8jYpCg2WdvQAflBgQ6/9C6o0 kQ6Y7kkNFq4uXuVMK7Nv22WKYllo+pPLQcqKXXTEijdtU5kukg+eMP782Yl4Emmnbd5eYBWR/jWO MBYY72NhoE7ZbujeOURQ7BdGjJh88thmTdT0OHUqLtw7EknCuyKMLaBRqRudtVfZnqaPHan0v1ac hi+kSHURMFU+Fq+HrvoS9mA5BJMZHfckotMyAAAAAAAA --Apple-Mail-39--469788887--