Return-Path: Delivered-To: apmail-lucene-general-archive@www.apache.org Received: (qmail 47439 invoked from network); 17 Jun 2009 23:29:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 17 Jun 2009 23:29:25 -0000 Received: (qmail 98868 invoked by uid 500); 17 Jun 2009 23:29:36 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 98810 invoked by uid 500); 17 Jun 2009 23:29:35 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 98800 invoked by uid 99); 17 Jun 2009 23:29:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Jun 2009 23:29:35 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ted.dunning@gmail.com designates 209.85.217.215 as permitted sender) Received: from [209.85.217.215] (HELO mail-gx0-f215.google.com) (209.85.217.215) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Jun 2009 23:29:27 +0000 Received: by gxk11 with SMTP id 11so1000347gxk.5 for ; Wed, 17 Jun 2009 16:29:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type; bh=Uu+ZOs7CIT4unu2v5cuDTCK9O53YRxz2Zb2ezDzZVnA=; b=UPASXIq2lBeCPAnlFR1cMCPZvLjLVlo+ZzfXLWul9l7EKzVXqz+OTcQ+hdckqwM36k cT7mkyd/exRKYRaTbsFaRgqqmhukmaUFhNBb2Y6ZD+/D2BhuGm9pLnYmJgVwjCVxVEVB w5XZzXVr9z+kNaiNNFfv7Nuxx8pW2MJcaJLmI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=LOUfAV1qEpQfDdgVdJ0wldTsT0HDXN/hUIP01p3rCx3tlxD2fRaZaqUBMM/ERAgWSb CJKQ/aYKsNUNf97i35UcmKJqJ02EeCRYAbjLxdt4SXZClfhGLycJOk/xk7eZLOpEeKFj 4PrB3XKShHJO2kTIzaP7bOtr4sfHa5ioOXB3Q= MIME-Version: 1.0 Received: by 10.151.49.8 with SMTP id b8mr2200613ybk.342.1245281347056; Wed, 17 Jun 2009 16:29:07 -0700 (PDT) In-Reply-To: References: <24062253.post@talk.nabble.com> <24082683.post@talk.nabble.com> From: Ted Dunning Date: Wed, 17 Jun 2009 16:28:47 -0700 Message-ID: Subject: Re: Question for top term frequency To: general@lucene.apache.org Content-Type: multipart/alternative; boundary=0015174c0e8c56364d046c93a87d X-Virus-Checked: Checked by ClamAV on apache.org --0015174c0e8c56364d046c93a87d Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit It is indeed faceting. I misunderstood the original request as being against the entire corpus. For the very modest size result that he is talking about, SOLR faceting should work just fine. Zehua's loss of the word NOT in his latest message increased my confusion a bit. On Wed, Jun 17, 2009 at 3:42 PM, Grant Ingersoll wrote: > Isn't this just faceting on the author field and then making a query out of > the top ten authors? I think you could do this in Solr pretty easily. Or > maybe I don't understand the question. > > -Grant > > On Jun 17, 2009, at 5:45 PM, zehua wrote: > > >> One thing to add is that the top author is *[NOT]* based on all >> doucments. It is >> based on the returned results. >> For example, we have 10000 results match the query, the top authors are >> among the 10000 results. >> >> >> --0015174c0e8c56364d046c93a87d--