Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9C0EB17C27 for ; Fri, 13 Mar 2015 15:07:57 +0000 (UTC) Received: (qmail 18330 invoked by uid 500); 13 Mar 2015 15:07:53 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 18243 invoked by uid 500); 13 Mar 2015 15:07:53 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 18231 invoked by uid 99); 13 Mar 2015 15:07:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Mar 2015 15:07:53 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of arafalov@gmail.com designates 209.85.218.54 as permitted sender) Received: from [209.85.218.54] (HELO mail-oi0-f54.google.com) (209.85.218.54) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Mar 2015 15:07:48 +0000 Received: by oifu20 with SMTP id u20so20026005oif.12 for ; Fri, 13 Mar 2015 08:06:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=GLtCIs+gO2LYyct+40zNqBrmqcxiId++jTbX5zMhsyQ=; b=GHy1dvc6HL2t380IL4cbpxKjf26jGLWJGOlAWt8fRHwRfHoA87SrX1j1GU3rIbU9st U6AhMQ4aTs3VV60X0rPHv27DTAYXREfAZ17g6hWFFWGn6jMgkP8rBS12XAUHY1QHOdUm WPegcOdSr6EyyPF1bmPSjv3mx9vc8uecVZKPEkn2ItgHLSdcNZghkxuGJ80Sy4O33bzq u+74FwpKwjX7jOVPDReI3WBV4OZff6sB3mY0kEdavJtTaAUuPOvCajCF1preR8SnBZTV FX949YqUSO0JwVLi+zENTyUSXsDNGKkUf9bDWK6i307Zl2A9qwBpagf7fG/o3DMoi+1a 2b2A== X-Received: by 10.202.219.215 with SMTP id s206mr36068670oig.114.1426259202845; Fri, 13 Mar 2015 08:06:42 -0700 (PDT) MIME-Version: 1.0 Received: by 10.76.35.10 with HTTP; Fri, 13 Mar 2015 08:06:02 -0700 (PDT) In-Reply-To: <1914812128.778592846.1426258454518.JavaMail.root@zimbra65-e11.priv.proxad.net> References: <1914812128.778592846.1426258454518.JavaMail.root@zimbra65-e11.priv.proxad.net> From: Alexandre Rafalovitch Date: Fri, 13 Mar 2015 11:06:02 -0400 Message-ID: Subject: Re: Word frequency To: solr-user Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org The usual recommendation is to use Solr as a database, internally with a separate user-facing app in a different container. Solr is not really easy to secure, so best is to use O/S level protection, e.g. listening on localhost only or only on a secure IP address. This separate client also gives you more flexibility with scaling/etc later= . You could look at something like Spring Data Solr if your search needs are simple and you want quick UI building support. Regards, Alex. ---- Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 13 March 2015 at 10:54, wrote: > > If you are asking whether users have access to /browse, then the answer i= s yes. > > Currently, they can type keywords in the q input field to do searches. > > I plan to turn q into a hidden field and add a 'keywords' input field who= se contents will be transferred to q when users press Search, using Javascr= ipt. > > I will also add date selects so that users don't have to type date querie= s. > > How do you secure the rest of SOLR (e.g., admin)? > > Would would recommend creating an alternative Search GUI with, say, Wicke= t, which queries SOLR using AJAX? > > Sounds hard, but I will try. Velocity is so much simpler. > > Cheers, > > Philippe > > > > > > > > ----- Mail original ----- > De: "Alexandre Rafalovitch" > =C3=80: "solr-user" > Envoy=C3=A9: Vendredi 13 Mars 2015 15:41:45 > Objet: Re: Word frequency > > On 13 March 2015 at 10:25, wrote: >> I would like to: >> >> - loop throught the documents in my core >> - extract the most-frequently-appearing words in each document's text fi= eld >> - generate a .vm which displays those words ranked number of occurrence= s, or, ideally, automatically generate that .vm whenever users use SOLR. > > That's what faceting does. You you can fine tune it further by telling > how many of top hits you want to get back. Have a look at those > parameters and play with them first in Web Admin UI before trying to > apply them to the browse handler. > > Regards, > Alex. > P.s. You are not planning to expose /browse handler directly to users, > do you? Because unless you REALLY know how to secure the rest of Solr, > you are asking for big troubles.