From users-return-2612-apmail-jackrabbit-users-archive=jackrabbit.apache.org@jackrabbit.apache.org Mon Mar 19 10:24:44 2007 Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 12010 invoked from network); 19 Mar 2007 10:24:43 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 19 Mar 2007 10:24:43 -0000 Received: (qmail 63323 invoked by uid 500); 19 Mar 2007 10:24:48 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 63304 invoked by uid 500); 19 Mar 2007 10:24:48 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 63295 invoked by uid 99); 19 Mar 2007 10:24:48 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Mar 2007 03:24:48 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of the.mindstorm.mailinglist@gmail.com designates 66.249.82.234 as permitted sender) Received: from [66.249.82.234] (HELO wx-out-0506.google.com) (66.249.82.234) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Mar 2007 03:24:38 -0700 Received: by wx-out-0506.google.com with SMTP id i28so1236717wxd for ; Mon, 19 Mar 2007 03:24:18 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=EIWTGx3rzaM5nNOhgS74bgPF1dBvXrBWH/x4rprWvEupSUHckCCVWDV2A7d6BQ43HLKnK8jquoHJ5KaDnPEDhHJwd3OXfxZAITZtIWaael4FNcdWf4ybRXI8zCD8rSTDNZlzseSpZxFbuGUeSFn1koploVYeilXNz4U21ycyxoI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=sE+1H1rLavpnPUFSm2VAU/fzSleki8wZ7TvxvgfI7wPwkbR8NIDGlKglIRzgJtVWeR3j7DN27njEwJWTUXkpZuylZC6k+JmQBRf4rhT2WrOA6qUsLvD+h/sUZkrGARM72yPFLXxeNR1Lc+voieH1cFIBRgKq4ibTclWVI+DQJDQ= Received: by 10.35.126.2 with SMTP id d2mr9923322pyn.1174299857939; Mon, 19 Mar 2007 03:24:17 -0700 (PDT) Received: by 10.35.64.5 with HTTP; Mon, 19 Mar 2007 03:24:17 -0700 (PDT) Message-ID: Date: Mon, 19 Mar 2007 12:24:17 +0200 From: "Alexandru Popescu" To: users@jackrabbit.apache.org Subject: Re: Impossible to manipulate lucene directly? In-Reply-To: <45FE5E7B.50505@gmx.net> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <9548316.post@talk.nabble.com> <45FE5E7B.50505@gmx.net> X-Virus-Checked: Checked by ClamAV on apache.org On 3/19/07, Marcel Reutegger wrote: > alartin wrote: > > What can I do if I need some search engine like search functions rather > > than XPath or SQLs? For example, I store some information in nodes, say one > > webpage content per node and I want to find "similar pages" according to > > their cotents. This is a simple task when I use lucene directly. But what > > can I do if I use jackrabbit as underlying layer? > > well, not much, because the API you are using is JCR and not lucene. but see > also below. > > > And what can I do if I need some short description of the hit and > > highlight the query terms?? > > you have to create that manually. > > > It seems Lucene is only used as a search library to implement the JCR > > sepecification, doesn't it? > > that's correct. > > > Is it possible to manipulate Lucene directly? > > no. > > But here's what you can do: create jira issues and describe exactly what you > would like to see as an enhancement. also try to think about how to integrate > this into jackrabbit or the query languages without affecting JCR compliance. > e.g. I could imagine a pseudo property or function in XPath: jr:highlight() > which returns an xml snippet with text extracts and highlighted search terms. > > regards > marcel > Thanks for the answer Marcel. At the first glance it looks like what you are saying is that this cannot be done through JCR API, which is something I already was aware of. What I am really wondering is if there is any way I can put my hands on the Lucene classes so that I can use those for querying the indexing (so not JCR API, but rather a "hack"). ./alex -- .w( the_mindstorm )p.