Return-Path: X-Original-To: apmail-clerezza-dev-archive@www.apache.org Delivered-To: apmail-clerezza-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6A676106CE for ; Wed, 2 Oct 2013 16:40:57 +0000 (UTC) Received: (qmail 50016 invoked by uid 500); 2 Oct 2013 16:40:46 -0000 Delivered-To: apmail-clerezza-dev-archive@clerezza.apache.org Received: (qmail 49950 invoked by uid 500); 2 Oct 2013 16:40:46 -0000 Mailing-List: contact dev-help@clerezza.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@clerezza.apache.org Delivered-To: mailing list dev@clerezza.apache.org Received: (qmail 49810 invoked by uid 99); 2 Oct 2013 16:40:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Oct 2013 16:40:39 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy includes SPF record at spf.trusted-forwarder.org) Received: from [213.238.45.90] (HELO r2-d2.netlabs.org) (213.238.45.90) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Oct 2013 16:40:32 +0000 Received: (qmail 84751 invoked by uid 89); 2 Oct 2013 16:40:11 -0000 Received: from unknown (HELO mail-la0-f41.google.com) (farewellutopia@netlabs.org@209.85.215.41) by 0 with ESMTPA; 2 Oct 2013 16:40:11 -0000 Received: by mail-la0-f41.google.com with SMTP id ec20so901125lab.28 for ; Wed, 02 Oct 2013 09:40:10 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=7vPf04jOOSPVH4r+q5elMKlIrQ6i7LtiB7SDAF57yGk=; b=D+f7tCmy4GhOZBMC0aKHQdXnoiOOG8gd1WQLck1ZSRPfrwZNI+0qK16l2sSayg5ZkH T9vZaW8yzsrW54K3FMzFQMittaPDgLh/g7uWldgA57xTZO+H1f8ObfDfSdbNlRWqxVHs 3v9Ayusckt8Km9KTu36T8XZHU6Sn52t7Awe+lcjZ4QtD2ZXYKwttA7elALzGdf038Veu pMdjtc39rlJWJCuAfC1z+/3TU+ylV0vTRt9CHMawxi7SOlvUKJfzk2efN2m5kK1Vigwb u8qUys8dtNhkd/uPhRV2sTk8FsnpmPdlhLVeu7J3znPBb3gc6TFN2xxo+DJY9LDYPM8/ 6rRg== X-Gm-Message-State: ALoCoQmgBtBlUrj+BYMwP93ShK4u2Dc4i6ZpwPGMjDhV4fvacYehZcfeUlz41xmhZZ+0varrFIy4 MIME-Version: 1.0 X-Received: by 10.112.136.195 with SMTP id qc3mr742620lbb.55.1380732010287; Wed, 02 Oct 2013 09:40:10 -0700 (PDT) Received: by 10.152.121.41 with HTTP; Wed, 2 Oct 2013 09:40:10 -0700 (PDT) X-Originating-IP: [31.24.10.151] In-Reply-To: References: Date: Wed, 2 Oct 2013 18:40:10 +0200 Message-ID: Subject: Re: Faceting with Lucene in CRIS From: =?ISO-8859-1?Q?Reto_Bachmann=2DGm=FCr?= To: dev@clerezza.apache.org Content-Type: multipart/alternative; boundary=089e011770db00e90204e7c4b963 X-Virus-Checked: Checked by ClamAV on apache.org --089e011770db00e90204e7c4b963 Content-Type: text/plain; charset=ISO-8859-1 Hi Stephane Really cool that the faceted search is going to be scalable! I created an issue for the marking of the respective properties and will look into it asap: https://issues.apache.org/jira/browse/CLEREZZA-828. Cheers, Reto On Wed, Oct 2, 2013 at 6:31 PM, Stephane Gamard wrote: > Hi Team, > > As I dive deeper and deeper into my learning and updates on the rdf.cris > I've come to realise that the Faceting implementation is not optimal for > Lucene. We're experiencing extremely slow faceting do to the post-search > facet collector that iterates thru the document list untill hits.length. > > It would be fairly trivial now to implement facets as per Lucene > specifications. The most "drastic" change that I am yet too ignorant to try > for myself is to have the ability to know when a VirtualProperty should be > considered for faceting at indexing time. > > This would require a small update of the DefinitionGraph by maybe adding a > property "facetable" for VirtualProperties? > > _Stephane > > > --089e011770db00e90204e7c4b963--