Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8824892DC for ; Tue, 3 Jul 2012 12:41:05 +0000 (UTC) Received: (qmail 57247 invoked by uid 500); 3 Jul 2012 12:41:01 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 57061 invoked by uid 500); 3 Jul 2012 12:41:01 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 57043 invoked by uid 99); 3 Jul 2012 12:41:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jul 2012 12:41:01 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.212.176] (HELO mail-wi0-f176.google.com) (209.85.212.176) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jul 2012 12:40:56 +0000 Received: by wibhn17 with SMTP id hn17so3109281wib.5 for ; Tue, 03 Jul 2012 05:40:35 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=uKi9Q6JQG9+ceabF4uST03XS2WRTnk5prW8G/+2Lllw=; b=b+fbRtgzdLH8R+f+HlXGPCgDdNlBcpF0N2Ida7WUau1tlWrMBoq/zQNt8FkTPey+Te 2kmoh1BgmnaMY4EBBCRNPxGrYMtIfytnkS4S+i9mjwIsOIDhExXXnf5MYUMXgbOeHciJ 6sEK3d9elbCFAADiOH6gxItaA2GOXZDFmdNuTDdpQK8zDwbiYfGaM8zJkh0Qcx7sRdEQ 5vcydCqHbFIcLnzp5+7W8YiRFuRTmj4H1e+w7NQ1DxtEmqseKmP63fMlEHhXKuwOE8XI ivtChXwMNrPDV/IvddE8o/hd7ulM6IBvMKURW9NmXeJahxrMr1qQjD5xSdClM4t0s104 SYNw== Received: by 10.216.145.97 with SMTP id o75mr3887033wej.7.1341319234958; Tue, 03 Jul 2012 05:40:34 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.221.93 with HTTP; Tue, 3 Jul 2012 05:40:14 -0700 (PDT) In-Reply-To: <4FF2A732.30109@codemium.com> References: <4FF2A732.30109@codemium.com> From: Michael McCandless Date: Tue, 3 Jul 2012 08:40:14 -0400 Message-ID: Subject: Re: Near Real Time Indexing and Searching with solr 3.6 To: solr-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQn5eJVDDKbkn/WRxKpRhVPEL7Jh6SBKNkIT7VuNF5o8ZkBDAsVMnrA9wGWA/46Ga9hXw4Bh X-Virus-Checked: Checked by ClamAV on apache.org Hi, You might want to take a look at Solr's trunk (very soon to be 4.0.0 alpha release), which already has a near-real-time solution (using Lucene's near-real-time APIs). Lucene has NRTCachingDirectory (to use RAM for small / recently flushed segments), but I don't think Solr uses it yet. Mike McCandless http://blog.mikemccandless.com On Tue, Jul 3, 2012 at 4:02 AM, thomas wrote: > Hi, > > As part of my bachelor thesis I'm trying to archive NRT with Solr 3.6. I've > came up with a basic concept and would be trilled if I could get some > feedback. > > The main idea is to use two different Indexes. One persistent on disc and > one in RAM. The plan is to route every added and modified document to the > RAMIndex (http://imgur.com/kLfUN). After a certain period of time, this > index would get cleared and the documents get added to the persistent Index. > > Some major problems I still have with this idea is: > - deletions of documents from documents in the persistent index > - having the same unique IDs in both the RAM index and persitent Index, as a > result of an updated document > - Merging search results to filter out old versions of updated documents > > Would such an idea be viable to persuit? > > Thanks for you time >