Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9DED69263 for ; Tue, 3 Jul 2012 08:03:25 +0000 (UTC) Received: (qmail 60654 invoked by uid 500); 3 Jul 2012 08:03:22 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 60425 invoked by uid 500); 3 Jul 2012 08:03:21 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 60395 invoked by uid 99); 3 Jul 2012 08:03:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jul 2012 08:03:20 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [50.30.46.226] (HELO codemium.com) (50.30.46.226) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jul 2012 08:03:12 +0000 Received: from [192.168.181.24] (p57AFF493.dip.t-dialin.net [87.175.244.147]) (Authenticated sender: thomas@codemium.com) by codemium.com (Postfix) with ESMTPA id 6830CC849AC3 for ; Tue, 3 Jul 2012 10:02:50 +0200 (CEST) Message-ID: <4FF2A732.30109@codemium.com> Date: Tue, 03 Jul 2012 10:02:58 +0200 From: thomas User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:10.0.2) Gecko/20120216 Thunderbird/10.0.2 MIME-Version: 1.0 To: solr-user@lucene.apache.org Subject: Near Real Time Indexing and Searching with solr 3.6 Content-Type: multipart/alternative; boundary="------------060309040408020404000306" --------------060309040408020404000306 Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit Hi, As part of my bachelor thesis I'm trying to archive NRT with Solr 3.6. I've came up with a basic concept and would be trilled if I could get some feedback. The main idea is to use two different Indexes. One persistent on disc and one in RAM. The plan is to route every added and modified document to the RAMIndex (http://imgur.com/kLfUN). After a certain period of time, this index would get cleared and the documents get added to the persistent Index. Some major problems I still have with this idea is: - deletions of documents from documents in the persistent index - having the same unique IDs in both the RAM index and persitent Index, as a result of an updated document - Merging search results to filter out old versions of updated documents Would such an idea be viable to persuit? Thanks for you time --------------060309040408020404000306--