Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8D120D103 for ; Fri, 9 Nov 2012 08:42:18 +0000 (UTC) Received: (qmail 56288 invoked by uid 500); 9 Nov 2012 08:42:16 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 54882 invoked by uid 500); 9 Nov 2012 08:42:13 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 54810 invoked by uid 99); 9 Nov 2012 08:42:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Nov 2012 08:42:11 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jakedsouza88@gmail.com designates 209.85.210.48 as permitted sender) Received: from [209.85.210.48] (HELO mail-da0-f48.google.com) (209.85.210.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Nov 2012 08:42:02 +0000 Received: by mail-da0-f48.google.com with SMTP id z8so1601197dad.35 for ; Fri, 09 Nov 2012 00:41:41 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=pPK2/I4sFQDwTiMHO6aS87M+oKuQTdo2W86saIrA8pI=; b=ZVFMd9tpleZD5JKXnwqbrgx/vL3g7jHpjQsvT6ZeDFVmQymQw4i9Mmp+UC1v4uPewL IE3IgK3z4PWMF8G4uQa3wCZPIDjw+by5CuG5aRlPACIx4RMlnRz35afo/DqJKlt/E42R h09Ilu88zx7fn4p7HEVYPEuqufb6Oyq1J1Nqu4kLSmW0Olv2a9kt76ABAVpnAnUrvDvf kp2cj3yTBVzyEQ8rXEU/Q0gPbqCbAFy9du+ZuVhzOgI1X1OeM312PcYpuiPEbtTh6BEU HrZEgRvHqC76/8K4YPcfpV1HIjZs2QETppv5cA4pa6iL7jT3/q1w88UqYwm2ZqJR1tgT TKLw== Received: by 10.68.200.227 with SMTP id jv3mr31521831pbc.162.1352450501183; Fri, 09 Nov 2012 00:41:41 -0800 (PST) MIME-Version: 1.0 Received: by 10.68.119.71 with HTTP; Fri, 9 Nov 2012 00:41:00 -0800 (PST) From: jake dsouza Date: Fri, 9 Nov 2012 03:41:00 -0500 Message-ID: Subject: Indexing and searching across versioned document collections To: java-user@lucene.apache.org, dev@lucene.apache.org, general@lucene.apache.org Content-Type: multipart/alternative; boundary=047d7b15af19b308af04ce0bebf3 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b15af19b308af04ce0bebf3 Content-Type: text/plain; charset=ISO-8859-1 Hello, Has any one worked on making Lucene index and search versioned document collections i.e any corpus with multiple versions of documents similar to wikipedia or source code. I am working on a project to index and search versioned collections while keeping the index size minimum by taking into consideration differences in the versions to minimize the size of the index . Could some one direct me to any existing efforts to make Lucene work with versions . Thanks Jake --047d7b15af19b308af04ce0bebf3--