Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 33DC6F2BA for ; Sun, 31 Mar 2013 12:21:55 +0000 (UTC) Received: (qmail 17129 invoked by uid 500); 31 Mar 2013 12:21:53 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 16287 invoked by uid 500); 31 Mar 2013 12:21:47 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 16244 invoked by uid 99); 31 Mar 2013 12:21:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 31 Mar 2013 12:21:45 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of arachweb@gmail.com designates 209.85.128.180 as permitted sender) Received: from [209.85.128.180] (HELO mail-ve0-f180.google.com) (209.85.128.180) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 31 Mar 2013 12:21:40 +0000 Received: by mail-ve0-f180.google.com with SMTP id c13so1731608vea.39 for ; Sun, 31 Mar 2013 05:21:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:date:message-id:subject:from:to :content-type; bh=p9Jwb1jxyek8ljsQLS9aT2UEAvPP0tBIvR3/oIKUlmc=; b=ManUkFyq0BXU3Lm+EmVaHR8uD/8nuzWIECOdPvdQBZpKe4bco/Z4Wj0eEJL+XK3oKt YsP1tezaean4jy5qpxChO4pNJFp45V3jS9zVczxzYryFCKL+/pgelsqjQhW5cf7oCQE2 MGZyMaGV/51nf8m+W8w9pF4ymXUhKk9H7Wn3duvV6UFJGPl97hHcbmGnyEBVDDc1ERZq LDbAO9c251PYb1Ch0C7zoTtX2/fS8d5tfglEdIVlXB5k+I7Ucs4V2wxRZYQFuYSTJaTw X9QC24fy2vgMVAc9Hs71CL2PhVjkhjvZYeHpwiW58RvDI8pQAGziFQ+5kJGvci6dw70I K2RA== MIME-Version: 1.0 X-Received: by 10.52.100.163 with SMTP id ez3mr5556108vdb.6.1364732479982; Sun, 31 Mar 2013 05:21:19 -0700 (PDT) Received: by 10.220.140.11 with HTTP; Sun, 31 Mar 2013 05:21:19 -0700 (PDT) Date: Sun, 31 Mar 2013 08:21:19 -0400 Message-ID: Subject: Indexing a long list From: Paul Bell To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=20cf307ca044aefed004d9378a7f X-Virus-Checked: Checked by ClamAV on apache.org --20cf307ca044aefed004d9378a7f Content-Type: text/plain; charset=ISO-8859-1 Hi All, Suppose I need to index a property whose value is a long list of terms. For example, someProperty = ["v1", "v2", .... , "v1000000"] Please note that I could drop the leading "v" and index these as numbers instead of strings. But the question is what's the best practice in Lucene when dealing with a case like this? I need to be able to retrieve the list. This makes methink that I need to store it. And I suppose that the list could be stored in the index itself or in the "content" to which the index points. So there are really two parts to this question: 1. Lucene "best practices" for long list 2. Where to store such a list Thanks for your help. -Paul --20cf307ca044aefed004d9378a7f--