Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 91276 invoked from network); 10 Jul 2009 14:35:34 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 10 Jul 2009 14:35:34 -0000 Received: (qmail 54316 invoked by uid 500); 10 Jul 2009 14:35:42 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 54249 invoked by uid 500); 10 Jul 2009 14:35:41 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 54239 invoked by uid 99); 10 Jul 2009 14:35:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jul 2009 14:35:41 +0000 X-ASF-Spam-Status: No, hits=3.4 required=10.0 tests=HTML_MESSAGE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [72.167.82.86] (HELO p3plsmtpa01-06.prod.phx3.secureserver.net) (72.167.82.86) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 10 Jul 2009 14:35:31 +0000 Received: (qmail 27136 invoked from network); 10 Jul 2009 14:35:09 -0000 Received: from unknown (99.141.135.247) by p3plsmtpa01-06.prod.phx3.secureserver.net (72.167.82.86) with ESMTP; 10 Jul 2009 14:35:09 -0000 From: "Andy Faibishenko" To: Subject: Is my app a good fit for Lucene? Date: Fri, 10 Jul 2009 09:35:15 -0500 Message-ID: <001501ca016b$a432c6c0$0500a8c0@D830> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_0016_01CA0141.BB5CBEC0" X-Mailer: Microsoft Office Outlook 11 Thread-Index: AcoBa6Ng7Fo9TYFrQ4q4l2GZdhGFLg== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3350 X-Virus-Checked: Checked by ClamAV on apache.org ------=_NextPart_000_0016_01CA0141.BB5CBEC0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit I have a GUI application which needs to open large files (hundreds of MB) and be able to search through them quickly for user specified strings. These files are frequently updated while the user is viewing them and the updates are captured by the application. Also, the files contain records which are KEY=VALUE pairs separated by a non-printable ASCII character instead of normal English text. I installed Lucene in Eclipse and tried to play around with some sample code. One thing I noticed is that the wildcard searching doesn't seem to work right on this data. I am guessing it is because the text format is tripping up the tokenizing. I am trying to figure out whether using Lucene to implement this is a good thing or whether I should just try to implement my own search logic. Andy Faibishenko ------=_NextPart_000_0016_01CA0141.BB5CBEC0--