Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 74873 invoked from network); 5 Mar 2004 14:40:28 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 5 Mar 2004 14:40:28 -0000 Received: (qmail 42726 invoked by uid 500); 5 Mar 2004 14:40:18 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 42704 invoked by uid 500); 5 Mar 2004 14:40:18 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 42691 invoked from network); 5 Mar 2004 14:40:17 -0000 Received: from unknown (HELO web12706.mail.yahoo.com) (216.136.173.243) by daedalus.apache.org with SMTP; 5 Mar 2004 14:40:17 -0000 Message-ID: <20040305144019.79276.qmail@web12706.mail.yahoo.com> Received: from [194.152.209.202] by web12706.mail.yahoo.com via HTTP; Fri, 05 Mar 2004 06:40:19 PST Date: Fri, 5 Mar 2004 06:40:19 -0800 (PST) From: Otis Gospodnetic Subject: Re: using lucene to search in a 1 huge file. (aka "grep -n") To: Lucene Users List In-Reply-To: <4047FAF4.8040807@bea.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N In order for this to make sense, you would have to split your huge file into either lines or blocks, whichever you want to be your indexing and search/hit unit, and convert those to Lucene Documents, which you would then index. Searching would then return the line/block where matches are found. Otis --- prasen wrote: > Hi guys, > I am relatively new to Lucene. Can lucene be used to speed-up > search > for a string in one huge file( ~ TerraBytes ) based on its libe > numbers. Something like "grep -n " where the > indexing will be done only on one file and based on either > line-numbers/blocks. > > prasen > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org