Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 42798 invoked from network); 2 May 2003 15:30:28 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 2 May 2003 15:30:28 -0000 Received: (qmail 5924 invoked by uid 97); 2 May 2003 15:32:31 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 5917 invoked from network); 2 May 2003 15:32:31 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 2 May 2003 15:32:31 -0000 Received: (qmail 41682 invoked by uid 500); 2 May 2003 15:30:13 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 41637 invoked from network); 2 May 2003 15:30:13 -0000 Received: from web40105.mail.yahoo.com (66.218.78.39) by daedalus.apache.org with SMTP; 2 May 2003 15:30:13 -0000 Message-ID: <20030502153015.52329.qmail@web40105.mail.yahoo.com> Received: from [163.188.121.119] by web40105.mail.yahoo.com via HTTP; Fri, 02 May 2003 08:30:15 PDT Date: Fri, 2 May 2003 08:30:15 -0700 (PDT) From: Shoba Ramachandran Subject: RE: Using lucene with HSSF from Apache To: Lucene Users List In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Hi Michel, Are you able to index and search xls and doc files with just Lucene using SimpleAnalyzer???? There is no need for POI? With Lucene, you are able to extract the xls content as text? Let me try as you explained. Thanks very much for your reply. Shoba --- MMachado@LEVI.com wrote: > Hi, > I did it, but I use only lucene. You need to create > an IndexWriter with > SimpleAnalyzer, an InputStream as new > FileInputStream, create Document with > two Fields: one contains the file path and one > contains the file's content). > That's all. > Michel > > -----Original Message----- > From: Shoba Ramachandran > [mailto:shoba_duruvan@yahoo.com] > Sent: Wednesday, April 30, 2003 6:10 PM > To: lucene-user@jakarta.apache.org > Subject: Using lucene with HSSF from Apache > > Hi, > > Has anyone tried to index xls and doc files? > I'm trying to do with HSSF from apache and using > lucene1.2 > > This code returns me binary and printing it out > gives > junk chracters. File indexed like this returns > nothing > upon search. > > public static byte[] parse(File file) throws > Exception > { > POIFSFileSystem fs = new POIFSFileSystem(new > FileInputStream(file)); > HSSFWorkbook wb = new HSSFWorkbook(fs); > byte[] xlsInfo = wb.getBytes(); > System.out.println("xls content : "+ > xlsInfo.toString()); > return xlsInfo; > } > > Thanks in advance for your help > Shoba > > > __________________________________ > Do you Yahoo!? > The New Yahoo! Search - Faster. Easier. Bingo. > http://search.yahoo.com > > --------------------------------------------------------------------- > To unsubscribe, e-mail: > lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: > lucene-user-help@jakarta.apache.org > > --------------------------------------------------------------------- > To unsubscribe, e-mail: > lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: > lucene-user-help@jakarta.apache.org > __________________________________ Do you Yahoo!? The New Yahoo! Search - Faster. Easier. Bingo. http://search.yahoo.com --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org