lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ernesto De Santis <ernesto.desan...@colaborativa.net>
Subject Re: Zip Files
Date Tue, 01 Mar 2005 15:48:50 GMT
Hello

first, you need a parser for each file type: pdf, txt, word, etc.
and use a java api to iterate zip content, see:

http://java.sun.com/j2se/1.4.2/docs/api/java/util/zip/ZipInputStream.html

use getNextEntry() method

little example:

ZipInputStream zis = new ZipInputStream(fileInputStream);
ZipEntry zipEntry;
while(zipEntry = zis.getNextEntry() != null){
    //use zipEntry to get name, etc.
    //get properly parser for current entry
    //use parser with zis (ZipInputStream)
}

good luck
Ernesto

Luke Shannon escribi├│:

>Hello;
>
>Anyone have an ideas on how to index the contents within zip files?
>
>Thanks,
>
>Luke
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>
>  
>

-- 
Ernesto De Santis - Colaborativa.net
C├│rdoba 1147 Piso 6 Oficinas 3 y 4
(S2000AWO) Rosario, SF, Argentina.



---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message