lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From anupama.gangad...@daimler.com
Subject RE: Using Solr to index zip files
Date Thu, 09 Jun 2016 07:38:31 GMT
Hi,

The nesting level is fixed. Outerzip has many inner zip files(i.e. 1.zip has many zip files).
Currently the outer zip path and inner zip name is stored in a Hive table for reference.
I use a Hive query to find the zip for me.

I intend to index the outer zip file and store all the inner zips as fields(search criteria)
for this index.

Thank you,
Regards,
Anupama

-----Original Message-----
From: Alexandre Rafalovitch [mailto:arafalov@gmail.com]
Sent: Tuesday, June 07, 2016 7:44 PM
To: solr-user
Subject: Re: Using Solr to index zip files

I _think_ DataImportHandler could handle zip files with fixed level of nesting, but not read
from HDFS.

I don't think anything else in Solr will. So, doing it outside of Solr is probably best. Especially,
since you would need to decide how you actually want to map these files (e.g. do you keep
the path for zip within zip, etc).

Regards,
    Alex.
----
Newsletter and resources for Solr beginners and intermediates:
http://www.solr-start.com/


On 7 June 2016 at 12:57,  <anupama.gangadhar@daimler.com> wrote:
> Hi,
>
> I have an use case where I need to search zip files quickly in HDFS. I intend to use
Solr but not finding any relevant information about whether it can be done for zip files.
> These are nested zip files i.e. zips within a zip file. Any help/information is much
appreciated.
>
> Thank you,
> Regards,
> Anupama
>
>
> If you are not the addressee, please inform us immediately that you have received this
e-mail by mistake, and delete it. We thank you for your support.
>

If you are not the addressee, please inform us immediately that you have received this e-mail
by mistake, and delete it. We thank you for your support.

Mime
View raw message