lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From prasanna pradhan <prasanna.prad...@gmail.com>
Subject Re: Parsing large xml files
Date Fri, 22 May 2009 18:04:45 GMT
We had similar a problem  where we had to parse 1 GB XML files.Better
transform to array like json and write a custom search API using lucene.

On Thu, May 21, 2009 at 8:12 PM, Sudarsan, Sithu D. <
Sithu.Sudarsan@fda.hhs.gov> wrote:

>
> Hi,
>
> While trying to parse xml documents of about 50MB size, we run into
> OutOfMemoryError due to java heap space. Increasing JVM to use close 2GB
> (that is the max), does not help. Is there any API that could be used to
> handle such large single xml files?
>
> If Lucene is not the right place, please let me know alternate places to
> look for,
>
> Thanks in advance,
> Sithu D Sudarsan
> sithu.sudarsan@fda.hhs.gov
> sdsudarsan@ualr.edu
>
>
>
>


-- 
Thanks,
Prasanna

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message