Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 4548 invoked from network); 4 Dec 2006 08:50:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Dec 2006 08:50:58 -0000 Received: (qmail 5070 invoked by uid 500); 4 Dec 2006 08:50:59 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 4839 invoked by uid 500); 4 Dec 2006 08:50:58 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 4828 invoked by uid 99); 4 Dec 2006 08:50:58 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Dec 2006 00:50:58 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of lukas.vlcek@gmail.com designates 64.233.182.191 as permitted sender) Received: from [64.233.182.191] (HELO nf-out-0910.google.com) (64.233.182.191) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Dec 2006 00:50:35 -0800 Received: by nf-out-0910.google.com with SMTP id n28so3780245nfc for ; Mon, 04 Dec 2006 00:50:14 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:references; b=j5F7LyXOjQJbb8giu5AfEqL2/LeI9rbtJPfjfewy+E9kfjXpt1jsbgt7uYrWMDDEmwEff18aNLvpr3FUJX+NL4iNnP9E9Umu0AZ2rgLyKZSkXsNmwfrDxMuC56M4jNdWNURFWPguJQ/FjHXSFmI2hVG2DtSLpwWJjR58jRpsOGg= Received: by 10.82.101.3 with SMTP id y3mr1423635bub.1165222213713; Mon, 04 Dec 2006 00:50:13 -0800 (PST) Received: by 10.82.147.8 with HTTP; Mon, 4 Dec 2006 00:50:13 -0800 (PST) Message-ID: <52c3ddca0612040050s4eacfb32re73a8774ffeeb97e@mail.gmail.com> Date: Mon, 4 Dec 2006 09:50:13 +0100 From: "Lukas Vlcek" Reply-To: java-user@lucene.apache.org To: java-user@lucene.apache.org Subject: Re: lucene - general question In-Reply-To: <4fe781a50612040008t7fcf3623gb0bbcb17f3b0229@mail.gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_35267_31364738.1165222213647" References: <4fe781a50612032328q22e97299g96d9b78e57f787aa@mail.gmail.com> <52c3ddca0612032341pe9f454enb5271e1b73a965ba@mail.gmail.com> <4fe781a50612040008t7fcf3623gb0bbcb17f3b0229@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_35267_31364738.1165222213647 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi, Try to look at Groovy (I haven't used it yet but some people say it is much easire to work with XML file in Groovy then in Java). It produces class files so it can be integrated with your exisitng Java code. 6MB file is not that much unless you are working in limited environment (like mobile device?). Also if the only thing you really need is to search for some strings in two files and you don't need to integrate this function with other Java code then you can simply go with *unix command line tools (grep, wc, ...) that should give you what you need very quickly. Lukas On 12/4/06, Eshwaramoorthy Babu wrote: > > Hi Lukas, > > Thanks for your response. > I was planning to search for 1st xml ID's in 2nd XML. so I thought of > using > lucene for search. > Can you please suggest me some scripting solution. Is perl right solution? > > Thanks, > Babu > > > > > On 12/4/06, Lukas Vlcek wrote: > > > > Hi Babu, > > > > Sorry but I don't see any point in using Lucene if you don't need search > > functionality. Also for parsing XML files I would consider using some > > scripting language (as opposed to pure Java based solution). The reason > is > > that scripting languages can be more effectire when simplicity of result > > code is important and as of Java 6 they can run right inside JVM - so > > integration with you java code is very simple. > > > > Just my 2 cents. > > > > Lukas > > > > On 12/4/06, Eshwaramoorthy Babu wrote: > > > > > > Hi , > > > > > > we have a requirement to compare 2 xml files and generate > > > result(reconcilation report). > > > The xml file size is 6MB each and the flrmat is as below > > > > > > 123 > > > 123 > > > > > > > > > > > > I have to implement the below logic > > > > > > Number of matching ID'S in both xml > > > Number of non matching ID'S in both xml > > > Number of non matching ID'S in both xml > > > > > > I am planning to use digester and lucene for my above requirement. > > > > > > Is my desicion of using lucene correct? or is there any bettwr > approch > > > for > > > my above problem. > > > > > > Thanks, > > > Babu > > > > > > > > > > > > ------=_Part_35267_31364738.1165222213647--