Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 70765 invoked from network); 3 Jun 2009 16:09:45 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 3 Jun 2009 16:09:45 -0000 Received: (qmail 34355 invoked by uid 500); 3 Jun 2009 16:09:56 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 34285 invoked by uid 500); 3 Jun 2009 16:09:56 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 34277 invoked by uid 99); 3 Jun 2009 16:09:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Jun 2009 16:09:56 +0000 X-ASF-Spam-Status: No, hits=3.4 required=10.0 tests=HTML_MESSAGE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [208.97.132.207] (HELO spunkymail-a12.g.dreamhost.com) (208.97.132.207) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Jun 2009 16:09:46 +0000 Received: from [192.168.1.126] (adsl-76-239-166-174.dsl.pltn13.sbcglobal.net [76.239.166.174]) by spunkymail-a12.g.dreamhost.com (Postfix) with ESMTP id F268A7FA9 for ; Wed, 3 Jun 2009 09:09:19 -0700 (PDT) Message-Id: <29B5BF9B-96D9-4771-9CD2-027470E8D918@apache.org> From: Grant Ingersoll To: java-dev@lucene.apache.org In-Reply-To: <786fde50906030558v70e2523bm31e9fe43e68fcd32@mail.gmail.com> Content-Type: multipart/alternative; boundary=Apple-Mail-11--549817910 Mime-Version: 1.0 (Apple Message framework v935.3) Subject: Re: EnwikiDocMaker Date: Wed, 3 Jun 2009 09:09:14 -0700 References: <37A6F53D-A70E-4DF8-B25D-C341E4726F1D@apache.org> <786fde50906030126g2b4a093bi37383ebbb50ad75@mail.gmail.com> <9ac0c6aa0906030359s40b9f27tab328d8a47c8bf21@mail.gmail.com> <786fde50906030558v70e2523bm31e9fe43e68fcd32@mail.gmail.com> X-Mailer: Apple Mail (2.935.3) X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-11--549817910 Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit +1 Note, Xerces Jar is not in benchmark, AFAICT. It relies on the fact that Java uses it under the hood. I'm having this really weird situation where I'm using EnwikiDocMaker outside the context of the benchmarker and I'm grasping at straws as to why it is not working. It seems to be a classpath issue, but is not Lucene related so I'll spare the details. -Grant On Jun 3, 2009, at 5:58 AM, Shai Erera wrote: > Then perhaps as part of 1595 I can change it to use Java's XML > parser, and test the Enwiki file. If all goes well, we may not need > the XERCES jar in benchmark? Anyway, I'll check that too > > On Wed, Jun 3, 2009 at 1:59 PM, Michael McCandless > wrote: > I also don't know why it's specifically using Xerces... > > Mike > > On Wed, Jun 3, 2009 at 4:26 AM, Shai Erera wrote: > > Grant, note that I'm changing the DocMakers in LUCENE-1595 > including this > > one. So whatever the decision is following your question, I can do > it as > > part of this issue, since that code will no longer be in > EnwikiDocMaker. > > > > Regarding to your question, I don't know why it should depend on > Xerces > > (rather than the default Java XML parser I assume?) > > > > Shai > > > > On Wed, Jun 3, 2009 at 2:48 AM, Grant Ingersoll > wrote: > >> > >> Is there a reason the EnwikiDocMaker assumes Xerces for the SAX > parser? > >> Line 96. > >> > >> Thanks, > >> Grant > >> > >> > --------------------------------------------------------------------- > >> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org > >> For additional commands, e-mail: java-dev-help@lucene.apache.org > >> > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-dev-help@lucene.apache.org > > -------------------------- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene: http://www.lucidimagination.com/search --Apple-Mail-11--549817910 Content-Type: text/html; charset=US-ASCII Content-Transfer-Encoding: quoted-printable +1

Note, = Xerces Jar is not in benchmark, AFAICT.  It relies on the fact that = Java uses it under the hood.

I'm having this = really weird situation where I'm using EnwikiDocMaker outside the = context of the benchmarker and I'm grasping at straws as to why it is = not working.  It seems to be a classpath issue, but is not Lucene = related so I'll spare the = details.

-Grant
On Jun 3, 2009, at = 5:58 AM, Shai Erera wrote:

Then perhaps as part of 1595 I can change it to use Java's = XML parser, and test the Enwiki file. If all goes well, we may not need = the XERCES jar in benchmark? Anyway, I'll check that too

On Wed, Jun 3, 2009 at 1:59 PM, Michael = McCandless <lucene@mikemccandless.com> wrote:
I also don't know why it's specifically using Xerces...

= Mike

On Wed, Jun 3, 2009 at = 4:26 AM, Shai Erera <serera@gmail.com> wrote:
> = Grant, note that I'm changing the DocMakers in LUCENE-1595 including = this
> one. So whatever the decision is following your question, I = can do it as
> part of this issue, since that code will no longer be = in EnwikiDocMaker.
>
> Regarding to your question, I don't know = why it should depend on Xerces
> (rather than the default Java XML = parser I assume?)
>
> Shai
>
> On Wed, Jun 3, 2009 at = 2:48 AM, Grant Ingersoll <gsingers@apache.org> wrote:
= >>
>> Is there a reason the EnwikiDocMaker assumes Xerces for the = SAX parser?
>>  Line 96.
>>
>> Thanks,
>> Grant
= >>
>> = ---------------------------------------------------------------------
= >> To unsubscribe, e-mail: java-dev-unsubscrib= e@lucene.apache.org
>> For additional commands, e-mail: java-dev-help@lucene.apach= e.org
>>
>
>

= ---------------------------------------------------------------------
= To unsubscribe, e-mail: java-dev-unsubscrib= e@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apach= e.org

=


--------------------------
Grant = Ingersoll

Search the Lucene ecosystem = (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene:
=

= --Apple-Mail-11--549817910--