lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Oren Shir <sho...@gmail.com>
Subject Re: Lucene/Digester
Date Wed, 19 Oct 2005 10:40:03 GMT
Hi,
About your first question:
How do you know that only the last "Chapter" element is stored? could it be
that you are only getting the last one? Try using getValues() instead of
get.

Oren

On 10/16/05, Malcolm Clark <malcyclark@btinternet.com> wrote:
>
> Hi all,
> I'm using Lucene/Digester etc for my MSc I'm quite new to these API's. I'm
> trying to obtain advice but it's hard to say whether the problem is Lucene
> or Digester.
> Firstly:
> I am trying to index the INEX collection but when I try to index
> repetitive elements only the last one is indexed. For example:
> <Book>
> <Name>
> <Title>
> <Chapter></Chapter>
> <Chapter></Chapter>
> <Chapter></Chapter> //this is the only one indexed
> </Title>
> </Name>
> </Book>
> only the last Chapter element will be indexed and it will skip the first
> two.
> Secondly:
> When using the Digester/Lucene with XML does each file have to contain e.g
> <!DOCTYPE books PUBLIC "-//LBIN//DTD IEEE Mag//EN" "xmlarticle.dtd" or is
> there a way around it?
> I have tried to use the sample line from the Digester API
> digester.register("-//Example Dot Com //DTD Sample Example//EN",
> "assets/sample.dtd");
> but to no avail.
>
> Thanks very much. I really appreciate any possible solutions as I'm
> stumped.
> Malcolm
> Scotland
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message