lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Litchfield <...@csh.rit.edu>
Subject Re: Missing pdf document title
Date Mon, 10 Nov 2003 17:13:27 GMT

I would try two things.

1)Is PDFBox getting the title from the document?
You can run this example to find out

java org.pdfbox.examples.pdmodel.PrintDocumentMetaData <input-pdf>

2)Is the lucene field getting properly set in the lucene database.  I
would use luke(http://www.getopt.org/luke/) to verify that lucene is
getting the field.

Other than that I would double check your code that gets the "Title" field
correctly.

Ben

On Mon, 10 Nov 2003, Zhou, Oliver wrote:

> Hi,
>
> I'm using lucene demo IndexHTML.java with pdfbox-0.6.4 to index pdf files.
> It created the index files.  However, the pdf document title was empty when
> I did search.  Any idea on why?
>
> Thanks
> Oliver
>
>
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message