lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Don Vaillancourt <d...@webimpact.com>
Subject Re: Title of PDF
Date Mon, 28 Jun 2004 15:17:01 GMT
It can't be that simple because the field will contain the whole PDF and 
not just the title.  And for PDFs that are 3 or 4 megs, it is  really not 
reasonable to store the who PDF in the collection just to get the title.

At 11:09 AM 28/06/2004, you wrote:
>Don,
>
>Sure, just make sure you pick the Field that stores the original text
>(e.g. Keyword or Text, NOT UnStored :)).  It does not matter that the
>source of the data is (PDF or HTML or Java String), as long as you use
>the appropriate Field method.  Also, don't pass a Reader, as that won't
>save your original text.
>
>Otis
>
>
>--- Don Vaillancourt <donv@webimpact.com> wrote:
> > Is there a way to store the title of a PDF in a Lucene collection for
> >
> > retrieval when listing search results.
> >
> > Thanks
> >
> >
> > Don Vaillancourt
> > Director of Software Development
> >
> > WEB IMPACT INC.
> > 416-815-2000 ext. 245
> > email: donv@web-impact.com
> > web: http://www.web-impact.com
> >
> >
> >
> >
> > This email message is intended only for the addressee(s)
> > and contains information that may be confidential and/or
> > copyright.  If you are not the intended recipient please
> > notify the sender by reply email and immediately delete
> > this email. Use, disclosure or reproduction of this email
> > by anyone other than the intended recipient(s) is strictly
> > prohibited. No representation is made that this email or
> > any attachments are free of viruses. Virus scanning is
> > recommended and is the responsibility of the recipient.
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Don Vaillancourt
Director of Software Development

WEB IMPACT INC.
416-815-2000 ext. 245
email: donv@web-impact.com
web: http://www.web-impact.com




This email message is intended only for the addressee(s)
and contains information that may be confidential and/or
copyright.  If you are not the intended recipient please
notify the sender by reply email and immediately delete
this email. Use, disclosure or reproduction of this email
by anyone other than the intended recipient(s) is strictly
prohibited. No representation is made that this email or
any attachments are free of viruses. Virus scanning is
recommended and is the responsibility of the recipient.












Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message