lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Anshum <ansh...@gmail.com>
Subject Re: Help Needed...
Date Thu, 28 May 2009 11:08:16 GMT
Indexing/Storing are at developers discretion. You may choose to store or
not store a field as per your requirement.

--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com

The facts expressed here belong to everybody, the opinions to me. The
distinction is yours to draw............


On Thu, May 28, 2009 at 4:22 PM, Alexander Aristov <
alexander.aristov@gmail.com> wrote:

> you will need to develop parser and indexer.
>
> but remember that in current implementation content is not stored in lucene
> index,
>
> indexed - yes nut not stored.
>
> Best Regards
> Alexander Aristov
>
>
> 2009/5/28 Gaurav Kumar <gaurav.bond.itbhu@gmail.com>
>
> > Hi everyone,
> >
> > I am doing a project using Lucene where i need to index HTML files. I am
> > using Tika to parse HTML files. But i need to index files according to
> > their
> > tags which means that every text present in different HTML tag (like <p>
> > <a>) should be stored in different fields. Can i do that. If yes how?
> Also
> > can i assign different weightage to the tokens present in different
> fields.
> > If yes how?
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message