lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aditya <findbestopensou...@gmail.com>
Subject Re: Getting terms from unstored fields, doc-wise
Date Fri, 27 Jul 2012 06:11:24 GMT
Hi

If the data is not stored then it cannot be retrieved in the same format.
Using IndexReader as you listed you could retrieve the list of the terms
available in the doc. It may be analyzed. You may not be getting exact data.

Regards
Aditya
www.findbestopensource.com

On Fri, Jul 27, 2012 at 1:34 AM, Phanindra R <phaninra@gmail.com> wrote:

> Thanks for the reply Abdul.
>
> I was exploring the API and I think we can retrieve all those words by
> using a brute-force approach.
>
> 1) Get all the terms using indexReader.terms()
>
> 2) Process the term only if it belongs to the target field.
>
> 3) Get all the docs using indexReader.termDocs(term);
>
> 4) So, we have the term-doc pairs at this point.
>
> Is there any better approach other than the above forever-taking procedure?
>
> Thanks,
> Phanindra
>
>
>
> On Thu, Jul 26, 2012 at 11:46 AM, in.abdul <in.abdul@gmail.com> wrote:
>
> > No , it's not possible to get the data which not stored ..
> > On Jul 26, 2012 10:27 PM, "Phanindra R [via Lucene]"
> > <ml-node+s472066n3997487h23@n3.nabble
> > >
> > > Hi,
> > >      I've an index to analyze (manually). Unfortunately, I cannot
> rebuild
> > > the index. Some of the fields are 'unstored'. I was wondering whether
> > > there's any way to get the terms from an unstored field for each doc.
> > > Positional information is not necessary. Lucene version is 3.5.
> > >
> > > The reason am trying to get those terms is that I can add that field to
> > my
> > > own index for every doc. And, yes, there's another id-type-field which
> > > allows me to recognize the document in both indices.
> > >
> > > Any guidance is highly appeciated.
> > >
> > > Thanks,
> > > Phani
> > >
> > >
> > > ------------------------------
> > >  If you reply to this email, your message will be added to the
> discussion
> > > below:
> > >
> > >
> >
> http://lucene.472066.n3.nabble.com/Getting-terms-from-unstored-fields-doc-wise-tp3997487.html
> > >  To unsubscribe from Lucene, click here<
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=472066&code=aW4uYWJkdWxAZ21haWwuY29tfDQ3MjA2NnwxMDczOTUyNDEw
> > >
> > > .
> > > NAML<
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> > >
> > >
> >
> >
> >
> >
> > -----
> > THANKS AND REGARDS,
> > SYED ABDUL KATHER
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Getting-terms-from-unstored-fields-doc-wise-tp3997487p3997510.html
> > Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message