lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Elschot <paul.elsc...@xs4all.nl>
Subject Re: SpanQuery for Terms at same position
Date Mon, 23 Nov 2009 16:56:51 GMT
Op maandag 23 november 2009 17:27:56 schreef Christopher Tignor:
> A slop of -1 doesn't work either.  I get no results returned.

I think the problem is in the NearSpansOrdered.docSpansOrdered methods.
Could you replace the < by <= in there (4 times) and try again?
That will allow spans at the same position to be considered ordered.
From a quick reading of the code both the unordered and ordered cases
might work for a slop of -1 with that modification.

> 
> this would be a *really* helpful feature for me if someone might suggest an
> implementation as I would really like to be able to do arbitrary span
> searches where tokens may be at the same position and also in other
> positions where the ordering of subsequent terms may be restricted as per
> the normal span API.

My pleasure,
Paul Elschot

> 
> thanks,
> 
> C>T>
> 
> On Sun, Nov 22, 2009 at 7:50 AM, Paul Elschot <paul.elschot@xs4all.nl>wrote:
> 
> > Op zondag 22 november 2009 04:47:50 schreef Adriano Crestani:
> > > Hi,
> > >
> > > I didn't test, but you might want to try SpanNearQuery and set slop to
> > zero.
> > > Give it a try and let me know if it worked.
> >
> > The slop is the number of positions "in between", so zero would still be
> > too
> > much to only match at the same position.
> >
> > SpanNearQuery may or may not work for a slop of -1, but one could try
> > that for both the ordered and unordered cases.
> > One way to do that is to start from the existing test cases.
> >
> > Regards,
> > Paul Elschot
> >
> > >
> > > Regards,
> > > Adriano Crestani
> > >
> > > On Thu, Nov 19, 2009 at 7:28 PM, Christopher Tignor <
> > ctignor@thinkmap.com>wrote:
> > >
> > > > Hello,
> > > >
> > > > I would like to search for all documents that contain both "plan" and
> > "_v"
> > > > (my part of speech token for verb) at the same position.
> > > > I have tokenized the documents accordingly so these tokens exists at
> > the
> > > > same location.
> > > >
> > > > I can achieve programaticaly using PhraseQueries by adding the Terms
> > > > explicitly at the same position but I need to be able to recover the
> > > > Payload
> > > > data for each
> > > > term found within the matched instance of my query.
> > > >
> > > > Unfortunately the PayloadSpanUtil doesn't seem to return the same
> > results
> > > > as
> > > > the PhraseQuery, possibly becuase it is converting it inoto Spans first
> > > > which do not support searching for Terms at the same document position?
> > > >
> > > > Any help appreciated.
> > > >
> > > > thanks,
> > > >
> > > > C>T>
> > > >
> > > > --
> > > > TH!NKMAP
> > > >
> > > > Christopher Tignor | Senior Software Architect
> > > > 155 Spring Street NY, NY 10012
> > > > p.212-285-8600 x385 f.212-285-8999
> > > >
> > >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
> 
> 
> -- 
> TH!NKMAP
> 
> Christopher Tignor | Senior Software Architect
> 155 Spring Street NY, NY 10012
> p.212-285-8600 x385 f.212-285-8999
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message