lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From prashant ullegaddi <prashullega...@gmail.com>
Subject Re: Unable to do exact search with Lucene.
Date Fri, 17 Jul 2009 13:53:30 GMT
Actually, the format of the query for which it worked was:

"\"Apache jakarta\""~10.

Thanks for the help.
Prashant.

On Fri, Jul 17, 2009 at 7:13 PM, Siraj Haider <siraj@jobdiva.com> wrote:

> Try doing a single word search, instead of a phrase.  I once had a similar
> problem when I indexed using Field.setOmitTf(true) which removed all the
> positional information from index, which is required to do phrase searches.
>
> -siraj
>
>
> Erick Erickson wrote:
>
>> The first thing I'd do is get a copy of Luke and look in my index
>> to see exactly what's there. Nothing in your e-mails indicates that you
>> *should* get any hits. Although I admin not getting jakarta lucene in
>> 50M pages seems unlikely.
>>
>> But Ian's suggestion that you start with a smaller index is spot on.
>>
>> Best
>> Erick
>>
>> On Thu, Jul 16, 2009 at 8:42 AM, prashant ullegaddi <
>> prashullegaddi@gmail.com> wrote:
>>
>>
>>
>>> 50 million HTML pages (part of clueweb09 dataset for TREC) were indexed
>>> using Hadoop into 56 indexes. 56 indexes were merged into a single index.
>>> Analyzer is the StandardAnalyzer.
>>>
>>>
>>>
>>> On Thu, Jul 16, 2009 at 6:07 PM, Anshum <anshumg@gmail.com> wrote:
>>>
>>>
>>>
>>>> Hi Prashant,
>>>>
>>>> What did you index? how did you index? what analyzer did you use?
>>>> without
>>>> all of these, perhaps it'd be difficult to figure out the issue.
>>>>
>>>> --
>>>> Anshum Gupta
>>>> Naukri Labs!
>>>> http://ai-cafe.blogspot.com
>>>>
>>>> The facts expressed here belong to everybody, the opinions to me. The
>>>> distinction is yours to draw............
>>>>
>>>>
>>>> On Thu, Jul 16, 2009 at 6:04 PM, prashant ullegaddi <
>>>> prashullegaddi@gmail.com> wrote:
>>>>
>>>>
>>>>
>>>>> Hi,
>>>>>
>>>>> I tried searching:
>>>>> "Apache Jakarta"~10
>>>>>
>>>>> Nothing was returned. What might be wrong?
>>>>>
>>>>> Regards,
>>>>> Prashant.
>>>>>
>>>>>
>>>>>
>>>>
>>
>>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message