nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tolga <to...@ozses.net>
Subject Re: Search in specific website
Date Tue, 16 Oct 2012 13:01:31 GMT
Solr sent me to Nutch list, but okay. Thanks,

On 10/16/2012 02:27 PM, Lewis John Mcgibbney wrote:
> Hi Tolga,
>
> Please take this to the Solr user@ list.
>
> Thank you
>
> Lewis
>
> On Tue, Oct 16, 2012 at 12:13 PM, Tolga <tolga@ozses.net> wrote:
>> Hi,
>>
>> I've tried url:fass\.sabanciuniv\.edu AND content:this, and I got results
>> from both my URLs. What to do?
>>
>> Regards,
>>
>>
>> On 10/13/2012 12:48 AM, Alejandro Caceres wrote:
>>> Once you've indexed it with Solr this can be done using Solr Query
>>> Syntax. Essentially what you're asking boils down to a Solr question.
>>> In your example situation you could do something like this in Solr:
>>>
>>> url:example\.net AND content:<whatever you are looking for>
>>>
>>> ...or something of the sort. This will search a url with example.net
>>> in it for whatever content you're looking for. Is this what you are
>>> looking for? If not we need more details of what you have tried and
>>> what issues you are having.
>>>
>>> On Fri, Oct 12, 2012 at 5:03 PM, Tolga <tolga@ozses.net> wrote:
>>>> Not really. Let me elaborate. If I pass it multiple URLs such as
>>>> http://example.com, example.net and example.org, how can I search only in
>>>> net?
>>>>
>>>> Regards,
>>>>
>>>> On 12 October 2012 23:55, Tejas Patil <tejas.patil.cs@gmail.com> wrote:
>>>>
>>>>> Hi Tolga,
>>>>>
>>>>> For searching a specific content from a specific website, crawl it
>>>>> first,
>>>>> then index it and search for the term after loading indexes over Solr.
>>>>> Does that really answer your question ?
>>>>>
>>>>> Thanks,
>>>>> Tejas
>>>>>
>>>>> On Fri, Oct 12, 2012 at 12:55 PM, Tolga <tolga@ozses.net> wrote:
>>>>>
>>>>>> Hi,
>>>>>>
>>>>>> I use nutch to crawl my website and index to solr. However, how can
I
>>>>>> search for piece of content in a specific website? I use multiple
URL's
>>>>>>
>>>>>> Regards,
>>>>>>
>>>
>
>


Mime
View raw message