lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Whitman <brian.whit...@variogr.am>
Subject Re: Solr and nutch, for reading a nutch index
Date Wed, 28 Nov 2007 06:31:48 GMT

On Nov 28, 2007, at 1:24 AM, Otis Gospodnetic wrote:

> I only glanced at Sami's post recently and what I think I saw there  
> is something different.  In other words, what Sami described is not  
> a Solr instance pointing to a Nutch-built Lucene index, but rather  
> an app that reads the appropriate Nutch/Hadoop files with fetched  
> content and posts the read content to a Solr instance using a Solr  
> java client like solrj.
> No?
>

Yes, to be clear, all you need from Sami's thing is the schema file.  
Ignore everything else. Then point solr at the nutch index directory  
(it's just a lucene index.)

Sami's entire thing is for indexing with solr instead of nutch,  
separate issue...



> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> ----- Original Message ----
> From: Norberto Meijome <freebsd@meijome.net>
> To: solr-user@lucene.apache.org
> Cc: brian.whitman@variogr.am
> Sent: Tuesday, November 27, 2007 8:33:18 PM
> Subject: Re: Solr and nutch, for reading a nutch index
>
> On Tue, 27 Nov 2007 18:12:13 -0500
> Brian Whitman <brian.whitman@variogr.am> wrote:
>
>>
>> On Nov 27, 2007, at 6:08 PM, bbrown wrote:
>>
>>> I couldn't tell if this was asked before.  But I want to perform a
>
>>> nutch crawl
>>> without any solr plugin which will simply write to some index
>>> directory.  And
>>> then ideally I would like to use solr for searching?  I am assuming
>
>>> this is
>>> possible?
>>>
>>
>> yes, this is quite possible. You need to have a solr schema that
>> mimics the nutch schema, see sami's solrindexer for an example. Once
>
>> you've got that schema, simply set the data dir in your solrconfig to
>
>> the nutch index location and you'll be set.
>
> I think you should keep an eye on the versions of Lucene library used
> by both Nutch + Solr - differences at this layer *could* make them
> incompatible - but I am not an expert...
> B
>
> _________________________
> {Beto|Norberto|Numard} Meijome
>
> "Against logic there is no armor like ignorance."
>  Laurence J. Peter
>
> I speak for myself, not my employer. Contents may be hot. Slippery  
> when
> wet. Reading disclaimers makes you go blind. Writing them is worse.
> You have been Warned.
>
>
>

--
http://variogr.am/




Mime
View raw message