hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhou, Yunqing" <azure...@gmail.com>
Subject Re: Help for the problem of running lucene on Hadoop
Date Fri, 31 Dec 2010 12:43:07 GMT
You should implement the Directory class by your self.
Nutch provided one, named HDFSDirectory.
You can use it to build the index, but when doing search on HDFS, it is
relatively slower, especially on phrase queries.
I recommend you to download it to disk when performing a search.

On Fri, Dec 31, 2010 at 5:08 PM, Jander g <jandergj@gmail.com> wrote:

> Hi, all
>
> I want  to run lucene on Hadoop, The problem as follows:
>
> IndexWriter writer = new IndexWriter(FSDirectory.open(new
> File("index")),new StandardAnalyzer(), true,
> IndexWriter.MaxFieldLength.LIMITED);
>
> when using Hadoop, whether the first param must be the dir of HDFS? And how
> to use?
>
> Thanks in advance!
>
> --
> Regards,
> Jander
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message