hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lukáš Vlček <lukas.vl...@gmail.com>
Subject Re: How Hadoop Works??
Date Fri, 26 Feb 2010 13:03:56 GMT
BTW you can also take a look at this book (complete draft of all chapters
available for download for free)
http://www.umiacs.umd.edu/~jimmylin/book.html

Lukas

On Fri, Feb 26, 2010 at 1:30 PM, Lukáš Vlček <lukas.vlcek@gmail.com> wrote:

> Hi,
>
> if you are serious about Hadoop then I can warmly recommend book by Tom
> White: http://www.hadoopbook.com/
> (Disclaimer: I am not paid for this commercial, I do it just because I
> found Tom's book valuable and worth buying.)
>
> Regards,
> Lukas
>
>
> On Fri, Feb 26, 2010 at 8:55 AM, Sujitha <gscse@tce.edu> wrote:
>
>>
>> > No, Hadoop do not use Lucene.
>>
>> have studied like
>>
>> http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data
>>
>>  ---Given that.....
>>
>>  The way the current Hadoop based system works is:
>>  Raw logs get streamed from hundreds of mail servers to the Hadoop
>>  Distributed File System (”HDFS”) in real time.
>>  MapReduce jobs are scheduled run to index the new data using Apache
>> Lucene
>>  and Solr.
>>  Once the indexes have been built, they are compressed and stored away in
>>  HDFS.
>>  Each Hadoop datanode runs a Tomcat servlet container, which hosts a
>> number
>>  of Solr instances that pull and merge the new indexes, and provide really
>>  fast search results to our support team.
>>
>> > And do you mean slor combine the Lucene and Hadoop ?
>>
>> No..Is that Solr (search server) uses Lucene (has library) that supports
>> the search..Solr needs Lucene to perform full-text indexing and searching
>> etc.,am i right??
>>
>>
>> >
>> >
>> >
>> > On Fri, Feb 26, 2010 at 2:52 PM, <gscse@tce.edu> wrote:
>> >
>> >> hi all
>> >>    when studying how hadoop framework works i have noticed that
>> >> map reduce in turn uses apache lucene for creating index for scheduled
>> >> new data and solr for creating instances. Is that right???
>> >> thanks
>> >> sujitha
>> >>
>> >>
>> >>
>> >>
>> >> -----------------------------------------
>> >> This email was sent using TCEMail Service.
>> >> Thiagarajar College of Engineering
>> >> Madurai-625 015, India
>> >>
>> >>
>> >
>> >
>> > --
>> > Best Regards
>> >
>> > Jeff Zhang
>> >
>>
>>
>> --
>> Suji
>>
>>
>> -----------------------------------------
>> This email was sent using TCEMail Service.
>> Thiagarajar College of Engineering
>> Madurai-625 015, India
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message