hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bertrand Dechoux <decho...@gmail.com>
Subject Re: Hadoop Real time help
Date Sun, 19 Aug 2012 19:34:58 GMT
Lucene allows you to build a kind of inverted index "content to document
identifier". Solr or ElasticSearch allows to scale the process.

However, if I am reading it correctly, you are saying that you can not pre
compute a structure (such an index) before the search?

If that's true and that you need to process GB of data, then you have to
allow a latency, if you can not have everything in memory before the search

I can't say anything more precisely.  It will depend on your context. One
may ask : why can't you index the content of your database and your files?


On Sun, Aug 19, 2012 at 9:06 PM, mahout user <mahoutuser@gmail.com> wrote:

> Thanks Mohit and  Bertrand,
>      I am looking into hadoop for search engine as many others. But in
> case of search engine, I know lucene is there. But in my case i have
> implemented java classes, they are searching from databases as well as from
> csv files. But i cant understand if there are GB's of data is there, then
> how can i get real time search service with hadoop. ?
> On Sun, Aug 19, 2012 at 10:06 PM, Mohit Anchlia <mohitanchlia@gmail.com>wrote:
>> On Sun, Aug 19, 2012 at 8:44 AM, mahout user <mahoutuser@gmail.com>wrote:
>>> Hello folks,
>>>    I am new to hadoop, I just want to get information that how hadoop
>>> framework is usefull for real time service.?can any one explain me..?
>>> Thanks.
>> Can you specify your use case? Each use case calls for different design
>> consideration.

Bertrand Dechoux

View raw message