hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Natarajan, Prabakaran 1. (NSN - IN/Bangalore)" <prabakaran.1.natara...@nsn.com>
Subject Hadoop Realtime Queries
Date Thu, 31 Jul 2014 07:32:29 GMT

I want to perform realtime query on HDFS data.   I tried hadoop/yarnt/hive, shark on spark,
Tez, etc.,
But still I couldn't get subsecond performance on the large data that I have.
I understand hadoop is not meant for this, but still want to achieve as max as possible

1)      How can we tune RHEL OS for this?
2)      How can we tune yarn?
3)      Is there is any stable framework like Tez which can perform much better
4)      Is there is any caching strategy that we can adopt?
5)      Any articles related to this are welcome

Thanks in Advance


View raw message