hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From sriram <rsriram...@gmail.com>
Subject Distributed Cache
Date Thu, 01 Sep 2011 05:35:02 GMT
Using distributed cache i put a common file in the hdfs.It contains of frequent
files to remove.In the code i converted words in the table into a hashtable and
removed words from other documents if they occur.

The problem is it removes these words for smaller files.If the file size
increases then those words are not removed.

Any reason for what is the problem.

View raw message