hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pankil Doshi <forpan...@gmail.com>
Subject Best Idea to deal with following situation
Date Sat, 26 Sep 2009 00:48:47 GMT
Hello everyone,

I have job whose result has  only 5 keys but but each key has long list of
values like in 100000's .
What should be best way to deal with it. I feel few of my reducers get over
loaded as two or more keys go to same reduce and hence they have lots of
work to do.

So what should be best way out with this situation?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message