hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From kartik saxena <kartik....@gmail.com>
Subject Indexing on top of Hadoop
Date Wed, 10 Jun 2009 12:49:36 GMT

I have a huge  LDIF file in order of GBs spanning some million user records.
I am running the example "Grep" job on that file. The search results have
not really been
upto expectations because of it being a basic per line , brute force.

I was thinking of building some indexes inside HDFS for that file , so that
the search results could improve. What could I possibly try to achieve this?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message