hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Siddharth Tiwari <siddharth.tiw...@live.com>
Subject Reading multiple lines from a microsoft doc in hadoop
Date Fri, 24 Aug 2012 05:52:13 GMT

hi,
I have doc files in msword doc and docx format. These have entries which are seperated by
an empty line. Is it possible for me to read these lines separated from empty lines at a time.
Also which inpurformat shall I use to read doc docx. Please help

*------------------------*

Cheers !!!

Siddharth Tiwari

Have a refreshing day !!!
"Every duty is holy, and devotion to duty is the highest form of worship of God.” 

"Maybe other people will try to limit me but I don't limit myself"
 		 	   		  
Mime
View raw message