hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Saurabh Nanda <>
Subject Re: Importing log files in custom (non-delimited) format
Date Thu, 16 Jul 2009 08:17:04 GMT
So, I'm back to square one. Is there *any* way I can do this using Hive
> alone? I'm fine with running the data through multiple passes, putting it in
> temporary tables, if need be. Should I be looking at UDF or SerDe to achieve
> this?

One way, I'm trying out is to have multiple UDFs, each taking the raw log
entry as input and returning a specific field. For example,
extract_ip_address, extract_apache_uid, extract_uri, etc.

Anything simpler?


View raw message