drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ajith Dharwar (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3672) Data Provenance and DFS metadata
Date Wed, 19 Aug 2015 12:37:45 GMT
Ajith Dharwar created DRILL-3672:
------------------------------------

             Summary: Data Provenance and DFS metadata
                 Key: DRILL-3672
                 URL: https://issues.apache.org/jira/browse/DRILL-3672
             Project: Apache Drill
          Issue Type: Bug
          Components: Metadata, Storage - Text & CSV
    Affects Versions: 1.1.0
         Environment: 1
            Reporter: Ajith Dharwar
            Assignee: Steven Phillips


My present query on Apache Drill is as follows -

select 
 CAST(columns[0] AS Integer) as ID, 
 to_timestamp(SUBSTR(columns[10],1,14),'YYYYMMddHHmmss') as RecordDt
from dfs.`/data/100Krows/`
What I would like to do is get the modificationTime metadata for each of the .csv files in
my folder as a column something like this -

select 
  CAST(columns[0] AS Integer) as ID, 
  to_timestamp(SUBSTR(columns[10],1,14),'YYYYMMddHHmmss') as RecordDt,
  modificationTime as versionStartTime
from dfs.`/data/100Krows/`
The modificationTime along with filename metadata can be found when I run the following qry

 SHOW FILES in dfs.data
Is there a way to achieve this in its current version 1.1 or 1.0?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message