arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "bb (Jira)" <j...@apache.org>
Subject [jira] [Created] (ARROW-6756) Include HDFS `getfacl` in `pyarrow.hdfs.HadoopFileSystem`
Date Tue, 01 Oct 2019 20:10:00 GMT
bb created ARROW-6756:
-------------------------

             Summary: Include HDFS `getfacl` in `pyarrow.hdfs.HadoopFileSystem`
                 Key: ARROW-6756
                 URL: https://issues.apache.org/jira/browse/ARROW-6756
             Project: Apache Arrow
          Issue Type: Wish
          Components: Python
    Affects Versions: 0.13.0
            Reporter: bb


Extended HDFS filesystem attributes such as role-based access through Cloudera's Sentry are
exposed through the `getfacl` command.

It would be immensely help to have this exposed through the `pyarrow.hdfs.HadoopFileSystem`.

Link to the official Hadoop docs where this is discussed in more detail:

[https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html#getfacl]

Sample output from the *nix shell:

```$ hadoop fs -getfacl /path/to/hdfs/dir
# file: /path/to/hdfs/dir
# owner: hive
# group: hive
user::rwx
group:unix_group_with_acl_privs_defined:rwx
group::---
user:hive:rwx
group:hive:rwx
mask::rwx
other::--x```



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message