Ad integration pass through for HDInsight

Hari GS 21 Reputation points
2020-07-20T10:50:21.813+00:00

HI,

I know HDInsight with ESP feature enables AD integration while connecting to cluster. Also teh access to underlying Hive tables can be controlled using Apache Ranger. But i would like to know if the access permission on storage account or datalake account can be passed through cluster.
For ex: if a used is read parquet files using spark.read.option(format,'parquet').load('<datalek gen1 path'), can the cluster use users identity while accessing the datalake. This way if a user has access to some folders in datalake , he should be able to access only those folders.

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,410 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
{count} votes

1 answer

Sort by: Most helpful
  1. HimanshuSinha-msft 19,376 Reputation points Microsoft Employee
    2020-07-22T20:08:47.737+00:00

    Hello HariGs,

    Thanks for the question .

    The authentication part is taken care by kerberos, so the domain users will be able to access the files/folders in storage account.

    Thanks & stay safe

    Himanshu

    Please do consider to click on "Accept Answer" and "Up-vote" on the post that helps you, as it can be beneficial to other community members

    0 comments No comments