Linking Blob storage with azure-hdinsight table
Hi All, I am new to Azure data lake. My requirement is like I need to store image and each image refers to advertisement (could be of string type). For this I have stored images in azure storage account and advertisement is stored in azure-hdinsight…
Security Recommendations for Azure Data and Analytics Services
I am working on Securing Data and Analytics Services on Azure. I want to know what security controls i can apply after creating of services and what i can apply only during the service creation. Below are the recommendation i have found as of now. Could…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Link HDInsight Cluster in VSCode to Show Hive Tables
Dear All I am using the latest release of VSCode 1.48.2 to connect to my HDInsight Spark Cluster (HDI 3.6, Spark 2.3). It successfully lists the Hive Databases available in my cluster when I browse to the 'Hive Databases' section insight the…
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
HDInsight Spark Cluster Customization with Boostrapping and Custom Action Scripts
Hello All We use both bootstrapping (via ARM templates) and action scripts to provision our HDInsight Spark Cluster (HDI 3.6, Spark 2.3). We face several challenges (in no particular order): First, some of the bootstrapping statements are not…
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Networking Issue on Azure HDInsight Spark Cluster with ESP
Dear All We encounter an issue with networking / DNS on our Azure HDInsight Spark cluster. The cluster is joined to our AAD (i.e., it's a cluster with ESP enabled). The cluster gets automatically created with a PS runbook and ARM template file. This…
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Configure HDFS Storage for Zeppelin Notebooks on HDInsight Spark Clusters with ESP
Dear All We followed this step-by-step tutorial to configure HDFS storage for Zeppelin notebooks on our ESP-enabled HDInsight Spark Cluster (HDI 3.9, Spark 2.3):…
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
HDInsight Zeppelin Notebook Not Working
Hi All We are running an ESP-enabled HDInsight Spark cluster in Azure. We have no clue why some of our domain users are not able to use Zeppelin notebooks (usint the pyspark interpreter in our case). This is the very simple code that results in…
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
How to alter kafka topic config in ESP enabled HDInsight
Hi All, I am not able to alter topic config using kafka-configs.sh binary. I am passing jaas config file and I am sure the user has sufficient permissions. But I am always getting "org.apache.zookeeper.KeeperException$NoAuthException:…
Ad integration pass through for HDInsight
HI, I know HDInsight with ESP feature enables AD integration while connecting to cluster. Also teh access to underlying Hive tables can be controlled using Apache Ranger. But i would like to know if the access permission on storage account or datalake…
Unable to run spark.sql queries on hive table using spark-shell (Class org.apache.hadoop.fs.adl.HdiAdlFileSystem not found)
I am trying to run below query on spark-shell on HDInsight cluster: val df=spark.sql("select * from hivesampletable") But it is giving below error repeatedly (irrespective of the query): 2020-08-10 07:29:21 WARN ObjectStore:568 -…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Proper Cores/Executors Configuration in HD-Insight
Proper Cores/Executors Configuration in HD-Insight And for this cluster i've this configuration Which is the best way to make a proper configuration in order to run efficiently a job in Spark. Is it Ok this configuration? Thanks!
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Is there any kind of Powerbi connector for Hadoop?
I've been testing some visualization tools and next is PowerBi. Some tools made me use apache drill, but it seems that Powerbi is full of connectors. Is there a way to connect naturally to hadoop (not hdinsight) or an easy workaround?
![](https://techprofile.blob.core.windows.net/images/iqsEOWFvAQAAAAAAAAAAAA.png?8D838D)
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Using Spark action in HDInsight Hue
I have created a Spark 2.4 cluster using HDInsights in Azure. I have installed Hue over it using Script actions. Also did the necessary steps for SSH tunneling and connecting to Hue UI. However, on the Hue UI I am able to only see Pig and Hive…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
What are Recommended Solutions to work with RStudio and HDInsight Spark Cluster
Dear All We are currently implementing in-house / on-prem machine learning solutions in R (RStudio). We are in the process of moving our data to the cloud by the means of a sophisticated ingestion process through Apache Nifi. Currently the data lands…
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
![](https://techprofile.blob.core.windows.net/images/-kBKepa7U0a5d4kNlSs0Pg.png?8D81DE)
Not able to edit core-site.xml file
Hello, I am using free version of azure.i am trying to add some entry in core-site.xml for HDInsight configuration Which is at location /etc/hadoop/conf location. But,that file is read-only.how i can change permissions of that file So that i can…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
How to change Azure HDInsight Hadoop to Azure Private endpoint.
I already create a Azure HDInsight Hadoop cluster but in public network. Now,I want to change it into Private network using Azure Private Link to private endpoint. How can I change "https://<CLUSTERNAME>.azurehdinsight.net" to "…
java.io.IOException: Stream is closed! Error in HDInsight with ADLS Gen 2
I am currently using Hail for the pyspark library to perform varying operations on Genomic data in ADLS Gen 2 with an HDInsight 4.0, Spark 2.4 cluster. I have been in touch with the development team regarding this error I have been getting when running a…
![](https://techprofile.blob.core.windows.net/images/MC_kzmROIEmSQPc4fvWl5w.png?8D81C3)
Unable to access Azure Blob storage from HDInsight cluster
Hi, I have spun up a HDInsight Spark cluster and am trying to access blob storage on the cluster as follows, but getting an exception: hdfs dfs -ls wasbs://deploy@nisumstorageaccount2.blob.core.windows.net/ 20/06/16 19:28:39 ERROR…
![](https://techprofile.blob.core.windows.net/images/VfQFAmOikEWfBHko2XlWTA.png?8D7F33)
Unable to change spark.executor.heartbeatInterval parameter
I try to run a Jupyter Notebook on HDInsight with Spark; after some time (observed: 15, 17, 30 minutes), execution fails with error message: Error with 400 StatusCode: "requirement failed: Session isn't active." According to Stack Overflow…
Configuring yarn alerts in Azure Monitor
Hi All, Is there any customized alerts to monitor the HDInsight yarn memory from Azure monitor.