204 questions with Azure HDInsight tags

Sort by: Updated
1 answer

Zeppelin notebook - sc.textFile does not work for HDI with ESP

We have HDI cluster with ESP enabled. From our zeppelin notebook, when I read data to a dataset (spark.read.text) it works but when I try to read it to an RDD (sc.textFile), I get an authentication exception: Note that, while sc.textFile…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-02-11T15:14:14.283+00:00
Steven Lai 1 Reputation point
commented 2022-04-01T04:03:14.057+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
0 answers

unable to access index for repository https://mran.microsoft.com/snapshot/2017-03-15/src/contrib

The last time I did same thing last month, it was still ok, but today When I tried to install R package from MRAN repository; I got this error Checking the repository via browser also error could not find repository Could you please help me in the…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-03-19T10:46:16.717+00:00
Suryanto 16 Reputation points
commented 2022-03-21T22:31:04.06+00:00
Saurabh Sharma 23,781 Reputation points Microsoft Employee
1 answer

HDInsight: Commands to clean up the space

Hi, at my workplace, we are using HDInsight 3.6. We have encountered space issues before, but we were able to resolve them by simply executing the simple cleanup commands from the edge node. Unfortunately, these commands have not been useful recently.…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-03-10T19:51:28.77+00:00
vijay singh parmar 26 Reputation points
commented 2022-03-17T10:07:15.783+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

run job in HDInsight compute linked service

is username and password is the only way to submit job to HDInsight cluster? is managed identity or msi or service principal supported? Added question: can HDI team build API which uses AAD tokens as password instead of user input password? we have…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,027 questions
asked 2022-03-04T21:52:05.777+00:00
Bill Kan 21 Reputation points
commented 2022-03-11T03:03:01.097+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

Kafka REST Proxy Authentication

We would like to develop a HDInsight Kafka cluster to share real time data with a subcontractor. The REST proxy documentation indicates that "Kafka clients that need access to the REST proxy should be registered to a group by the group owner."…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-02-18T18:30:46.19+00:00
Mike McNulty 1 Reputation point
commented 2022-02-27T17:35:16.017+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
0 answers

Run C# mapreduce job

I am a beginner to Hadoop MapReduce. I have implemented a MapReduce job in visual C# and want to run it locally. As I understood, the HDInsight emulator hasn't been updated for a long time. What else options I have, to run the job locally?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-02-17T17:59:18.127+00:00
Lilukshi Silva 1 Reputation point
commented 2022-02-25T20:30:24.82+00:00
Lilukshi Silva 1 Reputation point
0 answers

Azure login audit logs not accurate

I have a user on Azure HD that is showing one failed login on 2/17/22 when doing a search in the Sign-in logs for a time frame on 1 month. I know for a fact that this user successfully logged in several times on and after 1/24/22 but nothing shows except…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-02-18T18:47:10.96+00:00
Ronald Flourish 1 Reputation point
commented 2022-02-22T13:23:22.677+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
0 answers

How to execute Hive query in Databricks?

We are calling a ".jar" file from Azure Data Factory using Databricks JAR activity. In the JAR activity we are specifying the Cluster Id in Databricks Linked Service. In Databricks cluster we are adding below Spark config: …

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,410 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2022-02-11T12:21:07.377+00:00
Suman Dutta 1 Reputation point
commented 2022-02-16T19:47:50.707+00:00
HimanshuSinha-msft 19,376 Reputation points Microsoft Employee
1 answer

Azure Data Residency and GDPR Compliance Criteria

All Our Current Azure Resources , Workloads and SQL Databases are located in the West-Europe region. Now we want to create new SQL Databases in the US East, US West regions and want to use re use existing Azure workloads, Just checking is it violates…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Lab Services
Azure Lab Services
An Azure service that is used to set up labs for classrooms, trials, development and testing, and other scenarios.
287 questions
asked 2022-02-11T10:54:43.41+00:00
RKG 1 Reputation point
commented 2022-02-15T17:16:30.12+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
1 answer

HDinsight spark livy server stopping as soon as It starts

I am a new user of Azure HDInsight. I have created a 4 node spark cluster. When the cluster is successfully created, I saw spark history server and livy both are in a stopped state. And when I try to run them from Ambri it gets stopped by…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-01-16T18:37:09.537+00:00
Gourav Sharma 1 Reputation point
commented 2022-01-24T20:23:37.683+00:00
HimanshuSinha-msft 19,376 Reputation points Microsoft Employee
1 answer

HDInsight Kafka Disaster Recovery Solution

Hi , We need to create a DR strategy for HDInsight Kafka Cluster . We thought of following cross region unidirectional replication via mirror maker from primary to secondary cluster. The questions however We have are- -What kind of RTO and RPO we…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-01-07T13:10:37.33+00:00
Chandra Manral 1 Reputation point
commented 2022-01-24T16:49:25.893+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
1 answer

Accessing HBase on HDInsight cluster via public internet

Hi, Is there a way to access HBase cluster on HDInsight from public internet, not within Azure infrastructure? I'm trying to migrate smoothly from GCP bigtable to HBase on Azure HDInsight and is needed to access Azure's HBase cluster from our current…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2022-01-06T09:13:37.287+00:00
wooseok 1 Reputation point
commented 2022-01-13T10:29:12.287+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
4 answers One of the answers was accepted by the question author.

Upgrade HDInsight Python Version

I know that HDInsight is using Python 3.5, but is there a way to upgrade the minor version to Python 3.6 or above? The reason is that we have a third party package which only works on Python 3.6 or above. …

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-26T12:34:49.347+00:00
HenryX 21 Reputation points
commented 2021-12-23T07:26:55.54+00:00
Sarthak Agrawal 1 Reputation point Microsoft Employee
1 answer

Log4J vulnerability azure HDinsights

As there is a Log4J vulnerability trending recently. May I get clarifications for the below points. 1) How the Log4J vulnerability impacting HDInsight service ? Any Impact on Yarn/Hive/Spark logging utilities 2) How can I prevent or take precautions from…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-12-14T06:38:47.437+00:00
UDAYA SRINIVASARAO KOTHAMASU 1 Reputation point
commented 2021-12-22T12:05:03.9+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
0 answers

AmbariClusterCreationFailedErrorCode

Unable to launch an HDInsight ESP enabled cluster. Getting below error: { "code": "DeploymentFailed", "message": "At least one resource deployment operation failed. Please list deployment operations…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-12-12T15:41:59.85+00:00
Shamsher Ansari 1 Reputation point
commented 2021-12-13T07:25:39.723+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

Connect HdInsight to Local Superset

Hello, I have an HDInsight cluster and Superset installed in my local machine. So, i want to know if is possible to conect Superset to my HdInsight Cluster. Best Regards, Paulo

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-11-18T11:59:43.217+00:00
Paulo Barbosa 21 Reputation points
commented 2021-11-29T18:22:44.137+00:00
KranthiPakala-MSFT 46,437 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How to access to HDFS namenode UI

Hello, I have an HdInight cluster, but I can't access to the HDFS namenode UI. In Ambari, the link to access HDFS namenode UI is: https://{CLUSTERNAME}/da/host/hn0-testlo.nmfkjwercu3efldgcwsgpmaqge.ax.internal.cloudapp.net/port/30070/ But, when I…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-11-18T12:08:25.937+00:00
Paulo Barbosa 21 Reputation points
accepted 2021-11-22T16:18:05.23+00:00
Paulo Barbosa 21 Reputation points
1 answer

azure shared dashboard contents deleted.

Hello All, We have a shared dashboard in all environments and we have lost all contents in dev and PR. The dashboard is still present and only the content is lost and we get a empty screen without any tiles or blocks, just empty page. We did not…

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
2,971 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-11-19T09:40:51.183+00:00
Dilan BC 1 Reputation point
commented 2021-11-19T14:20:36.347+00:00
Dilan BC 1 Reputation point
1 answer One of the answers was accepted by the question author.

Azure HD insight cluster - Patching & Upgrading

We have a Kakfa cluster (Azure HD Insights) running on Azure. Does this need a manual intervention for OS Patching & Updates? https://video2.skills-academy.com/en-us/azure/hdinsight/hdinsight-os-patching Is it not handled by Azure?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-11-12T13:56:44.403+00:00
Senthilnath TM 241 Reputation points
accepted 2021-11-18T13:08:21.39+00:00
Senthilnath TM 241 Reputation points
1 answer

Create Hive database with Jupyter Notebook

I have a HDInsight cluster and I want to create hive databases and tables (and load data into them) using Jupyter Notebook. Can anyone explain how can I do that? Is there any type of example notebooks explaining that?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2021-10-25T17:08:54.737+00:00
Diogo Rodrigues 1 Reputation point
commented 2021-11-02T03:27:18.38+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee