204 questions with Azure HDInsight tags

Sort by: Updated
1 answer

Azure HDinsight Ambari RBAC

Hello, Can someone please tell me is there any programmatic way to use Ambari in Azure HDinsight to managed users and Role based access control. To clarify, I am asking about Ambari RBAC not Azure RBAC And can we do ambari admin password reset using…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-02T11:14:45.01+00:00
Akash Verma 21 Reputation points
commented 2020-11-13T23:54:28.65+00:00
MartinJaffer-MSFT 26,051 Reputation points
0 answers

HD Insights - converting the Aggregated Logs output into ADLSGen2 into Text format

Can someone share a reference to systematically output logs from the HDinsights application runs into a readable format -- for the stdout. Today when running a Spark application through HdInsights (triggered by Livy API), after the application finishes…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,411 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-22T17:07:06.273+00:00
Brendan Cassidy 26 Reputation points Microsoft Employee
commented 2020-11-11T18:23:27.957+00:00
Brendan Cassidy 26 Reputation points Microsoft Employee
0 answers

Need to know the actual size of traffic received in KB's/MB's/GB's in a day for per kafka topic in HDInsight Cluster

Hello All, I'm using HDI-3.6 Kafka cluster(0.1) in west Europe region with 26 Kafka brokers running in it on production environment. I'm trying to find out the actual size of traffic received in KB's/MB's/GB's in a day/hour for per Kafka topic in…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-21T09:34:05.653+00:00
Sugumar Srinivasan 1 Reputation point
commented 2020-10-28T21:21:20.893+00:00
HimanshuSinha-msft 19,376 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

HD Insights - Script Action stored in ADLSGen2

When trying to run Bash scripts as Script Actions on Cluster creation is there a way to use a ADLSGen2 location? From the current documentation it shows ADLGen1, and public HTTPS requests but does not discuss this possibility with Gen2. In scripts…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-26T20:12:52.64+00:00
Brendan Cassidy 26 Reputation points Microsoft Employee
accepted 2020-10-28T16:49:12.527+00:00
Brendan Cassidy 26 Reputation points Microsoft Employee
1 answer

HDInsight azure adls gen2 'InternalServerError' ARM Template deployment

Creating Azure HDinsight Spark cluster with ADLS Gen 2,Userassigned managed idnetity with StorageBlobdataOwner role. Successfully assigned msi role to storage but getting error with HDInsight deployment(Internal server error) Theres some issue with…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,411 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-08T04:56:53.52+00:00
Anon4914 1 Reputation point
commented 2020-10-28T07:51:37.547+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

ARM Template for Azure HDinsights

Can someone share any reference or ARM Template for Azure HDInsight which covers below things: Use Public-Private key pair together with a passphrase for SSH login Enterprise security package must be enabled TLS Version Encryption at rest…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-02T11:08:56.783+00:00
Akash Verma 21 Reputation points
commented 2020-10-20T17:08:47.017+00:00
MartinJaffer-MSFT 26,051 Reputation points
1 answer One of the answers was accepted by the question author.

Azure HD Insight Kafka Broker Setting for max.incremental.fetch.session.cache.slots

Hi, I need to change the setting for Broker Config for max.incremental.fetch.session.cache.slots . But I am not able to see this as a property on Ambari Broker Config. How can I change it ?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-12T21:53:44.247+00:00
Harjani, Ashish 201 Reputation points
accepted 2020-10-18T23:30:33.48+00:00
Harjani, Ashish 201 Reputation points
1 answer

Programmatic way to perform tasks on Azure HDinsight

Can someone please give reference to perform below tasks on Azure HDinsight via programmatic way: Enable diagnostics setting on HDinsights Cluster Enable Azure Monitor for the HDInsight cluster Enable NSG Flow Logs NSG Modifications (Rule Add,…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-07T09:20:05.27+00:00
Akash Verma 21 Reputation points
commented 2020-10-09T11:08:02.917+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

How we link livy server is end point correct in HDinsight

Hi All, i want to submit my spark job from intellij to use adf spark activity. instead of head node host which IP address i have to give. here i give password of my admin account username/password but it is error out i am new…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-23T14:56:17.643+00:00
manish verma 441 Reputation points
commented 2020-10-07T13:06:32.897+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

Error Setting Access Rights on ADL (Gen1) User Folders During HDInsight Cluster Creation

Hello All The issue occurs when the cluster is created. One of the last operations that Ambari performs are 'Post user creation hook for 1 users'. This fail with the attached errors. Here is some more information: It's an ESP cluster -- we…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,411 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-22T07:21:55.527+00:00
Christoph Kiefer 141 Reputation points
commented 2020-10-07T13:04:47.837+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Spark unable to write file onto Blob storage

We use HDInsight with Spark, v3.6. So far, our code has been working as expected. As of last night, our job started failing. The error states that "output directory already exists". When looking at the blob storage, directories appear to be…

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,578 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-02T16:25:54.1+00:00
Sachin Shah 101 Reputation points
accepted 2020-10-07T12:05:08.527+00:00
Sachin Shah 101 Reputation points
0 answers

Is it possible to send data from WPF app to Azure analytics?

Is it possible to send data from WPF app to Azure analytics? If yes, which is the minimum Visualstudio version & .net framework version required ? which are the best analytic services offered by Azure which can used with WPF app?

Azure Analysis Services
Azure Analysis Services
An Azure service that provides an enterprise-grade analytics engine.
443 questions
Windows Presentation Foundation
Windows Presentation Foundation
A part of the .NET Framework that provides a unified programming model for building line-of-business desktop applications on Windows.
2,706 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-10-01T09:42:53.153+00:00
Hi 1 Reputation point
commented 2020-10-07T00:46:17.98+00:00
MartinJaffer-MSFT 26,051 Reputation points
2 answers One of the answers was accepted by the question author.

How i call a Hdinsight Zeppelin notebook in Azure Data Factory

How i call a Hdinsight Zeppelin notebook in Azure Data Factory. i have a Zeppelin notebook in Hdinsight cluster , i need to call in azure data factory. what is best way to write spark code , and call in data factory.

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-21T11:31:03.803+00:00
Verma, Manish Kumar 131 Reputation points
accepted 2020-10-02T08:22:44.707+00:00
Verma, Manish Kumar 131 Reputation points
1 answer

How to add ambari infra to a hbase cluster?

I tried to add it by add service link next to it in the admin section but add service link is not responding. Is there any other way to add this service?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-17T05:52:27.67+00:00
Ayushi Bhandari 1 Reputation point
commented 2020-09-30T12:16:45.453+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

Apache Atlas on azure hdinsigt

How to install apache atlas on azure HDinsight?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-16T08:24:28.39+00:00
Ayushi Bhandari 1 Reputation point
commented 2020-09-30T12:16:25.453+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Data Analytics Studio (DAS) on HDInsight Cluster

Dear All I am following the recommendations here to install the Data Analytics Studio (from Cloudera): https://video2.skills-academy.com/en-us/azure/hdinsight/interactive-query/apache-hive-migrate-workloads The custom script action is not working…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-22T11:56:27.23+00:00
Christoph Kiefer 141 Reputation points
accepted 2020-09-30T07:11:23.807+00:00
Christoph Kiefer 141 Reputation points
1 answer One of the answers was accepted by the question author.

remainingCores is too small error. Spark and Kafka cluster on HDInsight problem

I tried to create a Kafka and Spark cluster in the HDInsight using either of templates in these links: …

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2020-09-27T06:57:32.923+00:00
Erfan Mahmoodnejad 41 Reputation points
commented 2020-09-29T06:31:58.623+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Create On Demand HDInsight from Data Factory in an existing VNet

The Azure Data Factory can create an on-demand HDInsight (linked service), which is automatically deleted after the job running on it finishes. Is it possible to have this on-demand HDInsight created in an existing VNet?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,028 questions
asked 2020-09-23T07:07:55.617+00:00
Beniamin Murza 26 Reputation points
commented 2020-09-28T12:11:44.993+00:00
HarithaMaddi-MSFT 10,136 Reputation points
1 answer

Apache Kafka as a Service with Confluent Cloud Azure

We are trying to connect to on premise server from our Apache Kafka as a Service with Confluent Cloud; with on-premise SQL Server source, but receive below error Do we require any specials setting from azure cloud to on-premise?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-26T04:42:30.917+00:00
Dondapati, Navin 281 Reputation points
answered 2020-09-28T09:59:03.723+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
0 answers

creating conda environment on azure hdinsight spark cluster taking hours

trying to follow the code in this tutorial , https://github.com/Azure/MachineLearningNotebooks/blob/master/how-to-use-azureml/training/train-in-spark/train-in-spark.ipynb when i send the run to run to azure ml, the run essentially stalls on the step…

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,690 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2020-09-10T14:14:09.1+00:00
kunal kapoor 1 Reputation point
commented 2020-09-15T22:43:58.83+00:00
MartinJaffer-MSFT 26,051 Reputation points