204 questions with Azure HDInsight tags

Sort by: Updated
1 answer

How will Azure Application Insight help to optimize my application

Hello ! I would like to know how will enabling Azure application insight help me to analyze the performance issues in my application . WIll enabling this slow down my application / is there anything we need to check before this is enabled. Thanks

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-10-12T09:36:42.2133333+00:00
kavipriya balaji 0 Reputation points
answered 2023-10-12T18:56:46.1966667+00:00
QuantumCache 20,186 Reputation points
1 answer

How to read and write data to hbase hdinsights with databricks

I'm doing a proof of concept to compare which tool best suits our company, Cosmos DB or A Hbase cluster on HDInsights: I'm trying to read and write data using Databricks on HDinsights HBase. I tried to use the SHC lib but it is only available for version…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,045 questions
asked 2023-09-21T14:38:31.71+00:00
Nobuyoshi Ishizuka 0 Reputation points
commented 2023-09-25T19:28:50.6966667+00:00
BhargavaGunnam-MSFT 28,606 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Azure HD Insights Spot instances.

How to use the Azure HDInsights Spot instances. What is the max limit here?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-09-11T17:50:23.2666667+00:00
HariKrishna-7067 120 Reputation points
accepted 2023-09-12T17:01:28.91+00:00
HariKrishna-7067 120 Reputation points
2 answers One of the answers was accepted by the question author.

Suggestions for Optimizing Data Processing Speed and Migrating to Azure

Problem Description: We are currently facing a data processing challenge: our data processing workflow runs Java programs on a single machine, and data is stored in a DB server with a total dataset of over a hundred million records. Each run of our data…

Azure Migrate
Azure Migrate
A central hub of Azure cloud migration services and tools to discover, assess, and migrate workloads to the cloud.
744 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,621 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,015 questions
asked 2023-09-02T03:25:33.9366667+00:00
石德平 40 Reputation points
accepted 2023-09-02T05:53:37.9966667+00:00
石德平 40 Reputation points
1 answer

Remotly connect to AzureHDInsight 5.0 spark cluster

Hi, I have a local jupyter notebook and Im trying to connect to an AzureHDInsight 5.0 spark cluster. I found tutorials on how to create a jupyter notebook directly inside the Azure Cloud Interface of the cluster, but what if I want to connect remotly? …

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-08-29T12:42:49.91+00:00
Dario Bertolino 0 Reputation points
commented 2023-09-01T21:25:39.9933333+00:00
KranthiPakala-MSFT 46,437 Reputation points Microsoft Employee
1 answer

HDinsight 4.0 to 5.0

Differences between HDinsight Componant functionality of 4.0 and 5.0 major change , minor change Does the Ambari page affect usability?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-08-18T10:20:04.28+00:00
PANUWAT KLAHAN 0 Reputation points
commented 2023-08-30T10:53:36.19+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
1 answer

How is headnode failure handled in hdinsight ?

Hi, As HDInsight have fixed number of head nodes i.e 2. I'm curious how head node failures are handled in HDinsight ? Does failed headnode gets removed and a new headnode gets added in cluster ? failed headnode remains in cluster ? Thanks, Akshit Mehta

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-08-11T06:20:42.81+00:00
31610895 20 Reputation points
commented 2023-08-29T05:47:47.12+00:00
QuantumCache 20,186 Reputation points
1 answer One of the answers was accepted by the question author.

Does HDInsight manage zookeeper node ? How are failures handled in zookeeper nodes ?

Hi, First question - As HDInsight have fixed number of zookeeper nodes i.e 3. I'm curious how zookeeper node failures are handled in HDinsight ? Does failed zookeeper node gets removed and a new zookeeper node gets added in cluster ? failed…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-08-16T10:08:51.13+00:00
31610895 20 Reputation points
commented 2023-08-22T06:52:45.5433333+00:00
PRADEEPCHEEKATLA-MSFT 84,381 Reputation points Microsoft Employee
2 answers

How to use spot instances with HDInsight

Hi, Is there any way to use Spot VMs in HDInsight ? How to determine which VMs are spot VMs in this list - https://azure.microsoft.com/en-in/pricing/details/virtual-machines/linux/ ? Is there any max limit on number of Spot VMs that can be…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-08-16T10:26:35.41+00:00
31610895 20 Reputation points
commented 2023-08-17T06:34:57.5766667+00:00
shlim@zenithn.com 1 Reputation point
2 answers

Upgrading Storage account to GPV2

Hi , I am having storage account in which I am getting an recommendation from upgrading it to GPV2 ie General Purpose V2 . But there is a dependency like it is connected with HD insight Cluster. So is there any issue like where if I upgrade the storage…

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,871 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-07-13T12:33:11.81+00:00
Sumedh Patil 20 Reputation points
commented 2023-08-02T14:42:12.8833333+00:00
Sumarigo-MSFT 44,906 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

HDInsight data governance/lineage tool or framework

Hi folks, We are implementing a bigdata solution using HDInsight (Hive Interactive Query and Spark with an azure SQL db for hive metastore), is a requirement from the client to provide a data governance, linage, data masking solution. Based on what I've…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-07-18T20:41:31.2933333+00:00
Federico Sardo 91 Reputation points
accepted 2023-07-25T12:50:42.55+00:00
Federico Sardo 91 Reputation points
1 answer One of the answers was accepted by the question author.

unable to access index of repository mran.microsoft.com/

getting below error while installing r packages using mran.microsoft.com repo. this was working fine on June 2nd 2023. please help. #27 130.6 > install.packages("devtools", dependencies = TRUE) #27 130.6 Installing package into…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Azure R Server for HDInsight
Azure R Server for HDInsight
An Azure service that provides predictive analytics, machine learning, and statistical modeling for big data.
13 questions
asked 2023-07-11T10:31:58.0233333+00:00
Addepalli, Pavan 20 Reputation points
edited the question 2023-07-18T05:28:27.6933333+00:00
PRADEEPCHEEKATLA-MSFT 84,381 Reputation points Microsoft Employee
2 answers

trino vm access to hdi hive metastore - nsg's wide open, destination host unreachable

Within the same resource group, created trino instance, and HDI cluster. Different Vnets of course. Set wide open nsg rules (inbound/outbound, all ports, all protocols). Trino node cannot connect to hdi presto metacatalog service (connection timeout in…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-05-17T17:48:49.35+00:00
DR 0 Reputation points
commented 2023-05-28T14:28:08+00:00
PRADEEPCHEEKATLA-MSFT 84,381 Reputation points Microsoft Employee
1 answer

How to set static ip for head nodes in HDInsight Cluster?

Hi, I am working with HDInsight cluster, the type of the cluster is Hadoop. I am using ARM templates to create and destroy the cluster every day (8am to 6pm). Regarding networking, I am using an azure virtual network with a subnet for hdinsight. We are…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-05-08T20:29:37.1033333+00:00
Federico Sardo 91 Reputation points
commented 2023-05-23T12:10:32.36+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
2 answers

Azure HDInsights SQL integration - lack of Managed Identity authentication

Hello, while creating HDInsight cluster with Spark, I found out that SQL database for metadata cannot be connected using managed identity - can You improve that? Note. There is a need to use managed identity for Storage Account Access (or Lake).

Azure SQL Database
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
20,357 questions
asked 2023-05-16T09:48:07.7133333+00:00
Krzysztof Świdrak 166 Reputation points
answered 2023-05-16T09:55:42.8266667+00:00
Sairam Yeturi 75 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Can we upgrade commons-io in HDInsight

Hello Team, We're currently running HDInsight 5.0 for the spark runtime. We're facing the issue wrt to the common-io lib which is available as the part of HDInsight 5.0 since it has lower version of it which 2.5v. In our project, we're dependent on a lib…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-05-02T16:08:39.0766667+00:00
Sharath 20 Reputation points
commented 2023-05-02T18:22:38.36+00:00
Sharath 20 Reputation points
1 answer

Where is hue service and url in Hdinsight?

Hi, I created a hdinsight (hadoop) cluster and I used the script action to install hue. The installation was Succeeded: But I was unable to find hue service or url in ambari Could you please help me? Regards Fede

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-03-13T20:14:16.37+00:00
Federico Sardo 91 Reputation points
edited a comment 2023-04-25T04:51:32.5566667+00:00
PRADEEPCHEEKATLA-MSFT 84,381 Reputation points Microsoft Employee
2 answers

How to setup custom database for ambari in HDInsight?

Hi, I am creating a hadoop cluster in HDInsight. For ambari I created an Azure SQL Database and during the creation I selected not to use existing data. But when I want to create the cluster and try to select the database, I am facing this warning: I…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-03-27T19:50:48.9866667+00:00
Federico Sardo 91 Reputation points
commented 2023-04-10T06:44:25.6333333+00:00
PRADEEPCHEEKATLA-MSFT 84,381 Reputation points Microsoft Employee
1 answer

Big data access

Hi, i've created a student account because I'm studying for a master, but that type of account doesn't work for big data. What do I need to access Hadoop and apache spark on Azure, and what are the costs? thanks joao

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-03-26T10:39:06.7133333+00:00
Joao Ribeiro 20 Reputation points
commented 2023-03-29T07:57:04.3266667+00:00
KranthiPakala-MSFT 46,437 Reputation points Microsoft Employee
3 answers One of the answers was accepted by the question author.

how to access hadoop and Apache spark in azure

Hi, I've created my azure student account, but can't access HDinsight to use hadoop and Apachec spark for my master. How to i do? cheers joao acount (sba22203@student.cct.ie)

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
204 questions
asked 2023-03-21T09:56:05.49+00:00
Joao Ribeiro 20 Reputation points
answered 2023-03-22T08:54:53.7633333+00:00
彬 陈 0 Reputation points