2,047 questions with Azure Databricks tags

Sort by: Updated
4 answers One of the answers was accepted by the question author.

What is the difference between Databrick prepay and Databrick reservation in Azure ?

Hello, We are just considering ways to reduce Databrick cost in Azure other than buying RI for VMs behind Databrick clusters. What is the difference between Databrick prepay and Databrick reservation in Azure It seems Databrick reservation is named as…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-23T09:26:44.82+00:00
Anil Kumar 325 Reputation points
accepted 2024-05-27T05:32:31.28+00:00
Anil Kumar 325 Reputation points
1 answer

Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)

Hello good people, I am getting this error "Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)" Please help. Thank You so much.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-24T00:53:43.1466667+00:00
Asma Khalid 0 Reputation points
commented 2024-05-27T04:08:21.1966667+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

I don't see the Data tab in my 14-day trial for Azure databricks.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-03-13T01:54:21.8533333+00:00
Venkata Subba Reddy Bovilla 5 Reputation points
edited a comment 2024-05-26T22:14:16.96+00:00
Kulkarni, Gargi Renukadas 0 Reputation points
1 answer

How to use a different version of a Spark Java library dependency (antlr4) in a Databricks notebook?

Hello. I need to use in a Databricks notebook a custom made Java library which depends on Drools v8.40.1.Final which depends on ANTLR4 v4.10.1. When I try to invoke a method in my Java library I get the following error: "ANTLR Tool version 4.10.1…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-01-22T22:32:40.6433333+00:00
Martin Medina 5 Reputation points
commented 2024-05-24T16:55:41.88+00:00
Carlos Irazabal 0 Reputation points
0 answers

How to parse nested json array of document in ADF data flow

Hi all I am trying to fitch the values from a nested josh array of document , I have used aggregate to convert into objects but not able to fitch the values of all child nodes like as below itOffer.item itOffer.item.SplOfr itOffer.item.buy …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,630 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,028 questions
asked 2024-05-02T15:44:57.8633333+00:00
venkat rao 65 Reputation points
commented 2024-05-24T09:38:47.0233333+00:00
phemanth 8,165 Reputation points Microsoft Vendor
1 answer

Azure Databricks workflow job failure

We have a stream workflow job that run 24*7 and loads the data in delta table for say: raw.deltaTableA Now, the problem is in case we are trying to optimize this delta (optimize raw.deltaTableA) table while the table is getting loaded we get frequent…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-02T08:23:36.4266667+00:00
NIKHIL KUMAR 101 Reputation points
commented 2024-05-24T04:40:08.6533333+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer

CSV to XML conversion in databricks which have some blank values as well in csv

I am converting CSV data to xml and that CSV data has some blank values as well for a few columns let's take an example there are 4 columns in CSV and out of that for a row(record) 1 colom value is blank , so as an output in xml, I am getting a missing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,028 questions
asked 2024-05-16T08:34:02.7366667+00:00
Manoj 0 Reputation points
commented 2024-05-23T16:49:27.77+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
0 answers

ADF | ADB Activity Execution Time on Job Clusters

Has anyone noticed adb notebooks running (on job clusters) faster in ADF ? we have sequential notebook activities and seeing the start up time of clusters to be as low as 2 minutes.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,028 questions
asked 2024-05-21T13:11:38.6066667+00:00
Lokesh 211 Reputation points
commented 2024-05-23T16:42:52+00:00
ShaikMaheer-MSFT 38,321 Reputation points Microsoft Employee
1 answer

how to disable autoscaling local storage

I'm configuring the cluster with the 'enable_elastic_disk' parameter as 'false', using tfvars. ex: enable_elastic_disk = false. However, clustering in Databricks remains true. what to do?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2023-02-24T12:13:47.3066667+00:00
Coimbra, Diego(GLOBAL-V) 0 Reputation points
commented 2024-05-23T15:59:09.1033333+00:00
Anthony Roberts (US) 0 Reputation points
1 answer One of the answers was accepted by the question author.

Connecting Azure Databricks workspace to on-premises network - peering

I was following this tutorial to deploy a workspace for on prem database access. I created the VNET for Databricks as mentioned as well as the transit VNET. However, when I got to the option to peer the two VNETs the VNET peering option seems to be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-22T14:48:27.7233333+00:00
Abdullah Humayun 40 Reputation points
accepted 2024-05-23T13:00:28.0566667+00:00
Abdullah Humayun 40 Reputation points
1 answer

[Databricks] Clusters are failing to launch. Cluster launch will be retried.

Hi all, I am a complete newbie on Databricks Azure. I have encounterd the below issue which I think is stopping me from running query. Any help will be much appreciated. Thanks. Billy Clusters are failing to launch. Cluster launch will be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-08T22:05:05.41+00:00
Billy Cheng 0 Reputation points
commented 2024-05-23T08:47:03.85+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How to access the Databricks manages resource group to rotate the access keys for a storage account under the managed resource group?

We want to rotate the access keys for a storage account under a Databricks managed RG. However, keep getting the below error message: "the access is denied because of the deny assignment with name System deny assignment created by Azure…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-22T06:01:54.29+00:00
Sarish Tabish Sayyed 40 Reputation points Microsoft Vendor
edited the question 2024-05-22T06:49:55.13+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Azure Databricks fail to install Geospark libraries from Maven

Hi Team , I am attempting to add below two geospark Maven libraries to my Azure Databricks interactive cluster with Runtime Version 14.3 LTS . However , I am getting below error Library installation attempted on the driver node of cluster…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-04-15T06:24:17.8033333+00:00
Anuj, Singh (Cognizant) 50 Reputation points
accepted 2024-05-22T05:08:38.27+00:00
Anuj, Singh (Cognizant) 50 Reputation points
1 answer

Results too large error

Hi, We have a databricks table for which the underlying data is in ADLS gen2. The table has a column named "data" (stringtype) which has very large JSON values. When we try to select the rows from the table it throws an error as "Results…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-21T12:31:02.8266667+00:00
Mandal, Sujalkumar (Cognizant) 0 Reputation points
answered 2024-05-22T04:53:31.9733333+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
2 answers

An error occurred when using odbc and managed identity to link azure databricks in the. net core

Environmental description: The app service has already been configured with managed identity, and the relevant permissions for this managed identity have been configured in Azure databricks. When the app service uses code to access Azure databricks, the…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-04-23T05:55:31.6166667+00:00
answered 2024-05-22T04:47:17.49+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Do you know how to install the 'ODBC Driver 17 for PostgreSQL' on a Azure Databricks cluster?

I am attempting to run postgreSQL stored procedures , through Azure Databricks notebook. We have stored procedure written in Azure Database for PostgreSQL and we want to run postgreSQL stored procedures through Azure Databricks Notebook (using…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,630 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
Azure Database for PostgreSQL
asked 2024-04-30T11:06:56.06+00:00
Anuj, Singh (Cognizant) 50 Reputation points
commented 2024-05-22T04:46:59.1066667+00:00
Anuj, Singh (Cognizant) 50 Reputation points
0 answers

py4j.security.Py4JSecurityException

Hello I am trying to run spark XGBoostRegression model on Databricks cluster with Databricks runtime: 14.3 LTS. I am getting the following error: Py4JError: An error occurred while calling o547.resourceProfileManager. Trace:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-06T12:48:28.3166667+00:00
Ahuja, Rachit 0 Reputation points
commented 2024-05-21T21:04:06.7066667+00:00
BhargavaGunnam-MSFT 28,766 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How do I use the Script activity in ADF, so it uses Azure Databricks SQL Warehouse

I want to be able to use ADF Script activity to execute SQL statements on the Azure Databricks SQL warehouses (including the serverless kind). https://video2.skills-academy.com/en-us/azure/data-factory/transform-data-using-script Azure Databricks SQL…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,028 questions
asked 2024-05-01T12:21:53.01+00:00
Krzysztof Przysowa 20 Reputation points
commented 2024-05-21T17:50:46.7966667+00:00
BhargavaGunnam-MSFT 28,766 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

wrong answer in dp 203 test exam question

Hello, I have test exam questions for dp 203 and i had the following question after the answer, it shows that it is wrong and gives a link to the documentation …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,630 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
asked 2024-05-21T11:51:02.05+00:00
Darjuš Vasiukevič 20 Reputation points
accepted 2024-05-21T13:44:15.2666667+00:00
Darjuš Vasiukevič 20 Reputation points
2 answers

How to configure ADF pipeline run, linked service, so it uses Databricks serverless compute

Databricks has recently announced serverless compute for workflows: https://video2.skills-academy.com/en-us/azure/databricks/workflows/jobs/run-serverless-jobs I would like to be able to execute Azure Data Factory (ADF) jobs using this…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,028 questions
asked 2024-05-01T12:12:06.9033333+00:00
Krzysztof Przysowa 20 Reputation points
commented 2024-05-21T04:35:19.3033333+00:00
PRADEEPCHEEKATLA-MSFT 84,531 Reputation points Microsoft Employee