2,042 questions with Azure Databricks tags

Sort by: Updated
0 answers

How do I assign managed identity to a databricks resource?

I have created a linked service from ADF to Databricks cluster, following https://techcommunity.microsoft.com/t5/azure-data-factory-blog/azure-databricks-activities-now-support-managed-identity/ba-p/1922818 I have provided contributor access to the ADF…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-12T10:24:08.21+00:00
Alexa Kiss 0 Reputation points Microsoft Employee
commented 2024-06-18T05:45:57.9033333+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
3 answers

"Premium Automated Serverless Compute - Promo DBU" expenses arise from what, how can I disable it, and why are the costs so high?

"Premium Automated Serverless Compute - Promo DBU" expenses arise from what, how can I disable it, and why are the costs so high? detail in below

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-09T16:43:36.0033333+00:00
Pratya Thanwatthanakit 5 Reputation points
commented 2024-06-18T05:43:25.1533333+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
0 answers

Azure Databricks exercise error

Keep receiving the error "No such file or directory /your_correct_source_value/wikipedia/pagecounts/staging_parquet_en_only_clean" When I checked Wikipedia, it appears this dataset has been deprecated since 2016-08-01 Could a new dataset be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T19:59:42.4166667+00:00
Joab Odera 0 Reputation points
commented 2024-06-18T05:02:02.1666667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Issues while writing into bad_records path in Databricks

Hello All, I would like to get your inputs with a scenario that I see while writing into the bad_records file. I am reading a ‘Ԓ’ delimited CSV file based on a schema that I have already defined. I have enabled error handling while reading the file to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T01:13:41.47+00:00
Alok Thampi 111 Reputation points
accepted 2024-06-17T15:09:38.76+00:00
Alok Thampi 111 Reputation points
1 answer One of the answers was accepted by the question author.

Best option to implement departmental secrecy in Azure databricks resource?

I have an azure databricks resource created in my Azure portal. I want to achieve departmental secracy in a single existing databricks resource. Hence, I am looking for a solution where I can add multiple workspaces to my single Databricks resource. How…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-13T09:23:30.1066667+00:00
Apurva Tiwari 40 Reputation points
accepted 2024-06-17T05:57:41.7633333+00:00
Apurva Tiwari 40 Reputation points
1 answer

How can i connect Azure Databricks to Neo4j??

Hello, I want to connect to neo4j from Azure Databricks. What are the different approaches do I have? I am trying to connect here and i getting following error. Do I need to do anything before running the code? i mean setup managed identity or enable…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-10T03:41:31.9866667+00:00
Siddartha Reddy Jammula 20 Reputation points
commented 2024-06-14T06:43:53.42+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

org.apache.hadoop.fs.FileAlreadyExistsException: Failed to rename temp file

[Repeat Question due to old thread] We have built a streaming pipeline with spark autoloader. Source Folder is a azure blob container. We've encountered a rare issue (could not replicate it). Below is the exception…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-07T01:40:56.47+00:00
Hiran Amarathunga 65 Reputation points
accepted 2024-06-14T02:26:30.36+00:00
Hiran Amarathunga 65 Reputation points
1 answer

Run Databricks notebook from ADF - error to find azure module to save the data in blob storage

Hi Guys, The requirement is - Call Rest API, read the records in jsonlines format and load into table in Azure SQL server. I used Databricks to read the jsonlines from Open API using Python script. It can read and keep the data into a file in Azure blob…

Azure SQL Database
Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-12T12:39:10.24+00:00
Sarmistha Sarkar 0 Reputation points
answered 2024-06-13T05:37:40.67+00:00
phemanth 7,835 Reputation points Microsoft Vendor
2 answers One of the answers was accepted by the question author.

When creating a second external location to the same path in Azure Databricks Unity Catalog it gives conflicting error for path. Is there any way to solve this?

Hello Team, When creating a second external location/external volumes to the same path with different folder or to the root location gives an error see below for details in Azure Databricks Unity Catalog as it gives conflict error for path. Is there any…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-04T10:23:22.8733333+00:00
Ashwini Gaikwad 110 Reputation points
commented 2024-06-12T09:37:40.13+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

The scim API is by default adding users to admins group in azure databricks

Hi, When we are invoking scim API in azure databricks it is by default adding users to the admins group and also after deleting users from only admins group they are being created again. Also calling scim API with adding groups as users also adding them…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T11:29:19.43+00:00
Gupta, Neha 0 Reputation points
commented 2024-06-12T07:15:58.77+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Connect to Blob storage from Azure Databricks SQL

So I would like to read a table from a CSV file on Azure Blob Storage in my own account, and load it into a table in Unity Catalog on databricks (hopefully using SQL). I have tried this SQL command: CREATE TABLE IF NOT EXISTS <table_name>; COPY…

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,575 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-11T13:36:31.3633333+00:00
Kaizad Wadia 20 Reputation points
commented 2024-06-12T06:52:52.87+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

What Azure service should I use to deploy a complex Python program on the cloud?

Background<br> I have developed a Python program that fetches data from three different REST APIs, processes it, and inserts it into a database. The program also queries the database to identify which values to fetch from the APIs, so there is…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T13:06:56.45+00:00
kman-1604 0 Reputation points
commented 2024-06-12T05:05:24.94+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

deploying Azure databrick with datalake

Deploying Azure Databricks creates an additional resource group in the background, which includes a data lake. Is it possible to use the data lake that I have already deployed in Azure instead of the one provisioned by Azure Databricks?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T08:40:13.1866667+00:00
Sujeet 0 Reputation points
commented 2024-06-12T05:04:01.3966667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

Integrating Databricks notebooks in Azure ML using SDK V2

Hi all, We currently have some Azure Databricks notebooks in production which we would like to integrate in Azure ML using the v2 SDK. I found resources to integrate these notebooks using the databricks_step in the v1 SDK. The official documentation…

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,685 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-07T13:12:55.3333333+00:00
Alexander 0 Reputation points
commented 2024-06-11T09:30:30.9966667+00:00
Amira Bedhiafi 18,501 Reputation points
1 answer One of the answers was accepted by the question author.

Databricks Dev/Prod setup

We are a data team of 4 people. To make the process easy and more productive. Can we separate dev/prod environments at Databricks catalogue level rather than the workspace level? Can anyone share any thoughts on this? Thanks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-07T06:26:24.16+00:00
vicman 20 Reputation points
accepted 2024-06-10T23:48:22.6966667+00:00
vicman 20 Reputation points
1 answer One of the answers was accepted by the question author.

My Dev, test, prod environments are in different resource groups of same subscription. How do I create a devops pipeline in this case?a DevOps pipeline to deploy a

Hi, My dev, test and prod environments are in different resource groups of the same subscription. I am involved in a data engineering project where I will be using primarily below resources - ADLS - data storage ADF - Orchestration Azure Databricks - QC…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-14T19:23:38.4366667+00:00
Shashwat Tiwary 40 Reputation points
accepted 2024-06-10T09:53:29.01+00:00
Shashwat Tiwary 40 Reputation points
2 answers

How to ignore the records in ADF Data Flows

Hi All I am building a data transamination using mapping data flows ,I have a time stamp field Like TimeStampUpdated in the target table. I want to lockup historical data with incremental data transamination and ignore the records coming in the…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-23T06:58:12.53+00:00
venkat rao 65 Reputation points
commented 2024-06-10T08:32:00.3+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
2 answers

Access issue with app registration

I've created a Databricks workspace and a new notebook, but I don't have access to the secret keys under app registration, which are disabled for me. How can I solve this issue? Warning message You do not have access Your administrator has disabled the…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-03T18:59:22.2666667+00:00
NIKHIL C 0 Reputation points
commented 2024-06-10T06:04:29.2033333+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
2 answers

Access to C:\Data not allowed . Error Code 22853

Access to C:\Data not allowed . Error Code 22853 Any workway around this ?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
Azure Data Catalog
Azure Data Catalog
An Azure service that serves as a system of registration and system of discovery for enterprise data assets.
99 questions
Windows Server PowerShell
Windows Server PowerShell
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.PowerShell: A family of Microsoft task automation and configuration management frameworks consisting of a command-line shell and associated scripting language.
5,444 questions
asked 2023-02-24T07:06:54.58+00:00
Sushan 0 Reputation points
commented 2024-06-08T17:03:05.74+00:00
Das, Dwaipayan 0 Reputation points
1 answer One of the answers was accepted by the question author.

How to Create Delta Table in Azure Synapse Analytics with Id Auto Increment Identity Column ?

I have created the Delta Lake Delta tables In ADLS using Synapse Notebook and in that table, I want to add an identity column (Auto increment 1,1) but I am not able to create the same, Below is my Create table script and error which i am facing. Table…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-04T06:11:14.4766667+00:00
Vedant Desai 651 Reputation points
accepted 2024-06-07T09:52:14.62+00:00
Vedant Desai 651 Reputation points