2,042 questions with Azure Databricks tags

Sort by: Updated
1 answer

Databricks Spark Scala: RaiseError throws type error

Hello, I am facing an issue in Databricks 14.3-LTS within a Scala notebook. When I try to raise an Exception using Spark Catalyst with the following Scala code: import org.apache.spark.sql.types.{StringType, DateType} val errorMessage =…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T08:24:16.67+00:00
bn2302 0 Reputation points
commented 2024-06-07T04:19:55.0466667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
2 answers

I am unable to mount containers using databricks and storage gen 2 ?

what is the issue?

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-05T15:57:33.72+00:00
smriti das 0 Reputation points
answered 2024-06-06T12:19:54.16+00:00
Luis Arias 5,901 Reputation points
1 answer One of the answers was accepted by the question author.

Can a single instance of Microsoft Purview scan multiple Azure Databricks Unity Catalog instances that exist in different logical data domains?

I am researching a use case where a single instance of Microsoft Purview can be used to scan multiple instances of Azure Databricks Unity Catalogs hosted in multiple logical / geographical domains, including using OpenLineage to provide lineage data to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,025 questions
asked 2024-06-04T21:03:01.6766667+00:00
Erich Huckschlag 20 Reputation points
accepted 2024-06-06T07:43:48.44+00:00
Erich Huckschlag 20 Reputation points
1 answer

Connecting Databricks to on prem sources

Is there a way to connect my Azure Databricks workspace to my local SQL Server database? I am trying to read data from my local SQL Server installed on my machine, but I am looking for a way to connect the two directly. I am aware we can use a SHIR with…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Windows Network
Windows Network
Windows: A family of Microsoft operating systems that run across personal computers, tablets, laptops, phones, internet of things devices, self-contained mixed reality headsets, large collaboration screens, and other devices.Network: A group of devices that communicate either wirelessly or via a physical connection.
696 questions
asked 2024-05-29T11:03:25.18+00:00
Abdullah Humayun 40 Reputation points
commented 2024-06-05T21:35:50.7366667+00:00
BhargavaGunnam-MSFT 28,526 Reputation points Microsoft Employee
2 answers

Enabled IP Access Control in Databricks workspace and no one can connect to the workspace

We setup Access control list with only one entry of the wrong IP address (outbound nat gateway IP ) Also Enforce IP access list on Compute Plane Requests toggle on . Now no one can access the workspace and there is no way I can toggle off. Is there a…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-04T19:19:54.28+00:00
Inna Mednyk 1 Reputation point
commented 2024-06-04T22:54:45.2933333+00:00
BhargavaGunnam-MSFT 28,526 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Azure Databricks workflow that runs multiple jobs sequentially using the same job cluster

Newbie here as far as Azure Databricks and workflows are concerned. TL/DR version: Is there a way to configure the same job cluster for multiple jobs that are part of the same workflow? Long version: We have an ELT process that we have broken down into…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-01T14:30:01.2+00:00
Aravind Raam 20 Reputation points
accepted 2024-06-04T19:16:12.37+00:00
Aravind Raam 20 Reputation points
1 answer One of the answers was accepted by the question author.

ADF Data Flows Flatten nested json array values are being populated as null

Hi All I am building a data transamination with ADF data flow using a nested json array of objects , but after parse and flatten the json node itOffer.item.LeadOfer.zdeal.item[].dealNumber I am seeing that the column values are populated as null . I…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-30T11:06:56.13+00:00
venkat rao 65 Reputation points
accepted 2024-06-04T12:37:27.4766667+00:00
venkat rao 65 Reputation points
1 answer One of the answers was accepted by the question author.

Request to clarify Databricks workspace IP access controls change for existing workspaces on August 26th

Microsoft sent an email about a change in Databricks workspace IP access controls that will impact existing workspaces on August 26, 2024. Our organization uses vnet injected Databricks and secure cluster connectivity, and we have some doubts: Do we…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-03T12:21:43.8266667+00:00
Said Mohammed 20 Reputation points
accepted 2024-06-04T11:17:29.78+00:00
Said Mohammed 20 Reputation points
1 answer One of the answers was accepted by the question author.

Are High Concurrency clusters deprecated or renamed in UC databricks worskpace

Hello Team, Is the High concurreny clusters deprecated. Even I don't see Custom Access mode in UC enabled databricks workspace UI. I went through this article https://video2.skills-academy.com/en-us/azure/databricks/archive/compute/cluster-ui-preview but I am…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-05-28T10:02:40.75+00:00
Ashwini Gaikwad 110 Reputation points
commented 2024-06-04T09:53:40.45+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

azure data factory's databricks runtime

Hello, i am trying to ingest data from SAP to fabric lakehouse table using dataflow and copy activity, however because my sink table has change data feed enabled, the pipeline throws the following error: Operation on target Data flow1 failed:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-06-03T14:24:46.9933333+00:00
Osama Tarek 20 Reputation points
accepted 2024-06-04T06:26:36.36+00:00
Osama Tarek 20 Reputation points
1 answer

Can we execute a Stored procedure(Azure SQL DB) through Databricks Notebook??

I already have a stored Procedure which has Merge Statement logic, I would like to execute this SP through Databricks notebook. Is there a way to do this?

Azure SQL Database
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2022-04-20T08:16:44.697+00:00
Anusha Rudra 6 Reputation points
commented 2024-06-04T05:25:20.0466667+00:00
Vedant Desai 651 Reputation points
2 answers

Seeking Expertise in Spark SQL CTE Recursive Queries in Azure Databricks

I'm currently diving deep into Spark SQL and its capabilities, and I'm facing an interesting challenge. I'm eager to learn how to write CTE recursive queries in Spark SQL, but after thorough research, it seems that Spark doesn't natively support…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
SQL Server
SQL Server
A family of Microsoft relational database management and analysis systems for e-commerce, line-of-business, and data warehousing solutions.
13,184 questions
asked 2024-05-30T04:27:35.4+00:00
Anuj, Singh (Cognizant) 50 Reputation points
edited a comment 2024-06-03T16:54:22.8866667+00:00
Smaran Thoomu 12,090 Reputation points Microsoft Vendor
1 answer

insert record from azure datafactory into a delta table generated in databricks

Hi team, i'm trying to insert value into a delta table from datafactory, the table has been created in databricks and in the table properties i setted "delta.enableChangeDataFeed=true", for this reason the protocol minWriterVersion of my delta…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2022-03-02T12:08:37.73+00:00
Antonio Nuzzo 11 Reputation points
commented 2024-06-03T14:13:46.6433333+00:00
Osama Tarek 20 Reputation points
1 answer One of the answers was accepted by the question author.

Azure Databricks with Key vault backed secret using RBAC

Azure Key vault had launched the RBAC access model in 2021. This allows finer granular access to a particular secret or key or certificate. The previous model was access policies which doesn't allow this granular access. From my knowledge, Azure keyvault…

Azure Key Vault
Azure Key Vault
An Azure service that is used to manage and protect cryptographic keys and other secrets used by cloud apps and services.
1,171 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2023-05-17T14:10:08.1633333+00:00
Chand, Anupam SBOBNG-ITA/RX 461 Reputation points
commented 2024-06-03T13:22:57.4466667+00:00
Naveen Kumar 1 Reputation point
2 answers

How to convert type of any block blob to append blob inside adls gen2? Is there any process or activity which will convert the type of my entire blob without changing the content?

I have a json file inside a data lake gen2 storage and that json file blob type is a block blob. I want to convert that blob type to append blob through any services of azure. Need to know the process in detail.

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Functions
Azure Functions
An Azure service that provides an event-driven serverless compute platform.
4,556 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,575 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-31T06:30:48.3333333+00:00
Chakraborty, Nabarun (External) 0 Reputation points
answered 2024-06-03T12:09:39.6833333+00:00
Nehruji R 4,126 Reputation points Microsoft Vendor
1 answer

The public IP address range for the Azure Databricks control plane will be updated - Is there anything else I should do?

Because I received this message regarding the update: The public IP address range for the Azure Databricks control plane will be updated on 30 May 2024—you may need to take action You're receiving this email because you use Azure Databricks. I wonder if…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-05-29T08:23:58.0766667+00:00
Zakrzewska, Agata 0 Reputation points
commented 2024-06-03T06:52:15.9466667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

How to ignore the records by applying an auditing filed column condition using ADF Data Flows

Hi All I am building a data transamination using mapping data flows ,I have a time stamp field Like TimeStampUpdated in the target table. I want to lockup historical data with incremental data transamination and ignore the records coming in the…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-30T09:28:46.6066667+00:00
venkat rao 65 Reputation points
commented 2024-06-03T05:46:23.9533333+00:00
Smaran Thoomu 12,090 Reputation points Microsoft Vendor
1 answer

Unable to create a cluster in Databricks getting quota exceeded error while everything is new

Azure Quota Exceeded Exception: Error code: QuotaExceeded, error message: Operation could not be completed as it results in exceeding approved standardDADSv5Family Cores quota. Additional details - Deployment Model: Resource Manager, Location:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-30T21:08:45.6433333+00:00
Sibbinath, Akhil 0 Reputation points
commented 2024-06-03T05:43:37.29+00:00
Smaran Thoomu 12,090 Reputation points Microsoft Vendor
2 answers

Azure to AWS

Hello We need to transfer files from ADLS to AWS (S3 bucket) for a SAS application hosted in third party in batches. We need to ensure data security and best practices. My understanding, we can use ADF to create a linked service for AWS S3 but IT DOES…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-20T08:39:35.8366667+00:00
Sourav 80 Reputation points
edited an answer 2024-06-01T10:27:52.2833333+00:00
Sumarigo-MSFT 44,891 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Understand costs AZ Databricks job

I have an Azure Data Factory pipeline that executes multiple Databricks Notebooks using job clusters. I need to track the cost of these job clusters, including both the Databricks and the underlying VM costs, specifically for this set of jobs. Currently,…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-05-31T07:58:49.75+00:00
WeirdMan 220 Reputation points
answered 2024-05-31T08:55:07.06+00:00
Vinodh247-1375 12,506 Reputation points