2,042 questions with Azure Databricks tags

Sort by: Updated
0 answers

While running SQL query in Azure Databricks workspace i.e. on SQL warehouse as well as on UC enabled shared cluster facing an SSL handshake error

Hello Team, We have UC enabled Azure databricks workspace, also the Public access and delta sharing is disabled on our workspace. So while running the below SQL query on SQL Warehouse as well as on UC enabled shared cluster, I am receiving an…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,025 questions
asked 2024-06-28T10:06:18.7566667+00:00
Ashwini Gaikwad 110 Reputation points
commented 2024-07-03T08:41:38.42+00:00
Ashwini Gaikwad 110 Reputation points
1 answer

Can we connect ADX to Databricks, without using App Registration.?

I am trying to Access ADX data into Databricks, but failing to configure connection between databricks and ADX. I don't have permissions to use App registration. Is there any way to do with managed identity or any other alternative.

Azure Data Explorer
Azure Data Explorer
An Azure data analytics service for real-time analysis on large volumes of data streaming from sources including applications, websites, and internet of things devices.
501 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
20,317 questions
asked 2024-06-25T12:25:47.8766667+00:00
Ram Ahuja, Roma 0 Reputation points
commented 2024-07-03T08:28:13.28+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
3 answers

How to reduce unnecessary high memory usage in a Databricks cluster?

We are having unnecessary high memory usage even when nothing is running on the cluster. When the cluster first starts, it's fine, but when I run a script and it finishes executing, nothing gets back to the idle (initial) state (even hours after nothing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
236 questions
asked 2024-05-08T08:58:46.4433333+00:00
Senad Hadzikic 20 Reputation points
edited the question 2024-07-03T04:18:19.4633333+00:00
Ryan Hill 26,866 Reputation points Microsoft Employee
0 answers

Guidance on how to use Service Principal with Certificate to Authorize for EventHub Stream Read

I found this documentation https://github.com/Azure/azure-event-hubs-spark/blob/master/docs/use-aad-authentication-to-connect-eventhubs.md online on how to use service principal with certificate to use spark stream read from EventHubs, I want to do this…

Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
586 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
20,317 questions
asked 2024-07-01T21:27:20.97+00:00
BEPV 0 Reputation points
edited a comment 2024-07-02T20:49:46.0566667+00:00
Sina Salam 6,501 Reputation points
1 answer One of the answers was accepted by the question author.

Set cloudFiles.maxFileAge and cloudFiles.backfillInterval values in Autoloader

I'm using following in the autoloader options. .option("cloudFiles.maxFileAge", "90 days")\ .option("cloudFiles.backfillInterval", "1 day")\ Our data retention policy is 7 years. Shall I use maxFileAge as 7 years…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-07-02T02:12:40.21+00:00
Hiran Amarathunga 65 Reputation points
accepted 2024-07-02T07:43:44.5933333+00:00
Hiran Amarathunga 65 Reputation points
3 answers

How to specify a custom catalog name for Azure Databricks Delta Lake Dataset in ADF

Hello, I am creating an Azure Databricks Delta Lake Dataset in ADF and I am only able to choose the database name that links to Databricks's hive_metastore. How can I specify a custom catalog name that I created in Databricks instead of…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,409 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-01-04T06:25:53.11+00:00
Tom Young 0 Reputation points
commented 2024-07-02T04:23:57.05+00:00
Evgeniy Perfilyev 0 Reputation points
1 answer One of the answers was accepted by the question author.

Serverless warehouse suddenly stops to start up.

Hey All. From today, suddenly we are getting below error while starting a serverless warehouse. Details for the latest failure: Error: Cluster launch timeout. Type: SERVICE_FAULT Code: K8S_DBR_CLUSTER_LAUNCH_TIMEOUT Warehouse details: Type:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-25T12:48:53.3833333+00:00
Bhargav Shir 20 Reputation points
commented 2024-07-02T04:16:33.1166667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

Can we run delta live tables with free tier azure account?

While running the delta live tables pipeline in azure databricks I'm getting an error saying: QuotaExceeded, error message: Operation could not be completed as it results in exceeding approved Total Regional Cores quota. Additional details - Deployment…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-07-01T10:14:34.0733333+00:00
Kartik B 0 Reputation points
answered 2024-07-01T11:17:49.9766667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

How to solve Invalid SessionHandle error with Azure Databricks ?

I am applying a SQLDatabaseChain Chatbot model by using LangChain SQLDatabaseChain and GPT4. I first created this model on Azure Databricks notebook like this : import json import os import langchain import mlflow from mlflow.models import…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-17T14:38:14.4333333+00:00
Feng YIJUN 0 Reputation points
edited an answer 2024-07-01T07:42:48.6866667+00:00
Feng YIJUN 0 Reputation points
1 answer

How do I share all of my databricks notebooks with all databricks users?

Hi all, I know that I've done this in the distant past, but we have a new instance of Databricks and I need to do a global setting to share all of my notebooks with all Databricks users (read only). That way I don't need to remember to share individual…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-27T13:57:35.7666667+00:00
Barb K 0 Reputation points
edited a comment 2024-07-01T05:28:58.0633333+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

while creating cluster in databricks i am getting following error Azure Quota Exceeded Exception: Error code: SkuNotAvailable, error message: The requested VM size for resource 'Following SKUs have failed for Capacity Restrictions: Standard_DS3_v2' is cur

while creating cluster in databricks i am getting following error Azure Quota Exceeded Exception: Error code: SkuNotAvailable, error message: The requested VM size for resource 'Following SKUs have failed for Capacity Restrictions: Standard_DS3_v2' is…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-28T12:09:26.57+00:00
azurecloud 0 Reputation points
edited an answer 2024-07-01T04:37:52.52+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

array in databricks

Hi , I got the 1st row in sorted order how can i get the view on row1,row2,row3 using array and how the reshuffling will happen 1,2,3,4,5  --- sorted order   5,1,2,3,4  - 1 4,5,1,2,3  - 2 3,4,5,1,2  - 3

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-25T08:20:36.5333333+00:00
Vineet S 265 Reputation points
commented 2024-06-30T15:08:50.3966667+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Azure Databricks fails

Hello, In the databricks notebook which is provided by Microsoft training classes, when I tried to import => read a data (csv or json) like path = source + "/wikipedia/pagecounts/staging_parquet_en_only_clean/" files =…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2023-07-20T13:58:55.4033333+00:00
Catherine 刘 30 Reputation points
edited a comment 2024-06-29T15:40:00.8866667+00:00
Choi, Seung-Rak 0 Reputation points
2 answers One of the answers was accepted by the question author.

databricks cluster sizing

Hey, how to calculate cluster core and workers node of 10gb data load every 2 hours ... what is the calculation behind this

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-26T06:13:37.5433333+00:00
Vineet S 265 Reputation points
accepted 2024-06-28T23:29:16.8466667+00:00
Vineet S 265 Reputation points
0 answers

How can I remove the `sample` catalog from Azure Databricks Workspaces?

All Azure Databricks Workspaces come with a sample catalog owned by Databricks that I cannot seem to remove or hide. I have tried dropping it, and I have also tried revoking and denying permissions on the catalog, but I keep receiving an error that is…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2023-10-31T19:35:30.61+00:00
Tim Thein 5 Reputation points
commented 2024-06-28T04:06:26.55+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer

view in dataframe

hey, how we can create or replace view statement in spark sql in dataframe of databricks create or replace view as (select * from temp1)

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,994 questions
asked 2024-06-25T17:31:45.62+00:00
Vineet S 265 Reputation points
commented 2024-06-27T07:06:54.6+00:00
Harishga 5,510 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Error while implementing File Notification Mode: com.microsoft.azure.storage.StorageException: This request is not authorized to perform this operation using this permission.

While implementing File Notification Mode in Autoloader, I get the following error. Has anyone faced the similar issue? Note: The Databricks Service Principal is having Contributor role to Storage account. com.microsoft.azure.storage.StorageException:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-21T06:16:38.9166667+00:00
Hiran Amarathunga 65 Reputation points
commented 2024-06-27T03:46:11.5833333+00:00
PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Cross tenant AAD authentication for Purview & Databricks

Hi, I want to know if for Purview & Databricks, is it possible to authenticate with cross tenant AAD? That is to say, can users belonging to AAD in tenant1, be able to login to Purview & Databricks which are setup in tenant2? Thanks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Active Directory
Active Directory
A set of directory-based technologies included in Windows Server.
6,133 questions
Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,025 questions
asked 2024-06-20T03:44:04.3166667+00:00
Amit Singh 60 Reputation points
accepted 2024-06-27T03:18:50.43+00:00
Amit Singh 60 Reputation points
0 answers

[INTERNAL_ERROR] The Spark SQL phase optimization failed with an internal error. You hit a bug in Spark or the Spark plugins you use.

"I am trying to extract data from Azure Cosmos DB using PySpark and I am getting the following error: Py4JJavaError: An error occurred while calling o700.save.: org.apache.spark.SparkException: [INTERNAL_ERROR] The Spark SQL phase optimization…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
Azure Cosmos DB
Azure Cosmos DB
An Azure NoSQL database service for app development.
1,514 questions
asked 2024-03-08T08:35:10.85+00:00
Nathalia Fernandez Rodrigues 0 Reputation points
commented 2024-06-26T16:39:36.0333333+00:00
Luce PHILIBERT 0 Reputation points
2 answers

Azure Data Bricks - User Doesn't have permission to perform this action while connecting to Azure Synapse Dedicate Pool

We are connecting Azure Synapse Analytics - Dedicated Pool using the PySpark Code that runs from Azure Data Bricks using SQL Authentication. While running, we are getting the below error when we use a user with db_datawriter and db_datareader…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,042 questions
asked 2024-06-07T11:26:00.4933333+00:00
Praveen Sreeram 1 Reputation point
commented 2024-06-26T06:20:40.76+00:00
Praveen Kumar Sreeram 0 Reputation points