2,150 questions with Azure Databricks tags

Sort by: Updated
1 answer One of the answers was accepted by the question author.

Downsizing subnets that are associated with a databricks workspace

Hi, Is it possible to downsize the private and public subnets that currently have nothing attached to it but are use by a databricks workspace? Would this require the databricks cluster to be redeployed? Are there any extra steps needed so the…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-24T11:10:32.2233333+00:00
azmag 20 Reputation points
accepted 2024-07-03T13:02:20.8933333+00:00
azmag 20 Reputation points
1 answer

Can we connect ADX to Databricks, without using App Registration.?

I am trying to Access ADX data into Databricks, but failing to configure connection between databricks and ADX. I don't have permissions to use App registration. Is there any way to do with managed identity or any other alternative.

Azure Data Explorer
Azure Data Explorer
An Azure data analytics service for real-time analysis on large volumes of data streaming from sources including applications, websites, and internet of things devices.
519 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-25T12:25:47.8766667+00:00
Ram Ahuja, Roma 0 Reputation points
edited the question 2024-07-03T10:29:01.5066667+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Set cloudFiles.maxFileAge and cloudFiles.backfillInterval values in Autoloader

I'm using following in the autoloader options. .option("cloudFiles.maxFileAge", "90 days")\ .option("cloudFiles.backfillInterval", "1 day")\ Our data retention policy is 7 years. Shall I use maxFileAge as 7 years…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-07-02T02:12:40.21+00:00
Hiran Amarathunga 95 Reputation points
accepted 2024-07-02T07:43:44.5933333+00:00
Hiran Amarathunga 95 Reputation points
3 answers

How to specify a custom catalog name for Azure Databricks Delta Lake Dataset in ADF

Hello, I am creating an Azure Databricks Delta Lake Dataset in ADF and I am only able to choose the database name that links to Databricks's hive_metastore. How can I specify a custom catalog name that I created in Databricks instead of…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,458 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,512 questions
asked 2024-01-04T06:25:53.11+00:00
Tom Young 0 Reputation points
commented 2024-07-02T04:23:57.05+00:00
Evgeniy Perfilyev 0 Reputation points
1 answer One of the answers was accepted by the question author.

Serverless warehouse suddenly stops to start up.

Hey All. From today, suddenly we are getting below error while starting a serverless warehouse. Details for the latest failure: Error: Cluster launch timeout. Type: SERVICE_FAULT Code: K8S_DBR_CLUSTER_LAUNCH_TIMEOUT Warehouse details: Type:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-25T12:48:53.3833333+00:00
Bhargav Shir 20 Reputation points
commented 2024-07-02T04:16:33.1166667+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee
1 answer

How to solve Invalid SessionHandle error with Azure Databricks ?

I am applying a SQLDatabaseChain Chatbot model by using LangChain SQLDatabaseChain and GPT4. I first created this model on Azure Databricks notebook like this : import json import os import langchain import mlflow from mlflow.models import…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-17T14:38:14.4333333+00:00
Feng YIJUN 0 Reputation points
edited an answer 2024-07-01T07:42:48.6866667+00:00
Feng YIJUN 0 Reputation points
1 answer

array in databricks

Hi , I got the 1st row in sorted order how can i get the view on row1,row2,row3 using array and how the reshuffling will happen 1,2,3,4,5  --- sorted order   5,1,2,3,4  - 1 4,5,1,2,3  - 2 3,4,5,1,2  - 3

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-25T08:20:36.5333333+00:00
Vineet S 730 Reputation points
commented 2024-06-30T15:08:50.3966667+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Azure Databricks fails

Hello, In the databricks notebook which is provided by Microsoft training classes, when I tried to import => read a data (csv or json) like path = source + "/wikipedia/pagecounts/staging_parquet_en_only_clean/" files =…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2023-07-20T13:58:55.4033333+00:00
Catherine 刘 35 Reputation points
edited a comment 2024-06-29T15:40:00.8866667+00:00
Choi, Seung-Rak 0 Reputation points
2 answers One of the answers was accepted by the question author.

databricks cluster sizing

Hey, how to calculate cluster core and workers node of 10gb data load every 2 hours ... what is the calculation behind this

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-26T06:13:37.5433333+00:00
Vineet S 730 Reputation points
accepted 2024-06-28T23:29:16.8466667+00:00
Vineet S 730 Reputation points
0 answers

How can I remove the `sample` catalog from Azure Databricks Workspaces?

All Azure Databricks Workspaces come with a sample catalog owned by Databricks that I cannot seem to remove or hide. I have tried dropping it, and I have also tried revoking and denying permissions on the catalog, but I keep receiving an error that is…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2023-10-31T19:35:30.61+00:00
Tim Thein 5 Reputation points
commented 2024-06-28T04:06:26.55+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee
1 answer

view in dataframe

hey, how we can create or replace view statement in spark sql in dataframe of databricks create or replace view as (select * from temp1)

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,512 questions
asked 2024-06-25T17:31:45.62+00:00
Vineet S 730 Reputation points
commented 2024-06-27T07:06:54.6+00:00
Harishga 5,990 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Error while implementing File Notification Mode: com.microsoft.azure.storage.StorageException: This request is not authorized to perform this operation using this permission.

While implementing File Notification Mode in Autoloader, I get the following error. Has anyone faced the similar issue? Note: The Databricks Service Principal is having Contributor role to Storage account. com.microsoft.azure.storage.StorageException:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-21T06:16:38.9166667+00:00
Hiran Amarathunga 95 Reputation points
commented 2024-06-27T03:46:11.5833333+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Cross tenant AAD authentication for Purview & Databricks

Hi, I want to know if for Purview & Databricks, is it possible to authenticate with cross tenant AAD? That is to say, can users belonging to AAD in tenant1, be able to login to Purview & Databricks which are setup in tenant2? Thanks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
Active Directory
Active Directory
A set of directory-based technologies included in Windows Server.
6,396 questions
Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,125 questions
asked 2024-06-20T03:44:04.3166667+00:00
Amit Singh 60 Reputation points
accepted 2024-06-27T03:18:50.43+00:00
Amit Singh 60 Reputation points
0 answers

[INTERNAL_ERROR] The Spark SQL phase optimization failed with an internal error. You hit a bug in Spark or the Spark plugins you use.

"I am trying to extract data from Azure Cosmos DB using PySpark and I am getting the following error: Py4JJavaError: An error occurred while calling o700.save.: org.apache.spark.SparkException: [INTERNAL_ERROR] The Spark SQL phase optimization…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
Azure Cosmos DB
Azure Cosmos DB
An Azure NoSQL database service for app development.
1,601 questions
asked 2024-03-08T08:35:10.85+00:00
Nathalia Fernandez Rodrigues 0 Reputation points
commented 2024-06-26T16:39:36.0333333+00:00
Luce PHILIBERT 0 Reputation points
1 answer

vm cpu utilization

Hi, have 100 VMs in Azure portal's resource group for which i am running VM memory metrix(cpu usage) .. how can it will automatically recognized the new subscription of vm came so that it will show the cpu usage via loading adf or databricks pipeline

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,837 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,512 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,624 questions
asked 2024-06-20T10:33:21.4333333+00:00
Vineet S 730 Reputation points
commented 2024-06-25T17:29:58.2766667+00:00
Vineet S 730 Reputation points
1 answer One of the answers was accepted by the question author.

Connecting to databricks using .Net

Hello Team, Is it possible to connect to Databricks and perform CRUD operations on catalog schema tables and Delta tables using .NET? If so, what approach is needed to connect to Databricks using JDBC or ODBC? Are there any specific libraries in Visual…

.NET
.NET
Microsoft Technologies based on the .NET software framework.
3,775 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-21T15:59:05.22+00:00
Nagesh CL 646 Reputation points
commented 2024-06-25T11:45:56.2666667+00:00
Nagesh CL 646 Reputation points
1 answer

How to know which type of service we need within azure databricks for our implementation

I need to know the different types of services available within databricks to implement my solution more cost efficiently. Is there any resource from azure we can reach out to whom we can explain our implementation and they can provide the list of…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-21T09:22:35.21+00:00
Arun Kumar S 0 Reputation points
edited a comment 2024-06-25T09:41:40.9366667+00:00
Smaran Thoomu 14,140 Reputation points Microsoft Vendor
1 answer

How can I migrate data bricks from one subscription to another subscription during cross subscription migration ?

I am unable to move data brick from the source subscription to the destination subscription during cross-subscription migration

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-19T19:20:06.0166667+00:00
Maaz Ahmed Nagori 0 Reputation points
commented 2024-06-25T08:26:08.2166667+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee
0 answers

I am unable to move data brick from the source subscription to the destination subscription during cross-subscription migration

I am unable to move data brick from the source subscription to the destination subscription during cross-subscription migration

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-19T19:22:30.1166667+00:00
Maaz Ahmed Nagori 0 Reputation points
commented 2024-06-24T19:57:49.5933333+00:00
Bhargava-MSFT 30,411 Reputation points Microsoft Employee
2 answers

Databricks not sending audit logs to event hub

Hi, I'm trying to push all the logs from databricks using the diagnostic tool to event hub but is not working, it didn't push anything. I'm using the root access policy and also already created the eventhub name, what else I'm probably missing? Thanks in…

Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
618 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,150 questions
asked 2024-06-13T19:41:27.85+00:00
Danny AMAYA 0 Reputation points
commented 2024-06-24T07:31:44.73+00:00
PRADEEPCHEEKATLA-MSFT 88,381 Reputation points Microsoft Employee