Azure Databricks

1 answer

Best way to connect to a databricks datamart for further exploration in PowerBI

Situation: Databricks is used as enterprise data platform. In reality local teams are moving forward with a different speed than the global organization is. To support this but still keeping control on the "golden source" we're setting up an…

asked

Peter Verrykt 21

accepted

Peter Verrykt 21

1 answer

How do I orchestrate ML model retraining periodically?

I have to retrain every month or so a PyTorch Model trained on data obtained from processing tables sitting in Azure Data Lake Storage gen 1. So far, I have the following building blocks: A Databricks notebook that does the ETL job of…

asked

Davide Fiocco 31

answered

Ramr-msft 17,731

1 answer

What is the use of oldest-time-to-consider param in Jobs API?

Hi, I haven't found any documentation around what this value "oldest-time-to-consider": "1457570074236" is used for in the Databricks Job API. Can someone please direct me to the documentation that talks about the significance of…

asked

VishR 21

commented

VishR 21

1 answer

Can you write multiple streaming queries(same schema, different input sources) into same Azure storage without overwriting?

Hi, I have to this requirement to write multiple streaming queries(same schema, different input sources) into same Azure blob delta lake gen 3 storage without overwriting. I need the data to co-exist in the same write directory, say like in 'append'…

asked

Mayuri Kadam 81 Microsoft Employee

accepted

Mayuri Kadam 81 Microsoft Employee

1 answer

Is there any way to do Custom dynamic mapping of different number of columns in dataflow or any other options to achieve this?

My source (CSV file in ADLS) has header record(3 columns) , detail records(5 columns) and trailer record(2 columns) . The header record has less number of columns than the detail records. When I try to convert this csv file to parquet, i m getting the…

asked

Abarna C (DAAI - Cloud Data Platforms) 21

accepted

Abarna C (DAAI - Cloud Data Platforms) 21

1 answer

MLOps using Azure Databricks & Azure ML - question on data prep for model inference and retraining.

I am using this blog (https://databricks.com/blog/2020/10/13/using-mlops-with-mlflow-and-azure.html) to set-up MLOps using Azure Databricks & Azure ML. As mentioned in the blog, we deploy MLflow model into an Azure ML environment using the built in…

asked

Kiran Purushotham 11

commented

GermanM 1

1 answer

How to use ARM template to restrict publicBlobAccess to managed Databricks storage account

In our organisation we are required to disable publicBlobAccess and enable TLS1_2 as minimum version on all storage accounts. Preferably we also use StorageV2 type instead of BlobStorage. When we create Databricks workspace, using ARM template, the…

asked

Leerdam van, J (Jean-Marc) 16

commented

Janne Kujanpää 216

1 answer

How to query 3rd party Azure DataLake Gen2 and only store the results

First, what I am trying to do is I want to query and aggregate raw JSON files stored in a 3rd party's Azure Data Lake (Gen2) and store those aggregates in my own data lake or relation db. I do not want to physically copy all of those raw JSON files…

asked

JasonW-5564 161

commented

HarithaMaddi-MSFT 10,136

1 answer

Linkedin connectivity with Azure

Hi, Is there a way to extract existing content from a Linkedin company page using Azure? Thanks

asked

Darshika Rajendran 1

commented

HarithaMaddi-MSFT 10,136

0 answers

Azure repo - Can't create or push tag

I host a project on Azure DevOps Repositories and I would like to create several git tag for project version release. I'm a GitKraken user so I added a new tag and push it to origin but this error occurs: So I checked my permission on Azure…

asked

MATTONE THOMAS 1

commented

Jaliya Udagedara 2,821 MVP

1 answer

The file updated in databricks is not reflecting in Azure Portal

I have a databrick workspace, where I read a file from Azure blob storage, updated a file and uploaded it in another Azure blob storage space. Now when I access that file through any databricks workspace, I can see the file and access the content of it.…

asked

Amey Pimpley 1

commented

PRADEEPCHEEKATLA-MSFT 85,346 Microsoft Employee

1 answer

I HAVE ERROR WHEN AZURE DATABRICKS WRITE COSMOS DB

I am trying to write a spark dataframe from azure databricks to a cosmos db database and I have this error Py4JJavaError: An error occurred while calling o753.save. : org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage…

asked

Williams Gerard Gamboa Anchante 1

commented

HimanshuSinha-msft 19,381 Microsoft Employee

0 answers

Deletion of managed MLflow Artifacts

When using Workspace experiments in Azure Databricks with the default managed MLflow artifact location dbfs:/databricks/mlflow-tracking configured, we see the following message when deleting an MLflow experiment run: Deleted runs are restorable for…

asked

Christoph Stumpf 1

commented

HimanshuSinha-msft 19,381 Microsoft Employee

1 answer

Unable to delete folder in databricks "DBFS://

Hi, When I run the command %fs ls '/' in the results I see a folder path as "dbfs://" and name as "/". and tried to run the command in the notebook %fs ls '//' I get the java error and even not able to delete the folder. Please…

asked

prado 1

commented

prado 1

1 answer

using streaming batch for multiple operations

I am new to spark and DataBricks and was trying to look for a solution where I can utilize a batch from a eventhub stream to accomplish multiple business logic but could not find any guidance. Stream I get from EventHub is a CDC stream from multiple…

asked

Rohit Sapru 41

commented

PRADEEPCHEEKATLA-MSFT 85,346 Microsoft Employee

1 answer

Databricks readstream writestream to Azure Synapse

I am having an issue on writing stream to Azure synapse with the following error . let's have a look and see if there is idea ?

asked

sakuraime 2,321

commented

PRADEEPCHEEKATLA-MSFT 85,346 Microsoft Employee

1 answer

How to set security permissions to Databases in databricks through Notebooks

We are stuck on the way to set security permissions to Databases by using Notebooks %sql. At first, let me explain our situations and settings. We run the following code on Notebooks: %sql CREATE DATABASE X ; GRANT USAGE ON DATABASE X TO…

asked

Asuka 21

accepted

Asuka 21

1 answer

Databricks Pyspark exception handling best practices

Hi, In the current development of pyspark notebooks on Databricks, I typically use the python specific exception blocks to handle different situations that may arise. I am wondering if there are any best practices/recommendations or patterns to handle…

asked

Satya D 141

accepted

Satya D 141

1 answer