1,942 questions with Azure Databricks tags

Sort by: Updated
1 answer One of the answers was accepted by the question author.

Raising the limit in Azure Databricks identities

Context: The Azure Databricks documentation states that "You can have a maximum of 10,000 combined users and service principals and 5,000 groups in an account. Each workspace can have a maximum of 10,000 combined users and service principals and…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-25T10:41:16.95+00:00
Kim Stig Hansen 20 Reputation points
accepted 2024-04-26T07:31:55.23+00:00
Kim Stig Hansen 20 Reputation points
1 answer One of the answers was accepted by the question author.

How to use AWS Databricks note book in ADF..?

Hi Team, I want to create Linked service in ADF with AWS Databricks notebook. Can you guys help me with list of steps that i need follow. Regards, Naveen.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,625 questions
asked 2023-10-30T03:44:41.43+00:00
commented 2024-04-25T10:09:37.6533333+00:00
Syed Ahamed S 0 Reputation points
1 answer

Custom text single label classification - Model API Consumption within Databricks

Hello together, i trained a model within Azure Language Studio , Custom text single label classification - and i want to consume Model API within Databricks Notebook. I get always below given error, kindly asking for your help. Thanks. Error: HTTP…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,407 questions
asked 2024-04-18T11:00:16.9066667+00:00
Aziz Öztürk 20 Reputation points
commented 2024-04-25T05:47:24.94+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
1 answer

How to fix timeout error when clreating compute cluster on azure databricks.

Azure error message: [id: InstanceId(e8b82690e17a4624a152afb20dce5339), status: INSTANCE_LAUNCHING, workerEnvId:WorkerEnvId(workerenv-3367469989580530), lastStatusChangeTime: 1713836219115, groupIdOpt Some(0),requestIdOpt…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-23T02:31:21.1133333+00:00
GEOFREY GETUBA 0 Reputation points
commented 2024-04-25T05:39:28.3766667+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

How to ingest CDC data from SQL DATA SYNC to Event hub?

Hi we have a scenario to implement real-time CDC from SQL DATA SYNC to Azure SQL DB. Which tools are good for near real-time processes and prices as well. By using data bricks we can merge stream data into delta lake. But How to ingest real-time…

Azure SQL Database
Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
560 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
Azure Stream Analytics
Azure Stream Analytics
An Azure real-time analytics service designed for mission-critical workloads.
330 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,625 questions
asked 2020-11-10T12:28:43.067+00:00
THIMMAIAH GARI,PRASHANTH,, 201 Reputation points
edited an answer 2024-04-24T06:51:50.8266667+00:00
Shane Blake 0 Reputation points
1 answer

How to write to datalakegen2 storage using databricks in delta format when connected using SAS tocken

I converted the data from parquet form to data format. Now I want to write the data to blob storage - datalakegen2. But facing below error while writing. I useed the below command to write my data: output_path =…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,352 questions
Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,718 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-09T23:29:12.6666667+00:00
Sai Sunny Kothuri 0 Reputation points
commented 2024-04-24T05:00:35.0166667+00:00
KarishmaTiwari-MSFT 18,527 Reputation points Microsoft Employee
2 answers

LivyHttpRequestFailure: Something went wrong while processing your request. Please try again later. HTTP status code: 500. Trace ID: d2425fce-9179-49eb-829c-a7b2a0d963ed.

Hi Everyone, I am getting the Below Error Code while running my cells in Synapse Notebook: > LivyHttpRequestFailure: Something went wrong while processing your request. Please try again later. HTTP status code: 500. Trace ID:…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,405 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2022-11-22T06:13:46.587+00:00
Devender 61 Reputation points
commented 2024-04-23T08:30:09.7233333+00:00
Dom Sadie 0 Reputation points
1 answer

Azure Databricks fail to install Geospark libraries from Maven

Hi Team , I am attempting to add below two geospark Maven libraries to my Azure Databricks interactive cluster with Runtime Version 14.3 LTS . However , I am getting below error Library installation attempted on the driver node of cluster…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-15T06:24:17.8033333+00:00
Anuj, Singh (Cognizant) 25 Reputation points
commented 2024-04-23T03:38:08.5466667+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
1 answer

How to recover a accidently deleted databricks instance in a free trial account

How to recover a accidently deleted databricks instance in a free trial account.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-19T10:01:26.8066667+00:00
Vinee Jain 0 Reputation points
commented 2024-04-22T03:54:20.6066667+00:00
Smaran Thoomu 10,080 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Azure Delta Lake to Snowflake

Hi Team, I am creating Delta lake in Azure data lake from ADF using Dataflow Sink - inline dataset as Delta and also through Databricks. Have created External Table in Databricks which is pointing to Mounted Azure datalake location. Now, I want to load…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,625 questions
asked 2024-04-18T09:50:12.3466667+00:00
Vaibhav 105 Reputation points
accepted 2024-04-20T14:07:03.6+00:00
Vaibhav 105 Reputation points
0 answers

FileAlreadyExistsException: Failed to rename temp file dbfs:/mnt/delta_checkpoints/sources/0/rocksdb/__tmp_path_dir/.2.zip.52d0723f-b803-4a8a-9533-9d6e67813641.tmp to dbfs:/mnt/delta_checkpoints/sources/0/rocksdb/2.zip because file exists

I have built a streaming pipeline with spark autoloader. Source Folder is a azure blob container. We encountered a rare issue (could not replicate it). Below is the exception Message: org.apache.hadoop.fs.FileAlreadyExistsException: Failed to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2022-03-23T22:43:18.8+00:00
Balasubramanian Singaravelu 6 Reputation points
commented 2024-04-19T08:04:21.1366667+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
2 answers

Failing to connect to metastore when using dbx to launch ephimeral cluster in databricks

Dear all, I am using dbx to deploy and launch jobs on ephemeral clusters on databricks. I have initialized the the cicd-sample-project and connected to a fresh empty Databricks Free trial environment and everything works. But when I try to do…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2022-12-13T12:39:33.237+00:00
Enrico Mosca 6 Reputation points
answered 2024-04-19T07:07:13.3133333+00:00
Alexander 0 Reputation points
4 answers

Databricks Cluster Size

Hi, We have a below use case. We are developing ML Development using azure data bricks cluster service's. Our data bricks will receive 20 GB size of dataset records and we are running our logic/algorithms against this dataset's. We have to run every…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-16T14:55:11.34+00:00
james vasanth 0 Reputation points
commented 2024-04-18T09:05:04.9833333+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
1 answer

How to calculate the price of a databricks job with Photon engine?

It's trivial to calculate a databricks job cost using Azure pricing calculator. You add up Azure VM price and Azure Databricks DBU consumption price and you would get your answer. However when it comes to Databricks with Photon runtime, the pricing is…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-05T14:53:12.39+00:00
Jiapeng Zhang 0 Reputation points
commented 2024-04-17T07:50:19.56+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

ADF pipeline to read the data from UC table to adls gen2 account

Hello Team, We have a requirement to create Azure Datafactory pipeline to read the data from UC table, access on the table is provided ( to Azure Datafactory Managed Identity) and copy the data into adls gen2. Is there a way or article to implement this?…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,352 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,625 questions
asked 2024-04-11T19:05:05.9733333+00:00
Ashwini Gaikwad 65 Reputation points
accepted 2024-04-15T07:35:40.09+00:00
Ashwini Gaikwad 65 Reputation points
0 answers

Enabling Azure PIM Disables user within DataBricks

We have successfully stood up Azure Databricks in our tenant. We are leveraging SCIM and User Provisioning to grant our users SSO access into DataBricks. We are trying to layer in an additional layer of security to meet our current security standards by…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2023-09-01T14:55:42.55+00:00
Smileyville 1 Reputation point
edited a comment 2024-04-15T01:04:28.3066667+00:00
Chris 0 Reputation points
2 answers

How to Fix Error Configuring VPC Peering in Azure Databricks? Failed to add virtual network peering 'Peering' to 'workers-vnet'. Error: The client "" with object ID "" has permission to perform action

I'm facing an issue while attempting to configure VPC peering in Azure Databricks. When trying to establish VPC peering between the Azure Databricks workspace's VNET ("workers-vnet") and an external network, I encountered the following error…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-10T17:31:13.91+00:00
Ivan David Perez Moreno 0 Reputation points
commented 2024-04-12T07:02:36.9066667+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
0 answers

Can we customize the index url to fetch Python package from a protected PyPi source

Recently Databricks provided in public preview the possibility to add library in the compute policy. We will need to install a library located in a protected PyPi repo from Azure DevOps: pkgs.dev.azure.com/{further_path}/pypi/packages. We used to have…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-03-27T10:28:37.7933333+00:00
Sypula, Aleksandra 0 Reputation points
commented 2024-04-12T06:22:22.8666667+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
1 answer

How to add SSL certificate for using it on Databricks cluster with PySpark

Hi, We would like to use functions written in PySpark for calling an external service that requires SSL certificate on the cluster. Currently we are using an init script similar to explained in the documentation -…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-04-03T08:20:22.1+00:00
Sypula, Aleksandra 0 Reputation points
commented 2024-04-11T04:23:37.9833333+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee
1 answer

How can you use Databricks Vector Search with images?

I have been working with Databricks Vector Search and feel comfortable using it on any kind of textual data. However I am running into problems with using it for image searching, and there are no examples available in the documentation to provide more…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,942 questions
asked 2024-03-21T22:17:02.7866667+00:00
Isabel 0 Reputation points
commented 2024-04-11T04:20:46.4466667+00:00
PRADEEPCHEEKATLA-MSFT 77,901 Reputation points Microsoft Employee