1,947 questions with Azure Databricks tags

Sort by: Updated
0 answers

py4j.security.Py4JSecurityException

Hello I am trying to run spark XGBoostRegression model on Databricks cluster with Databricks runtime: 14.3 LTS. I am getting the following error: Py4JError: An error occurred while calling o547.resourceProfileManager. Trace:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-06T12:48:28.3166667+00:00
Ahuja, Rachit 0 Reputation points
commented 2024-05-10T23:05:39.4066667+00:00
BhargavaGunnam-MSFT 26,496 Reputation points Microsoft Employee
0 answers

Spark_Ambiguous_Executor_MaxExecutorFailures

Hi I'm running a scheduled multiple run notebook using the below configuration, but I keep getting the below error DAG = { "activities": [ { "name": "Notebook", "path":…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-06T08:06:10.67+00:00
Seelan 0 Reputation points
commented 2024-05-10T23:04:05.6666667+00:00
BhargavaGunnam-MSFT 26,496 Reputation points Microsoft Employee
0 answers

Why create compute is taking long time?

I am trying create a compute for my workspaces i tried every combination still it is not working

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-03T13:41:31+00:00
Aditya Parida 0 Reputation points
commented 2024-05-10T22:53:53.5133333+00:00
BhargavaGunnam-MSFT 26,496 Reputation points Microsoft Employee
1 answer

[Databricks] Clusters are failing to launch. Cluster launch will be retried.

Hi all, I am a complete newbie on Databricks Azure. I have encounterd the below issue which I think is stopping me from running query. Any help will be much appreciated. Thanks. Billy Clusters are failing to launch. Cluster launch will be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-08T22:05:05.41+00:00
Billy Cheng 0 Reputation points
commented 2024-05-10T22:26:11.9733333+00:00
Billy Cheng 0 Reputation points
2 answers

Databricks support redirects to azure support: unexpected internal error when spinning up a Databricks all-purpose cluster

Hello, What do we do when we get this error, when spinning up a Databricks all-purpose cluster? {   "reason": {     "code": "CONTAINER_LAUNCH_FAILURE",     "type": "SERVICE_FAULT",    …

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-02T13:44:55.72+00:00
ADM.Susana Domingos 0 Reputation points
answered 2024-05-10T22:22:07.6033333+00:00
BhargavaGunnam-MSFT 26,496 Reputation points Microsoft Employee
1 answer

How do I add an inbound security rule if there is an default DenyAllInbound Rule that causes an error when attempting to create an inbound rule?

|Received an email with: The public IP address range for the Azure Databricks control plane will be updated on 30 May 2024—you may need to take action You're receiving this email because you use Azure Databricks. To support infrastructure …

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-04-30T17:39:32.7466667+00:00
Parris Sikorski (ALLEGIS GROUP HOLDINGS INC) 0 Reputation points Microsoft Vendor
commented 2024-05-10T16:56:15.79+00:00
Parris Sikorski (ALLEGIS GROUP HOLDINGS INC) 0 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

No Previews option in Azure Databricks user menu

I want to enable serverless compute in Azure Databricks which is in public preview, my workspace is eligible based on the details in the docs here and I am the workspace admin but I don't see a Previews option in my user menu. Is there another way to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-07T04:42:27.6566667+00:00
Matt 20 Reputation points
commented 2024-05-10T10:50:02.2566667+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
0 answers

Cannot read excel file which is in using adls using load_workbook of openpyxl in databricks

Cannot read excel file which is in using load_workbook of openpyxl but can read if copied to dbfs

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,357 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,449 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,649 questions
Excel Management
Excel Management
Excel: A family of Microsoft spreadsheet software with tools for analyzing, charting, and communicating data.Management: The act or process of organizing, handling, directing or controlling something.
1,649 questions
asked 2024-05-10T10:34:09.57+00:00
Alpha 20 Reputation points
2 answers One of the answers was accepted by the question author.

Cannot See Index tagging in while uploading Blob in ADLS gen2

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,357 questions
Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,722 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,449 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
Azure Role-based access control
Azure Role-based access control
An Azure service that provides fine-grained access management for Azure resources, enabling you to grant users only the rights they need to perform their jobs.
675 questions
asked 2024-05-01T06:57:01.2766667+00:00
Alpha 20 Reputation points
accepted 2024-05-10T10:00:15.3233333+00:00
Alpha 20 Reputation points
0 answers

Indexing a Pyspark dataframe

Hey guys, I am having a very large dataset as multiple parquets (like around 20,000 small files) which I am reading into a pyspark dataframe. I want to add an index column in this dataframe and then do some data profiling and data quality check…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-09T07:29:38.0266667+00:00
Varun S Kumar 50 Reputation points
commented 2024-05-10T08:03:39.0133333+00:00
Varun S Kumar 50 Reputation points
1 answer

How to ship Azure Databricks artifacts from Dev->QA->Prod through Azure Devops Pipelines?

We have a Azure Databricks workspace and Dev/QA/Prod environments. Everytime the Data engineers have to ship the artifacts from nonprod -> prod (e.g. python notebooks, config modules, etc) they have to copy the artifacts manually over to the next…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-04-29T21:17:03.8233333+00:00
Cataster 641 Reputation points
commented 2024-05-10T05:53:02.2833333+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
0 answers

How to reduce unnecessary high memory usage in a Databricks cluster?

We are having unnecessary high memory usage even when nothing is running on the cluster. When the cluster first starts, it's fine, but when I run a script and it finishes executing, nothing gets back to the idle (initial) state (even hours after nothing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-08T08:58:46.4433333+00:00
Senad Hadzikic 20 Reputation points
commented 2024-05-10T03:29:28.78+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
1 answer

Error while provisioning Databricks

Hi All I am receiving the below error while provisioning Databricks The resource write operation failed to complete successfully, because it reached terminal provisioning state 'Failed'. (Code: ResourceDeploymentFailure, The resource write operation…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2023-09-05T14:04:52.76+00:00
Mohan P V 5 Reputation points
commented 2024-05-10T02:40:27.4533333+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
2 answers

How to configure ADF pipeline run, linked service, so it uses Databricks serverless compute

Databricks has recently announced serverless compute for workflows: https://learn.microsoft.com/en-us/azure/databricks/workflows/jobs/run-serverless-jobs I would like to be able to execute Azure Data Factory (ADF) jobs using this…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,649 questions
asked 2024-05-01T12:12:06.9033333+00:00
Krzysztof Przysowa 0 Reputation points
answered 2024-05-09T05:44:23.0466667+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
1 answer

PowerBI / Databrick can we edit data in report

When we create reports in PowerBi or in Databricks. can we edit the data in report and if it can updated in backend datasource. Please let me know if this possible

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-06T20:49:03.1766667+00:00
Pothiraj, Saranya-ADM 0 Reputation points
edited a comment 2024-05-09T05:33:25.6866667+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
0 answers

How do I figure out what public IP ranges my Databricks workspace clusters are coming from?

Relatively new to Databricks. I have an existing workspace that was created years ago. It is vnet-injected but it has secured cluster connectivity (SCC) disabled. I need to know the outbound IP addresses/ranges the clusters would communicate on to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-05-08T22:13:53.72+00:00
McDonald, Matthew 101 Reputation points
edited a comment 2024-05-09T05:13:27.48+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
1 answer

Error with Create Table USING DELTA LOCATION in training exercise

In the exercise https://microsoftlearning.github.io/mslearn-databricks/Instructions/Exercises/03-Delta-lake-in-Azure-Databricks.html the line of code spark.sql("CREATE TABLE AdventureWorks.ProductsExternal USING DELTA LOCATION…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
Azure Training
Azure Training
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Training: Instruction to develop new skills.
977 questions
asked 2024-05-01T13:00:09.32+00:00
James Mitchell 0 Reputation points
commented 2024-05-08T11:17:45.0566667+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
0 answers

Custom libraries (wheel) for ADF Databricks Python activity run on serverless compute

I want to be able to execute Python scripts (via Databricks Python) from Azure Data Factory using serverless compute. Serverless compute does not support cluster level (compute scoped) libraries. In databricks workflows, it is being done as…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,649 questions
asked 2024-05-01T12:30:52.2366667+00:00
Krzysztof Przysowa 0 Reputation points
commented 2024-05-08T06:46:04.15+00:00
PRADEEPCHEEKATLA-MSFT 78,576 Reputation points Microsoft Employee
0 answers

DatabricksSQL Logs and correlate with query history

Hi everyone, I'm currently working on capturing logging information about query executions and data downloads within a Databricks workspace. Here's a summary of my current setup and the issue I'm facing: Diagnostic Settings in Azure Databricks: I have…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
asked 2024-04-22T11:09:19.4+00:00
Julio Avellaneda 0 Reputation points
commented 2024-05-06T18:07:56.88+00:00
Julio Avellaneda 0 Reputation points
1 answer

SAP latency data

Hi Expert, how to we can load the data from modified data in updated or insert fields in databricks using ADF or data bricks on trigger level instead of loading multiple times example: table updated or inserted with new records how table change and…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,649 questions
asked 2024-04-23T07:29:53.95+00:00
Vineet S 165 Reputation points
commented 2024-05-06T08:36:05.2166667+00:00
AnnuKumari-MSFT 31,151 Reputation points Microsoft Employee