作業因叢集管理員的核心執行個體要求限制而失敗Job fails due to cluster manager core instance request limit

問題Problem

Azure Databricks 筆記本或作業 API 會傳回下列錯誤:An Azure Databricks Notebook or Job API returns the following error:

Unexpected failure while creating the cluster for the job. Cause REQUEST_LIMIT_EXCEEDED: Your request was rejected due to API rate limit. Please retry your request later, or choose a larger node type instead.

原因Cause

此錯誤表示已超過叢集管理員服務核心實例要求限制The error indicates the Cluster Manager Service core instance request limit was exceeded.

叢集管理員核心實例最多可支援1000個要求。A Cluster Manager core instance can support a maximum of 1000 requests.

解決方法Solution

請聯絡 Azure Databricks 支援以增加核心實例中設定的限制。Contact Azure Databricks Support to increase the limit set in the core instance.

Azure Databricks 可以將作業限制增加 maxBurstyUpsizePerOrg 到2000, upsizeTokenRefillRatePerMin 最高可達120。Azure Databricks can increase the job limit maxBurstyUpsizePerOrg up to 2000, and upsizeTokenRefillRatePerMin up to 120. 當限制增加時,目前正在執行的作業會受到影響。Current running jobs are affected when the limit is increased.

增加這些值可能會停止節流問題,但也會造成 CPU 使用率過高。Increasing these values can stop the throttling issue, but can also cause high CPU utilization.

此問題的最佳解決方法是將叢集管理員核心實例取代為可支援最大資料傳輸速率的較大實例。The best solution for this issue is to replace the Cluster Manager core instance with a larger instance that can support maximum data transmission rates.

Azure Databricks 支援人員可以將目前的叢集管理員實例類型變更為較大的一種。Azure Databricks Support can change the current Cluster Manager instance type to a larger one.