Azure Stack Hub 中的縮放單位節點動作Scale unit node actions in Azure Stack Hub

本文說明如何檢視縮放單位的狀態。This article describes how to view the status of a scale unit. 您可以檢視單位的節點。You can view the unit's nodes. 您可以執行節點動作,例如開啟電源、關閉電源、關機、清空、繼續及修復。You can run node actions like power on, power off, shut down, drain, resume, and repair. 一般而言,您會在於現場進行組件更換時,或協助將節點復原時,使用這些節點動作。Typically, you use these node actions during field replacement of parts, or to help recover a node.

重要

本文所述的所有節點動作應該都一次以一個節點為目標。All node actions described in this article should target one node at a time.

檢視節點狀態View the node status

在系統管理員入口網站中,您可以檢視縮放單位及其相關節點的狀態。In the administrator portal, you can view the status of a scale unit and its associated nodes.

若要檢視縮放單位的狀態:To view the status of a scale unit:

  1. 在 [區域管理] 圖格上,選取區域。On the Region management tile, select the region.

  2. 在左側的 [基礎結構資源] 下,選取 [縮放單位]。On the left, under Infrastructure resources, select Scale units.

  3. 在結果中,選取縮放單位。In the results, select the scale unit.

  4. 從左側的 [一般] 底下,選取 [節點]。On the left, under General, select Nodes.

    檢視下列資訊:View the following information:

    • 個別節點的清單。The list of individual nodes.
    • 操作狀態 (請參閱下方清單)。Operational Status (see list below).
    • 電源狀態 (例如執行中或已停止)。Power Status (running or stopped).
    • 伺服器模型。Server model.
    • 基礎板管理控制器 (BMC) 的 IP 位址。IP address of the baseboard management controller (BMC).
    • 核心數總計。Total number of cores.
    • 記憶體量總計。Total amount of memory.

    節點動作也可以在系統管理員入口網站中引發預期的警示。Node actions can also raise expected alerts in the administrator portal.

縮放單位的狀態

節點操作狀態Node operational states

狀態Status 描述Description
執行中Running 節點正有效參與縮放單位。The node is actively participating in the scale unit.
已停止Stopped 節點無法使用。The node is unavailable.
新增中Adding 節點正在新增至縮放單位。The node is actively being added to the scale unit.
修復中Repairing 正在修復節點。The node is actively being repaired.
維護Maintenance 節點已暫停,且沒有作用中的使用者工作負載正在執行。The node is paused, and no active user workload is running.
需要補救Requires Remediation 偵測到需要修復節點的錯誤。An error has been detected that requires the node to be repaired.

Azure Stack Hub 會在作業之後顯示新增中狀態Azure Stack Hub shows Adding status after an operation

在執行清空、繼續、修復、關機或啟動等作業之後,Azure Stack Hub 可能會顯示作業節點的狀態為 新增中Azure Stack Hub may show the operational node status as Adding after an operation like drain, resume, repair, shutdown or start was executed. 當網狀架構資源提供者角色快取未在作業後重新整理時,就可能發生這種情況。This can happen when the Fabric Resource Provider Role cache did not refresh after an operation.

在套用下列步驟之前,請先確定目前沒有任何進行中的作業。Before applying the following steps ensure that no operation is currently in progress. 更新端點以符合您的環境。Update the endpoint to match your environment.

  1. 開啟 PowerShell 並新增您的 Azure Stack Hub 環境。Open PowerShell and add your Azure Stack Hub environment. 這需要在您的電腦上安裝 Azure Stack Hub PowerShellThis requires Azure Stack Hub PowerShell to be installed on your computer.

    Add-AzEnvironment -Name AzureStack -ARMEndpoint https://adminmanagement.local.azurestack.external
    Add-AzAccount -Environment AzureStack
    
  2. 執行下列命令以重新啟動網狀架構資源提供者角色。Run the following command to restart the Fabric Resource Provider Role.

    Restart-AzsInfrastructureRole -Name FabricResourceProvider
    
  3. 驗證受影響縮放單位節點的作業狀態已變更為 執行中Validate the operational status of the impacted scale unit node changed to Running. 您可以使用系統管理員入口網站或下列 PowerShell 命令:You can use the Administrator portal or the following PowerShell command:

    Get-AzsScaleUnitNode |ft name,scaleunitnodestatus,powerstate
    
  4. 如果節點的作業狀態仍顯示為 新增中,請繼續開啟支援事件。If the node operational status is still shown as Adding continue to open a support incident.

縮放單位節點動作Scale unit node actions

當您檢視縮放單位節點的相關資訊時,您也可以執行節點動作,例如:When you view information about a scale unit node, you can also perform node actions like:

  • 啟動和停止 (視目前的電源狀態而定)。Start and stop (depending on current power status).
  • 停用和繼續 (視作業狀態而定)。Disable and resume (depending on operations status).
  • 修復。Repair.
  • 關機。Shutdown.

節點的作業狀態會決定哪些選項可供使用。The operational state of the node determines which options are available.

您必須安裝 Azure Stack Hub PowerShell 模組。You need to install Azure Stack Hub PowerShell modules. 這些 Cmdlet 位於 Azs.Fabric.Admin 模組中。These cmdlets are in the Azs.Fabric.Admin module. 若要安裝或確認「適用於 Azure Stack Hub 的 PowerShell」安裝,請參閱安裝適用於 Azure Stack Hub 的 PowerShellTo install or verify your installation of PowerShell for Azure Stack Hub, see Install PowerShell for Azure Stack Hub.

StopStop

停止 動作會關閉節點電源。The Stop action turns off the node. 此動作就像按下電源按鈕一樣。It's the same as pressing the power button. 它不會傳送關機信號給作業系統。It doesn't send a shutdown signal to the operating system. 針對計劃性停止作業,請一律先嘗試關機作業。For planned stop operations, always try the shutdown operation first.

當節點不會再回應要求時,通常會使用此動作。This action is typically used when a node no longer responds to requests.

若要執行停止動作,請開啟已提升權限的 PowerShell 提示字元,然後執行下列 Cmdlet:To run the stop action, open an elevated PowerShell prompt, and run the following cmdlet:

  Stop-AzsScaleUnitNode -Location <RegionName> -Name <NodeName>

在停止動作無法運作的罕見情況下,請重試作業,如果第二次也失敗,請改用 BMC Web 介面。In the unlikely case that the stop action doesn't work, retry the operation and if it fails a second time use the BMC web interface instead.

如需詳細資訊,請參閱Stop-AzsScaleUnitNodeFor more information, see Stop-AzsScaleUnitNode.

StartStart

啟動 動作會開啟節點電源。The start action turns on the node. 此動作就像您按下電源按鈕一樣。It's the same as if you press the power button.

若要執行啟動動作,請開啟已提升權限的 PowerShell 提示字元,然後執行下列 Cmdlet:To run the start action, open an elevated PowerShell prompt, and run the following cmdlet:

  Start-AzsScaleUnitNode -Location <RegionName> -Name <NodeName>

在啟動動作無法運作的罕見情況下,請重試作業。In the unlikely case that the start action doesn't work, retry the operation. 如果第二次依然失敗,請改用 BMC Web 介面。If it fails a second time, use the BMC web interface instead.

如需詳細資訊,請參閱Start-AzsScaleUnitNodeFor more information, see Start-AzsScaleUnitNode.

清空Drain

清空 動作會將所有作用中工作負載移至該特定縮放單位中的其餘節點。The drain action moves all active workloads to the remaining nodes in that particular scale unit.

這個動作通常用於現場更換組件期間,例如更換整個節點。This action is typically used during field replacement of parts, like the replacement of an entire node.

重要

請確定您是在計劃性維護時段且已通知使用者的情況下,在節點上使用清空作業。Make sure you use a drain operation on a node during a planned maintenance window, where users have been notified. 在某些情況下,使用中的工作負載可能會導致中斷。Under some conditions, active workloads can experience interruptions.

若要執行清空動作,請開啟已提升權限的 PowerShell 提示字元,然後執行下列 Cmdlet:To run the drain action, open an elevated PowerShell prompt, and run the following cmdlet:

  Disable-AzsScaleUnitNode -Location <RegionName> -Name <NodeName>

如需詳細資訊,請參閱 Disable-AzsScaleUnitNodeFor more information, see Disable-AzsScaleUnitNode.

繼續Resume

繼續 動作會將已停用的節點繼續執行,並將其標示為可供放置工作負載。The resume action resumes a disabled node and marks it active for workload placement. 先前已在節點上執行的工作負載不會容錯回復。Earlier workloads that were running on the node don't fail back. (如果您在節點上使用清空作業,請務必關閉電源)。(If you use a drain operation on a node be sure to power off. 當您重新開啟節點的電源時,系統不會將它標示為可供放置工作負載。When you power the node back on it's not marked as active for workload placement. 準備就緒時,您必須使用繼續動作將節點標記為使用中。)When ready, you must use the resume action to mark the node as active.)

若要執行繼續動作,請開啟已提升權限的 PowerShell 提示字元,然後執行下列 Cmdlet:To run the resume action, open an elevated PowerShell prompt, and run the following cmdlet:

  Enable-AzsScaleUnitNode -Location <RegionName> -Name <NodeName>

如需詳細資訊,請參閱 Enable-AzsScaleUnitNodeFor more information, see Enable-AzsScaleUnitNode.

修復Repair

警告

韌體調整是成功執行本文所述作業的要件。Firmware leveling is critical for the success of the operation described in this article. 缺少此步驟可能會導致系統不穩定、效能降低、安全性威脅,或在 Azure Stack Hub 自動化部署作業系統時發生失敗。Missing this step can lead to system instability, a decrease in performance, security threats, or failure when Azure Stack Hub automation deploys the operating system. 更換硬體時請務必參閱硬體合作夥伴的文件,以確保套用的韌體符合 Azure Stack Hub 管理員入口網站中顯示的 OEM 版本。Always consult your hardware partner's documentation when replacing hardware to ensure the applied firmware matches the OEM Version displayed in the Azure Stack Hub administrator portal.

如需合作夥伴文件的詳細資訊和連結,請參閱更換硬體元件For more information and links to partner documentation, see Replace a hardware component.

硬體合作夥伴Hardware Partner 區域Region URLURL
CiscoCisco 全部All Cisco Integrated System for Microsoft Azure Stack Hub 操作指南Cisco Integrated System for Microsoft Azure Stack Hub Operations Guide

Cisco Integrated System for Microsoft Azure Stack Hub 的版本資訊Release Notes for Cisco Integrated System for Microsoft Azure Stack Hub
Dell EMCDell EMC 全部All 適用於 Microsoft Azure Stack Hub 14G 的雲端 (需要帳戶和登入)Cloud for Microsoft Azure Stack Hub 14G (account and login required)

適用於 Microsoft Azure Stack Hub 13G 的雲端 (需要帳戶和登入)Cloud for Microsoft Azure Stack Hub 13G (account and login required)
FujitsuFujitsu 日本JAPAN Fujitsu 受控服務支援中心 (需要帳戶和登入)Fujitsu managed service support desk (account and login required)
歐洲、中東與非洲EMEA Fujitsu 支援 IT 產品和系統Fujitsu support IT products and systems
Fujitsu MySupport (需要帳戶和登入)Fujitsu MySupport (account and login required)
HPEHPE 全部All HPE ProLiant for Microsoft Azure Stack HubHPE ProLiant for Microsoft Azure Stack Hub
LenovoLenovo 全部All ThinkAgile SXM 最佳配方ThinkAgile SXM Best Recipes

修復 動作會修復節點。The repair action repairs a node. 只針對下列其中一個案例使用它:Use it only for either of the following scenarios:

  • 完整節點更換 (不論有無新資料磁碟)。Full node replacement (with or without new data disks).
  • 硬體元件失敗並取代之後 (如果現場可更換單元 [FRU] 文件中有建議)。After hardware component failure and replacement (if advised in the field replaceable unit [FRU] documentation).

重要

當您需要更換節點或個別硬體元件時,請參閱 OEM 硬體廠商的 FRU 文件,以了解確切的步驟。See your OEM hardware vendor's FRU documentation for exact steps when you need to replace a node or individual hardware components. FRU 文件會指定在更換硬體元件之後是否需要執行修復動作。The FRU documentation will specify whether you need to run the repair action after replacing a hardware component.

執行修復動作時,您需要指定 BMC IP 位址。When you run the repair action, you need to specify the BMC IP address.

若要執行修復動作,請開啟已提升權限的 PowerShell 提示字元,然後執行下列 Cmdlet:To run the repair action, open an elevated PowerShell prompt, and run the following cmdlet:

Repair-AzsScaleUnitNode -Location <RegionName> -Name <NodeName> -BMCIPv4Address <BMCIPv4Address>

ShutdownShutdown

關機 動作會先將所有作用中工作負載移至該相同縮放單位中的其餘節點。The shutdown action first moves all active workloads to the remaining nodes in the same scale unit. 然後,此動作會以正常程序關閉縮放單位節點。Then the action gracefully shuts down the scale unit node.

啟動已關閉的節點之後,您必須執行繼續動作。After you start a node that was shut down, you need to run the resume action. 先前已在節點上執行的工作負載不會容錯回復。Earlier workloads that were running on the node don't fail back.

如果關機作業失敗,請嘗試使用清空作業,接著再使用關機作業。If the shutdown operation fails, attempt the drain operation followed by the shutdown operation.

若要執行關機動作,請開啟已提升權限的 PowerShell 提示字元,然後執行下列 Cmdlet:To run the shutdown action, open an elevated PowerShell prompt, and run the following cmdlet:

Stop-AzsScaleUnitNode -Location <RegionName> -Name <NodeName> -Shutdown

後續步驟Next steps