管理 Azure Stack Hub 的儲存體基礎結構Manage storage infrastructure for Azure Stack Hub

本文說明 Azure Stack Hub 儲存體基礎結構資源的健全狀態和操作狀態。This article describes the health and operational status of Azure Stack Hub storage infrastructure resources. 這些資源包括儲存體磁碟機和磁碟區。These resources include storage drives and volumes. 本主題中的資訊可協助您針對各種問題 (例如磁碟機無法新增至集區) 進行疑難排解。The information in this topic helps you troubleshoot various issues, like when a drive can't be added to a pool.

磁碟區狀態Volume states

若要了解磁碟區所處的狀態,請使用下列 PowerShell 命令:To find out what state volumes are in, use the following PowerShell commands:

$scaleunit_name = (Get-AzsScaleUnit)[0].name

$subsystem_name = (Get-AzsStorageSubSystem -ScaleUnit $scaleunit_name)[0].name

Get-AzsVolume -ScaleUnit $scaleunit_name -StorageSubSystem $subsystem_name | Select-Object VolumeLabel, HealthStatus, OperationalStatus, RepairStatus, Description, Action, TotalCapacityGB, RemainingCapacityGB

下列輸出範例顯示已中斷連結的磁碟區和已降級/不完整的磁碟區:Here's an example of output showing a detached volume and a degraded/incomplete volume:

VolumeLabelVolumeLabel HealthStatusHealthStatus OperationalStatusOperationalStatus
ObjStore_1ObjStore_1 UnknownUnknown 已卸離Detached
ObjStore_2ObjStore_2 警告Warning {已降級、不完整}{Degraded, Incomplete}

以下各節列出健全狀態和操作狀態:The following sections list the health and operational states:

磁碟區健全狀態:HealthyVolume health state: Healthy

操作狀態Operational state 描述Description
[確定]OK 磁碟區狀況良好。The volume is healthy.
次佳Suboptimal 資料未平均寫入各個磁碟機。Data isn't written evenly across drives.

動作: 請連絡支援人員,以最佳化存放集區中的磁碟機使用情形。Action: Contact Support to optimize drive usage in the storage pool. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles. 在還原失敗的連線之後,您可能必須從備份還原。You may have to restore from backup after the failed connection is restored.

磁碟區健全狀態:警告Volume health state: Warning

磁碟區的健全狀態為「警告」時,表示您的資料有一或多個複本無法使用,但 Azure Stack Hub 仍可讀取至少一個資料複本。When the volume is in a Warning health state, it means that one or more copies of your data are unavailable but Azure Stack Hub can still read at least one copy of your data.

操作狀態Operational state 描述Description
運作中In service Azure Stack Hub 正在修復磁碟區,例如,新增或移除磁碟機之後的修復。Azure Stack Hub is repairing the volume, like after adding or removing a drive. 修復完成後,磁碟區應該就會恢復為「正常」健全狀態。When the repair is complete, the volume should return to the OK health state.

動作: 等到 Azure Stack Hub 完成磁碟區修復後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish repairing the volume and check the status afterward.
不完整Incomplete 磁碟區的復原能力因一或多個磁碟機故障或遺失而下降。The resilience of the volume is reduced because one or more drives failed or are missing. 不過,遺失的磁碟機包含您的資料最新的複本。However, the missing drives contain up-to-date copies of your data.

動作: 重新連接任何遺失的磁碟機,並更換任何故障的磁碟機,然後使任何離線的伺服器重新上線。Action: Reconnect any missing drives, replace any failed drives, and bring online any servers that are offline.
已降級Degraded 磁碟區的復原能力因為一或多個磁碟機故障或遺失而下降,且這些磁碟機上有已過期的資料複本。The resilience of the volume is reduced because of one or more failed or missing drives as well as outdated copies of data on the drives.

動作: 重新連接任何遺失的磁碟機,並更換任何故障的磁碟機,然後使任何離線的伺服器重新上線。Action: Reconnect any missing drives, replace any failed drives, and bring online any servers that are offline.

磁碟區健全狀態:狀況不良Volume health state: Unhealthy

磁碟區處於「狀況不良」的健全狀態時,當下將無法存取磁碟區上的部分或所有資料。When a volume is in an Unhealthy health state, some or all of the data on the volume is currently inaccessible.

操作狀態Operational state 描述Description
無備援No redundancy 有太多磁碟機故障,因此磁碟區已遺失資料。The volume has lost data because too many drives failed.

動作: 請連絡支援人員。Action: Contact Support. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.

磁碟區健全狀態:UnknownVolume health state: Unknown

如果虛擬磁碟已中斷連結,磁碟區也可能處於「不明」健全狀態。The volume can also be in the Unknown health state if the virtual disk has become detached.

操作狀態Operational state 描述Description
已卸離Detached 儲存裝置發生可能導致磁碟區無法存取的故障。A storage device failure occurred which may cause the volume to be inaccessible. 部分資料可能會遺失。Some data may be lost.

動作:Action:
1.檢查所有儲存裝置的實體和網路連線。1. Check the physical and network connectivity of all storage devices.
2.如果所有裝置皆正確連線,請連絡支援人員。2. If all devices are connected correctly, contact Support. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles. 在還原失敗的連線之後,您可能必須從備份還原。You may have to restore from backup after the failed connection is restored.

磁碟機狀態Drive states

請使用下列 PowerShell 命令監視磁碟機的狀態:Use the following PowerShell commands to monitor the state of drives:

$scaleunit_name = (Get-AzsScaleUnit)[0].name

$subsystem_name = (Get-AzsStorageSubSystem -ScaleUnit $scaleunit_name)[0].name

Get-AzsDrive -ScaleUnit $scaleunit_name -StorageSubSystem $subsystem_name | Select-Object StorageNode, PhysicalLocation, HealthStatus, OperationalStatus, Description, Action, Usage, CanPool, CannotPoolReason, SerialNumber, Model, MediaType, CapacityGB

以下各節說明磁碟機可能的健全狀態:The following sections describe the health states a drive can be in:

磁碟機健全狀態:HealthyDrive health state: Healthy

操作狀態Operational state 描述Description
[確定]OK 磁碟區狀況良好。The volume is healthy.
運作中In service 磁碟機正在執行某些內部內務處理作業。The drive is doing some internal housekeeping operations. 此動作完成後,磁碟機應該就會恢復為「正常」健全狀態。When the action is complete, the drive should return to the OK health state.

磁片磁碟機健全狀態:警告Drive health state: Warning

處於「警告」狀態的磁碟機可成功讀取和寫入資料,但會發生問題。A drive in the Warning state can read and write data successfully but has an issue.

操作狀態Operational state 描述Description
中斷通訊Lost communication 磁碟機的連線已中斷。Connectivity has been lost to the drive.

動作: 將所有伺服器恢復上線。Action: Bring all servers back online. 若仍無法解決問題,請重新連接磁碟機。If that doesn't fix it, reconnect the drive. 若此狀態持續發生,請更換磁碟機以確保完整的復原能力。If this state persists, replace the drive to ensure full resiliency.
預期性故障Predictive failure 預期磁碟機即將發生故障。A failure of the drive is predicted to occur soon.

動作: 盡快更換磁碟機以確保完整的復原能力。Action: Replace the drive as soon as possible to ensure full resiliency.
IO 錯誤IO error 存取磁碟機時會發生暫時性錯誤。There was a temporary error accessing the drive.

動作: 若此狀態持續發生,請更換磁碟機以確保完整的復原能力。Action: If this state persists, replace the drive to ensure full resiliency.
暫時性錯誤Transient error 磁碟機發生暫時性錯誤。There was a temporary error with the drive. 這錯誤通常表示磁碟機沒有回應,但也可能表示儲存空間直接存取的保護分割區已從磁碟機中不當移除。This error usually means the drive was unresponsive, but it could also mean that the Storage Spaces Direct protective partition was inappropriately removed from the drive.

動作: 若此狀態持續發生,請更換磁碟機以確保完整的復原能力。Action: If this state persists, replace the drive to ensure full resiliency.
異常延遲Abnormal latency 磁碟機有時沒有回應,並出現故障的跡象。The drive is sometimes unresponsive and is showing signs of failure.

動作: 若此狀態持續發生,請更換磁碟機以確保完整的復原能力。Action: If this state persists, replace the drive to ensure full resiliency.
正在從集區移除Removing from pool Azure Stack Hub 正在從存放集區移除磁碟機。Azure Stack Hub is in the process of removing the drive from its storage pool.

動作: 等到 Azure Stack Hub 完成磁碟機移除後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish removing the drive, and check the status afterward.
若狀態不變,請連絡支援人員。If the status remains, contact Support. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.
正在進入維護模式Starting maintenance mode Azure Stack Hub 正在使磁碟機進入維護模式。Azure Stack Hub is in the process of putting the drive in maintenance mode. 這是暫時性的狀態 - 磁碟機應該很快就會處於維護模式狀態。This state is temporary—the drive should soon be in the In maintenance mode state.

動作: 等到 Azure Stack Hub 完成此程序後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish the process and check the status afterward.
處於維護模式In maintenance mode 磁碟機處於維護模式,暫停磁碟機的讀取和寫入作業。The drive is in maintenance mode, halting reads and writes from the drive. 此狀態通常表示 Azure Stack Hub 正在對磁碟機執行系統管理工作,例如 PNU 或 FRU。This state usually means Azure Stack Hub administration tasks such as PNU or FRU are operating the drive. 但是,系統管理員也可能將磁碟機置於維護模式。But the admin could also place the drive in maintenance mode.

動作: 等到 Azure Stack Hub 完成系統管理工作後,再查看其後的狀態。Action: Wait for Hub Azure Stack Hub to finish the administration task, and check the status afterward.
若狀態不變,請連絡支援人員。If the status remains, contact Support. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.
正在停止維護模式Stopping maintenance mode Azure Stack Hub 正在將磁碟機恢復上線。Azure Stack Hub is in the process of bringing the drive back online. 這是暫時性的狀態 - 磁碟機應該很快就會處於另一個狀態,符合預期的「良好」。This state is temporary - the drive should soon be in another state, ideally Healthy.

動作: 等到 Azure Stack Hub 完成此程序後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish the process and check the status afterward.

磁碟機健全狀態:狀況不良Drive health state: Unhealthy

處於「狀況不良」狀態的磁碟機目前無法進行寫入或存取。A drive in the Unhealthy state can't currently be written to or accessed.

操作狀態Operational state 描述Description
SplitSplit 磁碟機已與集區分離。The drive has become separated from the pool.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk. 如果您必須使用此磁碟,請從系統中移除磁碟,並確定磁碟上沒有任何有用的資料,再清除磁碟,然後重新安裝磁碟。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
無法使用Not usable 實體磁碟已遭隔離,因為您的解決方案廠商並不支援。The physical disk is quarantined because it's not supported by your solution vendor. 只有通過解決方案核准、且具有正確磁碟韌體的磁碟,才受到支援。Only disks that are approved for the solution and have the correct disk firmware are supported.

動作: 更換磁碟機,且其磁碟的製造商和型號必須已通過解決方案核准。Action: Replace the drive with a disk that has an approved manufacturer and model number for the solution.
過時的中繼資料Stale metadata 替換磁碟先前已使用過,且可能包含來自不明儲存體系統的資料。The replacement disk was previously used and may contain data from an unknown storage system. 此磁碟已遭隔離。The disk is quarantined.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk. 如果您必須使用此磁碟,請從系統中移除磁碟,並確定磁碟上沒有任何有用的資料,再清除磁碟,然後重新安裝磁碟。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
無法辨識的中繼資料Unrecognized metadata 若在磁碟機上發現無法辨識的中繼資料,通常表示該磁碟機上有來自不同集區的中繼資料。Unrecognized metadata found on the drive, which usually means that the drive has metadata from a different pool on it.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk. 如果您必須使用此磁碟,請從系統中移除磁碟,並確定磁碟上沒有任何有用的資料,再清除磁碟,然後重新安裝磁碟。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
故障的媒體Failed media 磁碟機故障,且儲存空間不會再加以使用。The drive failed and won't be used by Storage Spaces anymore.

動作: 盡快更換磁碟機以確保完整的復原能力。Action: Replace the drive as soon as possible to ensure full resiliency.
裝置硬體故障Device hardware failure 此磁碟機發生硬體故障。There was a hardware failure on this drive.

動作: 盡快更換磁碟機以確保完整的復原能力。Action: Replace the drive as soon as possible to ensure full resiliency.
正在更新韌體Updating firmware Azure Stack Hub 正在更新磁碟機上的韌體。Azure Stack Hub is updating the firmware on the drive. 這是暫時性的狀態,持續時間通常少於一分鐘,且在此期間,集區中的其他磁碟機將會處理所有的讀取和寫入。This state is temporary and usually lasts less than a minute and during which time other drives in the pool handle all reads and writes.

動作: 等到 Azure Stack Hub 完成更新後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish the updating and check the status afterward.
啟動中Starting 磁碟機正在進行作業準備。The drive is getting ready for operation. 這應該是暫時性的狀態 - 完成後,磁碟機應該就會轉換成不同的操作狀態。This state should be temporary—once complete, the drive should transition to a different operational state.

動作: 等到 Azure Stack Hub 完成作業後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish the operation and check the status afterward.

磁碟機無法進入集區的原因Reasons a drive can't be pooled

有些磁碟機純粹就是因為尚未就緒,而無法放入 Azure Stack Hub 存放集區中。Some drives just aren't ready to be in Azure Stack Hub storage pool. 您可以查看磁碟機的 CannotPoolReason 屬性,以了解磁碟機無法放入集區中的原因。You can find out why a drive isn't eligible for pooling by looking at the CannotPoolReason property of a drive. 下表將對各種原因的說明稍作補充。The following table gives a little more detail on each of the reasons.

原因Reason 描述Description
硬體不符合規範Hardware not compliant 磁碟機未列於使用健康情況服務指定的合格儲存體模型清單中。The drive isn't in the list of approved storage models specified by using the Health Service.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk.
韌體不符合規範Firmware not compliant 實體磁碟機上的韌體未列於使用健康情況服務指定的合格韌體修訂版本清單中。The firmware on the physical drive isn't in the list of approved firmware revisions by using the Health Service.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk.
由叢集使用中In use by cluster 磁碟機目前由容錯移轉叢集使用中。The drive is currently used by a Failover Cluster.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk.
卸除式媒體Removable media 磁碟機歸類為卸除式磁碟機。The drive is classified as a removable drive.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk.
狀況不良Not healthy 磁碟機未處於良好狀態,可能需要更換。The drive isn't in a healthy state and might need to be replaced.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk.
容量不足Insufficient capacity 有分割區佔用磁碟機上的可用空間。There are partitions taking up the free space on the drive.

動作: 更換具有新磁碟的磁碟機。Action: Replace the drive with a new disk. 如果您必須使用此磁碟,請從系統中移除磁碟,並確定磁碟上沒有任何有用的資料,再清除磁碟,然後重新安裝磁碟。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
驗證進行中Verification in progress 健康情況服務正在檢查磁碟機或磁碟機上的韌體,確認是否已通過使用核准。The Health Service is checking to see if the drive or firmware on the drive is approved for use.

動作: 等到 Azure Stack Hub 完成此程序後,再查看其後的狀態。Action: Wait for Azure Stack Hub to finish the process, and check the status afterward.
驗證失敗Verification failed 健康情況服務無法檢查磁碟機或磁碟機上的韌體以確認是否已通過使用核准。The Health Service couldn't check to see if the drive or firmware on the drive is approved for use.

動作: 請連絡支援人員。Action: Contact Support. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.
離線Offline 磁碟機已離線。The drive is offline.

動作: 請連絡支援人員。Action: Contact Support. 在此之前,請先參考 https://aka.ms/azurestacklogfiles 的指引開始進行記錄檔收集程序。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.