S2D Nested resiliency: disable write caching on node down option not working?

gigo91 1 Reputation point
2021-02-02T09:17:31.077+00:00

Hi,
we wanted to test the optional extend to cache drives szenario with our S2D cluster according to this MS doc:
https://learn.microsoft.com/en-us/windows-server/storage/storage-spaces/nested-resiliency
But unfortonately the tolerate of a cache drive loss don't work for us.

Our szenario:
We have here a 2-node S2D cluster with SSDs and HDDs. (Win Srv 2019)
SSDs for cache and HDDs for capacity. (created automatically, 1:4 ratio)

Nested resiliency storage tier is configured with the following commands:

New-StorageTier -StoragePoolFriendlyName S2D* -FriendlyName NestedMirror -ResiliencySettingName Mirror -MediaType HDD -NumberOfDataCopies 4

2 nested resiliency volumes are created:

New-Volume -FriendlyName csv1 -FileSystem CSVFS_ReFS -StoragePoolFriendlyName "S2D on cluster1" -StorageTierFriendlyNames NestedMirror -StorageTierSize 400GB

New-Volume -FriendlyName csv2 -FileSystem CSVFS_ReFS -StoragePoolFriendlyName "S2D on cluster1" -StorageTierFriendlyNames NestedMirror -StorageTierSize 400GB

We set cache settings Read+Write for SSDs and HDDS according to the MS doc: https://learn.microsoft.com/en-us/windows-server/storage/storage-spaces/understand-the-cache

At least we wanted to disable the (write)cache, if one node goes down:

Get-StorageSubSystem Cluster* | Set-StorageHealthSetting -Name "System.Storage.NestedResiliency.DisableWriteCacheOnNodeDown.Enabled" -Value "True"

Then we made a test:

-VM on the cluster is running.
--shut down node 1
-VM still running.
--after 30 min get-clusters2d -> CacheModeHDD + CacheModeSDD chanced automatically to ReadOnly
-VM still running
--Then we tried to remove 1 ssd from node 2.
-VM stopped!

Any ideas why our VM is stopping?
According to the MS doc it should tolerate a cache drive loss.

Thanks

Hyper-V
Hyper-V
A Windows technology providing a hypervisor-based virtualization solution enabling customers to consolidate workloads onto a single server.
2,560 questions
Windows Server Clustering
Windows Server Clustering
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.Clustering: The grouping of multiple servers in a way that allows them to appear to be a single unit to client computers on a network. Clustering is a means of increasing network capacity, providing live backup in case one of the servers fails, and improving data security.
962 questions
Windows Server Storage
Windows Server Storage
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.Storage: The hardware and software system used to retain data for subsequent retrieval.
632 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Xiaowei He 9,871 Reputation points
    2021-02-03T07:55:45.88+00:00

    Hi,

    Since S2D is complex to troubleshoot, it's recommended to open a case with MS for deep troubleshooting.

    Below is the link to open a case with MS:

    https://support.microsoft.com/en-us/gp/customer-service-phone-numbers

    Thanks for your time!
    Best Regards,
    Anne

    -----------------------------

    If the Answer is helpful, please click "Accept Answer" and upvote it.

    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

    0 comments No comments