How do we resolve disks stuck in stopping maintenance mode

Question

Hi there, we run Server 2022 Datacentre in a 3 Node Cluster (running storage spaces direct). This disks in one of the Nodes are all stuck in 'Stopping maintenance mode, OK We have tried restarting the node in question,

Accepted Answer

Hi, I have a resolution for this mackled together from various forums. As long as you are certain it is not an issue and just cosmetic / false labelling you can clear the flags to set your disks to show 'OK' in the Windows Admin Centre. We were experiencing this exact issue where the WAC was showing 'Stopping maintenance mode, OK' however checking Get-StorageJob, Get-ClusterStorageSpacesDirect + Get-StorageHealthReport, Get-PhysicalDisk from PowerShell which stated disk operational status as OK and no jobs running so presumed this was cosmetic from the WAC. The step involved importing a PowerShell script, identifying the disks concerned and repeating a command for each affected disk. The script to import is called 'Clear-PhysicalDiskHealthData.ps1' downloadable from https://go.microsoft.com/fwlink/?linkid=2034205 . You can then run Get-PhysicalDisk | Select-Object SerialNumber,UniqueID to show all your disks IDs. Once you have the ID of the disk you want to resolve run Clear-PhysicalDiskHealthData -Intent -Policy -UniqueID xxxxx -Verbose -Force (replacing xxxxx with your disk ID. And that's that, disk now shows as OK in WAC (give it 5 mins to refresh). I think this is caused by a recent-ish Windows update as one server from the cluster hasn't received the latest update yet and is fine.

Answer

Hi VMAX,

Thanks for your post. In general, start with these steps:

Confirm the make and model of SSD is certified for Windows Server 2016 and Windows Server 2019 by using the Windows Server Catalog. Confirm with the vendor that the drives are supported for Storage Spaces Direct.
Inspect the storage for any faulty drives. Use storage management software to check the status of the drives. If any of the drives are faulty, work with your vendor.
Update the storage and drive firmware if necessary. Ensure that the latest Windows Updates are installed on all nodes. You can get the latest updates for Windows Server 2016 from Windows 10 and Windows Server 2016 update history. Get the latest updates for Windows Server 2019 from Windows 10 and Windows Server 2019 update history.
Update the network adapter drivers and firmware.
Run cluster validation and review the Storage Space Direct section. Ensure that the drives you use for the cache are reported correctly and have no errors.

Reference: Storage Spaces Direct troubleshooting | Microsoft Learn

Also, I have found similar issue with same error, just for your reference and hope it helpful. Storage Spaces Direct / S2D - Disks stuck in maintenance mode (nuvotex.de)

Best Regards,

Ian Xue

If the Answer is helpful, please click "Accept Answer" and upvote it.

Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

Answer

First, ensure that the information you see is not a UI glitch. I prefer using Powershell for that purpose. Run the following commands against your cluster, storage pools, and storage volumes to check if the error persists.

Get-ClusterStorageSpacesDirect + Get-StorageHealthReport

Consider checking if any storage optimization jobs are currently running on the affected node.

Get-StorageJob

Since Storage Spaces Direct is known for those kinds of problems, your best course of action would be to rebuild the cluster node partially (evict, clean storage pools, rejoin) or entirely from scratch. That approach is guaranteed to fix the problem and, in most cases, takes less time compared to wasting your time for further investigation (benefits of HCI environment).

Alternatively, you may consider replacing Storage Spaces Direct with a virtual SAN software https://www.starwindsoftware.com/vsan that offers the same feature set but runs isolated from the Microsoft Failover Cluster subsystem, which makes it more reliable and easier to maintain.

Share via

How do we resolve disks stuck in stopping maintenance mode

3 additional answers

Your answer