MAP 5350: Powering off a SAN Volume Controller node helps you power off a single SAN Volume Controller node to perform a service action without disrupting the host's access to disks.
Powering off a single node will not normally disrupt the operation of a SAN Volume Controller cluster. This is because, within a SAN Volume Controller cluster, nodes operate in pairs called an I/O group. An I/O group will continue to handle I/O to the disks it manages with only a single node powered on. There will, however, be degraded performance and reduced resilience to error.
Care must be taken when powering off a node to ensure the cluster is not impacted more than it need be. If the procedures outlined here are not followed, it is possible your application hosts will lose access to their data or, in the worst case, data will be lost.
It is preferable to use either the SAN Volume Controller Console or the command-line interface (CLI) to power off a node, as these methods provide a controlled handover to the partner node and provide better resilience to other faults in the system.
If a node is offline or not a member of a cluster, it must be powered off using the power button.
In some circumstances, the reason you are powering off the node might make meeting these conditions impossible; for instance, if you are replacing a broken fibre channel card, the virtual disks will not be showing an online status. You should use your judgment to decide when it is safe to proceed when a condition has not been met. Always check with the system administrator before proceeding with a power off that you know will disrupt I/O access, as they might prefer to either wait until a more suitable time or suspend the host applications
To ensure a smooth restart, a node must save the data structures it cannot recreate to its local, internal, disk drive. The amount of data it saves to local disk can be high, so this operation might take several minutes. Do not attempt to interrupt the controlled power off.
This topic describes how to power off a node using the SAN Volume Controller Console.
At the end of this process, the node powers off.
This topic describes how to power off a node using the SAN Volume Controller CLI.
Do not use the power control button to power off a node unless it is an emergency or you have been directed to do so by another procedure.
If you must use this method, notice in Figure 2 that each SAN Volume Controller model
type has a power control button
on the front.
When you have determined it is safe to do so, press and immediately release the power button. The front panel display changes to display Powering Off, and a progress bar is displayed.
The 2145-CF8 requires that you remove a power button cover before you can press the power button. The 2145-8A4, the 2145-8G4, the 2145-8F4, or 2145-8F2 might require you to use a pointed device to press the power button.
If you press the power button for too long, the node cannot write all the data to its local disk. An extended service procedure is required to restart the node, which involves deleting the node from the cluster and adding it back into the cluster.
When a node is powered off by using the power button (or because of a power failure), the partner node in its I/O group immediately stops using its cache for new write data and destages any write data already in its cache to the SAN attached disks. The time taken by this destage depends on the speed and utilization of the disk controllers; it should complete in less than 15 minutes, but it could be longer, and it cannot complete if there is data waiting to be written to a disk that is offline.
If a node powers off and restarts while its partner node continues to process I/O, it might not be able to become an active member of the I/O group immediately. It has to wait until the partner node completes its destage of the cache. If the partner node is powered off during this period, access to the SAN storage that is managed by this I/O group is lost. If one of the nodes in the I/O group is unable to service any I/O, for example, because the partner node in the I/O group is still flushing its write cache, the VDisks that are managed by that I/O group will have a status of Degraded.