1 Introduction
This document provides the description and troubleshooting steps to take for the Storage Engine, Automatic Handling of Network Isolation not Completed for PDDB alarm.
1.1 Alarm Description
This alarm is raised when the Automatic Handling of Network Isolation process has failed to repair the Processing Layer Database (PLDB) cluster inconsistency between former and current master replica servers.
The alarm is issued in the following situations:
The possible alarm causes and the corresponding fault reasons, fault locations and impacts are described in Table 1.
|
Alarm Cause |
Description |
Fault Reason |
Fault Location |
Impact |
|---|---|---|---|---|
|
Selective Replica Check task was not completed. |
Automatic Handling of Network Isolation process was unsuccessful in repairing PLDB cluster inconsistency between former and current master replica servers. |
|
Slave (Selective Replica Check) replica server. |
Rescuing non-replicated data from former master has failed. |
|
Data Repair task was not completed. |
Automatic Handling of Network Isolation process was unsuccessful in repairing PLDB cluster inconsistency between former and current master replica servers. |
|
Master (Data Repair) replica server. |
Rescuing non-replicated data from former master has failed. |
|
Triggering Reconciliation task was not completed. |
Automatic Handling of Network Isolation process was unsuccessful in adding local DS units that were elected master for their DSG to the Reconciliation Pending Task List. |
|
Master replica server. |
No data reconciliation process. |
| Note: |
An alarm can appear as a result of a maintenance activity. |
The following are the consequences for the node if the alarm is not solved:
The alarm attributes are listed and explained in Table 2.
|
Attribute Name |
Attribute Value |
|---|---|
|
Auto Cease |
No |
|
Module |
STORAGE-ENGINE |
|
Error Code |
29 |
|
Timestamp First |
Date and time when the alarm was raised for the first time. |
|
Repeated Counter |
Number which indicates how many times the alarm was raised. |
|
Timestamp Last |
Date and time of the most recent alarm raised. |
|
Resource ID |
.1.3.6.1.4.1.193.169.1.1.29.<Timestamp> |
|
Alarm Model Description |
Automatic Handling of Network Isolation not Completed, Storage Engine. |
|
Alarm Active Description |
Storage Engine (PLDB): Automatic Handling of Network Isolation task <add_info> was not completed <add_info2> (task <taskid>, blade <Blade>), uuid: <uuid> |
|
ITU Alarm Event Type |
processingErrorAlarm (4) |
|
ITU Alarm Probable Cause |
softwareError (163) |
|
ITU Alarm Perceived Severity |
(4) – Major |
|
Originating source IP |
Node IP where the alarm was raised. |
|
Sequence Number |
Number which indicates the order in which the alarms are raised. |
In Table 2, the indicated variables are as follows:
For further information about attribute descriptions, refer to CUDB Node Fault Management Configuration Guide.
1.2 Prerequisites
This section provides information on the documents, tools, and conditions that apply to the procedure.
1.2.1 Documents
This instruction references the following documents:
1.2.2 Tools
Not applicable.
1.2.3 Conditions
Not applicable.
2 Procedure
This section describes the procedure to follow when this alarm is received.
2.1 Actions for Selective Replica Check Task Was Not Completed
Perform the following steps:
Steps
2.2 Actions for Data Repair Task Was Not Completed
Do the following:
Cease the alarm manually.
| Note: |
Full repair of system data cannot be guaranteed in this case. |
2.3 Actions for Triggering Reconciliation Task Was Not Completed
Perform the following steps:
Steps

Contents