Power Supply Failure
Cloud Execution Environment

Contents

1Introduction
1.1Alarm Description
1.2Prerequisites

2

Procedure
2.1Actions

Reference List

1   Introduction

This instruction concerns alarm handling.

This document is only applicable for systems using Extreme switches configured dynamically by the Cloud Execution Environment (CEE). Refer to the Configuration File Guide for more information about the CEE configuration types.

1.1   Alarm Description

The Power Supply Failure alarm is issued by the Managed Object (MO) PowerSupply when one or more power supplies fail in the physical switch.

The severity of the alarm is CRITICAL.

The possible alarm causes and fault locations are described in Table 1.

Table 1    Alarm Causes

Alarm
Cause

Description

Fault
Reason

Fault
Location

Impact

Power supply or supplies are down

One or more power sources have failed in the physical switch. Presumably, a redundant power supply has taken over.

Power supply is faulty

Switch power supply

Power malfunction

The following is the consequence for the node if the alarm is not solved:

The alarm attributes are listed in Table 2.

Table 2    Alarm Attributes

Attribute Name

Attribute Value

Major Type

193

Minor Type

2031686

Managed Object Class

PowerSupply

Managed Object Instance

Region=<name_of_the_region>,
Equipment=<equipment_id>,
TopOfRackSwitch=<switch_id>,
PowerSupply=<power_supply_id>

Specific Problem

Power Supply Failure

Event Type

equipmentAlarm (5)

Probable Cause

powerSupplyFailure (78)

Additional Text

One or more sources of power has failed in the ToR switch. Presumably a redundant power supply has taken over.(1)

Severity

CRITICAL (3)

(1)  Top of Rack (ToR) stands for the physical switch.


1.2   Prerequisites

This section provides information on the documents, tools, and conditions that apply to the procedure.

1.2.1   Documents

Before starting this procedure, ensure that the following document is available:

Summit Family Hardware Installation Guide, Reference [1].

1.2.2   Tools

No tools are required.

1.2.3   Conditions

No conditions.

2   Procedure

This section describes the procedure to follow when the alarm is received.

2.1   Actions

Perform the following:

  1. Locate the faulty fan module or modules by using the <power_supply_id> attribute from the Source field of the alarm.
  2. Confirm that the power supply or supplies are not functioning properly, or are down.
  3. Replace the failed power supply or supplies in the switch by referring to the instructions in Summit Family Hardware Installation Guide, Reference [1].
  4. Confirm that the alarm has ceased.

    If the alarm ceases, exit this procedure.

    If the alarm remains, proceed to Step 5.

  5. Collect troubleshooting data as described in the Data Collection Guideline.
  6. Consult the next level of maintenance support.

    Further actions are outside the scope of this instruction.

  7. The job is completed.

Reference List

[1] Summit Family Hardware Installation Guide for Switches Supported by ExtremeXOS 16 and earlier, http://documentation.extremenetworks.com/summit_16/downloads/SummitFamily_HW_Install.pdf, 121141-00