Fan Failure
Cloud Execution Environment

Contents

1Introduction
1.1Alarm Description
1.2Prerequisites

2

Procedure
2.1Actions

Reference List

1   Introduction

This instruction concerns alarm handling.

This document is only applicable for systems using Extreme switches configured dynamically by the Cloud Execution Environment (CEE). Refer to the Configuration File Guide for more information about the CEE configuration types.

1.1   Alarm Description

The alarm is issued by the Managed Object (MO) Fan when malfunction occurs in one or more cooling fans in the physical switch.

The severity of the alarm is CRITICAL.

The possible alarm causes and fault locations are described in Table 1.

Table 1    Alarm Causes

Alarm
Cause

Description

Fault
Reason

Fault
Location

Impact

Cooling fan or fans are down

Failure of one or more cooling fans in physical switch.

Fan module is faulty

Switch fan module

Cooling
malfunction

The following is the consequence for the node if the alarm is not solved:

The alarm attributes are listed in Table 2.

Table 2    Alarm Attributes

Attribute Name

Attribute Value

Major Type

193

Minor Type

2031685

Managed Object Class

Fan

Managed Object Instance

Region=<name_of_the_region>,
Equipment=<equipment_id>,
TopOfRackSwitch=<switch_id>,
Fan=<fan_id>

Specific Problem

Fan Failure

Event Type

equipmentAlarm (5)

Probable Cause

m3100CoolingFanFailure(107)

Additional Text

Failure of one or more cooling fans in ToR switch.(1)

Severity

CRITICAL (3)

(1)  Top of Rack (ToR) stands for the physical switch.


1.2   Prerequisites

This section provides information on the documents, tools, and conditions that apply to the procedure.

1.2.1   Documents

Before starting this procedure, ensure that the following document is available:

Summit Family Hardware Installation Guide, Reference [1].

1.2.2   Tools

No tools are required.

1.2.3   Conditions

No conditions.

2   Procedure

This section describes the procedure to follow when this alarm is received.

2.1   Actions

Perform the following:

  1. Locate the faulty fan module or modules by using the <fan_id> attribute from the Source field of the alarm.
  2. Confirm that the fan module or modules are not functioning properly, or are down.
  3. Replace the failed fan module or modules in the switch by referring to the instructions in Summit Family Hardware Installation Guide, Reference [1].
  4. Confirm that the alarm has ceased.

    If the alarm ceases, exit this procedure.

    If the alarm persists, proceed to Step 5.

  5. Collect troubleshooting data as described in the Data Collection Guideline.
  6. Consult the next level of maintenance support.

    Further actions are outside the scope of this instruction.

  7. The job is completed.

Reference List

[1] Summit Family Hardware Installation Guide for Switches Supported by ExtremeXOS 16 and earlier, http://documentation.extremenetworks.com/summit_16/downloads/SummitFamily_HW_Install.pdf, 121141-00