Health Check Management

Contents

1Introduction

2

Functions and Concepts
2.1Health Check Report File
2.2Report File Housekeeping
2.3Types of Operation

3

Managed Object Model

4

Configuration Management

5

Security Management

1   Introduction

This document provides an overview of the management model and concepts associated with the Health Check Management (HCM) managed area.

A managed area is represented by a group of Managed Object Classes (MOCs) within the Managed Object Model (MOM).

2   Functions and Concepts

HCM provides a management interface for reporting a summary of the Managed Element (ME) current health state highlighting any deviations from a normal behavior. It can be used to attend upgrades and to support preventive maintenance and problem resolution. The health status is obtained by verifying a set of rules.

A rule is a formal representation of a check. It contains information about what is checked: the command to be executed, the checks performed on the command printouts, and a recommendation about what to do if the result is not as expected. A rule, according to its definition, belongs to at least one category. A rule has an assigned severity (CRITICAL or WARNING) that helps understanding the severity if the check result of the rule fails. Optionally, a rule can be designed to accept input parameters through the model, that are used during the evaluation phase. A rule whose evaluation is performed using input parameters is a customizable rule, otherwise it is a simple rule. Input parameters of a rule allow the application node to customize, according to specific node characteristics, the rule check by specifying proper values to be used during the rule evaluation phase.

Rules are defined by the application node and written in XML format.

The evaluation of the health status is done through the execution of a health check job. It is only possible to execute one health check job at a time.

A health check job is created manually. It is associated to rule categories, executes all the installed rules belonging to the related categories, and computes the health status of the ME by checking the output of the rule commands against the defined evaluation criteria.

Possible ME statuses are as follows:

Health check job execution can be done in either of the following ways:

The ME status, as computed from the health check job, is available in the following ways:

2.1   Health Check Report File

The result of a health check job execution is logged in a report file. It is an XML file available by default in a predefined directory on the FileM MO but it is possible to set a local or remote used-defined location. The report file contains information about the job, the result of executed rules, and computed ME status. It is composed of the following two sections:

The body has two subsections, one for failed rules and one for successful rules, where details about each executed rule are provided.

2.1.1   Report File XML Structure

A detailed description of the XML schema for the report file is shown in Figure 1.

Figure 1   Schema Diagram for Report File

The report file starts with the tag <Report>. Only one such tag is present.

Under the <Report> tag, the children tags listed in Table 1 are present.

Table 1    <Report> Tag Children

Element Name

Cardinality

Description

HcJobId

1

This element provides the name of the job whose execution generated the report file.

RulesCategory

1..*

An instance of this tag is present for each rule category associated to the job whose execution generated the report file.

Date

1

The date and time of job execution.

Status

1

The ME state, as result of the health check job execution.

FailedRules

1

Contains information about the rules whose check evaluation failed during job execution.

SuccessRules

1

Contains information about the rules whose check evaluation succeeded during job execution.

The <FailedRules> and <SuccessRules> tags have the child tag listed in Table 2.

Table 2    <FailedRules> and <SuccessRules> Tag Children

Element Name

Cardinality

Description

HcRule

0..*

General information about the rule executed by the job.

The <HcRule> has two different definitions according to whether it is under the <FailedRules> tag or SuccessRules tag.

Under the <FailedRules> tag, the <HcRule> element has the attributes listed in Table 3.

Table 3    <HcRule> Attributes

Attribute Name

Mandatory

Description

id

yes

The identifier of the rule.

name

yes

The name of the rule.

The children tags are listed in Table 4.

Table 4    <HcRule> for <FailedRules> Children

Element Name

Cardinality

Description

Description

1

Contains a short description of the purpose of the rule.

Reason

1

Contains the error message to be shown when the rule result is not the expected one.

Severity

1

The severity of the rule.

RecommendedAction

1

Recommended action in case of check failure.

InputParameters

0..*

One of these tags is present for each, if any, user-defined value set for customizable rule.

Under the <SuccessRules> tag, the <HcRule> has the attributes listed in Table 3 and the children listed in Table 5.

Table 5    <HcRule> for <SuccesRules> Children

Element Name

Cardinality

Description

InputParameters

0..*

One of these tags is present for each, if any, user-defined value set for customizable rule.

2.2   Report File Housekeeping

The report file, reporting details about health check job execution, is produced upon successful job completion. A compressed archive file, containing logs used for the rules evaluation, is also provided, to be used for further investigation. Both the report file and the compressed archive are located in the same directory that can be either the default location under the FileM MO or a user-defined one.

For the report files stored in the default location under the FileM MO, HCM provides a housekeeping policy. That is, the maximum number of coupled report files with related compressed archive file is per default set to 10. A different value can be set by the user. If a new coupled report file and archive file are produced, exceeding the maximum number, the oldest couple is deleted.

2.3   Types of Operation

HCM supports the following operations:

For a description of how to lock or unlock a scheduler event and how to delete a scheduled execution, refer to the following documents:

3   Managed Object Model

The HCM managed area is represented in the Managed Object Model (MOM) as follows:

ManagedElement
   +-SystemFunctions
      +-HealthCheckM
         +-HcJob
            +-HcJobScheduler
               +-HcCalendarBasedPeriodicEvent
               +-HcPeriodicEvent
               +-HcSingleEvent
         +-HcRule

For general information about the MOM, MOCs, MOs, cardinality, and related concepts, refer to Managed Object Model User Guide.

The HCM MOCs are described in Table 6.

Table 6    HCM Managed Object Class Descriptions

Managed Object Class

Description

HealthCheckM

The root of the HCM model.


The maxNoOfReportFiles attribute holds the maximum amount of coupled report file and related compressed archive logs allowed in the default output directory under the FileM MO. It is used to manage the report file housekeeping policy for the default output file location under the FileM MO.

HcJob

Represents a health check job that can be executed on the ME.


The progressReport attribute is intended to report details about the job execution progress.

HcRule

Represents the health check rule. It contains detailed information necessary to define each specific health check rule.


The HcRule MO specifies the following:


  • The rule severity: each rule has an associated severity that can be warning or critical

  • The rule category: each rule belongs to at least one health check category

  • Default values for parameters of customizable rules.

HcJobScheduler

Represents the health check job scheduler to, at a specified time, start a job automatically. The following three scheduling policies are supported:


  • Calendar-based periodic event

  • Periodic event

  • Single event

HcCalendarBasedPeriodicEvent

Represents a calendar-based event for executing a health check job periodically.

HcPeriodicEvent

Represents a time interval-based event for executing a health check job periodically.

HcSingleEvent

Represents a date or time-based event for executing a health check job once.

4   Configuration Management

HCM is accessed using NETCONF or the Ericsson Command-Line Interface (ECLI) to manipulate the Management Information Base (MIB).

The following operations can be performed by the user and are described in Operating Instructions using the ECLI:

Basic Health Check Operations

Schedule Health Check Job

Manage Scheduled Health Check Jobs

5   Security Management

HCM access is managed by an authentication and authorization mechanism. For each HCM role, specific rules are applied to determine the scope of what is accessible.

The following two HCM roles are defined:

Once authenticated as a SystemAdministrator, full access (read, write, execute) is granted to the HealthCheckM MO and its attributes and actions.

Once authenticated as a SystemReadOnly, read access is granted to the HealthCheckM MO and its attributes.

Note:  
If the SystemReadOnly role is not defined, the related access rule is not installed.



Copyright

© Ericsson AB 2015. All rights reserved. No part of this document may be reproduced in any form without the written permission of the copyright owner.

Disclaimer

The contents of this document are subject to revision without notice due to continued progress in methodology, design and manufacturing. Ericsson shall have no liability for any error or damage of any kind resulting from the use of this document.

Trademark List
All trademarks mentioned herein are the property of their respective owners. These are shown in the document Trademark Information.

    Health Check Management