Data Collection Guideline for SAPC
Ericsson Service-Aware Policy Controller

Contents

1General
1.1Product Scope
1.2Typographic Conventions
1.3Prerequisites
1.4Related Data Collection Guidelines

2

Workflow

3

Mandatory Data
3.1Data Collection

4

Data Collected Based on Specific Problem Types
4.1Database General Problems in the SAPC
4.2Geographical Redundancy Problems in the SAPC
4.3Software License Problems in the SAPC

5

Other Useful Information
5.1Performance Data Collector Reports

6

Appendix A: Severity/Priority

Glossary

Abstract

The purpose of this document is to instruct what troubleshooting data to collect and enclose in a Customer Service Request (CSR). This guideline also instructs the needed procedure to collect the needed information.


1   General

The purpose of this document is to instruct what troubleshooting data to collect and enclose in a Customer Service Request (CSR). This guideline also instructs the needed procedure to collect the needed information.

1.1   Product Scope

This guideline is applicable to the following product releases:

1.2   Typographic Conventions

This document uses the following typographic conventions:

Table 1    Typographic Conventions

Convention

Description

Example

User Input

A command you must enter in a Command-Line Interface (CLI) exactly as written.

ssh -p 830 sapcadmin@<OAM VIP> -t -s cli

Command Variables

Command variables, the values of which you must supply.

SC-<X>

System Elements

Command and parameter names, program names, path names, URLs, and directory names

The files are located in /cluster/storage/user/sapc.

Output Information

Text displayed by the system

Status OK

1.2.1   Commands

Along this document most of the commands to be entered by the operator are preceded by an expression such as <user>@SC-1:/> where:

Note:  
This convention can refer to any processor: SC-1, SC-2, PL-<X> where "X" must be the number of an existent Payload.

For example, if it is required to create a directory, named test in the SC-1 processor, the command is shown as:

sapcadmin@SC-1:/> mkdir /var/log/test

1.3   Prerequisites

Not Applicable.

1.4   Related Data Collection Guidelines

Not Applicable.

2   Workflow

The workflow for collecting troubleshooting data is as follows:

  1. Collect mandatory data that is needed in connection to any problems experienced. See Section 3.
  2. Collect specific data based on the type of problem that is experienced. See Section 4.
  3. Collect other useful information in case that is available within acceptable amount of time and effort. See Section 5

3   Mandatory Data

This section describes how to collect data that is mandatory for every type of problems related to the SAPC.

The data described in this chapter must always be included in a CSR.

3.1   Data Collection

To collect the required data, perform the following procedure:

  1. Collect all application and platform essential data with the sapcCollectInfo commands script.

3.1.1   Collecting Application and Platform Logs

The sapcCollectInfo command script collects the essential data from the whole SAPC system, including the following:

Software repository list: Software level running in the SAPC system.
Core MW information and SAPC logs: Collect Core MW information.

Collect the SAPC logs, further information in Logging Events.

PMF files: SAPC performance and measurements.
SAPC configuration files: SAPC components configuration.
Last restore Last System Data Restore executed.
IMM configuration SAPC cluster configuration.
SAPC current state Collect the log execution of sapcHealthCheck command script.
DBS logs files Collect the log files of the DBS component.
DBS heap statistics Collect the DBN record heap statistics.
Core Dump Files Core dump files are generated when certain faults, such as segmentation faults, occur.
Stderr files All Third Parties which are used by the SAPC give us the standard error output to a file per process.
C-diameter logs files Collect the log files of the C-diameter component.
RMF information SAPC cluster configuration in Resource Management Framework.
SS7 CAF information Collect the log files of the SS7 CAF component.

Do the following to collect the previous logs:

  1. Log on to the COM CLI of the SAPC node as described in System Administrator Guide
  2. Caution!

    The following command consumes high amount of storage space, in /cluster/storage

    Execute sapcCollectInfo as follows:

    sapcadmin@SC-1:~> sudo sapcCollectInfo

    Note:  
    The execution of sapcCollectInfo takes about 5–20 minutes.

  3. The command output file (sapc_collect_info_<date>_<time>.tgz) is located in the following folder:

    /cluster/storage

The output of the command must be similar to the next example:

Example 1  

sapcadmin@SC-1:~> sudo sapcCollectInfo

-----------------------------------
Collecting information from cluster
-----------------------------------

Collecting repository list

Collecting SwM & SwInventory information

Collecting cmw-collect-info

Logs archived in /cluster/storage/sapc_collect_info_2017-03-23_12-24-18/cmw-collect-info.tar.gz.

Collecting measurements

Collecting configuration files

Collecting information about last restore
No information about restore.

Collecting IMM configuration

Dumping current IMM state to XML file /cluster/storage/sapc_collect_info_2017-03-23_12-24-18/immdump.xml using XMLWriter

Collecting healthcheck information

Collecting DBS logs

Collecting DBS heap statistics

Collecting coredumps (level all)
Processing coredumps

Collecting stderr files

Collecting C-Diameter logs

Collecting RMF information

Collecting SS7 CAF information

Generating tar file

Information collected on file (also link 'sapc_collect_info.tgz' updated):
/cluster/storage/sapc_collect_info_2017-03-23_12-24-18.tgz

-------------------
Collection finished
-------------------

The size of tgz output file is variable, generally between 1GB-3GB.

The size of log files is generally between 100-900 MB.

For further information about the use of the command, execute it with the help switch (sudo sapcCollectInfo -h).

4   Data Collected Based on Specific Problem Types

The data described in this chapter must be included in a CSR, depending on what type of problem is experienced.

4.1   Database General Problems in the SAPC

4.1.1   Useful Information

Include any data that could be useful for the analysis of the reported problem. The possible cause of fault, abnormal behaviors detected in network, or adjacent nodes are additional useful information. More logs and information for the particular problem can be collected, according to SAPC Troubleshooting Guide.

4.2   Geographical Redundancy Problems in the SAPC

4.2.1   Problem Description

Any problem affecting the replication in the Geographical Redundancy.

4.2.2   Data to Be Collected

4.3   Software License Problems in the SAPC

4.3.1   Problem Description

Any problem related to software license installed on the License Manager.

4.3.2   Data to Be Collected

To extract the information related to installed software license, follow the procedure described in View License Information

To activate and collect LM Traces, follow Enabling LM Trace section, in LM Troubleshooting Guideline.

5   Other Useful Information

The data described in this chapter could be included in a CSR, depending on what type of problem is experienced and within acceptable amount of time and effort.

5.1   Performance Data Collector Reports

PDC collects performance information from the SAPC regularly and generates reports. PDC could also be used on demand without having to wait scheduled daily or monthly planned execution. There are three main types of reports: daily logs, monthly packages, and specific health check snapshots.

5.1.1   Daily Report

PDC collect operation gets the information from PMF files and creates the daily reports. Include any data that could be useful for the analysis of the reported problem. More information could be read at Performance Data Collection.

5.1.2   Monthly Report

PDC package operation gets the information from daily reports and creates a monthly tar.gz file. More information could be read at Performance Data Collection.

5.1.3   Health Check Report

PDC healthcheck operation compiles the SAPC node health check information into an XML report. More information could be read at Performance Data Collection.

6   Appendix A: Severity/Priority

The correct priority ensures that critical problems are fixed before less critical problems, therefore it is crucial to apply the following guidelines. It is important to observe that for an Emergency and High priority CSR that requires immediate attention, First Line Support (FLS) has to call the Global Front Office when the fault is present. Another important point related to CSR priority is that when the priority is not clear for both parts (FLS and Second Line Support (SLS)), they have to discuss it and get an agreement defining a common priority from start. The FLS can increase priority owing to commercial reasons at any time.

Note:  
If a discrepancy exists between the definition of Severity (Emergency/High/Medium/Low) in this document and the WLA/SLA agreed upon by the customer and Ericsson, the latest takes precedence.


Glossary

CLI
Command-Line Interface
 
COM
Common Operation and Maintenance
 
CSR
Customer Service Request
 
FLS
First Line Support
 
LOTC
Linux Open Telecom Cluster
 
OPI
Operating Instructions
 
PL
Pay load.
 
RMF
Resource Management Framework
 
SAPC
Ericsson Service-Aware Policy Controller
 
SC
System Controller
 
SLS
Second Line Support
 
VIP_OAM
Operation and Maintenance Virtual IP
 
PDC
Performance Data Collector
 
PMF
Performance Management Files