1 Introduction
This document describes how to perform the troubleshooting procedure in the vMRF.
1.1 Documents
Before starting this procedure, ensure that the following documents have been read:
1.1.1 Conditions
Certain troubleshooting activities can have an impact on the node performance. For example, trace or log activation can be disturbing traffic and is not recommended without first consulting next level of maintenance support.
2 Troubleshooting Procedures
This section describes troubleshooting information for vMRF.
Problems identified that cannot be solved by using this document must be reported to the next level of maintenance support. This is to result in a Customer Service Report (CSR).
The details of the trouble reporting process is outside the scope of this document.
A manual recovery flow in Manual Recovery Flow presents a generic workflow to identify and solve problems if possible, or to collect useful data.
Trouble Cases describes specific trouble cases for various scenarios. These cases often utilize the manual recovery flow as well.
Log files of access and authorization events in the system can be collected by using the journalctl command. For more information, refer to vMRF Security Management.
It is recommended to periodically export and store the configuration of the node outside the VNF as a backup for a possible re-deployment. For more information, refer to vMRF Backup and Restore Guideline.
2.1 Manual Recovery Flow
This section describes the recommended manual recovery flow for troubleshooting purposes.
Steps
2.2 Common Procedures
This section describes procedures used during troubleshooting.
2.2.1 vMRF Status Check
This procedure describes how to verify the vMRF deployment.
The status check shows the status of all VMs in the cluster. A VM can be identified by its UUID. The UUID is visible:
in the uuid attribute of the ComputeResource MO, that is referenced from the MrfInstance MO, that represents the VM
in the cloud management tool for each VM
2.2.1.1 vMRF Status Check on Openstack
The following example shows the printout of a successful status check:
Running command: "verify_vmrf_node_status.py" on localhost
eth0: OK
eth1: OK
eth2: OK
SC role: ACTIVE
CoreMW: OK
COM: OK, RUNNING
MrfDirector: OK, RUNNING
CliDaemon: OK
IpPipeline: OK
TC-MPD: OK
MrfAgent: OK
CloudInit: OK
SEC-CERT: OK
neighbourdetection: OK
3 Trouble Cases
This section describes trouble cases for vMRF.
Follow the troubleshooting workflow as shown in Figure 1.
3.1 Deployment Related Trouble Cases
This section describes trouble cases related to deployment.
3.1.1 VNF Does not Start
3.1.1.1 Image Extraction Problem
Symptom:
Problem during image extraction.
Possible Cause:
Image is corrupted.
Procedure:
Do the following:
Steps
- Verify the image using the MD5 checksum file that is provided by Ericsson.
- If needed, repeat the download and extraction of the vMRF package as described in the corresponding deployment guide.
- If the problem still exists, contact Ericsson support.
3.1.1.2 vMRF Instantiation not Possible in OpenStack
Symptom:
Heat stack creation failed due to error: No valid host found. There are not enough hosts available.
Possible Causes:
Low vCPU resources
Low memory
Problem in network allocation. Compute nova log shows: Failed to allocate the network(s)
Procedure:
Do the following:
Steps
- In case of network allocation problem, check the neutron logs and configuration. Correct the network configuration.
- Ensure that the environment fulfills hardware, software, and network requirements. The main requirements are listed in vMRF Infrastructure Requirements.
- Follow the deployment instruction described in Deployment Guide for OpenStack.
- If the problem still exists, contact Ericsson support.
3.1.2 Cyclic Kernel Restart
Symptom:
Cyclic kernel restart.
Procedure:
Do the following:
Steps
- Perform the manual recovery flow procedure, as described in Manual Recovery Flow.
- If the problem still exists, contact Ericsson support.
3.1.3 VM Stuck
Symptoms:
ssh connection to VM is not possible:
ssh mrsv-admin@192.160.112.15 ssh: connect to host 192.160.112.15 port 22: No route to host
vMRF is disabled in vMTAS:
>show ManagedElement=1,MtasFunction=MtasFunction,⇒ MtasMediaFramework=0,MtasMrf=0,MtasMpController=0,MtasMrfpNode=1 MtasMrfpNode=1 mtasMrfpNodeAdministrativeState=UNLOCKED mtasMrfpNodeMId="[10.52.58.222]:2944" mtasMrfpNodeOperationalState=DISABLED
VM console is inaccessible or console shows problems
Procedure:
Steps
- Perform the manual recovery flow procedure, as described in Manual Recovery Flow.
- If the problem still exists, contact Ericsson support.
3.1.4 No Console Connection in Openstack
Symptoms:
Console is not available.
Procedure:
Do the following:
Steps
- Restart OpenStack nova services.
3.1.5 No ssh Connection into VM
3.1.5.1 OpenStack
Symptom:
No route to host.
ssh mrsv-admin@192.160.112.15 ssh: connect to host 192.160.112.15 port 22: No route to host
Possible Causes:
Procedure:
Steps
- Check console connection.
- Check if there is an IP address for eth1.
- Check if the floating IP is associated with the VM.
- Check if ssh is enabled in the security groups of the VM.
- Check connectivity and routing firewalls towards the cloud environment.
- Perform the manual recovery flow procedure, as described in Manual Recovery Flow.
- If the problem still exists, contact Ericsson support.
3.1.6 Wrong cloud-init Syntax
Symptom:
cloud-init process indicates problem.
Procedure:
Steps
- Check the user-data.txt file delivered in the release package. Modify it if needed.
- If the problem still exists, contact Ericsson support.
3.1.7 No Running MRF Processes on SC
Symptom:
Status check shows following printout:
Running command: "verify_vmrf_node_status.py" on localhost eth0: OK eth1: OK eth2: OK SC role: not_available CoreMW: ERROR COM: OK, NOT RUNNING MrfDirector: OK, NOT RUNNING CliDaemon: OK IpPipeline: OK TC-MPD: OK MrfAgent: ERROR CloudInit: NOT RUN YET SEC-CERT: OK neighbourdetection: OK
Procedure:
Steps
- Perform the manual recovery flow procedure, as described in Manual Recovery Flow.
- If the problem still exists, contact Ericsson support.
3.2 Scaling Related Trouble Cases
This section describes trouble cases related to scaling.
3.2.1 vMRF VM Joins Wrong Cluster
Symptoms:
A vMRF VM joins a different network cluster
mrsv-admin@42198368-bedd-898f-0d15-533ee8ad7dc4:~$ sudo journalctl | grep tipc Jan 9 00:42:18 kontron-am4024e kernel: [ 36.890082] tipc: Started in network mode Jan 9 00:42:18 kontron-am4024e kernel: [ 36.891668] tipc: Own node address <1.1.3>, network identity 4711 Jan 9 00:42:18 kontron-am4024e kernel: [ 36.893952] tipc: Enabled bearer <eth:et<eth0>, ⇒ discovery domain <1.1.0>, priority 10 Jan 9 00:42:18 kontron-am4024e kernel: [ 36.897451] tipc: Established link <1.1.3:eth0-1.1.1:eth0> ⇒ on network plane A Jan 9 00:42:18 kontron-am4024e kernel: [ 36.899559] tipc: Established link <1.1.3:eth0-1.1.2:eth0> ⇒ on network plane A
1.1.3:eth0-1.1.1:eth0 implies problem. Correct tipc connection in vMRF is from eth0 to eth0: <1.1.10:eth0-1.1.15:eth0>
Possible Cause:
Problem in cloud networking, incorrect network configuration, open network.
Procedure:
Steps
- Check and reconfigure cloud networking.
3.3 Signaling Related Trouble Cases
This section describes trouble cases related to signaling.
3.3.1 No IP Address Available for Signaling
Symptom:
IP address not available. Status check shows the following printout:
Running command: "verify_vmrf_node_status.py" on localhost: eth0: OK eth1: OK eth2: NO IPv4 ADDRESS SC role: ACTIVE CoreMW: OK COM: OK, RUNNING MrfDirector: OK, RUNNING CliDaemon: OK IpPipeline: OK TC-MPD: OK MrfAgent: OK CloudInit: OK SEC-CERT: OK neighbourdetection: OK
Possible Causes:
Hardware problem
VLAN tagging problem
OpenStack configuration problem
Procedure:
Steps
- Check DHCP server.
- Check connectivity to DHCP server, physical connectivity, vlan tagging.
- Check OpenStack security groups.
3.3.2 No Connection to NextHop
Symptoms:
SCTP operational state of MRF application is disabled:
mrsv-admin@fi2-vmrf-com-uplift-cl2:~$ cluster run cli_tool mrf_appl status Running command: "cli_tool mrf_appl status" on host: 192.168.0.3 (fi2-vmrf-com-uplift-cl2) [2017-01-09 11:50:55.383] Signalling State: =================== H248Interface-Id: 3 H248Interface-LDN: "MediaResourceFunction=1,MrfH248Control=1,MrfH248Interface=BLR2_16_2_mrf4" H248Interface association state: UNLOCKED H248Interface Service Change state: NOT_STARTED Sctp operational state: DISABLED Remote IP Address: 10.52.60.8 Remote Port: 21614 =================== H248Interface-Id: 2 H248Interface-LDN: "MediaResourceFunction=1,MrfH248Control=1,MrfH248Interface=BLR2_16_2_mrf3" H248Interface association state: UNLOCKED H248Interface Service Change state: NOT_STARTED Sctp operational state: DISABLED Remote IP Address: 10.52.60.8 Remote Port: 21613 =================== H248Interface-Id: 1 H248Interface-LDN: "MediaResourceFunction=1,MrfH248Control=1,MrfH248Interface=BLR2_16_2_mrf2" H248Interface association state: UNLOCKED H248Interface Service Change state: COMPLETED Sctp operational state: ENABLED Remote IP Address: 10.52.60.8 Remote Port: 21612 =================== LocalEndpoint Id: 3 Dscp: 40 Local port: 2944 =================== Sctp socket state: INITIATED. DHCP assigned IP: 10.52.61.219 =================== MRF instance administrative state: UNLOCKED =================== Running command: "cli_tool mrf_appl status" on host: 192.168.0.4 (fi2-vmrf-com-uplift-cl2-0) [2017-01-09 11:50:54.517] Signalling State: =================== H248Interface-Id: 3 H248Interface-LDN: "MediaResourceFunction=1,MrfH248Control=1,MrfH248Interface=BLR2_16_2_mrf4" H248Interface association state: UNLOCKED H248Interface Service Change state: NOT_STARTED Sctp operational state: DISABLED Remote IP Address: 10.52.60.8 Remote Port: 21614 =================== H248Interface-Id: 2 H248Interface-LDN: "MediaResourceFunction=1,MrfH248Control=1,MrfH248Interface=BLR2_16_2_mrf3" H248Interface association state: UNLOCKED H248Interface Service Change state: NOT_STARTED Sctp operational state: DISABLED Remote IP Address: 10.52.60.8 Remote Port: 21613 =================== H248Interface-Id: 1 H248Interface-LDN: "MediaResourceFunction=1,MrfH248Control=1,MrfH248Interface=BLR2_16_2_mrf2" H248Interface association state: UNLOCKED H248Interface Service Change state: ONGOING_COLD_BOOT Sctp operational state: DISABLED Remote IP Address: 10.52.60.8 Remote Port: 21612 =================== LocalEndpoint Id: 4 Dscp: 40 Local port: 2944 =================== Sctp socket state: INITIATED. DHCP assigned IP: 10.52.61.215 =================== MRF instance administrative state: UNLOCKED ===================
Possible Causes:
VLAN tagging problem in cloud
VLAN tagging problem in site switches or routers
Physical connectivity problem
Security group problem
Procedure:
Steps
- Check if the MRF H.248 Link Unavailable alarm is active. Clear the alarm using the alarm instruction.
- Check if VM has IP address for eth2.
- Check that SCTP is enabled in the security groups of the VM.
- Check connectivity and routing firewalls towards the cloud environment.
- Restart mrf process from CLI: sudo systemctl restart mrf_appl.service
- Perform the manual recovery flow procedure, as described in Manual Recovery Flow.
- If the problem still exists, contact Ericsson support.
3.3.3 No Connection to O&M IP Address
Symptom:
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ @ WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED! @ @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ IT IS POSSIBLE THAT SOMEONE IS DOING SOMETHING NASTY! Someone could be eavesdropping on you right now (man-in-the-middle attack)! It is also possible that the RSA host key has just been changed. The fingerprint for the RSA key sent by the remote host is <fingerprint>. Please contact your system administrator. Add correct host key in ~/.ssh/known_hosts to get rid of this message. Offending key in ~/.ssh/known_hosts: <line number of the offending key> Permission denied (publickey,password).
Possible Cause:
Procedure:
Steps
3.4 Media Related Trouble Cases
This section describes trouble cases related to media.
3.4.1 No Connection to Client
Symptom:
Client is not reachable. Ping request from the IP address ends in timeout.
mrsv-admin@fi1-vmrf:~$ cli_tool ipp conf
Configuration:
Network (id:1) default_network
VLAN ID -
UDP Port Range 1024..65535
Media IP IF (id:1)
Ethdev em1 (id:0)
MAC FA:16:EE:FC:14:7A
Link UP
IP 10.52.58.133
Status DHCP OK
Media IP IF (id:2)
Ethdev em1 (id:0)
MAC FA:16:EE:FC:14:7A
Link UP
IP 2001:1b70:8298:2038::5
Status DHCP OK
Link local fe80::f816:eeff:fefc:147a
Static Route (id:4)
IP 0.0.0.0/0
Nexthop (id:4)
MAC 00:30:88:11:DB:83
IP 10.52.58.129
Static Route (id:6)
IP ::/0
Nexthop (id:6)
MAC 00:30:88:11:DB:83
IP fe80::230:88ff:fe11:db83
mrsv-admin@fi1-vmrf:~$
mrsv-admin@fi1-vmrf:~$ cli_tool ipp ping -m 1 10.52.45.129
PING 10.52.45.129 56 bytes of data
Timeout (3000 ms)
mrsv-admin@fi1-vmrf:~$
Possible Causes:
Problem in static route
Problem in client
Procedure:
Steps
3.5 Load Related Trouble Cases
This section describes trouble cases related to load.
3.5.1 Disturbances in Traffic
Symptoms:
Temporary or permanent stoppage of traffic
Possible Cause:
Software problem
Procedure:
Do the following:
Steps
- Check crash dumps.
- Collect related data and contact Ericsson support. For more information on how to collect information, refer to Data Collection Guideline for vMRF.
3.5.2 Speech Quality Problem
3.5.2.1 Bandwidth Limitation
Symptom:
Bad speech quality
Bandwidth limitation can be checked by using the ipp discard-counters command:
mrsv-admin@fi8-mrs:~$ cluster run cli_tool ipp discard-counters RX_BANDWIDTH_POLICING_DROP_TRAFFIC : 6286
Possible Cause:
Bandwidth limitation
Procedure:
Steps
- Check that needed bandwidth is not limited.
3.5.2.2 Packet Loss in vSwitch
Symptom:
Bad speech quality.
Possible Cause:
Packet loss in vSwitch.
Procedure:
Steps
3.5.2.3 Packet Loss on Site
Symptom:
Bad speech quality.
Possible Causes:
Packet loss on site.
Packet loss/error counters are incremented in site switches or routers and cloud server switches.
Procedure:
Steps
- Check connectivity, configuration, VLANs, routing firewalls towards cloud environment.
3.6 Announcement Related Trouble Cases
This section describes trouble cases related to announcements.
3.6.1 vMRF Cannot Play Announcement
Symptoms:
- vMRF cannot play announcement
mrsv-admin@fi2-vmrf-20170116-084636-cl1:~$ cli_tool mrf_appl h248-counters
[2017-01-17 09:36:08.190]
Modify Request total: 1472180 (Emergency: 0 IEPS: 0 Priority: 0)
Pendings: 0
Pending limit exceeded: 0
Retransmissions: 0
Retransmission limit exceeded: 0
24 (Emergency: 0 IEPS: 0 Priority: 0) replied with error 514 (GCP_MEDIA_GATEWAY_CANNOT_SEND_THE_SPECIFIED_ANNOUNCEMENT)
Originated from CRH at location 66 (visible as ERR_LOC_00066 in source code)
Possible Causes:
File caching failure
Missing BasicAnnouncement or VariableAnnouncement MO configuration
Variable announcement logic execution error
Incorrect audio file format
Missing audio files
Procedure:
Steps
3.6.2 Client Cannot Hear Announcement
Symptoms:
- Client cannot hear announcement while vMRF plays the announcement
Possible Causes:
Early media handling settings in vSBG is incorrect.
Procedure:
Steps
Reference List
|
Managed Object Model, 387/155 54-LZN 765 0172-V1 |

Contents