Beruflich Dokumente
Kultur Dokumente
Agenda
Alarms (ZAHO, ZAHP and alarm description in NED)
MML command log (ZIGO)
Unit diagnostics (ZUDU)
Clear Codes (ZTUT or CC trace ZTOF with OBSFAILA logical file)
Subscriber Trace (ZMCJ or ZMWC with GSMME1PR logical file)
Message monitoring
Monitor Computer Unit start-up
Output Computer Unit logs & Black Box
Alarms
Always remember to check blocked alarms (ZABO;) (ZAFP; in MGW)
Active alarms:
ZAHO;
ZAAP; in MGW
ZAHP::NR=:2006-01-26,09-00-00:;
MSCi
MSS_880091
2006-01-26 10:03:20
ALARM HISTORY
<HIST> MSS_880091
OMU-0
SWITCH 2006-01-26 09:48:36.92
NOTICE VTP-22
IOMANA
0690 WORKING STATE CHANGE
BL-SY WO-BU 0000 0000 0000 0000 0000 0000
Alarm description
Structure of the alarm:
2PAC
CMU-0
SWITCH 2007-11-04 20:27:34.13
*** ALARM CMU-0
1A001-02 CRM_SJ
(0054) 3184 RNC CONTINUOUSLY OUT OF SERVICE IN MSC
633 01 0001 01 0000
1/ type of alarm: standard, update printout, history
2/name of NE
4/computer sending alarm
5/ alarm equipment type
6/ date , time
7/urgency
8/printout type
9/computer unit
10/alarm number -> Check in NED !
11/alarm text
12/suppelementary information -> Check in NED
IPA2800
2009-04-21 09:55:06
SYSTEM
16:09:07 PAGE
USER
SYSTEM
880091 MSS_880091
OMU-VTP20
JG
SYSTEM
880091 MSS_880091
1
MML
SESSION
I/O-DEVICE
00021
OMU-VTP19
QN
00020
&
5B DIAGNOS
UNIT = OMU
1
LOG
FILE
CLASS MODE
SPARE
FILE
NAME
PHYSICAL
FILE
OBJ
SYSTEM
IND
UNIT
NAME
1 MSS_880091
DEVICE/
NAME
LOGICAL
FILE
OMU
LPT-1
2 MSS_880091 OMU
3 MSS_880091 OMU
VDU-5
VTP-19
Start diagnostics
Change Computer Unit state to TE-EX
ZUSC:SIGU,2:TE;
Then start diagnostics
ZUDU:SIGU,2;
Diagnostics history
ZUDH:SIGU,2::;
MSCi
MSS_880091
2006-01-26
11:35:43
DATE = 2006-01-26
MSCi
MSS_880091
PARTIAL DIAGNOSIS EXECUTED
2006-01-26
SIGU-2
10:42:11
POWER
MSCi
MSS_880091
PARTIAL DIAGNOSIS EXECUTED
2006-01-26
SIGU-2
10:42:35
CPU
MSCi
MSS_880091
PARTIAL DIAGNOSIS EXECUTED
2006-01-26
SIGU-2
10:42:35
RAM
MSCi
MSS_880091
PARTIAL DIAGNOSIS EXECUTED
2006-01-26
SIGU-2
10:42:40
SYSB
MSCi
2006-01-26
10:42:40
MSS_880091
DIAGNOSTIC REPORT
SIGU-2
PARTIAL DIAGNOSIS
DIAGNOSTIC PROGRAM
DIAGNOSIS
TOTAL
0000
3999
TIME = 00:00:00
normal clearing
internal congestion
external congestion
subscriber errors
The first main class 'normal clearing' (000H 3FFH) consists of clear codes which have not been caused by
an error in the exchange or by subscriber error, but which nevertheless leads to clearing of a call or to an
interruption in call set-up.
The second main class, 'internal congestion' (400H - 7FFH) consists of cases in which a call or call set-up is
interrupted because of an error in the exchange. This group contains clear codes related mainly to file
management and to communication between program blocks and different units of the exchange.
The third main class, 'external congestion' (800H -BFFH) includes all the cases in which a call or call set-up is
interrupted because of an error outside the exchange. This group consists mainly of clear codes related to
inter-exchange signalling.
The fourth main class, 'subscriber error' (C00H - FFFH) includes the cases in which a call or call set-up is
interrupted by a subscriber's error or by a failure in the subsriber's equipment or by faulty subscriber signalling
Clear Codes
About the Clear Codes (CC)
Clear codes helps greatly troubleshooting of a failed call.
Of course the call has to reach a certain level to get clear codes. (e.g.
call ID request).
Clear Codes
ZTUS:CLR,60,0;
MSCi
MSS_880091
MODE:
STATE:
RES. OUTPUT DAY:
START DATE:
STOP DATE:
RES. ACC. PERIOD:
OUTPUT DELAY:
CLR
ACTIVE
MON 00:00-24:00
TUE 00:00-24:00
WED 00:00-24:00
THU 00:00-24:00
FRI 00:00-24:00
2006-01-26 10:16:00
60
0
COMMAND EXECUTED
2006-01-26
10:16:01
Clear Codes
ZTUT:CLR;
LOADING PROGRAM VERSION 14.17-0
SIGNALLING RING
0
0
235
1
3
1
2
22
0
0
1
0
1
89
51
143
0
1
0
0
4
3
491
2
1
1
END OF REPORT
SPEECH
0
7
0
1
2
0
0
0
0
0
0
1
0
0
1
0
CLEAR CODE
000H NORMAL END OF THE CALL
005H B-SUBSCRIBER BUSY
00DH CALL TERMINATED BY OPER
015H NORMAL UNSPECIFIED
024H MAX DUR OF CALL EXCEEDED
206H CALL REJECTED
304H B-LINE OUT OF SERVICE
30AH A ONHOOK DURING SET UP
30BH A ONHOOK DUR_WAIT ANSWER
603H NO RESPONSE FROM CO-PROC
706H CALL INTER REG ANALYSIS
80FH CIRCUIT CONGESTION
812H MAP FAILURE
B13H RADIO IF FAILURE
Additional
information fields
CALL ID
CALL ID : 0A9BH-AC8FH-401CH-4050H-0F61H-12H-0AF1H-57H
0A9BH: Record index of the call for SCALLF
AC8FH:Record index of the call for charging (LCALLF)
401CH: CPU identifier of the CHU computer that is handling charging for this call
4050H: CPU identifier of the RMAPRB that controls the call
0F61H: Hand ID of the RMAPRB that controls the call
12H: Focus of the hand process of the RMAPRB that control the call
0AF1H: Hand ID of XORPRB that controls the call
57H: Focus of the hand process of the XORPRB that controls the call
Computer Logs
The Error log:
In case when alarms do not indicate enough information or there is no alarm at all,
the error logs can help.
The error logs do not necessary mean trouble or failure of the system or program
block. It gives a signal or message regarding certain state or output of the code.
Of course the error situations regarding configuration error or misbehavior of the
system. In order to gather the error logs of a certain computer unit, accessing the
service terminal is mandatory.
e.g: ZDDS:SIGU;
Using the command above will access the unit index 0.
Computer Logs
Printout of SIGU-0`s error logs:
CALLER : 05E4 05F1 00 RETURN ADDRESS: 0940 (G0296).00000219
WRITE TIME: 2007-11-06 13:37:53.58
PARAMETERS: I-08 0938.00000021 00000075 0938.00000004
USER TEXT : LST: UNKNOWN MGW, SCTP IPv4:
USER DATA : mgw ip address version = 0
mgw ip address 1: 10.88.51.141
mgw ip address 2: 00.00.00.00
mgw ip port
= 8013
User text show description regarding the error situation.
User data contains the supplementary information
The caller is the process who sends the alarm.
Finding the process based on the id:
Enter service terminal with ZDDS
44-SYS>ZSXP:HTA
ID NAME
LDT
ATTR BP MP SI HGC MQ EH IS LP HP
05E4 HTAPRB 6D18 (G3491) 18FC 01 60 0014 0006 0A28 01 00 60 60
And vice versa, finding the process family based on the process ID:
4-MAN>ZSXPI:5E4
ID NAME
LDT
ATTR BP MP SI HGC MQ EH IS LP HP
05E4 HTAPRB 6D18 (G3491) 18FC 01 60 0014 0006 0A28 01 00 60 60
For internal use
21
Nokia Siemens Networks
.
.
.
.
CALLER: 3CB TYPE: I-8 DATE: 01.01.1970 TIME: 00:01:53.70
USER TEXT: UXCPRB: NOTICE -- SDxG interrupt occurred
USER DATA: port = 0, interface = 0
unit = UNKNOWN
CALLER: 3CB TYPE: I-8 DATE: 01.01.1970 TIME: 00:02:12.30
USER TEXT: UXCPRB: NOTICE -- SDxG interrupt occurred
USER DATA: port = 0, interface = 0
unit = UNKNOWN
Message monitoring
The message monitoring:
Deeper level troubleshooting method is the message monitoring gathering.
In order to find code error or possible configuration error (not that helpful) we are able to
execute different type of monitoring.
There is possibility to gather these logs from DMX( dx and ipa) and Chorus (ipa) units.
First of all here is the structure of the hex dump we call monitoring in DMX units:
MONITORING TIME: 2003-04-15
16:58:09.34
Deeper troubleshooting
1/1
Document Type
Author
Unit/Dept.
Document Title
Date, Version
For internal use
Deeper troubleshooting
Message monitoring methods in DMX units:
The first, fastest and easiest way of message monitoring is the ZOQ command in service terminal. You need to access to
working unit of the monitored program block that is located in. (ZDDS)
ZOQ parameters:
FAM
PRO
DI
ID
GR
C_C
C_F
C_P
E_F
E_P
QL
The ZOQ monitoring will provide hex dump, decoding not needed.
Deeper troubleshooting
Message monitoring methods in Chorus units:
All messages sent between computer units are DMX messages regardless from or to which type (DMX or
Chorus) of unit they are sent. Therefore, in order to be able to send and receive DMX messages, Chorus
actors must register a DMX identity from the DMX messaging system. This identity contains the family,
process, and focus identifier, similar to the DMX processes.
Do not forget to use lower case when typing the commands in Chorus units!
In order to find out the family id of a thread you need to use command:
alist
It will list which family, process, focus are registered by DMX identity.
20000061 00000000 00000047 00000001 0071 SUP STARTED 001 yshell
It is possible that you need to load certain files before using them.
If you would like to use dmxmon or dmxshow, You need to load them with:
loadext dmxshow
loadext dmxmon
Dmxshow command will show the FAM for the related actor:
0011-$ dmxshow -m
FAM
PROC
FOCUS
actor
thread queue #
054C
0000
00
0054
0076
0000
0000
00
0023
000D
1
0F03
0000
00
0071
00BC
123
pending
0
0
allocated
0
0
0
Deeper troubleshooting
Starting the monitoring :
dmxmon f 54C l /bin/example.txt
it links the monitoring to a file
Press CNTRL-C to stop the monitoring
Command: cat will print the file out.
cat /bin/example.txt
After a successfull loading these extensions can be used.
Deeper troubleshooting
What is important in daily troubleshooting?
Focus points are:
User Plane
in MSS:
in MGW:
ZWQV::URQ%
SIGU, BSU
ZSXP:<full path>URQ
05F5
Deeper troubleshooting
Extended message monitoring:
It will provide possibility to monitor several program blocks (16) in several units (15) in the same time.
Always have to be taken from OMU!
The process is following:
1. Reserving memory for monitoring
ZOEBR:MB_address:reserved_memory_in_hex
e.g: ZOEBR:40:1000000
2. Create monitoring conditions
ZOEC:MB_address:SR:OFAM=process_id (SR means: sent, received)
e.g: ZOEC:40:SR:OFAM=132
3. Start monitoring: ZOEM:MB_address
4. Stop monitoring: ZOES:MB_address
5. Printout monitoring in hexadump to screen:
ZOEG:MB_address
ZOEGF:MB_address:W0-log.bin
It will save the log into log.bin file on W0. IDA can decode it depending on version. It is easier to print to
screen, decoding done automatically.
6. Kill monitoring conditions:
ZOEK - It will release the used buffers and delete the monitoring conditions in all units.
For internal use
31
Nokia Siemens Networks
Deeper troubleshooting
How to read the monitoring LOGs???
The main troubleshooting tool is:
IDA2
IDA2 is an interactive and automatic off-line analysis and troubleshooting tool helping users to
analyze different kind of log files. IDA2 is well-suited to analyze log files interactively and
automatically for example from the DX platform containing objects such as error logs, test logs,
alarms, MML commands and responses and DX messages.
Although IDA2 has strong support for DX platform products it is also suitable for other platforms
as well.
Creating IDA2 extractors and decoders for different network products allows the browsing log
files and automated analysis.
IDA2 can be downloaded from:
https://confluence.inside.nokiasiemensnetworks.com/display/EE/EE+IDA2
: 0CF6H-9327H-401CH-413EH-05D3H-F0H-05F1H-C4H
HAND
6.
FOCUS
The hand ID can be found in IDA under Process, the focus ID can be found under
focus. If you find it you can filter for that with a double click on process column header
button.
Deeper troubleshooting
CMREAD program block
The CMR is located in CM/CMM having tasks like digit-, barring analysis.
It can be decoded with IDA, but major segments can be seen easily.
Enter active CM/CMM`s service terminal:
ZDDS:<active CMM>;
ZOQE:<Process family ID>;
As it was described before, ZOQ can be set to certain message number. In this case it is 3215.
Set it to ID field using cursor.
Initiate the call...
1/1 troubleshooting
Deeper
Document Type
Author
Unit/Dept.
Document Title
Date, Version
For internal use
CMREAD analysation:
FAM PRO DI ID GR C_C C_F C_P E_F E_P QL
FFFF FFFF FF 3215 FFFF FFFF FFFF FFFF FFFF FFFF FF
MONITORING TIME: 2007-11-09 17:02:06.15
RECEIVED BY: 0070 0000 00
MONITORED MESSAGE: 0054 4531 0134 0001 00 19 0000 3215 0045
00 00 02 00 00 00 00 00 00 00 FF FF FF 06 04 01 0A 0A 0A 0A 01 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 FF 04 06 00 64 00 00 00 00 00 00 00 01
MONITORING TIME: 2007-11-09 17:02:06.16
RECEIVED BY: 0070 0000 00
MONITORED MESSAGE: 0054 4531 0294 0001 00 19 0000 3215 0045
00 00 32 00 00 00 00 00 00 00 FF FF FF 06 04 01 06 01 02 0A 0A 0A 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 FF 04 06 00 64 00 00 00 00 00 00 00 01
RED:
the number of the TREE in hexadecimal format
GREEN: the digit being analized
BLUE: the TON (type of number)
For internal use
35
Nokia Siemens Networks
Blackbox
Blackbox:
Black box (boxana) is useful in case of unexpected unit restart or process exception.
In case of unexpected unit restart, tha alarm may not give relevant information regarding the
restart, only unit restarted alarm. Blackbox could provide detailed background of the event.
The loadable extension is : boxana.
1. Enter in the OMU service terminal: ZDDS;
2. Load the boxana extension: ZLP:1,BOX:
Commands:
A:
output the entire operating system log.
AE:
printout of error types
AS:
outputs statistical data
AP:
outputs only exceptions
L:
log file writing
I:
will show the reason for restart
U:
will output the entire black box
BBLOG in Chorus units:
loadext bblog
bblog -c -a x (default is 2 =previous restart) x can between 2-9
For internal use
36
Nokia Siemens Networks
Mini Debugger
a short turnout...
Mini debugger:
If there is continous unit restarts it is not possible to give commands in service terminal. In this case press
RST (reset) and DBG (debugger) button in the CPU card.
Keep them pressed for few seconds, then release RST and hold DBG unit debugger loaded message
appears on screen.
It can help analizing the failures in case of unexpected continous unit restarts.