Sie sind auf Seite 1von 342

OptiX OSN 8800/6800/3800

V100R007C00

Troubleshooting

Issue 05
Date 2013-11-30

HUAWEI TECHNOLOGIES CO., LTD.


Copyright © Huawei Technologies Co., Ltd. 2013. All rights reserved.
No part of this document may be reproduced or transmitted in any form or by any means without prior written
consent of Huawei Technologies Co., Ltd.

Trademarks and Permissions

and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their respective holders.

Notice
The purchased products, services and features are stipulated by the contract made between Huawei and the
customer. All or part of the products, services and features described in this document may not be within the
purchase scope or the usage scope. Unless otherwise specified in the contract, all statements, information,
and recommendations in this document are provided "AS IS" without warranties, guarantees or representations
of any kind, either express or implied.

The information in this document is subject to change without notice. Every effort has been made in the
preparation of this document to ensure accuracy of the contents, but all statements, information, and
recommendations in this document do not constitute a warranty of any kind, express or implied.

Huawei Technologies Co., Ltd.


Address: Huawei Industrial Base
Bantian, Longgang
Shenzhen 518129
People's Republic of China

Website: http://www.huawei.com
Email: support@huawei.com

Issue 05 (2013-11-30) Huawei Proprietary and Confidential i


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

About This Document

Related Versions
The following table lists the product versions related to this document.

Product Name Version

OptiX OSN 8800 V100R007C00

OptiX OSN 6800 V100R007C00

OptiX OSN 3800 V100R007C00

iManager U2000 V100R006C02

iManager U2000 Web LCT V100R006C02

Intended Audience
The intended audiences of this document are:

l Field maintenance engineers


l Network monitoring engineers
l System maintenance engineers

Symbol Conventions
The symbols that may be found in this document are defined as follows.

Symbol Description

Indicates an imminently hazardous situation which, if not


avoided, will result in death or serious injury.

Indicates a potentially hazardous situation which, if not


avoided, could result in death or serious injury.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential ii


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

Symbol Description

Indicates a potentially hazardous situation which, if not


avoided, may result in minor or moderate injury.

Indicates a potentially hazardous situation which, if not


avoided, could result in equipment damage, data loss,
performance deterioration, or unanticipated results.
NOTICE is used to address practices not related to personal
injury.

Calls attention to important information, best practices and


tips.
NOTE is used to address information not related to personal
injury, equipment damage, and environment deterioration.

GUI Conventions
The GUI conventions that may be found in this document are defined as follows.

Convention Description

Boldface Buttons, menus, parameters, tabs, window, and dialog titles


are in boldface. For example, click OK.

> Multi-level menus are in boldface and separated by the ">"


signs. For example, choose File > Create > Folder.

Change History
Updates between document issues are cumulative. Therefore, the latest document issue contains
all updates made in previous issues.

Updates in Issue 05 (2013-11-30) Based on Product Version


V100R007C00
This issue is the fifth official release for V100R007C00. Compared with Issue 04, some bugs in
the manual are fixed.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential iii


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

Updates in Issue 04 (2013-08-19) Based on Product Version


V100R007C00
This issue is the forth official release for V100R007C00. Compared with Issue 03, updating of
the following content.

Update Description

Basic Concepts Updated "Test Frame": optimized application 2 of ethernet test frame.
and Methods of
Fault Locating

Fault Location Updated the procedure of "Performing Intelligent Service Fault


Using TP-Assist Diagnosis for Ethernet Services".

Updates in Issue 03 (2012-12-15) Based on Product Version


V100R007C00
This issue is the third official release for V100R007C00. Compared with Issue 02 (2012-09-30),
updating of SPC200 is added. This issue has the following updates.

Update Description

Whole manual l Deleted "Description" in the Chapter "General Fault Handling Flow"
l Deleted "Information Collecting and Recording" "Technical
Support" "Latest Technical Documentation Obtaining" in the Chapter
"Fault Handling Flow"
l Deleted "Basic Rules for Locating Faults" in the Chapter "Basic
Concepts and Methods of Fault Locating".
l Deleted "Configuration Data Analysis" in the Chapter "Common
Methods of Locating Faults"
l Deleted "Related Information" in the Chapter "Rectifying Bit Errors"
l Deleted "Failure of Ringing of the Orderwire Phone Set upon
Incoming Calls Though the Calls Can Be Answered" in the Chapter
"Rectifying Orderwire Problems"
l "Frequent Resetting of the SCC Board" has been moved from "17
ECC Problems" to "10 NE Offline Problems".
l Added "Rectifying Coherent Board Faults".

Common Updated "Test Frame", added 100G OTU board.


Methods of
Locating Fault

Issue 05 (2013-11-30) Huawei Proprietary and Confidential iv


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

Updates in Issue 02 (2012-09-30) Based on Product Version


V100R007C00
This issue is the second official release for V100R007C00. Only the issue number is updated.

Updates in Issue 01 (2012-07-30) Based on Product Version


V100R007C00
First release for version V100R007C00. Compared with version V100R006C03, this issue has
the following updates.

Update Description

3.1 Common Section "One-Click Data Collection" is added.


Fault Isolation
Methods

4 Fault Location Section "Fault Location Using TP-Assist" is added.


Using TP-Assist

15 Packet Section "Rectifying Packet Service Problems" is added.


Service
Problems

16 Ethernet l Section "Common LAG Faults" is added.


Problems l Section "Common MSTP Faults" is added.

17 ECC l Section "FAQs for HWECC" is added.


Problems l Section "Troubleshooting for Inband DCN" is added.

Updates in Issue 03 (2012-10-30) Based on Product Version


V100R006C03
The update of contents is described as follows:

Update Description

3.1 Common Fault Isolation l 3.1.6 PRBS Test: Updated PRBS networking
Methods applications.
l 3.1.8 Test Frame: Added the 40G service type and
the supported board.
l Added "One-Click Data Collection".

Issue 05 (2013-11-30) Huawei Proprietary and Confidential v


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

Update Description

12 Automatic Switching 12.5 ODUk SNCP Protection: Updated the solution of


Failures troubleshooting.

17 ECC Problems Added "FAQs for HWECC".

Updates in Issue 02 (2012-06-22) Based on Product Version


V100R006C03
The update of contents is described as follows:

Update Description

3.1 Common Fault Isolation Section "Port Mirroring Analysis" is added.


Methods Common fault diagnosis methods are optimized.

10.1 Unreachability of One NE Causes and diagnosis methods for the fault are modified.
by the U2000

12.12 DLAG Protection Changed "Automatic Switching Failure of DLAG


Protection (OTN)" to "Automatic Switching Failure of
DLAG Protection".

13.10 DLAG Protection Changed "Automatic Reverting Failure of DLAG


Protection (OTN)" to "Automatic Reverting Failure of
DLAG Protection".

Automatic Switching Failure of Section are deleted.


DLAG Protection(OCS)
Automatic Reverting Failure of
DLAG Protection(OCS)

14.4 IPA End Failure in a Causes and diagnosis methods for the fault are added.
Raman System

Updates in Issue 01 (2011-12-15) Based on Product Version


V100R006C03
The update of contents is described as follows:

Some bugs in the manual are fixed.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential vi


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

Updates in Issue 02 (2013-06-30) Based on Product Version


V100R006C01
The update of contents is described as follows:

Some bugs in the manual are fixed.

Updates in Issue 01 (2011-07-30) Based on Product Version


V100R006C01
The update of contents is described as follows:

Some bugs in the manual are fixed.

Updates in Issue 02 (2011-04-15) Based on Product Version


V100R006C00
The update of contents is described as follows:

Some bugs in the manual are fixed.

Updates in Issue 01 (2010-12-31) Based on Product Version


V100R006C00
Update Description

12.16 Inter- Section "Automatic Switching Failure of Inter-Subrack Communication


Subrack Protection" is added.
Communication
Protection

Updates in Issue 01 (2010-07-30) Based on Product Version


V100R005C00
This issue is the first official release for OptiX OSN 8800/6800/3800 V100R005C00. In this
release, the manuals for OptiX OSN 8800 V100R002C02, OptiX OSN 6800 V100R004C04,
and OptiX OSN 3800 V100R004C04 are combined into one manual. Compared with the
combined manual, this issue has the following updates.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential vii


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting About This Document

Update Description

Entire manual This manual provides descriptions according to product series OptiX
OSN 8800, OptiX OSN 6800, and OptiX OSN 3800. Any difference
between the products is described in the manual.

11 Board Offline Section "Rectifying Board Offline Problems" is added.


Problems

16 Ethernet Section "Service Interruption" is deleted.


Problems

Issue 05 (2013-11-30) Huawei Proprietary and Confidential viii


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting Contents

Contents

About This Document.....................................................................................................................ii


1 Overview.........................................................................................................................................1
1.1 Safety Precautions..........................................................................................................................................................2
1.1.1 Laser............................................................................................................................................................................2
1.1.2 Short Circuit................................................................................................................................................................4
1.1.3 ESD..............................................................................................................................................................................4
1.1.4 Working on Equipment with Power Applied..............................................................................................................6
1.1.5 Alarm and Safety Symbols..........................................................................................................................................6
1.2 Instruments and Tools....................................................................................................................................................7
1.3 Maintenance Personnel Qualifications...........................................................................................................................8

2 Troubleshooting Workflow.........................................................................................................9
2.1 General Troubleshooting..............................................................................................................................................10
2.1.1 Workflow...................................................................................................................................................................10
2.2 Emergency Troubleshooting.........................................................................................................................................12
2.2.1 Workflow...................................................................................................................................................................12
2.2.2 Description.................................................................................................................................................................15

3 Fault Isolation Basics..................................................................................................................22


3.1 Common Fault Isolation Methods................................................................................................................................24
3.1.1 Service Signal Flow Analysis....................................................................................................................................24
3.1.2 Alarm and Performance Analysis..............................................................................................................................26
3.1.3 Fault Analysis Using Test Instruments......................................................................................................................29
3.1.4 Loopback...................................................................................................................................................................31
3.1.5 Replacement..............................................................................................................................................................32
3.1.6 PRBS Test..................................................................................................................................................................33
3.1.7 Configuration Modification.......................................................................................................................................35
3.1.8 Test Frame.................................................................................................................................................................36
3.1.9 RMON Performance Analysis...................................................................................................................................41
3.1.10 Port Mirroring Analysis...........................................................................................................................................43
3.1.11 One-Click Data Collection......................................................................................................................................46

4 Fault Location Using TP-Assist.................................................................................................49


4.1 Fault Location Using IP Ping.......................................................................................................................................50

Issue 05 (2013-11-30) Huawei Proprietary and Confidential ix


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting Contents

4.2 E-LAN Service Loopback Detection............................................................................................................................52


4.3 Performing Intelligent Service Fault Diagnosis for Ethernet Services.........................................................................54

5 Service Interruptions..................................................................................................................56
5.1 Interruption of All Services..........................................................................................................................................58
5.2 Single-Channel Service Interruption on the Client Side..............................................................................................60
5.3 Single-Channel Service Interruption on the WDM Side..............................................................................................61
5.4 Service Interruption Caused by Human Misoperation.................................................................................................63
5.5 Service Interruptions Caused by Power Supply Failures.............................................................................................64
5.6 Service Interruptions Due to Grounding Fault.............................................................................................................66
5.7 Service Interruptions Due to Unfavorable Environmental Conditions.........................................................................67
5.8 Service Interruptions Due to Faulty Optical Fibers, Cables, or Connectors................................................................71
5.9 Service Interruptions Due to Inconsistency of Board Types or Board Settings...........................................................74
5.10 Service Interruptions on the VCTRUNK Port............................................................................................................75
5.11 Service Interruptions on a Single Ethernet Port.........................................................................................................77
5.12 Service Interruptions on All Ethernet Ports................................................................................................................79
5.13 Service Interruption Cases..........................................................................................................................................81

6 Transient Service Interruptions................................................................................................83


6.1 Transient Interruption of Main Path Services...............................................................................................................84
6.2 Transient Interruption of Single-Channel Services......................................................................................................85
6.3 Transient Service Interruptions on a Single VCTRUNK Port.....................................................................................87
6.4 Transient Service Interruptions on a Single Ethernet Port...........................................................................................93
6.5 Transient Service Interruptions Due to Protocol Related Causes.................................................................................97
6.6 Transient Interruption Cases.......................................................................................................................................100

7 Optical Power Abnormity........................................................................................................102


7.1 Abnormal Decline of the Optical Power....................................................................................................................103
7.2 Optical Power Abnormity Cases................................................................................................................................104

8 Bit Errors......................................................................................................................................106
8.1 Multi-Channel Bit Errors............................................................................................................................................107
8.2 Single-Channel Bit Errors..........................................................................................................................................109
8.3 Bit Errors Due to Unfavorable Environmental Conditions........................................................................................111
8.4 Bit Errors Due to Device Faults.................................................................................................................................115
8.5 Bit Error Cases............................................................................................................................................................117

9 A Fault in Pointer Justification...............................................................................................118


9.1 Pointer Justification Due to Incorrect Clock Configuration.......................................................................................119
9.2 Pointer Justification Due to Over-High Temperature and Humidity..........................................................................120
9.3 Pointer Justification Due to Over-Low Precision of the External Clock Source.......................................................121

10 NE Offline Problems..............................................................................................................123
10.1 Unreachability of One NE by the U2000.................................................................................................................124
10.2 Unreachability of All NEs in a Subnet by the U2000..............................................................................................125

Issue 05 (2013-11-30) Huawei Proprietary and Confidential x


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting Contents

10.3 Frequent Unreachability of an NE by the U2000.....................................................................................................127


10.4 The System Control Board Is Frequently Reset.......................................................................................................128
10.5 NE Offline Cases......................................................................................................................................................129

11 Board Offline Problems......................................................................................................... 130


11.1 A Board Is Offline....................................................................................................................................................131
11.2 Multiple Boards Are Offline.....................................................................................................................................131

12 Automatic Switching Failures...............................................................................................133


12.1 Optical Line Protection.............................................................................................................................................136
12.2 Intra-Board 1+1 Protection.......................................................................................................................................138
12.3 Client-side 1+1 Protection........................................................................................................................................141
12.4 SW SNCP Protection................................................................................................................................................143
12.5 ODUk SNCP Protection...........................................................................................................................................145
12.6 VLAN SNCP Protection...........................................................................................................................................149
12.7 Tributary SNCP Protection.......................................................................................................................................151
12.8 Automatic Switching Failure of DBPS Protection...................................................................................................155
12.9 Board 1+1 Protection................................................................................................................................................156
12.10 Ethernet Ring Protection........................................................................................................................................157
12.11 MS SNCP Protection..............................................................................................................................................159
12.12 DLAG Protection....................................................................................................................................................162
12.13 Board-Level Protection...........................................................................................................................................163
12.14 ODUk SPRing Protection.......................................................................................................................................165
12.15 Optical Wavelength Shared Protection...................................................................................................................168
12.16 Inter-Subrack Communication Protection..............................................................................................................171
12.17 Linear Multiplex Section Protection.......................................................................................................................171
12.18 MSP Ring...............................................................................................................................................................174
12.19 Transoceanic MSP Ring.........................................................................................................................................176
12.20 SNCP......................................................................................................................................................................178
12.21 SNCTP....................................................................................................................................................................180
12.22 Protection Cases.....................................................................................................................................................182

13 Automatic Reverting Failures............................................................................................... 184


13.1 Optical Line Protection.............................................................................................................................................187
13.2 Intra-Board 1+1 Protection.......................................................................................................................................189
13.3 Client-side 1+1 Protection........................................................................................................................................191
13.4 SW SNCP Protection................................................................................................................................................193
13.5 ODUk SNCP Protection...........................................................................................................................................195
13.6 VLAN SNCP Protection...........................................................................................................................................197
13.7 Tributary SNCP Protection.......................................................................................................................................200
13.8 Ethernet Ring Protection..........................................................................................................................................202
13.9 MS SNCP Protection................................................................................................................................................203
13.10 DLAG Protection....................................................................................................................................................205

Issue 05 (2013-11-30) Huawei Proprietary and Confidential xi


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting Contents

13.11 ODUk SPRing Protection.......................................................................................................................................207


13.12 Optical Wavelength Shared Protection...................................................................................................................209
13.13 Linear Multiplex Section Protection.......................................................................................................................211
13.14 MSP Ring...............................................................................................................................................................212
13.15 Transoceanic Ring MS Protection..........................................................................................................................213
13.16 SNCP......................................................................................................................................................................214
13.17 SNCTP....................................................................................................................................................................215
13.18 Protection Cases.....................................................................................................................................................216

14 Optical Power Management Problems................................................................................219


14.1 IPA Start Failure.......................................................................................................................................................220
14.2 IPA End Failure........................................................................................................................................................222
14.3 IPA Start Failure in a Raman System.......................................................................................................................224
14.4 IPA End Failure in a Raman System........................................................................................................................228
14.5 ALC Start Failure.....................................................................................................................................................232
14.6 APE Exceptions........................................................................................................................................................234
14.7 Handling Anomalies in the OPA Function...............................................................................................................237
14.8 Optical Power Management Cases...........................................................................................................................237
14.8.1 Automatic Power Equilibrium (APE)....................................................................................................................238
14.8.2 Automatic Level Control (ALC)...........................................................................................................................238
14.8.3 Intelligent Power Adjustment (IPA)......................................................................................................................238
14.8.4 Intelligent Power Adjustment (IPA) of Raman System........................................................................................238

15 Packet Service Problems........................................................................................................240


15.1 Handling MPLS Tunnel Faults.................................................................................................................................241
15.2 Handling PW Faults..................................................................................................................................................242
15.3 Native Ethernet Service Troubleshooting.................................................................................................................243
15.4 PWE3 Ethernet Service Troubleshooting.................................................................................................................245
15.5 Tunnel APS Troubleshooting...................................................................................................................................247

16 Ethernet Problems...................................................................................................................250
16.1 Service Degradation.................................................................................................................................................251
16.2 LCAS Startup Failure...............................................................................................................................................252
16.3 LPT Switching Failure..............................................................................................................................................255
16.4 LB Test Failure.........................................................................................................................................................255
16.5 Test Frames Unavailability.......................................................................................................................................259
16.6 Common LAG faults................................................................................................................................................259
16.7 Common MSTP faults..............................................................................................................................................262
16.8 Ethernet Service Cases.............................................................................................................................................265

17 ECC Problems...........................................................................................................................266
17.1 ECC Communication Interruption............................................................................................................................267
17.2 Intermittent ECC Communication............................................................................................................................268
17.3 HWECC FAQ...........................................................................................................................................................269

Issue 05 (2013-11-30) Huawei Proprietary and Confidential xii


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting Contents

17.4 Troubleshooting for Inband DCN.............................................................................................................................270


17.5 ECC Cases................................................................................................................................................................273

18 Clock Problems........................................................................................................................275
18.1 Failure of NodeBs to Trace Clock Signals...............................................................................................................276
18.2 Synchronization Failure of the Physical-Layer Clock..............................................................................................277

19 Orderwire Problems................................................................................................................280
19.1 Orderwire Failure.....................................................................................................................................................281
19.2 A Called Phone Set Cannot Exit the Conference Call State.....................................................................................282
19.3 Orderwire Cases.......................................................................................................................................................283

20 Coherent Board Faults............................................................................................................284


21 Interconnection Faults............................................................................................................287
21.1 Failure in Intercommunication with the Data Equipment........................................................................................288
21.2 Failure in Intercommunication with the SDH Equipment........................................................................................289
21.3 WDM and other OTN Equipment Interconnection Failures....................................................................................291
21.4 Equipment Interconnection Cases............................................................................................................................292

A Remote Maintenance Guide...................................................................................................294


A.1 Overview....................................................................................................................................................................295
A.2 Enabling Remote Maintenance User.........................................................................................................................295
A.3 Creating Remote Maintenance..................................................................................................................................296

B Glossary......................................................................................................................................298

Issue 05 (2013-11-30) Huawei Proprietary and Confidential xiii


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

1 Overview

About This Chapter

This topic describes safety precautions, introduces essential instruments and tools, and outlines
maintenance personnel training requirements necessary for troubleshooting.

1.1 Safety Precautions


This section describes the safety operation guidelines. It contains personal safety guidelines and
equipment operating guidelines. These guidelines must be strictly followed to avoid any injury
to human body or damage to equipment during equipment operation.

1.2 Instruments and Tools


This topic introduces the essential instruments and tools necessary for troubleshooting.

1.3 Maintenance Personnel Qualifications


To quickly and accurately locate and resolve a fault in the optical transport system, maintenance
personnel must have sufficient background knowledge and service skills and be familiar with
the network and the equipment used. This topic outlines the skills and knowledge that
maintenance personnel require for troubleshooting the OptiX OSN equipment.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 1


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

1.1 Safety Precautions


This section describes the safety operation guidelines. It contains personal safety guidelines and
equipment operating guidelines. These guidelines must be strictly followed to avoid any injury
to human body or damage to equipment during equipment operation.

1.1.1 Laser
When you install and maintain equipment, follow the safety precautions described below to help
prevent personal injury or equipment damage. The laser complies with IEC60825.

The safety precautions of lasers consist of two parts:

l Personal injury
l Equipment damage

Personal Injury

DANGER
Laser beams from the optical ports on boards or from the fiber connectors cause eye damage.
Do not look directly at the optical ports or fiber connectors during the installation and
maintenance of boards or fibers. Do not shine laser beams into the eyes of other workers.

DANGER
To prevent eye damage in the case of an optical port that is in use, use protective caps to cover
the optical interface and the fiber connector after you remove the fiber from the optical interface.

DANGER
The optical power of the LINE interface on the Raman amplifier board is very high. Shut down
the pump laser before you insert or remove fiber connectors on the Raman amplifier board to
help prevent personal injury that is caused by high optical power.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 2


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

Equipment Damage

CAUTION
Use protective caps to cover unused optical ports and fiber connectors so that they do not gather
dust.

CAUTION
When performing a hardware loopback test on optical ports using a fiber, add an optical
attenuator to prevent damage to the equipment because of the high power of the laser beam. Add
the attenuator at the receive optical port on a board that supports optical attenuators.

CAUTION
When you use the optical time domain reflectometer (OTDR), disconnect the fiber between the
opposite station and the board to prevent damage to the receive optical module because of high
optical power.

CAUTION
Exercise caution when you remove or insert a board that is connected with fibers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 3


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

CAUTION
The optical power of the Raman amplifier board is very high. Observe the following precautions
when using the Raman amplifier board to prevent damage to the equipment.
l Do not use fiber connectors within 0–20 km. The fibers at every joint point must be spliced.
l The single-point additional loss within 0–10 km must be smaller than 0.1 dB (G.652) or
0.2 dB (G.655) and the single-point return loss must not be smaller than 40 dB.
l The single-point additional loss within 10–20 km must be smaller than 0.2 dB (G.652) or
0.4 dB (G.655) and the single-point return loss must not be smaller than 40 dB.
l Fiber connections must be complete before you enable the lasers on the Raman amplifier
board. Make sure that the fiber connectors are clean. Otherwise, the fiber connectors might
be damaged when you insert or remove the fiber connectors.
l The optical power of the LINE interface on the Raman amplifier board is very high. The
LSH/APC optical connectors must be used in the fiber that is connected to the LINE
interface.
l For the Raman amplifier board with backward pump, the strong pump light enters the fiber
through the input end (LINE) instead of the output end (SYS). Do not add boards or non-
fiber devices, such as attenuators or fiber jumpers, at the input end.
l The bent radius of the fiber that is connected to the LINE interface on the Raman amplifier
board must be larger than 30 mm to prevent the fiber from being burned.

1.1.2 Short Circuit


Exercise caution where you place metallic tools so that you do not cause a short circuit in any
equipment.

CAUTION
Do not place tools, such as screwdrivers, on the air baffle.

CAUTION
Ensure that screws do not fall off into the subrack or chassis.

1.1.3 ESD
During installation and maintenance, follow ESD procedures to prevent equipment damage:

l Always wear an ESD wrist strap during the operation.


l Check that the equipment is securely grounded.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 4


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

CAUTION
Wear a well-grounded ESD wrist strap whenever you touch equipment or boards. Make sure
that the wrist strap touches your skin. Insert the ESD strap connector into the ESD socket of the
equipment.

For information about how to wear an ESD wrist strap, see Figure 1-1.

Figure 1-1 Wearing an ESD wrist strap

NOTE

Insert the connector of the ESD strap into the equipment port. For details, see the Quick Installation Guide.

When you are following ESD procedures, take the following precautions:

l Check the validity and functionality of the wrist strap. Its resistance value must be between
0.75 mega ohm to 10 mega ohm. If the wrist strap validity period (usually two years) has
expired, or if the resistance value fails to meet requirements, replace it with a wrist strap
that provides the required resistance value.
l Do not touch a board with your clothing. Clothing generates static electricity that is not
protected by the wrist strap.
l Wear an ESD wrist strap and place the board on an ESD pad when you replace boards or
chips. Use ESD tweezers or extraction tools to replace chips. Do not touch chips, circuits,
or pins with your bare hands.
l Keep the boards and other ESD-sensitive parts you are installing in ESD bags. Place the
removed boards and components on an ESD pad or ESD material. Do not use non-antistatic
materials such as white foams, common plastic bags, or paper bags to pack boards, and do
not let these materials touch the boards.
l Wear an ESD wrist strap when operating the ports of boards because they are also ESD-
sensitive. Discharge the static electricity of cables and protective sleeves before you connect
them to the ports.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 5


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

l Keep packing materials (such as, ESD boxes and bags) available in the equipment room
for packing boards in the future.

ESD complies with IEC Publication 1000, EN 55022, EN 55024, IEC 61000 and GR-1089-
CORE.

1.1.4 Working on Equipment with Power Applied


When you perform operations on the equipment when power is applied, ensure that you take the
safety precautions to prevent personal injury or equipment damage.

DANGER
Do not install or disassemble equipment when power is applied.

1.1.5 Alarm and Safety Symbols


During equipment installation and maintenance, observe the precautions indicated by the alarm
and safety symbols to help prevent personal injury or equipment damage.

Table 1-1 describes the alarm and safety symbols on the WDM equipment.

Table 1-1 Symbols on the WDM equipment

Symbol Describes

ESD protection symbol.


You must wear an ESD wrist strap or glove to avoid
damage caused by electrostatic discharge to boards.

Laser level symbol.


Indicates the laser level and warns that laser beams
CAUTION can cause injuries to eyes.
HAZARD LEVEL 1M INVISIBLE
LASER RADIATION
DO NOT VIEW DIRECTLY WITH
NON-ATTENUATING OPTICAL
INSTRUMENTS

Grounding symbol.
Indicates the position of the grounding point.

Regular cleaning symbol.


Warns you to regularly clean the air filter.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 6


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

Symbol Describes

Fan warning symbol.


Warns you not to touch the fan blade until the fan
stops moving.

1.2 Instruments and Tools


This topic introduces the essential instruments and tools necessary for troubleshooting.

Table 1-2 lists the instruments and tools used for troubleshooting.

Table 1-2 Instruments and tools used for troubleshooting

Name Usage

Optical power meter Tests the optical power of optical ports.


Fiber patch cord

Attenuator Attenuates the receive optical power.

Optical spectrum analyzer Tests the system index.

Optical time-domain Detects optical cable faults.


reflectometer

Adjustment pin Adjusts mechanical variable optical attenuators (VOAs).

Fiberscope Checks whether the optical fiber end faces are clean.

ESD wrist strip Protects sensitive components against static generated by the
human body.

ESD bag Safely stores static-sensitive components.

ESD uniform Protects sensitive components against static generated by the


human body.

Phillips screwdriver Tightens and loosens screws on boards.

Cassette cleaner Cleans optical fiber end faces.

Dust-free cloth Cleans optical fiber heads.

Special cleaning solvent Cleans optical fibers and flanges.


(Isopropyl alcohol is
preferred to propyl alcohol)

Special compressed gas Cleans boards and optical ports.

U2000 Software used to configure and issue commands to boards

Web LCT Software used to configure and issue commands to boards

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 7


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 1 Overview

Name Usage

Label Labels the replacement pigtails.

Fiber binding strap Binds the replacement pigtails.

1.3 Maintenance Personnel Qualifications


To quickly and accurately locate and resolve a fault in the optical transport system, maintenance
personnel must have sufficient background knowledge and service skills and be familiar with
the network and the equipment used. This topic outlines the skills and knowledge that
maintenance personnel require for troubleshooting the OptiX OSN equipment.

Background Knowledge
l WDM fundamentals
l The WDM system alarm generation mechanism
l Troubleshooting methods for common alarms

Basic Operations
l Basic transmission equipment operations
l Basic network management system (NMS) operations

For basic NMS operations, refer to the OptiX iManager U2000 Operator Guide or the on-line
help.

Common Test Instruments


Commonly needed transmission system test instruments include optical power meter, SDH/
SONET tester, SmartBits tester, optical spectrum analyzer, OTDR and communication signal
analyzer. Refer to the respective operator guide for more information.

Network Layout
The network layout of the project

The service configuration, wavelength assignment, optical distribution frame (ODF), fiber
routing, board version, and equipment layout in each office.

The current equipment running status in local office

The relevant engineering files and documents (which need to be updated periodically)

On-Site Data Collection


This process includes periodic data collection both when the equipment is running normally and
when faults occur. Before handling the fault, maintenance personnel should first collect and
store the on-site data. For more information about data collection, refer to "3.1.11 One-Click
Data Collection".

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 8


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

2 Troubleshooting Workflow

About This Chapter

This topic describes the troubleshooting process, information collection, and how to obtain
additional technical support and the latest technical documentation.

2.1 General Troubleshooting


This topic describes how to handle general faults.

2.2 Emergency Troubleshooting


This topic describes how to handle emergencies.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 9


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

2.1 General Troubleshooting


This topic describes how to handle general faults.

2.1.1 Workflow
This topic describes how to handle general faults.

The following figure describes how to handle general faults.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 10


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

Figure 2-1 General troubleshooting flowchart

Start

Collect and record


information

Yes
External causes? Other handling process

No

Analyze and locate fault

Yes
Fault cleared?

No

Feed back to Huawei

Work out solutions


together

Try to solve

No
Service recovery?

Yes
Observe the running
equipment

No
Fault cleared?

Yes

Summary and fill out


report

End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 11


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

Fault Information Collection


The following information needs to be recorded:
l Fault occurrence time
l Operations performed before and after the occurrence of the fault
Additionally, other important data, such as alarm information and performance events on the
U2000, also needs to be saved.

External Faults
Handle faults caused by any of the following external factors:
l Power failure
l Optical cable fault
l Environment variable (for example, telecommunications room temperature)
l Terminal equipment

Technical Support
Contact Huawei Customer Service Center (CSC) for assistance locating and resolving any issues
beyond the scope of this document.
You can also contact the technical support personnel in Huawei's local representative office in
either of the following ways:
l Hotline: 4008302118
l E-mail: support@huawei.com

Observation and Confirmation


After services are restored, observe services for 24 consecutive hours to confirm that the fault
has been resolved.

Troubleshooting Reports
After the fault is resolved, fill in the related troubleshooting report in a timely manner and
summarize how the fault was handled.

2.2 Emergency Troubleshooting


This topic describes how to handle emergencies.

2.2.1 Workflow
This topic describes emergency handling workflow.
The emergency handling flow refers to the fault handling flow when the service of the equipment
is interrupted due to external faults (such as power supply fault or fiber cut), misoperations, or

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 12


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

software/hardware faults of the equipment. In the case of the OptiX WDM optical transmission
equipment, in addition to clearing the fault according to the troubleshooting flow, take other
emergency measures (such as providing standby channels) or ask for help in time to reduce the
service interruption duration. The emergency troubleshooting process refers to the
troubleshooting process when services on the equipment are interrupted due to external faults,
such as a power failure or damaged fiber cables, incorrect operations, or software/hardware faults
of the equipment. For the OptiX WDM optical transmission equipment, in addition to clearing
the fault according to the troubleshooting process, take other emergency measures, such as
providing standby channels, or ask for help in time to reduce the duration of service interruption.

CAUTION
When handling service interruptions or other emergencies, note the following points:
l Restore services as soon as possible. If a backup channel is available, switch services to the
backup channel.
l Analyze the fault symptoms first, find the causes, and handle the fault.
l When you fail to resolve a fault, contact Huawei technical support to troubleshoot the fault
and minimize the duration of service interruption.
l During the handling process, record and save the original fault data.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 13


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

Figure 2-2 Emergency troubleshooting flowchart 1


Start, a fault is reported

Yes Cancel the abnormal operation,


An abnormal operation
is performed? end

No

Adjust the affected services


Yes to the backup resources, go
Have spare resources?
to 1

No The
Locate the faulty
Yes R_LOS/R_LOF/MUT_L The power Yes
Yes section along the signal Yes
1 Check alarms with the NMS OS/IN_PWR- supply/optical cable
flow and confirm
LOW/PWR_HIGH/NE- is faulty?
whether the NE is
unreachable alarm
unreachable
exists?

No No No

Query the bit error Yes Check the


Yes All channels are SCC/OA/MUX/DEMUX
performance and check
whether all channels have abnormal? board on site
excessive bit
errors
Handle
No No the
external
The fault is a system fault
fault, go to 2 Yes The BUS_ERR
alarm exists?

No
The fault is a multi-
The fault is a cross- channel fault, go to 5
connect fault, go to 3
The
OTU_LOF/OTU_LOM/
Yes
ODU_LOF/ODU_LOM/
ODUk_PM_SSF/ODUk
_PM_BDI alarm
exists?

Yes
The fault is an Check the bit error The fault is a single- Replace the faulty
OTN/electrical layer performance and locate channel fault, go to 6 board
fault, go to 4 the channel with
excessive bit errors

The system
Yes OSNR/nonlinearity/fiber
jumper between the OTU
and MUX/DEMUX board
is faulty?

Handle the fiber


jumper/optical cable No
fault
Check the
OTU/MUX/DEMUX board

Replace the faulty board

The fault is cleared, end

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 14


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

Figure 2-3 Emergency handling flow chart 2

3 Cross-connection fault 4 OTN/electrical


2 System fault
layer fault

The fiber
Yes The system OSNR or Yes Yes connections between
nonlinearity is improper or The cross-connection OTU/tributary/line
the fiber jumper/optical configuration is correct? boards at the
cable is faulty? local/opposite end
are normal?

No No No

Check the The


OA/MUX/DEMUX Replace the faulty Modify the cross- Yes TD/TF/LSR_WILL_DI
board at the tributary/line/cross- connection alarm exists on the a
Handle the fiber connection
local/opposite end connect board configuration OTU/line board at the fault a
local end?

No
Replace the faulty
board
The
LSR_WILL_DIE/OUT_PWR_HIGH/
OUT_PWR/LOW alarm exists on
the OTU/line board at the opposite
end?
Handle the fiber
jumper/optical cable
fault
Replace the faulty Replace the faulty
board at the local end board at the opposite
end

a: The fiber connection faults include the


inconsistency between the logical fiber
The fault is cleared, end connection and the physical fiber connection ,
and the physical fiber connection faults.

5 Multi-channel fault 6 Single-channel fault

Both
the transmit and receive Yes
optical power values are Compare the current optical power of the
abnormal? OTU/tributary/line board of the faulty
channel with the history normal value
No
Compare the current
optical power of the OA/
MUX/DEMUX board with
the history normal value
The
OTU/tributary/line Yes
board is faulty?
The
The OA/MUX/DEMUX
Yes Yes
OA/MUX/DEMUX boards at both the transmit
board is faulty? and receive ends
are No
faulty?
No No
Replace the internal fiber
Handle the fiber Replace the faulty Handle the fiber between the OTU and Replace the faulty board
jumper/optical cable fault board jumper/optical cable fault MUX/DEMUX boards

The fault is cleared, end

2.2.2 Description
This topic describes the emergency troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 15


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

Checking for Incorrect Operations


Check for incorrect operations performed on the equipment, such as loopback or manual laser
shutdown. If there are any, cancel them as required.

Restoring Services as Soon as Possible


When a fault occurs in the WDM system, services are interrupted. If a backup channel is
available, switch the services to the backup channel to restore the services as soon as possible
before you begin troubleshooting the fault.

Handling Alarms
l If an R_LOS, R_LOF, MUT_LOS, IN_PWR_HIGH, IN_PWR_LOW, or NE-unreachable
alarm is reported, check and analyze the alarm.
– When such an alarm occurs, if the power supply or optical cable is faulty, handle these
faults first.
– Determine whether the fault is a multi-channel or single-channel issue based on the the
range of services affected, and handle it accordingly.
– If multiple channels are faulty, check the transmit/receive optical power and compare
the current optical power of the optical amplifier (OA)/multiplexer (MUX)/
demultiplexer (DEMUX) board with the historical normal value. Check the network
section by section to determine whether the line, pigtail, or OA/MUX/DEMUX board
is faulty. Then handle the line/pigtail fault or replace the board accordingly.
– If only one channel is faulty, compare the current optical power of the optical
transponder unit (OTU)/tributary/line board on the faulty channel with the historical
normal value to determine whether the board or intra-board fiber connection is faulty.
Then replace the faulty board or change the internal fiber connections accordingly.
l If no R_LOS, R_LOF, MUT_LOS, IN_PWR_HIGH, IN_PWR_LOW, or NE-unreachable
alarm is reported, check the bit error rate (BER) performance or optical power performance
for any of the following problems.
– If all channels have excessive bit errors, a system fault has occurred. Check the network
section by section along the signal flow to determine whether the system OSNR and
nonlinearity are correct and whether the pigtail/optical cable is faulty. If the system
OSNR and nonlinearity are incorrect and the pigtail/optical cable is faulty, handle the
pigtail/optical cable fault. If the pigtail/optical cable is functioning normally, the OA/
MUX/DEMUX board is faulty and should be replaced.
– If the fault is not a system fault and the BUS_ERR alarm is reported, the fault is a cross-
connection fault. If the cross-connection configuration is incorrect, modify it to resolve
the fault. If it is correct, the tributary/line/cross-connect board is faulty and should be
replaced.
– If the OTU_LOF/OTU_LOM/ODU_LOF/ODU_LOM/ODUk_PM_SSF/
ODUk_PM_BDI alarm is reported, the OTN/electrical layer is faulty. Check whether
the fiber connections between OTU boards, tributary boards, or line boards at the local
and opposite ends are faulty. If any of these is faulty, handle the fiber connection faults,
including the inconsistency between logical and physical fiber connections and physical
fiber connection faults.
– If a TD/TF/LSR_WILL_DIE alarm is reported by an OTU/line board, the board is faulty
and should be replaced.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 16


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

TIP
As shown in the flowchart, optical-layer faults and fiber connection faults cause OTN alarms.
Therefore, these faults must be handled first. After these faults are cleared, OTN alarms will typically
clear also.

If you isolate the fault to a specific board, replace the board.

Table 2-1 lists common alarms and their causes for the OA, MUX, DEMUX or OTU/tributary/
line board.

Table 2-1 Common alarms and their causes

Board Type Alarm Possible Cause Action


a

OA board MUT_LOS Power-related alarm Check whether the optical


Possible cause: power of the upstream
optical path and station is
The received signals are normal.
abnormal.

PUM_BCM_A Temperature-related alarm l Power on the fan.


LM Possible causes are as l Adjust the ambient
PUM_COOL_ follows: temperature.
EXC l The fan is powered off. l Replace the faulty
l The ambient board.
temperature is too high
or too low.
l The board is faulty.

Optical MUT_LOS Power-related alarm Check whether the optical


Multiplexer/ Possible cause: power of the upstream
Demultiplexe optical path and station is
The received signals are normal.
r Board abnormal.

MODULE_CO Temperature-related alarm. l Power on the fan.


OLCUR_OVE Possible causes are as l Adjust the ambient
R, follows: temperature.
MODULE_TE
MP_OVER l The fan is powered off. l Replace the faulty
l The ambient board.
temperature is too high.
l The board is faulty.

OTU/ R_LOS, Power-related alarm Check whether the optical


tributary/line IN_PWR_LO Possible cause: power of the upstream
board W, optical path and station is
IN_PWR_HIG The received signals are normal.
H abnormal.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 17


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

Board Type Alarm Possible Cause Action


a

R_LOF, Power-related alarm l Check whether the


B1_EXC, Possible cause: optical power of the
B1_SD, upstream optical path
B2_EXC, The received signals are and station is normal.
B2_SD abnormal.
l Check whether the
If the module in the receive optical power of the
end is faulty, these alarms receive end is normal.
will also occur.

TF, TD, Mainly related to the Check whether the


OUT_PWR_L transmit unit of the board. If pluggable optical module
OW, the optical module on the or the board is faulty.
OUT_PWR_HI board is pluggable, check
GH the module.

LSR_COOL_A Temperature-related alarm l Power on the fan.


LM, Possible causes are as l Adjust the ambient
LSR_WILL_DI follows: temperature.
E
l The fan is powered off. l Replace the faulty
l The ambient module.
temperature is too high. l Replace the faulty
l The pluggable optical board.
module is faulty.
l The board is faulty.

NOTE

a: Common alarms are listed by board type in the table. Alarms for different boards of the same type may
differ slightly.

In addition, there are various running and alarm indicators with different colors on the equipment.
The On/Off and flashing states of these indicators indicate the current running state or possible
alarm of the equipment. Such indication helps you analyze the cause of the fault and accordingly
handle the alarm. Refer to the Indicators.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 18


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

CAUTION
If you observe only the alarm indicators on the top of the cabinet, minor equipment alarms may
be missed (in the case of minor alarms, the alarm indicators on the top of the cabinet do not light
up, whereas the alarm indicators on the board do). In most cases, a minor alarm indicates a
potential fault at the local or opposite end. Such alarms cannot be neglected.
Equipment indicators indicate only the current operating status of the equipment, not the faults
that have already occurred and been cleared.
The indicator status that corresponds to a specific alarm category can be redefined on the U2000,
and certain alarm types can even be suppressed.
The board alarm indicator reports the alarm with the highest severity level detected by the board.

Isolating and Resolving External Faults


Before locating a fault in the system, identify and resolve external faults, including optical cable
or fiber faults, client equipment faults, and power failures.

l Troubleshooting a client equipment fault


To troubleshoot a fault on client equipment, use either of the following methods:
– Method 1: Connect a bit error meter between the receive port and transmit port of the
OTU/tributary board and perform a bit error test. Use a pigtail to short-circuit the
transmit port of the corresponding OTU/tributary board at the opposite station to the
receive port of the corresponding OTU/tributary board. Check for bit errors for 24 hours.
If no bit error is detected, the fault is occurring on the client side.

CAUTION
Before short-circuiting the transmit port of the corresponding OTU/tributary board at
the opposite station to the receive port of the corresponding OTU/tributary board,
determine whether to add an optical attenuator to the receive optical port on the OTU/
tributary board. The appropriate optical attenuator is determined based on the optical
power specification on the transmit port of the OTU/tributary board. Otherwise, optical
power overload might damage the OTU/tributary board.

– Method 2: For an OTU/tributary board that supports B1 bit error detection, compare
whether the RSES performance value of the OTU/tributary board at the local station is
the same as that at the opposite station. If they are the same, the system has no new bit
errors and is operating normally, which indicates that the fault is located on the client
side.
– Method 3: If users check client equipment on site, they can perform a self-loop (an
optical attenuator is required) on the receive and transmit optical ports of the client
equipment to check the alarms on the equipment. If alarms persist or the bit error meter
still detects bit errors, the fault is occurring on the client side.
l Troubleshooting an optical cable fault
To troubleshoot an optical cable fault, use any of the following methods:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 19


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

– Method 1: Test the input optical power of the local station and the output optical power
of the upstream station. If the difference between the two values (that is, the attenuation)
is less than the designated value, the optical cable is functioning normally; otherwise,
the optical cable is faulty.
– Method 2: Check whether optical cable parameters such as the type and length meet the
design requirements. If not, the optical cable is faulty.
– Method 3: If the optical cable meets design requirements, check whether the optical
cable connector is properly attached. Ensure that the connection is normal.
– Method 4: Switch system services to the backup optical cable. If the alarm is cleared,
the optical cable is faulty.
– Method 5: Use an OTDR to measure the optical power and determine whether the optical
cable is faulty. If the reflectance of the tested optical fiber core is less than 27 dB and
the attenuation of the fiber core is less than the designated value, the optical cable is
functioning normally. Otherwise, the optical cable is faulty. Note that the OTDR has a
blind spot within a short distance (smaller than 5 km), in which the test results are
inaccurate.

CAUTION
When you use the OTDR, separate the optical fiber from the equipment. Otherwise, the intense
OTDR light might damage the equipment.

l Troubleshooting a power supply fault

If you cannot log in to a station and all boards connected to the station report alarms indicating
input signal loss, the power supply of this station might be faulty. This causes a power failure
at this station and triggers the alarm. If this station suddenly enters an abnormal operating status,
the optical power of the station will suddenly decrease, some boards will operate abnormally,
services will be interrupted, and abnormal login will occur. Check whether the power supply
voltage of the transmission equipment is or a low-frequency transient voltage surge occurred.

l Troubleshooting a grounding fault

If the equipment is struck by lightning or cannot be interconnected, check the grounding. Check
whether the equipment is grounded in compliance with the specifications, whether any
equipment is disconnected from the public ground bar, and whether all types of equipment in
the same telecommunications room are grounded similarly. Then use an earth resistance tester
to test whether the grounding resistance and the potential between working and protection
grounds are within the permitted range.

Isolating and Resolving Optical Fiber or Board Faults


To troubleshoot an optical fiber or board fault, use any of the following methods:

l Method 1: Measure the input optical power of the board reporting the alarm and the output
optical power of the corresponding board at the opposite station.
– If the transmit optical power of the corresponding board at the opposite station is normal
and the difference between the receive optical power of the board at the local station

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 20


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 2 Troubleshooting Workflow

and the transmit optical power of the corresponding board at the opposite station is
greater than the designed value, the optical fiber is faulty.
– If the transmit optical power of the corresponding board at the opposite station is too
low, the pluggable optical module on the board is faulty or the board is faulty.
l Method 2: Connect the board and optical meter using a new pigtail and measure the transmit
optical power of the board.
– If the measured optical power is normal, the original optical fiber is faulty.
– If the measured optical power is still too low, the pluggable optical module on the board
is faulty or the board is faulty.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 21


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

3 Fault Isolation Basics

About This Chapter

This topic describes the basic concepts and methods used to locate faults.

The key to locating faults is to pinpoint a fault to a single station.

The general fault isolation rules are as follows:

l Locate external faults first, followed by internal faults.


l Locate network faults first, followed by NE faults.
l Locate high-severity alarms first, followed by low-severity alarms.
l Locate multi-channel signal alarms first, followed by single-channel signal alarms.
l Locate bidirectional signal alarms first, followed by unidirectional signal alarms.
l Locate public alarms first, followed by individual alarms.

Locate External Faults First, Followed by Internal Faults


Before locating a fault, confirm that external conditions are normal. For example, no faults are
occurring in optical cables and fibers, client equipment, or power supply.

Locate Network Faults First, Followed by NE Faults


When a fault occurs on the transmission equipment, multiple stations, rather than a single station,
may be reporting alarms. Fault analysis is required to quickly and accurately determine the
station at which the fault occurs.

Locate High-severity Alarms First, Followed by Low-severity Alarms


First analyze high-severity alarms, including critical and major alarms. Then analyze low-
severity alarms, such as minor alarms and warnings.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 22


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Locate Multi-channel Signal Alarms First, Followed by Single-channel Signal


Alarms
When analyzing alarms, determine whether multiple channels are faulty or only a single channel
is faulty. If multiple channels are faulty, locate the fault to the multiplex section (MS). After the
fault of the MS is resolved, the single-channel alarm will be subsequently cleared.

Locate Bidirectional Signal Alarms first, Followed by Unidirectional Signal Alarms


When analyzing alarms, if alarms are present in both the local receive and opposite transmit
directions, check for similar faults in the opposite receive and local transmit directions. If alarms
are being reported in both directions, analyze and handle them first.

Locate Public Alarms First, Followed by Individual Alarms


When analyzing alarms, check whether they are public or individual alarms and determine the
scope of their impact. That is, determine whether one or more boards are reporting the alarms.
For a board with multiple optical ports, determine whether one, multiple, or all optical ports
have bit errors.

3.1 Common Fault Isolation Methods


This topic provides common fault isolation methods.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 23


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

3.1 Common Fault Isolation Methods


This topic provides common fault isolation methods.
Common methods for locating hardware faults can be briefly summarized as "First analyze, then
perform a loopback, and finally replace the board."
When a fault occurs, first analyze the signal flow, alarms, and performance event data to
determine the possible faulty station or optical section. Then measure the optical power section
by section, analyze the optical spectrum, and perform a loopback to isolate the fault to a specific
board. If the fault persists, replace the board or optical fiber.

3.1.1 Service Signal Flow Analysis


A common method for isolating faults is to check for faults along the signal flow, station by
station. This method allows you to quickly determine where the fault is occurring.
The following example describes service signal flow analysis.

Application 1: Each Station Uses Tributary and Line Boards


l Fault description:
Figure 3-1 shows the networking structure of this example. A channel of client services
from station A to station B is interrupted, and the client equipment at station B is either not
receiving optical signals or is receiving a large number of bit errors.

Figure 3-1 Signal flow analysis: Each station uses tributary and line boards
Tributary Line Line Tributary Client
Client
board board MUX OA OA DEMUX board board equipment
equipment

DEMUX OA OA MUX

Station A Station B

Signal Flow Fault Point

NOTE
The U2000 provides the service signal flow interface. For details on how to view the service signal
flow using the GUIs, see "Viewing the Signal Flow Diagram for a WDM Trail" in the Online
Help.

l Fault analysis:
As shown in the preceding figure, the service signal flow of the client equipment at station
B is: client equipment at station A → the tributary board at station A → the cross-connect
board at station A → the line board at station A → MUX at station A → OA at station A
→ OA at station B → DEMUX at station B → the line board at station B → the cross-
connect board at station B → the tributary board at station B → client equipment at station
B.
Possible causes are as follows:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 24


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

– The transmit unit of station A is faulty.


– The optical path (including the optical fibers and fiber connectors) is faulty.
– The receive unit of station B is faulty.
1. Analyze the alarms and performance of the tributary board at station A. If the client-
side port of this tributary board receives no optical signal or its receive optical power
is too low, the fault location can be refined to the transmit end of the client equipment
at station A, the pigtail from the client equipment to the tributary board, or the client-
side receive module of the tributary board.
2. If the client-side input optical power of the tributary board at station A is normal,
check whether the cross-connection configuration is correct. If it is correct, locate the
corresponding line board based on the cross-connection. Then, check the alarms and
performance events of the tributary and line boards at station B for BUS_ERR alarms.
If BUS_ERR alarms are reported, the fault may be occurring on the tributary, cross-
connect, or line board. If needed, you can change the configuration or replace the
related boards.
3. If no BUS_ERR alarms are reported, check whether the output optical power of the
line board is normal. If the optical power is abnormal, the fault is occurring on the line
board.
4. The remaining steps are similar to the fault analysis steps of a tributary/line integrated
board.

Application 2: Each Station Uses the OTU board


l Fault description:
Figure 3-2 shows the networking structure. A channel of client services from station A to
station B are interrupted, and the client equipment at station B receives no optical signals
or receives a large number of bit errors.

Figure 3-2 Each Station Uses the OTU board in Application 2

Client Client
equipment OTU MUX OA OA DEMUX OTU equipment

DEMUX OA OA MUX

Station A Station B

Signal Flow Fault Point

NOTE
The U2000 provides the service signal flow interface. For information on how to view the service
signal flow in GUIs, see "Viewing the Signal Flow Diagram for a WDM Trail" in the Online Help.

l Fault analysis:
The Client equipment at station B receives no optical signals or receives a large number of
bit errors. As shown in Figure 3-2, the service signal flow of the Client equipment at station
B is: Client equipment at station A → OTU at station A → MUX at station A → OA at

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 25


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

station A → OA at station B → DEMUX at station B → OTU at station B → Client


equipment at station B. Possible causes are as follows:
– The transmit unit of station A is faulty.
– The optical path (including the fibers and fiber connectors) is faulty.
– The receive unit of station B is faulty.

1. Analyze the alarms and performance of the OTU at station A. If the client-side
interface of this OTU receives no optical signal or its received optical power is too
low, locate the fault to the transmit end of the Client equipment at station A, the fiber
jumper from the client equipment to the OTU, or the client side receiver module of
the OTU.
2. If the input optical power of the OTU at station A is normal, check whether its output
optical power is normal. If not, locate the fault to this OTU.
3. If the output optical power of the OTU at station A is also normal, observe whether
the optical power of the MUX at station A sharply changes. If there are many
wavelengths at station A, the loss of one of them does not results in a great change to
the optical power. Therefore, feed the signals at the MON port of the MUX to the
MCA board and query for any loss-of-wavelength alarm.
4. The key components on the MUX are passive; therefore, the MUX is not likely to be
damaged. When the MCA detects the loss of a wavelength, the most possible fault
point is between the fiber jumper from the OTU to the MUX.
5. The OA provides a function of input and output optical power detection. If the OA is
faulty, multiple wavelengths are affected. As a result, the possibility is rather low that
the OA board is faulty.
6. At station B, analyze the signal flow in this order: OA at station B → DEMUX at
station B → OTU at station B → Client equipment at station B. The method to analyze
the signal flow at station B is similar to that of station A.

3.1.2 Alarm and Performance Analysis


When a fault occurs in the system, it is usually accompanied by a large number of alarms and
abnormal performance events. By analyzing this information, you can approximate the type and
location of the fault.

To review the alarms and performance events, do as follows:


l Query the current or historical alarms and performance events on the U2000.
l Review the running status or the severities of the alarms using the indicators on the open
rack and the boards.

The U2000 fault isolation capabilities can be affected by system faults, and are classified as
follows:
l Comprehensive: You are able to obtain fault information for the entire network.
l Accurate: You are able to obtain the current alarms, alarm generation time, and historical
alarms of the equipment. In addition, you are able to obtain the specific values of the
performance events.
l Complex: In some cases, a significantly high number of alarms and performance events
can make analysis very difficult.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 26


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

l Dependent: Fault isolation is entirely dependent on the normal operations of the computer,
software, and communication equipment. If any of these are faulty, fault isolation
capabilities may be limited or entirely lost.

CAUTION
l Before querying the performance data on the U2000, enable performance monitoring on the
U2000. Otherwise, the performance data will not be reported to the U2000.
l Before querying the alarm or performance data on the U2000, ensure that the running time
of each NE is correctly set. Otherwise, the alarm and performance data will be incorrect or
not reported to the U2000.
l During maintenance, after re-issuing the configuration to a NE, set the NE time to the current
time. Otherwise, the NE will operate at the incorrect default time.

The generation, detection, and transmission of an alarm varies with the OTU type and the type
of signals that the OTU receives. Alarm signal flow analysis allows you to quickly determine
where the fault is occurring.

Application 1: Alarm Signal Flow for a Non-convergence OTU Board to Process


SDH-compliant Signals
An R_LOS alarm generated by the OTU board at the local station is used as an example to
describe the alarm signal flow analysis when a non-convergent OTU board processes SONET/
SDH standard signals, as shown in Figure 3-3.

Figure 3-3 Alarm and performance analysis: A Non-convergence OTU board processes SDH-
compliant signals

Client Client
equipment OTU MUX OA OA DEMUX OTU equipment

DEMUX OA OA MUX

Station A Station B

Signal Flow Fault Point

NOTE

The automatic laser shutdown (ALS) function of the OTU board shown in the figure is disabled.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 27


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-4 Alarm and performance analysis: fault isolation

Station A Station B Client device

Client-side WDM-side WDM-side Client-side Client-device

R_LOS
R_LOS REM_SF R_LOF

Alarm processing

xxx Detects and reports the xxx alarm.

As shown in Figure 3-4, the client side of the OTU board at station A receives R_LOS signals.
The WDM side of this OTU board reports an R_LOS alarm and sends the fault information to
station B. The client side of the OTU board at station B reports the REM_SF alarm and sends
the fault information to the downstream client device. The client device then reports an
R_LOF alarm.

If the client-side OTU board at station A reports the R_LOS alarm to the U2000, the client-side
equipment at station B reports the R_LOF alarm. In this case, the input signals on the client side
of the OTU board at station A are abnormal. Therefore, if the input signals on the client side of
the OTU board at station A are abnormal, the client device is normal and the local device is
faulty.

Application 2: Alarm Signal Flow for a Non-convergence OTU Board to Process


OTN-compliant Signals
In this example, a non-convergence OTU board is used to describe how to locate a fault using
alarm signal flow analysis after an OTU2_LOF alarm is generated when the board is processing
OTN services. For other alarms, the fault analysis is similar. In Figure 3-5, ODU2 services are
transmitted on the line, and the optical fiber between station A and station B degrades.

Figure 3-5 Alarm and performance analysis: A non-convergence OTU board processes OTN-
compliant signals

Client Client
equipment OTU OTU-W OTU-E OTU equipment

OTM OTM OTM OTM


Station A Station B Station C

NOTE

The ALS function of the OTU board shown in this figure is disabled.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 28


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-6 Alarm and performance analysis: fault isolation

Station A Station B Station C


OTU-W OTU-E
WDM-side WDM-side WDM-side WDM-side

OTUk_LOF/
OTUk_LOM/ SF
OTUk_AIS OTUk_LOF/
OTUk_LOM/ ODUk_PM_AIS
OTUk_AIS

OTUk_BDI

ODUk_PM_BDI

SF: SF event detecting Alarm processing


xxx Detects and reports the xxx alarm or performance event

As shown in Figure 3-6, when the optical fiber between station A and station B degrades, an
OTU2_LOF alarm is detected on the WDM side of the OTU-W at station B. After being
processed at station B, the OTU2_LOF alarm is transmitted to stations A and C. (If the optical
fiber is severely degraded, either an LOF or LOM alarm is generated. In this example, an LOF
alarm is generated.) At station C, an ODU2_PM_AIS alarm is detected on the WDM side of the
OTU board. After being processed at station C, the ODU2_PM_AIS alarm is transparently
transmitted to station B and then to station A. Then, station A reports OTU2_BDI and
ODU2_PM_BDI alarms.

If the WDM side of the OTU board at station A reports the OTU2_BDI and ODU2_PM_BDI
alarms to the U2000, the WDM side of the OTU-W board at station B reports the OTU2_LOF
alarm, and the WDM side of the OTU board at station C reports the ODU2_PM_AIS alarm. In
this case, the input signals on the WDM side of the OTU board at station B are abnormal. Further
analysis can be performed to determine whether the fault is caused by degraded optical fibers.

3.1.3 Fault Analysis Using Test Instruments


This method is used to resolve external transmission equipment problems or interconnection
problems. The test instruments must be available on site and maintenance personnel must have
been properly trained.

Overview
The most commonly used test instruments for the WDM system include the optical power meter,
optical spectrum analyzer, SDH/SONET tester, SmartBits tester, communication signal analyzer
and multimeter. Among these tools, the optical power meter and optical spectrum analyzer are
most often used.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 29


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Application
l Optical power meter
The optical power of each point can be obtained from the U2000 performance data.
However, to get a precise reading for a specific point, you must measure the optical power
at that point using an optical power meter.

CAUTION
When the output optical power of the MUX boards, the input optical power of the DEMUX
boards, or the input and output optical power of the OA boards are abnormal, all services
are interrupted if the line is disconnected for testing. Therefore, do not disconnect the line
to test the optical power of the main-path signal unless absolutely necessary.

You can test the output optical power on the MON port of various boards. Use this tested
value and the ratio of the tested value to the optical power of the main-path signal to
calculate the optical power of the main-path signal.
l Optical spectrum analyzer
Use an optical spectrum analyzer to test the optical spectrum of the output signal on the
MON port of the board. Read from the spectrum analyzer the optical power, OSNR, central
wavelength and analyze the gain flatness of the OA. Compare this data with the original
data and check whether there is a significant performance deterioration (for the original
data, refer to the engineering documentation). Verify the following items:
– Per-channel optical power is normal and the gain flatness is normal.
– OSNR meets the design requirements.
– Deviation of the central wavelength is lower than the specification.
NOTE

The main-path optical spectrum on the MON ports of boards can be tested online. If all the services
carried on the main path are affected, analyze the optical spectrum of the OA boards. If only one
service channel on the main path is affected, analyze the optical spectrum of the MUX and DEMUX
boards.
l SDH/SONET tester, SmartBits tester, and communication signal analyzer
If it is suspected that the poor interconnection is caused by signal error, use these analyzers
to check whether the frame signals and overhead bytes are normal, and check whether there
are any abnormal alarms.
l Multimeter
If it is suspected that the power supply is too high or too low, use a multimeter to measure
the input voltage.
If it is suspected that the poor interconnection is caused by improper grounding, use a
multimeter to measure the voltage between the shield layers of the coaxial ports at the
transmit and receive ends of interconnected channels. If the voltage is more than 0.5 V, the
grounding is improper.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 30


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

3.1.4 Loopback
The loopback method is the most common and effective method to locate a fault in a transport
network. The key feature of this method is that it does not require in-depth analysis of alarm and
performance information. Maintenance engineers should be familiar with this method.

There are software loopbacks and hardware loopbacks, which have the following advantages
and disadvantages:

l A hardware loopback is performed on a physical port (optical port) using an optical fiber.
Compared with the software loopback, the hardware loopback is more reliable. The
hardware loopback, however, requires on-site operations. In addition, a receive optical
power overload must be avoided when performing a hardware loopback.
l Compared with the hardware loopback, the software loopback is simpler but cannot locate
a fault as accurately. For example, during a single station test, if a software inloop is
performed on an optical port and the service is normal, the line board may not necessarily
be normal. However, if a selfloop is performed on the same optical port using a pigtail and
the service is normal, line board is normal.

For loopback configurations, refer to Performing Software Loopback and Performing Hardware
Loopback.

CAUTION
When performing a loopback on the optical path of the multiplex section, ensure that the OSNR
and dispersion meet the OTU requirements.
This method affects service quality. Therefore, it is generally used in deployment commissioning
or fault location in situations where services have been interrupted.
If the protection scheme has been configured, loopback is not recommended. Otherwise,
protection switching may fail. If a loopback is required, lock the protected services on the current
channel before performing the loopback, and unlock the services after the loopback is completed.

Application
The loopback method is applicable when the approximate range of the fault is known. The fault
is located by applying the loopback method on a section by section basis. This method clears
board and line faults. Figure 3-7 provides an example of fault isolation using the loopback
method.

Figure 3-7 Loopback method: fault isolation


Client Client
equipment OTU MUX OA OA DEMUX OTU equipment

DEMUX OA OA MUX
Inloop on
client side
Station A (OTM) Station B (OTM)

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 31


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

In Figure 3-7, the client side of the OTU at station A reports an R_LOS alarm whereas the
WDM side at station A reports no alarm. The WDM side and client sides of the OTU board at
station B report no alarms whereas the client equipment at station B reports an R_LOF alarm.

Perform an inloop on the client side of the OTU board at station A as shown in Figure 3-7.

l If the alarms at both stations A and B are cleared, the OTU board at station A is receiving
no signal from the client equipment at station A. The fault is occurring on the client
equipment at station A or the optical fiber from the client equipment at station A to the
OTU board at station A.
l If the alarms at stations A and B persist, the fault is occurring on the OTU board at station
A.
NOTE

l The faulty point can be identified quickly using the analysis method described in the "Alarm and
Performance Analysis" and analyzing the fault based on the alarm signal flow.
l If the system is capable of automatically releasing a loopback, the automatic loopback release period
can be changed or entirely disabled on the U2000. By default, the automatic loopback release period
is five minutes.

3.1.5 Replacement
In this method, a component suspected to be malfunctioning is replaced with a functional
component, in order to locate and resolve the fault. The component might be a section of pigtail,
a board, a flange, or an attenuator.

Application
The replacement method is used to resolve problems on external equipment, such as optical
fibers, flanges, client equipment, and power supply equipment. It is also used to handle faults
on boards or modules at a single station. Figure 3-8 and Figure 3-9 provide two examples.

The advantage of the replacement method is that the fault can be located within a small range,
and is a relatively simple procedure for maintenance personnel to perform. This method requires
that spare components be available. In addition, the operator must exercise caution when
performing operations. For example, when a board is being inserted or removed, careless
operations that could damage board components should be avoided.

Figure 3-8 Interchanging pigtails: fault isolation


Client
equipment OTU
A

As shown in Figure 3-8, when the OTU or tributary board reports an R_LOS alarm on the client
side but the receive module of the client equipment reports no alarm, swap pigtails A and B. If
the R_LOS alarm persists, the transmit module of the client side or the receive module of the

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 32


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

OTU or tributary board is faulty. If the R_LOS alarm is cleared but the client equipment reports
an R_LOS alarm, pigtail A is faulty.

CAUTION
Before performing the test, disable the automatic laser shutdown (ALS) function of the lasers
on the OTU or tributary board and the client equipment.

Figure 3-9 Replacing the board: fault isolation

Client Original Client


equipment OTU MUX OA OA DEMUX OTU equipment

DEMUX OA OA MUX

Replacement
OTU Station A (OTM) Station B (OTM)

Fiber jumper connected to the original OTU


Fiber jumper connected to the replacement OTU

As shown in Figure 3-9, when the OTU or tributary board reports an R_LOS alarm but the
receive module of the client equipment reports no alarm, replace the original OTU or tributary
board with a backup OTU board. Observe the alarms on the OTU board and client equipment.
If the alarm is cleared after the replacement, the original OTU board is faulty and needs to be
repaired. The following procedures can be used to narrow down the possible cause of the OTU
failure:

l If it is suspected that one of the tributary board channels or some a tributary board is faulty,
change the timeslot configuration to switch services to another channel or tributary board.
l If it is suspected that a slot is faulty, change the slot configuration.
l If it is suspected that a VC-4 channel is faulty, switch the time slot to another VC-4 channel.

3.1.6 PRBS Test


If no commissioning instruments are available in the network maintenance phase or on a new
network where no services are deployed, you can perform a self-transmit and self-receive test
using the pseudo random bit sequence (PRBS) function to check whether links on each section
are normal by performing loopbacks section by section.

Figure 3-10 shows a PRBS application. A local board with the PRBS test function sends PRBS
codes and analyzes the PRBS codes looped back from the peer end. By comparing the
loopbacked PRBS codes with the PRBS codes that should be received according to the
theoretical calculation, the local board determines whether equipment and the transmission line
are normal.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 33


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-10 PRBS application


PRBS is enabled on the auxiliary boards
Sends PRBS codes

WDM
network

Tester Near-end Far-end


board auxiliary auxiliary
board Inloop on Outloop on board Inloop on
WDM side WDM side client side
Site A Site B

CAUTION
l When a PRBS test is in progress, only query operations can be performed, no configurations
can be delivered to involved boards, the involved boards cannot carry any services, and the
original services on the boards will be interrupted.
l The LTX board does not support a PRBS test on multiple optical ports at the same time.

NOTE

l The PRBS test function is targeted for use during deployment and fault location. After deployment and
fault location, users must set PRBS Test Status to Disabled.
l PRBS codes vary according to the client-side service types. Therefore, to perform a client-side PRBS
test, users must ensure that the client-side service types for the tester board and auxiliary board are the
same.

The PRBS test is applicable to two networking modes, as provided in Table 3-1.

For a network, a client-side PRBS test covers a larger area than a WDM-side PRBS test, as
illustrated in Table 3-1. If a board on which a PRBS test has been started does not receive the
PRBS test signals that the board has sent, the board reports a PRBS_LSS alarm.

For information about how to complete PRBS configurations, see Configuring PRBS Test on
the Meter Board in Commissioning Guide.

For the boards that support the PRBS function, refer to Basic Functions of OTUs, OTN Tributary
Boards, OTN Line Boards and Packet Service Boards in Hardware Description.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 34


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Table 3-1 PRBS applications

Test Mode Service Type Networking Diagram

Client-side OTN signals/SDH TX RX OUT IN TX


1
PRBS test services Tester Near 2
Remote
WDM network 3 auxiliary 4
board auxiliary board
RX TX board IN OUT
RX

Client Client WDM WDM Client


side side side side side

1 Start PRBS test


2
Loopback on the WDM side/fiber loopback
3
4 Loopback on the client side/fiber loopback

WDM-side N/A OUT IN


1
PRBS test
Remote
Tester 2
WDM network auxiliary
board IN
OUT board

WDM WDM
side side

1 Start PRBS test


2 Loopback on the WDM side/fiber loopback

Tester board: generates PRBS test signals and monitors the loopbacked PRBS test signals
from the remote board. By comparing the transmitted and received PRBS test signals, the
board determines whether the current link and equipment are normal.
Auxiliary board: connects a tester board and the network under test and transparently transmits
the PRBS test signals. On the auxiliary boards at the near end, PRBS Test Status need to be
set to Enabled, only when client-side services are other than OTN services.
When a tributary or line board is used as a tester or auxiliary board, cross-connections need
to be configured to form a service path.

3.1.7 Configuration Modification


This method also removes problems caused by configuration errors in a known faulty NE. In
addition, this method is used to troubleshoot pointer justifications. Use this method to modify
the timeslot configuration, slot configuration, and board parameter configuration.

This method is used to resume the service temporarily when no spare board is available for the
replacement, or is used in handling the problem of pointer justification. The maintenance
personnel may not find this method convenient. In addition, save the original configuration
before you use this method. Record the steps used to locate the fault.

l If some path of a tributary board or some tributary board is suspected to be faulty, modify
the timeslot configuration to shift the payload to other path or tributary board.
l If a certain slot is suspected to be faulty, change the slot configuration.
l If a VC-4 is suspected to be faulty, shift the traffic to another VC-4.
l During the upgrade or expansion, if you are unsure about the new configuration, you can
re-load the original configuration for confirmation.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 35


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

l To modify the configuration in case of pointer justification, modify the tracing direction
and the reference source of the clock.

Modifying the timeslot configuration, however, does not help to locate the faulty point or faulty
board, for example, a line board, tributary board, cross-connect board or backplane. In this case,
use the replacement method or the loopback method to further locate the fault. This method is
applicable in the preliminary process of locating faults when spare boards are not available.
Other service channels or slots are used to resume the service temporarily.

3.1.8 Test Frame


A test frame is a data packet to test connectivity of networks carrying Ethernet services.

If a test meter or instrument is unavailable on site, the board receiving Ethernet services can be
used to send test frames. Test frames help operators commission services and locate faults during
network deployment or maintenance.

NOTE

The test frame function complies with the internal standard of Huawei.
When a test frames is in progress, only query operations can be performed, no configurations can be
delivered to involved boards, the involved boards cannot carry any services, and the original services on
the boards will be interrupted.

Application
A network can be into divided into access networks and a transmission network. On the access
networks, boards on OptiX OTN products are interconnected with client-side devices; on the
transmission network, boards on OptiX OTN products are interconnected with each other on the
WDM side.

Figure 3-11 and Figure 3-12 shows how to use test frames to assess the connectivity of network
links.

For OptiX OTN products, Ethernet boards with the test frame function send test frames in either
of the following modes: GFP test frame and Ethernet test frame.

l GFP test frame: send test frames using GFP management frame for GE and 10GE service.
The local NE compares the number of test frames transmitted with the number of response
frames received by the local board to detect network connectivity.
l Ethernet test frame: directly send test frames at MAC layer of 40GE and 100GE service.
The local NE compares the number of test frames transmitted with the number of test frames
received by the local board to detect network connectivity.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 36


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-11 Application 1 of GFP test frames

Access Network Transmission Network Access Network

Router NE1 NE2 Router

Test Frame
MAC Internal Logical Internal Logical MAC
Port Port
Board A Response Frame Board B

1 The internal logical ports of After receiving the test 2


Board A transmit a test frame on Board B, Board B
frame to the internal logical transmits back a response
ports of Board B. frame.

3 • NE1 counts the number of


transmitted test frames and received
test frames on Board A.
• NE1 enables the user to determine OptiX OTN Product
the quality of the service network.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 37


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-12 Application 2 of Ethernet test frame

Upward

Access Network Transmission Network Access Network

Router NE1 NE2 Router

OTN signal carrying


Test frame
an Ethernet test frame
MAC OTN OTN MAC

Board A Board B

1 The client-side port on board A


generates an Ethernet test frame and
encapsulates it into an OTN signal.
Then the OTN signal carrying the test
frame is sent to board B.
2
• NE1 counts the number of transmitted
test frames and received frames on
board A. OptiX OTN product
• NE1 enables the user to determine the
quality of the service network.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 38


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Downward

Access Network Transmission Network Access Network

Router NE1 NE2 Router

Test OTN signal carrying an


单板 A frame 单板 A Ethernet test frame
OTN MAC MAC OTN OTN MAC

Board A Board B Board C

2
1 The client-side port on
board A transmits an After receving the Ethernet test
Ethernet test frame to frame, board B encapsulates it into
the client-side port of an OTN signal and transmits the
board B. signal to board C.
3

• NE1 counts the number of transmitted


test frames and received frames on
board A.
• NE1 enables the user to determine the OptiX OTN product
quality of the service network.

NOTE

l Upward: The client-side port on board A generates an Ethernet test frame and send it to the WDM-side on
board B.
l Downward: The client-side port on board A generates an Ethernet test frame, send it to the WDM-side on
board C by the client-side port on board B.

l Application 1 of GFP test frames: In Figure 3-11, three counters monitor test frames
response frames. The first monitors the number of test frames transmitted by board A in
step 1, the second is the number of response frames transmitted by board B in step 2, and
the third is number of test frames received by board A in step 3. Based on service flows, if
the number of test frames transmitted by board A is equal to that of response frames received
by board A, the transmission link is normal. If the two numbers are different, packet loss
is occurring along the transmission link, indicating a fault on the link.
l Application 2 of ethernet test frames (for 40GE/100GE service): In Figure 3-12, two
counters monitor test frames response frames. The first monitors the number of test frames
transmitted by board A in step 1, the second is number of test frames received by board A
in step 2. Based on service flows, if the number of test frames transmitted by board A is
equal to that of test frames received by board A, the transmission link is normal. If the two
numbers are different, packet loss is occurring along the transmission link, indicating a
fault on the link.

For test frame configurations, refer to Configuring Test Frames in the Supporting Tasks.

Availability
Table 3-2 lists the boards that support the test frame function on OptiX OTN products.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 39


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Table 3-2 Boards that support the test frame function on OptiX OTN products

Service Type Board Type with the Test Frame Function

OTU Board Tributary Board

FE a TN12LDM, TN11LDMD, TN11TBE, TN54TEM28,


TN11LDMS, TN11LEM24, TN11TOM, TN52TOM,
TN11LOA, TN13LQM, TN12TQM
TN12LQMD, TN12LQMS

GE b TN11L4G, TN11LDGD, TN11TBE, TN11TDG,


TN11LDGS, TN12LDM, TN54TEM28, TN54THA,
TN11LDMD, TN11LDMS, TN54TOA, TN54TOG,
TN11LEM24, TN11LOA, TN11TOM, TN52TOM,
TN11LOG, TN12LOG, TN12TQM
TN11LOM, TN12LOM,
TN11LQG, TN13LQM,
TN12LQMD, TN12LQMS

10GE LAN c TN11LEM24, TN11LEX4 TN11TBE, TN11TDX, TN12TDX,


TN52TDX, TN53TDX,
TN54TEM28, TN55TOX,
TN11TQX, TN52TQX, TN53TQX,
TN55TQX

40GE N/A TN54TSXL

100GE TN12LSCd N/A

a: For FE service, the board supports the test frame function only when the serviec is mapping
with GFP-F.
b: For GE service, the board supports the test frame function only when the serviec is mapping
with GFP-T.
c: For 10GE LAN service, the board supports the test frame function only when Port Mapping
(WDM Interface) is set to MAC Transparent Mapping (10.7 G) or MAC Transparent
Mapping (10.7 G) support 1588.
d: The TN12LSC only support downward ethernet test frame.

When using the test frame function, take the following precautions:
l For OTU boards, except for TOM boards with tributary-line mode, test frames are
independent of electrical cross-connections. The receive board can receive test frames if
the transmission link between the transmit and receive boards is connective.
l When electrical cross-connections function from a tributary board to the WDM-side port
on an OTU board with the test frame, the OTU board processes GFP management frames.
Therefore, the tributary board cannot send or receive test frames.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 40


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

3.1.9 RMON Performance Analysis


When Ethernet service is interrupted or degraded, remote monitoring (RMON) can be enabled
on the U2000along with Ethernet OAM, loopback, and ping functions to locate faults causing
service interruption or degradation.

RMON is used to transmit monitoring data between different network segments.

Various performance statistics and Ethernet port data need to be managed. RMON is a standard
monitoring specification that enables various network monitors and console systems to exchange
network-monitoring data and create port-statistics reports. To meet different network
requirements, RMON provides flexible detection modes and control mechanisms. RMON also
provides error diagnosis, planning, and performance event monitoring of the entire network.

Purposes of RMON performance management are as follows:


l To save all statistical data at the agent end even when NEs cannot be reached by the manager
l To save historical data for fault diagnosis
l To detect errors and report performance events to the U2000
l To provide detailed port data to the U2000
l To support multiple management stations

RMON features are implemented through the following modules:


l Ethernet statistics group: Queries the current real-time statistical data of Ethernet ports.
The following performance items are supported: different packet lengths, overall
performance, errors and collisions, and FC performance.
l Ethernet historical group: Queries statistical data on Ethernet ports over a historical
sampling period. The Ethernet history group supports the same performance items as the
Ethernet statistics group.

For details, refer to Performance Event List of RMON Function in the Alarms and Performance
Events Reference.

Application
This topic uses Ethernet convergence service as an example to describe how to locate data service
faults using the RMON function. As shown in Figure 3-13, there is an Ethernet service from
station A to station C. Ethernet boards 1, 2, and 3 are used on stations A, B, and D, respectively.
Figure 3-14 shows the service connections. The data service from Router 1 to Router 2 is
abnormal. In this example, the service from Router 1 to Router 2 is interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 41


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-13 Ethernet convergence service


Router 1

iManager U2000
Ethernet
Board 1

A
Router 2 Router 3

B D
Ethernet Ethernet
Board 2 Board 3
C

Figure 3-14 Service Connections


Board 2
Router 2
VCTRUNK1 PORT1
Board 1
Router 1
PORT1 VCTRUNK1
Board 3
VCTRUNK2
Router 3
PORT1

VCTRUNK1

The RMON function is enabled to remotely monitor the service between stations C and A. To
check the service performance of the Ethernet service board at the transmit end (station A) you
can query the RMON performance of the corresponding Ethernet service board at station C.
When service is interrupted or signals are degraded, diagnose the fault as follows:
1. Set the RMON performance monitoring parameters for the Ethernet port, including the
monitored objects (ports on the Ethernet board) and the monitoring period of the
performance event.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 42


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

2. View the historical group performance and statistical group performance of the Ethernet
ports, and query the performance of PORT1 on Ethernet boards 1, 2, and 3 for a specific
period.
3. Analyze RMON performance and check whether the data is normal.
l When the PAUSE frame is found in the statistics report of the monitored port, use the
ping function to check whether the fault is occurring on the link between Huawei
equipment and the interconnected data communication equipment.
l If the FCS error, collision, or fragment is found in the statistics report of the monitored
port, refer to the procedure of handling the "the RMON performance value is above the
upper limit performance"event.
l If the problem is related to the maximum transmission unit (MTU), for example, the
set maximum transmission unit (MTU) is smaller than expected, change it.
4. Check whether the sum of the traffic on PORT1 of Ethernet boards 2 and 3 is equal to the
traffic on PORT1 of Ethernet board 1.
l If the sum of the traffic on PORT1 of Ethernet boards 2 and 3 is equal to the traffic on
PORT1 of Ethernet board 1, the fault may be located at the interconnection point
between Huawei equipment and the customer equipment instead of on Ethernet boards
1, 2, and 3. Use the ping function to check the link between Huawei equipment and the
non-Huawei equipment.
l If the sum of the traffic on PORT1 of Ethernet boards 2 and 3 is not equal to the traffic
on PORT1 of Ethernet board 1, the fault is occurring on Ethernet board 1, 2, or 3. Check
whether the traffic on PORT1 on Ethernet board 1, 2, or 3 is equal to the sum of the
traffic on the corresponding VCTRUNK ports to locate the specific board where the
fault (congestion or signal degradation) is occurring.
5. Use the ping function to check whether the fault occurs on the link between Huawei
equipment and the interconnected data communication equipment.

For RMON configurations, refer to Configuring the RMON in the Supporting Tasks.

3.1.10 Port Mirroring Analysis


To monitor and analyze the Ethernet packets at a port, you can enable the port mirroring function
so that the received or transmitted packets on the port are duplicated to another Ethernet port to
which the Ethernet tester is connected. Then, you can monitor and analyze the packets.

Application
You can configure port mirroring to perform packet monitoring, daily maintenance and in-
service commissioning. Port mirroring has the following features:

l Port mirroring applies to online fault diagnosis. It mirrors the traffic from one or more ports
to another port, and then an analyzer is used for fault diagnosis.
l After port mirroring is used, traffic can be monitored in real time using an analyzer.

Port mirroring implements fault diagnosis by mirroring and analyzing port data and without
affecting services. This method, which takes a long time and demands high maintenance skills,
is generally used by experienced maintenance personnel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 43


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Figure 3-15 Schematic diagram of Ethernet port mirroring

Ethernet processing unit

Mirroring Monitoring
port port

NE Duplication Ethernet tester

To monitor the data in different directions, port mirroring can be performed in the ingress
direction and in the egress direction.
l In the ingress direction
Also in the upstream direction. The equipment duplicates the packets received from the
mirroring port to the observing port, and then transmits the packets from the observing port
to the Ethernet tester.
l In the egress direction
Also in the downstream direction. The equipment duplicates the packets transmitted by the
mirroring port to the observing port, and then transmits the packets from the observing port
to the Ethernet tester.

NOTE

No services can be configured on an observing port.


For LEX4, LEM24, TBE, and TEM28 boards, you can set Mirrored Upstream Port or Mirrored
DownstreamPort, but cannot set them at the same time.
For packet boards such as EG16, WX2, and PND2, pay attention to the following points:
l The port where LAG protection groups are configured, can be configured as the source mirroring port but
cannot as an observing port.
l Setting tags on an observing port does not affect the receiving or transmission of mirroring packets.
l On the same physical port, you can configure upstream or downstream mirroring services.

For port mirroring configurations, see Configuring Port Mirroring in Supporting Tasks.

Availability
Table 3-3 lists the boards that support the port mirroring function.

Table 3-3 Boards that support the port mirroring for the OptiX OTN products

Board Type Equipment

TBE, L4G OptiX OSN 6800


OptiX OSN 3800

LEX4, LEM24 OptiX OSN 6800


OptiX OSN 8800

EG16, EX2, PND2 OptiX OSN 8800

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 44


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Related Information
For TBE, L4G, LEX4, LEM24, TEM28:

Table 3-4 Parameters

Field Value Description

Board For example: 5-L4G Selects a board for port mirroring.

Mirror Listener Port For example: PORT4 l After the mirroring function of the
port is configured, you can
monitor all the mirrored ports by
analyzing the packets at the
mirroring port only. As a result,
you can easily manage the ports.
l Mirrored Port indicates the port
that sends the packets copied from
Mirrored Upstream Port and
Mirrored Downstream Port.
l Mirrored Port cannot be set to a
port that carries any service.

Uplink Listened Port For example: PORT3 Sets the uplink listened port.
Mirrored Upstream Port can be a
PORT or a VCTRUNK. As a PORT,
the port copies the packets that it
receives; as a VCTRUNK, the port
copies the packets that it transmits.
Mirrored Port sends the packets
copied from Mirrored Upstream
Port.

Downlink Listened For example: PORT5 Sets the downlink listened port.
Port Mirrored Downstream Port can be
a PORT or a VCTRUNK. As a PORT,
the port copies the packets that it
transmits; as a VCTRUNK, the port
copies the packets that it receives.
Mirrored Port sends the packets
copied from Mirrored Downstream
Port.

Mirrored Upstream Port and Mirrored Downstream Port indicate the ports that copy
packets for Mirrored Port.
The transmit direction and receive direction mentioned in this section are related to the local
NE.

For EG16, EX2, PND2:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 45


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

Table 3-5 Parameters

Field Value Description

Mirror ID - Indicates the mirroring ID. The


system automatically assigns
mirroring IDs if the Automatically
Assign check box is selected.

Mirror Name - Indicates the name of a mirroring


service.

Direction INGRESS, EGRESS Indicates the direction for observing a


Default: INGRESS mirroring service.

Mirror Source Function For example: Shelf0 Indicates the source mirroring port.
Point (subrack)-2-54EX2-1
(PORT-1)

Mirror Observation For example: Shelf0 Indicates the observing port.


Point (subrack)-12-54EX2-1
(PORT-1)

3.1.11 One-Click Data Collection


One-click data collection is generally used in the event of equipment failure to collect the fault
and performance data of the equipment.

For any problems that prove complex for technicians to identify or resolve, contact Huawei
technical support and report the relevant equipment data. As shown in Table 3-6, the equipment
data to be collected includes the version information, electronic labels, basic configurations,
service configurations, current and historical alarm and performance event data, equipment
running logs, operation logs, security logs, and equipment running status.

Table 3-6 Description of fault data types

N Type Description
o.

1 NGWDM_ Collects the version information about NEs and boards, board dump
Board_Data information, service performance parameters and alarms, and chip
register information on boards.

2 40G_Modul Collects the performance status and key component parameters of the
e_Basic_Dat 40G optical and electrical modules installed on the boards on the current
a NE. Such data includes the module input and output optical power
specifications, service status, alarms, and key component parameters.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 46


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

N Type Description
o.

3 NGWDM_ Collects the basic configuration information and electromechanical


Basic_Data information about the current NE. The basic configuration information
includes information about the active/standby boards, memory partition
and usage, file system, alarms, performance, and databases. The
electromechanical information includes the temperature alarm thresholds
of the system control boards and other boards, NE power consumption,
and NE power-saving status.

4 NGWDM_ Collects time and clock information to check the tracing status of the
Clock_Data current NE. Such information includes the IEEE 1588-related alarms,
external time sources, line time sources, line time source tracing jitter,
synchronization clock alarms, and lost external clock sources.

5 NGWDM_ Collects the communication information to check the communication


Communica status between NEs. Such information includes the NE IDs, NE routes,
tions_Data communication ports, and orderwire configurations.

6 NGWDM_ Collects the electromechanical and configuration information about the


Configure_ current NE. Such information includes the power supply and fan status,
Data interface board (EFI and ATE) status, and online status of the physical
boards.

7 NGWDM_E Collects the electrical-layer service configuration information about the


lectric_Data current NE. Such information includes the OTN overhead configurations
on service boards, management information about Ethernet ports, and
basic SDH configurations.

8 NGWDM_P Collects the cross-connection and protection configurations on the


rotected_Da current NE. Such configurations include the intra-board protection, inter-
ta board protection, client-side protection, optical line protection, and
electrical cross-connections.

9 NGWDM_L Collects the optical-layer configurations of the current NE. Such


ight_Data configurations include the logical fiber connections, optical cross-
connections, automatic level control (ALC), intelligent power adjustment
(IPA), automatic power equilibrium (APE), and windows media unit
(WMU).

1 NGWDM_P Collects the package loading data of the current NE. Such data includes
0 atch_Data the package loading alarms, status, logs, and patch loading and operating
data.

1 NGWDM_ Collects board black box data in the log files from the platform in file
1 BlackBox_ transfer protocol (SFTP) mode.
Data

1 OTN_Ason Collects data about ASON services, TE links, current alarms, historical
2 _Data alarms, and boards. Such data must be collected when an ODUk service
in the OTN ASON domain is faulty (for example, service rerouting fails
or service interruption alarms are generated), or an ODUk ASON link is
faulty.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 47


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 3 Fault Isolation Basics

N Type Description
o.

1 WDM_Aso Collects data about ASON services, TE links, current alarms, historical
3 n_Data alarms, and boards. Such data must be collected when an OCh service in
the WDM ASON domain is faulty (for example, service rerouting fails
or service interruption alarms are generated), or an OCh/OMS ASON
link is faulty.

1 SDH_Ason_ Collects data about ASON services, TE links, current alarms, historical
4 Data alarms, and boards. Such data must be collected when an SDH service in
the SDH ASON domain is faulty (for example, service rerouting fails or
service interruption alarms are generated), or an SDH ASON link is
faulty.

1 Collect_HIS Collects the alarms of the boards that support the HISI chip on the current
5 I_Chip_Dat NE.
a

1 SYS Collects information about management modules in one-click mode.


6

1 ALM Collects current and historical alarms in one-click mode.


7

1 ASON Collects information about ASON services in one-click mode.


8

1 TDM Collects data about the time division multiplexing (TDM) feature in one-
9 click mode.

2 WDM Collects data about the wavelength division multiplexing (WDM) feature
0 in one-click mode.

2 Data Collects information about service configuration modules in one-click


1 mode.

2 DBMS Collects data in the database in one-click mode.


2

2 PER Collects the current and historical performance data in one-click mode.
3

For data collection configurations, refer to Collecting Fault Data in the Supporting Tasks.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 48


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

4 Fault Location Using TP-Assist

About This Chapter

Fault location using the Transport Packet Assist (TP-Assist) simplifies operation and
maintenance (O&M) of packet services and improves O&M efficiency.

4.1 Fault Location Using IP Ping


You can use the cross-product IP ping function to quickly locate the fault that occurs on a hybrid
network containing transport network equipment and wireless network equipment.

4.2 E-LAN Service Loopback Detection


E-LAN services are broadcast on the related VLAN. If packet loopbacks occur, a broadcast
storm occurs. The E-LAN service loopback detection function is provided to detect and disable
the looped back service in time. The function helps prevent network congestion.

4.3 Performing Intelligent Service Fault Diagnosis for Ethernet Services


This section describes how to intelligently diagnose faults of Ethernet services based on service
paths.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 49


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

4.1 Fault Location Using IP Ping


You can use the cross-product IP ping function to quickly locate the fault that occurs on a hybrid
network containing transport network equipment and wireless network equipment.
The IP ping function is classified into near-end IP ping and far-end IP ping functions.
l Near-end IP ping
The wireless network equipment sends IP ping packets to a user-to-network interface (UNI)
on near-end transport network equipment. The transport network equipment responds to
the IP ping packets. The near-end IP ping function is generally used to check whether a
fault occurs on wireless network equipment. For example, the NodeB sends IP ping packets
to the UNI on NE1, as shown in Figure 4-1.
l Far-end IP ping
The wireless network equipment sends IP ping packets to a UNI on far-end transport
network equipment. The transport network equipment responds to the IP ping packets. The
far-end IP ping function is generally used to check whether a fault occurs on the transport
network. For example, the NodeB sends IP ping packets to the UNI port on NE2, as shown
in Figure 4-1.

Figure 4-1 Application of the IP ping functions


Far-end IP Ping

Near-end IP Ping

PSN

Router RNC
NodeB NE1 NE2
Near-end IP Ping

Far-end IP Ping

NOTE

Switches and routers support both strict mode and non-strict mode.
l In strict mode, switches and routers learn MAC addresses only upon receiving ARP responses.
l In non-strict mode, switches and routers learn MAC addresses upon receiving ARP request packets.
OptiX OSN equipment enabled with the IP ping function does not send ARP responses. If switches or
routers use the strict mode, IP ping will fail. Therefore, if the IP ping function is initiated by a switch or
router, use the non-restrict mode for the switch or router.
When the IP ping function is initiated at a PC that is directly connected to an NE, the packets sent by the
PC do not carry VLAN tags. To test a service with VLAN tags on the NE, set the tag attribute of the service
port to Access or Hybrid, and set the default VLAN ID of the port the same as the service VLAN ID.

The following describes how to configure the near-end IP ping function and far-end IP ping
function using a NodeB as the initiator of the IP ping function.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 50


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

Table 4-1 Mapping between port encapsulation modes and TPIDs in IP Ping packets

Encapsulation mode of the TPID in IP Ping Packets


Port

Null 0x8100

802.1Q 0x8100

QinQ 0x8100 and NE-level TPID (0x88a8 by default)

Near-End IP Ping
Step 1 Right-click NE1 in the Main Topology and choose NE Explorer from the shortcut menu.

Step 2 In the NE Explorer, click NE1 and choose Configuration > Packet Configuration > IP Ping
Test.

Step 3 Set Enable, VLAN ID, VLAN Priority, IP Address, IP Mask, Next Hop IP Address, and
Next Hop IP Mask for the target UNI.

NOTE

l The IP address of the UNI must be on the same network segment with the IP address of the NodeB, to
ensure that the near-end IP ping function can operate properly.
l The next hop IP address of the UNI is the IP address of the port connecting straightly with the NodeB.

Step 4 Click Apply.

----End

Far-End IP Ping
Step 1 Right-click NE1 in the Main Topology and choose NE Explorer from the shortcut menu.

Step 2 In the NE Explorer, click NE1 and choose Configuration > Packet Configuration > IP Ping
Test.

Step 3 Click Add and set the proxy IP address in the Proxy IP Address List area.

NOTE

In far-end IP ping tests, the configured proxy IP address ensures that IP ping packets are forwarded through
the proxy IP address.

Step 4 Click Apply.

Step 5 Set Enable, VLAN ID, VLAN Priority, IP Address, and IP Mask for a UNI of NE2 by
referring to the configuration steps applicable to NE1.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 51


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

NOTE

l The UNI IP address configured on NE2 must be the same as the proxy IP address configured on NE1.
l Before using the far-end IP ping function, ensure that the IP ping function has been enabled on both
the near-end port and far-end port.

----End

4.2 E-LAN Service Loopback Detection


E-LAN services are broadcast on the related VLAN. If packet loopbacks occur, a broadcast
storm occurs. The E-LAN service loopback detection function is provided to detect and disable
the looped back service in time. The function helps prevent network congestion.

Application Scenario
The E-LAN service loopback detection function applies to the following native Ethernet
services:

l IEEE 802.1d bridge-based E-LAN services


l IEEE 802.1q bridge-based E-LAN services
l IEEE 802.1ad bridge-based E-LAN services

Context
l As shown in Figure 4-2, E-LAN services are configured for the network. A UNI on NE1
initiates a loopback detection packet on the specified VLAN. The loopback detection packet
takes the same path as forwarded service packets.
l If the UNI receives a loopback detection packet whose destination MAC address is the
same as the source MAC address of the loopback detection packet sent by the UNI, a
loopback occurs. You can use the automatic disabling function to disable loopback services.
If the UNI does not receive a loopback detection packet within the specified duration, a
service loopback does not occur. The duration ranges from 3s to 10s and the default value
is 3s.
NOTE

The automatic disabling function disables only the looped back service. The UNI remains operational.
Therefore, disabling of the looped back service does not affect other services provided by the UNI.
l You can perform E-LAN service loopback detection for a maximum of 20 VLANs at a
time. You can perform two consecutive loopback detection operations for a VLAN at an
interval of 10 or more seconds.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 52


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

Figure 4-2 Application of E-LAN service loopback detection

Loopback
detection NE2
packet

NE1 PSN
NE3

Procedure
Step 1 Right-click NE1 in the Main Topology and choose NE Explorer from the shortcut menu.

Step 2 In the NE Explorer, click NE1 and choose Configuration > Packet Configuration > Ethernet
Service Management > E-LAN Service.

Step 3 Select the target E-LAN service and click the Loopback tab.

Step 4 Click UNI and set the port that initiates the loopback packet.
NOTE

This task uses a UNI as an example. The port varies according to services.

Step 5 Click Start.

Step 6 Set Vlans/CVLAN, Packet Timeout Period, and Disable Service When Loopback is
Detected. Then click Start.

Step 7 Check the loopback detection results.

Step 8 Optional: If the loopback services does occur, disable the looped back service in time to help
prevent network congestion.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 53


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

1. Choose NE Explore. In the NE Explorer choose Configuration > Packet


Configuration > Ethernet Service Management > E-LAN Service > Loopback, Click
Service Status List.
2. In the Service Status List box, choose the loopback service, set Service Status as
Disable,then click Enable.

----End

4.3 Performing Intelligent Service Fault Diagnosis for


Ethernet Services
This section describes how to intelligently diagnose faults of Ethernet services based on service
paths.

Prerequisites
You must be an NM user with NE administrator authority or higher.

End-to-end Ethernet services have been deployed.

Procedure
Step 1 Search for services.

Search for nodes in a VLAN 3


domain based on the port and
VLAN ID specified by a user.
Networking diagram of the domain
is displayed. The service path to which the specified port
2 and VLAN ID reside is displayed. 4

Step 2 Select a service to be diagnosed by specifying a service port.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 54


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 4 Fault Location Using TP-Assist

6 7

8
Detect a specified service.

Step 3 Diagnose the service and display the result.

Service information is
displayed.
9

Test result and progress are


displayed.
11
10

Test result about received packets


12 on ports is displayed.
13

Test result about transmitted


packets on ports is displayed.

Alarms that affect services are


displayed.

Step 4 Optional: You can also diagnose the PWE3 services as follow:
1. Choose Service > PWE3 Service > Manage PWE3 Service from the main menu.
2. In the dialog box that is displayed, set filter vaule. Click Filter to check PWE3 services.
3. Select the PWE3 service to be deployed, right-click the PWE3 services, and choose
Diagnose Service from the shortcut menu.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 55


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

5 Service Interruptions

About This Chapter

Many factors can cause service interruptions, such as bit errors, line interruption, decrease of
optical power, failed interconnection of equipment.

5.1 Interruption of All Services


This section introduces the analysis and handling of faults that result in service interruptions on
an NE.

5.2 Single-Channel Service Interruption on the Client Side


This section introduces the analysis and handling of faults that result in an interruption of one
client-side service on an NE.

5.3 Single-Channel Service Interruption on the WDM Side


This section introduces the analysis and handling of faults that result in an interruption of one
WDM-side service on an NE.

5.4 Service Interruption Caused by Human Misoperation


This section introduces the analysis and handling of faults that result in service interruptions on
an NE because of misoperations.

5.5 Service Interruptions Caused by Power Supply Failures


This topic describes the symptoms, impact, and possible causes of a service interruption caused
by a power supply failure. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

5.6 Service Interruptions Due to Grounding Fault


This topic describes the symptoms, impact, and possible causes of a service interruption caused
by a grounding fault. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

5.7 Service Interruptions Due to Unfavorable Environmental Conditions


This topic describes the symptoms, impact, and possible causes of a service interruption caused
by unfavorable environmental conditions. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

5.8 Service Interruptions Due to Faulty Optical Fibers, Cables, or Connectors

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 56


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

This topic describes the symptoms, impact, and possible causes of a service interruption caused
by faulty fibers, cables, and or connectors. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

5.9 Service Interruptions Due to Inconsistency of Board Types or Board Settings


This topic describes the symptoms, impact, and possible causes of a service interruption caused
by inconsistent board types or board settings when the board is replaced or added. In addition,
this topic describes the required tools, precautions, and procedures for rectifying the fault.

5.10 Service Interruptions on the VCTRUNK Port


This topic describes the symptoms, impact on the system, and possible causes of a service
interruption on the VCTRUNK port. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

5.11 Service Interruptions on a Single Ethernet Port


This topic describes the symptoms, impact, and possible causes of a service interruption on a
single Ethernet port. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

5.12 Service Interruptions on All Ethernet Ports


This topic describes the symptoms, impact, and possible causes of a service interruption on all
Ethernet ports of an Ethernet board. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

5.13 Service Interruption Cases


This section lists cases related to Service Interruption.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 57


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

5.1 Interruption of All Services


This section introduces the analysis and handling of faults that result in service interruptions on
an NE.

Symptom
The OA (Station B) reports MUT_LOS alarm and all OTUs (Station B) report alarm R_LOS, ,
OTUk_LOF, OTUk_LOM alarm. All services are interrupted.

Networking configuration example is shown in Figure 5-1.

Figure 5-1 Networking configuration example of interruption of all services

Client OTU MUX OA FIU FIU OA DEMUX OTU Client


equipment equipment

DEMUX MUX

OSC OSC
Station A (OTM) Station B (OTM)

Impact on System
All services are interrupted.

Possible Causes
l Cause 1: The fiber is cut off.
l Cause 2: The optical power is abnormal.
l Cause 3: The optical amplifying board or the optical cable is faulty.
l Cause 4: The demultiplexing/multiplexing board at station A or B is faulty.
l Cause 5: The ring fiber adapter from the equipment to the ODF is damaged.

Tools, Equipment and Materials


U2000, Optical power meter, cassette cleaner, optical time-domain reflectometer, ring fiber
adapter

Procedure
l Cause 1: The system provides an OSC board, the optical amplifier board reports a
MUT_LOS alarm, and the optical supervisory board reports a OSC_LOS alarm. In this
case, suspect that the optical cable is cut off.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 58


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

1. Check the fiber jumper between the optical distribution frame (ODF) and WDM
equipment. If the fiber jumper is dirty, clean the fiber jumper; if it is faulty, replace
the fiber jumper.
2. If the fiber jumper is good, the optical cable is faulty. Clear the fault.
l Cause 2: The system provides an OSC board, the optical amplifier board reports a
MUT_LOS alarm, and the OSC board reports no OSC_LOS alarm. It indicates that the
optical power is abnormal.
1. Check the fiber jumper between the FIU and optical amplifier board. If the fiber jumper
is dirty, clean the fiber jumper.
2. Check the optical power change of the OSC. If the received optical power of the OSC
declines, check whether the attenuation of the optical cable is excessive. If yes, the
optical cable is faulty. Clear the fault.
3. If the optical cable between station A and B is good, query the optical power on the
U2000 at station A.
– If the input optical power of the optical amplifier board at station A is normal but
the output optical power is too low, replace the optical amplifier board at station
A.
– If the input optical power of the optical amplifier board at station A declines and
thus results in a low output optical power, check the upstream against the service
signal flow to locate the fault.
4. If the optical cables of the faulty station B are good, check whether the input optical
power of the optical amplifier board is normal. If yes, and the output optical power is
too low, replace the board.
l Cause 3: The system provides no OSC board, and the optical amplifier board reports a
MUT_LOS alarm. It indicates that the optical amplifying board or the optical cable is faulty.
1. Refer to Testing Optical Power by Using an Optical Power Meter. Check whether the
input optical power of the optical amplifier board at station B is normal. If yes, replace
the optical amplifier board at station B.
2. Refer to Testing Optical Power by Using an Optical Power Meter. Check whether the
input and output optical power of the optical amplifier board at station A is normal.
If the input optical power is normal but the output optical power declines, replace the
optical amplifier board at station A.
3. Check the fiber jumper between the ODF and WDM equipment. If the fiber jumper
is dirty, clean it; if it is faulty, replace it.
4. If the fiber jumper is good, the optical cable is faulty. Clear the fault.
l Cause 4: The MUX/DEMUX board at station A or B is faulty.
1. Query failure alarms of the MUX/DEMUX board at station A or B. If alarms are found,
replace the faulty board.
NOTE

Before you replace the M40V or D40V board, record the attenuation of the internal VOA of
each channel.
l Cause 5: The ring fiber adapter from the equipment to the ODF is damaged.
1. Check the ring fiber adapter between the ODF and WDM equipment. If the ring fiber
adapter is faulty, replace it.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 59


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

5.2 Single-Channel Service Interruption on the Client Side


This section introduces the analysis and handling of faults that result in an interruption of one
client-side service on an NE.

Symptom
The client-side RX interface of the OTU or tributary board reports alarms such as R_LOS,
R_LOF, OTUk_LOF at station A, and the client-side service is interrupted.

Networking configuration example is shown in Figure 5-2.

Figure 5-2 Networking configuration example of interruption of client-side services

Client Client
equipment OTU1 OTU1 equipment
MUX MUX
DEMUX DEMUX
OA OA
OTUn OTUn
Station A (OTM) Station B (OTM)

Impact on System
Single channel service is interrupted on the client side.

Possible Causes
l Cause 1: The output optical power of the client equipment that is interconnected to the
OTU board is abnormal.
l Cause 2: The fiber jumper between the OTU board and client equipment is faulty.
l Cause 3: The optical interfaces of the OTU board are dirty.
l Cause 4: The OTU board is faulty.
l Cause 5: The client equipment is faulty.
l Cause 6: The service type, wavelength and mode that configured on the board 1 are different
from those of the client equipment, and therefore services are interrupted.

Tools, Equipment and Materials


U2000, Optical power meter, Cassette cleaner, Optical fiber, Compressed gas

Procedure
l Cause 1: The output optical power of the client equipment that is interconnected to the
OTU board is abnormal.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 60


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

1. Refer to Testing Optical Power by Using an Optical Power Meter. Check whether the
output optical power of the client equipment is normal. If not, remove the fault of the
client equipment.
l Cause 2: The fiber jumper between the OTU board and client equipment is faulty.
1. If the output optical power of the client equipment is normal, check whether the input
optical power of the OTU board is normal. If not, clean the fiber jumper or replace
the fiber jumper.
l Cause 3: The optical interfaces of the OTU board are dirty.
1. If the optical interfaces of the OTU board are dirty, blow them with compressed gas
to clean them.
l Cause 4: The OTU board is faulty.
1. If the input optical power of the OTU board is normal, but the system reports an alarm
indicating that the optical power is abnormal or an alarm related to the optical module,
replace the client-side optical module on the OTU board.
2. If the fault still exists, replace the board.
l Cause 5: The client equipment is faulty.
1. Remove the fault of the client equipment.
l Cause 6: The service type, wavelength and mode that configured on the board 1 are different
from those of the client equipment, and therefore services are interrupted.
1. Query and set the client service type, wavelength and mode of the board 1. Make sure
that the service type, wavelength and mode of the board 1 board are the same as those
of the client equipment.
NOTE

For GE service, query and set the auto-negotiation modes of the boards. Make sure that the auto-
negotiation modes are the same.

----End

5.3 Single-Channel Service Interruption on the WDM Side


This section introduces the analysis and handling of faults that result in an interruption of one
WDM-side service on an NE.

Symptom
The WDM-side IN interface of the OTU, or tributary board and line board reports alarms such
as R_LOS, R_LOF, R_OOF and OTUk_LOF, and one channel of services is interrupted.

Networking configuration example is shown in Figure 5-3.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 61


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Figure 5-3 Networking configuration example of interruption of one channel of services

Client Client
equipment OTU MUX OA OA DEMUX OTU equipment

DEMUX OA OA MUX

Station A Station B

Signal Flow

Impact on System
One channel of the WDM-side services of an NE is interrupted.

Possible Causes
l Cause 1: The OTU, or tributary board and line board in the opposite station (station A) is
faulty or the fiber jumper in the opposite station between the OTU board and MUX is dirty.
l Cause 2: The fiber jumper between the DEMUX and OTU, or tributary board and line board
is dirty or faulty.
l Cause 3: The loopback testing on the OTU, or tributary board and line board in the local
station and the optical power of OTU, or tributary board and line board is normal. In this
case, the FEC mode of the OTU, or tributary board and line board in the local station and
the opposite station might be inconsistent.
l Cause 4: The interconnection services type, wavelength and mode do not match.
l Cause 5: The OTU, or tributary board and line board at the local station (station B) is faulty.
l Cause 6: New service wavelengths are added to the network and the new service
wavelengths conflict with existing service wavelengths.

Tools, Equipment and Materials


U2000, Optical power meter, Cassette cleaner, Optical fiber, Compressed gas

Procedure
l Cause 1: The OTU, or tributary board and line board in the opposite station (station A) is
faulty or the fiber jumper in the opposite station between the OTU, or tributary board and
line board and MUX is dirty.
1. Check whether the input optical power on the client side and the output optical power
on the WDM side of the OTU, or tributary board and line board in the opposite station
are normal. If either optical power is abnormal, replace the corresponding board.
2. Check the input optical power of the optical amplifier board in the opposite station
(station A) to determine whether the fiber jumper between the OTU, or tributary board
and line board and MUX is dirty or faulty. If the fiber jumper is dirty, clean the fiber
jumper; if it is faulty, replace the fiber jumper.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 62


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

l Cause 2: The fiber jumper between the DEMUX and OTU, or tributary board and line board
is dirty or faulty.
1. Check whether the input optical power of the OTU, or tributary board and line board
is normal. If not, clean or replace the fiber jumper between the DEMUX and OTU, or
tributary board and line board.
l Cause 3: The loopback testing on the OTU, or tributary board and line board in the local
station and the optical power of OTU, or tributary board and line board is normal. In this
case, the FEC mode of the OTU, or tributary board and line board in the local station and
the opposite station might be inconsistent.
1. Refer to Querying the FEC Type to query the FEC states of the two interconnected
boards on the U2000. If the two modes differ, enable the FEC to ensure mode
matching.
l Cause 4: The interconnection services type, wavelength and mode do not match.
1. Check and ensure that the interconnection services type, wavelength and mode match.

NOTE

If the services on the WDM side do not match, there is only LOOP_ALM alarm reported in
the loopback operation, but there will be OTUk_LOF or R_LOF if the loopback operation is
canceled.
l Cause 5: The OTU, or tributary board and line board at station B is faulty.
1. Check whether the input optical power of the OTU, or tributary board and line board
is normal. If yes, the OTU, or tributary board and line board in the local station is
faulty. Replace the pluggable optical module or Replace the board.
l Cause 6: New service wavelengths are added to the network and the new service
wavelengths conflict with existing service wavelengths.
1. Check whether new services are added to the network. If new services are added,
check whether the new service wavelengths conflict with the existing service
wavelengths. Then, change the new service wavelengths so that they do not conflict
with the existing service wavelengths.

----End

5.4 Service Interruption Caused by Human Misoperation


This section introduces the analysis and handling of faults that result in service interruptions on
an NE because of misoperations.

Symptom
Human misoperation results in service interruptions.

Impact on System
The services carried on the OTU, or tributary board and line board are interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 63


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Possible Causes
l Cause 1: When you disable the FEC function of an OTU, or tributary board and line board,
the equipment does not report any alarm, whereas the OTN services borne by the equipment
are interrupted.
l Cause 2: The FEC mode of any OTU, or tributary board and line board is manually changed,
causing the services to be interrupted.
l Cause 3: Hardware loopback or software loopback is set manually.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: When you disable the FEC function of an OTU, or tributary board and line board,
the equipment will not report any alarm, but the OTN services are interrupted.
1. Querying the FEC Mode, the equipment does not report any alarm when you enable
or disable the FEC function of the OTU, or tributary board and line board. The engineer
must pay attention to this during maintenance. Make sure to enable the FEC function
in application.
l Cause 2: The FEC mode of any OTU, or tributary board and line board is manually changed,
causing the services to be interrupted.
1. Check whether the FEC modes of the two interconnected boards are different.
l Cause 3: Hardware loopback or software loopback is set manually.
1. Check whether hardware loopback is set. If yes, release the hardware loopback.
2. On the U2000, check whether software loopback is set on the service channel
manually. If yes, release the software loopback.

----End

5.5 Service Interruptions Caused by Power Supply Failures


This topic describes the symptoms, impact, and possible causes of a service interruption caused
by a power supply failure. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

Symptom
A large number of boards on the device of a certain site are reset simultaneously and the services
are interrupted.

Impact on System
Power supply failures may cause service interruptions.

Possible Causes
l Cause 1: The power supply of the board fails.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 64


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

l Cause 2: The power cable connectors are not connected properly.


l Cause 3: The external power supply is off or the power fluctuates abnormally.
l Cause 4: The power board of the device is faulty.
l Cause 5: The DC PDU is faulty.

Tools, Equipment, and Materials


U2000, Multimeter, Screwdriver

Procedure
l Cause 1: The power supply of the board is faulty. As a result, services are interrupted.
1. Query the current alarms on the U2000.

If... Then...

The NE reports the Proceed to the next step.


POWER_ABNORMAL,
POWER_FAIL alarm

The NE does not report the Check whether the fault is due to other
POWER_ABNORMAL, causes.
POWER_FAIL alarm

NOTE

Only SDH boards support POWER_ABNORMAL alarm.


2. Handle the POWER_ABNORMAL, POWER_FAIL alarm.
3. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 2: The power cable connectors are not connected properly. As a result, services are
interrupted.
1. Check the connections of the power cables on the external power supply module, DC
PDU, and power board of the device.

If... Then...

The power cable connectors are not Fasten the power cable connectors by
connected properly using a screwdriver or replace the power
cable connectors, and proceed to the
next step.

Power cable connectors are connected Check whether the fault is due to other
properly and the contact is good causes.

2. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 3: The external power supply is off or the power fluctuates abnormally. As a result,
services are interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 65


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

1. Switch off the DC PDU. Connect the positive pole of the multimeter to the NEG(-)
terminal of the DC PDU and the negative pole of the multimeter to the RTN(+)
terminal. Measure the voltage between the RTN(+) and NEG(-) terminals and check
whether the voltage meets the requirement. For details about voltage requirements,
refer to the Hardware Description.

If... Then...

The voltage between the RTN and -48V Proceed to the next step.
DC terminals does not meet the
requirements

The voltage between the RTN and -48V Check whether the fault is due to other
DC terminals meets the requirements causes.

2. Contact the power supply engineers to rectify the fault of the external power supply.
l Cause 4: The power board of the device is faulty. As a result, services are interrupted.
1. Replace the power board of the device.
2. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 5: The DC PDU is faulty. Abnormality of the power supply system causes the board
reset.
1. Replace the DC PDU.
2. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.

----End

5.6 Service Interruptions Due to Grounding Fault


This topic describes the symptoms, impact, and possible causes of a service interruption caused
by a grounding fault. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

Symptom
A large number of boards on the device of a certain site are reset simultaneously and the services
are interrupted.

Impact on System
The grounding fault causes service interruptions.

Possible Causes
l Cause 1: The PGND and BGND grounding is poor, and the grounding resistance is higher
than 10 ohms or the potential difference between BGND and PGND is higher than 0.5 V.
l Cause 2: The PGND and BGND share the same grounding with the AC neutral wire.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 66


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

l Cause 3: The PGND of the interconnected device is not joint grounding.


NOTE

PGND means protection ground and BGND means power ground (also called working ground).

Tools, Equipment, and Materials


Multimeter, Screwdriver

NOTE

Use the multimeter to check whether the grounding resistance of the BGND and PGND meet the
requirements. Or, you can use a proper meter to check whether the waveform of the interconnected signal
is distorted.

Procedure
l Cause 1-3: The device grounding is improper. As a result, services are interrupted.
1. Check whether the ground bar in the telecommunications room is properly grounded.
If the grounding is proper, proceed to the next step; otherwise, ground the ground bar
properly.
2. Check whether the contact between the cabinet of the transmission device and the
ground bar in the telecommunications room is good. If the contact is good, proceed
to the next step; otherwise, ground the cabinet of the transmission device properly.
3. Check whether the contact between the cabinet and the front door/side cover of the
cabinet is good. If the contact is good, proceed to the next step; otherwise, ground the
front door/side cover of the cabinet properly.
4. Check whether the contact between the subrack and the cabinet is good. If the contact
is good, proceed to the next step; otherwise, ground the subrack properly.
5. Check whether the DDF and ODF are properly grounded. If the DDF and ODF are
properly grounded, proceed to the next step; otherwise, ground the DDF and ODF
properly.
6. Check whether the NMS device and various power consumption devices are properly
grounded. If these devices are properly grounded, proceed to the next step; otherwise,
ground these devices properly.
7. Check whether the grounding of the interconnection devices is joint grounding.

----End

5.7 Service Interruptions Due to Unfavorable


Environmental Conditions
This topic describes the symptoms, impact, and possible causes of a service interruption caused
by unfavorable environmental conditions. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
A large number of boards on the device of a certain site are reset simultaneously and the services
are interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 67


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Impact on System
Unfavorable environmental conditions can cause a service interruption.

Possible Causes
l Cause 1: The temperature and humidity of the environment have caused equipment damage.
l Cause 2: A strong external interference source exists.
l Cause 3: Rodent animals damage equipment.

Tools, Equipment, and Materials


U2000, Thermometer, Humidiometer

Procedure
l Cause 1: The temperature and humidity of the environment have caused equipment damage.
As a result, services are interrupted.
1. Check whether the temperature and humidity of the environment meet the
requirements.

The detailed requirements are as follows:

– Temperature: 0°C to 45 °C (32°F to 113°F)


– Relative humidity: 10% to 90%

If... Then...

The temperature and humidity of the Check and adjust the air conditioner in
environment do not meet the the telecommunications room to make
requirements sure that the temperature and humidity
of the telecommunications room are in
the proper range. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.

The temperature and humidity of the Proceed to the next step.


environment meet the requirements

2. Check the current alarms on the U2000.

If... Then...

The NE reports the temperature Proceed to the next step.


threshold-crossing alarms
TEMP_OVER

The NE does not report the temperature Check whether the fault is due to other
threshold-crossing alarms causes.
TEMP_OVER

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 68


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

3. Handle the TEMP_OVER alarms.


4. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 2: A strong external interference source exists. As a result, services are interrupted.
1. Check whether there is a transformer, high-voltage transmission line, or high-current
device near the device.

If... Then...

There is an AC transformer within 20 m The interference should meet the


(65.6 ft.) of the device or high-voltage requirements of related standards. When
transmission line within 50 m (164.1 ft.) necessary, shielding or isolation
of the device measures should be taken to protect the
device. Check whether services recover.
If the services do not recover, proceed to
the next step.

There is no AC transformer within 20 m Proceed to the next step.


(65.6 ft.) of the device or high-voltage
transmission line within 50 m (164.1 ft.)
of the device

2. Check whether there is a high-power broadcasting transmitter near the device.

If... Then...

There is a high-power broadcasting The interference should meet the


transmitter within 500 m (1640.5 ft.) of requirements of related standards. When
the device necessary, shielding or isolation
measures should be taken to protect the
device. Check whether services recover.
If the services do not recover, proceed to
the next step.

There is no high-power broadcasting Proceed to the next step.


transmitter within 500 m (1640.5 ft.) of
the device

3. Check whether there is a mobile communication transmitter near the device.

If... Then...

There is a mobile communication The interference should meet the


transmitter near the device requirements of related standards. When
necessary, shielding or isolation
measures should be taken to protect the
device. Check whether services recover.
If the services do not recover, proceed to
the next step.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 69


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

If... Then...

There is no mobile communication Proceed to the next step.


transmitter near the device

4. Check whether a hand-held mobile communication device is in use near the device.

If... Then...

A hand-held mobile communication Switch off the hand-held mobile


device is in use near the device communication device. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.

No hand-held mobile communication Check whether the fault is due to other


device is in use near the device causes.

l Cause 3: Rodent animals damage equipment. As a result, services are interrupted.


1. Check whether there are rodent animals or their excrement inside the cabinet.

If... Then...

There are rodent animals or their Remove the rodent animals and their
excrement inside the cabinet excrement, replace any broken cable,
and install a rodent-proof mesh in the
cabinet. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.

There are no rodent animals or their Check whether the fault is due to other
excrement inside the cabinet causes.

----End

Related Information
Table 5-1 lists the requirement for the electromagnetic interference.

Table 5-1 Requirements of the electromagnetic interference

Electromagnetic Environment Electromagnetic


Environment Parameter

Low-frequency magnetic Frequency (Hz) 50 to 20000


field
Ampl. A/m (rms) 10 to 0.025

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 70


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Electromagnetic Environment Electromagnetic


Environment Parameter

Amplitude-modulated Frequency (MHz) 0.009 to 1000


radio-frequency electrical
field Ampl. V/m (rms) 3

Impulse-modulated radio- Frequency (GHz) 1 to 20


frequency electrical field
Ampl. (V/m (peak)) 3

5.8 Service Interruptions Due to Faulty Optical Fibers,


Cables, or Connectors
This topic describes the symptoms, impact, and possible causes of a service interruption caused
by faulty fibers, cables, and or connectors. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
Services of a certain site are interrupted.

Impact on System
Faulty fibers, cables, or connectors can cause a service interruption.

Possible Causes
l Cause 1: The fibers are broken.
l Cause 2: Attenuation of the fibers and the fiber adapter is too high.
l Cause 3: The cable connectors are connected improperly or incorrectly, or are not
connected.

Tools, Equipment, and Materials


U2000, Optical time-domain reflectometer (OTDR), Optical power meter

Precautions

DANGER
l Avoid direct eye exposure to laser beams launched from the optical interface and the fiber
jumper connector.
l When removing fibers from the optical interface, make sure that the proper optical fiber is
removed; otherwise, it may cause service interruption to a larger area.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 71


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Procedure
l Query alarms on the U2000 and clear the alarms according to the reported information.
1. Check the current alarms on the U2000.

If... Then...

The NE reports the R_LOS, R_LOF or Proceed to the next substep.


IN_PWR_LOW alarm

The NE does not report the R_LOS, Check whether the fault is due to other
R_LOF or IN_PWR_LOW alarm causes.

2. Handle the R_LOS, R_LOF or IN_PWR_LOW alarm.


3. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 1: The fibers are broken. As a result, services are interrupted.
1. Use the OTDR meter to test the fibers. According to the line attenuation curve
displayed by the meter, determine whether the fibers are broken and the detailed
position of the broken fibers. For how to use the OTDR meter, see the Operation
Guide of the OTDR meter.

If... Then...

The fibers are detected as broken Replace the fibers. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.

The fibers are not detected as broken Check whether the fault is due to other
causes.

l Cause 2: Attenuation of the fibers and the fiber adapter is too high. As a result, services are
interrupted.
1. Measure the transmit optical power of the board in the upstream station and the receive
optical power of the board in downstream station.

If... Then...

The transmit optical power of the board Replace the board.


in the upstream station is abnormal

The receive optical power of the board Proceed to the next substep.
in downstream station is abnormal

2. The difference between the transmit optical power of the board in the upstream station
and the receive optical power of the board in downstream station is the actual optical
power attenuation of the line. Compare the actual optical power attenuation of the line
with the designed value. If the actual attenuation is higher than the designed value,
locate and rectify the fault as follows.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 72


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

– Check whether the bending radius of the fibers is equal to or larger than
standard. If the bending radius of the fibers is less than standard, re-route the
fibers and make sure that the bending radius is equal to or larger than standard.
– Check whether there are many fiber connectors on the line and whether the
connectors are connected properly. If the connectors are connected improperly,
fasten the connectors with the optical modules.
– Check whether there are aerial fibers on the line, which is easy to be affected by
the weather. If there are aerial fibers on the line, provide necessary protection to
the aerial fibers.
– Check whether the fiber type and attenuation factor of the line are consistent with
the designed ones. If the fiber type and attenuation factor of the line are inconsistent
with the designed ones, replace the fiber.
– Check whether the fiber connectors of the line are dirty. For details, see the
information about checking the fiber connectors. Clean the fiber connectors.
3. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 3: The connection between the cable and the connector is loose, the cable connectors
are connected incorrectly, or the cable connectors are not connected. As a result, services
are interrupted.
1. Check the cables connection on the board in the downstream station.

If... Then...

The connection between the cable and Connect the cables properly. Check
the connector is loose whether services recover. If the services
do not recover, proceed to the next
substep.

The connection of the cable and the Proceed to the next substep.
connectors is good

2. Check whether the connections of the cables to the board in the downstream station
are correct.

If... Then...

Connections of the cables to the board in Connect the cables correctly. Check
the downstream station are incorrect whether services recover. If the services
do not recover, check whether the fault
is due to other causes.

Connections of the cables to the board in Check whether the fault is due to other
the downstream station are correct causes.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 73


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Related Information

Table 5-2 Minimum bending radius for fibers

Fiber Type Minimum Bending Radius

G.652D 30 mm

Multi-mode fiber (A1B) 30 mm

G.657A2 10 mm

G.657B3 8 mm

Note: When fibers are connected to CRPC boards, the bending radius of the fibers must be
greater than or equal to 50 mm to ensure expected transmission performance.

5.9 Service Interruptions Due to Inconsistency of Board


Types or Board Settings
This topic describes the symptoms, impact, and possible causes of a service interruption caused
by inconsistent board types or board settings when the board is replaced or added. In addition,
this topic describes the required tools, precautions, and procedures for rectifying the fault.

Symptom
During a capacity expansion at a site, after a board is replaced or added, services are interrupted.

Impact on System
Inconsistency of board types causes service interruption.

Possible Causes
l Cause 1: The board types are inconsistent.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The board types are inconsistent. As a result, services are interrupted.
1. Check the current alarms on the U2000.

If... Then...

The NE reports the Proceed to the next step.


WRG_BD_TYPEalarm

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 74


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

If... Then...

The NE does not report the Check whether the fault is due to other
WRG_BD_TYPE alarm causes.

2. Handle the WRG_BD_TYPE alarm.

----End

Related Information
For any capacity expansion, the type of the board to be replaced or added must be consistent
with that of the original board. Especially, when replacing or adding the cross-connect boards
with the active and standby relation, make sure that the types of the active and standby boards
are the same.

NOTE

A list of compatible boards is available. For details, consult technical support engineers of Huawei.

5.10 Service Interruptions on the VCTRUNK Port


This topic describes the symptoms, impact on the system, and possible causes of a service
interruption on the VCTRUNK port. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
The receive end of the Ethernet board reports the ALM_GFP_dLFD or VCAT_LOA alarm,
indicating that the services are interrupted.

Impact on System
The services in the VCG that reports the alarms are interrupted.

Possible Causes
l Cause 1: The board fails to frame the GFP frame. As a result, services are interrupted.
l Cause 2: During the service data transmission, if the alignment delay of the virtual
concatenation is excessively long, a data frame cannot be formed. Accordingly, it causes
the packet loss and even service interruptions.
l Cause 3: The number of timeslots bound with the VCTRUNK at the downstream station
is inconsistent with the number of timeslots bound with the VCTRUNK at the upstream
station. As a result, services are interrupted.

Tools, Equipment, and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 75


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Procedure
l Cause 1: The board fails to frame the GFP frame. As a result, services are interrupted
1. Query the current alarms on the U2000.

If... Then...

The NE reports the ALM_GFP_dLFD Proceed to the next substep.


alarm

The NE does not report the Check whether the fault is due to other
ALM_GFP_dLFD alarm causes.

2. Handle the ALM_GFP_dLFD alarm.


3. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 2: During the service data transmission, if the alignment delay of the virtual
concatenation is excessively long, a data frame cannot be formed. Accordingly, it causes
the packet loss and even service interruptions.
1. Query the current alarms on the U2000.

If... Then...

The NE reports the VCAT_LOA alarm Proceed to the next substep.

The NE does not report the Check whether the fault is due to other
VCAT_LOA alarm causes.

2. Handle the VCAT_LOA alarm.


3. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 3: The number of timeslots bound with the VCTRUNK at the downstream station
is inconsistent with the number of timeslots bound with the VCTRUNK at the upstream
station. As a result, services are interrupted.
1. On the U2000, query whether the LCAS function of the VCTRUNK port is enabled.

If... Then...

The LCAS function of the VCTRUNK Check whether the fault is due to other
port is enabled causes.

The LCAS function of the VCTRUNK Enable the LCAS function, and then
port is disabled check whether services recover. If the
services do not recover, check whether
the fault is due to other causes.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 76


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

5.11 Service Interruptions on a Single Ethernet Port


This topic describes the symptoms, impact, and possible causes of a service interruption on a
single Ethernet port. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

Symptom
The service on a single Ethernet port of an Ethernet board is interrupted.

Impact on System
The service on a single Ethernet port is interrupted.

Possible Causes
l Cause 1: The configuration is incorrect.
l Cause 2: The fibers, network cables, or optical/electrical modules are faulty.
l Cause 3: The optical module does not match the optical port.
l Cause 4: The board port is faulty.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The configuration is incorrect. As a result, services are interrupted.
1. On the U2000, query the configuration on the board.

If... Then...

The configuration is incorrect Re-configure the service. Check


whether the service recovers. If the
service does not recover, proceed to the
next step.

The configuration is correct Proceed to the next step.

2. On the U2000, query whether the board port is configured with loopback.

If... Then...

The board port is configured with a Release the loopback. Check whether
loopback services recover. If the service does not
recover, proceed to the next step.

The board port is not configured with a Proceed to the next step.
loopback

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 77


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

3. On the U2000, query the attributes of the port.

If... Then...

The attributes of the port are incorrect Set the attributes of the port again,
including the tag attributes and whether
to enable the port. Check whether
services recover. If the services do not
recover, proceed to the next step.

The attributes of the port are correct Proceed to the next step.

4. On the U2000, check whether the working mode of the local port is consistent with
the working mode of the opposite port.

If... Then...

The working mode of the local port is Set the local port to the same working
inconsistent with that of the opposite mode as the opposite port. Check
port whether services recover. If the services
do not recover, check whether the fault
is due to other causes.

The working mode of the local port is Check whether the fault is due to other
consistent with that of the opposite port causes.

l Cause 2: The fibers, network cables, or optical/electrical modules are faulty. As a result,
services are interrupted.
1. Query the current alarms on the U2000.

If... Then...

The NE reports the LINK_ERR alarms Handle the LINK_ERR alarms. Check
whether services recover. If the services
do not recover, check whether the fault
is due to other causes.

The NE does not report the Check whether the fault is due to other
LINK_ERR alarms causes.

l Cause 3: The optical module does not match the optical port. As a result, services are
interrupted.
1. Query the current alarms on the U2000.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 78


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

If... Then...

The NE reports the Handle the LASER_MODULE_MIS-


LASER_MODULE_MISMATCH MATCH alarm.
alarm that indicates the mismatch of the Check whether services recover. If the
optical module services do not recover, check whether
the fault is due to other causes.

The NE does not report the Check whether the fault is due to other
LASER_MODULE_MISMATCH causes.
alarm that indicates the mismatch of the
optical module

l Cause 4: The board port is faulty. As a result, services are interrupted.


1. On the U2000, query the data of the Ethernet port on the Ethernet board by using the
remote monitoring (RMON) function.

If... Then...

The Ethernet port receives data, but does Replace the board. Check whether
not transmit data services recover. If the services do not
recover, check whether the fault is due
to other causes.

The Ethernet port receives and also Check whether the fault is due to other
transmits data causes.

----End

5.12 Service Interruptions on All Ethernet Ports


This topic describes the symptoms, impact, and possible causes of a service interruption on all
Ethernet ports of an Ethernet board. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
Services on all Ethernet ports of an Ethernet board are interrupted.

Impact on System
Services on all Ethernet ports are interrupted.

Possible Causes
l Cause 1: The board is offline.
l Cause 2: The configuration is incorrect.
l Cause 3: The ports of the board are faulty.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 79


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The board is offline. As a result, services are interrupted.
1. Query the current alarms on the U2000.

If... Then...

The NE reports the BD_STATUS alarm Handle the BD_STATUS alarm. Check
whether services recover. If the services
do not recover, check whether the fault
is due to other causes.

The NE does not report the Check whether the fault is due to other
BD_STATUS alarm causes.

l Cause 2: The configuration is incorrect. As a result, services are interrupted.


1. On the U2000, query the configuration on the board.

If... Then...

The configuration is incorrect Re-configure the service. Check


whether services recover. If the service
does not recover, proceed to the next
step.

The configuration is correct Proceed to the next substep.

2. On the U2000, query whether the board port is configured with loopback.

If... Then...

The board port is configured with a Release the loopback. Check whether
loopback services recover. If the services do not
recover, proceed to the next step.

The board port is not configured with a Proceed to the next substep.
loopback

3. On the U2000, query the tag attributes of the port on the board.

If... Then...

The attributes of the port are incorrect Re-configure the attributes of the port,
such as the tag attributes and whether to
enable the port. Check whether services
recover. If the services do not recover,
proceed to the next substep.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 80


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

If... Then...

The attributes of the port are correct Proceed to the next substep.

4. On the U2000, check whether the working mode of the local port is consistent with
that of the opposite port.

If... Then...

The working mode of the local port is Set the local port to the same working
inconsistent with that of the opposite mode as the opposite port. Check
port whether services recover. If the services
do not recover, check whether the fault
is due to other causes.

The working mode of the local port is Check whether the fault is due to other
consistent with that of the opposite port causes.

l Cause 3: The ports of the board are faulty. As a result, services are interrupted.
1. On the U2000, query the data of the Ethernet ports by using the remote monitoring
(RMON) function.

If... Then...

The Ethernet ports receive data, but do Replace the board. Check whether
not transmit data services recover. If the services do not
recover, check whether the fault is due
to other causes.

The Ethernet ports receive and transmit Check whether the fault is due to other
data causes.

----End

5.13 Service Interruption Cases


This section lists cases related to Service Interruption.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MC-A1 The OTU_LOF Alarm is Reporting on the OTU at the Downstream Station
l MC-A4 The LOG Board Fails to Interwork With the FDG Board on the Client Sides. The
LOG board reports the R_LOS alarm on the client side. The FDG board reports the
LINK_STATUS alarm.
l MC-A8 The TN11OAU101 at the Transmit End Reports the MUT_LOS Alarm
l MC-A15 The Downstream Optical Amplifier Board Does Not Report MUT_LOS

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 81


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 5 Service Interruptions

l MC-A24 Low Optical Power on the Client Side of the OTU Board Leads to R_LOS Alarm
on the Board
l MC-A38 The Service Is Interrupted After the Protection Is Triggered
l MC-A45 During the deployment of Raman, the OPU board reports MUT_LOS
l MC-A52 Faults of End Face of the Fiber Connector Cannot Be Identified
l MC-A67 The GE Port on the Client Side Reports LINK_DOWN Alarm
l MC-A69 Shutdown of the RPC Laser Interrupts SDH-Layer Services in a DWDM Network
l MC-A114 LSX Board to which the 10GE LAN service is input reports the OTU_LOF
alarm due to inconsistent settings of the client-side mode
l MC-A117 Analysis and Handling of the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Reported by the LBE
l MC-A118 An R_OOF Alarm Occurs When the LWC Interconnects with Third-Party
Equipment
l MC-A147 Services Are Unavailable Due to Inappropriate FEC Configuration on the 40G
Regeneration Boards, The System Reports An OTU_LOF Alarm
l MC-A188 The LDGF2 Board on the OptiX OSN 1800 Reports an OTU1_LOF Alarm
l MC-A200 Fiber Faults Result in Service Interruption, The TDX Board Reports
ODU1_PM_SSF Alarms
l MC-A206 Service Provisioning Fails on an LHP Network, The OTU2_LOF Alarm Is
Reported
l MC-A207 Inaccurate System Commissioning Causes Poor System Performance
l MC-A208 40G Service Provisioning Fails On a Network, The OTU_LOF And
DCM_INSUFF Alarms Are Reported
l MC-A211 OTUk_LOF Alarm Is Reported Due to Mismatched Mapping Modes of the LSX
Boards in the Case of 10GE LAN Services
l MC-A215 Incorrect Configuration of an Added Wavelength During Expansion Results in
Service Interruption of Another Normal Wavelength on an OTU Board, The Corresponding
OTU Board Reports OTU2_LOF Alarm
l MC-A219 EPL Service Is Interrupted Due to Mismatched Service Modes of the L4G
Boards, Sites Report The R_LOF, ODU5G_PM_AIS, OTU5G_LOF And LINK_ERR
Alarms
l MC-A227 Services Cannot Be Provisioned and The LWX2 Boards Report The R_LOC
Alarm Due to Incorrect Service Rates on LWX2 Board of OptiX OSN 1800
l MC-A234 Services Are Interrupted After an Upgrade due to Improper Database
Restoration Operations on the OptiX OSN 6800
l MC-A248 Network Services Are Interrupted After the Equipment Restarts Because the
Wavelengths Conflict After the Wavelength Configurations Are Modified

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 82


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

6 Transient Service Interruptions

About This Chapter

Transient service interruption refers to the transient loss of signal at the millisecond level during
transmission.

6.1 Transient Interruption of Main Path Services


This section describes the transient interruption of services in the main optical path in terms of
the symptom, impact on the system, possible causes, and handling procedure, as well as the
necessary tools for fault handling and the precautions that should be taken during fault handling.

6.2 Transient Interruption of Single-Channel Services


This section describes the transient interruption of services in a single channel in terms of the
symptom, impact on the system, possible causes, and handling procedure, as well as the
necessary tools for fault handling and the precautions that should be taken during fault handling.

6.3 Transient Service Interruptions on a Single VCTRUNK Port


This topic describes the symptoms, impact, and possible causes of transient service interruptions
on a single VCTRUNK port. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

6.4 Transient Service Interruptions on a Single Ethernet Port


This topic describes the symptoms, impact, and possible causes of transient service interruptions
on a single Ethernet port. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

6.5 Transient Service Interruptions Due to Protocol Related Causes


This topic describes the symptoms, impact, and possible causes of transient service interruptions
due to protocol-related causes. In addition, this topic describes the required tools, precautions,
and procedures for rectifying the fault.

6.6 Transient Interruption Cases


This section lists cases related to Transient Interruption.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 83


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

6.1 Transient Interruption of Main Path Services


This section describes the transient interruption of services in the main optical path in terms of
the symptom, impact on the system, possible causes, and handling procedure, as well as the
necessary tools for fault handling and the precautions that should be taken during fault handling.

Symptom
The OTU, or tributary board and line boards at station B may report the R_LOS, R_LOF,
OTUk_LOF, OTUk_LOM alarms on WDM side, and the alarms clear in a short time. All
services at station B are transiently interrupted.

Figure 6-1 shows an example of the network configuration in this case.

Figure 6-1 Example of the network configuration where all services are transiently interrupted

OTU MUX OA FIU FIU OA DEMUX OTU

Client Client
DEMUX MUX

OSC OSC
Station A (OTM) Station B (OTM)

Impact on System
Services are interrupted for milliseconds at station B.

Possible Causes
l Cause 1: The optical cable or fiber jumper in the multiplexing part is faulty.
l Cause 2: The equipment temperature is too high.
l Cause 3: The optical amplifier board is faulty.
l Cause 4: The FIU board is faulty.

Tools, Equipment and Materials


U2000, Cassette cleaner, Fiber jumpers

Procedure
l Cause 1: The optical cable or fiber jumper in the multiplexing part is faulty.
1. Check the fiber jumper in the multiplexing part. If it is loose, secure it; if it is dirty,
clean it; if it is faulty, replace the fiber jumper.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 84


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

NOTE

The fiber jumpers in the multiplexing part are the fiber jumpers between:
l The equipment and ODF
l The OA and FIU
l The OA and MUX/DEMUX
l ODFs
l TDC and RDC of the OAU
2. Clear the fault of the optical cable.

If the fault is located in a section of optical line, check:

– Whether there are multiple fiber connectors in this section of optical line and
whether they are in good connection
– Whether there are aerial optical cables, which are liable to be damaged by the
weather
– Whether the type and attenuation coefficient of the optical fiber in this section is
correct
NOTE

The optical line fault is the usual cause which results in a transient interruption. Usually, the
transient interruption caused by a line fault affects all channels (including the OSC). The optical
line fault can be caused by a fiber cut or excessive attenuation.
However, the fiber cut cannot cause a transient interruption. When you have a line fault, find
the cause of excessive attenuation of the fiber and then remove the fault.
l Cause 2: The equipment temperature is too high.
1. Check whether the performance data of NE contain alarms such as TEMP_OVER. If
yes, the operating temperature of the board is too high. Check the environmental
conditions of the telecommunications room. Clean the air filter.
2. Check whether the fan tray assembly works normally. If not, replace the fan tray
assembly.
l Cause 3: The optical power fluctuates by the fault of the optical amplifier board.
1. In the case of the DWDM application, on the U2000, query the historical performance
data of optical power of OA board. Check the input and output optical power of the
optical amplifier board. If the input optical power is normal but the output optical
power does not reach the standard output optical power, replace the board.
l Cause 4: The FIU board is faulty.
1. In the case of the DWDM application, replace the FIU board. If the fault clears,
determine that the FIU board is faulty.

----End

6.2 Transient Interruption of Single-Channel Services


This section describes the transient interruption of services in a single channel in terms of the
symptom, impact on the system, possible causes, and handling procedure, as well as the
necessary tools for fault handling and the precautions that should be taken during fault handling.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 85


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

Symptom
The IN interface on the WDM side of OTU1 at station C reports the R_LOS, R_LOF, R_OOF,
IN_PWR_LOW, and OTUk_LOF alarms, and the alarms clear in a short time. The services in
a single channel of station C are transiently interrupted.

Figure 6-2 shows an example of the network configuration in this case.

Figure 6-2 Example of the network configuration where the services in a single channel are
transiently interrupted

MUX MUX
OTU1 DEMUX DEMUX OTU1
OA
DEMUX OA
OTU2 OTU2
Client Client
Station A OTM Station C OTM

Impact on System
Services are interrupted for milliseconds on one channel at station C.

Possible Causes
l Cause 1: The services are configured with protection. When a fault in the working channel
triggers protection switching, the switching times out or occurs frequently.
l Cause 2: The fiber jumper between the MUX/DEMUX and OTU1 is faulty.
l Cause 3: The OTU1 board at station A is faulty.
l Cause 4: The equipment temperature is extremely high.

Tools, Equipment and Materials


U2000, Cassette cleaner, Fiber jumpers

Procedure
l Cause 1: The services are configured with protection. When a fault in the working channel
triggers protection switching, the switching times out or occurs frequently.
1. On the U2000, query the historical data of alarms. Check whether the protection
switching alarm exists (the equipment reports a PS alarm when protection switching
occurs). Handle the related protection switching alarm.
l Cause 2: The fiber jumper between the MUX/DEMUX and OTU1 is faulty.
1. Check the fiber jumper. If it is loose, secure it; if it is dirty, clean it; if it is faulty,
replace the fiber jumper.
l Cause 3: The OTU1 board at station A is faulty.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 86


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

1. Query on the U2000 the historical performance data of the output optical power of
OTU1 at station A. If the data contains a performance event showing a drastic decline
of output optical power of OTU1 at station A, then OTU1 at station A is faulty. Replace
OTU1 at station A.
l Cause 4: The equipment temperature is extremely high. When the equipment temperature
is extremely high, the wavelength of the single channel deviates and the services are
interrupted.
1. On the U2000, query the historical data of alarms. If alarms such as TEMP_OVER
exists, determine that the operating temperature of the board is extremely high. Check
and lower the temperature in the telecommunications room. Clean the air filter.
2. Check whether the fan tray assembly works normally. If not, replace the fan tray
assembly.

----End

6.3 Transient Service Interruptions on a Single VCTRUNK


Port
This topic describes the symptoms, impact, and possible causes of transient service interruptions
on a single VCTRUNK port. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

Symptom
l Symptom 1: The ALM_GFP_dLFD alarm jitter occurs on a single VCTRUNK port of an
Ethernet board, and services recover after a period of interruption.
l Symptom 2: The channel bound to a single VCTRUNK port on an Ethernet board
transiently reports the SDH-related alarms, such as the HP_UNEQ, HP_LOM, and
VCAT_LOA. Services are transiently interrupted.
l Symptom 3: The channel bound to a single VCTRUNK port on an Ethernet board
transiently reports the LCAS_TLCT and LCAS_TLCR bandwidth loss alarms. Services
are transiently interrupted.

Impact on System
The services in the VCG that reports the alarms are interrupted transiently.

Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: The fibers on the line side are faulty and recover within a short period.
– Cause 2: Bit errors exist on the line.
l For symptom 2, the possible causes are as follows:
– Cause 1: The configuration of the SDH service is changed manually.
– Cause 2: Channel binding is added or deleted on the opposite device manually.
– Cause 3: The VCTRUNK port uses multiple lines and the delay of a certain line is too
long.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 87


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

l For symptom 3, the possible causes are as follows:


– Cause 1: The line is faulty and then recovers. As a result, the channel changes to the
WTR state.

Tools, Equipment, and Materials


U2000, Optical power meter

Procedure
Step 1 For symptom 1: The ALM_GFP_dLFD alarm jitter occurs on a single VCTRUNK port of an
Ethernet board, and services recover after a period of interruption.
1. Cause 1: The fibers on the line side are faulty and recover within a short period.
a. Query the historical line alarms on the U2000.

If... Then...

The line from the VCTRUNK port of the The line is faulty. Replace the optical
local device to the VCTRUNK port of module and the fibers. Check whether
the opposite device generates the services recover. If the services do not
R_LOS alarm recover, proceed to the next step.

The line from the VCTRUNK port of the Proceed to the next step.
local device to the VCTRUNK port of
the opposite device does not generate the
R_LOS alarm

b. Check whether the fibers or the line boards are removed and reinserted manually.

If... Then...

The fibers or the line boards are removed Make sure that the fibers or the line
and reinserted manually boards is not removed and reinserted
manually. Check whether services
recover. If the services do not recover,
proceed to the next step.

The fibers or the line boards are not Proceed to the next step.
removed and reinserted manually

c. According to the transmission distance, check whether the optical module type is
correct.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 88


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

The optical module type is incorrect See the Hardware Description and
Parts Replacement to select a correct
optical module and replace the optical
module on the device with the correct
one. Check whether services recover. If
the services do not recover, proceed to
the next step.

The optical module type is correct Proceed to the next step.

d. Check whether the fibers are faulty.

If... Then...

The fibers are faulty Replace the faulty fibers. Check whether
services recover. If the services do not
recover, proceed to the next step.

The fibers are normal Check whether the fault is due to other
causes.

2. Cause 2: Bit errors exist on the line.


a. Query the historical alarms on the U2000.

If... Then...

The line from the VCTRUNK port of the See the Alarms and Performance Events
local device to the VCTRUNK port of Reference to handle the bit error alarm.
the opposite device generates the bit Check whether services recover. If the
error alarms B3_EXC, B3_SD, services do not recover, check whether
BIP_EXC, or BIP_SD the fault is due to other causes.

The line from the VCTRUNK port of the Check whether the fault is due to other
local device to the VCTRUNK port of causes.
the opposite device does not generate the
bit error alarms B3_EXC, B3_SD,
BIP_EXC, or BIP_SD

Step 2 For symptom 2: The channel bound to a single VCTRUNK port on an Ethernet board transiently
reports the SDH-related alarms, such as the HP_UNEQ, TU_LOP, HP_LOM, and VCAT_LOA.
Services are transiently interrupted.
1. Cause 1: The configuration of the SDH service is changed manually.
a. On the U2000, query the historical alarms of the VCTRUNK ports of the local and
the opposite devices.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 89


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

The VCTRUNK ports of the local and Proceed to the next step.
the opposite devices generate the
alarms, such as the HP_UNEQ and
HP_LOM

Only the VCTRUNK port of the local Proceed to Cause 2.


device generates the alarms, such as the
HP_UNEQ and HP_LOM

b. Query the operation log on the U2000.

If... Then...

The SDH cross-connection has been Make sure that the SDH cross-
deleted or set up manually connection is not manually deleted and
set up again. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.

The SDH cross-connection has not been Check whether the fault is due to other
deleted or set up manually causes.

2. Cause 2: Channel binding is added or deleted on the opposite device manually.

a. On the U2000, query the historical alarms of the VCTRUNK ports of the local and
the opposite devices.

If... Then...

The VCTRUNK ports of the local and Proceed to Cause 1.


the opposite devices generate the
alarms, such as the HP_UNEQ and
HP_LOM

Only the VCTRUNK port of the local Proceed to the next step.
device generates the alarms, such as the
HP_UNEQ and HP_LOM

b. Query the operation log on the U2000.

If... Then...

Channel binding of the device has been Make sure that channel binding is not
added or deleted manually added or deleted on the opposite device
manually. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 90


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

Channel binding of the device has not Check whether the fault is due to other
been added or deleted manually causes.

3. Cause 3: The VCTRUNK port uses multiple lines and the delay of a certain line is too long.
a. On the U2000, query the historical alarms of the VCTRUNK port of the local device.

If... Then...

The VCTRUNK port of the local device Proceed to the next step.
generates the VCAT_LOA alarm

The VCTRUNK port of the local device Check whether the fault is due to other
does not generate the VCAT_LOA causes.
alarm

b. Query the operation log on the U2000.

If... Then...

The SDH cross-connection and channel Make sure that the SDH cross-
binding have been deleted or set up connection and channel binding is not
manually deleted or set up manually. Check
whether services recover. If the services
do not recover, proceed to the next step.

The SDH cross-connection and channel Proceed to the next step.


binding have not been deleted or set up
manually

c. Query the historical line alarms on the U2000.

If... Then...

The line from the VCTRUNK port of the The line is faulty. Replace the optical
local device to the VCTRUNK port of module and the fibers. Check whether
the opposite device generates the services recover. If the services do not
R_LOS alarm recover, proceed to the next step.

The line from the VCTRUNK port of the Proceed to the next step.
local device to the VCTRUNK port of
the opposite device does not generate the
R_LOS alarm

d. Measure the hold-off time of each route.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 91


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

The delay of each route is rather long Optimize the network to reduce the
delay of each route. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.

The delay of each route is normal Check whether the fault is due to other
causes.

Step 3 For symptom 3: The channel bound to a single VCTRUNK port on an Ethernet board transiently
reports the LCAS_TLCT and LCAS_TLCR bandwidth loss alarms. Services are transiently
interrupted.
1. Cause 1: The line is faulty and then recovers. As a result, the channel changes to the WTR
state.
a. On the U2000, query the historical alarms of the channel used by the VCTRUNK port
of the local device.

If... Then...

The LCAS_TLCT or LCAS_TLCR Proceed to the next step.


alarm is generated

The LCAS_TLCT or LCAS_TLCR Check whether the fault is due to other


alarm is not generated causes.

b. Query the historical line alarms on the U2000.

If... Then...

The line from the VCTRUNK port of the The line is faulty. Replace the optical
local device to the VCTRUNK port of module and the fibers. Check whether
the opposite device generates the services recover. If the services do not
R_LOS alarm recover, proceed to the next step.

The line from the VCTRUNK port of the Proceed to the next step.
local device to the VCTRUNK port of
the opposite device does not generate the
R_LOS alarm

c. Check the configuration of the LCAS parameters on the U2000.

If... Then...

The delay is set short It is recommended that you set the WTR
time to the default 300s and the delay to
the default 2s.

The delay are set properly Contact the Huawei engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 92


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

NOTE

l When the network is stable, the WTR time should not be set too long; otherwise, the
interruption time of the services after the restoration of the network is long. If the channel
does not need to enter the WTR time, the WTR time can be set to 0s.
l The delay should not be set too short; otherwise, when the network is unstable, the channel
changes to the WTR state frequently. As a result, services are interrupted. It is
recommended that you set the delay to 2s.

----End

Related Information
None

6.4 Transient Service Interruptions on a Single Ethernet Port


This topic describes the symptoms, impact, and possible causes of transient service interruptions
on a single Ethernet port. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

Symptom
l Symptom 1: The LINK_ERR alarm jitter occurs on a single Ethernet port of an Ethernet
board that works in auto-negotiation mode, and services recover after a period of
interruption.
l Symptom 2: The ETH_CFM_LOC alarm jitter occurs on a single Ethernet port of an
Ethernet board, and services recover after a period of interruption.

Impact on System
Services on the Ethernet port are transiently interrupted.

Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: Optical fibers in the transmit direction of the local Ethernet port are faulty and
recover within a short period.
– Cause 2: The port interconnected to the Ethernet port switches between the auto-
negotiation mode and the fixed mode continuously.
– Cause 3: The board is faulty.
l For symptom 2, the possible causes are as follows:
– Cause 1: Optical fibers in the receive direction of the Ethernet port are faulty and recover
within a short period.
– Cause 2: The port interconnected to the Ethernet port is disabled or enabled within a
short period.
– Cause 3: The board is faulty.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 93


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

Tools, Equipment, and Materials


U2000, Optical power meter

Procedure
l For symptom 1: The LINK_ERR alarm jitter occurs on a single Ethernet port of an Ethernet
board that works in auto-negotiation mode, and services recover after a period of
interruption.
1. Cause 1: Optical fibers in the transmit direction of thelocal Ethernet port are faulty
and recover within a short period.
a. Query the historical alarms of the port of the interconnected device on the
U2000.

If... Then...

The ETH_CFM_LOC alarm exists Indicates that the fibers in the transmit
direction of the local Ethernet port
have been broken. As a result, the
negotiation fails at the local end and
the LINK_ERR alarm is reported
accordingly. In this case, services are
transiently interrupted. Check
whether services recover. If the
services do not recover, proceed to the
next step.

The ETH_CFM_LOC alarm does not Proceed to the next step.


exist

b. Check whether the fibers or the line boards are removed and reinserted manually.
Query the historical alarms of the interconnected port of the device on the
U2000.

If... Then...

The fibers or the line boards are Make sure that the fibers or the line
removed and reinserted manually boards is not removed and reinserted
manually. Check whether services
recover. If the services do not recover,
proceed to the next step.

The fibers or the line boards are not Proceed to the next step.
removed and reinserted manually

c. According to the transmission distance, check whether the optical module types
are correct.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 94


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

The optical module type is incorrect Select a correct optical module and
replace the optical module on the
device with the correct one. Check
whether services recover. If the
services do not recover, proceed to the
next step.

The optical module type is correct Proceed to the next step.

d. Check whether the fibers are faulty.

If... Then...

The fibers are faulty Replace the faulty fibers. Check


whether services recover. If the
services do not recover, proceed to the
next step.

The fibers are normal Check whether the fault is due to other
causes.

2. Cause 2: The port interconnected to the Ethernet port switches between the auto-
negotiation mode and the fixed mode continuously.
a. Query the operation log on the U2000.

If... Then...

The working mode of the Indicates that the service transient


interconnected port is changed interruption is caused by the
manually mismatch of the working mode. Make
sure that the operation mode of the
interconnected port is not changed
manually. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.

The working mode of the Check whether the fault is due to other
interconnected port is not changed causes.
manually

3. Cause 3: The board is faulty.


a. Replace the board.
b. Check whether services recover. If the services do not recover, check whether
the fault is due to other causes.
l For symptom 2: The ETH_CFM_LOC alarm jitter occurs on a single Ethernet port of the
Ethernet board, and the services recover after a period of interruption.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 95


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

1. Cause 1: Optical fibers in the receive direction of the Ethernet port are faulty and
recover within a short period.
a. Query the historical alarms of the Ethernet port on the U2000.

If... Then...

The ETH_CFM_LOC alarm exists Indicates that the fibers in the receive
direction of the local Ethernet port
have been broken. Proceed to the next
step.

The ETH_CFM_LOC alarm does not Check whether the fault is due to other
exist causes.

b. Check whether the fibers or the line boards are removed and reinserted manually.
Query the historical alarms of the interconnected port of the device on the
U2000.

If... Then...

The fibers or the line boards are Make sure that the fibers or the line
removed and reinserted manually boards is not removed and reinserted
manually. Check whether services
recover. If the services do not recover,
proceed to the next step.

The fibers or the line boards are not Proceed to the next step.
removed and reinserted manually

c. According to the transmission distance, check whether the optical module types
are correct.

If... Then...

The optical module type is incorrect Select a correct optical module and
replace the optical module on the
device with the correct one. Check
whether services recover. If the
services do not recover, proceed to the
next step.

The optical module type is correct Proceed to the next step.

d. Check whether the fibers are faulty.

If... Then...

The fibers are faulty Replace the faulty fibers. Check


whether services recover. If the
services do not recover, proceed to the
next step.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 96


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

The fibers are normal Check whether the fault is due to other
causes.

2. Cause 2: The port interconnected to the Ethernet port is disabled or enabled.


a. Query the operation log on the U2000.
If... Then...

The port interconnected to the Make sure that these ports are not
Ethernet port is disable or enabled enabled or disabled manually. Check
manually whether services recover. If the
services do not recover, check
whether the fault is due to other
causes.

The port interconnected to the Check whether the fault is due to other
Ethernet port is not disable or enabled causes.
manually

3. Cause 3: The board is faulty.


a. Replace the board.
b. Check whether services recover. If the services do not recover, check whether
the fault is due to other causes.
----End

Related Information
None

6.5 Transient Service Interruptions Due to Protocol Related


Causes
This topic describes the symptoms, impact, and possible causes of transient service interruptions
due to protocol-related causes. In addition, this topic describes the required tools, precautions,
and procedures for rectifying the fault.

Symptom
l Symptom 1: The Rapid Spanning Tree Protocol (RSTP) is enabled, but the status of the
protocol jitters and the services are transiently interrupted.
l Symptom 2: The Link Aggregation Control Protocol (LACP) is enabled, but the status of
the protocol jitters and the services are transiently interrupted.

Impact on System
The services are transiently interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 97


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: The network topology is changed and the RSTP protocol reconstructs the
network topology.
– Cause 2: The RSTP protocol is reset.
– Cause 3: The RSTP protocol packets are lost. As a result, the status of the protocol
jitters.
l For symptom 2, the possible causes are as follows:
– Cause 1: The NE reports the LAG_PORT_FAIL alarm and the services are transiently
interrupted.
– Cause 2: The LACP protocol is reset and the services are transiently interrupted.

Tools, Equipment, and Materials


U2000

Procedure
l For symptom 1: The RSTP protocol has been enabled, but the state of the protocol jitters
and the services are transiently interrupted.
1. Cause 1: The network topology is changed and the RSTP protocol reconstructs the
network topology.
a. Check whether the network topology is changed manually.

If... Then...

The network topology is changed The transient service interruption is


manually caused by the reconstruction of the
network topology implemented by
the RSTP protocol.

The network topology is not changed Proceed to the next step.


manually

b. Query the historical alarms on the U2000.

If... Then...

The service interruption alarms are After the link is faulty, the services
reported, such as the are transiently interrupted due to the
ETH_CFM_LOC and reconstruction of the network
LINK_ERRLINK-ERR alarms of the topology by the RSTP protocol.
Ethernet port and the UNEQ and
TU_LOP alarms on the SDH side

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 98


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

If... Then...

The service interruption alarms are Check whether the fault is due to other
not reported, such as the causes.
ETH_CFM_LOC and
LINK_ERRLINK-ERR alarms of the
Ethernet port and the UNEQ and
TU_LOP alarms on the SDH side

2. Cause 2: The RSTP protocol is reset.


a. Query the operation log on the U2000.

If... Then...

Before the services are interrupted, The service interruption is caused by


the RSTP protocol has been reset or the reset of the RSTP protocol.
the board has been warm reset
manually

Before the services are interrupted, Check whether the fault is due to other
the RSTP protocol has not been reset causes.
or the board has been warm reset
manually

NOTE

After the board is warm reset, the RSTP protocol is restarted automatically.
3. Cause 3: The RSTP protocol packets are lost. As a result, the status of the protocol
jitters.
a. Check the traffic flow of the services, and then check whether the QoS function
is enabled according to the service configuration.

If... Then...

The QoS function is enabled and the The state jitter of the RSTP protocol
bandwidth is insufficient is caused by the loss of the RSTP
protocol packets.

The QoS is not enabled Check whether the fault is due to other
causes.

NOTE

Generally, the priority of the protocol packets is higher than the priority of the service
packets. If the QoS function is enabled, the priority of the service packets may be higher
than that of the protocol packets. In this case, the protocol packets may be discarded when
the bandwidth is insufficient.
l For symptom 2: The LACP protocol has been enabled, but the state of the protocol jitters
and the services are transiently interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 99


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

1. Cause 1: The NE reports the LAG_PORT_FAIL alarm and the services are transiently
interrupted.

a. Check the current alarms on the U2000.

If... Then...

The NE reports the Handle the LAG_PORT_FAIL


LAG_PORT_FAIL alarm alarm. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.

The NE does not report the Check whether the fault is due to other
LAG_PORT_FAIL alarm causes.

2. Cause 2: The LACP protocol is reset and the services are transiently interrupted.

a. Query the operation log on the U2000.

If... Then...

Before the services are interrupted, The service interruption is caused by


the LACP protocol has been reset or the reset of the LACP protocol.
the board has been warm reset
manually

Before the services are interrupted, Check whether the fault is due to other
the LACP protocol has not been reset causes.
or the board has been warm reset
manually

NOTE

After the board is warm reset, the LACP protocol is restarted automatically.

----End

Related Information
None

6.6 Transient Interruption Cases


This section lists cases related to Transient Interruption.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 100


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 6 Transient Service Interruptions

l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS


Alarms Transiently
l MC-A6 The LDG Board Keeps Reporting the ALM_DATA_TLOS and
ALM_DATA_RLOS Alarms Transiently
l MC-A36 The OTU in the OptiX BWS 1600G Reports IN_PWR_LOW Alarm

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 101


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 7 Optical Power Abnormity

7 Optical Power Abnormity

About This Chapter

Correct optical power is critical to the performance of the WDM system. If the input optical
power is too high or too low, the system will generate bit errors, or even interrupt services.

7.1 Abnormal Decline of the Optical Power


This section introduces the analysis and handling of the fault of drastic optical power decline.

7.2 Optical Power Abnormity Cases


This section lists cases related to Optical Power Abnormity.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 102


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 7 Optical Power Abnormity

7.1 Abnormal Decline of the Optical Power


This section introduces the analysis and handling of the fault of drastic optical power decline.

Symptom
The optical power of all OTUs in the local station sharply declines, and the optical signal-to-
noise ratio (OSNR) is lower than the OSNR tolerance of the OTU.

NOTE

Lower OSNR is caused by lower receiving optical power.

Networking configuration example is shown in Figure 7-1.

Figure 7-1 Networking configuration example of drastic decline of optical power

MUX OA FIU FIU OA DEMUX


To OTUs To OTUs
and client and client
equipment equipment
DEMUX MUX

Station A (OTM) Station B (OTM)

Impact on System
A large number of bit errors occur in the services and the services are transiently interrupted
frequently.

Possible Causes
l Cause 1: Any fiber jumper in the multiplexing part, or an optical cable degrades or is
physically damaged.
l Cause 2: The gain of the optical amplifier board in the upstream station or downstream
station declines.

Tools, Equipment and Materials


U2000, Optical power meter, Cassette cleaner, Ring fiber adapter

Procedure
l Cause 1: Any fiber jumper in the multiplexing part, or an optical cable degrades or is
physically damaged. Hence, the attenuation of the fiber increases and the received optical
power declines. As a result, the OSNR decreases.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 103


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 7 Optical Power Abnormity

1. Refer to Checking Fiber Jumpers by Using an Optical Power Meter. Check whether
the fiber jumper in the multiplexing part is faulty. If yes, replace the fiber jumper.
2. Remove the fault of the optical cable.
NOTE

The fiber jumpers in the multiplexing part are the fiber jumpers between:
l The equipment and ODF
l The OA and FIU
l The OA and MUX/DEMUX
l ODFs
l TDC and RDC of the OAU
l Cause 2: The gain of the optical amplifier board in the upstream station or downstream
station declines.
1. Check whether the gain of the optical amplifier board is abnormal. If yes, replace the
board.

----End

7.2 Optical Power Abnormity Cases


This section lists cases related to Optical Power Abnormity.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MD-A13 Bit Error Alarm (BEFFEC_EXC) Is Generated When Optical Power Gets Close
to the Threshold
l MC-A15 The Downstream Optical Amplifier Board Does Not Report MUT_LOS
l MC-A19 Use Power Monitoring To Process Problems on IN_PWR_LOW Alarm
l MC-A24 Low Optical Power on the Client Side of the OTU Board Leads to R_LOS Alarm
on the Board
l MC-A35 LWM Output Optical Power Is Unstable upon Forced Light Generation
l MC-A36 The OTU in the OptiX BWS 1600G Reports IN_PWR_LOW Alarm
l MC-A43 Too High Insertion Loss Between TDC and RDC of the E3OAUC01C
l MC-A45 During the deployment of Raman, the OPU board reports MUT_LOS
l MC-A52 Faults of End Face of the Fiber Connector Cannot Be Identified
l MC-A54 After the Lasers of the Raman Amplifier Are Disabled, the Optical Power Is
Abnormal
l MC-A62 The Minimum Optical Power of the SC2 Is Detected As -35 dBm
l MC-A66 The Received Optical Power of Downstream Stations Is not Flat and Some
Wavelengths Report the IN_PWR_LOW Alarm
l MC-A68 OTU Boards Report the IN_PWR_LOW Alarm Due to Wavelength Wander

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 104


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 7 Optical Power Abnormity

l MC-A90 The Incorrect Configuration of DWC Leads To Abnormal Optical Power and
Service Interruption, the WDM Side of the LBE Reports the IN_PWR_HIGH and
OTU_LOF Alarms Abruptly
l MC-A93 The Input Optical Power of the OTU Board Is Abnormal Due to a Fault of the
OPU Board
l MC-A101 The Wavelength Is Unstable or Changed Due to Optical Power Variation
l MC-A126 VA4 Board Reports R_LOS When the Receiver Power Is Below -23 dBm Even
If the Attenuation Is Decreased
l MC-A169 T2000 Reports Errors After the Standard Optical Power of Single Wavelength
Is Set to 7 dBm with ALC in Power Reference Mode
l MC-A174 Excessive PDL of Fiber Results in Slow Changes of the Received Optical Power
on an OTU Board
l MC-A182 Receive Optical Power Is Excessively Low Because of the End Face Problem
of the Fiber Jumper, the OAU Board Reports the MUT_LOS Alarm
l MC-A186 Inconsistency of Fiber Jumper Model and Fiber Connector Type of a Board
Causes Low Receive Optical Power, The Connected OAU Board At The Downstream
Station Reports MUT_LOS And R_LOS Alarms
l MC-A194 A Malfunctioning VA1 Board Causes Abnormal Optical Power, The NS2
Boards Reports IN_PWR_HIGH Alarms For Three Times
l MC-A196 The Insertion Loss of the FIU Board on OptiX OSN 6800 Is High Because the
Fiber and Fiber Connector Are Not Securely Connected
l MC-A198 Optical Power Can Be Detected at the IN and OUT Ports of the FIU Board
Because of Incorrect Connections of Fiber Patch Cords
l MC-A201 Frequent Fiber Cut Results in High Line Attenuation and the IN_PWR_LOW
Alarm Reported by the OTU Board
l MC-A203 System of Few Wavelengths Fails to Function
l MC-A239 Inappropriate Line Optical Power Adjustment Causes Deteriorated 40G
Performance
l MC-A242 Excessively High Incident Optical Power Results in Poor Performance of OTU
Boards
l MC-A244 The Insertion Loss of a Board Is Abnormal

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 105


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

8 Bit Errors

About This Chapter

Bit errors refer to the errors that occur in transmitted bit stream. Bit errors are usually represented
by bit. In the SDH/SONET frame structure, bytes for monitoring bit errors include B1, B2, M1,
B3, and G1. However, in DWDM equipment, the OTU only provides non-intrusive monitoring
on B1. In the OTN frame structure, for the DWDM equipment, the OTU provides bit errors
monitoring on BIP8 bytes of SM, PM and TCM section.

8.1 Multi-Channel Bit Errors


This section introduces the analysis and handling of bit errors on multiple channels.

8.2 Single-Channel Bit Errors


This section introduces the analysis and handling of bit errors on one channel.

8.3 Bit Errors Due to Unfavorable Environmental Conditions


This topic describes the symptoms, impact, and possible causes of bit errors due to unfavorable
environmental conditions. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

8.4 Bit Errors Due to Device Faults


This topic describes the symptoms, impact, and possible causes of bit errors due to device faults.
In addition, this topic describes the required tools, precautions, and procedures for rectifying the
fault.

8.5 Bit Error Cases


This section lists cases related to Bit Errors.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 106


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

8.1 Multi-Channel Bit Errors


This section introduces the analysis and handling of bit errors on multiple channels.

Symptom
The optical power is normal but a large number of bit errors occur in services on multiple
channels.
The OTU, or tributary board and line board reports alarms such as OTUk_EXC, OTUk_DEG,
BEFFEC_EXC, B1_EXC, and B1_SD.
Networking configuration example is shown in Figure 8-1.

Figure 8-1 Networking configuration example of multi-channel bit errors

MUX OA FIU FIU OA DEMUX


To OTUs To OTUs
and client and client
equipment equipment
DEMUX MUX

Station A (OTM) Station B (OTM)

Impact on System
A large number of bit errors occur in services on multiple channels.

Possible Causes
l Cause 1: The attenuation of the optical cable or that of the fiber jumper in the multiplexing
part is too large.
l Cause 2: The equipment temperature is too high.
l Cause 3: The configuration of the dispersion compensation module (DCM) is improper.
l Cause 4: The optical power of the main path is faulty.
l Cause 5: The non-linearity of the optical fiber. Bit errors can be caused by the non-linearity
of the optical power but not often.
l Cause 6: The ring fiber adapter from the equipment to the ODF is damaged.
l Cause 7: The PMD exceeds the permitted range.

CAUTION
Multi-channel bit errors might be caused by the improper commissioning of the optical power
of the system, please check the optical power of system.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 107


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

Tools, Equipment and Materials


U2000, Optical power meter, Cassette cleaner, Ring fiber adapter

Procedure
l Cause 1: The attenuation of the optical cable or that of the fiber jumper in the multiplexing
part is too large.
1. Refer to Checking Fiber Jumpers by Using an Optical Power Meter. Check whether
the fiber jumper in the multiplexing part is faulty. If yes, replace the fiber jumper.
2. Remove the fault of the optical cable.
NOTE

The fiber jumpers in the multiplexing part are the fiber jumpers between:
l The equipment and ODF
l The OA and FIU
l The OA and MUX/DEMUX
l ODFs
l TDC and RDC of the OAU
l Cause 2: The equipment temperature is too high.
1. Check whether the NE generates alarms such as TEMP_OVER. If yes, check the
environmental conditions of the telecommunications room. Clean the air filter.
l Cause 3: The configuration of the dispersion compensation module (DCM) is improper.
1. Check optical cable parameters such as the type and actual length of the optical cable.
Check the configuration of the DCM. After that, adjust the compensation distance of
the DCM to a value that matches the actual length of the optical cable.
l Cause 4: The optical power of the main-path is abnormal.
1. Check the input and output optical power of boards between MUX of the transfer
station and the DEMUX of the receive station. If the input optical power is normal
but the output optical power does not reach the standard output optical power, replace
the board.
l Cause 5: The non-linearity of the optical fiber. Bit errors can be caused by the non-linearity
of the optical power but not often.
1. Decrease or increase the optical power, to judge whether bit errors are caused by the
non-linearity of the optical fiber.
– Increasing optical power causes bit errors to increase.
– Decreasing optical power causes bit errors to decrease.
2. Increase the value of attenuator.

NOTE

Perform this procedure only after all other possible causes have been checked.
l Cause 6: The ring fiber adapter from the equipment to the ODF is damaged.
1. Check the ring fiber adapter between the ODF and WDM equipment. If the ring fiber
adapter is faulty, replace it.
l Cause 7: The PMD exceeds the permitted range.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 108


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

1. Test the PMD of optical fibers in all spans to obtain the PMD values. Resplice the
splicing points of optical fibers with very large PMD and resplice the splicing points
with large attenuation. The attenuation of optical fibers will improve.
2. The PMD values of several spans are larger than the recommended value in the system
design. It is recommended that the fiber core be replaced to ensure that the total PMD
value of optical fibers between electrical regenerators of all channels is smaller than
the system design. In this manner, the system can work stably in the long run and the
system instability caused by large PMD can be resolved radically.
NOTE

The 40G services require a smaller PMD. When the accumulated PMD exceeds the threshold, a large
OSNR penalty will be introduced, deteriorating the system performance. In a short period of time, high
PMD leads to the generation of the BEFFEC_EXC alarm for some wavelengths and irregular changes to
the receive-end system performance.

----End

8.2 Single-Channel Bit Errors


This section introduces the analysis and handling of bit errors on one channel.

Symptom
The OTU, or tributary board and line board reports alarms such as OTUk_EXC, OTUk_DEG,
BEFFEC_EXC, B1_EXC, and B1_SD.

Networking configuration example is shown in Figure 8-2.

Figure 8-2 Networking configuration example of single-channel bit errors

Client Client
equipment OTU1 OTU1 equipment
MUX MUX
DEMUX DEMUX
OA OA
OTUn OTUn
Station A (OTM) Station B (OTM)

Impact on System
A large number of bit errors occur in one channel and the services are transiently interrupted.

Possible Causes
l Cause 1: The input optical power of the OTU, or tributary board and line board on the
WDM side is abnormal.
l Cause 2: The fiber jumper between the MUX and ODF is aged or bent. Hence, the
attenuation of the fiber jumper is too large.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 109


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

l Cause 3: The equipment temperature is too high.


l Cause 4: The FEC modes of the two interconnected boards might be different.
l Cause 5: The optical interfaces of the OTU, or tributary board and line boards are dirty.
l Cause 6: The OTU, or tributary board and line board is faulty.

Tools, Equipment and Materials


U2000, Optical power meter, Cassette cleaner, Optical fiber, Compressed gas

Procedure
l Cause 1: The input optical power of the OTU, or tributary board and line board on the
WDM side is abnormal.
1. Checking fiber jumpers by using an optical power meter. Check whether the input
optical power of the OTU, or tributary board and line board is too low. If yes, clean
the fiber.
l Cause 2: The fiber jumper between the OTU, or tributary board and line board, MUX/
DMUX or the fiber jumper between the MUX and ODF is aged or bent. Hence, the
attenuation of the fiber jumper is too large.
1. Replace the fiber jumper.
l Cause 3: The equipment temperature is too high.
1. Check whether the NE contains alarms such as TEMP_OVER. If yes, the operating
temperature of the board is too high. Check the environmental conditions of the
telecommunications room. Clean the air filter.
l Cause 4: The FEC modes of the two interconnected boards might be different.
1. Query the FEC mode of the two interconnected boards on the U2000. If the two modes
differ, re-set the FEC Working state and ensure state matching.
l Cause 5: The optical interfaces of the OTU, or tributary board and line board board are
dirty.
1. If the optical interfaces of the OTU, or tributary board and line board board are dirty,
blow them with compressed gas to clean them.
l Cause 6: The OTU, or tributary board and line board is faulty.
1. If the fiber jumper, input optical power and operating temperature are normal, and the
FEC modes of the OTU, or tributary board and line boards are matching, replace the
OTU, or tributary board and line board at receive end. If the bit errors disappear, it
indicates that the OTU, or tributary board and line board at receive end is faulty.
2. If the OTU, or tributary board and line board at receive end is normal, replace the
OTU, or tributary board and line board at transmit end. If the bit errors disappear, it
indicates that the OTU, or tributary board and line board at transmit end is faulty.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 110


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

8.3 Bit Errors Due to Unfavorable Environmental


Conditions
This topic describes the symptoms, impact, and possible causes of bit errors due to unfavorable
environmental conditions. In addition, this topic describes the required tools, precautions, and
procedures for rectifying the fault.

Symptom
The telecommunications room of a certain site does not meet the environmental requirements
of a device. The device reports bit errors.

Impact on System
Unfavorable environmental conditions may cause bit errors in services.

Possible Causes
l Cause 1: The temperature and humidity of the environment do not meet the environmental
requirements of a device.
l Cause 2: A strong external interference source exists.
l Cause 3: The grounding is improper.
l Cause 4: The cable is faulty.

Tools, Equipment, and Materials


U2000, Thermometer, Humidiometer

Procedure
l Cause 1: The temperature and humidity of the environment do not meet the requirements
of long term running of the device. As a result, bit errors are generated.
1. Check whether the temperature and humidity of the environment meet the
requirements.

The detailed requirements are as follows:

– Temperature: 0°C to +45 °C (32°F to 113°F)


– Relative humidity: 10% to 90%

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 111


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

If... Then...

The temperature and humidity of the Check and adjust the air conditioner in
environment do not meet the the telecommunications room to make
requirements sure that the temperature and humidity
of the telecommunications room are in
the proper range. Check whether the bit
errors persist. If the bit errors persist,
check whether the fault is due to other
causes.

The temperature and humidity of the Proceed to the next step.


environment meet the requirements

2. Check the current alarms on the U2000.

If... Then...

The NE reports the temperature Handle the temperature threshold-


threshold-crossing alarms crossin alarms. Check whether the bit
errors persist. If the bit errors persist,
check whether the fault is due to other
causes.

The NE does not report the temperature Check whether the fault is due to other
threshold-crossing alarms causes.

l Cause 2: A strong external interference source exists. As a result, bit errors are generated.
1. Check whether there is a transformer, high-voltage transmission line, or high-current
device near the device.

If... Then...

There is an AC transformer within 20 m The interference should meet the


(65.6 ft.) of the device or high-voltage requirements of related standards. When
transmission line within 50 m (164.1 ft.) necessary, shielding or isolation
of the device measures should be taken to protect the
device. Check whether the bit errors
persist. If the bit errors persist, check
whether the fault is due to other causes.

There is no AC transformer within 20 m Proceed to the next step.


(65.6 ft.) of the device or high-voltage
transmission line within 50 m (164.1 ft.)
of the device

2. Check whether there is a high-power broadcasting transmitter near the device.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 112


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

If... Then...

There is a high-power broadcasting The interference should meet the


transmitter within 500 m (1640.5 ft.) of requirements of related standards. When
the device necessary, shielding or isolation
measures should be taken to protect the
device. Check whether the bit errors
persist. If the bit errors persist, check
whether the fault is due to other causes.

There is not a high-power broadcasting Proceed to the next step.


transmitter within 500 m (1640.5 ft.) of
the device

3. Check whether there is a mobile communication transmitter near the device.

If... Then...

There is a mobile communication The interference should meet the


transmitter near the device requirements of related standards. When
necessary, shielding or isolation
measures should be taken to protect the
device. Check whether the bit errors
persist. If the bit errors persist, check
whether the fault is due to other causes.

There is not a mobile communication Proceed to the next step.


transmitter near the device

4. Check whether a hand-held mobile communication device is in use near the device.

If... Then...

A hand-held mobile communication Switch off the hand-held mobile


device is in use near the device communication device. Check whether
the bit errors persist. If the bit errors
persist, check whether the fault is due to
other causes.

No hand-held mobile communication Check whether the fault is due to other


device is in use near the device causes.

l Cause 3: The grounding is improper.


1. Check whether the ground bar in the telecommunications room is properly connected.
If yes, proceed to the next step. If not, connect the ground bar properly.
2. Check whether the ground bar is properly connected with the cabinet of the
transmission equipment. If yes, proceed to the next step. If not, connect the ground
bar properly.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 113


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

3. Check whether the front door and side panel of the cabinet are properly connected to
the cabinet. If yes, proceed to the next step. If not, connect the front door and side
panel to the cabinet properly.
4. Check whether the subrack is connected to the cabinet properly. If yes, proceed to the
next step. If not, connect the subrack to the cabinet properly.
5. Check whether the DDF and ODF are properly grounded. If yes, proceed to the next
step. If not, ground the DDF and ODF properly.
6. Check whether the NMS devices and all power-consumption devices are properly
grounded. If yes, proceed to the next step. If not, ground the devices properly.
7. Check whether the interconnected equipment is jointly grounded.
l Cause 4: The cable is faulty.
1. Test the transmit optical power of the board at the opposite end and the receive optical
power of the board at the local end.
2. The difference between the transmit optical power and the receive optical power is
the actual loss of optical power on the line. If the difference varies much with the
design value of the line attenuation, locate and clear the fault using the following steps.
– Check whether the bending radius of fibers is equal to or larger than standard. If
the bending radius is smaller than standard, route the fiber again.
– Check whether the multiple fiber connectors are correctly connected on the line.
If certain connectors are badly connected, push the connectors into the optical
module and ensure that the connectors are caught tightly.
– Check whether there are aerial optical fiber cables on the line. If yes, provide
external protection to the cables because they are easily affected by the weather.
– Check the optical fiber type and the attenuation coefficient of the line. Ensure that
the type and attenuation coefficient are consistent with the design document. If
they are inconsistent with the design document, replace the fiber.
– Check whether the fiber connectors are dirty. If the fiber connectors are dirty, clean
them immediately.
3. Check whether services recover. If not, proceed to other causes.

----End

Related Information
Table 8-1 lists the requirement for the electromagnetic interference.

Table 8-1 Requirements of the electromagnetic interference

Electromagnetic Environment Electromagnetic


Environment Parameter

Low-frequency magnetic Frequency (Hz) 50 to 20000


field
Ampl. A/m (rms) 10 to 0.025

Amplitude-modulated Frequency (MHz) 0.009 to 1000


radio-frequency electrical
field Ampl. V/m (rms) 3

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 114


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

Electromagnetic Environment Electromagnetic


Environment Parameter

Impulse-modulated radio- Frequency (GHz) 1 to 20


frequency electrical field
Ampl. (V/m (peak)) 3

Table 8-2 Minimum bending radius for fibers

Fiber Type Minimum Bending Radius

G.652D 30 mm

Multi-mode fiber (A1B) 30 mm

G.657A2 10 mm

G.657B3 8 mm

Note: When fibers are connected to CRPC boards, the bending radius of the fibers must be
greater than or equal to 50 mm to ensure expected transmission performance.

8.4 Bit Errors Due to Device Faults


This topic describes the symptoms, impact, and possible causes of bit errors due to device faults.
In addition, this topic describes the required tools, precautions, and procedures for rectifying the
fault.

Symptom
Large amounts of regeneration section bit errors and multiplex section bit errors are generated
on the line.

Impact on System
Device faults cause bit errors.

Possible Causes
l Cause 1: The clock unit is faulty.
l Cause 2: The cross-connect unit is faulty.
l Cause 3: The line board is faulty.

Tools, Equipment, and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 115


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

Procedure
Step 1 Cause 1: The clock unit is faulty. As a result, bit errors are generated.
1. Check the configuration of the clock on the NE.

If... Then...

The configuration of the clock on the NE Re-configure the clock on the NE. Check
is incorrect whether the bit errors persist. If the bit
errors persist, proceed to the next step.

The configuration of the clock on the NE Proceed to the next step.


is correct

2. Check the configuration of the clock on the network.

If... Then...

The configuration of the clock on the NE Re-configure the clock on the NE. Check
is incorrect whether the bit errors persist. If the bit
errors persist, proceed to the next step.

The configuration of the clock on the NE Proceed to the next step.


is correct

3. The clock unit is faulty.


a. Replace the faulty board.
b. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.

Step 2 Cause 2: The cross-connect unit is faulty. As a result, bit errors are generated.
1. Replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.

Step 3 Cause 3: The line board is faulty. As a result, bit errors are generated.
1. Replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.

----End

Related Information
After the device reports bit errors, you need to analyze the features of the bit errors to locate the
fault to a single site.

You can use the loopback method to locate faults to a board of a certain site, and then use the
replacement method to reset or replace the doubted board.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 116


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 8 Bit Errors

8.5 Bit Error Cases


This section lists cases related to Bit Errors.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MD-A13 Bit Error Alarm (BEFFEC_EXC) Is Generated When Optical Power Gets Close
to the Threshold
l MC-A16 The R_LOF and R_OOF Alarms Are Reported in the 24-Hour Bit Error Test
Because the Line Fiber Loss Is Very Large
l MC-A31 Wrong Calculation for Dispersion in One DWDM Project
l MC-A40 Over Compensation Causes Very High Bit Error Rate of the Short Waves After
Correction and the SM_BIP8_OVER and SM_BIP8_SD Alarms Are Transiently Reported
on the Board
l MC-A44 Bit Errors Generated in the Services, the Meter Reports the HP_RDI and ERROR
Alarms
l MC-A55 Improper DCM Distribution Causes Abnormal Service and Bit Error Alarm
BEFFEC_EXC Is Detected on the Newly-added LWFS Board
l MC-A63 MSBBE Bit Errors and OTU_LOF Alarm Occur in a DWDM 10G Network Due
to the Incorrect PMD
l MC-A88 The OTU Board Reports the B1_EXC Alarm, And There Are the Difference in
BER Reported by the OTU Board and Test Instrument
l MC-A118 An R_OOF Alarm Occurs When the LWC Interconnects with Third-Party
Equipment
l MC-A124 Determine the Factors That Affect the OSNR By Using the ANT30 Tester
l MC-A199 An OTU Board Reports Bit Errors Due to High Fiber Reflection
l MC-A202 Mixing Compensation of DCMs Results in the Inconsistency of Channel
Performance
l MC-A205 WDM-Side Pre-FEC BER Changes Drastically
l MC-A240 Isolated Bit Errors Are Found in the Long-Term Bit Error Testing of a 40G
System

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 117


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 9 A Fault in Pointer Justification

9 A Fault in Pointer Justification

About This Chapter

The scenario is applicable to only the OptiX OSN 8800. This topic describes the common
methods and measures that can be adopted to rectify a fault in pointer justification.

9.1 Pointer Justification Due to Incorrect Clock Configuration


This topic describes the symptoms, impact, and possible causes of the pointer justification due
to the incorrect clock configuration. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

9.2 Pointer Justification Due to Over-High Temperature and Humidity


This topic describes the symptoms, impact, and possible causes of the pointer justification due
to over-high temperature and humidity. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

9.3 Pointer Justification Due to Over-Low Precision of the External Clock Source
This topic describes the symptoms, impact, and possible causes of the pointer justification due
to the over-low clock source precision. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 118


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 9 A Fault in Pointer Justification

9.1 Pointer Justification Due to Incorrect Clock


Configuration
This topic describes the symptoms, impact, and possible causes of the pointer justification due
to the incorrect clock configuration. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
After the fibers of a network are broken, services of respective NEs are switched normally. The
AU and TU pointer justifications, however, occur on the line, and this is accompanied by bit
errors. The pointer justification frequency and the bit error rate increase with time.

Impact on System
Incorrect clock configuration causes pointer justification.

Possible Causes
l Cause 1: The clock configuration is incorrect.

Tools, Equipment, and Materials


U2000

Procedure
Step 1 Cause 1: Incorrect clock configuration causes pointer justification.
1. Check whether the transmit network traces two or more clock sources and whether the
clock sources trace each other, which results in the pointer justification. If the configuration
is correct, proceed to the next step.
2. Check whether the precision of the traced clock source is low or there are too many tracing
sites. If the configuration is correct, proceed to the next step.
NOTE

Provide clock compensation for long clock chain: the number of G.812 slave clocks in a transmission
link should not be more than 10; the number of G.813 clocks between two G.812 slave clocks should
not be more than 20; the number of G.813 clocks between the G.811 and G.812 slave clocks should
not be more than 20; the number of G.813 slave clocks should not be more than 60.
3. Check whether the clock protection subnet is not configured. After the main clock is lost
(or the fiber is broken), the non-protection of the clock causes the pointer justification. If
the configuration is correct, proceed to the next step.
4. Check whether the priority configuration of the clock source is correct. After the clock
protection switching, clocks begin to trace each other, which results in the pointer
justification. If the configuration is correct, proceed to the next step.
5. Check whether the internal clock source of the main clock NE is configured with the clock
source ID. After the clock source of a higher level is lost, the NE enters the free-run mode.
Other NEs will not synchronize to the central site. As a result, all NEs in this clock subnet

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 119


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 9 A Fault in Pointer Justification

are in the free-run mode. Thus the pointer justification occurs accordingly. If the
configuration is correct, proceed to the next step.
6. Check whether the SSM clock protection is enabled. When the clock quality deteriorates,
the protection switching cannot be executed, thus causing the pointer justification. If the
configuration is correct, proceed to the next step.
7. When the settings of the SSM mode on the network are different, the NE with the SSM
mode enabled cannot extract the line clock of other NEs. In this case, the NE enters the
free-run mode, thus causing the pointer justification.

----End

Related Information
None

9.2 Pointer Justification Due to Over-High Temperature and


Humidity
This topic describes the symptoms, impact, and possible causes of the pointer justification due
to over-high temperature and humidity. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
The telecommunications room of a certain site does not meet the requirements of device running,
and the AU and TU pointer justifications occur on the line.

Impact on System
Over-high temperature and humidity cause pointer justification.

Possible Causes
l Cause 1: The temperature and humidity of the environment do not meet the requirements
of long term running of the device.

Tools, Equipment, and Materials


U2000, Thermometer

Procedure
Step 1 Cause 1: The temperature and humidity of the environment do not meet the requirements of long
term running of the device. As a result, the pointer justification occurs.
1. Check whether the temperature and humidity of the environment meet the requirements.

The detailed requirements are as follows:

l Temperature: 0°C to +45 °C


l Relative humidity: 10% to 90%

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 120


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 9 A Fault in Pointer Justification

If... Then...

The temperature and humidity of the Check and adjust the air conditioner in the
environment do not meet the requirements telecommunications room to make sure
that the temperature and humidity of the
telecommunications room are in the proper
range. Check whether the pointer
justification persists. If the pointer
justification persists, check whether the
fault is due to other causes.

The temperature and humidity of the Proceed to the next step.


environment meet the requirements

2. Check the current alarms on the U2000.

If... Then...

The NE reports the temperature threshold- See the Alarms and Performance Events
crossing alarms TEMP_OVER Reference to handle the TEMP_OVER
alarms. Check whether the pointer
justification persists. If the pointer
justification persists, check whether the
fault is due to other causes.

The NE does not report the temperature Contact Huawei engineers.


threshold-crossing alarms TEMP_OVER

----End

Related Information
None

9.3 Pointer Justification Due to Over-Low Precision of the


External Clock Source
This topic describes the symptoms, impact, and possible causes of the pointer justification due
to the over-low clock source precision. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

Symptom
After the network operates for a certain period, AU pointer justification and TU pointer
justification occur on a large scale on the network.

Impact on System
Over-low precision of the external clock source causes pointer justification.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 121


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 9 A Fault in Pointer Justification

Possible Causes
l Cause 1: The precision of the external clock source is over-low.

Tools, Equipment, and Materials


U2000

Procedure
Step 1 Cause 1: Over-low precision of the external clock source causes pointer justification.
1. Check the external clock source. It is found that the central site traces the clock source of
the switch or other clock sources with lower levels.
2. Change the clock tracing of the central site into internal free-run mode. Observe the clock
tracing for 24 hours and it is found that the fault of pointer justification is rectified.
3. Introduce the clock with a higher level to solve the problem thoroughly.

----End

Related Information
The SDH device has high requirements on the external clock source, the external clock source
must meet at least the requirements in the G.813 of SDH synchronization quality level. If the
precision of the external clock source is over-low or the clock quality deteriorates, pointer
justification occurs on the entire network.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 122


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

10 NE Offline Problems

About This Chapter

When the communication between an NE and the U2000 fails, this NE cannot be reached by the
U2000.

10.1 Unreachability of One NE by the U2000


This section describes the possible causes and handling methods when one NE is unreachable
by the U2000.

10.2 Unreachability of All NEs in a Subnet by the U2000


This section describes the possible causes and handling methods when all NEs in a subnet are
unreachable by the U2000.

10.3 Frequent Unreachability of an NE by the U2000


This section describes the possible causes and handling methods when an NE is frequently
unreachable by the U2000.

10.4 The System Control Board Is Frequently Reset


This topic describes the possible causes and handling methods when the system control board
is frequently reset, which is caused by an ECC communication overload.

10.5 NE Offline Cases


This section lists cases related to NE Offline.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 123


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

10.1 Unreachability of One NE by the U2000


This section describes the possible causes and handling methods when one NE is unreachable
by the U2000.

Symptom
One NE at a station is unreachable by the U2000, whereas the other NEs at this station are
reachable. The NE_COMMU_BREAK and NE_NOT_LOGIN alarms are reported on the
U2000.

The services on the NE are not interrupted.

Impact on System
The NE is unreachable by the U2000.

Possible Causes
l Cause 1: The network cable is disconnected or loose. For example, between the NE and
the HUB, between the GNE and the computer where the NMS server is installed, between
the GNE and non-GNEs, between the master and slave subrack of an NE.
l Cause 2: The NE Attributes of the NE is changed by mistake, such as the IP address, ID,
Gateway Type, User Name or Password
l Cause 3: The system control board on the NE is faulty.
l Cause 4: ECC communication is interrupted.
l Cause 5: The NE(non-first gateway NE) encounters system control board active/standby
switching.
l Cause 6: Coherent and non-coherent boards are interconnected.

Tools, Equipment and Materials


U2000, Web LCT, cable tester

Procedure
l Cause 1: The network cable is disconnected or loose. For example, between the NE and
the HUB, between the GNE and the computer where the NMS server is installed, between
the GNE and non-GNEs, between the master and slave subrack of an NE.
1. Determine if the network cable connection between the NE and the HUB is loose. If
the network cable connection is loose, secure it.
2. Check whether network cables are correctly connected between the GNE and the
computer where the NMS server is installed, between the GNE and non-GNEs,
between the master and slave subrack of an NE.
3. Use a cable tester to test the network cable and the network port. If the network cable
is faulty, replace it.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 124


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

4. If the network cable is good, replace the interface, to which the network cable is
connected to the HUB. If the problem clears, the interface of the HUB may be
damaged.
5. If the network cable and the HUB is good, the interface board (such as EFI, EFI1,
EFI2, or AUX) is faulty, replace it.
l Cause 2: The NE Attributes of the NE is changed by mistake, such as the IP address, ID,
Gateway Type, User Name or Password.
1. Log in to the NE on the Web LCT and restore the initial IP address , ID, gateway type,
NE user or password of the NE according to the record.
l Cause 3: The system control board on the NE is faulty.
1. Log in to the NE on the Web LCT. If you fail to log in, the system control board is
faulty. Reset the board. If the problem persists, replacing the SCC board.
l Cause 4: ECC communication is interrupted.
1. A maximum of 16 users can simultaneously log in to an NE. If the number of users
reaches the upper limit, other users are not permitted to log in to the NE. For the method
of handling the interruption of ECC communication, see 17.1 ECC Communication
Interruption.
l Cause 5: The NE(non-first gateway NE) encounters system control board active/standby
switching.
1. No handling is required. Communication between the NE and U2000 will be restored
after the system control board active/standby switching.
l Cause 6: Coherent and non-coherent boards are interconnected.
1. The DCN bytes are defined differently for coherent and non-coherent services, so the
two types of services cannot be interconnected. If DCN channels are unavailable,
check whether the interconnected boards are coherent boards.

----End

10.2 Unreachability of All NEs in a Subnet by the U2000


This section describes the possible causes and handling methods when all NEs in a subnet are
unreachable by the U2000.

Symptom
All NEs in a subnet are unreachable by the U2000. The NE_COMMU_BREAK,
NE_NOT_LOGIN, and GNE_CONNECT_FAIL alarms are reported on the U2000.

Impact on System
The NEs are unreachable by the U2000 in the subnet.

Possible Causes
l Cause 1: The gateway NE is faulty.
l Cause 2: The network cable between the gateway NE and the HUB is disconnected or the
interface to the HUB is damaged.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 125


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

l Cause 3: The IP address of the gateway NE is changed.


l Cause 4: The optical cable is faulty.
l Cause 5: The gateway NE encounters the system control board active/standby switching if
only one gateway NE is provided. The primary gateway NE encounters the system control
board active/standby switching if the primary and secondary gateway NEs are provided.

Tools, Equipment and Materials


U2000, Web LCT, Cable tester

Procedure
l Cause 1: The gateway NE is faulty. No backup gateway NE is configured for the subnet.
1. Determine if the power supply of the gateway NE is faulty. If yes, clear the fault of
the power supply.
2. Log in to the gateway NE on the Web LCT. If you fail to log in, the system control
board is faulty. replacing the SCC board.
l Cause 2: The network cable between the gateway NE and the HUB is faulty.
1. Determine if the network cable connection between the gateway NE and the HUB is
loose. If the network cable connection is loose, secure it.
2. Use a cable tester to test whether the network cable is good. If the network cable is
faulty, replace it.
3. If the network cable is good, replace the interface to the HUB. If the problem clears,
the HUB is damaged.
l Cause 3: The IP address of the gateway NE is changed.
1. Log in to the NE on the Web LCT and restore the initial IP address of the gateway
NE according to the record.
l Cause 4: The optical cable is faulty.
1. When the optical cable is cut off in the main optical path, the NEs in the subnet become
unreachable by the U2000. At the same time, a large number of alarms and
performance events occur in the main optical path. Hence, the normal operation of
services is affected. In this case, the optical cable is faulty.
l Cause 5: The gateway NE encounters the system control board active/standby switching if
only one gateway NE is provided. The primary gateway NE encounters system the control
board active/standby switching if the primary and secondary gateway NEs are provided.
1. No handling is required. Communication between the gateway NE and U2000 will be
restored after the system control board active/standby switching.

----End

Solution
To prevent all NEs in a subnet from becoming unreachable by the U2000 when the gateway NE
fails, configure a backup gateway NE for the NEs in the subnet.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 126


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

10.3 Frequent Unreachability of an NE by the U2000


This section describes the possible causes and handling methods when an NE is frequently
unreachable by the U2000.

Symptom
An NE frequently becomes unreachable by the U2000. The NE_COMMU_BREAK,
NE_NOT_LOGIN, and GNE_CONNECT_FAIL alarms are reported on the U2000.

Impact on System
The NE is unreachable by the U2000.

Possible Causes
l Cause 1: The IP address of the NE conflicts with that of another NE.
l Cause 2: The ECC communication is intermittent.
l Cause 3: The load of ECC communication is extremely heavy.
l Cause 4: Another NE user with the same account as that of the current NE user in the
network has logged in to the NE.
l Cause 5: When ESC and OSC coexist between station A and B, the system preferentially
selects the ESC for communication. Bit errors leads to the unstable link and the ESC
communication fails. The switching between ESC and OSC occurs frequently because the
route of ESC is shorter than the route of OSC.
l Cause 6: The scale of the DCN subnetwork is extremely large.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The IP address of the NE conflicts with that of another NE.
1. Determine if the IP addresses of the other devices (such as routers and servers) that
are newly configured in the network are the same as the IP address of the frequently
unreachable NE. Make sure that the IP address of the NE is unique.
l Cause 2: The ECC communication is intermittent.
1. For the method of handling the intermittent ECC communication, see 17.2
Intermittent ECC Communication.
l Cause 3: The load of ECC communication is extremely heavy.
1. For the method of handling the extremely heavy load of ECC communication, see
10.4 The System Control Board Is Frequently Reset.
l Cause 4: Another NE user with the same account as that of the current NE user in the
network has logged in to the NE. As a result, the NE becomes unreachable, and the
NE_NOT_LOGIN alarm is frequently reported on the U2000.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 127


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

1. Plan and manage the users that can log in to the NE. For the information on NE user
management, see "Creating an NE User".
l Cause 5: When ESC and OSC coexist between station A and B, the system preferentially
selects the ESC for communication. Bit errors leads to the unstable link and the ESC
communication fails. The switching between ESC and OSC occurs frequently. The NEs at
station B are frequently unreachable by the U2000.
1. Disable the ESC communication between station A and B on the U2000, keep the
OSC communication.
– In NE Explorer, select the NE.
– Select Communication > DCC Management from Function Tree.
– In DCC Rate Configuration, select the board. Double click the parameter area of
Enabled/Disabled.
– Choose Disabled in the drop list.
– Click Apply.
2. Rectify the bit errors on the link, for details refer to 8 Bit Errors.
3. Enable the ESC communication between station A and B after the link becomes stable.
l Cause 6: The scale of the DCN subnetwork is extremely large.
1. Due to the inherent limitations of the DCC protocol, there is a limit on the number of
NEs (physical NEs with unique IDs) in a network even though there are sufficient
GNEs. If the number of NEs in a network exceeds the limit, the network must be
divided into multiple DCN subnets. The details refer to DCN Networking Capability.
NOTE

Each GNE can connect to a maximum of 50 non-GNEs, regardless of the protocol. If there are more
than 50 non-NEs for one GNE, another GNE must be added.

----End

10.4 The System Control Board Is Frequently Reset


This topic describes the possible causes and handling methods when the system control board
is frequently reset, which is caused by an ECC communication overload.

Symptom
Due to an ECC communication overload, the system control board is frequently reset and the
U2000 is frequently unable to reach the NE.

System Impact
The U2000 cannot reach the NE.

Possible Causes
l Cause 1: The ECC communication load is extremely heavy.

Tools, Equipment, and Materials


WEB LCT, cable tester

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 128


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 10 NE Offline Problems

Procedure
l Cause 1: The ECC communication load is extremely heavy. The scale of the ECC network
is extremely large and the inter-NE ECC communication load exceeds the maximum
processing capability of the NEs in the network. As a result, the system control board
frequently resets.
1. Check the scale of the ECC network. The number of NEs in the network cannot exceed
100. If there are more than 100 NEs in the network, divide the network into subnets.
Create a data communication network (DCN) management channel for each subnet
so the subnets are independent of each other.
2. Check the per-NE scale. When multiple pieces of equipment are connected by using
a HUB or are cascaded inter subracks, the extended ECC function of the network
interfaces of the equipment is used for communication. If not more than four pieces
of equipment are connected by using one HUB, the automatic extended ECC function
can be enabled. If more than four pieces of equipment are connected by using one
HUB, the manual extended ECC function is recommended for communication to
prevent ECC storms. For details on how to set the manual extended ECC function,
refer to "Configuring Extended ECC in Specified Mode".

----End

10.5 NE Offline Cases


This section lists cases related to NE Offline.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MC-A20 The T2000 Cannot Log in to the Remote GNE Connected by the Router
l MC-A22 NE ESC Communication Interrupted Because of the Closure of the OTU Laser
l MC-A64 Many BD_STATUS Alarms Occur Due to the ECC Storm
l MC-A83 The OptiX OSN 6800 Is Not Reachable When the NE IP Is Modified
l MC-A110 Question About the Extended ECC of WDM NEs
l MC-A113 Communication Between the WDM Equipment And the DCN Network Fails
After a Hub Is Connected To the DCN Network
l MC-A135 The Communication Between the NE and the T2000 Stops Because the NE
Software Cannot Switch the ECC from an OSC Board to an OTU Board Automatically
l MC-A181 An NE Is Frequently Unreachable to the NMS Due to Insufficient Processing
Capacity of a Router
l MC-A212 Secondary GNE Becomes Unavailable After a Change of the Maximum Number
of Route Hops of an NE
l MC-A213 Internal Communication of an NE Is Abnormal And Many Boards Report
Transiently BD_STATUS or COMMUN_FAIL Alarms Due to Conflicted Subrack IDs

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 129


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 11 Board Offline Problems

11 Board Offline Problems

About This Chapter

This section describes how to troubleshoot offline boards.

11.1 A Board Is Offline


This section describes the possible causes and handling methods of a board that is offline on the
U2000.

11.2 Multiple Boards Are Offline


This section describes the possible causes and handling methods when multiple boards are offline
on the U2000.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 130


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 11 Board Offline Problems

11.1 A Board Is Offline


This section describes the possible causes and handling methods of a board that is offline on the
U2000.

Fault Description
The board is offline and unavailable on the U2000 while the other boards are online.

The board reports the BD_STATUS alarm.

Impact on the System


The boards cannot be managed or communicated with.

Possible Causes
l Cause 1: The slot that houses the board is faulty.
l Cause 2: The board is faulty.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The slot that houses the board is faulty.
1. Reseat the board to check whether the fault is removed.
2. If the fault persists, change the slot that houses the board.
l Cause 2: The board is faulty.
1. Replace the faulty board.

----End

11.2 Multiple Boards Are Offline


This section describes the possible causes and handling methods when multiple boards are offline
on the U2000.

Fault Description
Multiple or all boards are offline and unavailable on the U2000.

Multiple or all boards on the NE report the BD_STATUS alarms.

Impact on the System


The boards cannot be managed or communicated with.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 131


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 11 Board Offline Problems

Possible Causes
l Cause 1: The NE is unreachable.
l Cause 2: The network cable between the master and slave subracks is faulty.
l Cause 3: The backplane is faulty.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The NE is unreachable.
1. If the U2000 reports NE_COMMU_BREAK or NE_NOT_LOGIN alarm, the NE is
likely unreachable. For details, see 10.1 Unreachability of One NE by the U2000.
l Cause 2: The network cable between the master and slave subracks is faulty.
1. Check whether the network cable between the master and slave subracks is damaged
or loose and remove the fault accordingly.
l Cause 3: The backplane is faulty.
1. Replace the subrack.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 132


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12 Automatic Switching Failures

About This Chapter

This topic describes how to handle services that fail to be automatically switched to the protection
channel when the working channel works abnormally.
For information on the protection schemes supported by the OptiX OSN 8800, OptiX OSN 6800,
and OptiX OSN 3800, see the Feature Description"Feature Overview".
12.1 Optical Line Protection
This topic describes how to resolve switchover issues when optical line protection fails to
automatically switch services to the protection channel.
12.2 Intra-Board 1+1 Protection
This topic describes how to resolve switchover issues when intra-board 1+1 protection fails to
automatically switch services to the protection channel.
12.3 Client-side 1+1 Protection
This topic describes how to resolve switchover issues when client 1+1 protection fails to
automatically switch services to the protection channel.
12.4 SW SNCP Protection
This section describes the handling methods for services with SW SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.
12.5 ODUk SNCP Protection
This topic describes how to resolve switchover issues when ODUk SNCP protection fails to
automatically switch services to the protection channel.
12.6 VLAN SNCP Protection
This section describes the handling methods for services with VLAN SNCP protection that fail
to automatically switch to the protection channel when the working channel works abnormally.
12.7 Tributary SNCP Protection
This topic describes how to resolve switchover issues when tributary SNCP protection fails to
automatically switch services to the protection channel.
12.8 Automatic Switching Failure of DBPS Protection
This section describes the handling methods for services with DBPS protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 133


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12.9 Board 1+1 Protection


This topic describes how to resolve switchover issues when board-level protection fails to
automatically switch services to the protection channel. Board-level protection refers to an
equipment-level protection scheme, which includes M:N protection for the cross-connect board
and 1+1 protection for the system control board.

12.10 Ethernet Ring Protection


This section describes the symptoms, impact, and possible causes of the protection switching
failure of Ethernet ring protection. In addition, this section describes the required tools,
precautions, and procedures for rectifying the fault.

12.11 MS SNCP Protection


This section describes the handling methods for services with MS SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

12.12 DLAG Protection


This section describes the handling methods for services with DLAG protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

12.13 Board-Level Protection


This section describes the handling methods for services with board-level protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

12.14 ODUk SPRing Protection


This section describes the handling methods for services with ODUk SPRing protection that fail
to automatically switch to the protection channel when the working channel works abnormally.

12.15 Optical Wavelength Shared Protection


This section describes the handling methods for services with optical wavelength shared
protection that fail to automatically switch to the protection channel when the working channel
works abnormally.

12.16 Inter-Subrack Communication Protection


This topic describes how to resolve switchover issues when inter-subrack communication
protection fails to automatically switch services to the standby route when communication
between master and slave subracks is interrupted.

12.17 Linear Multiplex Section Protection


This section describes the handling methods for services with linear multiplex section protection
that fail to automatically switch to the protection channel when the working channel works
abnormally.

12.18 MSP Ring


This section describes the handling methods for services with MSP ring protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

12.19 Transoceanic MSP Ring


This section describes the handling methods for services with transoceanic MSP ring protection
that fail to automatically switch to the protection channel when the working channel works
abnormally.

12.20 SNCP
This section describes the handling methods for services with SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

12.21 SNCTP

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 134


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

This section describes the handling methods for services with tributary SNCTP protection that
fail to automatically switch to the protection channel when the working channel works
abnormally.

12.22 Protection Cases


This section lists cases related to Protection.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 135


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12.1 Optical Line Protection


This topic describes how to resolve switchover issues when optical line protection fails to
automatically switch services to the protection channel.

Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

l During normal network operations, conditions that trigger protection switching occur on
the working channel, but services do not automatically switch to the protection channel. In
this case, the services are interrupted.
l The optical line protection (OLP) board may report an OLP_PS or OLP_STA_INDI alarm.

Possible Causes
l Cause 1: The protection group parameters are incorrectly configured.
l Cause 2: Locked or forced switching commands have been manually issued.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
l Cause 4: The protection channel is working abnormally.

Troubleshooting Workflow
The following figure illustrates the optical line protection automatic switching failure
troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 136


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Figure 12-1 Optical line protection automatic switching failure troubleshooting flowchart

Start

Do the parameter No Are the Yes


Modify the services switched
values comply with the parameters.
specificatsions? to the protection
channel?

Yes No

Yes Are the Yes


Is a locked or Clear the locked or
services switched
forced switching command forced switching
manually issued? to the protection
command.
channel?

No No

No Are the Yes


Are the fiber Modify the fiber services switched
connections on the connections. to the protection
board correct?
channel?

Yes No

Does the No Handle the faults on the Are the Yes


protection channel work working and services switched
normally? protection channels. to the protection
channel?

Yes No
Forcibly switch the End
services to
the protection channel.

Resolve the faults on


the working
channel.

Contact Huawei
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The protection group parameters are incorrectly configured.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, click Query to query all protection group parameter
values. In the Operation Result dialog box, click Close.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 137


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

3. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured correctly.
l Cause 2: Locked or forced switching commands have been manually issued.
1. In the Port Protection window, click Query to check the Switching Status parameter
of the protection group.
2. If Switching Status is Locked or Force to Working Channel, click Function and
choose Clear to clear the locked or forced switching commands.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
1. Check whether the optical fibers are properly connected to optical ports 1 (RI1/TO1)
and 2 (RI2/TO2) on the OLP boards located at the two stations next to the failure point.
For details, refer to "Principles".
l Cause 4: The protection channel is working abnormally.
1. In the Port Protection interface, click Query to check the Protection Channel
Status parameter of the protection group.
2. If Protection Channel Status is SF, check whether MUT_LOS and
POWER_DIFF_OVER alarms are being reported on the corresponding boards on the
protection channel, clear the alarms.
NOTE

If Protection Channel Status is SF, faults have occurred on both the working and protection
channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.2 Intra-Board 1+1 Protection


This topic describes how to resolve switchover issues when intra-board 1+1 protection fails to
automatically switch services to the protection channel.

Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

l When the OLP board initiates intra-board 1+1 protection, the OLP board may report an
INTRA_OTU_PS or INTRA_OTU_STA_INDI alarm.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 138


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l When the intra-board 1+1 protection is realized by dual fed and selective receiving OTU,
the OTU board may report the INTRA_OTU_PS alarm or INTRA_OTU_STA_INDI
alarm.

Possible Causes
l The protection group parameters are incorrectly configured.
l Locked or forced switching commands have been manually issued.
l The optical fibers are improperly connected to the corresponding boards.
l The protection channel is working abnormally.

Troubleshooting Flow
The following figure illustrates the intra-board 1+1 protection automatic switching failure
troubleshooting process.

Figure 12-2 Intra-board 1+1 protection automatic switching failure troubleshooting flowchart

Start

Do the parameter No Are the Yes


Modify the services switched
values comply with the parameters.
specificatsions? to the protection
channel?

Yes No

Yes Are the Yes


Is a locked or Clear the locked or
services switched
forced switching command forced switching
manually issued? to the protection
command.
channel?

No No

No Are the Yes


Are the fiber Modify the fiber services switched
connections on the connections. to the protection
board correct?
channel?

Yes No

Does the No Handle the faults on the Are the Yes


protection channel work working and services switched
normally? protection channels. to the protection
channel?

Yes No
Forcibly switch the End
services to
the protection channel.

Resolve the faults on


the working
channel.

Contact Huawei
engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 139


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The protection group parameters are incorrectly configured.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, click Query to query all protection group parameter
values. In the Operation Result dialog box, click Close.
3. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured correctly.
l Cause 2: Locked or forced switching commands have been manually issued.
1. In the Port Protection window, click Query to check the Switching Status parameter
of the protection group.
2. If Switching Status is Locked or Force to Working Channel, click Function and
choose Clear to clear the locked or forced switching commands.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
1. For 2.5G OTU boards, check whether the optical fibers are properly connected to the
optical ports for working and protection channels on the OTU, or tributary or line
boards located at the two stations next to the failure point. For 5G, 10G, and 40G
OTU, or tributary or line boards, check whether the optical fibers are properly
connected to the optical ports for working and protection channels on the OLP or DCP
boards located at the two stations next to the failure point. For details, refer to
"Principles".
NOTE

OptiX OSN 3800/6800 supports 5G OTU.


l Cause 4: The protection channel is working abnormally.
1. In the Port Protection window, click Query to query the Protection Channel
Status parameter of the protection group.
2. If Protection Channel Status is SD or SF, check whether R_LOS, R_LOF, R_LOC,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI,
ODUk_PM_LCK, ODUk_LOFLOM, B1_EXC, ODUk_PM_DEG,
ODUk_PM_EXC, POWER_DIFF_OVER, IN_PWR_HIGH and IN_PWR_LOW
alarms are being reported on the corresponding boards on the protection channel.
NOTE
If Protection Channel Status is SD, SF, faults have occurred on both the working and protection
channels.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 140


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.3 Client-side 1+1 Protection


This topic describes how to resolve switchover issues when client 1+1 protection fails to
automatically switch services to the protection channel.

Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

l The DCP or OLP board may report a CLIENT_PORT_PS or CLIENT_PORT_STA_INDI


alarm.
l When you use the SCS board to configure the protection, the OTU or tributary board may
report the CLIENT_PORT_PS alarm or CLIENT_PORT_STA_INDI alarm.

Possible Causes
l The protection group parameters are incorrectly configured.
l Locked or forced switching commands have been manually issued.
l The optical fibers are improperly connected to the corresponding boards.
l The protection channel is working abnormally.

Troubleshooting Workflow
The following figure illustrates the client 1+1 protection automatic switching failure
troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 141


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Figure 12-3 Client 1+1 protection automatic switching failure troubleshooting flowchart

Start

Do the No Are the Yes


parameter values comply Modify the services switched to
with the specifications? parameters. the protection
channel?

Yes No

Yes Clear the locked or Are the Yes


Is a locked or services switched to
forced switching command forced switching
the protection
manually issued? command. channel?

No No

Are the fiber No Are the Yes


Modify the fiber services switched
connections on the
connections. to the protection
board correct?
channel?

Yes No

Is Protection No Handle the faults on Are the


Channel Status the working and services switched to
protection channels. the protection
Normal?
channel? Yes

Yes No

Forcibly switch the


services to the
protection channel.

Handle the faults on


the working channel.
End

Contact Huawei
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The protection group parameters are incorrectly configured.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, click Query to query all protection group parameter
values. In the Operation Result dialog box, click Close.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 142


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

3. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured properly.
l Cause 2: Locked or forced switching commands have been manually issued.
1. In the Port Protection window, click Query to check the Switching Status parameter
of the protection group.
2. If Switching Status is Locked or Force to Working Channel, click Function and
choose Clear to clear the locked or forced switching commands.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
1. Ensure that the optical fibers are properly connected to the optical ports for the working
and protection channels.
l Cause 4: The protection channel is working abnormally.
1. In the Port Protection window, click Query to query the Protection Channel
Status parameter of the protection group.
2. If Protection Channel Status is SD or SF, check whether the alarms listed in the
trigger conditions are being reported on the corresponding boards on the protection
channel, clear the alarms.
NOTE
If Protection Channel Status is SD or SF, faults have occurred on both the working and protection
channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.4 SW SNCP Protection


This section describes the handling methods for services with SW SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

The OTU board may report SW_SNCP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic switching failure are as follows:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 143


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l The parameters of the protection group are improperly configured.


l Locked or forced switching is not released.
l The electrical cross-connections are configured incorrectly.
l The protection channel works abnormally.

Fault Handling Flow

Figure 12-4 Fault handling flow when automatic switching by SW SNCP protection fails

Start

No Are the Yes


Do the parameter Modify the services switched to
values comply with the parameters. the protection
specifications? channel?
No
Yes

Yes Are the Yes


Is a locked or Clear the locked or
forced switching command services switched to
forced switching the protection
manually issued? command. channel?

No
No

Are the Modify the Are the


No Yes
configurations of electrical configurations of services switched to
cross-connections electrical cross- the protection
correct? connections. channel?

Yes No

Yes Are the Yes


Dose the protection Handle the faults on services switched to
channel work the protection channel. the protection
abnormally? channel?

No No

End
Handle the faults on the
working channel.

Contact Huawei
engineers.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The parameters of the protection group are improperly configured.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 144


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click the SNCP Service Control tab.
2. Click Query in the SNCP Service Control interface, to query the values of all
parameters of the protection group. Click Close in the Operation Result dialog box.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the Feature Description to ensure that the parameters
are configured properly.
l Cause 2: Locked or forced switching is not released.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from protection to working) switching
state, select the protection group and right-click it. Choose Clear from the shortcut
menu to clear the locked or forced switching configuration.
l Cause 3: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to check the created cross-connections.
3. According to the network design, ensure the source and sink ports of the dual fed
cross-connections that are configured on the source end are correct.
l Cause 4: The protection channel works abnormally.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the protection cross-connection.
2. If Channel Status is displayed as SD, SF, check whether the following alarms of
trigger conditions exist on the relevant boards on the protection channel. For the
method of handling the alarms, see the Alarms and Performance Events Reference.
NOTE

If the Channel Status parameter of the protection cross-connection is displayed as SD, SF, it
is indicated that faults have occurred in both the working and protection channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.5 ODUk SNCP Protection


This topic describes how to resolve switchover issues when ODUk SNCP protection fails to
automatically switch services to the protection channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 145


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

Tributary, line or PID board may report an ODU_SNCP_STA_INDI alarm.

Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The protection group parameters are incorrectly configured.
l Cause 3: Locked or forced switching commands have been manually issued.
l Cause 4: The electrical cross-connections are incorrectly configured.
l Cause 5: The protection channel is working abnormally.
l Cause 6: The line board is faulty.

Troubleshooting Workflow
The following figure illustrates the ODUk SNCP protection automatic switching failure
troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 146


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Figure 12-5 ODUk SNCP protection automatic switching failure troubleshooting flowchart

Start

Is the SNCP Type Are the Yes


No Change the SNCP Type
consistent with that services switched
required in the service of the protection group. to the protection
scenario? channel?
No
Yes

Are the
Do the parameter values No Yes
Modify the parameters. services switched
comply with the to the protection
specifications?
channel?
No
Yes

Is Are the
a locked or forced Yes Clear the locked or Yes
services switched
switching command forced switching
manually issued? to the protection
command.
channel?

No
No

Are Modify the Are the


the configurations No Yes
configurations of services switched
of electrical cross-
connections correct? electrical cross- to the protection
connections. channel?

Yes No

Does Are the


the protection Yes Handle the faults on Yes
services switched
channel work the working and
to the protection
abnormally? protection channels.
channel?

No No

Handle the faults on the


working channel. End

Contact Huawei
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query the Protection Type of
the protection group. In the Operation Result dialog box, click Close.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 147


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

3. Refer to "Dependency and Limitation" and "Parameters" to check whether the


Protection Type of the protection group is appropriate for the service scenario.
l Cause 2: The protection group parameters are incorrectly configured.
1. In the SNCP Service Control window, click Query to query all protection group
parameter values.
2. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured correctly.
l Cause 3: Locked switching or forced switching commands have been manually issued.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from protection to working) switching
state, select the protection group and right-click it. Choose Clear from the shortcut
menu to clear the locked or forced switching configuration.
l Cause 4: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to check the created cross-connections.
3. According to the network design, ensure the source and sink ports of the dual fed
cross-connections that are configured on the source end are correct.
l Cause 5: The protection channel is working abnormally.
1. In the SNCP Service Control window, click Query to query the Channel Status
parameter of the protection cross-connection.
2. If Channel Status is SD or SF, check whether the alarms listed in Trigger Conditions
are being reported on the corresponding boards on the protection channel, clear the
alarms.
NOTE
If the Channel Status parameter of the protection cross-connection is SD or SF, faults have occurred
on both the working and protection channels.
l Cause 6: The line board is faulty.
1. Forcibly switch services to the protection channel, which will eventually restores
services.
2. Replace the faulty line board.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

Before the working channel is restored from a failure (for example, the faulty board has not yet
been replaced), services on the protection channel may run without protection for a long time.
To avoid a second failure on the network, you are advised to convert an idle and unprotected
service to the SNCP service for protection. For details, refer to "Converting a Normal WDM
Service to an SNCP Service." After the working channel is restored, convert the SNCP service

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 148


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

back to an unprotected service. For details, refer to "Converting an SNCP Service to a Normal
WDM Service."

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.6 VLAN SNCP Protection


This section describes the handling methods for services with VLAN SNCP protection that fail
to automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

L4G, TBE, LEM24, LEX4 board of OptiX OSN 6800/OptiX OSN 3800 may report
VLAN_SNCP_PS.

LEM24, LEX4 board of OptiX OSN 8800 may report VLAN_SNCP_PS.

Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l Locked or forced switching is not released.
l The electrical cross-connections are configured incorrectly.
l The protection channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 149


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Fault Handling Flow

Figure 12-6 Fault handling flow when automatic switching by VLAN SNCP protection fails

Start

No Are the Yes


Does the parameter Modify the services switched to
values comply with the parameters. the protection
specifications?
channel?
No
Yes

Yes Are the Yes


Is a locked or Clear the locked or
services switched to
forced switching command forced switching
manually issued? the protection
command.
channel?
No
No

Are the Modify the Are the


No Yes
configurations of electrical configurations of services switched to
cross-connections electrical cross- the protection
correct? connections. channel?

Yes No

Is Protection No Handle the faults on Are the Yes


Channel Status the working and services switched to
Normal? protection channels. the protection
channel?

Yes No

Forcibly switch the End


services to the
protection channel.

Handle the faults on the


working channel.

Contact Huawei
engineers.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the desired Ethernet board in the NE Explorer, and choose Configuration >
Ethernet Service > Ethernet Line Service from the Function Tree.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 150


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

2. Click Query in the VLAN SNCP Service Management tab to query the values of
all parameters of the protection group.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the Feature Description to ensure that the parameters
are configured properly.
l Cause 2: Locked or forced switching is not released.
1. In the VLAN SNCP Service Management interface, click Set/Query Switching and
select Query Switching Status to query the Channel Status parameter of the
protection group.
2. If Channel Status for the protection channel is displayed as Lockout State or Forced
(Standby to Active) Switching State, right-click the protection group, and then
choose Clear from the shortcut menu, to clear the locked or forced switching
configuration.
l Cause 3: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to check the created cross-connections.
3. According to the network design, ensure the source and sink ports of the dual fed
cross-connections that are configured on the source end are correct.
l Cause 4: The protection channel works abnormally.
1. In the VLAN SNCP Service Management interface, click Set/Query Switching and
select Query Switching Status to check the Link Status parameter of the protection
service.
2. If Link Status is displayed as SF, check whether the following alarms exist on the
relevant boards on the protection channel: R_LOS, R_LOF, R_OOF, OTU5G_LOF,
OTU5G_AIS, OTU5G_LOM. For the method of handling the alarms, see the Alarms
and Performance Events Reference.
NOTE

If the Link Status parameter of the protection service is displayed as SF, it is indicated that
faults have occurred in both the working and protection channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.7 Tributary SNCP Protection


This topic describes how to resolve switchover issues when tributary SNCP protection fails to
automatically switch services to the protection channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 151


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

Tributary, line or PID board may report an ODU_SNCP_PS alarm or ODU_SNCP_STA_INDI


alarm.

Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The protection group parameters are incorrectly configured.
l Cause 3: Locked or forced switching commands have been manually issued.
l Cause 4: The electrical cross-connections are incorrectly configured.
l Cause 5: The protection channel is working abnormally.

Troubleshooting Workflow
The following figure illustrates the tributary SNCP protection automatic switching failure
troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 152


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Figure 12-7 Tributary SNCP protection automatic switching failure troubleshooting flowchart

Start

Is the SNCP Type Are the Yes


No Change the
consistent with that services switched
required in the service SNCP Type
of the protection group. to the protection
scenario? channel?
No
Yes

Do Are the
the parameter No Yes
Modify the parameters. services switched
values comply with the
specifications? to the protection
channel?
No
Yes

Are the Yes


Is a locked or Yes Clear the locked or
forced switching command services switched
forced switching
manually issued? to the protection
command. channel?

No
No

Are Modify the Are the Yes


the configurations No configurations of services switched
of electrical cross- electrical cross-
connections correct? to the protection
connections. channel?

Yes No

Does the Handle the faults on Are the Yes


No
protection channel work the working and services switched
protection channels. to the protection
normally? channel?

Yes No

Handle the faults on the


working channel. End

Contact Huawei
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query the Protection Type of
the protection group.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 153


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

3. Refer to "Dependency and Limitation" and "Parameters" to check whether the


Protection Type of the protection group is appropriate for the service scenario.
l Cause 2: The protection group parameters are incorrectly configured.
1. In the SNCP Service Control window, click Query to query all protection group
parameter values.
2. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured correctly.
l Cause 3: Locked switching or forced switching commands have been manually issued.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from protection to working) switching
state, select the protection group and right-click it. Choose Clear from the shortcut
menu to clear the locked or forced switching configuration.
l Cause 4: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to check the created cross-connections.
3. According to the network design, ensure the source and sink ports of the dual fed
cross-connections that are configured on the source end are correct.
l Cause 5: The protection channel is working abnormally.
1. In the SNCP Service Control window, click Query to query the Channel Status
parameter of the protection cross-connection.
2. If Channel Status is SD or SF, check whether the alarms listed in Trigger Conditions
are being reported on the corresponding boards on the protection channel, clear the
alarms.
NOTE
If the Channel Status parameter of the protection cross-connection is SD or SF, faults have occurred
on both the working and protection channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

NOTE
If the section overhead status changes, protection switching may be triggered.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 154


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12.8 Automatic Switching Failure of DBPS Protection


This section describes the handling methods for services with DBPS protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

TBE board may report DBPS_ABNORMAL alarm.

Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l There is alarm to be solved.

Fault Handling Flow


Fault handling flow in the case that services cannot be switched back to the protection channel
in the DBPS protection, see Figure 12-8.

Figure 12-8 Fault handling flow when automatic switching by DBPS protection fails

Start

No Are the Yes


Does the parameter Modify the services switched
value comply with the parameters. to the protection
specifications?
channel?

Yes No

Are the Yes


No
Are the alarms services switched
Clear the alarms. to the protection
cleared?
channel?

Yes No

Handle the faults on


the working channel.

End

Contact Huawei
engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 155


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. In the NE Explorer, select the NE and then choose Configuration > Distributed
Board-Level Protection from the Function Tree.
2. In the right pane, click Query to obtain the current information about the created
DBPS.
3. Compare the values against the principles and requirements specified in Dependencies
and Limitations and Parameters in the Feature Description to check and ensure that
the parameters are configured properly.
l Cause 2: There is alarm to be solved.
1. Check if there is the DBPS_ABNORMAL, LINK_ERR, R_LOS, R_LOF,
HARD_BAD, PORT_MODULE_OFFLINE alarms.
2. Solve the alarms. For details, refer to the Alarms and Performance Events
Reference.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.9 Board 1+1 Protection


This topic describes how to resolve switchover issues when board-level protection fails to
automatically switch services to the protection channel. Board-level protection refers to an
equipment-level protection scheme, which includes M:N protection for the cross-connect board
and 1+1 protection for the system control board.

Background Information
The OptiX OSN 3800 supports 1+1 protection for only SCC boards.

The OptiX OSN 6800 supports 1+1 protection for only cross-connect boards and SCC boards.

Symptom
During normal network operations, conditions that trigger protection switching occur on the
equipment, but services do not automatically switch to the standby board.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 156


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l If 1+1 protection for the system control board fails to be automatically switched, the
U2000 cannot reach the equipment. The services on the equipment, however, are
unaffected.
l If the protection switching of the cross-connect boards or clock boards fails, the services
on the equipment are interrupted.

Possible Causes
l Cause 1: No standby board is configured.
l Cause 2: The standby board is faulty.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: No standby board is configured.
1. Configure a standby board.
l Cause 2: The standby board is faulty.
1. Replace the faulty standby board.

----End

Solution
After configuring a standby board or replacing the faulty standby board, replace the faulty active
board to restore services.

NOTE
To replace a board with active/standby protection, use a board of the same type.

12.10 Ethernet Ring Protection


This section describes the symptoms, impact, and possible causes of the protection switching
failure of Ethernet ring protection. In addition, this section describes the required tools,
precautions, and procedures for rectifying the fault.

Fault Description
When the network operates normally, the working channel has generated the conditions of
triggering the protection switching, but the services cannot be automatically switched to the
protection channel. In this case, the services are interrupted.

If there is more than one RPL owner node on the ring, the system reports the
MULTI_RPL_OWNER alarm.

Impact on the System


An ERPS failure interrupts services.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 157


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Possible Causes
l Cause 1: The configuration of the protection group parameters is incorrect.
l Cause 2: The protection channel works abnormally.
l Cause 3: The fiber connections on the board are incorrect.
l Cause 4: The hardware is faulty.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The configuration of the protection group parameters is incorrect. As a result, the
protection switching fails.
1. Review the configuration of the protection group parameters on the U2000.

If... Then...

The configuration of the protection Re-configure the protection group


group parameters is incorrect parameters. Determine if the services
are restored. If the services are not
restored, determine if the fault is due to
other causes.

The configuration of the protection Determine if the fault is due to other


group parameters is correct causes.

l Cause 2: The protection channel works abnormally.


1. Determine if the following alarms exist on the relevant boards: AU_LOP, MS_AIS,
AU_AIS, B3_EXC, R_LOS, and LINK_ERR. For the method of handling the alarms,
see the Alarms and Performance Events Reference.
l Cause 3: The fiber connections on the board are incorrect. As a result, the protection
switching fails.
1. See the protection principles to determine if the fiber connections at the faulty point
are correct.

If... Then...

The fiber connections are incorrect Reconnect the fibers. Determine if the
services recover. If the services do not
recover, determine if the fault is due to
other causes.

The fiber connections are correct Determine if the fault is due to other
causes.

l Cause 4: The hardware is faulty. As a result, the protection switching fails.


1. See the Parts Replacement to replace the faulty board.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 158


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

2. Determine if the services are restored. If the services are not restored, determine if the
fault is due to other causes.

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

12.11 MS SNCP Protection


This section describes the handling methods for services with MS SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

The OTU board may report MS_SNCP_PS alarm or MS_SNCP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic switching failure are as follows:
l The related boards work abnormally.
l The parameters of the protection group are improperly configured.
l Locked switching or forced switching is not released.
l The electrical cross-connections are configured incorrectly.
l The protection channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 159


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Fault Handling Flow

Figure 12-9 Fault handling flow when automatic switching by MS SNCP protection fails

Start

Does the No Are services Yes


corresponding Replace the
switched to the
board work board.
protection channel?
normally?
No
Yes

Do the parameter No Are services Yes


Modify the
values comply with switched to the
parameters.
the specifications? protection channel?

No
Yes

Is a locked or forced Yes Clear the locked or Are services Yes


forced switching switched to the
switching command protection
manually issued? command. channel?

No
No

Modify the
No Are services Yes
Are the configurations of configurations of switched to the
electrical cross-connections electrical cross-
correct? protection channel?
connections.

Yes No

No
Does the protection Handle the faults on the Are services Yes
channel work protection channel. switched to the
normally? protection channel?

Yes No

Handle the faults on End


the working channel.

Contact Huawei
engineers.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The related boards work abnormally.
1. Allocate the board which works abnormally.
2. Replace the board. For details refer to Parts Replacement.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 160


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l Cause 2: The parameters of the protection group are improperly configured.


1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click SNCP Service Control tab.
2. Click Query in the SNCP Service Control interface, to query the values of all
parameters of the protection group.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the OptiX OSN 6800/3800 Feature Description to
check and ensure that the parameters are configured properly.
l Cause 3: Locked switching or forced switching is not released.
1. In the SNCP Service Control interface, click Query to query the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from protection to working) switching
state, right-click the protection group, and then choose Clear from the shortcut menu,
to clear the locked or forced switching configuration.
l Cause 4: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to query the created cross-connections.
3. According to the network design, check if source and sink ports of the dual fed cross-
connections that are configured on the source end are correct.
l Cause 5: The protection channel works abnormally.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the protection cross-connection.
2. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the protection channel: R_LOF, R_LOS, R_LOC,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI,
ODUk_PM_LCK, ODUk_LOFLOM, REM_SF, B1_EXC, ODUk_PM_EXC,
OTUk_EXC, REM_SD. For the method of handling the alarms, see the Alarms and
Performance Events Reference.
NOTE

If the Channel Status parameter of the protection cross-connection is displayed as SD, SF, it
is indicated that faults have occurred in both the working and protection channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 161


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12.12 DLAG Protection


This section describes the handling methods for services with DLAG protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

The board may report DLAG_PROTECT_FAIL alarm.

Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l There is alarm which needs to be cleared.

Fault Handling Flow

Figure 12-10 Fault handling flow when automatic switching by DLAG protection fails

Start

No Are the Yes


Do the parameter Modify the services switched
values comply with the parameters. to the protection
specifications?
channel?

Yes No

No Are the Yes


Are the alarms cleared? Clear the alarms. services switched
to the protection
channel?

Yes No

Handle the faults on


the working channel.

End

Contact Huawei
engineers.

Tools, Equipment and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 162


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. In the NE Explorer, select the NE and then choose Configuration > Ethernet
Distributed Link Aggregation Management from the Function Tree.
2. In the right pane, click Query to obtain the current information about the created
DLAG.
3. For DLAG Protection(EOO/EOW), compare the values against the principles and
requirements specified in Parameters in the OptiX OSN 8800 Feature Description to
ensure that the parameters are configured properly.
4. For DLAG Protection(EOS), compare the values against the principles and
requirements specified in Dependency and Limitation and Parameters in the OptiX
OSN 8800 Feature Description to check and ensure that the parameters are configured
properly.
5. For DLAG Protection(EOS), compare the values against the principles and
requirements specified in Dependency and Limitation and Parameters in the OptiX
OSN 6800/3800 Feature Description to ensure that the parameters are configured
properly.
l Cause 2: There is alarm which needs to be cleared.
1. On the U2000, determine if there is DLAG_PROTECT_FAIL or other alarms on the
protection channel.
2. Clear the alarms. For more information, refer to the Alarms and Performance Events
Reference.
----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

12.13 Board-Level Protection


This section describes the handling methods for services with board-level protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

Possible Causes
The possible causes of the automatic switching failure are as follows:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 163


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l The parameters of the protection group are improperly configured.


l Forced switching is not released.
l The protection board works abnormally.

Fault Handling Flow

Figure 12-11 Fault handling flow when automatic switching by board-level protection fails

Start

No Are the Yes


Does the parameter services switched to
value comply with the Modify parameters
the protection
specification?
board?
No
Yes

Yes Are the Yes


Is a forced Clear the
services switched to
switching command forced switching
the protection
issued? command
board?
No
No

No Clear the faults in the Are the Yes


Is Protection Board services switched to
Status displayed as working and protection
boards the protection
Available? board?
No
Yes

End
Perform forced
switching

Clear the faults in the


working board

Contact Huawei
engineers

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > Board-Level
Protection from the Function Tree.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 164


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

2. Click Query in the interface, to query the values of all parameters of the protection
group.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the OptiX OSN 6800/3800 Feature Description to
ensure that the parameters are configured properly.
l Cause 2: Forced switching is not released.
1. In the window, click Query to check the Switching Status parameter of the protection
group.
2. If Switching Status is displayed as Forced Switch Request, right-click the protection
group, and then choose Clear from the shortcut menu to clear the locked switching
or forced switching configuration.
l Cause 3: The protection board works abnormally.
1. In the window, click Query to check the Protection Board Availability parameter
of the protection group.
2. If Protection Board Availability is displayed as Unusable, determine if the following
alarms exist on the relevant boards on the channels: HARD_BAD, BD_STATUS,
PORT_MODULE_OFFLINE, R_LOS, R_LOF. For more information, refer to the
Alarms and Performance Events Reference.

----End

Solution
If the problem persists after the above steps are performed and Protection Board
Availability is Available, choose Forced Switching to Protection to perform a forced
switching on the protection group. Then, the services are switched to the protection board and
the services recover.

For more information regarding how to handle faults that have occurred on the working board,
refer to the Alarms and Performance Events Reference. If necessary, contact Huawei engineers
to resolve the problems.

12.14 ODUk SPRing Protection


This section describes the handling methods for services with ODUk SPRing protection that fail
to automatically switch to the protection channel when the working channel works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

The NEs at both ends of the channel where the switching occurs may report the ODUKSP_PS
alarm or ODUKSP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic switching failure are as follows:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 165


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l The protection channel works abnormally.


l Locked switching or forced switching is not released.
l The parameters of the protection group are improperly configured.
l The SD Switching is set to Disabled.
l The protection protocol is not started.
l The fibers are improperly connected to the corresponding boards.

Fault Handling Flow

Figure 12-12 Fault handling flow when automatic switching by ODUk SPRing protection fails

Start

Does No Handle the faults on the Are the Yes


the protection channel services switched to
working and
work normally? the protection
protection channels. channel?
No
Yes

Is a locked or Yes Are the Yes


Clear the locked services switched to
forced switching command or forced switching the protection
manually issued? command. channel?

No No

Do Are the
the parameter No Yes
Modify the services switched to
values comply with parameters. the protection
the specifications? channel?

Yes No

Yes Are the Yes


Is SD Switching set services switched to
Modify the parameter.
to Disabled? the protection
channel?

No No

No Are the Yes


Is the protection Start the protection services switched to
protocol started? protocol. the protection
channel?

No
Yes

Are the fiber No Are the Yes


Modify the fiber services switched to
connections on the board the protection
correct? connections.
channel?

Yes No
End

Perform manual
Handle the faults on the Contact Huawei
switching or forced
switching. working channel. engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 166


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The protection channel works abnormally.
1. In the Channel Mapping Relation interface, click Query to query the Channel
Status parameter of the protection channel.
2. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the protection channel: R_LOS, R_LOC, HARD_BAD,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_LOFLOM, ODUk_TCMn_OCI,
ODUk_TCMn_LCK, ODUk_TCMn_AIS, OTUk_EXC, ODUk_TCM6_DEG,
ODUk_TCM6_EXC. For the method of handling the alarms, see the Alarms and
Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the ODUk SPRing interface, click Query to query the Switching Request-East
or Switching Request-West parameter of the protection group.
2. If Switching Request-West or Switching Request-East is displayed as Locked or
Forced Switching - Ring, right-click the protection group, and then choose Clear
from the shortcut menu, to clear the locked or forced switching configuration.
l Cause 3: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > ODUk SPRing from
the Function Tree.
2. Click Query in the interface, to query the values of all parameters of the protection
group.
3. Check whether Span ID has been set correctly.

For example, if services are added/dropped at sites A and B and site B is east to site
A, then the span ID for the west working unit of site B must be the same as the span
ID for the east working unit of site A.
4. Compare the values against the principles and requirements specified in Dependencies
and Limitations and Parameters in the Feature Description to check and ensure that
the other parameters are configured properly.
l Cause 4: SD Switching has been set to Disabled.
1. Click Query to query the SD Switching parameter of the protection group.

If SD Switching has been set to Disabled, services cannot be switched to the


protection channel when an OTUk_EXC, ODUk_TCM6_DEG, or
ODUk_TCM6_EXC alarm is generated on the working channel.
2. Set SD Switching to Enabled.
l Cause 5: The protection protocol is not started.
1. Click Query to query the Switching Request-East or Switching Request-West
parameter of the protection group.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 167


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

If the protection protocol has not started, Switching Request-East or Switching


Request-West is displayed as Unknown.
2. Select the protection group and click Start Protocol.
l Cause 6: The fibers are improperly connected to the corresponding boards.
1. Refer to the protection principle in Principles. Check if the fiber connections are
correct on the optical interfaces of the east and west working channels and protection
channels of the line boards at the stations on the two ends.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
contact Huawei engineers to resolve the problem.

12.15 Optical Wavelength Shared Protection


This section describes the handling methods for services with optical wavelength shared
protection that fail to automatically switch to the protection channel when the working channel
works abnormally.

Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.

DCP board may report OWSP_PS alarm or OWSP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l The protection protocol is not started.
l Locked switching or forced switching is not released.
l The fibers are improperly connected to the corresponding boards.
l The protection channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 168


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Fault Handling Flow

Figure 12-13 Fault handling flow when automatic switching by OWSP protection fails

Start

Do Are the
No Yes
the parameter Modify the services switched to
values comply with the parameters. the protection
specifications?
channel?

Yes No

No Are the Yes


Is the protection protocol Start the protection services switched to
started? protocol. the protection
channel?

Yes No

Yes Are the Yes


Is a locked or Clear the locked or
services switched to
forced switching command forced switching the protection
manually issued? command. channel?

No No

Are Are the


No Modify the fiber services switched to Yes
the fiber connections
on the board correct? connections. the protection
channel?

Yes No

Does No Handle the faults on the Are the Yes


the protection channel working and services switched to
work normally? the protection
protection channels. channel?

Yes No
End
Perform a manual
or forced switching.

Handle the faults on the


working channel.

Contact Huawei
engineers.

Tools, Equipment and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 169


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > Optical Wavelength
Shared Protection from the Function Tree. Click OWSP Protection tab.
2. Click Query in the OWSP Protection interface, to query the values of all parameters
of the protection group.
3. Compare the values against the principles and requirements specified in Dependencies
and Limitations and Parameters in the Feature Description to check and ensure that
the parameters are configured properly.
l Cause 2: The protection protocol is not started.
1. In the OWSP Protection interface, select the protection group and click Start
Protocol.
l Cause 3: Locked switching or forced switching is not released.
1. In the OWSP Protection interface, click Query to query the Switching Request-
East or Switching Request-West parameter of the protection group.
2. If Switching Request-East or Switching Request-West is displayed as Locked or
Forced Switching - Ring, right-click the protection group, and then choose Clear
from the shortcut menu, to clear the locked or forced switching configuration.
l Cause 4: The fibers are improperly connected to the corresponding boards.
1. Refer to the protection principle in Principles. Check if the fiber connections are
correct on the optical interfaces of the east and west working channels and protection
channels of the DCP boards at the stations on the two ends.
l Cause 5: The protection channel works abnormally.
1. In the OWSP Protection interface, click Query to query the Channel Status
parameter of the protection channel.
2. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the protection channel: MUT_LOS, R_LOS, R_LOC,
R_LOF, OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS,
ODUk_PM_OCI, ODUk_PM_LCK, OTUk_TIM, ODUk_PM_TIM,
ODUk_LOFLOM, IN_PWR_HIGH, IN_PWR_LOW, B1_EXC, ODUk_PM_DEG,
ODUk_PM_EXC, OTUk_DEG, OTUk_EXC. For the method of handling the alarms,
see the Alarms and Performance Events Reference.
NOTE

If the Channel Status parameter of the protection channel is displayed as SD, SF, it is indicated
that faults have occurred in both the working and protection channels.

----End

Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.

For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 170


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12.16 Inter-Subrack Communication Protection


This topic describes how to resolve switchover issues when inter-subrack communication
protection fails to automatically switch services to the standby route when communication
between master and slave subracks is interrupted.

Background Information
The OptiX OSN 8800 and 6800 support protection for inter-subrack communication.

Symptom
During normal network operations, the communication between master and slave subracks is
interrupted, and services do not automatically switch to the standby route.

System Impact
When inter-subrack communication protection fails to automatically switch, the U2000 cannot
reach the slave subracks. The services on the equipment, however, are unaffected.

Possible Causes
l Cause 1: The protection network cable is loosely connected or is malfunctioning.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The protection network cable is loosely connected or is malfunctioning.
1. Securely attach the protection network cable.
2. Replace the protection network cable.

----End

12.17 Linear Multiplex Section Protection


This section describes the handling methods for services with linear multiplex section protection
that fail to automatically switch to the protection channel when the working channel works
abnormally.

Fault Description
When the network operates normally, the working channel has generated the conditions of
triggering the protection switching, but the services cannot be automatically switched to the
protection channel. In this case, the services are interrupted.

SCC board may report APS_FAIL alarm.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 171


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Possible Causes
l Cause 1: The protocol fails to start and the protection switching fails.
l Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
l Cause 3: The protocol is stopped manually or a forced switching is performed manually.
As a result, the protection switching fails.
l Cause 4: The power supply of the device is faulty.
l Cause 5: The hardware is faulty.

Tools, Equipment, and Materials


U2000

Procedure
Step 1 Cause 1: The protocol fails to start and the protection switching fails.
1. In the NE Explorer, click the desired NE and choose Configuration > Linear MS from
the Function Tree.
2. In Linear MS interface, click Stop Protocol. Click Close in the Operation Result dialog
box.
3. Click Start Protocol to re-enable the protocol. Click Close in the Operation Result dialog
box.
4. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.

Step 2 Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
1. Check the settings of linear MS parameters on the U2000.

If... Then...

The settings of the linear MS parameters Set the parameters of the multiplex section
are incorrect to proper values. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.

The configuration of the linear MS proceed to the next substep.


parameters is correct

2. The optional switching conditions are not configured.

If... Then...

The optional switching conditions are not Configure B2_SD as an optional switching
configured. conditions. For the details, see the Feature
Description.

The working channel is configured with proceed to the next substep.


optional switching conditions.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 172


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

3. The protection channel works abnormally.


If... Then...

Check whether the following alarms exist For the method of handling the alarms, see
on the relevant boards: the Alarms and Performance Events
R_LOS, R_L0F, R_LOC, MS_AIS, Reference.
B2_EXC, B2_SD

No alarms, proceed to the next substep.

4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 3 Cause 3: The protocol is stopped manually or a forced switching is performed manually. As a
result, the protection switching fails.
1. Check the protocol controller state on the U2000.
If... Then...

The state of the protocol controller is Restart the protocol controller. Check
Protocol Stopped whether services recover. If the services do
not recover, check whether the fault is due
to other causes.

The state of the protocol controller is proceed to the next substep.


Protocol Started

2. Check the protection group switching state on the U2000.


If... Then...

The switching state of the protection group Clear protection switching. Check whether
is Lockout or Switching services recover. If the services do not
recover, check whether the fault is due to
other causes.

The switching state of the protection group Check whether the fault is due to other
is Normal causes.

Step 4 Cause 4: The power supply of the device is faulty. As a result, the protection switching fails.
1. Refer to 5.5 Service Interruptions Caused by Power Supply Failures to rectify the fault.
Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.

----End

Related Information
None

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 173


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

12.18 MSP Ring


This section describes the handling methods for services with MSP ring protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Fault Description
When the network operates normally, the working channel generates the triggering conditions
for protection switching, but the services cannot be automatically switched to the protection
channel. In this case, the services are interrupted.

SCC board may report APS_FAIL alarm.

Possible Causes
l Cause 1: The protocol fails to start and the protection switching fails.
l Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
l Cause 3: The protocol is stopped manually or the forced switching is performed manually.
As a result, the protection switching fails.
l Cause 4: The power supply of the device is faulty.
l Cause 5: The hardware is faulty.

Tools, Equipment, and Materials


U2000

Procedure
Step 1 Cause 1: The protocol fails to start and the protection switching fails.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from the
Function Tree.
2. In Ring MS interface, click Stop Protocol. Click Close in the Operation Result dialog
box.
3. Click Start Protocol to re-enable the protocol. Click Close in the Operation Result dialog
box.
4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.

Step 2 Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
1. Check the settings of the ring MS parameters on the U2000.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 174


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

If... Then...

The settings of the ring MS parameters are Set the parameters of the ring MS to proper
incorrect values. Check whether services recover. If
the services do not recover, check whether
the fault is due to other causes.

The configuration of the ring MS Proceed to the next substep.


parameters is correct

2. The optional switching conditions are not configured.

If... Then...

The optional switching conditions are not Configure B2_SD as an optional switching
configured. conditions. For the details, see the Feature
Description.

The working channel is configured with proceed to the next substep.


optional switching conditions.

3. The protection channel works abnormally.

If... Then...

Check whether the following alarms exist For the method of handling the alarms, see
on the relevant boards: the Alarms and Performance Events
R_LOS, R_L0F, R_LOC, MS_AIS, Reference.
B2_EXC, B2_SD

No alarms, proceed to the next substep.

4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.

Step 3 Cause 3: The protocol is stopped manually or the forced switching is performed manually. As
a result, the protection switching fails.
1. Check the protocol controller state on the U2000.

If... Then...

The state of the protocol controller is Restart the protocol controller. Check
Protocol Stopped whether services recover. If the services do
not recover, check whether the fault is due
to other causes.

The state of the protocol controller is Proceed to the next substep.


Protocol Started

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 175


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

2. Check the protection group switching state on the U2000.

If... Then...

The switching state of the protection group Clear protection switching. Check whether
is Lockout or Forced Switching services recover. If the services do not
recover, check whether the fault is due to
other causes.

The switching state of the protection group Check whether the fault is due to other
is Normal causes.

Step 4 Cause 4: The power supply of the device is faulty. As a result, the protection switching fails.
1. Refer to 5.5 Service Interruptions Caused by Power Supply Failures to rectify the fault.

Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.

----End

Related Information
None

12.19 Transoceanic MSP Ring


This section describes the handling methods for services with transoceanic MSP ring protection
that fail to automatically switch to the protection channel when the working channel works
abnormally.

Fault Description
When the network operates normally, the working channel has generated the conditions of
triggering the protection switching, but the services cannot be automatically switched to the
protection channel. In this case, the services are interrupted.

Possible Causes
l Cause 1: The protocol fails to start and the protection switching fails.
l Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
l Cause 3: The protocol is stopped manually, the forced switching is performed manually.
As a result, the protection switching fails.
l Cause 4: The power supply of the device is faulty.
l Cause 5: The hardware is faulty.

Tools, Equipment, and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 176


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Procedure
Step 1 Cause 1: The protocol fails to start and the protection switching fails.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from the
Function Tree.
2. In Ring MS interface, click Stop Protocol. Click Close in the Operation Result dialog
box.
3. Click Start Protocol to re-enable the protocol. Click Close in the Operation Result dialog
box.
4. Check whether services recover. If the services do not recover, proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 2 Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
1. Check the settings of the transoceanic MSP ring parameters on the U2000.
If... Then...

The settings of the transoceanic MSP ring Set the parameters of the multiplex section
parameters are incorrect to proper values.
For the details about the parameters, refer
to Feature Description.
Check whether services recover.
If the services do not recover, check
whether the fault is due to other causes.

The configuration of the transoceanic MSP Proceed to the next substep.


ring parameters is correct

NOTE

Transoceanic protocol must be configured on all NEs.


2. The optional switching conditions are not configured.
If... Then...

The optional switching conditions are not Configure B2_SD as an optional switching
configured. conditions. For the details, see the Feature
Description.

The working channel is configured with proceed to the next substep.


optional switching conditions.

3. The protection channel works abnormally.


If... Then...

Check whether the following alarms exist For the method of handling the alarms, see
on the relevant boards: the Alarms and Performance Events
R_LOS, R_L0F, R_LOC, MS_AIS, Reference.
B2_EXC, B2_SD

No alarms, proceed to the next substep.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 177


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.

Step 3 Cause 3: The protocol is stopped manually, the forced switching is performed manually. As a
result, the protection switching fails.
1. Check the protocol controller state on the U2000.

If... Then...

The state of the protocol controller is Restart the protocol controller. Check
Protocol Stopped whether services recover. If the services do
not recover, check whether the fault is due
to other causes.

The state of the protocol controller is Proceed to the next substep.


Protocol Started

2. Check the protection group switching state on the U2000.

If... Then...

The switching state of the protection group Clear protection switching. Check whether
is Lockout or Forced Switching services recover. If the services do not
recover, check whether the fault is due to
other causes.

The switching state of the protection group Check whether the fault is due to other
is Normal causes.

Step 4 Cause 4: The power supply of the device is faulty. As a result, the protection switching fails.
1. Refer to 5.5 Service Interruptions Caused by Power Supply Failures to rectify the fault.

Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.

----End

Related Information
None

12.20 SNCP
This section describes the handling methods for services with SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 178


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Fault Description
When the network operates normally, the working channel generates the triggering conditions
for protection switching, but the services cannot be automatically switched to the protection
channel. In this case, the services are interrupted.

Possible Causes
l Cause 1: The configuration of the protection group parameters is incorrect.
l Cause 2: The protection channel works abnormally.
l Cause 3: Locked switching or forced switching is not released.
l Cause 4: The fiber connections on the board are incorrect.
l Cause 5: The hardware is faulty.

Tools, Equipment and Materials


U2000

Procedure
Step 1 Cause 1: The configuration of the protection group parameters is incorrect. As a result, the
protection switching fails.
1. Check the configuration of the protection group parameters on the U2000.

If... Then...

The configuration of the protection group See the Feature Description to re-
parameters is incorrect configure the protection group parameters.
Check whether services recover. If the
services do not recover, check whether the
fault is due to other causes.

The configuration of the protection group proceed to the next substep.


parameters is correct

2. The optional switching conditions are not configured.

If... Then...

The optional switching conditions are not Configure HP_UNEQ, B3_SD, HP_TIM
configured as the optional switching conditions. For
the details, see the Feature Description.

The optional switching conditions are Check whether the fault is due to other
configured causes.

Step 2 Cause 2: The protection channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 179


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

If... Then...

Check whether the following alarms exist on For the method of handling the alarms, see the
the relevant boards: Alarms and Performance Events Reference.
R_LOS, R_L0F, R_LOC, MS_AIS,
B2_EXC, B2_SD

No alarms Check whether the fault is due to other causes.

Step 3 Cause 3: Locked switching or forced switching is not released.


1. Choose the NE in NE Explorer.
2. Choose Configuration > SNCP Service Control from the Function Tree.
3. In the SNCP Service Control interface, click Query. Click Close in the Operation
Result dialog box.
4. Choose the protection group, and then click Function. Choose Clear from the shortcut
menu, to clear the locked switching or forced switching configuration.

Step 4 Cause 4: The fiber connections on the board are incorrect. As a result, the protection switching
fails.
1. See the protection principles to check whether the fiber connections at the faulty point are
correct. For details of the protection principles, see the Production Description.

If... Then...

The fiber connections are incorrect Reconnect the fibers. Check whether
services recover. If the services do not
recover, check whether the fault is due to
other causes.

The fiber connections are correct Check whether the fault is due to other
causes.

Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.

----End

Related Information
None

12.21 SNCTP
This section describes the handling methods for services with tributary SNCTP protection that
fail to automatically switch to the protection channel when the working channel works
abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 180


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

Fault Description
When the network operates normally, the working channel generates the triggering conditions
for protection switching, but the services cannot be automatically switched to the protection
channel. In this case, the services are interrupted.

Possible Causes
l Cause 1: The configuration of the protection group parameters is incorrect.
l Cause 2: The protection channel works abnormally.
l Cause 3: Locked switching or forced switching is not released.
l Cause 4: The fiber connections on the board are incorrect.
l Cause 5: The hardware is faulty.

Tools, Equipment, and Materials


U2000

Procedure
Step 1 Cause 1: The configuration of the protection group parameters is incorrect. As a result, the
protection switching fails.
1. Check the configuration of the protection group parameters on the U2000.

If... Then...

The configuration of the protection group See the Feature Description to re-
parameters is incorrect configure the protection group parameters.
Check whether services recover. If the
services do not recover, check whether the
fault is due to other causes.

The configuration of the protection group proceed to the next substep.


parameters is correct

2. The optional switching conditions are not configured.

If... Then...

The optional switching conditions are not Configure HP_UNEQ, B3_SD, HP_TIM
configured as the optional switching conditions. For
the details, see the Feature Description.

The optional switching conditions are Check whether the fault is due to other
configured causes.

Step 2 Cause 2: The protection channel works abnormally.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 181


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

If... Then...

Check whether the following alarms exist on For the method of handling the alarms, see the
the relevant boards: Alarms and Performance Events Reference.
R_LOS, R_L0F, R_LOC, MS_AIS,
B2_EXC, B2_SD

No alarms Check whether the fault is due to other causes.

Step 3 Cause 3: Locked switching or forced switching is not released.


1. Choose the NE in NE Explorer.
2. Choose Configuration > SNCTP from the Function Tree.
3. In the SNCTP interface, click Query Switching Status.
4. Right-click the protection group, and then choose Clear from the shortcut menu, to clear
the locked switching or forced switching configuration.

Step 4 Cause 4: The fiber connections on the board are incorrect. As a result, the protection switching
fails.
1. See the protection principles to check whether the fiber connections at the faulty point are
correct. For details of the protection principles, see the Production Description.

If... Then...

The fiber connections are incorrect Reconnect the fibers. Check whether
services recover. If the services do not
recover, check whether the fault is due to
other causes.

The fiber connections are correct Check whether the fault is due to other
causes.

Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.

----End

12.22 Protection Cases


This section lists cases related to Protection.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 182


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 12 Automatic Switching Failures

l MC-A23 When the OTU Board Accesses Light, the Laser at Output End is Disabled
l MC-A25 The System Indicates that the Channel Number Is Illegal
l MC-A38 The Service Is Interrupted After the Protection Is Triggered
l MC-A41 Unsuccessful 1: N Protection Subnet Search
l MC-A91 Creation of Intra-Board Wavelength Protection on the LDG Fails
l MC-A94 The WXCP Protection Configured for the LOG Board Is Invalid
l MC-A98 The Protection Switching Times Out Severely
l MC-A127 WXCP Protection Cannot Be Implemented Due To Wrong Configuration of the
LQG Boards
l MC-A133 In a WDM System with 1:N Optical Channel Protection, the R_LOF Alarm Is
Reported in the Receive Direction on the Client Side When the Laser on the LWC1 Board
in the Protection Channel Is Open
l MC-A139 Automatic Switching Is Caused by the Incorrect Service Configuration of ODUk
SPRing Protection
l MC-A150 How Signals Are Selected When the DCP Board Is Used
l MC-A151 OWSP Protection Switching Fails Due to Incorrect Connection of Fiber Jumpers
On The DCP Boards, DCP Boards Report The MUT_LOS Alarms And TDG Board Reports
The OPU1_PLM Alarm
l MC-A155 Using the LDMD Board to Configure Intra-Board 1+1 Protection Fails Due to
the System Limit and the System Displays "The optical port number exceeds the limit"
l MC-A157 An OptiX OSN 6800 Network Cannot Be Reverted Back to the Working
Channel After Being Restored from an ODUk SNCP Protection Switching, The Board
Reports The ODU_SNCP_PS Alarm
l MC-A159 Services Are Interrupted After the Protection Hold-Off Time Is Set to 10s During
Configuration of Client-Side 1+1 Protection on the OptiX OSN 6800, The System Reports
The R_LOS Alarm
l MC-A161 The NS2 and TDG Boards Report the BUS_ERR Alarms at the Same Time After
ODUk SNCP Protection Is Configured Because The SCC Board Is Faulty
l MC-A171 How to Configure Two TMX Boards in a Client-Side 1+1 Protection Group
Through an SCS Board
l MC-A176 Loss of the Data About an Inter-Subrack Wavelength Protection Group Results
from a Change of an NE ID
l MC-A177 What Is the Difference Between the EOLP Board and the OLP Board
l MC-A220 SNCP Protection Switching Fails Due to Incorrect Fiber Connections
l MC-A228 Intra-Board 1+1 Protection Switching Fails on the OptiX OSN 1800 After a
Change of the NE ID
l MC-A229 Switching Fails Due to Incorrect SNCP Protection Mode on the WDM
Equipment
l MC-A232 The ODUk SNCP Protection Is Abnormal due to Incorrect Fiber Connections
l MC-A233 The LOM Board Fails to Carry Two FC400 Services due to Port Specification
Limitations When Inter-Subrack 1+1 Protection or Client 1+1 Protection Is Configured
l MC-A252 Switching of ODUk SNCP Protection Fails

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 183


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

13 Automatic Reverting Failures

About This Chapter

This topic describes how to resolve switchover issues when services fail to automatically switch
back to the working channel even after the working channel has resumed normal operation.

For information on the protection schemes supported by the OptiX OSN 8800, OptiX OSN 6800,
and OptiX OSN 3800, see the Feature Description"Feature Overview".

13.1 Optical Line Protection


This topic describes how to resolve switchover issues when optical line protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

13.2 Intra-Board 1+1 Protection


This topic describes how to resolve switchover issues when intra-board 1+1 protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

13.3 Client-side 1+1 Protection


This topic describes how to resolve switchover issues when client 1+1 protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

13.4 SW SNCP Protection


This section describes the handling methods for services with SW SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

13.5 ODUk SNCP Protection


This topic describes how to resolve switchover issues when ODUk SNCP protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

13.6 VLAN SNCP Protection


This section describes the handling methods for services with VLAN SNCP protection that
cannot automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 184


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

13.7 Tributary SNCP Protection


This topic describes how to resolve switchover issues when tributary SNCP protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.
13.8 Ethernet Ring Protection
This section describes the handling methods in the case that services cannot be switched back
to the working channel after the working channel resumes normal operation even though the
revertive mode is configured.
13.9 MS SNCP Protection
This section describes the handling methods for services with MS SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
13.10 DLAG Protection
This section describes the handling methods for services with DLAG protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
13.11 ODUk SPRing Protection
This section describes the handling methods for services with ODUk SPRing protection that
cannot automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
13.12 Optical Wavelength Shared Protection
This section describes the handling methods for services with optical wavelength shared
protection that cannot automatically switch back to the working channel even after the working
channel resumes normal operation and the revertive mode is properly configured.
13.13 Linear Multiplex Section Protection
This section describes the handling methods for services with linear multiplex section protection
that cannot automatically switch back to the working channel even after the working channel
resumes normal operation and the revertive mode is properly configured.
13.14 MSP Ring
This section describes the handling methods for services with MSP ring protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
13.15 Transoceanic Ring MS Protection
This section describes the handling methods for services with transoceanic ring MS protection
that cannot automatically switch back to the working channel even after the working channel
resumes normal operation and the revertive mode is properly configured.
13.16 SNCP
This section describes the handling methods for services with SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
13.17 SNCTP
This section describes the handling methods for services with SNCTP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
13.18 Protection Cases

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 185


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

This section lists cases related to Protection.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 186


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

13.1 Optical Line Protection


This topic describes how to resolve switchover issues when optical line protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

The OLP board may be reporting an OLP_PS or OLP_STA_INDI alarm.

Possible Causes
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive
l Cause 2: The working channel works abnormally or failure has not been resolved.
l Cause 3: Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Troubleshooting Workflow
The following figure illustrates the optical line protection automatic reverting failure
troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 187


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Figure 13-1 Optical line protection automatic reverting failure troubleshooting flowchart

Start

Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?

Yes No

Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?

Yes No

Yes Are the Yes


Is a locked or forced Clear the locked, or
services switched to
switching command forced switching
manually issued? the working
command.
channel?

No No

Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?

Yes No

Contact Huawei
End
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, select one protection group and click Modify to
change Revertive Mode to Revertive.
l Cause 2: The working channel is working abnormally.
1. In the Port Protection window, click Query to query the Working Channel
Status parameter of the protection group.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 188


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

2. If Working Channel Status is not Normal, check whether MUT_LOS and


POWER_DIFF_OVER alarms are being reported on the corresponding boards on the
working channel, clear the alarms.
l Cause 3: Locked or forced switching commands have been manually issued.
1. In the Port Protection interface, click Query to check the Switching Status
parameter of the protection group.
2. If Switching Status is displayed as Lock or Force to Protection Channel, right-click
the protection group, and then choose Clear from the shortcut menu to clear the
switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the Port Protection window, click Query to check the WTR Time.
2. If service fails to be restored to the working channel after the time specified by WTR
Time elapses, check whether the working channel is normal.

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.2 Intra-Board 1+1 Protection


This topic describes how to resolve switchover issues when intra-board 1+1 protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

When the intra-board 1+1 protection is achieved using the OLP board, the OLP board reports
an INTRA_OTU_PS or INTRA_OTU_STA_INDI alarm.

When the intra-board 1+1 protection is realized by dual fed and selective receiving OTU, the
OTU board may report the INTRA_OTU_PS alarm or INTRA_OTU_STA_INDI alarm.

Possible Causes
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive. In the case of the intra-board 1+1 protection configured using an OLP
or DCP board, when the Detection Channel is configured, the Revertive Mode parameter
must be set to Non-Revertive. This is not considered as one of the possible causes when
the protection cannot be restored automatically.
l Cause 2: The working channel works abnormally or failure has not been resolved.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 189


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

l Cause 3: Locked, or forced switching commands have been manually issued.


l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Troubleshooting Workflow
The following figure illustrates the intra-board 1+1 protection automatic reverting failure
troubleshooting process.

Figure 13-2 Intra-board 1+1 protection automatic reverting failure troubleshooting flowchart

Start

Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?

Yes No

Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?

Yes No

Yes Are the Yes


Is a locked or forced Clear the locked, or
services switched to
switching command forced switching
manually issued? the working
command.
channel?

No No

Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?

Yes No

Contact Huawei
End
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 190


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, select one protection group and click Modify to
change Revertive Mode to Revertive.
NOTE

In the case of intra-board 1+1 protection configured using an OLP or DCP board, when the
Monitoring Channel is configured, Revertive Mode can only be set to Non-Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the Port Protection interface, click Query to query the Working Channel
Status parameter of the protection group.
2. If Working Channel Status is not Normal, check whether R_LOS, R_LOF, R_LOC,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI,
ODUk_PM_LCK, ODUk_LOFLOM, B1_EXC, ODUk_PM_DEG,
ODUk_PM_EXC, POWER_DIFF_OVER, IN_PWR_HIGH and IN_PWR_LOW
alarms are being reported on the corresponding boards on the working channel, clear
the alarms.
l Cause 3: Forced switching commands have been manually issued.
1. In the Port Protection interface, click Query to check the Switching Status
parameter of the protection group.
2. If Switching Status is displayed as Lock or Force to Protection Channel, right-click
the protection group, and then choose Clear from the shortcut menu to clear the
switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the Port Protection window, click Query to check the WTR Time.
2. If service fails to be restored to the working channel after the time specified by WTR
Time elapses, check whether the working channel is normal.

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.3 Client-side 1+1 Protection


This topic describes how to resolve switchover issues when client 1+1 protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 191


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

l The DCP or OLP board may be reporting a CLIENT_PORT_PS or


CLIENT_PORT_STA_INDI alarm.
l When you use the SCS board to configure the protection, the OTU or tributary board may
report the CLIENT_PORT_PS alarm or CLIENT_PORT_STA_INDI alarm.

Possible Causes
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Troubleshooting Workflow
The following figure illustrates the client 1+1 protection automatic reverting failure
troubleshooting process.

Figure 13-3 Client 1+1 protection automatic reverting failure troubleshooting flowchart

Start

Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?

Yes No

Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?

Yes No

Yes Are the Yes


Is a locked or forced Clear the locked, or
services switched to
switching command forced switching
manually issued? the working
command.
channel?

No No

Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?

Yes No

Contact Huawei
End
engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 192


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, select one protection group and click Modify to
change Revertive Mode to Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the Port Protection window, click Query to query the Working Channel
Status parameter of the protection group.
2. If Working Channel Status is not Normal, check whether the alarms listed in the
trigger conditions are being reported on the relevant boards on the working channel,
clear the alarms.
l Cause 3: Locked or forced switching commands have been manually issued.
1. In the Port Protection interface, click Query to check the Switching Status
parameter of the protection group.
2. If Switching Status is displayed as Lock or Force to Protection Channel, right-click
the protection group, and then choose Clear from the shortcut menu to clear the
switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the Port Protection window, click Query to check the WTR Time.
2. If service fails to be restored to the working channel after the time specified by WTR
Time elapses, check whether the working channel is normal.
----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.4 SW SNCP Protection


This section describes the handling methods for services with SW SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 193


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

resumes normal operation, the services cannot be automatically switched back to the working
channel.
OTU board may report SW_SNCP_PS alarm or SW_SNCP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Locked, or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Fault Handling Flow

Figure 13-4 Fault handling flow when automatic switching back to the working channel by SW
SNCP protection fails

Start

Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?

Yes No

Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?

Yes No

Yes Are the Yes


Is a locked or forced Clear the locked, or
services switched to
switching command forced switching
manually issued? the working
command.
channel?

No No

Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?

Yes No

Contact Huawei
End
engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 194


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click the SNCP Service Control tab.
2. On the SNCP Service Control interface, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the working cross-connection.
2. If Channel Status is not displayed as Normal, check whether the following alarms
of trigger conditions exist on the relevant boards on the working channel. For the
method of handling the alarms, see the Alarms and Performance Events Reference.
l Cause 3: Locked, or forced switching is not released.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state, choose the protection group. Click Function and then choose Clear from the
shortcut menu to clear the switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control interface, click Query to check the WTR Time
(mm:ss).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.5 ODUk SNCP Protection


This topic describes how to resolve switchover issues when ODUk SNCP protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 195


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Tributary, line or PID board may report an ODU_SNCP_PS alarm or ODU_SNCP_STA_INDI


alarm.

Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
l Cause 4: Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Troubleshooting Workflow
The following figure illustrates the ODUk SNCP protection automatic reverting failure
troubleshooting process.

Figure 13-5 ODUk SNCP protection automatic reverting failure troubleshooting flowchart

Start

Is the SNCP Type No Change the SNCP Are the Yes


consistent with that required in services switched to
Type of the protection
the service scenario? the working
group. channel?

No
Yes

Is No Are the Yes


Set
Revertive Mode set to services switched to
Revertive Mode the working
Revertive? to Revertive. channel?

No
Yes

Does Yes Are the Yes


Handle the faults on the services switched to
the working channel work
working channel. the working
abnormally? channel?

No
No

Is a forced, Clear the Are the


Yes Yes
or locked switching command forced, or locked services switched to
manually issued? switching command. the working
channel?

No
No

Wait for a moment Are the


No services switched to Yes
Does the time exceed longer than
WTR time? the working
WTR Time. channel?

Yes No

Contact Huawei
engineers. End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 196


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query the Protection Type of
the protection group. In the Operation Result dialog box, click Close.
3. Refer to "Dependency and Limitation" and "Parameters" to check whether the
Protection Type of the protection group is appropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the SNCP Service Control window, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control window, click Query to query the Channel Status
parameter of the working cross-connection.
2. If Channel Status is not Normal, check whether the alarms listed in Trigger
Conditions are being reported on the corresponding boards on the working channel,
clear the alarms.
l Cause 4: Locked or forced switching commands have been manually issued.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state, choose the protection group. Click Function and then choose Clear from the
shortcut menu to clear the switching configuration.
l Cause 5: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control window, click Query to check the WTR Time.

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.6 VLAN SNCP Protection


This section describes the handling methods for services with VLAN SNCP protection that
cannot automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 197


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

L4G, TBE, LEM24, LEX4 board of OptiX OSN 6800/OptiX OSN 3800 may report
VLAN_SNCP_PS.

LEM24, LEX4 board of OptiX OSN 8800 may report VLAN_SNCP_PS.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual, locked or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 198


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Fault Handling Flow

Figure 13-6 Fault handling flow when automatic switching back to the working channel by
VLAN SNCP protection fails

Start

Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?

Yes No

Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?

Yes No

Yes Are the Yes


Is a locked or forced Clear the locked, or
services switched to
switching command forced switching
manually issued? the working
command.
channel?

No No

Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?

Yes No

Contact Huawei
End
engineers.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. Select the desired Ethernet board in the NE Explorer, and choose Configuration >
Ethernet Service > Ethernet Line Service from the Function Tree. Click the VLAN
SNCP Service Management tab.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 199


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

2. On the VLAN SNCP Service Management interface, select the desired protection
group. Double-click the Revertive Mode field, and then change Non-Revertive to
Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the VLAN SNCP Service Management interface, click Query to check the Link
Status parameter of the working service.
2. If Link Status is not displayed as Normal, check whether the following alarms exist
on the relevant boards on the working channel: R_LOF, R_LOS, R_OOF,
OTUk_LOF, OTUk_LOM, OTUk_AIS. For the method of handling the alarms, see
the Alarms and Performance Events Reference.
l Cause 3: Manual, locked or forced switching is not released.
1. In the VLAN SNCP Service Management interface, click Set/Query Switching and
select Query Switching Status to check the Current Status parameter of the
protection group.
2. If Current Status is displayed as Manual (Active to Standby) Switching State,
Forced (Active to Standby) Switching State or Lockout State, click Set/Query
Switching. Then choose Clear from the shortcut menu, to clear the manual, locked
or forced switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the VLAN SNCP Service Management interface, click Query to check the WTR
Time (s).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.7 Tributary SNCP Protection


This topic describes how to resolve switchover issues when tributary SNCP protection fails to
automatically switch services back to the working channel even after the working channel has
resumed normal operations.

Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Tributary, line may report an ODU_SNCP_PS alarm or ODU_SNCP_STA_INDI alarm.

Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 200


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
l Cause 4: Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Fault Handling Flow


The following figure illustrates the tributary SNCP protection automatic reverting failure
troubleshooting process.

Figure 13-7 Tributary SNCP protection automatic reverting failure troubleshooting flowchart

Start

Is
the SNCP Type No Are the Yes
Change the SNCP
consistent with that required services switched to
in the service Type of the protection
group. the working
scenario? channel?

No
Yes

Is No Are the Yes


Set services switched to
Revertive Mode set to Revertive Mode
the working
Revertive ? to Revertive.
channel?

No
Yes

Does Yes Are the Yes


Handle the faults on the services switched to
the working channel work
working channel. the working
abnormally? channel?

No
No

Is a forced Yes Are the Yes


Clear the
or locked switching command services switched to
forced, or locked the working
manually issued? switching command. channel?

No
No

Wait for a period Are the Yes


No
Does the time exceed longer than services switched to
WTR Time? the working
WTR Time. channel?

Yes No

Contact Huawei
engineers. End

Tools, Equipment, and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 201


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query Protection Type of the
protection group. In the Operation Result dialog box, click Close.
3. Refer to "Dependency and Limitation" and "Parameters" to check whether Protection
Type of the protection group is appropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the SNCP Service Control window, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control window, click Query to check the Channel Status
parameter of the working cross-connection.
2. If Channel Status is SD or SF, check whether the alarms listed in Trigger Conditions
are being reported on the corresponding boards on the working channel, clear the
alarms.
l Cause 4: Locked or forced switching commands have been manually issued.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state, choose the protection group. Click Function and then choose Clear from the
shortcut menu to clear the switching configuration.
l Cause 5: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control window, click Query to check WTR Time.
----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.8 Ethernet Ring Protection


This section describes the handling methods in the case that services cannot be switched back
to the working channel after the working channel resumes normal operation even though the
revertive mode is configured.

Fault Description
In the normal operation of the network, the services are switched to the protection channel after
the working channel generates the protection triggering conditions. When the working channel
resumes the normal operation, the services cannot be automatically switched back to the working
channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 202


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Possible Causes
The possible causes of the automatic revertive failure are as follows:
l Cause 1: The working channel works abnormally.
l Cause 2: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (min).
l Cause 3: The control VLAN channel on an ERPS-enabled Ethernet ring is faulty.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The working channel works abnormally.
1. If there are alarms on the working channel, clear the alarms. For the method of handling
the alarms, see Alarms and Events Handling.
l Cause 2: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (min).
1. Select an Ethernet board in NE Explorer.
2. Choose Configuration > Ethernet Protection > ERPS Management.
3. In the ERPS Management interface, click Query to check WTR Time (min).
l Cause 3: The control VLAN channel on an ERPS-enabled Ethernet ring is faulty.
1. Check whether the Ethernet ring has a control VLAN channel. If no, create a control
VLAN channel for the Ethernet ring.

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.9 MS SNCP Protection


This section describes the handling methods for services with MS SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

SCC board may report MS_SNCP_PS alarm or MS_SNCP_STA_INDI alarm.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 203


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Locked switching or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Fault Handling Flow


Services cannot be switched back to the working channel in the MS SNCP protection see Figure
13-8.

Figure 13-8 Fault handling flow when automatic switching back to the working channel by MS
SNCP protection fails

Start

No Are the Yes


Manually switch the services switched to
Is Revertive Mode
services to the the working
set to Revertive ?
protection channel. channel?

Yes No

Are the
No Yes
Is Working Channel Handle the faults on the services switched to
Status Normal? working channel. the working
channel?

Yes No

Is a forced Yes Clear the Are the Yes


or locked switching forced, or locked services switched to
command manually the working
issued? switching command.
channel?

No No

Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time. channel?

Yes No

Contact Huawei
End
engineers.

Tools, Equipment and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 204


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click SNCP Service Control tab.
2. On the SNCP Service Control interface, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the working cross-connection.
2. If Channel Status is displayed as SD, SF check whether the following alarms exist
on the relevant boards: R_LOF, R_LOS, R_LOC, OTUk_LOF, OTUk_LOM,
OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI, ODUk_PM_LCK,
ODUk_LOFLOM, REM_SF, B1_EXC, ODUk_PM_EXC, OTUk_EXC, REM_SD.
For the method of handling the alarms, see the Alarms and Performance Events
Reference.
l Cause 3: Locked switching or forced switching is not released.
1. In the SNCP Service Control interface, click Query to query the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state , click Function, and then choose Clear from the shortcut menu, to clear the
manual, locked or forced switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control interface, click Query to query the WTR Time
(mm:ss).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.10 DLAG Protection


This section describes the handling methods for services with DLAG protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 205


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.

Fault Handling Flow

Figure 13-9 Fault handling flow when automatic switching back to the working channel by
DLAG protection fails

Start

Is No Are the Yes


the protection Set Revertive services switched to
revertive? Mode to Revertive. the working
channel?

Yes No

No Are the Yes


Does the working services switched to
Handle the faults on the
channel work normally? the working
working channel.
channel?

Yes No

Contact Huawei
engineers. End

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive
1. Select the NE in the NE Explorer and then choose Configuration > Ethernet
Distributed Link Aggregation Management.
2. Click Query. Double-click Revertive Mode parameter and choose Revertive from
the drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Resolve the problems with the working channel. Determine if the following alarms
exist on the relevant boards: HARD_BAD, PORT_MODULE_OFFLINE,
LINK_ERR, R_LOS, R_LOF. For more information regarding how to clear the
alarms, see the Alarms and Performance Events Reference.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 206


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.11 ODUk SPRing Protection


This section describes the handling methods for services with ODUk SPRing protection that
cannot automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

The NEs at both ends of the channel where the switching occurs may report the ODUKSP_PS
alarm or ODUKSP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 207


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Fault Handling Flow

Figure 13-10 Fault handling flow when automatic switching back to the working channel by
ODUk SPRing protection fails

Start

Does Are the


No Handle the faults on the Yes
the working channel services switched to
work normally? working channel. the working
channel?

Yes No

Is a forced, or Yes Clear the forced, or Are the Yes


locked switching locked switching services switched to
command manually the working
issued? command.
channel?

No No

Does Are the


No Wait for a period Yes
the time exceed services switched to
longer than
WTR Time? the working
WTR Time. channel?

Yes No

Contact Huawei
End
engineers.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Select the NE in the NE Explorer, and choose Configuration > ODUk SPRing from
the Function Tree.
2. In the Channel Mapping Relation interface, click Query to query the Channel
Status parameter of the working channel.
3. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the working channel: R_LOS, R_LOC, HARD_BAD,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_LOFLOM, ODUk_TCMn_OCI,
ODUk_TCMn_LCK, ODUk_TCMn_AIS, OTUk_EXC, ODUk_TCM6_DEG,
ODUk_TCM6_EXC. For the method of handling the alarms, see the Alarms and
Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the ODUk SPRing interface, click Query to query the Switching Request-East
or Switching Request-West parameter of the protection group.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 208


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

2. If Switching Request-East or Switching Request-West is displayed as Forced


Switching-Ring, or Locked, right-click the protection group, and then choose
Clear from the shortcut menu, to clear the manual, locked or force switching
configuration.
l Cause 3: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the ODUk SPRing window, click Query to view the WTR Time(mm:ss) value.
During the WTR time, the value of Switching Request-East or Switching Request-
West parameter should be WTR - Ring Switching. After the WTR time elapses, the
protection switching will automatically take place.

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.12 Optical Wavelength Shared Protection


This section describes the handling methods for services with optical wavelength shared
protection that cannot automatically switch back to the working channel even after the working
channel resumes normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

DCP board may report OWSP_PS alarm or OWSP_STA_INDI alarm.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Locked switching or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 209


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Fault Handling Flow

Figure 13-11 Fault handling flow when automatic switching back to the working channel by
OWSP protection fails

Start

Does No Are the Yes


the working channel Handle the faults on the services switched to
working channel. the working
work normally? channel?

No
Yes

Is a forced Yes Clear the Are the Yes


switching command forced switching services switched to
manually issued? the working
command. channel?

No
No

No Wait for a period Are the Yes


Does the time exceed longer than services switched to
WTR Time? the working
WTR Time. channel?

No
Yes

Contact Huawei End


engineers.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Select the NE in the NE Explorer, and choose Configuration > Optical Wavelength
Shared Protection from the Function Tree. Click OWSP Protection tab.
2. In the Channel interface, click Query to query the Channel Status parameter of the
working channel.
3. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the working channel: MUT_LOS, R_LOS, R_LOC, R_LOF,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI,
ODUk_PM_LCK, OTUk_TIM, ODUk_PM_TIM, ODUk_LOFLOM,
IN_PWR_HIGH, IN_PWR_LOW, B1_EXC, ODUk_PM_DEG, ODUk_PM_EXC,
OTUk_DEG, OTUk_EXC. For the method of handling the alarms, see the Alarms
and Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the OWSP Protection interface, click Query to query the Switching Request-
East or Switching Request-West parameter of the protection group.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 210


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

2. If Switching Request-East or Switching Request-West is displayed as Forced


Switching-Ring, or Locked, right-click the protection group, and then choose
Clear from the shortcut menu, to clear the manual, locked or force switching
configuration.
l Cause 3: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the OWSP Protection interface, click Query to query the WTR Time (s).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.13 Linear Multiplex Section Protection


This section describes the handling methods for services with linear multiplex section protection
that cannot automatically switch back to the working channel even after the working channel
resumes normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the desired NE and choose Configuration > Linear MS
from the Function Tree.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 211


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

2. In the Linear MS interface, select one protection group and double click Revertive
Mode parameter area. Choose Revertive from drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, R_LOC, MS_AIS, B2_EXC, B2_SD. For the method of
handling the alarms, see the Alarms and Performance Events Reference.
l Cause 3: Locked switching or forced switching is not released.
1. Click Query in Linear MS interface and choose Query Switching Status.
2. If Switching Status-West is Switching or Lockout, right click the protection group
and choose Clear.
l Cause 4: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the Linear MS interface, click Query and choose Query Protection Group to
check WTR Time (s).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.14 MSP Ring


This section describes the handling methods for services with MSP ring protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).

Tools, Equipment and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 212


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, R_LOC, MS_AIS, B2_EXC, B2_SD. For the method of
handling the alarms, see the Alarms and Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from
the Function Tree.
2. Click Query in Ring MS interface and choose Query Switching Status.
3. If Switching Status is Forced switching or Lockout, right click the protection group
and choose Clear.
l Cause 3: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the Ring MS interface, click Query and choose Query Protection Group to check
WTR Time (s).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.15 Transoceanic Ring MS Protection


This section describes the handling methods for services with transoceanic ring MS protection
that cannot automatically switch back to the working channel even after the working channel
resumes normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).

Tools, Equipment and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 213


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, R_LOC, MS_AIS, B2_EXC, B2_SD. For the method of
handling the alarms, see the Alarms and Performance Events Reference.
l Cause 2: Manual switching, locked switching or forced switching is not released.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from
the Function Tree.
2. Click Query in Ring MS interface and choose Query Switching Status.
3. If Switching Status is Forced switching or Lockout, right click the protection group
and choose Clear.
l Cause 3: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the Ring MS interface, click Query and choose Query Protection Group to check
WTR Time (s).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.16 SNCP
This section describes the handling methods for services with SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 214


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the desired NE and choose Configuration > SNCP Service
Control from the Function Tree.
2. In the SNCP Service Control interface, select one protection group and double click
Revertive Mode parameter area. Choose Revertive from drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, MS_AIS, AU_AIS, AU_LOP, HP_LOM, B2_EXC,
B3_SD, B3_EXC, HP_TIM, HP_UNEQ. For the method of handling the alarms, see
the Alarms and Performance Events Reference.
l Cause 3: Forced switching is not released.
1. Click Query in the SNCP Service Control interface.
2. Click Close in the Operation Result dialog box.
3. If Current Status is Forced switching to Protection, Manual switching to
Protection or Lockout, click Function and choose Clear.
l Cause 4: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the SNCP Service Control interface, click Query.
2. Click Close in the Operation Result dialog box.
3. Check WTR Time (s).

----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.17 SNCTP
This section describes the handling methods for services with SNCTP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.

Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 215


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the desired NE and choose Configuration > SNCTP from
the Function Tree.
2. In the SNCTP interface, select one protection group and double click Revertive
Mode parameter area. Choose Revertive from drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, MS_AIS, AU_AIS, AU_LOP, HP_LOM, B2_EXC,
B3_SD, B3_EXC, HP_TIM, HP_UNEQ. For the method of handling the alarms, see
the Alarms and Performance Events Reference.
l Cause 3: Manual switching, locked switching or forced switching is not released.
1. Click Query Switching Status in SNCTP interface.
2. If Switching Status is Forced switching or Lockout, right click the protection group
and choose Clear.
l Cause 4: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the SNCTP interface, click Query Protection Group to check WTR Time (s).
----End

Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.

13.18 Protection Cases


This section lists cases related to Protection.

NOTE

The following cases are related to the OptiX WDM product series.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 216


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

Related Cases:
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A23 When the OTU Board Accesses Light, the Laser at Output End is Disabled
l MC-A25 The System Indicates that the Channel Number Is Illegal
l MC-A38 The Service Is Interrupted After the Protection Is Triggered
l MC-A41 Unsuccessful 1: N Protection Subnet Search
l MC-A91 Creation of Intra-Board Wavelength Protection on the LDG Fails
l MC-A94 The WXCP Protection Configured for the LOG Board Is Invalid
l MC-A98 The Protection Switching Times Out Severely
l MC-A127 WXCP Protection Cannot Be Implemented Due To Wrong Configuration of the
LQG Boards
l MC-A133 In a WDM System with 1:N Optical Channel Protection, the R_LOF Alarm Is
Reported in the Receive Direction on the Client Side When the Laser on the LWC1 Board
in the Protection Channel Is Open
l MC-A139 Automatic Switching Is Caused by the Incorrect Service Configuration of ODUk
SPRing Protection
l MC-A150 How Signals Are Selected When the DCP Board Is Used
l MC-A151 OWSP Protection Switching Fails Due to Incorrect Connection of Fiber Jumpers
On The DCP Boards, DCP Boards Report The MUT_LOS Alarms And TDG Board Reports
The OPU1_PLM Alarm
l MC-A155 Using the LDMD Board to Configure Intra-Board 1+1 Protection Fails Due to
the System Limit and the System Displays "The optical port number exceeds the limit"
l MC-A157 An OptiX OSN 6800 Network Cannot Be Reverted Back to the Working
Channel After Being Restored from an ODUk SNCP Protection Switching, The Board
Reports The ODU_SNCP_PS Alarm
l MC-A159 Services Are Interrupted After the Protection Hold-Off Time Is Set to 10s During
Configuration of Client-Side 1+1 Protection on the OptiX OSN 6800, The System Reports
The R_LOS Alarm
l MC-A161 The NS2 and TDG Boards Report the BUS_ERR Alarms at the Same Time After
ODUk SNCP Protection Is Configured Because The SCC Board Is Faulty
l MC-A171 How to Configure Two TMX Boards in a Client-Side 1+1 Protection Group
Through an SCS Board
l MC-A176 Loss of the Data About an Inter-Subrack Wavelength Protection Group Results
from a Change of an NE ID
l MC-A177 What Is the Difference Between the EOLP Board and the OLP Board
l MC-A220 SNCP Protection Switching Fails Due to Incorrect Fiber Connections
l MC-A228 Intra-Board 1+1 Protection Switching Fails on the OptiX OSN 1800 After a
Change of the NE ID
l MC-A229 Switching Fails Due to Incorrect SNCP Protection Mode on the WDM
Equipment
l MC-A232 The ODUk SNCP Protection Is Abnormal due to Incorrect Fiber Connections

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 217


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 13 Automatic Reverting Failures

l MC-A233 The LOM Board Fails to Carry Two FC400 Services due to Port Specification
Limitations When Inter-Subrack 1+1 Protection or Client 1+1 Protection Is Configured
l MC-A252 Switching of ODUk SNCP Protection Fails

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 218


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

14 Optical Power Management Problems

About This Chapter

This topic describes how to troubleshoot optical power management problems.

14.1 IPA Start Failure


This topic describes how to troubleshoot the IPA function when it fails to start.

14.2 IPA End Failure


This topic describes how to troubleshoot the IPA function when it fails to end.

14.3 IPA Start Failure in a Raman System


This topic describes how to troubleshoot the IPA function when it fails to start in a Raman
system.

14.4 IPA End Failure in a Raman System


This topic describes how to troubleshoot the IPA function when it fails to end in a Raman system.

14.5 ALC Start Failure


This topic describes how to troubleshoot the ALC function when it fails to start after the line
attenuation changes when DCN communication between NEs is working normal and there are
no fiber cuts along the line.

14.6 APE Exceptions


When the communication between NEs is working normally and there are no fiber cuts along
the line, the U2000 reports an optical power equilibrium event and then starts APE adjustment.
However, APE does not function and the adjustment fails. This topic describes how to handle
such an exception in the APE function.

14.7 Handling Anomalies in the OPA Function


This section describes fault handling when the OPA function fails to calculate the VOA of the
optical-layer boards after the optical cross-connections are configured.

14.8 Optical Power Management Cases


This section lists cases related to Optical Power Management.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 219


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

14.1 IPA Start Failure


This topic describes how to troubleshoot the IPA function when it fails to start.

Symptom
During normal network operations, when a fiber cut occurs, and the intelligent power adjustment
(IPA) function may fail to start. As a result, the OAs at both ends of the optical fiber fail to
automatically shut down the lasers on the output optical ports.

Possible Causes
l Cause 1: The IPA function is disabled.
l Cause 2: The fault is resolved before the time specified by Start Hold-off Time elapses.
l Cause 3: The IPA protection group is configured with auxiliary detection boards. When a
fiber cut occurs, some of the auxiliary detection boards do not report any alarm.
l Cause 4: The IPA protection group is configured with auxiliary detection boards. However,
the auxiliary detection board ports are incorrectly set.

Troubleshooting Workflow
Figure 14-1 illustrates the IPA start failure troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 220


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Figure 14-1 IPA start failure troubleshooting flowchart

Start

No Yes
Is IPA Status set to
Set IPA Status to Enabled. Is IPA started?
Enabled?

No
Yes

Is the fault resolved No Yes


before the time specified Wait for a period longer
by Start Hold-off Time Is IPA started?
than Start Hold-off Time.
elapses?

No
Yes

No Check the fiber Yes


Do all the auxiliary detection Are the line services
connections of the OA
boards report IPA-triggering restored?
board.
alarms?

Yes No

No Re-set the Yes


Are auxiliary detection auxiliary detection board Is IPA started?
board ports set correctly? ports.

No
Yes

Contact Huawei
engineers.
End

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The IPA function is disabled.
1. In the NE Explorer, select the NE and choose Configuration > IPA Management.
Click Query to check the IPA Status of IPA.
2. If IPA Status is set to Disabled, change it to Enabled. Click Apply.
3. Query the IPA Status of the NE at the opposite end. Ensure that the IPA function of
the NEs at the two ends of the line is set to Enabled.
l Cause 2: The fault is resolved before the time specified by Start Hold-off Time elapses.
In case of a fiber cut, the IPA function is enabled only after the start hold-off time expires.
1. In the IPA Management window, click Set Start Delay. In the dialog box that is
displayed, click Query to query the Start Hold-off Time.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 221


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

2. Check whether the fault is resolved before the time specified by Start Hold-off
Time elapses.
l Cause 3: The IPA protection group is configured with auxiliary detection boards. When a
fiber cut occurs, some of the auxiliary detection boards do not report any alarm.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If some of the auxiliary detection boards are not
reporting such alarms, check whether the fiber connection between the OA board and
the FIU, and that between the FIU and the auxiliary detection board, comply with the
engineering design.
l Cause 4: The IPA protection group is configured with auxiliary detection boards. However,
the auxiliary detection board ports are incorrectly set.
1. Check whether the auxiliary detection board ports are receiving optical signals from
the same direction as the detection board ports. If they are not receiving optical ports,
re-set the auxiliary detection board ports.

----End

Solution
If the problem persists after you take the preceding measures, handle the alarms and faults of
the NE, and contact Huawei technical support for resolving the IPA start failure.

14.2 IPA End Failure


This topic describes how to troubleshoot the IPA function when it fails to end.

Symptom
During normal network operations, when a fiber cut occurs, the IPA function starts. The lasers
on the output optical ports of the OAs at the upstream station of the optical fiber are automatically
shut down. After the optical fiber is restored, the IPA function does not automatically turn the
lasers on the output optical ports back on. As a result, the output optical power of the OAs is not
restored.

Possible Causes
l Cause 1: When the IPA protection group is not configured with any detection board and
is configured with only auxiliary detection boards, all the auxiliary detection boards are
reporting IPA-triggering alarms.
l Cause 2: One or more of IPA function parameters are incorrectly set.

Troubleshooting Workflow
Figure 14-2 illustrates the IPA end failure troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 222


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Figure 14-2 IPA end failure troubleshooting flowchart

Start

Do all the auxiliary No Handle the fiber connection Yes


detection boards report Are the services
errors or board faults on the
IPA-triggering alarms? restored?
auxiliary detection boards.

No
Yes

No Yes
Do the parameter values Are the services
comply with the Modify the parameters.
restored?
specifications?

No
Yes

Set IPA Status to


Disanabled to disable the
IPA function.

Yes

Are the srivices No


Query NE alarms and
recovered?
handle other faults.

Yes

Contact Huawei
End
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: When the IPA protection group is not configured with any detection board and
is configured with only auxiliary detection boards, all the auxiliary detection boards are
reporting IPA-triggering alarms.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If any of these alarms is being reported, check
the fiber connection of each auxiliary detection board.
2. If the fiber connection of each auxiliary detection board is normal, check whether one
of the auxiliary detection boards is faulty.
l Cause 2: One or more of IPA function parameters are incorrectly set.
1. In the NE Explorer, click the NE and choose Configuration > IPA Management.
2. In the IPA Management window, click Query and check whether the IPA parameters
are correctly set. For more information about how to set each parameter, refer to
"Dependency and Limitation" and "Parameters".

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 223


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

NOTE

l Check whether the Off Period, On Period, and Start Test Duration values of the two
jointly used IPA pairs are the same.
l Check whether the On Period value of the two IPA pairs is the same as the recommended
value calculated based on the number of intermediate stations.

----End

Solution
If the problem persists after you take the preceding measures, disable the IPA function by setting
the IPA Status in the IPA Management window to Disabled. In the WDM interface window
of the corresponding board, confirm that Laser Status of the transmit optical port of Laser
Control Board is set to On.
Check whether the services are restored. If services are not restored, query the alarms of the NE
and clear the alarms and faults of the NE. If the services are still not restored, contact Huawei
technical support for resolving the IPA end failure.

14.3 IPA Start Failure in a Raman System


This topic describes how to troubleshoot the IPA function when it fails to start in a Raman
system.

Symptom
During normal network operations, when a fiber cut occurs, the IPA function fails to start. As a
result, the OAs at both ends of the optical fiber fail to automatically shut down the lasers on the
output optical ports.

Possible Causes
l The IPA function is disabled.
l The fault is resolved before the time specified by Start Hold-off Time elapses.
l The communication between the case-shape Raman pump amplifier unit for C-band
(CRPC) frame and the subrack is faulty.
l The preset power threshold of the detection board is too low.
l The preset power threshold of the Raman board is too low.
l The IPA protection group is configured with auxiliary detection boards. When a fiber cut
occurs, some of the auxiliary detection boards do not report any alarm.
l The IPA protection group is configured with auxiliary detection boards. However, the
auxiliary detection board ports are incorrectly set.
l IPA parameters are incorrectly set.

CAUTION
Ensure that the optical fibers are properly connected and the configuration is correct.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 224


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

NOTE

When the RAU1 or RAU2 board functions as the Raman amplifier,


l The TN14FIU and OSC boards function as the auxiliary detection boards to implement IPA, you do
not need to configure the detection board. In this case, IPA Detection Board Threshold cannot be
set.
l It does not support fiber-break detection and therefore the Raman Board Threshold cannot be set.

Troubleshooting Workflow
Figure 14-3 illustrates the IPA start failure in a Raman system troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 225


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Figure 14-3 IPA start failure in a Raman system troubleshooting flowchart

Start

No Yes
Is IPA Status set to Set IPA Status to
Enabled? Is IPA started?
Enabled.

Yes No

Is the fault resolved before No Yes


Wait for a period longer Is IPA started?
the time specified by Start than Start Hold-off Time.
Hold-off Time elapses?

Yes No

Is the communication No Yes


between CRPC frame and Check the connectors and Is IPA started?
the master subrack normal? replace the network cables.

Yes No

Yes
No Re-calculate and
Is the receive optical power change the threshold. Is IPA started?
of the Raman board lower
than the threshold? ?

Yes

Yes
Is the receive optical power No Re-calculate and
Is IPA started?
of the detection board lower change the threshold.
than the threshold? ?

Yes No

No Yes
Do all auxiliary detection Check the fiber connections
Are the line services
boards report IPA-triggering of the OA board.
restored?
alarms?

Yes No

Are all auxiliary No Yes


Re-set the auxiliary
detection board ports Is IPA started?
detection board ports.
set correctly?

Yes No

Do the parameter No Yes


values comply with the Modify the parameters. Is IPA started?
specifications?

Yes No

Contact Huawei End


engineers.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 226


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The IPA function is disabled.
1. In the NE Explorer, click the NE and choose Configuration > IPA Management.
Then click Query to query IPA Status.
2. If IPA Status is set to Disabled, change it to Enabled and click Apply.
3. Query IPA Status of the opposite NE. Ensure that IPA Status of the NEs at both ends
of the line is set to Enabled.
l Cause 2: The fault is resolved before the time specified by Start Hold-off Time elapses.
After the fiber cut occurs, the IPA function is enabled only after the start hold-off time
elapses.
1. In the IPA Management window, click Set Start Delay. In the dialog box that is
displayed, click Query to query the Start Hold-off Time.
2. Check whether the fault is resolved before the time specified by Start Hold-off
Time elapses.
l Cause 3: The communication between the CRPC frame and the subrack is faulty.
1. On the U2000, check whether the CRPC board status is normal.
2. If the query fails, check whether the connectors and network cables are properly
installed.
l Cause 4: The preset optical value of the detection board threshold is too low.
1. Test the actual receive optical power of the detection board, and compare the test result
with the value of the IPA parameter IPA Detection Board Threshold.
2. If the actual receive optical power is higher than the value of Raman Board
Threshold, recalculate the Raman Board Threshold value and change it to a value
higher than the actual receive optical power of the detection board.
NOTE

For the Raman Board Threshold calculation method, refer to "Parameters".


3. If the actual receive optical power is higher than the value of IPA Detection Board
Threshold, recalculate the IPA Detection Board Threshold value and change it to
a value higher than the actual receive optical power of the detection board.
NOTE

For the IPA Detection Board Threshold calculation method, refer to "Parameters".
l Cause 5: The preset value of the Raman board threshold is too low.
1. Test the actual receive optical power of the Raman board, and compare the test result
with the IPA parameter Raman Board Threshold.
2. If the actual receive optical power is lower than the Raman board threshold, recalculate
the Raman board threshold and change it to a value lower than the actual receive
optical power.
NOTE

For the Raman Board Threshold calculation method, refer to "Parameters".

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 227


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

l Cause 6: The IPA protection group is configured with auxiliary detection boards. When a
fiber cut occurs, some of the auxiliary detection boards do not report any alarm.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If some of the auxiliary detection boards are not
reporting such alarms, check whether the fiber connection between the OA board and
the FIU, and that between FIU and the auxiliary detection board, comply with the
engineering design.
l Cause 7: The IPA protection group is configured with auxiliary detection boards. However,
the auxiliary detection board ports are incorrectly set.
1. Check the auxiliary detection board ports are receiving optical signals from the same
direction as the detection board ports. If they are not receiving optical signals, re-set
the auxiliary detection board ports.
l Cause 8: IPA parameters are incorrectly set.
1. In the IPA Management window, click Query and check whether IPA parameters
are correctly set. For more information about how to set each parameter, refer to
"Dependency and Limitation" and "Parameters".
NOTE

Main IPA parameters include Detection Board, Laser Control Board, IPA Detection Board
Threshold, Raman Board Threshold, and Raman Board Alarm Reporting.

----End

Solution
If the problem persists after you take the preceding measures, clear the alarms and faults of the
NE. Contact Huawei technical support for resolving the IPA function failure.

14.4 IPA End Failure in a Raman System


This topic describes how to troubleshoot the IPA function when it fails to end in a Raman system.

Symptom
During normal network operations, when a fiber cut occurs, the IPA function starts. The lasers
on the output optical ports of the OAs at both ends of the optical fiber are automatically shut
down. After the optical fiber is restored, the IPA function does not automatically turn \the lasers
on the output optical ports back on. As a result, the output optical power of the OAs is not
restored.

Possible Causes
l When the IPA protection group is not configured with a Detection Board and is configured
with only auxiliary detection boards, all the auxiliary detection boards are reporting IPA-
triggering alarms.
l The preset value of the Raman board threshold is too high.
l The preset value of the detection board threshold is too high.
l IPA parameters are incorrectly set.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 228


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

l The lasers on the local and remote Raman boards are not turned on manually within the
time specified by On Period or Start Test Duration.

NOTE

When the RAU1 or RAU2 board functions as the Raman amplifier,


l The TN14FIU and OSC boards function as the auxiliary detection boards to implement IPA, you do
not need to configure the detection board. In this case, IPA Detection Board Threshold (dBm)
cannot be set.
l It does not support fiber-break detection and therefore the Raman Board Threshold (dBm) cannot
be set.

Troubleshooting Workflow
Figure 14-4 illustrates the IPA function failure in a Raman system troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 229


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Figure 14-4 IPA function failure in a Raman system troubleshooting flowchart

Start

Handle the fiber


Do all auxiliary Yes connection errors or Yes
detection boards board faults on the Are the services
report IPA-triggering auxiliary detection restored?
alarms? boards.

No
No

Is the receive optical power No Re-calculate and change Yes


the threshold. Are the services
of the Raman board higher
restored?
than the threshold?

No
Yes

Is the receive optical power No Re-calculate and Yes


change the threshold. Are the services
of the detection board
restored?
higher than the threshold?

No
Yes

No Yes
Do the other parameter Modify the Are the services
values comply with the parameters. restored?
specifications?

No
Yes

Are the lasers No Manually turn on the Yes


manually turned on lasers within the Are the services
within the specified specified time. restored?
time?

No
Yes

Set IPA Status


to Disabled to disable
the IPA function.

No
Are the services Query NE alarms and
restored? handle other faults.

Yes

End
Contact Huawei
engineers.

Tools, Equipment, and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 230


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Procedure
l Cause 1: When the IPA protection group is not configured with a Detection Board and is
configured with only auxiliary detection boards, all the auxiliary detection boards are
reporting IPA-triggering alarms.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If some of the auxiliary detection boards are
reporting such alarms, check the fiber connection of each auxiliary detection board.
2. If the fiber connection of each auxiliary detection board is normal, check whether one
of the auxiliary detection boards is faulty.
l Cause 2: The preset value of the Raman board threshold is too high.
1. Test the actual receive optical power of the Raman board, and compare the test result
with the IPA parameter Raman Board Threshold.
2. If the actual receive optical power is lower than the Raman board threshold, recalculate
the Raman board threshold and change it to a value lower than the actual receive
optical power.
NOTE

For the Raman Board Threshold calculation method, refer to "Parameters".


l Cause 3: The preset value of the detection board threshold is too high.
1. Test the actual receive optical power of the detection board, and compare the test result
with the IPA parameter IPA Detection Board Threshold.
2. If the actual receive optical power is lower than the detection board threshold,
recalculate the detection board threshold and change it to a value lower than the actual
receive optical power.
NOTE

For the IPA Detection Board Threshold calculation method, refer to "Parameters".
l Cause 4: IPA parameters are incorrectly set.
1. In the NE Explorer, click the NE and choose Configuration > IPA Management.
2. In the IPA Management window, click Query to check whether IPA parameters are
correctly set. For more information about how to set each parameter, refer to
"Dependency and Limitation" and "Parameters".
NOTE

Do the follows to check the parameters:


l Check whether the Off Period, On Period and Start Test Duration values of the two
jointly used IPA pairs are the same.
l Check whether the value of On Period value of the two IPA pairs is the same as the
recommended value calculated based on the number of intermediate stations.
l Cause 5: The lasers on the local and remote Raman boards are not turned on manually
within the time specified by On Period or Start Test Duration.
1. Check whether the On Period and Start Test Duration values of the two IPA pairs
is the same as the recommended value calculated based on the number of intermediate
stations.
2. Manually turn on the lasers on the local and remote Raman boards within the specified
time.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 231


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

NOTE

This operation applies only to the scenario where lasers need to be manually opened after Laser
Restart Mode is set to Manual.
If the CRPC03 and ROP board are involved in the IPA function, the laser on the CRPC03 and ROP
board need to be manually opened.

----End

Solution
If the problem persists after you take the preceding measures, set the IPA Status in the IPA
Management window to Disabled to disable the IPA function. In the WDM Interface window
of the corresponding board, confirm that the Laser Status of the transmit optical interface of
the laser control board is set to On.

Check whether the services are restored. If services have not been restored, query the alarms of
the NE and clear the alarms and faults of the NE. If the services are still not restored, contact
Huawei technical support for resolving the IPA function failure.

14.5 ALC Start Failure


This topic describes how to troubleshoot the ALC function when it fails to start after the line
attenuation changes when DCN communication between NEs is working normal and there are
no fiber cuts along the line.

Symptom
During normal operations of an ALC-enabled network, the attenuation of a line section changes
and the ALC function does not start. As a result, all OAs at each downstream station cannot
make adjustments. This affects the system OSNR and service transmission.

Possible Causes
l Cause 1: The ALC Automatic Regulation Switch is set to Disabled.
l Cause 2: ALC stops abnormally due to other external causes. ALC nodes report the event
indicating that ALC has ended unexpectedly.
l Cause 3: MUT_LOS or BD_STATUS alarms are being reported on the related board.
l Cause 4: ALC parameters are configured incorrectly. Therefore, the consistency check
fails.

Troubleshooting Workflow
Figure 14-5 illustrates the ALC function failure troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 232


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Figure 14-5 ALC function failure troubleshooting flowchart

Start

Is Automatic No Yes
Regulation Switch Start ALC. Is ALC started?
set to Enabled ?

Yes No

No Does any node report


the event indicating that
ALC has ended
unexpectedly?

Yes

Is the Yes Yes


MUT_LOS or BD_STATUS Clear the alarm. Is ALC started?
alarm being reported?

No No

No Yes
Is the consistency check Check and modify
Is ALC started?
successful? ALC parameter values.

Yes No

Contact Huawei
End
engineers.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The ALC Automatic Regulation Switch is set to Disabled.
1. Choose Configuration > WDM ALC Management from the Main Menu. In the
WDM ALC Management window that is displayed, click NG WDM Single Station
Configuration tab.
2. In the left-hand NE list, select the desired NE. Check whether the Automatic
Regulation Switch is set to Disabled. If it is disabled, enable ALC. For more
information, refer to "Starting Automatic Link Adjustment".
l Cause 2: ALC stops abnormally due to other external causes. ALC nodes report the event
indicating that ALC has ended unexpectedly.
1. On the left of the WDM ALC Management window, query the ALC events reported.
Check whether there are events indicating that the ALC has ended unexpectedly. If
the ALC has ended unexpectedly, identify the reason.
l Cause 3: MUT_LOS or BD_STATUS alarms are being reported on the related board.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 233


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

1. Check whether MUT_LOS or BD_STATUS alarms are being reported on the ALC-
related OA and VOA boards at the station. If either of these alarms is being reported,
clear the alarm.
l Cause 4: ALC parameters are configured incorrectly. Therefore, the consistency check
fails.
1. Perform a consistency check of the ALC links. For more information, refer to
"Performing Consistency Check of ALC Links". If the consistency check fails, check
whether the ALC parameters of each ALC node are configured correctly. For details
on how to set each parameter, refer to "Dependency and Limitation and "Parameters".

----End

Solution
If the problem persists after you take the preceding measures, query NE alarms and clear the
alarms and faults on the NE. Contact Huawei technical support for resolving ALC function
faults.

14.6 APE Exceptions


When the communication between NEs is working normally and there are no fiber cuts along
the line, the U2000 reports an optical power equilibrium event and then starts APE adjustment.
However, APE does not function and the adjustment fails. This topic describes how to handle
such an exception in the APE function.

Background Information
The OptiX OSN 3800 does not support the APE function.

Fault Description
During normal operations on an APE-enabled network, the U2000 reports an optical power
equilibrium event and then starts APE adjustment. However, the system fails to automatically
adjust the optical power at the transmit end of each channel or report an event indicating that
APE has ended unexpectedly. In this case, the equilibrium of the receive-end optical power and
OSNR cannot be ensured.

NOTE

If a fiber cut occurs along the line or communication is abnormal, handle the network faults.
NOTE

If the U2000 reports both an ALC event and an optical power equilibrium event, start the ALC function.
If the optical power equilibrium event persists after the adjustment is completed, start the APE function.

Possible Causes
l Cause 1: APE parameters are configured incorrectly.
l Cause 2: The Optical Interface Attenuation Ratio of the APE power regulation unit has
reached a critical point and can no longer be adjusted. As a result, the adjustment cannot
be started.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 234


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

l Cause 3: Limited APE adjustment is performed and further adjustment is required.


l Cause 4: The Standard Power Offset settings are insufficient to meet the requirements of
the live network.

Troubleshooting Workflow
Figure 14-6 illustrates the APE function exception troubleshooting process.

Figure 14-6 APE function exception troubleshooting flowchart

Start

No Are alarms being reported Yes


Are there alarms on the Is optical power
regulating and monitoring on the regulation and
monitoring units? adjusted?
units?

No
Yes

No Yes
Do the APE parameter
Modify the parameters. Is optical power
values comply with the
adjusted?
specifications?

No
Yes

Is there any margin in No Yes


Optical Interface Audust the optical power Is optical power
Attenuation Ratio of the budget of the line. adjusted?
APE regulation unit?

No
Yes

No Yes
Is the difference between
Restart APE. Is optical power
ActualPower Offset and Standard
adjusted?
Power Offset lower than the
threshold?
No
Yes

Yes
Re-calibrate the value of Is optical power
Standard Power Offset. adjusted?

No

Contact Huawei engineers. End

Tools, Equipment, and Materials


U2000

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 235


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Procedure
l Cause 1: APE parameters are configured incorrectly.
1. In the NE Explorer, select the desired NE and choose Configuration > Optical Power
Equilibrium from the Function Tree.
2. Click Query to check whether APE parameters are configured correctly. For more
information about how to set each parameter, refer to "Dependency and Limitation
and Parameters".
NOTE

Main APE parameters include Power Monitoring Unit, Odd Wavelength Power Regulation
Unit, Even Wavelength Power Regulation Unit, and Monitoring Flag of the related
wavelength.
l Cause 2: The Optical Interface Attenuation Ratio of the APE power regulation unit has
reached a critical point and can no longer be adjusted. As a result, the adjustment cannot
be started.
1. In the NE Explorer, select the desired power regulation unit and choose
Configuration > WDM Interface to query the Optical Interface Attenuation
Ratio board parameter.
2. Check whether the Optical Interface Attenuation Ratio of each optical port where
the APE monitoring wavelength is located has reached the critical value and there is
no margin for adjustment.
3. If the Optical Interface Attenuation Ratio value has reached this point, adjust the
optical attenuation of the regulation unit at the upstream station, thereby decreasing
the optical attenuation required by the local station.
l Cause 3: Insufficient APE adjustment has occurred and further adjustment is required.
1. In the NE Explorer, select the desired NE and choose Configuration > Optical Power
Equilibrium from the Function Tree.
2. Query the Actual Power Offset and Standard Power Offset of the APE monitoring
wavelength and the Power Unbalance Threshold.
3. If the maximum difference between Actual Power Offset and Standard Power
Offset in all monitoring wavelengths is less than 7 dB and Power Unbalance
Threshold uses the default value (1.5), start the APE function again.
4. If the maximum difference between Actual Power Offset and Standard Power
Offset in all monitoring wavelengths is greater than 7 dB and Power Unbalance
Threshold uses the default value (1.5), increase the value of Power Unbalance
Threshold and start the APE function again.
NOTE

The Power Unbalance Threshold ranges from 0.5 dBm to 3 dBm.


5. If the APE function starts successfully, set Power Unbalance Threshold to the
original value.
l Cause 4: The Standard Power Offset settings are insufficient to meet the requirements of
the live network.
1. Re-design the optical power budget and re-calibrate the Standard Power Offset.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 236


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

Solution
If the problem persists after you take the preceding measures, contact Huawei technical support
for resolving the APE function faults.

14.7 Handling Anomalies in the OPA Function


This section describes fault handling when the OPA function fails to calculate the VOA of the
optical-layer boards after the optical cross-connections are configured.

Fault Description
After the optical cross-connections in automatic mode are configured, the optical path is
unavailable.

Possible Causes
l Cause 1: The rated optical power of the OA board is set incorrectly, and thus the NE reports
the OPA_FAIL_INDI alarm.
l Cause 2: The optical power budget is insufficient, and thus the NE reports the
OPA_FAIL_INDI alarm.

Tools, Equipment and Materials


U2000

Procedure
l Cause 1: The rated optical power of the OA board is set incorrectly, and thus the NE reports
the OPA_FAIL_INDI alarm.
1. In Explorer, choose the desired OA board.
2. In Function Tree on the left side, choose Configuration > WDM Interface and then
select Advanced Attributes.
3. Double-click the Rated Optical Power (dBm) fields of the input and output
interfaces, and modify the values.
l Cause 2: The optical power budget is insufficient, and thus the NE reports the
OPA_FAIL_INDI alarm.
1. Adjust the system optical power budget, which is insufficient.

----End

Solution
After the above steps are performed, contact Huawei engineers to remove the faults in the OPA
function.

14.8 Optical Power Management Cases


This section lists cases related to Optical Power Management.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 237


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

14.8.1 Automatic Power Equilibrium (APE)


Case analysis related to Automatic Power Equilibrium (APE) cases.

NOTE

The following cases are the cases relevant to the OptiX WDM product series.

Related Cases:
l MC-A8 The TN11OAU101 at the Transmit End Reports the MUT_LOS Alarm

14.8.2 Automatic Level Control (ALC)


Case analysis related to Automatic Level Control cases.

NOTE

The following cases are the cases relevant to the OptiX WDM product series.

Related Cases:
l MC-A77 Improper Configuration of Protocol Channels Results in a Failure of the ALC to
Start
l MC-A81 The Created ALC Link on OptiX OSN 6800 Cannot Be Automatically Adjusted
l MC-A121 The T2000 Reports a Failure In Querying the ALC Link Status Because Of Too
Many NEs On the Link
l MC-A169 T2000 Reports Errors After the Standard Optical Power of Single Wavelength
Is Set to 7 dBm with ALC in Power Reference Mode

14.8.3 Intelligent Power Adjustment (IPA)


Case analysis related to Intelligent Power Adjustment cases.

NOTE

The following cases are the cases relevant to the OptiX WDM product series.

Related Cases:
l MC-A78 When the IPA Is Set or Deleted, the OAU Alarm Threshold Changes Due to
Version Features
l MC-A116 Failure in Configuring the SC2 as an Auxiliary Detection Board in the IPA
Protection Group
l MC-A175 How to Clear the LASER_HAZARD_WARNING Alarms Reported on Certain
OBU and OAU Boards After an Upgrade of Software

14.8.4 Intelligent Power Adjustment (IPA) of Raman System


Case analysis related to Intelligent Power Adjustment (IPA) of Raman System cases.

NOTE

The following cases are the cases relevant to the OptiX WDM product series.

Related Cases:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 238


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 14 Optical Power Management Problems

l MC-A116 Failure in Configuring the SC2 as an Auxiliary Detection Board in the IPA
Protection Group

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 239


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

15 Packet Service Problems

About This Chapter

This topic describes the symptoms, impact, and possible causes of MPLS tunnel faults such as
creation failures or service interruptions. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.

15.1 Handling MPLS Tunnel Faults


This section describes the MPLS tunnel faults such as the creation failure or service interruption
in terms of the symptoms, impact on the system, possible causes, tools required for
troubleshooting, troubleshooting procedure, and precautions that should be taken during the
troubleshooting.

15.2 Handling PW Faults


This section describes the PW faults such as the creation failure or service interruption in terms
of the symptoms, impact on the system, possible causes, tools required for troubleshooting,
troubleshooting procedure, and precautions that should be taken during the troubleshooting.

15.3 Native Ethernet Service Troubleshooting


The Ethernet services based on the packet mode are transmitted over a packet switched network.
The faults of native Ethernet services include service interruption and service degradation.

15.4 PWE3 Ethernet Service Troubleshooting


This topic describes how to troubleshoot PWE3 Ethernet service interruption or packet loss in
terms of the symptoms, impact on the system, possible causes, tools required for troubleshooting,
troubleshooting procedure, and precautions that should be taken during the troubleshooting.

15.5 Tunnel APS Troubleshooting


This topic describes the symptom of common faults when the Tunnel APS function is enabled
and the solution to the relevant fault.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 240


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

15.1 Handling MPLS Tunnel Faults


This section describes the MPLS tunnel faults such as the creation failure or service interruption
in terms of the symptoms, impact on the system, possible causes, tools required for
troubleshooting, troubleshooting procedure, and precautions that should be taken during the
troubleshooting.

Symptoms
Symptom 1: Creating the tunnel fails, and thus the service is unavailable.

Symptom 2: The tunnel is faulty, and thus the service is interrupted.

Symptom 3: The protection switching fails, and thus the service is interrupted.

Impact on the System


If the tunnel is faulty, the service carried by this tunnel on the network is affected, but the services
carried by other tunnels are normal.

Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: Creating the tunnel fails.
l For symptom 2, the possible causes are as follows:
– Cause 1: The routing fails.
– Cause 2: The physical link where the tunnel resides is faulty.
l For symptom 3, the possible causes are as follows:
– Cause 1: The protection switching fails.

Tools, Instruments, and Materials


l U2000

Procedure
Step 1 For symptom 1: Creating the tunnel fails, and thus the service is unavailable.
1. Cause 1: Creating the cross-connection fails.
a. Check whether different NEs on the network use the same network segment. If
different NEs use the same network segment, change the IP address of the port.
b. Check whether the tunnel is configured with the incompatible protection feature. For
details, see the Release Notes.

Step 2 For symptom 2: The tunnel is faulty, and thus the service is interrupted.
1. Cause 1: The routing fails.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 241


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

a. According to the network planning, check whether the parameters of the ports such
as the IP address of the port at the two ends of the tunnel are correctly set. Correct the
parameter settings, and then create the tunnel again.
b. Check whether there is a complete and reachable link between the NEs where the
source and sink nodes of the tunnel reside. If the link does not exist, repair the
incomplete or faulty part of the link.
2. Cause 2: The physical link where the tunnel resides is faulty.
a. Check whether the HARD_BAD or ETH_LOS alarm indicating that the physical link
used by the tunnel is faulty exists in the system. If any alarm exists, take priority to
handle it.
b. Enable the tunnel OAM function. Then, check and handle the following alarms that
exist in the system, MPLS_TUNNEL_LOCV, MPLS_TUNNEL_SD and
MPLS_TUNNEL_SF.
c. Check whether anomalies such as board faults or NE resetting exist on the opposite
equipment If yes, rectify the anomalies.

Step 3 For symptom 3: The protection switching fails, and thus the service is interrupted.
1. Cause 1: The protection switching fails.
a. The MPLS tunnel APS protection switching fails. For details, see Feature
Description.

----End

Related Information
None

15.2 Handling PW Faults


This section describes the PW faults such as the creation failure or service interruption in terms
of the symptoms, impact on the system, possible causes, tools required for troubleshooting,
troubleshooting procedure, and precautions that should be taken during the troubleshooting.

Symptoms
Symptoms 1: Creating the PW fails, and thus the service is unavailable.

Symptoms 2: The PW is faulty, and thus the service is interrupted, or packet loss or bit errors
occur in the service.

Impact on the System


If a PW is faulty, the service carried in the PW is affected, but the services carried by other PWs
are normal.

Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: Creating the PW cross-connection fails.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 242


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

l For symptom 2, the possible causes are as follows:


– Cause 1: The physical link is faulty.
– Cause 2: The tunnel where the PW resides is faulty.
– Cause 3: The protection switching fails.
– Cause 4: The equipment is faulty.

Tools, Instruments, and Materials


l U2000

Procedure
Step 1 For symptom 1: Creating the PW fails, and thus the service is unavailable.
1. Cause 1: Creating the PW cross-connection fails.
a. Check whether the number of PWs reaches the upper limit of the NE or that of the
board. If yes, make a plan again or delete redundant PWs.
b. Check whether the label of PW is existent. If yes, modify the label of PW.
Step 2 For symptom 2: The PW is faulty, and thus the service is interrupted, or packet loss or bit
errors occur in the service.
1. Cause 1: The physical link is faulty.
a. Check whether the physical link between the source and sink nodes of the PW are
normal, and whether the R_LOS, or ETH_LOS alarm exists in the system.
b. If the alarm exists, take priority to handle the ETH_LOS alarm.
2. Cause 2: The tunnel where the PW resides is faulty.
a. For details on the tunnel faults, see 15.1 Handling MPLS Tunnel Faults.
3. Cause 3: The protection switching fails.
a. The PW APS protection switching fails.
4. Cause 4: The equipment is faulty.
a. Check whether anomalies such as board faults or NE resetting exist on the opposite
equipment. If yes, rectify the anomalies.

----End

Related Information
None

15.3 Native Ethernet Service Troubleshooting


The Ethernet services based on the packet mode are transmitted over a packet switched network.
The faults of native Ethernet services include service interruption and service degradation.

Symptom
Symptom 1: Ethernet services are interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 243


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

Symptom 2: Ethernet services are degraded. For example, they have great delays, packet loss,
or incorrect packets.

Impact on System
Ethernet services are interrupted or degraded.

Possible Causes
l Cause 1: Services are incorrectly configured or parameter settings at local equipment are
inconsistent with those at its interconnected equipment. Parameter settings include settings
for loopbacks on Ethernet ports, loopbacks on transmission lines, enabling statuses/
working modes of Ethernet ports, and flow control for Ethernet ports.
l Cause 2: The local/interconnected equipment is faulty or bit errors occur on lines.
l Cause 3: The network cable is faulty.
l Cause 4: The external electromagnetic interference is severe.

Tools, Equipment, and Materials


U2000 and test instruments

Procedure
Step 1 Troubleshoot an Ethernet service fault.
1. Cause 1: Services are incorrectly configured or parameter settings at local equipment
are inconsistent with those at its interconnected equipment. Parameter settings
include settings for loopbacks on Ethernet ports, loopbacks on transmission lines,
enabling statuses/working modes of Ethernet ports, and flow control for Ethernet
ports.
a. Check whether a loopback is configured for an Ethernet port or a transmission line.
If misoperations cause a port/line loopback, correct the settings. If no loopback is
configured, go to the next step.
b. Check whether Ethernet port settings regarding enabling status, working mode, and
flow control at local equipment are consistent with those at its interconnected
equipment. If inconsistent, correct the settings; if consistent, go to the next step.
c. Check whether the Ethernet protocol and Ethernet service configurations (especially
the attributes of the Ethernet port) are correct.

Step 2 Troubleshoot an abnormal RMON performance event.


1. Cause 1: Services are incorrectly configured or parameter settings at local equipment
are inconsistent with those at its interconnected equipment. Parameter settings
include settings for loopbacks on Ethernet ports, loopbacks on transmission lines,
enabling statuses/working modes of Ethernet ports, and flow control for Ethernet
ports.
a. Check whether working rate settings and working mode settings at local equipment
are consistent with those at its interconnected equipment.
2. Cause 2: The local/interconnected equipment is faulty or bit errors occur on lines.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 244


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

a. Query the real-time performance measurement statistics of the Ethernet port with
reference to Feature Description.
b. Change the Ethernet port that receives services. If the RMON performance statistics
of the new port do not contain FCS errors, the hardware of the original port is faulty;
otherwise, the network port of the interconnected equipment may be faulty. To check
it out, go to the next step.
3. Cause 3: The network cable is faulty.
a. Check the network cable. If the network cable is not qualified, replace it with a new
one.
NOTE
When an Ethernet port works in auto-negotiation mode, its interconnected port must not work
in full duplex mode.
4. Cause 4: The external electromagnetic interference is severe.
a. Reduce external electromagnetic interference.

----End

Related Information
None.

15.4 PWE3 Ethernet Service Troubleshooting


This topic describes how to troubleshoot PWE3 Ethernet service interruption or packet loss in
terms of the symptoms, impact on the system, possible causes, tools required for troubleshooting,
troubleshooting procedure, and precautions that should be taken during the troubleshooting.

Symptoms
Symptom 1: The PWE3 Ethernet service is interrupted.

Symptom 2: The PWE3 Ethernet service loses packets or has errored packets.

Impact on the System


PWE3 Ethernet service interruption, PWE3 packet loss, or errored PWE3 packets may impact
the services that the equipment receives.

Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: The board hardware is faulty, the board temperature is above the upper
threshold, or the communication between boards fails. As a result, the board cannot
work properly.
– Cause 2: The PWE3 Ethernet port is misconnected and the port negotiation fails.
– Cause 3: Loopback is configured for the port.
– Cause 4: Dynamically learning the entries in the ARP table fails.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 245


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

– Cause 5: The service configuration is incorrect.


l For symptom 2, the possible causes are as follows:
– Cause 1: The signals on the receive side are lost.
– Cause 2: The configured traffic limit is excessively low at the port and configurations
of the source and sink ports are inconsistent.

Tools, Instruments, and Materials


l U2000

Procedure
Step 1 For symptom 1: The PWE3 Ethernet service is interrupted.
1. Cause 1: The board hardware is faulty, the board temperature is above the upper
threshold, or the communication between boards fails. As a result, the board cannot
work properly.
a. Query the current alarms of the system to check for the HARD_BAD or
COMMUN_FAIL alarm and the board that reports the alarm.
b. Handle the HARD_BAD or COMMUN_FAIL alarm. For details, refer to the Alarms
and Performance Events Reference manual.
2. Cause 2: The PWE3 Ethernet port is misconnected and the port negotiation fails.
a. Check for the ETH_LOS alarm of the system and handle the ETH_LOS alarm.
3. Cause 3: Loopback is configured for the port.
a. Check for the LOOP_ALM alarm of the system and handle the LOOP_ALM alarm.
b. Check for the ETH_EFM_LOOPBACK alarm of the system and handle the
ETH_EFM_LOOPBACK alarm.
4. Cause 4: Dynamically learning the entries in the ARP table fails.
a. Check whether the ARP table contains the mapping relationship between the opposite
IP address and the opposite MAC address. If the ARP table does not contain the
mapping relationship, it indicates that the entries in the ARP table fail to be
dynamically learned.
b. Configure the static ARP table.
5. Cause 5: The service configuration is incorrect.
a. See the Configuration Guide to re-configure the services.

Step 2 For symptom 2: The PWE3 Ethernet service loses packets or has errored packets.
1. Cause 1: The signals on the receive side are lost.
a. Check for the ETH_LOS alarm of the system and handle the ETH_LOS alarm.
b. Check for the LSR_NO_FITED, LSR_WILL_DIE, or LASER_MOD_ERR alarm of
the system and handle the LSR_NO_FITED, LSR_WILL_DIE, or
LASER_MOD_ERR alarm.
2. Cause 2: The configured traffic limit is excessively low at the port and configurations
of the source and sink ports are inconsistent.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 246


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

a. Check for the FLOW_OVER alarm of the system and handle the FLOW_OVER
alarm.

----End

Related Information
None

15.5 Tunnel APS Troubleshooting


This topic describes the symptom of common faults when the Tunnel APS function is enabled
and the solution to the relevant fault.

Symptom
When the network works normally, a condition that triggers the protection switching occurs on
the working tunnel. The services, however, fail to be automatically switched from the working
tunnel to the protection tunnel. Consequently, the services are interrupted. The symptoms are as
follows:

l The switching duration exceeds the specified value.


l The MPLS Tunnel APS protection group is configured incorrectly. Thus the protection
tunnel cannot receive the APS frame. In this case, the protection fails and the following
alarms may be reported.
– ETH_APS_PATH_MISMATCH
– ETH_APS_TYPE_MISMATCH
– ETH_APS_LOST
– ETH_APS_SWITCH_FAIL
l The protection tunnel is faulty and fails to provide protection. In this case, the following
alarms may be reported.
– MPLS_TUNNEL_LOCV
– MPLS_TUNNEL_UNEXPMEG
– MPLS_TUNNEL_UNEXPMEP
– MPLS_TUNNEL_SD
– MPLS_TUNNEL_SF
– MPLS_TUNNEL_AIS
– MPLS_TUNNEL_RDI
– TUNNEL_APS_DEGRADED

Possible Causes
l Cause 1: The fibers or cables on the protection tunnel are connected incorrectly.
l Cause 2: The OAM parameters of the working tunnel and protection tunnel are set
incorrectly.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 247


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

l Cause 3: The parameter values of the APS protection group at the two ends are different
from each other.
l Cause 4: The MPLS Tunnel APS protocol is disabled.

Flow Chart
Figure 15-1 shows the flow chart for rectifying a fault when the MPLS Tunnel APS function is
enabled.

Figure 15-1 Flow chart for rectifying a fault when the MPLS Tunnel APS function is enabled

Start

Are the fibers or cables No Can the APS be Yes


Reconnect the fibers or
connected correctly? cables. performed normally?

No
Yes

No Set the working and Can the APS be Yes


Are the OAM parameters of the
working and protection protection tunnels correctly. performed normally?
tunnels set correctly?
No

Yes

Set the parameters for the Yes


Are the parameter values of the No Can the APS be
APS at the two ends
APS the same at the two ends? performed normally?
correctly.

No
Yes

No Enable the MPLS APS Can the APS be Yes


Is the MPLS APS protocol protocol. performed normally?
enabled?
No
Yes

Does any alarm related to


the APS occur?

Yes

See the Alarms and


Performance Events
Reference to clear the alarm.

No

Can the APS be Yes


performed normally?

No

Conact the technical support End


engineers of Huawei.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 248


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 15 Packet Service Problems

Procedure
l Cause 1: The fibers or cables on the protection tunnel are connected incorrectly.
1. Reconnect the fibers or cables to each port and ensure that the fibers or cables are
connected correctly.
l Cause 2: The working tunnel and protection tunnel are set incorrectly.
1. In the Main Topology, right-click the NE that you want to configure and choose NE
Explorer from the shortcut menu.
2. Choose Configuration > Packet Configuration > MPLS Management > Unicast
Tunnel Management in the Function Tree.
3. Click the OAM Parameters tab, select the tunnel that realizes the MPLS Tunnel APS
function. Then, set Detection Packet Period (ms) to 3.3.
l Cause 3: The parameter values of the APS protection group at the two ends are different
from each other.
1. In the Main Topology, right-click the NE that you want to configure and choose NE
Explorer from the shortcut menu.
2. Choose Configuration > Packet Configuration > APS Protection Management in
the Function Tree.
3. Click Query to query whether the settings of the APS protection group are the same.
If the settings of the APS protection group are different from each other, set the
protection tunnel in the APS protection group to the same value as the working tunnel.
l Cause 4: The MPLS Tunnel APS protocol is disabled.
1. In the Main Topology, right-click the NE that you want to configure and choose NE
Explorer from the shortcut menu.
2. Choose Configuration > Packet Configuration > APS Protection Management in
the Function Tree.
3. Set Protocol Status to Enabled, and then click Apply.
4. In the Operation Result dialog box that is displayed, click Close.
l If the preceding operations are performed but the alarm related to the MPLS Tunnel APS
function persists, clear the alarm. If the fault persists, contact Huawei technical support
engineers for the solution.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 249


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

16 Ethernet Problems

About This Chapter

This topic describes how to troubleshoot Ethernet problems.

16.1 Service Degradation


This topic describes how to handle Ethernet service degradation.

16.2 LCAS Startup Failure


Learning the common fault symptom and the fault handling flow helps you to locate and rectify
the common faults quickly.

16.3 LPT Switching Failure


This topic describes how to handle LPT switching failure.

16.4 LB Test Failure


If the LB test fails, troubleshoot the failure according to the troubleshooting flow.

16.5 Test Frames Unavailability


This topic describes the possible causes and handling methods when the test frames are
unavailable.

16.6 Common LAG faults


Following a certain troubleshooting flow facilitates your fault locating and rectification.

16.7 Common MSTP faults


Following a certain troubleshooting flow can facilitate fault locating and rectification.

16.8 Ethernet Service Cases


Case analysis related to Ethernet Service cases.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 250


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

16.1 Service Degradation


This topic describes how to handle Ethernet service degradation.

Symptom
Services are degraded.

System Impact
Services are affected.

Possible Causes
l Cause 1: The bandwidth is insufficient for heavy traffic or the bandwidth is sufficient but
the burst traffic is excessive.
l Cause 2: There is a fault at the physical layer.

Tools, Equipment, and Materials


U2000

Troubleshooting Workflow
Figure 16-1 illustrates the Ethernet service degradation troubleshooting process.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 251


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Figure 16-1 Ethernet service degradation troubleshooting flowchart

Start

The bandwidth is insufficient for Yes Check for RMON Yes


Are services
heavy traffic or the bandwidth is performance events of the
insufficient but the burst traffic is restored?
NE and handle them.
excessive.

No No

Yes Enable the flow control Yes


When the traffic is extremely Are services
heavy, the opposite equipment function for the equipment
does not respond. at both ends. restored?

No No

Yes Yes
The setting of a port attribute of Modify the Are services
the local equipment is different
from that of the opposite
mismatched setting. restored?
equipment.

No No

Yes Yes
Is there a fault at the Check for alarms and Are services
physical layer? handle them. restored?

No No

Contact Huawei
engineers. End

Procedure
l Cause 1: The bandwidth is insufficient for heavy traffic or the bandwidth is sufficient but
the burst traffic is excessive.
1. Check for RMON performance events of the corresponding NE and handle them. For
information about how to handle RMON performance events, refer to the Alarms and
Performance Events Reference.
l Cause 2: There is a fault at the physical layer.
1. Check for alarms. If an ETH_8B10B_ERR alarm is being reported, handle this alarm.
----End

16.2 LCAS Startup Failure


Learning the common fault symptom and the fault handling flow helps you to locate and rectify
the common faults quickly.

Fault Symptom
The common fault phenomena are as follows:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 252


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

l The source and sink ends fail to transmit control information. Hence, the negotiation
between the two ends is incorrect and the LCAS function fails to be enabled.
l If the bound path of the data board is configured before the SDH services from the data
board to the line board are configured, the LCAS function may fail to start immediately.

Possible Causes
The possible causes are as follows:

l The configuration timeslots of SDH services at the source and sink ends are not consistent.
l The timeslots actually bound to the data boards at the source and sink ends are abnormal
or lost.
l The LCAS function at the source and sink ends is not enabled.
l Excessive bit errors cause the failure of control information transmission between the
source and sink ends.

Tools, Equipment, and Materials


U2000

Flow for Handling Faults


If the LCAS function fails to successfully start, see Figure 16-2.

Figure 16-2 Flowchart for handling LCAS faults

Start

The timeslots configured No Reconfigure the timeslots of The LCAS Yes


at the source and sink SDH services at the function starts
ends are correct and source and sink ends successfully
consistent
No
Yes

The timeslots actually No Reconfigure the timeslots The LCAS Yes


No bound to the data boards at function starts
bound to the data boards at
the source and sink ends successfully
are normal or are not lost the source and sink ends

No

Yes

No Enable the LCAS function The LCAS Yes


The LCAS function is function starts
at the source and sink
enabled at the source and successfully
ends
sink ends
No

Yes

See the Alarms and Yes


No LCAS_FOPR or No Performance Events The LCAS function
LCAS_FOPT alarm is starts successfully
Reference handle the alarms
reported
No
Yes

Contact Huawei End


engineers for handling

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 253


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Procedure
l Cause 1: The timeslots configured at the source and sink ends are not consistent.
1. Right-click the relevant NE icon in the Main Topology, and then choose NE
Explorer.
2. Choose Configuration > SDH Service Configuration from the Function Tree.
3. Select the SDH service that is configured between the source and sink ends in the pane
on the right side.
4. Click the Deactivate button in the lower right corner.
5. In the Confirm dialog box that is displayed, click OK.
6. Right-click the SDH service that is configured between the source and sink ends, and
then choose Modify.
7. In the Modify SDH Service dialog box that is displayed, verify that the service level
and timeslots configured for the source and sink ends are consistent.
l Cause 2: The timeslots actually bound to the data boards at the source and sink ends are
abnormal or lost.
1. Right-click the relevant NE icon in the Main Topology, and then choose NE
Explorer.
2. Select the relevant data board in the Object Tree, and then choose Configuration >
Ethernet Interface Management > Ethernet Interface from the Function Tree in
the left pane.
3. Select Internal Port and click the Bound path tab.
4. Ensure that the activation status of the configured bound path in the bound path list is
"active".
5. Select the configured bound path, and then click the Configuration button in the lower
right corner in the pane on the right side.
6. In the Bound Path Configuration dialog box that is displayed, re-configure
Available Resources and Available Timeslots.
l Cause 3: The LCAS function is not enabled at the source and sink ends at the same time.
1. Right-click the relevant NE icon in the Main Topology, and then choose NE
Explorer.
2. Select the relevant data board in the Object Tree, and then choose Configuration >
Ethernet Interface Management > Ethernet Interface from the Function Tree
3. Select Internal Port and click the LCAS tab.
4. Find the relevant port, double-click on the column Enabling LCAS of which is set to
Disabled, and finally set Enabling LCAS to Enabled.
5. Click the Apply button in the lower right corner.
6. In the Operation Result dialog box that is displayed, click Close.
l Cause 4: Excessive bit errors cause the failure of control information transmission between
the source and sink ends.
NOTE
You can check whether the control information transmission between the source and sink ends fails
due to excessive bit errors by querying the alarm information on the NMS.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 254


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

1. Right-click the relevant NE icon in the Main Topology, and then choose Browse
Current Alarms.
2. Check whether the VCAT_LOA alarm is contained in the alarm list.
3. See the Alarms and Performance Events Reference to troubleshoot the virtual
concatenation delay and excessive bit errors.

----End

16.3 LPT Switching Failure


This topic describes how to handle LPT switching failure.

Symptom
Services are interrupted.

Possible Causes
l Cause 1: The LPT function is disabled.
l Cause 2: The LPT bearer modes of all the NEs between the service access point and
intermediate network are inconsistent.

Tools, Equipment, and Materials


U2000

Procedure
Step 1 Cause 1: The LPT function is disabled.
1. In the NE Explorer, select the desired board and choose Configuration > Ethernet
Interface Management > LPT Management from the Function Tree.
2. Click Query and the Operation Result dialog box is displayed.
3. Check the LPT field of the board and ensure that the LPT function is enabled. Then click
Close.

Step 2 Cause 2: The LPT bearer modes of all the NEs between the service access point and intermediate
network are inconsistent.
1. In the NE Explorer, select the desired board and choose Configuration > Ethernet
Interface Management > LPT Management from the Function Tree.
2. Click Query and the Operation Result dialog box is displayed.
3. Check the Bearer Mode field of the board and ensure that the bearer modes of all nodes
are consistent. Then click Close.

----End

16.4 LB Test Failure


If the LB test fails, troubleshoot the failure according to the troubleshooting flow.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 255


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Fault Symptom
Common fault phenomena are listed as follows.

l LB test failure

Possible Causes
Possible causes of the LB test failure are listed as follows.
l The external physical port where the maintenance point resides is not enabled.
l Values of Port Type of the ports where the sink and source maintenance points reside are
inconsistent.
l When values of Port Type of the ports where the sink and source maintenance points reside
are UNI or PE, the TAG flags are inconsistent.
l The VLAN ID of the maintenance points is inconsistent with the VLAN ID of the service.
l The level of the source maintenance point is inconsistent with the level of the sink
maintenance point.
l MEPs or MIPs of a higher level exist between the source and sink maintenance points.
l The Ethernet service is a unidirectional service.
l The Ethernet service is interrupted.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 256


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Troubleshooting Flow

Figure 16-3 Troubleshooting flow for the LB test failure


Start

Is
the external No Enable the external
Is LB test Yes
physical port of the MP physical port of the
MP successful
enabled
No
Yes

Are the Make the types of the Yes


types of the ports No Is LB test
1 of the source and sink MPs ports of the source and successful
consistent sink MPs consistent
No
Yes

Port type
of the ports of the Yes No Make the TAG flags Is LB test Yes
Are TAG flags
source sink MPs is UNI consistent successful
and or PE consistent

Yes No
No

Are the
VLAN IDs of the No Make the VLAN ID Is LB test Yes
MPs and the service consistent successful
consistent
No
No

Are Make levels of the


levels of the source No Is LB test Yes
source and sink MP
and sink MP consistent successful
consistent
No
No
2
Are
there MEPs or MIPs Yes Degrade the level of Yes
of a higher level between the Is LB test
source and sink the MEPs or MIPs successful
MPs
No
No

No Configure the service Is LB test Yes


Is the Ethernet service
as a bidirectional successful
bidirectional
Ethernet service
No
Yes

Refer to the
3 Ethernet service is Troubleshooting for Is LB test Yes
interrupted the handling successful

No

1 Port type of the port where the MP resides may be improperly configured Contact Huawei
engineers
2 MP attributes may be improperly configured
End
3 The Ethernet service may be interrupted

Procedure
Step 1 Cause 1: The external physical port where the maintenance point resides is not enabled.
1. In the Main View, right-click the required NE icon and choose NE Explorer from the
shortcut menu.
2. In the Object Tree, select the required Ethernet service processing board. Choose
Configuration > Ethernet Interface Management > Ethernet Interface from the
Function Tree. Click .

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 257


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

3. Select External Port.


4. Click the Basic Attributes tab. Check whether Enabled/Disabled of the port where the
maintenance point resides is Enabled.

Step 2 Cause 2: Values of Port Type of the ports where the sink and source maintenance points reside
are inconsistent.
1. Select External Port or Internal Port.
2. Click the Network Attributes tab. Check whether the values of Port Type of the ports are
consistent.

Step 3 Cause 3: When values of Port Type of the ports where the sink and source maintenance points
reside are UNI or PE, the TAG flags are inconsistent.
1. Click the TAG Attribute tab. Check whether the values of TAG of the ports are consistent.
NOTE

The TAG parameter is valid only when the value of Port Type is UNI or PE.

Step 4 Cause 4: The VLAN ID of the maintenance points is inconsistent with the VLAN ID of the
service.
1. In the Object Tree, select the required Ethernet service processing board. Choose
Configuration > Ethernet Maintenance > Ethernet Service OAM from the Function
Tree. Click .
2. Click Query to check whether the VLAN ID of the maintenance points is consistent with
the VLAN ID of the service.

Step 5 Cause 5: The level of the source maintenance point is inconsistent with the level of the sink
maintenance point.
1. Click Query to check whether the level of the source maintenance point is inconsistent
with the level of the sink maintenance point.

Step 6 Cause 6: MEPs or MIPs of a higher level exist between the source and sink maintenance points.
1. Click Query to check whether any MEP or MIP of a higher level exists between the source
and sink maintenance points.

Step 7 Cause 7: The Ethernet service is a unidirectional service.


1. In the Object Tree, select the required Ethernet service processing board. Choose
Configuration > Ethernet Service from the Function Tree.

2. Select Ethernet Line Service or Ethernet LAN Service. Then, click .


3. Click Query to check whether the Ethernet service is a unidirectional service.
NOTE

The LB test is based on bidirectional Ethernet services.

Step 8 Cause 8: The Ethernet service is interrupted.


1. Check and handle the Ethernet service fault according to the Troubleshooting.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 258


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

16.5 Test Frames Unavailability


This topic describes the possible causes and handling methods when the test frames are
unavailable.

Symptom
The test frame cannot function normally.

System Impact
None.

Possible Cause
l Cause 1: The timeslots in the cross-connection services of the interconnected ends are
inconsistent.
l Cause 2: The bearing modes of the VC trunk ports at the two interconnected ends are
inconsistent.
l Cause 3: The logic port on which the function of test frame is enabled is not bound with
timeslots.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The timeslots in the cross-connection services of the interconnected ends are
inconsistent.
1. Check whether the timeslots in the cross-connection services of the interconnected
ends are consistent. If they are inconsistent, set them to the same.
l Cause 2: The bearing modes of the VC trunk ports at the two interconnected ends are
inconsistent.
1. Check whether the bearing modes of the VC trunk ports at the two interconnected
ends are consistent. If they are inconsistent, set them to the same.
l Cause 3: The logic port on which the function of test frame is enabled is not bound with
timeslots.
1. Check whether the logic port on which the function of test frame is enabled is bound
with timeslots.

----End

16.6 Common LAG faults


Following a certain troubleshooting flow facilitates your fault locating and rectification.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 259


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Symptom
The common LAG faults are as follows:

l The member ports become unavailable, because the LAG is faulty. As a result, the services
are interrupted.
l Certain packets are discarded, because the member ports in the LAG become unavailable.

Possible Causes
l The NEs at both ends of the LAG are configured incorrectly.
l The member ports in the LAG work in half-duplex mode.
l Loopbacks are performed on the member ports in the LAG.
l The member ports in the LAG are disconnected.

Flow Chart
Following a certain troubleshooting flow facilitates your fault locating and rectification.

NOTE

This topic describes the process for rectifying a LAG fault when the LAG function is enabled in TDM
mode. This topic also provides a reference for you to rectify a LAG fault because the process for rectifying
a LAG fault when the LAG function is enabled in packet mode is similar.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 260


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Figure 16-4 Flow chart for rectifying a LAG fault

Start

The Yes
LAG_FAIL Handle the
alarm occurs? alarm

No

The Yes Modify the


configurations of
both NEs at both configurations of
ends are both NEs
incorrect?
No

Yes Modify the working


The port works
in half-duplex mode of the port to
mode? full-duplex

No

The
Yes
LOOP_ALM or Release the
ETH_EFM_LOOPBACK loopback at the port
alarm occurs?

No

The ETH_LOS or Yes


Rectify the connection
ETH_LINK_DOWN fault at the port to
alarm occurs? clear the alarm

No
The fault is
rectified?

Yes

Contact Huawei
technical support
engineers to handle the End
fault

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 261


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Procedure
Step 1 Cause 1: The NEs at both ends of the LAG are configured incorrectly.
1. On the U2000, check whether the LAG_DOWN or LAG_MEMBER_DOWN alarm is
reported.
2. If yes, see the Alarms and Performance Events Reference to clear the alarm. Then, check
whether the fault is rectified.

Step 2 Cause 2: The member ports in the LAG work in half-duplex mode.
1. Check whether the member ports in the LAG work in half-duplex mode.
2. If yes, change the working mode to full-duplex and ensure that the ports on the local and
opposite NEs work in the same mode.

Step 3 Cause 3: Loopbacks are performed on the member ports in the LAG.
1. Check whether the LOOP_ALM or ETH_EFM_LOOPBACK alarm is reported on the
member ports in the LAG.
2. If yes, release the loopback status to clear the LOOP_ALM or ETH_EFM_LOOPBACK
alarm.

Step 4 Cause 4: The member ports in the LAG are disconnected.


1. Check whether the ETH_LOS or ETH_LINK_DOWN alarm is reported on the member
ports in the LAG.
2. If yes, clear the alarm. Then, check whether the fault is rectified.

Step 5 If the fault persists, contact the technical support engineers of Huawei.

----End

16.7 Common MSTP faults


Following a certain troubleshooting flow can facilitate fault locating and rectification.

Context
Figure 16-5 shows the flow for rectifying a fault that occurs when the MSTP function is used.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 262


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

Figure 16-5 Troubleshooting flowchart

Start

Check the information in the


MAC address tables of all the
equipment on the network

Check the parameter


configurations of the bridge

Check the parameter


configurations of the
Ethernet port

Check the configurations of


the MSTP function for the
other equipment

Check the VLAN


configuration of the network

End

Procedure
Step 1 Check the information in the MAC address tables of all the equipment on the network.
1. Check the information in the MAC address tables of all the equipment on the network. If
a MAC address corresponds to several ports, loops may exist. In this case, cancel any
loopbacks.

Step 2 Check the parameter configurations of the bridge.


If a fault occurs when the MSTP function is used, check whether the parameter configurations
of the MSTP bridge are incorrect. If this is the case, modify the parameter configurations to
rectify the fault.
1. Check whether the MSTP function is disabled. If this is the case, enable it.
2. Check whether the equipment works in MSTP mode or STP mode.
The OptiX OSN equipment supports the MSTP protocol, which is compatible with the STP
and RSTP protocols.
3. Check whether the priorities are correctly set for the equipment.
Modify the priorities of the bridges so that one bridge is selected as the root bridge. If the
priorities of the bridges are set incorrectly, the root bridge is incorrectly selected.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 263


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

4. Check whether the values of the Hello Time, Forward Delay, and Max Age parameters
are set correctly.
l If the Hello Time parameter is set to an extremely large value, the change in network
topology may be responded in a delayed manner. If the Hello Time parameter is set to
an extremely small value, the configuration message is frequently transmitted, and thus
more CPU and network resources are consumed.
l If the Forward Delay parameter is set to an extremely large value, the spanning tree
converges slowly. If the Forward Delay parameter is set to an extremely small value,
temporary loops may exist when the network topology changes.
l If the Max Age parameter is set to an extremely large value, the fault on the link fails
to be detected. If the Max Age parameter is set to an extremely small value, the switch
incorrectly considers that a fault occurs on the link in the case of network congestion,
and thus the spanning tree is frequently computed.
NOTE
It is recommended that the three parameters take the default values.

Step 3 Check the parameter configurations of the Ethernet port.


If a fault occurs when the MSTP function is used, check whether the parameter configurations
of the Ethernet port are incorrect. If this is the case, modify the parameter configurations to
rectify the fault.
1. Check whether the MSTP function is enabled for the Ethernet port.
2. Check whether the setting of the point-to-point link status for the Ethernet port is consistent
with the actual point-to-point link status.
If the Ethernet port is set as a point-to-point link port whereas the actual situation is
inconsistent, temporary loops may exist when the network topology changes.

Step 4 Check the configurations of the MSTP function for the other equipment.
If a fault occurs when the MSTP function is used, check the configurations of the MSTP function
for the other equipment on the network. The MSTP protocol should be enabled for all the
equipment on the network (at least on one ring). Otherwise, the equipment on the network fails
to be informed of the network topology by exchanging the messages, and thus the learning of
the spanning tree fails.
1. Check whether the root bridge of the entire network is correctly selected.
In general cases, you can set the intermediate LanSwitch as the root bridge by modifying
the priorities of bridges. If the intermediate LanSwitch is not set as the root bridge, the
traffic on the entire network is not balanced. In addition, certain services become
unavailable.
2. Check whether the bridge parameters and port parameters of all the equipment are set
correctly. In general cases, the Hello Time, Forward Delay, and Max Age parameters
take the parameters values of the root bridge.

Step 5 Check the VLAN configuration of the network.


If a fault occurs when the MSTP function is used, check whether the VLAN configuration of
the network is correct. Mainly check whether the service is unavailable due to the blocking of
the Ethernet port.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 264


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 16 Ethernet Problems

16.8 Ethernet Service Cases


Case analysis related to Ethernet Service cases.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MC-A4 The LOG Board Fails to Interwork With the FDG Board on the Client Sides. The
LOG board reports the R_LOS alarm on the client side. The FDG board reports the
LINK_STATUS alarm.
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A6 The LDG Board Keeps Reporting the ALM_DATA_TLOS and
ALM_DATA_RLOS Alarms Transiently
l MC-A17 The LQG Reports ALM_DATA_RLOS and ALM_DATA_TLOS Alarms
l MC-A30 LDG Board of Metro 6100 Equipment Reports the INBADOCTS_OVER Alarm
l MC-A96 Packet Loss Occurs in Ethernet Service Testing of the L4G Board
l MC-A117 Analysis and Handling of the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Reported by the LBE
l MC-A158 How to Use the Ethernet Test Frames on the OptiX OSN 6800
l MC-A219 EPL Service Is Interrupted Due to Mismatched Service Modes of the L4G
Boards, Sites Report The R_LOF, ODU5G_PM_AIS, OTU5G_LOF And LINK_ERR
Alarms

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 265


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

17 ECC Problems

About This Chapter

In a WDM system, communication between the U2000 and non-gateway NEs is established
with the U2000 first communicating with the gateway NE over TCP/IP. Then the gateway NE
communicates with non-gateway NEs through the embedded control channel (ECC) to complete
the communication channel.

17.1 ECC Communication Interruption


This topic describes how to handle ECC interruption.

17.2 Intermittent ECC Communication


This topic describes how to handle intermittent ECC communication.

17.3 HWECC FAQ


This topic provides answers to frequently asked questions about the HWECC solution.

17.4 Troubleshooting for Inband DCN


A troubleshooting flow facilitates fault locating and rectification.

17.5 ECC Cases


Case analysis related to ECC cases.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 266


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

17.1 ECC Communication Interruption


This topic describes how to handle ECC interruption.

Symptom
The services on the main optical path are normal, but ECC communication is interrupted and
the NE cannot be reached by the U2000.

System Impact
The U2000 cannot reach the NE.

Possible Causes
l Cause 1: The ID of the NE is a duplicate of another NE.
l Cause 2: The pigtail connected to the OSC board is faulty.
l Cause 3: The system control board is faulty.
l Cause 4: The OSC board is faulty.
l Cause 5: The OTU or tributary or line board is faulty.
l Cause 6: Coherent and non-coherent boards are interconnected.

Tools, Equipment, and Materials


WEB LCT, optical power meter

Procedure
l Cause 1: The ID of the NE is a duplicate of another NE.
1. Log in to the NE on the WEB LCT and modify the NE ID of the NE according to the
record.
l Cause 2: The pigtail connected to the OSC board is faulty.
1. Check the pigtail connected to the OSC board on the first faulty NE along the signal
flow. If the pigtail is loose, secure it; if it is dirty, clean it; if it is faulty, replace it.
2. Ensure that the pigtail is properly connected to the OSC board on the faulty NE.
NOTE

During deployment commissioning, the pigtail might be incorrectly connected because the FIU
board and the OSC board have many ports marked with similar names. During commissioning,
to prevent incorrect pigtail connections in one direction of the OSC board, you can connect an
optical attenuator between the currently unused RM and TM optical ports in the other direction
of the OSC board to form a self-loop.
l Cause 3: The system control board is faulty.
1. Reset the system control board on the faulty NE along the signal flow, and then check
whether you can log in to the NE.
2. If you cannot log in to the NE, remove the system control board on the faulty NE so
the ECC signals pass through the backplane. Then check whether the downstream NE

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 267


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

is restored. If the downstream NE is restored, the system control board is faulty.


Replace the system control board.
l Cause 4: The OSC board is faulty.
1. Replace the OSC board on the first faulty NE along the signal flow, and check whether
the network is restored. If the network is restored, the OSC board is faulty. Replace
the OSC board.
l Cause 5: The OTU or tributary or line board is faulty.
1. Replace the OTU, or tributary or line board on the first faulty NE along the signal
flow, and check whether the network is restored. If the network is restored, the OTU,
or tributary or line board is faulty. Replace the OTU, or tributary or line board.
l Cause 6: Coherent and non-coherent boards are interconnected.
1. The DCN bytes are defined differently for coherent and non-coherent services, so the
two types of services cannot be interconnected. If DCN channels are unavailable,
check whether the interconnected boards are coherent boards.

----End

17.2 Intermittent ECC Communication


This topic describes how to handle intermittent ECC communication.

Symptom
The ECC communication is intermittent, and the U2000 is frequently unable to reach the NE.

System Impact
The U2000 cannot reach the NE.

Possible Causes
l Cause 1: Clock tracing is set incorrectly.
l Cause 2: There are bit errors on the signal transmission.

Tools, Equipment, and Materials


WEB LCT

Procedure
l Cause 1: Clock tracing is set incorrectly.
1. Set clock tracing to the correct source. For details on how to set the correct clock
tracing, see "Synchronization".

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 268


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

NOTE

All equipment on the network must trace the same clock source. In the case of WDM equipment,
the internal clock source of a specific NE should be given the highest priority, and the other
NEs trace the internal clock source of this NE. (In the case of chain networking, the internal
clock source of a terminal NE rather than an intermediate NE should be set as the clock source
so it can be traced network-wide.)
l Cause 2: There are bit errors on the signal transmission.
1. Refer to rectifying bit errors.

----End

17.3 HWECC FAQ


This topic provides answers to frequently asked questions about the HWECC solution.

Q: Why does the U2000 always fail to log in to an NE?

A: Common causes include:


l Communication between the U2000 and the GNE is faulty.
To locate the fault, run the ping or tracert command on the U2000 server.
l The ECC route between the GNE and a non-GNE is faulty.
To locate the fault, check the ECC route between the GNE and the non-GNE.
l NE IDs conflict.

Q: Why does the U2000 frequently fail to log in to NEs?

A: Common causes include:


l The U2000 is faulty.
In this case, the U2000 fails to log in to all NEs.
l The IP addresses of GNEs conflict.
In this case, the U2000 fails to log in to all NEs on a DCN subnet.
l The scale of a DCN subnet is excessively large.
In this case, the U2000 fails to log in to NEs that access multiple data communications
channels (DCCs).

Q: Why does the system control board board on the GNE or the NE that uses the extended
ECC frequently reset?

A: Common causes include:


l Unknown equipment is connected to the LAN to which the NE is connected, resulting in
a conflict between the NE and the equipment.
l A loop (especially between the Ethernet NM port and NE cascading port) occurs in the
LAN to which the NE is connected, resulting in a network storm.

Q: What are the hazards of a DCN subnet of excessively large scale?

A: Main hazards include:


l The stability of the ECC route is poor, the convergence time is long, and ECC route flapping
can occur.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 269


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

l Remote loading occurs slowly.


l Alarms reported to the U2000 are lost.
l The U2000 cannot log in to certain NEs.
l The system control boards of certain NEs reset unexpectedly.

Q: Why are NEs prone to be unreachable by the U2000 due to bit errors during deployment
and expansion?

A:

If the OSC and ESC functions are used together, bit errors are often generated during deployment
and expansion because services transmitted along with ESC information are being
commissioned. Once bit errors are generated, the network frequently switches between the OSC
and ESC channels, and consequently NEs on the network become unreachable by the U2000,
leading to NE configuration and management failures. To prevent this problem, disable the ESC
function during deployment and expansion, and enable it again when deployment and expansion
are complete.

17.4 Troubleshooting for Inband DCN


A troubleshooting flow facilitates fault locating and rectification.

Symptom
The common faults regarding the inband DCN are as follows:

l The communication between the U2000 and NEs is interrupted, and the NE icons on the
U2000 become unavailable.
l No response is returned after a command is issued on the U2000. If no response is returned
more than 2 minutes later, the communication between the U2000 and an NE is interrupted.
l Part of the query result on the U2000 is missing.

Impact on System
l After the communication between an NE and the U2000 is interrupted, the NEs that
communicate with the U2000 through the NE also become unreachable, if they cannot
connect to the U2000 by other methods. The other NEs, however, are not affected.

Possible Causes
l Cause 1: The physical link between the faulty NE and the U2000 is disconnected.
l Cause 2: The ports on the faulty NE are not enabled with the inband DCN function.
l Cause 3: The DCN VLAN IDs of the NEs at both ends are not the same.
l Cause 4: The ports connecting the local and opposite NEs use different protocols.
l Cause 5: The faulty NE fails to obtain DCN packets due to loss of received signals or very
low receive optical power.
l Cause 6: A board is faulty.
l Cause 7: No response is returned for inband DCN packets, because the SCC board on the
faulty NE is being reset or active/standby protection switching occurs on another board.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 270


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

l Cause 8: The bandwidth of the inband DCN channel is very low or other services preempt
the bandwidth for the inband DCN channel.
NOTE

When handling a fault regarding the inband DCN, do as follows:


l If communication between NEs and the U2000 is interrupted, rectify the fault on the gateway NE
prior to the fault on the non-gateway NE.
l If the communication between NEs and the U2000 is normal, rectify the fault on the non-gateway
NE prior to the fault on the gateway NE. This method ensures that the unaffected NEs can keep
communicating with the U2000 during the troubleshooting process.

Tools, Equipment, and Materials


U2000

Troubleshooting Flow
A troubleshooting flow facilitates fault locating and rectification.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 271


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

Figure 17-1 Flow chart for handling an inband DCN fault

Start

Yes The communication Yes Reconnect the


Check whether the Is the physical link
between the U2000 and Ethernet cable or
NE icon is dimmed. disconnected?
the NE is interrupted. fiber.

No

Yes
Is the port on the Enable the port
DCN disabled? on the DCN.

No

Yes
Is the received Clear the alarm
No
signal lost? related to the optical
power, fiber, or cable.

No

Yes
Is the board Replace the board.
faulty?

No response is Yes
否returned for the The SCC board is Wait until the SCC
command on the being reset. board is reset.
U2000.

No

Yes
Is the information The bandwidth Increase the bandwidth of the
queried on the of the DCN path DCN path or degrade the
U2000 lost? is very low. priority of other services.

No
Is the fault
rectified?

Yes

Contact Huawei technical End


support engineers.

Procedure
Step 1 Cause 1: The physical link between the faulty NE and the U2000 is disconnected.
1. Check whether the Ethernet cable or fiber of the faulty NE is disconnected from the port.
If yes, reconnect it.

Step 2 Cause 2: The ports on the faulty NE are not enabled with the inband DCN function.
1. Check whether the DCN function is enabled by default for the port that is connected to the
fiber. If not, connect the fiber to a port with the DCN function enabled by default.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 272


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

2. Check whether the inband DCN function is enabled for the ports at both ends of the link.
If not, enable DCN access function for the ports.
Step 3 Cause 3: The DCN VLAN IDs of the NEs at both ends are not the same.
1. Check whether the DCN VLAN ID of the local NE is the same as that of the opposite NE.
If not, change the DCN VLAN ID of the local NE to the DCN VLAN ID of the opposite
NE.
Step 4 Cause 4: The ports connecting the local and opposite NEs use different protocols.
1. Check whether the port on the local NE uses the same protocol as the port on the opposite
NE. If not, change the protocol to ensure that the ports connecting the local and opposite
NEs use the same protocol.
Step 5 Cause 5: The faulty NE fails to obtain DCN packets due to loss of received signals or very low
receive optical power.
1. Check whether alarms, such as ETH_LOS, ETH_LINK_DOWN, or IN_PWR_LOW, are
reported on the board that is configured with the inband DCN channel. If yes, see the Alarms
and Performance Events Reference to clear these alarms.
Step 6 Cause 6: A board is faulty.
1. Check whether the HARD_BAD alarm is reported on the board that is configured with the
inband DCN channel. If yes, see the Alarms and Performance Events Reference to clear
the alarm.
Step 7 Cause 7: No response is returned for inband DCN packets, because the SCC board on the faulty
NE is being reset or active/standby protection switching occurs on another board.
1. Check whether the PROG indicator on the system control and communication (SCC) board
is blinking green. If yes, the SCC board is being reset. When the PROG indicator is steady
green, the reset of the SCC board is completed. Then, the NE is automatically reconnected
to the DCN.
2. If no response is returned for DCN packets, check whether active/standby protection
switching occurs on another board, which results in rerouting of DCN packets.
3. If active/standby protection switching occurs on a board, the NE automatically responds
to DCN packets after rerouting of DCN packets is completed.
Step 8 Cause 8: The bandwidth of the inband DCN channel is very low or other services preempt the
bandwidth for the inband DCN channel.
1. If the number of services that are configured for the port exceeds the specified value, part
of the query result may be missing. To resolve the problem, increase the bandwidth of the
inband DCN channel properly. For details, see Setting the VLAN ID and Bandwidth Used
by the Inband DCN.
Step 9 If the fault persists, contact Huawei to handle the fault.

----End

17.5 ECC Cases


Case analysis related to ECC cases.

NOTE

The following cases are related to the OptiX WDM product series.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 273


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 17 ECC Problems

Related Cases:
l MC-A22 NE ESC Communication Interrupted Because of the Closure of the OTU Laser
l MC-A64 Many BD_STATUS Alarms Occur Due to the ECC Storm
l MC-A81 The Created ALC Link on OptiX OSN 6800 Cannot Be Automatically Adjusted
l MC-A110 Question About the Extended ECC of WDM NEs
l MC-A113 Communication Between the WDM Equipment And the DCN Network Fails
After a Hub Is Connected To the DCN Network
l MC-A135 The Communication Between the NE and the T2000 Stops Because the NE
Software Cannot Switch the ECC from an OSC Board to an OTU Board Automatically
l MC-A181 An NE Is Frequently Unreachable to the NMS Due to Insufficient Processing
Capacity of a Router
l MC-A247 NE Configuration Data Cannot Be Uploaded

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 274


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 18 Clock Problems

18 Clock Problems

About This Chapter

This section describes the possible causes of the synchronization failure in the clock, and the
troubleshooting procedure.

18.1 Failure of NodeBs to Trace Clock Signals


When a fiber on the path for transmitting IEEE 1588v2 clock signals is faulty, compensate for
the delay after rectifying the fault. Otherwise, the downstream NodeBs cannot trace the clock
signals.

18.2 Synchronization Failure of the Physical-Layer Clock


This section describes the possible causes of the synchronization failure in the physical-layer
clock, and the troubleshooting procedure.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 275


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 18 Clock Problems

18.1 Failure of NodeBs to Trace Clock Signals


When a fiber on the path for transmitting IEEE 1588v2 clock signals is faulty, compensate for
the delay after rectifying the fault. Otherwise, the downstream NodeBs cannot trace the clock
signals.

Symptom
After a fiber fault is rectified, the delay is not compensated. As a result, the downstream NodeBs
cannot trace the clock signals.

Impact on System
If protection is configured, the IEEE 1588v2 clock signals are switched from the working path
to the protection path. If no protection is configured, the IEEE 1588v2 clock signals are
interrupted and the downstream NodeBs cannot perform clock synchronization.

Tools, Equipment, and Materials


U2000 and time testing meter

Fault Handling on a Ring Network Configured with Protection


As shown in the following figure, the fiber between NE2 and NE3 on the working path is faulty.
IEEE 1588v2 clock signals are switched to protection path NE1–NE4–NE3.

Figure 18-1 Networking diagram of the ring network configured with protection
NE1 NE2
East West

BITS West East


Convergence
layer NodeB
East West

West East
NE4 NE3 NE5

: IEEE 1588v2 working path

: IEEE 1588v2 protection path

OptiX PTN
BITS OptiX OSN equipment equipment NodeB

Compensate for the delay on the faulty fiber as follows:

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 276


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 18 Clock Problems

1. Delete the west IEEE 1588v2 port on NE3 so that the clock signals will not be immediately
switched to the working path after the fiber between NE2 and NE3 is repaired.
2. Repair the faulty fiber between NE2 and NE3.
3. Check whether the time precision on NE3 and that on NE4 satisfy requirements. For the
time precision requirements, see Requirements on Clock Synchronization. If the time
precision requirements are not satisfied, the path between NE3 and NE4 is faulty. When
this occurs, remove the fault on the path.
4. Create the west IEEE 1588v2 port that is deleted in step 1 to switch the IEEE 1588v2 clock
signals to working path NE1–NE2–NE3.
5. Record the time precision on NE2 and that on NE3. Then calibrate the original compensated
delay on NE2 and that on NE3 according to the time precision on NE2 and the time precision
on NE3. For the delay compensation method, see Testing Delay Compensation.
6. Ensure that the time precision on NE2 and that on NE3 satisfy requirements.
7. Verify that the status of all PTP clock ports is normal.

Fault Handling on a Chain Network Configured with Protection


On a chain network configured with protection, IEEE 1588v2 clock signals are switched to the
protection path after a fiber on the working path is faulty. Use the same method as that used on
a ring network configured with protection to compensate for the delay on the faulty fiber.

Fault Handling on a Network Not Configured with Protection


On a network not configured with protection, the downstream NodeBs cannot perform clock
synchronization after a fiber on the working path is faulty. After the faulty fiber is repaired,
compensate for the delay on the faulty fiber by performing 5 to 7 described in the scenario of a
ring network configured with protection.

18.2 Synchronization Failure of the Physical-Layer Clock


This section describes the possible causes of the synchronization failure in the physical-layer
clock, and the troubleshooting procedure.

Symptom
In the case of a clock processing failure, the EXT_TIME_LOC, SYNC_C_LOS,
S1_SYN_CHANGE, LTI, SYN_BAD, EXT_SYNC_LOS, or CLK_NO_TRACE_MODE
alarm occurs.

When you clear the alarm reported by the equipment, the fault is rectified.

Impact on System
When the clock source of the network is lost or degrades, the services that trace the clock source
are affected. In this case, pointer justification occurs and BER increases.

Possible Causes
l Cause 1: In the priority table, the synchronous time source for the service board is lost.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 277


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 18 Clock Problems

l Cause 2: The synchronous clock source is lost and the clock of the NE is in an abnormal
state.
l Cause 3: In SSM mode, the clock source is switched and the clock source traced by the NE
is also switched.
l Cause 4: The signals of the synchronous clock source are degraded.
l Cause 5: The external clock source is lost.
l Cause 6: The clock enters the non-tracing working mode.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: In the priority table, the synchronous time source for the service board is lost.
1. Check for the SYNC_C_LOS alarm of the STG board on U2000. For details, refer to
"Querying the Current Alarms and Performance Events of a Component on the
U2000" in Supporting Tasks.
2. If the SYNC_C_LOS alarm occurs, handle the alarm according to the Alarms and
Performance Events Reference.
l Cause 2: The synchronous clock source is lost and the clock of the NE is in an abnormal
state.
1. Check for the LTI alarm of the STG board on U2000. For details, refer to "Querying
the Current Alarms and Performance Events of a Component on the U2000" in
Supporting Tasks.
2. If the LTI alarm occurs, handle the alarm according to the Alarms and Performance
Events Reference.
l Cause 3: In SSM mode, the clock source is switched and the clock source traced by the NE
is also switched.
1. Check for the S1_SYN_CHANGE alarm of the STG board on U2000. For details,
refer to "Querying the Current Alarms and Performance Events of a Component on
the U2000" in Supporting Tasks.
2. If the S1_SYN_CHANGE alarm occurs, handle the alarm according to the Alarms
and Performance Events Reference.
l Cause 4: The signals of the synchronous clock source are degraded.
1. Check for the SYN_BAD alarm of the STG board on U2000. For details, refer to
"Querying the Current Alarms and Performance Events of a Component on the
U2000" in Supporting Tasks.
2. If the SYN_BAD alarm occurs, handle the alarm according to the Alarms and
Performance Events Reference.
l Cause 5: The external clock source is lost.
1. Check for the EXT_SYNC_LOS alarm of the STG board on U2000. For details, refer
to "Querying the Current Alarms and Performance Events of a Component on the
U2000" in Supporting Tasks.
2. If the EXT_SYNC_LOS alarm occurs, handle the alarm according to the Alarms and
Performance Events Reference.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 278


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 18 Clock Problems

l Cause 6: The clock enters the non-tracing working mode.


1. Check for the CLK_NO_TRACE_MODE alarm of the STG board on U2000. For
details, refer to "Querying the Current Alarms and Performance Events of a
Component on the U2000" in Supporting Tasks.
2. If the CLK_NO_TRACE_MODE alarm occurs, handle the alarm according to the
Alarms and Performance Events Reference.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 279


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 19 Orderwire Problems

19 Orderwire Problems

About This Chapter

This topic describes how to handle orderwire problems.

19.1 Orderwire Failure


This topic describes the possible causes and troubleshooting methods for orderwire failures that
occur when the services on the main optical path are working normally.

19.2 A Called Phone Set Cannot Exit the Conference Call State
This topic describes the possible cause and troubleshooting method when a called phone set in
a conference call fails to exit the conference call state.

19.3 Orderwire Cases


Case analysis related to Orderwire cases.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 280


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 19 Orderwire Problems

19.1 Orderwire Failure


This topic describes the possible causes and troubleshooting methods for orderwire failures that
occur when the services on the main optical path are working normally.

Symptom
The services on the main optical path from station A to station C are normal, but the orderwire
is unavailable.

Figure 19-1 shows an example of the network configuration in this case.

Figure 19-1 Network configuration in the case of an orderwire failure

OLA

OTM OTM

Station A Station B Station C

System Impact
The services are not affected.

Possible Causes
l Cause 1: The parameters are incorrectly configured.
l Cause 2: The setting of the phone set is incorrect.
l Cause 3: The phone set or phone line is faulty.
l Cause 4: The fiber connection of the SC2 board at station B is incorrect.
l Cause 5: The system control board at station B is faulty.

Tools, Equipment, and Materials


U2000

Procedure
l Cause 1: The parameters are incorrectly configured.
1. On the U2000, check the phone numbers of stations A and C. Ensure that each phone
number is correctly set to a unique value. For details, refer to "Configuring Orderwire".

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 281


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 19 Orderwire Problems

2. On the U2000, check the setting of the OSC boards at stations A, B, and C. Set
orderwire bytes to pass through each station; otherwise, the orderwire of station A and
that of station C are unavailable.
3. On the U2000, check the clock tracing settings of stations A and C. For details on how
to set the correct clock tracing, see "Setting the Priority List of Clock Sources" or
"Setting IEEE 1588v2" in the Configuration Guide.
l Cause 2: The setting of the phone set is incorrect.
1. Check the port to which the phone set is connected and ensure that the interface is
correctly selected.
NOTE

Generally, the orderwire phone line is inserted on the "PHONE1" port of the subrack.
2. Check whether the ringing current switch RING on the phone set is set to ON
(indicating that the phone rings when there is an incoming call).
3. Check whether the dialing mode switch is set to T (indicating the dual-tone
multifrequency (DTMF) dialing mode).
4. Check whether the called orderwire phone is on hook. If the red indicator in the upper
right corner on the front panel of the phone set is unlit, the phone set is on hook. If the
red indicator is lit, the phone set is not on hook. Press the "TALK" button on the front
panel of the phone set to hook it up.
l Cause 3: The phone set or phone line is faulty.
1. Install a normal phone set at stations A and C and test the calling function of the phone
set. If the function is available, the original phone set is faulty.
2. Replace the phone line between the phone set and the equipment at stations A and C,
and test the calling function. If the function is available, the phone line is faulty.
l Cause 4: The fiber connection of the SC2 board at station B is incorrect.
1. Along the signal flow, check the pigtail connected to the SC2 board at station B. Ensure
that the connection is correct.
l Cause 5: The system control board at station B is faulty.
1. Remove the system control board at station B to make orderwire bytes pass through
station B. If the orderwire becomes available, the system control board is faulty.
Replace the system control board.

----End

19.2 A Called Phone Set Cannot Exit the Conference Call


State
This topic describes the possible cause and troubleshooting method when a called phone set in
a conference call fails to exit the conference call state.

Symptom
A called phone set in a conference call fails to exit the conference call state. That is, the called
phone set is still in the conference call state after being connected and then disconnected.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 282


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 19 Orderwire Problems

System Impact
The called phone set fails to exit the conference call state, and the orderwire is unavailable.

Possible Causes
l Cause 1: During the conference call, the calling orderwire board of a certain phone set is
reset either due to an external cause or because the phone set was connected when
communication on the supervisory channel failed.

Tools, Equipment, and Materials


None

Procedure
l Cause 1: During the conference call, the calling orderwire board of a certain phone set is
reset either due to an external cause or because the phone set was connected when
communication on the supervisory channel failed.
1. Leave the phone set in the connected state for two hours. The phone set will return to
the idle state two hours later.
2. Use an idle phone set to initiate a conference call. Then, connect this phone set, causing
all phone sets in orderwire communication with this phone set to exit the conference
call state.

----End

19.3 Orderwire Cases


Case analysis related to Orderwire cases.

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MC-A42 A Fault of the PMU for the OptiX BWS 1600G Results In an Orderwire Ringing
Failure
l MC-A163 How To Quickly Re-obtain the IP Address Of a Device At a Station After the
IP Address Of the Device Is Changed
l MC-A187 How to Change the IP Address of the OptiX OSN 1800 to the Default One
l MC-A223 When a Self-Made Network Cable Is Used, the OptiX OSN 6800 Reports the
COMMUN_FAIL Alarm
l MC-A224 The Network Cable at the COM Port on the EFI Board of the OptiX OSN 6800
Subrack Is Incorrectly Connected. Consequently, All the Boards on the OptiX OSN 6800
Subrack Report the COMMUN_FAIL Alarms

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 283


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 20 Coherent Board Faults

20 Coherent Board Faults

This section describes how to troubleshoot coherent board faults in terms of the symptoms,
impact on the system, possible causes, tools required for troubleshooting, troubleshooting
procedure, and precautions that should be taken during the troubleshooting.

Symptoms
l Symptom 1: The pre-FEC BER is high, services are unavailable, or the wavelength of the
received service is incorrect.
l Symptom 2: DCN communication is interrupted and some NEs are unreachable by the
NMS.

Impact on the System


On the NEs unreachable by the NMS, bit errors are generated in the services, or the services are
interrupted.

Possible Causes
l For symptom 1:
– Cause 1: The receive wavelength is incorrectly configured.
– Cause 2: The FEC type of the board is incorrectly set.
l For symptom 2:
– Cause 1: Coherent and non-coherent boards are interconnected.

Tools, Instruments, and Materials


l U2000

Procedure
Step 1 For symptom 1:
1. Cause 1: The receive wavelength is incorrectly configured.
a. Check whether the receive wavelength has been configured for the coherent board. If
not, the system automatically associates the receive and transmit wavelengths when

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 284


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 20 Coherent Board Faults

receiving coherent wavelengths. In other words, the local transmit wavelength is


consistent with the remote transmit wavelength expected to be received.
b. If the receive wavelength has been configured for the board, the configured
wavelength takes effect and there is no automatic association between the receive and
transmit wavelengths. Check whether the receive wavelengths configured for
interconnected boards are consistent with the planned ones.
c. In the following scenario, the user must set the receive wavelength for the coherent
board. In the figures, there are four nodes: A, B1, B2, and C. Nodes B1 and B2 use
the TN11LTX board in relay mode, wavelength 1 is used between node A and nodes
B1/B2, and wavelength 2 is used between node C and nodes B1/B2. Normally, the
receive wavelength and transmit wavelength of node B1 are 1 and 2 respectively, and
those of node B2 are 2 and 1 respectively. If no receive wavelength is configured for
the 11LTX boards of nodes B1 and B2, the receive and transmit wavelengths of nodes
B1 and B2 will be consistent at each node.

1 2

B1
11LTX
A C
B2
11LTX

1 2
2. Cause 2: The FEC type of the board is incorrectly set.
a. On the U2000, check whether the board supports the preset FEC type. Currently,
coherent boards support HFEC only. If the board does not support the preset FEC
type, set the FEC type as HFEC.

Step 2 For symptom 2:


1. Cause 1: Coherent and non-coherent boards are interconnected.
a. The DCN bytes are defined differently for coherent and non-coherent services, so the
two types of services cannot be interconnected. If DCN channels are unavailable,
check whether the interconnected boards are coherent boards.

----End

Related Information
Coherent boards are heavy and their components are sensitive, so special package materials are
required and boards must be packaged in strict compliance with the instructions. During
deployment, keep the removed package materials in case of return for repair in future. If boards

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 285


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 20 Coherent Board Faults

are improperly packaged, they will be easily damaged during transportation, installation, and
return for repair, resulting in unnecessary loss.

Storing Package Materials: Package materials must be free of water and must be handled with
care. They can be stacked.

CAUTION
A board must be put into a foam box with the front panel towards a correct direction. Otherwise,
connectors and optical modules on the board maybe seriously damaged by the stress of the foam
box.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 286


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

21 Interconnection Faults

About This Chapter

This topic describes the possible causes and troubleshooting methods for WDM and OTN
equipment interconnection failures.

21.1 Failure in Intercommunication with the Data Equipment


This section describes the possible causes and handling methods of the failure in
intercommunication of the WDM equipment with the data equipment.

21.2 Failure in Intercommunication with the SDH Equipment


This section describes the possible causes and handling methods of the failure in
intercommunication of the WDM equipment with the SDH equipment.

21.3 WDM and other OTN Equipment Interconnection Failures


This topic describes the possible causes and troubleshooting methods for WDM and OTN
equipment interconnection failures.

21.4 Equipment Interconnection Cases


Case analysis related to Equipment Interconnection cases.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 287


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

21.1 Failure in Intercommunication with the Data


Equipment
This section describes the possible causes and handling methods of the failure in
intercommunication of the WDM equipment with the data equipment.

Symptom
After a piece of WDM equipment is interconnected to a piece of data equipment, the two pieces
of equipment fail to communicate with each other and the equipment might report the R_LOS,
LINK_ERR and R_OOF alarms.

Impact on System
The two pieces of interconnected equipment fail to communicate with each other.

Possible Causes
l Cause 1: The line between the WDM equipment and the data equipment is faulty.
l Cause 2: The parameter configuration of interconnected board of the WDM equipment on
the client side is incorrect.
l Cause 3: Either or both of the interconnected boards is/are faulty.

Tools, Equipment, and Materials


U2000, optical power meter

Procedure
Step 1 Cause 1: The line between the WDM equipment and the data equipment is faulty. In this case,
interconnected board of the WDM equipment may report the R_LOS, LINK_ERR alarms on
the client side.
1. Check whether the optical modules of the interconnected boards are of the same type. If
they are of the same type, check whether the supported transmission distance of the optical
modules is equal or longer than the actual distance away from the client side. The single-
mode optical module and the multimode optical module cannot be used together.
2. Check whether the fibers of the interconnected boards match each other. The single-mode
fiber and the multimode fiber cannot be used together.
NOTE
Generally, the multimode fiber is nacarat-colored, and the single-mode fiber is yellow.
3. Check whether any fiber is inserted in an incorrect interface according to the fiber labels.
If any fiber is inserted in an incorrect interface, the equipment might not generate the
R_LOS alarm but generates a large number of performance events against anomalies. In
this case, insert the fiber in the correct interface.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 288


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

4. Use an optical power meter to test the interconnect fiber jumper. If the fiber jumper is
abnormal, replace the fiber jumper. If the fiber jumper is good, check whether the optical
cable is faulty.

Step 2 Cause 2: The parameter configuration of interconnected board of the WDM equipment on the
client side is incorrect. In this case, interconnected board of the WDM equipment may report
R_LOS, LINK_ERR alarms on the client side.
1. In the case of the OTU board that can access various service data, check whether the service
type settings of the OTU board are different from those of the data equipment.
l On the interconnected board of the WDM equipment, query and set the client service
type of the board. Make sure that the service type of the WDM equipment is the same
as the interconnected data equipment.
2. Check whether the auto-negotiation modes of the optical interfaces on the interconnected
boards are different from each other.
l Querying and Setting the Working Mode of the Ethernet Port. Make sure that the auto-
negotiation modes are the same.
3. The Maximum Package Length of the optical interfaces on the interconnected boards are
different from each other.
l Log in to the U2000. In the NE Explorer interface, select the interconnected board.
l In Function Tree on the left side, choose Configuration > WDM Interface.
l Select Advanced Attributes, and click Query. Make sure that the Maximum Package
Length of the optical interfaces on the interconnected boards are same.
4. Check whether any interconnected interface is looped back.
l Log in to the U2000. In the NE Explorer interface, select the interconnected board.
l In Function Tree on the left side, choose Configuration > WDM Interface.
l Select Basic Attributes, and click Query. Check whether the optical interface on the
client side of each board is looped back. If yes, double-click the Loopback field. In the
drop-down list, select Non-Loopback and click Apply.

Step 3 Cause 3: Either or both of the interconnected boards is/are faulty.


1. At the optical interface on the client side of the interconnected board in the WDM
equipment, use the optical power meter to test the optical power. If the optical power is
abnormal, replace the board.
2. At the optical interface of the interconnected board in the data equipment, use the optical
power meter to test the optical power. If the optical power is abnormal, replace the board.

----End

21.2 Failure in Intercommunication with the SDH


Equipment
This section describes the possible causes and handling methods of the failure in
intercommunication of the WDM equipment with the SDH equipment.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 289


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

Symptom
After a piece of WDM equipment is interconnected to a piece of SDH equipment, the two pieces
of equipment fail to communicate with each other and the equipment might report the R_LOS,
R_LOF, and R_OOF alarms.

If the equipment reports the B1_EXC, B1_SD, B2_EXC, and B2_SD alarms, it indicates that
bit errors occur in the one of the two pieces of interconnected equipment. For details on how to
clear the fault, see Single-Channel Bit Errors.

Impact on System
The communication between the interconnected equipment is unavailable.

Possible Causes
l Cause 1: The line between the WDM equipment and the SDH equipment is faulty.
l Cause 2: The parameter configuration of interconnected board of the WDM equipment on
the client side is incorrect.
l Cause 3: Either or both of the interconnected boards is/are faulty.

Tools, Equipment, and Materials


U2000, optical power meter

Procedure
Step 1 Cause 1: The line between the WDM equipment and the SDH equipment is faulty. In this case,
interconnected board of the WDM equipment may report the R_LOS and R_LOF alarms on the
client side.
1. Check whether the optical modules of the interconnected boards are of the same type. If
they are of the same type, check whether the supported transmission distance of the optical
modules is equal or longer than the actual distance away from the client side. The single-
mode optical module and the multimode optical module cannot be used together.
2. Check whether the fibers of the interconnected boards match each other. The single-mode
fiber and the multimode fiber cannot be used together.
NOTE
Generally, the multimode fiber is nacarat-colored, and the single-mode fiber is yellow.
3. Check whether any fiber is inserted in an incorrect interface according to the fiber labels.
If any fiber is inserted in an incorrect interface, the equipment might not generate the
R_LOS alarm but generates a large number of performance events against anomalies. In
this case, insert the fiber in the correct interface.
4. Use an optical power meter to test the interconnect fiber jumper. If the fiber jumper is
abnormal, replace the fiber jumper. If the fiber jumper is good, check whether the optical
cable is faulty.

Step 2 Cause 2: The parameter configuration of interconnected board of the WDM equipment on the
client side is incorrect. In this case, interconnected board of the WDM equipment may report
R_OOF alarm on client side.
1. Check whether the service types of the interconnected boards mismatch each other.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 290


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

l On the interconnected board of the WDM equipment, query and set the client service
type of the board. Make sure that the service type of the WDM equipment is the same
as the interconnected SDH equipment.
2. Check whether any interconnected interface is looped back.
l Log in to the U2000. In the NE Explorer interface, select the interconnected board.
l In Function Tree on the left side, choose Configuration > WDM Interface.
l Select Basic Attributes, and click Query. Check whether the optical interface on the
client side of each board is looped back. If yes, double-click the Loopback field. In the
drop-down list, select Non-Loopback and click Apply.

Step 3 Cause 3: Either or both of the interconnected boards is/are faulty.


1. At the optical interface on the client side of the interconnected board in the WDM
equipment, use the optical power meter to test the optical power. If the optical power is
abnormal, replace the board.
2. At the optical interface of the interconnected board in the SDH equipment, use the optical
power meter to test the optical power. If the optical power is abnormal, replace the board.

----End

21.3 WDM and other OTN Equipment Interconnection


Failures
This topic describes the possible causes and troubleshooting methods for WDM and OTN
equipment interconnection failures.

Symptom
After local WDM equipment is interconnected with other OTN equipment, the two devices fail
to communicate with each other and the U2000 may report R_LOS, OTUk_LOF, and
OTUk_LOM alarms.

System Impact
Interconnected devices are unable to communicate with each other.

Possible Causes
l Cause 1: The line between the local WDM equipment and the other OTN equipment is
faulty.
l Cause 2: The client-side configuration for the interconnected board on the WDM equipment
is incorrect.
l Cause 3: One or both of the interconnected boards is/are faulty.

Tools, Equipment, and Materials


U2000, optical power meter

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 291


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

Procedure
Step 1 Cause 1: The line between the local WDM equipment and the other OTN equipment is faulty.
In this case, the interconnected board on the WDM equipment may report the R_LOS alarm on
the client side.
1. Check whether the optical modules of the interconnected boards are of the same type. If
they are of the same type, check whether the supported transmission distance of the optical
modules is equal or longer than the actual distance away from the client side. The single-
mode optical module and the multimode optical module cannot be used together.
2. Check whether the optical fibers of the interconnected boards match each other. The single-
mode optical fiber and the multimode optical fiber cannot be used together.
NOTE
Generally, the multimode optical fiber is nacarat, and the single-mode fiber is yellow.
3. Check whether any optical fiber is installed on an incorrect port according to the optical
fiber labels. If any optical fiber is installed on an incorrect port, the R_LOS alarm might
not be generated but a large number of performance events against anomalies may be
generated on the equipment. In this case, install the optical fiber on the correct port.
4. Use an optical power meter to test the pigtail interconnected with the equipment. If the
pigtail is abnormal, replace it. If it is normal, check whether the optical cable is faulty.
Step 2 Cause 2: The client-side configuration for the ports on the interconnected board of the WDM
equipment is incorrect. In this case, the interconnected board on the WDM equipment may report
the OTUk_LOF and OTUk_LOM alarms on the client side.
1. Check whether the service types, FEC mode, working modes, board mode of the
interconnected boards are inconsistent.
l Query and set the client service type of the interconnected board of the WDM
equipment. Ensure that the service type of the WDM equipment is the same as that of
the interconnected OTN equipment.
2. Check whether a loop is occurring on any interconnected port.
l Log in to the U2000. In NE Explorer, select the interconnected NE.
l Choose Configuration > WDM Interface from the Function Tree.
l On the Basic Attributes tab, click Query to check whether a loopback is enabled on
any optical port on the client side of each board. If a loopback is enabled, double-click
the Loopback field. In the drop-down list, select Non-Loopback and click Apply.
Step 3 Cause 3: One or both of the interconnected boards is/are faulty.
1. Use the optical power meter to test the optical power of the client-side optical port on the
interconnected board of the WDM equipment. If the optical power is abnormal, replace the
board.
2. Use the optical power meter to test the optical power of the optical port on the
interconnected board of the OTN equipment. If the optical power is abnormal, replace the
board.

----End

21.4 Equipment Interconnection Cases


Case analysis related to Equipment Interconnection cases.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 292


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting 21 Interconnection Faults

NOTE

The following cases are related to the OptiX WDM product series.

Related Cases:
l MC-A4 The LOG Board Fails to Interwork With the FDG Board on the Client Sides. The
LOG board reports the R_LOS alarm on the client side. The FDG board reports the
LINK_STATUS alarm.
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A6 The LDG Board Keeps Reporting the ALM_DATA_TLOS and
ALM_DATA_RLOS Alarms Transiently
l MC-A11 The SSE3LWF Board Reports the OTU_LOF Alarm When Interworking with
the SSE1TMR
l MC-A17 The LQG Reports ALM_DATA_RLOS and ALM_DATA_TLOS Alarms
l MC-A21 An Interconnection Failure between the WDM and SDH Equipment
l MC-A30 LDG Board of Metro 6100 Equipment Reports the INBADOCTS_OVER Alarm
l MC-A44 Bit Errors Generated in the Services, the Meter Reports the HP_RDI and ERROR
Alarms
l MC-A49 An Incorrect Setting of CRC Results in an Interconnection Failure
l MC-A67 The GE Port on the Client Side Reports LINK_DOWN Alarm
l MC-A74 The FDG Reports the T_DATA_LOST and R_DATA_LOST Alarms
l MC-A75 Bandwidth Decreases and Service Rate Becomes Lower Due to Improper Setting
l MC-A84 Maximal Packet Length Setting of the OptiX OSN 6800 Causes the Abnormal
Service
l MC-A117 Analysis and Handling of the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Reported by the LBE
l MC-A118 An R_OOF Alarm Occurs When the LWC Interconnects with Third-Party
Equipment
l MC-A153 It Is Recommended to Enable the Auto-Negotiation Working Mode at a GE Port
When the LPT Function Is Enabled, Or The Services Are Interrupted And The System
Reports The R_LOS Alarms
l MC-A154 The N2SLQ16 Board Interconnected with the TQS Board Reports an R_LOS
Alarm Because The TQS Board Emits White Lights When the Input Optical Power Is
Normal
l MC-A156 An SDH Board on the SDH Equipment Reports an R_LOF Alarm When the
SDH Board Is Interconnected with a Tributary Board on the OptiX OSN 6800
l MC-A185 Incorrect Client-Side Service Type Causes Failure of Interconnection With a
Router
l MC-A249 The OTN Equipment Fails to Transparently Transmit Ethernet Clock Signals

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 293


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting A Remote Maintenance Guide

A Remote Maintenance Guide

This topic describes remote maintenance.

A.1 Overview
This topic provides remote maintenance overview.

A.2 Enabling Remote Maintenance User


This section introduces how to enable remote maintenance user.

A.3 Creating Remote Maintenance


This section introduces how to create remote maintenance between the U2000 service end and
the remote maintenance terminal.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 294


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting A Remote Maintenance Guide

A.1 Overview
This topic provides remote maintenance overview.

The Huawei optical transmission equipment series supports remote maintenance. A remote
computer (remote end) can connect to an on-site U2000 server (local end) through a public
switched telephone network (PSTN) or through the Internet. The remote computer then can
perform maintenance to the equipment.

Figure A-1 is a networking diagram illustrating remote maintenance. A modem is installed at


the remote maintenance terminal and on the U2000 server, and dial-up programs are installed
at each end. These operations are performed during the U2000 installation.

Figure A-1 Remote maintenance networking diagram

serial
port Modem
PSTN/
Internet serial
Modem port
Remote
maintenance
ternimal
U2000
Server

WDM
network

During remote maintenance, the U2000 server needs to cooperate with the remote end to perform
the following operations:

l Enable the remote maintenance user.


l Establish the remote maintenance connection.

A.2 Enabling Remote Maintenance User


This section introduces how to enable remote maintenance user.

Prerequisites
You must be an NM user with "Network administrator" authority or higher.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 295


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting A Remote Maintenance Guide

Context
A remote maintenance user is the NM user who logs in to the U2000 server to maintain the
equipment. Remote maintenance users are in Disable status by default. You need to enable it
before you start the remote maintenance.

Procedure
Step 1 In the Main Menu choose System > Remote Maintenance User Management to display the
Remote Maintenance User Management dialog box.

Step 2 Set the Disable/Enable to be Enable. Set the other attributes of the remote user.

Step 3 Click Apply.

Step 4 Click OK to confirm the settings.

----End

A.3 Creating Remote Maintenance


This section introduces how to create remote maintenance between the U2000 service end and
the remote maintenance terminal.

Prerequisites
The communication connection between the remote maintenance terminal and the U2000 server
must have been configured.

The remote maintenance user must have been enabled.

The user name and password of the U2000 server must be known to maintenance personnel at
the remote maintenance end.

Procedure
Step 1 On the U2000 server, query the connection status for remote maintenance. If the remote
maintenance connection is normal, go to step 5. If not, go to step 2.

The query methods are listed as follows.

l If the Windows operating system is installed, right-click My Network Places and select
Properties to query the connection status for remote maintenance.
l If the Solaris or Linux operating system is installed, open a terminal window and run the
ifconfig -a command to query the connection status for remote maintenance.

Step 2 Double-click the shortcut icon, which is created for dial-up connection, on the desktop, such as
Remote Maintenance to perform dial-up connection.

Step 3 Enter the user name (ppp_user) and password in the displayed dialog box.

Step 4 Enter the user name and password. Press the Enter key and click the D button at the right bottom
corner to establish the connection.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 296


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting A Remote Maintenance Guide

NOTE
After you enter the password and press the Enter key, a line of illegal characters is displayed. This is normal.

Step 5 Query the IP address for dial-up access to network.

l If the Windows operating system is installed, choose Start > All Programs > Accessories
> Command Prompt. Enter ipconfig in the dialog box and query the dynamic IP address
for dial-up access to network.
l If the Solaris or Linux operating system is installed, display a terminal window and run the
ifconfig -a command to query dynamic IP address for dial-up access to network.

NOTE
The address for dial-up access is IP address. If the dial-up connection of the server is disconnected, re-dial
to get a new IP address.

Step 6 Inform the maintenance personnel at the remote end of the queried IP address.

Step 7 Establish the dial-up connection between the remote maintenance personnel and the U2000
server. Log in to the U2000 client. After the login succeeds, you can perform the remote
maintenance.

CAUTION
To ensure network security, set the remote maintenance user to Disabled after the remote
maintenance.

----End

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 297


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

B Glossary

Numerics
1+1 backup A backup method in which two components mirror each other. If the active component
goes down, the standby component takes over services from the active component to
ensure that the system service is not interrupted.
1000BASE-T An Ethernet specification that uses the twisted pair cable with the transmission speed as
1000 Mbit/s and the transmission distance as 100 meters.
3R reshaping, retiming, regenerating

A
AC alternating current
ACL See access control list.
ADM add/drop multiplexer
ADSL See asymmetric digital subscriber line.
AGC automatic gain control
AIS alarm indication signal
ALC link A piece of end-to-end configuration information, which exists in the equipment (single
station) as an ALC link node. Through the ALC function of each node, it fulfills optical
power control on the line that contains the link.
ALS See automatic laser shutdown.
ANSI See American National Standards Institute.
APE See automatic power equilibrium.
APS automatic protection switching
ASE amplified spontaneous emission
ASIC See application-specific integrated circuit.
ATM asynchronous transfer mode

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 298


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

American National An organization that defines U.S standards for the information processing industry.
Standards Institute
(ANSI)
access control list A list of entities, together with their access rights, which are authorized to access a
(ACL) resource.
active/standby A troubleshooting technology. When an active device becomes faulty, services and
switchover control functions are automatically switched over to the standby device to ensure the
normal running of the services and functions.
alarm cable A cable used to transmit visual or audio alarms.
alarm cascading The method of cascading alarm signals from several subracks or cabinets.
alarm output Node or other signals that are sent by an alarm controller to peripheral devices when an
alarm is reported.
alarm suppression A method to suppress alarms for the alarm management purpose. Alarms that are
suppressed are no longer reported from NEs.
application-specific A special type of chip that starts out as a nonspecific collection of logic gates. Late in
integrated circuit the manufacturing process, a layer is added to connect the gates for a specific function.
(ASIC) By changing the pattern of connections, the manufacturer can make the chip suitable for
many needs.
asymmetric digital A technology for transmitting digital information at a high bandwidth on existing phone
subscriber line (ADSL) lines to homes and businesses. Unlike regular dialup phone service, ADSL provides
continuously-available, "always on" connection. ADSL is asymmetric in that it uses most
of the channel to transmit downstream to the user and only a small part to receive
information from the user. ADSL simultaneously accommodates analog (voice)
information on the same line. ADSL is generally offered at downstream data rates from
512 kbit/s to about 6 Mbit/s.
attenuation Reduction of signal magnitude or signal loss, usually expressed in decibels.
attenuator A device used to increase the attenuation of an Optical Fiber Link. Generally used to
ensure that the signal at the receive end is not too strong.
automatic laser A technique (procedure) to automatically shutdown the output power of laser transmitters
shutdown (ALS) and optical amplifiers to avoid exposure to hazardous levels.
automatic power A function to automatically equalize channel optical power at the transmitter end,
equilibrium (APE) ensuring a required optical power flatness and OSNR at the receiver end.

B
B/S browser/server
BBER background block error ratio
BC boundary clock
BDI See backward defect indication.
BGP Border Gateway Protocol
BIOS See basic input/output system.
BIP See bit interleaved parity.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 299


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

BIP-8 See bit interleaved parity-8.


BITS See building integrated timing supply.
BOM bill of materials
BPDU See bridge protocol data unit.
BPSK See binary phase shift keying.
BRAS See broadband remote access server.
backbone network A network that forms the central interconnection for a connected network. The
communication backbone for a country is WAN. The backbone network is an important
architectural element for building enterprise networks. It provides a path for the exchange
of information between different LANs or subnetworks. A backbone can tie together
diverse networks in the same building, in different buildings in a campus environment,
or over wide areas. Generally, the backbone network's capacity is greater than the
networks connected to it.
backplane An electronic circuit board containing circuits and sockets into which additional
electronic devices on other circuit boards or cards can be plugged.
backward defect A function that the sink node of a LSP, when detecting a defect, uses to inform the
indication (BDI) upstream end of the LSP of a downstream defect along the return path.
bandwidth A range of transmission frequencies a transmission line or channel can carry in a network.
In fact, the bandwidth is the difference between the highest and lowest frequencies in
the transmission line or channel. The greater the bandwidth, the faster the data transfer
rate.
basic input/output Firmware stored on the computer motherboard that contains basic input/output control
system (BIOS) programs, power-on self test (POST) programs, bootstraps, and system setting
information. The BIOS provides hardware setting and control functions for the computer.
binary phase shift 2-phase modulation for carrier based on binary baseband signal. In this modulation mode,
keying (BPSK) the binary character 0 represents phase 0 of the carrier, and character 1 represents the
phase 180. The phase of character 0 is 0, and the phase of character 1 needs to be
specified. This is an absolute phase shift mode that uses different phases to represent
digital information.
bit The smallest unit of information handled by a hardware component. One bit expresses
a 1 or a 0 in a binary numeral, or a true or a false logical condition, and is represented
physically by an element such as a high or low voltage at one point in a circuit or a small
spot on a disk magnetized one way or the other. A single bit conveys little information
a human would consider meaningful. A group of eight bits, however, makes up a byte,
which can be used to represent many types of information, such as a letter of the alphabet,
a decimal digit, or other character.
bit interleaved parity A method of error monitoring. With even parity, the transmitting equipment generates
(BIP) an X-bit code over a specified portion of the signal in such a manner that the first bit of
the code provides even parity over the first bit of all X-bit sequences in the covered
portion of the signal, the second bit provides even parity over the second bit of all X-bit
sequences within the specified portion, and so forth. Even parity is generated by setting
the BIP-X bits so that an even number of 1s exist in each monitored partition of the
signal. A monitored partition comprises all bits in the same bit position within the X-bit
sequences in the covered portion of the signal. The covered portion includes the BIP-X.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 300


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

bit interleaved parity-8 Consists of a parity byte calculated bit-wise across a large number of bytes in a
(BIP-8) transmission transport frame. Divide a frame is into several blocks with 8 bits (one byte)
in a parity unit and then arrange the blocks in matrix. Compute the number of "1" or "0"
over each column. Then fill a 1 in the corresponding bit for the result if the number is
odd, otherwise fill a 0.
bit/s See bits per second.
bits per second (bit/s) A rate at which the individual bits are transmitted through a communication link or
circuit. Its unit can be bit/s, kbit/s, and Mbit/s.
bridge A device that connects two or more networks and forwards packets among them. Bridges
operate at the physical network level. Bridges differ from repeaters because bridges store
and forward complete packets, while repeaters forward all electrical signals. Bridges
differ from routers because bridges use physical addresses, while routers use IP
addresses.
bridge protocol data Data messages exchanged across switches within an extended LAN that uses a spanning
unit (BPDU) tree protocol (STP) topology. BPDU packets contain information on ports, addresses,
priorities, and costs, and they ensure that the data reaches its intended destination. BPDU
messages are exchanged across bridges to detect loops in a network topology. These
loops are then removed by shutting down selected bridge interfaces and placing
redundant switch ports in a backup, or blocked, state.
broadband remote A new type of access gateway for broadband networks. As a bridge between backbone
access server (BRAS) networks and broadband access networks, BRAS provides methods for fundamental
access and manages the broadband access network. It is deployed at the edge of network
to provide broadband access services, convergence, and forwarding of multiple services,
meeting the demands for transmission capacity and bandwidth utilization of different
users. BRAS is a core device for the broadband users' access to a broadband network.
broadcast A means of delivering information to all members in a network. The broadcast range is
determined by the broadcast address.
broadcast address A network address in computer networking that allows information to be sent to all nodes
on a network, rather than to a specific network host.
building integrated In the situation of multiple synchronous nodes or communication devices, one can use
timing supply (BITS) a device to set up a clock system on the hinge of telecom network to connect the
synchronous network as a whole, and provide satisfactory synchronous base signals to
the building integrated device. This device is called BITS.
bus A path or channel for signal transmission. The typical case is that, the bus is an electrical
connection that connects one or more conductors. All devices that are connected to a
bus, can receive all transmission contents simultaneously.
byte A unit of computer information equal to eight bits.

C
CAR committed access rate
CBR See constant bit rate.
CE See customer edge.
CES See circuit emulation service.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 301


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

CPRI See common public radio interface.


CPU See central processing unit.
CRC See cyclic redundancy check.
CSF Client Signal Fail
CSMA/CD See carrier sense multiple access with collision detection.
CV connectivity verification
CWDM See coarse wavelength division multiplexing.
cabinet Free-standing and self-supporting enclosure for housing electrical and/or electronic
equipment. It is usually fitted with doors and/or side panels which may or may not be
removable.
carrier sense multiple Carrier sense multiple access with collision detection (CSMA/CD) is a computer
access with collision networking access method in which:
detection (CSMA/CD)
l A carrier sensing scheme is used.
l A transmitting data station that detects another signal while transmitting a frame,
stops transmitting that frame, transmits a jam signal, and then waits for a random
time interval before trying to send that frame again.
central processing unit The computational and control unit of a computer. The CPU is the device that interprets
(CPU) and executes instructions. The CPU has the ability to fetch, decode, and execute
instructions and to transfer information to and from other resources over the computer's
main data-transfer path, the bus.
chain network One type of network that all network nodes are connected one after one to be in series.
circuit emulation A function with which the E1/T1 data can be transmitted through ATM networks. At the
service (CES) transmission end, the interface module packs timeslot data into ATM cells. These ATM
cells are sent to the reception end through the ATM network. At the reception end, the
interface module re-assigns the data in these ATM cells to E1/T1 timeslots. The CES
technology guarantees that the data in E1/T1 timeslots can be recovered to the original
sequence at the reception end.
client trail A lower level trail in a structure where trails of different levels have an inclusion relation.
For example, a trail of a certain level contains multiple trails of lower levels.
clock source A device that provides standard time for the NTP configuration.
clock synchronization A process of synchronizing clocks, in which the signal frequency traces the reference
frequency, but the start points do not need to be consistent. This process is (also known
as frequency synchronization).
clock tracing The method of keeping the time on each node synchronized with a clock source in the
network.
coarse wavelength A signal transmission technology that multiplexes widely-spaced optical channels into
division multiplexing the same fiber. CWDM spaces wavelengths at a distance of several nm. CWDM does
(CWDM) not support optical amplifiers and is applied in short-distance chain networking.
collection A process of prompting a customer to pay outstanding bills.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 302


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

common public radio A common standard of the key internal interface between the REC and the RE of the
interface (CPRI) wireless base station. This standard was established by Huawei, Ericsson, NEC, Siemens,
and Nortel in June 2003. It aims at standardizing the baseband and RF interface. The
CPRI has a set of mature standards, which advance the standard and equipment. The
major feature of the CPRI is that baseband is separated from RF to reduce the cost of
engineering, equipment room, and equipment.
configuration data A command file defining hardware configurations of an NE. With this file, an NE can
collaborate with other NEs in a network. Therefore, configuration data is the key factor
that determines the operation of an entire network.
connection point A reference point where the output of a trail termination source or a connection is bound
to the input of another connection, or where the output of a connection is bound to the
input of a trail termination sink or another connection. The connection point is
characterized by the information which passes across it. A bidirectional connection point
is formed by the association of a contradirectional pair.
consistency check A function that is used to check the consistency of service data and resource data between
two softswitches that have the dual homing relation. This ensures the consistency of
service data and resource data between the softswitches.
constant bit rate (CBR) A kind of service categories defined by the ATM forum. CBR transfers cells based on
the constant bandwidth. It is applicable to service connections that depend on precise
clocking to ensure undistorted transmission.
control VLAN A VLAN that transmits only protocol packets.
convergence layer A "bridge" between the access layer and the core layer. The convergence layer provides
the convergence and forwarding functions for the access layer. It processes all the traffic
from the access layer devices, and provides the uplinks to the core layer. Compared with
the access layer, the convergence layer devices should have higher performance, fewer
interfaces and higher switching rate. In the real network, the convergence layer refers to
the network between UPEs and PE-AGGs.
cross-connection The connection of channels between the tributary board and the line board, or between
line boards inside the NE. Network services are realized through the cross-connections
of NEs.
customer edge (CE) A part of the BGP/MPLS IP VPN model that provides interfaces for directly connecting
to the Service Provider (SP) network. A CE can be a router, switch, or host.
cyclic redundancy A procedure used to check for errors in data transmission. CRC error checking uses a
check (CRC) complex calculation to generate a number based on the data transmitted. The sending
device performs the calculation before performing the transmission and includes the
generated number in the packet it sends to the receiving device. The receiving device
then repeats the same calculation. If both devices obtain the same result, the transmission
is considered to be error free. This procedure is known as a redundancy check because
each transmission includes not only data but extra (redundant) error-checking values.

D
DAPI destination access point identifier
DBPS distributed board protect system
DC direct current

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 303


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

DC-C See DC-return common (with ground).


DC-I See DC-return isolate (with ground).
DC-return common A power system, in which the BGND of the DC return conductor is short-circuited with
(with ground) (DC-C) the PGND on the output side of the power supply cabinet and also on the line between
the output of the power supply cabinet and the electric equipment.
DC-return isolate (with A power system, in which the BGND of the DC return conductor is short-circuited with
ground) (DC-I) the PGND on the output side of the power supply cabinet and is isolated from the PGND
on the line between the output of the power supply cabinet and the electric equipment.
DCC See data communications channel.
DCM See dispersion compensation module.
DCN See data communication network.
DGD differential group delay
DLAG See distributed link aggregation group.
DRDB dynamic random database
DRZ differential phase return to zero
DSLAM See digital subscriber line access multiplexer.
DSP See digital signal processor.
DVB-ASI digital video broadcast-asynchronous serial interface
DWDM See dense wavelength division multiplexing.
data communication A communication network used in a TMN or between TMNs to support the data
network (DCN) communication function.
data communications The data channel that uses the D1-D12 bytes in the overhead of an STM-N signal to
channel (DCC) transmit information on the operation, management, maintenance, and provisioning
(OAM&P) between NEs. The DCC channel composed of bytes D1-D3 is referred to as
the 192 kbit/s DCC-R channel. The other DCC channel composed of bytes D4-D12 is
referred to as the 576 kbit/s DCC-M channel.
dense wavelength The technology that utilizes the characteristics of broad bandwidth and low attenuation
division multiplexing of single mode optical fiber, employs multiple wavelengths with specific frequency
(DWDM) spacing as carriers, and allows multiple channels to transmit simultaneously in the same
fiber.
digital signal processor A microprocessor designed specifically for digital signal processing, generally in real
(DSP) time.
digital subscriber line A network device, usually situated in the main office of a telephone company, that
access multiplexer receives signals from multiple customer Digital Subscriber Line (DSL) connections and
(DSLAM) uses multiplexing techniques to put these signals on a high-speed backbone line.
discrete service The cross-connection that exists on an NE but cannot form trails on the network
management system.
dispersion A type of module that contains dispersion compensation fibers to compensate for the
compensation module dispersion of the transmitting fiber.
(DCM)

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 304


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

distributed link A board-level port protection technology that detects unidirectional fiber cuts and
aggregation group negotiates with the opposite port. In the case of a link down failure on a port or hardware
(DLAG) failure on a board, services are automatically switched to the slave board, thereby
achieving 1+1 protection for the inter-board ports.
dual feed and selective A channel used to transmit monitoring data on an optical transmission network. The
receiving monitoring data is transmitted on the data communications channel as part of the
overhead of the service signal.
dual-ended switching A protection method in which switching is performed at both ends of a protected entity,
such as a connection or path, even if a unidirectional failure occurs.

E
E-LAN See Ethernet local area network.
E-Line See Ethernet line.
E1 An European standard for high-speed data transmission at 2.048 Mbit/s. It provides
thirty-two 64 kbit/s channels. A time division multiplexing frame is divided in to 32
timeslots numbered from 0 to 31. Timeslot 0 is reserved for frame synchronization, and
timeslot 16 is reserved for signaling transmission. The rest 30 timeslots are use as speech
channels. Each timeslot sends or receives an 8-bit data per second. Each frame sends or
receives 256-bit data per second. 8000 frames will be sent or received per second.
Therefore the line data rate is 2.048 Mbit/s.
EAPE enhanced automatic power pre-equilibrium
ECC See embedded control channel.
EDFA See erbium-doped fiber amplifier.
EEPROM See electrically erasable programable read-only memory.
EMI See electromagnetic interference.
EPL See Ethernet private line.
EPLAN See Ethernet private LAN service.
EPLD See erasable programmable logical device.
EPON See Ethernet passive optical network.
ERPS Ethernet ring protection switching
ES edge server
ESC See electric supervisory channel.
ESCON See enterprise system connection.
ESD electrostatic discharge
ESN See equipment serial number.
ETS European Telecommunication Standards
ETSI See European Telecommunications Standards Institute.
EVOA electrical variable optical attenuator
EVPL See Ethernet virtual private line.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 305


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

EVPLAN See Ethernet virtual private LAN service.


EXP See experimental bits.
Ethernet A LAN technology that uses the carrier sense multiple access with collision detection
(CSMA/CD) media access control method. The Ethernet network is highly reliable and
easy to maintain. The speed of an Ethernet interface can be 10 Mbit/s, 100 Mbit/s, 1000
Mbit/s, or 10,000 Mbit/s.
Ethernet line (E-Line) A type of Ethernet service that is based on a point-to-point EVC (Ethernet virtual
connection).
Ethernet local area A type of Ethernet service that is based on a multipoint-to-multipoint EVC (Ethernet
network (E-LAN) virtual connection).
Ethernet passive A passive optical network based on Ethernet. It is a new generation broadband access
optical network technology that uses a point-to-multipoint structure and passive fiber transmission. It
(EPON) supports upstream/downstream symmetrical rates of 1.25 Gbit/s and a reach distance of
up to 20 km. In the downstream direction, the bandwidth is shared based on encrypted
broadcast transmission for different users. In the upstream direction, the bandwidth is
shared based on TDM. EPON meets the requirements for high bandwidth.
Ethernet private LAN A type of Ethernet service provided by SDH, PDH, ATM, or MPLS server layer
service (EPLAN) networks. This service is carried over dedicated bandwidth between multipoint-to-
multipoint connections.
Ethernet private line A type of Ethernet service provided by SDH, PDH, ATM, or MPLS server layer
(EPL) networks. This service is carried over dedicated bandwidth between point-to-point
connections.
Ethernet virtual A type of Ethernet service provided by SDH, PDH, ATM, or MPLS server layer
private LAN service networks. This service is carried over shared bandwidth between multipoint-to-
(EVPLAN) multipoint connections.
Ethernet virtual A type of Ethernet service provided by SDH, PDH, ATM, or MPLS server layer
private line (EVPL) networks. This service is carried over shared bandwidth between point-to-point
connections.
European A standards-setting body in Europe. Also the standards body responsible for GSM.
Telecommunications
Standards Institute
(ETSI)
eDQPSK enhanced differential quadrature phase shift keying
eSFP enhanced small form-factor pluggable
egress The egress LER. The group is transferred along the LSP consisting of a series of LSRs
after the group is labeled.
electric supervisory A technology that implements communication among all the nodes and transmission of
channel (ESC) monitoring data in an optical transmission network. The monitoring data of ESC is
introduced into DCC service overhead and is transmitted with service signals.
electrically erasable A type of EPROM that can be erased with an electrical signal. It is useful for stable
programable read-only storage for long periods without electricity while still allowing reprograming. EEPROMs
memory (EEPROM) contain less memory than RAM, take longer to reprogram, and can be reprogramed only
a limited number of times before wearing out.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 306


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

electromagnetic Any electromagnetic disturbance that interrupts, obstructs, or otherwise degrades or


interference (EMI) limits the performance of electronics/electrical equipment.
embedded control A logical channel that uses a data communications channel (DCC) as its physical layer
channel (ECC) to enable the transmission of operation, administration, and maintenance (OAM)
information between NEs.
encapsulation A technology for layered protocols, in which a lower-level protocol accepts a message
from a higher-level protocol and places it in the data portion of the lower-level frame.
Protocol A's packets have complete header information, and are carried by protocol B
as data. Packets that encapsulate protocol A have a B header, an A header, followed by
the information that protocol A is carrying. Note that A could equal to B, as in IP inside
IP.
encryption A function used to transform data so as to hide its information content to prevent it's
unauthorized use.
enterprise system A path protocol that connects the host to various control units in a storage system.
connection (ESCON) Enterprise system connection is a serial bit stream transmission protocol that operates a
rate of 200 Mbit/s.
equalization A method of avoiding selective fading of frequencies. Equalization can compensate for
the changes of amplitude frequency caused by frequency selective fading.
equipment serial A string of characters that identify a piece of equipment and ensures correct allocation
number (ESN) of a license file to the specified equipment. It is also called "equipment fingerprint".
erasable A logical array device which can be used to implement the required functions by
programmable logical programming the array. In addition, a user can modify and program the array repeatedly
device (EPLD) until the program meets the requirement.
erbium-doped fiber An optical device that amplifies optical signals. This device uses a short optical fiber
amplifier (EDFA) doped with the rare-earth element, Erbium. The signal to be amplified and a pump laser
are multiplexed into the doped fiber, and the signal is amplified by interacting with
doping ions. When the amplifier passes an external light source pump, it amplifies the
optical signals in a specific wavelength range.
experimental bits A field in the MPLS packet header, three bits long. This field is always used to identify
(EXP) the CoS of the MPLS packet.
extended ID The number of the subnet to which an NE belongs, used to identify different network
segments in a wide area network (WAN). Together, the ID and extended ID form the
physical ID of the NE.
eye pattern An oscilloscope display in which a digital data signal from a receiver is repetitively
sampled and applied to the vertical input, while the data rate is used to trigger the
horizontal sweep. It is so called because, for several types of coding, the pattern looks
like a series of eyes between a pair of rails.

F
FC See Fibre Channel.
FCS free cooling system
FDB See forwarding database.
FDDI See fiber distributed data interface.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 307


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

FE fast Ethernet
FEC See forward error correction.
FICON See Fiber Connect.
FOADM fixed optical add/drop multiplexer
FPGA See field programmable gate array.
FTP File Transfer Protocol
Fiber Connect A new generation connection protocol that connects the host to various control units. It
(FICON) carries a single byte command protocol through the physical path of fiber channel, and
provides a higher transmission rate and better performance than ESCON.
Fibre Channel (FC) A high-speed transport technology used to build SANs. FC is primarily used for
transporting SCSI traffic from servers to disk arrays, but it can also be used on networks
carrying ATM and IP traffic. FC supports single-mode and multi-mode fiber
connections, and can run on twisted-pair copper wires and coaxial cables. FC provides
both connection-oriented and connectionless services.
fault alarm A type of alarm caused by hardware and/or software faults, for example, board failure,
or by the exception that occurs in major functions. After handling, a fault alarm can be
cleared, upon which the NE reports a recovery alarm. Fault alarms are of higher severity
than event alarms.
fiber distributed data A standard developed by the American National Standards Institute (ANSI) for high-
interface (FDDI) speed fiber-optic LANs. FDDI provides specifications for transmission rates of 100
megabits per second on token ring networks.
field programmable A semi-customized circuit that is used in the Application Specific Integrated Circuit
gate array (FPGA) (ASIC) field and developed based on programmable components. FPGA remedies many
of the deficiencies of customized circuits, and allows the use of many more gate arrays.
forced switching The action of switching traffic signals between a working channel and protection
channel. The switching occurs even if the channel to which traffic is being switched is
faulty or an equal or higher priority switching command is in effect.
forward error A bit error correction technology that adds correction information to the payload at the
correction (FEC) transmit end. Based on the correction information, the bit errors generated during
transmission can be corrected at the receive end.
forwarding database A type of database that Includes entries for guiding multicast data forwarding. There are
(FDB) Layer 2 FDB and Layer 3 FDB. The Layer 2 FDB refers to the MAC table, which provides
information about the MAC address and outbound interface and guides Layer 2
forwarding. The Layer 3 FDB refers to the ARP table, which provides information about
the IP address and outbound interface and guides Layer 3 forwarding.

G
GCC general communication channel
GE Gigabit Ethernet
GFP See Generic Framing Procedure.
GMP Group Map Protocol
GMPLS generalized multiprotocol label switching

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 308


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

GNE See gateway network element.


GPON gigabit-capable passive optical network
GPS See Global Positioning System.
Generic Framing A framing and encapsulated method that can be applied to any data type. GFP is defined
Procedure (GFP) by ITU-T G.7041.
Global Positioning A global navigation satellite system that provides reliable positioning, navigation, and
System (GPS) timing services to users worldwide.
gain The difference between the optical power from the input optical interface of the optical
amplifier and the optical power from the output optical interface of the jumper fiber,
which expressed in dB.
gateway A device that connects two network segments using different protocols. It is used to
translate the data in the two network segments.
gateway network An NE that serves as a gateway for other NEs to communicate with a network
element (GNE) management system.

H
HD-SDI high definition serial digital interface
HSDPA See High Speed Downlink Packet Access.
High Speed Downlink A modulating-demodulating algorithm put forward in 3GPP R5 to meet the requirement
Packet Access for asymmetric uplink and downlink transmission of data services. It enables the
(HSDPA) maximum downlink data service rate to reach 14.4 Mbit/s without changing the
WCDMA network topology.
handle A component of the panel. It is used to insert or remove boards in and out of slots.
hardware loopback A connection mode in which a fiber jumper is used to connect the input optical interface
of a board to the output optical interface of the board to achieve signal loopback.
hop A network connection between two distant nodes. For Internet operation a hop represents
a small step on the route from one main computer to another.
hot patch A patch that is used to repair a deficiency in the software or add a new feature to a program
without restarting the software and interrupting the service. For the equipment using the
built-in system, a hot patch can be loaded, activated, confirmed, deactivated, deleted, or
queried.

I
IANA See Internet Assigned Numbers Authority.
IC See integrated circuit.
ICMP See Internet Control Message Protocol.
IEEE See Institute of Electrical and Electronics Engineers.
IETF Internet Engineering Task Force
IGMP See Internet Group Management Protocol.
IMA See inverse multiplexing over ATM.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 309


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

IP Internet Protocol
IP subnet A special submap used to identify an IP network segment. It is displayed as the submap
icon in the topological view.
IPA See intelligent power adjustment.
IPv4 See Internet Protocol version 4.
IPv6 See Internet Protocol version 6.
IS-IS See Intermediate System to Intermediate System.
ISDN integrated services digital network
ISO International Organization for Standardization
ITU-T International Telecommunication Union-Telecommunication Standardization Sector
Institute of Electrical A professional association of electrical and electronics engineers based in the United
and Electronics States, but with membership from numerous other countries. The IEEE focuses on
Engineers (IEEE) electrical, electronics, and computer engineering, and produces many important
technology standards.
Intermediate System to A protocol used by network devices (routers) to determine the best way to forward
Intermediate System datagram or packets through a packet-based network.
(IS-IS)
Internet Assigned A department operated by the IAB. IANA delegates authority for IP address-space
Numbers Authority allocation and domain-name assignment to the NIC and other organizations. IANA also
(IANA) maintains a database of assigned protocol identifiers used in the TCP/IP suite, including
autonomous system numbers.
Internet Control A network layer protocol that provides message control and error reporting between a
Message Protocol host server and an Internet gateway.
(ICMP)
Internet Group One of the TCP/IP protocols for managing the membership of Internet Protocol multicast
Management Protocol groups. It is used by IP hosts and adjacent multicast routers to establish and maintain
(IGMP) multicast group memberships.
Internet Protocol The current version of the Internet Protocol (IP). IPv4 utilizes a 32bit address which is
version 4 (IPv4) assigned to hosts. An address belongs to one of five classes (A, B, C, D, or E) and is
written as 4 octets separated by periods and may range from 0.0.0.0 through to
255.255.255.255. Each IPv4 address consists of a network number, an optional
subnetwork number, and a host number. The network and subnetwork numbers together
are used for routing, and the host number is used to address an individual host within the
network or subnetwork.
Internet Protocol An update version of IPv4, which is designed by the Internet Engineering Task Force
version 6 (IPv6) (IETF) and is also called IP Next Generation (IPng). It is a new version of the Internet
Protocol. The difference between IPv6 and IPv4 is that an IPv4 address has 32 bits while
an IPv6 address has 128 bits.
indicator Description of a performance feature collected from the managed devices by the
performance collector.
integrated circuit (IC) A combination of inseparable associated circuit elements that are formed in place and
interconnected on or within a single base material to perform a microcircuit function.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 310


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

intelligent power A technology that reduces the optical power of all the amplifiers in an adjacent
adjustment (IPA) regeneration section in the upstream to a safe level if the system detects the loss of optical
signals on the link. IPA helps ensure that maintenance engineers are not injured by the
laser escaping from a broken fiber or a connector that is not plugged in properly.
intersecting ring More than two public nodes between two ring networks. The network is more complex
if there are many intersecting nodes. Most devices support that two rings are intersected
at two nodes.
inverse multiplexing A technique that involves inverse multiplexing and de-multiplexing of ATM cells in a
over ATM (IMA) cyclical fashion among links grouped to form a higher bandwidth logical link whose rate
is approximately the sum of the link rates.

J
jitter The measure of short waveform variations caused by vibration, voltage fluctuations, and
control system instability.
jumper A connection wire for connecting two pins.

L
L2 switching The switching based on the data link layer.
L2VPN Layer 2 virtual private network
L3VPN Layer 3 virtual private network
LACP See Link Aggregation Control Protocol.
LAG See link aggregation group.
LAN See local area network.
LAPD link access procedure on the D channel
LB See loopback.
LC Lucent connector
LCAS See link capacity adjustment scheme.
LCT local craft terminal
LDP Label Distribution Protocol
LED See light emitting diode.
LMP link management protocol
LOS See loss of signal.
LP logical port
LPT link-state pass through
LSP See label switched path.
LSR See label switching router.
LTC loss of tandem connection

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 311


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

Layer 2 switching A data forwarding method. In a LAN, a network bridge or 802.3 Ethernet switch
transmits and distributes packet data based on the MAC address. Since the MAC address
is at the second layer of the OSI model, this data forwarding method is called Layer 2
switching.
Link Aggregation A dynamic link aggregation protocol that improves the transmission speed and
Control Protocol reliability. The two ends of the link send LACP packets to inform each other of their
(LACP) parameters and form a logical aggregation link. After the aggregation link is formed,
LACP maintains the link status in real time and dynamically adjusts the ports on the
aggregation link upon detecting the failure of a physical port.
label A short identifier that is of fixed length and local significance. It is used to uniquely
identify the FEC to which a packet belongs. It does not contain topology information. It
is carried in the header of a packet and does not contain topology information.
label switched path A sequence of hops (R0...Rn) in which a packet travels from R0 to Rn through label
(LSP) switching mechanisms. A label-switched path can be chosen dynamically, based on
common routing mechanisms or through configuration.
label switching router Basic element of an MPLS network. All LSRs support the MPLS protocol. The LSR is
(LSR) composed of two parts: control unit and forwarding unit. The former is responsible for
allocating the label, selecting the route, creating the label forwarding table, creating and
removing the label switch path; the latter forwards the labels according to groups
received in the label forwarding table.
light emitting diode A display and lighting technology used in almost every electrical and electronic product
(LED) on the market, from a tiny on/off light to digital readouts, flashlights, traffic lights, and
perimeter lighting. LEDs are also used as the light source in multimode fibers, optical
mice, and laser printers.
linear MSP linear multiplex section protection
link aggregation group An aggregation that allows one or more links to be aggregated together to form a link
(LAG) aggregation group so that a MAC client can treat the link aggregation group as if it were
a single link.
link capacity LCAS in the virtual concatenation source and sink adaptation functions provides a
adjustment scheme control mechanism to hitless increase or decrease the capacity of a link to meet the
(LCAS) bandwidth needs of the application. It also provides a means of removing member links
that have experienced failure. The LCAS assumes that in cases of capacity initiation,
increases or decreases, the construction or destruction of the end-to-end path is the
responsibility of the network and element management systems.
link group According to some principles, links are divided into the set in the logical term. A set of
links is called the link group. The division makes management more convenient.
link status The running status of a link, which can be Up, Down, backup, or unknown.
loading A process of importing information from the storage device to the memory to facilitate
processing (when the information is data) or execution (when the information is
program).
local area network A network formed by the computers and workstations within the coverage of a few square
(LAN) kilometers or within a single building, featuring high speed and low error rate. Current
LANs are generally based on switched Ethernet or Wi-Fi technology and run at 1,000
Mbit/s (that is, 1 Gbit/s).

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 312


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

loopback (LB) A troubleshooting technique that returns a transmitted signal to its source so that the
signal or message can be analyzed for errors. The loopback can be a inloop or outloop.
loopback test Self-test of chips, including internal and external loopback. Loopback test is used to test
whether interfaces work normally.
loss of signal (LOS) No transitions occurring in the received signal.

M
MAC mandatory access control
MAC address A link layer address or physical address. It is six bytes long.
MADM multiple add/drop multiplexer
MCA multi-channel spectrum analyzer unit
MD5 See message digest algorithm 5.
MFAS See multiframe alignment signal.
MIP maintenance association intermediate point
MPLS See Multiprotocol Label Switching.
MPLS TE multiprotocol label switching traffic engineering
MPLS VPN See multiprotocol label switching virtual private network.
MPLS-TP See MultiProtocol Label Switching Transport Profile.
MS multiplex section
MSOH multiplex section overhead
MSP See multiplex section protection.
MSTP See Multiple Spanning Tree Protocol.
MTU See maximum transmission unit.
MUX See multiplexer.
MultiProtocol Label An extension to MPLS in terms of forwarding, OAM, reliability, NMS and control plane
Switching Transport protocol standardized by IETF to provide sufficient transport functionality.
Profile (MPLS-TP)
Multiple Spanning A protocol that can be used in a loop network. Using an algorithm, the MSTP blocks
Tree Protocol (MSTP) redundant paths so that the loop network can be trimmed as a tree network. In this case,
the proliferation and endless cycling of packets is avoided in the loop network. The
protocol that introduces the mapping between VLANs and multiple spanning trees. This
solves the problem that data cannot be normally forwarded in a VLAN because in STP/
RSTP, only one spanning tree corresponds to all the VLANs.
Multiprotocol Label A technology that uses short tags of fixed length to encapsulate packets in different link
Switching (MPLS) layers, and provides connection-oriented switching for the network layer on the basis of
IP routing and control protocols.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 313


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

main topology A basic component of a human-machine interface. It is the default client interface of the
NMS and intuitively displays the structure of a network, NEs on the network, subnets in
the network as well as the NE communication and running status, reflecting the overall
network running status.
management node A management node consists of multiple document security management(DSM) servers
(four at most) in an area. Management nodes are created so that the DSM management
center can synchronize the department and account information of areas from each node
and then deliver all information to all nodes, thus realizing across-system authorization
and roaming access of security documents.
maximum transmission The largest packet of data that can be transmitted on a network. MTU size varies,
unit (MTU) depending on the network—576 bytes on X.25 networks, for example, 1500 bytes on
Ethernet, and 17,914 bytes on 16 Mbit/s token ring. Responsibility for determining the
size of the MTU lies with the link layer of the network. When packets are transmitted
across networks, the path MTU, or PMTU, represents the smallest packet size (the one
that all networks can transmit without breaking up the packet) among the networks
involved.
member A basic element for forming a dimension according to the hierarchy of each level. Each
member represents a data element in a dimension. For example, January 1997 is a typical
member of the time dimension.
message digest A hash function that is used in a variety of security applications to check message
algorithm 5 (MD5) integrity. MD5 processes a variable-length message into a fixed-length output of 128
bits. It breaks up an input message into 512-bit blocks (sixteen 32-bit little-endian
integers). After a series of processing, the output consists of four 32-bit words, which
are then cascaded into a 128-bit hash number.
monitoring A method that an inspector uses to inspect a service agent. By monitoring a service agent,
an inspector can check each detailed operation performed by the service agent during
the conversation and operate the GUI used by the service agent. The inspector helps the
service agent to provide better service.
mounting ear A piece of angle plate on a rack. The mounting ear has holes that can be used to fix
network elements or components.
multicast A process of transmitting data packets from one source to many destinations. The
destination address of the multicast packet uses Class D address, that is, the IP address
ranges from 224.0.0.0 to 239.255.255.255. Each multicast address represents a multicast
group rather than a host.
multiframe alignment A distinctive signal inserted into every multiframe or once into every n multiframes,
signal (MFAS) always occupying the same relative position within the multiframe, and used to establish
and maintain multiframe alignment.
multiplex section A function, which is performed to provide capability for switching a signal between and
protection (MSP) including two multiplex section termination (MST) functions, from a "working" to a
"protection" channel.
multiplexer (MUX) Equipment that combines a number of tributary channels onto a fewer number of
aggregate bearer channels, the relationship between the tributary and aggregate channels
being fixed.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 314


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

multiplexing A procedure by which multiple lower order path layer signals are adapted into a higher
order path or the multiple higher order path layer signals are adapted into a multiplex
section.
multiprotocol label An Internet Protocol (IP) virtual private network (VPN) based on the multiprotocol label
switching virtual switching (MPLS) technology. It applies the MPLS technology for network routers and
private network switches, simplifies the routing mode of core routers, and combines traditional routing
(MPLS VPN) technology and label switching technology. It can be used to construct the broadband
Intranet and Extranet to meet various service requirements.

N
NE Explorer The main operation interface of the NMS, which is used to manage the
telecommunication equipment. In the NE Explorer, a user can query, manage, and
maintain NEs, boards, and ports.
NE ID An ID that indicates a managed device in the network. In the network, each NE has a
unique NE ID.
NE Panel A graphical user interface, of the network management system, which displays subracks,
boards, and ports on an NE. On the NE Panel, the user can complete most of the
configuration, management and maintenance functions for an NE.
NNI network node interface
NRZ non-return to zero
NSAP See network service access point.
NTP Network Time Protocol
network layer Layer 3 of the seven-layer OSI model of computer networking. The network layer
provides routing and addressing so that two terminal systems are interconnected. In
addition, the network layer provides congestion control and traffic control. In the TCP/
IP protocol suite, the functions of the network layer are specified and implemented by
IP protocols. Therefore, the network layer is also called IP layer.
network service A service that needs to be enabled at the network layer and maintained as a basic service.
network service access A network address defined by ISO, at which the OSI Network Service is made available
point (NSAP) to a Network service user by the Network service provider.
network storm A phenomenon that occurs during data communication. To be specific, mass broadcast
packets are transmitted in a short time; the network is congested; transmission quality
and availability of the network decrease rapidly. The network storm is caused by network
connection or configuration problems.

O
O&M operation and maintenance
OA optical amplifier
OADM See optical add/drop multiplexer.
OAM See operation, administration and maintenance.
OAMS Optical fiber line Automatic Monitoring System

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 315


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

OAU See optical amplifier unit.


OC ordinary clock
OC-3 optical carrier level 3
OCP optical channel protection
OCS optical core switching
OCh optical channel with full functionality
OD optical demultiplexing
ODB optical duobinary
ODF optical distribution frame
ODUk optical channel data unit - k
OEQ optical equalizer
OLA optical line amplifier
OLP See optical line protection.
OM optical multiplexing
OMS optical multiplexing section
ONT See optical network terminal.
ONU See optical network unit.
OPA optical power adjust
OPU See optical channel payload unit.
OPUk optical channel payload unit - k
OSC See optical supervisory channel.
OSI open systems interconnection
OSN optical switch node
OSNR See optical signal-to-noise ratio.
OSPF See Open Shortest Path First.
OTM optical terminal multiplexer
OTN optical transport network
OTS See optical transmission section.
OTU See optical transponder unit.
OTUk optical channel transport unit - k
OWSP optical wavelength shared protection
Open Shortest Path A link-state, hierarchical interior gateway protocol (IGP) for network routing that uses
First (OSPF) cost as its routing metric. A link state database is constructed of the network topology,
which is identical on all routers in the area.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 316


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

operation, A set of network management functions that cover fault detection, notification, location,
administration and and repair.
maintenance (OAM)
optical add/drop A device that can be used to add the optical signals of various wavelengths to one channel
multiplexer (OADM) and drop the optical signals of various wavelengths from one channel.
optical amplifier unit A board that is mainly responsible for amplifying optical signals. The OAU can be used
(OAU) in both the transmitting direction and the receiving direction.
optical channel payload A protection architecture that allows one wavelength to provide protection for multiple
unit (OPU) services between different stations, saving wavelength resources and lowering costs.
optical interface A component that connects several transmit or receive units.
optical line protection A mechanism that protects line signals using the dual feeding and selective receiving
(OLP) principle, featuring single-ended switching.
optical network A device that terminates the fiber optical network at the customer premises.
terminal (ONT)
optical network unit A form of Access Node that converts optical signals transmitted via fiber to electrical
(ONU) signals that can be transmitted via coaxial cable or twisted pair copper wiring to
individual subscribers.
optical signal-to-noise The ratio of signal power to noise power in a transmission link. OSNR is the most
ratio (OSNR) important index for measuring the performance of a DWDM system.
optical supervisory A technology that uses specific optical wavelengths to realize communication among
channel (OSC) nodes in optical transmission network and transmit the monitoring data in a certain
channel.
optical switch A passive component possessing two or more ports that selectively transmits, redirects,
or blocks optical power in an optical fiber transmission line.
optical transmission A section in the logical structure of an optical transport network (OTN). The OTS allows
section (OTS) the network operator to perform monitoring and maintenance tasks between NEs.
optical transponder A device or subsystem that converts accessed client signals into a G.694.1/G.694.2-
unit (OTU) compliant WDM wavelength.
orderwire A channel that provides voice communication between operation engineers or
maintenance engineers of different stations.

P
P2MP point-to-multipoint
P2P See point-to-point service.
PC personal computer
PCB See printed circuit board.
PCN product change notice
PDH See plesiochronous digital hierarchy.
PDL See polarization-dependent loss.
PDU See power distribution unit.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 317


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

PE See provider edge.


PGND cable A cable which connects the equipment and the protection grounding bar. Usually, one
half of the cable is yellow, whereas the other half is green.
PHB See per-hop behavior.
PID photonics integrated device
PIN See personal identification number.
PM performance monitoring
PMD polarization mode dispersion
PMU power monitoring unit
POS See packet over SDH/SONET.
POTS See plain old telephone service.
PPP Point-to-Point Protocol
PRBS See pseudo random binary sequence.
PSI payload structure identifier
PSN See packet switched network.
PT payload type
PTN packet transport network
PTP Precision Time Protocol
PVC permanent virtual channel
PW See pseudo wire.
PWE3 See pseudo wire emulation edge-to-edge.
packet loss The discarding of data packets in a network when a device is overloaded and cannot
accept any incoming data at a given moment.
packet over SDH/ A MAN and WAN technology that provides point-to-point data connections. The POS
SONET (POS) interface uses SDH/SONET as the physical layer protocol, and supports the transport of
packet data (such as IP packets) in MAN and WAN.
packet switched A telecommunications network that works in packet switching mode.
network (PSN)
packet switching A network technology in which information is transmitted by means of exchanging
packets and the bandwidth of a channel can be shared by multiple connections.
paired slots Two slots of which the overheads can be passed through by using the bus on the
backplane.
per-hop behavior IETF Diff-Serv workgroup defines forwarding behaviors of network nodes as per-hop
(PHB) behaviors (PHB), such as, traffic scheduling and policing. A device in the network should
select the proper PHB behaviors, based on the value of DSCP. At present, the IETF
defines four types of PHB. They are class selector (CS), expedited forwarding (EF),
assured forwarding (AF), and best-effort (BE).
personal identification A code used by a mobile user in conjunction with a SIM card to complete a call.
number (PIN)

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 318


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

phase The relative position in time within a single period of a signal.


physical layer Layer 1 in the Open System Interconnection (OSI) architecture; the layer that provides
services to transmit bits or groups of bits over a transmission link between open systems
and which entails electrical, mechanical and handshaking.
physical link The link between two physical network elements (NEs). When the user creates NEs or
refreshes the device status, the system automatically creates the physical link according
to the topology structure information on the device. The remark information of a physical
link can be modified, but the physical link cannot be deleted.
plain old telephone The basic telephone service provided through the traditional cabling such as twisted pair
service (POTS) cables.
plesiochronous digital A multiplexing scheme of bit stuffing and byte interleaving. It multiplexes the minimum
hierarchy (PDH) rate 64 kit/s into rates of 2 Mbit/s, 34 Mbit/s, 140 Mbit/s, and 565 Mbit/s.
point-to-point service A service between two terminal users. In P2P services, senders and recipients are
(P2P) terminal users.
polarization-dependent A measure of the peak-to-peak insertion loss or gain variation caused by a component
loss (PDL) when stimulated by all possible polarization states. PDL is specified in dB.
policy A set of rules that are applied when the conditions for triggering an event are met.
port priority The priority that is used when a port attaches tags to Layer 2 packets. Packets received
on ports with higher priorities are forwarded preferentially.
power box A direct current power distribution box at the upper part of a cabinet, which supplies
power for the subracks in the cabinet.
power distribution unit A unit that performs AC or DC power distribution.
(PDU)
power module The module that converts the external power input into the power supply for internal
use. Power modules are classified into AC power modules and DC power modules.
power on To start up a computer; to begin a cold boot procedure; to turn on the power
printed circuit board A board used to mechanically support and electrically connect electronic components
(PCB) using conductive pathways, tracks, or traces, etched from copper sheets laminated onto
a non-conductive substrate.
private line A line, such as a subscriber cable and trunk cable, which are leased by the
telecommunication carrier and are used to meet the special user requirements.
protection path A path in a protection group that transports services when a fault occurs on the working
path.
protection service A specific service that is part of a protection group and is labeled protection.
provider edge (PE) A device that is located in the backbone network of the MPLS VPN structure. A PE is
responsible for managing VPN users, establishing LSPs between PEs, and exchanging
routing information between sites of the same VPN. A PE performs the mapping and
forwarding of packets between the private network and the public channel. A PE can be
a UPE, an SPE, or an NPE.
pseudo random binary A sequence that is random in the sense that the value of each element is independent of
sequence (PRBS) the values of any of the other elements, similar to a real random sequence.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 319


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

pseudo wire (PW) An emulated connection between two PEs for transmitting frames. The PW is established
and maintained by PEs through signaling protocols. The status information of a PW is
maintained by the two end PEs of a PW.
pseudo wire emulation An end-to-end Layer 2 transmission technology. It emulates the essential attributes of a
edge-to-edge (PWE3) telecommunication service such as ATM, FR or Ethernet in a packet switched network
(PSN). PWE3 also emulates the essential attributes of low speed time division
multiplexing (TDM) circuit and SONET/SDH. The simulation approximates to the real
situation.
pulse A variation above or below a normal level and a given duration in electrical energy.

Q
QinQ A layer 2 tunnel protocol based on IEEE 802.1Q encapsulation. It add a public VLAN
tag to a frame with a private VLAN tag to allow the frame with double VLAN tags to
be transmitted over the service provider's backbone network based on the public VLAN
tag. This provides a layer 2 VPN tunnel for customers and enables transparent
transmission of packets over private VLANs.
QoS See quality of service.
quality of service (QoS) A commonly-used performance indicator of a telecommunication system or channel.
Depending on the specific system and service, it may relate to jitter, delay, packet loss
ratio, bit error ratio, and signal-to-noise ratio. It functions to measure the quality of the
transmission system and the effectiveness of the services, as well as the capability of a
service provider to meet the demands of users.

R
RADIUS See Remote Authentication Dial In User Service.
RADIUS An authentication mode in which the BRAS sends the user name and the password to
authentication the RADIUS server by using the RADIUS protocol. The RADIUS server authenticates
the user, and then returns the result to the BRAS.
RB See radio bearer.
RDI remote defect indication
RFC See Request For Comments.
RJ registered jack
RMON See remote monitor.
ROADM reconfigurable optical add/drop multiplexer
RSTP See Rapid Spanning Tree Protocol.
RSVP See Resource Reservation Protocol.
RTN radio transmission node
RZ return to zero
Rapid Spanning Tree An evolution of the Spanning Tree Protocol (STP) that provides faster spanning tree
Protocol (RSTP) convergence after a topology change. The RSTP protocol is backward compatible with
the STP protocol.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 320


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

Remote Authentication A security service that authenticates and authorizes dial-up users and is a centralized
Dial In User Service access control mechanism. RADIUS uses the User Datagram Protocol (UDP) as its
(RADIUS) transmission protocol to ensure real-time quality. RADIUS also supports the
retransmission and multi-server mechanisms to ensure good reliability.
Request For Comments A document in which a standard, a protocol, or other information pertaining to the
(RFC) operation of the Internet is published. The RFC is actually issued, under the control of
the IAB, after discussion and serves as the standard. RFCs can be obtained from sources
such as InterNIC.
Resource Reservation A protocol that reserves resources on every node along a path. RSVP is designed for an
Protocol (RSVP) integrated services Internet.
radio bearer (RB) The service provided by the Layer 2 for the transfer of user data between UE (User
Equipment) and UTRAN (UMTS Terrestrial Radio Access Network).
reboot To start the system again. Programs or data will be reloaded to all boards.
receiver sensitivity The minimum acceptable value of mean received power at point Rn (a reference point
at an input to a receiver optical connector) to achieve a 1x10-12 BER when the FEC is
enabled.
regeneration The process of receiving and reconstructing a digital signal so that the amplitudes,
waveforms and timing of its signal elements are constrained within specified limits.
remote monitor A widely used network management standard defined by the IETF, and it enhances the
(RMON) MIB II standard greatly. It is mainly used to monitor the data traffic over a network
segment or the entire network. RMON is completely based on the SNMP architecture,
including the NMS and the Agent running on each network device.
report A tool that displays data in a specific format to intuitively present service information.
resistance The ability to impede (resist) the flow of electric current. With the exception of
superconductors, all substances have a greater or lesser degree of resistance. Substances
with very low resistance, such as metals, conduct electricity well and are called
conductors. Substances with very high resistance, such as glass and rubber, conduct
electricity poorly and are called nonconductors or insulators.
response A message that is returned to the requester to notify the requester of the status of the
request packet.
router A device on the network layer that selects routes in the network. The router selects the
optimal route according to the destination address of the received packet through a
network and forwards the packet to the next router. The last router is responsible for
sending the packet to the destination host. Can be used to connect a LAN to a LAN, a
WAN to a WAN, or a LAN to the Internet.
routing The determination of a path that a data unit (frame, packet, message) traverses from
source to destination.
routing table A table that stores and updates the locations (addresses) of network devices. Routers
regularly share routing table information to be up to date. A router relies on the
destination address and on the information in the table that gives the possible routes--in
hops or in number of jumps--between itself, intervening routers, and the destination.
Routing tables are updated frequently as new information is available.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 321


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

rule Rules define automatic management services or applications. Rules include triggering
conditions and execution logic. Triggering conditions refer to the conditions for
executing a rule and execution logic refers to a set of actions executed based on the rule.

S
S-VLAN service virtual local area network
SAN storage area network
SAPI source access point identifier
SD See signal degrade.
SD-SDI See standard definition-serial digital interface signal.
SDH See synchronous digital hierarchy.
SDI See serial digital interface.
SES severely errored second
SESR severely errored second ratio
SETS SDH equipment timing source
SF See signal fail.
SFP small form-factor pluggable
SFTP See Secure File Transfer Protocol.
SHDSL See single-pair high-speed digital subscriber line.
SLA See service level agreement.
SLM signaling link management
SM section monitoring
SMF See single-mode fiber.
SNC subnetwork connection
SNCP subnetwork connection protection
SNCTP subnetwork connection tunnel protection
SNMP See Simple Network Management Protocol.
SONET See synchronous optical network.
SPC soft permanent connection
SRG See shared risk group.
SRLG shared risk link group
SSL See Secure Sockets Layer.
SSM See Synchronization Status Message.
STI service trigger information
STM synchronous transfer mode
STM-1 See Synchronous Transport Module level 1.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 322


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

STM-16 Synchronous Transport Module level 16


STM-4 Synchronous Transport Module level 4
STM-N Synchronous Transport Module level N
STP Spanning Tree Protocol
STS synchronous transport signal
Secure File Transfer A network protocol designed to provide secure file transfer over SSH.
Protocol (SFTP)
Secure Sockets Layer A security protocol that works at a socket level. This layer exists between the TCP layer
(SSL) and the application layer to encrypt/decode data and authenticate concerned entities.
Simple Network A network management protocol of TCP/IP. It enables remote users to view and modify
Management Protocol the management information of a network element. This protocol ensures the
(SNMP) transmission of management information between any two points. The polling
mechanism is adopted to provide basic function sets. According to SNMP, agents, which
can be hardware as well as software, can monitor the activities of various devices on the
network and report these activities to the network console workstation. Control
information about each device is maintained by a management information block.
Synchronization Status A message that carries the quality levels of timing signals on a synchronous timing link.
Message (SSM) SSM messages provide upstream clock information to nodes on an SDH network or
synchronization network.
Synchronous Synchronous transfer mode at 155 Mbit/s.
Transport Module level
1 (STM-1)
security Protection of a computer system and its data from harm or loss. A major focus of
computer security, especially on systems accessed by many people or through
communication lines, is preventing system access by unauthorized individuals.
serial digital interface An interface that transmits data in a single channel in sequence.
(SDI)
serial port An input/output location (channel) that sends and receives data to and from a computer's
CPU or a communications device one bit at a time. Serial ports are used for serial data
communication and as interfaces with some peripheral devices, such as mice and printers.
service data The user and/or network information required for the normal functioning of services.
service level The level of service quality of an evaluated party in a specified period, determined by
an evaluating party.
service level agreement A service agreement between a customer and a service provider. SLA specifies the
(SLA) service level for a customer. The customer can be a user organization (source domain)
or another differentiated services domain (upstream domain). An SLA may include
traffic conditioning rules which constitute a traffic conditioning agreement as a whole
or partially.
service protection A measure that ensures that services can be received at the receive end.
shared risk group A group of resources that share a common risk component whose failure can cause the
(SRG) failure of all the resources in the group.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 323


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

signal degrade (SD) A signal indicating that associated data has degraded in the sense that a degraded defect
condition is active.
signal fail (SF) A signal indicating that associated data has failed in the sense that a near-end defect
condition (non-degrade defect) is active.
single-ended switching A protection mechanism that takes switching action only at the affected end of the
protected entity in the case of a unidirectional failure.
single-mode fiber A type of optical fiber through which only one type of optical signal with a fixed wave
(SMF) length can travel at a time. The inner diameter of the single-mode fiber is less than 10
microns. This type of fiber can transmit data over a long distance.
single-pair high-speed A symmetric digital subscriber line technology developed from HDSL, SDSL, and
digital subscriber line HDSL2, which is defined in ITU-T G.991.2. The SHDSL port is connected to the user
(SHDSL) terminal through the plain telephone subscriber line and uses trellis coded pulse
amplitude modulation (TC-PAM) technology to transmit high-speed data and provide
the broadband access service.
smooth upgrade Process of upgrading the system files without service interruption
span The physical reach between two pieces of WDM equipment.
standard definition- Standard definition video signal transported by serial digital interface.
serial digital interface
signal (SD-SDI)
subnet A type of smaller networks that form a larger network according to a rule, for example,
according to different districts. This facilitates the management of the large network.
synchronous digital A transmission scheme that follows ITU-T G.707, G.708, and G.709. SDH defines the
hierarchy (SDH) transmission features of digital signals, such as frame structure, multiplexing mode,
transmission rate level, and interface code. SDH is an important part of ISDN and B-
ISDN.
synchronous optical A high-speed network that provides a standard interface for communications carriers to
network (SONET) connect networks based on fiber optical cable. SONET is designed to handle multiple
data types (voice, video, and so on). It transmits at a base rate of 51.84 Mbit/s, but
multiples of this base rate go as high as 2.488 Gbit/s.

T
TCM tandem connection monitor
TCP See Transmission Control Protocol.
TCP/IP Transmission Control Protocol/Internet Protocol
TD transmit degrade
TDC tunable dispersion compensator
TDM See time division multiplexing.
TF transport format
TFTP See Trivial File Transfer Protocol.
TIM trail trace identifier mismatch
TL1 Transaction Language 1

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 324


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

TM See terminal multiplexer.


TMN See telecommunications management network.
TSC test system controller
TTI trail trace identifier
TX transmit
Transmission Control The protocol within TCP/IP that governs the breakup of data messages into packets to
Protocol (TCP) be sent using Internet Protocol (IP), and the reassembly and verification of the complete
messages from packets received by IP. A connection-oriented, reliable protocol (reliable
in the sense of ensuring error-free delivery), TCP corresponds to the transport layer in
the ISO/OSI reference model.
Trivial File Transfer A small and simple alternative to FTP for transferring files. TFTP is intended for
Protocol (TFTP) applications that do not need complex interactions between the client and server. TFTP
restricts operations to simple file transfers and does not provide authentication.
tangent ring A concept borrowed from geometry. Two tangent rings have a common node between
them. The common node often leads to single-point failures.
telecommunications A protocol model defined by ITU-T for managing open systems in a communications
management network network. TMN manages the planning, provisioning, installation, and OAM of
(TMN) equipment, networks, and services.
terminal multiplexer A device used at a network terminal either to multiplex multiple channels of low rate
(TM) signals into one channel of high rate signals, or to demultiplex one channel of high rate
signals into multiple channels of low rate signals.
threshold A limitation on an amount, scale, or level. Changes will occur when a threshold is
reached.
time division A multiplexing technology. TDM divides the sampling cycle of a channel into time slots
multiplexing (TDM) (TSn, n=0, 1, 2, 3…), and the sampling value codes of multiple signals engross time slots
in a certain order, forming multiple multiplexing digital signals to be transmitted over
one channel.
tolerance Permissible degree of variation from a pre-set standard.
topology view A basic component for the man-machine interface. The topology view directly displays
the networking of a network as well as the alarm and communication status of each
network element and subnet. The topology view reflects the basic running conditions of
the network.
transmission delay The period from the time when a site starts to transmit a data frame to the time when the
site finishes the data frame transmission. It consists of the transmission latency and the
equipment forwarding latency.
transparent A process during which the signaling protocol or data is not processed in the content but
transmission encapsulated in the format for the processing of the next phase.
tunnel A channel on the packet switching network that transmits service traffic between PEs.
In VPN, a tunnel is an information transmission channel between two entities. The tunnel
ensures secure and transparent transmission of VPN information. In most cases, a tunnel
is an MPLS tunnel.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 325


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

UAT See unavailable time event.


UNI See user-to-network interface.
UPI user payload identifier
UPM uninterruptible power module
unavailable time event An event that is reported when the monitored object generates 10 consecutive severely
(UAT) errored seconds.
unicast The process of sending data from a source to a single recipient.
unprotected Pertaining to the transmission of services that are not protected. Unprotected services
cannot be switched to the protection channel if the working channel is faulty or the service
is interrupted, because protection is not configured.
user-to-network The interface between user equipment and private or public network equipment (for
interface (UNI) example, ATM switches).

V
V-UNI See virtual user-network interface.
VB virtual bridge
VBR See variable bit rate.
VC See virtual container.
VCPLM virtual concatenation payload mismatch
VCTRUNK A virtual concatenation group applied in data service mapping, also called the internal
port of a data service processing board.
VDSL2 See very-high-speed digital subscriber line 2.
VLAN virtual local area network
VOA variable optical attenuator
VPLS See virtual private LAN service.
VPN virtual private network
VPWS See virtual private wire service.
VRRP See Virtual Router Redundancy Protocol.
VSI See virtual switching instance.
Virtual Router A protocol designed for multicast or broadcast LANs such as an Ethernet. A group of
Redundancy Protocol routers (including an active router and several backup routers) in a LAN is regarded as
(VRRP) a virtual router, which is called a backup group. The virtual router has its own IP address.
The host in the network communicates with other networks through this virtual router.
If the active router in the backup group fails, one of the backup routers in this backup
group becomes active and provides routing service for the host in the network.
variable bit rate (VBR) One of the traffic classes used by ATM (Asynchronous Transfer Mode). Unlike a
permanent CBR (Constant Bit Rate) channel, a VBR data stream varies in bandwidth
and is better suited to non real time transfers than to real-time streams such as voice calls.

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 326


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

very-high-speed digital An extension of the VDSL technology, which complies with ITU G.993.2, supports
subscriber line 2 multiple spectrum profiles and encapsulation modes, and provides short-distance and
(VDSL2) high-speed access solutions to the next-generation FTTx access service.
view The topological view that is presented in some rules. Customize the view according to
requirements of every product and organize the data in the view displayed by the topology
module. By default, the platform provides the physical view. The topology view can be
planned according to the domain, maintenance relationship and so on.
virtual container (VC) An information structure used to support path layer connections in the SDH. A VC
consists of a payload and path overhead (POH), which are organized in a block frame
structure that repeats every 125 μs or 500 μs.
virtual private LAN A type of point-to-multipoint L2VPN service provided over the public network. VPLS
service (VPLS) enables geographically isolated user sites to communicate with each other through the
MAN/WAN as if they are on the same LAN.
virtual private wire A technology that bears Layer 2 services. VPWS emulates services such as ATM, FR,
service (VPWS) Ethernet, low-speed TDM circuit, and SONET/SDH in a PSN.
virtual switching An instance through which the physical access links of VPLS can be mapped to the
instance (VSI) virtual links. Each VSI provides independent VPLS service. VSI has Ethernet bridge
function and can terminate PW.
virtual user-network A virtual user-network interface, works as an action point to perform service
interface (V-UNI) classification and traffic control in HQoS.

W
WAN wide area network
WDM wavelength division multiplexing
WLAN See wireless local area network.
WSS wavelength selective switching
Web LCT The local maintenance terminal of a transport network, which is located at the NE
management layer of the transport network.
wireless local area A hybrid of the computer network and the wireless communication technology. It uses
network (WLAN) wireless multiple address channels as transmission media and carriers out data interaction
through electromagnetic wave to implement the functions of the traditional LAN.
working service A specific service that is part of a protection group and is labeled working.

X
XCS cross-connect and synchronous timing board

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 327


Copyright © Huawei Technologies Co., Ltd.
OptiX OSN 8800/6800/3800
Troubleshooting B Glossary

Y.1731 The OAM protocol introduced by the ITU-T. Besides the contents defined by
IEEE802.1ag, ITU-T Recommendation Y.173 also defines the following combined
OAM messages: Alarm Indication Signal (AIS), Remote Defect Indication (RDI),
Locked Signal (LCK), Test Signal, Automatic Protection Switching (APS), Maintenance
Communication Channel (MCC), Experimental (EXP), and Vendor Specific (VSP) for
fault management and performance monitoring, such as frame loss measurement (LM),
and delay measurement (DM).

Issue 05 (2013-11-30) Huawei Proprietary and Confidential 328


Copyright © Huawei Technologies Co., Ltd.

Das könnte Ihnen auch gefallen