Beruflich Dokumente
Kultur Dokumente
V100R007C00
Troubleshooting
Issue 05
Date 2013-11-30
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei and the
customer. All or part of the products, services and features described in this document may not be within the
purchase scope or the usage scope. Unless otherwise specified in the contract, all statements, information,
and recommendations in this document are provided "AS IS" without warranties, guarantees or representations
of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made in the
preparation of this document to ensure accuracy of the contents, but all statements, information, and
recommendations in this document do not constitute a warranty of any kind, express or implied.
Website: http://www.huawei.com
Email: support@huawei.com
Related Versions
The following table lists the product versions related to this document.
Intended Audience
The intended audiences of this document are:
Symbol Conventions
The symbols that may be found in this document are defined as follows.
Symbol Description
Symbol Description
GUI Conventions
The GUI conventions that may be found in this document are defined as follows.
Convention Description
Change History
Updates between document issues are cumulative. Therefore, the latest document issue contains
all updates made in previous issues.
Update Description
Basic Concepts Updated "Test Frame": optimized application 2 of ethernet test frame.
and Methods of
Fault Locating
Update Description
Whole manual l Deleted "Description" in the Chapter "General Fault Handling Flow"
l Deleted "Information Collecting and Recording" "Technical
Support" "Latest Technical Documentation Obtaining" in the Chapter
"Fault Handling Flow"
l Deleted "Basic Rules for Locating Faults" in the Chapter "Basic
Concepts and Methods of Fault Locating".
l Deleted "Configuration Data Analysis" in the Chapter "Common
Methods of Locating Faults"
l Deleted "Related Information" in the Chapter "Rectifying Bit Errors"
l Deleted "Failure of Ringing of the Orderwire Phone Set upon
Incoming Calls Though the Calls Can Be Answered" in the Chapter
"Rectifying Orderwire Problems"
l "Frequent Resetting of the SCC Board" has been moved from "17
ECC Problems" to "10 NE Offline Problems".
l Added "Rectifying Coherent Board Faults".
Update Description
Update Description
3.1 Common Fault Isolation l 3.1.6 PRBS Test: Updated PRBS networking
Methods applications.
l 3.1.8 Test Frame: Added the 40G service type and
the supported board.
l Added "One-Click Data Collection".
Update Description
Update Description
10.1 Unreachability of One NE Causes and diagnosis methods for the fault are modified.
by the U2000
14.4 IPA End Failure in a Causes and diagnosis methods for the fault are added.
Raman System
Update Description
Entire manual This manual provides descriptions according to product series OptiX
OSN 8800, OptiX OSN 6800, and OptiX OSN 3800. Any difference
between the products is described in the manual.
Contents
2 Troubleshooting Workflow.........................................................................................................9
2.1 General Troubleshooting..............................................................................................................................................10
2.1.1 Workflow...................................................................................................................................................................10
2.2 Emergency Troubleshooting.........................................................................................................................................12
2.2.1 Workflow...................................................................................................................................................................12
2.2.2 Description.................................................................................................................................................................15
5 Service Interruptions..................................................................................................................56
5.1 Interruption of All Services..........................................................................................................................................58
5.2 Single-Channel Service Interruption on the Client Side..............................................................................................60
5.3 Single-Channel Service Interruption on the WDM Side..............................................................................................61
5.4 Service Interruption Caused by Human Misoperation.................................................................................................63
5.5 Service Interruptions Caused by Power Supply Failures.............................................................................................64
5.6 Service Interruptions Due to Grounding Fault.............................................................................................................66
5.7 Service Interruptions Due to Unfavorable Environmental Conditions.........................................................................67
5.8 Service Interruptions Due to Faulty Optical Fibers, Cables, or Connectors................................................................71
5.9 Service Interruptions Due to Inconsistency of Board Types or Board Settings...........................................................74
5.10 Service Interruptions on the VCTRUNK Port............................................................................................................75
5.11 Service Interruptions on a Single Ethernet Port.........................................................................................................77
5.12 Service Interruptions on All Ethernet Ports................................................................................................................79
5.13 Service Interruption Cases..........................................................................................................................................81
8 Bit Errors......................................................................................................................................106
8.1 Multi-Channel Bit Errors............................................................................................................................................107
8.2 Single-Channel Bit Errors..........................................................................................................................................109
8.3 Bit Errors Due to Unfavorable Environmental Conditions........................................................................................111
8.4 Bit Errors Due to Device Faults.................................................................................................................................115
8.5 Bit Error Cases............................................................................................................................................................117
10 NE Offline Problems..............................................................................................................123
10.1 Unreachability of One NE by the U2000.................................................................................................................124
10.2 Unreachability of All NEs in a Subnet by the U2000..............................................................................................125
16 Ethernet Problems...................................................................................................................250
16.1 Service Degradation.................................................................................................................................................251
16.2 LCAS Startup Failure...............................................................................................................................................252
16.3 LPT Switching Failure..............................................................................................................................................255
16.4 LB Test Failure.........................................................................................................................................................255
16.5 Test Frames Unavailability.......................................................................................................................................259
16.6 Common LAG faults................................................................................................................................................259
16.7 Common MSTP faults..............................................................................................................................................262
16.8 Ethernet Service Cases.............................................................................................................................................265
17 ECC Problems...........................................................................................................................266
17.1 ECC Communication Interruption............................................................................................................................267
17.2 Intermittent ECC Communication............................................................................................................................268
17.3 HWECC FAQ...........................................................................................................................................................269
18 Clock Problems........................................................................................................................275
18.1 Failure of NodeBs to Trace Clock Signals...............................................................................................................276
18.2 Synchronization Failure of the Physical-Layer Clock..............................................................................................277
19 Orderwire Problems................................................................................................................280
19.1 Orderwire Failure.....................................................................................................................................................281
19.2 A Called Phone Set Cannot Exit the Conference Call State.....................................................................................282
19.3 Orderwire Cases.......................................................................................................................................................283
B Glossary......................................................................................................................................298
1 Overview
This topic describes safety precautions, introduces essential instruments and tools, and outlines
maintenance personnel training requirements necessary for troubleshooting.
1.1.1 Laser
When you install and maintain equipment, follow the safety precautions described below to help
prevent personal injury or equipment damage. The laser complies with IEC60825.
l Personal injury
l Equipment damage
Personal Injury
DANGER
Laser beams from the optical ports on boards or from the fiber connectors cause eye damage.
Do not look directly at the optical ports or fiber connectors during the installation and
maintenance of boards or fibers. Do not shine laser beams into the eyes of other workers.
DANGER
To prevent eye damage in the case of an optical port that is in use, use protective caps to cover
the optical interface and the fiber connector after you remove the fiber from the optical interface.
DANGER
The optical power of the LINE interface on the Raman amplifier board is very high. Shut down
the pump laser before you insert or remove fiber connectors on the Raman amplifier board to
help prevent personal injury that is caused by high optical power.
Equipment Damage
CAUTION
Use protective caps to cover unused optical ports and fiber connectors so that they do not gather
dust.
CAUTION
When performing a hardware loopback test on optical ports using a fiber, add an optical
attenuator to prevent damage to the equipment because of the high power of the laser beam. Add
the attenuator at the receive optical port on a board that supports optical attenuators.
CAUTION
When you use the optical time domain reflectometer (OTDR), disconnect the fiber between the
opposite station and the board to prevent damage to the receive optical module because of high
optical power.
CAUTION
Exercise caution when you remove or insert a board that is connected with fibers.
CAUTION
The optical power of the Raman amplifier board is very high. Observe the following precautions
when using the Raman amplifier board to prevent damage to the equipment.
l Do not use fiber connectors within 0–20 km. The fibers at every joint point must be spliced.
l The single-point additional loss within 0–10 km must be smaller than 0.1 dB (G.652) or
0.2 dB (G.655) and the single-point return loss must not be smaller than 40 dB.
l The single-point additional loss within 10–20 km must be smaller than 0.2 dB (G.652) or
0.4 dB (G.655) and the single-point return loss must not be smaller than 40 dB.
l Fiber connections must be complete before you enable the lasers on the Raman amplifier
board. Make sure that the fiber connectors are clean. Otherwise, the fiber connectors might
be damaged when you insert or remove the fiber connectors.
l The optical power of the LINE interface on the Raman amplifier board is very high. The
LSH/APC optical connectors must be used in the fiber that is connected to the LINE
interface.
l For the Raman amplifier board with backward pump, the strong pump light enters the fiber
through the input end (LINE) instead of the output end (SYS). Do not add boards or non-
fiber devices, such as attenuators or fiber jumpers, at the input end.
l The bent radius of the fiber that is connected to the LINE interface on the Raman amplifier
board must be larger than 30 mm to prevent the fiber from being burned.
CAUTION
Do not place tools, such as screwdrivers, on the air baffle.
CAUTION
Ensure that screws do not fall off into the subrack or chassis.
1.1.3 ESD
During installation and maintenance, follow ESD procedures to prevent equipment damage:
CAUTION
Wear a well-grounded ESD wrist strap whenever you touch equipment or boards. Make sure
that the wrist strap touches your skin. Insert the ESD strap connector into the ESD socket of the
equipment.
For information about how to wear an ESD wrist strap, see Figure 1-1.
NOTE
Insert the connector of the ESD strap into the equipment port. For details, see the Quick Installation Guide.
When you are following ESD procedures, take the following precautions:
l Check the validity and functionality of the wrist strap. Its resistance value must be between
0.75 mega ohm to 10 mega ohm. If the wrist strap validity period (usually two years) has
expired, or if the resistance value fails to meet requirements, replace it with a wrist strap
that provides the required resistance value.
l Do not touch a board with your clothing. Clothing generates static electricity that is not
protected by the wrist strap.
l Wear an ESD wrist strap and place the board on an ESD pad when you replace boards or
chips. Use ESD tweezers or extraction tools to replace chips. Do not touch chips, circuits,
or pins with your bare hands.
l Keep the boards and other ESD-sensitive parts you are installing in ESD bags. Place the
removed boards and components on an ESD pad or ESD material. Do not use non-antistatic
materials such as white foams, common plastic bags, or paper bags to pack boards, and do
not let these materials touch the boards.
l Wear an ESD wrist strap when operating the ports of boards because they are also ESD-
sensitive. Discharge the static electricity of cables and protective sleeves before you connect
them to the ports.
l Keep packing materials (such as, ESD boxes and bags) available in the equipment room
for packing boards in the future.
ESD complies with IEC Publication 1000, EN 55022, EN 55024, IEC 61000 and GR-1089-
CORE.
DANGER
Do not install or disassemble equipment when power is applied.
Table 1-1 describes the alarm and safety symbols on the WDM equipment.
Symbol Describes
Grounding symbol.
Indicates the position of the grounding point.
Symbol Describes
Table 1-2 lists the instruments and tools used for troubleshooting.
Name Usage
Fiberscope Checks whether the optical fiber end faces are clean.
ESD wrist strip Protects sensitive components against static generated by the
human body.
Name Usage
Background Knowledge
l WDM fundamentals
l The WDM system alarm generation mechanism
l Troubleshooting methods for common alarms
Basic Operations
l Basic transmission equipment operations
l Basic network management system (NMS) operations
For basic NMS operations, refer to the OptiX iManager U2000 Operator Guide or the on-line
help.
Network Layout
The network layout of the project
The service configuration, wavelength assignment, optical distribution frame (ODF), fiber
routing, board version, and equipment layout in each office.
The relevant engineering files and documents (which need to be updated periodically)
2 Troubleshooting Workflow
This topic describes the troubleshooting process, information collection, and how to obtain
additional technical support and the latest technical documentation.
2.1.1 Workflow
This topic describes how to handle general faults.
Start
Yes
External causes? Other handling process
No
Yes
Fault cleared?
No
Try to solve
No
Service recovery?
Yes
Observe the running
equipment
No
Fault cleared?
Yes
End
External Faults
Handle faults caused by any of the following external factors:
l Power failure
l Optical cable fault
l Environment variable (for example, telecommunications room temperature)
l Terminal equipment
Technical Support
Contact Huawei Customer Service Center (CSC) for assistance locating and resolving any issues
beyond the scope of this document.
You can also contact the technical support personnel in Huawei's local representative office in
either of the following ways:
l Hotline: 4008302118
l E-mail: support@huawei.com
Troubleshooting Reports
After the fault is resolved, fill in the related troubleshooting report in a timely manner and
summarize how the fault was handled.
2.2.1 Workflow
This topic describes emergency handling workflow.
The emergency handling flow refers to the fault handling flow when the service of the equipment
is interrupted due to external faults (such as power supply fault or fiber cut), misoperations, or
software/hardware faults of the equipment. In the case of the OptiX WDM optical transmission
equipment, in addition to clearing the fault according to the troubleshooting flow, take other
emergency measures (such as providing standby channels) or ask for help in time to reduce the
service interruption duration. The emergency troubleshooting process refers to the
troubleshooting process when services on the equipment are interrupted due to external faults,
such as a power failure or damaged fiber cables, incorrect operations, or software/hardware faults
of the equipment. For the OptiX WDM optical transmission equipment, in addition to clearing
the fault according to the troubleshooting process, take other emergency measures, such as
providing standby channels, or ask for help in time to reduce the duration of service interruption.
CAUTION
When handling service interruptions or other emergencies, note the following points:
l Restore services as soon as possible. If a backup channel is available, switch services to the
backup channel.
l Analyze the fault symptoms first, find the causes, and handle the fault.
l When you fail to resolve a fault, contact Huawei technical support to troubleshoot the fault
and minimize the duration of service interruption.
l During the handling process, record and save the original fault data.
No
No The
Locate the faulty
Yes R_LOS/R_LOF/MUT_L The power Yes
Yes section along the signal Yes
1 Check alarms with the NMS OS/IN_PWR- supply/optical cable
flow and confirm
LOW/PWR_HIGH/NE- is faulty?
whether the NE is
unreachable alarm
unreachable
exists?
No No No
No
The fault is a multi-
The fault is a cross- channel fault, go to 5
connect fault, go to 3
The
OTU_LOF/OTU_LOM/
Yes
ODU_LOF/ODU_LOM/
ODUk_PM_SSF/ODUk
_PM_BDI alarm
exists?
Yes
The fault is an Check the bit error The fault is a single- Replace the faulty
OTN/electrical layer performance and locate channel fault, go to 6 board
fault, go to 4 the channel with
excessive bit errors
The system
Yes OSNR/nonlinearity/fiber
jumper between the OTU
and MUX/DEMUX board
is faulty?
The fiber
Yes The system OSNR or Yes Yes connections between
nonlinearity is improper or The cross-connection OTU/tributary/line
the fiber jumper/optical configuration is correct? boards at the
cable is faulty? local/opposite end
are normal?
No No No
No
Replace the faulty
board
The
LSR_WILL_DIE/OUT_PWR_HIGH/
OUT_PWR/LOW alarm exists on
the OTU/line board at the opposite
end?
Handle the fiber
jumper/optical cable
fault
Replace the faulty Replace the faulty
board at the local end board at the opposite
end
Both
the transmit and receive Yes
optical power values are Compare the current optical power of the
abnormal? OTU/tributary/line board of the faulty
channel with the history normal value
No
Compare the current
optical power of the OA/
MUX/DEMUX board with
the history normal value
The
OTU/tributary/line Yes
board is faulty?
The
The OA/MUX/DEMUX
Yes Yes
OA/MUX/DEMUX boards at both the transmit
board is faulty? and receive ends
are No
faulty?
No No
Replace the internal fiber
Handle the fiber Replace the faulty Handle the fiber between the OTU and Replace the faulty board
jumper/optical cable fault board jumper/optical cable fault MUX/DEMUX boards
2.2.2 Description
This topic describes the emergency troubleshooting process.
Handling Alarms
l If an R_LOS, R_LOF, MUT_LOS, IN_PWR_HIGH, IN_PWR_LOW, or NE-unreachable
alarm is reported, check and analyze the alarm.
– When such an alarm occurs, if the power supply or optical cable is faulty, handle these
faults first.
– Determine whether the fault is a multi-channel or single-channel issue based on the the
range of services affected, and handle it accordingly.
– If multiple channels are faulty, check the transmit/receive optical power and compare
the current optical power of the optical amplifier (OA)/multiplexer (MUX)/
demultiplexer (DEMUX) board with the historical normal value. Check the network
section by section to determine whether the line, pigtail, or OA/MUX/DEMUX board
is faulty. Then handle the line/pigtail fault or replace the board accordingly.
– If only one channel is faulty, compare the current optical power of the optical
transponder unit (OTU)/tributary/line board on the faulty channel with the historical
normal value to determine whether the board or intra-board fiber connection is faulty.
Then replace the faulty board or change the internal fiber connections accordingly.
l If no R_LOS, R_LOF, MUT_LOS, IN_PWR_HIGH, IN_PWR_LOW, or NE-unreachable
alarm is reported, check the bit error rate (BER) performance or optical power performance
for any of the following problems.
– If all channels have excessive bit errors, a system fault has occurred. Check the network
section by section along the signal flow to determine whether the system OSNR and
nonlinearity are correct and whether the pigtail/optical cable is faulty. If the system
OSNR and nonlinearity are incorrect and the pigtail/optical cable is faulty, handle the
pigtail/optical cable fault. If the pigtail/optical cable is functioning normally, the OA/
MUX/DEMUX board is faulty and should be replaced.
– If the fault is not a system fault and the BUS_ERR alarm is reported, the fault is a cross-
connection fault. If the cross-connection configuration is incorrect, modify it to resolve
the fault. If it is correct, the tributary/line/cross-connect board is faulty and should be
replaced.
– If the OTU_LOF/OTU_LOM/ODU_LOF/ODU_LOM/ODUk_PM_SSF/
ODUk_PM_BDI alarm is reported, the OTN/electrical layer is faulty. Check whether
the fiber connections between OTU boards, tributary boards, or line boards at the local
and opposite ends are faulty. If any of these is faulty, handle the fiber connection faults,
including the inconsistency between logical and physical fiber connections and physical
fiber connection faults.
– If a TD/TF/LSR_WILL_DIE alarm is reported by an OTU/line board, the board is faulty
and should be replaced.
TIP
As shown in the flowchart, optical-layer faults and fiber connection faults cause OTN alarms.
Therefore, these faults must be handled first. After these faults are cleared, OTN alarms will typically
clear also.
Table 2-1 lists common alarms and their causes for the OA, MUX, DEMUX or OTU/tributary/
line board.
NOTE
a: Common alarms are listed by board type in the table. Alarms for different boards of the same type may
differ slightly.
In addition, there are various running and alarm indicators with different colors on the equipment.
The On/Off and flashing states of these indicators indicate the current running state or possible
alarm of the equipment. Such indication helps you analyze the cause of the fault and accordingly
handle the alarm. Refer to the Indicators.
CAUTION
If you observe only the alarm indicators on the top of the cabinet, minor equipment alarms may
be missed (in the case of minor alarms, the alarm indicators on the top of the cabinet do not light
up, whereas the alarm indicators on the board do). In most cases, a minor alarm indicates a
potential fault at the local or opposite end. Such alarms cannot be neglected.
Equipment indicators indicate only the current operating status of the equipment, not the faults
that have already occurred and been cleared.
The indicator status that corresponds to a specific alarm category can be redefined on the U2000,
and certain alarm types can even be suppressed.
The board alarm indicator reports the alarm with the highest severity level detected by the board.
CAUTION
Before short-circuiting the transmit port of the corresponding OTU/tributary board at
the opposite station to the receive port of the corresponding OTU/tributary board,
determine whether to add an optical attenuator to the receive optical port on the OTU/
tributary board. The appropriate optical attenuator is determined based on the optical
power specification on the transmit port of the OTU/tributary board. Otherwise, optical
power overload might damage the OTU/tributary board.
– Method 2: For an OTU/tributary board that supports B1 bit error detection, compare
whether the RSES performance value of the OTU/tributary board at the local station is
the same as that at the opposite station. If they are the same, the system has no new bit
errors and is operating normally, which indicates that the fault is located on the client
side.
– Method 3: If users check client equipment on site, they can perform a self-loop (an
optical attenuator is required) on the receive and transmit optical ports of the client
equipment to check the alarms on the equipment. If alarms persist or the bit error meter
still detects bit errors, the fault is occurring on the client side.
l Troubleshooting an optical cable fault
To troubleshoot an optical cable fault, use any of the following methods:
– Method 1: Test the input optical power of the local station and the output optical power
of the upstream station. If the difference between the two values (that is, the attenuation)
is less than the designated value, the optical cable is functioning normally; otherwise,
the optical cable is faulty.
– Method 2: Check whether optical cable parameters such as the type and length meet the
design requirements. If not, the optical cable is faulty.
– Method 3: If the optical cable meets design requirements, check whether the optical
cable connector is properly attached. Ensure that the connection is normal.
– Method 4: Switch system services to the backup optical cable. If the alarm is cleared,
the optical cable is faulty.
– Method 5: Use an OTDR to measure the optical power and determine whether the optical
cable is faulty. If the reflectance of the tested optical fiber core is less than 27 dB and
the attenuation of the fiber core is less than the designated value, the optical cable is
functioning normally. Otherwise, the optical cable is faulty. Note that the OTDR has a
blind spot within a short distance (smaller than 5 km), in which the test results are
inaccurate.
CAUTION
When you use the OTDR, separate the optical fiber from the equipment. Otherwise, the intense
OTDR light might damage the equipment.
If you cannot log in to a station and all boards connected to the station report alarms indicating
input signal loss, the power supply of this station might be faulty. This causes a power failure
at this station and triggers the alarm. If this station suddenly enters an abnormal operating status,
the optical power of the station will suddenly decrease, some boards will operate abnormally,
services will be interrupted, and abnormal login will occur. Check whether the power supply
voltage of the transmission equipment is or a low-frequency transient voltage surge occurred.
If the equipment is struck by lightning or cannot be interconnected, check the grounding. Check
whether the equipment is grounded in compliance with the specifications, whether any
equipment is disconnected from the public ground bar, and whether all types of equipment in
the same telecommunications room are grounded similarly. Then use an earth resistance tester
to test whether the grounding resistance and the potential between working and protection
grounds are within the permitted range.
l Method 1: Measure the input optical power of the board reporting the alarm and the output
optical power of the corresponding board at the opposite station.
– If the transmit optical power of the corresponding board at the opposite station is normal
and the difference between the receive optical power of the board at the local station
and the transmit optical power of the corresponding board at the opposite station is
greater than the designed value, the optical fiber is faulty.
– If the transmit optical power of the corresponding board at the opposite station is too
low, the pluggable optical module on the board is faulty or the board is faulty.
l Method 2: Connect the board and optical meter using a new pigtail and measure the transmit
optical power of the board.
– If the measured optical power is normal, the original optical fiber is faulty.
– If the measured optical power is still too low, the pluggable optical module on the board
is faulty or the board is faulty.
This topic describes the basic concepts and methods used to locate faults.
Figure 3-1 Signal flow analysis: Each station uses tributary and line boards
Tributary Line Line Tributary Client
Client
board board MUX OA OA DEMUX board board equipment
equipment
DEMUX OA OA MUX
Station A Station B
NOTE
The U2000 provides the service signal flow interface. For details on how to view the service signal
flow using the GUIs, see "Viewing the Signal Flow Diagram for a WDM Trail" in the Online
Help.
l Fault analysis:
As shown in the preceding figure, the service signal flow of the client equipment at station
B is: client equipment at station A → the tributary board at station A → the cross-connect
board at station A → the line board at station A → MUX at station A → OA at station A
→ OA at station B → DEMUX at station B → the line board at station B → the cross-
connect board at station B → the tributary board at station B → client equipment at station
B.
Possible causes are as follows:
Client Client
equipment OTU MUX OA OA DEMUX OTU equipment
DEMUX OA OA MUX
Station A Station B
NOTE
The U2000 provides the service signal flow interface. For information on how to view the service
signal flow in GUIs, see "Viewing the Signal Flow Diagram for a WDM Trail" in the Online Help.
l Fault analysis:
The Client equipment at station B receives no optical signals or receives a large number of
bit errors. As shown in Figure 3-2, the service signal flow of the Client equipment at station
B is: Client equipment at station A → OTU at station A → MUX at station A → OA at
1. Analyze the alarms and performance of the OTU at station A. If the client-side
interface of this OTU receives no optical signal or its received optical power is too
low, locate the fault to the transmit end of the Client equipment at station A, the fiber
jumper from the client equipment to the OTU, or the client side receiver module of
the OTU.
2. If the input optical power of the OTU at station A is normal, check whether its output
optical power is normal. If not, locate the fault to this OTU.
3. If the output optical power of the OTU at station A is also normal, observe whether
the optical power of the MUX at station A sharply changes. If there are many
wavelengths at station A, the loss of one of them does not results in a great change to
the optical power. Therefore, feed the signals at the MON port of the MUX to the
MCA board and query for any loss-of-wavelength alarm.
4. The key components on the MUX are passive; therefore, the MUX is not likely to be
damaged. When the MCA detects the loss of a wavelength, the most possible fault
point is between the fiber jumper from the OTU to the MUX.
5. The OA provides a function of input and output optical power detection. If the OA is
faulty, multiple wavelengths are affected. As a result, the possibility is rather low that
the OA board is faulty.
6. At station B, analyze the signal flow in this order: OA at station B → DEMUX at
station B → OTU at station B → Client equipment at station B. The method to analyze
the signal flow at station B is similar to that of station A.
The U2000 fault isolation capabilities can be affected by system faults, and are classified as
follows:
l Comprehensive: You are able to obtain fault information for the entire network.
l Accurate: You are able to obtain the current alarms, alarm generation time, and historical
alarms of the equipment. In addition, you are able to obtain the specific values of the
performance events.
l Complex: In some cases, a significantly high number of alarms and performance events
can make analysis very difficult.
l Dependent: Fault isolation is entirely dependent on the normal operations of the computer,
software, and communication equipment. If any of these are faulty, fault isolation
capabilities may be limited or entirely lost.
CAUTION
l Before querying the performance data on the U2000, enable performance monitoring on the
U2000. Otherwise, the performance data will not be reported to the U2000.
l Before querying the alarm or performance data on the U2000, ensure that the running time
of each NE is correctly set. Otherwise, the alarm and performance data will be incorrect or
not reported to the U2000.
l During maintenance, after re-issuing the configuration to a NE, set the NE time to the current
time. Otherwise, the NE will operate at the incorrect default time.
The generation, detection, and transmission of an alarm varies with the OTU type and the type
of signals that the OTU receives. Alarm signal flow analysis allows you to quickly determine
where the fault is occurring.
Figure 3-3 Alarm and performance analysis: A Non-convergence OTU board processes SDH-
compliant signals
Client Client
equipment OTU MUX OA OA DEMUX OTU equipment
DEMUX OA OA MUX
Station A Station B
NOTE
The automatic laser shutdown (ALS) function of the OTU board shown in the figure is disabled.
R_LOS
R_LOS REM_SF R_LOF
Alarm processing
As shown in Figure 3-4, the client side of the OTU board at station A receives R_LOS signals.
The WDM side of this OTU board reports an R_LOS alarm and sends the fault information to
station B. The client side of the OTU board at station B reports the REM_SF alarm and sends
the fault information to the downstream client device. The client device then reports an
R_LOF alarm.
If the client-side OTU board at station A reports the R_LOS alarm to the U2000, the client-side
equipment at station B reports the R_LOF alarm. In this case, the input signals on the client side
of the OTU board at station A are abnormal. Therefore, if the input signals on the client side of
the OTU board at station A are abnormal, the client device is normal and the local device is
faulty.
Figure 3-5 Alarm and performance analysis: A non-convergence OTU board processes OTN-
compliant signals
Client Client
equipment OTU OTU-W OTU-E OTU equipment
NOTE
The ALS function of the OTU board shown in this figure is disabled.
OTUk_LOF/
OTUk_LOM/ SF
OTUk_AIS OTUk_LOF/
OTUk_LOM/ ODUk_PM_AIS
OTUk_AIS
OTUk_BDI
ODUk_PM_BDI
As shown in Figure 3-6, when the optical fiber between station A and station B degrades, an
OTU2_LOF alarm is detected on the WDM side of the OTU-W at station B. After being
processed at station B, the OTU2_LOF alarm is transmitted to stations A and C. (If the optical
fiber is severely degraded, either an LOF or LOM alarm is generated. In this example, an LOF
alarm is generated.) At station C, an ODU2_PM_AIS alarm is detected on the WDM side of the
OTU board. After being processed at station C, the ODU2_PM_AIS alarm is transparently
transmitted to station B and then to station A. Then, station A reports OTU2_BDI and
ODU2_PM_BDI alarms.
If the WDM side of the OTU board at station A reports the OTU2_BDI and ODU2_PM_BDI
alarms to the U2000, the WDM side of the OTU-W board at station B reports the OTU2_LOF
alarm, and the WDM side of the OTU board at station C reports the ODU2_PM_AIS alarm. In
this case, the input signals on the WDM side of the OTU board at station B are abnormal. Further
analysis can be performed to determine whether the fault is caused by degraded optical fibers.
Overview
The most commonly used test instruments for the WDM system include the optical power meter,
optical spectrum analyzer, SDH/SONET tester, SmartBits tester, communication signal analyzer
and multimeter. Among these tools, the optical power meter and optical spectrum analyzer are
most often used.
Application
l Optical power meter
The optical power of each point can be obtained from the U2000 performance data.
However, to get a precise reading for a specific point, you must measure the optical power
at that point using an optical power meter.
CAUTION
When the output optical power of the MUX boards, the input optical power of the DEMUX
boards, or the input and output optical power of the OA boards are abnormal, all services
are interrupted if the line is disconnected for testing. Therefore, do not disconnect the line
to test the optical power of the main-path signal unless absolutely necessary.
You can test the output optical power on the MON port of various boards. Use this tested
value and the ratio of the tested value to the optical power of the main-path signal to
calculate the optical power of the main-path signal.
l Optical spectrum analyzer
Use an optical spectrum analyzer to test the optical spectrum of the output signal on the
MON port of the board. Read from the spectrum analyzer the optical power, OSNR, central
wavelength and analyze the gain flatness of the OA. Compare this data with the original
data and check whether there is a significant performance deterioration (for the original
data, refer to the engineering documentation). Verify the following items:
– Per-channel optical power is normal and the gain flatness is normal.
– OSNR meets the design requirements.
– Deviation of the central wavelength is lower than the specification.
NOTE
The main-path optical spectrum on the MON ports of boards can be tested online. If all the services
carried on the main path are affected, analyze the optical spectrum of the OA boards. If only one
service channel on the main path is affected, analyze the optical spectrum of the MUX and DEMUX
boards.
l SDH/SONET tester, SmartBits tester, and communication signal analyzer
If it is suspected that the poor interconnection is caused by signal error, use these analyzers
to check whether the frame signals and overhead bytes are normal, and check whether there
are any abnormal alarms.
l Multimeter
If it is suspected that the power supply is too high or too low, use a multimeter to measure
the input voltage.
If it is suspected that the poor interconnection is caused by improper grounding, use a
multimeter to measure the voltage between the shield layers of the coaxial ports at the
transmit and receive ends of interconnected channels. If the voltage is more than 0.5 V, the
grounding is improper.
3.1.4 Loopback
The loopback method is the most common and effective method to locate a fault in a transport
network. The key feature of this method is that it does not require in-depth analysis of alarm and
performance information. Maintenance engineers should be familiar with this method.
There are software loopbacks and hardware loopbacks, which have the following advantages
and disadvantages:
l A hardware loopback is performed on a physical port (optical port) using an optical fiber.
Compared with the software loopback, the hardware loopback is more reliable. The
hardware loopback, however, requires on-site operations. In addition, a receive optical
power overload must be avoided when performing a hardware loopback.
l Compared with the hardware loopback, the software loopback is simpler but cannot locate
a fault as accurately. For example, during a single station test, if a software inloop is
performed on an optical port and the service is normal, the line board may not necessarily
be normal. However, if a selfloop is performed on the same optical port using a pigtail and
the service is normal, line board is normal.
For loopback configurations, refer to Performing Software Loopback and Performing Hardware
Loopback.
CAUTION
When performing a loopback on the optical path of the multiplex section, ensure that the OSNR
and dispersion meet the OTU requirements.
This method affects service quality. Therefore, it is generally used in deployment commissioning
or fault location in situations where services have been interrupted.
If the protection scheme has been configured, loopback is not recommended. Otherwise,
protection switching may fail. If a loopback is required, lock the protected services on the current
channel before performing the loopback, and unlock the services after the loopback is completed.
Application
The loopback method is applicable when the approximate range of the fault is known. The fault
is located by applying the loopback method on a section by section basis. This method clears
board and line faults. Figure 3-7 provides an example of fault isolation using the loopback
method.
DEMUX OA OA MUX
Inloop on
client side
Station A (OTM) Station B (OTM)
In Figure 3-7, the client side of the OTU at station A reports an R_LOS alarm whereas the
WDM side at station A reports no alarm. The WDM side and client sides of the OTU board at
station B report no alarms whereas the client equipment at station B reports an R_LOF alarm.
Perform an inloop on the client side of the OTU board at station A as shown in Figure 3-7.
l If the alarms at both stations A and B are cleared, the OTU board at station A is receiving
no signal from the client equipment at station A. The fault is occurring on the client
equipment at station A or the optical fiber from the client equipment at station A to the
OTU board at station A.
l If the alarms at stations A and B persist, the fault is occurring on the OTU board at station
A.
NOTE
l The faulty point can be identified quickly using the analysis method described in the "Alarm and
Performance Analysis" and analyzing the fault based on the alarm signal flow.
l If the system is capable of automatically releasing a loopback, the automatic loopback release period
can be changed or entirely disabled on the U2000. By default, the automatic loopback release period
is five minutes.
3.1.5 Replacement
In this method, a component suspected to be malfunctioning is replaced with a functional
component, in order to locate and resolve the fault. The component might be a section of pigtail,
a board, a flange, or an attenuator.
Application
The replacement method is used to resolve problems on external equipment, such as optical
fibers, flanges, client equipment, and power supply equipment. It is also used to handle faults
on boards or modules at a single station. Figure 3-8 and Figure 3-9 provide two examples.
The advantage of the replacement method is that the fault can be located within a small range,
and is a relatively simple procedure for maintenance personnel to perform. This method requires
that spare components be available. In addition, the operator must exercise caution when
performing operations. For example, when a board is being inserted or removed, careless
operations that could damage board components should be avoided.
As shown in Figure 3-8, when the OTU or tributary board reports an R_LOS alarm on the client
side but the receive module of the client equipment reports no alarm, swap pigtails A and B. If
the R_LOS alarm persists, the transmit module of the client side or the receive module of the
OTU or tributary board is faulty. If the R_LOS alarm is cleared but the client equipment reports
an R_LOS alarm, pigtail A is faulty.
CAUTION
Before performing the test, disable the automatic laser shutdown (ALS) function of the lasers
on the OTU or tributary board and the client equipment.
DEMUX OA OA MUX
Replacement
OTU Station A (OTM) Station B (OTM)
As shown in Figure 3-9, when the OTU or tributary board reports an R_LOS alarm but the
receive module of the client equipment reports no alarm, replace the original OTU or tributary
board with a backup OTU board. Observe the alarms on the OTU board and client equipment.
If the alarm is cleared after the replacement, the original OTU board is faulty and needs to be
repaired. The following procedures can be used to narrow down the possible cause of the OTU
failure:
l If it is suspected that one of the tributary board channels or some a tributary board is faulty,
change the timeslot configuration to switch services to another channel or tributary board.
l If it is suspected that a slot is faulty, change the slot configuration.
l If it is suspected that a VC-4 channel is faulty, switch the time slot to another VC-4 channel.
Figure 3-10 shows a PRBS application. A local board with the PRBS test function sends PRBS
codes and analyzes the PRBS codes looped back from the peer end. By comparing the
loopbacked PRBS codes with the PRBS codes that should be received according to the
theoretical calculation, the local board determines whether equipment and the transmission line
are normal.
WDM
network
CAUTION
l When a PRBS test is in progress, only query operations can be performed, no configurations
can be delivered to involved boards, the involved boards cannot carry any services, and the
original services on the boards will be interrupted.
l The LTX board does not support a PRBS test on multiple optical ports at the same time.
NOTE
l The PRBS test function is targeted for use during deployment and fault location. After deployment and
fault location, users must set PRBS Test Status to Disabled.
l PRBS codes vary according to the client-side service types. Therefore, to perform a client-side PRBS
test, users must ensure that the client-side service types for the tester board and auxiliary board are the
same.
The PRBS test is applicable to two networking modes, as provided in Table 3-1.
For a network, a client-side PRBS test covers a larger area than a WDM-side PRBS test, as
illustrated in Table 3-1. If a board on which a PRBS test has been started does not receive the
PRBS test signals that the board has sent, the board reports a PRBS_LSS alarm.
For information about how to complete PRBS configurations, see Configuring PRBS Test on
the Meter Board in Commissioning Guide.
For the boards that support the PRBS function, refer to Basic Functions of OTUs, OTN Tributary
Boards, OTN Line Boards and Packet Service Boards in Hardware Description.
WDM WDM
side side
Tester board: generates PRBS test signals and monitors the loopbacked PRBS test signals
from the remote board. By comparing the transmitted and received PRBS test signals, the
board determines whether the current link and equipment are normal.
Auxiliary board: connects a tester board and the network under test and transparently transmits
the PRBS test signals. On the auxiliary boards at the near end, PRBS Test Status need to be
set to Enabled, only when client-side services are other than OTN services.
When a tributary or line board is used as a tester or auxiliary board, cross-connections need
to be configured to form a service path.
This method is used to resume the service temporarily when no spare board is available for the
replacement, or is used in handling the problem of pointer justification. The maintenance
personnel may not find this method convenient. In addition, save the original configuration
before you use this method. Record the steps used to locate the fault.
l If some path of a tributary board or some tributary board is suspected to be faulty, modify
the timeslot configuration to shift the payload to other path or tributary board.
l If a certain slot is suspected to be faulty, change the slot configuration.
l If a VC-4 is suspected to be faulty, shift the traffic to another VC-4.
l During the upgrade or expansion, if you are unsure about the new configuration, you can
re-load the original configuration for confirmation.
l To modify the configuration in case of pointer justification, modify the tracing direction
and the reference source of the clock.
Modifying the timeslot configuration, however, does not help to locate the faulty point or faulty
board, for example, a line board, tributary board, cross-connect board or backplane. In this case,
use the replacement method or the loopback method to further locate the fault. This method is
applicable in the preliminary process of locating faults when spare boards are not available.
Other service channels or slots are used to resume the service temporarily.
If a test meter or instrument is unavailable on site, the board receiving Ethernet services can be
used to send test frames. Test frames help operators commission services and locate faults during
network deployment or maintenance.
NOTE
The test frame function complies with the internal standard of Huawei.
When a test frames is in progress, only query operations can be performed, no configurations can be
delivered to involved boards, the involved boards cannot carry any services, and the original services on
the boards will be interrupted.
Application
A network can be into divided into access networks and a transmission network. On the access
networks, boards on OptiX OTN products are interconnected with client-side devices; on the
transmission network, boards on OptiX OTN products are interconnected with each other on the
WDM side.
Figure 3-11 and Figure 3-12 shows how to use test frames to assess the connectivity of network
links.
For OptiX OTN products, Ethernet boards with the test frame function send test frames in either
of the following modes: GFP test frame and Ethernet test frame.
l GFP test frame: send test frames using GFP management frame for GE and 10GE service.
The local NE compares the number of test frames transmitted with the number of response
frames received by the local board to detect network connectivity.
l Ethernet test frame: directly send test frames at MAC layer of 40GE and 100GE service.
The local NE compares the number of test frames transmitted with the number of test frames
received by the local board to detect network connectivity.
Test Frame
MAC Internal Logical Internal Logical MAC
Port Port
Board A Response Frame Board B
Upward
Board A Board B
Downward
2
1 The client-side port on
board A transmits an After receving the Ethernet test
Ethernet test frame to frame, board B encapsulates it into
the client-side port of an OTN signal and transmits the
board B. signal to board C.
3
NOTE
l Upward: The client-side port on board A generates an Ethernet test frame and send it to the WDM-side on
board B.
l Downward: The client-side port on board A generates an Ethernet test frame, send it to the WDM-side on
board C by the client-side port on board B.
l Application 1 of GFP test frames: In Figure 3-11, three counters monitor test frames
response frames. The first monitors the number of test frames transmitted by board A in
step 1, the second is the number of response frames transmitted by board B in step 2, and
the third is number of test frames received by board A in step 3. Based on service flows, if
the number of test frames transmitted by board A is equal to that of response frames received
by board A, the transmission link is normal. If the two numbers are different, packet loss
is occurring along the transmission link, indicating a fault on the link.
l Application 2 of ethernet test frames (for 40GE/100GE service): In Figure 3-12, two
counters monitor test frames response frames. The first monitors the number of test frames
transmitted by board A in step 1, the second is number of test frames received by board A
in step 2. Based on service flows, if the number of test frames transmitted by board A is
equal to that of test frames received by board A, the transmission link is normal. If the two
numbers are different, packet loss is occurring along the transmission link, indicating a
fault on the link.
For test frame configurations, refer to Configuring Test Frames in the Supporting Tasks.
Availability
Table 3-2 lists the boards that support the test frame function on OptiX OTN products.
Table 3-2 Boards that support the test frame function on OptiX OTN products
a: For FE service, the board supports the test frame function only when the serviec is mapping
with GFP-F.
b: For GE service, the board supports the test frame function only when the serviec is mapping
with GFP-T.
c: For 10GE LAN service, the board supports the test frame function only when Port Mapping
(WDM Interface) is set to MAC Transparent Mapping (10.7 G) or MAC Transparent
Mapping (10.7 G) support 1588.
d: The TN12LSC only support downward ethernet test frame.
When using the test frame function, take the following precautions:
l For OTU boards, except for TOM boards with tributary-line mode, test frames are
independent of electrical cross-connections. The receive board can receive test frames if
the transmission link between the transmit and receive boards is connective.
l When electrical cross-connections function from a tributary board to the WDM-side port
on an OTU board with the test frame, the OTU board processes GFP management frames.
Therefore, the tributary board cannot send or receive test frames.
Various performance statistics and Ethernet port data need to be managed. RMON is a standard
monitoring specification that enables various network monitors and console systems to exchange
network-monitoring data and create port-statistics reports. To meet different network
requirements, RMON provides flexible detection modes and control mechanisms. RMON also
provides error diagnosis, planning, and performance event monitoring of the entire network.
For details, refer to Performance Event List of RMON Function in the Alarms and Performance
Events Reference.
Application
This topic uses Ethernet convergence service as an example to describe how to locate data service
faults using the RMON function. As shown in Figure 3-13, there is an Ethernet service from
station A to station C. Ethernet boards 1, 2, and 3 are used on stations A, B, and D, respectively.
Figure 3-14 shows the service connections. The data service from Router 1 to Router 2 is
abnormal. In this example, the service from Router 1 to Router 2 is interrupted.
iManager U2000
Ethernet
Board 1
A
Router 2 Router 3
B D
Ethernet Ethernet
Board 2 Board 3
C
VCTRUNK1
The RMON function is enabled to remotely monitor the service between stations C and A. To
check the service performance of the Ethernet service board at the transmit end (station A) you
can query the RMON performance of the corresponding Ethernet service board at station C.
When service is interrupted or signals are degraded, diagnose the fault as follows:
1. Set the RMON performance monitoring parameters for the Ethernet port, including the
monitored objects (ports on the Ethernet board) and the monitoring period of the
performance event.
2. View the historical group performance and statistical group performance of the Ethernet
ports, and query the performance of PORT1 on Ethernet boards 1, 2, and 3 for a specific
period.
3. Analyze RMON performance and check whether the data is normal.
l When the PAUSE frame is found in the statistics report of the monitored port, use the
ping function to check whether the fault is occurring on the link between Huawei
equipment and the interconnected data communication equipment.
l If the FCS error, collision, or fragment is found in the statistics report of the monitored
port, refer to the procedure of handling the "the RMON performance value is above the
upper limit performance"event.
l If the problem is related to the maximum transmission unit (MTU), for example, the
set maximum transmission unit (MTU) is smaller than expected, change it.
4. Check whether the sum of the traffic on PORT1 of Ethernet boards 2 and 3 is equal to the
traffic on PORT1 of Ethernet board 1.
l If the sum of the traffic on PORT1 of Ethernet boards 2 and 3 is equal to the traffic on
PORT1 of Ethernet board 1, the fault may be located at the interconnection point
between Huawei equipment and the customer equipment instead of on Ethernet boards
1, 2, and 3. Use the ping function to check the link between Huawei equipment and the
non-Huawei equipment.
l If the sum of the traffic on PORT1 of Ethernet boards 2 and 3 is not equal to the traffic
on PORT1 of Ethernet board 1, the fault is occurring on Ethernet board 1, 2, or 3. Check
whether the traffic on PORT1 on Ethernet board 1, 2, or 3 is equal to the sum of the
traffic on the corresponding VCTRUNK ports to locate the specific board where the
fault (congestion or signal degradation) is occurring.
5. Use the ping function to check whether the fault occurs on the link between Huawei
equipment and the interconnected data communication equipment.
For RMON configurations, refer to Configuring the RMON in the Supporting Tasks.
Application
You can configure port mirroring to perform packet monitoring, daily maintenance and in-
service commissioning. Port mirroring has the following features:
l Port mirroring applies to online fault diagnosis. It mirrors the traffic from one or more ports
to another port, and then an analyzer is used for fault diagnosis.
l After port mirroring is used, traffic can be monitored in real time using an analyzer.
Port mirroring implements fault diagnosis by mirroring and analyzing port data and without
affecting services. This method, which takes a long time and demands high maintenance skills,
is generally used by experienced maintenance personnel.
Mirroring Monitoring
port port
To monitor the data in different directions, port mirroring can be performed in the ingress
direction and in the egress direction.
l In the ingress direction
Also in the upstream direction. The equipment duplicates the packets received from the
mirroring port to the observing port, and then transmits the packets from the observing port
to the Ethernet tester.
l In the egress direction
Also in the downstream direction. The equipment duplicates the packets transmitted by the
mirroring port to the observing port, and then transmits the packets from the observing port
to the Ethernet tester.
NOTE
For port mirroring configurations, see Configuring Port Mirroring in Supporting Tasks.
Availability
Table 3-3 lists the boards that support the port mirroring function.
Table 3-3 Boards that support the port mirroring for the OptiX OTN products
Related Information
For TBE, L4G, LEX4, LEM24, TEM28:
Mirror Listener Port For example: PORT4 l After the mirroring function of the
port is configured, you can
monitor all the mirrored ports by
analyzing the packets at the
mirroring port only. As a result,
you can easily manage the ports.
l Mirrored Port indicates the port
that sends the packets copied from
Mirrored Upstream Port and
Mirrored Downstream Port.
l Mirrored Port cannot be set to a
port that carries any service.
Uplink Listened Port For example: PORT3 Sets the uplink listened port.
Mirrored Upstream Port can be a
PORT or a VCTRUNK. As a PORT,
the port copies the packets that it
receives; as a VCTRUNK, the port
copies the packets that it transmits.
Mirrored Port sends the packets
copied from Mirrored Upstream
Port.
Downlink Listened For example: PORT5 Sets the downlink listened port.
Port Mirrored Downstream Port can be
a PORT or a VCTRUNK. As a PORT,
the port copies the packets that it
transmits; as a VCTRUNK, the port
copies the packets that it receives.
Mirrored Port sends the packets
copied from Mirrored Downstream
Port.
Mirrored Upstream Port and Mirrored Downstream Port indicate the ports that copy
packets for Mirrored Port.
The transmit direction and receive direction mentioned in this section are related to the local
NE.
Mirror Source Function For example: Shelf0 Indicates the source mirroring port.
Point (subrack)-2-54EX2-1
(PORT-1)
For any problems that prove complex for technicians to identify or resolve, contact Huawei
technical support and report the relevant equipment data. As shown in Table 3-6, the equipment
data to be collected includes the version information, electronic labels, basic configurations,
service configurations, current and historical alarm and performance event data, equipment
running logs, operation logs, security logs, and equipment running status.
N Type Description
o.
1 NGWDM_ Collects the version information about NEs and boards, board dump
Board_Data information, service performance parameters and alarms, and chip
register information on boards.
2 40G_Modul Collects the performance status and key component parameters of the
e_Basic_Dat 40G optical and electrical modules installed on the boards on the current
a NE. Such data includes the module input and output optical power
specifications, service status, alarms, and key component parameters.
N Type Description
o.
4 NGWDM_ Collects time and clock information to check the tracing status of the
Clock_Data current NE. Such information includes the IEEE 1588-related alarms,
external time sources, line time sources, line time source tracing jitter,
synchronization clock alarms, and lost external clock sources.
1 NGWDM_P Collects the package loading data of the current NE. Such data includes
0 atch_Data the package loading alarms, status, logs, and patch loading and operating
data.
1 NGWDM_ Collects board black box data in the log files from the platform in file
1 BlackBox_ transfer protocol (SFTP) mode.
Data
1 OTN_Ason Collects data about ASON services, TE links, current alarms, historical
2 _Data alarms, and boards. Such data must be collected when an ODUk service
in the OTN ASON domain is faulty (for example, service rerouting fails
or service interruption alarms are generated), or an ODUk ASON link is
faulty.
N Type Description
o.
1 WDM_Aso Collects data about ASON services, TE links, current alarms, historical
3 n_Data alarms, and boards. Such data must be collected when an OCh service in
the WDM ASON domain is faulty (for example, service rerouting fails
or service interruption alarms are generated), or an OCh/OMS ASON
link is faulty.
1 SDH_Ason_ Collects data about ASON services, TE links, current alarms, historical
4 Data alarms, and boards. Such data must be collected when an SDH service in
the SDH ASON domain is faulty (for example, service rerouting fails or
service interruption alarms are generated), or an SDH ASON link is
faulty.
1 Collect_HIS Collects the alarms of the boards that support the HISI chip on the current
5 I_Chip_Dat NE.
a
1 TDM Collects data about the time division multiplexing (TDM) feature in one-
9 click mode.
2 WDM Collects data about the wavelength division multiplexing (WDM) feature
0 in one-click mode.
2 PER Collects the current and historical performance data in one-click mode.
3
For data collection configurations, refer to Collecting Fault Data in the Supporting Tasks.
Fault location using the Transport Packet Assist (TP-Assist) simplifies operation and
maintenance (O&M) of packet services and improves O&M efficiency.
Near-end IP Ping
PSN
Router RNC
NodeB NE1 NE2
Near-end IP Ping
Far-end IP Ping
NOTE
Switches and routers support both strict mode and non-strict mode.
l In strict mode, switches and routers learn MAC addresses only upon receiving ARP responses.
l In non-strict mode, switches and routers learn MAC addresses upon receiving ARP request packets.
OptiX OSN equipment enabled with the IP ping function does not send ARP responses. If switches or
routers use the strict mode, IP ping will fail. Therefore, if the IP ping function is initiated by a switch or
router, use the non-restrict mode for the switch or router.
When the IP ping function is initiated at a PC that is directly connected to an NE, the packets sent by the
PC do not carry VLAN tags. To test a service with VLAN tags on the NE, set the tag attribute of the service
port to Access or Hybrid, and set the default VLAN ID of the port the same as the service VLAN ID.
The following describes how to configure the near-end IP ping function and far-end IP ping
function using a NodeB as the initiator of the IP ping function.
Table 4-1 Mapping between port encapsulation modes and TPIDs in IP Ping packets
Null 0x8100
802.1Q 0x8100
Near-End IP Ping
Step 1 Right-click NE1 in the Main Topology and choose NE Explorer from the shortcut menu.
Step 2 In the NE Explorer, click NE1 and choose Configuration > Packet Configuration > IP Ping
Test.
Step 3 Set Enable, VLAN ID, VLAN Priority, IP Address, IP Mask, Next Hop IP Address, and
Next Hop IP Mask for the target UNI.
NOTE
l The IP address of the UNI must be on the same network segment with the IP address of the NodeB, to
ensure that the near-end IP ping function can operate properly.
l The next hop IP address of the UNI is the IP address of the port connecting straightly with the NodeB.
----End
Far-End IP Ping
Step 1 Right-click NE1 in the Main Topology and choose NE Explorer from the shortcut menu.
Step 2 In the NE Explorer, click NE1 and choose Configuration > Packet Configuration > IP Ping
Test.
Step 3 Click Add and set the proxy IP address in the Proxy IP Address List area.
NOTE
In far-end IP ping tests, the configured proxy IP address ensures that IP ping packets are forwarded through
the proxy IP address.
Step 5 Set Enable, VLAN ID, VLAN Priority, IP Address, and IP Mask for a UNI of NE2 by
referring to the configuration steps applicable to NE1.
NOTE
l The UNI IP address configured on NE2 must be the same as the proxy IP address configured on NE1.
l Before using the far-end IP ping function, ensure that the IP ping function has been enabled on both
the near-end port and far-end port.
----End
Application Scenario
The E-LAN service loopback detection function applies to the following native Ethernet
services:
Context
l As shown in Figure 4-2, E-LAN services are configured for the network. A UNI on NE1
initiates a loopback detection packet on the specified VLAN. The loopback detection packet
takes the same path as forwarded service packets.
l If the UNI receives a loopback detection packet whose destination MAC address is the
same as the source MAC address of the loopback detection packet sent by the UNI, a
loopback occurs. You can use the automatic disabling function to disable loopback services.
If the UNI does not receive a loopback detection packet within the specified duration, a
service loopback does not occur. The duration ranges from 3s to 10s and the default value
is 3s.
NOTE
The automatic disabling function disables only the looped back service. The UNI remains operational.
Therefore, disabling of the looped back service does not affect other services provided by the UNI.
l You can perform E-LAN service loopback detection for a maximum of 20 VLANs at a
time. You can perform two consecutive loopback detection operations for a VLAN at an
interval of 10 or more seconds.
Loopback
detection NE2
packet
NE1 PSN
NE3
Procedure
Step 1 Right-click NE1 in the Main Topology and choose NE Explorer from the shortcut menu.
Step 2 In the NE Explorer, click NE1 and choose Configuration > Packet Configuration > Ethernet
Service Management > E-LAN Service.
Step 3 Select the target E-LAN service and click the Loopback tab.
Step 4 Click UNI and set the port that initiates the loopback packet.
NOTE
This task uses a UNI as an example. The port varies according to services.
Step 6 Set Vlans/CVLAN, Packet Timeout Period, and Disable Service When Loopback is
Detected. Then click Start.
Step 8 Optional: If the loopback services does occur, disable the looped back service in time to help
prevent network congestion.
----End
Prerequisites
You must be an NM user with NE administrator authority or higher.
Procedure
Step 1 Search for services.
6 7
8
Detect a specified service.
Service information is
displayed.
9
Step 4 Optional: You can also diagnose the PWE3 services as follow:
1. Choose Service > PWE3 Service > Manage PWE3 Service from the main menu.
2. In the dialog box that is displayed, set filter vaule. Click Filter to check PWE3 services.
3. Select the PWE3 service to be deployed, right-click the PWE3 services, and choose
Diagnose Service from the shortcut menu.
----End
5 Service Interruptions
Many factors can cause service interruptions, such as bit errors, line interruption, decrease of
optical power, failed interconnection of equipment.
This topic describes the symptoms, impact, and possible causes of a service interruption caused
by faulty fibers, cables, and or connectors. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.
Symptom
The OA (Station B) reports MUT_LOS alarm and all OTUs (Station B) report alarm R_LOS, ,
OTUk_LOF, OTUk_LOM alarm. All services are interrupted.
DEMUX MUX
OSC OSC
Station A (OTM) Station B (OTM)
Impact on System
All services are interrupted.
Possible Causes
l Cause 1: The fiber is cut off.
l Cause 2: The optical power is abnormal.
l Cause 3: The optical amplifying board or the optical cable is faulty.
l Cause 4: The demultiplexing/multiplexing board at station A or B is faulty.
l Cause 5: The ring fiber adapter from the equipment to the ODF is damaged.
Procedure
l Cause 1: The system provides an OSC board, the optical amplifier board reports a
MUT_LOS alarm, and the optical supervisory board reports a OSC_LOS alarm. In this
case, suspect that the optical cable is cut off.
1. Check the fiber jumper between the optical distribution frame (ODF) and WDM
equipment. If the fiber jumper is dirty, clean the fiber jumper; if it is faulty, replace
the fiber jumper.
2. If the fiber jumper is good, the optical cable is faulty. Clear the fault.
l Cause 2: The system provides an OSC board, the optical amplifier board reports a
MUT_LOS alarm, and the OSC board reports no OSC_LOS alarm. It indicates that the
optical power is abnormal.
1. Check the fiber jumper between the FIU and optical amplifier board. If the fiber jumper
is dirty, clean the fiber jumper.
2. Check the optical power change of the OSC. If the received optical power of the OSC
declines, check whether the attenuation of the optical cable is excessive. If yes, the
optical cable is faulty. Clear the fault.
3. If the optical cable between station A and B is good, query the optical power on the
U2000 at station A.
– If the input optical power of the optical amplifier board at station A is normal but
the output optical power is too low, replace the optical amplifier board at station
A.
– If the input optical power of the optical amplifier board at station A declines and
thus results in a low output optical power, check the upstream against the service
signal flow to locate the fault.
4. If the optical cables of the faulty station B are good, check whether the input optical
power of the optical amplifier board is normal. If yes, and the output optical power is
too low, replace the board.
l Cause 3: The system provides no OSC board, and the optical amplifier board reports a
MUT_LOS alarm. It indicates that the optical amplifying board or the optical cable is faulty.
1. Refer to Testing Optical Power by Using an Optical Power Meter. Check whether the
input optical power of the optical amplifier board at station B is normal. If yes, replace
the optical amplifier board at station B.
2. Refer to Testing Optical Power by Using an Optical Power Meter. Check whether the
input and output optical power of the optical amplifier board at station A is normal.
If the input optical power is normal but the output optical power declines, replace the
optical amplifier board at station A.
3. Check the fiber jumper between the ODF and WDM equipment. If the fiber jumper
is dirty, clean it; if it is faulty, replace it.
4. If the fiber jumper is good, the optical cable is faulty. Clear the fault.
l Cause 4: The MUX/DEMUX board at station A or B is faulty.
1. Query failure alarms of the MUX/DEMUX board at station A or B. If alarms are found,
replace the faulty board.
NOTE
Before you replace the M40V or D40V board, record the attenuation of the internal VOA of
each channel.
l Cause 5: The ring fiber adapter from the equipment to the ODF is damaged.
1. Check the ring fiber adapter between the ODF and WDM equipment. If the ring fiber
adapter is faulty, replace it.
----End
Symptom
The client-side RX interface of the OTU or tributary board reports alarms such as R_LOS,
R_LOF, OTUk_LOF at station A, and the client-side service is interrupted.
Client Client
equipment OTU1 OTU1 equipment
MUX MUX
DEMUX DEMUX
OA OA
OTUn OTUn
Station A (OTM) Station B (OTM)
Impact on System
Single channel service is interrupted on the client side.
Possible Causes
l Cause 1: The output optical power of the client equipment that is interconnected to the
OTU board is abnormal.
l Cause 2: The fiber jumper between the OTU board and client equipment is faulty.
l Cause 3: The optical interfaces of the OTU board are dirty.
l Cause 4: The OTU board is faulty.
l Cause 5: The client equipment is faulty.
l Cause 6: The service type, wavelength and mode that configured on the board 1 are different
from those of the client equipment, and therefore services are interrupted.
Procedure
l Cause 1: The output optical power of the client equipment that is interconnected to the
OTU board is abnormal.
1. Refer to Testing Optical Power by Using an Optical Power Meter. Check whether the
output optical power of the client equipment is normal. If not, remove the fault of the
client equipment.
l Cause 2: The fiber jumper between the OTU board and client equipment is faulty.
1. If the output optical power of the client equipment is normal, check whether the input
optical power of the OTU board is normal. If not, clean the fiber jumper or replace
the fiber jumper.
l Cause 3: The optical interfaces of the OTU board are dirty.
1. If the optical interfaces of the OTU board are dirty, blow them with compressed gas
to clean them.
l Cause 4: The OTU board is faulty.
1. If the input optical power of the OTU board is normal, but the system reports an alarm
indicating that the optical power is abnormal or an alarm related to the optical module,
replace the client-side optical module on the OTU board.
2. If the fault still exists, replace the board.
l Cause 5: The client equipment is faulty.
1. Remove the fault of the client equipment.
l Cause 6: The service type, wavelength and mode that configured on the board 1 are different
from those of the client equipment, and therefore services are interrupted.
1. Query and set the client service type, wavelength and mode of the board 1. Make sure
that the service type, wavelength and mode of the board 1 board are the same as those
of the client equipment.
NOTE
For GE service, query and set the auto-negotiation modes of the boards. Make sure that the auto-
negotiation modes are the same.
----End
Symptom
The WDM-side IN interface of the OTU, or tributary board and line board reports alarms such
as R_LOS, R_LOF, R_OOF and OTUk_LOF, and one channel of services is interrupted.
Client Client
equipment OTU MUX OA OA DEMUX OTU equipment
DEMUX OA OA MUX
Station A Station B
Signal Flow
Impact on System
One channel of the WDM-side services of an NE is interrupted.
Possible Causes
l Cause 1: The OTU, or tributary board and line board in the opposite station (station A) is
faulty or the fiber jumper in the opposite station between the OTU board and MUX is dirty.
l Cause 2: The fiber jumper between the DEMUX and OTU, or tributary board and line board
is dirty or faulty.
l Cause 3: The loopback testing on the OTU, or tributary board and line board in the local
station and the optical power of OTU, or tributary board and line board is normal. In this
case, the FEC mode of the OTU, or tributary board and line board in the local station and
the opposite station might be inconsistent.
l Cause 4: The interconnection services type, wavelength and mode do not match.
l Cause 5: The OTU, or tributary board and line board at the local station (station B) is faulty.
l Cause 6: New service wavelengths are added to the network and the new service
wavelengths conflict with existing service wavelengths.
Procedure
l Cause 1: The OTU, or tributary board and line board in the opposite station (station A) is
faulty or the fiber jumper in the opposite station between the OTU, or tributary board and
line board and MUX is dirty.
1. Check whether the input optical power on the client side and the output optical power
on the WDM side of the OTU, or tributary board and line board in the opposite station
are normal. If either optical power is abnormal, replace the corresponding board.
2. Check the input optical power of the optical amplifier board in the opposite station
(station A) to determine whether the fiber jumper between the OTU, or tributary board
and line board and MUX is dirty or faulty. If the fiber jumper is dirty, clean the fiber
jumper; if it is faulty, replace the fiber jumper.
l Cause 2: The fiber jumper between the DEMUX and OTU, or tributary board and line board
is dirty or faulty.
1. Check whether the input optical power of the OTU, or tributary board and line board
is normal. If not, clean or replace the fiber jumper between the DEMUX and OTU, or
tributary board and line board.
l Cause 3: The loopback testing on the OTU, or tributary board and line board in the local
station and the optical power of OTU, or tributary board and line board is normal. In this
case, the FEC mode of the OTU, or tributary board and line board in the local station and
the opposite station might be inconsistent.
1. Refer to Querying the FEC Type to query the FEC states of the two interconnected
boards on the U2000. If the two modes differ, enable the FEC to ensure mode
matching.
l Cause 4: The interconnection services type, wavelength and mode do not match.
1. Check and ensure that the interconnection services type, wavelength and mode match.
NOTE
If the services on the WDM side do not match, there is only LOOP_ALM alarm reported in
the loopback operation, but there will be OTUk_LOF or R_LOF if the loopback operation is
canceled.
l Cause 5: The OTU, or tributary board and line board at station B is faulty.
1. Check whether the input optical power of the OTU, or tributary board and line board
is normal. If yes, the OTU, or tributary board and line board in the local station is
faulty. Replace the pluggable optical module or Replace the board.
l Cause 6: New service wavelengths are added to the network and the new service
wavelengths conflict with existing service wavelengths.
1. Check whether new services are added to the network. If new services are added,
check whether the new service wavelengths conflict with the existing service
wavelengths. Then, change the new service wavelengths so that they do not conflict
with the existing service wavelengths.
----End
Symptom
Human misoperation results in service interruptions.
Impact on System
The services carried on the OTU, or tributary board and line board are interrupted.
Possible Causes
l Cause 1: When you disable the FEC function of an OTU, or tributary board and line board,
the equipment does not report any alarm, whereas the OTN services borne by the equipment
are interrupted.
l Cause 2: The FEC mode of any OTU, or tributary board and line board is manually changed,
causing the services to be interrupted.
l Cause 3: Hardware loopback or software loopback is set manually.
Procedure
l Cause 1: When you disable the FEC function of an OTU, or tributary board and line board,
the equipment will not report any alarm, but the OTN services are interrupted.
1. Querying the FEC Mode, the equipment does not report any alarm when you enable
or disable the FEC function of the OTU, or tributary board and line board. The engineer
must pay attention to this during maintenance. Make sure to enable the FEC function
in application.
l Cause 2: The FEC mode of any OTU, or tributary board and line board is manually changed,
causing the services to be interrupted.
1. Check whether the FEC modes of the two interconnected boards are different.
l Cause 3: Hardware loopback or software loopback is set manually.
1. Check whether hardware loopback is set. If yes, release the hardware loopback.
2. On the U2000, check whether software loopback is set on the service channel
manually. If yes, release the software loopback.
----End
Symptom
A large number of boards on the device of a certain site are reset simultaneously and the services
are interrupted.
Impact on System
Power supply failures may cause service interruptions.
Possible Causes
l Cause 1: The power supply of the board fails.
Procedure
l Cause 1: The power supply of the board is faulty. As a result, services are interrupted.
1. Query the current alarms on the U2000.
If... Then...
The NE does not report the Check whether the fault is due to other
POWER_ABNORMAL, causes.
POWER_FAIL alarm
NOTE
If... Then...
The power cable connectors are not Fasten the power cable connectors by
connected properly using a screwdriver or replace the power
cable connectors, and proceed to the
next step.
Power cable connectors are connected Check whether the fault is due to other
properly and the contact is good causes.
2. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 3: The external power supply is off or the power fluctuates abnormally. As a result,
services are interrupted.
1. Switch off the DC PDU. Connect the positive pole of the multimeter to the NEG(-)
terminal of the DC PDU and the negative pole of the multimeter to the RTN(+)
terminal. Measure the voltage between the RTN(+) and NEG(-) terminals and check
whether the voltage meets the requirement. For details about voltage requirements,
refer to the Hardware Description.
If... Then...
The voltage between the RTN and -48V Proceed to the next step.
DC terminals does not meet the
requirements
The voltage between the RTN and -48V Check whether the fault is due to other
DC terminals meets the requirements causes.
2. Contact the power supply engineers to rectify the fault of the external power supply.
l Cause 4: The power board of the device is faulty. As a result, services are interrupted.
1. Replace the power board of the device.
2. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 5: The DC PDU is faulty. Abnormality of the power supply system causes the board
reset.
1. Replace the DC PDU.
2. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
----End
Symptom
A large number of boards on the device of a certain site are reset simultaneously and the services
are interrupted.
Impact on System
The grounding fault causes service interruptions.
Possible Causes
l Cause 1: The PGND and BGND grounding is poor, and the grounding resistance is higher
than 10 ohms or the potential difference between BGND and PGND is higher than 0.5 V.
l Cause 2: The PGND and BGND share the same grounding with the AC neutral wire.
PGND means protection ground and BGND means power ground (also called working ground).
NOTE
Use the multimeter to check whether the grounding resistance of the BGND and PGND meet the
requirements. Or, you can use a proper meter to check whether the waveform of the interconnected signal
is distorted.
Procedure
l Cause 1-3: The device grounding is improper. As a result, services are interrupted.
1. Check whether the ground bar in the telecommunications room is properly grounded.
If the grounding is proper, proceed to the next step; otherwise, ground the ground bar
properly.
2. Check whether the contact between the cabinet of the transmission device and the
ground bar in the telecommunications room is good. If the contact is good, proceed
to the next step; otherwise, ground the cabinet of the transmission device properly.
3. Check whether the contact between the cabinet and the front door/side cover of the
cabinet is good. If the contact is good, proceed to the next step; otherwise, ground the
front door/side cover of the cabinet properly.
4. Check whether the contact between the subrack and the cabinet is good. If the contact
is good, proceed to the next step; otherwise, ground the subrack properly.
5. Check whether the DDF and ODF are properly grounded. If the DDF and ODF are
properly grounded, proceed to the next step; otherwise, ground the DDF and ODF
properly.
6. Check whether the NMS device and various power consumption devices are properly
grounded. If these devices are properly grounded, proceed to the next step; otherwise,
ground these devices properly.
7. Check whether the grounding of the interconnection devices is joint grounding.
----End
Symptom
A large number of boards on the device of a certain site are reset simultaneously and the services
are interrupted.
Impact on System
Unfavorable environmental conditions can cause a service interruption.
Possible Causes
l Cause 1: The temperature and humidity of the environment have caused equipment damage.
l Cause 2: A strong external interference source exists.
l Cause 3: Rodent animals damage equipment.
Procedure
l Cause 1: The temperature and humidity of the environment have caused equipment damage.
As a result, services are interrupted.
1. Check whether the temperature and humidity of the environment meet the
requirements.
If... Then...
The temperature and humidity of the Check and adjust the air conditioner in
environment do not meet the the telecommunications room to make
requirements sure that the temperature and humidity
of the telecommunications room are in
the proper range. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.
If... Then...
The NE does not report the temperature Check whether the fault is due to other
threshold-crossing alarms causes.
TEMP_OVER
If... Then...
If... Then...
If... Then...
If... Then...
4. Check whether a hand-held mobile communication device is in use near the device.
If... Then...
If... Then...
There are rodent animals or their Remove the rodent animals and their
excrement inside the cabinet excrement, replace any broken cable,
and install a rodent-proof mesh in the
cabinet. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.
There are no rodent animals or their Check whether the fault is due to other
excrement inside the cabinet causes.
----End
Related Information
Table 5-1 lists the requirement for the electromagnetic interference.
Symptom
Services of a certain site are interrupted.
Impact on System
Faulty fibers, cables, or connectors can cause a service interruption.
Possible Causes
l Cause 1: The fibers are broken.
l Cause 2: Attenuation of the fibers and the fiber adapter is too high.
l Cause 3: The cable connectors are connected improperly or incorrectly, or are not
connected.
Precautions
DANGER
l Avoid direct eye exposure to laser beams launched from the optical interface and the fiber
jumper connector.
l When removing fibers from the optical interface, make sure that the proper optical fiber is
removed; otherwise, it may cause service interruption to a larger area.
Procedure
l Query alarms on the U2000 and clear the alarms according to the reported information.
1. Check the current alarms on the U2000.
If... Then...
The NE does not report the R_LOS, Check whether the fault is due to other
R_LOF or IN_PWR_LOW alarm causes.
If... Then...
The fibers are detected as broken Replace the fibers. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.
The fibers are not detected as broken Check whether the fault is due to other
causes.
l Cause 2: Attenuation of the fibers and the fiber adapter is too high. As a result, services are
interrupted.
1. Measure the transmit optical power of the board in the upstream station and the receive
optical power of the board in downstream station.
If... Then...
The receive optical power of the board Proceed to the next substep.
in downstream station is abnormal
2. The difference between the transmit optical power of the board in the upstream station
and the receive optical power of the board in downstream station is the actual optical
power attenuation of the line. Compare the actual optical power attenuation of the line
with the designed value. If the actual attenuation is higher than the designed value,
locate and rectify the fault as follows.
– Check whether the bending radius of the fibers is equal to or larger than
standard. If the bending radius of the fibers is less than standard, re-route the
fibers and make sure that the bending radius is equal to or larger than standard.
– Check whether there are many fiber connectors on the line and whether the
connectors are connected properly. If the connectors are connected improperly,
fasten the connectors with the optical modules.
– Check whether there are aerial fibers on the line, which is easy to be affected by
the weather. If there are aerial fibers on the line, provide necessary protection to
the aerial fibers.
– Check whether the fiber type and attenuation factor of the line are consistent with
the designed ones. If the fiber type and attenuation factor of the line are inconsistent
with the designed ones, replace the fiber.
– Check whether the fiber connectors of the line are dirty. For details, see the
information about checking the fiber connectors. Clean the fiber connectors.
3. Check whether services recover. If the services do not recover, check whether the fault
is due to other causes.
l Cause 3: The connection between the cable and the connector is loose, the cable connectors
are connected incorrectly, or the cable connectors are not connected. As a result, services
are interrupted.
1. Check the cables connection on the board in the downstream station.
If... Then...
The connection between the cable and Connect the cables properly. Check
the connector is loose whether services recover. If the services
do not recover, proceed to the next
substep.
The connection of the cable and the Proceed to the next substep.
connectors is good
2. Check whether the connections of the cables to the board in the downstream station
are correct.
If... Then...
Connections of the cables to the board in Connect the cables correctly. Check
the downstream station are incorrect whether services recover. If the services
do not recover, check whether the fault
is due to other causes.
Connections of the cables to the board in Check whether the fault is due to other
the downstream station are correct causes.
----End
Related Information
G.652D 30 mm
G.657A2 10 mm
G.657B3 8 mm
Note: When fibers are connected to CRPC boards, the bending radius of the fibers must be
greater than or equal to 50 mm to ensure expected transmission performance.
Symptom
During a capacity expansion at a site, after a board is replaced or added, services are interrupted.
Impact on System
Inconsistency of board types causes service interruption.
Possible Causes
l Cause 1: The board types are inconsistent.
Procedure
l Cause 1: The board types are inconsistent. As a result, services are interrupted.
1. Check the current alarms on the U2000.
If... Then...
If... Then...
The NE does not report the Check whether the fault is due to other
WRG_BD_TYPE alarm causes.
----End
Related Information
For any capacity expansion, the type of the board to be replaced or added must be consistent
with that of the original board. Especially, when replacing or adding the cross-connect boards
with the active and standby relation, make sure that the types of the active and standby boards
are the same.
NOTE
A list of compatible boards is available. For details, consult technical support engineers of Huawei.
Symptom
The receive end of the Ethernet board reports the ALM_GFP_dLFD or VCAT_LOA alarm,
indicating that the services are interrupted.
Impact on System
The services in the VCG that reports the alarms are interrupted.
Possible Causes
l Cause 1: The board fails to frame the GFP frame. As a result, services are interrupted.
l Cause 2: During the service data transmission, if the alignment delay of the virtual
concatenation is excessively long, a data frame cannot be formed. Accordingly, it causes
the packet loss and even service interruptions.
l Cause 3: The number of timeslots bound with the VCTRUNK at the downstream station
is inconsistent with the number of timeslots bound with the VCTRUNK at the upstream
station. As a result, services are interrupted.
Procedure
l Cause 1: The board fails to frame the GFP frame. As a result, services are interrupted
1. Query the current alarms on the U2000.
If... Then...
The NE does not report the Check whether the fault is due to other
ALM_GFP_dLFD alarm causes.
If... Then...
The NE does not report the Check whether the fault is due to other
VCAT_LOA alarm causes.
If... Then...
The LCAS function of the VCTRUNK Check whether the fault is due to other
port is enabled causes.
The LCAS function of the VCTRUNK Enable the LCAS function, and then
port is disabled check whether services recover. If the
services do not recover, check whether
the fault is due to other causes.
----End
Symptom
The service on a single Ethernet port of an Ethernet board is interrupted.
Impact on System
The service on a single Ethernet port is interrupted.
Possible Causes
l Cause 1: The configuration is incorrect.
l Cause 2: The fibers, network cables, or optical/electrical modules are faulty.
l Cause 3: The optical module does not match the optical port.
l Cause 4: The board port is faulty.
Procedure
l Cause 1: The configuration is incorrect. As a result, services are interrupted.
1. On the U2000, query the configuration on the board.
If... Then...
2. On the U2000, query whether the board port is configured with loopback.
If... Then...
The board port is configured with a Release the loopback. Check whether
loopback services recover. If the service does not
recover, proceed to the next step.
The board port is not configured with a Proceed to the next step.
loopback
If... Then...
The attributes of the port are incorrect Set the attributes of the port again,
including the tag attributes and whether
to enable the port. Check whether
services recover. If the services do not
recover, proceed to the next step.
The attributes of the port are correct Proceed to the next step.
4. On the U2000, check whether the working mode of the local port is consistent with
the working mode of the opposite port.
If... Then...
The working mode of the local port is Set the local port to the same working
inconsistent with that of the opposite mode as the opposite port. Check
port whether services recover. If the services
do not recover, check whether the fault
is due to other causes.
The working mode of the local port is Check whether the fault is due to other
consistent with that of the opposite port causes.
l Cause 2: The fibers, network cables, or optical/electrical modules are faulty. As a result,
services are interrupted.
1. Query the current alarms on the U2000.
If... Then...
The NE reports the LINK_ERR alarms Handle the LINK_ERR alarms. Check
whether services recover. If the services
do not recover, check whether the fault
is due to other causes.
The NE does not report the Check whether the fault is due to other
LINK_ERR alarms causes.
l Cause 3: The optical module does not match the optical port. As a result, services are
interrupted.
1. Query the current alarms on the U2000.
If... Then...
The NE does not report the Check whether the fault is due to other
LASER_MODULE_MISMATCH causes.
alarm that indicates the mismatch of the
optical module
If... Then...
The Ethernet port receives data, but does Replace the board. Check whether
not transmit data services recover. If the services do not
recover, check whether the fault is due
to other causes.
The Ethernet port receives and also Check whether the fault is due to other
transmits data causes.
----End
Symptom
Services on all Ethernet ports of an Ethernet board are interrupted.
Impact on System
Services on all Ethernet ports are interrupted.
Possible Causes
l Cause 1: The board is offline.
l Cause 2: The configuration is incorrect.
l Cause 3: The ports of the board are faulty.
Procedure
l Cause 1: The board is offline. As a result, services are interrupted.
1. Query the current alarms on the U2000.
If... Then...
The NE reports the BD_STATUS alarm Handle the BD_STATUS alarm. Check
whether services recover. If the services
do not recover, check whether the fault
is due to other causes.
The NE does not report the Check whether the fault is due to other
BD_STATUS alarm causes.
If... Then...
2. On the U2000, query whether the board port is configured with loopback.
If... Then...
The board port is configured with a Release the loopback. Check whether
loopback services recover. If the services do not
recover, proceed to the next step.
The board port is not configured with a Proceed to the next substep.
loopback
3. On the U2000, query the tag attributes of the port on the board.
If... Then...
The attributes of the port are incorrect Re-configure the attributes of the port,
such as the tag attributes and whether to
enable the port. Check whether services
recover. If the services do not recover,
proceed to the next substep.
If... Then...
The attributes of the port are correct Proceed to the next substep.
4. On the U2000, check whether the working mode of the local port is consistent with
that of the opposite port.
If... Then...
The working mode of the local port is Set the local port to the same working
inconsistent with that of the opposite mode as the opposite port. Check
port whether services recover. If the services
do not recover, check whether the fault
is due to other causes.
The working mode of the local port is Check whether the fault is due to other
consistent with that of the opposite port causes.
l Cause 3: The ports of the board are faulty. As a result, services are interrupted.
1. On the U2000, query the data of the Ethernet ports by using the remote monitoring
(RMON) function.
If... Then...
The Ethernet ports receive data, but do Replace the board. Check whether
not transmit data services recover. If the services do not
recover, check whether the fault is due
to other causes.
The Ethernet ports receive and transmit Check whether the fault is due to other
data causes.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A1 The OTU_LOF Alarm is Reporting on the OTU at the Downstream Station
l MC-A4 The LOG Board Fails to Interwork With the FDG Board on the Client Sides. The
LOG board reports the R_LOS alarm on the client side. The FDG board reports the
LINK_STATUS alarm.
l MC-A8 The TN11OAU101 at the Transmit End Reports the MUT_LOS Alarm
l MC-A15 The Downstream Optical Amplifier Board Does Not Report MUT_LOS
l MC-A24 Low Optical Power on the Client Side of the OTU Board Leads to R_LOS Alarm
on the Board
l MC-A38 The Service Is Interrupted After the Protection Is Triggered
l MC-A45 During the deployment of Raman, the OPU board reports MUT_LOS
l MC-A52 Faults of End Face of the Fiber Connector Cannot Be Identified
l MC-A67 The GE Port on the Client Side Reports LINK_DOWN Alarm
l MC-A69 Shutdown of the RPC Laser Interrupts SDH-Layer Services in a DWDM Network
l MC-A114 LSX Board to which the 10GE LAN service is input reports the OTU_LOF
alarm due to inconsistent settings of the client-side mode
l MC-A117 Analysis and Handling of the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Reported by the LBE
l MC-A118 An R_OOF Alarm Occurs When the LWC Interconnects with Third-Party
Equipment
l MC-A147 Services Are Unavailable Due to Inappropriate FEC Configuration on the 40G
Regeneration Boards, The System Reports An OTU_LOF Alarm
l MC-A188 The LDGF2 Board on the OptiX OSN 1800 Reports an OTU1_LOF Alarm
l MC-A200 Fiber Faults Result in Service Interruption, The TDX Board Reports
ODU1_PM_SSF Alarms
l MC-A206 Service Provisioning Fails on an LHP Network, The OTU2_LOF Alarm Is
Reported
l MC-A207 Inaccurate System Commissioning Causes Poor System Performance
l MC-A208 40G Service Provisioning Fails On a Network, The OTU_LOF And
DCM_INSUFF Alarms Are Reported
l MC-A211 OTUk_LOF Alarm Is Reported Due to Mismatched Mapping Modes of the LSX
Boards in the Case of 10GE LAN Services
l MC-A215 Incorrect Configuration of an Added Wavelength During Expansion Results in
Service Interruption of Another Normal Wavelength on an OTU Board, The Corresponding
OTU Board Reports OTU2_LOF Alarm
l MC-A219 EPL Service Is Interrupted Due to Mismatched Service Modes of the L4G
Boards, Sites Report The R_LOF, ODU5G_PM_AIS, OTU5G_LOF And LINK_ERR
Alarms
l MC-A227 Services Cannot Be Provisioned and The LWX2 Boards Report The R_LOC
Alarm Due to Incorrect Service Rates on LWX2 Board of OptiX OSN 1800
l MC-A234 Services Are Interrupted After an Upgrade due to Improper Database
Restoration Operations on the OptiX OSN 6800
l MC-A248 Network Services Are Interrupted After the Equipment Restarts Because the
Wavelengths Conflict After the Wavelength Configurations Are Modified
Transient service interruption refers to the transient loss of signal at the millisecond level during
transmission.
Symptom
The OTU, or tributary board and line boards at station B may report the R_LOS, R_LOF,
OTUk_LOF, OTUk_LOM alarms on WDM side, and the alarms clear in a short time. All
services at station B are transiently interrupted.
Figure 6-1 Example of the network configuration where all services are transiently interrupted
Client Client
DEMUX MUX
OSC OSC
Station A (OTM) Station B (OTM)
Impact on System
Services are interrupted for milliseconds at station B.
Possible Causes
l Cause 1: The optical cable or fiber jumper in the multiplexing part is faulty.
l Cause 2: The equipment temperature is too high.
l Cause 3: The optical amplifier board is faulty.
l Cause 4: The FIU board is faulty.
Procedure
l Cause 1: The optical cable or fiber jumper in the multiplexing part is faulty.
1. Check the fiber jumper in the multiplexing part. If it is loose, secure it; if it is dirty,
clean it; if it is faulty, replace the fiber jumper.
NOTE
The fiber jumpers in the multiplexing part are the fiber jumpers between:
l The equipment and ODF
l The OA and FIU
l The OA and MUX/DEMUX
l ODFs
l TDC and RDC of the OAU
2. Clear the fault of the optical cable.
– Whether there are multiple fiber connectors in this section of optical line and
whether they are in good connection
– Whether there are aerial optical cables, which are liable to be damaged by the
weather
– Whether the type and attenuation coefficient of the optical fiber in this section is
correct
NOTE
The optical line fault is the usual cause which results in a transient interruption. Usually, the
transient interruption caused by a line fault affects all channels (including the OSC). The optical
line fault can be caused by a fiber cut or excessive attenuation.
However, the fiber cut cannot cause a transient interruption. When you have a line fault, find
the cause of excessive attenuation of the fiber and then remove the fault.
l Cause 2: The equipment temperature is too high.
1. Check whether the performance data of NE contain alarms such as TEMP_OVER. If
yes, the operating temperature of the board is too high. Check the environmental
conditions of the telecommunications room. Clean the air filter.
2. Check whether the fan tray assembly works normally. If not, replace the fan tray
assembly.
l Cause 3: The optical power fluctuates by the fault of the optical amplifier board.
1. In the case of the DWDM application, on the U2000, query the historical performance
data of optical power of OA board. Check the input and output optical power of the
optical amplifier board. If the input optical power is normal but the output optical
power does not reach the standard output optical power, replace the board.
l Cause 4: The FIU board is faulty.
1. In the case of the DWDM application, replace the FIU board. If the fault clears,
determine that the FIU board is faulty.
----End
Symptom
The IN interface on the WDM side of OTU1 at station C reports the R_LOS, R_LOF, R_OOF,
IN_PWR_LOW, and OTUk_LOF alarms, and the alarms clear in a short time. The services in
a single channel of station C are transiently interrupted.
Figure 6-2 Example of the network configuration where the services in a single channel are
transiently interrupted
MUX MUX
OTU1 DEMUX DEMUX OTU1
OA
DEMUX OA
OTU2 OTU2
Client Client
Station A OTM Station C OTM
Impact on System
Services are interrupted for milliseconds on one channel at station C.
Possible Causes
l Cause 1: The services are configured with protection. When a fault in the working channel
triggers protection switching, the switching times out or occurs frequently.
l Cause 2: The fiber jumper between the MUX/DEMUX and OTU1 is faulty.
l Cause 3: The OTU1 board at station A is faulty.
l Cause 4: The equipment temperature is extremely high.
Procedure
l Cause 1: The services are configured with protection. When a fault in the working channel
triggers protection switching, the switching times out or occurs frequently.
1. On the U2000, query the historical data of alarms. Check whether the protection
switching alarm exists (the equipment reports a PS alarm when protection switching
occurs). Handle the related protection switching alarm.
l Cause 2: The fiber jumper between the MUX/DEMUX and OTU1 is faulty.
1. Check the fiber jumper. If it is loose, secure it; if it is dirty, clean it; if it is faulty,
replace the fiber jumper.
l Cause 3: The OTU1 board at station A is faulty.
1. Query on the U2000 the historical performance data of the output optical power of
OTU1 at station A. If the data contains a performance event showing a drastic decline
of output optical power of OTU1 at station A, then OTU1 at station A is faulty. Replace
OTU1 at station A.
l Cause 4: The equipment temperature is extremely high. When the equipment temperature
is extremely high, the wavelength of the single channel deviates and the services are
interrupted.
1. On the U2000, query the historical data of alarms. If alarms such as TEMP_OVER
exists, determine that the operating temperature of the board is extremely high. Check
and lower the temperature in the telecommunications room. Clean the air filter.
2. Check whether the fan tray assembly works normally. If not, replace the fan tray
assembly.
----End
Symptom
l Symptom 1: The ALM_GFP_dLFD alarm jitter occurs on a single VCTRUNK port of an
Ethernet board, and services recover after a period of interruption.
l Symptom 2: The channel bound to a single VCTRUNK port on an Ethernet board
transiently reports the SDH-related alarms, such as the HP_UNEQ, HP_LOM, and
VCAT_LOA. Services are transiently interrupted.
l Symptom 3: The channel bound to a single VCTRUNK port on an Ethernet board
transiently reports the LCAS_TLCT and LCAS_TLCR bandwidth loss alarms. Services
are transiently interrupted.
Impact on System
The services in the VCG that reports the alarms are interrupted transiently.
Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: The fibers on the line side are faulty and recover within a short period.
– Cause 2: Bit errors exist on the line.
l For symptom 2, the possible causes are as follows:
– Cause 1: The configuration of the SDH service is changed manually.
– Cause 2: Channel binding is added or deleted on the opposite device manually.
– Cause 3: The VCTRUNK port uses multiple lines and the delay of a certain line is too
long.
Procedure
Step 1 For symptom 1: The ALM_GFP_dLFD alarm jitter occurs on a single VCTRUNK port of an
Ethernet board, and services recover after a period of interruption.
1. Cause 1: The fibers on the line side are faulty and recover within a short period.
a. Query the historical line alarms on the U2000.
If... Then...
The line from the VCTRUNK port of the The line is faulty. Replace the optical
local device to the VCTRUNK port of module and the fibers. Check whether
the opposite device generates the services recover. If the services do not
R_LOS alarm recover, proceed to the next step.
The line from the VCTRUNK port of the Proceed to the next step.
local device to the VCTRUNK port of
the opposite device does not generate the
R_LOS alarm
b. Check whether the fibers or the line boards are removed and reinserted manually.
If... Then...
The fibers or the line boards are removed Make sure that the fibers or the line
and reinserted manually boards is not removed and reinserted
manually. Check whether services
recover. If the services do not recover,
proceed to the next step.
The fibers or the line boards are not Proceed to the next step.
removed and reinserted manually
c. According to the transmission distance, check whether the optical module type is
correct.
If... Then...
The optical module type is incorrect See the Hardware Description and
Parts Replacement to select a correct
optical module and replace the optical
module on the device with the correct
one. Check whether services recover. If
the services do not recover, proceed to
the next step.
If... Then...
The fibers are faulty Replace the faulty fibers. Check whether
services recover. If the services do not
recover, proceed to the next step.
The fibers are normal Check whether the fault is due to other
causes.
If... Then...
The line from the VCTRUNK port of the See the Alarms and Performance Events
local device to the VCTRUNK port of Reference to handle the bit error alarm.
the opposite device generates the bit Check whether services recover. If the
error alarms B3_EXC, B3_SD, services do not recover, check whether
BIP_EXC, or BIP_SD the fault is due to other causes.
The line from the VCTRUNK port of the Check whether the fault is due to other
local device to the VCTRUNK port of causes.
the opposite device does not generate the
bit error alarms B3_EXC, B3_SD,
BIP_EXC, or BIP_SD
Step 2 For symptom 2: The channel bound to a single VCTRUNK port on an Ethernet board transiently
reports the SDH-related alarms, such as the HP_UNEQ, TU_LOP, HP_LOM, and VCAT_LOA.
Services are transiently interrupted.
1. Cause 1: The configuration of the SDH service is changed manually.
a. On the U2000, query the historical alarms of the VCTRUNK ports of the local and
the opposite devices.
If... Then...
The VCTRUNK ports of the local and Proceed to the next step.
the opposite devices generate the
alarms, such as the HP_UNEQ and
HP_LOM
If... Then...
The SDH cross-connection has been Make sure that the SDH cross-
deleted or set up manually connection is not manually deleted and
set up again. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.
The SDH cross-connection has not been Check whether the fault is due to other
deleted or set up manually causes.
a. On the U2000, query the historical alarms of the VCTRUNK ports of the local and
the opposite devices.
If... Then...
Only the VCTRUNK port of the local Proceed to the next step.
device generates the alarms, such as the
HP_UNEQ and HP_LOM
If... Then...
Channel binding of the device has been Make sure that channel binding is not
added or deleted manually added or deleted on the opposite device
manually. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.
If... Then...
Channel binding of the device has not Check whether the fault is due to other
been added or deleted manually causes.
3. Cause 3: The VCTRUNK port uses multiple lines and the delay of a certain line is too long.
a. On the U2000, query the historical alarms of the VCTRUNK port of the local device.
If... Then...
The VCTRUNK port of the local device Proceed to the next step.
generates the VCAT_LOA alarm
The VCTRUNK port of the local device Check whether the fault is due to other
does not generate the VCAT_LOA causes.
alarm
If... Then...
The SDH cross-connection and channel Make sure that the SDH cross-
binding have been deleted or set up connection and channel binding is not
manually deleted or set up manually. Check
whether services recover. If the services
do not recover, proceed to the next step.
If... Then...
The line from the VCTRUNK port of the The line is faulty. Replace the optical
local device to the VCTRUNK port of module and the fibers. Check whether
the opposite device generates the services recover. If the services do not
R_LOS alarm recover, proceed to the next step.
The line from the VCTRUNK port of the Proceed to the next step.
local device to the VCTRUNK port of
the opposite device does not generate the
R_LOS alarm
If... Then...
The delay of each route is rather long Optimize the network to reduce the
delay of each route. Check whether
services recover. If the services do not
recover, check whether the fault is due
to other causes.
The delay of each route is normal Check whether the fault is due to other
causes.
Step 3 For symptom 3: The channel bound to a single VCTRUNK port on an Ethernet board transiently
reports the LCAS_TLCT and LCAS_TLCR bandwidth loss alarms. Services are transiently
interrupted.
1. Cause 1: The line is faulty and then recovers. As a result, the channel changes to the WTR
state.
a. On the U2000, query the historical alarms of the channel used by the VCTRUNK port
of the local device.
If... Then...
If... Then...
The line from the VCTRUNK port of the The line is faulty. Replace the optical
local device to the VCTRUNK port of module and the fibers. Check whether
the opposite device generates the services recover. If the services do not
R_LOS alarm recover, proceed to the next step.
The line from the VCTRUNK port of the Proceed to the next step.
local device to the VCTRUNK port of
the opposite device does not generate the
R_LOS alarm
If... Then...
The delay is set short It is recommended that you set the WTR
time to the default 300s and the delay to
the default 2s.
NOTE
l When the network is stable, the WTR time should not be set too long; otherwise, the
interruption time of the services after the restoration of the network is long. If the channel
does not need to enter the WTR time, the WTR time can be set to 0s.
l The delay should not be set too short; otherwise, when the network is unstable, the channel
changes to the WTR state frequently. As a result, services are interrupted. It is
recommended that you set the delay to 2s.
----End
Related Information
None
Symptom
l Symptom 1: The LINK_ERR alarm jitter occurs on a single Ethernet port of an Ethernet
board that works in auto-negotiation mode, and services recover after a period of
interruption.
l Symptom 2: The ETH_CFM_LOC alarm jitter occurs on a single Ethernet port of an
Ethernet board, and services recover after a period of interruption.
Impact on System
Services on the Ethernet port are transiently interrupted.
Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: Optical fibers in the transmit direction of the local Ethernet port are faulty and
recover within a short period.
– Cause 2: The port interconnected to the Ethernet port switches between the auto-
negotiation mode and the fixed mode continuously.
– Cause 3: The board is faulty.
l For symptom 2, the possible causes are as follows:
– Cause 1: Optical fibers in the receive direction of the Ethernet port are faulty and recover
within a short period.
– Cause 2: The port interconnected to the Ethernet port is disabled or enabled within a
short period.
– Cause 3: The board is faulty.
Procedure
l For symptom 1: The LINK_ERR alarm jitter occurs on a single Ethernet port of an Ethernet
board that works in auto-negotiation mode, and services recover after a period of
interruption.
1. Cause 1: Optical fibers in the transmit direction of thelocal Ethernet port are faulty
and recover within a short period.
a. Query the historical alarms of the port of the interconnected device on the
U2000.
If... Then...
The ETH_CFM_LOC alarm exists Indicates that the fibers in the transmit
direction of the local Ethernet port
have been broken. As a result, the
negotiation fails at the local end and
the LINK_ERR alarm is reported
accordingly. In this case, services are
transiently interrupted. Check
whether services recover. If the
services do not recover, proceed to the
next step.
b. Check whether the fibers or the line boards are removed and reinserted manually.
Query the historical alarms of the interconnected port of the device on the
U2000.
If... Then...
The fibers or the line boards are Make sure that the fibers or the line
removed and reinserted manually boards is not removed and reinserted
manually. Check whether services
recover. If the services do not recover,
proceed to the next step.
The fibers or the line boards are not Proceed to the next step.
removed and reinserted manually
c. According to the transmission distance, check whether the optical module types
are correct.
If... Then...
The optical module type is incorrect Select a correct optical module and
replace the optical module on the
device with the correct one. Check
whether services recover. If the
services do not recover, proceed to the
next step.
If... Then...
The fibers are normal Check whether the fault is due to other
causes.
2. Cause 2: The port interconnected to the Ethernet port switches between the auto-
negotiation mode and the fixed mode continuously.
a. Query the operation log on the U2000.
If... Then...
The working mode of the Check whether the fault is due to other
interconnected port is not changed causes.
manually
1. Cause 1: Optical fibers in the receive direction of the Ethernet port are faulty and
recover within a short period.
a. Query the historical alarms of the Ethernet port on the U2000.
If... Then...
The ETH_CFM_LOC alarm exists Indicates that the fibers in the receive
direction of the local Ethernet port
have been broken. Proceed to the next
step.
The ETH_CFM_LOC alarm does not Check whether the fault is due to other
exist causes.
b. Check whether the fibers or the line boards are removed and reinserted manually.
Query the historical alarms of the interconnected port of the device on the
U2000.
If... Then...
The fibers or the line boards are Make sure that the fibers or the line
removed and reinserted manually boards is not removed and reinserted
manually. Check whether services
recover. If the services do not recover,
proceed to the next step.
The fibers or the line boards are not Proceed to the next step.
removed and reinserted manually
c. According to the transmission distance, check whether the optical module types
are correct.
If... Then...
The optical module type is incorrect Select a correct optical module and
replace the optical module on the
device with the correct one. Check
whether services recover. If the
services do not recover, proceed to the
next step.
If... Then...
If... Then...
The fibers are normal Check whether the fault is due to other
causes.
The port interconnected to the Make sure that these ports are not
Ethernet port is disable or enabled enabled or disabled manually. Check
manually whether services recover. If the
services do not recover, check
whether the fault is due to other
causes.
The port interconnected to the Check whether the fault is due to other
Ethernet port is not disable or enabled causes.
manually
Related Information
None
Symptom
l Symptom 1: The Rapid Spanning Tree Protocol (RSTP) is enabled, but the status of the
protocol jitters and the services are transiently interrupted.
l Symptom 2: The Link Aggregation Control Protocol (LACP) is enabled, but the status of
the protocol jitters and the services are transiently interrupted.
Impact on System
The services are transiently interrupted.
Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: The network topology is changed and the RSTP protocol reconstructs the
network topology.
– Cause 2: The RSTP protocol is reset.
– Cause 3: The RSTP protocol packets are lost. As a result, the status of the protocol
jitters.
l For symptom 2, the possible causes are as follows:
– Cause 1: The NE reports the LAG_PORT_FAIL alarm and the services are transiently
interrupted.
– Cause 2: The LACP protocol is reset and the services are transiently interrupted.
Procedure
l For symptom 1: The RSTP protocol has been enabled, but the state of the protocol jitters
and the services are transiently interrupted.
1. Cause 1: The network topology is changed and the RSTP protocol reconstructs the
network topology.
a. Check whether the network topology is changed manually.
If... Then...
If... Then...
The service interruption alarms are After the link is faulty, the services
reported, such as the are transiently interrupted due to the
ETH_CFM_LOC and reconstruction of the network
LINK_ERRLINK-ERR alarms of the topology by the RSTP protocol.
Ethernet port and the UNEQ and
TU_LOP alarms on the SDH side
If... Then...
The service interruption alarms are Check whether the fault is due to other
not reported, such as the causes.
ETH_CFM_LOC and
LINK_ERRLINK-ERR alarms of the
Ethernet port and the UNEQ and
TU_LOP alarms on the SDH side
If... Then...
Before the services are interrupted, Check whether the fault is due to other
the RSTP protocol has not been reset causes.
or the board has been warm reset
manually
NOTE
After the board is warm reset, the RSTP protocol is restarted automatically.
3. Cause 3: The RSTP protocol packets are lost. As a result, the status of the protocol
jitters.
a. Check the traffic flow of the services, and then check whether the QoS function
is enabled according to the service configuration.
If... Then...
The QoS function is enabled and the The state jitter of the RSTP protocol
bandwidth is insufficient is caused by the loss of the RSTP
protocol packets.
The QoS is not enabled Check whether the fault is due to other
causes.
NOTE
Generally, the priority of the protocol packets is higher than the priority of the service
packets. If the QoS function is enabled, the priority of the service packets may be higher
than that of the protocol packets. In this case, the protocol packets may be discarded when
the bandwidth is insufficient.
l For symptom 2: The LACP protocol has been enabled, but the state of the protocol jitters
and the services are transiently interrupted.
1. Cause 1: The NE reports the LAG_PORT_FAIL alarm and the services are transiently
interrupted.
If... Then...
The NE does not report the Check whether the fault is due to other
LAG_PORT_FAIL alarm causes.
2. Cause 2: The LACP protocol is reset and the services are transiently interrupted.
If... Then...
Before the services are interrupted, Check whether the fault is due to other
the LACP protocol has not been reset causes.
or the board has been warm reset
manually
NOTE
After the board is warm reset, the LACP protocol is restarted automatically.
----End
Related Information
None
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
Correct optical power is critical to the performance of the WDM system. If the input optical
power is too high or too low, the system will generate bit errors, or even interrupt services.
Symptom
The optical power of all OTUs in the local station sharply declines, and the optical signal-to-
noise ratio (OSNR) is lower than the OSNR tolerance of the OTU.
NOTE
Impact on System
A large number of bit errors occur in the services and the services are transiently interrupted
frequently.
Possible Causes
l Cause 1: Any fiber jumper in the multiplexing part, or an optical cable degrades or is
physically damaged.
l Cause 2: The gain of the optical amplifier board in the upstream station or downstream
station declines.
Procedure
l Cause 1: Any fiber jumper in the multiplexing part, or an optical cable degrades or is
physically damaged. Hence, the attenuation of the fiber increases and the received optical
power declines. As a result, the OSNR decreases.
1. Refer to Checking Fiber Jumpers by Using an Optical Power Meter. Check whether
the fiber jumper in the multiplexing part is faulty. If yes, replace the fiber jumper.
2. Remove the fault of the optical cable.
NOTE
The fiber jumpers in the multiplexing part are the fiber jumpers between:
l The equipment and ODF
l The OA and FIU
l The OA and MUX/DEMUX
l ODFs
l TDC and RDC of the OAU
l Cause 2: The gain of the optical amplifier board in the upstream station or downstream
station declines.
1. Check whether the gain of the optical amplifier board is abnormal. If yes, replace the
board.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MD-A13 Bit Error Alarm (BEFFEC_EXC) Is Generated When Optical Power Gets Close
to the Threshold
l MC-A15 The Downstream Optical Amplifier Board Does Not Report MUT_LOS
l MC-A19 Use Power Monitoring To Process Problems on IN_PWR_LOW Alarm
l MC-A24 Low Optical Power on the Client Side of the OTU Board Leads to R_LOS Alarm
on the Board
l MC-A35 LWM Output Optical Power Is Unstable upon Forced Light Generation
l MC-A36 The OTU in the OptiX BWS 1600G Reports IN_PWR_LOW Alarm
l MC-A43 Too High Insertion Loss Between TDC and RDC of the E3OAUC01C
l MC-A45 During the deployment of Raman, the OPU board reports MUT_LOS
l MC-A52 Faults of End Face of the Fiber Connector Cannot Be Identified
l MC-A54 After the Lasers of the Raman Amplifier Are Disabled, the Optical Power Is
Abnormal
l MC-A62 The Minimum Optical Power of the SC2 Is Detected As -35 dBm
l MC-A66 The Received Optical Power of Downstream Stations Is not Flat and Some
Wavelengths Report the IN_PWR_LOW Alarm
l MC-A68 OTU Boards Report the IN_PWR_LOW Alarm Due to Wavelength Wander
l MC-A90 The Incorrect Configuration of DWC Leads To Abnormal Optical Power and
Service Interruption, the WDM Side of the LBE Reports the IN_PWR_HIGH and
OTU_LOF Alarms Abruptly
l MC-A93 The Input Optical Power of the OTU Board Is Abnormal Due to a Fault of the
OPU Board
l MC-A101 The Wavelength Is Unstable or Changed Due to Optical Power Variation
l MC-A126 VA4 Board Reports R_LOS When the Receiver Power Is Below -23 dBm Even
If the Attenuation Is Decreased
l MC-A169 T2000 Reports Errors After the Standard Optical Power of Single Wavelength
Is Set to 7 dBm with ALC in Power Reference Mode
l MC-A174 Excessive PDL of Fiber Results in Slow Changes of the Received Optical Power
on an OTU Board
l MC-A182 Receive Optical Power Is Excessively Low Because of the End Face Problem
of the Fiber Jumper, the OAU Board Reports the MUT_LOS Alarm
l MC-A186 Inconsistency of Fiber Jumper Model and Fiber Connector Type of a Board
Causes Low Receive Optical Power, The Connected OAU Board At The Downstream
Station Reports MUT_LOS And R_LOS Alarms
l MC-A194 A Malfunctioning VA1 Board Causes Abnormal Optical Power, The NS2
Boards Reports IN_PWR_HIGH Alarms For Three Times
l MC-A196 The Insertion Loss of the FIU Board on OptiX OSN 6800 Is High Because the
Fiber and Fiber Connector Are Not Securely Connected
l MC-A198 Optical Power Can Be Detected at the IN and OUT Ports of the FIU Board
Because of Incorrect Connections of Fiber Patch Cords
l MC-A201 Frequent Fiber Cut Results in High Line Attenuation and the IN_PWR_LOW
Alarm Reported by the OTU Board
l MC-A203 System of Few Wavelengths Fails to Function
l MC-A239 Inappropriate Line Optical Power Adjustment Causes Deteriorated 40G
Performance
l MC-A242 Excessively High Incident Optical Power Results in Poor Performance of OTU
Boards
l MC-A244 The Insertion Loss of a Board Is Abnormal
8 Bit Errors
Bit errors refer to the errors that occur in transmitted bit stream. Bit errors are usually represented
by bit. In the SDH/SONET frame structure, bytes for monitoring bit errors include B1, B2, M1,
B3, and G1. However, in DWDM equipment, the OTU only provides non-intrusive monitoring
on B1. In the OTN frame structure, for the DWDM equipment, the OTU provides bit errors
monitoring on BIP8 bytes of SM, PM and TCM section.
Symptom
The optical power is normal but a large number of bit errors occur in services on multiple
channels.
The OTU, or tributary board and line board reports alarms such as OTUk_EXC, OTUk_DEG,
BEFFEC_EXC, B1_EXC, and B1_SD.
Networking configuration example is shown in Figure 8-1.
Impact on System
A large number of bit errors occur in services on multiple channels.
Possible Causes
l Cause 1: The attenuation of the optical cable or that of the fiber jumper in the multiplexing
part is too large.
l Cause 2: The equipment temperature is too high.
l Cause 3: The configuration of the dispersion compensation module (DCM) is improper.
l Cause 4: The optical power of the main path is faulty.
l Cause 5: The non-linearity of the optical fiber. Bit errors can be caused by the non-linearity
of the optical power but not often.
l Cause 6: The ring fiber adapter from the equipment to the ODF is damaged.
l Cause 7: The PMD exceeds the permitted range.
CAUTION
Multi-channel bit errors might be caused by the improper commissioning of the optical power
of the system, please check the optical power of system.
Procedure
l Cause 1: The attenuation of the optical cable or that of the fiber jumper in the multiplexing
part is too large.
1. Refer to Checking Fiber Jumpers by Using an Optical Power Meter. Check whether
the fiber jumper in the multiplexing part is faulty. If yes, replace the fiber jumper.
2. Remove the fault of the optical cable.
NOTE
The fiber jumpers in the multiplexing part are the fiber jumpers between:
l The equipment and ODF
l The OA and FIU
l The OA and MUX/DEMUX
l ODFs
l TDC and RDC of the OAU
l Cause 2: The equipment temperature is too high.
1. Check whether the NE generates alarms such as TEMP_OVER. If yes, check the
environmental conditions of the telecommunications room. Clean the air filter.
l Cause 3: The configuration of the dispersion compensation module (DCM) is improper.
1. Check optical cable parameters such as the type and actual length of the optical cable.
Check the configuration of the DCM. After that, adjust the compensation distance of
the DCM to a value that matches the actual length of the optical cable.
l Cause 4: The optical power of the main-path is abnormal.
1. Check the input and output optical power of boards between MUX of the transfer
station and the DEMUX of the receive station. If the input optical power is normal
but the output optical power does not reach the standard output optical power, replace
the board.
l Cause 5: The non-linearity of the optical fiber. Bit errors can be caused by the non-linearity
of the optical power but not often.
1. Decrease or increase the optical power, to judge whether bit errors are caused by the
non-linearity of the optical fiber.
– Increasing optical power causes bit errors to increase.
– Decreasing optical power causes bit errors to decrease.
2. Increase the value of attenuator.
NOTE
Perform this procedure only after all other possible causes have been checked.
l Cause 6: The ring fiber adapter from the equipment to the ODF is damaged.
1. Check the ring fiber adapter between the ODF and WDM equipment. If the ring fiber
adapter is faulty, replace it.
l Cause 7: The PMD exceeds the permitted range.
1. Test the PMD of optical fibers in all spans to obtain the PMD values. Resplice the
splicing points of optical fibers with very large PMD and resplice the splicing points
with large attenuation. The attenuation of optical fibers will improve.
2. The PMD values of several spans are larger than the recommended value in the system
design. It is recommended that the fiber core be replaced to ensure that the total PMD
value of optical fibers between electrical regenerators of all channels is smaller than
the system design. In this manner, the system can work stably in the long run and the
system instability caused by large PMD can be resolved radically.
NOTE
The 40G services require a smaller PMD. When the accumulated PMD exceeds the threshold, a large
OSNR penalty will be introduced, deteriorating the system performance. In a short period of time, high
PMD leads to the generation of the BEFFEC_EXC alarm for some wavelengths and irregular changes to
the receive-end system performance.
----End
Symptom
The OTU, or tributary board and line board reports alarms such as OTUk_EXC, OTUk_DEG,
BEFFEC_EXC, B1_EXC, and B1_SD.
Client Client
equipment OTU1 OTU1 equipment
MUX MUX
DEMUX DEMUX
OA OA
OTUn OTUn
Station A (OTM) Station B (OTM)
Impact on System
A large number of bit errors occur in one channel and the services are transiently interrupted.
Possible Causes
l Cause 1: The input optical power of the OTU, or tributary board and line board on the
WDM side is abnormal.
l Cause 2: The fiber jumper between the MUX and ODF is aged or bent. Hence, the
attenuation of the fiber jumper is too large.
Procedure
l Cause 1: The input optical power of the OTU, or tributary board and line board on the
WDM side is abnormal.
1. Checking fiber jumpers by using an optical power meter. Check whether the input
optical power of the OTU, or tributary board and line board is too low. If yes, clean
the fiber.
l Cause 2: The fiber jumper between the OTU, or tributary board and line board, MUX/
DMUX or the fiber jumper between the MUX and ODF is aged or bent. Hence, the
attenuation of the fiber jumper is too large.
1. Replace the fiber jumper.
l Cause 3: The equipment temperature is too high.
1. Check whether the NE contains alarms such as TEMP_OVER. If yes, the operating
temperature of the board is too high. Check the environmental conditions of the
telecommunications room. Clean the air filter.
l Cause 4: The FEC modes of the two interconnected boards might be different.
1. Query the FEC mode of the two interconnected boards on the U2000. If the two modes
differ, re-set the FEC Working state and ensure state matching.
l Cause 5: The optical interfaces of the OTU, or tributary board and line board board are
dirty.
1. If the optical interfaces of the OTU, or tributary board and line board board are dirty,
blow them with compressed gas to clean them.
l Cause 6: The OTU, or tributary board and line board is faulty.
1. If the fiber jumper, input optical power and operating temperature are normal, and the
FEC modes of the OTU, or tributary board and line boards are matching, replace the
OTU, or tributary board and line board at receive end. If the bit errors disappear, it
indicates that the OTU, or tributary board and line board at receive end is faulty.
2. If the OTU, or tributary board and line board at receive end is normal, replace the
OTU, or tributary board and line board at transmit end. If the bit errors disappear, it
indicates that the OTU, or tributary board and line board at transmit end is faulty.
----End
Symptom
The telecommunications room of a certain site does not meet the environmental requirements
of a device. The device reports bit errors.
Impact on System
Unfavorable environmental conditions may cause bit errors in services.
Possible Causes
l Cause 1: The temperature and humidity of the environment do not meet the environmental
requirements of a device.
l Cause 2: A strong external interference source exists.
l Cause 3: The grounding is improper.
l Cause 4: The cable is faulty.
Procedure
l Cause 1: The temperature and humidity of the environment do not meet the requirements
of long term running of the device. As a result, bit errors are generated.
1. Check whether the temperature and humidity of the environment meet the
requirements.
If... Then...
The temperature and humidity of the Check and adjust the air conditioner in
environment do not meet the the telecommunications room to make
requirements sure that the temperature and humidity
of the telecommunications room are in
the proper range. Check whether the bit
errors persist. If the bit errors persist,
check whether the fault is due to other
causes.
If... Then...
The NE does not report the temperature Check whether the fault is due to other
threshold-crossing alarms causes.
l Cause 2: A strong external interference source exists. As a result, bit errors are generated.
1. Check whether there is a transformer, high-voltage transmission line, or high-current
device near the device.
If... Then...
If... Then...
If... Then...
4. Check whether a hand-held mobile communication device is in use near the device.
If... Then...
3. Check whether the front door and side panel of the cabinet are properly connected to
the cabinet. If yes, proceed to the next step. If not, connect the front door and side
panel to the cabinet properly.
4. Check whether the subrack is connected to the cabinet properly. If yes, proceed to the
next step. If not, connect the subrack to the cabinet properly.
5. Check whether the DDF and ODF are properly grounded. If yes, proceed to the next
step. If not, ground the DDF and ODF properly.
6. Check whether the NMS devices and all power-consumption devices are properly
grounded. If yes, proceed to the next step. If not, ground the devices properly.
7. Check whether the interconnected equipment is jointly grounded.
l Cause 4: The cable is faulty.
1. Test the transmit optical power of the board at the opposite end and the receive optical
power of the board at the local end.
2. The difference between the transmit optical power and the receive optical power is
the actual loss of optical power on the line. If the difference varies much with the
design value of the line attenuation, locate and clear the fault using the following steps.
– Check whether the bending radius of fibers is equal to or larger than standard. If
the bending radius is smaller than standard, route the fiber again.
– Check whether the multiple fiber connectors are correctly connected on the line.
If certain connectors are badly connected, push the connectors into the optical
module and ensure that the connectors are caught tightly.
– Check whether there are aerial optical fiber cables on the line. If yes, provide
external protection to the cables because they are easily affected by the weather.
– Check the optical fiber type and the attenuation coefficient of the line. Ensure that
the type and attenuation coefficient are consistent with the design document. If
they are inconsistent with the design document, replace the fiber.
– Check whether the fiber connectors are dirty. If the fiber connectors are dirty, clean
them immediately.
3. Check whether services recover. If not, proceed to other causes.
----End
Related Information
Table 8-1 lists the requirement for the electromagnetic interference.
G.652D 30 mm
G.657A2 10 mm
G.657B3 8 mm
Note: When fibers are connected to CRPC boards, the bending radius of the fibers must be
greater than or equal to 50 mm to ensure expected transmission performance.
Symptom
Large amounts of regeneration section bit errors and multiplex section bit errors are generated
on the line.
Impact on System
Device faults cause bit errors.
Possible Causes
l Cause 1: The clock unit is faulty.
l Cause 2: The cross-connect unit is faulty.
l Cause 3: The line board is faulty.
Procedure
Step 1 Cause 1: The clock unit is faulty. As a result, bit errors are generated.
1. Check the configuration of the clock on the NE.
If... Then...
The configuration of the clock on the NE Re-configure the clock on the NE. Check
is incorrect whether the bit errors persist. If the bit
errors persist, proceed to the next step.
If... Then...
The configuration of the clock on the NE Re-configure the clock on the NE. Check
is incorrect whether the bit errors persist. If the bit
errors persist, proceed to the next step.
Step 2 Cause 2: The cross-connect unit is faulty. As a result, bit errors are generated.
1. Replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.
Step 3 Cause 3: The line board is faulty. As a result, bit errors are generated.
1. Replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.
----End
Related Information
After the device reports bit errors, you need to analyze the features of the bit errors to locate the
fault to a single site.
You can use the loopback method to locate faults to a board of a certain site, and then use the
replacement method to reset or replace the doubted board.
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MD-A13 Bit Error Alarm (BEFFEC_EXC) Is Generated When Optical Power Gets Close
to the Threshold
l MC-A16 The R_LOF and R_OOF Alarms Are Reported in the 24-Hour Bit Error Test
Because the Line Fiber Loss Is Very Large
l MC-A31 Wrong Calculation for Dispersion in One DWDM Project
l MC-A40 Over Compensation Causes Very High Bit Error Rate of the Short Waves After
Correction and the SM_BIP8_OVER and SM_BIP8_SD Alarms Are Transiently Reported
on the Board
l MC-A44 Bit Errors Generated in the Services, the Meter Reports the HP_RDI and ERROR
Alarms
l MC-A55 Improper DCM Distribution Causes Abnormal Service and Bit Error Alarm
BEFFEC_EXC Is Detected on the Newly-added LWFS Board
l MC-A63 MSBBE Bit Errors and OTU_LOF Alarm Occur in a DWDM 10G Network Due
to the Incorrect PMD
l MC-A88 The OTU Board Reports the B1_EXC Alarm, And There Are the Difference in
BER Reported by the OTU Board and Test Instrument
l MC-A118 An R_OOF Alarm Occurs When the LWC Interconnects with Third-Party
Equipment
l MC-A124 Determine the Factors That Affect the OSNR By Using the ANT30 Tester
l MC-A199 An OTU Board Reports Bit Errors Due to High Fiber Reflection
l MC-A202 Mixing Compensation of DCMs Results in the Inconsistency of Channel
Performance
l MC-A205 WDM-Side Pre-FEC BER Changes Drastically
l MC-A240 Isolated Bit Errors Are Found in the Long-Term Bit Error Testing of a 40G
System
The scenario is applicable to only the OptiX OSN 8800. This topic describes the common
methods and measures that can be adopted to rectify a fault in pointer justification.
9.3 Pointer Justification Due to Over-Low Precision of the External Clock Source
This topic describes the symptoms, impact, and possible causes of the pointer justification due
to the over-low clock source precision. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.
Symptom
After the fibers of a network are broken, services of respective NEs are switched normally. The
AU and TU pointer justifications, however, occur on the line, and this is accompanied by bit
errors. The pointer justification frequency and the bit error rate increase with time.
Impact on System
Incorrect clock configuration causes pointer justification.
Possible Causes
l Cause 1: The clock configuration is incorrect.
Procedure
Step 1 Cause 1: Incorrect clock configuration causes pointer justification.
1. Check whether the transmit network traces two or more clock sources and whether the
clock sources trace each other, which results in the pointer justification. If the configuration
is correct, proceed to the next step.
2. Check whether the precision of the traced clock source is low or there are too many tracing
sites. If the configuration is correct, proceed to the next step.
NOTE
Provide clock compensation for long clock chain: the number of G.812 slave clocks in a transmission
link should not be more than 10; the number of G.813 clocks between two G.812 slave clocks should
not be more than 20; the number of G.813 clocks between the G.811 and G.812 slave clocks should
not be more than 20; the number of G.813 slave clocks should not be more than 60.
3. Check whether the clock protection subnet is not configured. After the main clock is lost
(or the fiber is broken), the non-protection of the clock causes the pointer justification. If
the configuration is correct, proceed to the next step.
4. Check whether the priority configuration of the clock source is correct. After the clock
protection switching, clocks begin to trace each other, which results in the pointer
justification. If the configuration is correct, proceed to the next step.
5. Check whether the internal clock source of the main clock NE is configured with the clock
source ID. After the clock source of a higher level is lost, the NE enters the free-run mode.
Other NEs will not synchronize to the central site. As a result, all NEs in this clock subnet
are in the free-run mode. Thus the pointer justification occurs accordingly. If the
configuration is correct, proceed to the next step.
6. Check whether the SSM clock protection is enabled. When the clock quality deteriorates,
the protection switching cannot be executed, thus causing the pointer justification. If the
configuration is correct, proceed to the next step.
7. When the settings of the SSM mode on the network are different, the NE with the SSM
mode enabled cannot extract the line clock of other NEs. In this case, the NE enters the
free-run mode, thus causing the pointer justification.
----End
Related Information
None
Symptom
The telecommunications room of a certain site does not meet the requirements of device running,
and the AU and TU pointer justifications occur on the line.
Impact on System
Over-high temperature and humidity cause pointer justification.
Possible Causes
l Cause 1: The temperature and humidity of the environment do not meet the requirements
of long term running of the device.
Procedure
Step 1 Cause 1: The temperature and humidity of the environment do not meet the requirements of long
term running of the device. As a result, the pointer justification occurs.
1. Check whether the temperature and humidity of the environment meet the requirements.
If... Then...
The temperature and humidity of the Check and adjust the air conditioner in the
environment do not meet the requirements telecommunications room to make sure
that the temperature and humidity of the
telecommunications room are in the proper
range. Check whether the pointer
justification persists. If the pointer
justification persists, check whether the
fault is due to other causes.
If... Then...
The NE reports the temperature threshold- See the Alarms and Performance Events
crossing alarms TEMP_OVER Reference to handle the TEMP_OVER
alarms. Check whether the pointer
justification persists. If the pointer
justification persists, check whether the
fault is due to other causes.
----End
Related Information
None
Symptom
After the network operates for a certain period, AU pointer justification and TU pointer
justification occur on a large scale on the network.
Impact on System
Over-low precision of the external clock source causes pointer justification.
Possible Causes
l Cause 1: The precision of the external clock source is over-low.
Procedure
Step 1 Cause 1: Over-low precision of the external clock source causes pointer justification.
1. Check the external clock source. It is found that the central site traces the clock source of
the switch or other clock sources with lower levels.
2. Change the clock tracing of the central site into internal free-run mode. Observe the clock
tracing for 24 hours and it is found that the fault of pointer justification is rectified.
3. Introduce the clock with a higher level to solve the problem thoroughly.
----End
Related Information
The SDH device has high requirements on the external clock source, the external clock source
must meet at least the requirements in the G.813 of SDH synchronization quality level. If the
precision of the external clock source is over-low or the clock quality deteriorates, pointer
justification occurs on the entire network.
10 NE Offline Problems
When the communication between an NE and the U2000 fails, this NE cannot be reached by the
U2000.
Symptom
One NE at a station is unreachable by the U2000, whereas the other NEs at this station are
reachable. The NE_COMMU_BREAK and NE_NOT_LOGIN alarms are reported on the
U2000.
Impact on System
The NE is unreachable by the U2000.
Possible Causes
l Cause 1: The network cable is disconnected or loose. For example, between the NE and
the HUB, between the GNE and the computer where the NMS server is installed, between
the GNE and non-GNEs, between the master and slave subrack of an NE.
l Cause 2: The NE Attributes of the NE is changed by mistake, such as the IP address, ID,
Gateway Type, User Name or Password
l Cause 3: The system control board on the NE is faulty.
l Cause 4: ECC communication is interrupted.
l Cause 5: The NE(non-first gateway NE) encounters system control board active/standby
switching.
l Cause 6: Coherent and non-coherent boards are interconnected.
Procedure
l Cause 1: The network cable is disconnected or loose. For example, between the NE and
the HUB, between the GNE and the computer where the NMS server is installed, between
the GNE and non-GNEs, between the master and slave subrack of an NE.
1. Determine if the network cable connection between the NE and the HUB is loose. If
the network cable connection is loose, secure it.
2. Check whether network cables are correctly connected between the GNE and the
computer where the NMS server is installed, between the GNE and non-GNEs,
between the master and slave subrack of an NE.
3. Use a cable tester to test the network cable and the network port. If the network cable
is faulty, replace it.
4. If the network cable is good, replace the interface, to which the network cable is
connected to the HUB. If the problem clears, the interface of the HUB may be
damaged.
5. If the network cable and the HUB is good, the interface board (such as EFI, EFI1,
EFI2, or AUX) is faulty, replace it.
l Cause 2: The NE Attributes of the NE is changed by mistake, such as the IP address, ID,
Gateway Type, User Name or Password.
1. Log in to the NE on the Web LCT and restore the initial IP address , ID, gateway type,
NE user or password of the NE according to the record.
l Cause 3: The system control board on the NE is faulty.
1. Log in to the NE on the Web LCT. If you fail to log in, the system control board is
faulty. Reset the board. If the problem persists, replacing the SCC board.
l Cause 4: ECC communication is interrupted.
1. A maximum of 16 users can simultaneously log in to an NE. If the number of users
reaches the upper limit, other users are not permitted to log in to the NE. For the method
of handling the interruption of ECC communication, see 17.1 ECC Communication
Interruption.
l Cause 5: The NE(non-first gateway NE) encounters system control board active/standby
switching.
1. No handling is required. Communication between the NE and U2000 will be restored
after the system control board active/standby switching.
l Cause 6: Coherent and non-coherent boards are interconnected.
1. The DCN bytes are defined differently for coherent and non-coherent services, so the
two types of services cannot be interconnected. If DCN channels are unavailable,
check whether the interconnected boards are coherent boards.
----End
Symptom
All NEs in a subnet are unreachable by the U2000. The NE_COMMU_BREAK,
NE_NOT_LOGIN, and GNE_CONNECT_FAIL alarms are reported on the U2000.
Impact on System
The NEs are unreachable by the U2000 in the subnet.
Possible Causes
l Cause 1: The gateway NE is faulty.
l Cause 2: The network cable between the gateway NE and the HUB is disconnected or the
interface to the HUB is damaged.
Procedure
l Cause 1: The gateway NE is faulty. No backup gateway NE is configured for the subnet.
1. Determine if the power supply of the gateway NE is faulty. If yes, clear the fault of
the power supply.
2. Log in to the gateway NE on the Web LCT. If you fail to log in, the system control
board is faulty. replacing the SCC board.
l Cause 2: The network cable between the gateway NE and the HUB is faulty.
1. Determine if the network cable connection between the gateway NE and the HUB is
loose. If the network cable connection is loose, secure it.
2. Use a cable tester to test whether the network cable is good. If the network cable is
faulty, replace it.
3. If the network cable is good, replace the interface to the HUB. If the problem clears,
the HUB is damaged.
l Cause 3: The IP address of the gateway NE is changed.
1. Log in to the NE on the Web LCT and restore the initial IP address of the gateway
NE according to the record.
l Cause 4: The optical cable is faulty.
1. When the optical cable is cut off in the main optical path, the NEs in the subnet become
unreachable by the U2000. At the same time, a large number of alarms and
performance events occur in the main optical path. Hence, the normal operation of
services is affected. In this case, the optical cable is faulty.
l Cause 5: The gateway NE encounters the system control board active/standby switching if
only one gateway NE is provided. The primary gateway NE encounters system the control
board active/standby switching if the primary and secondary gateway NEs are provided.
1. No handling is required. Communication between the gateway NE and U2000 will be
restored after the system control board active/standby switching.
----End
Solution
To prevent all NEs in a subnet from becoming unreachable by the U2000 when the gateway NE
fails, configure a backup gateway NE for the NEs in the subnet.
Symptom
An NE frequently becomes unreachable by the U2000. The NE_COMMU_BREAK,
NE_NOT_LOGIN, and GNE_CONNECT_FAIL alarms are reported on the U2000.
Impact on System
The NE is unreachable by the U2000.
Possible Causes
l Cause 1: The IP address of the NE conflicts with that of another NE.
l Cause 2: The ECC communication is intermittent.
l Cause 3: The load of ECC communication is extremely heavy.
l Cause 4: Another NE user with the same account as that of the current NE user in the
network has logged in to the NE.
l Cause 5: When ESC and OSC coexist between station A and B, the system preferentially
selects the ESC for communication. Bit errors leads to the unstable link and the ESC
communication fails. The switching between ESC and OSC occurs frequently because the
route of ESC is shorter than the route of OSC.
l Cause 6: The scale of the DCN subnetwork is extremely large.
Procedure
l Cause 1: The IP address of the NE conflicts with that of another NE.
1. Determine if the IP addresses of the other devices (such as routers and servers) that
are newly configured in the network are the same as the IP address of the frequently
unreachable NE. Make sure that the IP address of the NE is unique.
l Cause 2: The ECC communication is intermittent.
1. For the method of handling the intermittent ECC communication, see 17.2
Intermittent ECC Communication.
l Cause 3: The load of ECC communication is extremely heavy.
1. For the method of handling the extremely heavy load of ECC communication, see
10.4 The System Control Board Is Frequently Reset.
l Cause 4: Another NE user with the same account as that of the current NE user in the
network has logged in to the NE. As a result, the NE becomes unreachable, and the
NE_NOT_LOGIN alarm is frequently reported on the U2000.
1. Plan and manage the users that can log in to the NE. For the information on NE user
management, see "Creating an NE User".
l Cause 5: When ESC and OSC coexist between station A and B, the system preferentially
selects the ESC for communication. Bit errors leads to the unstable link and the ESC
communication fails. The switching between ESC and OSC occurs frequently. The NEs at
station B are frequently unreachable by the U2000.
1. Disable the ESC communication between station A and B on the U2000, keep the
OSC communication.
– In NE Explorer, select the NE.
– Select Communication > DCC Management from Function Tree.
– In DCC Rate Configuration, select the board. Double click the parameter area of
Enabled/Disabled.
– Choose Disabled in the drop list.
– Click Apply.
2. Rectify the bit errors on the link, for details refer to 8 Bit Errors.
3. Enable the ESC communication between station A and B after the link becomes stable.
l Cause 6: The scale of the DCN subnetwork is extremely large.
1. Due to the inherent limitations of the DCC protocol, there is a limit on the number of
NEs (physical NEs with unique IDs) in a network even though there are sufficient
GNEs. If the number of NEs in a network exceeds the limit, the network must be
divided into multiple DCN subnets. The details refer to DCN Networking Capability.
NOTE
Each GNE can connect to a maximum of 50 non-GNEs, regardless of the protocol. If there are more
than 50 non-NEs for one GNE, another GNE must be added.
----End
Symptom
Due to an ECC communication overload, the system control board is frequently reset and the
U2000 is frequently unable to reach the NE.
System Impact
The U2000 cannot reach the NE.
Possible Causes
l Cause 1: The ECC communication load is extremely heavy.
Procedure
l Cause 1: The ECC communication load is extremely heavy. The scale of the ECC network
is extremely large and the inter-NE ECC communication load exceeds the maximum
processing capability of the NEs in the network. As a result, the system control board
frequently resets.
1. Check the scale of the ECC network. The number of NEs in the network cannot exceed
100. If there are more than 100 NEs in the network, divide the network into subnets.
Create a data communication network (DCN) management channel for each subnet
so the subnets are independent of each other.
2. Check the per-NE scale. When multiple pieces of equipment are connected by using
a HUB or are cascaded inter subracks, the extended ECC function of the network
interfaces of the equipment is used for communication. If not more than four pieces
of equipment are connected by using one HUB, the automatic extended ECC function
can be enabled. If more than four pieces of equipment are connected by using one
HUB, the manual extended ECC function is recommended for communication to
prevent ECC storms. For details on how to set the manual extended ECC function,
refer to "Configuring Extended ECC in Specified Mode".
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A20 The T2000 Cannot Log in to the Remote GNE Connected by the Router
l MC-A22 NE ESC Communication Interrupted Because of the Closure of the OTU Laser
l MC-A64 Many BD_STATUS Alarms Occur Due to the ECC Storm
l MC-A83 The OptiX OSN 6800 Is Not Reachable When the NE IP Is Modified
l MC-A110 Question About the Extended ECC of WDM NEs
l MC-A113 Communication Between the WDM Equipment And the DCN Network Fails
After a Hub Is Connected To the DCN Network
l MC-A135 The Communication Between the NE and the T2000 Stops Because the NE
Software Cannot Switch the ECC from an OSC Board to an OTU Board Automatically
l MC-A181 An NE Is Frequently Unreachable to the NMS Due to Insufficient Processing
Capacity of a Router
l MC-A212 Secondary GNE Becomes Unavailable After a Change of the Maximum Number
of Route Hops of an NE
l MC-A213 Internal Communication of an NE Is Abnormal And Many Boards Report
Transiently BD_STATUS or COMMUN_FAIL Alarms Due to Conflicted Subrack IDs
Fault Description
The board is offline and unavailable on the U2000 while the other boards are online.
Possible Causes
l Cause 1: The slot that houses the board is faulty.
l Cause 2: The board is faulty.
Procedure
l Cause 1: The slot that houses the board is faulty.
1. Reseat the board to check whether the fault is removed.
2. If the fault persists, change the slot that houses the board.
l Cause 2: The board is faulty.
1. Replace the faulty board.
----End
Fault Description
Multiple or all boards are offline and unavailable on the U2000.
Possible Causes
l Cause 1: The NE is unreachable.
l Cause 2: The network cable between the master and slave subracks is faulty.
l Cause 3: The backplane is faulty.
Procedure
l Cause 1: The NE is unreachable.
1. If the U2000 reports NE_COMMU_BREAK or NE_NOT_LOGIN alarm, the NE is
likely unreachable. For details, see 10.1 Unreachability of One NE by the U2000.
l Cause 2: The network cable between the master and slave subracks is faulty.
1. Check whether the network cable between the master and slave subracks is damaged
or loose and remove the fault accordingly.
l Cause 3: The backplane is faulty.
1. Replace the subrack.
----End
This topic describes how to handle services that fail to be automatically switched to the protection
channel when the working channel works abnormally.
For information on the protection schemes supported by the OptiX OSN 8800, OptiX OSN 6800,
and OptiX OSN 3800, see the Feature Description"Feature Overview".
12.1 Optical Line Protection
This topic describes how to resolve switchover issues when optical line protection fails to
automatically switch services to the protection channel.
12.2 Intra-Board 1+1 Protection
This topic describes how to resolve switchover issues when intra-board 1+1 protection fails to
automatically switch services to the protection channel.
12.3 Client-side 1+1 Protection
This topic describes how to resolve switchover issues when client 1+1 protection fails to
automatically switch services to the protection channel.
12.4 SW SNCP Protection
This section describes the handling methods for services with SW SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.
12.5 ODUk SNCP Protection
This topic describes how to resolve switchover issues when ODUk SNCP protection fails to
automatically switch services to the protection channel.
12.6 VLAN SNCP Protection
This section describes the handling methods for services with VLAN SNCP protection that fail
to automatically switch to the protection channel when the working channel works abnormally.
12.7 Tributary SNCP Protection
This topic describes how to resolve switchover issues when tributary SNCP protection fails to
automatically switch services to the protection channel.
12.8 Automatic Switching Failure of DBPS Protection
This section describes the handling methods for services with DBPS protection that fail to
automatically switch to the protection channel when the working channel works abnormally.
12.20 SNCP
This section describes the handling methods for services with SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.
12.21 SNCTP
This section describes the handling methods for services with tributary SNCTP protection that
fail to automatically switch to the protection channel when the working channel works
abnormally.
Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
l During normal network operations, conditions that trigger protection switching occur on
the working channel, but services do not automatically switch to the protection channel. In
this case, the services are interrupted.
l The optical line protection (OLP) board may report an OLP_PS or OLP_STA_INDI alarm.
Possible Causes
l Cause 1: The protection group parameters are incorrectly configured.
l Cause 2: Locked or forced switching commands have been manually issued.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
l Cause 4: The protection channel is working abnormally.
Troubleshooting Workflow
The following figure illustrates the optical line protection automatic switching failure
troubleshooting process.
Figure 12-1 Optical line protection automatic switching failure troubleshooting flowchart
Start
Yes No
No No
Yes No
Yes No
Forcibly switch the End
services to
the protection channel.
Contact Huawei
engineers.
Procedure
l Cause 1: The protection group parameters are incorrectly configured.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, click Query to query all protection group parameter
values. In the Operation Result dialog box, click Close.
3. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured correctly.
l Cause 2: Locked or forced switching commands have been manually issued.
1. In the Port Protection window, click Query to check the Switching Status parameter
of the protection group.
2. If Switching Status is Locked or Force to Working Channel, click Function and
choose Clear to clear the locked or forced switching commands.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
1. Check whether the optical fibers are properly connected to optical ports 1 (RI1/TO1)
and 2 (RI2/TO2) on the OLP boards located at the two stations next to the failure point.
For details, refer to "Principles".
l Cause 4: The protection channel is working abnormally.
1. In the Port Protection interface, click Query to check the Protection Channel
Status parameter of the protection group.
2. If Protection Channel Status is SF, check whether MUT_LOS and
POWER_DIFF_OVER alarms are being reported on the corresponding boards on the
protection channel, clear the alarms.
NOTE
If Protection Channel Status is SF, faults have occurred on both the working and protection
channels.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
l When the OLP board initiates intra-board 1+1 protection, the OLP board may report an
INTRA_OTU_PS or INTRA_OTU_STA_INDI alarm.
l When the intra-board 1+1 protection is realized by dual fed and selective receiving OTU,
the OTU board may report the INTRA_OTU_PS alarm or INTRA_OTU_STA_INDI
alarm.
Possible Causes
l The protection group parameters are incorrectly configured.
l Locked or forced switching commands have been manually issued.
l The optical fibers are improperly connected to the corresponding boards.
l The protection channel is working abnormally.
Troubleshooting Flow
The following figure illustrates the intra-board 1+1 protection automatic switching failure
troubleshooting process.
Figure 12-2 Intra-board 1+1 protection automatic switching failure troubleshooting flowchart
Start
Yes No
No No
Yes No
Yes No
Forcibly switch the End
services to
the protection channel.
Contact Huawei
engineers.
Procedure
l Cause 1: The protection group parameters are incorrectly configured.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, click Query to query all protection group parameter
values. In the Operation Result dialog box, click Close.
3. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured correctly.
l Cause 2: Locked or forced switching commands have been manually issued.
1. In the Port Protection window, click Query to check the Switching Status parameter
of the protection group.
2. If Switching Status is Locked or Force to Working Channel, click Function and
choose Clear to clear the locked or forced switching commands.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
1. For 2.5G OTU boards, check whether the optical fibers are properly connected to the
optical ports for working and protection channels on the OTU, or tributary or line
boards located at the two stations next to the failure point. For 5G, 10G, and 40G
OTU, or tributary or line boards, check whether the optical fibers are properly
connected to the optical ports for working and protection channels on the OLP or DCP
boards located at the two stations next to the failure point. For details, refer to
"Principles".
NOTE
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
l The protection group parameters are incorrectly configured.
l Locked or forced switching commands have been manually issued.
l The optical fibers are improperly connected to the corresponding boards.
l The protection channel is working abnormally.
Troubleshooting Workflow
The following figure illustrates the client 1+1 protection automatic switching failure
troubleshooting process.
Figure 12-3 Client 1+1 protection automatic switching failure troubleshooting flowchart
Start
Yes No
No No
Yes No
Yes No
Contact Huawei
engineers.
Procedure
l Cause 1: The protection group parameters are incorrectly configured.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, click Query to query all protection group parameter
values. In the Operation Result dialog box, click Close.
3. Compare these values against those specified in "Dependency and Limitation" and
"Parameters" to ensure that the parameters are configured properly.
l Cause 2: Locked or forced switching commands have been manually issued.
1. In the Port Protection window, click Query to check the Switching Status parameter
of the protection group.
2. If Switching Status is Locked or Force to Working Channel, click Function and
choose Clear to clear the locked or forced switching commands.
l Cause 3: The optical fibers are improperly connected to the corresponding boards.
1. Ensure that the optical fibers are properly connected to the optical ports for the working
and protection channels.
l Cause 4: The protection channel is working abnormally.
1. In the Port Protection window, click Query to query the Protection Channel
Status parameter of the protection group.
2. If Protection Channel Status is SD or SF, check whether the alarms listed in the
trigger conditions are being reported on the corresponding boards on the protection
channel, clear the alarms.
NOTE
If Protection Channel Status is SD or SF, faults have occurred on both the working and protection
channels.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
The possible causes of the automatic switching failure are as follows:
Figure 12-4 Fault handling flow when automatic switching by SW SNCP protection fails
Start
No
No
Yes No
No No
End
Handle the faults on the
working channel.
Contact Huawei
engineers.
Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click the SNCP Service Control tab.
2. Click Query in the SNCP Service Control interface, to query the values of all
parameters of the protection group. Click Close in the Operation Result dialog box.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the Feature Description to ensure that the parameters
are configured properly.
l Cause 2: Locked or forced switching is not released.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from protection to working) switching
state, select the protection group and right-click it. Choose Clear from the shortcut
menu to clear the locked or forced switching configuration.
l Cause 3: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to check the created cross-connections.
3. According to the network design, ensure the source and sink ports of the dual fed
cross-connections that are configured on the source end are correct.
l Cause 4: The protection channel works abnormally.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the protection cross-connection.
2. If Channel Status is displayed as SD, SF, check whether the following alarms of
trigger conditions exist on the relevant boards on the protection channel. For the
method of handling the alarms, see the Alarms and Performance Events Reference.
NOTE
If the Channel Status parameter of the protection cross-connection is displayed as SD, SF, it
is indicated that faults have occurred in both the working and protection channels.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The protection group parameters are incorrectly configured.
l Cause 3: Locked or forced switching commands have been manually issued.
l Cause 4: The electrical cross-connections are incorrectly configured.
l Cause 5: The protection channel is working abnormally.
l Cause 6: The line board is faulty.
Troubleshooting Workflow
The following figure illustrates the ODUk SNCP protection automatic switching failure
troubleshooting process.
Figure 12-5 ODUk SNCP protection automatic switching failure troubleshooting flowchart
Start
Are the
Do the parameter values No Yes
Modify the parameters. services switched
comply with the to the protection
specifications?
channel?
No
Yes
Is Are the
a locked or forced Yes Clear the locked or Yes
services switched
switching command forced switching
manually issued? to the protection
command.
channel?
No
No
Yes No
No No
Contact Huawei
engineers.
Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query the Protection Type of
the protection group. In the Operation Result dialog box, click Close.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
Before the working channel is restored from a failure (for example, the faulty board has not yet
been replaced), services on the protection channel may run without protection for a long time.
To avoid a second failure on the network, you are advised to convert an idle and unprotected
service to the SNCP service for protection. For details, refer to "Converting a Normal WDM
Service to an SNCP Service." After the working channel is restored, convert the SNCP service
back to an unprotected service. For details, refer to "Converting an SNCP Service to a Normal
WDM Service."
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
L4G, TBE, LEM24, LEX4 board of OptiX OSN 6800/OptiX OSN 3800 may report
VLAN_SNCP_PS.
Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l Locked or forced switching is not released.
l The electrical cross-connections are configured incorrectly.
l The protection channel works abnormally.
Figure 12-6 Fault handling flow when automatic switching by VLAN SNCP protection fails
Start
Yes No
Yes No
Contact Huawei
engineers.
Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the desired Ethernet board in the NE Explorer, and choose Configuration >
Ethernet Service > Ethernet Line Service from the Function Tree.
2. Click Query in the VLAN SNCP Service Management tab to query the values of
all parameters of the protection group.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the Feature Description to ensure that the parameters
are configured properly.
l Cause 2: Locked or forced switching is not released.
1. In the VLAN SNCP Service Management interface, click Set/Query Switching and
select Query Switching Status to query the Channel Status parameter of the
protection group.
2. If Channel Status for the protection channel is displayed as Lockout State or Forced
(Standby to Active) Switching State, right-click the protection group, and then
choose Clear from the shortcut menu, to clear the locked or forced switching
configuration.
l Cause 3: The electrical cross-connections are configured incorrectly.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree.
2. Click Query on the bottom of the WDM Cross-Connection Configuration interface
to check the created cross-connections.
3. According to the network design, ensure the source and sink ports of the dual fed
cross-connections that are configured on the source end are correct.
l Cause 4: The protection channel works abnormally.
1. In the VLAN SNCP Service Management interface, click Set/Query Switching and
select Query Switching Status to check the Link Status parameter of the protection
service.
2. If Link Status is displayed as SF, check whether the following alarms exist on the
relevant boards on the protection channel: R_LOS, R_LOF, R_OOF, OTU5G_LOF,
OTU5G_AIS, OTU5G_LOM. For the method of handling the alarms, see the Alarms
and Performance Events Reference.
NOTE
If the Link Status parameter of the protection service is displayed as SF, it is indicated that
faults have occurred in both the working and protection channels.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Symptom
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The protection group parameters are incorrectly configured.
l Cause 3: Locked or forced switching commands have been manually issued.
l Cause 4: The electrical cross-connections are incorrectly configured.
l Cause 5: The protection channel is working abnormally.
Troubleshooting Workflow
The following figure illustrates the tributary SNCP protection automatic switching failure
troubleshooting process.
Figure 12-7 Tributary SNCP protection automatic switching failure troubleshooting flowchart
Start
Do Are the
the parameter No Yes
Modify the parameters. services switched
values comply with the
specifications? to the protection
channel?
No
Yes
No
No
Yes No
Yes No
Contact Huawei
engineers.
Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query the Protection Type of
the protection group.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
NOTE
If the section overhead status changes, protection switching may be triggered.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l There is alarm to be solved.
Figure 12-8 Fault handling flow when automatic switching by DBPS protection fails
Start
Yes No
Yes No
End
Contact Huawei
engineers.
Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. In the NE Explorer, select the NE and then choose Configuration > Distributed
Board-Level Protection from the Function Tree.
2. In the right pane, click Query to obtain the current information about the created
DBPS.
3. Compare the values against the principles and requirements specified in Dependencies
and Limitations and Parameters in the Feature Description to check and ensure that
the parameters are configured properly.
l Cause 2: There is alarm to be solved.
1. Check if there is the DBPS_ABNORMAL, LINK_ERR, R_LOS, R_LOF,
HARD_BAD, PORT_MODULE_OFFLINE alarms.
2. Solve the alarms. For details, refer to the Alarms and Performance Events
Reference.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Background Information
The OptiX OSN 3800 supports 1+1 protection for only SCC boards.
The OptiX OSN 6800 supports 1+1 protection for only cross-connect boards and SCC boards.
Symptom
During normal network operations, conditions that trigger protection switching occur on the
equipment, but services do not automatically switch to the standby board.
l If 1+1 protection for the system control board fails to be automatically switched, the
U2000 cannot reach the equipment. The services on the equipment, however, are
unaffected.
l If the protection switching of the cross-connect boards or clock boards fails, the services
on the equipment are interrupted.
Possible Causes
l Cause 1: No standby board is configured.
l Cause 2: The standby board is faulty.
Procedure
l Cause 1: No standby board is configured.
1. Configure a standby board.
l Cause 2: The standby board is faulty.
1. Replace the faulty standby board.
----End
Solution
After configuring a standby board or replacing the faulty standby board, replace the faulty active
board to restore services.
NOTE
To replace a board with active/standby protection, use a board of the same type.
Fault Description
When the network operates normally, the working channel has generated the conditions of
triggering the protection switching, but the services cannot be automatically switched to the
protection channel. In this case, the services are interrupted.
If there is more than one RPL owner node on the ring, the system reports the
MULTI_RPL_OWNER alarm.
Possible Causes
l Cause 1: The configuration of the protection group parameters is incorrect.
l Cause 2: The protection channel works abnormally.
l Cause 3: The fiber connections on the board are incorrect.
l Cause 4: The hardware is faulty.
Procedure
l Cause 1: The configuration of the protection group parameters is incorrect. As a result, the
protection switching fails.
1. Review the configuration of the protection group parameters on the U2000.
If... Then...
If... Then...
The fiber connections are incorrect Reconnect the fibers. Determine if the
services recover. If the services do not
recover, determine if the fault is due to
other causes.
The fiber connections are correct Determine if the fault is due to other
causes.
2. Determine if the services are restored. If the services are not restored, determine if the
fault is due to other causes.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
The possible causes of the automatic switching failure are as follows:
l The related boards work abnormally.
l The parameters of the protection group are improperly configured.
l Locked switching or forced switching is not released.
l The electrical cross-connections are configured incorrectly.
l The protection channel works abnormally.
Figure 12-9 Fault handling flow when automatic switching by MS SNCP protection fails
Start
No
Yes
No
No
Modify the
No Are services Yes
Are the configurations of configurations of switched to the
electrical cross-connections electrical cross-
correct? protection channel?
connections.
Yes No
No
Does the protection Handle the faults on the Are services Yes
channel work protection channel. switched to the
normally? protection channel?
Yes No
Contact Huawei
engineers.
Procedure
l Cause 1: The related boards work abnormally.
1. Allocate the board which works abnormally.
2. Replace the board. For details refer to Parts Replacement.
If the Channel Status parameter of the protection cross-connection is displayed as SD, SF, it
is indicated that faults have occurred in both the working and protection channels.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l There is alarm which needs to be cleared.
Figure 12-10 Fault handling flow when automatic switching by DLAG protection fails
Start
Yes No
Yes No
End
Contact Huawei
engineers.
Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. In the NE Explorer, select the NE and then choose Configuration > Ethernet
Distributed Link Aggregation Management from the Function Tree.
2. In the right pane, click Query to obtain the current information about the created
DLAG.
3. For DLAG Protection(EOO/EOW), compare the values against the principles and
requirements specified in Parameters in the OptiX OSN 8800 Feature Description to
ensure that the parameters are configured properly.
4. For DLAG Protection(EOS), compare the values against the principles and
requirements specified in Dependency and Limitation and Parameters in the OptiX
OSN 8800 Feature Description to check and ensure that the parameters are configured
properly.
5. For DLAG Protection(EOS), compare the values against the principles and
requirements specified in Dependency and Limitation and Parameters in the OptiX
OSN 6800/3800 Feature Description to ensure that the parameters are configured
properly.
l Cause 2: There is alarm which needs to be cleared.
1. On the U2000, determine if there is DLAG_PROTECT_FAIL or other alarms on the
protection channel.
2. Clear the alarms. For more information, refer to the Alarms and Performance Events
Reference.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
The possible causes of the automatic switching failure are as follows:
Figure 12-11 Fault handling flow when automatic switching by board-level protection fails
Start
End
Perform forced
switching
Contact Huawei
engineers
Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > Board-Level
Protection from the Function Tree.
2. Click Query in the interface, to query the values of all parameters of the protection
group.
3. Compare the values against the principles and requirements specified in Dependency
and Limitation and Parameters in the OptiX OSN 6800/3800 Feature Description to
ensure that the parameters are configured properly.
l Cause 2: Forced switching is not released.
1. In the window, click Query to check the Switching Status parameter of the protection
group.
2. If Switching Status is displayed as Forced Switch Request, right-click the protection
group, and then choose Clear from the shortcut menu to clear the locked switching
or forced switching configuration.
l Cause 3: The protection board works abnormally.
1. In the window, click Query to check the Protection Board Availability parameter
of the protection group.
2. If Protection Board Availability is displayed as Unusable, determine if the following
alarms exist on the relevant boards on the channels: HARD_BAD, BD_STATUS,
PORT_MODULE_OFFLINE, R_LOS, R_LOF. For more information, refer to the
Alarms and Performance Events Reference.
----End
Solution
If the problem persists after the above steps are performed and Protection Board
Availability is Available, choose Forced Switching to Protection to perform a forced
switching on the protection group. Then, the services are switched to the protection board and
the services recover.
For more information regarding how to handle faults that have occurred on the working board,
refer to the Alarms and Performance Events Reference. If necessary, contact Huawei engineers
to resolve the problems.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
The NEs at both ends of the channel where the switching occurs may report the ODUKSP_PS
alarm or ODUKSP_STA_INDI alarm.
Possible Causes
The possible causes of the automatic switching failure are as follows:
Figure 12-12 Fault handling flow when automatic switching by ODUk SPRing protection fails
Start
No No
Do Are the
the parameter No Yes
Modify the services switched to
values comply with parameters. the protection
the specifications? channel?
Yes No
No No
No
Yes
Yes No
End
Perform manual
Handle the faults on the Contact Huawei
switching or forced
switching. working channel. engineers.
Procedure
l Cause 1: The protection channel works abnormally.
1. In the Channel Mapping Relation interface, click Query to query the Channel
Status parameter of the protection channel.
2. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the protection channel: R_LOS, R_LOC, HARD_BAD,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_LOFLOM, ODUk_TCMn_OCI,
ODUk_TCMn_LCK, ODUk_TCMn_AIS, OTUk_EXC, ODUk_TCM6_DEG,
ODUk_TCM6_EXC. For the method of handling the alarms, see the Alarms and
Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the ODUk SPRing interface, click Query to query the Switching Request-East
or Switching Request-West parameter of the protection group.
2. If Switching Request-West or Switching Request-East is displayed as Locked or
Forced Switching - Ring, right-click the protection group, and then choose Clear
from the shortcut menu, to clear the locked or forced switching configuration.
l Cause 3: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > ODUk SPRing from
the Function Tree.
2. Click Query in the interface, to query the values of all parameters of the protection
group.
3. Check whether Span ID has been set correctly.
For example, if services are added/dropped at sites A and B and site B is east to site
A, then the span ID for the west working unit of site B must be the same as the span
ID for the east working unit of site A.
4. Compare the values against the principles and requirements specified in Dependencies
and Limitations and Parameters in the Feature Description to check and ensure that
the other parameters are configured properly.
l Cause 4: SD Switching has been set to Disabled.
1. Click Query to query the SD Switching parameter of the protection group.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, the working channel has generated triggering conditions for
protection switching, but services cannot be automatically switched to the protection channel.
In this case, the services are interrupted.
Possible Causes
The possible causes of the automatic switching failure are as follows:
l The parameters of the protection group are improperly configured.
l The protection protocol is not started.
l Locked switching or forced switching is not released.
l The fibers are improperly connected to the corresponding boards.
l The protection channel works abnormally.
Figure 12-13 Fault handling flow when automatic switching by OWSP protection fails
Start
Do Are the
No Yes
the parameter Modify the services switched to
values comply with the parameters. the protection
specifications?
channel?
Yes No
Yes No
No No
Yes No
Yes No
End
Perform a manual
or forced switching.
Contact Huawei
engineers.
Procedure
l Cause 1: The parameters of the protection group are improperly configured.
1. Select the NE in the NE Explorer, and choose Configuration > Optical Wavelength
Shared Protection from the Function Tree. Click OWSP Protection tab.
2. Click Query in the OWSP Protection interface, to query the values of all parameters
of the protection group.
3. Compare the values against the principles and requirements specified in Dependencies
and Limitations and Parameters in the Feature Description to check and ensure that
the parameters are configured properly.
l Cause 2: The protection protocol is not started.
1. In the OWSP Protection interface, select the protection group and click Start
Protocol.
l Cause 3: Locked switching or forced switching is not released.
1. In the OWSP Protection interface, click Query to query the Switching Request-
East or Switching Request-West parameter of the protection group.
2. If Switching Request-East or Switching Request-West is displayed as Locked or
Forced Switching - Ring, right-click the protection group, and then choose Clear
from the shortcut menu, to clear the locked or forced switching configuration.
l Cause 4: The fibers are improperly connected to the corresponding boards.
1. Refer to the protection principle in Principles. Check if the fiber connections are
correct on the optical interfaces of the east and west working channels and protection
channels of the DCP boards at the stations on the two ends.
l Cause 5: The protection channel works abnormally.
1. In the OWSP Protection interface, click Query to query the Channel Status
parameter of the protection channel.
2. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the protection channel: MUT_LOS, R_LOS, R_LOC,
R_LOF, OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS,
ODUk_PM_OCI, ODUk_PM_LCK, OTUk_TIM, ODUk_PM_TIM,
ODUk_LOFLOM, IN_PWR_HIGH, IN_PWR_LOW, B1_EXC, ODUk_PM_DEG,
ODUk_PM_EXC, OTUk_DEG, OTUk_EXC. For the method of handling the alarms,
see the Alarms and Performance Events Reference.
NOTE
If the Channel Status parameter of the protection channel is displayed as SD, SF, it is indicated
that faults have occurred in both the working and protection channels.
----End
Solution
If the preceding measures are not successful in switching the services to the protection channel,
and Protection Channel Status is displayed as Normal, you need to perform a forced switching
to switch the services to the protection channel, which eventually restores the services.
For more information regarding how to handle faults that occur in the working channel, clean
the Alarms and Performance Events in the NE. If necessary, contact Huawei engineers to resolve
the problem.
Background Information
The OptiX OSN 8800 and 6800 support protection for inter-subrack communication.
Symptom
During normal network operations, the communication between master and slave subracks is
interrupted, and services do not automatically switch to the standby route.
System Impact
When inter-subrack communication protection fails to automatically switch, the U2000 cannot
reach the slave subracks. The services on the equipment, however, are unaffected.
Possible Causes
l Cause 1: The protection network cable is loosely connected or is malfunctioning.
Procedure
l Cause 1: The protection network cable is loosely connected or is malfunctioning.
1. Securely attach the protection network cable.
2. Replace the protection network cable.
----End
Fault Description
When the network operates normally, the working channel has generated the conditions of
triggering the protection switching, but the services cannot be automatically switched to the
protection channel. In this case, the services are interrupted.
Possible Causes
l Cause 1: The protocol fails to start and the protection switching fails.
l Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
l Cause 3: The protocol is stopped manually or a forced switching is performed manually.
As a result, the protection switching fails.
l Cause 4: The power supply of the device is faulty.
l Cause 5: The hardware is faulty.
Procedure
Step 1 Cause 1: The protocol fails to start and the protection switching fails.
1. In the NE Explorer, click the desired NE and choose Configuration > Linear MS from
the Function Tree.
2. In Linear MS interface, click Stop Protocol. Click Close in the Operation Result dialog
box.
3. Click Start Protocol to re-enable the protocol. Click Close in the Operation Result dialog
box.
4. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 2 Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
1. Check the settings of linear MS parameters on the U2000.
If... Then...
The settings of the linear MS parameters Set the parameters of the multiplex section
are incorrect to proper values. Check whether services
recover. If the services do not recover,
check whether the fault is due to other
causes.
If... Then...
The optional switching conditions are not Configure B2_SD as an optional switching
configured. conditions. For the details, see the Feature
Description.
Check whether the following alarms exist For the method of handling the alarms, see
on the relevant boards: the Alarms and Performance Events
R_LOS, R_L0F, R_LOC, MS_AIS, Reference.
B2_EXC, B2_SD
4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 3 Cause 3: The protocol is stopped manually or a forced switching is performed manually. As a
result, the protection switching fails.
1. Check the protocol controller state on the U2000.
If... Then...
The state of the protocol controller is Restart the protocol controller. Check
Protocol Stopped whether services recover. If the services do
not recover, check whether the fault is due
to other causes.
The switching state of the protection group Clear protection switching. Check whether
is Lockout or Switching services recover. If the services do not
recover, check whether the fault is due to
other causes.
The switching state of the protection group Check whether the fault is due to other
is Normal causes.
Step 4 Cause 4: The power supply of the device is faulty. As a result, the protection switching fails.
1. Refer to 5.5 Service Interruptions Caused by Power Supply Failures to rectify the fault.
Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
----End
Related Information
None
Fault Description
When the network operates normally, the working channel generates the triggering conditions
for protection switching, but the services cannot be automatically switched to the protection
channel. In this case, the services are interrupted.
Possible Causes
l Cause 1: The protocol fails to start and the protection switching fails.
l Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
l Cause 3: The protocol is stopped manually or the forced switching is performed manually.
As a result, the protection switching fails.
l Cause 4: The power supply of the device is faulty.
l Cause 5: The hardware is faulty.
Procedure
Step 1 Cause 1: The protocol fails to start and the protection switching fails.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from the
Function Tree.
2. In Ring MS interface, click Stop Protocol. Click Close in the Operation Result dialog
box.
3. Click Start Protocol to re-enable the protocol. Click Close in the Operation Result dialog
box.
4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 2 Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
1. Check the settings of the ring MS parameters on the U2000.
If... Then...
The settings of the ring MS parameters are Set the parameters of the ring MS to proper
incorrect values. Check whether services recover. If
the services do not recover, check whether
the fault is due to other causes.
If... Then...
The optional switching conditions are not Configure B2_SD as an optional switching
configured. conditions. For the details, see the Feature
Description.
If... Then...
Check whether the following alarms exist For the method of handling the alarms, see
on the relevant boards: the Alarms and Performance Events
R_LOS, R_L0F, R_LOC, MS_AIS, Reference.
B2_EXC, B2_SD
4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 3 Cause 3: The protocol is stopped manually or the forced switching is performed manually. As
a result, the protection switching fails.
1. Check the protocol controller state on the U2000.
If... Then...
The state of the protocol controller is Restart the protocol controller. Check
Protocol Stopped whether services recover. If the services do
not recover, check whether the fault is due
to other causes.
If... Then...
The switching state of the protection group Clear protection switching. Check whether
is Lockout or Forced Switching services recover. If the services do not
recover, check whether the fault is due to
other causes.
The switching state of the protection group Check whether the fault is due to other
is Normal causes.
Step 4 Cause 4: The power supply of the device is faulty. As a result, the protection switching fails.
1. Refer to 5.5 Service Interruptions Caused by Power Supply Failures to rectify the fault.
Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
----End
Related Information
None
Fault Description
When the network operates normally, the working channel has generated the conditions of
triggering the protection switching, but the services cannot be automatically switched to the
protection channel. In this case, the services are interrupted.
Possible Causes
l Cause 1: The protocol fails to start and the protection switching fails.
l Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
l Cause 3: The protocol is stopped manually, the forced switching is performed manually.
As a result, the protection switching fails.
l Cause 4: The power supply of the device is faulty.
l Cause 5: The hardware is faulty.
Procedure
Step 1 Cause 1: The protocol fails to start and the protection switching fails.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from the
Function Tree.
2. In Ring MS interface, click Stop Protocol. Click Close in the Operation Result dialog
box.
3. Click Start Protocol to re-enable the protocol. Click Close in the Operation Result dialog
box.
4. Check whether services recover. If the services do not recover, proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 2 Cause 2: The protection switching protocol is abnormal. The protocol status is not idle.
1. Check the settings of the transoceanic MSP ring parameters on the U2000.
If... Then...
The settings of the transoceanic MSP ring Set the parameters of the multiplex section
parameters are incorrect to proper values.
For the details about the parameters, refer
to Feature Description.
Check whether services recover.
If the services do not recover, check
whether the fault is due to other causes.
NOTE
The optional switching conditions are not Configure B2_SD as an optional switching
configured. conditions. For the details, see the Feature
Description.
Check whether the following alarms exist For the method of handling the alarms, see
on the relevant boards: the Alarms and Performance Events
R_LOS, R_L0F, R_LOC, MS_AIS, Reference.
B2_EXC, B2_SD
4. Re-enable the protocol, check whether services recover. If the services do not recover,
proceed to the next substep.
5. Check whether the hardware is faulty. If the hardware is faulty, go to cause 5.
Step 3 Cause 3: The protocol is stopped manually, the forced switching is performed manually. As a
result, the protection switching fails.
1. Check the protocol controller state on the U2000.
If... Then...
The state of the protocol controller is Restart the protocol controller. Check
Protocol Stopped whether services recover. If the services do
not recover, check whether the fault is due
to other causes.
If... Then...
The switching state of the protection group Clear protection switching. Check whether
is Lockout or Forced Switching services recover. If the services do not
recover, check whether the fault is due to
other causes.
The switching state of the protection group Check whether the fault is due to other
is Normal causes.
Step 4 Cause 4: The power supply of the device is faulty. As a result, the protection switching fails.
1. Refer to 5.5 Service Interruptions Caused by Power Supply Failures to rectify the fault.
Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
----End
Related Information
None
12.20 SNCP
This section describes the handling methods for services with SNCP protection that fail to
automatically switch to the protection channel when the working channel works abnormally.
Fault Description
When the network operates normally, the working channel generates the triggering conditions
for protection switching, but the services cannot be automatically switched to the protection
channel. In this case, the services are interrupted.
Possible Causes
l Cause 1: The configuration of the protection group parameters is incorrect.
l Cause 2: The protection channel works abnormally.
l Cause 3: Locked switching or forced switching is not released.
l Cause 4: The fiber connections on the board are incorrect.
l Cause 5: The hardware is faulty.
Procedure
Step 1 Cause 1: The configuration of the protection group parameters is incorrect. As a result, the
protection switching fails.
1. Check the configuration of the protection group parameters on the U2000.
If... Then...
The configuration of the protection group See the Feature Description to re-
parameters is incorrect configure the protection group parameters.
Check whether services recover. If the
services do not recover, check whether the
fault is due to other causes.
If... Then...
The optional switching conditions are not Configure HP_UNEQ, B3_SD, HP_TIM
configured as the optional switching conditions. For
the details, see the Feature Description.
The optional switching conditions are Check whether the fault is due to other
configured causes.
If... Then...
Check whether the following alarms exist on For the method of handling the alarms, see the
the relevant boards: Alarms and Performance Events Reference.
R_LOS, R_L0F, R_LOC, MS_AIS,
B2_EXC, B2_SD
Step 4 Cause 4: The fiber connections on the board are incorrect. As a result, the protection switching
fails.
1. See the protection principles to check whether the fiber connections at the faulty point are
correct. For details of the protection principles, see the Production Description.
If... Then...
The fiber connections are incorrect Reconnect the fibers. Check whether
services recover. If the services do not
recover, check whether the fault is due to
other causes.
The fiber connections are correct Check whether the fault is due to other
causes.
Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.
----End
Related Information
None
12.21 SNCTP
This section describes the handling methods for services with tributary SNCTP protection that
fail to automatically switch to the protection channel when the working channel works
abnormally.
Fault Description
When the network operates normally, the working channel generates the triggering conditions
for protection switching, but the services cannot be automatically switched to the protection
channel. In this case, the services are interrupted.
Possible Causes
l Cause 1: The configuration of the protection group parameters is incorrect.
l Cause 2: The protection channel works abnormally.
l Cause 3: Locked switching or forced switching is not released.
l Cause 4: The fiber connections on the board are incorrect.
l Cause 5: The hardware is faulty.
Procedure
Step 1 Cause 1: The configuration of the protection group parameters is incorrect. As a result, the
protection switching fails.
1. Check the configuration of the protection group parameters on the U2000.
If... Then...
The configuration of the protection group See the Feature Description to re-
parameters is incorrect configure the protection group parameters.
Check whether services recover. If the
services do not recover, check whether the
fault is due to other causes.
If... Then...
The optional switching conditions are not Configure HP_UNEQ, B3_SD, HP_TIM
configured as the optional switching conditions. For
the details, see the Feature Description.
The optional switching conditions are Check whether the fault is due to other
configured causes.
If... Then...
Check whether the following alarms exist on For the method of handling the alarms, see the
the relevant boards: Alarms and Performance Events Reference.
R_LOS, R_L0F, R_LOC, MS_AIS,
B2_EXC, B2_SD
Step 4 Cause 4: The fiber connections on the board are incorrect. As a result, the protection switching
fails.
1. See the protection principles to check whether the fiber connections at the faulty point are
correct. For details of the protection principles, see the Production Description.
If... Then...
The fiber connections are incorrect Reconnect the fibers. Check whether
services recover. If the services do not
recover, check whether the fault is due to
other causes.
The fiber connections are correct Check whether the fault is due to other
causes.
Step 5 Cause 5: The hardware is faulty. As a result, the protection switching fails.
1. See the Parts Replacement to replace the faulty board.
2. Check whether services recover. If the services do not recover, check whether the fault is
due to other causes.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A23 When the OTU Board Accesses Light, the Laser at Output End is Disabled
l MC-A25 The System Indicates that the Channel Number Is Illegal
l MC-A38 The Service Is Interrupted After the Protection Is Triggered
l MC-A41 Unsuccessful 1: N Protection Subnet Search
l MC-A91 Creation of Intra-Board Wavelength Protection on the LDG Fails
l MC-A94 The WXCP Protection Configured for the LOG Board Is Invalid
l MC-A98 The Protection Switching Times Out Severely
l MC-A127 WXCP Protection Cannot Be Implemented Due To Wrong Configuration of the
LQG Boards
l MC-A133 In a WDM System with 1:N Optical Channel Protection, the R_LOF Alarm Is
Reported in the Receive Direction on the Client Side When the Laser on the LWC1 Board
in the Protection Channel Is Open
l MC-A139 Automatic Switching Is Caused by the Incorrect Service Configuration of ODUk
SPRing Protection
l MC-A150 How Signals Are Selected When the DCP Board Is Used
l MC-A151 OWSP Protection Switching Fails Due to Incorrect Connection of Fiber Jumpers
On The DCP Boards, DCP Boards Report The MUT_LOS Alarms And TDG Board Reports
The OPU1_PLM Alarm
l MC-A155 Using the LDMD Board to Configure Intra-Board 1+1 Protection Fails Due to
the System Limit and the System Displays "The optical port number exceeds the limit"
l MC-A157 An OptiX OSN 6800 Network Cannot Be Reverted Back to the Working
Channel After Being Restored from an ODUk SNCP Protection Switching, The Board
Reports The ODU_SNCP_PS Alarm
l MC-A159 Services Are Interrupted After the Protection Hold-Off Time Is Set to 10s During
Configuration of Client-Side 1+1 Protection on the OptiX OSN 6800, The System Reports
The R_LOS Alarm
l MC-A161 The NS2 and TDG Boards Report the BUS_ERR Alarms at the Same Time After
ODUk SNCP Protection Is Configured Because The SCC Board Is Faulty
l MC-A171 How to Configure Two TMX Boards in a Client-Side 1+1 Protection Group
Through an SCS Board
l MC-A176 Loss of the Data About an Inter-Subrack Wavelength Protection Group Results
from a Change of an NE ID
l MC-A177 What Is the Difference Between the EOLP Board and the OLP Board
l MC-A220 SNCP Protection Switching Fails Due to Incorrect Fiber Connections
l MC-A228 Intra-Board 1+1 Protection Switching Fails on the OptiX OSN 1800 After a
Change of the NE ID
l MC-A229 Switching Fails Due to Incorrect SNCP Protection Mode on the WDM
Equipment
l MC-A232 The ODUk SNCP Protection Is Abnormal due to Incorrect Fiber Connections
l MC-A233 The LOM Board Fails to Carry Two FC400 Services due to Port Specification
Limitations When Inter-Subrack 1+1 Protection or Client 1+1 Protection Is Configured
l MC-A252 Switching of ODUk SNCP Protection Fails
This topic describes how to resolve switchover issues when services fail to automatically switch
back to the working channel even after the working channel has resumed normal operation.
For information on the protection schemes supported by the OptiX OSN 8800, OptiX OSN 6800,
and OptiX OSN 3800, see the Feature Description"Feature Overview".
Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive
l Cause 2: The working channel works abnormally or failure has not been resolved.
l Cause 3: Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Troubleshooting Workflow
The following figure illustrates the optical line protection automatic reverting failure
troubleshooting process.
Figure 13-1 Optical line protection automatic reverting failure troubleshooting flowchart
Start
Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?
Yes No
Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?
Yes No
No No
Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, select one protection group and click Modify to
change Revertive Mode to Revertive.
l Cause 2: The working channel is working abnormally.
1. In the Port Protection window, click Query to query the Working Channel
Status parameter of the protection group.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
When the intra-board 1+1 protection is achieved using the OLP board, the OLP board reports
an INTRA_OTU_PS or INTRA_OTU_STA_INDI alarm.
When the intra-board 1+1 protection is realized by dual fed and selective receiving OTU, the
OTU board may report the INTRA_OTU_PS alarm or INTRA_OTU_STA_INDI alarm.
Possible Causes
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive. In the case of the intra-board 1+1 protection configured using an OLP
or DCP board, when the Detection Channel is configured, the Revertive Mode parameter
must be set to Non-Revertive. This is not considered as one of the possible causes when
the protection cannot be restored automatically.
l Cause 2: The working channel works abnormally or failure has not been resolved.
Troubleshooting Workflow
The following figure illustrates the intra-board 1+1 protection automatic reverting failure
troubleshooting process.
Figure 13-2 Intra-board 1+1 protection automatic reverting failure troubleshooting flowchart
Start
Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?
Yes No
Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?
Yes No
No No
Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, select one protection group and click Modify to
change Revertive Mode to Revertive.
NOTE
In the case of intra-board 1+1 protection configured using an OLP or DCP board, when the
Monitoring Channel is configured, Revertive Mode can only be set to Non-Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the Port Protection interface, click Query to query the Working Channel
Status parameter of the protection group.
2. If Working Channel Status is not Normal, check whether R_LOS, R_LOF, R_LOC,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI,
ODUk_PM_LCK, ODUk_LOFLOM, B1_EXC, ODUk_PM_DEG,
ODUk_PM_EXC, POWER_DIFF_OVER, IN_PWR_HIGH and IN_PWR_LOW
alarms are being reported on the corresponding boards on the working channel, clear
the alarms.
l Cause 3: Forced switching commands have been manually issued.
1. In the Port Protection interface, click Query to check the Switching Status
parameter of the protection group.
2. If Switching Status is displayed as Lock or Force to Protection Channel, right-click
the protection group, and then choose Clear from the shortcut menu to clear the
switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the Port Protection window, click Query to check the WTR Time.
2. If service fails to be restored to the working channel after the time specified by WTR
Time elapses, check whether the working channel is normal.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Troubleshooting Workflow
The following figure illustrates the client 1+1 protection automatic reverting failure
troubleshooting process.
Figure 13-3 Client 1+1 protection automatic reverting failure troubleshooting flowchart
Start
Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?
Yes No
Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?
Yes No
No No
Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the NE and choose Configuration > Port Protection from
the Function Tree.
2. In the Port Protection window, select one protection group and click Modify to
change Revertive Mode to Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the Port Protection window, click Query to query the Working Channel
Status parameter of the protection group.
2. If Working Channel Status is not Normal, check whether the alarms listed in the
trigger conditions are being reported on the relevant boards on the working channel,
clear the alarms.
l Cause 3: Locked or forced switching commands have been manually issued.
1. In the Port Protection interface, click Query to check the Switching Status
parameter of the protection group.
2. If Switching Status is displayed as Lock or Force to Protection Channel, right-click
the protection group, and then choose Clear from the shortcut menu to clear the
switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the Port Protection window, click Query to check the WTR Time.
2. If service fails to be restored to the working channel after the time specified by WTR
Time elapses, check whether the working channel is normal.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
OTU board may report SW_SNCP_PS alarm or SW_SNCP_STA_INDI alarm.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Locked, or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Figure 13-4 Fault handling flow when automatic switching back to the working channel by SW
SNCP protection fails
Start
Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?
Yes No
Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?
Yes No
No No
Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click the SNCP Service Control tab.
2. On the SNCP Service Control interface, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the working cross-connection.
2. If Channel Status is not displayed as Normal, check whether the following alarms
of trigger conditions exist on the relevant boards on the working channel. For the
method of handling the alarms, see the Alarms and Performance Events Reference.
l Cause 3: Locked, or forced switching is not released.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state, choose the protection group. Click Function and then choose Clear from the
shortcut menu to clear the switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control interface, click Query to check the WTR Time
(mm:ss).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
l Cause 4: Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Troubleshooting Workflow
The following figure illustrates the ODUk SNCP protection automatic reverting failure
troubleshooting process.
Figure 13-5 ODUk SNCP protection automatic reverting failure troubleshooting flowchart
Start
No
Yes
No
Yes
No
No
No
No
Yes No
Contact Huawei
engineers. End
Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query the Protection Type of
the protection group. In the Operation Result dialog box, click Close.
3. Refer to "Dependency and Limitation" and "Parameters" to check whether the
Protection Type of the protection group is appropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the SNCP Service Control window, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control window, click Query to query the Channel Status
parameter of the working cross-connection.
2. If Channel Status is not Normal, check whether the alarms listed in Trigger
Conditions are being reported on the corresponding boards on the working channel,
clear the alarms.
l Cause 4: Locked or forced switching commands have been manually issued.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state, choose the protection group. Click Function and then choose Clear from the
shortcut menu to clear the switching configuration.
l Cause 5: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control window, click Query to check the WTR Time.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
L4G, TBE, LEM24, LEX4 board of OptiX OSN 6800/OptiX OSN 3800 may report
VLAN_SNCP_PS.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual, locked or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Figure 13-6 Fault handling flow when automatic switching back to the working channel by
VLAN SNCP protection fails
Start
Is Are the
No Set Yes
Revertive Mode set to services switched to
Revertive Mode
Revertive? the working
to Revertive. channel ?
Yes No
Are the
No Yes
Does the working Handle the faults on the services switched to
channel work normally? working channel. the working
channel ?
Yes No
No No
Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time.
channel?
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. Select the desired Ethernet board in the NE Explorer, and choose Configuration >
Ethernet Service > Ethernet Line Service from the Function Tree. Click the VLAN
SNCP Service Management tab.
2. On the VLAN SNCP Service Management interface, select the desired protection
group. Double-click the Revertive Mode field, and then change Non-Revertive to
Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the VLAN SNCP Service Management interface, click Query to check the Link
Status parameter of the working service.
2. If Link Status is not displayed as Normal, check whether the following alarms exist
on the relevant boards on the working channel: R_LOF, R_LOS, R_OOF,
OTUk_LOF, OTUk_LOM, OTUk_AIS. For the method of handling the alarms, see
the Alarms and Performance Events Reference.
l Cause 3: Manual, locked or forced switching is not released.
1. In the VLAN SNCP Service Management interface, click Set/Query Switching and
select Query Switching Status to check the Current Status parameter of the
protection group.
2. If Current Status is displayed as Manual (Active to Standby) Switching State,
Forced (Active to Standby) Switching State or Lockout State, click Set/Query
Switching. Then choose Clear from the shortcut menu, to clear the manual, locked
or forced switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the VLAN SNCP Service Management interface, click Query to check the WTR
Time (s).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Symptom
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
l Cause 1: The SNCP Type is inappropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
l Cause 4: Locked or forced switching commands have been manually issued.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Figure 13-7 Tributary SNCP protection automatic reverting failure troubleshooting flowchart
Start
Is
the SNCP Type No Are the Yes
Change the SNCP
consistent with that required services switched to
in the service Type of the protection
group. the working
scenario? channel?
No
Yes
No
Yes
No
No
No
No
Yes No
Contact Huawei
engineers. End
Procedure
l Cause 1: The SNCP Type is inappropriate for the service scenario.
1. In the NE Explorer, click the NE and choose Configuration > WDM Service
Management from the Function Tree. Then click the SNCP Service Control tab.
2. In the SNCP Service Control window, click Query to query Protection Type of the
protection group. In the Operation Result dialog box, click Close.
3. Refer to "Dependency and Limitation" and "Parameters" to check whether Protection
Type of the protection group is appropriate for the service scenario.
l Cause 2: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the SNCP Service Control window, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 3: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control window, click Query to check the Channel Status
parameter of the working cross-connection.
2. If Channel Status is SD or SF, check whether the alarms listed in Trigger Conditions
are being reported on the corresponding boards on the working channel, clear the
alarms.
l Cause 4: Locked or forced switching commands have been manually issued.
1. In the SNCP Service Control interface, click Query to check the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state, choose the protection group. Click Function and then choose Clear from the
shortcut menu to clear the switching configuration.
l Cause 5: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control window, click Query to check WTR Time.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In the normal operation of the network, the services are switched to the protection channel after
the working channel generates the protection triggering conditions. When the working channel
resumes the normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic revertive failure are as follows:
l Cause 1: The working channel works abnormally.
l Cause 2: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (min).
l Cause 3: The control VLAN channel on an ERPS-enabled Ethernet ring is faulty.
Procedure
l Cause 1: The working channel works abnormally.
1. If there are alarms on the working channel, clear the alarms. For the method of handling
the alarms, see Alarms and Events Handling.
l Cause 2: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (min).
1. Select an Ethernet board in NE Explorer.
2. Choose Configuration > Ethernet Protection > ERPS Management.
3. In the ERPS Management interface, click Query to check WTR Time (min).
l Cause 3: The control VLAN channel on an ERPS-enabled Ethernet ring is faulty.
1. Check whether the Ethernet ring has a control VLAN channel. If no, create a control
VLAN channel for the Ethernet ring.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Locked switching or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Figure 13-8 Fault handling flow when automatic switching back to the working channel by MS
SNCP protection fails
Start
Yes No
Are the
No Yes
Is Working Channel Handle the faults on the services switched to
Status Normal? working channel. the working
channel?
Yes No
No No
Are the
No Wait for a period Yes
Does the time exceed services switched to
longer than the working
WTR Time? WTR Time. channel?
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. Select the NE in the NE Explorer, and choose Configuration > WDM Service
Management from the Function Tree. Click SNCP Service Control tab.
2. On the SNCP Service Control interface, select the desired protection group. Double-
click the Revertive Mode field, and then change Non-Revertive to Revertive.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. In the SNCP Service Control interface, click Query to query the Channel Status
parameter of the working cross-connection.
2. If Channel Status is displayed as SD, SF check whether the following alarms exist
on the relevant boards: R_LOF, R_LOS, R_LOC, OTUk_LOF, OTUk_LOM,
OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI, ODUk_PM_LCK,
ODUk_LOFLOM, REM_SF, B1_EXC, ODUk_PM_EXC, OTUk_EXC, REM_SD.
For the method of handling the alarms, see the Alarms and Performance Events
Reference.
l Cause 3: Locked switching or forced switching is not released.
1. In the SNCP Service Control interface, click Query to query the Status parameter
of the protection group.
2. If Status is displayed as Lockout or Forced (from working to protection) switching
state , click Function, and then choose Clear from the shortcut menu, to clear the
manual, locked or forced switching configuration.
l Cause 4: When the working channel is restored, services are restored to the working channel
after the time specified by WTR Time elapses.
1. In the SNCP Service Control interface, click Query to query the WTR Time
(mm:ss).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
Figure 13-9 Fault handling flow when automatic switching back to the working channel by
DLAG protection fails
Start
Yes No
Yes No
Contact Huawei
engineers. End
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive
1. Select the NE in the NE Explorer and then choose Configuration > Ethernet
Distributed Link Aggregation Management.
2. Click Query. Double-click Revertive Mode parameter and choose Revertive from
the drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Resolve the problems with the working channel. Determine if the following alarms
exist on the relevant boards: HARD_BAD, PORT_MODULE_OFFLINE,
LINK_ERR, R_LOS, R_LOF. For more information regarding how to clear the
alarms, see the Alarms and Performance Events Reference.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
The NEs at both ends of the channel where the switching occurs may report the ODUKSP_PS
alarm or ODUKSP_STA_INDI alarm.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Figure 13-10 Fault handling flow when automatic switching back to the working channel by
ODUk SPRing protection fails
Start
Yes No
No No
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Select the NE in the NE Explorer, and choose Configuration > ODUk SPRing from
the Function Tree.
2. In the Channel Mapping Relation interface, click Query to query the Channel
Status parameter of the working channel.
3. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the working channel: R_LOS, R_LOC, HARD_BAD,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_LOFLOM, ODUk_TCMn_OCI,
ODUk_TCMn_LCK, ODUk_TCMn_AIS, OTUk_EXC, ODUk_TCM6_DEG,
ODUk_TCM6_EXC. For the method of handling the alarms, see the Alarms and
Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the ODUk SPRing interface, click Query to query the Switching Request-East
or Switching Request-West parameter of the protection group.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Locked switching or forced switching is not released.
l Cause 4: When the working channel is restored, service are restored to the working channel
after the time specified by WTR Time elapses.
Figure 13-11 Fault handling flow when automatic switching back to the working channel by
OWSP protection fails
Start
No
Yes
No
No
No
Yes
Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Select the NE in the NE Explorer, and choose Configuration > Optical Wavelength
Shared Protection from the Function Tree. Click OWSP Protection tab.
2. In the Channel interface, click Query to query the Channel Status parameter of the
working channel.
3. If Channel Status is displayed as SD, SF, check whether the following alarms exist
on the relevant boards on the working channel: MUT_LOS, R_LOS, R_LOC, R_LOF,
OTUk_LOF, OTUk_LOM, OTUk_AIS, ODUk_PM_AIS, ODUk_PM_OCI,
ODUk_PM_LCK, OTUk_TIM, ODUk_PM_TIM, ODUk_LOFLOM,
IN_PWR_HIGH, IN_PWR_LOW, B1_EXC, ODUk_PM_DEG, ODUk_PM_EXC,
OTUk_DEG, OTUk_EXC. For the method of handling the alarms, see the Alarms
and Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the OWSP Protection interface, click Query to query the Switching Request-
East or Switching Request-West parameter of the protection group.
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the desired NE and choose Configuration > Linear MS
from the Function Tree.
2. In the Linear MS interface, select one protection group and double click Revertive
Mode parameter area. Choose Revertive from drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, R_LOC, MS_AIS, B2_EXC, B2_SD. For the method of
handling the alarms, see the Alarms and Performance Events Reference.
l Cause 3: Locked switching or forced switching is not released.
1. Click Query in Linear MS interface and choose Query Switching Status.
2. If Switching Status-West is Switching or Lockout, right click the protection group
and choose Clear.
l Cause 4: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the Linear MS interface, click Query and choose Query Protection Group to
check WTR Time (s).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).
Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, R_LOC, MS_AIS, B2_EXC, B2_SD. For the method of
handling the alarms, see the Alarms and Performance Events Reference.
l Cause 2: Locked switching or forced switching is not released.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from
the Function Tree.
2. Click Query in Ring MS interface and choose Query Switching Status.
3. If Switching Status is Forced switching or Lockout, right click the protection group
and choose Clear.
l Cause 3: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the Ring MS interface, click Query and choose Query Protection Group to check
WTR Time (s).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).
Procedure
l Cause 1: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, R_LOC, MS_AIS, B2_EXC, B2_SD. For the method of
handling the alarms, see the Alarms and Performance Events Reference.
l Cause 2: Manual switching, locked switching or forced switching is not released.
1. In the NE Explorer, click the desired NE and choose Configuration > Ring MS from
the Function Tree.
2. Click Query in Ring MS interface and choose Query Switching Status.
3. If Switching Status is Forced switching or Lockout, right click the protection group
and choose Clear.
l Cause 3: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the Ring MS interface, click Query and choose Query Protection Group to check
WTR Time (s).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
13.16 SNCP
This section describes the handling methods for services with SNCP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the desired NE and choose Configuration > SNCP Service
Control from the Function Tree.
2. In the SNCP Service Control interface, select one protection group and double click
Revertive Mode parameter area. Choose Revertive from drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, MS_AIS, AU_AIS, AU_LOP, HP_LOM, B2_EXC,
B3_SD, B3_EXC, HP_TIM, HP_UNEQ. For the method of handling the alarms, see
the Alarms and Performance Events Reference.
l Cause 3: Forced switching is not released.
1. Click Query in the SNCP Service Control interface.
2. Click Close in the Operation Result dialog box.
3. If Current Status is Forced switching to Protection, Manual switching to
Protection or Lockout, click Function and choose Clear.
l Cause 4: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the SNCP Service Control interface, click Query.
2. Click Close in the Operation Result dialog box.
3. Check WTR Time (s).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
13.17 SNCTP
This section describes the handling methods for services with SNCTP protection that cannot
automatically switch back to the working channel even after the working channel resumes
normal operation and the revertive mode is properly configured.
Fault Description
In normal operation of a network, services are switched to the protection channel after the
working channel generates the protection triggering conditions. When the working channel
resumes normal operation, the services cannot be automatically switched back to the working
channel.
Possible Causes
The possible causes of the automatic reverting failure are as follows:
l The value of the Revertive Mode parameter of the protection group has been set to Non-
Revertive.
l The working channel works abnormally or failure has not been resolved.
l Manual switching, locked switching or forced switching is not released.
l When the working channel is recovered, the working channel recovery time does not exceed
the specified WTR Time (s).
Procedure
l Cause 1: The value of the Revertive Mode parameter of the protection group has been set
to Non-Revertive.
1. In the NE Explorer, click the desired NE and choose Configuration > SNCTP from
the Function Tree.
2. In the SNCTP interface, select one protection group and double click Revertive
Mode parameter area. Choose Revertive from drop-down list.
l Cause 2: The working channel works abnormally or failure has not been resolved.
1. Check whether the following alarms exist on the relevant boards on the working
channel: R_LOS, R_LOF, MS_AIS, AU_AIS, AU_LOP, HP_LOM, B2_EXC,
B3_SD, B3_EXC, HP_TIM, HP_UNEQ. For the method of handling the alarms, see
the Alarms and Performance Events Reference.
l Cause 3: Manual switching, locked switching or forced switching is not released.
1. Click Query Switching Status in SNCTP interface.
2. If Switching Status is Forced switching or Lockout, right click the protection group
and choose Clear.
l Cause 4: When the working channel is recovered, the working channel recovery time does
not exceed the specified WTR Time (s).
1. In the SNCTP interface, click Query Protection Group to check WTR Time (s).
----End
Solution
If the preceding measures are not successful in switching the services to the working channel,
contact Huawei engineers to resolve the problem.
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A23 When the OTU Board Accesses Light, the Laser at Output End is Disabled
l MC-A25 The System Indicates that the Channel Number Is Illegal
l MC-A38 The Service Is Interrupted After the Protection Is Triggered
l MC-A41 Unsuccessful 1: N Protection Subnet Search
l MC-A91 Creation of Intra-Board Wavelength Protection on the LDG Fails
l MC-A94 The WXCP Protection Configured for the LOG Board Is Invalid
l MC-A98 The Protection Switching Times Out Severely
l MC-A127 WXCP Protection Cannot Be Implemented Due To Wrong Configuration of the
LQG Boards
l MC-A133 In a WDM System with 1:N Optical Channel Protection, the R_LOF Alarm Is
Reported in the Receive Direction on the Client Side When the Laser on the LWC1 Board
in the Protection Channel Is Open
l MC-A139 Automatic Switching Is Caused by the Incorrect Service Configuration of ODUk
SPRing Protection
l MC-A150 How Signals Are Selected When the DCP Board Is Used
l MC-A151 OWSP Protection Switching Fails Due to Incorrect Connection of Fiber Jumpers
On The DCP Boards, DCP Boards Report The MUT_LOS Alarms And TDG Board Reports
The OPU1_PLM Alarm
l MC-A155 Using the LDMD Board to Configure Intra-Board 1+1 Protection Fails Due to
the System Limit and the System Displays "The optical port number exceeds the limit"
l MC-A157 An OptiX OSN 6800 Network Cannot Be Reverted Back to the Working
Channel After Being Restored from an ODUk SNCP Protection Switching, The Board
Reports The ODU_SNCP_PS Alarm
l MC-A159 Services Are Interrupted After the Protection Hold-Off Time Is Set to 10s During
Configuration of Client-Side 1+1 Protection on the OptiX OSN 6800, The System Reports
The R_LOS Alarm
l MC-A161 The NS2 and TDG Boards Report the BUS_ERR Alarms at the Same Time After
ODUk SNCP Protection Is Configured Because The SCC Board Is Faulty
l MC-A171 How to Configure Two TMX Boards in a Client-Side 1+1 Protection Group
Through an SCS Board
l MC-A176 Loss of the Data About an Inter-Subrack Wavelength Protection Group Results
from a Change of an NE ID
l MC-A177 What Is the Difference Between the EOLP Board and the OLP Board
l MC-A220 SNCP Protection Switching Fails Due to Incorrect Fiber Connections
l MC-A228 Intra-Board 1+1 Protection Switching Fails on the OptiX OSN 1800 After a
Change of the NE ID
l MC-A229 Switching Fails Due to Incorrect SNCP Protection Mode on the WDM
Equipment
l MC-A232 The ODUk SNCP Protection Is Abnormal due to Incorrect Fiber Connections
l MC-A233 The LOM Board Fails to Carry Two FC400 Services due to Port Specification
Limitations When Inter-Subrack 1+1 Protection or Client 1+1 Protection Is Configured
l MC-A252 Switching of ODUk SNCP Protection Fails
Symptom
During normal network operations, when a fiber cut occurs, and the intelligent power adjustment
(IPA) function may fail to start. As a result, the OAs at both ends of the optical fiber fail to
automatically shut down the lasers on the output optical ports.
Possible Causes
l Cause 1: The IPA function is disabled.
l Cause 2: The fault is resolved before the time specified by Start Hold-off Time elapses.
l Cause 3: The IPA protection group is configured with auxiliary detection boards. When a
fiber cut occurs, some of the auxiliary detection boards do not report any alarm.
l Cause 4: The IPA protection group is configured with auxiliary detection boards. However,
the auxiliary detection board ports are incorrectly set.
Troubleshooting Workflow
Figure 14-1 illustrates the IPA start failure troubleshooting process.
Start
No Yes
Is IPA Status set to
Set IPA Status to Enabled. Is IPA started?
Enabled?
No
Yes
No
Yes
Yes No
No
Yes
Contact Huawei
engineers.
End
Procedure
l Cause 1: The IPA function is disabled.
1. In the NE Explorer, select the NE and choose Configuration > IPA Management.
Click Query to check the IPA Status of IPA.
2. If IPA Status is set to Disabled, change it to Enabled. Click Apply.
3. Query the IPA Status of the NE at the opposite end. Ensure that the IPA function of
the NEs at the two ends of the line is set to Enabled.
l Cause 2: The fault is resolved before the time specified by Start Hold-off Time elapses.
In case of a fiber cut, the IPA function is enabled only after the start hold-off time expires.
1. In the IPA Management window, click Set Start Delay. In the dialog box that is
displayed, click Query to query the Start Hold-off Time.
2. Check whether the fault is resolved before the time specified by Start Hold-off
Time elapses.
l Cause 3: The IPA protection group is configured with auxiliary detection boards. When a
fiber cut occurs, some of the auxiliary detection boards do not report any alarm.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If some of the auxiliary detection boards are not
reporting such alarms, check whether the fiber connection between the OA board and
the FIU, and that between the FIU and the auxiliary detection board, comply with the
engineering design.
l Cause 4: The IPA protection group is configured with auxiliary detection boards. However,
the auxiliary detection board ports are incorrectly set.
1. Check whether the auxiliary detection board ports are receiving optical signals from
the same direction as the detection board ports. If they are not receiving optical ports,
re-set the auxiliary detection board ports.
----End
Solution
If the problem persists after you take the preceding measures, handle the alarms and faults of
the NE, and contact Huawei technical support for resolving the IPA start failure.
Symptom
During normal network operations, when a fiber cut occurs, the IPA function starts. The lasers
on the output optical ports of the OAs at the upstream station of the optical fiber are automatically
shut down. After the optical fiber is restored, the IPA function does not automatically turn the
lasers on the output optical ports back on. As a result, the output optical power of the OAs is not
restored.
Possible Causes
l Cause 1: When the IPA protection group is not configured with any detection board and
is configured with only auxiliary detection boards, all the auxiliary detection boards are
reporting IPA-triggering alarms.
l Cause 2: One or more of IPA function parameters are incorrectly set.
Troubleshooting Workflow
Figure 14-2 illustrates the IPA end failure troubleshooting process.
Start
No
Yes
No Yes
Do the parameter values Are the services
comply with the Modify the parameters.
restored?
specifications?
No
Yes
Yes
Yes
Contact Huawei
End
engineers.
Procedure
l Cause 1: When the IPA protection group is not configured with any detection board and
is configured with only auxiliary detection boards, all the auxiliary detection boards are
reporting IPA-triggering alarms.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If any of these alarms is being reported, check
the fiber connection of each auxiliary detection board.
2. If the fiber connection of each auxiliary detection board is normal, check whether one
of the auxiliary detection boards is faulty.
l Cause 2: One or more of IPA function parameters are incorrectly set.
1. In the NE Explorer, click the NE and choose Configuration > IPA Management.
2. In the IPA Management window, click Query and check whether the IPA parameters
are correctly set. For more information about how to set each parameter, refer to
"Dependency and Limitation" and "Parameters".
NOTE
l Check whether the Off Period, On Period, and Start Test Duration values of the two
jointly used IPA pairs are the same.
l Check whether the On Period value of the two IPA pairs is the same as the recommended
value calculated based on the number of intermediate stations.
----End
Solution
If the problem persists after you take the preceding measures, disable the IPA function by setting
the IPA Status in the IPA Management window to Disabled. In the WDM interface window
of the corresponding board, confirm that Laser Status of the transmit optical port of Laser
Control Board is set to On.
Check whether the services are restored. If services are not restored, query the alarms of the NE
and clear the alarms and faults of the NE. If the services are still not restored, contact Huawei
technical support for resolving the IPA end failure.
Symptom
During normal network operations, when a fiber cut occurs, the IPA function fails to start. As a
result, the OAs at both ends of the optical fiber fail to automatically shut down the lasers on the
output optical ports.
Possible Causes
l The IPA function is disabled.
l The fault is resolved before the time specified by Start Hold-off Time elapses.
l The communication between the case-shape Raman pump amplifier unit for C-band
(CRPC) frame and the subrack is faulty.
l The preset power threshold of the detection board is too low.
l The preset power threshold of the Raman board is too low.
l The IPA protection group is configured with auxiliary detection boards. When a fiber cut
occurs, some of the auxiliary detection boards do not report any alarm.
l The IPA protection group is configured with auxiliary detection boards. However, the
auxiliary detection board ports are incorrectly set.
l IPA parameters are incorrectly set.
CAUTION
Ensure that the optical fibers are properly connected and the configuration is correct.
NOTE
Troubleshooting Workflow
Figure 14-3 illustrates the IPA start failure in a Raman system troubleshooting process.
Start
No Yes
Is IPA Status set to Set IPA Status to
Enabled? Is IPA started?
Enabled.
Yes No
Yes No
Yes No
Yes
No Re-calculate and
Is the receive optical power change the threshold. Is IPA started?
of the Raman board lower
than the threshold? ?
Yes
Yes
Is the receive optical power No Re-calculate and
Is IPA started?
of the detection board lower change the threshold.
than the threshold? ?
Yes No
No Yes
Do all auxiliary detection Check the fiber connections
Are the line services
boards report IPA-triggering of the OA board.
restored?
alarms?
Yes No
Yes No
Yes No
Procedure
l Cause 1: The IPA function is disabled.
1. In the NE Explorer, click the NE and choose Configuration > IPA Management.
Then click Query to query IPA Status.
2. If IPA Status is set to Disabled, change it to Enabled and click Apply.
3. Query IPA Status of the opposite NE. Ensure that IPA Status of the NEs at both ends
of the line is set to Enabled.
l Cause 2: The fault is resolved before the time specified by Start Hold-off Time elapses.
After the fiber cut occurs, the IPA function is enabled only after the start hold-off time
elapses.
1. In the IPA Management window, click Set Start Delay. In the dialog box that is
displayed, click Query to query the Start Hold-off Time.
2. Check whether the fault is resolved before the time specified by Start Hold-off
Time elapses.
l Cause 3: The communication between the CRPC frame and the subrack is faulty.
1. On the U2000, check whether the CRPC board status is normal.
2. If the query fails, check whether the connectors and network cables are properly
installed.
l Cause 4: The preset optical value of the detection board threshold is too low.
1. Test the actual receive optical power of the detection board, and compare the test result
with the value of the IPA parameter IPA Detection Board Threshold.
2. If the actual receive optical power is higher than the value of Raman Board
Threshold, recalculate the Raman Board Threshold value and change it to a value
higher than the actual receive optical power of the detection board.
NOTE
For the IPA Detection Board Threshold calculation method, refer to "Parameters".
l Cause 5: The preset value of the Raman board threshold is too low.
1. Test the actual receive optical power of the Raman board, and compare the test result
with the IPA parameter Raman Board Threshold.
2. If the actual receive optical power is lower than the Raman board threshold, recalculate
the Raman board threshold and change it to a value lower than the actual receive
optical power.
NOTE
l Cause 6: The IPA protection group is configured with auxiliary detection boards. When a
fiber cut occurs, some of the auxiliary detection boards do not report any alarm.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If some of the auxiliary detection boards are not
reporting such alarms, check whether the fiber connection between the OA board and
the FIU, and that between FIU and the auxiliary detection board, comply with the
engineering design.
l Cause 7: The IPA protection group is configured with auxiliary detection boards. However,
the auxiliary detection board ports are incorrectly set.
1. Check the auxiliary detection board ports are receiving optical signals from the same
direction as the detection board ports. If they are not receiving optical signals, re-set
the auxiliary detection board ports.
l Cause 8: IPA parameters are incorrectly set.
1. In the IPA Management window, click Query and check whether IPA parameters
are correctly set. For more information about how to set each parameter, refer to
"Dependency and Limitation" and "Parameters".
NOTE
Main IPA parameters include Detection Board, Laser Control Board, IPA Detection Board
Threshold, Raman Board Threshold, and Raman Board Alarm Reporting.
----End
Solution
If the problem persists after you take the preceding measures, clear the alarms and faults of the
NE. Contact Huawei technical support for resolving the IPA function failure.
Symptom
During normal network operations, when a fiber cut occurs, the IPA function starts. The lasers
on the output optical ports of the OAs at both ends of the optical fiber are automatically shut
down. After the optical fiber is restored, the IPA function does not automatically turn \the lasers
on the output optical ports back on. As a result, the output optical power of the OAs is not
restored.
Possible Causes
l When the IPA protection group is not configured with a Detection Board and is configured
with only auxiliary detection boards, all the auxiliary detection boards are reporting IPA-
triggering alarms.
l The preset value of the Raman board threshold is too high.
l The preset value of the detection board threshold is too high.
l IPA parameters are incorrectly set.
l The lasers on the local and remote Raman boards are not turned on manually within the
time specified by On Period or Start Test Duration.
NOTE
Troubleshooting Workflow
Figure 14-4 illustrates the IPA function failure in a Raman system troubleshooting process.
Start
No
No
No
Yes
No
Yes
No Yes
Do the other parameter Modify the Are the services
values comply with the parameters. restored?
specifications?
No
Yes
No
Yes
No
Are the services Query NE alarms and
restored? handle other faults.
Yes
End
Contact Huawei
engineers.
Procedure
l Cause 1: When the IPA protection group is not configured with a Detection Board and is
configured with only auxiliary detection boards, all the auxiliary detection boards are
reporting IPA-triggering alarms.
1. Check whether each auxiliary detection board is reporting IPA-triggering alarms, such
as R_LOS, R_LOF, and OSC_RDI. If some of the auxiliary detection boards are
reporting such alarms, check the fiber connection of each auxiliary detection board.
2. If the fiber connection of each auxiliary detection board is normal, check whether one
of the auxiliary detection boards is faulty.
l Cause 2: The preset value of the Raman board threshold is too high.
1. Test the actual receive optical power of the Raman board, and compare the test result
with the IPA parameter Raman Board Threshold.
2. If the actual receive optical power is lower than the Raman board threshold, recalculate
the Raman board threshold and change it to a value lower than the actual receive
optical power.
NOTE
For the IPA Detection Board Threshold calculation method, refer to "Parameters".
l Cause 4: IPA parameters are incorrectly set.
1. In the NE Explorer, click the NE and choose Configuration > IPA Management.
2. In the IPA Management window, click Query to check whether IPA parameters are
correctly set. For more information about how to set each parameter, refer to
"Dependency and Limitation" and "Parameters".
NOTE
NOTE
This operation applies only to the scenario where lasers need to be manually opened after Laser
Restart Mode is set to Manual.
If the CRPC03 and ROP board are involved in the IPA function, the laser on the CRPC03 and ROP
board need to be manually opened.
----End
Solution
If the problem persists after you take the preceding measures, set the IPA Status in the IPA
Management window to Disabled to disable the IPA function. In the WDM Interface window
of the corresponding board, confirm that the Laser Status of the transmit optical interface of
the laser control board is set to On.
Check whether the services are restored. If services have not been restored, query the alarms of
the NE and clear the alarms and faults of the NE. If the services are still not restored, contact
Huawei technical support for resolving the IPA function failure.
Symptom
During normal operations of an ALC-enabled network, the attenuation of a line section changes
and the ALC function does not start. As a result, all OAs at each downstream station cannot
make adjustments. This affects the system OSNR and service transmission.
Possible Causes
l Cause 1: The ALC Automatic Regulation Switch is set to Disabled.
l Cause 2: ALC stops abnormally due to other external causes. ALC nodes report the event
indicating that ALC has ended unexpectedly.
l Cause 3: MUT_LOS or BD_STATUS alarms are being reported on the related board.
l Cause 4: ALC parameters are configured incorrectly. Therefore, the consistency check
fails.
Troubleshooting Workflow
Figure 14-5 illustrates the ALC function failure troubleshooting process.
Start
Is Automatic No Yes
Regulation Switch Start ALC. Is ALC started?
set to Enabled ?
Yes No
Yes
No No
No Yes
Is the consistency check Check and modify
Is ALC started?
successful? ALC parameter values.
Yes No
Contact Huawei
End
engineers.
Procedure
l Cause 1: The ALC Automatic Regulation Switch is set to Disabled.
1. Choose Configuration > WDM ALC Management from the Main Menu. In the
WDM ALC Management window that is displayed, click NG WDM Single Station
Configuration tab.
2. In the left-hand NE list, select the desired NE. Check whether the Automatic
Regulation Switch is set to Disabled. If it is disabled, enable ALC. For more
information, refer to "Starting Automatic Link Adjustment".
l Cause 2: ALC stops abnormally due to other external causes. ALC nodes report the event
indicating that ALC has ended unexpectedly.
1. On the left of the WDM ALC Management window, query the ALC events reported.
Check whether there are events indicating that the ALC has ended unexpectedly. If
the ALC has ended unexpectedly, identify the reason.
l Cause 3: MUT_LOS or BD_STATUS alarms are being reported on the related board.
1. Check whether MUT_LOS or BD_STATUS alarms are being reported on the ALC-
related OA and VOA boards at the station. If either of these alarms is being reported,
clear the alarm.
l Cause 4: ALC parameters are configured incorrectly. Therefore, the consistency check
fails.
1. Perform a consistency check of the ALC links. For more information, refer to
"Performing Consistency Check of ALC Links". If the consistency check fails, check
whether the ALC parameters of each ALC node are configured correctly. For details
on how to set each parameter, refer to "Dependency and Limitation and "Parameters".
----End
Solution
If the problem persists after you take the preceding measures, query NE alarms and clear the
alarms and faults on the NE. Contact Huawei technical support for resolving ALC function
faults.
Background Information
The OptiX OSN 3800 does not support the APE function.
Fault Description
During normal operations on an APE-enabled network, the U2000 reports an optical power
equilibrium event and then starts APE adjustment. However, the system fails to automatically
adjust the optical power at the transmit end of each channel or report an event indicating that
APE has ended unexpectedly. In this case, the equilibrium of the receive-end optical power and
OSNR cannot be ensured.
NOTE
If a fiber cut occurs along the line or communication is abnormal, handle the network faults.
NOTE
If the U2000 reports both an ALC event and an optical power equilibrium event, start the ALC function.
If the optical power equilibrium event persists after the adjustment is completed, start the APE function.
Possible Causes
l Cause 1: APE parameters are configured incorrectly.
l Cause 2: The Optical Interface Attenuation Ratio of the APE power regulation unit has
reached a critical point and can no longer be adjusted. As a result, the adjustment cannot
be started.
Troubleshooting Workflow
Figure 14-6 illustrates the APE function exception troubleshooting process.
Start
No
Yes
No Yes
Do the APE parameter
Modify the parameters. Is optical power
values comply with the
adjusted?
specifications?
No
Yes
No
Yes
No Yes
Is the difference between
Restart APE. Is optical power
ActualPower Offset and Standard
adjusted?
Power Offset lower than the
threshold?
No
Yes
Yes
Re-calibrate the value of Is optical power
Standard Power Offset. adjusted?
No
Procedure
l Cause 1: APE parameters are configured incorrectly.
1. In the NE Explorer, select the desired NE and choose Configuration > Optical Power
Equilibrium from the Function Tree.
2. Click Query to check whether APE parameters are configured correctly. For more
information about how to set each parameter, refer to "Dependency and Limitation
and Parameters".
NOTE
Main APE parameters include Power Monitoring Unit, Odd Wavelength Power Regulation
Unit, Even Wavelength Power Regulation Unit, and Monitoring Flag of the related
wavelength.
l Cause 2: The Optical Interface Attenuation Ratio of the APE power regulation unit has
reached a critical point and can no longer be adjusted. As a result, the adjustment cannot
be started.
1. In the NE Explorer, select the desired power regulation unit and choose
Configuration > WDM Interface to query the Optical Interface Attenuation
Ratio board parameter.
2. Check whether the Optical Interface Attenuation Ratio of each optical port where
the APE monitoring wavelength is located has reached the critical value and there is
no margin for adjustment.
3. If the Optical Interface Attenuation Ratio value has reached this point, adjust the
optical attenuation of the regulation unit at the upstream station, thereby decreasing
the optical attenuation required by the local station.
l Cause 3: Insufficient APE adjustment has occurred and further adjustment is required.
1. In the NE Explorer, select the desired NE and choose Configuration > Optical Power
Equilibrium from the Function Tree.
2. Query the Actual Power Offset and Standard Power Offset of the APE monitoring
wavelength and the Power Unbalance Threshold.
3. If the maximum difference between Actual Power Offset and Standard Power
Offset in all monitoring wavelengths is less than 7 dB and Power Unbalance
Threshold uses the default value (1.5), start the APE function again.
4. If the maximum difference between Actual Power Offset and Standard Power
Offset in all monitoring wavelengths is greater than 7 dB and Power Unbalance
Threshold uses the default value (1.5), increase the value of Power Unbalance
Threshold and start the APE function again.
NOTE
----End
Solution
If the problem persists after you take the preceding measures, contact Huawei technical support
for resolving the APE function faults.
Fault Description
After the optical cross-connections in automatic mode are configured, the optical path is
unavailable.
Possible Causes
l Cause 1: The rated optical power of the OA board is set incorrectly, and thus the NE reports
the OPA_FAIL_INDI alarm.
l Cause 2: The optical power budget is insufficient, and thus the NE reports the
OPA_FAIL_INDI alarm.
Procedure
l Cause 1: The rated optical power of the OA board is set incorrectly, and thus the NE reports
the OPA_FAIL_INDI alarm.
1. In Explorer, choose the desired OA board.
2. In Function Tree on the left side, choose Configuration > WDM Interface and then
select Advanced Attributes.
3. Double-click the Rated Optical Power (dBm) fields of the input and output
interfaces, and modify the values.
l Cause 2: The optical power budget is insufficient, and thus the NE reports the
OPA_FAIL_INDI alarm.
1. Adjust the system optical power budget, which is insufficient.
----End
Solution
After the above steps are performed, contact Huawei engineers to remove the faults in the OPA
function.
NOTE
The following cases are the cases relevant to the OptiX WDM product series.
Related Cases:
l MC-A8 The TN11OAU101 at the Transmit End Reports the MUT_LOS Alarm
NOTE
The following cases are the cases relevant to the OptiX WDM product series.
Related Cases:
l MC-A77 Improper Configuration of Protocol Channels Results in a Failure of the ALC to
Start
l MC-A81 The Created ALC Link on OptiX OSN 6800 Cannot Be Automatically Adjusted
l MC-A121 The T2000 Reports a Failure In Querying the ALC Link Status Because Of Too
Many NEs On the Link
l MC-A169 T2000 Reports Errors After the Standard Optical Power of Single Wavelength
Is Set to 7 dBm with ALC in Power Reference Mode
NOTE
The following cases are the cases relevant to the OptiX WDM product series.
Related Cases:
l MC-A78 When the IPA Is Set or Deleted, the OAU Alarm Threshold Changes Due to
Version Features
l MC-A116 Failure in Configuring the SC2 as an Auxiliary Detection Board in the IPA
Protection Group
l MC-A175 How to Clear the LASER_HAZARD_WARNING Alarms Reported on Certain
OBU and OAU Boards After an Upgrade of Software
NOTE
The following cases are the cases relevant to the OptiX WDM product series.
Related Cases:
l MC-A116 Failure in Configuring the SC2 as an Auxiliary Detection Board in the IPA
Protection Group
This topic describes the symptoms, impact, and possible causes of MPLS tunnel faults such as
creation failures or service interruptions. In addition, this topic describes the required tools,
precautions, and procedures for rectifying the fault.
Symptoms
Symptom 1: Creating the tunnel fails, and thus the service is unavailable.
Symptom 3: The protection switching fails, and thus the service is interrupted.
Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: Creating the tunnel fails.
l For symptom 2, the possible causes are as follows:
– Cause 1: The routing fails.
– Cause 2: The physical link where the tunnel resides is faulty.
l For symptom 3, the possible causes are as follows:
– Cause 1: The protection switching fails.
Procedure
Step 1 For symptom 1: Creating the tunnel fails, and thus the service is unavailable.
1. Cause 1: Creating the cross-connection fails.
a. Check whether different NEs on the network use the same network segment. If
different NEs use the same network segment, change the IP address of the port.
b. Check whether the tunnel is configured with the incompatible protection feature. For
details, see the Release Notes.
Step 2 For symptom 2: The tunnel is faulty, and thus the service is interrupted.
1. Cause 1: The routing fails.
a. According to the network planning, check whether the parameters of the ports such
as the IP address of the port at the two ends of the tunnel are correctly set. Correct the
parameter settings, and then create the tunnel again.
b. Check whether there is a complete and reachable link between the NEs where the
source and sink nodes of the tunnel reside. If the link does not exist, repair the
incomplete or faulty part of the link.
2. Cause 2: The physical link where the tunnel resides is faulty.
a. Check whether the HARD_BAD or ETH_LOS alarm indicating that the physical link
used by the tunnel is faulty exists in the system. If any alarm exists, take priority to
handle it.
b. Enable the tunnel OAM function. Then, check and handle the following alarms that
exist in the system, MPLS_TUNNEL_LOCV, MPLS_TUNNEL_SD and
MPLS_TUNNEL_SF.
c. Check whether anomalies such as board faults or NE resetting exist on the opposite
equipment If yes, rectify the anomalies.
Step 3 For symptom 3: The protection switching fails, and thus the service is interrupted.
1. Cause 1: The protection switching fails.
a. The MPLS tunnel APS protection switching fails. For details, see Feature
Description.
----End
Related Information
None
Symptoms
Symptoms 1: Creating the PW fails, and thus the service is unavailable.
Symptoms 2: The PW is faulty, and thus the service is interrupted, or packet loss or bit errors
occur in the service.
Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: Creating the PW cross-connection fails.
Procedure
Step 1 For symptom 1: Creating the PW fails, and thus the service is unavailable.
1. Cause 1: Creating the PW cross-connection fails.
a. Check whether the number of PWs reaches the upper limit of the NE or that of the
board. If yes, make a plan again or delete redundant PWs.
b. Check whether the label of PW is existent. If yes, modify the label of PW.
Step 2 For symptom 2: The PW is faulty, and thus the service is interrupted, or packet loss or bit
errors occur in the service.
1. Cause 1: The physical link is faulty.
a. Check whether the physical link between the source and sink nodes of the PW are
normal, and whether the R_LOS, or ETH_LOS alarm exists in the system.
b. If the alarm exists, take priority to handle the ETH_LOS alarm.
2. Cause 2: The tunnel where the PW resides is faulty.
a. For details on the tunnel faults, see 15.1 Handling MPLS Tunnel Faults.
3. Cause 3: The protection switching fails.
a. The PW APS protection switching fails.
4. Cause 4: The equipment is faulty.
a. Check whether anomalies such as board faults or NE resetting exist on the opposite
equipment. If yes, rectify the anomalies.
----End
Related Information
None
Symptom
Symptom 1: Ethernet services are interrupted.
Symptom 2: Ethernet services are degraded. For example, they have great delays, packet loss,
or incorrect packets.
Impact on System
Ethernet services are interrupted or degraded.
Possible Causes
l Cause 1: Services are incorrectly configured or parameter settings at local equipment are
inconsistent with those at its interconnected equipment. Parameter settings include settings
for loopbacks on Ethernet ports, loopbacks on transmission lines, enabling statuses/
working modes of Ethernet ports, and flow control for Ethernet ports.
l Cause 2: The local/interconnected equipment is faulty or bit errors occur on lines.
l Cause 3: The network cable is faulty.
l Cause 4: The external electromagnetic interference is severe.
Procedure
Step 1 Troubleshoot an Ethernet service fault.
1. Cause 1: Services are incorrectly configured or parameter settings at local equipment
are inconsistent with those at its interconnected equipment. Parameter settings
include settings for loopbacks on Ethernet ports, loopbacks on transmission lines,
enabling statuses/working modes of Ethernet ports, and flow control for Ethernet
ports.
a. Check whether a loopback is configured for an Ethernet port or a transmission line.
If misoperations cause a port/line loopback, correct the settings. If no loopback is
configured, go to the next step.
b. Check whether Ethernet port settings regarding enabling status, working mode, and
flow control at local equipment are consistent with those at its interconnected
equipment. If inconsistent, correct the settings; if consistent, go to the next step.
c. Check whether the Ethernet protocol and Ethernet service configurations (especially
the attributes of the Ethernet port) are correct.
a. Query the real-time performance measurement statistics of the Ethernet port with
reference to Feature Description.
b. Change the Ethernet port that receives services. If the RMON performance statistics
of the new port do not contain FCS errors, the hardware of the original port is faulty;
otherwise, the network port of the interconnected equipment may be faulty. To check
it out, go to the next step.
3. Cause 3: The network cable is faulty.
a. Check the network cable. If the network cable is not qualified, replace it with a new
one.
NOTE
When an Ethernet port works in auto-negotiation mode, its interconnected port must not work
in full duplex mode.
4. Cause 4: The external electromagnetic interference is severe.
a. Reduce external electromagnetic interference.
----End
Related Information
None.
Symptoms
Symptom 1: The PWE3 Ethernet service is interrupted.
Symptom 2: The PWE3 Ethernet service loses packets or has errored packets.
Possible Causes
l For symptom 1, the possible causes are as follows:
– Cause 1: The board hardware is faulty, the board temperature is above the upper
threshold, or the communication between boards fails. As a result, the board cannot
work properly.
– Cause 2: The PWE3 Ethernet port is misconnected and the port negotiation fails.
– Cause 3: Loopback is configured for the port.
– Cause 4: Dynamically learning the entries in the ARP table fails.
Procedure
Step 1 For symptom 1: The PWE3 Ethernet service is interrupted.
1. Cause 1: The board hardware is faulty, the board temperature is above the upper
threshold, or the communication between boards fails. As a result, the board cannot
work properly.
a. Query the current alarms of the system to check for the HARD_BAD or
COMMUN_FAIL alarm and the board that reports the alarm.
b. Handle the HARD_BAD or COMMUN_FAIL alarm. For details, refer to the Alarms
and Performance Events Reference manual.
2. Cause 2: The PWE3 Ethernet port is misconnected and the port negotiation fails.
a. Check for the ETH_LOS alarm of the system and handle the ETH_LOS alarm.
3. Cause 3: Loopback is configured for the port.
a. Check for the LOOP_ALM alarm of the system and handle the LOOP_ALM alarm.
b. Check for the ETH_EFM_LOOPBACK alarm of the system and handle the
ETH_EFM_LOOPBACK alarm.
4. Cause 4: Dynamically learning the entries in the ARP table fails.
a. Check whether the ARP table contains the mapping relationship between the opposite
IP address and the opposite MAC address. If the ARP table does not contain the
mapping relationship, it indicates that the entries in the ARP table fail to be
dynamically learned.
b. Configure the static ARP table.
5. Cause 5: The service configuration is incorrect.
a. See the Configuration Guide to re-configure the services.
Step 2 For symptom 2: The PWE3 Ethernet service loses packets or has errored packets.
1. Cause 1: The signals on the receive side are lost.
a. Check for the ETH_LOS alarm of the system and handle the ETH_LOS alarm.
b. Check for the LSR_NO_FITED, LSR_WILL_DIE, or LASER_MOD_ERR alarm of
the system and handle the LSR_NO_FITED, LSR_WILL_DIE, or
LASER_MOD_ERR alarm.
2. Cause 2: The configured traffic limit is excessively low at the port and configurations
of the source and sink ports are inconsistent.
a. Check for the FLOW_OVER alarm of the system and handle the FLOW_OVER
alarm.
----End
Related Information
None
Symptom
When the network works normally, a condition that triggers the protection switching occurs on
the working tunnel. The services, however, fail to be automatically switched from the working
tunnel to the protection tunnel. Consequently, the services are interrupted. The symptoms are as
follows:
Possible Causes
l Cause 1: The fibers or cables on the protection tunnel are connected incorrectly.
l Cause 2: The OAM parameters of the working tunnel and protection tunnel are set
incorrectly.
l Cause 3: The parameter values of the APS protection group at the two ends are different
from each other.
l Cause 4: The MPLS Tunnel APS protocol is disabled.
Flow Chart
Figure 15-1 shows the flow chart for rectifying a fault when the MPLS Tunnel APS function is
enabled.
Figure 15-1 Flow chart for rectifying a fault when the MPLS Tunnel APS function is enabled
Start
No
Yes
Yes
No
Yes
Yes
No
No
Procedure
l Cause 1: The fibers or cables on the protection tunnel are connected incorrectly.
1. Reconnect the fibers or cables to each port and ensure that the fibers or cables are
connected correctly.
l Cause 2: The working tunnel and protection tunnel are set incorrectly.
1. In the Main Topology, right-click the NE that you want to configure and choose NE
Explorer from the shortcut menu.
2. Choose Configuration > Packet Configuration > MPLS Management > Unicast
Tunnel Management in the Function Tree.
3. Click the OAM Parameters tab, select the tunnel that realizes the MPLS Tunnel APS
function. Then, set Detection Packet Period (ms) to 3.3.
l Cause 3: The parameter values of the APS protection group at the two ends are different
from each other.
1. In the Main Topology, right-click the NE that you want to configure and choose NE
Explorer from the shortcut menu.
2. Choose Configuration > Packet Configuration > APS Protection Management in
the Function Tree.
3. Click Query to query whether the settings of the APS protection group are the same.
If the settings of the APS protection group are different from each other, set the
protection tunnel in the APS protection group to the same value as the working tunnel.
l Cause 4: The MPLS Tunnel APS protocol is disabled.
1. In the Main Topology, right-click the NE that you want to configure and choose NE
Explorer from the shortcut menu.
2. Choose Configuration > Packet Configuration > APS Protection Management in
the Function Tree.
3. Set Protocol Status to Enabled, and then click Apply.
4. In the Operation Result dialog box that is displayed, click Close.
l If the preceding operations are performed but the alarm related to the MPLS Tunnel APS
function persists, clear the alarm. If the fault persists, contact Huawei technical support
engineers for the solution.
----End
16 Ethernet Problems
Symptom
Services are degraded.
System Impact
Services are affected.
Possible Causes
l Cause 1: The bandwidth is insufficient for heavy traffic or the bandwidth is sufficient but
the burst traffic is excessive.
l Cause 2: There is a fault at the physical layer.
Troubleshooting Workflow
Figure 16-1 illustrates the Ethernet service degradation troubleshooting process.
Start
No No
No No
Yes Yes
The setting of a port attribute of Modify the Are services
the local equipment is different
from that of the opposite
mismatched setting. restored?
equipment.
No No
Yes Yes
Is there a fault at the Check for alarms and Are services
physical layer? handle them. restored?
No No
Contact Huawei
engineers. End
Procedure
l Cause 1: The bandwidth is insufficient for heavy traffic or the bandwidth is sufficient but
the burst traffic is excessive.
1. Check for RMON performance events of the corresponding NE and handle them. For
information about how to handle RMON performance events, refer to the Alarms and
Performance Events Reference.
l Cause 2: There is a fault at the physical layer.
1. Check for alarms. If an ETH_8B10B_ERR alarm is being reported, handle this alarm.
----End
Fault Symptom
The common fault phenomena are as follows:
l The source and sink ends fail to transmit control information. Hence, the negotiation
between the two ends is incorrect and the LCAS function fails to be enabled.
l If the bound path of the data board is configured before the SDH services from the data
board to the line board are configured, the LCAS function may fail to start immediately.
Possible Causes
The possible causes are as follows:
l The configuration timeslots of SDH services at the source and sink ends are not consistent.
l The timeslots actually bound to the data boards at the source and sink ends are abnormal
or lost.
l The LCAS function at the source and sink ends is not enabled.
l Excessive bit errors cause the failure of control information transmission between the
source and sink ends.
Start
No
Yes
Yes
Procedure
l Cause 1: The timeslots configured at the source and sink ends are not consistent.
1. Right-click the relevant NE icon in the Main Topology, and then choose NE
Explorer.
2. Choose Configuration > SDH Service Configuration from the Function Tree.
3. Select the SDH service that is configured between the source and sink ends in the pane
on the right side.
4. Click the Deactivate button in the lower right corner.
5. In the Confirm dialog box that is displayed, click OK.
6. Right-click the SDH service that is configured between the source and sink ends, and
then choose Modify.
7. In the Modify SDH Service dialog box that is displayed, verify that the service level
and timeslots configured for the source and sink ends are consistent.
l Cause 2: The timeslots actually bound to the data boards at the source and sink ends are
abnormal or lost.
1. Right-click the relevant NE icon in the Main Topology, and then choose NE
Explorer.
2. Select the relevant data board in the Object Tree, and then choose Configuration >
Ethernet Interface Management > Ethernet Interface from the Function Tree in
the left pane.
3. Select Internal Port and click the Bound path tab.
4. Ensure that the activation status of the configured bound path in the bound path list is
"active".
5. Select the configured bound path, and then click the Configuration button in the lower
right corner in the pane on the right side.
6. In the Bound Path Configuration dialog box that is displayed, re-configure
Available Resources and Available Timeslots.
l Cause 3: The LCAS function is not enabled at the source and sink ends at the same time.
1. Right-click the relevant NE icon in the Main Topology, and then choose NE
Explorer.
2. Select the relevant data board in the Object Tree, and then choose Configuration >
Ethernet Interface Management > Ethernet Interface from the Function Tree
3. Select Internal Port and click the LCAS tab.
4. Find the relevant port, double-click on the column Enabling LCAS of which is set to
Disabled, and finally set Enabling LCAS to Enabled.
5. Click the Apply button in the lower right corner.
6. In the Operation Result dialog box that is displayed, click Close.
l Cause 4: Excessive bit errors cause the failure of control information transmission between
the source and sink ends.
NOTE
You can check whether the control information transmission between the source and sink ends fails
due to excessive bit errors by querying the alarm information on the NMS.
1. Right-click the relevant NE icon in the Main Topology, and then choose Browse
Current Alarms.
2. Check whether the VCAT_LOA alarm is contained in the alarm list.
3. See the Alarms and Performance Events Reference to troubleshoot the virtual
concatenation delay and excessive bit errors.
----End
Symptom
Services are interrupted.
Possible Causes
l Cause 1: The LPT function is disabled.
l Cause 2: The LPT bearer modes of all the NEs between the service access point and
intermediate network are inconsistent.
Procedure
Step 1 Cause 1: The LPT function is disabled.
1. In the NE Explorer, select the desired board and choose Configuration > Ethernet
Interface Management > LPT Management from the Function Tree.
2. Click Query and the Operation Result dialog box is displayed.
3. Check the LPT field of the board and ensure that the LPT function is enabled. Then click
Close.
Step 2 Cause 2: The LPT bearer modes of all the NEs between the service access point and intermediate
network are inconsistent.
1. In the NE Explorer, select the desired board and choose Configuration > Ethernet
Interface Management > LPT Management from the Function Tree.
2. Click Query and the Operation Result dialog box is displayed.
3. Check the Bearer Mode field of the board and ensure that the bearer modes of all nodes
are consistent. Then click Close.
----End
Fault Symptom
Common fault phenomena are listed as follows.
l LB test failure
Possible Causes
Possible causes of the LB test failure are listed as follows.
l The external physical port where the maintenance point resides is not enabled.
l Values of Port Type of the ports where the sink and source maintenance points reside are
inconsistent.
l When values of Port Type of the ports where the sink and source maintenance points reside
are UNI or PE, the TAG flags are inconsistent.
l The VLAN ID of the maintenance points is inconsistent with the VLAN ID of the service.
l The level of the source maintenance point is inconsistent with the level of the sink
maintenance point.
l MEPs or MIPs of a higher level exist between the source and sink maintenance points.
l The Ethernet service is a unidirectional service.
l The Ethernet service is interrupted.
Troubleshooting Flow
Is
the external No Enable the external
Is LB test Yes
physical port of the MP physical port of the
MP successful
enabled
No
Yes
Port type
of the ports of the Yes No Make the TAG flags Is LB test Yes
Are TAG flags
source sink MPs is UNI consistent successful
and or PE consistent
Yes No
No
Are the
VLAN IDs of the No Make the VLAN ID Is LB test Yes
MPs and the service consistent successful
consistent
No
No
Refer to the
3 Ethernet service is Troubleshooting for Is LB test Yes
interrupted the handling successful
No
1 Port type of the port where the MP resides may be improperly configured Contact Huawei
engineers
2 MP attributes may be improperly configured
End
3 The Ethernet service may be interrupted
Procedure
Step 1 Cause 1: The external physical port where the maintenance point resides is not enabled.
1. In the Main View, right-click the required NE icon and choose NE Explorer from the
shortcut menu.
2. In the Object Tree, select the required Ethernet service processing board. Choose
Configuration > Ethernet Interface Management > Ethernet Interface from the
Function Tree. Click .
Step 2 Cause 2: Values of Port Type of the ports where the sink and source maintenance points reside
are inconsistent.
1. Select External Port or Internal Port.
2. Click the Network Attributes tab. Check whether the values of Port Type of the ports are
consistent.
Step 3 Cause 3: When values of Port Type of the ports where the sink and source maintenance points
reside are UNI or PE, the TAG flags are inconsistent.
1. Click the TAG Attribute tab. Check whether the values of TAG of the ports are consistent.
NOTE
The TAG parameter is valid only when the value of Port Type is UNI or PE.
Step 4 Cause 4: The VLAN ID of the maintenance points is inconsistent with the VLAN ID of the
service.
1. In the Object Tree, select the required Ethernet service processing board. Choose
Configuration > Ethernet Maintenance > Ethernet Service OAM from the Function
Tree. Click .
2. Click Query to check whether the VLAN ID of the maintenance points is consistent with
the VLAN ID of the service.
Step 5 Cause 5: The level of the source maintenance point is inconsistent with the level of the sink
maintenance point.
1. Click Query to check whether the level of the source maintenance point is inconsistent
with the level of the sink maintenance point.
Step 6 Cause 6: MEPs or MIPs of a higher level exist between the source and sink maintenance points.
1. Click Query to check whether any MEP or MIP of a higher level exists between the source
and sink maintenance points.
----End
Symptom
The test frame cannot function normally.
System Impact
None.
Possible Cause
l Cause 1: The timeslots in the cross-connection services of the interconnected ends are
inconsistent.
l Cause 2: The bearing modes of the VC trunk ports at the two interconnected ends are
inconsistent.
l Cause 3: The logic port on which the function of test frame is enabled is not bound with
timeslots.
Procedure
l Cause 1: The timeslots in the cross-connection services of the interconnected ends are
inconsistent.
1. Check whether the timeslots in the cross-connection services of the interconnected
ends are consistent. If they are inconsistent, set them to the same.
l Cause 2: The bearing modes of the VC trunk ports at the two interconnected ends are
inconsistent.
1. Check whether the bearing modes of the VC trunk ports at the two interconnected
ends are consistent. If they are inconsistent, set them to the same.
l Cause 3: The logic port on which the function of test frame is enabled is not bound with
timeslots.
1. Check whether the logic port on which the function of test frame is enabled is bound
with timeslots.
----End
Symptom
The common LAG faults are as follows:
l The member ports become unavailable, because the LAG is faulty. As a result, the services
are interrupted.
l Certain packets are discarded, because the member ports in the LAG become unavailable.
Possible Causes
l The NEs at both ends of the LAG are configured incorrectly.
l The member ports in the LAG work in half-duplex mode.
l Loopbacks are performed on the member ports in the LAG.
l The member ports in the LAG are disconnected.
Flow Chart
Following a certain troubleshooting flow facilitates your fault locating and rectification.
NOTE
This topic describes the process for rectifying a LAG fault when the LAG function is enabled in TDM
mode. This topic also provides a reference for you to rectify a LAG fault because the process for rectifying
a LAG fault when the LAG function is enabled in packet mode is similar.
Start
The Yes
LAG_FAIL Handle the
alarm occurs? alarm
No
No
The
Yes
LOOP_ALM or Release the
ETH_EFM_LOOPBACK loopback at the port
alarm occurs?
No
No
The fault is
rectified?
Yes
Contact Huawei
technical support
engineers to handle the End
fault
Procedure
Step 1 Cause 1: The NEs at both ends of the LAG are configured incorrectly.
1. On the U2000, check whether the LAG_DOWN or LAG_MEMBER_DOWN alarm is
reported.
2. If yes, see the Alarms and Performance Events Reference to clear the alarm. Then, check
whether the fault is rectified.
Step 2 Cause 2: The member ports in the LAG work in half-duplex mode.
1. Check whether the member ports in the LAG work in half-duplex mode.
2. If yes, change the working mode to full-duplex and ensure that the ports on the local and
opposite NEs work in the same mode.
Step 3 Cause 3: Loopbacks are performed on the member ports in the LAG.
1. Check whether the LOOP_ALM or ETH_EFM_LOOPBACK alarm is reported on the
member ports in the LAG.
2. If yes, release the loopback status to clear the LOOP_ALM or ETH_EFM_LOOPBACK
alarm.
Step 5 If the fault persists, contact the technical support engineers of Huawei.
----End
Context
Figure 16-5 shows the flow for rectifying a fault that occurs when the MSTP function is used.
Start
End
Procedure
Step 1 Check the information in the MAC address tables of all the equipment on the network.
1. Check the information in the MAC address tables of all the equipment on the network. If
a MAC address corresponds to several ports, loops may exist. In this case, cancel any
loopbacks.
4. Check whether the values of the Hello Time, Forward Delay, and Max Age parameters
are set correctly.
l If the Hello Time parameter is set to an extremely large value, the change in network
topology may be responded in a delayed manner. If the Hello Time parameter is set to
an extremely small value, the configuration message is frequently transmitted, and thus
more CPU and network resources are consumed.
l If the Forward Delay parameter is set to an extremely large value, the spanning tree
converges slowly. If the Forward Delay parameter is set to an extremely small value,
temporary loops may exist when the network topology changes.
l If the Max Age parameter is set to an extremely large value, the fault on the link fails
to be detected. If the Max Age parameter is set to an extremely small value, the switch
incorrectly considers that a fault occurs on the link in the case of network congestion,
and thus the spanning tree is frequently computed.
NOTE
It is recommended that the three parameters take the default values.
Step 4 Check the configurations of the MSTP function for the other equipment.
If a fault occurs when the MSTP function is used, check the configurations of the MSTP function
for the other equipment on the network. The MSTP protocol should be enabled for all the
equipment on the network (at least on one ring). Otherwise, the equipment on the network fails
to be informed of the network topology by exchanging the messages, and thus the learning of
the spanning tree fails.
1. Check whether the root bridge of the entire network is correctly selected.
In general cases, you can set the intermediate LanSwitch as the root bridge by modifying
the priorities of bridges. If the intermediate LanSwitch is not set as the root bridge, the
traffic on the entire network is not balanced. In addition, certain services become
unavailable.
2. Check whether the bridge parameters and port parameters of all the equipment are set
correctly. In general cases, the Hello Time, Forward Delay, and Max Age parameters
take the parameters values of the root bridge.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A4 The LOG Board Fails to Interwork With the FDG Board on the Client Sides. The
LOG board reports the R_LOS alarm on the client side. The FDG board reports the
LINK_STATUS alarm.
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A6 The LDG Board Keeps Reporting the ALM_DATA_TLOS and
ALM_DATA_RLOS Alarms Transiently
l MC-A17 The LQG Reports ALM_DATA_RLOS and ALM_DATA_TLOS Alarms
l MC-A30 LDG Board of Metro 6100 Equipment Reports the INBADOCTS_OVER Alarm
l MC-A96 Packet Loss Occurs in Ethernet Service Testing of the L4G Board
l MC-A117 Analysis and Handling of the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Reported by the LBE
l MC-A158 How to Use the Ethernet Test Frames on the OptiX OSN 6800
l MC-A219 EPL Service Is Interrupted Due to Mismatched Service Modes of the L4G
Boards, Sites Report The R_LOF, ODU5G_PM_AIS, OTU5G_LOF And LINK_ERR
Alarms
17 ECC Problems
In a WDM system, communication between the U2000 and non-gateway NEs is established
with the U2000 first communicating with the gateway NE over TCP/IP. Then the gateway NE
communicates with non-gateway NEs through the embedded control channel (ECC) to complete
the communication channel.
Symptom
The services on the main optical path are normal, but ECC communication is interrupted and
the NE cannot be reached by the U2000.
System Impact
The U2000 cannot reach the NE.
Possible Causes
l Cause 1: The ID of the NE is a duplicate of another NE.
l Cause 2: The pigtail connected to the OSC board is faulty.
l Cause 3: The system control board is faulty.
l Cause 4: The OSC board is faulty.
l Cause 5: The OTU or tributary or line board is faulty.
l Cause 6: Coherent and non-coherent boards are interconnected.
Procedure
l Cause 1: The ID of the NE is a duplicate of another NE.
1. Log in to the NE on the WEB LCT and modify the NE ID of the NE according to the
record.
l Cause 2: The pigtail connected to the OSC board is faulty.
1. Check the pigtail connected to the OSC board on the first faulty NE along the signal
flow. If the pigtail is loose, secure it; if it is dirty, clean it; if it is faulty, replace it.
2. Ensure that the pigtail is properly connected to the OSC board on the faulty NE.
NOTE
During deployment commissioning, the pigtail might be incorrectly connected because the FIU
board and the OSC board have many ports marked with similar names. During commissioning,
to prevent incorrect pigtail connections in one direction of the OSC board, you can connect an
optical attenuator between the currently unused RM and TM optical ports in the other direction
of the OSC board to form a self-loop.
l Cause 3: The system control board is faulty.
1. Reset the system control board on the faulty NE along the signal flow, and then check
whether you can log in to the NE.
2. If you cannot log in to the NE, remove the system control board on the faulty NE so
the ECC signals pass through the backplane. Then check whether the downstream NE
----End
Symptom
The ECC communication is intermittent, and the U2000 is frequently unable to reach the NE.
System Impact
The U2000 cannot reach the NE.
Possible Causes
l Cause 1: Clock tracing is set incorrectly.
l Cause 2: There are bit errors on the signal transmission.
Procedure
l Cause 1: Clock tracing is set incorrectly.
1. Set clock tracing to the correct source. For details on how to set the correct clock
tracing, see "Synchronization".
NOTE
All equipment on the network must trace the same clock source. In the case of WDM equipment,
the internal clock source of a specific NE should be given the highest priority, and the other
NEs trace the internal clock source of this NE. (In the case of chain networking, the internal
clock source of a terminal NE rather than an intermediate NE should be set as the clock source
so it can be traced network-wide.)
l Cause 2: There are bit errors on the signal transmission.
1. Refer to rectifying bit errors.
----End
Q: Why does the system control board board on the GNE or the NE that uses the extended
ECC frequently reset?
Q: Why are NEs prone to be unreachable by the U2000 due to bit errors during deployment
and expansion?
A:
If the OSC and ESC functions are used together, bit errors are often generated during deployment
and expansion because services transmitted along with ESC information are being
commissioned. Once bit errors are generated, the network frequently switches between the OSC
and ESC channels, and consequently NEs on the network become unreachable by the U2000,
leading to NE configuration and management failures. To prevent this problem, disable the ESC
function during deployment and expansion, and enable it again when deployment and expansion
are complete.
Symptom
The common faults regarding the inband DCN are as follows:
l The communication between the U2000 and NEs is interrupted, and the NE icons on the
U2000 become unavailable.
l No response is returned after a command is issued on the U2000. If no response is returned
more than 2 minutes later, the communication between the U2000 and an NE is interrupted.
l Part of the query result on the U2000 is missing.
Impact on System
l After the communication between an NE and the U2000 is interrupted, the NEs that
communicate with the U2000 through the NE also become unreachable, if they cannot
connect to the U2000 by other methods. The other NEs, however, are not affected.
Possible Causes
l Cause 1: The physical link between the faulty NE and the U2000 is disconnected.
l Cause 2: The ports on the faulty NE are not enabled with the inband DCN function.
l Cause 3: The DCN VLAN IDs of the NEs at both ends are not the same.
l Cause 4: The ports connecting the local and opposite NEs use different protocols.
l Cause 5: The faulty NE fails to obtain DCN packets due to loss of received signals or very
low receive optical power.
l Cause 6: A board is faulty.
l Cause 7: No response is returned for inband DCN packets, because the SCC board on the
faulty NE is being reset or active/standby protection switching occurs on another board.
l Cause 8: The bandwidth of the inband DCN channel is very low or other services preempt
the bandwidth for the inband DCN channel.
NOTE
Troubleshooting Flow
A troubleshooting flow facilitates fault locating and rectification.
Start
No
Yes
Is the port on the Enable the port
DCN disabled? on the DCN.
No
Yes
Is the received Clear the alarm
No
signal lost? related to the optical
power, fiber, or cable.
No
Yes
Is the board Replace the board.
faulty?
No response is Yes
否returned for the The SCC board is Wait until the SCC
command on the being reset. board is reset.
U2000.
No
Yes
Is the information The bandwidth Increase the bandwidth of the
queried on the of the DCN path DCN path or degrade the
U2000 lost? is very low. priority of other services.
No
Is the fault
rectified?
Yes
Procedure
Step 1 Cause 1: The physical link between the faulty NE and the U2000 is disconnected.
1. Check whether the Ethernet cable or fiber of the faulty NE is disconnected from the port.
If yes, reconnect it.
Step 2 Cause 2: The ports on the faulty NE are not enabled with the inband DCN function.
1. Check whether the DCN function is enabled by default for the port that is connected to the
fiber. If not, connect the fiber to a port with the DCN function enabled by default.
2. Check whether the inband DCN function is enabled for the ports at both ends of the link.
If not, enable DCN access function for the ports.
Step 3 Cause 3: The DCN VLAN IDs of the NEs at both ends are not the same.
1. Check whether the DCN VLAN ID of the local NE is the same as that of the opposite NE.
If not, change the DCN VLAN ID of the local NE to the DCN VLAN ID of the opposite
NE.
Step 4 Cause 4: The ports connecting the local and opposite NEs use different protocols.
1. Check whether the port on the local NE uses the same protocol as the port on the opposite
NE. If not, change the protocol to ensure that the ports connecting the local and opposite
NEs use the same protocol.
Step 5 Cause 5: The faulty NE fails to obtain DCN packets due to loss of received signals or very low
receive optical power.
1. Check whether alarms, such as ETH_LOS, ETH_LINK_DOWN, or IN_PWR_LOW, are
reported on the board that is configured with the inband DCN channel. If yes, see the Alarms
and Performance Events Reference to clear these alarms.
Step 6 Cause 6: A board is faulty.
1. Check whether the HARD_BAD alarm is reported on the board that is configured with the
inband DCN channel. If yes, see the Alarms and Performance Events Reference to clear
the alarm.
Step 7 Cause 7: No response is returned for inband DCN packets, because the SCC board on the faulty
NE is being reset or active/standby protection switching occurs on another board.
1. Check whether the PROG indicator on the system control and communication (SCC) board
is blinking green. If yes, the SCC board is being reset. When the PROG indicator is steady
green, the reset of the SCC board is completed. Then, the NE is automatically reconnected
to the DCN.
2. If no response is returned for DCN packets, check whether active/standby protection
switching occurs on another board, which results in rerouting of DCN packets.
3. If active/standby protection switching occurs on a board, the NE automatically responds
to DCN packets after rerouting of DCN packets is completed.
Step 8 Cause 8: The bandwidth of the inband DCN channel is very low or other services preempt the
bandwidth for the inband DCN channel.
1. If the number of services that are configured for the port exceeds the specified value, part
of the query result may be missing. To resolve the problem, increase the bandwidth of the
inband DCN channel properly. For details, see Setting the VLAN ID and Bandwidth Used
by the Inband DCN.
Step 9 If the fault persists, contact Huawei to handle the fault.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A22 NE ESC Communication Interrupted Because of the Closure of the OTU Laser
l MC-A64 Many BD_STATUS Alarms Occur Due to the ECC Storm
l MC-A81 The Created ALC Link on OptiX OSN 6800 Cannot Be Automatically Adjusted
l MC-A110 Question About the Extended ECC of WDM NEs
l MC-A113 Communication Between the WDM Equipment And the DCN Network Fails
After a Hub Is Connected To the DCN Network
l MC-A135 The Communication Between the NE and the T2000 Stops Because the NE
Software Cannot Switch the ECC from an OSC Board to an OTU Board Automatically
l MC-A181 An NE Is Frequently Unreachable to the NMS Due to Insufficient Processing
Capacity of a Router
l MC-A247 NE Configuration Data Cannot Be Uploaded
18 Clock Problems
This section describes the possible causes of the synchronization failure in the clock, and the
troubleshooting procedure.
Symptom
After a fiber fault is rectified, the delay is not compensated. As a result, the downstream NodeBs
cannot trace the clock signals.
Impact on System
If protection is configured, the IEEE 1588v2 clock signals are switched from the working path
to the protection path. If no protection is configured, the IEEE 1588v2 clock signals are
interrupted and the downstream NodeBs cannot perform clock synchronization.
Figure 18-1 Networking diagram of the ring network configured with protection
NE1 NE2
East West
West East
NE4 NE3 NE5
OptiX PTN
BITS OptiX OSN equipment equipment NodeB
1. Delete the west IEEE 1588v2 port on NE3 so that the clock signals will not be immediately
switched to the working path after the fiber between NE2 and NE3 is repaired.
2. Repair the faulty fiber between NE2 and NE3.
3. Check whether the time precision on NE3 and that on NE4 satisfy requirements. For the
time precision requirements, see Requirements on Clock Synchronization. If the time
precision requirements are not satisfied, the path between NE3 and NE4 is faulty. When
this occurs, remove the fault on the path.
4. Create the west IEEE 1588v2 port that is deleted in step 1 to switch the IEEE 1588v2 clock
signals to working path NE1–NE2–NE3.
5. Record the time precision on NE2 and that on NE3. Then calibrate the original compensated
delay on NE2 and that on NE3 according to the time precision on NE2 and the time precision
on NE3. For the delay compensation method, see Testing Delay Compensation.
6. Ensure that the time precision on NE2 and that on NE3 satisfy requirements.
7. Verify that the status of all PTP clock ports is normal.
Symptom
In the case of a clock processing failure, the EXT_TIME_LOC, SYNC_C_LOS,
S1_SYN_CHANGE, LTI, SYN_BAD, EXT_SYNC_LOS, or CLK_NO_TRACE_MODE
alarm occurs.
When you clear the alarm reported by the equipment, the fault is rectified.
Impact on System
When the clock source of the network is lost or degrades, the services that trace the clock source
are affected. In this case, pointer justification occurs and BER increases.
Possible Causes
l Cause 1: In the priority table, the synchronous time source for the service board is lost.
l Cause 2: The synchronous clock source is lost and the clock of the NE is in an abnormal
state.
l Cause 3: In SSM mode, the clock source is switched and the clock source traced by the NE
is also switched.
l Cause 4: The signals of the synchronous clock source are degraded.
l Cause 5: The external clock source is lost.
l Cause 6: The clock enters the non-tracing working mode.
Procedure
l Cause 1: In the priority table, the synchronous time source for the service board is lost.
1. Check for the SYNC_C_LOS alarm of the STG board on U2000. For details, refer to
"Querying the Current Alarms and Performance Events of a Component on the
U2000" in Supporting Tasks.
2. If the SYNC_C_LOS alarm occurs, handle the alarm according to the Alarms and
Performance Events Reference.
l Cause 2: The synchronous clock source is lost and the clock of the NE is in an abnormal
state.
1. Check for the LTI alarm of the STG board on U2000. For details, refer to "Querying
the Current Alarms and Performance Events of a Component on the U2000" in
Supporting Tasks.
2. If the LTI alarm occurs, handle the alarm according to the Alarms and Performance
Events Reference.
l Cause 3: In SSM mode, the clock source is switched and the clock source traced by the NE
is also switched.
1. Check for the S1_SYN_CHANGE alarm of the STG board on U2000. For details,
refer to "Querying the Current Alarms and Performance Events of a Component on
the U2000" in Supporting Tasks.
2. If the S1_SYN_CHANGE alarm occurs, handle the alarm according to the Alarms
and Performance Events Reference.
l Cause 4: The signals of the synchronous clock source are degraded.
1. Check for the SYN_BAD alarm of the STG board on U2000. For details, refer to
"Querying the Current Alarms and Performance Events of a Component on the
U2000" in Supporting Tasks.
2. If the SYN_BAD alarm occurs, handle the alarm according to the Alarms and
Performance Events Reference.
l Cause 5: The external clock source is lost.
1. Check for the EXT_SYNC_LOS alarm of the STG board on U2000. For details, refer
to "Querying the Current Alarms and Performance Events of a Component on the
U2000" in Supporting Tasks.
2. If the EXT_SYNC_LOS alarm occurs, handle the alarm according to the Alarms and
Performance Events Reference.
----End
19 Orderwire Problems
19.2 A Called Phone Set Cannot Exit the Conference Call State
This topic describes the possible cause and troubleshooting method when a called phone set in
a conference call fails to exit the conference call state.
Symptom
The services on the main optical path from station A to station C are normal, but the orderwire
is unavailable.
OLA
OTM OTM
System Impact
The services are not affected.
Possible Causes
l Cause 1: The parameters are incorrectly configured.
l Cause 2: The setting of the phone set is incorrect.
l Cause 3: The phone set or phone line is faulty.
l Cause 4: The fiber connection of the SC2 board at station B is incorrect.
l Cause 5: The system control board at station B is faulty.
Procedure
l Cause 1: The parameters are incorrectly configured.
1. On the U2000, check the phone numbers of stations A and C. Ensure that each phone
number is correctly set to a unique value. For details, refer to "Configuring Orderwire".
2. On the U2000, check the setting of the OSC boards at stations A, B, and C. Set
orderwire bytes to pass through each station; otherwise, the orderwire of station A and
that of station C are unavailable.
3. On the U2000, check the clock tracing settings of stations A and C. For details on how
to set the correct clock tracing, see "Setting the Priority List of Clock Sources" or
"Setting IEEE 1588v2" in the Configuration Guide.
l Cause 2: The setting of the phone set is incorrect.
1. Check the port to which the phone set is connected and ensure that the interface is
correctly selected.
NOTE
Generally, the orderwire phone line is inserted on the "PHONE1" port of the subrack.
2. Check whether the ringing current switch RING on the phone set is set to ON
(indicating that the phone rings when there is an incoming call).
3. Check whether the dialing mode switch is set to T (indicating the dual-tone
multifrequency (DTMF) dialing mode).
4. Check whether the called orderwire phone is on hook. If the red indicator in the upper
right corner on the front panel of the phone set is unlit, the phone set is on hook. If the
red indicator is lit, the phone set is not on hook. Press the "TALK" button on the front
panel of the phone set to hook it up.
l Cause 3: The phone set or phone line is faulty.
1. Install a normal phone set at stations A and C and test the calling function of the phone
set. If the function is available, the original phone set is faulty.
2. Replace the phone line between the phone set and the equipment at stations A and C,
and test the calling function. If the function is available, the phone line is faulty.
l Cause 4: The fiber connection of the SC2 board at station B is incorrect.
1. Along the signal flow, check the pigtail connected to the SC2 board at station B. Ensure
that the connection is correct.
l Cause 5: The system control board at station B is faulty.
1. Remove the system control board at station B to make orderwire bytes pass through
station B. If the orderwire becomes available, the system control board is faulty.
Replace the system control board.
----End
Symptom
A called phone set in a conference call fails to exit the conference call state. That is, the called
phone set is still in the conference call state after being connected and then disconnected.
System Impact
The called phone set fails to exit the conference call state, and the orderwire is unavailable.
Possible Causes
l Cause 1: During the conference call, the calling orderwire board of a certain phone set is
reset either due to an external cause or because the phone set was connected when
communication on the supervisory channel failed.
Procedure
l Cause 1: During the conference call, the calling orderwire board of a certain phone set is
reset either due to an external cause or because the phone set was connected when
communication on the supervisory channel failed.
1. Leave the phone set in the connected state for two hours. The phone set will return to
the idle state two hours later.
2. Use an idle phone set to initiate a conference call. Then, connect this phone set, causing
all phone sets in orderwire communication with this phone set to exit the conference
call state.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A42 A Fault of the PMU for the OptiX BWS 1600G Results In an Orderwire Ringing
Failure
l MC-A163 How To Quickly Re-obtain the IP Address Of a Device At a Station After the
IP Address Of the Device Is Changed
l MC-A187 How to Change the IP Address of the OptiX OSN 1800 to the Default One
l MC-A223 When a Self-Made Network Cable Is Used, the OptiX OSN 6800 Reports the
COMMUN_FAIL Alarm
l MC-A224 The Network Cable at the COM Port on the EFI Board of the OptiX OSN 6800
Subrack Is Incorrectly Connected. Consequently, All the Boards on the OptiX OSN 6800
Subrack Report the COMMUN_FAIL Alarms
This section describes how to troubleshoot coherent board faults in terms of the symptoms,
impact on the system, possible causes, tools required for troubleshooting, troubleshooting
procedure, and precautions that should be taken during the troubleshooting.
Symptoms
l Symptom 1: The pre-FEC BER is high, services are unavailable, or the wavelength of the
received service is incorrect.
l Symptom 2: DCN communication is interrupted and some NEs are unreachable by the
NMS.
Possible Causes
l For symptom 1:
– Cause 1: The receive wavelength is incorrectly configured.
– Cause 2: The FEC type of the board is incorrectly set.
l For symptom 2:
– Cause 1: Coherent and non-coherent boards are interconnected.
Procedure
Step 1 For symptom 1:
1. Cause 1: The receive wavelength is incorrectly configured.
a. Check whether the receive wavelength has been configured for the coherent board. If
not, the system automatically associates the receive and transmit wavelengths when
1 2
B1
11LTX
A C
B2
11LTX
1 2
2. Cause 2: The FEC type of the board is incorrectly set.
a. On the U2000, check whether the board supports the preset FEC type. Currently,
coherent boards support HFEC only. If the board does not support the preset FEC
type, set the FEC type as HFEC.
----End
Related Information
Coherent boards are heavy and their components are sensitive, so special package materials are
required and boards must be packaged in strict compliance with the instructions. During
deployment, keep the removed package materials in case of return for repair in future. If boards
are improperly packaged, they will be easily damaged during transportation, installation, and
return for repair, resulting in unnecessary loss.
Storing Package Materials: Package materials must be free of water and must be handled with
care. They can be stacked.
CAUTION
A board must be put into a foam box with the front panel towards a correct direction. Otherwise,
connectors and optical modules on the board maybe seriously damaged by the stress of the foam
box.
21 Interconnection Faults
This topic describes the possible causes and troubleshooting methods for WDM and OTN
equipment interconnection failures.
Symptom
After a piece of WDM equipment is interconnected to a piece of data equipment, the two pieces
of equipment fail to communicate with each other and the equipment might report the R_LOS,
LINK_ERR and R_OOF alarms.
Impact on System
The two pieces of interconnected equipment fail to communicate with each other.
Possible Causes
l Cause 1: The line between the WDM equipment and the data equipment is faulty.
l Cause 2: The parameter configuration of interconnected board of the WDM equipment on
the client side is incorrect.
l Cause 3: Either or both of the interconnected boards is/are faulty.
Procedure
Step 1 Cause 1: The line between the WDM equipment and the data equipment is faulty. In this case,
interconnected board of the WDM equipment may report the R_LOS, LINK_ERR alarms on
the client side.
1. Check whether the optical modules of the interconnected boards are of the same type. If
they are of the same type, check whether the supported transmission distance of the optical
modules is equal or longer than the actual distance away from the client side. The single-
mode optical module and the multimode optical module cannot be used together.
2. Check whether the fibers of the interconnected boards match each other. The single-mode
fiber and the multimode fiber cannot be used together.
NOTE
Generally, the multimode fiber is nacarat-colored, and the single-mode fiber is yellow.
3. Check whether any fiber is inserted in an incorrect interface according to the fiber labels.
If any fiber is inserted in an incorrect interface, the equipment might not generate the
R_LOS alarm but generates a large number of performance events against anomalies. In
this case, insert the fiber in the correct interface.
4. Use an optical power meter to test the interconnect fiber jumper. If the fiber jumper is
abnormal, replace the fiber jumper. If the fiber jumper is good, check whether the optical
cable is faulty.
Step 2 Cause 2: The parameter configuration of interconnected board of the WDM equipment on the
client side is incorrect. In this case, interconnected board of the WDM equipment may report
R_LOS, LINK_ERR alarms on the client side.
1. In the case of the OTU board that can access various service data, check whether the service
type settings of the OTU board are different from those of the data equipment.
l On the interconnected board of the WDM equipment, query and set the client service
type of the board. Make sure that the service type of the WDM equipment is the same
as the interconnected data equipment.
2. Check whether the auto-negotiation modes of the optical interfaces on the interconnected
boards are different from each other.
l Querying and Setting the Working Mode of the Ethernet Port. Make sure that the auto-
negotiation modes are the same.
3. The Maximum Package Length of the optical interfaces on the interconnected boards are
different from each other.
l Log in to the U2000. In the NE Explorer interface, select the interconnected board.
l In Function Tree on the left side, choose Configuration > WDM Interface.
l Select Advanced Attributes, and click Query. Make sure that the Maximum Package
Length of the optical interfaces on the interconnected boards are same.
4. Check whether any interconnected interface is looped back.
l Log in to the U2000. In the NE Explorer interface, select the interconnected board.
l In Function Tree on the left side, choose Configuration > WDM Interface.
l Select Basic Attributes, and click Query. Check whether the optical interface on the
client side of each board is looped back. If yes, double-click the Loopback field. In the
drop-down list, select Non-Loopback and click Apply.
----End
Symptom
After a piece of WDM equipment is interconnected to a piece of SDH equipment, the two pieces
of equipment fail to communicate with each other and the equipment might report the R_LOS,
R_LOF, and R_OOF alarms.
If the equipment reports the B1_EXC, B1_SD, B2_EXC, and B2_SD alarms, it indicates that
bit errors occur in the one of the two pieces of interconnected equipment. For details on how to
clear the fault, see Single-Channel Bit Errors.
Impact on System
The communication between the interconnected equipment is unavailable.
Possible Causes
l Cause 1: The line between the WDM equipment and the SDH equipment is faulty.
l Cause 2: The parameter configuration of interconnected board of the WDM equipment on
the client side is incorrect.
l Cause 3: Either or both of the interconnected boards is/are faulty.
Procedure
Step 1 Cause 1: The line between the WDM equipment and the SDH equipment is faulty. In this case,
interconnected board of the WDM equipment may report the R_LOS and R_LOF alarms on the
client side.
1. Check whether the optical modules of the interconnected boards are of the same type. If
they are of the same type, check whether the supported transmission distance of the optical
modules is equal or longer than the actual distance away from the client side. The single-
mode optical module and the multimode optical module cannot be used together.
2. Check whether the fibers of the interconnected boards match each other. The single-mode
fiber and the multimode fiber cannot be used together.
NOTE
Generally, the multimode fiber is nacarat-colored, and the single-mode fiber is yellow.
3. Check whether any fiber is inserted in an incorrect interface according to the fiber labels.
If any fiber is inserted in an incorrect interface, the equipment might not generate the
R_LOS alarm but generates a large number of performance events against anomalies. In
this case, insert the fiber in the correct interface.
4. Use an optical power meter to test the interconnect fiber jumper. If the fiber jumper is
abnormal, replace the fiber jumper. If the fiber jumper is good, check whether the optical
cable is faulty.
Step 2 Cause 2: The parameter configuration of interconnected board of the WDM equipment on the
client side is incorrect. In this case, interconnected board of the WDM equipment may report
R_OOF alarm on client side.
1. Check whether the service types of the interconnected boards mismatch each other.
l On the interconnected board of the WDM equipment, query and set the client service
type of the board. Make sure that the service type of the WDM equipment is the same
as the interconnected SDH equipment.
2. Check whether any interconnected interface is looped back.
l Log in to the U2000. In the NE Explorer interface, select the interconnected board.
l In Function Tree on the left side, choose Configuration > WDM Interface.
l Select Basic Attributes, and click Query. Check whether the optical interface on the
client side of each board is looped back. If yes, double-click the Loopback field. In the
drop-down list, select Non-Loopback and click Apply.
----End
Symptom
After local WDM equipment is interconnected with other OTN equipment, the two devices fail
to communicate with each other and the U2000 may report R_LOS, OTUk_LOF, and
OTUk_LOM alarms.
System Impact
Interconnected devices are unable to communicate with each other.
Possible Causes
l Cause 1: The line between the local WDM equipment and the other OTN equipment is
faulty.
l Cause 2: The client-side configuration for the interconnected board on the WDM equipment
is incorrect.
l Cause 3: One or both of the interconnected boards is/are faulty.
Procedure
Step 1 Cause 1: The line between the local WDM equipment and the other OTN equipment is faulty.
In this case, the interconnected board on the WDM equipment may report the R_LOS alarm on
the client side.
1. Check whether the optical modules of the interconnected boards are of the same type. If
they are of the same type, check whether the supported transmission distance of the optical
modules is equal or longer than the actual distance away from the client side. The single-
mode optical module and the multimode optical module cannot be used together.
2. Check whether the optical fibers of the interconnected boards match each other. The single-
mode optical fiber and the multimode optical fiber cannot be used together.
NOTE
Generally, the multimode optical fiber is nacarat, and the single-mode fiber is yellow.
3. Check whether any optical fiber is installed on an incorrect port according to the optical
fiber labels. If any optical fiber is installed on an incorrect port, the R_LOS alarm might
not be generated but a large number of performance events against anomalies may be
generated on the equipment. In this case, install the optical fiber on the correct port.
4. Use an optical power meter to test the pigtail interconnected with the equipment. If the
pigtail is abnormal, replace it. If it is normal, check whether the optical cable is faulty.
Step 2 Cause 2: The client-side configuration for the ports on the interconnected board of the WDM
equipment is incorrect. In this case, the interconnected board on the WDM equipment may report
the OTUk_LOF and OTUk_LOM alarms on the client side.
1. Check whether the service types, FEC mode, working modes, board mode of the
interconnected boards are inconsistent.
l Query and set the client service type of the interconnected board of the WDM
equipment. Ensure that the service type of the WDM equipment is the same as that of
the interconnected OTN equipment.
2. Check whether a loop is occurring on any interconnected port.
l Log in to the U2000. In NE Explorer, select the interconnected NE.
l Choose Configuration > WDM Interface from the Function Tree.
l On the Basic Attributes tab, click Query to check whether a loopback is enabled on
any optical port on the client side of each board. If a loopback is enabled, double-click
the Loopback field. In the drop-down list, select Non-Loopback and click Apply.
Step 3 Cause 3: One or both of the interconnected boards is/are faulty.
1. Use the optical power meter to test the optical power of the client-side optical port on the
interconnected board of the WDM equipment. If the optical power is abnormal, replace the
board.
2. Use the optical power meter to test the optical power of the optical port on the
interconnected board of the OTN equipment. If the optical power is abnormal, replace the
board.
----End
NOTE
The following cases are related to the OptiX WDM product series.
Related Cases:
l MC-A4 The LOG Board Fails to Interwork With the FDG Board on the Client Sides. The
LOG board reports the R_LOS alarm on the client side. The FDG board reports the
LINK_STATUS alarm.
l MC-A5 The LQG Board Reports the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Transiently
l MC-A6 The LDG Board Keeps Reporting the ALM_DATA_TLOS and
ALM_DATA_RLOS Alarms Transiently
l MC-A11 The SSE3LWF Board Reports the OTU_LOF Alarm When Interworking with
the SSE1TMR
l MC-A17 The LQG Reports ALM_DATA_RLOS and ALM_DATA_TLOS Alarms
l MC-A21 An Interconnection Failure between the WDM and SDH Equipment
l MC-A30 LDG Board of Metro 6100 Equipment Reports the INBADOCTS_OVER Alarm
l MC-A44 Bit Errors Generated in the Services, the Meter Reports the HP_RDI and ERROR
Alarms
l MC-A49 An Incorrect Setting of CRC Results in an Interconnection Failure
l MC-A67 The GE Port on the Client Side Reports LINK_DOWN Alarm
l MC-A74 The FDG Reports the T_DATA_LOST and R_DATA_LOST Alarms
l MC-A75 Bandwidth Decreases and Service Rate Becomes Lower Due to Improper Setting
l MC-A84 Maximal Packet Length Setting of the OptiX OSN 6800 Causes the Abnormal
Service
l MC-A117 Analysis and Handling of the ALM_DATA_RLOS and ALM_DATA_TLOS
Alarms Reported by the LBE
l MC-A118 An R_OOF Alarm Occurs When the LWC Interconnects with Third-Party
Equipment
l MC-A153 It Is Recommended to Enable the Auto-Negotiation Working Mode at a GE Port
When the LPT Function Is Enabled, Or The Services Are Interrupted And The System
Reports The R_LOS Alarms
l MC-A154 The N2SLQ16 Board Interconnected with the TQS Board Reports an R_LOS
Alarm Because The TQS Board Emits White Lights When the Input Optical Power Is
Normal
l MC-A156 An SDH Board on the SDH Equipment Reports an R_LOF Alarm When the
SDH Board Is Interconnected with a Tributary Board on the OptiX OSN 6800
l MC-A185 Incorrect Client-Side Service Type Causes Failure of Interconnection With a
Router
l MC-A249 The OTN Equipment Fails to Transparently Transmit Ethernet Clock Signals
A.1 Overview
This topic provides remote maintenance overview.
A.1 Overview
This topic provides remote maintenance overview.
The Huawei optical transmission equipment series supports remote maintenance. A remote
computer (remote end) can connect to an on-site U2000 server (local end) through a public
switched telephone network (PSTN) or through the Internet. The remote computer then can
perform maintenance to the equipment.
serial
port Modem
PSTN/
Internet serial
Modem port
Remote
maintenance
ternimal
U2000
Server
WDM
network
During remote maintenance, the U2000 server needs to cooperate with the remote end to perform
the following operations:
Prerequisites
You must be an NM user with "Network administrator" authority or higher.
Context
A remote maintenance user is the NM user who logs in to the U2000 server to maintain the
equipment. Remote maintenance users are in Disable status by default. You need to enable it
before you start the remote maintenance.
Procedure
Step 1 In the Main Menu choose System > Remote Maintenance User Management to display the
Remote Maintenance User Management dialog box.
Step 2 Set the Disable/Enable to be Enable. Set the other attributes of the remote user.
----End
Prerequisites
The communication connection between the remote maintenance terminal and the U2000 server
must have been configured.
The user name and password of the U2000 server must be known to maintenance personnel at
the remote maintenance end.
Procedure
Step 1 On the U2000 server, query the connection status for remote maintenance. If the remote
maintenance connection is normal, go to step 5. If not, go to step 2.
l If the Windows operating system is installed, right-click My Network Places and select
Properties to query the connection status for remote maintenance.
l If the Solaris or Linux operating system is installed, open a terminal window and run the
ifconfig -a command to query the connection status for remote maintenance.
Step 2 Double-click the shortcut icon, which is created for dial-up connection, on the desktop, such as
Remote Maintenance to perform dial-up connection.
Step 3 Enter the user name (ppp_user) and password in the displayed dialog box.
Step 4 Enter the user name and password. Press the Enter key and click the D button at the right bottom
corner to establish the connection.
NOTE
After you enter the password and press the Enter key, a line of illegal characters is displayed. This is normal.
l If the Windows operating system is installed, choose Start > All Programs > Accessories
> Command Prompt. Enter ipconfig in the dialog box and query the dynamic IP address
for dial-up access to network.
l If the Solaris or Linux operating system is installed, display a terminal window and run the
ifconfig -a command to query dynamic IP address for dial-up access to network.
NOTE
The address for dial-up access is IP address. If the dial-up connection of the server is disconnected, re-dial
to get a new IP address.
Step 6 Inform the maintenance personnel at the remote end of the queried IP address.
Step 7 Establish the dial-up connection between the remote maintenance personnel and the U2000
server. Log in to the U2000 client. After the login succeeds, you can perform the remote
maintenance.
CAUTION
To ensure network security, set the remote maintenance user to Disabled after the remote
maintenance.
----End
B Glossary
Numerics
1+1 backup A backup method in which two components mirror each other. If the active component
goes down, the standby component takes over services from the active component to
ensure that the system service is not interrupted.
1000BASE-T An Ethernet specification that uses the twisted pair cable with the transmission speed as
1000 Mbit/s and the transmission distance as 100 meters.
3R reshaping, retiming, regenerating
A
AC alternating current
ACL See access control list.
ADM add/drop multiplexer
ADSL See asymmetric digital subscriber line.
AGC automatic gain control
AIS alarm indication signal
ALC link A piece of end-to-end configuration information, which exists in the equipment (single
station) as an ALC link node. Through the ALC function of each node, it fulfills optical
power control on the line that contains the link.
ALS See automatic laser shutdown.
ANSI See American National Standards Institute.
APE See automatic power equilibrium.
APS automatic protection switching
ASE amplified spontaneous emission
ASIC See application-specific integrated circuit.
ATM asynchronous transfer mode
American National An organization that defines U.S standards for the information processing industry.
Standards Institute
(ANSI)
access control list A list of entities, together with their access rights, which are authorized to access a
(ACL) resource.
active/standby A troubleshooting technology. When an active device becomes faulty, services and
switchover control functions are automatically switched over to the standby device to ensure the
normal running of the services and functions.
alarm cable A cable used to transmit visual or audio alarms.
alarm cascading The method of cascading alarm signals from several subracks or cabinets.
alarm output Node or other signals that are sent by an alarm controller to peripheral devices when an
alarm is reported.
alarm suppression A method to suppress alarms for the alarm management purpose. Alarms that are
suppressed are no longer reported from NEs.
application-specific A special type of chip that starts out as a nonspecific collection of logic gates. Late in
integrated circuit the manufacturing process, a layer is added to connect the gates for a specific function.
(ASIC) By changing the pattern of connections, the manufacturer can make the chip suitable for
many needs.
asymmetric digital A technology for transmitting digital information at a high bandwidth on existing phone
subscriber line (ADSL) lines to homes and businesses. Unlike regular dialup phone service, ADSL provides
continuously-available, "always on" connection. ADSL is asymmetric in that it uses most
of the channel to transmit downstream to the user and only a small part to receive
information from the user. ADSL simultaneously accommodates analog (voice)
information on the same line. ADSL is generally offered at downstream data rates from
512 kbit/s to about 6 Mbit/s.
attenuation Reduction of signal magnitude or signal loss, usually expressed in decibels.
attenuator A device used to increase the attenuation of an Optical Fiber Link. Generally used to
ensure that the signal at the receive end is not too strong.
automatic laser A technique (procedure) to automatically shutdown the output power of laser transmitters
shutdown (ALS) and optical amplifiers to avoid exposure to hazardous levels.
automatic power A function to automatically equalize channel optical power at the transmitter end,
equilibrium (APE) ensuring a required optical power flatness and OSNR at the receiver end.
B
B/S browser/server
BBER background block error ratio
BC boundary clock
BDI See backward defect indication.
BGP Border Gateway Protocol
BIOS See basic input/output system.
BIP See bit interleaved parity.
bit interleaved parity-8 Consists of a parity byte calculated bit-wise across a large number of bytes in a
(BIP-8) transmission transport frame. Divide a frame is into several blocks with 8 bits (one byte)
in a parity unit and then arrange the blocks in matrix. Compute the number of "1" or "0"
over each column. Then fill a 1 in the corresponding bit for the result if the number is
odd, otherwise fill a 0.
bit/s See bits per second.
bits per second (bit/s) A rate at which the individual bits are transmitted through a communication link or
circuit. Its unit can be bit/s, kbit/s, and Mbit/s.
bridge A device that connects two or more networks and forwards packets among them. Bridges
operate at the physical network level. Bridges differ from repeaters because bridges store
and forward complete packets, while repeaters forward all electrical signals. Bridges
differ from routers because bridges use physical addresses, while routers use IP
addresses.
bridge protocol data Data messages exchanged across switches within an extended LAN that uses a spanning
unit (BPDU) tree protocol (STP) topology. BPDU packets contain information on ports, addresses,
priorities, and costs, and they ensure that the data reaches its intended destination. BPDU
messages are exchanged across bridges to detect loops in a network topology. These
loops are then removed by shutting down selected bridge interfaces and placing
redundant switch ports in a backup, or blocked, state.
broadband remote A new type of access gateway for broadband networks. As a bridge between backbone
access server (BRAS) networks and broadband access networks, BRAS provides methods for fundamental
access and manages the broadband access network. It is deployed at the edge of network
to provide broadband access services, convergence, and forwarding of multiple services,
meeting the demands for transmission capacity and bandwidth utilization of different
users. BRAS is a core device for the broadband users' access to a broadband network.
broadcast A means of delivering information to all members in a network. The broadcast range is
determined by the broadcast address.
broadcast address A network address in computer networking that allows information to be sent to all nodes
on a network, rather than to a specific network host.
building integrated In the situation of multiple synchronous nodes or communication devices, one can use
timing supply (BITS) a device to set up a clock system on the hinge of telecom network to connect the
synchronous network as a whole, and provide satisfactory synchronous base signals to
the building integrated device. This device is called BITS.
bus A path or channel for signal transmission. The typical case is that, the bus is an electrical
connection that connects one or more conductors. All devices that are connected to a
bus, can receive all transmission contents simultaneously.
byte A unit of computer information equal to eight bits.
C
CAR committed access rate
CBR See constant bit rate.
CE See customer edge.
CES See circuit emulation service.
common public radio A common standard of the key internal interface between the REC and the RE of the
interface (CPRI) wireless base station. This standard was established by Huawei, Ericsson, NEC, Siemens,
and Nortel in June 2003. It aims at standardizing the baseband and RF interface. The
CPRI has a set of mature standards, which advance the standard and equipment. The
major feature of the CPRI is that baseband is separated from RF to reduce the cost of
engineering, equipment room, and equipment.
configuration data A command file defining hardware configurations of an NE. With this file, an NE can
collaborate with other NEs in a network. Therefore, configuration data is the key factor
that determines the operation of an entire network.
connection point A reference point where the output of a trail termination source or a connection is bound
to the input of another connection, or where the output of a connection is bound to the
input of a trail termination sink or another connection. The connection point is
characterized by the information which passes across it. A bidirectional connection point
is formed by the association of a contradirectional pair.
consistency check A function that is used to check the consistency of service data and resource data between
two softswitches that have the dual homing relation. This ensures the consistency of
service data and resource data between the softswitches.
constant bit rate (CBR) A kind of service categories defined by the ATM forum. CBR transfers cells based on
the constant bandwidth. It is applicable to service connections that depend on precise
clocking to ensure undistorted transmission.
control VLAN A VLAN that transmits only protocol packets.
convergence layer A "bridge" between the access layer and the core layer. The convergence layer provides
the convergence and forwarding functions for the access layer. It processes all the traffic
from the access layer devices, and provides the uplinks to the core layer. Compared with
the access layer, the convergence layer devices should have higher performance, fewer
interfaces and higher switching rate. In the real network, the convergence layer refers to
the network between UPEs and PE-AGGs.
cross-connection The connection of channels between the tributary board and the line board, or between
line boards inside the NE. Network services are realized through the cross-connections
of NEs.
customer edge (CE) A part of the BGP/MPLS IP VPN model that provides interfaces for directly connecting
to the Service Provider (SP) network. A CE can be a router, switch, or host.
cyclic redundancy A procedure used to check for errors in data transmission. CRC error checking uses a
check (CRC) complex calculation to generate a number based on the data transmitted. The sending
device performs the calculation before performing the transmission and includes the
generated number in the packet it sends to the receiving device. The receiving device
then repeats the same calculation. If both devices obtain the same result, the transmission
is considered to be error free. This procedure is known as a redundancy check because
each transmission includes not only data but extra (redundant) error-checking values.
D
DAPI destination access point identifier
DBPS distributed board protect system
DC direct current
distributed link A board-level port protection technology that detects unidirectional fiber cuts and
aggregation group negotiates with the opposite port. In the case of a link down failure on a port or hardware
(DLAG) failure on a board, services are automatically switched to the slave board, thereby
achieving 1+1 protection for the inter-board ports.
dual feed and selective A channel used to transmit monitoring data on an optical transmission network. The
receiving monitoring data is transmitted on the data communications channel as part of the
overhead of the service signal.
dual-ended switching A protection method in which switching is performed at both ends of a protected entity,
such as a connection or path, even if a unidirectional failure occurs.
E
E-LAN See Ethernet local area network.
E-Line See Ethernet line.
E1 An European standard for high-speed data transmission at 2.048 Mbit/s. It provides
thirty-two 64 kbit/s channels. A time division multiplexing frame is divided in to 32
timeslots numbered from 0 to 31. Timeslot 0 is reserved for frame synchronization, and
timeslot 16 is reserved for signaling transmission. The rest 30 timeslots are use as speech
channels. Each timeslot sends or receives an 8-bit data per second. Each frame sends or
receives 256-bit data per second. 8000 frames will be sent or received per second.
Therefore the line data rate is 2.048 Mbit/s.
EAPE enhanced automatic power pre-equilibrium
ECC See embedded control channel.
EDFA See erbium-doped fiber amplifier.
EEPROM See electrically erasable programable read-only memory.
EMI See electromagnetic interference.
EPL See Ethernet private line.
EPLAN See Ethernet private LAN service.
EPLD See erasable programmable logical device.
EPON See Ethernet passive optical network.
ERPS Ethernet ring protection switching
ES edge server
ESC See electric supervisory channel.
ESCON See enterprise system connection.
ESD electrostatic discharge
ESN See equipment serial number.
ETS European Telecommunication Standards
ETSI See European Telecommunications Standards Institute.
EVOA electrical variable optical attenuator
EVPL See Ethernet virtual private line.
F
FC See Fibre Channel.
FCS free cooling system
FDB See forwarding database.
FDDI See fiber distributed data interface.
FE fast Ethernet
FEC See forward error correction.
FICON See Fiber Connect.
FOADM fixed optical add/drop multiplexer
FPGA See field programmable gate array.
FTP File Transfer Protocol
Fiber Connect A new generation connection protocol that connects the host to various control units. It
(FICON) carries a single byte command protocol through the physical path of fiber channel, and
provides a higher transmission rate and better performance than ESCON.
Fibre Channel (FC) A high-speed transport technology used to build SANs. FC is primarily used for
transporting SCSI traffic from servers to disk arrays, but it can also be used on networks
carrying ATM and IP traffic. FC supports single-mode and multi-mode fiber
connections, and can run on twisted-pair copper wires and coaxial cables. FC provides
both connection-oriented and connectionless services.
fault alarm A type of alarm caused by hardware and/or software faults, for example, board failure,
or by the exception that occurs in major functions. After handling, a fault alarm can be
cleared, upon which the NE reports a recovery alarm. Fault alarms are of higher severity
than event alarms.
fiber distributed data A standard developed by the American National Standards Institute (ANSI) for high-
interface (FDDI) speed fiber-optic LANs. FDDI provides specifications for transmission rates of 100
megabits per second on token ring networks.
field programmable A semi-customized circuit that is used in the Application Specific Integrated Circuit
gate array (FPGA) (ASIC) field and developed based on programmable components. FPGA remedies many
of the deficiencies of customized circuits, and allows the use of many more gate arrays.
forced switching The action of switching traffic signals between a working channel and protection
channel. The switching occurs even if the channel to which traffic is being switched is
faulty or an equal or higher priority switching command is in effect.
forward error A bit error correction technology that adds correction information to the payload at the
correction (FEC) transmit end. Based on the correction information, the bit errors generated during
transmission can be corrected at the receive end.
forwarding database A type of database that Includes entries for guiding multicast data forwarding. There are
(FDB) Layer 2 FDB and Layer 3 FDB. The Layer 2 FDB refers to the MAC table, which provides
information about the MAC address and outbound interface and guides Layer 2
forwarding. The Layer 3 FDB refers to the ARP table, which provides information about
the IP address and outbound interface and guides Layer 3 forwarding.
G
GCC general communication channel
GE Gigabit Ethernet
GFP See Generic Framing Procedure.
GMP Group Map Protocol
GMPLS generalized multiprotocol label switching
H
HD-SDI high definition serial digital interface
HSDPA See High Speed Downlink Packet Access.
High Speed Downlink A modulating-demodulating algorithm put forward in 3GPP R5 to meet the requirement
Packet Access for asymmetric uplink and downlink transmission of data services. It enables the
(HSDPA) maximum downlink data service rate to reach 14.4 Mbit/s without changing the
WCDMA network topology.
handle A component of the panel. It is used to insert or remove boards in and out of slots.
hardware loopback A connection mode in which a fiber jumper is used to connect the input optical interface
of a board to the output optical interface of the board to achieve signal loopback.
hop A network connection between two distant nodes. For Internet operation a hop represents
a small step on the route from one main computer to another.
hot patch A patch that is used to repair a deficiency in the software or add a new feature to a program
without restarting the software and interrupting the service. For the equipment using the
built-in system, a hot patch can be loaded, activated, confirmed, deactivated, deleted, or
queried.
I
IANA See Internet Assigned Numbers Authority.
IC See integrated circuit.
ICMP See Internet Control Message Protocol.
IEEE See Institute of Electrical and Electronics Engineers.
IETF Internet Engineering Task Force
IGMP See Internet Group Management Protocol.
IMA See inverse multiplexing over ATM.
IP Internet Protocol
IP subnet A special submap used to identify an IP network segment. It is displayed as the submap
icon in the topological view.
IPA See intelligent power adjustment.
IPv4 See Internet Protocol version 4.
IPv6 See Internet Protocol version 6.
IS-IS See Intermediate System to Intermediate System.
ISDN integrated services digital network
ISO International Organization for Standardization
ITU-T International Telecommunication Union-Telecommunication Standardization Sector
Institute of Electrical A professional association of electrical and electronics engineers based in the United
and Electronics States, but with membership from numerous other countries. The IEEE focuses on
Engineers (IEEE) electrical, electronics, and computer engineering, and produces many important
technology standards.
Intermediate System to A protocol used by network devices (routers) to determine the best way to forward
Intermediate System datagram or packets through a packet-based network.
(IS-IS)
Internet Assigned A department operated by the IAB. IANA delegates authority for IP address-space
Numbers Authority allocation and domain-name assignment to the NIC and other organizations. IANA also
(IANA) maintains a database of assigned protocol identifiers used in the TCP/IP suite, including
autonomous system numbers.
Internet Control A network layer protocol that provides message control and error reporting between a
Message Protocol host server and an Internet gateway.
(ICMP)
Internet Group One of the TCP/IP protocols for managing the membership of Internet Protocol multicast
Management Protocol groups. It is used by IP hosts and adjacent multicast routers to establish and maintain
(IGMP) multicast group memberships.
Internet Protocol The current version of the Internet Protocol (IP). IPv4 utilizes a 32bit address which is
version 4 (IPv4) assigned to hosts. An address belongs to one of five classes (A, B, C, D, or E) and is
written as 4 octets separated by periods and may range from 0.0.0.0 through to
255.255.255.255. Each IPv4 address consists of a network number, an optional
subnetwork number, and a host number. The network and subnetwork numbers together
are used for routing, and the host number is used to address an individual host within the
network or subnetwork.
Internet Protocol An update version of IPv4, which is designed by the Internet Engineering Task Force
version 6 (IPv6) (IETF) and is also called IP Next Generation (IPng). It is a new version of the Internet
Protocol. The difference between IPv6 and IPv4 is that an IPv4 address has 32 bits while
an IPv6 address has 128 bits.
indicator Description of a performance feature collected from the managed devices by the
performance collector.
integrated circuit (IC) A combination of inseparable associated circuit elements that are formed in place and
interconnected on or within a single base material to perform a microcircuit function.
intelligent power A technology that reduces the optical power of all the amplifiers in an adjacent
adjustment (IPA) regeneration section in the upstream to a safe level if the system detects the loss of optical
signals on the link. IPA helps ensure that maintenance engineers are not injured by the
laser escaping from a broken fiber or a connector that is not plugged in properly.
intersecting ring More than two public nodes between two ring networks. The network is more complex
if there are many intersecting nodes. Most devices support that two rings are intersected
at two nodes.
inverse multiplexing A technique that involves inverse multiplexing and de-multiplexing of ATM cells in a
over ATM (IMA) cyclical fashion among links grouped to form a higher bandwidth logical link whose rate
is approximately the sum of the link rates.
J
jitter The measure of short waveform variations caused by vibration, voltage fluctuations, and
control system instability.
jumper A connection wire for connecting two pins.
L
L2 switching The switching based on the data link layer.
L2VPN Layer 2 virtual private network
L3VPN Layer 3 virtual private network
LACP See Link Aggregation Control Protocol.
LAG See link aggregation group.
LAN See local area network.
LAPD link access procedure on the D channel
LB See loopback.
LC Lucent connector
LCAS See link capacity adjustment scheme.
LCT local craft terminal
LDP Label Distribution Protocol
LED See light emitting diode.
LMP link management protocol
LOS See loss of signal.
LP logical port
LPT link-state pass through
LSP See label switched path.
LSR See label switching router.
LTC loss of tandem connection
Layer 2 switching A data forwarding method. In a LAN, a network bridge or 802.3 Ethernet switch
transmits and distributes packet data based on the MAC address. Since the MAC address
is at the second layer of the OSI model, this data forwarding method is called Layer 2
switching.
Link Aggregation A dynamic link aggregation protocol that improves the transmission speed and
Control Protocol reliability. The two ends of the link send LACP packets to inform each other of their
(LACP) parameters and form a logical aggregation link. After the aggregation link is formed,
LACP maintains the link status in real time and dynamically adjusts the ports on the
aggregation link upon detecting the failure of a physical port.
label A short identifier that is of fixed length and local significance. It is used to uniquely
identify the FEC to which a packet belongs. It does not contain topology information. It
is carried in the header of a packet and does not contain topology information.
label switched path A sequence of hops (R0...Rn) in which a packet travels from R0 to Rn through label
(LSP) switching mechanisms. A label-switched path can be chosen dynamically, based on
common routing mechanisms or through configuration.
label switching router Basic element of an MPLS network. All LSRs support the MPLS protocol. The LSR is
(LSR) composed of two parts: control unit and forwarding unit. The former is responsible for
allocating the label, selecting the route, creating the label forwarding table, creating and
removing the label switch path; the latter forwards the labels according to groups
received in the label forwarding table.
light emitting diode A display and lighting technology used in almost every electrical and electronic product
(LED) on the market, from a tiny on/off light to digital readouts, flashlights, traffic lights, and
perimeter lighting. LEDs are also used as the light source in multimode fibers, optical
mice, and laser printers.
linear MSP linear multiplex section protection
link aggregation group An aggregation that allows one or more links to be aggregated together to form a link
(LAG) aggregation group so that a MAC client can treat the link aggregation group as if it were
a single link.
link capacity LCAS in the virtual concatenation source and sink adaptation functions provides a
adjustment scheme control mechanism to hitless increase or decrease the capacity of a link to meet the
(LCAS) bandwidth needs of the application. It also provides a means of removing member links
that have experienced failure. The LCAS assumes that in cases of capacity initiation,
increases or decreases, the construction or destruction of the end-to-end path is the
responsibility of the network and element management systems.
link group According to some principles, links are divided into the set in the logical term. A set of
links is called the link group. The division makes management more convenient.
link status The running status of a link, which can be Up, Down, backup, or unknown.
loading A process of importing information from the storage device to the memory to facilitate
processing (when the information is data) or execution (when the information is
program).
local area network A network formed by the computers and workstations within the coverage of a few square
(LAN) kilometers or within a single building, featuring high speed and low error rate. Current
LANs are generally based on switched Ethernet or Wi-Fi technology and run at 1,000
Mbit/s (that is, 1 Gbit/s).
loopback (LB) A troubleshooting technique that returns a transmitted signal to its source so that the
signal or message can be analyzed for errors. The loopback can be a inloop or outloop.
loopback test Self-test of chips, including internal and external loopback. Loopback test is used to test
whether interfaces work normally.
loss of signal (LOS) No transitions occurring in the received signal.
M
MAC mandatory access control
MAC address A link layer address or physical address. It is six bytes long.
MADM multiple add/drop multiplexer
MCA multi-channel spectrum analyzer unit
MD5 See message digest algorithm 5.
MFAS See multiframe alignment signal.
MIP maintenance association intermediate point
MPLS See Multiprotocol Label Switching.
MPLS TE multiprotocol label switching traffic engineering
MPLS VPN See multiprotocol label switching virtual private network.
MPLS-TP See MultiProtocol Label Switching Transport Profile.
MS multiplex section
MSOH multiplex section overhead
MSP See multiplex section protection.
MSTP See Multiple Spanning Tree Protocol.
MTU See maximum transmission unit.
MUX See multiplexer.
MultiProtocol Label An extension to MPLS in terms of forwarding, OAM, reliability, NMS and control plane
Switching Transport protocol standardized by IETF to provide sufficient transport functionality.
Profile (MPLS-TP)
Multiple Spanning A protocol that can be used in a loop network. Using an algorithm, the MSTP blocks
Tree Protocol (MSTP) redundant paths so that the loop network can be trimmed as a tree network. In this case,
the proliferation and endless cycling of packets is avoided in the loop network. The
protocol that introduces the mapping between VLANs and multiple spanning trees. This
solves the problem that data cannot be normally forwarded in a VLAN because in STP/
RSTP, only one spanning tree corresponds to all the VLANs.
Multiprotocol Label A technology that uses short tags of fixed length to encapsulate packets in different link
Switching (MPLS) layers, and provides connection-oriented switching for the network layer on the basis of
IP routing and control protocols.
main topology A basic component of a human-machine interface. It is the default client interface of the
NMS and intuitively displays the structure of a network, NEs on the network, subnets in
the network as well as the NE communication and running status, reflecting the overall
network running status.
management node A management node consists of multiple document security management(DSM) servers
(four at most) in an area. Management nodes are created so that the DSM management
center can synchronize the department and account information of areas from each node
and then deliver all information to all nodes, thus realizing across-system authorization
and roaming access of security documents.
maximum transmission The largest packet of data that can be transmitted on a network. MTU size varies,
unit (MTU) depending on the network—576 bytes on X.25 networks, for example, 1500 bytes on
Ethernet, and 17,914 bytes on 16 Mbit/s token ring. Responsibility for determining the
size of the MTU lies with the link layer of the network. When packets are transmitted
across networks, the path MTU, or PMTU, represents the smallest packet size (the one
that all networks can transmit without breaking up the packet) among the networks
involved.
member A basic element for forming a dimension according to the hierarchy of each level. Each
member represents a data element in a dimension. For example, January 1997 is a typical
member of the time dimension.
message digest A hash function that is used in a variety of security applications to check message
algorithm 5 (MD5) integrity. MD5 processes a variable-length message into a fixed-length output of 128
bits. It breaks up an input message into 512-bit blocks (sixteen 32-bit little-endian
integers). After a series of processing, the output consists of four 32-bit words, which
are then cascaded into a 128-bit hash number.
monitoring A method that an inspector uses to inspect a service agent. By monitoring a service agent,
an inspector can check each detailed operation performed by the service agent during
the conversation and operate the GUI used by the service agent. The inspector helps the
service agent to provide better service.
mounting ear A piece of angle plate on a rack. The mounting ear has holes that can be used to fix
network elements or components.
multicast A process of transmitting data packets from one source to many destinations. The
destination address of the multicast packet uses Class D address, that is, the IP address
ranges from 224.0.0.0 to 239.255.255.255. Each multicast address represents a multicast
group rather than a host.
multiframe alignment A distinctive signal inserted into every multiframe or once into every n multiframes,
signal (MFAS) always occupying the same relative position within the multiframe, and used to establish
and maintain multiframe alignment.
multiplex section A function, which is performed to provide capability for switching a signal between and
protection (MSP) including two multiplex section termination (MST) functions, from a "working" to a
"protection" channel.
multiplexer (MUX) Equipment that combines a number of tributary channels onto a fewer number of
aggregate bearer channels, the relationship between the tributary and aggregate channels
being fixed.
multiplexing A procedure by which multiple lower order path layer signals are adapted into a higher
order path or the multiple higher order path layer signals are adapted into a multiplex
section.
multiprotocol label An Internet Protocol (IP) virtual private network (VPN) based on the multiprotocol label
switching virtual switching (MPLS) technology. It applies the MPLS technology for network routers and
private network switches, simplifies the routing mode of core routers, and combines traditional routing
(MPLS VPN) technology and label switching technology. It can be used to construct the broadband
Intranet and Extranet to meet various service requirements.
N
NE Explorer The main operation interface of the NMS, which is used to manage the
telecommunication equipment. In the NE Explorer, a user can query, manage, and
maintain NEs, boards, and ports.
NE ID An ID that indicates a managed device in the network. In the network, each NE has a
unique NE ID.
NE Panel A graphical user interface, of the network management system, which displays subracks,
boards, and ports on an NE. On the NE Panel, the user can complete most of the
configuration, management and maintenance functions for an NE.
NNI network node interface
NRZ non-return to zero
NSAP See network service access point.
NTP Network Time Protocol
network layer Layer 3 of the seven-layer OSI model of computer networking. The network layer
provides routing and addressing so that two terminal systems are interconnected. In
addition, the network layer provides congestion control and traffic control. In the TCP/
IP protocol suite, the functions of the network layer are specified and implemented by
IP protocols. Therefore, the network layer is also called IP layer.
network service A service that needs to be enabled at the network layer and maintained as a basic service.
network service access A network address defined by ISO, at which the OSI Network Service is made available
point (NSAP) to a Network service user by the Network service provider.
network storm A phenomenon that occurs during data communication. To be specific, mass broadcast
packets are transmitted in a short time; the network is congested; transmission quality
and availability of the network decrease rapidly. The network storm is caused by network
connection or configuration problems.
O
O&M operation and maintenance
OA optical amplifier
OADM See optical add/drop multiplexer.
OAM See operation, administration and maintenance.
OAMS Optical fiber line Automatic Monitoring System
operation, A set of network management functions that cover fault detection, notification, location,
administration and and repair.
maintenance (OAM)
optical add/drop A device that can be used to add the optical signals of various wavelengths to one channel
multiplexer (OADM) and drop the optical signals of various wavelengths from one channel.
optical amplifier unit A board that is mainly responsible for amplifying optical signals. The OAU can be used
(OAU) in both the transmitting direction and the receiving direction.
optical channel payload A protection architecture that allows one wavelength to provide protection for multiple
unit (OPU) services between different stations, saving wavelength resources and lowering costs.
optical interface A component that connects several transmit or receive units.
optical line protection A mechanism that protects line signals using the dual feeding and selective receiving
(OLP) principle, featuring single-ended switching.
optical network A device that terminates the fiber optical network at the customer premises.
terminal (ONT)
optical network unit A form of Access Node that converts optical signals transmitted via fiber to electrical
(ONU) signals that can be transmitted via coaxial cable or twisted pair copper wiring to
individual subscribers.
optical signal-to-noise The ratio of signal power to noise power in a transmission link. OSNR is the most
ratio (OSNR) important index for measuring the performance of a DWDM system.
optical supervisory A technology that uses specific optical wavelengths to realize communication among
channel (OSC) nodes in optical transmission network and transmit the monitoring data in a certain
channel.
optical switch A passive component possessing two or more ports that selectively transmits, redirects,
or blocks optical power in an optical fiber transmission line.
optical transmission A section in the logical structure of an optical transport network (OTN). The OTS allows
section (OTS) the network operator to perform monitoring and maintenance tasks between NEs.
optical transponder A device or subsystem that converts accessed client signals into a G.694.1/G.694.2-
unit (OTU) compliant WDM wavelength.
orderwire A channel that provides voice communication between operation engineers or
maintenance engineers of different stations.
P
P2MP point-to-multipoint
P2P See point-to-point service.
PC personal computer
PCB See printed circuit board.
PCN product change notice
PDH See plesiochronous digital hierarchy.
PDL See polarization-dependent loss.
PDU See power distribution unit.
pseudo wire (PW) An emulated connection between two PEs for transmitting frames. The PW is established
and maintained by PEs through signaling protocols. The status information of a PW is
maintained by the two end PEs of a PW.
pseudo wire emulation An end-to-end Layer 2 transmission technology. It emulates the essential attributes of a
edge-to-edge (PWE3) telecommunication service such as ATM, FR or Ethernet in a packet switched network
(PSN). PWE3 also emulates the essential attributes of low speed time division
multiplexing (TDM) circuit and SONET/SDH. The simulation approximates to the real
situation.
pulse A variation above or below a normal level and a given duration in electrical energy.
Q
QinQ A layer 2 tunnel protocol based on IEEE 802.1Q encapsulation. It add a public VLAN
tag to a frame with a private VLAN tag to allow the frame with double VLAN tags to
be transmitted over the service provider's backbone network based on the public VLAN
tag. This provides a layer 2 VPN tunnel for customers and enables transparent
transmission of packets over private VLANs.
QoS See quality of service.
quality of service (QoS) A commonly-used performance indicator of a telecommunication system or channel.
Depending on the specific system and service, it may relate to jitter, delay, packet loss
ratio, bit error ratio, and signal-to-noise ratio. It functions to measure the quality of the
transmission system and the effectiveness of the services, as well as the capability of a
service provider to meet the demands of users.
R
RADIUS See Remote Authentication Dial In User Service.
RADIUS An authentication mode in which the BRAS sends the user name and the password to
authentication the RADIUS server by using the RADIUS protocol. The RADIUS server authenticates
the user, and then returns the result to the BRAS.
RB See radio bearer.
RDI remote defect indication
RFC See Request For Comments.
RJ registered jack
RMON See remote monitor.
ROADM reconfigurable optical add/drop multiplexer
RSTP See Rapid Spanning Tree Protocol.
RSVP See Resource Reservation Protocol.
RTN radio transmission node
RZ return to zero
Rapid Spanning Tree An evolution of the Spanning Tree Protocol (STP) that provides faster spanning tree
Protocol (RSTP) convergence after a topology change. The RSTP protocol is backward compatible with
the STP protocol.
Remote Authentication A security service that authenticates and authorizes dial-up users and is a centralized
Dial In User Service access control mechanism. RADIUS uses the User Datagram Protocol (UDP) as its
(RADIUS) transmission protocol to ensure real-time quality. RADIUS also supports the
retransmission and multi-server mechanisms to ensure good reliability.
Request For Comments A document in which a standard, a protocol, or other information pertaining to the
(RFC) operation of the Internet is published. The RFC is actually issued, under the control of
the IAB, after discussion and serves as the standard. RFCs can be obtained from sources
such as InterNIC.
Resource Reservation A protocol that reserves resources on every node along a path. RSVP is designed for an
Protocol (RSVP) integrated services Internet.
radio bearer (RB) The service provided by the Layer 2 for the transfer of user data between UE (User
Equipment) and UTRAN (UMTS Terrestrial Radio Access Network).
reboot To start the system again. Programs or data will be reloaded to all boards.
receiver sensitivity The minimum acceptable value of mean received power at point Rn (a reference point
at an input to a receiver optical connector) to achieve a 1x10-12 BER when the FEC is
enabled.
regeneration The process of receiving and reconstructing a digital signal so that the amplitudes,
waveforms and timing of its signal elements are constrained within specified limits.
remote monitor A widely used network management standard defined by the IETF, and it enhances the
(RMON) MIB II standard greatly. It is mainly used to monitor the data traffic over a network
segment or the entire network. RMON is completely based on the SNMP architecture,
including the NMS and the Agent running on each network device.
report A tool that displays data in a specific format to intuitively present service information.
resistance The ability to impede (resist) the flow of electric current. With the exception of
superconductors, all substances have a greater or lesser degree of resistance. Substances
with very low resistance, such as metals, conduct electricity well and are called
conductors. Substances with very high resistance, such as glass and rubber, conduct
electricity poorly and are called nonconductors or insulators.
response A message that is returned to the requester to notify the requester of the status of the
request packet.
router A device on the network layer that selects routes in the network. The router selects the
optimal route according to the destination address of the received packet through a
network and forwards the packet to the next router. The last router is responsible for
sending the packet to the destination host. Can be used to connect a LAN to a LAN, a
WAN to a WAN, or a LAN to the Internet.
routing The determination of a path that a data unit (frame, packet, message) traverses from
source to destination.
routing table A table that stores and updates the locations (addresses) of network devices. Routers
regularly share routing table information to be up to date. A router relies on the
destination address and on the information in the table that gives the possible routes--in
hops or in number of jumps--between itself, intervening routers, and the destination.
Routing tables are updated frequently as new information is available.
rule Rules define automatic management services or applications. Rules include triggering
conditions and execution logic. Triggering conditions refer to the conditions for
executing a rule and execution logic refers to a set of actions executed based on the rule.
S
S-VLAN service virtual local area network
SAN storage area network
SAPI source access point identifier
SD See signal degrade.
SD-SDI See standard definition-serial digital interface signal.
SDH See synchronous digital hierarchy.
SDI See serial digital interface.
SES severely errored second
SESR severely errored second ratio
SETS SDH equipment timing source
SF See signal fail.
SFP small form-factor pluggable
SFTP See Secure File Transfer Protocol.
SHDSL See single-pair high-speed digital subscriber line.
SLA See service level agreement.
SLM signaling link management
SM section monitoring
SMF See single-mode fiber.
SNC subnetwork connection
SNCP subnetwork connection protection
SNCTP subnetwork connection tunnel protection
SNMP See Simple Network Management Protocol.
SONET See synchronous optical network.
SPC soft permanent connection
SRG See shared risk group.
SRLG shared risk link group
SSL See Secure Sockets Layer.
SSM See Synchronization Status Message.
STI service trigger information
STM synchronous transfer mode
STM-1 See Synchronous Transport Module level 1.
signal degrade (SD) A signal indicating that associated data has degraded in the sense that a degraded defect
condition is active.
signal fail (SF) A signal indicating that associated data has failed in the sense that a near-end defect
condition (non-degrade defect) is active.
single-ended switching A protection mechanism that takes switching action only at the affected end of the
protected entity in the case of a unidirectional failure.
single-mode fiber A type of optical fiber through which only one type of optical signal with a fixed wave
(SMF) length can travel at a time. The inner diameter of the single-mode fiber is less than 10
microns. This type of fiber can transmit data over a long distance.
single-pair high-speed A symmetric digital subscriber line technology developed from HDSL, SDSL, and
digital subscriber line HDSL2, which is defined in ITU-T G.991.2. The SHDSL port is connected to the user
(SHDSL) terminal through the plain telephone subscriber line and uses trellis coded pulse
amplitude modulation (TC-PAM) technology to transmit high-speed data and provide
the broadband access service.
smooth upgrade Process of upgrading the system files without service interruption
span The physical reach between two pieces of WDM equipment.
standard definition- Standard definition video signal transported by serial digital interface.
serial digital interface
signal (SD-SDI)
subnet A type of smaller networks that form a larger network according to a rule, for example,
according to different districts. This facilitates the management of the large network.
synchronous digital A transmission scheme that follows ITU-T G.707, G.708, and G.709. SDH defines the
hierarchy (SDH) transmission features of digital signals, such as frame structure, multiplexing mode,
transmission rate level, and interface code. SDH is an important part of ISDN and B-
ISDN.
synchronous optical A high-speed network that provides a standard interface for communications carriers to
network (SONET) connect networks based on fiber optical cable. SONET is designed to handle multiple
data types (voice, video, and so on). It transmits at a base rate of 51.84 Mbit/s, but
multiples of this base rate go as high as 2.488 Gbit/s.
T
TCM tandem connection monitor
TCP See Transmission Control Protocol.
TCP/IP Transmission Control Protocol/Internet Protocol
TD transmit degrade
TDC tunable dispersion compensator
TDM See time division multiplexing.
TF transport format
TFTP See Trivial File Transfer Protocol.
TIM trail trace identifier mismatch
TL1 Transaction Language 1
V
V-UNI See virtual user-network interface.
VB virtual bridge
VBR See variable bit rate.
VC See virtual container.
VCPLM virtual concatenation payload mismatch
VCTRUNK A virtual concatenation group applied in data service mapping, also called the internal
port of a data service processing board.
VDSL2 See very-high-speed digital subscriber line 2.
VLAN virtual local area network
VOA variable optical attenuator
VPLS See virtual private LAN service.
VPN virtual private network
VPWS See virtual private wire service.
VRRP See Virtual Router Redundancy Protocol.
VSI See virtual switching instance.
Virtual Router A protocol designed for multicast or broadcast LANs such as an Ethernet. A group of
Redundancy Protocol routers (including an active router and several backup routers) in a LAN is regarded as
(VRRP) a virtual router, which is called a backup group. The virtual router has its own IP address.
The host in the network communicates with other networks through this virtual router.
If the active router in the backup group fails, one of the backup routers in this backup
group becomes active and provides routing service for the host in the network.
variable bit rate (VBR) One of the traffic classes used by ATM (Asynchronous Transfer Mode). Unlike a
permanent CBR (Constant Bit Rate) channel, a VBR data stream varies in bandwidth
and is better suited to non real time transfers than to real-time streams such as voice calls.
very-high-speed digital An extension of the VDSL technology, which complies with ITU G.993.2, supports
subscriber line 2 multiple spectrum profiles and encapsulation modes, and provides short-distance and
(VDSL2) high-speed access solutions to the next-generation FTTx access service.
view The topological view that is presented in some rules. Customize the view according to
requirements of every product and organize the data in the view displayed by the topology
module. By default, the platform provides the physical view. The topology view can be
planned according to the domain, maintenance relationship and so on.
virtual container (VC) An information structure used to support path layer connections in the SDH. A VC
consists of a payload and path overhead (POH), which are organized in a block frame
structure that repeats every 125 μs or 500 μs.
virtual private LAN A type of point-to-multipoint L2VPN service provided over the public network. VPLS
service (VPLS) enables geographically isolated user sites to communicate with each other through the
MAN/WAN as if they are on the same LAN.
virtual private wire A technology that bears Layer 2 services. VPWS emulates services such as ATM, FR,
service (VPWS) Ethernet, low-speed TDM circuit, and SONET/SDH in a PSN.
virtual switching An instance through which the physical access links of VPLS can be mapped to the
instance (VSI) virtual links. Each VSI provides independent VPLS service. VSI has Ethernet bridge
function and can terminate PW.
virtual user-network A virtual user-network interface, works as an action point to perform service
interface (V-UNI) classification and traffic control in HQoS.
W
WAN wide area network
WDM wavelength division multiplexing
WLAN See wireless local area network.
WSS wavelength selective switching
Web LCT The local maintenance terminal of a transport network, which is located at the NE
management layer of the transport network.
wireless local area A hybrid of the computer network and the wireless communication technology. It uses
network (WLAN) wireless multiple address channels as transmission media and carriers out data interaction
through electromagnetic wave to implement the functions of the traditional LAN.
working service A specific service that is part of a protection group and is labeled working.
X
XCS cross-connect and synchronous timing board
Y.1731 The OAM protocol introduced by the ITU-T. Besides the contents defined by
IEEE802.1ag, ITU-T Recommendation Y.173 also defines the following combined
OAM messages: Alarm Indication Signal (AIS), Remote Defect Indication (RDI),
Locked Signal (LCK), Test Signal, Automatic Protection Switching (APS), Maintenance
Communication Channel (MCC), Experimental (EXP), and Vendor Specific (VSP) for
fault management and performance monitoring, such as frame loss measurement (LM),
and delay measurement (DM).