Beruflich Dokumente
Kultur Dokumente
V100R002C00
Issue
02
Date
2009-09-30
Huawei Technologies Co., Ltd. provides customers with comprehensive technical support and service. For any
assistance, please contact our local office or company headquarters.
Website:
http://www.huawei.com
Email:
support@huawei.com
Notice
The information in this document is subject to change without notice. Every effort has been made in the
preparation of this document to ensure accuracy of the contents, but the statements, information, and
recommendations in this document do not constitute a warranty of any kind, express or implied.
Contents
Contents
About This Document.....................................................................................................................1
1 Troubleshooting Overview......................................................................................................1-1
1.1 Classification of Troubleshooting...................................................................................................................1-2
1.1.1 Connectivity Faults................................................................................................................................1-2
1.1.2 Performance Faults.................................................................................................................................1-2
1.2 Objectives of Troubleshooting........................................................................................................................1-2
1.3 Methods of Fault Diagnosis............................................................................................................................1-3
1.4 Collection of Fault Information.......................................................................................................................1-3
1.5 Troubleshooting Assistance............................................................................................................................1-5
1.5.1 Customer Center.....................................................................................................................................1-5
1.5.2 Huawei Technical Support Website.......................................................................................................1-5
Contents
5 CF Card Troubleshooting.........................................................................................................5-1
5.1 CF Card Overview..........................................................................................................................................5-2
5.2 Troubleshooting CF Card................................................................................................................................5-2
5.2.1 Troubleshooting Flowchart....................................................................................................................5-2
5.2.2 Troubleshooting Procedure....................................................................................................................5-3
5.3 FAQs...............................................................................................................................................................5-3
6 POE Troubleshooting................................................................................................................6-1
6.1 Overview of PoE.............................................................................................................................................6-2
6.1.1 Introduction to PoE................................................................................................................................6-2
6.1.2 Power-on Process of PoE.......................................................................................................................6-2
6.1.3 PoE Working Mode Supported by theS9300.........................................................................................6-3
6.2 Troubleshooting Cases....................................................................................................................................6-3
6.2.1 PoE Board Fails to Be Registered..........................................................................................................6-3
6.2.2 PSE Cannot Detect Any PD...................................................................................................................6-4
6.2.3 PSE Cannot Provide Power for PDs......................................................................................................6-5
6.3 Known Anomalies...........................................................................................................................................6-6
ii
Issue 02 (2009-09-30)
Figures
Figures
Figure 3-1 Board loading troubleshooting flowchart...........................................................................................3-3
Figure 4-1 Board registration flowchart...............................................................................................................4-3
Figure 5-1 CF card troubleshooting flowchart.....................................................................................................5-3
Issue 02 (2009-09-30)
iii
Tables
Tables
Table 6-1 Table 1-1 Commands used in manual mode........................................................................................6-3
Issue 02 (2009-09-30)
Related Versions
The following table lists the product versions related to this document.
Product Name
Version
S9300
V100R002C00
Intended Audience
The intended audiences of this document are:
l
Commissioning engineer
Organization
This document is organized as follows.
Issue 02 (2009-09-30)
Chapter
Description
1 Troubleshooting
Overview
Chapter
Description
2 Routine Device
Troubleshooting
3 Board Loading
Troubleshooting
4 Board Registration
Troubleshooting
5 CF Card Troubleshooting
6 POE Troubleshooting
Conventions
Symbol Conventions
The symbols that may be found in this document are defined as follows.
Symbol
Description
DANGER
WARNING
CAUTION
TIP
NOTE
General Conventions
The general conventions that may be found in this document are defined as follows.
2
Issue 02 (2009-09-30)
Convention
Description
Boldface
Italic
Courier New
Command Conventions
The command conventions that may be found in this document are defined as follows.
Convention
Description
Boldface
Italic
[]
{ x | y | ... }
[ x | y | ... ]
{ x | y | ... }*
[ x | y | ... ]*
&<1-n>
GUI Conventions
The GUI conventions that may be found in this document are defined as follows.
Issue 02 (2009-09-30)
Convention
Description
Boldface
>
Keyboard Operations
The keyboard operations that may be found in this document are defined as follows.
Format
Description
Key
Press the key. For example, press Enter and press Tab.
Key 1+Key 2
Key 1, Key 2
Mouse Operations
The mouse operations that may be found in this document are defined as follows.
Action
Description
Click
Double-click
Drag
Press and hold the primary mouse button and move the
pointer to a certain position.
Update History
Updates between document versions are cumulative. Therefore, the latest document version
contains all updates made to previous versions.
6 POE Troubleshooting
Issue 02 (2009-09-30)
1 Troubleshooting Overview
Troubleshooting Overview
Issue 02 (2009-09-30)
1-1
1 Troubleshooting Overview
Configuration error
Improper interactions
Network congestion
Routing loop
Network error
Fault diagnosis: obtaining the diagnostic information with the tools, locating the fault and
restoring the network
Network optimizing: finding out the defects in network plan and configuration, and
improving the performance
Routine maintenance: observing the running of the network, and detecting the
communication quality in time
Fault diagnosis is to use diagnostic tools to obtain diagnostic information, locate the fault, find
the reason, remove the fault, and make the network device operate in normal state.
Generally, a device fault may be caused by:
1-2
Connection failure of devices on the physical layer or hardware and line fault
Issue 02 (2009-09-30)
1 Troubleshooting Overview
You should diagnose the fault in terms of the OSI model from the physical layer. In this way,
locate the fault to recover the system.
View the routing table.The ping, tracert and display, debugging commands are useful
tools to obtain the diagnostic information.
Run the display interface command to obtain the information about each interface to be
detected.
Run the display cpu-usage command to display the usage of the CPU.
Run the display memory-usage command to display the usage of the memory.
Generally, the maintenance personnel troubleshoot the fault by the following five steps:
1.
2.
3.
4.
5.
Who Is Involved?
1-3
1 Troubleshooting Overview
You should give questions continuously on the basis of the answer that the customer gives until
you obtain an exact knowledge about the fault.
Who Is Involved?
Does the failure involve one user, a group of users with common attributes, or all the users in
the network?
As for one user, you can ask questions from the following aspects:
l
The physical layer, including the network cable connecting the user's device
As for a group of users with common attributes or all the users, you can ask questions from the
following aspects:
l
The server
Hardware
Routing protocols
As for a partial connectivity fault, you can ask questions from the following aspects:
l
As for a performance fault, you can ask questions from the following aspects:
1-4
Network congestion
Routing loop
Non-optimal route
Huawei Proprietary and Confidential
Copyright Huawei Technologies Co., Ltd.
Issue 02 (2009-09-30)
1 Troubleshooting Overview
Routing loop
As for a fault that occurred just now, you can ask questions from the following aspects:
l
Rerouting
Generally, a fault in the edge area is associated with the access list.
Possibly, a fault in the access area is associated with all the preceding questions.
1-5
1 Troubleshooting Overview
Supporting network
Documentation
Community
Documentation
The contents in Documentation involving datacom mainly include:
l
Product manuals that embody all the attached documentation about Huawei datacom
products
Functions and features that embody the application of new features of Huawei datacom
products
Case database that embodies all the troubleshooting cases and resolutions experienced by
Huawei technical support personnel
Precautions that embody all the points deserving attention in the use of Huawei datacom
products
Community
It provides a site for discussing the communications technology in terms of forum.
Any question about a fault can be posed here. A reply will be given promptly.
1-6
Issue 02 (2009-09-30)
Issue 02 (2009-09-30)
2-1
Generic fault: refers to the fault found in routine device maintenance, which has minor
influence on the performance of the device.
Urgent fault: refers to the fault that results in system breakdown or service interruption,
which needs to be removed urgently.
The common device fault refers to the fault found in routine maintenance of the device.
Command
Description
Basic information
display diagnosticinformation
2-2
Version information
display version
Patch information
display patchinformation
Issue 02 (2009-09-30)
Information
Command
Description
Detailed running
information of the
board
display device
System environment
information
System temperature
Run the display temperature slot slotid command to display the temperature
on the board.
Current configuration
display currentconfiguration
Current time
display clock
Log
display logbuffer
Trap
display trapbuffer
Information on the
interface
display interface
Memory usage
display memory-usage
CPU usage
display cpu-usage
2-3
After collecting the information about faults, analyze it, judge the type and range of faults and
locate the faults.
The following common methods can be used to locate faults:
l
Performing loopback
Comparing the running status with faults by shifting the interface, the sub-card or the
board to another one
Resetting the register without power-off and with the configuration being saved
2.
3.
4.
5.
2-4
Be cautious to use the debugging command, especially the debugging all command, because
it can slow down the system. Disable the debugging immediately after it is finished.
Before enabling the debugging, run the terminal debugging and terminal monitor
commands so that the debugging information is displayed on the terminal.
Issue 02 (2009-09-30)
Command
debugging all
CAUTION
Do confirm whether to run the following commands because the commands may slow down the
system or even interrupt services.
Run the following commands in the user view or the system view.
Issue 02 (2009-09-30)
View
Action
Command
User view
reboot
2-5
View
Action
Command
System
view
slave restart
Perform active/standby
switchover by force.
slave switchover
Command
display version
2-6
display power
display patch-information
display fan
Issue 02 (2009-09-30)
Issue 02 (2009-09-30)
3-1
BootROM
Logical chip
The types of the loaded software on different hardware are different, including:
l
Start-up loading: After the main system started, the board downloads the software to the
storage area of the relevant board according to the hardware type. The whole process
completes automatically. The loading files vary with board types, so the S9300 identifies
the loading files with different IDs.
On-line loading: It is also called force-loading and indicates that the relevant software
(except EPLD) or logical file is loaded to the board by the command line.
3-2
Issue 02 (2009-09-30)
Enable
debugging
Is there
loading
information?
Upgrade the
system
through JTAG
No
Yes
Run display
load fail-info
Fault
removed?
No
Technical
support
Yes
End
----End
3-3
Fault Analysis
You can enable the debugging:
<Quidway> debugging load packet snd
<Quidway> debugging load packet rcv
<Quidway> debugging load event
In normal case, three files should be downloaded, but the debugging information shows that no
file is downloaded. That is to say, an error occurs when the first file is downloaded.
Run the dir command to check the system files:
<Quidway> dir
Directory of cfcard:/
0
-rw- 46424064
1
-rw44
2
-rw12001
3
-rw5709
4
-rw947
Oct
Oct
Oct
Oct
Oct
19
20
19
19
20
2008
2008
2008
2008
2008
15:49:18
14:30:22
18:06:58
18:07:04
14:30:24
s9300.cc
private-data.txt
paf.txt
license.txt
connex-pe1.cfg
The size of the system file is 46424064 bytes, while the size of the large package file released
with the product is 70000000 bytes.
Procedure
Step 1 Obtain the latest system software version and load it to the S9300.
Step 2 Run the startup system-software filename command to specify system software for the S9300
and restart the S9300.
If the fault persists, contact Huawei engineers.
----End
Summary
If the "file's id error" message is displayed, it indicates that the system file is incorrect. It is
possible that the loading of system file aborts. Therefore, some LPUs cannot register.
3-4
Issue 02 (2009-09-30)
3.4 FAQs
This section lists frequently asked questions and their answers.
l
Q:Why Is the "write flash error" Message Displayed During LPU Start-up?
A: After reboot, the S9300 can work only after the system file is loaded from the Master
Control Unit to the Flash memory of the LPU. If the message is displayed, it indicates that
the board should be repaired.
Description
NOTE
The display load fail-info command can run only in the user view.
Issue 02 (2009-09-30)
Command
Description
3-5
3-6
Command
Description
Issue 02 (2009-09-30)
Issue 02 (2009-09-30)
4-1
Sends the registration request to the Master Control Unit. The request contains the version
and initialization self-check information of each module.
2.
Receives the registration request from the LPU and check the self-check information.
2.
If the self-check information is correct, records the LPU information and sends a response
to the LPU. If the self-check information is incorrect, resets the LPU and records the
resetting reason in log and reports the alarm.
CAUTION
Do not draw out the board or reset the board when the bootrom program or bootload program
is loading on the LPU; otherwise, the program will be damaged, and you should contact Huawei
engineers to repair it.
4.2.1 Troubleshooting Flowchart
4.2.2 Troubleshooting Procedure
4-2
Issue 02 (2009-09-30)
Board
registration
error?
No
Yes
Enable
debugging
Fault removed?
No
Technical
support
Yes
End
The start-up time of the Master Control Unit is within three minutes. If the S9300 is restarted
after system upgrade, the start-up time is within five minutes.
The start-up time of the LPU is within five minutes. If the system software is upgraded, the
start-up time is within 10 minutes.
Step 2 Check whether the board start-up failure is caused by board registration fault.
l
Check whether the board registration fails. If it is a board registration fault, remove the fault
by referring to "3 Board Loading Troubleshooting".
Issue 02 (2009-09-30)
4-3
If the reason why the board does not register cannot be displayed by the display logbuffer
slot slot-id command, it means that the board never registers. Then you need to check that the
board is loaded successfully by connecting the board with a serial port cable. For detailed
procedure, see "3 Board Loading Troubleshooting".
If the fault persists, contact Huawei engineers.
----End
Fault Analysis
The possible reasons are as follows:
l
When the S9300 is started, the main system loads logical software automatically. The
versions of the small system and the main system must be matched; otherwise, the loading
cannot complete.
Procedure
Step 1 Run the check version command to check whether the versions of the main system and the small
system are matched. If they are unmatched, upgrade the version which is earlier.
<Quidway> upgrade all startup
Issue 02 (2009-09-30)
Fault Analysis
The LPU cannot register for a long time. The log is as follows:
%Oct 15 11:34:19 2008 Quidway ALML/4/ENTRESET:
LPU6 is reset, The reason is:
Warm reset board for no receiving message in a long time
The preceding log is generated if the LPU cannot communicate with the active Master Control
Unit in three minutes.
The possible causes of communication failure are as follows:
l
Procedure
Step 1 Insert the LPU again.
Step 2 Upgrade the EPLD through JTAG. For detailed procedure, see 4.3.1 Failed to Register the
LPU.
Step 3 Wait three minutes. If the fault is not removed, upgrade the software version of the main system
and the small version through JTAG. For detailed procedure, see 4.3.1 Failed to Register the
LPU.
If the fault persists, contact Huawei engineers.
----End
Fault Analysis
The possible reasons are as follows:
l
The Master Control Unit is faulty. The LPUs cannot be powered on if the Master Control
Unit is not powered on.
Issue 02 (2009-09-30)
4-5
Procedure
Step 1 Pull out all the LPUs and power on the S9300 again. Check whether the Master Control Unit
can be powered on.
----End
4.4 FAQs
This section lists frequently asked questions and their answers.
l
4-6
Command
Description
display device
Issue 02 (2009-09-30)
5 CF Card Troubleshooting
CF Card Troubleshooting
Issue 02 (2009-09-30)
5-1
5 CF Card Troubleshooting
CAUTION
The compact flash (CF) card on the panel of the Master Control Unit does not support hot swap.
The S9300 has one CF card, which is installed on the panel of the Master Control Unit. The CF
card stores system file, configuration file, PAF, license file, and log files.
The log files of the S9300 are stored in the CF card, so the storage space of the CF card may be
full when the S9300 operates for a long time. In this case, the S9300 deletes the log files by
generation date. The earliest log file is deleted first. To save the log files, you can transfer the
files through FTP or TFTP, and then delete the files from the CF card.
5-2
Issue 02 (2009-09-30)
5 CF Card Troubleshooting
Is CF card
plugged?
Yes
Fault
removed?
Yes
No
No
Is storage
space full?
Yes
Fault
removed?
Yes
No
No
Replace the
CF card
Fault
removed?
Yes
End
No
Technical
support
5.3 FAQs
This section lists the frequently asked questions and their answers.
l
Issue 02 (2009-09-30)
5-3
6 POE Troubleshooting
POE Troubleshooting
Issue 02 (2009-09-30)
6-1
6 POE Troubleshooting
In manual mode, you need to run certain commands to complete the operation.
2.
3.
6-2
4.
RTP & Power management: The PSE provides functions of over-current protection,
current/voltage detection, short-circuit protection, open-circuit protection, and
troubleshooting.
5.
Issue 02 (2009-09-30)
6 POE Troubleshooting
The PSE detects whether the PD is disconnected by using a special detection method. If
the PD is disconnected, the PSE shuts down the port to stop providing the voltage. The port
then enters the detection state.
Manual Mode
In manual mode, the PSE automatically detects and classifies PDs but does not power on PDs.
You need to run certain commands to power on PDs. To manually power on or power off PDs,
you must set the PoE working mode of the relevant board to manual mode in system mode.
Table 6-1 describes the commands that are used in manual mode.
Table 6-1 Table 1-1 Commands used in manual mode
Function
Command
Power on PDs on a
specified port manually.
Issue 02 (2009-09-30)
6-3
6 POE Troubleshooting
Fault Analysis
This fault occurs usually because the PoE power supply does not provide power.
1.
2.
2. If the PoE power supply is well connected, ensure that the PoE power supply is powered
on and works normally.
Run the following command to view information about the PoE power supply in the
corresponding slot.
<Quidway> display poe-power
Available total POE power(mW) : 800000
System reserved POE power(mW) : 48000
User reserved POE power percent : 20
POE power backup-mode
: 2+0
POE power 1 :
Power value(mW) : Voltage value(V) : Current value(A) : POE power 2 :
Power value(mW) : 800000
Voltage value(V) : 53.57
Current value(A) : 0.24
If the PoE power supply works normally, it indicates that the fault lies in the DIMM. In
this case, you need to replace the DIMM. Remove the PoE board from the subrack, and
then replace the DIMM.
Fault Analysis
Theoretically, a port can automatically detect and classify a PD are the PD is connected. If the
power of the port is sufficient, the port can provide power for the PD automatically. If the power
of the port is insufficient, the port keeps in Classification completed state. If the port retains in
the Detecting state after a PD is connected, locate the fault as follows:
1.
2.
Run the following command to view the status of each PoE port on the board.
<Quidway> display poe power-state slot 4
PortName
PowerOn/Off Enabled Priority
Status
------------------------------------------------------------------------------GigabitEthernet4/0/0
off
enable
Low
Classification
completed
GigabitEthernet4/0/1
off
enable
Low
Detecting
GigabitEthernet4/0/2
off
enable
Low
Detecting
GigabitEthernet4/0/3
off
enable
Low
Detecting
GigabitEthernet4/0/4
off
enable
Low
Detecting
6-4
3.
Replace the network cable of the port and check whether the fault is rectified.
4.
If the network cable is normal, check the grounding of the S9300. If the S9300 is not
grounded, ground it.
Huawei Proprietary and Confidential
Copyright Huawei Technologies Co., Ltd.
Issue 02 (2009-09-30)
5.
6 POE Troubleshooting
6.
Fault Analysis
Theoretically, a port can automatically detect and classify a PD when the PD is connected to the
port. If the power of the port is sufficient, the port can provide power for the PD automatically.
If the power of the port is insufficient, the port keeps in Classification completed state. If the
port keeps in Classification completed state but cannot power on the PD, do as follows to locate
the fault:
1.
Run the following command to view the status of each PoE port on the board.
<Quidway> display poe power-state slot 4
PortName
PowerOn/Off Enabled Priority
Status
------------------------------------------------------------------------------GigabitEthernet4/0/0
off
enable
Low
Power condition is
good
GigabitEthernet4/0/1
off
enable
Low
Power condition is
good
GigabitEthernet4/0/2
off
enable
Low
Power condition is
good
GigabitEthernet4/0/3
off
enable
Low
Power condition is
good
GigabitEthernet4/0/4
off
enable
Low
Classification
completed
2.
Run the following command to view information about the port that fails to be powered
on.
<Qwidway> display poe power-state interface GigabitEthernet 4/0/4
Port power enabled
: enable
Port power ON/OFF
: off
Port power status
: Classification completed
Port PD class
: 0
Port reference power(mW)
: 15400
Port power priority
: low
Port max power(mW)
: 20000
Port current power(mW)
: 0
Port peak power(mW)
: 0
Port average power(mW)
: 0
Port current(mA)
: 0.00
Port voltage(V)
: 0.00
Issue 02 (2009-09-30)
The Port reference power(mW) field indicates the reference power of the port. If this
port needs to power a PD, the available power of the board must be greater than or equal
to the reference power of the port.
Huawei Proprietary and Confidential
Copyright Huawei Technologies Co., Ltd.
6-5
6 POE Troubleshooting
l
3.
The Port max power(mW) field indicates the maximum power of the port. If this port
needs to power a PD, the maximum power of the port must be greater than or equal to
the reference power of the port.
Run the following command to check the power supply information in the corresponding
slot.
<Quidway> display poe information slot 4
Codes: USMPW(User Set Max Power), AVTPW(Available Total Power),
TRPW(Total Reference Power), TPWC(Total Power Consumption),
PKVAL(Peak Value), PWMGM(Power-Management mode)
Slot 4
------------------------------------------------------------------------------------------------------------------USMPW(mW) AVTPW(mW) TRPW(mW) TPWC(mW) PKVAL(mW) PWMGM
1440000
592000
60000
4586
5000
auto
-------------------------------------------------------------------------------------------------------------------
(1) Check the value of AVTPW. If the value is 0, it indicates that the board does not
obtain power, and hence cannot provide power for the port.
(2) The value of PWMGM should be auto. If the value of PWMGM is manual, run the
poe power-on interface { interface-name | interface-type interface-num } command
in the system view to power on the port manually.
(3) Calculate the available power of the board by using the formula: Available power =
Min. (USMPW, AVTPW) TPWC.
Compare the reference power of the port with the available power of the board. If the
reference power of the port is larger, it indicates that the available power of the board
is insufficient for the PD.
(4) 4. Run the following command to view information about the PoE power supplies.
Check whether at least one PoE power supply works normally.
<Quidway> display poe-power
Available total POE power(mW) : 800000
System reserved POE power(mW) : 48000
User reserved POE power percent : 20
POE power backup-mode
: 4+0
POE power 1 :
Power value(mW)
: Voltage value(V) : Current value(A) : POE power 2 :
Power value(mW)
: Voltage value(V) : Current value(A) : POE power 3 :
Power value(mW) : 800000
Voltage value(V) : 53.47
Current value(A) : 0.21
POE power 4 :
Power value(mW)
: Voltage value(V) : Current value(A) : -
(5) 5. If at least one PoE power supply works normally, check the value of User reserved
POE power percent. If the value is 100, it indicates that all the power of the PoE
power supply is reserved, so the port cannot be powered on.
Issue 02 (2009-09-30)
6 POE Troubleshooting
PoE Function Is Unavailable on All Ports Due to Faults of PoE Power Supplies
When the S9300 Works Under Full Power
Assume that two 800 W PoE power supplies are installed on an S9312, so the total power of the
system is 1600 W. When the total power consumption of the S9300 exceeds 800 W and a PoE
power supply is faulty at this time, the system software cannot shut down PoE ports in time.
Therefore, the instant power is too large, causing over-load protection of the other PoE. Then
all PDs are powered off. After the PoE power recovers, the S9300 powers on the PoE ports
according to their priorities.
Issue 02 (2009-09-30)
6-7