Beruflich Dokumente
Kultur Dokumente
1. 2. 3. 4.
QoS Reports Delivery Investigation Reports delivery Investigation methodology Migration Reports/templates delivery
5.
6. 7.
Migration methodology
QoS Alerter Investigation of problems
Delivery of 4 reports permitting to follow the quality of service per WAC and available on commercial network
General remark: concerning the reports, when 2 scales are present, the scale on left part is dedicated to the columns and the scale on right part is dedicated to the lines Rem: it is an update of the reports done in W3MR1ed2b. So the report names are kept even if the new release is W3MR1ed2D.
Investigation methodology
The goal is from a global overview to go deeper in the analysis permitting:
to check the root cause of the problem to identify the entity concerned by the problem to take an action to solve/clarify the problem
The reports delivered concerned the attach setup procedure Run the report NP_Mono_W3MR1ed2b_1 with periodicity day If bad performance for attach setup procedure, for the concerned day, run the report NP_Mono_attach_setup_WAC with periodicity day. This will permit to fix if the problem is before/after the attachment to the BS. If before, you can see the view with reason of the failure If before attachment to the BS, run the warning report NP_attach_setup with periodicity day giving the day corresponding to the problem. Check for the concerned view, what is the worst cell. Ensure for the concerned cell you have enough samples (ex: if you have 100% failure with 2 request and 0 success, it is not meaningful) Run the mono report NP_Mono_attach_setup for the concerned day with periodicity 1/4. This permit to check if the problem is occasional of spread during all the day.
Investigation methodology
Re-Run the report NP_Mono_attach_setup for the previous days and the concerned day with periodicity 1/4 to check if the problem is periodic or episodic
On NPO, for the concerned view, right click and select properties. In some views, I added in the description field some comments permitting to check in priority some specific points
If not enough, the analysis above permit to say when and on which BS/WAC to start the trace
FOR HANDOVER
Investigation methodology
If handover execution failure case for the day concerned: select all the cells of the WAC and drag&drop of indicators HO_WAC_intraWAC_exec_req_sBS, HO_WAC_intraWAC_succ_sBS, NP_HO_WAC_intraW_sBS_fail Check if the problem is important on a site or spread over all the sites If problem on one site (for example), select object W-adjacency, select all the adjacencies related to this site and drag&drop of indicators HO_WAC_intraWAC_exec_req_sBS, HO_WAC_intraWAC_succ_sBS, NP_HO_WAC_intraW_sBS_fail you will have the list of the worst adjacencies
Migration reports/templates
Delivery of 5 reports for migration
Migration_WAC mono-report available at WAC level to run before OMC migration. Migration_BS multi-report available at BS level only to run before OMC migration Migration_WAC to run after WAC migration T_Mono_Evolution_BS to run after BS migration for one particular BS when doubts T_Multi_Migration_BS to run on all the BS after migration
Migration methodology
Migration methodology
The migration is done in 2 part: OMC migration WAC/BS migration WAC/BS migration is usually done 2 or 3 days after OMC migration
During the OMC migration there is a risk to loose partially/completely the indicator database. It takes time to recover the database. During this period, we have no reference. So before OMC migration: Export the customers dictionary for report/indicators Run the Mono-Report Migration_WAC for each WAC: With periodicity week, for the 10 previous week With periodicity day, for the 21 previous days With periodicity hour, for the 7 previous days Run the Multi-Report Migration_BS for all the BS: With periodicity day for the 7 previous days Save the pm_xml files regularly before the OMC migration As the pm xml files are present during a period of 5 days, knowing the OMC migration date, the backup must be done regularly (at least during the 10 days before the migration). This files contain all the counters. So in case of missing information in the reports, it is possible to recover it in these files
Migration methodology
After OMC migration: Check if NPO is always running If KO during a long time, need to work with pm xml files Check the customers views/reports/indicators are always present If no more present, re-import them Run the mono and multi reports (used before migration) for the different periodicities and with the same date to check they are equivalent If database corrupted, the comparison will be done with the reports done before the OMC migration
Migration methodology
After the WAC/BS migration: First day, reports to run each hour: Mono report Migration_WAC starting from the previous complete days with periodicity hour (if possible with periodicity 1/4 but that means the Granularity Period is 15mn else no meaning). To have a good reference, you have to compare the previous hour with the same hour of the previous day (and if possible with the same hour and same day of the previous week) Multi-report T_Multi_Migration_BS with periodicity hour for the current day to check all the BS are running and generating traffic( attach setup attempt/success) Results sent as soon as BS not operational or important degradation Results of the previous hour with the cell migrated and taking off the bad BS compared to before migration(due to any reasons) to give the list of the bad BS (rem: can be done creating a cell zone)
Migration methodology
Second day: Same as first day checking regularly if no problem seen before Compare the previous day with the same day of the previous week should be similar Results of the previous days for all the BS Results of the previous days taking off the BS bad compared to before migration(due to any reasons) to give the list of the bad BS (rem: can be done creating a cell zone)
Other days: Regular check for the current day Results of the previous days for all the BS Results of the previous days taking off the bad BS bad Follow of the bad BS if any action has been done First week: comparison of the previous week with the week before QoS follow up
QoS Alerter
QoS Alerter
Regarding QoS alerters, please check out the dedicated KTS session & respective slides on the Sleeping, Dead & Lazy cell topic: KTS QoS Alerter link Sleeping cell:
Description: BS-WAC link down Impact: will trigger a BS reset [the keep alive mechanism will not work] - all users on that BS will be affected being disconnected Workaround: use WBS monitoring tool to determine/measuring the impact
Dead cell:
Description: WBS strops to make traffic Impact: connected users will not make traffic anymore and new users can't connect (BS is not detected by the CPE) Workaround: QoS alerter to be used (usage depends on SW release) => if a BS is dead we have many steps as workaround: check status of the WBS and other alarms; check if the WBS is not having traffic because no CPEs on that area; if you find no reason for the BS no having traffic => Reset Telecom => if the problem still exists on the next GP, Reset WBS
Lazy cell:
Description: Traffic ongoing but no new CPE can connect Impact: Some mobiles can't connect into the WBS (this could impact all the mobiles or few mobiles only) Workaround: detection available in W3.1 Ed2D P2 only. Final QoS alerter formula is under test in a commercial network. As soon we will have the final formula with final threshold, we will announce it.
22 | Presentation Title | Month 2006 All Rights Reserved Alcatel-Lucent 2006, #####
INVESTIGATION OF PROBLEMS
Attach Setup Success Rate - WAC: waclu01 ( RanCtrl-01-Cluster01.packet1.com ) 2008 Week 23 To 2008 Week 28 (Working Zone: Global - Medium)
16000 14000 12000 10000 0.8% 0.7% 0.6% 0.5% 0.4% 0.3% 0.2% 0.1% 0.% 06/02/2008 06/09/2008 06/16/2008 06/23/2008 06/30/2008 07/07/2008
No unit
No unit
98.% 97.5%
50000 0 97.%
07 /0 6/ 20 08 07 /0 7/ 20 08 07 /0 8/ 20 08 07 /0 9/ 20 08 07 /1 0/ 20 08 07 /1 1/ 20 08 07 /1 2/ 20 08 07 /1 3/ 20 08 07 /1 4/ 20 08 07 /1 5/ 20 08
All Rights Reserved Alcatel-Lucent 2006, #####
Attach fail at BS level - WAC: waclu01 ( RanCtrl-01-Cluster01.packet1.com ) 07/06/2008 To 07/15/2008 (Working Zone: Global - Medium)
350000 300000 250000 80.00%
No unit
120.00%
100.00%
auth fail setup succ %attach fail %T17 expired %RAC rej
20.00%
0.00%
No unit
C el l_ P P
70000
Cell RESTSRIRATU2
setup req %other fail
Attach_setup_req=73374
Running the warning report NP_attach_setup with periodicity day: In view Attach fail cause other, we took the first worst cell (containing many attempts)
Attach fail cause other - BSCELL - 07/15/2008 (Working Zone: Global - Medium)
100%
No unit
auth fail setup succ %attach fail %T17 expired %RAC rej
07 /1 07 5 /2 /1 0 0 07 5 /2 8 0 /1 0 0 0: 07 5 /2 8 0 00 /1 0 0 1: 07 5 /2 8 0 15 /1 0 0 2: 07 5 /2 8 0 30 /1 0 0 3: 07 5 /2 8 0 45 /1 0 0 5: 07 5 /2 8 0 00 /1 0 0 6: 07 5 /2 8 0 15 /1 0 0 7: 07 5 /2 8 0 30 /1 0 0 8: 07 5 /2 8 1 45 /1 0 0 0: 07 5 /2 8 1 00 /1 0 0 1: 07 5 /2 8 1 15 /1 0 0 2: 07 5 /2 8 1 30 /1 0 0 3: 07 5 /2 8 1 45 /1 0 0 5: 07 5 /2 8 1 00 /1 0 0 6: 07 5 /2 8 1 15 /1 0 0 7: 07 5 /2 8 1 30 /1 0 0 8: 07 5 /2 8 2 45 /1 0 0 0: 07 5 /2 8 2 00 /1 0 0 1: 5 / 8 15 20 2 08 2:3 23 0 :4 5
All Rights Reserved Alcatel-Lucent 2006, #####
As it is not possible to systematically make wireshark trace to have a list of MAC address with bad anonymous identity, another solution is to use the WAC log file CC_callControlThread_x.log. On WAC machine, you have to collect all the CC_callControlThread_x.log going in /diagnosis/log/WUM_<PID>_<date>>/CC_<PID> with <date> corresponding to the more recent date. A tool is in preparation to give the list of MAC address with bad anonymous identity and the corresponding cell When the tool will be available, this check must be done each day to verify if new CPEs will have bad anonymous identity
Session Release - WAC: waclu01 ( RanCtrl-01-Cluster01.packet1.com ) - 2008 Week 23 To 2008 Week 28 (Working Zone: Global - Medium)
16000 14000 12000 10000 16000 14000 12000 Rel other 10000 WAC Abnormal Rel
No unit
No unit
8000 6000 4000 2000 0 06/02/2008 06/09/2008 06/16/2008 06/23/2008 06/30/2008 07/07/2008
Rem: you can see a case with %HO succ > 100%. It is under investigation. If you see similar anomaly, let me know (Nicolas Palumbo) to be able to investigate the problem
INTRA WAC HO Exec Proc - WAC: waclu01 ( RanCtrl-01-Cluster01.packet1.com ) 2008 Week 23 To 2008 Week 28 (Working Zone: Global - Medium)
160.% 140.% 120.% 100.% 80.% 60.% 40.% 20.% .% 06/02/2008 06/09/2008 06/16/2008 06/23/2008 06/30/2008 07/07/2008
No unit
20000000 15000000
s
10000000
40 20 0
5000000 0
07 /1 6/ 20 08 07 /1 7/ 20 08 07 /1 8/ 20 08 07 /1 9/ 20 08 07 /2 0/ 20 08 07 /2 1/ 20 08 07 /2 2/ 20 08 07 /2 3/ 20 08
1000 0
1.2E+11 1E+11
bytes
2000000 0
2E+10 0
07 /1 6/ 20 08 07 /1 7/ 20 08 07 /1 8/ 20 08 07 /1 9/ 20 08 07 /2 0/ 20 08 07 /2 1/ 20 08 07 /2 2/ 20 08 07 /2 3/ 20 08
Handover Preparation - WAC: waclu01- 07/16/2008 To 07/23/2008 Prep fail Prep succ %Prep fail 80.00% %RAC Rej
70.00% 60.00% 50.00% 40.00% 30.00% 20.00% 10.00% 0.00% 07/16/200807/17/200807/18/200807/19/200807/20/200807/21/200807/22/200807/23/2008
%
We can see a small reduction of WAC abnormal release and a big decrease of %HO succ.
The next step is to see if it is due to one cell or if it is spread over all the cells.
No unit
INTRA WAC HO Exec Proc - WAC: waclu01- 07/16/2008 To 07/23/2008 HO fail HO succ %HO succ 100.%
90.% 80.% 70.% 60.% 50.% 40.% 30.% 20.% 10.% .% 07/16/2008 07/17/2008 07/18/2008 07/19/200807/20/2008 07/21/2008 07/22/2008 07/23/2008
%
07/23/2008
On BS Cell object, select the 3 sites with modification of hard HO hysterisis margin and check the indicators relative to HO and release. We can see the main problem is on site DANAUKOTA. Now we have to select object W-adjacency and select all the adjacencies related to DANAUKOTA site (so 1,2,3)
HO_WAC_intraWAC_succ_sBS
3 2 9 7 4 0 0 0
20 0 221 247 69 80 70 7
93 293 41 33 14 16 24 0
15 11 35 41 52 77 17 1
26 81 25 29 4 16 9 0
07/23/2008
642 762
231 732
Conclusion: the HO failures are mainly due to the adjacency: DANAUKOTA3-DANAUKOTA1 DANAUKOTA2-DANAUKOTA1 DANAUKOTA2-PPRSGBONUS1
43 | Presentation Title | Month 2006
HO_WAC_intraWAC_exec_req_sBS
6 7 56 7 50 274 10 9
274 10 9
_NP_HO_WAC_intraWAC_sBS_fail
3 5 47 0 46
Cell_DANAUKOTA1 Cell_DANAUKOTA2 Cell_DANAUKOTA3 Cell_HOTELTUNE1 Cell_HOTELTUNE2 Cell_HOTELTUNE3 Cell_PPRSGBONUS1 Cell_PPRSGBONUS2 Cell_PPRSGBONUS3 Sum of the 3 sites WACLU1
13
393
MS_rel_other
41
No unit
60000 50000 40000 30000 20000 10000 0 06/02/2008 06/09/2008 06/16/2008 06/23/2008 06/30/2008 07/07/2008
No unit
Prep ko RTVR flush expired Prep ko NRT flush expired Prep ko empty list
%
10.%
07 /1 6/ 20 08 07 /1 7/ 20 08 07 /1 8/ 20 08 07 /1 9/ 20 08 07 /2 0/ 20 08 07 /2 1/ 20 08 07 /2 2/ 20 08 07 /2 3/ 20 08
All Rights Reserved Alcatel-Lucent 2006, #####
Due to the fact the CPE is not on the good cell, the ranging req vs cdma rate is directly impacted (poor coverage and so many ranging CDMA sent)
.% 07/16/200807/17/200807/18/200807/19/200807/20/200807/21/200807/22/200807/23/2008
The 21/07/2008
51 | Presentation Title | Month 2006 All Rights Reserved Alcatel-Lucent 2006, #####
The 21/07/2008 at 15h58, WAC log traces have been collected on WACLU01. So USB key has been plugged and the WAC has transferred around 500MB of log file. It has taken around 30mn Doing actions on WAC can increase the CPU and so to have a direct influence on network behavior
07 /2 1/ 20 08 07 14 /2 1/ :0 20 0 0 07 8 14 /2 1/ :1 20 5 08 07 /2 14 1/ :3 20 0 08 07 14 /2 1/ :4 20 5 0 07 8 15 /2 1/ :0 20 0 0 07 8 15 /2 1/ :1 20 5 08 07 /2 15 1/ :3 20 0 08 07 15 /2 1/ :4 20 5 08 07 16 /2 1/ :0 20 0 0 07 8 /2 16 1/ :1 20 5 0 07 8 16 /2 1/ :3 20 0 0 07 8 16 /2 1/ :4 20 5 08 17 :0 0
70 60 50
No u nit
No u nit
Rel other WAC Abnormal Rel BS Rel WAC Normal Rel MS Rel
40 30 20 10 0
07 /21 / 07 20 0 81 /21 4:0 07 /20 0 0 /21 8 /20 14:1 07 /21 08 1 5 4:3 / 07 20 0 81 0 /21 4:4 / 07 20 0 81 5 /21 5:0 / 07 20 0 81 0 /21 5:1 07 /20 0 5 /21 8 /20 15:3 07 /21 08 1 0 5:4 / 07 20 0 81 5 /21 6:0 / 07 20 0 81 0 /21 6:1 07 /20 0 5 /21 8 /20 16:3 07 /21 08 1 0 6: /20 08 45 17 :00
40 30 20 10 0
07 /21 /20 07 08 /21 14 :00 /20 08 07 /21 14 :15 /20 07 08 /21 14 :30 /20 07 08 /21 14 :45 /20 08 07 /21 15 :00 /20 07 08 /21 15 :15 /20 07 08 /21 15 :30 /20 08 07 /21 15 :45 /20 07 08 /21 16 :00 /20 07 08 /21 16 :15 /20 08 07 /21 16 :30 /20 07 08 /21 16 :45 /20 08 17 :00
Session Ended
No u nit
50
90 80 70 60 50 40 30 20 10 0
www.alcatel-lucent.com