Beruflich Dokumente
Kultur Dokumente
in VoIP
Shveni P Mehta
ABSTRACT
Voice over IP is an upcoming technology that enables voice
communication through the Internet. Packet-based network links
are shared between different connections, which gives rise to
interaction between various traffic types. Excessive delay, packet
loss, and high delay jitter all impair the communication quality.
Although voice over IP is very economical, people become
hesitant to use it due to the above-mentioned facts which affect
the voice quality greatly. Delay jitter and packet loss are the two
factors that affect the voice quality the most. The delay jitter
manifests itself as packet loss in the dejitter buffer, which drops
the late packets. It is impossible to remove packet loss from the
network but it can definitely be minimized. Several techniques
have been proposed to minimize the effect of packet loss. This
paper describes some of those techniques and then compares the
following two techniques on the basis of certain criteria:
1) A New Technique for improving quality of speech in voice
over IP using time-scale modification [1]
2) Adaptive playout scheduling and loss concealment for voice
communication over IP Network [3]
MOS
1. INTRODUCTION
For non-real-time applications like Telnet, FTP, and email, TCP
offers reliable delivery of data, but for real-time applications like
voice over IP, TCP is not appropriate because it introduces too
much delay in creating that reliability. The three main problems
occurring in real-time applications like Voice over IP (VoIP) are:
1) End-to-End delay: The total delay experienced by the packet
from the sender till it reaches the receiver.
2) Jitter: The variation in packet interarrival time. The
difference between when the packet is expected and when it
is actually received is jitter.
3) Packet loss: Loss of voice packets from sender to receiver.
The total packet loss is composed of two elements: 1) packets lost
over the network due to congestion, and 2) packets arriving late
after their expected playout time that are discarded by the
receiver. The jitter caused by variable delays in the network is
ultimately translated into the effect of packet loss in the network,
as the packets arriving after the playout time are considered as
lost. Another factor that could affect the quality of VoIP is the
choice of codec used to transform and compress analog signals
into digital signals. Currently available codecs are listed in Table
1.
The Mean Opinion Score (MOS) is a measure to determine or
compare the quality of audio transmissions. It is widely applied
Keywords
VoIP, Packet loss, Adaptive playout, Time-Scale Modification.
Bitrate
64 kbps
64 kbps
40 kbps
32 kbps
24 kbps
16 kbps
13 kbps
8 kbps
5.3 kbps
1
2
3
4
5
2.1.1 Interleaving
This technique distributes the effect of the lost packets in order to
reduce the impact on quality. The information of a speech part is
distributed in multiple packets. The data units are regrouped in a
crossed form before transmission such that they are distributed,
and at the receiver they are rearranged in their original form.
Thus, instead of losing the whole packet small parts from
distributed packets are lost, Figure 1.
1
Before
Transmission
Interleaving
Lost
2.1.2
Repetition:
2.1.3
The data are interleaved before sending and then any missing part
is substituted using the repetition technique at the receiver.
2.1.5
Simple Interpolation:
2.1.4
3. EVALUATION OF TECHNIQUES
The most important characteristics in determining a good
solution for VoIP include the following:
1) Receiver-based technique, as they are faster and independent of
the network delay characteristics, and also have lower
computational overhead at the sender or network [3].
2) Minimizes overall delay and packet loss.
3) Flexible arrival delay cut-offs for late arriving packets, to
reduce the packet loss-rate at the receiver.
4) Uses adaptive playout to minimize overall delay and effects of
lost packets.
5) Low complexity.
6) Maintains pitch frequencies and intelligibility of speech.
7) Technique itself includes the solution for recovering from burst
losses.
Of the techniques described above, the two that best meet these
criteria and are similar enough to be compared effectively are:
1) Time-scale modification Approach [1].
2) Adaptive Playout scheduling and Loss Concealment [3].
The specific characteristics to be considered for comparison are:
1) Receiver-based
2) Minimizes overall delay and packet loss
3) Flexible arrival delay cut-offs to late arriving packets, reducing
the packet loss-rate at the receiver
4) Considers both portions of packet lost, i.e., packets lost in the
network due to congestion at some intermediate node and at the
receiver due to packets arriving later than their scheduled playout
times
5) Generic and relative computational overhead
Seq. #
3.5
3
Seq. #
10
20
30
37 40
5
PESQ Score
(a)
50
6
2.5
2
1.5
1
0.5
0
0
13
25
37
42
10
20
30
40
50
30
40
50
30
40
50
RPL %
50
4
PESQ Score
Seq. #
3.5
(b)
0
13
25
37 40
50
3
2.5
2
1.5
1
0.5
0
0
10
20
RPL %
4
3.5
3
PESQ Score
2.5
2
1.5
1
0.5
0
0
10
20
RPL %
. Network Delay
___ Total End-to-End Delay
Notation
t si
tri
tpi
dni
dbi
db
dti
dmax
dmaxi
n
l
b
R
P
B
N
Lo
Li
Lo
i+1
Playout
dmaxi
i+4
tr
tr i
dni
i+3
ts
Receiver
i+2
dbi
tp i
tp
Time
Li
Pitch Period
1
2
Input
Input
Packet
Input
Template
4
Input
Packet
i-2
Weighting Window
3
2
Found
Similar
Segment
Lost
i
i-1
i+1
i+2
Output
i- 1
Extend to 2Lo
Similar Segment
i+2
i+1
Extend to 1.3Lo
(a)
Input
Search Region
2
1
Search Region
4
3
i -1
Lost
i+2
i+1
i+3
i+4
L
Output
Realignment
3 Template
4
Lost
i
and
Overlap Add
i-1
i+1
Extend to 2Lo
i+3
Extend to 1.3Lo
Extend to 2Lo
(b)
Input
i-1
Similar Segment
Lost
Lost
i+1
i+4
L
i+2
i+3
5
Output
1/ 2
2/3
3/4
Output
2/3
3/4
i+1
i-1
Extend to 2Lo
Waveform Repetition
(c)
1
(a)
160
160
(b)
Time (Sample)
Used for Correlation
i+2
Extend to 1.3Lo
i+3
Time
STD of
Network
Delay (ms)
Maximum
Jitter
(ms)
1
2
3
4
23.7
15.9
5.9
13.7
130.0
86.0
39.0
47.0
Link
Loss
Rate (%)
STD of
Total
Delay (ms)
Buffering
Delay
(ms)
Total
Loss
Rate (%)
Burst
Loss
Rate (%)
MOS
0
8.3
0
0
7.5
8.5
2.6
7.4
54.6
26.1
23.0
25.7
2.8
8.3
0.3
1.1
0.7
0
0
0
3.7
2.8
4.3
4.1
1
2
Receiver-based
Minimizes overall
delay and packet loss
Considers both
portions of packet
loss, i.e., packets lost
in the network due to
congestion at some
intermediate node and
at the receiver due to
packets arriving later
than their scheduled
playout times
Yes
Yes
Yes
Yes
Yes
5.0 REFERENCES
[1] Agnihotri, S., Aravindhan, K., Jamadagni, H.S., Pawate,
B.I. A new technique for improving quality of speech in
Voice Over IP using time-scale modification, Acoustics,
Speech, and Signal Processing, 2002 Proceedings. (ICASSP
'02). IEEE International Conference on, Volume: 2, 2002
Pages:2085 2088.
[2] Duysburgh, B., Vanhastel, S., De Vreese, B., Petrisor, C.,
Demeester, P. On the influence of best-effort network
conditions on the perceived speech quality of VoIP
connections, Computer Communications and Networks,
2001 Proceedings. Tenth International Conference on 15-17
Oct 2001, Pages 334-339.