Beruflich Dokumente
Kultur Dokumente
2
5. an editable button that specifies the frame shift, Rm , (in msec) for short-time analysis; (the default value is
Rm = 10 msec),
6. a popupmenu button that lets the user choose a pitch range for search for the current pitch period estimate
depending on the gender of the talker; (the default is Male pitch range),
7. an editable button that specifies the AMDF threshold, amdfthresh, for determining that a frame of signal is
voiced (minimum AMDF below threshold) or not voiced (minimum AMDF above threshold),
8. a text button that displays the starting sample, ss, of the current frame for frame analysis; (the default value is
1 for starting sample),
9. a pushbutton to determine the single frame starting sample, ss, using the iterative method described below; this
starting sample defines the current analysis frame,
10. a pushbutton to run the analysis code and display the signal processing results using the current frame of the
speech signal; this button can be pressed and used as often as desired, changing one or more analysis parameters
while keeping the frame starting sample the same,
11. a pushbutton to run the analysis code and display the signal processing results using the next frame of signal;
i.e., the frame with starting sample set to ss+R where R is the frame shift in samples; this button can be pushed
repeatedly to provide a frame-by-frame analysis,
12. a pushbutton to run the pitch detector code and to display the smoothed pitch period and confidence score
contours on the graphics panels,
13. a pushbutton to close the GUI.
Figure 1: Sequence of waveform plots defining how the user can interactively choose a starting sample for the current
analysis frame.
4
8. hit the Run Current Frame button to initiate single frame analysis of the speech beginning at the current frame
starting sample, ss; the results of AMDF analysis are shown in the various graphical plots; the Run Current
Frame button can be hit repeatedly after making changes in the analysis frame parameters; a red vertical line
indicates the estimate of pitch period for the current frame of speech,
9. hit the Run Next Frame button to initiate single frame analysis on the next frame of speech, i.e., where the
starting sample of the next frame is set to ss+R, where R is the frame shift in samples,
10. hit the Run Pitch Detector to run in pitch detection mode; the resulting pitch period contour is displayed in
the upper graphics panel, and the amdf minimum value (which serves as a confidence score) is displayed in the
lower graphics panel,
11. experiment with different choices of speech file, and with different values for Lm , Rm , gender and amdfthresh,
12. hit the Close GUI button to terminate the run.
Examples of the graphical output obtained from this exercise using the speech file:
we were away a year ago suzanne.wav
are shown in Figure 2 (for the frame mode graphics), and in Figure 3 (for the pitch detection mode graphics). The
displays for the frame mode graphics are the short-time analysis frame (upper graphics panel) and the short-time amdf
function (lower graphics panel). The displays for the pitch detection mode graphics are the smoothed pitch period
contour (upper graphics panel) and the smoothed confidence score contour (lower graphics panel).
Figure 2: Plots of short-time AMDF analysis in frame mode: the upper graphics panel shows the current speech frame,
and the lower graphics panel shows the short-time amdf with the best estimate of pitch period denoted by a red vertical
line at the pitch period location.
Figure 3: Plots of amdf pitch detector output. The upper graphics panel shows the smoothed pitch period contour. The
lower graphics panel shows the smoothed confidence scores.