Abstract
This work describes a speech fundamental period estimation algorithm that estimates the time of excitation of the vocal tract using a pattern classifier, the multi-layer perceptron (MLP). The pattern classifier was trained using speech semi-automatically labelled by means of an algorithm that makes use of the output from a Laryngograph. Various issues arising in the training of the system were explored. Three basic configurations of the system were compared using different pre-processing strategies. It was found that processing the sampled speech time - waveform directly with the pattern classifier gave better results than using one of two filterbanks. The performance of the algorithm was evaluated against that of a simple peak-picking algorithm and the well known cepstrum algorithm using quantitative frequency contour comparisons. The performance of the new algorithm on a difficult set of test data was shown to be better than the peak-picker and comparable to the cepstrum algorithm. The advantage of the scheme is that fundamental period estimates are made on a period-by-period basis, thus preserving the irregularity in the speech excitation that is lost by techniques that produce as average period estimate. In addition, its simple structure lends itself to real-time implementation (Howard & Walliker, 9; Walliker & Howard, 14).
Original language | English |
---|---|
Pages (from-to) | 340-344 |
Number of pages | 0 |
Journal | IEE Conference Publication |
Volume | 0 |
Issue number | 349 |
Publication status | Published - 1 Dec 1991 |