TY - JOUR
T1 - A Comparison of Fitness Functions in a Genetic Algorithm for Acoustic–Articulatory Parameter Inversion of Vowels
AU - Drayton, J
AU - Miranda, ER
AU - Kirke, A
PY - 2017/7/15
Y1 - 2017/7/15
N2 - Articulatory speech synthesis provides an alternative to the state of the art concatenative and formant systems, holding potential for more versatile and expressive artificial speech due to its physical modelling basis. However, a major limitation of practical articulatory synthesis is gaining adequate control of the complex underlying physical models, which stems from a lack of articulatory data. In an effort to procure more data, a Genetic Algorithm approach to Acoustic-Articulatory Parameter Inversion is taken. This paper presents the initial results from testing a number of fitness functions for the Acoustic-Articulatory Parameter Inversion of three vowels, /a/, /o/, and /e/. Three feature vector representations of the vowels were tested; Hertz, Mel-scale, and Cents, in conjunction with three distance metrics. The distance metrics defined the fitness score by calculating the similarity between a candidate and targets feature vector. A Voiced/Un-Voiced constraint was also added as a penalty function, and an indicator of loudness was implemented using a Root Mean Square based co-efficient. The results indicated that certain combinations of the above could lead to convergence towards all three vowels. However, the quality of convergence was not uniform.
AB - Articulatory speech synthesis provides an alternative to the state of the art concatenative and formant systems, holding potential for more versatile and expressive artificial speech due to its physical modelling basis. However, a major limitation of practical articulatory synthesis is gaining adequate control of the complex underlying physical models, which stems from a lack of articulatory data. In an effort to procure more data, a Genetic Algorithm approach to Acoustic-Articulatory Parameter Inversion is taken. This paper presents the initial results from testing a number of fitness functions for the Acoustic-Articulatory Parameter Inversion of three vowels, /a/, /o/, and /e/. Three feature vector representations of the vowels were tested; Hertz, Mel-scale, and Cents, in conjunction with three distance metrics. The distance metrics defined the fitness score by calculating the similarity between a candidate and targets feature vector. A Voiced/Un-Voiced constraint was also added as a penalty function, and an indicator of loudness was implemented using a Root Mean Square based co-efficient. The results indicated that certain combinations of the above could lead to convergence towards all three vowels. However, the quality of convergence was not uniform.
UR - https://pearl.plymouth.ac.uk/context/ada-research/article/1042/viewcontent/GECCO_2017.pdf
U2 - 10.1145/3067695.3076112
DO - 10.1145/3067695.3076112
M3 - Conference proceedings published in a journal
VL - 0
JO - Proceedings of Genetic and Evolutionary Computation Conference
JF - Proceedings of Genetic and Evolutionary Computation Conference
IS - 0
T2 - Genetic and Evolutionary Computation Conference 2017
Y2 - 15 July 2017 through 19 July 2017
ER -