Accurate parameter estimation for an articulatory speech synthesizer with an improved neural network mapping
Küçük Resim Yok
Tarih
2001
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Erişim Hakkı
info:eu-repo/semantics/openAccess
Özet
Neural network (NN) applications have recently been employed to extract the parameters of an articulatory speech synthesizer from a given speech signal. Results from these attempts showed that a single NN is insufficient to cover all of the possible configurations uniquely. Moreover, apart from their computational advantages, NN mapping is so far not superior to the other mapping techniques [1]. Thus there is a clear need to improve NN solution to the inverse problem. Results from our earlier experiments with an articulatory speech synthesizer have shown that the statistical characteristic of the articulatory target pattern vectors can be exploited for an improvement in the estimation performance of a Multi-Layer Perceptron (MLP) NN [2]. In this paper, the effect of the modification to the distribution characteristic of the acoustic input pattern vectors will be investigated. The theoretical background for the effect of the input distribution characteristics on neural learning has been detailed elsewhere [3]. Empirical results for a more correct estimation of articulatory speech synthesizer parameters through exploiting the behavior of the Back Propagation (BP) algorithm are focused on here.
Neural network (NN) applications have recently been employed to extract the parameters of an articulatory speech synthesizer from a given speech signal. Results from these attempts showed that a single NN is insufficient to cover all of the possible configurations uniquely. Moreover, apart from their computational advantages, NN mapping is so far not superior to the other mapping techniques [1]. Thus there is a clear need to improve NN solution to the inverse problem. Results from our earlier experiments with an articulatory speech synthesizer have shown that the statistical characteristic of the articulatory target pattern vectors can be exploited for an improvement in the estimation performance of a Multi-Layer Perceptron (MLP) NN [2]. In this paper, the effect of the modification to the distribution characteristic of the acoustic input pattern vectors will be investigated. The theoretical background for the effect of the input distribution characteristics on neural learning has been detailed elsewhere [3]. Empirical results for a more correct estimation of articulatory speech synthesizer parameters through exploiting the behavior of the Back Propagation (BP) algorithm are focused on here.
Neural network (NN) applications have recently been employed to extract the parameters of an articulatory speech synthesizer from a given speech signal. Results from these attempts showed that a single NN is insufficient to cover all of the possible configurations uniquely. Moreover, apart from their computational advantages, NN mapping is so far not superior to the other mapping techniques [1]. Thus there is a clear need to improve NN solution to the inverse problem. Results from our earlier experiments with an articulatory speech synthesizer have shown that the statistical characteristic of the articulatory target pattern vectors can be exploited for an improvement in the estimation performance of a Multi-Layer Perceptron (MLP) NN [2]. In this paper, the effect of the modification to the distribution characteristic of the acoustic input pattern vectors will be investigated. The theoretical background for the effect of the input distribution characteristics on neural learning has been detailed elsewhere [3]. Empirical results for a more correct estimation of articulatory speech synthesizer parameters through exploiting the behavior of the Back Propagation (BP) algorithm are focused on here.
Açıklama
Anahtar Kelimeler
Mühendislik, Elektrik ve Elektronik
Kaynak
Turkish Journal of Electrical Engineering and Computer Sciences
WoS Q Değeri
Scopus Q Değeri
Q3
Cilt
9
Sayı
2