Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance

Dokuz, Yeşim; Tüfekçi, Zekeriya

Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance

dc.contributor.author	Dokuz, Yeşim
dc.contributor.author	Tüfekçi, Zekeriya
dc.date.accessioned	2024-11-07T13:19:01Z
dc.date.available	2024-11-07T13:19:01Z
dc.date.issued	2020
dc.department	Niğde Ömer Halisdemir Üniversitesi
dc.description.abstract	With the recent advances in hardware technologies and computational methods, computers became more powerful for analyzingdifficult tasks, such as speech recognition and image processing. Speech recognition is the task of extraction of text representation ofa speech signal using computational or analytical methods. Speech recognition is a challenging problem due to variations in accents and languages, powerful hardware requirements, big dataset needs for generating accurate models, and environmental factors thataffect signal quality. Recently, with the increasing processing ability of hardware devices, such as Graphical Processing Units, deeplearning methods became more prevalent and state-of-the-art method for speech recognition, especially Recurrent Neural Networks(RNNs) and Long-Short Term Memory (LSTMs) networks which is a variant of RNNs. In the literature, RNNs and LSTMs are usedfor speech recognition and the applications of speech recognition with various parameters, i.e. number of layers, number of hiddenunits, and batch size. It is not investigated that how the parameter values of the literature are selected and whether these values couldbe used in future studies. In this study, we investigated the effect of LSTMs hyperparameters on speech recognition performance interms of error rates and deep architecture cost. Each parameter is investigated separately while other parameters remain constant andthe effect of each parameter is observed on a speech corpus. Experimental results show that each parameter has its specific values forthe selected number of training instances to provide lower error rates and better speech recognition performance. It is shown in thisstudy that before selecting appropriate values for each LSTM parameters, there should be several experiments performed on thespeech corpus to find the most eligible value for each parameter.
dc.identifier.doi	10.31590/ejosat.araconf21
dc.identifier.endpage	168
dc.identifier.issn	2148-2683
dc.identifier.issue	Ejosat Özel Sayı 2020 (ARACONF)
dc.identifier.startpage	161
dc.identifier.trdizinid	364761
dc.identifier.uri	https://doi.org/10.31590/ejosat.araconf21
dc.identifier.uri	https://search.trdizin.gov.tr/tr/yayin/detay/364761
dc.identifier.uri	https://hdl.handle.net/11480/12828
dc.identifier.volume	0
dc.indekslendigikaynak	TR-Dizin
dc.language.iso	en
dc.relation.ispartof	Avrupa Bilim ve Teknoloji Dergisi
dc.relation.publicationcategory	Makale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_20241107
dc.subject	Bilgisayar Bilimleri
dc.subject	Yazılım Mühendisliği
dc.subject	Bilgisayar Bilimleri
dc.subject	Sibernitik
dc.subject	Bilgisayar Bilimleri
dc.subject	Bilgi Sistemleri
dc.subject	Bilgisayar Bilimleri
dc.subject	Donanım ve Mimari
dc.subject	Bilgisayar Bilimleri
dc.subject	Teori ve Metotlar
dc.subject	Akustik
dc.subject	Bilgisayar Bilimleri
dc.subject	Yapay Zeka
dc.title	Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance
dc.type	Article

Koleksiyon

TR-Dizin İndeksli Yayınlar Koleksiyonu

Investigation of the Effect of LSTM Hyperparameters on Speech Recognition Performance

Dosyalar

Koleksiyon