Mini-batch sample selection strategies for deep learning based speech recognition

Dokuz, Yesim; Tufekci, Zekeriya

Mini-batch sample selection strategies for deep learning based speech recognition

Tarih

2021

Yazarlar

Dokuz, Yesim

Tufekci, Zekeriya

Yayıncı

Elsevier Sci Ltd

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

With the use of deep learning technologies, speech recognition systems gained more success and human-computer interactions became more prevalent. Deep learning based speech recognition systems are getting more attention and are having tremendous success in all areas of speech recognition, such as voice search, mobile communication, and personal digital assistance. However, speech recognition is still challenging due to hardness of adapting new languages, difficulty in handling variations in speech datasets, and overcoming distorting factors. Deep learning systems have the ability to overcome these challenges using high-level abstractions in the datasets by using a deep graph with multiple processing layers using training algorithms, such as gradient descent optimization. In this study, a variant of gradient descent optimization, mini-batch gradient descent is used. We proposed four strategies for selecting mini-batch samples to represent variations of each feature in the dataset for speech recognition tasks to increase model performance of deep learning based speech recognition. For this purpose, gender and accent adjusted strategies are proposed for selecting mini-batch samples. The experiments show that proposed strategies perform better in comparison with standard mini-batch sample selection strategy. (C) 2020 Elsevier Ltd. All rights reserved.

Anahtar Kelimeler

Mini-batch gradient descent, Sample selection strategies, Deep learning, Speech recognition, LSTM

Kaynak

Applied Acoustics

WoS Q Değeri

Q1

Scopus Q Değeri

Q1

Cilt

171

Bağlantı

https://doi.org/10.1016/j.apacoust.2020.107573
https://hdl.handle.net/11480/14793

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Mini-batch sample selection strategies for deep learning based speech recognition

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon