Computer Aided Qur’an Pronunciation using DNN

Al-Marri, Mubarak; Raafat, Hazem; Abdallah, Mustafa; Abdou, Sherif; Rashwan, Mohsen

doi:10.3233/JIFS-169508

Computer Aided Qur’an Pronunciation using DNN

Issue title: Intelligent and Fuzzy Systems applied to Language & Knowledge Engineering

Guest editors: David Pinto, Vivek Kumar Singh, Aline Villavicencio, Philipp Mayr-Schlegel and Efstathios Stamatatos

Article type: Research Article

Authors: Al-Marri, Mubarak^a | Raafat, Hazem^{a; *} | Abdallah, Mustafa^b | Abdou, Sherif^c | Rashwan, Mohsen^b

Affiliations: [a] Computer Science Department, Kuwait University, Kuwait | [b] Faculty of Engineering, Cairo University, Egypt | [c] Faculty of Computers and Information, Cairo University, Egypt

Correspondence: [*] Corresponding author. Hazem Raafat, Computer Science Department, Kuwait University, Kuwait. E-mail: WEML hazem@cs.ku.edu.kw.

Abstract: This paper presents a system for improving the quality of pronunciation error detection and correction for Qur’an recitation by Non-Arabic speakers. Most of the classical speech recognition systems are built using the Hidden Markov Model (HMM) with a Mixture of Gaussian Model (GMM). This paper attempts to enhance the GMM-HMM model’s performance by using Deep Neural Networks (DNNs). The major part of the work done in this paper is involved in the collection and processing of speakers’ data, and building and evaluation of baseline GMM system and the proposed DNN acoustic models for the Qur’an recitation framework. With the aim of solving some pronunciation problems and enhancing the overall performance of such a speech recognition system, we replace the mixture of Gaussians with a DNN. The DNN-HMM model outperforms the GMM-HMM model by 1.02% based on HTK’s word accuracy equation. By calculating the insertion results for both models, DNN-HMM showed progress by 2.59%. In addition, in substitution results, DNN-HMM shows progress with the confusion phonemes DAA by 15.09% and DHA by 17.28%. All experiments and results are presented and discussed in detail.

Keywords: Computer Aided Language Pronunciation, Hidden Markov Model, Automatic Speech Recognition, Deep Neural Network

DOI: 10.3233/JIFS-169508

Journal: Journal of Intelligent & Fuzzy Systems, vol. 34, no. 5, pp. 3257-3271, 2018

Published: 24 May 2018

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia