Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Vaishnavi, V.a; * | Suveetha Dhanaselvam, P.b
Affiliations: [a] Department of Electrical and Electronics Engineering, Sethu Institute of Technology, Pulloor, Tamil Nadu, India | [b] Department of Electronics and Communication Engineering, Velammal College of Engineering and Technology, Madurai, Tamil Nadu, India
Correspondence: [*] Corresponding author. V. Vaishnavi, Department of Electrical and Electronics Engineering, Sethu Institute of Technology, Pulloor, Tamil Nadu 626115, India. E-mail: suveethapapers@gmail.com.
Abstract: The study of neonatal cry signals is always an interesting topic and still researcher works interminably to develop some module to predict the actual reason for the baby cry. It is really hard to predict the reason for their cry. The main focus of this paper is to develop a Dense Convolution Neural network (DCNN) to predict the cry. The target cry signal is categorized into five class based on their sound as “Eair”, “Eh”, “Neh”, “Heh” and “Owh”. Prediction of these signals helps in the detection of infant cry reason. The audio and speech features (AS Features) were exacted using Mel-Bark frequency cepstral coefficient from the spectrogram cry signal and fed into DCNN network. The systematic DCNN architecture is modelled with modified activation layer to classify the cry signal. The cry signal is collected in different growth phase of the infants and tested in proposed DCNN architecture. The performance of the system is calculated through parameters accuracy, specificity and sensitivity are calculated. The output of proposed system yielded a balanced accuracy of 92.31%. The highest accuracy level 95.31%, highest specificity level 94.58% and highest sensitivity level 93% attain through proposed technique. From this study, it is concluded that the proposed technique is more efficient in detecting cry signal compared to the existing techniques.
Keywords: Infant cry signal, spectrogram images, audio and speech features, mel-bark frequency cepstral domain, dense convolution neural network
DOI: 10.3233/JIFS-212473
Journal: Journal of Intelligent & Fuzzy Systems, vol. 42, no. 6, pp. 6103-6116, 2022
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl