Downloads 769

..............................

Views 2k

..............................

Cited by

..............................

Received date February 14, 2018

Accepted date April 13, 2018

Using Deep Learning for Automatically Determining Correct Application of Basic Quranic

Author Jordan University of Science and Technology, Jordan,

Keywords #Articulation rules (Ahkam Al-Tajweed) #Mel-Frequency Cepstral Coefficient (MFCC) #Linear predictive Code (LPC) #Wavelet Packet Decomposition (WPD) #Hidden Markov Model based Spectral Peak Location (HMM-SPL) #Convolutional Deep Belief Network (CDBN); k-Nearest Neighbors (KNN); Support Vector Machines (SVM); Artificial Neural Network (NN) #Random Forest (RF) #multiclass classifier #bagging; t-Test

Abstract Quranic Recitation Rules (Ahkam Al-Tajweed) are the articulation rules that should be applied properly when reciting the Holy Quran. Most of the current automatic Quran recitation systems focus on the basic aspects of recitation, which are concerned with the correct pronunciation of words and neglect the other Ahkam Al-Tajweed that are related to the rhythmic and melodious way of recitation such as where to stop and how to “stretch” or “merge” certain letters. The only existing works on the latter parts are limited in terms of the rules they consider or the parts of Quran they cover. This paper comes to fill these gaps. It addresses the problem of identifying the correct usage of Ahkam Al-Tajweed in the entire Quran. Specifically, we focus on eight Ahkam Al-Tajweed faced by early learners of recitation. In the first part of our work, we used traditional audio processing techniques for feature extraction (such as Linear predictive Code (LPC), Mel-Frequency Cepstral Coefficient (MFCC), Wavelet Packet Decomposition (WPD) and Markov Model based Spectral Peak Location (HMM-SPL)) and classification (such as k-Nearest Neighbors (KNN), Support Vector Machines (SVM), and Random Forest (RF)) on an in- house dataset of thousands of audio recordings covering all occurrences of the rules under consideration in the entire Holy Quran by different reciters of both genders. In this part, we show how to improve the classification accuracy to surpass 97.7% by incorporating deep learning techniques. Specifically, this result is obtained by incorporating most traditional features with ones extracted using Convolutional Deep Belief Network (CDBN) while the classification is performed using SVM.

References

[1] Abdurrahman S., Abdo S., Khalil A., and Rashwan M., Enhancing Usability of CAPL System for Qur'an Recitation Learning, in Proceedings of Interspeech, Antwerp, pp. 214- 217, 2007.

[2] Abro B., Naqvi A., and Hussain A., Qur'an Recognition for The Purpose of Memorisation Using Speech Recognition Technique, in Proceedings of the 15th International Multitopic Conference, Islamabad, pp. 30-34, 2012.

[3] Al-Ayyoub M., Nuseir A., Alsmearat K., Jararweh,Y., and Gupta B., Deep Learning for Arabic NLP: A Survey, Journal of Computational Science, vol. 26, pp. 522-531, 2017.

[4] Al-Ayyoub M., Rihani M., Dalgamoni N., and Abdulla N., Spoken Arabic Dialects Identification: The case of Egyptian and Jordanian Dialects, in Proceedings of the 5th International Conference on Information and Communication Systems, Irbid, pp. 1-6, 2004.

[5] Asda T., Gunawan T., Kartiwi M., and Mansor, H., Development of Quran Reciter Identification System Using MFCC and Neural Network, Indonesian Journal of Electrical Engineering and Computer Science, vol. 1, no. 1, pp. 168-175, 2016.

[6] Breiman L., Bagging Predictors, Machine Learning, vol. 24, no. 2, pp. 123-140, 1996.

[7] Damer N., Al-Ayyoub M., and Hmeidi I., Automatically Determining Correct Application of Basic Quranic Recitation Rules, in Proceedings of the International Arab Conference on Information Technology, Yassmine Hammamet, 2017.

[8] Ehab M., Ahamd S., and Mousa A., Speaker Independent Quranic Recognizer Based on Maximum Likelihood Linear Regression, International Journal of Electrical and Computer Engineering, vol. 1, no. 12, pp. 1755-1761, 2007.

[9] Geiger J., Schuller B., and Rigoll G., Large- Scale Audio Feature Extraction And SVM For Acoustic Scene Classification, in Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, pp. 1-4, 2013.

[10] Hassan H., Nasrudin N., Khalid M., Zabidi A., and Yassin A., Pattern Classification in Recognizing Qalqalah Kubra Pronuncation using Multilayer Perceptrons, in Proceedings of the International Symposium on Computer Applications and Industrial Electronics, Kota Kinabalu, pp. 209-212, 2012.

[11] Ibrahim N., Razak Z., Idris M., Mohd, Tamil E., Yakub M., Yusoff Z., and Rahman N., Quranic Verse Recitation Recognition Module for Support in J-QAF Learning: a Review, International Journal of Computer Science and Network Security, vol. 8, no. 8, pp. 207-216, 2008.

[12] Ibrahim N., Razak Z., Yusoff Z., Idris M., and Tamil E., Quranic Verse Recitation Feature Extraction using Mel-Frequency Cepstral Coefficient (MFCC), in Proceedings of the 4th International Colloquium on Signal Processing and its Applications, Kuala Lumpur, 2008.

[13] Ibrahim N., Yusoff Z., and Razak Z., Quranic verse Recitation Recognition Module for Educational Programme, in Proceedings of the International Seminar on Research in Islamic Studies, Kuala Lumpur, 2008.

[14] Ibrahim J., Mohd. Y., Zaidi R., and Noor N., Automated Tajweed Checking Rules Engine for Quranic Learning, Multicultural Education and Technology Journal, vol. 7, no. 4, pp. 275-287, 2013. Using Deep Learning for Automatically Determining Correct Application of ... 625

[15] Lee H., Pham P., Largman Y., and Ng Y., Unsupervised Feature Learning for Audio Classification using Convolutional Deep Belief Networks, in Proceedings of Neural Information Processing Systems, Vancouver, 2009.

[16] Lin C., and Wang H., Language Identification Using Pitch Contour Information, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, pp. 1-1, 2005.

[17] Norouzi M., Ranjbar M., and Mori G., Stacks of Convolutional Restricted Boltzmann Machines for Shift-Invariant Feature Learning, in Proceedings of Computer Vision and Pattern Recognition, Miami, pp. 2735-2742, 2009.

[18] Peeters G., Automatic Classification of Large Musical Instrument Databases Using Hierarchical Classifiers with Inertia Ratio Maximization, Audio Engineering Society Convention, 2003.

[19] Ridhwan M., Zeki A., and Olowolayemo A., Differential Qiraat Processing Applications using Spectrogram Voice Analysis, in \proceedings of the International Conference on Data Mining, Multimedia, Image Processing and their Applications, Kuala Lumpur, 2016.

[20] Rodriguez-Galiano V., Ghimire B., Rogan J., Chica-Olmo M., and Rigol-Sanchez, J., An Assessment of the Effectiveness of a Random Forest Classifier for Land-Cover Classification, ISPRS Journal of Photogrammetry and Remote Sensing, vol. 67, pp. 93-104, 2012.

[21] Rashwan M., Metwally S., Abdou S., Nazih W., Hamid O., Shahin M., and Samir A., Computer Aided Pronunciation Learning System using Speech Recognition Techniques, in Proceedings of Interspeech, Pittsburgh, 2006.

[22] Tabbal H., El Falou W., and Monla B., Analysis and Implementation of a Quranic Verses Delimitation System in Audio Files using Speech Recognition Techniques, in Proceedings of the 2nd International Conference on Information and Communication Technologies, Damascus, 2006.

[23] Tzanetakis G., and Cook P., Musical Genre Classification of Audio Signals, IEEE Transactions On Speech And Audio Processing, vol. 10, no. 5, pp. 293-302, 2002.

[24] Waqar M., Muhammad R., Muhammad A., and Martines-Enriguez A., Voice Content Matching System for Quran Readers, in Proceedings of the Artificial Intelligence, Pachuca, pp. 148-153, 2010.

[25] Witten I., Frank E., Trigg L., Hall M., Holmes G., and Cunningham S., Weka: Practical machine learning tools and techniques with Java implementations, Seminar, University of Waikato, 1999. Mahmoud Al-Ayyoub received his M.Sc. and Ph.D. degrees in Computer Science from the State University of New York at Stony Brook, NY, USA, in 2006 and 2010, respectively. He is currently an associate professor of Computer Science and the vice dean of the Faculty of Computer and Information Technology (ICT) at the Jordan University of Science and Technology (JUST), Irbid, Jordan. His research interests include cloud computing, high performance computing, machine learning and AI. Nour Alhuda Damer received her M.Sc. degree in Computer Science from the Jordan University of Science and Technology (JUST) in 2017. Her research interests include image and speech processing, machine learning and AI. Ismail Hmeidi received his M.Sc. degree in Computer Science from Eastern Michigan University, USA, in 1987, and his Ph.D. degree in Computer Science from Illinois Institute of Technology, USA, in 1995. He is currently aprofessor of Computer Information Systems at the Jordan University of Science and Technology (JUST), Irbid, Jordan. His research interests include Information Retrieval and Natural Language Processing.

,abstract={Quranic Recitation Rules (Ahkam Al-Tajweed) are the articulation rules that should be applied properly when reciting the Holy Quran. Most of the current automatic Quran recitation systems focus on the basic aspects of recitation, which are concerned with the correct pronunciation of words and neglect the other Ahkam Al-Tajweed that are related to the rhythmic and melodious way of recitation such as where to stop and how to “stretch” or “merge” certain letters. The only existing works on the latter parts are limited in terms of the rules they consider or the parts of Quran they cover. This paper comes to fill these gaps. It addresses the problem of identifying the correct usage of Ahkam Al-Tajweed in the entire Quran. Specifically, we focus on eight Ahkam Al-Tajweed faced by early learners of recitation. In the first part of our work, we used traditional audio processing techniques for feature extraction (such as Linear predictive Code (LPC), Mel-Frequency Cepstral Coefficient (MFCC), Wavelet Packet Decomposition (WPD) and Markov Model based Spectral Peak Location (HMM-SPL)) and classification (such as k-Nearest Neighbors (KNN), Support Vector Machines (SVM), and Random Forest (RF)) on an in- house dataset of thousands of audio recordings covering all occurrences of the rules under consideration in the entire Holy Quran by different reciters of both genders. In this part, we show how to improve the classification accuracy to surpass 97.7% by incorporating deep learning techniques. Specifically, this result is obtained by incorporating most traditional features with ones extracted using Convolutional Deep Belief Network (CDBN) while the classification is performed using SVM.},
keywords={Articulation rules (Ahkam Al-Tajweed), Mel-Frequency Cepstral Coefficient (MFCC), Linear predictive Code (LPC), Wavelet Packet Decomposition (WPD), Hidden Markov Model based Spectral Peak Location (HMM-SPL), Convolutional Deep Belief Network (CDBN); k-Nearest Neighbors (KNN); Support Vector Machines (SVM); Artificial Neural Network (NN), Random Forest (RF), multiclass classifier, bagging; t-Test},
ISSN={2413-9351},
month={Jan}}