Mispronunciation Detection of Basic Quranic Recitation Rules using Deep Learning

by   Ahmad Al Harere, et al.

In Islam, readers must apply a set of pronunciation rules called Tajweed rules to recite the Quran in the same way that the angel Jibrael taught the Prophet, Muhammad. The traditional process of learning the correct application of these rules requires a human who must have a license and great experience to detect mispronunciation. Due to the increasing number of Muslims around the world, the number of Tajweed teachers is not enough nowadays for daily recitation practice for every Muslim. Therefore, lots of work has been done for automatic Tajweed rules' mispronunciation detection to help readers recite Quran correctly in an easier way and shorter time than traditional learning ways. All previous works have three common problems. First, most of them focused on machine learning algorithms only. Second, they used private datasets with no benchmark to compare with. Third, they did not take into consideration the sequence of input data optimally, although the speech signal is time series. To overcome these problems, we proposed a solution that consists of Mel-Frequency Cepstral Coefficient (MFCC) features with Long Short-Term Memory (LSTM) neural networks which use the time series, to detect mispronunciation in Tajweed rules. In addition, our experiments were performed on a public dataset, the QDAT dataset, which contains more than 1500 voices of the correct and incorrect recitation of three Tajweed rules (Separate stretching , Tight Noon , and Hide ). To the best of our knowledge, the QDAT dataset has not been used by any research paper yet. We compared the performance of the proposed LSTM model with traditional machine learning algorithms used in SoTA. The LSTM model with time series showed clear superiority over traditional machine learning. The accuracy achieved by LSTM on the QDAT dataset was 96 three rules (Separate stretching, Tight Noon, and Hide), respectively.


page 1

page 2

page 3

page 4


Emotion-Inspired Deep Structure (EiDS) for EEG Time Series Forecasting

Accurate forecasting of an electroencephalogram (EEG) time series is cru...

Extreme-Long-short Term Memory for Time-series Prediction

The emergence of Long Short-Term Memory (LSTM) solves the problems of va...

Use of 1D-CNN for input data size reduction of LSTM in Hourly Rainfall-Runoff modeling

An architecture consisting of a serial coupling of the one-dimensional c...

A Deep Learning Model for Heterogeneous Dataset Analysis – Application to Winter Wheat Crop Yield Prediction

Western countries rely heavily on wheat, and yield prediction is crucial...

The VVAD-LRS3 Dataset for Visual Voice Activity Detection

Robots are becoming everyday devices, increasing their interaction with ...

Estimating value at risk: LSTM vs. GARCH

Estimating value-at-risk on time series data with possibly heteroscedast...

Please sign up or login with your details

Forgot password? Click here to reset