Automated Recognition, Processing and Sentiment Analysis of Speech
- Designed an LSTM-based recurrent neural network for speech recognition using Mel Frequency Cepstral Coefficients (MFCC) and compared its performance to the baseline Convolution Neural Network model using confusion matrix
- Evaluated and then fine-tuned a pre-trained CRDNN model and utilized it to design a sentiment detection model
- Obtained word error rate of 0.37 and sentiment detection accuracy of 76% for our model on the Hugging Face dataset
Download Presentation ; Download Poster ; Github Repository