Automated Recognition, Processing and Sentiment Analysis of Speech

Designed an LSTM-based recurrent neural network for speech recognition using Mel Frequency Cepstral Coefficients (MFCC) and compared its performance to the baseline Convolution Neural Network model using confusion matrix
Evaluated and then fine-tuned a pre-trained CRDNN model and utilized it to design a sentiment detection model
Obtained word error rate of 0.37 and sentiment detection accuracy of 76% for our model on the Hugging Face dataset

Download Presentation ; Download Poster ; Github Repository

Share on

Twitter Facebook LinkedIn