Multimodal Speech Sentimental Analysis

Rohan Katyal, Sanskriti Agrahari, Shagun Chauhan, Md. Shahid,

doi:10.52783/tjjpt.v45.i02.6033

PDF

Published: Apr 12, 2024

Rohan Katyal, Sanskriti Agrahari, Shagun Chauhan, Md. Shahid,

Abstract

In today’s world, communication is not just limited to text. Communication today can be in the form of various modes, such as speech, visual, and textual cues. With the help of these multiple modalities, sentiment analysis becomes more easy and accurate. This paper introduces Multimodal Speech Sentiment Analysis (MSSA), which integrates Convolutional Neural Network (CNN) and Bidirectional Encoder Representations from Transformers (BERT) algorithms for comprehensive analysis of the sentiments that are expressed through multiple modalities, such as acoustic features from speech. This paper sets a new standard for sentiment analysis by promoting speech data along with conventional textual analysis. We are not only focusing on multimodal approaches so that we can interpret the sentiments accurately but also seizing the complications of human expressions in the real world.

Issue

Vol. 45 No. 02 (2024)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details