Abstract
The ability to recognize emotions in speech-based applications represents a significant advancement in human-computer interaction, offering a more intuitive and empathetic interface between users and technology. This paper presents a framework for speech-based emotional artificial intelligence detection utilizing an Efficient Capsule Network (CapsNet) model, trained and validated with the CREMA-D dataset. The Efficient-CapsNet model's unique architecture excels in capturing the intricate spatial relationships within speech data, resulting in superior accuracy and robustness compared to traditional models. This enhanced capability enables real-time, nuanced emotion detection, making it applicable in diverse domains such as mental health monitoring, customer service, and interactive entertainment. The use of the CREMA-D dataset ensures the model is well-equipped to handle a wide range of emotional expressions, facilitating generalization across varied real-world scenarios. The findings demonstrate the model's efficiency in terms of computational load and processing speed, highlighting its suitability for deployment in real-time applications. This research underscores the importance of emotion recognition in improving user experiences, providing timely mental health support, and creating more engaging and empathetic interactions in technology-driven environments. Future work will focus on optimizing the model, expanding its emotional detection capabilities, and ensuring ethical standards are upheld in its deployment.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE 15th Annual Computing and Communication Workshop and Conference, CCWC 2025 |
| Editors | Rajashree Paul, Arpita Kundu |
| Place of Publication | usa |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 178-182 |
| Number of pages | 5 |
| ISBN (Electronic) | 9798331507695 |
| DOIs | |
| State | Published - Jan 1 2025 |
| Event | 15th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2025 - Las Vegas, United States Duration: Jan 6 2025 → Jan 8 2025 |
Conference
| Conference | 15th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2025 |
|---|---|
| Country/Territory | United States |
| City | Las Vegas |
| Period | 01/6/25 → 01/8/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Artificial intelligence
- audio
- emotions
- machine learning
- neural networks
- speech
- video
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver