Please use this identifier to cite or link to this item:
http://studentrepo.iium.edu.my/handle/123456789/10994
Title: | Speech emotion recognition and depression prediction using deep neural networks | Authors: | AlGhifari, Muhammad Fahreza | Supervisor: | Teddy Surya Gunawan, Ph.D Mimi Aminah Wan Nordin, Ph.D Nik Nur Wahidah Nik Hashim, Ph.D |
Subject: | Speech processing systems Signal processing -- Digital techniques Neural networks (Computer science) |
Year: | 2021 | Publisher: | Kuala Lumpur : Kulliyyah of Engineering, International Islamic University Malaysia, 2021 | Abstract in English: | Speech signals contain ample information from which computers can gain insight into the user's state, including emotion recognition and depression prediction. The applications are numerous, from customer service to suicide prevention due to depression. In this research, we propose several deep-learning-based methodologies to detect emotion, as well as depression. Deep neural networks variations such as deep feedforward networks and convolutional networks were used. The deep learning model training, multi-languages emotion and depression database have been utilized, using well-known databases such as the Berlin Emotion Database and DAIC-WOZ Depression Dataset. For speech emotion recognition, the algorithm yields an accuracy of 80.5% across 4 languages, English, German, French and Italian. For depression detection, the current algorithm obtains an accuracy of 60.1% tested with the DAIC-WOZ dataset. This research has also created the Sorrow Analysis Dataset – an English depression audio dataset that contains 64 individuals samples of depressed and not-depressed. Further testing achieved an average accuracy of 97% with 5-fold validation using 1-dimensional convolutional networks. Finally, a prototype currently in development with Skymind Xpress.ai is presented, outlining the design and possible applications in the real world. It has been shown that the model is capable of performing both training and inference on a Raspberry Pi 3B+. | Call Number: | t TK 7882 S65 G423S 2021 | Kullliyah: | Kulliyyah of Engineering | Programme: | Master of Computing (Computer Science and Information Technology) | URI: | http://studentrepo.iium.edu.my/handle/123456789/10994 |
Appears in Collections: | KOE Thesis |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
t11100437191MuhammadFahrezaAlGhifari_24.pdf | 24 pages file | 355.6 kB | Adobe PDF | View/Open |
t11100437191MuhammadFahrezaAlGhifari_SEC.pdf Restricted Access | Full text secured file | 3.2 MB | Adobe PDF | View/Open Request a copy |
Google ScholarTM
Check
Items in this repository are protected by copyright, with all rights reserved, unless otherwise indicated. Please give due acknowledgement and credits to the original authors and IIUM where applicable. No items shall be used for commercialization purposes except with written consent from the author.