Тип публикации: доклад, тезисы доклада, статья из сборника материалов конференций
Конференция: International Conference on Language Resources and Evaluation, LREC 2016; Portoroz; Portoroz
Год издания: 2016
Ключевые слова: Adaptive speech-based emotion recognition, Classification performances, Corpora evaluation
Аннотация: Emotion Recognition (ER) is an important part of dialogue analysis which can be used in order to improve the quality of Spoken Dialogue Systems (SDSs). The emotional hypothesis of the current response of an end-user might be utilised by the dialogue manager component in order to change the SDS strategy which could result in a quality enhancement. In this study additional speaker-related information is used to improve the performance of the speech-based ER process. The analysed information is the speaker identity, gender and age of a user. Two schemes are described here, namely, using additional information as an independent variable within the feature vector and creating separate emotional models for each speaker, gender or age-cluster independently. The performances of the proposed approaches were compared against the baseline ER system, where no additional information has been used, on a number of emotional speech corpora of German, English, Japanese and Russian. The study revealed that for some of the corpora the proposed approach significantly outperforms the baseline methods with a relative difference of up to 11.9%.
Издание
Журнал: Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
Номера страниц: 61-68
Персоны
- Sidorov M. (Ulm University)
- Schmitt A. (Ulm University)
- Minker W. (Ulm University)
- Semenkin E. (Siberian State Aerospace University)
Вхождение в базы данных
Информация о публикациях загружается с сайта службы поддержки публикационной активности СФУ. Сообщите, если заметили неточности.