1. bookVolume 72 (2021): Issue 2 (December 2021)
    NLP, Corpus Linguistics and Interdisciplinarity
Journal Details
License
Format
Journal
eISSN
1338-4287
First Published
05 Mar 2010
Publication timeframe
2 times per year
Languages
English
access type Open Access

StressDat – Database of speech under stress in Slovak

Published Online: 30 Dec 2021
Volume & Issue: Volume 72 (2021) - Issue 2 (December 2021) - NLP, Corpus Linguistics and Interdisciplinarity
Page range: 579 - 589
Journal Details
License
Format
Journal
eISSN
1338-4287
First Published
05 Mar 2010
Publication timeframe
2 times per year
Languages
English
Abstract

The paper describes methodology for creating a Slovak database of speech under stress and pilot observations. While the relationship between stress and speech characteristics can be utilized in a wide domain of speech technology applications, its research suffers from the lack of suitable databases, particularly in conversational speech. We propose a novel procedure to record acted speech in the home of actors and using their own smartphones. We describe both the collection of speech material under three levels of stress and the subsequent annotation of stress levels in this material. First observations suggest a reasonable inter-annotator agreement, as well as interesting avenues for the relationship between the intended stress levels and those perceived in speech.

Keywords

[1] Yogesh, C. K. et al. (2017). Bispectral features and mean shift clustering for stress and emotion recognition from natural speech. In Computers & Electrical Engineering, 62, pages 676–691. Search in Google Scholar

[2] Robinson, C., and Nicolas, R. A. (2019). Sequence-to-sequence modelling of f0 for speech emotion conversion. In ICASSP 2019, pages 6830–6834.10.1109/ICASSP.2019.8683865 Search in Google Scholar

[3] Rusko, M., Trnka, M., Darjaa, S., Stelkens-Kobsch, T., and Finke, M., (2018). Weaknesses of voice biometrics – sensitivity of speaker verification to emotional arousal. In ICSV25: 25th International Congress on Sound and Vibration. Hiroshima, Japan, pages 1–8. Search in Google Scholar

[4] Alimuradov, A. K. et al. (2020). Development of Natural Emotional Speech Database for Training Automatic Recognition Systems of Stressful Emotions in Human-Robot Interaction. In 4th Scientific School on Dynamics of Complex Networks and their Application in Intellectual Robotics (DCNAIR), pages 11–16.10.1109/DCNAIR50402.2020.9216940 Search in Google Scholar

[5] Busso, C. et al. (2008). IEMOCAP: Interactive emotional dyadic motion capture database. In Language resources and evaluation, pages 335–359.10.1007/s10579-008-9076-6 Search in Google Scholar

[6] Hansen, J. H. et al. (1997). Getting started with SUSAS: a speech under simulated and actual stress database. In Eurospeech, 97(1), pages 1743–1746.10.21437/Eurospeech.1997-494 Search in Google Scholar

[7] Burkhardt, F. et al. (2005). A database of German emotional speech. In Ninth European Conference on Speech Communication and Technology.10.21437/Interspeech.2005-446 Search in Google Scholar

[8] Campbell, N. (2000). Databases of emotional speech. In ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. Search in Google Scholar

[9] Rusko, M., Darjaa, S., Trnka, M., Sabo, R., and Ritomský, M. (2015). Expressive Speech Synthesis for Critical Situations. COMPUTING AND INFORMATICS, 33(6), pages 1312–1332. Search in Google Scholar

[10] Enos F., and Hirschberg J. (2006). A framework for eliciting emotional speech: Capitalizing on the actors process. In First International Workshop on Emotion: Corpora for Research on Emotion and Affect LREC 2006, Genoa, Italy, pages 6–10. Search in Google Scholar

[11] Wolpe, J. (1969). The Practice of Behavior Therapy. Pergamon Press, 314 p. Search in Google Scholar

[12] Accessible at: https://ccp.net.au/suds-thermometer/. Search in Google Scholar

[13] SATIE project: “D4.2 – Traffic Management Intrusion and Compliance System”, Status: submitted. Search in Google Scholar

[14] Accessible at: https://en.wikipedia.org/wiki/Standard_score. Search in Google Scholar

[15] Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), pages 378–382.10.1037/h0031619 Search in Google Scholar

Recommended articles from Trend MD

Plan your remote conference with Sciendo