RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing

Olga Perepelkina ORCID: orcid.org/0000-0001-9357-8407^16,17,
Evdokia Kazimirova¹⁶ &
Maria Konstantinova^16,17

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11096))

Included in the following conference series:

International Conference on Speech and Computer

1750 Accesses
15 Citations
4 Altmetric

Abstract

Emotion expression encompasses various types of information, including face and eye movement, voice and body motion. Emotions collected from real conversations are difficult to classify using one channel. That is why multimodal techniques have recently become more popular in automatic emotion recognition. Multimodal databases that include audio, video, 3D motion capture and physiology data are quite rare. We collected The Russian Acted Multimodal Affective Set (RAMAS) − the first multimodal corpus in Russian language. Our database contains approximately 7 h of high-quality close-up video recordings of faces, speech, motion-capture data and such physiological signals as electro-dermal activity and photoplethysmogram. The subjects were 10 actors who played out interactive dyadic scenarios. Each scenario involved one of the basic emotions: Anger, Sadness, Disgust, Happiness, Fear or Surprise, and such characteristics of social interaction like Domination and Submission. In order to note emotions that subjects really felt during the process we asked them to fill in short questionnaires (self-reports) after each played scenario. The records were marked by 21 annotators (at least five annotators marked each scenario). We present our multimodal data collection, annotation process, inter-rater agreement analysis and the comparison between self-reports and received annotations. RAMAS is an open database that provides research community with multimodal data of faces, speech, gestures and physiology interrelation. Such material is useful for various investigations and automatic affective systems development.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Multimodal Database of Emotional Speech, Video and Gestures

A Multimodal Dataset for Mixed Emotion Recognition

Article Open access 05 August 2024

Multimodal Techniques and Methods in Affective Computing – A Brief Overview

References

Consensys GSR development kit. http://www.shimmersensing.com/products/gsr-optical-pulse-development-kit
Kinect v. 2. http://www.microsoft.com/en-us/kinectforwindows
Neurodata Lab LLC. http://www.neurodatalab.com
RAMAS scripts. http://neurodatalab.com/upload/technologies_files/scenarios/Scripts_RAMAS.pdf
RAMAS database (2016). http://neurodatalab.com/en/projects/RAMAS
Anderson, A., Hsiao, T., Metsis, V.: Classification of emotional arousal during multimedia exposure. In: Proceedings of the 10th International Conference on Pervasive Technologies Related to Assistive Environments, pp. 181–184. ACM (2017)
Google Scholar
Ayvaz, U., Gürüler, H., Devrim, M.O.: Use of facial emotion recognition in e-learning systems. Inf. Technol. Learn. Tools 60(4), 95–104 (2017)
Google Scholar
Bänziger, T., Pirker, H., Scherer, K.: GEMEP-GEneva multimodal emotion portrayals: a corpus for the study of multimodal emotional expressions. In: Proceedings of LREC, vol. 6, pp. 15–19 (2006)
Google Scholar
Busso, C., Bulut, M., Lee, C.C., Kazemzadeh, A., Mower, E., Kim, S., Chang, J.N., Lee, S., Narayanan, S.S.: IEMOCAP: interactive emotional dyadic motion capture database. Lang. Resour. Eval. 42(4), 335 (2008)
Article Google Scholar
Chaw, T.V., Khor, S.W., Lau, P.Y.: Facial expression recognition using correlation of eyes regions. In: The FICT Colloquium 2016, p. 34, December 2016
Google Scholar
De Silva, L.C., Miyasato, T., Nakatsu, R.: Facial emotion recognition using multi-modal information. In: Proceedings of 1997 International Conference on Information, Communications and Signal Processing, ICICS 1997, vol. 1, pp. 397–401. IEEE (1997)
Google Scholar
D’mello, S.K., Kory, J.: A review and meta-analysis of multimodal affect detection systems. ACM Comput. Surv. 47(3), 43:1–43:36 (2015). http://doi.acm.org/10.1145/2682899
Google Scholar
Douglas, M.: Purity and danger: an analysis of pollution and taboo London (1966)
Google Scholar
El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 44(3), 572–587 (2011)
Article Google Scholar
Gouizi, K., Bereksi Reguig, F., Maaoui, C.: Emotion recognition from physiological signals. J. Med. Eng. Technol. 35(6–7), 300–307 (2011)
Article Google Scholar
Hayes, A.F., Krippendorff, K.: Answering the call for a standard reliability measure for coding data. Commun. Methods Meas. 1(1), 77–89 (2007)
Article Google Scholar
Karg, M., Samadani, A.A., Gorbet, R., Kühnlenz, K., Hoey, J., Kulić, D.: Body movements for affective expression: a survey of automatic recognition and generation. IEEE Trans. Affect. Comput. 4(4), 341–359 (2013)
Article Google Scholar
Krippendorff, K.: Estimating the reliability, systematic error and random error of interval data. Educ. Psychol. Meas. 30(1), 61–70 (1970)
Article Google Scholar
Mayer, J.D., Salovey, P., Caruso, D.R., Sitarenios, G.: Measuring emotional intelligence with the MSCEIT V2. 0. Emotion 3(1), 97 (2003)
Article Google Scholar
Metallinou, A., Lee, C.C., Busso, C., Carnicke, S., Narayanan, S.: The USC creativeIT database: a multimodal database of theatrical improvisation. Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, p. 55 (2010)
Google Scholar
Rachman, S.: Anxiety. Psychology Press Ltd., Publishers, East Sussex (1998)
Google Scholar
Ranganathan, H., Chakraborty, S., Panchanathan, S.: Multimodal emotion recognition using deep learning architectures. In: 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1–9, March 2016. https://doi.org/10.1109/WACV.2016.7477679
Ringeval, F., Sonderegger, A., Sauer, J., Lalanne, D.: Introducing the RECOLA multimodal corpus of remote collaborative and affective interactions. In: 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), pp. 1–8. IEEE (2013)
Google Scholar
Russell, J.A., Fernández-Dols, J.M.: The Psychology of Facial Expression. Cambridge University Press, Cambridge (1997)
Book Google Scholar
Sergienko, E.G., Vetrova, I.I., Volochkov, A.A., Popov, A.Y.: Adaptation of J. Mayer P. Salovey and D. Caruso emotional intelligence test on russian-speaking sample. Psikhologicheskii Zhurnal 31(1), 55–73 (2010)
Google Scholar
Sloetjes, H., Wittenburg, P.: Annotation by category: ELAN and ISO DCR. In: LREC (2008)
Google Scholar
Tarnowski, P., Kołodziej, M., Majkowski, A., Rak, R.J.: Emotion recognition using facial expressions. Procedia Comput. Sci. 108, 1175–1184 (2017)
Article Google Scholar
Tomkins, S.: Affect Imagery Consciousness: Volume II: The Negative Affects. Springer, New York (1963)
Google Scholar
Volkova, E., De La Rosa, S., Bülthoff, H.H., Mohler, B.: The MPI emotional body expressions database for narrative scenarios. PloS one 9(12), e113647 (2014)
Article Google Scholar
Wagner, J., Lingenfelser, F., Baur, T., Damian, I., Kistler, F., André, E.: The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time. In: Proceedings of the 21st ACM International Conference on Multimedia, pp. 831–834. ACM (2013)
Google Scholar

Download references

Acknowledgements

Supported by Neurodata Lab LLC. The authors would like to thank Elena Arkova for finding the actors and helping with the scenarios and experimental procedure, and Irina Vetrova for evaluating the emotional intelligence of the annotators with MSCEIT v 2.0 test.

Author information

Authors and Affiliations

Neurodata Lab LLC, Miami, FL, USA
Olga Perepelkina, Evdokia Kazimirova & Maria Konstantinova
Lomonosov Moscow State University, Moscow, Russia
Olga Perepelkina & Maria Konstantinova

Authors

Olga Perepelkina
View author publications
You can also search for this author in PubMed Google Scholar
Evdokia Kazimirova
View author publications
You can also search for this author in PubMed Google Scholar
Maria Konstantinova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Olga Perepelkina .

Editor information

Editors and Affiliations

SPIIRAS, St. Petersburg, Russia
Alexey Karpov
Leipzig University of Telecommunications, Leipzig, Germany
Oliver Jokisch
Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Perepelkina, O., Kazimirova, E., Konstantinova, M. (2018). RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing. In: Karpov, A., Jokisch, O., Potapova, R. (eds) Speech and Computer. SPECOM 2018. Lecture Notes in Computer Science(), vol 11096. Springer, Cham. https://doi.org/10.1007/978-3-319-99579-3_52

Download citation

DOI: https://doi.org/10.1007/978-3-319-99579-3_52
Published: 25 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99578-6
Online ISBN: 978-3-319-99579-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multimodal Database of Emotional Speech, Video and Gestures

A Multimodal Dataset for Mixed Emotion Recognition

Multimodal Techniques and Methods in Affective Computing – A Brief Overview

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

RAMAS: Russian Multimodal Corpus of Dyadic Interaction for Affective Computing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multimodal Database of Emotional Speech, Video and Gestures

A Multimodal Dataset for Mixed Emotion Recognition

Multimodal Techniques and Methods in Affective Computing – A Brief Overview

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation