Call for Papers

Multimodal human understanding and analysis is an emerging research area that cuts through several disciplines like Computer Vision, Natural Language Processing (NLP), Speech Processing, Human-Computer Interaction, and Multimedia. Several multimodal learning techniques have recently shown the benefit of combining multiple modalities in image-text, audio-visual and video representation learning and various downstream multimodal tasks. At the core, these methods focus on modelling the modalities and their complex interactions by using large amounts of data, different loss functions and deep neural network architectures. However, for many Web and Social media applications, there is the need to model the human, including the understanding of human behaviour and perception. For this, it becomes important to consider interdisciplinary approaches, including social sciences, semiotics and psychology. The core is understanding various cross-modal relations, quantifying bias such as social biases, and the applicability of models to real-world problems. Interdisciplinary theories such as semiotics or gestalt psychology can provide additional insights and analysis on perceptual understanding through signs and symbols via multiple modalities. In general, these theories provide a compelling view of multimodality and perception that can further expand computational research and multimedia applications on the Web and Social media.

The theme of the MUWS workshop, multimodal human understanding, includes various interdisciplinary challenges related to social bias analyses, multimodal representation learning, detection of human impressions or sentiment, hate speech, sarcasm in multimodal data, multimodal rhetoric and semantics, and related topics. The MUWS workshop will be an interactive event and include keynotes by relevant experts, poster and demo sessions, research presentations and discussion.

  • Modeling human impressions in the context of the Web and Social Media
  • Cross-modal and semantic relations
  • Incorporating multi-disciplinary theories such as Semiotics or Gestalt-Theory into multimodal analyses
  • Measuring and analyzing biases such as cultural bias, social bias, multilingual bias, and related topics in the context of the Web and Social Media
  • Multimodal human perception understanding
  • Multimodal sentiment/emotion/sarcasm recognition
  • Multimodal hate speech detection
  • Multimodal misinformation detection
  • Multimodal content understanding and analysis
  • Multimodal rhetoric in online media

Important dates

  • Submission deadline: April 7, 2024
  • Paper notification: April 21, 2024
  • Camera ready: April 25, 2024
  • Workshop date: June 10, 2024 in Phuket, Thailand

Submission Page


If you have any problems or questions, please contact us via e-mail at:

Copyright (c) 2022-2024, MUWS Workshop; all rights reserved.