Multimodality and Immersion

Emergent Technologies & Media (ETM)

Andy Weeger

Neu-Ulm University of Applied Sciences

September 23, 2025

Recap

The steam enginge

Why?

Characteristics of emerging technologies

Rotolo et al. (2015) outlines five attributes that classify emerging technologies and differentiate them from other technologies:

  1. Radical novelty
  2. Relatively fast growth
  3. Coherence
  4. Prominent impact
  5. Uncertainty and ambiguity

Stages of emergence

Pre-emergence, emergence, and post-emergence: attributes and ’stylised’trends. (Rotolo et al., 2015, p. 1833)

AI — towards a definition

‘AI system’ means a machine-based systems designed to operate with varying levels of autonomy and that may exhibit adaptiveness after deployment and that, for explicit or implicit objectives, infers, from the input it received, how to generate output such as content, predictions, recommendations, or decisions, that can influence physical or virtual environment. (European Commission, 2024)

Systems that perceive, learn, think and act human-like.

AI an emergent technology

  • Radical novelty: While the underlying concepts of AI have been around for decades, its current applications are truly groundbreaking.
  • Relatively fast growth: New advancements and discoveries are happening at an incredible pace, pushing the boundaries of what’s possible.
  • Coherence: AI research isn’t happening in isolation. It draws on various disciplines like computer science, mathematics, linguistics, and neuroscience.
  • Prominent impact: AI is transforming industries and shaping the future in a significant way. Its influence is felt in healthcare, finance, entertainment, and countless other sectors.
  • Uncertainty and ambiguity: The full potential and limitations of AI are still being explored as well as the manifold ethical concerns.

Discussion

What are distinct characteristics of emerging digital technologies?

Hypothesis 1

Emerging digital technologies enable multimodal and immersive experiences.

Multimodality

Discussion

Which senses have you used when you have interacted with (information) technology?

Give examples.

How the
computer
sees us.

For traditional computers, humans are reduced to an eye and a finger.
Courtesy Dan O’Sullivan and Tom Igoe.

Multimodality — definition

Multimodality

Using more than one mode of communication to create meaning simultaneously.

Multimodality emphasizes the importance of multiple modes (e.g., visual, linguistics, audio, spatial, gestural) to form overall understanding of a message.

Multimodality is a theoretical concept about communication and meaning-making.

Multimedia — definition

Multimedia

Using more than one media formats in the presentation of content.

Multimedia emphasizes the technical format and delivery systems used to present information.
A multimedia presentation might include slides with text, embedded videos, and audio narration.

Multimedia is a practical term about content delivery and technology.

Distinction

Every multimodal system is a multimedia system
as it uses multiple methods for content delivery.

Not every multimedia system is multimodal
as it might just throw different media together without considering how they work together.

Reading

Writing

Texting

Multimodal interactions
are natural

Usage of modalities in current computer systems

Usage of modalities in current computer systems

Advantages of multimodal systems

Improved accuracy and robustness, enhanced bandwidth, flexibility and user preference, naturalness and ease of use, redundancy and error correction, accessibility, support for complex tasks.

Fundamental problems

Multimodal fusion

The integration of communication modalities in interactive systems (Input)

and

Multimodal fission

The re-partitioning of information among several communication modalities (Output)

Human-machine interaction loop

A representation of multimodal man machine interaction loop based on Dumas et al. (2009, p. 8)

 

 

 

 

 

 

Multimodal AI in action

Example: Multimodal AI

Example: The Future of Windows

Exercise

Modality examples

Identify specific examples of multimodal systems. Analyze which modalities are used for input (data and control) as well as output (data and control).

Homework

Read Suh & Prophet (2018) and figure out what immersive technologies are and how this concept relates to multimodality.

Recap

AI as emergent tech

Many systems currently discussed under the umbrella-term AI qualify as emergent technologies.

What is AI and why can it be considered as emergent tech?

‘AI system’ means a machine-based systems designed to operate with varying levels of autonomy and that may exhibit adaptiveness after deployment and that, for explicit or implicit objectives, infers, from the input it received, how to generate output such as content, predictions, recommendations, or decisions, that can influence physical or virtual environment. (European Commission, 2024)

Systems that perceive, learn, think and act human-like.

Hypothesis 1

Emerging digital technologies enable multimodal and immersive experiences.

Multimodality

Multimodality

Using more than one mode of communication (e.g., visual, linguistics, audio, spatial or gestural) to create meaning simultaneously.

The main challenges in creating multimodal systems are:

Multimodal fusion and fission

That is, the integration of communication modalities in interactive systems (input fusion) and the re-partitioning of information among several communication modalities (output fission).

Immersiveness

Immersion in games

Exercise

Take 10 minutes and recap your findings from reading Suh & Prophet (2018) along following questions:

  1. How can immersive technology be defined?
  2. What are examples of immersive technology provided in the paper?
  3. What technological features are key to enhancing user experience in immersive environments?
  4. How does immersiveness relate to multimodality?
  5. What fields of application are revealed in the paper?

Immersion

Immersion refers to the state of being deeply engaged, absorbed, or submerged in an environment, either physically or mentally.

Immersion implies that the consciousness of the immersed person is detached from their physical self. Immersiveness is the quality or degree of being immersive.

Discussion

Have you ever felt detached from your physical self? What caused that feeling?

Immersive technology

Immersive technology blurs the line between the physical, virtual, and simulated worlds, thereby creating a sense of immersion.

Technology has different abilities to create a sense of presence and engagement in the user.

Immersive?

Design challenge

Redesign the weather experience
to be more immersive.

Group with your neighbor and ideate on a more immersive weather experience.

Guiding questions:

  • What modalities will you add?
  • How do they work together (multimodal fusion)?
  • What stimuli create the feeling of immersion??
  • How are the stimuli created?
  • What limitation do you hit with current technologies?

Mechanisms

The feeling of immersion is created by temporarily altering a person’s sense of presence by tricking their cognitive and perceptual systems into believing they are in a place other than their actual physical location.

Virtuality continuum

Simplified representation of a “virtuality continuum” based on Milgram & Kishino (1994)

 

 

Stimuli

Important stimuli that determine the immersiveness of environments created by technology are

Visual stimuli, auditory stimuli, tactile stimuli, olfactory stimuli, and interactive stimuli.

In order to create these, technology needs visual displays with high representational fidelity, auditory and haptic interfaces, olfactory delivery systems (e.g., scent devices), and the capability to track movements and enable real-time interaction.

Augmented reality (AR)

Augmented reality refers to the combination of a real scene viewed by a user with a virtual scene that augments the scene with additional information.

AR technology superimposes virtual objects onto a live view of physical environments, helping users visualize how these objects fit into their physical world.

Discussion

Immersion refers to the state of being deeply engaged, absorbed, or submerged in an environment, either physically or mentally.

How is immersion reflected in augmented reality?

Hardware components

AR technologies require following hardware components:

  1. Devices for multi-sensory output: visual displays, audio systems, haptic actuators, and (potentially) olfactory generators
  2. Input devices to detect users’ interactions with real and virtual objects
  3. Sensors to capture users’ positions and movements
  4. Computing devices that combine input data and create the virtual overlays

Advancements in mobile computing and telecommunications infrastructures (will) increase the mobility and, thus, usability of these devices significantly.

Exercise

What are examples of AR technologies?

Briefly introduce one example and name application fields and benefits.

Virtual reality (VR)

Virtual Reality refers to technology that generates an interactive virtual environment that is designed to simulate a real life experience.

VR technology reveals different levels of immersion ranging from non-immersive VR (e.g., second life) to immersive VR (e.g., PlayStation VR2).

Exercise

Search for immersive technologies used in the healthcare context.

Take 15 minutes and …

  • research and portray a technology based on the characteristics discussed here;
  • discuss the reasons for the prominent impact of immersive technologies in healthcare;
  • and prepare yourself to present your findings.

Q&A

Literature

Adami, E. (2016). Introducing multimodality. The Oxford Handbook of Language and Society, 451–472.
Dumas, B., Lalanne, D., & Oviatt, S. (2009). Multimodal interfaces: A survey of principles, models and frameworks. In Human machine interaction: Research results of the mmi program (pp. 3–26). Springer.
European Commission. (2024). Artificial intelligence in the european commission — a strategic vision to foster the development and use of lawful, safe and trustworthy artificial intelligence systems in the european commission. C(2024) 380.
Handa, M., Aul, E. G., & Bajaj, S. (2012). Immersive technology–uses, challenges and opportunities. International Journal of Computing & Business Research, 6(2), 1–11.
Lee, H.-G., Chung, S., & Lee, W.-H. (2013). Presence in virtual golf simulators: The effects of presence on perceived enjoyment, perceived value, and behavioral intention. New Media & Society, 15(6), 930–946.
Milgram, P., & Kishino, F. (1994). A taxonomy of mixed reality visual displays. IEICE TRANSACTIONS on Information and Systems, 77(12), 1321–1329.
Oviatt, S. (1999). Ten myths of multimodal interaction. Communications of the ACM, 42(11), 74–81.
Rotolo, D., Hicks, D., & Martin, B. R. (2015). What is an emerging technology? Research Policy, 44(10), 1827–1843.
Suh, A., & Prophet, J. (2018). The state of immersive technology research: A literature analysis. Computers in Human Behavior, 86, 77–90.
Sunyaev, A. (2020). Internet computing: Principles of distributed systems and emerging internet-based technologies (1st ed.). Springer Nature Switzerland AG.