Introduction to AI (I2AI)
Neu-Ulm University of Applied Sciences
March 17, 2026
In the neural networks unit we built a language model from the ground up:
You already know how an LLM works. This unit asks what that machinery makes possible.
When you hear “Generative AI”, what comes to mind?
And what do you think it actually means for a machine to create something?
Neural networks learn to recognize patterns.
Generative AI learns to create them.
This shift from recognition to generation is fundamental (Goodfellow et al., 2016; Urbach et al., 2026):
Generative AI has rapidly transitioned from a niche research domain to a significant driver of innovation across industries (Urbach et al., 2026).
Two major families of foundational models dominate today:
Beyond standalone models, Agentic AI combines these capabilities with planning, memory, and tool use and, thus, enable AI to act, not just generate.
The introduction of ChatGPT by OpenAI in November 2022 marked a turning point:
ChatGPT is a catalyst, not the full picture.
An LLM is the transformer you already built, scaled up and trained on internet-scale text (Brown et al., 2020; Vaswani et al., 2017).
Think of the transformer as the engine and the LLM as the vehicle: the same attention, embeddings, and next-token prediction, now scaled and refined into a usable system.
The next-token prediction you saw last unit is only the first of three training phases (Ouyang et al., 2022):
Consider the tasks you do in a typical working day. Where could an LLM genuinely help? And where might it do more harm than good?
LLMs are applied across a broad spectrum of domains (Gimpel et al., 2023, 2024):
The neural networks unit introduced the core failure modes; deploying LLMs at scale raises the stakes and adds new ones (Riemer & Peter, 2023; Verma & Oremus, 2023).
Carried over from the transformer
New once you deploy
Static knowledge is the limitation that motivates retrieval-augmented generation, which we meet under agentic AI.
While LLMs generate text token by token, diffusion models generate images, video, and audio through an iterative denoising process inspired by physics (Ho et al., 2020; Urbach et al., 2026).
The core intuition:
The most prominent application of diffusion models is generating images from text descriptions (Rombach et al., 2022):
Diffusion models extend naturally to other modalities:
All three modalities share the same fundamental mechanism: embedding the prompt to iterative denoising to structured output (Liu et al., 2023; Singh, 2023).
From generation to action
Agentic AI is an emerging paradigm in AI that refers to autonomous systems designed to pursue complex goals with minimal human intervention. Acharya et al. (2025, p. 18912)
Core characteristics
| Feature | Traditional AI | Agentic AI |
|---|---|---|
| Primary purpose | Task-specific automation | Goal-oriented autonomy |
| Human intervention | High (predefined parameters) | Low (autonomous adaptability) |
| Adaptability | Limited | High |
| Environment interaction | Static or limited context | Dynamic and context-aware |
| Learning type | Primarily supervised | Reinforcement and self-supervised |
| Decision-making | Data-driven, static rules | Autonomous, contextual reasoning |
Four key components transform LLMs into agents (Urbach et al., 2026):
Anthrophic (2024) discusses five key patterns for designing agentic AI workflows:
RAG combines the generative power of LLMs with dynamic access to external, up-to-date knowledge (Lewis et al., 2020).
The mechanism:
Key advantages: factual accuracy, updatability without retraining, and interpretability (users can inspect which sources were used)
Multi-agent systems (MAS) represent the frontier of agentic AI: multiple specialised agents collaborating to solve problems beyond any single agent’s capabilities (Doran et al., 1997; Hoek & Wooldridge, 2008).
Architecture:
Benefits:
You are advising a media company that produces news articles, photographs, video segments, and podcast episodes.
A startup has built a general-purpose LLM through pretraining on a large web corpus. They now want to deploy it as a legal document assistant for law firms.
A pharmaceutical company wants to use generative AI to visualise molecular structures and simulate how proposed drug compounds might interact with target proteins. They are considering adapting diffusion models for this scientific domain.
Your university wants to deploy a student support chatbot that can answer questions about study regulations, course requirements, examination procedures, and administrative processes. The university updates its regulations each semester.
Consider the following agentic AI scenario: a university deploys an AI research assistant agent that, given a research question, autonomously searches academic databases, reads and synthesises relevant papers, identifies gaps in the literature, and produces a structured literature review draft.