Generative AI

Introduction to AI (I2AI)

Andy Weeger

Neu-Ulm University of Applied Sciences

March 17, 2026

Recap

Where we left off

In the neural networks unit we built a language model from the ground up:

Neurons to networks: weighted sums and activations, trained by gradient descent and backpropagation
Transformers: tokenization, embeddings, and attention turn a sequence into context-aware representations
Next-token prediction: the transformer learns language by predicting what comes next, then generates text one token at a time
Known failure modes: hallucination, training-data bias, and the absence of genuine understanding

You already know how an LLM works. This unit asks what that machinery makes possible.

This unit builds directly on the neural networks unit, where we traced the full path from a single artificial neuron to a working transformer: the weighted sum and activation of a neuron, the cost function and gradient descent that train it, backpropagation, and finally the transformer architecture with its attention mechanism and next-token-prediction objective. We also saw where these systems break down; they hallucinate, they reflect biases in their training data, and they manipulate statistical patterns rather than reasoning from genuine understanding.

The closing discussion of that unit asked how next-token prediction scales into something genuinely useful. This unit answers that question. Rather than re-deriving the transformer, we treat it as a known building block (the engine) and ask what we can build with it: generative systems across text, image, audio, and video, and ultimately agentic systems that act in the world.

Introduction

Discussion

When you hear “Generative AI”, what comes to mind?

And what do you think it actually means for a machine to create something?

From recognition to generation

Neural networks learn to recognize patterns.
Generative AI learns to create them.

This shift from recognition to generation is fundamental (Goodfellow et al., 2016; Urbach et al., 2026):

Discriminative AI asks:
“What is this?” (i.e., classifying, predicting, deciding)
Generative AI asks:
“What could this be?” (i.e., creating text, images, audio, video)
Instead of mapping inputs to labels, generative models learn the underlying distribution of data
They can then sample new instances from that distribution (i.e., producing new content)

Why generative AI is different

A digit classifier takes in pixel values and outputs one of ten class labels. Its “knowledge” is encoded as a decision boundary separating classes in high-dimensional space. It cannot produce a new digit, it can only evaluate ones it is given.

Generative AI systems learn something richer: a model of the data distribution itself. A generative model trained on handwritten digits does not just learn “this looks like a 3”; it learns what makes a 3 look like a 3 (i.e., the curves, the proportions, the variations) well enough to synthesize new, plausible examples from scratch.

This distinction has deep practical implications. Discriminative models are extraordinarily useful for classification and prediction tasks. But generative models unlock an entirely new class of capabilities: content creation, augmentation, simulation, and creative assistance. The explosion of tools like ChatGPT, DALL-E, and Midjourney is the direct result of scaling generative approaches using the transformer architectures.

The generative AI landscape

Generative AI has rapidly transitioned from a niche research domain to a significant driver of innovation across industries (Urbach et al., 2026).

Two major families of foundational models dominate today:

Large Language Models (LLMs): generate coherent, contextually relevant text (examples: GPT-4, Gemini, Claude, LLaMA)
Diffusion Models: generate high-quality visual and audio content from noise (examples: DALL-E, Midjourney, Stable Diffusion, AudioLDM)

Beyond standalone models, Agentic AI combines these capabilities with planning, memory, and tool use and, thus, enable AI to act, not just generate.

Figure 1: Hierarchical structure of generative AI technologies adapted from Urbach et al. (2026)

The diagram above captures the nested relationship between these concepts. All the systems discussed in this lecture sit within the broad category of “Generative AI.” The two main architectural families — LLMs and diffusion models — are both instances of a broader category called foundational models.

Foundational models are large-scale models trained on massive multimodal datasets that generalize knowledge across diverse applications. They are designed not just to perform specific tasks, but to serve as versatile bases from which specialised applications can be derived. The term was introduced by researchers at Stanford to capture the idea that these models are foundational for other, more specialised systems, that are built on top of them.

This lecture follows the structure of the book chapter by Urbach et al. (2026) and proceeds through three layers of increasing sophistication: first LLMs (pure language generation), then diffusion models (visual and audio generation), and finally agentic AI systems (goal-directed action).

The turning point

The introduction of ChatGPT by OpenAI in November 2022 marked a turning point:

Built on the GPT architecture (Generative Pre-trained Transformer)
Its simple, user-friendly interface made advanced AI accessible to a mass audience
Within 2 months it attracted over 100 million users, one of the fastest-growing applications in digital history (Reuters, 2023)
Major tech companies (Microsoft, Google) immediately intensified their generative AI investments (The Verge, 2024)

ChatGPT is a catalyst, not the full picture.

Why ChatGPT succeeded where earlier models did not

GPT-2 and GPT-3 had already demonstrated remarkable language capabilities, yet neither achieved mainstream cultural impact. ChatGPT’s breakthrough was not primarily technical, it was the interaction design. By wrapping a powerful LLM in a simple chat interface, OpenAI made it trivially easy for non-technical users to experience the model’s capabilities directly.

Two factors drove adoption: (1) the interface required no technical knowledge and enabled diverse practical applications (e.g., writing assistance, coding help, tutoring); and (2) the underlying model had reached a threshold of quality where outputs were routinely impressive and useful, rather than merely interesting.

This combination of technical capability and accessible design is a recurring pattern in technology adoption. The lesson for AI management is that the path from capable technology to widespread impact often runs through thoughtful design and user experience, not just raw performance improvements.

Large Language Models

From transformer to LLM

An LLM is the transformer you already built, scaled up and trained on internet-scale text (Brown et al., 2020; Vaswani et al., 2017).

Think of the transformer as the engine and the LLM as the vehicle: the same attention, embeddings, and next-token prediction, now scaled and refined into a usable system.

Same mechanism: tokenize, embed, attend, predict the next token (see neural networks unit)
New scale: GPT-3 has 175 billion parameters; GPT-4 is estimated above 1 trillion
New behaviour: at this scale, generation becomes coherent, contextual, and broadly useful

The neural networks unit covered the generation pipeline in full: a prompt is tokenized, each token is embedded, attention and feed-forward layers refine those embeddings into context-aware representations, the final vector is projected onto the vocabulary, and softmax plus sampling selects the next token, which is appended to the context before the process repeats. Nothing about that mechanism changes here.

What changes is scale and training. An LLM is what results when this architecture is grown to hundreds of billions of parameters and trained on internet-scale corpora. The engine-and-vehicle metaphor is useful: the transformer is the engine; the LLM is the complete vehicle that results from scaling that engine and fine-tuning it to be useful and safe.

Generation is probabilistic sampling

Recall from the neural networks unit that the model does not “know” the answer; it computes a probability distribution over the vocabulary and samples from it. Temperature controls how concentrated that distribution is, and the same prompt can yield different outputs. This probabilistic core is exactly why hallucination is a structural property, not a fixable bug, a point we return to under limitations.

The rest of this section focuses on what is genuinely new relative to the neural networks unit: how raw next-token prediction is turned into a helpful assistant through a three-phase training process, and where LLMs are applied in practice (Sanderson, 2024).

Training phases

The next-token prediction you saw last unit is only the first of three training phases (Ouyang et al., 2022):

Pretraining: self-supervised next-token prediction on massive, unlabelled text corpora (internet archives, books, scientific articles); the model learns linguistic patterns, factual associations, and reasoning structures, exactly the process from the neural networks unit
Fine-tuning: supervised learning on smaller, high-quality, labelled datasets; adapts the general model to specific tasks (summarisation, Q&A, coding) with more precise, contextually relevant outputs
Reinforcement Learning from Human Feedback (RLHF): human evaluators score model outputs; a reward model is trained on these scores; the LLM is then optimised to produce outputs humans prefer (i.e., aligning behaviour with expectations and ethical considerations)

Why all three phases matter

Pretraining establishes the model’s broad capability. It is extremely data-intensive and computationally expensive (GPT-3’s pretraining was estimated at ~$4.6 million in compute costs). The model emerges from pretraining knowing a great deal about language and the world, but it is not yet useful as an assistant. It would complete prompts in bizarre ways, generate harmful content, and fail to follow instructions.

Fine-tuning makes the model task-appropriate. By training on carefully curated input-output pairs, the model learns to produce the type of response a user would find useful. However, this alone is insufficient for safety, as the model might still produce plausible-sounding but harmful outputs.

RLHF is where alignment happens. Human raters compare pairs of model outputs and indicate which is better. A reward model learns to predict human preferences and the LLM is updated using reinforcement learning to maximise predicted reward. This is what makes ChatGPT feel helpful, honest, and (usually) harmless. It is also one of the most active areas of AI safety research, as it requires careful design to avoid teaching models to “game” the reward signal.

The progression “BabyGPT” described by Bhatia (2023) illustrates pretraining vividly: a model trained on Jane Austen begins producing gibberish, gradually forms recognisable words at 500 rounds, coherent fragments at 5,000 rounds, and grammatically correct sentences at 30,000 rounds.

Discussion

Consider the tasks you do in a typical working day. Where could an LLM genuinely help? And where might it do more harm than good?

Application scenarios

LLMs are applied across a broad spectrum of domains (Gimpel et al., 2023, 2024):

Content creation: drafting emails, blog posts, reports, code, and creative writing
Text summarisation: condensing lengthy documents into concise, actionable summaries
Knowledge dissemination: explaining complex concepts accessibly for varied audiences
Research support: structuring literature reviews, drafting paper sections, suggesting methodology
Customer service: powering conversational agents that handle routine queries at scale
Code generation: writing, explaining, and debugging software across programming languages
Translation: converting documents between languages while preserving context and register

The breadth of LLM applications reflects a key property: unlike narrow AI tools (which solve one task), LLMs are general-purpose text processors. The same model that drafts a legal contract can explain a physics concept to a school student or translate a marketing brief into Japanese.

In higher education specifically, LLMs are becoming embedded in the learning process. Students use them as tutoring systems, writing assistants, and research partners. This shift raises important questions about the competencies education should develop: as LLMs handle routine writing and information retrieval, critical reflection, judgment, and responsible AI use become more central to what it means to be an educated person (Gimpel et al., 2024).

From a business perspective, the key strategic question is not “can we use LLMs?” but “which processes benefit most, and how do we manage the risks?” The answer depends heavily on the specific task, the tolerance for error, and whether human oversight is built into the workflow.

Limitations of LLMs

The neural networks unit introduced the core failure modes; deploying LLMs at scale raises the stakes and adds new ones (Riemer & Peter, 2023; Verma & Oremus, 2023).

Carried over from the transformer

Hallucination: generates plausible but factually incorrect information
No genuine understanding: relies on statistical patterns rather than logic or reasoning
Training-data bias: reflects and amplifies prejudices found in its source material

New once you deploy

Legal & privacy risks: can leak sensitive data or infringe on intellectual property
Static knowledge: limited by a “cutoff date”; cannot learn post-training
Resource intensive: demands massive energy and expensive infrastructure

Static knowledge is the limitation that motivates retrieval-augmented generation, which we meet under agentic AI.

Hallucination

Hallucination is arguably the most consequential limitation for practical deployment. It arises from the model’s core mechanism: generating high-probability continuations. If a prompt leads the model into a context where fabricated content is statistically plausible, the model will produce it confidently.

A famous example: ChatGPT produced a legal brief citing case law that did not exist. The lawyer who submitted it did not verify the citations. The cases were entirely fabricated. This illustrates the danger of treating LLM output as authoritative without independent verification.

Hallucination is not simply a “bug” that can be patched”. It is a structural consequence of probabilistic text generation. Mitigation strategies exist (Retrieval-Augmented Generation, confidence calibration, human-in-the-loop verification), but none eliminates the risk entirely. Any organisation deploying LLMs in high-stakes contexts (law, medicine, finance, education) must build verification workflows into their processes.

The ethical and legal landscape is also evolving rapidly. The EU AI Act represents the first major legislative framework for AI governance, but global consensus on acceptable AI use remains elusive.

Diffusion Models

A different generative paradigm

While LLMs generate text token by token, diffusion models generate images, video, and audio through an iterative denoising process inspired by physics (Ho et al., 2020; Urbach et al., 2026).

The core intuition:

Forward diffusion: Take a real image and gradually add Gaussian noise over many timesteps until it becomes pure random noise
Reverse diffusion: Train a neural network to reverse this process (to predict and remove the noise at each step)
At generation time: start from pure noise and apply the learned denoising process to produce coherent, realistic content

Why noise-to-image works

It may seem paradoxical that starting from random noise and iteratively removing it yields a coherent image. The key is in what the neural network learns during training.

The network (typically a U-Net architecture) is trained on enormous datasets of real images. For each training image, the forward diffusion process creates a series of increasingly noisy versions. The network learns to predict, at each noise level, which direction of denoising makes the result more “image-like” given the statistical patterns in training data.

Over hundreds of denoising steps, this process progressively imposes structure on the noise, guided by the network’s learned understanding of what real images look like. The result is a new image that was not in the training set, but shares the statistical properties of real images.

Conditioning on text (as in text-to-image models) adds a further layer: the denoising network is also conditioned on a text embedding, so the denoising process is guided not just toward “something that looks like a real image” but toward “something that looks like a real image matching this description.”

Comparison to GANs: Earlier generative image models, especially Generative Adversarial Networks (GANs), used a competitive dynamic between a generator and a discriminator. GANs could produce impressive results but were notoriously unstable to train and prone to “mode collapse” (producing limited variety). Diffusion models offer a more principled, likelihood-based training objective, producing more stable training and greater output diversity (Cao et al., 2023).

Text-to-image generation

The most prominent application of diffusion models is generating images from text descriptions (Rombach et al., 2022):

Text embedding: the text prompt is encoded into a high-dimensional semantic vector
Noise-to-image: a diffusion model starts from random noise and iteratively denoises it, conditioned on the text embedding
Super-resolution: the rough initial image (e.g., 64×64 pixels) is progressively upscaled through further diffusion passes to high resolution (e.g., 1024×1024)

Key models

DALL-E 3 (OpenAI)
Stable Diffusion (Stability AI)
Midjourney

Applications

Graphic design and advertising
Game asset generation
Rapid visual prototyping
E-commerce product visualisation

Real-world impact on creative industries

Text-to-image diffusion models are already reshaping creative workflows. Companies like Edelman and BMW use AI-generated images for rapid prototyping of visual campaigns. Activision has integrated AI-generated assets into “Call of Duty” production pipelines. E-commerce platforms use Midjourney and DALL-E to generate product visuals tailored to seasonal trends at a fraction of traditional photography costs.

A survey of video game studios found that nearly 90% had experimented with generative AI in their development pipelines. The implications for creative labour markets are profound and contested: some see these tools as powerful amplifiers of human creativity; others are concerned about the displacement of commercial illustrators and designers.

The process in detail: the initial 64×64 image generated by the base diffusion model is passed through one or more super-resolution diffusion models —which also denoise, but conditioned on the low-resolution image— progressively refining detail until reaching the final output resolution. Throughout this pipeline, the original text embedding continues to guide denoising, ensuring that fine-grained detail remains faithful to the original description (Rombach et al., 2022).

Beyond images

Diffusion models extend naturally to other modalities:

Text-to-video (Runway Gen-2, Imagen Video)
Temporal coherence between frames is learned from large video datasets; enables visualisation of storyboards, animation, and promotional content directly from text
Text-to-audio (AudioLDM)
Musical pieces and soundscapes from descriptive prompts; learns from mood, instrumentation, and genre cues embedded in text; applications in entertainment scoring, advertising jingles, and rapid audio prototyping

All three modalities share the same fundamental mechanism: embedding the prompt to iterative denoising to structured output (Liu et al., 2023; Singh, 2023).

Text-to-video technically extends text-to-image by adding a temporal dimension. Rather than generating a single frame, the model generates sequences of frames, learning temporal coherence and realistic motion from large-scale video datasets. Early models struggled with maintaining consistent subjects across frames; newer models (Sora, Runway Gen-3) have made dramatic progress.

Text-to-audio models like AudioLDM apply diffusion in the audio latent space (typically operating on mel-spectrograms rather than raw waveforms). The text embedding encodes genre, tempo, instrumentation, and mood, guiding the denoising toward musically coherent outputs. The speed of audio prototyping these tools enable is already reshaping workflows in game audio, advertising production, and independent music creation.

The cross-modal nature of these applications points toward a future of unified multimodal generative models, i.e., systems that can generate any combination of text, image, audio, and video from any combination of inputs.

Limitations of diffusion models

Limited controllability: precise steering of outputs (exact positions, specific faces, fine typography) remains difficult; users often resort to trial and error for nuanced requirements (Peng, 2024)
Embedded bias: models trained on internet imagery inevitably reflect and can amplify societal biases relating to gender, race, and culture (“data mirror effect”) (Milne, 2023)
Copyright and IP concerns: outputs may closely resemble training images; legal questions about ownership, attribution, and infringement are unresolved (Brittain, 2023)
Deepfakes and misinformation: realistic images and videos can be used to fabricate convincing false narratives, threatening trust in media
Safety vs. freedom trade-offs: content filters introduce new complexities; defining universally acceptable generation criteria is culturally contested

The deepfake problem

Diffusion models have dramatically lowered the cost and skill required to create realistic synthetic media. While earlier deepfake techniques required large datasets of the target person and significant compute, modern diffusion-based methods can produce convincing fake images from a handful of reference photos.

This capability poses serious threats to political discourse (fabricated images of public figures), to individuals (non-consensual synthetic intimate imagery), and to epistemics in general (undermining confidence in photographic evidence). Responses include digital watermarking (embedding imperceptible signatures in AI-generated content), synthetic media detectors, and legislative frameworks (the EU AI Act, various national laws).

The broader challenge is that content restriction is fundamentally difficult. Safety filters can be circumvented by adversarial prompting; they can also over-restrict legitimate creative uses. There is no technically clean solution. The governance of generative media ultimately requires a combination of technical controls, legal frameworks, and social norms.

Agentic AI

From generation to action

Definition

Agentic AI is an emerging paradigm in AI that refers to autonomous systems designed to pursue complex goals with minimal human intervention. Acharya et al. (2025, p. 18912)

Core characteristics

Autonomy & goal complexity: handles multiple complex goals simultaneously; operates independently over extended periods
Adaptability: functions in dynamic and unpredictable environments; makes decisions with incomplete information
Independent decision-making: learns from experience; reconceptualizes approaches based on new information

The shift to agentic AI is not merely incremental. Acharya et al. (2025) identify three technical foundations: reinforcement learning enables systems to refine strategies through trial and error; goal-oriented architectures manage complex, multi-step objectives; and adaptive control mechanisms allow recalibration in response to environmental changes. Together, these enable systems that can pursue extended task sequences with minimal human intervention.

Berente et al. (2021) (MIS Quarterly, 2021) provide a complementary management perspective, arguing that AI systems create three interdependent management challenges: autonomy (the system acts with progressively less human guidance), learning (the system’s behaviour changes over time through experience), and inscrutability (the system’s internal reasoning is opaque to observers). Managing these three dimensions simultaneously — rather than treating each in isolation — is the central challenge of deploying agentic AI in organisations.

Agentic AI vs. Traditional AI

Comparison of traditional AI and agentic AI based on Acharya et al. (2025)
Feature	Traditional AI	Agentic AI
Primary purpose	Task-specific automation	Goal-oriented autonomy
Human intervention	High (predefined parameters)	Low (autonomous adaptability)
Adaptability	Limited	High
Environment interaction	Static or limited context	Dynamic and context-aware
Learning type	Primarily supervised	Reinforcement and self-supervised
Decision-making	Data-driven, static rules	Autonomous, contextual reasoning

Building blocks

Four key components transform LLMs into agents (Urbach et al., 2026):

Reasoning-augmented LLMs: chain-of-thought prompting and multi-path reasoning enable systematic, verifiable problem-solving rather than surface-level pattern matching
Retrieval-Augmented Generation (RAG): integrates real-time access to external knowledge bases, addressing the static knowledge limitation of standard LLMs
Conversational agents: maintain context over extended dialogues; bridge human intent and machine execution; manage conversation history within token limits
Multi-agent systems (MAS): multiple specialised agents collaborate and delegate tasks, enabling scalable, modular architectures for complex domains

Each building block addresses a specific limitation of standalone LLMs:

Reasoning augmentation addresses the surface-level reasoning problem: standard LLMs can produce confident-sounding wrong answers. By explicitly generating and checking intermediate reasoning steps (chain-of-thought), the model’s logical process becomes visible and verifiable.
RAG addresses the knowledge cut-off problem: rather than relying solely on what was encoded during training, the model retrieves relevant documents at query time. A medical RAG system might retrieve the latest clinical guidelines before generating a recommendation.
Conversational agents address the single-turn limitation: by maintaining structured dialogue history, agents can support extended, coherent interactions where context from earlier in the conversation shapes later responses.
MAS addresses the single-agent capability ceiling: no single agent can be expert in everything. By coordinating specialised agents (e.g., a planning agent, an execution agent, a validation agent, a writing agent) multi-agent systems achieve greater accuracy, efficiency, and robustness than any individual agent could.

Workflow patterns in agentic systems

Anthrophic (2024) discusses five key patterns for designing agentic AI workflows:

Prompt chaining: output of one step becomes input to the next; creates complex multi-step reasoning flows
Routing: directs tasks to specialised components based on type; improves efficiency through targeted processing
Parallelisation: processes independent subtasks simultaneously; increases throughput
Orchestrator-workers: central orchestrator delegates to specialised worker agents; manages coordination and integration
Evaluator-optimizer: separate components generate, evaluate, and refine; enables iterative quality improvement

Retrieval-Augmented Generation (RAG)

RAG combines the generative power of LLMs with dynamic access to external, up-to-date knowledge (Lewis et al., 2020).

The mechanism:

Retrieval: a retrieval module searches an external knowledge base (databases, documents, web APIs) for passages relevant to the query
Augmentation: retrieved passages are injected into the context alongside the original query
Generation: the LLM generates a response grounded in the retrieved evidence, not just its training data

Key advantages: factual accuracy, updatability without retraining, and interpretability (users can inspect which sources were used)

Why RAG is strategically important

For enterprise deployment, RAG resolves the two most critical practical limitations of standalone LLMs: knowledge staleness and hallucination.

A hospital deploying a clinical decision support system cannot use a model whose knowledge was frozen at a training date, medical guidelines update continuously. RAG allows the model to retrieve current clinical literature at query time, ensuring responses reflect current evidence. When a clinician asks “what is the current recommended first-line treatment for X?”, the system retrieves relevant recent guidelines rather than relying on possibly outdated training knowledge.

Hallucination risk also decreases substantially with RAG, because the model is conditioned on externally retrieved, verifiable content. The generation is grounded, which means that there is a retrievable source that can be shown to the user, enabling verification.

The adaptability advantage is equally important operationally: updating a RAG system’s knowledge requires only updating the external knowledge base, not retraining the (extremely expensive) generative model. This makes RAG-based systems practical to maintain and keep current at a fraction of the cost of full model retraining.

Multi-agent systems

Multi-agent systems (MAS) represent the frontier of agentic AI: multiple specialised agents collaborating to solve problems beyond any single agent’s capabilities (Doran et al., 1997; Hoek & Wooldridge, 2008).

Architecture:

A manager agent orchestrates the overall workflow
Specialist agents handle specific subtasks (analysis, reasoning, writing, validation)
Agents share a common knowledge base and communicate through structured protocols

Benefits:

Division of labour: tasks broken into components matched to agent strengths
Scalability: add specialised agents without redesigning the whole system
Robustness: multiple agents can check each other’s outputs

The analogy to human organisations is apt and intentional. Effective human teams work because different members bring different expertise; a project manager coordinates their contributions; clear communication protocols prevent misunderstandings. MAS architectures mirror this: a manager agent with planning capabilities directs a research agent, a coding agent, a reviewing agent, and a writing agent. Each contributing specialised skills to a shared objective.

The coordination challenges are also analogous. Ensuring that agents understand each other’s capabilities, align their individual decisions with overarching objectives, and resolve conflicts when they produce contradictory outputs requires sophisticated orchestration. This is an active area of research, and production-grade MAS deployments remain technically demanding.

Nevertheless, MAS are already used in production: in software development (agents that plan, write, test, and review code), in financial analysis (agents that gather data, model scenarios, and write reports), and in customer service (agents that route, retrieve information, and compose responses across a support workflow).

The road ahead

Agentic AI is not a distant future; systems like AutoGPT, Microsoft Copilot, and Anthropic’s Claude already exhibit agentic behaviour in production deployments (Plaat et al., 2025)
Governance challenges grow with agency: autonomous systems that take actions in the world require more robust oversight than text-generation systems
Human-AI collaboration models are evolving rapidly: from AI as assistant, to AI as collaborator, toward AI as (supervised) autonomous actor
Responsibility questions intensify: when an agentic system makes a consequential mistake, who is accountable (e.g., the user, the deploying organisation, or the model developer)?

The progression from generative AI to agentic AI represents a qualitative shift in what AI systems do. A text-generation system is a tool: a human poses a question, the system generates a response, a human decides what to do with it. An agentic system is closer to an autonomous process: it receives a goal, plans a sequence of actions, executes those actions (potentially interacting with external systems, databases, or other agents), monitors its own progress, and adapts its approach.

This shift has significant implications for AI governance and risk management. The EU AI Act, which came into force in 2024, addresses AI systems partly by their risk level. Autonomous systems operating in high-stakes domains (healthcare, critical infrastructure, law enforcement) face the highest regulatory burden. As agentic systems proliferate, organisations deploying them will need robust human oversight mechanisms: checkpoints where humans review and approve agent decisions before consequential actions are taken.

The technical challenges are also substantial: ensuring that goals are correctly specified (the alignment problem), that agents do not take unintended shortcuts (specification gaming), and that multi-agent systems do not exhibit emergent behaviours that no individual designer intended.

Exercises

GenAI landscape mapping

You are advising a media company that produces news articles, photographs, video segments, and podcast episodes.

Map the technologies: For each content type, identify which generative AI technology (LLM, diffusion model, or a combination) would be most relevant and explain why.
Identify risks: For each application, identify the most significant risk that would need to be managed (e.g., hallucination, bias, copyright, deepfake potential).
Prioritise: If the company can only pilot one generative AI application in the first year, which would you recommend and why?

Solution notes

Technology mapping

News articles (text): LLMs are the natural fit: drafting, summarising wire reports, translating. Risk: hallucination — factual errors in published news. A RAG-based system grounded in verified news agency feeds would substantially reduce this risk.
Photographs (images): Diffusion models (DALL-E, Stable Diffusion) can generate illustrations or supplement stock imagery. Risk: copyright ambiguity (training data provenance), bias in representation, and deepfake potential if photo-realistic people are generated.
Video segments: Text-to-video models (Runway, Sora). Risk: deepfake potential is highest here; realistic synthetic video could be mistaken for documentary footage. Robust labelling and watermarking are essential.
Podcast episodes (audio): Text-to-audio/speech synthesis (AudioLDM, ElevenLabs voice cloning). Risk: voice cloning could enable fraud or impersonation; consent and disclosure requirements are both ethical and increasingly legal obligations.

Recommendation rationale

A reasonable first-year pilot is LLM-assisted article drafting with RAG. It has the clearest productivity ROI (journalists save time on first drafts), the risk is manageable with human editorial review, and the legal landscape around text generation is more settled than around synthetic imagery or audio. The pilot also builds organisational competence in prompt engineering, output review workflows, and quality standards — a foundation for expanding to other modalities in subsequent years.

Training phase analysis

A startup has built a general-purpose LLM through pretraining on a large web corpus. They now want to deploy it as a legal document assistant for law firms.

Fine-tuning: What kind of training data would you recommend for fine-tuning, and why?
RLHF: Who would you recruit as human raters, and what specific quality criteria would you have them evaluate?
Trade-offs: Fine-tuning makes the model more specialised, but it risks “catastrophic forgetting” of general knowledge. How would you design the training process to mitigate this?
Evaluation: Before deploying to law firms, what evaluation would you run to assess the model’s readiness?

Solution notes

Fine-tuning data

Legal briefs, contracts, court opinions, regulatory filings with high-quality, verified content (e.g., from LexisNexis or Westlaw databases)
Include diverse jurisdictions (if multi-jurisdiction deployment is planned) and legal domains (contract law, IP, labour law, etc.)
Include human-annotated “ideal” responses to common legal drafting tasks for supervised fine-tuning
Crucially: ensure data is properly licensed, e.g., by using copyrighted legal databases without authorisation creates legal risk

RLHF rater profile

Practising lawyers or senior paralegals with relevant domain expertise
Mix of jurisdictions if multi-jurisdiction deployment is intended
Quality criteria: legal accuracy (most important), citation correctness, appropriate hedging (legal advice must acknowledge uncertainty), clarity and professional tone, absence of fabricated citations or case law

Catastrophic forgetting mitigation

Use mixed fine-tuning datasets: combine legal-domain data with a small proportion of general-domain data to preserve general capabilities
Parameter-efficient fine-tuning (PEFT) methods such as LoRA (Low-Rank Adaptation) update only a small subset of parameters, preserving most of the pretrained knowledge
Monitor performance on a general benchmark (e.g., MMLU) throughout fine-tuning to detect degradation early

Pre-deployment evaluation

Domain accuracy: Test on a held-out set of verified legal questions with known correct answers; have lawyers rate outputs on accuracy
Hallucination rate: Include questions where the correct answer is “I don’t know” or where verifiable citations are required — assess how often the model fabricates case law or statutes
Adversarial testing: Red-team the model to see if it can be prompted to give dangerous advice (e.g., advising a client to take an action that would be legally disastrous)
Bias audit: Check whether outputs differ systematically across client characteristics (e.g., does advice quality vary by the perceived socioeconomic status implied in the prompt?)

Diffusion model design

A pharmaceutical company wants to use generative AI to visualise molecular structures and simulate how proposed drug compounds might interact with target proteins. They are considering adapting diffusion models for this scientific domain.

Analogy: The text-to-image pipeline uses (i) text embedding, (ii) denoising conditioned on the embedding, (iii) super-resolution. Design an analogous pipeline for text-to-molecule generation.
Training data: What would a training dataset look like, and where might it come from?
Evaluation challenge: Unlike image generation (where human aesthetic judgement provides a useful signal), how would you evaluate whether a generated molecule is “good”?
Limitations: What specific limitations of diffusion models become especially problematic in this scientific context?

Solution notes

Text-to-molecule pipeline (analogous design)

Step 1 Molecular description embedding: Convert a textual description of desired properties (“small molecule with high binding affinity for EGFR, low molecular weight, water soluble”) into a semantic embedding vector using a chemistry-aware LLM (e.g., ChemBERTa)
Step 2 Structure denoising: Start from a random molecular graph (atoms as nodes, bonds as edges) and iteratively denoise it using a graph diffusion model conditioned on the embedding, guided toward chemically valid structures with the desired properties
Step 3 Energy minimisation/refinement: Pass the generated structure through a physics-based energy minimisation module to correct bond lengths, angles, and stereochemistry — analogous to super-resolution but in chemical property space

Training data

Public databases: PubChem (~116 million compounds), ChEMBL (bioactivity data), PDB (Protein Data Bank for protein-ligand interactions)
Company’s proprietary assay data (binding affinities, toxicity screens)
Each entry: molecular structure + associated property measurements (binding affinity, solubility, toxicity) used to condition generation

Evaluation approaches

Chemical validity: What percentage of generated molecules are syntactically valid (no broken valences, etc.)?
Predicted bioactivity: Use established computational docking tools (AutoDock Vina, Schrödinger) to estimate binding affinity of generated molecules to the target protein
Synthesisability: Score how feasible the molecule would be to actually synthesise in the lab (using tools like SAScore or RetroSynthesis AI)
Diversity: Assess whether generated candidates explore different regions of chemical space or cluster around known compounds
Experimental validation: Ultimately, synthesise and test a subset of top-ranked generated molecules — this is the ground truth

Critical limitations in this context

Controllability: The difficulty of precisely steering diffusion outputs is far more consequential here than in image generation; a pharmaceutical error (generating a toxic compound misidentified as safe) has life-or-death implications
Out-of-distribution extrapolation: Diffusion models generate compounds similar to those in training data; truly novel scaffolds outside the training distribution may be unreachable or generated incorrectly
No physical grounding: Diffusion models learn statistical patterns, not physical laws; generated structures may violate thermodynamic or quantum-chemical constraints in subtle ways that are hard to detect without expensive wet-lab validation
Dataset bias: Chemical databases over-represent certain drug classes (small molecules, carbon-based compounds); biased training data produces biased generation

RAG system design

Your university wants to deploy a student support chatbot that can answer questions about study regulations, course requirements, examination procedures, and administrative processes. The university updates its regulations each semester.

Architecture: Sketch a RAG architecture for this system. What are the components, and what documents belong in the knowledge base?
Retrieval quality: A student asks: “Can I take my bachelor’s thesis exam if I still have one failed elective from last semester?” Describe the retrieval steps and explain what could go wrong.
Knowledge base maintenance: How would you ensure the knowledge base stays up to date as regulations change each semester?
Failure modes: Identify three ways this system could fail in ways that harm students, and propose mitigations for each.

Solution notes

RAG architecture components

Knowledge base: Study and examination regulations (Prüfungsordnung), module handbooks, administrative process guides, FAQ documents, course catalogues — all updated each semester. Documents stored in a vector database (e.g., Pinecone, Weaviate, ChromaDB) as chunked, embedded passages.
Retrieval module: A dense retrieval system (e.g., bi-encoder with a fine-tuned sentence embedding model) that takes the student’s query, embeds it, and retrieves the top-k most relevant document chunks by cosine similarity.
Re-ranking: An optional cross-encoder re-ranker that scores query-passage pairs more precisely to improve retrieval precision for the top results.
LLM (generator): A fine-tuned or prompted LLM that receives the retrieved passages as context and generates a clear, accurate answer with references to specific document sections.
Source attribution: The response should cite the specific regulation document and section, enabling students to verify independently.

Retrieval for the complex exam question

The query involves multiple conditions: (a) bachelor’s thesis exam eligibility, (b) failed elective module, (c) interaction between the two.
Good retrieval should return: thesis examination admission requirements AND elective failure / retake regulations.
What could go wrong: (1) retrieval returns only the thesis admission requirements without the elective rules — the model generates a partial or incorrect answer; (2) regulations have been updated this semester but the knowledge base has not been refreshed — the model gives outdated advice; (3) the relevant rule is buried inside a long document chunk that gets cut off — the model misses the nuance.

Knowledge base maintenance

Implement a versioned document pipeline: when new semester regulations are published (PDF/Word), an automated ingestion process re-chunks, re-embeds, and replaces the old versions in the vector database.
Tag all documents with their effective semester date; at query time, the retrieval is filtered to the current semester’s documents.
Maintain a change log: human administrators review and approve new regulation documents before ingestion to catch formatting errors or unexpected changes.

Failure modes and mitigations

Hallucinated regulation details: The model fabricates a regulation that does not exist. Mitigation: require that every factual claim in the response is directly supported by a retrieved passage; display the source passage alongside the answer.
Stale knowledge: A student receives advice based on last semester’s regulations, which have since changed. Mitigation: semester-versioned document tagging + automated ingestion pipeline + a prominent “effective from” label on all responses.
Out-of-scope confidence: A student asks about a very specific edge case not covered in any regulation; the model invents an answer rather than saying “I don’t know”. Mitigation: train or prompt the model to abstain when retrieval similarity is below a threshold, and escalate to a human advisor.

Agentic AI evaluation

Consider the following agentic AI scenario: a university deploys an AI research assistant agent that, given a research question, autonomously searches academic databases, reads and synthesises relevant papers, identifies gaps in the literature, and produces a structured literature review draft.

Agent architecture: Identify which building blocks from the lecture (reasoning-augmented LLM, RAG, conversational agent, MAS) are present in this system, and describe the role of each.
Goal specification: How would you specify the agent’s goal precisely enough that it produces useful output without over- or under-shooting? What could go wrong with vague goal specification?
Human oversight: At which points in the agent’s workflow should a human researcher be consulted or able to intervene? Design a human-in-the-loop workflow.
Ethical considerations: Identify two ethical issues this deployment raises for academic integrity, and discuss how they might be addressed.

Solution notes

Building blocks present

RAG: Central to the system — the agent retrieves papers from academic databases (Semantic Scholar, PubMed, arXiv) at query time rather than relying on the LLM’s training data. Without RAG, the agent would produce a review based on papers it was trained on, missing recent work and potentially hallucinating citations.
Reasoning-augmented LLM: The agent must synthesise across multiple papers, identify thematic patterns, assess methodological quality, and reason about gaps — tasks requiring multi-step inference beyond surface-level pattern matching. Chain-of-thought prompting or structured reasoning prompts would be essential.
Conversational agent: The researcher may need to refine the scope mid-task (“focus more on intervention studies and less on observational work”); the agent maintains context of the evolving research question and earlier retrieved papers.
MAS (optional but valuable): A well-designed system might decompose the task: a planning agent defines the search strategy; a retrieval agent queries databases; a reading agent processes and summarises each paper; a synthesis agent integrates findings; a writing agent produces the draft. Each specialises, improving quality.

Goal specification

Vague goal: “Do a literature review on AI in education.” Problems: unclear scope, depth, methodology focus, time range, target audience, output format.

Precise specification should include: research question, date range, databases to search, inclusion/exclusion criteria (study type, language, methodology), desired output format (introduction, thematic sections, gap analysis, reference list), intended audience (conference paper, PhD thesis, industry report), and approximate length target.

What goes wrong with vague goals: agent retrieves too broadly (50,000 papers) or too narrowly (misses a relevant sub-field); synthesises at the wrong depth; produces a general overview when a methodological critique was needed.

Human-in-the-loop design

Key checkpoints: 1. After query formulation: Human reviews and approves the search queries and inclusion criteria before any retrieval happens. 2. After initial retrieval: Human reviews the list of retrieved papers (titles + abstracts); can remove irrelevant results and flag missed important works. 3. After synthesis outline: Human reviews the thematic structure the agent has identified before writing begins — ensuring the framing matches the researcher’s intent. 4. After draft: Human reviews the complete draft for accuracy, tone, and completeness; all citations must be verified against source documents.

Ethical considerations

Academic integrity and authorship: If an agent writes a literature review that appears in a PhD thesis or paper without disclosure, this raises questions of authorship and academic honesty. Mitigation: institutions should develop and enforce clear AI disclosure policies; journals and thesis committees should require explicit statements about AI tool use.
Citation fabrication: Even with RAG, the agent’s writing layer may introduce subtle misrepresentations of source papers or generate plausible-looking but incorrect citations. Mitigation: every citation in the agent’s output must be verifiable against a retrieved source passage; the system should flag any generated reference for which no retrieved source exists; human reviewer must check a sample of all citations.

Literature

Acharya, D. B., Kuppan, K., & Divya, B. (2025). Agentic AI: Autonomous intelligence for complex goals–a comprehensive survey. IEEE Access.

Anthrophic. (2024). Building effective agents. Anthropic Research Team; https://www.anthropic.com/engineering/building-effective-agents.

Berente, N., Gu, B., Recker, J., & Santhanam, R. (2021). Managing artificial intelligence. MIS Quarterly, 45(3), 1433–1450. https://doi.org/10.25300/MISQ/2021/16274

Bhatia, A. (2023). We need to talk about how good A.I. Is getting. https://www.nytimes.com/interactive/2023/04/26/upshot/gpt-from-scratch.html

Brittain, B. (2023). Getty images lawsuit says Stability AI misused photos to train AI. Reuters. https://www.reuters.com/legal/getty-images-lawsuit-says-stability-ai-misused-photos-train-ai-2023-02-06/

Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., … Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems (NeurIPS), 33, 1877–1901. https://arxiv.org/abs/2005.14165

Cao, H., Tan, C., Gao, Z., Xu, Y., Chen, G., Heng, P.-A., & Li, S. Z. (2023). A survey on generative diffusion model. arXiv Preprint arXiv:2209.02646. https://arxiv.org/abs/2209.02646

Doran, J., Franklin, S., Jennings, N., & Norman, T. (1997). On cooperation in multi-agent systems. The Knowledge Engineering Review, 12(3), 309–314. https://doi.org/10.1017/s0269888997003111

Gimpel, H., Gutheil, N., Mayer, V., Bandtel, M., Büttgen, M., Decker, S., et al. (2024). (Generative) AI competencies for future-proof graduates: Inspiration for higher education institutions [Hohenheim Discussion Papers in Business, Economics and Social Sciences]. University of Hohenheim.

Gimpel, H., Ruiner, C., Schoch, M., Schoop, M., Hall, K., Eymann, T., Röglinger, M., Vandrik, S., Lämmermann, L., Urbach, N., Mädche, A., & Decker, S. (2023). Unlocking the power of generative AI models and systems such as GPT-4 and ChatGPT for higher education: A guide for students and lecturers (Hohenheim Discussion Papers in Business, Economics and Social Sciences 02-2023). University of Hohenheim. https://hohpublica.uni-hohenheim.de/items/fe53b2bb-ab75-463c-9383-ec74416fd940

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press. https://www.deeplearningbook.org

Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems (NeurIPS), 33, 6840–6851. https://arxiv.org/abs/2006.11239

Hoek, W. van der, & Wooldridge, M. (2008). Multi-agent systems. In Handbook of knowledge representation (Vol. 3, pp. 887–928). Elsevier. https://doi.org/10.1016/S1574-6526(07)03024-6

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., & Riedel, S. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems (NeurIPS), 33, 9459–9474. https://arxiv.org/abs/2005.11401

Liu, H., Chen, Z., Yuan, Y., et al. (2023). AudioLDM: Text-to-audio generation with latent diffusion models. https://arxiv.org/abs/2301.12503

Milne, S. (2023). AI image generator Stable Diffusion perpetuates racial and gendered stereotypes, study finds. https://www.washingtonpost.com/news/2023/11/29/ai-image-generator-stable-diffusion-perpetuates-racial-and-gendered-stereotypes-bias/

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C. L., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., Schulman, J., Hilton, J., Kelton, F., Miller, L. E., Simens, M., Askell, A., Welinder, P., Christiano, P. F., Leike, J., & Lowe, R. (2022). Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems (NeurIPS), 35, 27730–27744. https://arxiv.org/abs/2203.02155

Peng, Y. (2024). A comparative analysis between GAN and diffusion models in image generation. Transactions on Computer Science and Intelligent Systems Research, 5(1).

Plaat, A., D’Ascoli, S., Bubeck, S., Chan, B., Chen, D., Chi, E. H., et al. (2025). Agentic large language models. https://arxiv.org/abs/2503.23037

Reuters. (2023). ChatGPT sets record for fastest-growing user base—Analyst note. https://www.reuters.com/technology/chatgpt-sets-record-fastest-growing-user-base-analyst-note-2023-02-01/

Riemer, K., & Peter, S. (2023). What the lone banana problem reveals about the nature of generative AI. ACIS 2023 Proceedings.

Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-resolution image synthesis with latent diffusion models. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 10684–10695. https://doi.org/10.1109/CVPR52688.2022.01042

Sanderson, G. (2024). But what is a GPT? Visual intro to transformers. https://www.3blue1brown.com/lessons/gpt

Singh, A. (2023). A survey of AI text-to-image and AI text-to-video generators. https://arxiv.org/abs/2311.06329

The Verge. (2024). Inside the launch—and future—of ChatGPT. https://www.theverge.com/23610427/chatgpt-openai-history-two-year-anniversary

Urbach, N., Feulner, D., Feulner, S., Guggenberger, T., & Mayer, V. (2026). Introduction to generative artificial intelligence. In N. Urbach & D. Feulner (Eds.), Managing artificial intelligence (pp. 71–95). Springer Nature Switzerland. https://doi.org/10.1007/978-3-032-13308-3_4

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems (NeurIPS), 30, 5998–6008. https://arxiv.org/abs/1706.03762

Verma, P., & Oremus, W. (2023). ChatGPT invented a sexual harassment scandal and named a real law prof as the accused. https://www.washingtonpost.com/technology/2023/04/05/chatgpt-lies/

Wang, L., Liu, Z., Wang, Z., & Li, L. (2024). A survey on large language model based autonomous agents. Journal of Computer Science and Technology. https://doi.org/10.1109/JOCST.2024.10849561