An LLM Walks into General Relativity at Devoxx Greece 2026

An LLM Walks into General Relativity

Quickie (INTERMEDIATE level)

Friday from 12:45 13:00

MC 2

Large Language Models are increasingly used to generate technical content: documentation, reports, and even conference presentations. The results are often fluent, confident, and well-structure, which makes their mistakes harder to spot.
In this talk, we run a simple experiment, an LLM is asked to generate an entire presentation on General Relativity, covering gravitational time dilation, gravitational waves, and black holes, using real scientific sources. The output looks convincing. It has equations, misconceptions, and citations. And yet, several explanations are subtly but fundamentally wrong.
General Relativity is an unforgiving domain. Concepts that sound intuitive, like “light slows down in gravity”, “gravitational waves are ripples in space”, “black holes suck everything in”, fail as soon as you frame them in terms of measurements, observables, and invariants. This makes physics an ideal stress test for AI-generated explanations.
Using the generated slides as a case study, we show:

where LLMs consistently succeed (structure, narrative, pedagogy),
where they fail (measurement-based reasoning and physical constraints),
and how to design agent pipelines that combine AI generation with deterministic validation and human review.

Although physics is the example, the lessons generalize to any technical domain where correctness matters. The goal is not to reject AI-generated content, but to understand how to use it responsibly, and how to catch confident explanations that are wrong.

Tasos Nikolaou

Up Hellas

Tasos is a software engineer with a background in physics and machine learning, working on backend systems, software architecture, and microservices, with a strong interest in the practical limits of AI-generated solutions.

Talk