Speaker Details

Tiffany Jernigan
Grafana Labs

Tiffany is senior developer advocate at Grafana Labs and a CNCF Ambassador. She also formerly worked as a software developer and developer advocate at VMware, Amazon, Docker, and Intel. Prior to that, she graduated from Georgia Tech with a degree in electrical engineering. In her free time, she likes to travel and dabble in photography. You can find her at tiffanyfay.dev (and for Bluesky) and elsewhere on linktr.ee/tiffanyfay.

View
Breaking conversational barriers through multi-modal distributed AI agents
Conference (INTERMEDIATE level)
Banquet

GenAI is no longer a new buzzword and we currently stand at the convergence of speech, vision and intelligence. The boundaries of human interactions with machines are blurring into a more seamless, interactive ecosystem. This talk will present an approach to building distributed, multi-modal AI agents at scale.

Our live demonstration will showcase:

- Voice capture: The Automatic Speech Recognition (ASR) service will transcribe the voice input into a precise text prompt

- Visual Generation: The text prompt will generate a contextually rich image

- Text Generation: The generated image will be analyzed, producing an ALT text description

We’ll explore how these agents can be run at scale, using a globally distributed system. We’ll share the thought process behind making these architectural decisions and what it can take to go from prototype to production in this space. We’ve also instrumented the entire system with distributed tracing to provide visibility into each stage of the pipeline.

Whether you’re curious about building AI agents or want to take your GenAI projects to production, this talk will show you what that journey can look like—end to end.

More

Searching for speaker images...