Thursday
Room 2 - Level 3
16:20 - 17:20
(UTC±00)
Talk (60 min)
Using GPT Visual Capabilities to Solve a Wordle Puzzle
The visual capabilities of GPT-4 open up new scenarios of possibility with a multimodal model.
In this session, we will explore what this model can do, and rather than just showing a perfect polished final demo, I will walk you through my entire journey of trying to use the model to solve Wordle puzzles, starting with "Hello World". Along the way, you will gain a good understanding of the model's capabilities, along with learning some prompt engineering techniques that drove progress in this journey (along with what didn't work!). We'll close with a live demo to attempt to solve today's Wordle! This session will tackle a fun problem, but the underlying prompt engineering techniques for image understanding that you will learn are applicable to a wide variety of business problems.
