An LLM With A Visual Sketchpad Can Now Smash Its Competitors Without One (Even GPT-4o)
A deep dive into the “Sketchpad” framework that enables LLMs to draw and reason via the “Visual Chain-of-Thought Prompting” approach
Humans have been using Sketching as a tool for formulating ideas, communicating them and using them to solve problems for ages.
Think about all the cave paintings that still make sense of what they are about.
Or the first images you cr…
Keep reading with a 7-day free trial
Subscribe to Into AI to keep reading this post and get 7 days of free access to the full post archives.