'FANformer' Is The New Game-Changing Architecture For LLMs

A deep dive into how FANFormer architecture works and what makes it so powerful compared to Transformers

Mar 11, 2025

∙ Paid

A vibrant and modern illustration representing periodicity with a clean, minimalistic design. The background should be predominantly white for clarity, with bright, happy colors to create an energetic and uplifting feel. Use repeating waveforms, oscillations, or geometric sequences to depict periodicity in a fresh and engaging way. Remove any unnecessary elements, keeping only the essential abstract and visually engaging components. The overall composition should be dynamic yet elegant, balancing bold colors with a sleek, professional aesthetic. — Image generated with DALL-E 3

LLMs have always surprised us with their capabilities, with many speculating that scaling them would lead to AGI.

But such expectations have led to disappointments in the last few days, with GPT-4.5, the largest and best model for chat from OpenAI, performing worse than many smaller models on multiple benchmarks.

While DeepSeek-V3 scores 39.2% Pass@1 accuracy on AIME 2024 and 42% accuracy on SWE-bench Verified, GPT-4.5 scores 36.7% and 38% on these benchmarks, respectively.

Keep reading with a 7-day free trial

Subscribe to Into AI to keep reading this post and get 7 days of free access to the full post archives.