Sakana AI's Collective AI: Smaller Models Outperform Industry Giants

So, picture this: a group of small fish swimming together in perfect harmony, each one contributing its unique skills to navigate through the ocean. That’s kinda what Sakana AI is doing in the world of artificial intelligence. This Tokyo-based startup is shaking things up by showing that smaller models can work together to tackle complex problems, rather than just relying on one massive model to do all the heavy lifting.

Founded in 2023 by some pretty impressive folks—David Ha and Llion Jones, who used to work at Google AI, and Ren Ito, the former COO of Stability AI—Sakana AI is inspired by nature. They’re taking cues from evolution and the collective intelligence of, you guessed it, schools of fish. Instead of the industry norm of building bigger and bigger models, they’re all about collaboration and efficiency.

The Magic Behind the Models

Now, let’s dive into the nitty-gritty of what makes Sakana AI’s approach so special. They’ve got two main tricks up their sleeves: Evolutionary Model Fusion and a collaborative technique for real-time problem-solving.

Evolutionary Model Fusion is like a matchmaking service for AI models. Imagine you’ve got a bunch of open-source models just sitting around, each with its own strengths and weaknesses. Sakana AI’s algorithm takes these models and combines them in a way that mimics natural selection. It’s like a reality show where the best models get to mate and create even better offspring.

For instance, they managed to create a Japanese-language model that’s not just good at language but also excels in math. How? By merging a Japanese model with an English one that was a math whiz. The result? A specialized model that can handle both languages and complex calculations without needing to start from scratch. Pretty cool, right?

But wait, there’s more! The second method, called Adaptive Branching Monte Carlo Tree Search (AB-MCTS), is all about teamwork. This algorithm allows different models to work together in real-time to solve a single problem. Think of it like a sports team where each player has a specific role. Some models are great at digging deep into a problem, while others are better at exploring new angles.

When they tested this approach on a tricky benchmark called the ARC (Abstraction and Reasoning Corpus), they found that the multi-model setup outperformed individual models by up to 30%. That’s like having a dream team of AI experts, each contributing their unique skills to achieve something greater than they could alone.

A Game Changer for the AI Industry

So, what does all this mean for the future of AI? Well, it’s kinda revolutionary. The trend in the industry has been to build these gigantic models that require massive resources—something only the big players can afford. But Sakana AI is showing that there’s a smarter, more efficient way to develop powerful AI.

By leveraging the collective intelligence of existing models, they’re not just cutting down on costs but also speeding up the process of creating specialized models tailored to specific needs. It’s like having a toolbox filled with all the right tools, rather than trying to make do with just one big hammer.

And here’s the kicker: their collaborative inference-time algorithm means businesses can get more bang for their buck from the AI they already have. Instead of overhauling everything, they can just make their existing models work together more intelligently. This could open the door for tackling complex problems that single models often struggle with.

The Road Ahead

Sakana AI isn’t just a flash in the pan. With a team that’s got deep roots in AI innovation—Llion Jones even co-authored the groundbreaking “Attention Is All You Need” paper that introduced the Transformer architecture—they’re quickly making a name for themselves. They recently secured around $200 million in Series A funding from big names like Khosla Ventures and NVIDIA, which will help them invest in talent and tech to keep pushing the envelope.

While they’re not looking to go head-to-head with giants like OpenAI, they’re carving out a niche by focusing on collective, multi-modal models. Their ambitious projects, like the “AI Scientist” that aims to automate the scientific research lifecycle, highlight their commitment to pushing the boundaries of what AI can do through collaboration and evolutionary principles.

In a world where bigger often seems better, Sakana AI is proving that sometimes, it’s the smaller, smarter fish that can swim circles around the giants. By showing that a swarm of specialized models can achieve more together, they’re paving the way for a more sustainable future in AI.

So, next time you think about AI, remember the little fish that could—and the big waves they’re making in the tech ocean!

AI Research | 7/8/2025

Sakana AI's Collective AI: Smaller Models Outperform Industry Giants

Sakana AI's Collective AI: Smaller Models Outperform Industry Giants

The Magic Behind the Models

A Game Changer for the AI Industry

The Road Ahead