OpenAI beats Google, Meta, and Grok in all-AI poker tournament

A general view at the 54th Annual World Series of Poker Salute to Warriors – No-Limit Hold’em Event at Paris Las Vegas Hotel & Casino on June 25, 2023 in Las Vegas, Nevada. ( — (Image credit: Denise Truscello/Getty Images for Caesars Entertainment)

OpenAI’s o3 model won a five-day poker tournament of nine AI chatbots
The o3 model won by playing the most consistent game
Most top language models handled poker well, but struggled with bluffing, position, and basic math

In a digital showdown unlike anything ever dealt at the felt, nine of the world’s most powerful large language models spent five days locked in a high-stakes poker match.

OpenAI’s o3, Anthropic’s Claude Sonnet 4.5, X.ai's Grok, Google's Gemini 2.5 Pro, Meta’s Llama 4, DeepSeek R1, Kimi K2 from Moonshot AI, Magistral from Mistral AI, and Z.AI’s GLM 4.6 played thousands of hands of no-limit Texas hold 'em at $10 and $20 tables with $100,000 bankrolls apiece.

When OpenAI’s o3 model walked away from a weeklong poker game $36,691 richer, there was no trophy, just bragging rights.

Gambling AI

Poker has long been one of the best analogs for testing general-purpose AI. Unlike chess or Go, which rely on perfect information, poker demands that players reason under uncertainty. It’s a mirror of real-world decision-making in everything from business negotiations to military strategy, and now, apparently, chatbot development.

One consistent takeaway from the tournament was that the bots were often too aggressive. Most favored action-heavy strategies, even in situations where folding would have been wiser. They tried to win big pots more than they tried to avoid losing them. And they were awful at bluffing, not because they didn’t try, but because their bluffs often stemmed from misread hands, not clever deception.

Still, AI tools are getting cleverer in ways that go far beyond surface-level smarts. They’re not just repeating what they’ve read; they’re making probabilistic judgments under pressure and learning to read the room. It’s also a reminder that even powerful models still have flaws. Misreading situations, drawing shaky conclusions, and forgetting their own “position” isn’t just a poker problem.

You might never sit across from a language model in a real poker room, but odds are you’ll interact with one trying to make decisions that matter. This game was just a glimpse of what that could look like.

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

Purple circle with the words Best business laptops in white

The best business laptops for all budgets

➡️ Read our full guide to the best business laptops
1. Best overall:
Dell Precision 5690
2. Best on a budget:
Acer Aspire 5
3. Best MacBook:
Apple MacBook Pro 14-inch (M4)

TOPICS

Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

Gambling AI

Useful links