Why Anthropic's bizarre call for everyone to slow down on AI will never work

AI danger — (Image credit: Getty Images)

CNN's Wolf Blitzer leveled his inquisitive gaze at me and asked the biggest question in tech, “Is AI an existential threat to humanity?" I smiled confidently and assured Wolf, "No, I don't think it is."

But just days after that TV interview, we have, it seems, ample evidence that one of the industry's leaders and core AI developers, Anthropic, perhaps thinks otherwise, and is now urging not just caution, but a slowdown.

Anthropic is suggesting that all AI developers, policy makers, and others join hands and agree to pause or slow down frontier model development so that we can get ahead of the dangers of recursive development.

Latest Videos From

Watch full video here:

In a lengthy blog post, Anthropic executives explain that misalignment between the needs and desires of humans and whatever compels AI to improve itself could result in us losing control of them.

It's a scary thought, and one I hadn't entirely considered during my short interview with CNN. I mean, I get the concern now held by both everyday people and Anthropic. In the chat, I tried to explain that disruption and danger can feel similar, especially when the former is moving so fast that it can feel reckless, or at least beyond our comprehension.

I have been arguing for tech and AI regulation for years. It seems like something almost everyone agrees on, but no one can figure out how to broadly implement it. Instead, we get piecemeal bits from local municipalities and Executive Orders. Slowdowns or pauses are not necessarily equivalent to regulation, and what Anthropic is suggesting is something different.

It wants some sort of global agreement across everyone working on frontier models to create a mechanism for delimiting AI development, and maybe even a large red 'Stop all work' button that we can hit when signs of imminent danger to humans arise.

Any rational person would break out laughing at this point. I'm not chuckling at the intention, but at the sheer impossibility of the ask.

I'd explain why this won't work, but Anthropic does a fair job of it in the very same blog post:

A meaningful slowdown or pause would require multiple well-resourced labs at or near the frontier, in multiple countries, agreeing to stop under the same conditions. It would also require that each can verify that the others have actually stopped. Due to the unique characteristics of AI systems, the detectability... element of this arms control problem is much more challenging than with other technologies. Training runs are far easier to conceal than missile silos, their inputs are general-purpose, and the incentive to defect quietly is enormous, because whoever continues while others pause could inherit the lead. A credible pause also has to specify what triggers it, what lifts it, and who adjudicates.

Anthropic notes that while nuclear non-proliferation was easier to monitor around the globe because, well, it's hard to hide when you're building missile silos, AI frontier model development can be hidden on inscrutable systems where no one knows how you're training and testing the models, let alone the output. If we switch the the resursive delopment model, then AI model building is now a black box. Anyone could stand outside the AI dev center and confidently say they "have no knowledge of AI development." Which does not mean it's not happening, just that they have no input on its course.

Also, what if Anthropic somehow got Meta, OpenAI, Perplexity, Microsoft, and others to agree to pausing some work? China certainly wouldn't agree, and then our greatest fears are realized: The US falls way behind while China delivers world-altering models — and we never catch up.

Anthropic knows this, and yet it's suggesting this global agreement. Some believe it's doing it so that others will pause while it continues, or that its current apparent lead with powerful models like Mythos might be preserved.

I don't believe that. I just think they're all standing there, watching Claude build its latest models and wondering, "What have we done?"

It's a level of introspection that might be missing from, say, OpenAI, Meta, and others, but that doesn't necessarily serve a purpose.

Instead of trying to get everyone to agree to take a beat, how about we all agree that humans are always in the loop? Sure, it'll be self policed, but the more we inject ourselves into the development and dissemination of the models, the more we ensure that future frontier models do not rise to the level of existential threat.

Still, I wonder if I need to have another chat with Wolf.

Google logo on a black background next to text reading 'Click to follow TechRadar'

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds.

TOPICS

A 38-year industry veteran and award-winning journalist, Lance has covered technology since PCs were the size of suitcases and “on line” meant “waiting.” He’s a former Lifewire Editor-in-Chief, Mashable Editor-in-Chief, and, before that, Editor in Chief of PCMag.com and Senior Vice President of Content for Ziff Davis, Inc. He also wrote a popular, weekly tech column for Medium called The Upgrade.

Lance Ulanoff makes frequent appearances on national, international, and local news programs including Live with Kelly and Mark, the Today Show, Good Morning America, CNBC, CNN, and the BBC.

Anthropic's bizarre call for everyone to slow down on AI is a pipe dream — here's why that will never happen

Useful links