Anthropic just dropped the core commitment of its safety policy: the promise to not train models it couldn't prove were safe first.

The new version commits to matching competitors on safety and publishing more transparency reports. But the actual constraint, "we stop if we can't… pic.twitter.com/k5Zi6dHUMN

— Raphael Pfeiffer (@raphpfei) February 25, 2026
.....