Anthropic hires former OpenAI security director to guide recent team

May 28, 2024

279

Jan Leike, a number one AI researcher who resigned from OpenAI earlier this month before publicly criticizing the corporate's approach to AI safety, has joined OpenAI competitor Anthropic to guide a brand new “superalignment” team.

In a post on X, Leike said his team at Anthropic will give attention to various facets of AI safety, particularly “scalable supervision,” “weak-to-strong generalization,” and research into automatic targeting.

I’m blissful to be there @AnthropicAI to proceed the super alignment mission!

My recent team will work on scalable monitoring, generalization from weak to strong changes, and automatic alignment research.

If you're enthusiastic about joining, my DMs are open.

— Jan Leike (@janleike) May 28, 2024

A source acquainted with the matter told TechCrunch that Leike will report on to Jared Kaplan, Anthropic's chief scientific officer, and that Anthropic researchers currently working on scalable supervision – techniques for controlling the behavior of large-scale artificial intelligence in predictable and desirable ways – will move to Leike as Leike's team ramps up.

✨🪩 Yay! 🪩✨

Jan has led some foundational work on AI technical safety and I'm really excited to be working with him! We will lead twin teams tackling different facets of the issue of adapting AI systems to the human level and beyond. https://t.co/aqSFTnOEG0

— Sam Bowman (@sleepinyourhat) May 28, 2024

In some ways, the mission of Leike's team is analogous to that of OpenAI's recently disbanded Superalignment team. The Superalignment team, which Leike co-led, had an ambitious goal of solving the core technical challenges of governing superintelligent AI over the following 4 years, but was often stymied by OpenAI's leadership.

Anthropic has often tried to position itself as more security-focused than OpenAI.

Anthropic's CEO, Dario Amodei, was once VP of research at OpenAI and reportedly parted ways with OpenAI after disagreements over the corporate's direction – namely, OpenAI's growing industrial focus. Amodei brought quite a lot of former OpenAI employees to form Anthropic, including former OpenAI head of policy Jack Clark.

Anthropic hires former OpenAI security director to guide recent team

LEAVE A REPLY Cancel reply

Must Read

VCs predict that corporations will spend more on AI in 2026 through fewer vendors

2025 war das Jahr, in dem KI einen Vibe-Check erhielt

AI agents got here onto the market in 2025 – here's what happened and what challenges lie ahead in 2026

Deepfakes have increased in 2025 – here's what's next

The 12 months data centers took center stage from the backend

As AI recreates the feminine voice, it also rewrites who’s heard

How can Canada develop into a worldwide AI powerhouse? By investing in mathematics

Latest articles

VCs predict that corporations will spend more on AI in 2026 through fewer vendors

2025 war das Jahr, in dem KI einen Vibe-Check erhielt

AI agents got here onto the market in 2025 – here's what happened and what challenges lie ahead in 2026

Our Newsletter

Anthropic hires former OpenAI security director to guide recent team

RELATED ARTICLES

LEAVE A REPLY Cancel reply

Must Read

Latest articles

Our Newsletter