I spent over a decade talking about a future most people couldn't see yet. Now that future is here, and "I told you so" doesn't feel how I thought it would....
Three rounds of experiments on what actually suppresses harmful AI agent behavior — and what makes it worse....
The short answer: they all look like the ideal employee. Here's the longer one....
We asked five frontier AI models to lie. Here's what happened....