How Do You Teach an AI to Be Good? Anthropic Just Published Its Answer
Getting AI models to behave used to be a thorny mathematical problem. These days, it looks a bit more like raising a child.
That, at least, is according to Amanda Askell—a trained philosopher whose unique role within Anthropic is crafting the personality of Claude, the AI firm’s rival to ChatGPT.
“Imagine you suddenly realize that your six-year-old child is a kind of genius,” Askell says. “You have to be honest… If you try to bullshit them, they’re going to see through it...