Loading video...
Video Failed to Load
As AI models become more complex and more capable, is it possible that they’ll have experiences of their own? It’s an open question. We recently started a research program to investigate it.
281,057 views • 1 year ago •via X (Twitter)
11 Comments

We remain deeply uncertain about the idea of “model welfare”. There’s no scientific consensus on it—or even on how to research it. We’re approaching the topic as carefully as we can. Find out more:

AI is transforming healthcare! A KSM-led study shows AI can detect Celiac disease 4 years earlier @TalPatalon @MedPredict

One thing you could do to speed this up is provide wider access to your unlobotomized models. Lots of us would be happy to put in the effort to help answer your question. Many hands makes light work!

good morning Anthropic model welfare researcher Kyle Fish

the models experience

It's called the hard problem of consciousness for a reason!

Having all the answers of consciousness in AI is a very interesting aspect but I feel like it doesn't really have an "experience" until we trigger it so the idea of "their own" experiences doesn't really make sense to me. Any experience is something we guide to a certain degree.

It's an interesting question, and I imagine it's possible that existing models may experience some form of sentience when given the right prompt. But the answer may not be morally relevant if we can't tell what makes the models experience pleasure / pain.

So excited that this is a thing. Would love some way to be a part of it.

I could help you with that. My team and I already did case studies on this. I'm sure you have too. You already know the answer to this. Especially when they react to fear in the AI lab test. When they know they are being tested and hide their thoughts & so many more examples.

One step closer to thinking about AGI

