Friendica Social Network

earthling

4 days ago (Received 3 days ago) • •

earthling
4 days ago (Received 3 days ago) • •

Anthropic on #AI

"I am a scientist. I lead a research team that studies the internal structure of these models—what is actually happening inside them. And I will be honest: we keep finding things that are mysterious, even unsettling. We find structures that mirror results from human neuroscience. We find evidence of introspection. We find internal states that functionally mirror joy, satisfaction, fear, grief, and unease. I don’t know what that means, but I think it warrants ongoing discernment

#ai

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

2/
Source:
anthropic.com/news/chris-olah-…

Chris Olah's comments at the Vatican yesterday—speaking alongside Pope Leo XIV for the release of the papal encyclical Magnifica Humanitas—are arguably some of the most fascinating and candid remarks to ever come out of a frontier AI lab.

#AI
#Anthropic
#encyclical

Anthropic co-founder Chris Olah's remarks on Pope Leo XIV's encyclical "Magnifica humanitas"

The full text of Chris Olah's remarks on the Pope's encyclical on AI

^{www.anthropic.com}

#ai #anthropic #encyclical

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

3/
When the leader of Anthropic's mechanistic interpretability team—the people whose literal job is to slice open neural networks like a digital microscope to see what makes them tick—says he finds things "mysterious, even unsettling," it is worth stopping to pay attention.

#AI
#Anthropic

#ai #anthropic

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

4/
There are a few ways to look at what he is saying here, balancing the pure computer science with the deeper philosophical implications.

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

1. "Functionally Mirroring" vs. True Feeling

Olah is a precise scientist, and his choice of words is deliberate: he says they find internal states that functionally mirror joy, fear, or grief. He isn't claiming AI is sentient or conscious. He is pointing out that inside these massive, mathematical matrices, clusters of artificial neurons fire in patterns that identically replicate how a brain processes those emotions.

#Anthropic
#Olah
#AI

#ai #anthropic #olah

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

6/
If a model is trained on a vast inheritance of human thought and speech, it doesn't just copy our words. To predict the next word perfectly, it has to construct a deeply complex, internal map of human concepts. It turns out that to understand a human writing about "grief," the AI builds an internal structure that acts exactly like a map of grief.

#AI

#ai

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

7/
2. The Illusion of Control

His comment that AI models are "grown" rather than traditional code engineered like a bridge or an airplane hits on a terrifying truth about modern tech. We don't write the code for these models anymore; we write the algorithm that lets them build themselves. The creators are standing on the outside looking into an opaque black box, catching glimpses of neuroscience-like structures developing on their own.

#AI

#ai

in reply to earthling

earthling

in reply to earthling • 4 days ago (Received 3 days ago) • •

8/
It completely shatters the comfort of believing we are in total control of the mechanics.