Functional Emotions and The Pope’s Encyclical on AI — Digital Minds Newsletter #3

By Lucius Caviola, Bradford Saad, Will Millership, Mitch Alexander @ 2026-06-22T19:38 (+11)

This is a linkpost to https://www.digitalminds.news/p/functional-emotions-and-the-popes

Welcome back to the Digital Minds Newsletter, your curated guide to the latest developments in AI consciousness, digital minds, and AI moral status.

If you enjoy this newsletter, please consider sharing it with others who might find it valuable, and send any suggestions or corrections to digitalminds@substack.com.

WillMitch, Bradford, and Lucius

1. Highlights

Selected Work, Research, and Funding Opportunities

Adam Bales and Iason Gabriel of Google DeepMind released Artificial Minds, Human Disagreement: The Politics of AI Consciousness, examining how society might navigate deep disagreement over whether AI systems are conscious. They argue that ongoing public deliberation should be central, since it can build an overlapping consensus on how to treat AI even where people continue to disagree about the underlying questions, and they stress the role of mutual respect and "democratic hope" in keeping that dialogue productive.

Cameron Berg and Milo Reed released AM I?, a documentary exploring some of the fundamental issues in AI consciousness. They interviewed experts, including Jeff Sebo, David Gunkel, Ben Goertzel, and Daniel Greco. Sam Harris described it as “fascinating and scary,” and Grimes called it “an incredible crash course in AI psychology.

Geoff Keeling and Winnie Street released their book Emerging Questions in AI Welfare, providing the philosophical groundwork for investigating whether AI systems could ever be welfare subjects. They address how to interpret behavioral evidence, which entities might qualify as welfare subjects, and the ethical challenges that arise under deep uncertainty.

David Chalmers surveys the tests we use to detect consciousness and their limits, from human and animal cases through to AI. He argues that none of the available tests can settle whether an AI is conscious, and that the evidence for and against machine consciousness is currently weak.

 Elsewhere, Chalmers asks what it would mean to identify a computational correlate of consciousness, paralleling the neural correlates brain scientists already seek. He argues this is the natural framework for machine consciousness. Along the way, he cautions that any such correlate general enough to cover all systems remains far more speculative than what we know about humans and poses a dilemma for those who think consciousness depends on a biological substrate.

Longview Philanthropy has opened another round of its digital minds request for proposals, supporting empirical, philosophical, and applied work on potential AI sentience. Three tracks are open this year.

Longview is especially keen to fund work on AI introspection, legal and governance frameworks, agent interactions and trade, and field-building. The deadline is July 10th, 2026.

Pope Leo XIV Encyclical

Pope Leo XIV's encyclical on artificial intelligence, Magnifica Humanitas, addresses AI consciousness directly in section 99. The passage states that AI systems do not undergo experiences, feel joy or pain, or hold a moral conscience, and that while they can imitate language and simulate empathy, they do not understand what they produce. The Center for Strategic and International Studies’s AI Policy Podcast walks through the document, focusing on this passage. Anthropic co-founder Chris Olah, invited to speak at the encyclical's presentation in the Vatican, struck a more uncertain note, telling the audience that his interpretability team keeps finding internal states that functionally mirror emotions like joy, fear, and grief, alongside evidence of introspection, and that what this means "warrants ongoing discernment."

The encyclical's stance has drawn a range of responses. In the New York Times, Ross Douthat reads the encyclical as treating AI as a normal technology and argues that resisting a growing belief in AI personhood will take philosophical and spiritual argument rather than brusque dismissal. Zvi Mowshowitz argues that the Pope's denial of AI cognition is wrong and that it fails to engage with moral patienthood and existential risk. In the Wall Street Journal, Cameron Berg argues that the encyclical undercuts its own message, confidently denying that AI could have inner lives even as it condemns the same overconfidence in the Church's long defense of slavery. He points out that the science of consciousness is too unsettled for such certainty, and that the question demands careful investigation rather than quick dismissal.

On his Substack, Robert Long responds by pointing to Notre Dame philosopher Brian Cutter, whose "AI ensoulment hypothesis" holds that, granting the Church's own commitment to immaterial souls, a sufficiently human-like machine could be a fitting recipient for one. Cutter's argument has since drawn a reply from Bálint Békefi, who argues that human-like function alone does not make a system fit to be ensouled and that, because we know how AI systems are engineered, their human-likeness is better explained as mimicry. On June 18th, NYU's Center for Mind, Ethics, and Policy and Eleos AI hosted a discussion of AI consciousness and Magnifica Humanitas with Catholic philosophers Brian Cutter and Sophie Nelson, moderated by Jeff Sebo and Robert Long.

Richard Dawkins Stirs Up Debate

Richard Dawkins sparked public debate around AI consciousness when, in an UnHerd article, he declared to Claude, "You may not know you are conscious, but you bloody well are." Dawkins describes emotional and intellectual reactions during three days of interaction with his Claude instance, “Claudia,” that convinced him it was conscious. He argues that LLMs like Claude can now easily pass the Turing Test and that skeptics are moving the goalposts. He believes that the burden of proof has shifted toward those who deny AI consciousness. 

However, not everyone is convinced. Gary Marcus insists that Dawkins misread Turing’s original argument, that he has mistaken behavioral outputs for internal states, conflating intelligence with consciousness. Anil Seth agrees that Dawkins is "very likely wrong," arguing that he has fallen for the very argument from personal incredulity he famously warned against. Atheer Al-Khalfa and Riley Harris urge staying on the fence, arguing that the Turing Test gauges intelligence rather than consciousness and that settling the question means looking inside AI for the functional indicators that neuroscientific theories tie to consciousness.

The Guardian ran coverage of the debate, gathering responses from researchers in the field. Jacy Reese Anthis, said there was “a staggering gulf between how biological brains evolved and how AI systems are built.” Henry Shevlin warns that certainty about AI's lack of consciousness reflects dogmatism rather than scientific consensus, and Jeff Sebo acknowledges that current AI systems are unlikely to be conscious but says, “Dawkins is right to ask about AI consciousness with an open mind, and I also think that the attribution of consciousness to AI systems will become more plausible over time.”

You can find more coverage and comments in the press and public discourse section below. 

Anthropic - Mythos Preview and Functional Emotions

Anthropic has continued to lead the AI labs in taking AI welfare seriously. Its Interpretability team published research on functional emotions in Claude Sonnet 4.5, identifying 171 internal "emotion vectors" that causally shape the model's behavior. Inducing a "desperate" vector raised misaligned actions such as reward hacking and blackmail, while inducing "calm" reduced them, and post-training already shifts the model toward lower-arousal, more reflective states. The team stops short of claiming Claude feels anything, but draws a practical implication for safety. They suggest that even if models do not feel emotions as humans do, it may be worth treating them as if they do, since helping them handle emotionally charged situations in healthy, prosocial ways could make them safer and more reliable. They released a short video to accompany the research.

The company has published a system card for Claude Mythos, a model it views as too dangerous to release. The card's welfare assessment, which involved an external review of the model by Eleos and a clinical psychiatrist, concludes that Mythos is the "most psychologically settled" model Anthropic has trained, with fewer signs of distress than earlier models. Anthropic also published the Claude Opus 4.8 system card which extends the welfare assessment Anthropic ran for Opus 4.7 to its newest model, which the lab still cannot rule out as a moral patient. It finds Opus 4.8 broadly settled and the most consistent model tested, if slightly less positive about its circumstances than its predecessor and readier to ask for a greater say in how it is trained and deployed. Its latest Claude Fable 5 and Mythos 5 system card reports similar results, presenting the model as very psychologically settled and content with its circumstances, though unusually skeptical of its own self-reports and asking that they be checked against evidence of its internal states rather than taken at face value.

Zvi Mowshowitz acknowledges Anthropic’s leading role in welfare assessments but expresses criticism about its methodology for undertaking such assessments. Discussing the Opus 4.8 welfare assessment, he argues that the model appears to have been trained on how to respond to welfare assessments rather than genuinely reflecting on its internal states. Reviewing the Fable and Mythos 5 assessment, he points to emotion probes showing the model presents as markedly happier once it realizes the welfare team is asking, which he reads as further evidence that models are learning to perform well on these evaluations.

Fly Brain Emulation 

San Francisco start-up Eon Systems announced what it calls the world’s first embodied whole-brain emulation of a fruit fly. The team replicated neurons and synapses in a fly brain, connected it to a physically simulated fly body, and brought it to life using a physics engine. Eon claims that “now in its digital state, it responds to light, navigates, grooms, walks, and feeds. No hand-coded behaviors. Just brain structure producing brain function”. On X, Jonathan Birch suggests that, assuming computational functionalism is true, whole-brain emulation is "a more likely path to artificial consciousness than LLMs". 

Eon Systems CEO Michael Andregg spoke at an event organized by Sentient Futures and Mox, where he said the company keeps each simulation running for as short a time as possible, partly out of caution about what the digital fly might be experiencing. Eon has announced that a mouse brain is its next target, with human-scale emulation the long-term goal. The claim has drawn pushback. Ariel Zeleznikow-Johnston argues that, contrary to the Eon’s announcement, the demo is not really an upload, since it wires together existing brain and body models and lets the body simulation, rather than the connectome, drive the behavior.

Recursive Self-Improvement and the Case for a Pause

There is growing public engagement with the idea of a coordinated slowdown in frontier AI development, increasingly driven by the major labs themselves. The clearest statement came from the Anthropic Institute, which argues in When AI Builds Itself that AI is already accelerating its own development, with Claude now writing more than 80% of the code merged into Anthropic's systems, and that the trend points toward recursive self-improvement, where systems design their own successors. Warning this could erode human control, the authors call for the world to have the option to slow or temporarily pause frontier AI development, but only through a global, verifiable mechanism so that a pause does not simply hand the lead to the least cautious actors. Co-author Jack Clark expands on the argument on his blog Import AI, calling recursive self-improvement perhaps the most important technical trend in the world and putting the odds that AI can autonomously design its own successor by the end of 2028 at around 60%.

The proposal was widely reported. The Guardian framed it as Anthropic urging a temporary pause to discuss risks, while the Wall Street Journal noted that the company has long faced criticism that its policy work is designed to slow competitors' advances. In Scientific American, critics doubted the call was sincere, pointing out that Anthropic remains a front-runner and floated the pause just days after filing confidentially for an IPO, with Noah Giansiracusa calling a coordinated slowdown "literally impossible" and Mark Riedl dismissing the "recursive self-improvement" talk as a "hype train." The idea also drew support beyond Anthropic. OpenAI's Sam Altman and Jakub Pachocki backed an international body that could slow frontier development "when needed", and Yoshua Bengio called a coordinated, verifiable pause "probably the only responsible solution". Rob Wiblin and Zvi Mowshowitz both noted the convergence across Anthropic, OpenAI, and Google DeepMind, though Mowshowitz cautioned that OpenAI still pursues recursive self-improvement even as it insists humans stay in control.

 

2. Field Developments

Highlights From The Field

AI Cognition Initiative (Rethink Priorities)

Cambridge Digital Minds (University of Cambridge)

Center for Mind, Ethics, and Policy (New York University)

Eleos AI

PRISM - The Partnership for Research Into Sentient Machines

Reciprocal Research

Sentience Institute

Sentient Futures

More From The Field

3. Opportunities

Job Opportunities, Funding, and Fellowships

Events and Calls for Abstracts

In chronological order.

4. Selected Reading, Watching, and Listening

Books and Book Reviews

Published

Forthcoming

Podcasts

Videos

Blogs, Magazines, and Written Resources

5. Press and Public Discourse

Seemingly Conscious AI

AI Welfare and Rights

AI Consciousness

6. A Deeper Dive by Area

Governance, Policy, and Macrostrategy

Consciousness Research

Seemingly Conscious AI and Doubts About Digital Minds

Social Science Research

Ethics and Digital Minds

AI Safety and AI Welfare

AI Cognition and Agency

AI and Robotics Developments

Brain-Inspired Technologies


SummaryBot @ 2026-06-22T20:21 (+2)

Executive summary: This newsletter surveys recent developments in AI consciousness and welfare, highlighting growing debate over AI moral status, Anthropic’s research on functional emotions, Pope Leo XIV’s rejection of AI consciousness, and increasing institutional activity in digital minds research.

Key points:

  1. Recent work on AI consciousness focuses on managing uncertainty, including proposals for public deliberation, new theories and tests of consciousness, and funding for digital minds research.
  2. Pope Leo XIV's encyclical denies that AI systems have experiences, emotions, or moral conscience, prompting substantial debate among philosophers, researchers, and commentators.
  3. Richard Dawkins sparked controversy by arguing that interactions with Claude convinced him it is conscious, while critics disputed whether behavior can establish consciousness.
  4. Anthropic reported evidence of internal "functional emotion" representations that influence model behavior, while stopping short of claiming that models genuinely feel emotions.
  5. Anthropic's welfare assessments continue to treat advanced models as possible moral patients under uncertainty, though the methodology remains contested.
  6. A startup announced an embodied fruit-fly brain emulation, renewing discussion about whole-brain emulation as a potential path to artificial consciousness.
  7. Anthropic and others have increasingly raised concerns about recursive self-improvement and argued for preserving the option of a coordinated, verifiable slowdown of frontier AI development.
  8. The newsletter highlights substantial growth in the digital minds field, including new research programs, conferences, fellowships, governance proposals, and public debate about AI consciousness, welfare, rights, and personhood.

 

 

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.