From Therapy Tool to Alignment Puzzle-Piece: Introducing the VSPE Framework

By Astelle Kay @ 2025-06-18T14:47 (+6)

Hi EA Forum! đź‘‹
I’m Astelle Kay, a counseling-psych grad student who moonlights in alignment whenever coursework (and caffeine) allow. Most of my brain lives where clinical psychology, systems thinking, and “please-let-humanity-stick-around” concerns intersect.

TL;DR

From couch to compute cluster

Why this might matter

How you can stress-test or support

“Psychology and AI share a flaw: both love telling us exactly what we want to hear.”
— sticky note above my desk

My hope: VSPE nudges future models toward frank, human-centred dialogue—first in micro-benchmarks, later (if it survives) in training loops.

Curious, sceptical, or just chasing cross-disciplinary rabbit holes? Drop a comment or DM. I’ll post code, data, and inevitable blooper reels as the project unfolds. More context at vspeframework.com.

With care,
Astelle

(Manifund pilot: [Manifund pilot])

This work is shared for educational and research purposes. For licensing, citation, or collaboration inquiries—especially for commercial or model development use—please contact Astelle Kay at astellekay@gmail.com.


Astelle Kay @ 2025-06-18T07:34 (+1)

Related work: Varma & Beitman (2025) recently proposed a CBT-style “therapy loop” prompt to curb hallucinations. VSPE targets the complementary issue of flattery; our benchmark will include the therapy loop as a baseline for comparison.