So8res

Posts

Quick takes on "AI is easy to control"
by So8res @ 2023-12-02 | –12 | 0 comments
Apocalypse insurance, and the hardline libertarian take on AI risk
by So8res @ 2023-11-28 | +21 | 0 comments
Ability to solve long-horizon tasks correlates with wanting things in the...
by So8res @ 2023-11-24 | +38 | 0 comments
Thoughts on the AI Safety Summit company policy requests and responses
by So8res @ 2023-10-31 | +42 | 0 comments
AI as a science, and three obstacles to alignment strategies
by So8res @ 2023-10-25 | +41 | 0 comments
But why would the AI kill us?
by So8res @ 2023-04-17 | +45 | 0 comments
Misgeneralization as a misnomer
by So8res @ 2023-04-06 | +48 | 0 comments
If interpretability research goes well, it may get dangerous
by So8res @ 2023-04-03 | +33 | 0 comments
Hooray for stepping out of the limelight
by So8res @ 2023-04-01 | +103 | 0 comments
A rough and incomplete review of some of John Wentworth's research
by So8res @ 2023-03-28 | +27 | 0 comments
A stylized dialogue on John Wentworth's claims about markets and optimization
by So8res @ 2023-03-25 | +18 | 0 comments
Truth and Advantage: Response to a draft of "AI safety seems hard to measure"
by So8res @ 2023-03-22 | +11 | 0 comments
Deep Deceptiveness
by So8res @ 2023-03-21 | +40 | 0 comments
Comments on OpenAI's "Planning for AGI and beyond"
by So8res @ 2023-03-03 | +115 | 0 comments
Enemies vs Malefactors
by So8res @ 2023-02-28 | +85 | 0 comments
AI alignment researchers don't (seem to) stack
by So8res @ 2023-02-21 | +47 | 0 comments
A personal reflection on SBF
by So8res @ 2023-02-07 | +321 | 0 comments
Focus on the places where you feel shocked everyone's dropping the ball
by So8res @ 2023-02-02 | +92 | 0 comments
Alignment is mostly about making cognition aimable at all
by So8res @ 2023-01-30 | +57 | 0 comments
Distinguishing test from training
by So8res @ 2022-11-29 | +27 | 0 comments
How could we know that an AGI system will have good consequences?
by So8res @ 2022-11-07 | +25 | 0 comments
Superintelligent AI is necessary for an amazing future, but far from sufficient
by So8res @ 2022-10-31 | +35 | 0 comments
Notes on "Can you control the past"
by So8res @ 2022-10-20 | +15 | 0 comments
Decision theory does not imply that we get to have nice things
by So8res @ 2022-10-18 | +36 | 0 comments
Contra shard theory, in the context of the diamond maximizer problem
by So8res @ 2022-10-13 | +27 | 0 comments
Niceness is unnatural
by So8res @ 2022-10-13 | +20 | 0 comments
Don't leave your fingerprints on the future
by So8res @ 2022-10-08 | +93 | 0 comments
What does it mean for an AGI to be 'safe'?
by So8res @ 2022-10-07 | +53 | 0 comments
Warning Shots Probably Wouldn't Change The Picture Much
by So8res @ 2022-10-06 | +93 | 0 comments
Humans aren't fitness maximizers
by So8res @ 2022-10-04 | +30 | 0 comments
Where I currently disagree with Ryan Greenblatt’s version of the ELK approach
by So8res @ 2022-09-29 | +21 | 0 comments
AGI ruin scenarios are likely (and disjunctive)
by So8res @ 2022-07-27 | +53 | 0 comments
Brainstorm of things that could force an AI team to burn their lead
by So8res @ 2022-07-25 | +26 | 0 comments
A note about differential technological development
by So8res @ 2022-07-24 | +58 | 0 comments
On how various plans miss the hard bits of the alignment challenge
by So8res @ 2022-07-12 | +125 | 0 comments
A central AI alignment problem: capabilities generalization, and the sharp left...
by So8res @ 2022-06-15 | +51 | 0 comments
Visible Thoughts Project and Bounty Announcement
by So8res @ 2021-11-30 | +35 | 0 comments
Altruistic Motivations
by So8res @ 2019-01-04 | +52 | 0 comments
Intro to caring about AI alignment as an EA cause
by So8res @ 2017-04-14 | +28 | 0 comments
MIRI Update and Fundraising Case
by So8res @ 2016-10-09 | +18 | 0 comments
Conclusion of the Replacing Guilt series
by So8res @ 2016-02-28 | +15 | 0 comments
How we will be measured
by So8res @ 2016-02-21 | +17 | 0 comments
Defiance
by So8res @ 2016-02-14 | +15 | 0 comments
Recklessness
by So8res @ 2016-02-02 | +13 | 0 comments
Desperation
by So8res @ 2016-01-24 | +14 | 0 comments
Confidence all the way up
by So8res @ 2016-01-17 | +16 | 0 comments
The art of response
by So8res @ 2016-01-03 | +13 | 0 comments
Obvious advice
by So8res @ 2015-12-06 | +21 | 0 comments
There is no try
by So8res @ 2015-11-29 | +13 | 0 comments
Stop trying to try and try
by So8res @ 2015-11-22 | +14 | 0 comments
Dark, not colorless
by So8res @ 2015-11-16 | +16 | 0 comments
The best you can
by So8res @ 2015-11-10 | +13 | 0 comments
Transmute guilt into resolve
by So8res @ 2015-11-01 | +11 | 0 comments
Come to your terms
by So8res @ 2015-10-26 | +14 | 0 comments
Have no excuses
by So8res @ 2015-10-19 | +15 | 0 comments
Simply locate yourself
by So8res @ 2015-10-11 | +15 | 0 comments
Detach the grim-o-meter
by So8res @ 2015-10-05 | +16 | 0 comments
Choose without suffering
by So8res @ 2015-09-27 | +14 | 0 comments
See the dark world
by So8res @ 2015-09-20 | +31 | 0 comments
Being unable to despair
by So8res @ 2015-09-13 | +16 | 0 comments
Residing in the mortal realm
by So8res @ 2015-09-06 | +15 | 0 comments
There are no "bad people"
by So8res @ 2015-08-30 | +19 | 0 comments
Self compassion
by So8res @ 2015-08-25 | +15 | 0 comments
Where coulds go
by So8res @ 2015-08-17 | +20 | 0 comments
Not yet gods
by So8res @ 2015-08-09 | +25 | 0 comments
2015 MIRI Summer Fundraiser: How We Could Scale
by So8res @ 2015-07-28 | +7 | 0 comments
Be a new homunculus
by So8res @ 2015-07-26 | +22 | 0 comments
Update from the suckerpunch
by So8res @ 2015-07-19 | +17 | 0 comments
Don't steer with guilt
by So8res @ 2015-07-13 | +15 | 0 comments
Shifting guilt
by So8res @ 2015-07-05 | +15 | 0 comments
Rest in motion
by So8res @ 2015-06-28 | +18 | 0 comments
Working yourself ragged is not a virtue
by So8res @ 2015-06-21 | +18 | 0 comments
I am Nate Soares, AMA!
by So8res @ 2015-06-10 | +18 | 0 comments
Your "shoulds" are not a duty
by So8res @ 2015-06-07 | +17 | 0 comments
Not because you "should"
by So8res @ 2015-05-31 | +15 | 0 comments
"Should" considered harmful
by So8res @ 2015-05-25 | +18 | 0 comments
You don't get to know what you're fighting for
by So8res @ 2015-05-17 | +20 | 0 comments
Caring about something larger than yourself
by So8res @ 2015-05-10 | +19 | 0 comments
You're allowed to fight for something
by So8res @ 2015-05-03 | +27 | 0 comments
The stamp collector
by So8res @ 2015-04-27 | +22 | 0 comments
Replacing guilt
by So8res @ 2015-04-19 | +20 | 0 comments
Failing with abandon
by So8res @ 2015-04-13 | +42 | 0 comments
Half-assing it with everything you've got
by So8res @ 2015-03-13 | +51 | 0 comments
The Value of a Life
by So8res @ 2015-02-17 | +38 | 0 comments
Moving towards the goal
by So8res @ 2014-12-07 | +18 | 0 comments
Productivity through self-loyalty
by So8res @ 2014-11-02 | +29 | 0 comments
Self-signaling the ability to do what you want
by So8res @ 2014-10-26 | +19 | 0 comments
On Caring
by So8res @ 2014-10-07 | +308 | 0 comments