So8res

Quick takes on "AI is easy to control"
by So8res @ 2023-12-02 | –12 | 0 comments

Apocalypse insurance, and the hardline libertarian take on AI risk
by So8res @ 2023-11-28 | +21 | 0 comments

Ability to solve long-horizon tasks correlates with wanting things in the...
by So8res @ 2023-11-24 | +38 | 0 comments

Thoughts on the AI Safety Summit company policy requests and responses
by So8res @ 2023-10-31 | +42 | 0 comments

AI as a science, and three obstacles to alignment strategies
by So8res @ 2023-10-25 | +41 | 0 comments

But why would the AI kill us?
by So8res @ 2023-04-17 | +45 | 0 comments

Misgeneralization as a misnomer
by So8res @ 2023-04-06 | +48 | 0 comments

If interpretability research goes well, it may get dangerous
by So8res @ 2023-04-03 | +33 | 0 comments

Hooray for stepping out of the limelight
by So8res @ 2023-04-01 | +103 | 0 comments

A rough and incomplete review of some of John Wentworth's research
by So8res @ 2023-03-28 | +27 | 0 comments

A stylized dialogue on John Wentworth's claims about markets and optimization
by So8res @ 2023-03-25 | +18 | 0 comments

Truth and Advantage: Response to a draft of "AI safety seems hard to measure"
by So8res @ 2023-03-22 | +11 | 0 comments

Deep Deceptiveness
by So8res @ 2023-03-21 | +40 | 0 comments

Comments on OpenAI's "Planning for AGI and beyond"
by So8res @ 2023-03-03 | +115 | 0 comments

Enemies vs Malefactors
by So8res @ 2023-02-28 | +85 | 0 comments

AI alignment researchers don't (seem to) stack
by So8res @ 2023-02-21 | +47 | 0 comments

A personal reflection on SBF
by So8res @ 2023-02-07 | +321 | 0 comments

Focus on the places where you feel shocked everyone's dropping the ball
by So8res @ 2023-02-02 | +92 | 0 comments

What I mean by "alignment is in large part about making cognition aimable at...
by So8res @ 2023-01-30 | +57 | 0 comments

Distinguishing test from training
by So8res @ 2022-11-29 | +27 | 0 comments

How could we know that an AGI system will have good consequences?
by So8res @ 2022-11-07 | +25 | 0 comments

Superintelligent AI is necessary for an amazing future, but far from sufficient
by So8res @ 2022-10-31 | +35 | 0 comments

Notes on "Can you control the past"
by So8res @ 2022-10-20 | +15 | 0 comments

Decision theory does not imply that we get to have nice things
by So8res @ 2022-10-18 | +36 | 0 comments

Contra shard theory, in the context of the diamond maximizer problem
by So8res @ 2022-10-13 | +27 | 0 comments

Niceness is unnatural
by So8res @ 2022-10-13 | +20 | 0 comments

Don't leave your fingerprints on the future
by So8res @ 2022-10-08 | +95 | 0 comments

What does it mean for an AGI to be 'safe'?
by So8res @ 2022-10-07 | +53 | 0 comments

Warning Shots Probably Wouldn't Change The Picture Much
by So8res @ 2022-10-06 | +95 | 0 comments

Humans aren't fitness maximizers
by So8res @ 2022-10-04 | +30 | 0 comments

Where I currently disagree with Ryan Greenblatt’s version of the ELK approach
by So8res @ 2022-09-29 | +21 | 0 comments

AGI ruin scenarios are likely (and disjunctive)
by So8res @ 2022-07-27 | +54 | 0 comments

Brainstorm of things that could force an AI team to burn their lead
by So8res @ 2022-07-25 | +26 | 0 comments

A note about differential technological development
by So8res @ 2022-07-24 | +58 | 0 comments

On how various plans miss the hard bits of the alignment challenge
by So8res @ 2022-07-12 | +126 | 0 comments

A central AI alignment problem: capabilities generalization, and the sharp left...
by So8res @ 2022-06-15 | +53 | 0 comments

Visible Thoughts Project and Bounty Announcement
by So8res @ 2021-11-30 | +35 | 0 comments

Altruistic Motivations
by So8res @ 2019-01-04 | +54 | 0 comments

Intro to caring about AI alignment as an EA cause
by So8res @ 2017-04-14 | +28 | 0 comments

MIRI Update and Fundraising Case
by So8res @ 2016-10-09 | +18 | 0 comments

Conclusion of the Replacing Guilt series
by So8res @ 2016-02-28 | +15 | 0 comments

How we will be measured
by So8res @ 2016-02-21 | +17 | 0 comments

Defiance
by So8res @ 2016-02-14 | +15 | 0 comments

Recklessness
by So8res @ 2016-02-02 | +13 | 0 comments

Desperation
by So8res @ 2016-01-24 | +14 | 0 comments

Confidence all the way up
by So8res @ 2016-01-17 | +16 | 0 comments

The art of response
by So8res @ 2016-01-03 | +13 | 0 comments

Obvious advice
by So8res @ 2015-12-06 | +21 | 0 comments

There is no try
by So8res @ 2015-11-29 | +13 | 0 comments

Stop trying to try and try
by So8res @ 2015-11-22 | +15 | 0 comments

Dark, not colorless
by So8res @ 2015-11-16 | +16 | 0 comments

The best you can
by So8res @ 2015-11-10 | +13 | 0 comments

Transmute guilt into resolve
by So8res @ 2015-11-01 | +11 | 0 comments

Come to your terms
by So8res @ 2015-10-26 | +14 | 0 comments

Have no excuses
by So8res @ 2015-10-19 | +15 | 0 comments

Simply locate yourself
by So8res @ 2015-10-11 | +15 | 0 comments

Detach the grim-o-meter
by So8res @ 2015-10-05 | +16 | 0 comments

Choose without suffering
by So8res @ 2015-09-27 | +14 | 0 comments

See the dark world
by So8res @ 2015-09-20 | +31 | 0 comments

Being unable to despair
by So8res @ 2015-09-13 | +16 | 0 comments

Residing in the mortal realm
by So8res @ 2015-09-06 | +15 | 0 comments

There are no "bad people"
by So8res @ 2015-08-30 | +20 | 0 comments

Self compassion
by So8res @ 2015-08-25 | +15 | 0 comments

Where coulds go
by So8res @ 2015-08-17 | +20 | 0 comments

Not yet gods
by So8res @ 2015-08-09 | +25 | 0 comments

2015 MIRI Summer Fundraiser: How We Could Scale
by So8res @ 2015-07-28 | +7 | 0 comments

Be a new homunculus
by So8res @ 2015-07-26 | +22 | 0 comments

Update from the suckerpunch
by So8res @ 2015-07-19 | +17 | 0 comments

Don't steer with guilt
by So8res @ 2015-07-13 | +15 | 0 comments

Shifting guilt
by So8res @ 2015-07-05 | +18 | 0 comments

Rest in motion
by So8res @ 2015-06-28 | +18 | 0 comments

Working yourself ragged is not a virtue
by So8res @ 2015-06-21 | +18 | 0 comments

I am Nate Soares, AMA!
by So8res @ 2015-06-10 | +18 | 0 comments

Your "shoulds" are not a duty
by So8res @ 2015-06-07 | +17 | 0 comments

Not because you "should"
by So8res @ 2015-05-31 | +15 | 0 comments

"Should" considered harmful
by So8res @ 2015-05-25 | +18 | 0 comments

You don't get to know what you're fighting for
by So8res @ 2015-05-17 | +20 | 0 comments

Caring about something larger than yourself
by So8res @ 2015-05-10 | +19 | 0 comments

You're allowed to fight for something
by So8res @ 2015-05-03 | +27 | 0 comments

The stamp collector
by So8res @ 2015-04-27 | +22 | 0 comments

Replacing guilt
by So8res @ 2015-04-19 | +20 | 0 comments

Failing with abandon
by So8res @ 2015-04-13 | +42 | 0 comments

Half-assing it with everything you've got
by So8res @ 2015-03-13 | +52 | 0 comments

The Value of a Life
by So8res @ 2015-02-17 | +39 | 0 comments

Moving towards the goal
by So8res @ 2014-12-07 | +18 | 0 comments

Productivity through self-loyalty
by So8res @ 2014-11-02 | +29 | 0 comments

Self-signaling the ability to do what you want
by So8res @ 2014-10-26 | +19 | 0 comments

On Caring
by So8res @ 2014-10-07 | +334 | 0 comments

So8res

Posts