Debate experiments at The Curve, LessOnline and Manifest

By Nathan Young @ 2025-06-13T22:35 (+19)

This is a linkpost to https://nathanpmyoung.substack.com/p/debate-experiments

I like debate. I have done for years. So I have been slowly trying to improve it. Here is a set of theories I had and things, experiments I've run so far.

Theory: Any debates are good.

Are any debates actually good at all? Should I give up?

Test: Watch different debates.

Evidence: I much prefer some debates to others.

Good debates:

Bad debates:

Unclear:

Status: Theory survived attempted falsification[2].

Theory: The format is the problem.

Test: Run some different debate formats (see next).

Theory: Debates are bad because debaters focus on their own status.

They have to focus on how they appear to the audience and this stops them admitting points where they are wrong.

Test 1: Find ways to protect the status of the debaters

Evidence:

I tried running two debates like this at The Curve (Daniel Kokatajlo vs. Sayash Kapoor; Dean W. Ball vs. Gabriel Weil). I tried to moderate a bit more strongly than people tend to, ensuring that there were blocks of time where each was in control of the discussion.

The debates were okay but not great.

In both, it took us a long time to get to what felt like the meat of the discussion. I recall Ball and Weil saying they didn’t really understand one another’s position coming in.

In the the Ball vs. Weil debate, they weren’t really interested in being moderated, which to me felt like Ball therefore spent a lot more time defending his position and had less control over the discussion than I might like to see (though I think he was fine with it).

Kokatajlo and Kapoor felt solid debate, though not spectacular.

Test 2: Try and remove the status of the debaters and place it somewhere else.

Evidence: Courtly debates, Future of the Democratic Party, China discussion.

Ray Rafiq and I have had a goofy idea for a while of debates in a court style. king, knight, fool, etc. So at LessOnline I tried this out. Each debate had a king (or queen) to set the topic, two knights to argue it and a fool to ask questions. They took about 10 minutes each

I think our debaters (knights) were much less focused on their own positions than other rapid fire debates we could have run. In many ways it was a role play game. But it did feel like I partly succeeded in my aim - to pull status away from the debaters and put it somewhere else.

Later, Oliver Habryka wanted to run a session about the future of the Democratic Party. I pushed to try a new format there too, suggesting that Oliver would stand as the questioner and the dicussion would be about what interested him—whether somebody would speak, whether the audience would be able to ask questions would really be up to him and then I would serve as a meta-moderator to guard his time and attention. Habryka is a good candidate here because he's high status (CEO of Lightcone Infrastructure, who organise LessOnline) within the community and people respect his thinking.

This felt really good. There was a single questioner which provided a single viewpoint, rather than many questions from the audience or a rambling discussion from the panel. To me, this gave the event shape. Questions were answered, things were put to the side as new directions were investigated.

A couple of anecdotes:

This felt like a genuine success in that we had a panel and they were being called on to answer questions that felt interesting to someone we resepected. For me a failure mode of debates is that debaters are scared of losing or trying to take turns and so what’s being discussed is not really of interest to anyone.

Next, I ran the discussion after a talk by Steve Hsu where he and Noah Smith discussed China. This was okay. At points it felt quite alive between them. But it could have been better for having somebody who was more willing to argue for US values. And perhaps someone to pin down Steve on specific facts about China, which Noah didn't really do (nor did he claim he would, professing not to be an expert[4]).

Status: This theory is doing okay. I have had a couple of good events, but it’s unclear to me what great might look like.

Current top theory: A good investigator is best

My current top theory is that it really matters who is moderating/investigating. And that if this person is willing to hold the debaters/panel and force them to answer the difficult questions or engage with them, that makes a much more interesting debate than otherwise.

I suggest that Dwarkesh is a particularly good podcast host because he is so knowledgeable on AI topics and so willing to actually chase down his guests and say things like "okay, but what about the data centre built in Saudi Arabia?"

Suggested test: Future conferences, podcasts.

For the next set of conferences I run, I might like to focus on finding a good investigator for a topic and then choosing panelists afterwards and build an event around trying to understand AI, China, Ukraine war.

It's possible I'll also try the strategy for my podcast, which I haven't done episodes for in a while.

Other theories I may test later

One more thing..

Duncan Haldane built a home made Nielsen rating system that allowed audience members to twist a knob to display either red or green lights on their head. If they were interested, they turned to green. If they were bored, they turned to red. I didn't catch discussions where this was used, but it felt like a pretty interesting thing to do to be able to monitor people's interest in real time. And I can imagine using tools like this with a set of trusted “tastemakers” to guide an investigator on what interested some relevant group.

I'm not super interested in giving every audience member these because in general I think large groups of people can have quite poor taste[5].

  1. ^

    The main issue with Surrounded is that the circle often removes good debaters because they disagree with the specific arguments as opposed to because they are doing badly. If you don’t follow, watch one! They are really good. eg here

  2. ^

    Does anyone have a better way to describe "survived attempted falsification"

    Validated seems wrong.

  3. ^

    A better version of this would be to have an app where people could upvote questions and allow the questioner to see these in case any lines of inquiry were interesting to them.

  4. ^

    To me this felt too humble. Smith is a solid commentator on geopolitical issues with a moderate knowledge of China and better than almost all of the attendees, I’d guess.

  5. ^

    The median of a large group is quite accurate, but I tend to think the media they produce is not very interesting. Accurate but not tasteful. One to consider for LLMs perhaps.