Announcing the Distillation for Alignment Practicum (DAP)
By Jonas Hallgren, TheMcDouglas @ 2022-08-18T22:31 (+20)
Hello there, fellow EAs!
Around the time of John Wentworth's post A Call for Distillers, we started to work on a distillation course which now has turned into a practicum. (A practicum is more close to a study circle than a course and a lot more open-ended)
Five months later and timely to the launch of the CAIS competition, we're launching the trial run of DAP! (Distillation for Alignment Practicum), starting on the week of September 16th. The course is six weeks long and structured similarly to AGISF, with core readings and exercises during and in-between sessions.
The difference is that we're going to have slightly smaller cohorts containing 3-4 fellows (along with one facilitator), and we're also focusing on letting you develop your distillations. Through this process, we hope you will explore your views of what distillation is helpful for and what techniques might be practical when creating a distillation.
Practicum structure
Below, you can see our proposed loop for how we think of improving your distilling skills. It is essentially the scientific method. The first two weeks of the course are about working on a hypothesis on what makes a helpful distillation and what techniques you could use to implement the values you identify in the first week in practice. You then pick a distillation and apply the methodology you came up with in practice. Lastly, in week six, you review how it went and whether your models could have been improved for the specific distillation you were doing. (For more detail, check out the end of the post)
The hope is that this gives a new meta methodology for improving your writing. As this is a practicum, we aim to provide an open space for discussing what makes a distillation great, as we're not sure of it ourselves.
We hope this will provide a space for people to continuously grow and become great distillers!
Give us feedback, please
It is the first iteration of the distillation practicum, so if you have any ways you can think to improve it, we're very open to feedback! We're specifically looking for you to do one or more of the following:
- Roast the living crap out of the current exercises (Trial by fire)
- Come up with new exercises that would be better than the existing ones
- Suggest things which might be missing in the general research loop
- Come up with improvements to exercises
You can send this to us privately, or you could do it through whatever means you feel like, for example, while partaking in the practicum! We're looking for around 3-5 cohorts in the trial run, and we might expand it after, depending on how it goes.
Links to curriculum and application
Here's the link to the current version of the curriculum (as we're still in the alpha phase, the curriculum will most likely change before the beginning of September): DAP curriculum
Here's the application form: https://airtable.com/shrckaJjpnp3RZGTQ
In general, we're looking for people who have finished AGISF/ have equivalent knowledge and who are interested in distilling a work.
More detail on the structure
Week 1 will focus on why distillations are valuable and what they contribute to the broader alignment community. We hope that by the end of the week, you will have a better idea of how to assess the impactfulness of a distillation.
Week 2 will look at how you structure and write a distillation. We will read about the different research & writing processes of successful people in the EA and AI safety communities. We will also examine specific foundational papers in various technical fields, and explore what made them so compelling - from the writing techniques to the overall structure.
The next three weeks will mainly focus on writing distillations. You will put the lessons from weeks 1 & 2 into practice while you write, and the discussion sessions will be an opportunity to get feedback and share thoughts.
Week 3 focuses on understanding the material in the first place. We will discuss skills such as performing literature reviews, breaking down and understanding complex concepts, forming gears-level models, etc.
Weeks 4 & 5 will be mainly devoted to writing your distillation. We hope that you will have a minimum viable product by the end of week 4, and it will be fleshed out into a full post by the end of week 5.
Week 6 will be an opportunity to update your distillation models and how you might improve them in the future. We will also reflect on the next steps you can take towards doing distillation work for a living in the most impactful way for the alignment field.
Thanks for the help
(Personal note from Jonas): Lastly I wanted to thank everyone who helped us develop the course and gave me the support and ideas needed to get it off the ground. A special thanks to Jamie Bernardi for being a chill dude and supporting this shit in its conception stages. Thanks to Kay Kozaronek, Rudolf Laine, and Oliver Zhang for reviewing the curriculum and giving great support along the way (you're superb). Thanks to John Wentworth for telling us to rebuild the practicum from scratch; it would have been a lot worse without your input. Huge thanks to Callum Mcdougall for being a great co-lead in turning this practicum into something tangible; I seriously couldn't have done it without you! (thanks man, you're great). Finally, thanks to everyone else who gave feedback and supported DAP along the way; you're simply fantastic.
We hope you enjoy this practicum!
Jay Bailey @ 2022-08-19T03:01 (+4)
I've put in an application! I'm currently doing a distillation of how Deep Q Networks work for an AGISF project, and distillation is my Plan B if research engineering doesn't work out, so this would be a great course to do in parallel with my current upskilling! I am in Australia, so the timezones might be difficult, but I'll definitely try to arrange a group to do it with if not selected.
berglund @ 2022-08-29T14:10 (+2)
This looks exciting! I plan to apply.
One reaction I have looking at the syllabus is that it's too theoretical for me in the beginning. I feel like it would be better to have an applied component from the start. Maybe the first two weeks could be theory paired with writing a short distillation in the first week, getting feedback, and then refining it. The feedback loop of actually writing is probably by far the best way to improve distillation skills. This is just an impression I have though, so I could be wrong.