The Guardrail: A free tool for tracking AI safety research from arXiv (feedback welcome)

By Craig Dickson @ 2026-02-06T19:42 (+1)

I've built The Guardrail, a website that aggregates and curates AI safety research, and I'd value feedback from this community on whether it's useful and how to improve it.

What it does

The site pulls new papers from arXiv daily and uses an LLM (Gemini 3 Flash) to:

I've also processed papers from NeurIPS and ICLR (2025 only for now) with the same tagging system.

There's a weekly Editor's Choice that ranks the top 10 papers by significance and novelty, available as an email digest for those who want it.

Why I built it

The volume of potentially safety-relevant research on arXiv is overwhelming. I wanted a way to stay current without manually scanning hundreds of abstracts. The LLM judge isn't perfect, but it catches most things and dramatically reduces the filtering burden.

Current limitations

What I'm looking for

The site is open source (GitHub) and funded by a BlueDot Impact rapid grant, so I'm committed to maintaining and improving it.

Happy to answer questions about how the filtering works or take feature requests.