Center for AI Safety

AISN #66: Evaluating Frontier Models, New Gemini and Claude, Preemption is Back
by Center for AI Safety, Nick_Stockton, Dan H @ 2025-12-02 | +6 | 0 comments

AISN #65: Measuring Automation and Superintelligence Moratorium Letter
by Center for AI Safety, Alice Blair, Dan H @ 2025-10-29 | +8 | 0 comments

AISN #64: New AGI Definition and Senate Bill Would Establish Liability for AI...
by Center for AI Safety, Corin Katzke, Dan H @ 2025-10-16 | +8 | 0 comments

AISN #63: California’s SB-53 Passes the Legislature
by Center for AI Safety, Corin Katzke, Dan H @ 2025-09-24 | +6 | 0 comments

AISN #61: OpenAI Releases GPT-5
by Center for AI Safety, Corin Katzke, Dan H @ 2025-08-12 | +6 | 0 comments

AISN #60: The AI Action Plan
by Center for AI Safety, Corin Katzke, Dan H @ 2025-07-31 | +6 | 0 comments

AISN #59: EU Publishes General-Purpose AI Code of Practice
by Center for AI Safety, Corin Katzke, Dan H @ 2025-07-15 | +8 | 0 comments

AISN #58: Senate Removes State AI Regulation Moratorium
by Center for AI Safety, Corin Katzke, Dan H @ 2025-07-03 | +6 | 0 comments

AISN #57: The RAISE Act
by Center for AI Safety, Corin Katzke, Dan H @ 2025-06-17 | +12 | 0 comments

AISN #56: Google Releases Veo 3
by Center for AI Safety, Corin Katzke, Dan H @ 2025-05-28 | +6 | 0 comments

AISN #55: Trump Administration Rescinds AI Diffusion Rule, Allows Chip Sales to...
by Center for AI Safety, Corin Katzke, Dan H @ 2025-05-20 | +7 | 0 comments

AISN #54: OpenAI Updates Restructure Plan
by Center for AI Safety, Corin Katzke, Dan H @ 2025-05-13 | +7 | 0 comments

AIs Are Expert-Level at Many Virology Skills
by Center for AI Safety, SecureBio, JasperGo, Dan H @ 2025-05-02 | +22 | 0 comments

AISN #53: An Open Letter Attempts to Block OpenAI Restructuring
by Center for AI Safety, Corin Katzke, Dan H @ 2025-04-29 | +6 | 0 comments

AISN#52: An Expert Virology Benchmark
by Center for AI Safety, Corin Katzke, Dan H @ 2025-04-22 | +6 | 0 comments

AISN #51: AI Frontiers
by Center for AI Safety, Corin Katzke, Dan H @ 2025-04-15 | +8 | 0 comments

AISN #50: AI Action Plan Responses
by Center for AI Safety, Corin Katzke, Dan H @ 2025-03-31 | +10 | 0 comments

AISN #49: Superintelligence Strategy
by Center for AI Safety, Corin Katzke, Dan H @ 2025-03-06 | +8 | 0 comments

AISN #48: Utility Engineering and EnigmaEval
by Center for AI Safety, Corin Katzke, Dan H @ 2025-02-18 | +6 | 0 comments

AISN #47: Reasoning Models
by Center for AI Safety, Corin Katzke, Dan H @ 2025-02-06 | +8 | 0 comments

AISN #46: The Transition
by Center for AI Safety, Corin Katzke, Dan H @ 2025-01-23 | +10 | 0 comments

AISN #45: Center for AI Safety 2024 Year in Review
by Center for AI Safety, Corin Katzke, Dan H @ 2024-12-19 | +11 | 0 comments

AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to...
by Center for AI Safety, Corin Katzke, Julius, andrewz, Dan H @ 2024-11-19 | +11 | 0 comments

AI Safety Newsletter #43: White House Issues First National Security Memo on AI...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Dan H @ 2024-10-28 | +6 | 0 comments

AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI...
by Center for AI Safety, Corin Katzke, Julius, AlexaPanYue, andrewz, Dan H @ 2024-10-01 | +10 | 0 comments

AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking...
by Center for AI Safety, Corin Katzke, Julius, andrewz, Dan H @ 2024-09-11 | +12 | 0 comments

AI forecasting bots incoming
by Center for AI Safety, Long Phan, andrewz, Mantas Mazeika, Adam Austin Khoja, Dan H @ 2024-09-09 | –2 | 0 comments

AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip...
by Center for AI Safety, Corin Katzke, Julius, AlexaPanYue, Dan H @ 2024-08-21 | +17 | 0 comments

AI Safety Newsletter #39: Implications of a Trump Administration for AI...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-07-29 | +6 | 0 comments

AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-07-09 | +8 | 0 comments

AI Safety Newsletter #37: US Launches Antitrust Investigations Plus, recent...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-06-18 | +15 | 0 comments

AISN #36: Voluntary Commitments are Insufficient Plus, a Senate AI Policy...
by Center for AI Safety, Corin Katzke, Julius, Dan H @ 2024-05-30 | +6 | 0 comments

AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google,...
by Center for AI Safety, Corin Katzke, Dan H @ 2024-05-16 | +14 | 0 comments

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary...
by Center for AI Safety, Corin Katzke, Dan H @ 2024-05-02 | +21 | 0 comments

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Dan H @ 2024-04-12 | +19 | 0 comments

$250K in Prizes: SafeBench Competition Announcement
by Center for AI Safety @ 2024-04-03 | +47 | 0 comments

Cybersecurity and AI: The Evolving Security Landscape
by Center for AI Safety @ 2024-03-14 | +9 | 0 comments

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting...
by Center for AI Safety, Corin Katzke, Dan H @ 2024-03-07 | +15 | 0 comments

Biosecurity and AI: Risks and Opportunities
by Center for AI Safety @ 2024-02-27 | +7 | 0 comments

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance...
by Center for AI Safety, Dan H @ 2024-02-21 | +27 | 0 comments

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s...
by Center for AI Safety, Dan H, Corin Katzke @ 2024-01-24 | +7 | 0 comments

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright...
by Center for AI Safety, Dan H, Corin Katzke @ 2024-01-04 | +5 | 0 comments

AISN #28: Center for AI Safety 2023 Year in Review
by Center for AI Safety, Dan H @ 2023-12-23 | +17 | 0 comments

AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga,...
by Center for AI Safety, Dan H, Corin Katzke, allisoncyhuang @ 2023-12-07 | +10 | 0 comments

AISN #26: National Institutions for AI Safety, Results From the UK Summit, and...
by Center for AI Safety, Corin Katzke, allisoncyhuang, Dan H @ 2023-11-15 | +11 | 0 comments

Center for AI Safety’s Bi-Weekly Reading and Learning
by Center for AI Safety @ 2023-11-02 | +5 | 0 comments

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress...
by Center for AI Safety, Dan H @ 2023-10-31 | +21 | 0 comments

AISN #24: Kissinger Urges US-China Cooperation on AI, China's New AI Law, US...
by Center for AI Safety, Dan H, Corin Katzke @ 2023-10-18 | +16 | 0 comments

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering
by Center for AI Safety, Dan H @ 2023-10-04 | +7 | 0 comments

AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and...
by Center for AI Safety, Dan H @ 2023-09-19 | +15 | 0 comments

MLSN: #10 Adversarial Attacks Against Language and Vision Models, Improving LLM...
by Center for AI Safety, Dan H @ 2023-09-13 | +7 | 0 comments

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous...
by Center for AI Safety, Dan H @ 2023-09-05 | +13 | 0 comments

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI...
by Center for AI Safety, Dan H @ 2023-08-29 | +12 | 0 comments

An Overview of Catastrophic AI Risks
by Center for AI Safety, Dan H, Mantas Mazeika, TW123 @ 2023-08-15 | +37 | 0 comments

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s...
by Center for AI Safety, Dan H @ 2023-08-08 | +12 | 0 comments

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum,...
by Center for AI Safety, Dan H @ 2023-08-01 | +15 | 0 comments

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and...
by Center for AI Safety, Dan H, Corin Katzke @ 2023-07-25 | +7 | 0 comments

AISN#15: China and the US take action to regulate AI, results from a tournament...
by Center for AI Safety, Dan H, Corin Katzke @ 2023-07-19 | +5 | 0 comments

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments...
by Center for AI Safety, Dan H @ 2023-07-12 | +26 | 0 comments

Cost-effectiveness of professional field-building programs for AI safety...
by Center for AI Safety @ 2023-07-10 | +45 | 0 comments

Cost-effectiveness of student programs for AI safety research
by Center for AI Safety @ 2023-07-10 | +53 | 0 comments

Modeling the impact of AI safety field-building programs
by Center for AI Safety @ 2023-07-10 | +86 | 0 comments

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors...
by Center for AI Safety, Dan H @ 2023-07-05 | +25 | 0 comments

Catastrophic Risks from AI #6: Discussion and FAQ
by Center for AI Safety @ 2023-06-27 | +10 | 0 comments

Catastrophic Risks from AI #5: Rogue AIs
by Center for AI Safety @ 2023-06-27 | +16 | 0 comments

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering...
by Center for AI Safety, Dan H @ 2023-06-27 | +30 | 0 comments

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI...
by Center for AI Safety, Dan H @ 2023-06-06 | +12 | 0 comments

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for...
by Center for AI Safety, Dan H @ 2023-05-30 | +16 | 0 comments

Statement on AI Extinction - Signed by AGI Labs, Top Academics, and Many Other...
by Center for AI Safety @ 2023-05-30 | +429 | 0 comments

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs,...
by Center for AI Safety, Dan H @ 2023-05-23 | +23 | 0 comments

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes...
by Center for AI Safety, Dan H @ 2023-05-16 | +32 | 0 comments

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House...
by Center for AI Safety, Dan H @ 2023-05-09 | +60 | 0 comments

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization,...
by Center for AI Safety, Dan H @ 2023-05-02 | +35 | 0 comments

Center for AI Safety

Posts