Center for AI Safety
Posts
AI Safety Newsletter #43: White House Issues First National Security Memo on AI...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Dan H @ 2024-10-28 | +6 | 0 comments
by Center for AI Safety, Corin Katzke, AlexaPanYue, Dan H @ 2024-10-28 | +6 | 0 comments
AI Safety Newsletter #42: Newsom Vetoes SB 1047
Plus, OpenAI’s o1, and AI...
by Center for AI Safety, Corin Katzke, Julius, AlexaPanYue, andrewz, Dan H @ 2024-10-01 | +10 | 0 comments
by Center for AI Safety, Corin Katzke, Julius, AlexaPanYue, andrewz, Dan H @ 2024-10-01 | +10 | 0 comments
AI Safety Newsletter #41: The Next Generation of Compute Scale
Plus, Ranking...
by Center for AI Safety, Corin Katzke, Julius, andrewz, Dan H @ 2024-09-11 | +12 | 0 comments
by Center for AI Safety, Corin Katzke, Julius, andrewz, Dan H @ 2024-09-11 | +12 | 0 comments
AI forecasting bots incoming
by Center for AI Safety, Long Phan, andrewz, Mantas Mazeika, Adam Austin Khoja, Dan H @ 2024-09-09 | –2 | 0 comments
by Center for AI Safety, Long Phan, andrewz, Mantas Mazeika, Adam Austin Khoja, Dan H @ 2024-09-09 | –2 | 0 comments
AI Safety Newsletter #40: California AI Legislation
Plus, NVIDIA Delays Chip...
by Center for AI Safety, Corin Katzke, Julius, AlexaPanYue, Dan H @ 2024-08-21 | +17 | 0 comments
by Center for AI Safety, Corin Katzke, Julius, AlexaPanYue, Dan H @ 2024-08-21 | +17 | 0 comments
AI Safety Newsletter #39: Implications of a Trump Administration for AI...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-07-29 | +6 | 0 comments
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-07-29 | +6 | 0 comments
AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-07-09 | +8 | 0 comments
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-07-09 | +8 | 0 comments
AI Safety Newsletter #37: US Launches Antitrust Investigations
Plus, recent...
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-06-18 | +15 | 0 comments
by Center for AI Safety, Corin Katzke, AlexaPanYue, Julius, Dan H @ 2024-06-18 | +15 | 0 comments
AISN #36: Voluntary Commitments are Insufficient
Plus, a Senate AI Policy...
by Center for AI Safety, Corin Katzke, Julius, Dan H @ 2024-05-30 | +6 | 0 comments
by Center for AI Safety, Corin Katzke, Julius, Dan H @ 2024-05-30 | +6 | 0 comments
AISN #35: Lobbying on AI Regulation
Plus, New Models from OpenAI and Google,...
by Center for AI Safety, aogara, Corin Katzke, Dan H @ 2024-05-16 | +14 | 0 comments
by Center for AI Safety, aogara, Corin Katzke, Dan H @ 2024-05-16 | +14 | 0 comments
AISN #34: New Military AI Systems
Plus, AI Labs Fail to Uphold Voluntary...
by Center for AI Safety, aogara, Corin Katzke, Dan H @ 2024-05-02 | +21 | 0 comments
by Center for AI Safety, aogara, Corin Katzke, Dan H @ 2024-05-02 | +21 | 0 comments
AISN #33: Reassessing AI and Biorisk
Plus, Consolidation in the Corporate AI...
by Center for AI Safety, aogara, Corin Katzke, AlexaPanYue, Dan H @ 2024-04-12 | +19 | 0 comments
by Center for AI Safety, aogara, Corin Katzke, AlexaPanYue, Dan H @ 2024-04-12 | +19 | 0 comments
$250K in Prizes: SafeBench Competition Announcement
by Center for AI Safety @ 2024-04-03 | +47 | 0 comments
by Center for AI Safety @ 2024-04-03 | +47 | 0 comments
Cybersecurity and AI: The Evolving Security Landscape
by Center for AI Safety @ 2024-03-14 | +9 | 0 comments
by Center for AI Safety @ 2024-03-14 | +9 | 0 comments
AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs
Plus, Forecasting...
by Center for AI Safety, aogara, Corin Katzke, Dan H @ 2024-03-07 | +15 | 0 comments
by Center for AI Safety, aogara, Corin Katzke, Dan H @ 2024-03-07 | +15 | 0 comments
AISN #31: A New AI Policy Bill in California
Plus, Precedents for AI Governance...
by Center for AI Safety, aogara, Dan H @ 2024-02-21 | +27 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2024-02-21 | +27 | 0 comments
AISN #30: Investments in Compute and Military AI
Plus, Japan and Singapore’s...
by Center for AI Safety, aogara, Dan H, Corin Katzke @ 2024-01-24 | +7 | 0 comments
by Center for AI Safety, aogara, Dan H, Corin Katzke @ 2024-01-24 | +7 | 0 comments
AISN #29: Progress on the EU AI Act
Plus, the NY Times sues OpenAI for Copyright...
by Center for AI Safety, aogara, Dan H, Corin Katzke @ 2024-01-04 | +5 | 0 comments
by Center for AI Safety, aogara, Dan H, Corin Katzke @ 2024-01-04 | +5 | 0 comments
AISN #28: Center for AI Safety 2023 Year in Review
by Center for AI Safety, aogara, Dan H @ 2023-12-23 | +17 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-12-23 | +17 | 0 comments
AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga,...
by Center for AI Safety, aogara, Dan H, Corin Katzke, allisoncyhuang @ 2023-12-07 | +10 | 0 comments
by Center for AI Safety, aogara, Dan H, Corin Katzke, allisoncyhuang @ 2023-12-07 | +10 | 0 comments
AISN #26: National Institutions for AI Safety, Results From the UK Summit, and...
by Center for AI Safety, aogara, Corin Katzke, allisoncyhuang, Dan H @ 2023-11-15 | +11 | 0 comments
by Center for AI Safety, aogara, Corin Katzke, allisoncyhuang, Dan H @ 2023-11-15 | +11 | 0 comments
Center for AI Safety’s Bi-Weekly Reading and Learning
by Center for AI Safety @ 2023-11-02 | +5 | 0 comments
by Center for AI Safety @ 2023-11-02 | +5 | 0 comments
AISN #25:
White House Executive Order on AI, UK AI Safety Summit, and Progress...
by Center for AI Safety, aogara, Dan H @ 2023-10-31 | +21 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-10-31 | +21 | 0 comments
AISN #24:
Kissinger Urges US-China Cooperation on AI, China's New AI Law, US...
by Center for AI Safety, aogara, Dan H, Corin Katzke @ 2023-10-18 | +16 | 0 comments
by Center for AI Safety, aogara, Dan H, Corin Katzke @ 2023-10-18 | +16 | 0 comments
AISN #23:
New OpenAI Models, News from Anthropic, and Representation Engineering
by Center for AI Safety, aogara, Dan H @ 2023-10-04 | +7 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-10-04 | +7 | 0 comments
AISN #22: The Landscape of US AI Legislation -
Hearings, Frameworks, Bills, and...
by Center for AI Safety, aogara, Dan H @ 2023-09-19 | +15 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-09-19 | +15 | 0 comments
MLSN: #10
Adversarial Attacks Against Language and Vision Models, Improving LLM...
by Center for AI Safety, aogara, Dan H @ 2023-09-13 | +7 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-09-13 | +7 | 0 comments
AISN #21:
Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous...
by Center for AI Safety, aogara, Dan H @ 2023-09-05 | +13 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-09-05 | +13 | 0 comments
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI...
by Center for AI Safety, aogara, Dan H @ 2023-08-29 | +12 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-08-29 | +12 | 0 comments
An Overview of Catastrophic AI Risks
by Center for AI Safety, Dan H, Mantas Mazeika, TW123 @ 2023-08-15 | +37 | 0 comments
by Center for AI Safety, Dan H, Mantas Mazeika, TW123 @ 2023-08-15 | +37 | 0 comments
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s...
by Center for AI Safety, aogara, Dan H @ 2023-08-08 | +12 | 0 comments
by Center for AI Safety, aogara, Dan H @ 2023-08-08 | +12 | 0 comments
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum,...
by Center for AI Safety, Dan H, aogara @ 2023-08-01 | +15 | 0 comments
by Center for AI Safety, Dan H, aogara @ 2023-08-01 | +15 | 0 comments
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and...
by Center for AI Safety, Dan H, Corin Katzke, aogara @ 2023-07-25 | +7 | 0 comments
by Center for AI Safety, Dan H, Corin Katzke, aogara @ 2023-07-25 | +7 | 0 comments
AISN#15: China and the US take action to regulate AI, results from a tournament...
by Center for AI Safety, Dan H, Corin Katzke @ 2023-07-19 | +5 | 0 comments
by Center for AI Safety, Dan H, Corin Katzke @ 2023-07-19 | +5 | 0 comments
AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments...
by Center for AI Safety, Dan H @ 2023-07-12 | +26 | 0 comments
by Center for AI Safety, Dan H @ 2023-07-12 | +26 | 0 comments
Cost-effectiveness of professional field-building programs for AI safety...
by Center for AI Safety @ 2023-07-10 | +38 | 0 comments
by Center for AI Safety @ 2023-07-10 | +38 | 0 comments
Cost-effectiveness of student programs for AI safety research
by Center for AI Safety @ 2023-07-10 | +53 | 0 comments
by Center for AI Safety @ 2023-07-10 | +53 | 0 comments
Modeling the impact of AI safety field-building programs
by Center for AI Safety @ 2023-07-10 | +83 | 0 comments
by Center for AI Safety @ 2023-07-10 | +83 | 0 comments
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors...
by Center for AI Safety, Dan H, aogara @ 2023-07-05 | +25 | 0 comments
by Center for AI Safety, Dan H, aogara @ 2023-07-05 | +25 | 0 comments
Catastrophic Risks from AI #6: Discussion and FAQ
by Center for AI Safety @ 2023-06-27 | +10 | 0 comments
by Center for AI Safety @ 2023-06-27 | +10 | 0 comments
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering...
by Center for AI Safety, Dan H, aogara @ 2023-06-27 | +30 | 0 comments
by Center for AI Safety, Dan H, aogara @ 2023-06-27 | +30 | 0 comments
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI...
by Center for AI Safety, Dan H, aogara @ 2023-06-06 | +12 | 0 comments
by Center for AI Safety, Dan H, aogara @ 2023-06-06 | +12 | 0 comments
AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for...
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-30 | +16 | 0 comments
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-30 | +16 | 0 comments
Statement on AI Extinction - Signed by AGI Labs, Top Academics, and Many Other...
by Center for AI Safety @ 2023-05-30 | +427 | 0 comments
by Center for AI Safety @ 2023-05-30 | +427 | 0 comments
AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs,...
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-23 | +23 | 0 comments
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-23 | +23 | 0 comments
AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes...
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-16 | +32 | 0 comments
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-16 | +32 | 0 comments
AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House...
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-09 | +60 | 0 comments
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-09 | +60 | 0 comments
AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization,...
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-02 | +35 | 0 comments
by Center for AI Safety, Dan H, Akash, aogara @ 2023-05-02 | +35 | 0 comments