AI Safety blog posts
September 20, 2024
September 20, 2024
June 2024 AI Alignment project prize categories
AI alignment
Read more
September 16, 2024
September 16, 2024
Modular AI Safety courses proposal
Read more
September 13, 2024
September 13, 2024
Summary of AI alignment participant user interviews
AI alignment
Read more
August 19, 2024
August 19, 2024
Introduction to Mechanistic Interpretability
AI alignment
Read more
August 19, 2024
August 19, 2024
Problems with Reinforcement Learning from Human Feedback (RLHF) for AI safety
AI alignment
Read more
August 13, 2024
August 13, 2024
What we didn’t cover in our June 2024 AI Alignment course
AI alignment
Read more
August 11, 2024
August 11, 2024
AI alignment project evaluation criteria
AI alignment
Read more
July 22, 2024
July 22, 2024
How to avoid the 2 mistakes behind 89% of rejected AI alignment applications
AI alignment
Read more
July 4, 2024
July 4, 2024
Submitting your AI governance project
Read more
July 2, 2024
July 2, 2024
What we learnt from running our AI alignment course in March 2024
AI alignment
Read more
June 24, 2024
June 24, 2024
What we changed for the June 2024 AI alignment course
AI alignment
Read more
June 17, 2024
June 17, 2024
3 articles on AI safety we’d like to exist
Read more
June 5, 2024
June 5, 2024
AI governance project ideas
AI governance
Read more
May 25, 2024
May 25, 2024
Why are people building AI systems?
Read more
May 24, 2024
May 24, 2024
Announcing the AI Governance Project Phase
AI governance
Read more
May 23, 2024
May 23, 2024
Submitting your AI alignment project
AI alignment
Read more
May 20, 2024
May 20, 2024
2022 AI Alignment Course: 5→37% working on AI safety
AI alignment
Read more
April 30, 2024
April 30, 2024
March 2024 AI Alignment project prize categories
AI alignment
Read more
April 12, 2024
April 12, 2024
AI alignment project ideas
AI alignment
Read more
April 10, 2024
April 10, 2024
How to do an excellent AI alignment project
AI alignment
Read more
April 9, 2024
April 9, 2024
What we didn’t cover in our Early 2024 AI Alignment course
AI alignment
Read more
April 5, 2024
April 5, 2024
How to avoid the 4 mistakes behind 92% of rejected AI governance applications
AI governance
Read more
April 5, 2024
April 5, 2024
Alignment careers guide
AI alignment
Read more
March 18, 2024
March 18, 2024
Can we scale human feedback for complex AI tasks? An intro to scalable oversight.
AI alignment
Read more
March 1, 2024
March 1, 2024
What is AI alignment?
AI alignment
Read more
February 21, 2024
February 21, 2024
What risks does AI pose?
Read more
September 11, 2023
September 11, 2023
Some Talent Needs in AI Governance
AI governance
Read more
September 11, 2023
September 11, 2023
AI Governance Needs Technical Work
AI governance
Read more
August 29, 2023
August 29, 2023
Historical case studies of technology governance and international agreements (compilation – various authors)
AI governance
Read more
August 23, 2023
August 23, 2023
Primer on AI Chips and AI Governance
AI governance
Read more
August 16, 2023
August 16, 2023
Primer on Safety Standards and Regulations for Industrial-Scale AI Development
AI governance
Read more
August 15, 2023
August 15, 2023
The State of AI in Different Countries — An Overview
AI governance
Read more
August 8, 2023
August 8, 2023
Why Might Misaligned, Advanced AI Cause Catastrophe? (Compilation)
Read more
January 1, 2023
January 1, 2023
The Need For Work On Technical AI Alignment
AI alignment
Read more
November 27, 2022
November 27, 2022
Career Resources on U.S. AI Policy
AI governance
Read more
November 27, 2022
November 27, 2022
Career Resources – European AI Policy
AI governance
Read more
November 27, 2022
November 27, 2022
A Brief Introduction to some Approaches to AI Alignment
AI alignment
Read more
November 27, 2022
November 27, 2022
Overviews of Some Basic Models of Governments and International Cooperation
AI governance
Read more
November 8, 2022
November 8, 2022
Avoiding Extreme Global Vulnerability as a Core AI Governance Problem
AI governance
Read more
November 8, 2022
November 8, 2022
Career Resources on AI Strategy Research
Read more