AI Safety blog posts

September 20, 2024
September 20, 2024
June 2024 AI Alignment project prize categories
By Adam Jones
AI alignment
Read more
September 16, 2024
September 16, 2024
Modular AI Safety courses proposal
By Adam Jones
 
Read more
September 13, 2024
September 13, 2024
Summary of AI alignment participant user interviews
By Adam Jones
AI alignment
Read more
August 19, 2024
August 19, 2024
Introduction to Mechanistic Interpretability
By Sarah Hastings-Woodhouse
AI alignment
Read more
August 19, 2024
August 19, 2024
Problems with Reinforcement Learning from Human Feedback (RLHF) for AI safety
By Sarah Hastings-Woodhouse
AI alignment
Read more
August 13, 2024
August 13, 2024
What we didn’t cover in our June 2024 AI Alignment course
By Adam Jones
AI alignment
Read more
August 11, 2024
August 11, 2024
AI alignment project evaluation criteria
By Adam Jones
AI alignment
Read more
July 22, 2024
July 22, 2024
How to avoid the 2 mistakes behind 89% of rejected AI alignment applications
By Adam Jones
AI alignment
Read more
July 4, 2024
July 4, 2024
Submitting your AI governance project
By Luke Drago
 
Read more
July 2, 2024
July 2, 2024
What we learnt from running our AI alignment course in March 2024
By Adam Jones
AI alignment
Read more
June 24, 2024
June 24, 2024
What we changed for the June 2024 AI alignment course
By Adam Jones
AI alignment
Read more
June 17, 2024
June 17, 2024
3 articles on AI safety we’d like to exist
By Adam Jones
 
Read more
June 5, 2024
June 5, 2024
AI governance project ideas
By Luke Drago
AI governance
Read more
May 25, 2024
May 25, 2024
Why are people building AI systems?
By Adam Jones
 
Read more
May 24, 2024
May 24, 2024
Announcing the AI Governance Project Phase
By Luke Drago
AI governance
Read more
May 23, 2024
May 23, 2024
Submitting your AI alignment project
By Adam Jones
AI alignment
Read more
May 20, 2024
May 20, 2024
2022 AI Alignment Course: 5→37% working on AI safety
By Dewi Erwan
AI alignment
Read more
April 30, 2024
April 30, 2024
March 2024 AI Alignment project prize categories
By Adam Jones
AI alignment
Read more
April 12, 2024
April 12, 2024
AI alignment project ideas
By Adam Jones
AI alignment
Read more
April 10, 2024
April 10, 2024
How to do an excellent AI alignment project
By Adam Jones
AI alignment
Read more
April 9, 2024
April 9, 2024
What we didn’t cover in our Early 2024 AI Alignment course
By Adam Jones
AI alignment
Read more
April 5, 2024
April 5, 2024
How to avoid the 4 mistakes behind 92% of rejected AI governance applications
By Adam Jones
AI governance
Read more
April 5, 2024
April 5, 2024
Alignment careers guide
By Charlie Rogers-Smith, with minor updates by Adam Jones
AI alignment
Read more
March 18, 2024
March 18, 2024
Can we scale human feedback for complex AI tasks? An intro to scalable oversight.
By Adam Jones
AI alignment
Read more
March 1, 2024
March 1, 2024
What is AI alignment?
By Adam Jones
AI alignment
Read more
February 21, 2024
February 21, 2024
What risks does AI pose?
By Adam Jones
 
Read more
September 11, 2023
September 11, 2023
Some Talent Needs in AI Governance
By Sam Clarke
AI governance
Read more
September 11, 2023
September 11, 2023
AI Governance Needs Technical Work
By AI Safety Fundamentals team
AI governance
Read more
August 29, 2023
August 29, 2023
Historical case studies of technology governance and international agreements (compilation – various authors)
By AI Safety Fundamentals Team
AI governance
Read more
August 23, 2023
August 23, 2023
Primer on AI Chips and AI Governance
By AI Safety Fundamentals Team
AI governance
Read more
August 16, 2023
August 16, 2023
Primer on Safety Standards and Regulations for Industrial-Scale AI Development
By AI Safety Fundamentals Team
AI governance
Read more
August 15, 2023
August 15, 2023
The State of AI in Different Countries — An Overview
By Lizka Vaintrob
AI governance
Read more
August 8, 2023
August 8, 2023
Why Might Misaligned, Advanced AI Cause Catastrophe? (Compilation)
By AI Safety Fundamentals Team
 
Read more
January 1, 2023
January 1, 2023
The Need For Work On Technical AI Alignment
By Daniel Eth
AI alignment
Read more
November 27, 2022
November 27, 2022
Career Resources on U.S. AI Policy
By AI Safety Fundamentals Team
AI governance
Read more
November 27, 2022
November 27, 2022
Career Resources – European AI Policy
By AI Safety Fundamentals Team
AI governance
Read more
November 27, 2022
November 27, 2022
A Brief Introduction to some Approaches to AI Alignment
By AI Safety Fundamentals Team
AI alignment
Read more
November 27, 2022
November 27, 2022
Overviews of Some Basic Models of Governments and International Cooperation
By AI Safety Fundamentals Team
AI governance
Read more
November 8, 2022
November 8, 2022
Avoiding Extreme Global Vulnerability as a Core AI Governance Problem
By AI Safety Fundamentals Team
AI governance
Read more
November 8, 2022
November 8, 2022
Career Resources on AI Strategy Research
By AI Safety Fundamentals Team
 
Read more

We use analytics cookies to improve our website and measure ad performance. Cookie Policy.