Awesome Artificial Intelligence Alignment

NOTE: April 2023

As of April 2023, there is a lot of new interest in the field of AI Alignment. However, this repo is unmaintained since I gave up hope about solving alignment on-time as a species - almost three years ago.

Maybe AI Safety Support is one of the definitive resources right now.

I will, however, accept PRs on this repo.

Awesome Artificial Intelligence Alignment

Welcome to Awesome-AI Alignment - a curated list of awesome resources for getting-into and staying-in-touch with the research in AI Alignment.

AI Alignment is also known as AI Safety, Beneficial AI, Human-aligned AI, Friendly AI etc.

If you are a newcomer to this field, start with the Crash Course below.

Pull requests are welcome.

A Crash Course for a Popular Audience

Watch These Two TED Talks

Can we build AI without losing control over it? - Sam Harris
What happens when our computers get smarter than we are? - Nick Bostrom

Read These Blogposts by Tim Urban

WaitButWhy on AI Safety: Part 1 and Part 2
A Reply by Luke Muehlhauser correcting a few things

Books

Superintelligence: Paths, Dangers, Strategies by Nick Bostrom
Life 3.0 by Max Tegmark
Artificial Intelligence Safety and Security by Roman Yampolskiy (Editor)

Courses

CS 294-149: Safety and Control for Artificial General Intelligence (Fall 2018) by Andrew Critch and Stuart Russel [UC Berkeley]
CS 521: Seminar on AI Safety by Dorsa Sadigh [Stanford]

Research Agendas

Paul Christiano’s Agenda summarised by Ajeya Cotra
The Learning-Theoretic AI Alignment Research Agenda by Vadim Kosoy
MIRI Machine Learning Agenda by Jessica Taylor and Eliezer Yudkowsky and Patrick LaVictoire and Andrew Critch
MIRI Agent Foundations Agenda by Nate Soares and Benya Fallenstein
Concrete Problems in AI Safety by Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané
DeepMind Scalable Agent Alignment Agenda by Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, and Shane Legg
MIRI 2018 Research Directions by Nate Soares
Integrative Biological Simulation, Neuropsychology, and AI Safety by Gopal P. Sarma, Adam Safron, and Nick J. Hay
Reframing Superintelligence: Comprehensive AI Services as General Intelligence by Eric Drexler

Literature Reviews

AGI Safety Literature Review by Tom Everitt, Gary Lea, Marcus Hutter
Towards Safe Artificial General Intelligence - Tom Everitt’s PhD Thesis
2018 AI Alignment Literature Review and Charity Comparison by Larks
FLI AI Policy Resources
An Overview of Technical AI Alignment with Rohin Shah
AI Alignment Research Overview by Jacob Steinhardt

Technical Papers

Agent Foundations

Machine Learning

Frameworks/ Environments

Talks

Technical

Eliezer Yudkowsky – AI Alignment: Why It’s Hard, and Where to Start (2016)

Blogposts

Risks of Artificial Intelligence by Johannes Heidecke
Embedded Agency by Scott Garrabrant and Abram Demski
Value Learning Sequence by Rohin Shah et al.

Communities/ Forums

Institutes/ Research Groups

Technical Research

Policy and Strategy Research

Podcasts

Episodes in Popular Podcasts

Dedicated Podcasts

AI Alignment Podcast by Lucas Perry [Future of Life Institute]
80000hours Podcast by Rob Wiblin

Events

Newsletters

Alignment Newsletter by Rohin Shah

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
README.org		README.org

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NOTE: April 2023

Awesome Artificial Intelligence Alignment

Table of Contents

A Crash Course for a Popular Audience

Watch These Two TED Talks

Read These Blogposts by Tim Urban

Read More about Real Research on AI Safety

Books

Courses

Research Agendas

Literature Reviews

Technical Papers

Agent Foundations

Machine Learning

Frameworks/ Environments

Talks

Popular

Technical

Blogposts

Communities/ Forums

Institutes/ Research Groups

Technical Research

Policy and Strategy Research

Podcasts

Episodes in Popular Podcasts

Dedicated Podcasts

Events

Newsletters

Other Lists Like This

About

Releases

Packages

dit7ya/awesome-ai-alignment

Folders and files

Latest commit

History

Repository files navigation

NOTE: April 2023

Awesome Artificial Intelligence Alignment

Table of Contents

A Crash Course for a Popular Audience

Watch These Two TED Talks

Read These Blogposts by Tim Urban

Read More about Real Research on AI Safety

Books

Courses

Research Agendas

Literature Reviews

Technical Papers

Agent Foundations

Machine Learning

Frameworks/ Environments

Talks

Popular

Technical

Blogposts

Communities/ Forums

Institutes/ Research Groups

Technical Research

Policy and Strategy Research

Podcasts

Episodes in Popular Podcasts

Dedicated Podcasts

Events

Newsletters

Other Lists Like This

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages