Aligning

Aligning AI with human values

Senior Audrey Lorvo is researching AI safety, which seeks to make sure...

Aligning language models to follow instructions

We’ve trained language models which can be a lot better at following user intentions than GPT-3 while also making them more truthful and fewer toxic, using techniques developed through our alignment research. These InstructGPT models, that...

Recent posts

Popular categories

ASK ANA