John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges

Video
Apr 20, 2023
#Naturallanguageprocessing #ArtificialIntelligence

John Schulman

@johnschulman2

(Speaker)

Watch on Youtube

1,151 64 min

2 Recommenders

2 Mentions

John discusses the issue of hallucination and factual accuracy with large language models. He argues that behavior cloning or supervised learning is not enough to avoid the hallucin... Show More

Mentions

See All

Jim Fan @DrJimFan · May 12, 2023

Post
From Twitter

Don't watch AI FOMO and fear-mongering videos on YouTube. Watch the excellent talk from John Schulman @johnschulman2, creator of RLHF that powers GPT-4. Now *this* qualifies as "insane" ingenuity, if you ask me.

Nando de Freitas 🏳️‍🌈 @NandoDF · Apr 24, 2023

Post
From Twitter

It’s a brilliant talk by one of the best RL researchers in the world. Recommended.