upcarta
  • Sign In
  • Sign Up
  • Explore
  • Search

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges

  • Video
  • Apr 20, 2023
  • #Naturallanguageprocessing #ArtificialIntelligence
John Schulman
@johnschulman2
(Speaker)
www.youtube.com
Watch on Youtube
1,151 64 min
2 Recommenders
2 Mentions
John discusses the issue of hallucination and factual accuracy with large language models. He argues that behavior cloning or supervised learning is not enough to avoid the hallucin... Show More

John discusses the issue of hallucination and factual accuracy with large language models. He argues that behavior cloning or supervised learning is not enough to avoid the hallucination problem. Reinforcement learning from human feedback can help improve the model's truthfulness and ability to express uncertainty.

Show Less
Recommend
Post
Save
Complete
Collect
Mentions
See All
Jim Fan @DrJimFan · May 12, 2023
  • Post
  • From Twitter
Don't watch AI FOMO and fear-mongering videos on YouTube. Watch the excellent talk from John Schulman @johnschulman2, creator of RLHF that powers GPT-4. Now *this* qualifies as "insane" ingenuity, if you ask me.
Nando de Freitas 🏳️‍🌈 @NandoDF · Apr 24, 2023
  • Post
  • From Twitter
It’s a brilliant talk by one of the best RL researchers in the world. Recommended.
  • upcarta ©2025
  • Home
  • About
  • Terms
  • Privacy
  • Cookies
  • @upcarta