upcarta
  • Sign In
  • Sign Up
  • Explore
  • Search

The Scaling Hypothesis · Gwern.net

  • Article
  • May 28, 2020
  • #ComputerScience
Gwern
@Gwern
(Author)
www.gwern.net
Read on www.gwern.net
2 Recommenders
2 Mentions
On GPT-3: meta-learning, scaling, implications, and deep theory. The scaling hypothesis: neural nets absorb data & compute, generalizing and becoming more Bayesian as problems get h... Show More

On GPT-3: meta-learning, scaling, implications, and deep theory. The scaling hypothesis: neural nets absorb data & compute, generalizing and becoming more Bayesian as problems get harder, manifesting new abilities even at trivial-by-global-standards-scale. The deep learning revolution has begun as foretold.

Show Less
Recommend
Post
Save
Complete
Collect
Mentions
See All
Michael Nielsen @michael_nielsen · Apr 14, 2022
  • Post
  • From Twitter
Much more here, very highly recommended:
Matt Clifford @matthewclifford · Apr 16, 2023
  • Post
  • From Twitter
This is very true. I’ve probably shared “The Scaling Hypothesis” more than any other post. From 2020, but still essential reading:
  • upcarta ©2025
  • Home
  • About
  • Terms
  • Privacy
  • Cookies
  • @upcarta