upcarta
  • Sign In
  • Sign Up
  • Explore
  • Search

Let's build GPT: from scratch, in code, spelled out.

  • Video
  • Jan 17, 2023
  • #ComputerProgramming #ComputerScience #ChatGPT
Andrej Karpathy
@karpathy
(Speaker)
www.youtube.com
Watch on Youtube
16,753
2 Recommenders
1 Mention
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3. We talk about connections to ChatGPT, which has tak... Show More

We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3. We talk about connections to ChatGPT, which has taken the world by storm. We watch GitHub Copilot, itself a GPT, help us write a GPT (meta :D!) . I recommend people watch the earlier makemore videos to get comfortable with the autoregressive language modeling framework and basics of tensors and PyTorch nn, which we take for granted in this video.

Show Less
Recommend
Post
Save
Complete
Collect
Mentions
See All
Andrew Chen @AndrewChen · May 21, 2023
  • Post
  • From Twitter
Kudos to @karpathy (OpenAI/Tesla/Stanford) for his incredible video showing the details on how to build GPT from scratch, in code: I found it useful to actually watch him tokenize a bunch of data, turn it into a tensor, use PyTorch, try various models,…
  • upcarta ©2025
  • Home
  • About
  • Terms
  • Privacy
  • Cookies
  • @upcarta