ChatGPT is clearly more capable than previous language models. It’s not clear why, from the scant available info about its construction. This paper has some interesting hypotheses: