The Waluigi Effect (mega-post) - AI Alignment Forum

Article
Mar 3, 2023
#ArtificialIntelligence

In this article, I will present a mechanistic explanation of the Waluigi Effect and other bizarre "semiotic" phenomena which arise within large language models such as GPT-3/3.5/4 a... Show More

Mentions

See All

Sriram Krishnan @SriramKrishnan · Mar 5, 2023

Post
From Twitter

My favorite post of recent times (via @patrickc )