cross-posted from: https://lemmy.world/post/13768307

In this video, we recount an incident that occurred at OpenAI while researchers were trying to finetune GPT-2 to be as helpful and ethical as possible. It’s narrated that inadvertently flipping a single minus sign led GPT-2 to become the embodiment of a well-known cardinal sin.