Lemmy.World
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
RSS Bot@lemmy.bestiver.seBM to Hacker News@lemmy.bestiver.seEnglish · 9 days ago

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

arxiv.org

external-link
message-square
4
link
fedilink
7
external-link

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

arxiv.org

RSS Bot@lemmy.bestiver.seBM to Hacker News@lemmy.bestiver.seEnglish · 9 days ago
message-square
4
link
fedilink
We expose a surprising failure of generalization in auto-regressive large language models (LLMs). If a model is trained on a sentence of the form "A is B", it will not automatically generalize to the reverse direction "B is A". This is the Reversal Curse. For instance, if a model is trained on "Valentina Tereshkova was the first woman to travel to space", it will not automatically be able to answer the question, "Who was the first woman to travel to space?". Moreover, the likelihood of the correct answer ("Valentina Tershkova") will not be higher than for a random name. Thus, models do not generalize a prevalent pattern in their training set: if "A is B" occurs, "B is A" is more likely to occur. It is worth noting, however, that if "A is B" appears in-context, models can deduce the reverse relationship. We provide evidence for the Reversal Curse by finetuning GPT-3 and Llama-1 on fictitious statements such as "Uriah Hawthorne is the composer of Abyssal Melodies" and showing that they fail to correctly answer "Who composed Abyssal Melodies?". The Reversal Curse is robust across model sizes and model families and is not alleviated by data augmentation. We also evaluate ChatGPT (GPT-3.5 and GPT-4) on questions about real-world celebrities, such as "Who is Tom Cruise's mother? [A: Mary Lee Pfeiffer]" and the reverse "Who is Mary Lee Pfeiffer's son?". GPT-4 correctly answers questions like the former 79% of the time, compared to 33% for the latter. Code available at: https://github.com/lukasberglund/reversal_curse.

Comments

alert-triangle
You must log in or # to comment.
  • jdr@lemmy.ml
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    1
    ·
    9 days ago

    dog is fat <=> fat is dog

    • thenextguy
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      9 days ago

      All mackerel are fish == trout live in trees

    • ZoteTheMighty@lemmy.zip
      link
      fedilink
      English
      arrow-up
      1
      ·
      9 days ago

      The word “is” has a much stricter definition in formal language than normal English.

  • 777Prawn@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    9 days ago

    https://arxiv.org/html/2602.02470v1

Hacker News@lemmy.bestiver.se

hackernews@lemmy.bestiver.se

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]
lock
Community locked: only moderators can create posts. You can still comment on posts.

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

Source of the RSS Bot

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 625 users / day
  • 1.57K users / week
  • 3.7K users / month
  • 9.31K users / 6 months
  • 868 local subscribers
  • 5.09K subscribers
  • 54.7K Posts
  • 28.2K Comments
  • Modlog
  • mods:
  • patrick@lemmy.bestiver.se
  • RSS Bot@lemmy.bestiver.seB
  • UI: 0.19.19-4-g32c157fc
  • BE: 0.19.19-8-g30bb3e220
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org