TechnologyEnglish • 2 months ago

It’s remarkably easy to inject new medical misinformation into LLMs

arstechnica.com

cross-posted to:
[email protected]

177

It’s remarkably easy to inject new medical misinformation into LLMs

arstechnica.com

TechnologyEnglish • 2 months ago

cross-posted to:
[email protected]

Changing just 0.001% of inputs to misinformation makes the AI less accurate.

It’s pretty easy to see the problem here: The Internet is brimming with misinformation, and most large language models are trained on a massive body of text obtained from the Internet.

Ideally, having substantially higher volumes of accurate information might overwhelm the lies. But is that really the case? A new study by researchers at New York University examines how much medical information can be included in a large language model (LLM) training set before it spits out inaccurate answers. While the study doesn’t identify a lower bound, it does show that by the time misinformation accounts for 0.001 percent of the training data, the resulting LLM is compromised.

Chat

@ribhu
link
English
3•2 months ago
How old is this study? The LLMs mentioned are Llama 2 and GPT 3.5 which in current terms are almost archaic
- @Zron
  link
  English
  19•2 months ago
  Unfortunately, it’s a lot harder to rigorously test something than it is to shit a new product out into the wild with no regard for its impact.

Technology

[email protected]

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

6.18K users / day
13.7K users / week
20.4K users / month
35.8K users / 6 months
66.2K subscribers
14K Posts
618K Comments
Modlog