AI watermarking must be watertight to be effective (on SynthID, DeepMind's "watermarking" tool for detecting LLM outputs)

@zlatiah · 5 months ago

AI watermarking must be watertight to be effective (on SynthID, DeepMind's "watermarking" tool for detecting LLM outputs)

@[email protected] · edit-2 5 months ago

However, it is still comparatively easy for a determined individual to remove a watermark and make AI-generated text look as if it was written by a person.

And that’s assuming people are using a model specifically designed with watermarking in the first place. In practice, this will only affect the absolute dumbest adversaries. It won’t apply at all to open source or custom-built tools. Any additional step in a workflow is going to wash this right out either way.

My fear is that regulators will try to ban open models because the can’t possibly control them. That wouldn’t actually work, of course, but it might sound good enough for an election campaign, and I’m sure Microsoft and Google would dump a pile of cash on their doorstep for it.

@[email protected] · 5 months ago

I would be in favor of open models actively defaulting to watermarking by default if they could make unobtrusive. An opt out watermarking scheme honestly would handle most bad actors to me tbh.

Bad guys are normally not the brightest. Yes even big expensive bad actors.

Making it the norm also makes it a red flag if content has other markers but seems intentionally obfuscated.