PSA: The Lemmy federation convention of hotlinking images to other peer federation servers makes it easy for a rogue instance to collect end-user IP addresses & browser strings, don't assume otherwise

RoundSparrow · edit-2 2 years ago

PSA: The Lemmy federation convention of hotlinking images to other peer federation servers makes it easy for a rogue instance to collect end-user IP addresses & browser strings, don't assume otherwise

Dick Justice · 2 years ago

Is rhere anything end users can do about it, or is it a choice between using the fediverse or not using the fediverse?

RoundSparrow · 2 years ago

At this point, raise awareness that it isn’t like a major identifiable social media site who has a reputation to worry about. Right now, anyone can create a Lemmy instance and join the federation, there is no approval or application process.

Dick Justice · 2 years ago

I agree 100%, people need to know what they aare using and doing, especially less savvy or less curious users who wont seek out this kind of information on their own.

Im asking though, is their anything that can be done by the user themselves to obscure their IP when loading images, or is it just something that has to be accepted as part of using the fediverse if you want to use it, and thats just that.

RoundSparrow · edit-2 2 years ago

Some people use VPN… and every day they are getting a different IP address when they connect to their VPN…

It’s normal, what’s ‘new’ to people is that the images are loaded from peer. Especially coming from Reddit (where it uses thumbs.redditmedia.com), that is different from what most people expect. They assume you have to click a link to go off-site. “hotlinking” the images as Lemmy is typically doing right now involves more parties.

static · 2 years ago

It’s a lemmy thing, kbin appears not to hotlink. and mastodon allso doesn’t hotlink.

@[email protected] · 2 years ago

Use a trust worthy VPN or Tor. Nothing is perfect. But one of those would help.

earthling · 2 years ago

The Mullvad Browser. Think of it like the Tor Browser without Tor. That combined with Mullvad’s VPN service would address both of the legitimate issues brought up in the OP.

@kadu · 2 years ago

deleted by creator

RoundSparrow · 2 years ago

Have a basic firewall enabled.

This isn’t about a firewall. This is about some image hoster figuring out and keeping records of your usage times, location tracking, for your account that comments and posts.

Lemmy is at a point where typical new-user sign-up had no terms of service, privacy policy, cookie policy - people need to understand that this is not a hardened and mature platform. There could be a lot of things that people didn’t expect - that a major corporation like Reddit or Twitter would be scrutinized more closely over.

finn · 2 years ago

Sharing IP on a message board is not optimal if it can be avoided. I’m curious if the solution is to serve content through the users current instance? That would have major implications for performance.

@[email protected] · 2 years ago

GDPR believes an IP address is a private information. This can be used to mount a legal attack on EU-hosted lemmy instances.

Lenins2ndCat · edit-2 2 years ago

If IP address sharing via hotlinked images, embedded content, etc were breaking GDPR I think the entire internet is breaking it. If I visit a blog, and then click an embedded video or image on that page, then my IP has been shared to someone else while visiting that page. This occurs on the vast majority of the internet.

EDIT: It wouldn’t just be EU-hosted lemmy instances either. GDPR applies to servers outside of EU jurisdiction whenever they’re serving residents of the EU.

@[email protected] · 2 years ago

Indeed. Most of the web is broken under GDPR’s privacy requirements.

@[email protected] · 2 years ago

I’d imagine that applies to a website sharing or selling logs of user ips that visited, if you go hit some other server, you give them your ip so they can talk back to you. You’re basically doing this on almost every website because almost all will use third party cdns like fonts.google.com or others for Javascript libraries etc. If linking content from another site is the actual interpretation, the modern web basically can’t legally exist in the EU.

@styraco · 2 years ago

Third party cdns like google fonts are a problem under GDPR: https://www.theregister.com/2022/01/31/website_fine_google_fonts_gdpr/

@[email protected] · 2 years ago

Most EU legislators don’t understand how the internet works, so that’s not really a surprise.

I’m not saying that small organizations shouldn’t comply with their laws, but those laws shouldn’t be used to judge other practices.

RoundSparrow · 2 years ago

Now that is purely passive, just data collection from routine serving of images.

A rogue site could start serving crafted images to confuse users or cause NSFW content to appear on all other Lemmy sites where users were browsing a meme.

When you have small-time image hosts being used by the dozens, they likely aren’t as worried about their reputation being burned by pranks like this.

Ulu-Mulu-no-die · 2 years ago

I know very little about networks, but aren’t end-users IP addresses dynamically assigned by their Internet providers? Don’t providers use NAT systems so the public IP you see is not actually the one assigned to someone’s “house”?

RoundSparrow · 2 years ago

Don’t providers use NAT systems so the public IP you see is not actually the one assigned to someone’s “house”?

Typically NAT ends at the building. So a coffee shop it is going to use private addresses for every person on WiFi inside the coffee shop, but the servers are still going to get the single IP address of that building.

You can see what a typical database might say about your IP address: https://www.iplocation.net/ and compare with another service: https://iplocation.com/

Ulu-Mulu-no-die · edit-2 2 years ago

Thanks, iplocation.net has a lot more detail than iplocation.com, but they both show exactly the same public IP for me (that is my provider), were they supposed to be different on the 2 sites?

RoundSparrow · edit-2 2 years ago

hey both show exactly the same public IP

Your Public IP won’t change, but the different databases guessing where that is, interpretation changes.

It was coincidence that they had similar names, I went through the first 5 ot 6 I found on a Google Search and picked two that had different actual information.

With the same IP, the first site thinks I am 2 states away in the USA (1300 miles wrong), the 2nd site shows exactly where I am. Almost all the sites show me 2 states away, but iplocation.com must have had someone identify the hotel I am at.

Ulu-Mulu-no-die · 2 years ago

Oh I see, in my case both show the same location, probably because I’m at home and what they show is my internet provider, it’s probably easy to know exactly where providers actually are.

@[email protected] · 2 years ago

Your ISP has assigned that IP to you. It may be temporary for anywhere from a month or so to a year. But either way that is the IP sites will see when you visit a web site or view the images OP is talking about. It can fingerprint you to a degree. And ISPs can and do keep track of who they have assigned ip addresses to.

@[email protected] · 2 years ago

While most residential ISP customers are dynamically assigned a public IP address, those addresses usually don’t change unless you reset (not just restart) your router. My “dynamic” IP address has been the same for over a year (ever since I switched to my current ISP).

As the other person said, that address is then shared among all devices connected to your router. Even most IPv6 networks (without NAT) share the same address prefix.

RGB · 2 years ago

This applies to any website. by visiting the website you give them your IP. the only way to mitigate this is to use a VPN.

Kushan · 2 years ago

Big instances surfing up content from smaller instances is invariably going to cripple them unless larger instances start locally caching that content.

@[email protected] · 2 years ago

A “solution” to this might be for lemmy instances/apps to have the option to “load remote images” the way e-mail providers do. So I have to actively click to load the images. Could even be done per subscription, so I could declare that I just want everything loaded when I see lemmy.ml/c/pics but not in general, where I usually want text.

@[email protected] · 2 years ago

IP Address isn’t extremely precise, and I always run on the latest version of whatever browser I’m on, be it Safari or Edge.

neo (he/him) · 2 years ago

You can trust that I’m too incompetent to actually access that data in any meaningful way, but idk about some of you real tech knowers.

@GlitzyArmrest · 2 years ago

Ever heard of a VPN?

RoundSparrow · edit-2 2 years ago

If you use a VPN, the analysis of timestamp probability in logs is still going to reveal that you are likely the user who made x comment at y time. Another lemmy instance you never signed up at can start to look in logs for your comment id: #734995 and cross-reference it to image loading.

@GlitzyArmrest · 2 years ago

This can happen with literally any web host. It’s not something exclusive to Lemmy, I’m not sure why you’re acting like it is?