[[email protected]](https://lemm.ee/u/LWD) presents information relating to accusations of source obfuscation around rising search engine Kagi

@IncogCyberspaceUser · edit-2 4 days ago

[[email protected]](https://lemm.ee/u/LWD) presents information relating to accusations of source obfuscation around rising search engine Kagi

@asap · 4 days ago

What’s the issue with this? API calls to other search engines are anonymised, and naturally I want Kagi to search as many sources as possible.

So what’s the issue?

You don’t need Wayback machine to find their old page, it’s all open on Github: https://github.com/kagisearch/kagi-docs/commit/6baff1c066db9b3d804653ea19bc9d1c076a710b

I don’t see any “conspiracy” removing an itemised list - it’s just something which is a pain to keep up-to-date over time. Better to say “all major search engines worldwide”, which is what they’re now saying.

Again, I don’t really see any issue here…

@[email protected] · 2 days ago

So what’s the issue? You don’t need Wayback machine to find their old page…

The issue is that the page is old and hidden! Normal people don’t hunt through repositories, let alone archives. It is extremely unreasonable to assume someone should do this.

Kagi Corp should provide this information transparently, not respond with mild hostility when that information is politely requested. In addition, they should keep it up to date! Acting cagey with users and hiding data isn’t how an honest corporation should behave!

@asap · edit-2 2 days ago

The topic is “using Yandex”. What’s the issue with that? I wasn’t asking what is the issue with changing the wording on some random page 🤦

@[email protected] · edit-2 2 days ago

The topic is “using Yandex”.

No it was not. Read the post. The topic was obfuscation.

That obfuscation did come from an interlinked issue (Vladimir Prelovac getting caught paying Yandex for search data), but this year that concerned me was Kagi hiding this information from the community.

Do we not agree that being cagey about things and hiding once-public information is a red flag when it comes to privacy?

@asap · 21 hours ago

As I said in my original comment, I think it makes perfect sense to replace an itemised list - which needs to be constantly updated - with a generic “all major search engines” which covers everything.

If Yandex being removed isn’t an issue, then I’m not sure what could be termed as Proton being cagey.

For me, from a privacy perspective as a user it’s largely irrelevant what third-parties they use on the backend as long as my searches stay private.

Adding the statistics for third parties to their stats page would be neat from the user perspective, but I can’t imagine what value there would be in publishing that information from Kagi’s perspective.

@[email protected] · 21 hours ago

Let’s agree to disagree, then. I also disagree on the framing: you call it inconvenient, I call it a clear step backwards in terms of transparency. We can even compromise: CEO Vladimir Prelovac can simply update the list when he finds time. (He can even take time off from his occasional searches for online critics!)

For me, from a privacy perspective as a user it’s largely irrelevant what third-parties they use on the backend as long as my searches stay private.

This is largely circular logic. “It’s irrelevant what Kagi is doing, as long as I can trust what Kagi is doing.” You should never trust an opaque company. A company that becomes more opaque upon scrutiny is not a trustworthy company.

@asap · 20 hours ago

This is largely circular logic. “It’s irrelevant what Kagi is doing, as long as I can trust what Kagi is doing.”

The problem is that you can never know what software a company is running in production. So for any service you don’t host yourself, at some point you have to just cross your fingers and hope.

We can certainly agree to disagree. I don’t encourage you to use Kagi - quite the opposite, I would say it is a terrible fit.

Evkob (they/them) · edit-2 4 days ago

I truly don’t understand the hype around Kagi. Maybe I’m just too wrapped-up in my own little FOSS and privacy world, but paying for a closed-source solution that ties you into an account just doesn’t make sense to me.

For basically as much money as an unlimited Kagi plan, you can rent a small VPS to host Searxng and a VPN service to run it behind and you’d have a much more privacy-respecting search solution, plus a VPS to play with. Kagi just seems like such poor value for your money. Granted, I’ve never used it, but I can’t imagine its results could realistically be good enough to justify paying 10$/month.

@[email protected] · 2 days ago

I think I do. Kagi generates hype by being a paid service.

Typically, people don’t think about their search engine. They have it set up and they just use it. Kagi requires conscious thought and conscious payment.
Because they have already “bought in” to Kagi, their users will then consciously consider using anything else - after all, they don’t want to abandon a service they paid for!

This kind of mentality is similar to how many paid services, like Amazon Prime, capture customers. Once you have bought into the monthly subscription, it becomes a mental default. Why shop around when you’ve already sunk some money into Prime?

@asap · edit-2 4 days ago

Granted, I’ve never used it

This is probably the reason you don’t get the value.

I do rent multiple VPS’s and I have run Searxng, and for me Kagi is worth paying for. It’s certainly not worth it for everybody though.

Regarding the account, I used an anonymous email and paid with an anonymous method. Perhaps they could assemble my identity based on my searches, but considering the new support for Privacy Pass it appears they are walking their talk.

Evkob (they/them) · 4 days ago

Care to share what makes it worth it for you? I’m honestly curious.

@asap · edit-2 4 days ago

The results, first and foremost. When I’m getting better results it saves me time and it’s worth it to me.
Site ranking up/down/block
Removal of AI generated images from image searches
Custom “lenses” mean that you can create subsets of the internet to search on
The filtering options
Modification of result URLS. For example I rewrite all Reddit links to old.reddit.com - ^https://(www.|)reddit.com|https://old.reddit.com
AI model integration (this one is definitely going to be polarising…). I’m already paying for Kagi, and this lets me query a stack of different models at no extra cost.
Custom CSS (a minor thing, and sure I could use a browser extension, but this way it’s consistent on all my devices with no extra effort)
Compared to Searxng, other than the results being better, it’s the ease of configuration. I can’t be bothered trying to tweak Searxng through YAML when I could just get exactly what I need OOTB.

There’s probably more that I’ve simply gotten used to over time, but that’s a few off the top of my head.

Evkob (they/them) · 4 days ago

Thanks for the overview! I appreciate it, and can definitely see how these features could be worth the investment depending on your use case. I guess I’ve mostly seen Kagi pushed as a privacy-respecting option, which doesn’t appeal to me considering the alternatives such as Searxng.

Feature-wise, I can definitely see why some people would prefer Kagi.

@asap · edit-2 3 days ago

I’ve mostly seen Kagi pushed as a privacy-respecting option

Interesting. If that’s what your primary goal is, then absolutely I think Searxng would be better. Kagi would more aptly be thought of as “better search” - as in rather than being the product (a la Google), you pay for a better product.

setVeryLoud(true); · 4 days ago

I’m a Kagi user, and basically all of this.

The experience is far better than any free search engine for me, and the “summarize search” feature seems to be factual and complete, compared to Google’s Gemini feature that gives you 5 words that then turn out to be wrong or dangerous.

@IncogCyberspaceUser · edit-2 4 days ago

I thought this was a good presentation of concerns surrounding Kagi, a search engine I’ve heard championed as a great Google replacement, one that I’m definitely interested in using. But I’ve also seen many that critize it for a number of reasons. So I wanted to see if others wanted to share their perspective. Thanks to LWD for laying out what he did.