A great use for reddit is the ability to search posts and opinions about any niche topic. Will that be possible with Lemmy as it grows? Will I be able to Google “instant rice Lemmy” and get a comprehensive tier list of each brand?
I imagine search engines will have trouble with all the different instances(?). EDIT: Especially with instances that don’t have Lemmy in their name, I don’t think search engines would return them for Lemmy searches?
So I’ve been working on a solution for this.
As I see it Google and others are going to have a hard if not impossible time to incorporate the fediverse, and the fact that the same content can exist on multiple servers.
So I’m working on a search engine specifically build, for Lemmy at least. Where it’ll take you to whatever your preferred instance is when tapping on a search result.
I hope to have a MVP up and running in a few more days.
Can’t emphasize enough how important this is for the growth of Lemmy. Many people I know only access Reddit through google searches.
Yep and I’m one of them. Go look me up on Reddit and I think I have maybe 20 posts over the 14+ years I was on the site. …joined Lemmy and immediately got frustrated that I couldn’t find anything. So I figured I take a crack at it. Especially since I couldn’t see how Google would ever be able to link me to my instance. Let alone make it easy to search the entire fediverse without having to write out every possible site, with new ones popping up every day.
I wonder if it’s possible to have a sophisticated search engine similar to Google’s, with BERT and kNN or vice versa. It would be the closest thing to Google search but specifically for Lemmy posts.
Easier to find a Reddit post through Google than by Reddit search.
Please pop a reminder here. Commenting for a bump.
Search their name on GitHub and you’ll find it. Star it to follow.
Can the reminder bots be migrated too?
reminder: https://lemmy.world/post/963301
If this guy changes the internet include me in the screenshot.
I’ll invest in seed funding stage. 😂
Interesting. I hadn’t even thought about how the fact that instance1.[post] and instance2.[post@instance1] is essentially the same thing and how search engines would handle it. Interested in what you come up with!
Thanks. If you do some digging you can find the project on GitHub but note that it’s a work in progress still. The UI is lacking and it’s rough around the edges but it’s “working”. And I still need to do some optimizations on the crawler itself, etc…
It’s also going to be completely self-hostable just like Lemmy, etc…
Hey, can you dm me the git link, i would like to contribute if i can : )
Search their name on GitHub
Do you mean the ‘crawler’ repo?
probably, that’s pretty much what a search engine is. A crawler that saves it’s results into an optimized database, to make it easily searchable.
I can see this being helpful
(commenting so I can bookmark)
IDK, isn’t it the same for reddit? It also encourages crossposting, so the same content is on there several times. Maybe I don’t understand the fediverse well enough yet, so please correct me if I’m wrong.
deleted by creator
The mastodon crowd was verry anti on search engines, and killed projects like this.
But yea, do it! I think the lemmy/kbin crowd would mostly like it
Ya I only index Lemmy instances, for now. Mastodon and other ActivityPub servers may be in the future, but ActivityPub has some limitations so I’m stuck using the Lemmy-specific APIs.
we need a search engine for the entire fediverse, it would be a game changer
That is great. Thanks for the initiative. Have you considered contacting the people at DuckDuckGo so that that search engine can access Lemmy/Kbin content?
That sounds awesome. Can’t wait to see it.
This is a great project
deleted by creator
deleted by creator
This would be awesome!
Lmk once it’s up
Shit dude, that’d be a sweet little tool.