I'm about to release (this week of the next, not much later) a search engine for the Fediverse (including mastodon, pleroma, peertube etc.)
If you want to know more about this project, feel free to subscribe to my account here.
I will publish most source code under GPL license, and translation help will be appreciated! See you then!
@archeodon > most source code?
@devnull yeah, there are little parts of the indexer I'm not sure I will publish at once
1. because it's damn ugly
2. because it may trigger unexpected reactions from people managing some instances, (although I respect strict rate limits and robots noindex rules...)
@archeodon So, basically, you're expecting poeple to blindly trust unknown code (that might trigger some people)?
@devnull I don't know what you mean by "blindly trust" or "trigger some people"
@archeodon Blindly trust -> Trust unpublished source code
"trigger some people" -> I was refering to your own statement "because it may trigger unexpected reactions from people managing some instances". you you better than me what it means.
@devnull
oh, this one :)
I worked for a search engine that indexed websites such as craigslist in the past,
and people that manages such websites have mixed-feeling with indexers...
(even if we respect robots.txt)
I'd rather have them use a standard protocol (robots.txt or meta tags) rather than trying to block my bots.
@devnull
as of the "trust unpublished source code", I only talk about some bots as of now,
the search part will be published, for sure, as will be some indexing bots.
But you can't trust any code running on a remote server anyway, right?
@archeodon > But you can't trust any code running on a remote server anyway, right?
Of course not, who knows what code the remote server actually runs.