mamot.fr is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mamot.fr est un serveur Mastodon francophone, géré par La Quadrature du Net.

Server stats:

3.2K
active users

#aiscraping

1 post1 participant0 posts today
ulrike<p>Feeling less and less inclined to put new content on my websites as AI scrapers regularly come by, ignoring robots.txt. How do you deal with that? Seriously? I don't want to feed their machines with machine readable data. I even considered using PDFs with text as images but that is wrong in so many ways... </p><p><a href="https://pouet.chapril.org/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://pouet.chapril.org/tags/openweb" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>openweb</span></a> <a href="https://pouet.chapril.org/tags/aiscraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscraping</span></a></p>
olimobu 🎶<p>👀 Esta mañana al comentar los problemas de Wikimedia con el scrapping, un amigo programador me han hablado del proyecto Anubis <a href="https://github.com/TecharoHQ/anubis/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/TecharoHQ/anubis/</span><span class="invisible"></span></a> <br>"Es bastante sencillo y fácil de implementar en cualquier web medio seria, te cargas automáticamente cualquier scrapper (sea de IA sea de lo que sea). Además, no pueden inventar nada que haga que sea rentable el scrapping con eso puesto." <a href="https://social.anartist.org/tags/aiscraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscraping</span></a> <a href="https://social.anartist.org/tags/aiscrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscrapers</span></a> <a href="https://social.anartist.org/tags/wikimedia" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>wikimedia</span></a> <a href="https://social.anartist.org/tags/anubis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>anubis</span></a> <a href="https://social.anartist.org/tags/iahastaenlaputasopa" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>iahastaenlaputasopa</span></a></p>
Winbuzzer<p>AI Crawlers Overwhelm Open-Source Projects, Forcing Developers to Block Entire Countries</p><p><a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/Web" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Web</span></a> <a href="https://mastodon.social/tags/Robotstxt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Robotstxt</span></a> <a href="https://mastodon.social/tags/AIScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIScraping</span></a> <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> <a href="https://mastodon.social/tags/Cybersecurity" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cybersecurity</span></a> <a href="https://mastodon.social/tags/DataScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataScraping</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a> </p><p><a href="https://winbuzzer.com/2025/03/26/ai-crawlers-overwhelm-open-source-projects-forcing-developers-to-block-entire-countries-xcxwbn/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/03/26/ai-cr</span><span class="invisible">awlers-overwhelm-open-source-projects-forcing-developers-to-block-entire-countries-xcxwbn/</span></a></p>
Mic Die Duiwel<p>AI scrapers are a plague on the internet</p><p><a href="https://mastodon.social/tags/aiscraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscraping</span></a> <a href="https://mastodon.social/tags/aiscrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscrapers</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/llm" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>llm</span></a> </p><p><a href="https://www.osnews.com/story/141969/foss-infrastructure-is-under-attack-by-ai-companies/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">osnews.com/story/141969/foss-i</span><span class="invisible">nfrastructure-is-under-attack-by-ai-companies/</span></a></p>
jbz<p>🌐 LLM crawlers continue to DDoS SourceHut | sr_ht status</p><p>「 SourceHut continues to face disruptions due to aggressive LLM crawlers. We are continuously working to deploy mitigations. We have deployed a number of mitigations which are keeping the problem contained for now. However, some of our mitigations may impact end-users 」</p><p><a href="https://status.sr.ht/issues/2025-03-17-git.sr.ht-llms/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">status.sr.ht/issues/2025-03-17</span><span class="invisible">-git.sr.ht-llms/</span></a></p><p><a href="https://indieweb.social/tags/sourcehut" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>sourcehut</span></a> <a href="https://indieweb.social/tags/ddos" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ddos</span></a> <a href="https://indieweb.social/tags/aiscraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscraping</span></a></p>
aproposnix<p>Serious question, isn't this an issue even with decentralized systems? What's preventing AI bots from just using all of our public data on the Fediverse? Is there any difference?</p><p><a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/AITraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AITraining</span></a> <a href="https://mastodon.social/tags/aiscraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aiscraping</span></a> <a href="https://mastodon.social/tags/askfedi" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>askfedi</span></a> </p><p><a href="https://techcrunch.com/2025/03/15/bluesky-users-debate-plans-around-user-data-and-ai-training/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">techcrunch.com/2025/03/15/blue</span><span class="invisible">sky-users-debate-plans-around-user-data-and-ai-training/</span></a></p>
Friedemann<p>Hi <a href="https://mastodon.online/tags/Admins" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Admins</span></a> 👋,</p><p>Can you give me quotes that explain your fight against <a href="https://mastodon.online/tags/AIScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIScraping</span></a>? I'm looking for (verbal) images, metaphors, comparisons, etc. that explain to non-techies what's going on. (efforts, goals, resources...)</p><p>I intend to publish your quotes in a text on <span class="h-card" translate="no"><a href="https://mastodon.social/@campact" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>campact</span></a></span> 's blog¹ (DE, German NGO).</p><p>The quotes should make your work🙏 visible in a generally understandable way</p><p>¹ <a href="https://blog.campact.de/author/friedemann/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.campact.de/author/friedem</span><span class="invisible">ann/</span></a></p><p><a href="https://mastodon.online/tags/TDM" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TDM</span></a> <a href="https://mastodon.online/tags/MastoAdmin" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MastoAdmin</span></a> <a href="https://mastodon.online/tags/DataPoisoning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataPoisoning</span></a> <a href="https://mastodon.online/tags/aitxt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aitxt</span></a> <a href="https://mastodon.online/tags/GPT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GPT</span></a> <a href="https://mastodon.online/tags/TDMRep" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TDMRep</span></a> <a href="https://mastodon.online/tags/Kudurru" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Kudurru</span></a> <a href="https://mastodon.online/tags/Nightshade" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Nightshade</span></a> <a href="https://mastodon.online/tags/Glaze" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Glaze</span></a> <a href="https://mastodon.online/tags/FediAdmins" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FediAdmins</span></a></p>
beSpacific<p>How to turn off <a href="https://newsie.social/tags/AIscraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIscraping</span></a> from your Word documents "<a href="https://newsie.social/tags/Microsoft" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Microsoft</span></a> Office has slyly turned on an “opt-out” feature that scrapes your <a href="https://newsie.social/tags/Word" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Word</span></a>,<a href="https://newsie.social/tags/Excel" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Excel</span></a> docs to train its internal AI systems. This setting is turned on by default, and you have to manually uncheck a box in order to opt out. If you are a writer who uses MS Word to write any proprietary content (blog posts, novels, any work you intend to protect w <a href="https://newsie.social/tags/copyright" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>copyright</span></a> and/or sell), u want to turn this feature off immediately <a href="https://medium.com/illumination/ms-word-is-using-you-to-train-ai-86d6a4d87021" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">medium.com/illumination/ms-wor</span><span class="invisible">d-is-using-you-to-train-ai-86d6a4d87021</span></a></p>