A new web crawler launched by Meta last month is quietly scraping the web for AI training data

lemme in@lemm.ee · 1 year ago

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

Aniki 🌱🌿 · 1 year ago

Mate we have absurdly restrictive robots.txt including a custom WordPress plugin that automatically generates the file and the bots don’t give a fuck.

GarrulousBrevity@lemmy.world · 1 year ago

But meta’s will, and Alta Vista. I’m not angry at them when a script kitty makes a bad crawler