A new web crawler launched by Meta last month is quietly scraping the web for AI training data

lemme in · 3 months ago

A new web crawler launched by Meta last month is quietly scraping the web for AI training data

@BlackDragon@slrpnk.net · 3 months ago

How do they know they’re not feeding AI generated garbage into their models?

They don’t. Any popular place on the internet which lets users type text for people to publicly view is now full of AI trash. They’ve fucked it, this shit is just gonna spiral into progressively worse garbage

@henfredemars@infosec.pub · 3 months ago

They screwed the artificial pooch in a manner of speaking.