• @sir_reginald@lemmy.world
    link
    fedilink
    English
    7
    edit-2
    11 months ago

    removing these images from the open web has been a headache of webmasters and admins for years in sites which host user uploaded images.

    if the millions of images in the training data were automatically scraped from the internet, I don’t find it surprising that there was CSAM there.