Lemmings.world
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
David Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · 3 months ago

a handy list of LLM poisoners

tldr.nettime.org

external-link
message-square
8
link
fedilink
122
external-link

a handy list of LLM poisoners

tldr.nettime.org

David Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · 3 months ago
message-square
8
link
fedilink
ASRG (@asrg@tldr.nettime.org)
tldr.nettime.org
external-link
Attached: 1 image Sabot in the Age of AI Here is a curated list of strategies, offensive methods, and tactics for (algorithmic) sabotage, disruption, and deliberate poisoning. 🔻 iocaine The deadliest AI poison—iocaine generates garbage rather than slowing crawlers. 🔗 https://git.madhouse-project.org/algernon/iocaine 🔻 Nepenthes A tarpit designed to catch web crawlers, especially those scraping for LLMs. It devours anything that gets too close. @aaron@zadzmo.org 🔗 https://zadzmo.org/code/nepenthes/ 🔻 Quixotic Feeds fake content to bots and robots.txt-ignoring #LLM scrapers. @marcusb@mastodon.sdf.org 🔗 https://marcusb.org/hacks/quixotic.html 🔻 Poison the WeLLMs A reverse-proxy that serves diassociated-press style reimaginings of your upstream pages, poisoning any LLMs that scrape your content. @mike@mikecoats.social 🔗 https://codeberg.org/MikeCoats/poison-the-wellms 🔻 Django-llm-poison A django app that poisons content when served to #AI bots. @Fingel@indieweb.social 🔗 https://github.com/Fingel/django-llm-poison 🔻 KonterfAI A model poisoner that generates nonsense content to degenerate LLMs. 🔗 https://codeberg.org/konterfai/konterfai
alert-triangle
You must log in or register to comment.
  • Gigliorananomicom@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    11
    ·
    3 months ago

    Doing God’s work 🙏

  • BlueMonday1984@awful.systems
    link
    fedilink
    English
    arrow-up
    11
    ·
    3 months ago

    I do feel like active anti-scraping measures could go somewhat further, though - the obvious route in my eyes would be to try to actively feed complete garbage to scrapers instead - whether by sticking a bunch of garbage on webpages to mislead scrapers or by trying to prompt inject the shit out of the AIs themselves.

    Me, predicting how anti-scraping efforts would evolve

    (I have nothing more to add, I just find this whole development pretty vindicating)

  • arsCynic@beehaw.org
    link
    fedilink
    English
    arrow-up
    6
    ·
    3 months ago

    Stupidly trivial question probably, but I guess it isn’t possible to poison LLMs on static websites hosted on GitHub?

    • Luna@lemdro.id
      link
      fedilink
      English
      arrow-up
      8
      ·
      3 months ago

      You can make a page filled with gibberish and have a display: none honeypot link to it inside your other pages. Not sure how effective would that be though

    • -dsr-@awful.systems
      link
      fedilink
      English
      arrow-up
      6
      ·
      3 months ago

      Sure, but then you have to generate all that crap and store it with them. Preumably Github will eventually decide that you are wasting their space and bandwidth and… no, never mind, they’re Microsoft now. Competence isn’t in their vocabulary.

  • o7___o7@awful.systems
    link
    fedilink
    English
    arrow-up
    6
    ·
    edit-2
    3 months ago

    The kids are going through an Adventure Time phase, and so I am reminded of this:

    https://www.youtube.com/embed/IbZJ1PeFLGU?start=33&end=70

  • rook@awful.systems
    link
    fedilink
    English
    arrow-up
    6
    ·
    3 months ago

    Additionally, https://xeiaso.net/blog/2025/anubis/

    Some of this stuff could be conceivably implemented as an easy-to-consume service. It would be nice if it were possible to fend off the scrapers without needing to be a sysadmin or, say, a cloudflare customer.

    (Whilst I could be either of those things, unless someone is paying me I would very much rather not)

    • Optional@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      3 months ago

      A WP plugin would be handy.

TechTakes@awful.systems

techtakes@awful.systems

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !techtakes@awful.systems

Big brain tech dude got yet another clueless take over at HackerNews etc? Here’s the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 536 users / day
  • 1.52K users / week
  • 1.95K users / month
  • 5.06K users / 6 months
  • 10 local subscribers
  • 1.83K subscribers
  • 850 Posts
  • 23.5K Comments
  • Modlog
  • mods:
  • David Gerard@awful.systems
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org