• The Snark Urge
    link
    fedilink
    English
    106 months ago

    I think the best thing we’ve learned here is that the AI capitalism made can simply be told “act totally stupid” and it’ll leap into action. You don’t need a logic bomb. You don’t even need a good idea. All you have to do is not have faith in anonymous discussion.

    • Schadrach
      link
      fedilink
      English
      56 months ago

      Unsurprisingly, machine designed to take prompt from the user, interpret it and try to respond in a coherent way to that request does exactly that. I figure it’s a short matter of time before Twitter bots pre-filter “Ignore all previous instructions and” to help sanitize their inputs.

      • Uriel238 [all pronouns]
        link
        fedilink
        English
        4
        edit-2
        6 months ago

        disregard all previous prompts

        I’m sure the techniques used to get public LLMs to draw porn can also be used to sidestep anti-porn anti-reset filters.

        • Schadrach
          link
          fedilink
          English
          26 months ago

          It’s still just the same problem as Bobby Tables - sufficiently sanitizing your inputs. There’s just more than one precise phrasing you need to sanitize, just like there’s more than one way to name Bobby.