• @Hackworth@lemmy.world
    link
    fedilink
    English
    15 months ago

    Humans are really bad at determining whether a chat is with a human or a bot

    Eliza is not indistinguishable from a human at 22%.

    Passing the Turing test stood largely out of reach for 70 years precisely because Humans are pretty good at spotting counterfeit humans.

    This is a monumental achievement.

    • @dustyData@lemmy.world
      link
      fedilink
      English
      0
      edit-2
      5 months ago

      First, that is not how that statistic works, like you are reading it entirely wrong.

      Second, this test is intentionally designed to be misleading. Comparing ChatGPT to Eliza is the equivalent of me claiming that the Chevy Bolt is the fastest car to ever enter a highway by comparing it to a 1908 Ford Model T. It completely ignores a huge history of technological developments. There have been just as successful chatbots before ChatGPT, just they weren’t LLM and they were measured by other methods and systematic trials. Because the Turing test is not actually a scientific test of anything, so it isn’t standardized in any way. Anyone is free to claim to do a Turing Test whenever and however without too much control. It is meaningless and proves nothing.