@brbposting@sh.itjust.works to

Technology@lemmy.worldEnglish • 5 months ago

ChatGPT outperforms undergrads in intro-level courses, falls short later

arstechnica.com

188

ChatGPT outperforms undergrads in intro-level courses, falls short later

arstechnica.com

@brbposting@sh.itjust.works to

Technology@lemmy.worldEnglish • 5 months ago

Software that promises to detect AI-produced text fails to deliver.

Researchers create 30 fake student accounts to submit model-generated responses to real exams. Professors grade the 200 or 1500 word responses from the AI undergrads and gave them better grades than real students 84% of the time. 6% of the bot respondents did get caught, though… for being too good. Meanwhile, AI detection tools? Total bunk.

Will AI be the new calculator… or the death of us all (obviously the only alternative).

Note: the software was NOT as good on the advanced exams, even though it handled the easier stuff.

Chat

vortic
link
fedilink
English
17•5 months ago
I wonder how undergrads would do on the same exams given unlimited time and internet access but with LLMs blocked. That’s essentially what the LLMs have.
- @technocrit@lemmy.dbzer0.com
  link
  fedilink
  English
  2•5 months ago
  The LLMs blocked themselves?
  - vortic
    link
    fedilink
    English
    1•5 months ago
    I don’t think they really query one another. Maybe they do though?

Technology@lemmy.world

!technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

3.25K users / day
8.84K users / week
17.1K users / month
34.7K users / 6 months
59.5K subscribers
12.5K Posts
544K Comments
Modlog