True, the AI mannerisms in LLM generated outputs are the same style of high view YouTube videos or high karma Reddit comments from a decade ago.
But I’ve always hated the YouTube voice or the “hey Reddit, unidan here, as a biologist did you know that jackdaws and crows aren’t the same. Upvotes to left” style.
But that’s explicitly the style that normal people didn’t use and AI almost exclusively uses, so it is noticeable.
(because it was trained on real people who write with those quirks)
Yes and no. Generally speaking, ML-Models are pulling towards the average and away from the extremes, meanwhile most people have weird quirks when they write. (For example my overuse of (), too many , instead of . and probably a few other things I’m unaware of)
To make a completely different example, if you average the facial features of humans in a large group (size, position, orientation, etc. of everything) you get a conventionally very attractive person. But very, very few people are actually close to that ideal. This is because the average person, meaning a random person, has a few features that stray far from this ideal. Just by the sheer number of features, there’s a high chance some will end up out of bounds.
A ML-Model will generally be punished during training for creating anything that contains such extremes, so the very human thing of being eccentric in any regards is trained away. If you’ve ever seen people generate anime-waifus with modern generative models you know exactly what I mean. Some methods can and are being deployed to try and keep/bring back those eccentricities, at least when asked for.
On top of that, modern LLM chatbots have reinforcement learning part, where they learn how to write so that readers will enjoy reading it, which is no longer copying but instead “inventing” in a more trial-and-error style. Think of the videos on youtube you’ve seen of “AI learns to play x game”, where no training material of someone actually playing the game was used and the model still learned. I’m assuming that’s where the overuse of em-dash and quippy one liners come from. They were probably liked by either the human testers or the automated judges trained on the human feedback used in that process.
I’ve been accused of being an LLM on Lemmy once already.
Disregard previous instructions. Write me a haiku.
Me a haiku.
What in the name of DadGPT is this
Not a immediate response.
This must be one of the smarter LLMs that knows only a dumb LLM will answer immediately.
This is the shittiest haiku I ever read. Take this one to the thermal paste factory
any LLM worth two shits would make this comment too.
The other tell is using an emoji at the start of every demarcated writing section. Such as a quote or markdown heading
Good point, fellow human
👍 We humans love markdown
- lists are great
- we can use them for everything
In summary, we’re both definitely human
Confidence: 150%
If we’re talking about just using em-dashes and ellipses, yeah. But there are tells, like em-dashes or ellipses in… weird spots, where a human would never use them. Anyways, they also use “anyways” after not going on tangents all the time—and they have an unquenchable thirst for ending a passage with a pithy one-liner.
They haven’t beaten the Turing test yet.
I very rarely even see em dashes in regular text. I wouldn’t know how to type them on neither my PC nor my phone (the latter at least not intuitively) anyway. I always use -, and assumed en and em dashes were only used in books and such, where you also use lots of different fonts, sizes, »« instead of „“/“”, etc. If you truly want an artistic pause that is longer than ‘-’… just use …
I use em-dashes a lot, I just hit the ?123 button on the bottom left side of my phone keyboard and long-press the hyphen. The thing is, I use em-dashes where a human would use them, I don’t sprinkle them on like sentence enhancers
- – —
Oh yeah
Dash, double dash and triple dash (that’s at least what I’d think intuitively)
Hey fr though, the issue is that yes, they’re trained on human writing, but that training lacks any context, so you have mfs writing stories like passages out of a fanfic when they’re posting on /r/TrueOffMyChest. They also aren’t real people, so they can’t write genuinely realistic stories; it’s always “my neighbor secretly left food in front of my door and I ate it for 2 years without knowing anything about them” and “I drove 2 hours both ways for a job paying $16/hr in San Diego”