‘You Can’t Lick a Badger Twice’: Google Failures Highlight a Fundamental AI Flaw

Deebster@infosec.pub · 19 hours ago

‘You Can’t Lick a Badger Twice’: Google Failures Highlight a Fundamental AI Flaw

masterspace@lemmy.ca · edit-2 16 hours ago

Try this on your friends, make up an idiom, then walk up to them, say it without context, and then say “meaning?” and see how they respond.

Pretty sure most of mine will just make up a bullshit response nd go along with what I’m saying unless I give them more context.

There are genuinely interesting limitations to LLMs and the newer reasoning models, and I find it interesting to see what we can learn from them, this is just ham fisted robo gotcha journalism.

TimewornTraveler@lemm.ee · 7 hours ago

it highlights the fact that these LLMs refuse to say “I don’t know”, which essentially means we cannot rely on them for any factual reporting.

masterspace@lemmy.ca · 7 hours ago

But a) they don’t refuse, most will tell you if you prompt them well them and b) you cannot rely on them as the sole source of truth but an information machine can still be useful if it’s right most of the time.

This is fine🔥🐶☕🔥@lemmy.world · 11 hours ago

My friends aren’t burning up the planet just to come up with that useless response though.

masterspace@lemmy.ca · 7 hours ago

Yes, they literally are. Or maybe you haven’t heard of human caused climate change?

This is fine🔥🐶☕🔥@lemmy.world · 4 hours ago

You dumb

zarkanian@sh.itjust.works · 8 hours ago

So, you have friends who are as stupid as an AI. Got it. What’s your point?

sugar_in_your_tea@sh.itjust.works · 4 hours ago

Yeah, mine would say, “what you talkin’ 'bout Willis?”:

Deebster@infosec.pub · 15 hours ago

My friends would probably say something like “I’ve never heard that one, but I guess it means something like …”

The problem is, these LLMs don’t give any indication when they’re making stuff up versus when repeating an incontrovertible truth. Lots of people don’t understand the limitations of things like Google’s AI summary* so they will trust these false answers. Harmless here, but often not.

* I’m not counting the little disclaimer because we’ve been taught to ignore smallprint from being faced with so much of it

masterspace@lemmy.ca · edit-2 15 hours ago

My friends would probably say something like “I’ve never heard that one, but I guess it means something like …”

Ok, but the point is that lots of people would just say something and then figure out if it’s right later.

The problem is, these LLMs don’t give any indication when they’re making stuff up versus when repeating an incontrovertible truth. Lots of people don’t understand the limitations of things like Google’s AI summary* so they will trust these false answers. Harmless here, but often not.

Quite frankly, you sound like middle school teachers being hysterical about Wikipedia being wrong sometimes.

Deebster@infosec.pub · 14 hours ago

LLMs are already being used for policy making, business decisions, software creation and the like. The issue is bigger than summarisers, and “hallucinations” are a real problem when they lead to real decisions and real consequences.

If you can’t imagine why this is bad, maybe read some Kafka or watch some Black Mirror.

masterspace@lemmy.ca · edit-2 6 hours ago

If you can’t imagine why this is bad, maybe read some Kafka or watch some Black Mirror.

Lmfao. Yeah, ok, let’s get my predictions from the depressing show dedicated to being relentlessly pessimistic at every single decision point.

And yeah, like I said, you sound like my hysterical middle school teacher claiming that Wikipedia will be society’s downfall.

Guess what? It wasn’t. People learn that tools are error prone and came up with strategies to use them while correcting for potential errors.

Like at a fundamental, technical level, components of a system can be error prone, but still be useful overall. Quantum calculations have inherent probabilities and errors in them, but they can still solve some types of calculations so much faster than normal computers that you can run the same calculation 100x on a Quantum Computer, average out the results to remove the outlying errors, and get to the right answer far faster than a classical computer.

Computer chips in satellites and the space station are constantly have random bits of memory flipped by cosmic rays but they still work fine because their RAM is special, error correcting ram, that can use similar methods to verify and check for errors.

Designing for error correction is a thing, and people are perfectly capable of doing so in their personal lives.

desktop_user@lemmy.blahaj.zone · 5 hours ago

and this is why humans are bad, a tool is neither good or bad, sure a tool can use a large amount of resources to develop only to be completely obsolete in a year but only humans (so far) have the ability (and stupidity) to be both in charge of millions of lives and trust a bunch of lithographed rocks to create tarrif rates for uninhabited islands (and the rest of the world).