Are hallucinating GenAI models careless or ignorant?

Large language models (LLMs) are notorious for "hallucinating" responses containing totally false information which bears little response to reality.

But are the models making mistakes because they're ignorant or because of some other error which causes them to slip up?

Answering this question is becoming ever more important as LLMs move away from simply answering consumers' questions and start to become embedded in core mission-critical enterprise functions in which a wrong answer could be an expensive mistake or a major security risk.

The thorny topic of hallucinations is made even more complex by the fact that wrong answers can "snowball", so that one mistake is followed by subsequent errors as the model attempts to justify or compensate for its slip-ups.

To work out whether Gen AI models are stupid or just plain ignorant, researchers from Google and Technion, the Israel Institute of Technology, devised a new system called Wrong Answer despite having Correct Knowledge (WACK) that can test why hallucinations took place.

"Large language models are susceptible to hallucinations - outputs that are ungrounded, factually incorrect, or inconsistent with prior generations," they wrote in a pre-print paper.

Are hallucinating GenAI models careless or just plain ignorant? Google researchers found out

READ MORE: Elon Musk to double power of world's largest AI supercomputer

READ MORE: LinkedIn slams the brakes on GenAI data-harvesting in the UK amid privacy storm

Join peers following The Stack on LinkedIn

Latest

Federal CIO takes aim at big five software suppliers in latest tech procurement shakeup

Porsche Formula E team thinks small and local to slow down AI energy guzzling

Beware the phishing Morphing Meerkat, it may be posing as your email provider

Shadow boxing - How CIOs are tackling a growing 'Shadow AI' problem

Are hallucinating GenAI models careless or just plain ignorant? Google researchers found out

READ MORE: Elon Musk to double power of world's largest AI supercomputer

READ MORE: LinkedIn slams the brakes on GenAI data-harvesting in the UK amid privacy storm

Join peers following The Stack on LinkedIn

Related

Latest