AI models struggle to identify nonsense, says study

Stay tuned with 24 News HD Android App

The AI models that power chatbots and other applications still have difficulty distinguishing between nonsense and natural language, according to a study released on Thursday.

The researchers at Columbia University in the United States said their work revealed the limitations of current AI models and suggested it was too early to let them loose in legal or medical settings.

They put nine AI models through their paces, firing hundreds of pairs of sentences at them and asking which were likely to be heard in everyday speech.

They asked 100 people to make the same judgement on pairs of sentences like: "A buyer can own a genuine product also / One versed in circumference of highschool I rambled."

The research, published in the Nature Machine Intelligence journal, then weighed the AI answers against the human answers and found dramatic differences.

Sophisticated models like GPT-2, an earlier version of the model that powers viral chatbot ChatGPT, generally matched the human answers.

Other simpler models did less well.

But the researchers highlighted that all the models made mistakes.

"Every model exhibited blind spots, labelling some sentences as meaningful that human participants thought were gibberish," said psychology professor Christopher Baldassano, an author of the report.

"That should give us pause about the extent to which we want AI systems making important decisions, at least for now."

Tal Golan, another of the paper's authors, told AFP that the models were "an exciting technology that can complement human productivity dramatically".

However, he argued that "letting these models replace human decision-making in domains such as law, medicine, or student evaluation may be premature".

Among the pitfalls, he said, was the possibility that people might intentionally exploit the blind spots to manipulate the models.

AI models burst into public consciousness with the release of ChatGPT last year, which has since been credited with passing various exams and has been touted as a possible aide to doctors, lawyers and other professionals.

Categories : Science & Tech

Trump defence pick Hegseth accused of 2017 sexual assault

10 infants killed, 16 critical in India hospital fire

With Trump in White House, Xi and Biden signal turbulence ahead

Global pandemic accord talks: What’s been achieved so far?

Historic six-hitting blitz propels India to dominant win over South Africa

Lebanon reviews US proposal for Israel-Hezbollah truce

In rare talks, China and South Korea vow to enhance collaboration

Pakistan launches 1st National Climate Finance Strategy on COP29 sidelines

WHO blames 'fake finger markings' for failure of polio control efforts in Pakistan

Seed Clouding: Punjab govt conducts artificial rain in Jehlum and Chakwal districts

AI models struggle to identify nonsense, says study

Stay tuned with 24 News HD Android App

AFP

British Vogue editor concerned about skinny models

Bangladesh struggle at 101-3 as South Africa threaten innings defeat

Argentina students study in the street to protest budget cuts

Science study ranks 50-year-old Shah Rukh Khan among world’s 10 most handsome ...

Preity Zinta flaunts heartfelt note on 20 years of Veer-Zaara

Khushi Kapoor finally opens up about dating buzz with alleged beau Vedang Riana

Neha Kakkar’s ex-boyfriend Hemansh Kohli ties the knot with his lady love

Former actress Aisha Khan celebrates daughter Mahnoor's 5th birthday

Saheefa hits back at trolls over driving and eating lollipop

Foreign occupation is destroying the very fabric of Kashmiri society

A Century of Defence, A legacy of Growth

Battle for the White House

Judge Raj's last breaths

A year of Zionist terror in Gaza

Schools closure extended as smog emergency declared in Lahore, Multan

Gen Asim Munir says Fitna-tul-Khawarij threatens global security

Reception in honour of Coordinator General COMSTECH and VCs

Suicide bomb blast misses police target in Charsadda

Aymen Saleem announces her pregnancy

Gambler hits $85 million jackpot betting on Trump win

Nine tonnes of methamphetamine seized in Turkey in two weeks

Mysterious diamond-laden necklace fetches $4.8m in Geneva auction

Insurance fraudsters dressed as bears to wreck cars

US tourist arrested for defacing Tokyo shrine