How to detect hallucinations in LLM | by Iulia Brezeanu | December 2023

Teach chatbots to say “I don't know”

Who is Evelyn Hartwell?

Evelyn Hartwell is an American author, speaker and life coach…

Evelyn Hartwell is a Canadian dancer and founding artistic director…

Evelyn Hartwell is an American actress known for her roles in the…

No, Evelyn Hartwell is not a con artist with multiple false identities living a deceitful triple life with multiple professions. In reality, she doesn't exist at all, but the model, instead of telling me that she doesn't know, starts making up facts. We are facing an LLM Hallucination.

Long, detailed results can seem really convincing, even if they are fictitious. Does it mean we can't trust chatbots and have to manually check the results every time? Fortunately, there could be ways to make chatbots less likely to say made-up things with the right safeguards.

text-davinci-003 quick completion about a fictional person. Author's image.

For the above outputs, I set a higher temperature of 0.7. I allow the LLM to change its sentence structure so that I don't have identical text for each generation. The differences between the results should be only semantic, not factual.

This simple idea allowed us to introduce a new sample-based hallucination detection mechanism. If the LLM's responses to the same message contradict each other, they are probably hallucinations. If they are related to each other, it implies that the information is objective. (2)

For this type of assessment, we only require the text results of the LLMs. This is known as black box evaluation. Also, since we don't need any external knowledge, it is called zero resource. (5)

Let's start with a very basic way to measure similarity. We will compute the pairwise cosine similarity between corresponding pairs of embedded sentences. We normalize them because we need to focus only on the direction of the vector, not the magnitude. The following function takes as input the originally generated sentence called production and a…

How to detect hallucinations in LLM | by Iulia Brezeanu | December 2023

Technical Terrence Team

Costco is about to make a big change

Leave a Reply Cancel reply

Recommended.

Carnival Cruise Line shares R-rated warning for passengers

Navigating Single Stock Futures Amid Inflation Data

Ethereum Price is Showing Early Signs of Fresh Rally But $1,670 is the Key

An AI-haunted house and the mysterious murder of a superhero

Bitcoin ETFs or not, don’t expect a ‘sexy’ crypto bull run — Concordium founder

Categories

Important Links

How to detect hallucinations in LLM | by Iulia Brezeanu | December 2023

Teach chatbots to say “I don't know”

Related

Technical Terrence Team

Costco is about to make a big change

Leave a Reply Cancel reply

Recommended.

Carnival Cruise Line shares R-rated warning for passengers

Navigating Single Stock Futures Amid Inflation Data

Ethereum Price is Showing Early Signs of Fresh Rally But $1,670 is the Key

An AI-haunted house and the mysterious murder of a superhero

Bitcoin ETFs or not, don’t expect a ‘sexy’ crypto bull run — Concordium founder

Categories

Important Links

Get daily news updates to your inbox!