How to Improve Master's Exam Responses with Better Sampling Parameters | by Dr. Leon Eversberg | September 2024

A deep dive into stochastic decoding with temperature, top_p, top_k and min_p

10 minutes reading time

21 hours ago

Example Python code taken from the OpenAI Python SDK where the chat completion API is called with the temperature and top_p parameters. — When calling the OpenAI API with the Python SDK, have you ever wondered what exactly the temperature and top_p parameters do?

When a Large Language Model (LLM) is asked a question, the model generates a probability for each possible token in its vocabulary.

After sampling a token from this probability distribution, we can add the selected token to our input message so that the LLM can generate the probabilities for the next token.

This sampling process can be controlled by parameters such as the famous temperature and top_p.

In this article, I will explain and visualize the sampling strategies that define the output behavior of LLMs. By understanding what these parameters do and configuring them based on our use case, we can improve the output generated by LLMs.

For this article, I will use Very good like the inference engine and Microsoft's new one Phi-3.5-mini-instructions Model with AWQ quantization. To run this model locally, I use the NVIDIA GeForce RTX 2060 GPU in my laptop.

· Understanding Sampling with Logprobs
∘ LLM decoding theory
∘ Retrieving Logprobs with the OpenAI Python SDK
· Greedy decoding
· Temperature
· Sampling the best k
· Top-p sampling
· Combining Top-p…

How to Improve Master's Exam Responses with Better Sampling Parameters | by Dr. Leon Eversberg | September 2024

Technical Terrence Team

Uniti Group secures 20-year contract with major hyperscaler customer in Alabama

Leave a Reply Cancel reply

Recommended.

Where AI art critics go wrong

6 startups redefining 3D workflows with OpenUSD and generative AI

The euro and the pound fell while the US dollar rose

Solana’s TVL growth outpaces Avalanche and BNB

fast eth2 upgrade | Ethereum Foundation Blog

Categories

Important Links

How to Improve Master's Exam Responses with Better Sampling Parameters | by Dr. Leon Eversberg | September 2024

A deep dive into stochastic decoding with temperature, top_p, top_k and min_p

table of Contents

Related

Technical Terrence Team

Uniti Group secures 20-year contract with major hyperscaler customer in Alabama

Leave a Reply Cancel reply

Recommended.

Where AI art critics go wrong

6 startups redefining 3D workflows with OpenUSD and generative AI

The euro and the pound fell while the US dollar rose

Solana’s TVL growth outpaces Avalanche and BNB

fast eth2 upgrade | Ethereum Foundation Blog

Categories

Important Links

Get daily news updates to your inbox!