Scikit-learn's dummy models. Always have a pacifier by your side. | by Yoann Mocquin | February 2024

If you like or want to learn machine learning with scikit-learn, check out my tutorial series on this amazing package:

Sklearn Tutorial

All images by author.

Dummy models are very simplistic models that should be used as a basis for comparing real models. A baseline is just a kind of reference point to compare yourself to. When you calculate your first cross-validation results to estimate the performance of your model, you usually know that the higher the score the better, and if the score is pretty high on the first try, that's great. It is not usually this way.

What to do if the first accuracy score is quite low, or lower than what you would like or expect? Is it because of the data? Is it because of your model? Both? How can we quickly know if our model is not poorly tuned?

Dummy models are here to answer these questions. Their complexity and “intelligence” are very low: the idea is that you can compare your models with them to see how much better you are than the “stupider” models. Note that they don't intentionally predict stupid values, they just take the easiest and most simplistic smart guess. If your model performs worse than the dummy model, you should adjust or change your model entirely.

A simple example for a dummy regressor would be to always predict the mean value of the training target, whatever the input: it's not ideal, but on average it gives a reasonable simplistic assumption. If your real model gives worse results than this very, very simple approach, you might want to review your model.

Scikit-learn's dummy models. Always have a pacifier by your side. | by Yoann Mocquin | February 2024

Technical Terrence Team

Taco Bell Menu Brings Back Fan-Favorite Spicy Dishes Nationwide

Leave a Reply Cancel reply

Recommended.

£10,000 in an ISA? This is how you would aim to earn £33 every week in passive income

MemoryFormer: A Novel Transformative Architecture for Efficient and Scalable Large Language Models

Meet PayHOA, a once-starting, profitable SaaS startup that just raised a $27.5 million Series A.

Mistral 7B-V0.2: Tuning the new Mistral open source LLM with Hugging Face

Sam Altman is “embarrassed” that OpenAI has threatened to revoke equity if outgoing employees do not sign confidentiality agreement

Categories

Important Links

Scikit-learn's dummy models. Always have a pacifier by your side. | by Yoann Mocquin | February 2024

Sklearn Tutorial

Related

Technical Terrence Team

Taco Bell Menu Brings Back Fan-Favorite Spicy Dishes Nationwide

Leave a Reply Cancel reply

Recommended.

£10,000 in an ISA? This is how you would aim to earn £33 every week in passive income

MemoryFormer: A Novel Transformative Architecture for Efficient and Scalable Large Language Models

Meet PayHOA, a once-starting, profitable SaaS startup that just raised a $27.5 million Series A.

Mistral 7B-V0.2: Tuning the new Mistral open source LLM with Hugging Face

Sam Altman is “embarrassed” that OpenAI has threatened to revoke equity if outgoing employees do not sign confidentiality agreement

Categories

Important Links

Get daily news updates to your inbox!