Correct sampling bias for recommender systems | by Thao Vu | October 2023

What is sampling bias in recommendation and how to correct it?

Recommendations are ubiquitous in our digital lives, from e-commerce giants to streaming services. However, behind every great recommendation system lies a challenge that can significantly affect its effectiveness: sampling bias.

In this article, I will introduce how sampling bias occurs during training of recommendation models and how we can solve this problem in practice.

Let’s dive in!

In general, we can formulate the recommendation problem as follows: given query x (which may contain user information, context, previously clicked items, etc.)find the set of elements {y1,.., yk} that the user You’ll probably be interested.

One of the main challenges for large-scale recommender systems is low latency requirements. However, groups of users and items are vast and dynamic, so qualifying each candidate and greedily finding the best one is impossible. Therefore, to meet the latency requirement, recommender systems are generally divided into two main stages: retrieval and ranking.

Multi-stage recommender systems (Image by author)

Retrieval is a cheap and efficient way to quickly capture the top candidates (a few hundred) from the vast pool of candidates (millions or billions). Recovery optimization has mainly two objectives:

During the training phase, we want to encode users and items into embeddings that capture user behavior and preferences.
During inference, we want to quickly retrieve relevant elements through Approximate Nearest Neighbors (ANN).

For the first goal, one of the common approaches is two-tower neural networks. The model gained popularity for addressing cold start issues by incorporating article content features.

In detail, queries and items are encoded by the corresponding DNN towers so that the relevant embeddings (query, item) remain…

Correct sampling bias for recommender systems | by Thao Vu | October 2023

Technical Terrence Team

Alternative Income REIT posts pre-tax loss in first half amid impact of inflation, interest rates By Investing.com

Leave a Reply Cancel reply

Recommended.

Bitcoin RSI Turns Bearish for the First Time Since August 2023, Will It Fall Below $40,000?

Quanta Networks ICO (QN): The future of telecommunications

Nifty Gateway founders leave for a new company

White House Says Biden Has ‘Confidence’ in Fed Chairman Powell as Fedwatch Tool Predicts 25bp Rise This Week Bitcoin News

And combinator eliminates publications after the demonstration of a startup becomes viral

Categories

Important Links

Correct sampling bias for recommender systems | by Thao Vu | October 2023

What is sampling bias in recommendation and how to correct it?

Related

Technical Terrence Team

Alternative Income REIT posts pre-tax loss in first half amid impact of inflation, interest rates By Investing.com

Leave a Reply Cancel reply

Recommended.

Bitcoin RSI Turns Bearish for the First Time Since August 2023, Will It Fall Below $40,000?

Quanta Networks ICO (QN): The future of telecommunications

Nifty Gateway founders leave for a new company

White House Says Biden Has ‘Confidence’ in Fed Chairman Powell as Fedwatch Tool Predicts 25bp Rise This Week Bitcoin News

And combinator eliminates publications after the demonstration of a startup becomes viral

Categories

Important Links

Get daily news updates to your inbox!