Abstract

This talk explores the application of reinforcement learning (RL) in large-scale recommendation systems to optimize user retention at scale - the true north star of effective recommendation engines. We'll discuss how RL can learn patterns and attribute future retention behavior to content consumed in current sessions, providing a more holistic approach than traditional methods.

We'll share insights from implementing this strategy in a production environment serving millions of users, highlighting significant improvements in key retention metrics. The presentation will address major challenges, including noisy attributions and proving causality within the system.

Our findings demonstrate the potential of RL in creating more sustainable, user-centric recommendation systems across various digital platforms, with important implications for the future of personalized content delivery.

Key Takeaways:

Using long-term rewards at different future horizons leads to incrementality in long-term metrics like Daily active users and sessions (will share results in the talk).
Optimal to try different horizons and approaches since duration is a tradeoff between causality of the lever and correlation with the final long-term metric.
High ROI of reward shaping to encode product intuition and strategy (e.g. private sharing)
Lessons on model architecture, co-investment in infrastructure (GPU inference, user history processing, sequence modeling) required to derive benefit at scale.

Speaker

Saurabh Gupta

Senior Engineering Leader @Meta, Veteran in the Video Recommendations Domain, Helping Scale Video Consumption

Saurabh Gupta is a Senior Engineering Leader at Meta Inc. He leads the video recommendations core ranking organization which is directly responsible for serving personalized video recommendations to billions of users on Facebook. These video recommendations constitute one third of time consumed by all users on Facebook app. He is a veteran in the video recommendations domain and has helped scale video consumption by many folds on Facebook in the last 9+ years. His work focuses on building scalable retrieval systems, understanding user interests, building large scale ML models to predict user actions and delivering personalized and highly relevant video feeds involving short form and long form videos on parts of Facebook app. He received his M.S. in Computer Science with specialization in Machine Learning from Georgia Institute of Technology. He holds several US patents and many more in pipeline in areas of machine learning, recommendations and software engineering.

Gaurav Chakravorty

Uber TL @Meta, Previously Worked on Facebook Video Recommendations and Instagram Friending and Growth

Gaurav is an Uber TL at Meta Inc, previously in Facebook video recommendations and more recently in Instagram friending and growth. He has worked on end-to-end recommender system advances, including user retention modeling. Prior to this, he has led applied ML based initiatives at Discord and Google, and high frequency trading.

From the same track

Session AI/ML

Recommender and Search Ranking Systems in Large Scale Real World Applications

Monday Nov 18 / 01:35PM PST

Recommendation and search systems are two of the key applications of machine learning models in industry. Current state of the art approaches have evolved from tree based ensembles models to large deep learning models within the last few years.

Moumita Bhattacharya

Senior Research Scientist @Netflix, Previously @Etsy, Specialized in Machine Learning, Deep Learning, Big Data, Scala, Tensorflow, and Python

Session Knowledge Graphs

Enhance LLMs’ Explainability and Trustworthiness With Knowledge Graphs

Monday Nov 18 / 10:35AM PST

Graphs, especially knowledge graphs, are powerful tools for structuring data into interconnected networks. The structured format of knowledge graphs enhances the performance of LLM-based systems by improving information retrieval and ensuring the use of reliable sources.

Leann Chen

AI Developer Advocate @Diffbot, Creator of AI and Knowledge Graph Content on YouTube, Passionate About Knowledge Graphs & Generative AI

Session AI/ML

Why Most Machine Learning Projects Fail to Reach Production and How to Beat the Odds

Monday Nov 18 / 02:45PM PST

Despite the hype around AI, many ML projects fail, with only 15% of businesses' ML projects succeeding, according to McKinsey. Particularly with the significant investments in large language models and generative AI, only a small portion of companies have managed to realize their true value.

Wenjie Zi

Senior Machine Learning Engineer and Tech Lead @Grammarly, Specializing in Natural Language Processing, 10+ Years of Industrial Experience in Artificial Intelligence Applications

Session

Unconference: AI and ML for Software Engineers

Monday Nov 18 / 03:55PM PST

Session

Scale Out Batch Inference with Ray

Monday Nov 18 / 11:45AM PST

As AI technologies continue to evolve, the demand for processing both structured and unstructured data across diverse industries is rapidly growing.

Cody Yu

Staff Software Engineer and Tech Lead @Anyscale, Ex-Amazonian, vLLM Committer, Apache TVM PMC

Reinforcement Learning for User Retention in Large-Scale Recommendation Systems

Abstract

Key Takeaways:

Speaker

Saurabh Gupta

Speaker

Gaurav Chakravorty

Speaker

Saurabh Gupta

Speaker

Gaurav Chakravorty

Date

Location

Track

Topics

Video

Slides

Share

From the same track

Recommender and Search Ranking Systems in Large Scale Real World Applications

Enhance LLMs’ Explainability and Trustworthiness With Knowledge Graphs

Why Most Machine Learning Projects Fail to Reach Production and How to Beat the Odds

Unconference: AI and ML for Software Engineers

Scale Out Batch Inference with Ray

Follow QCon

Contact

Menu

Conferences around the World