Skip to main content
Download free report
SoftBlues
Back to Projects

Live Streaming Reinforcement Learning Recommendation System

Lead AI Engineer / Architect2020-2024Anton O.
AO
Anton O.

Lead ML & Data Engineer

Data Engineer & Big Data

Key Expertise

Reinforcement LearningML InfrastructureReal-time MLApplied NLPData Platform Architect

Experience

10+ years

Timezone

CET (GMT +1)

7-day risk-free trial
Response within 24 hours
View Full Profile

Overview

A consumer-facing live streaming platform needed to improve real-time content recommendations under strict latency constraints. Traditional offline-trained models were slow to adapt to user behavior and changing content dynamics. The goal was to design a system that could learn continuously from live user interactions.

Achievements

A reinforcement learning–based recommendation system was deployed to production, adapting recommendations in near real time based on user feedback signals. The system improved engagement metrics while remaining stable under high request rates.

Responsibilities

  • Designed the end-to-end recommendation architecture combining offline training and online learning.
  • Defined reward signals based on user interactions (watch time, skips, engagement events).
  • Built a real-time inference service with tight latency budgets.
  • Implemented safeguards to prevent feedback loops and degraded user experience during exploration.
  • Worked closely with product and backend teams to integrate the model into the live serving stack.
AO

This project was delivered by

Anton O.

View Full Profile

Ready to Build Your AI Team?

Get matched with the right AI experts for your project. Book a free discovery call to discuss your requirements.

We respond within 24 hours.