RL

Adaptive Bitrate Estimation using Reinforcement Learning

Advanced the state-of-the-art A3C model implemented in Pensieve by increasing exploration using the Follow then Forage technique