dc.contributor
Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial
dc.contributor
Angulo Bahón, Cecilio
dc.contributor.author
Izquierdo Enfedaque, Héctor
dc.date.issued
2021-10-22
dc.identifier
https://hdl.handle.net/2117/355912
dc.identifier
ETSEIB-240.161351
dc.description.abstract
Recommender Systems aim to help customers find content of their interest by presenting them suggestions they are most likely to prefer. Reinforcement Learning, a Machine Learning paradigm where agents learn by interaction which actions to perform in an environment so as to maximize a reward, can be trained to give good recommendations. One of the problems when working with Reinforcement Learning algorithms is the dimensionality explosion, especially in the observation space. On the other hand, Industrial recommender systems deal with extremely large observation spaces. New Deep Reinforcement Learning algorithms can deal with this problem, but they are mainly focused on images. A new technique has been developed able to convert raw data into images, enabling DRL algorithms to be properly applied. This project addresses this line of investigation. The contributions of the project are: (1) defining a generalization of the Markov Decision Process formulation for Recommender Systems, (2) defining a way to express the observation as an image, and (3) demonstrating the use of both concepts by addressing a particular Recommender System case through Reinforcement Learning. Results show how the trained agents offer better recommendations than the arbitrary choice. However, the system does not achieve a great performance mainly due to the lack of interactions in the dataset
dc.format
application/pdf
dc.publisher
Universitat Politècnica de Catalunya
dc.rights
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject
Àrees temàtiques de la UPC::Informàtica
dc.subject
Recommender systems (Information filtering) -- Mathematical models -- Software
dc.subject
Reinforcement learning -- Mathematical models
dc.subject
Sistemes recomanadors (Filtratge d'informació) -- Models matemàtics -- Programari
dc.subject
Aprenentatge per reforç -- Models matemàtics
dc.title
Deep Reinforcement Learning in Recommender Systems