What is it about?

Many modern sequential recommender systems use deep neural networks, which can effectively estimate the relevance of items but require a lot of time to train. Slow training increases expenses, hinders product development timescales and prevents the model from being regularly updated to adapt to changing user preferences. Training such sequential models involves appropriately sampling past user interactions to create a realistic training objective. The existing training objectives have limitations. For instance, next item prediction never uses the beginning of the sequence as a learning target, thereby potentially discarding valuable data. On the other hand, the item masking used by BERT4Rec is only weakly related to the goal of the sequential recommendation; therefore, it requires much more time to obtain an effective model. Hence, we propose a novel Recency-based Sampling of Sequences training objective that addresses both limitations. We apply our method to various recent and state-of-the-art model architectures - such as GRU4Rec, Caser, and SASRec. We show that the models enhanced with our method can achieve performances exceeding or very close to stateof-the-art BERT4Rec, but with much less training time.

Featured Image

Why is it important?

Slow model training is a problem for both academic researchers and industry. For academic researchers it limits the number of the experiments, which researchers can run. In the industry it may cause delays between rapid changes of user interests and reflection of these changes in the model.

Read the Original

This page is a summary of: Effective and Efficient Training for Sequential Recommendation using Recency Sampling, September 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3523227.3546785.
You can read the full text:

Read

Contributors

The following have contributed to this page