SML
Problem formulationβ
There are two major components in this SML framework: (1) building an expressive component that transfers the knowledge gained in previous training to the training on new interactions, and (2) optimizing the transfer component towards the recommendation performance in the near future.
Formally,
Model overviewβ
Model overview of transfer-based retraining for the t-th time period. represents the previous recommender, is a recommender learned on new data only. The transfer component is to combine the βknowledgeβ in and to obtain the new recommender for serving the next period.
We propose a new retraining method with two major considerations: 1) building an expressive component that transfers the knowledge gained in previous training to the training on new interactions, and, 2) optimizing the transfer component towards the recommendation performance in the near future.
To achieve the first goal, we devise the transfer component as a convolutional neural network (CNN), which inputs the previous model parameters as constant and the present model as trainable parameters. The rationality is that the knowledge gained in previous training is condensed in model parameters, such that an expressive neural network should be able to distill the knowledge towards the desired purpose. To achieve the second goal, in addition to normal training on newly collected interactions, we further train the transfer CNN on the future interactions of next time period. As such, the CNN can learn how to combine the old parameters with present parameters, with the objective of predicting the user interactions of the near future.
In real-world recommender systems, user interaction data streams in continuously. To keep the predictive model fresh with recent data, a common choice is to retrain the model periodically. We represent the data as , where denotes the data newly collected in the time period t. Assume each retraining is triggered right after is collected. A period can be any length of time, e.g., daily, weekly or until a certain number of interactions are collected, depending on the system requirement and implementation ability. In the retraining of time period t, the system has access to all previous data, i.e., , and the new data . Since the retrained model is used to serve for the near future, it is reasonable to judge its effectiveness based on β the data collected in the next time period. As such, we set the recommendation performance on as the generalization goal of the t-th period retraining. Let the model parameters after the t-th period retraining be . We treat each retraining as a task, formulating it as: . In this work, we aim to utilize the newly collected data only plus the previous model parameters , so as to pursue a good retrained model as evaluated on . Thus we reformulate the retraining process as: .
Training procedureβ
Sequential training procedure of SML.
Evaluation procedureβ
Model evaluation and update procedure.
Performanceβ
Average recommendation performance over online testing periods on Adressa and Yelp. βRIβ indicates the relative improvement of SML over the corresponding baseline.