sobazaar | v1 | data from a social fashion app ‘Sobazaar’ developed by Telenor Digital. |
amazon electronics | v1 | amazon electronics ratings data. |
wesee tencent video | v1 | data from two popular video applications, “Wesee” and “Tencent Video”, both with over billions of daily active users. Watching histories for three consecutive days,from June 26, 2020, to June 28, 2020, of these two applications are collected in this dataset. |
epinions | v1 | processed epinions Social Network Dataset. |
citeulike | v1 | processed citations data obtained from this site. |
taobao | v1 | taobao processed data, originally stored in gdrive here. |
mind | v1 | processed MIND news data obtained from this site. |
xing | v1 | xing 2016 data, obtained from this site. |
movielens 1m | v1 | movielens 1m data. |
| v2 | slightly processed, obtained from here. |
anime | v1 | myanimelist zipped data. |
filmtrust | v1 | filmtrust csv data file. |
dota | v1 | dota heroes binary file. |
ifashion | v1 | processed alibaba ifashion data. |
| v2 | processed for graph neural net based models. |
retail rocket | v1 | processed data. |
| v2 | raw zipped. |
aotm | v1 | raw zipped art of the mix playlist dataset |
30music | v1 | raw zipped. |
| v2 | raw zipped from different source, some variations. |
tmall | v1 | processed. |
| v2 | raw zipped. |
gowalla | v1 | processed gowalla social-networking data. |
| v2 | processed differently. |
retail session | v1 | zipped retail sessions data. |
retail general | v1 | general retail data files. |
beibei | v1 | beibei data files. |
adressa | v1 | adressa processed. |
yelp | v1 | processed. |
| v2 | processed differently. |
| v3 | train test splits. |
| v4 | processed differently. |
amazon books | v1 | processed. |
| v2 | train test splits. |
| v3 | processed differently. |
spotify | v1 | spotify music data. |
diginetica | v1 | diginetica miscellaneous. |
| v2 | processed. |
| v3 | raw zipped. |
| v4 | processed csv. |
ibm watson articles | v1 | user-article interactions on ibm watson studio. |
lastfm | v1 | lastfm data. |
| v2 | csv data file. |
nowplaying | v1 | processed. |
| v2 | raw zipped. |
| v3 | csv data file. |
weeplaces | v1 | random sample of 1k users. |
| v2 | splitted files. |
sample session | v1 | sample session data csv file. |
| v2 | processed txt file. |
wsdm 2022 | v1 | wsdm cup 2022 data on cross market recommendations. |
amazon music instruments | v1 | raw json file. |
sample ctr | v1 | splitted csv files. |
| v2 | similar to v1, but there is data in sequence column also. |
avazu | v1 | splitted csv files. |
criteo | v1 | sample txt files. |
| v2 | splitted csv files. |
mts kion | v1 | mts kion data. |
douban | v1 | combined dataset of music, movies and books. |
amazon beauty | v1 | ratings data. |
yoochoose | v1 | splitted processed txt files. |
| v2 | raw zipped. |
| v3 | csv file. |
| v4 | processed v2. |