Skip to main content

Datasets

TitleLinkDescription
sobazaarv1data from a social fashion app ‘Sobazaar’ developed by Telenor Digital.
amazon electronicsv1amazon electronics ratings data.
wesee tencent videov1data from two popular video applications, “Wesee” and “Tencent Video”, both with over billions of daily active users. Watching histories for three consecutive days,from June 26, 2020, to June 28, 2020, of these two applications are collected in this dataset.
epinionsv1processed epinions Social Network Dataset.
citeulikev1processed citations data obtained from this site.
taobaov1taobao processed data, originally stored in gdrive here.
mindv1processed MIND news data obtained from this site.
xingv1xing 2016 data, obtained from this site.
movielens 1mv1movielens 1m data.
v2slightly processed, obtained from here.
animev1myanimelist zipped data.
filmtrustv1filmtrust csv data file.
dotav1dota heroes binary file.
ifashionv1processed alibaba ifashion data.
v2processed for graph neural net based models.
retail rocketv1processed data.
v2raw zipped.
aotmv1raw zipped art of the mix playlist dataset
30musicv1raw zipped.
v2raw zipped from different source, some variations.
tmallv1processed.
v2raw zipped.
gowallav1processed gowalla social-networking data.
v2processed differently.
retail sessionv1zipped retail sessions data.
retail generalv1general retail data files.
beibeiv1beibei data files.
adressav1adressa processed.
yelpv1processed.
v2processed differently.
v3train test splits.
v4processed differently.
amazon booksv1processed.
v2train test splits.
v3processed differently.
spotifyv1spotify music data.
digineticav1diginetica miscellaneous.
v2processed.
v3raw zipped.
v4processed csv.
ibm watson articlesv1user-article interactions on ibm watson studio.
lastfmv1lastfm data.
v2csv data file.
nowplayingv1processed.
v2raw zipped.
v3csv data file.
weeplacesv1random sample of 1k users.
v2splitted files.
sample sessionv1sample session data csv file.
v2processed txt file.
wsdm 2022v1wsdm cup 2022 data on cross market recommendations.
amazon music instrumentsv1raw json file.
sample ctrv1splitted csv files.
v2similar to v1, but there is data in sequence column also.
avazuv1splitted csv files.
criteov1sample txt files.
v2splitted csv files.
mts kionv1mts kion data.
doubanv1combined dataset of music, movies and books.
amazon beautyv1ratings data.
yoochoosev1splitted processed txt files.
v2raw zipped.
v3csv file.
v4processed v2.