Modeling Popularity and Temporal Drift of Music Genre Preferences

Elisabeth Lex; Dominik Kowald; Markus Schedl

doi:10.5334/tismir.39

Modeling Popularity and Temporal Drift of Music Genre Preferences

Transactions of the International Society for Music Information Retrieval

Volume 3 (2020): Issue 1

By: Elisabeth Lex, Dominik Kowald and Markus Schedl

Open Access

|Mar 2020

Figures & Tables

Table 1

Dataset statistics for the LowMS, MedMS, and HighMS Last.fm user groups. Here, |U| is the number of distinct users, |A| is the number of distinct artists, |G| is the number of distinct genres, |LE| is the number of listening events, |GA| is the number of genre assignments, |GA|/|LE| is the number of genre assignments per listening event, $\bar{G_{u}}$ is the average number of genres a user u has listened to, $\bar{MS}$ is the average mainstreaminess value, and $\bar{Age}$ is the average age of users in the group.

User Group	\|U\|	\|A\|	\|G\|	\|LE\|	\|GA\|	\|GA\|/\|LE\|	$\bar{G_{u}}$	$\bar{MS}$	$\bar{Age}$
LowMS	1,000	82,417	931	6,915,352	14,573,028	2.107	85.771	.125	24.582
MedMS	1,000	86,249	933	7,900,726	20,264,870	2.565	126.439	.379	25.352
HighMS	1,000	92,690	973	8,251,022	22,498,370	2.727	186.010	.688	21.486

Boxplots show the average pairwise user similarity in our user groups using the cosine similarity metric computed on the users’ genre distributions. While users in the LowMS group show a very individual listening behavior, users in the HighMS group tend to listen to similar music genres.

Number of listening events LE (in millions) for the top-30 genres of our LowMS, MedMS, and HighMS Last.fm user groups. We find that there are some dominating genres in the HighMS group, while the genre distribution in the LowMS group is more evenly distributed.

The effect of time on genre relistening behavior for the LowMS, MedMS, and HighMS Last.fm user groups. For all three groups, we find that the shorter the time since the last listening event of a genre, the higher its relistening count. Additionally, we plot the linear fits of the data and report the corresponding R² estimates as well as the slopes α. We can observe a very good fit of the data, which indicates that the data likely follows a power-law distribution.

Boxplots showing the average duration in days per user we have available in our three test sets. Across all three users groups, the average duration per user is evenly distributed with a median value of 11.8 days.

Recall/precision plots of the baselines and our *BLL_u* approach for the three user groups LowMS, MedMS, and HighMS. We see that *BLL_u* provides the best results for all groups and for all k = 1…10 predicted genres.

Table 2

Comparison of our five baselines as well as our approach based on the BLL equation for modeling and predicting music genre preferences. In this table, a “✔” indicates that a specific approach covers a specific feature. While TOP, CF_u and CF_i also consider collaboration among users (i.e., investigate listening events of all users), our BLL_u approach is the only one that is personalized and accounts for the features of popularity as well as temporal drifts.

Feature	TOP	CF_u	CF_i	POP_u	TIME_u	BLL_u
Personalization		✔	✔	✔	✔	✔
Collaboration	✔	✔	✔
Popularity	✔	✔	✔	✔		✔
Temporal drifts					✔	✔

Table 3

Genre prediction accuracy results of our study comparing our BLL_u approach with a group-based baseline (TOP), a user-based collaborative filtering baseline (CF_u), an item-based collaborative filtering baseline (CF_i), a frequency-based baseline (POP_u) and a recency-based baseline (TIME_u). For all three user groups (i.e., LowMS, MedMS, and HighMS), the combination of popularity and temporal drift of music genre preferences in the form of BLL_u provides the best results for all metrics. According to a t-test with α = .001, “***” indicates statistically significant differences between BLL_u and all other approaches for all user groups.

User group	Evaluation metric	TOP	CF_u	CF_i	POP_u	TIME_u	BLL_u
LowMS	F1@5	.108	.311	.341	.356	.368	.397***
LowMS	MRR@10	.101	.389	.425	.443	.445	.492***
MAP@10	.112	.461	.505	.533	.550	.601***
nDCG@10	.180	.541	.590	.618	.625	.679***
MedMS	F1@5	.196	.271	.284	.292	.293	.338***
MedMS	MRR@10	.146	.248	.264	.274	.272	.320***
MAP@10	.187	.319	.336	.351	.365	.419***
nDCG@10	.277	.419	.441	.460	.452	.523***
HighMS	F1@5	.247	.273	.266	.282	.228	.304***
HighMS	MRR@10	.188	.232	.229	.242	.201	.266***
MAP@10	.246	.304	.298	.314	.267	.348***
nDCG@10	.354	.413	.402	.429	.357	.462***

Recall/precision plot of our *BLL_u* approach for *k =* 1…10 predicted genres for the three user groups LowMS, MedMS and HighMS. We see that *BLL_u* provides good prediction accuracy results for all groups but especially in the LowMS setting. This shows that our approach is especially useful for predicting the music genre preferences of users with low mainstreaminess values.

Recall/precision plot for our *BLL_u* approach and our five baselines in a cold-start setting. We see that *BLL_u* also provides the best results in cases where users only have a few listening events available for training.

References

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.5334/tismir.39 | Journal eISSN: 2514-3298

Journal RSS Feed

Language: English

Submitted on: Jun 19, 2019

Accepted on: Nov 15, 2019

Published on: Mar 25, 2020

Published by: Ubiquity Press

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

Music Genre Preference Prediction,

Music Recommendation,

Music Retrieval,

Personalized Music Access,

Time-Aware Recommendation,

ACT-R

© 2020 Elisabeth Lex, Dominik Kowald, Markus Schedl, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.

Volume 3 (2020): Issue 1

Modeling Popularity and Temporal Drift of Music Genre Preferences

Figures & Tables

Table 1

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5

Table 2

Table 3

Figure 6

Figure 7

Paradigm

My account