Skip to main content
Have a personal or library account? Click to login
Modeling Popularity and Temporal Drift of Music Genre Preferences Cover

Modeling Popularity and Temporal Drift of Music Genre Preferences

Open Access
|Mar 2020

Figures & Tables

Table 1

Dataset statistics for the LowMS, MedMS, and HighMS Last.fm user groups. Here, |U| is the number of distinct users, |A| is the number of distinct artists, |G| is the number of distinct genres, |LE| is the number of listening events, |GA| is the number of genre assignments, |GA|/|LE| is the number of genre assignments per listening event, Gu¯ is the average number of genres a user u has listened to, MS¯ is the average mainstreaminess value, and Age¯ is the average age of users in the group.

User Group|U||A||G||LE||GA||GA|/|LE|Gu¯MS¯Age¯
LowMS1,00082,4179316,915,35214,573,0282.10785.771.12524.582
MedMS1,00086,2499337,900,72620,264,8702.565126.439.37925.352
HighMS1,00092,6909738,251,02222,498,3702.727186.010.68821.486
Figure 1

Boxplots show the average pairwise user similarity in our user groups using the cosine similarity metric computed on the users’ genre distributions. While users in the LowMS group show a very individual listening behavior, users in the HighMS group tend to listen to similar music genres.

Figure 2

Number of listening events LE (in millions) for the top-30 genres of our LowMS, MedMS, and HighMS Last.fm user groups. We find that there are some dominating genres in the HighMS group, while the genre distribution in the LowMS group is more evenly distributed.

Figure 3

The effect of time on genre relistening behavior for the LowMS, MedMS, and HighMS Last.fm user groups. For all three groups, we find that the shorter the time since the last listening event of a genre, the higher its relistening count. Additionally, we plot the linear fits of the data and report the corresponding R2 estimates as well as the slopes α. We can observe a very good fit of the data, which indicates that the data likely follows a power-law distribution.

Figure 4

Boxplots showing the average duration in days per user we have available in our three test sets. Across all three users groups, the average duration per user is evenly distributed with a median value of 11.8 days.

Figure 5

Recall/precision plots of the baselines and our BLLu approach for the three user groups LowMS, MedMS, and HighMS. We see that BLLu provides the best results for all groups and for all k = 1…10 predicted genres.

Table 2

Comparison of our five baselines as well as our approach based on the BLL equation for modeling and predicting music genre preferences. In this table, a “✔” indicates that a specific approach covers a specific feature. While TOP, CFu and CFi also consider collaboration among users (i.e., investigate listening events of all users), our BLLu approach is the only one that is personalized and accounts for the features of popularity as well as temporal drifts.

FeatureTOPCFuCFiPOPuTIMEuBLLu
Personalization
Collaboration
Popularity
Temporal drifts
Table 3

Genre prediction accuracy results of our study comparing our BLLu approach with a group-based baseline (TOP), a user-based collaborative filtering baseline (CFu), an item-based collaborative filtering baseline (CFi), a frequency-based baseline (POPu) and a recency-based baseline (TIMEu). For all three user groups (i.e., LowMS, MedMS, and HighMS), the combination of popularity and temporal drift of music genre preferences in the form of BLLu provides the best results for all metrics. According to a t-test with α = .001, “***” indicates statistically significant differences between BLLu and all other approaches for all user groups.

User groupEvaluation metricTOPCFuCFiPOPuTIMEuBLLu
LowMSF1@5.108.311.341.356.368.397***
MRR@10.101.389.425.443.445.492***
MAP@10.112.461.505.533.550.601***
nDCG@10.180.541.590.618.625.679***
MedMSF1@5.196.271.284.292.293.338***
MRR@10.146.248.264.274.272.320***
MAP@10.187.319.336.351.365.419***
nDCG@10.277.419.441.460.452.523***
HighMSF1@5.247.273.266.282.228.304***
MRR@10.188.232.229.242.201.266***
MAP@10.246.304.298.314.267.348***
nDCG@10.354.413.402.429.357.462***
Figure 6

Recall/precision plot of our BLLu approach for k = 1…10 predicted genres for the three user groups LowMS, MedMS and HighMS. We see that BLLu provides good prediction accuracy results for all groups but especially in the LowMS setting. This shows that our approach is especially useful for predicting the music genre preferences of users with low mainstreaminess values.

Figure 7

Recall/precision plot for our BLLu approach and our five baselines in a cold-start setting. We see that BLLu also provides the best results in cases where users only have a few listening events available for training.

DOI: https://doi.org/10.5334/tismir.39 | Journal eISSN: 2514-3298
Language: English
Submitted on: Jun 19, 2019
Accepted on: Nov 15, 2019
Published on: Mar 25, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Elisabeth Lex, Dominik Kowald, Markus Schedl, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.