Cookies?
Library Header Image
LSE Research Online LSE Library Services

Popularity prediction for social media over arbitrary time horizons

Haimovich, Daniel, Karamshuk, Dima, Leeper, Thomas J., Riabenko, Evgeniy and Vojnovic, Milan ORCID: 0000-0003-1382-022X (2021) Popularity prediction for social media over arbitrary time horizons. Proceedings of the VLDB Endowment, 15 (4). 841 - 849. ISSN 2150-8097

[img] Text (3503585.3503593) - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB)

Identification Number: 10.14778/3503585.3503593

Abstract

Predicting the popularity of social media content in real time requires approaches that efficiently operate at global scale. Popularity prediction is important for many applications, including detection of harmful viral content to enable timely content moderation. The prediction task is difficult because views result from interactions between user interests, content features, resharing, feed ranking, and network structure. We consider the problem of accurately predicting popularity both at any given prediction time since a content item’s creation and for arbitrary time horizons into the future. In order to achieve high accuracy for different prediction time horizons, it is essential for models to use static features (of content and user) as well as observed popularity growth up to prediction time. We propose a feature-based approach based on a self-excited Hawkes point process model, which involves prediction of the con-tent’s popularity at one or more reference horizons in tandem with a point predictor of an effective growth parameter that reflects the timescale of popularity growth. This results in a highly scalable method for popularity prediction over arbitrary prediction time horizons that also achieves a high degree of accuracy, compared to several leading baselines, on a dataset of public page content on Facebook over a two-month period, covering billions of content views and hundreds of thousands of distinct content items. The model has shown competitive prediction accuracy against a strong baseline that consists of separately trained models for specific prediction time horizons.

Item Type: Article
Official URL: https://dl.acm.org/journal/pvldb
Additional Information: © 2021 The Authors
Divisions: Statistics
Subjects: H Social Sciences > HM Sociology
Date Deposited: 01 Jun 2022 15:33
Last Modified: 20 Dec 2024 00:44
URI: http://eprints.lse.ac.uk/id/eprint/115272

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics