Modeling Social Media Memes as a Contagious Process

Similar documents
Thursday. Threshold and Sensitivity Analysis

Diffusion of information and social contagion

Epidemics and information spreading

Inferring the origin of an epidemic with a dynamic message-passing algorithm

KINETICS OF COMPLEX SOCIAL CONTAGION. János Kertész Central European University. Pohang, May 27, 2016

Web Structure Mining Nodes, Links and Influence

KINETICS OF SOCIAL CONTAGION. János Kertész Central European University. SNU, June

The Spreading of Epidemics in Complex Networks

Time varying networks and the weakness of strong ties

Global Stability of a Computer Virus Model with Cure and Vertical Transmission

Modeling, Analysis, and Control of Information Propagation in Multi-layer and Multiplex Networks. Osman Yağan

Three Disguises of 1 x = e λx

CSCI 3210: Computational Game Theory. Cascading Behavior in Networks Ref: [AGT] Ch 24

Virgili, Tarragona (Spain) Roma (Italy) Zaragoza, Zaragoza (Spain)

arxiv: v2 [physics.soc-ph] 5 Sep 2016

Lecture VI Introduction to complex networks. Santo Fortunato

Machine Learning and Modeling for Social Networks

Supplementary Information Activity driven modeling of time varying networks

Epidemics in Networks Part 2 Compartmental Disease Models

Any live cell with less than 2 live neighbours dies. Any live cell with 2 or 3 live neighbours lives on to the next step.

Mathematical Analysis of Epidemiological Models: Introduction

MATHEMATICAL MODELS Vol. III - Mathematical Models in Epidemiology - M. G. Roberts, J. A. P. Heesterbeek

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Australian Journal of Basic and Applied Sciences

Preventive behavioural responses and information dissemination in network epidemic models

arxiv: v3 [physics.soc-ph] 14 Nov 2018

SOCIAL MEDIA IN THE COMMUNICATIONS CENTRE

Epidemics in Complex Networks and Phase Transitions

MobiHoc 2014 MINIMUM-SIZED INFLUENTIAL NODE SET SELECTION FOR SOCIAL NETWORKS UNDER THE INDEPENDENT CASCADE MODEL

Identifying critical nodes in multi-layered networks under multi-vector malware attack Rafael Vida 1,2,3, Javier Galeano 3, and Sara Cuenda 4

arxiv: v2 [physics.soc-ph] 27 Apr 2015

arxiv: v1 [cs.si] 14 Dec 2018

Brief Glimpse of Agent-Based Modeling

Theory and Methods of Statistical Inference. PART I Frequentist likelihood methods

Modelling of the Hand-Foot-Mouth-Disease with the Carrier Population

Dynamical models of HIV-AIDS e ect on population growth

Reducing Computation Time for the Analysis of Large Social Science Datasets

Social Influence in Online Social Networks. Epidemiological Models. Epidemic Process

Simulating stochastic epidemics

Reliability and Efficiency of Generalized Rumor Spreading Model on Complex Social Networks

Personalized Social Recommendations Accurate or Private

Distributed Systems Gossip Algorithms

Role of GIS in Tracking and Controlling Spread of Disease

Transmission in finite populations

A Mathematical Analysis on the Transmission Dynamics of Neisseria gonorrhoeae. Yk j N k j

WITH the recent advancements of information technologies,

arxiv: v1 [cs.si] 16 Jun 2016

Modelling spatio-temporal patterns of disease

Theory and Methods of Statistical Inference. PART I Frequentist theory and methods

Mathematical Epidemiology Lecture 1. Matylda Jabłońska-Sabuka

Lecture 10. Under Attack!

GlobalStabilityofStificrsImpactonMemeTransmissionModel. Global Stability of Stificrs Impact on Meme Transmission Model

Theory and Methods of Statistical Inference

Models of Infectious Disease Formal Demography Stanford Summer Short Course James Holland Jones, Instructor. August 15, 2005

GEMF: GENERALIZED EPIDEMIC MODELING FRAMEWORK SOFTWARE IN PYTHON

Agent-Based Models: Statistical Issues and Challenges

A Class of Rumor Spreading Models with Population Dynamics

Trends, Helpouts and Hidden Gems: Additional Google Tools to Strengthen Your Brand

ANALYSIS OF DIPHTHERIA DISSEMINATION BY USING MULTI GROUPS OF DYNAMIC SYSTEM METHOD APPROACH

Fixed Point Analysis of Kermack Mckendrick SIR Model

Quarantine generated phase transition in epidemic spreading. Abstract

Detectability of Patient Zero Depending on its Position in the Network

Diffusion of Innovation and Influence Maximization

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Network Science & Telecommunications

The Friendship Paradox in Scale-Free Networks

Displaying Scientific Evidence for Making Valid Decisions: Lessons from Two Case Studies

Electronic appendices are refereed with the text. However, no attempt has been made to impose a uniform editorial style on the electronic appendices.

Degree Distribution: The case of Citation Networks

Comparing two samples

COOPERATIVE EPIDEMIC SPREADING IN COMPLEX NETWORKS

Modeling Controversy Within Populations

Non-Linear Models Cont d: Infectious Diseases. Non-Linear Models Cont d: Infectious Diseases

Introduction to SEIR Models

Spam ain t as Diverse as It Seems: Throttling OSN Spam with Templates Underneath

Section 8.1 Def. and Examp. Systems

New Journal of Physics

Finding central nodes in large networks

A Framework of Detecting Burst Events from Micro-blogging Streams

Confidence Limits and Intervals 3: Various other topics. Roger Barlow SLUO Lectures on Statistics August 2006

Epidemic Modeling with Contact Heterogeneity and Multiple Routes of Transmission

Epidemic reemergence in adaptive complex networks

ECS 289 / MAE 298, Lecture 7 April 22, Percolation and Epidemiology on Networks, Part 2 Searching on networks

Analysis and Control of Epidemics

Social Networks. Chapter 9

On a stochastic epidemic SEIHR model and its diffusion approximation

Network Models in Epidemiology

Understanding the contribution of space on the spread of Influenza using an Individual-based model approach

Week 12: Visual Argument (Visual and Statistical Thinking by Tufte) March 28, 2017

IDENTIFYING INFECTION SOURCES IN A NETWORK LUO WUQIONG

Modeling Dynamic Evolution of Online Friendship Network

A note on modeling retweet cascades on Twitter

arxiv: v1 [physics.soc-ph] 6 Nov 2013

Analysis of Numerical and Exact solutions of certain SIR and SIS Epidemic models

DM-Group Meeting. Subhodip Biswas 10/16/2014

Numerical solution of stochastic epidemiological models

arxiv: v2 [physics.soc-ph] 3 May 2018

Complex contagions and hybrid phase transitions

Degree (k)

Bayesian Inference for Contact Networks Given Epidemic Data

Transcription:

Modeling Social Media Memes as a Contagious Process S.Towers 1,, A.Person 2, C.Castillo-Chavez 1 1 Arizona State University, Tempe, AZ, USA 2 Some University, Nowhereville, NE, USA E-mail: smtowers@asu.edu Abstract In recent years, memes have become increasingly prevalent on the Internet and social media platforms such as Facebook and Twitter. Understanding the dynamics behind the spread of memes has important implications for understanding the spread of general information on social media, and how the spread of such information might be promoted or suppressed. In this analysis, we use a mathematical model to simulate the spread of memes as a contagious process among a heterogenous population in which one group is quick to be infected with the meme (ie; they spread the meme on social media), and quick to recover (ie; they quickly cease spreading the meme). A second group in the population is slower to begin spreading the meme, and slower to stop spreading it. We will refer to the former group as Hip, and the latter group as Grannies. We show that the contagion model has good explanatory capabilities for describing the spread of popular memes on the Internet. 1 Introduction Here is where you motivate the analysis: why do we care about memes? While memes might seem like a frivolous topic, understanding the dynamics behind their spread has important implications for understanding the spread of other information, including rumours, on the Internet and social media. There are many people and/or companies that would like to exploit these dynamics to maximize the spread of their message, such as advertising as product, or maximizing their influence on the way a population thinks. In the motivation, we would include references to relevant literature. For instance, 1

Reference [1] used a network model to simulate the spread of information on networks; they developed criteria for identifying influential spreaders of information, and point out that their work has implications for both sociological studies and the design of efficient commercial viral processes. Reference [3] also examines the issue of treating memes as a contagious process. Here is where you give the objective of the analysis. Here we will use a mathematical compartmental contagion model of a heterogenous population to simulate the spread of memes in a population that is composed of Hip users who are quick to adopt popular practices, but also quick to abandon them in favour of a newly emerging popular practice. The population is also composed of Granny users who are much slower to adopt popular practices, but also much slower torealizethatthepracticeisnolongerpopularamongthe Hip, andthusareslowertoceasethepractice. We examine various memes that have been popular in recent years, and show that the mathematical model provides a good fit to the data. We estimate that YYY% of the population are Hip, and the remainder are Grannies. The reproduction number of the memes ranges from YY to ZZ. In the following sections we describe the sources of data used in this study, and the mathematical model employed, followed by Results and Discussion. 2 Methods 2.1 Data In this analysis we examine the following memes: Good Guy Greg, Bad Luck Brian, Scumbag Steve, Seems Legit. Data used in this analysis consist of the time series of Google search trend data in the US between 2010 to present for each of the memes, as shown in Figure 1. 2

Figure 1: Time series of monthly Google search trend data related to the memes examined in this analysis. 2.2 Mathematical Contagion Model In our analysis, we model the spread of memes on the Internet using a compartmental Susceptible, Infected, Recovered (SIR) mathematical model with two groups that can be infected. On group (the Hip ) is quick to adopt a meme, but quick to cease spreading it. Another group (the Grannies ) is slower to adopt, and slower to cease spreading it. The model equations are: S hip = β hh S hip I hip /N hip S granny = β hg S granny I hip /N hip β gg S granny I granny /N granny I hip = +βs hip I hip /N hip γ h I hip I granny = +β hg S granny I hip /N hip +β gg S granny I granny /N granny γ g I granny R hip = γ h I hip R granny = γ g I granny (1) In our model, we assume that Hip people can never be infected by a Granny, but Grannies can be infected bythehip. Wealsoignorevitaldynamicsinthepopulation,undertheassumptionthatthedurationofthe meme epidemic is much shorter than the average human lifetime, thus the population size is constant, and equal to N = N hip +N granny, with N hip = S hip +I hip +R hip and N granny = S granny +I granny +R granny. 3

Table 1: Compartments and parameters of the mathematical contagion model of Equations 1. In the analysis, we fit the parameters of the model by comparing the model prediction of the total prevalence, I hip +I granny, to the observed total prevalence as determined from the Google Trends data. Compartment Description S hip Number susceptible Hip people I hip Number infected Hip people R hip Number recovered Hip people S granny Number susceptible Granny people I granny Number infected Granny people R granny Number recovered Granny people Parameter β hh Transmission rate between Hip people β hg Transmission rate from Hip people to Grannies β gg Transmission rate between Grannies γ h Recovery rate of Hip people Recovery rate of Grannies people γ g The parameter β hh is the contact rate, sufficient to transmit infection, among the Hip population. The parameter β hg is the transmission rate between Hip and Grannies, and β gg is the transmission rate among Grannies. The parameters γ h and γ g are the recovery rates of the Hip and Grannies, respectively. We assume that γ h > γ g. A summary of the model parameters is shown in Table 1. Here is where I would also put in a figure showing the compartmental diagram. Here is also where I would discuss the mathematical analysis of the model, and the expression for the reproduction number, R 0. 2.3 Statistical Methods To estimate the model parameters that optimally describe each data sample, we use the Monte Carlo method for solution of inverse problems to randomly sample the parameters of the model from broad uniform distributions, and calculate the Pearson χ 2 goodness-of-fit statistic comparing the model to the data sample [2, 5]. The uniform distribution sampling range is chosen to be large enough to ultimately include the parameter optimal value and at least a ±5 standard deviation range about that value. In order to determine the parameter hypotheses that minimize the Pearson χ 2 goodness-of-fit statistic, this procedure is repeated at least one million times To determine the parameter 95% confidence intervals, the Pearson χ 2 statistic is corrected for over-dispersion using the ansatz of McCullagh and Nelder (1989) [4]. 4

3 Results and Discussion 4 Summary References [1] Javier Borge-Holthoefer and Yamir Moreno, Absence of influential spreaders in rumor dynamics, Physical Review E 85 (2012), no. 2, 026116. [2] Glen Cowan, Statistical data analysis, Oxford University Press, 1998. [3] Patrick Grim, Daniel J Singer, Christopher Reade, and Steven Fisher, Germs, genes, and memes: Function and fitness dynamics on information networks, Philosophy of Science 82 (2015), no. 2, 219 243. [4] Peter McCullagh and John A Nelder, Generalized linear models., London England Chapman and Hall 1983., 1989. [5] Klaus Mosegaard and Albert Tarantola, Monte carlo sampling of solutions to inverse problems, Journal of Geophysical Research: Solid Earth (1978 2012) 100 (1995), no. B7, 12431 12447. 5