Detecting and Forecasting Misinformation via Temporal and Geometric Propagation Patterns

Zhang, Qiang; Cook, Jonathan; Yilmaz, Emine

doi:10.1007/978-3-030-72240-1_48

Qiang Zhang¹⁴,
Jonathan Cook¹⁴ &
Emine Yilmaz^14,15

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12657))

Included in the following conference series:

European Conference on Information Retrieval

2580 Accesses
3 Citations
8 Altmetric

Abstract

Misinformation takes the form of a false claim under the guise of fact. It is necessary to protect social media against misinformation by means of effective misinformation detection and analysis. To this end, we formulate misinformation propagation as a dynamic graph, then extract the temporal evolution patterns and geometric features of the propagation graph based on Temporal Point Processes (TPPs). TPPs provide the appropriate modelling framework for a list of stochastic, discrete events. In this context, that is a sequence of social user engagements. Furthermore, we forecast the cumulative number of engaged users based on a power law. Such forecasting capabilities can be useful in assessing the threat level of misinformation pieces. By jointly considering the geometric and temporal propagation patterns, our model has achieved comparable performance with state-of-the-art baselines on two well known datasets.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Multi-stage dynamic disinformation detection with graph entropy guidance

Article 31 January 2024

Holistic Analysis of Organised Misinformation Activity in Social Networks

Identifying Propagation Source in Time-Varying Networks

Keywords

1 Introduction

Social media has empowered human society in many ways. It is easier than ever to keep in touch with those we wish to, allowing an enormous variety of relationships to transcend physical isolation [19]. More so than ever before, social media has a responsibility for our mental wellbeing, as the arbiter of interactions between colleagues, friends and loved ones [13, 24]. It is therefore a matter of the utmost importance that we make this platform a safe environment, protected against those wishing to corrupt the service with fake news [20].

Various methods have been used to tackle the misinformation problem. Content-based misinformation analysis models apply natural language processing tools to the text content of claims [23]. Alone, content-based models fail to trace the dynamics of spread for tasks such as early detection or spread forecasting. Recent misinformation analysis models use static graph neural networks to extract geometric propagation patterns; others leverage time-series analysis by treating misinformation spread as a temporal event sequence [4, 15]. These two approaches each neglect the alternative propagation structure with neither leveraging both geometric and temporal dissemination features.

Propagation-based misinformation analysis makes use of patterns that can be attributed to the dynamics of spread. Our principal goal is to utilise the maximum space of these spreading features, so as to make the most effective use of the available data. Specifically, we first formulate misinformation propagation as a dynamic graph, then we employ a continuous-time temporal point process to extract the temporal evolution patterns and geometric features. Furthermore, we use a power law to model the growth in the temporal network scale, so as to forecast the future rate of spread for a claim identified as misinformation. The contributions of this study can thus be summarised as follows. (i) We formulate misinformation propagation as a dynamic graph. (ii) We then design temporal point processes (TPPs) to utilize both temporal and geometric features of the dynamic graph for misinformation detection. (iii) This study is the first to introduce forecasting of user engagements to misinformation analysis.

2 Related Work

To figure out the differences between true and false statements, most researchers conduct studies from three approaches: textual content, multimedia features and social context. Misinformation often contains opinionated language [2], which motivates textual content-based detection [1]. Sentiment features like positive words (e.g., love, sweet) and negating words (e.g., not, never) are reported to help detect rumours [6]. Misinformation also relies on sensational images to provoke an emotional response in consumers. As an example, Deepfakes [3] employed deep learning to generate fake images and videos to convey misleading information.

In social media, every piece of news is correlated to other posts and users. User engagements (e.g., commenting) provide rich reference evidence in two ways: by aggregation with relevant posts for a specific affair, and by temporal evolution. The first way relies on the “wisdom of crowds” to locate potential misinformation [1], while the second way captures temporal propagation patterns. For example, Hawkes processes are used to analyze how user stance changes temporally in [11]. However, these methods neglect geometric propagation features.

Graph neural networks can extract geometric propagation patterns. Graph Convolutional Networks (GCN) are used in [14] to encapsulate the propagation structure of heterogeneous data. Graph-Aware Co-Attention Network (GCAN) is proposed in [4, 8] to utilise the co-attention mechanism in graph modeling. Each of these works use static graphs and researchers neglect temporal information.

3 Problem Formulation

This section gives definitions and describes notation. A source claim takes the form of ${{\varvec{c}}} = ({{\varvec{x}}}, t)$, where ${{\varvec{x}}}$ is a concatenation of the posting user account features and the claim’s text features, i.e. ${{\varvec{x}}} = [{{\varvec{u}}} \mid \mid {{\varvec{M}}}]$. Here, ${{\varvec{u}}}$ is the user account representation and ${\varvec{M}}$ is the text message representation. t is initially zero, as ensuing dissemination events are timestamped with respect to the source claim.

Suppose the claim ${\varvec{c}}$ is accompanied by a sequence of social engagements $\mathcal {S}=\{{\varvec{v}}_{1}, {\varvec{v}}_{2}, \dots , {\varvec{v}}_{j}, \dots , {\varvec{v}}_{N}\}$, where ${\varvec{v}}_{j} = ({\varvec{x}}_{j}, t_j)$. Similarly, ${\varvec{x}}_j$ is the feature of an engaging node and $t_{j}$ is the engagement time with respect to claim post time. Social engagements include all forms of interactions that users conduct with claims on social media platforms, such as reposting, commenting and tagging.

Our temporal, dynamic graph is represented as a sequence of time-stamped snapshots $\mathcal {G} = \{\mathcal {G}(t_0), \mathcal {G}(t_1), \cdots , \mathcal {G}(t_j), \cdots , \mathcal {G}(t_N)\}$, where the first snapshot simply represents the source claim node and further snapshots are added with each representing the state of the dissemination network when a new node is connected. Let $\mathcal {G}(t)= <\mathcal {V}(t), \mathcal {E}(t)>$ denote the state of the temporal graph $\mathcal {G}$ at time t, where $\mathcal {V}(t) = \{{\varvec{c}}, {\varvec{v}}_{1}, {\varvec{v}}_{2}, \dots , {\varvec{v}}_{j}, \dots , {\varvec{v}}_{N(t)}\}$, with N(t) being the number of nodes to have directly or indirectly interacted with the claim ${\varvec{c}}$ as of time t. A new graph snapshot $\mathcal {G}(t_{j+1})$ is generated when a node ${\varvec{v}}_{j+1}$ is added to the sequence of social engagements. The graph structure of an exemplary false claim’s dissemination tree is demonstrated in Fig. 1.

4 Model Description

With the temporal evolution of the propagation graph $\mathcal {G}(t)$, new engagement nodes will establish edges with existing nodes and thus update the graph. To capture both geometric and temporal propagation features, we view the addition of new engagement nodes as the chronological events and develop a temporal point process that generates node embeddings of the dynamic graph $\mathcal {G}(t)$.

4.1 Propagation by Temporal Point Processes

A temporal point process (TPP) is a stochastic process that is realised as a list of discrete events in the continuous time domain $t \in \mathbb {R}^{+}$. TPPs usually rely on an intensity function, which is defined as the probability of the occurrence of an event in an infinitesimal time interval [22], to describe the temporal dynamics. They have been used to model dynamic graphs in [10, 17, 25].

In our propagation graph use-case, the timestamped event sequence comprises static graph snapshots. This static propagation graph represents the final state of the misinformation dissemination tree. Symbolically, $\mathcal {S} = \{({\varvec{x}}_j, t_j)\}^N_{j=1}$, where ${\varvec{x}}_j$ are the event features (previously node features) and $t_j$ is the timestamp of the $j^{th}$ event in the sequence $\mathcal {S}$. Intuitively, the added edge ${\varvec{e}}_{i,j}$ between the source node ${\varvec{v}}_i$ and the new node ${\varvec{v}}_j$ are influenced by not only ${\varvec{v}}_i$ and ${\varvec{v}}_j$, but also the history nodes of ${\varvec{v}}_i$. With this assumption, we define the intensity function associated with adding the new edge ${\varvec{e}}_{i,j}$ as,

$$\begin{aligned} \lambda _{i,j}(t) = g({\varvec{x}}_i, {\varvec{x}}_j) + \sum _{i' \in \mathcal {H}^{i}}\alpha _{i'j}(t) f({\varvec{x}}_{i'}, {\varvec{x}}_j) \kappa (t - t_{i'}). \end{aligned}$$

(1)

where $\mathcal {H}^i$ contains history events of the node i. The function $g(\cdot )$ calculates the affinity between two nodes, which is implemented as a bilinear interaction with the trainable parameter $\mathbf {W}_1$, i.e., $f({\varvec{x}}_{i}, {\varvec{x}}_{j}) = {\varvec{x}}_{i} *\mathbf {W}_1* {\varvec{x}}_{j}$. A non-linear activation ReLU is used to define the base intensity $g(\cdot ) = ReLU(f(\cdot ))$.

The influence from history nodes are measured via the self-attention mechanism as proposed in [21, 22]. For history nodes before time t, we calculate attention weight for each node,

$$\begin{aligned} \alpha _{i'j} = \frac{\exp (f({\varvec{x}}_{i'}, {\varvec{x}}_j))}{\sum _{k \in \mathcal {H}^{i}} \exp (f({\varvec{x}}_{k}, {\varvec{x}}_j))}. \end{aligned}$$

(2)

With the intensity function, we derive the probability of having a new node ${\varvec{v}}_j$ following an existing node ${\varvec{v}}_i$ at the timestamp t,

$$\begin{aligned} p\left( {\varvec{v}}_i,{\varvec{v}}_j \mid \mathcal {H}^{i}(t)\right) =\frac{\lambda _{i, j}(t)}{\sum _{i^{\prime } \in \mathcal {H}^{i}(t)} \lambda _{i^{\prime }, j}(t)}. \end{aligned}$$

(3)

The objective function to minimize is the negative log-likelihood of all the events in the sequence, $ \mathcal {L}_{TPP}=-\sum _{t \in \mathcal {T}} \sum _{({\varvec{v}}_i,{\varvec{v}}_j, t) \in \mathcal {E}} \log p\left( {\varvec{v}}_i,{\varvec{v}}_j \mid \mathcal {H}^{i}(t)\right) . $ Negative sampling is used to generate non-existing edges in the objective function as done in [9], so that the learnt node embeddings are able to distinguish which two nodes are connected and which two are not, i.e., the geometric structure. Maximizing the intensity at occurrence timestamps while minimizing the intensity otherwise will enforce the node embeddings to capture temporal dynamics.

4.2 Predictive Task

Macro-dynamics describe the evolution pattern of the network scale. We assume the network scale can be described with a certain dynamics equation. Given a dynamic graph $\mathcal {G}$, we have the cumulative number of nodes N(t) by timestamp t. We empirically find that N(t) increases in a power law, which is presented in Sect. 5. To approximate the power law, we define the following predictive equation

$$\begin{aligned} \hat{N}(t) = N_{max}*(1 - \alpha *\exp (-\beta *t)), \end{aligned}$$

(4)

where $N_{max}$, $\alpha $ and $\beta $ are learnable parameters. $N_{max}$ is the maximum number of nodes that this graph will contain while $\alpha $ and $\beta $ control how fast the graph scale will increase. Predictive loss is measured by $\mathcal {L}_{Pred} = (N(t) - \hat{N}(t))^2$.

4.3 Veracity Classification

We have designed a temporal point process to capture the geometric structure and temporal evolution of the propagation graph. With node embeddings, we obtain the graph embedding by concatenating the mean pooling and the maximum pooling of all nodes as well as the source claim being verified, $ {\varvec{x}}_G = \left[ MeanPool(\mathcal {S}) || MaxPool(\mathcal {S}) || {\varvec{c}} \right] . $ The graph embedding is then concatenated by parameters in predictive tasks, i.e., ${\varvec{x}} = [{\varvec{x}}_G || N_{max} || \alpha || \beta ]$. The veracity prediction is conducted by a Multi-Layer Perceptron (MLP) $ \hat{\mathbf {y}}={\text {softmax}}\left( {\text {ReLU}}\left( \mathbf { W}_{2}{\varvec{x}}+\mathbf {b}\right) \right) , $ where $\mathbf {W}_2$ and $\mathbf {b}$ are trainable parameters. And the classification loss is calculated by cross-entropy: $ \mathcal {L}_{MLP}=-y \log \left( \hat{y}_{1}\right) -(1-y) \log \left( 1-\hat{y}_{0}\right) . $ We take the weighted sum of the TPP loss, predictive loss and the MLP loss as the final loss function $ \mathcal {L} = \mathcal {L}_{TPP} + \omega _1 * \mathcal {L}_{Pred} + \omega _2 * \mathcal {L}_{MLP}. $

Table 1. Statistics of the used datasets.

Full size table

5 Experiments

We use two Twitter datasets [12], i.e., Twitter15 and Twitter16, in the experiments. Each dataset has a collection of stories with a source tweet being verified and a sequence of its retweets. We pick “True” and “False” source tweets to make the experimental datasets, and split the dataset into training, validation and test sets with 70%, 10% and 20% respectively. We train the model with the training set, tune hyperparameters with the validation set and report performance on the test set. We crawl user information according to their user IDs via Twitter API (Table 1).

As we set out to tackle the misinformation detection task, we compare our model with state-of-the-art baselines. RFC [5] is a random forest model with features from the source tweets and engaged user profiles. CRNN [7] combines convolutional neural networks and recurrent neural networks to extract features from engaged users and retweet texts. CSI [15] incorporates relevant articles and analyses the group behaviour of engaged users. dEFEND [16] uses a co-attention mechanism to study the source claims and user features. The graph-based baseline GCAN has been explained in Related Works.

6 Results and Analysis

To demonstrate the dissemination trends of true and false claims, we plotted the mean number of nodes within temporal graphs associated with each veracity classification at 5 min time intervals for the first 200 min following a source Tweet’s posting time. In Fig. 2, we make three interesting observations. (1) Both claim veracity types exhibit a similar power-law trend of plateauing gradient. (2) Contrary to much of the misinformation literature, which suggests that fake news spreads faster than true news [18], within our datasets, true news stories spread faster and reach more users on average. (3) There is a far greater disparity between the mean spreading plots in the Twitter16 dataset than there is in the Twitter15 dataset. This would indicate that it is easier to extract temporal features that are consistent within a given veracity classification in Twitter16.

Table 2. Test results on the two experimental datasets.

Full size table

We show the misinformation detection performance of our model against state-of-the-art baselines on test subsets. From Table 2, we can tell that we are able to achieve comparable performance with GCAN. Specifically, we beat GCAN on the Twitter16 dataset. This can be explained by the fact that Twitter16 displays greater disparity between the mean spreading of true and false claims, and our model captures such patterns to reach higher performance.

7 Conclusion

This study sets out to detect and forecast misinformation. We model the misinformation propagation as a continuous-time dynamic graph, and employ Temporal Point Processes to capture geometric and temporal patterns of the graph. We also develop a power law equation to forecast the growth of the graph scale. Experiments show the effectiveness of our model to achieve state-of-the-art performance in misinformation detection tasks. Future works will investigate more comprehensive methods to combine temporal and geometric features for propagation-based misinformation detection.

References

Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. ACM (2011)
Google Scholar
Chen, Y., Conroy, N.J., Rubin, V.L.: Misleading online content: recognizing clickbait as false news. In: Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection, pp. 15–19. ACM (2015)
Google Scholar
Floridi, L.: Artificial intelligence, deepfakes and a future of ectypes. Philos. Technol. 31(3), 317–321 (2018)
Article Google Scholar
Huang, Q., Yu, J., Wu, J., Wang, B.: Heterogeneous graph attention networks for early detection of rumors on twitter. arXiv preprint arXiv:2006.05866 (2020)
Kwon, S., Cha, M., Jung, K.: Rumor detection over varying time windows. PLoS ONE 12(1), e0168344 (2017)
Article Google Scholar
Kwon, S., Cha, M., Jung, K., Chen, W., et al.: Prominent features of rumor propagation in online social media. In: International Conference on Data Mining. IEEE (2013)
Google Scholar
Liu, Y., Wu, Y.F.B.: Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Google Scholar
Lu, Y.J., Li, C.T.: GCAN: graph-aware co-attention networks for explainable fake news detection on social media. arXiv preprint arXiv:2004.11648 (2020)
Lu, Y., Wang, X., Shi, C., Yu, P.S., Ye, Y.: Temporal network embedding with micro- and macro-dynamics. arXiv preprint arXiv:1909.04246 (2019)
Lu, Y., Wang, X., Shi, C., Yu, P.S., Ye, Y.: Temporal network embedding with micro-and macro-dynamics. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 469–478 (2019)
Google Scholar
Lukasik, M., Srijith, P., Vu, D., Bontcheva, K., Zubiaga, A., Cohn, T.: Hawkes processes for continuous time sequence classification: an application to rumour stance classification in Twitter. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 393–398 (2016)
Google Scholar
Ma, J., Gao, W., Wei, Z., Lu, Y., Wong, K.F.: Detect rumors using time series of social context information on microblogging websites. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 1751–1754. ACM (2015)
Google Scholar
Ma, J., Gao, W., Wong, K.F.: Rumor detection on twitter with tree-structured recursive neural networks. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1980–1989. Association for Computational Linguistics (2018)
Google Scholar
Monti, F., Frasca, F., Eynard, D., Mannion, D., Bronstein, M.M.: Fake news detection on social media using geometric deep learning. arXiv preprint arXiv:1902.06673 (2019)
Ruchansky, N., Seo, S., Liu, Y.: CSI: a hybrid deep model for fake news detection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 797–806. ACM (2017)
Google Scholar
Shu, K., Cui, L., Wang, S., Lee, D., Liu, H.: Defend: explainable fake news detection. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 395–405 (2019)
Google Scholar
Trivedi, R., Farajtabar, M., Biswal, P., Zha, H.: Dyrep: learning representations over dynamic graphs. In: International Conference on Learning Representations (2019)
Google Scholar
Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science 359, 1146–1151 (2018). https://doi.org/10.1126/science.aap9559
Article Google Scholar
Weiss, K., Khoshgoftaar, T.M., Wang, D.D.: A survey of transfer learning. J. Big Data 3(1), 1–40 (2016). https://doi.org/10.1186/s40537-016-0043-6
Article Google Scholar
Zhang, Q., Liang, S., Lipani, A., Ren, Z., Yilmaz, E.: From stances’ imbalance to their hierarchical representation and detection. In: The World Wide Web Conference, pp. 2323–2332 (2019)
Google Scholar
Zhang, Q., Liang, S., Yilmaz, E.: Variational self-attention model for sentence representation. arXiv preprint arXiv:1812.11559 (2018)
Zhang, Q., Lipani, A., Kirnap, O., Yilmaz, E.: Self-attentive hawkes processes. arXiv preprint arXiv:1907.07561 (2019)
Zhang, Q., Lipani, A., Liang, S., Yilmaz, E.: Reply-aided detection of misinformation via Bayesian deep learning. In: The World Wide Web Conference, pp. 2333–2343. ACM (2019)
Google Scholar
Zhang, Q., Yilmaz, E., Liang, S.: Ranking-based method for news stance detection. In: Companion Proceedings of the The Web Conference 2018. ACM Press (2018)
Google Scholar
Zuo, Y., Liu, G., Lin, H., Guo, J., Hu, X., Wu, J.: Embedding temporal network via neighborhood formation. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2857–2866 (2018)
Google Scholar

Download references

Acknowledgments

This project was funded by the EPSRC Fellowship titled “Task Based Information Retrieval”, grant reference number EP/P024289/1. We acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU.

Author information

Authors and Affiliations

University College London, London, UK
Qiang Zhang, Jonathan Cook & Emine Yilmaz
Alan Turing Institute, London, UK
Emine Yilmaz

Authors

Qiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Cook
View author publications
You can also search for this author in PubMed Google Scholar
Emine Yilmaz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiang Zhang .

Editor information

Editors and Affiliations

Radboud University Nijmegen, Nijmegen, The Netherlands
Djoerd Hiemstra
Department of Computer Science, Katholieke Universiteit Leuven, Heverlee, Belgium
Marie-Francine Moens
Toulouse, Toulouse Institute of Computer Science Research, Toulouse, France
Josiane Mothe
Istituto di Scienza e Tecnologie dell’Informazione, Consiglio Nazionale delle Ricerche, Pisa, Italy
Raffaele Perego
Leipzig University, Leipzig, Germany
Martin Potthast
Istituto di Scienza e Tecnologie dell’Informazione, Consiglio Nazionale delle Ricerche, Pisa, Italy
Fabrizio Sebastiani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Q., Cook, J., Yilmaz, E. (2021). Detecting and Forecasting Misinformation via Temporal and Geometric Propagation Patterns. In: Hiemstra, D., Moens, MF., Mothe, J., Perego, R., Potthast, M., Sebastiani, F. (eds) Advances in Information Retrieval. ECIR 2021. Lecture Notes in Computer Science(), vol 12657. Springer, Cham. https://doi.org/10.1007/978-3-030-72240-1_48

Download citation

DOI: https://doi.org/10.1007/978-3-030-72240-1_48
Published: 30 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72239-5
Online ISBN: 978-3-030-72240-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Detecting and Forecasting Misinformation via Temporal and Geometric Propagation Patterns

Abstract

Similar content being viewed by others

Multi-stage dynamic disinformation detection with graph entropy guidance

Holistic Analysis of Organised Misinformation Activity in Social Networks

Identifying Propagation Source in Time-Varying Networks

Keywords

1 Introduction

2 Related Work

3 Problem Formulation

4 Model Description

4.1 Propagation by Temporal Point Processes

4.2 Predictive Task

4.3 Veracity Classification

5 Experiments

6 Results and Analysis

7 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Detecting and Forecasting Misinformation via Temporal and Geometric Propagation Patterns

Abstract

Similar content being viewed by others

Multi-stage dynamic disinformation detection with graph entropy guidance

Holistic Analysis of Organised Misinformation Activity in Social Networks

Identifying Propagation Source in Time-Varying Networks

Keywords

1 Introduction

2 Related Work

3 Problem Formulation

4 Model Description

4.1 Propagation by Temporal Point Processes

4.2 Predictive Task

4.3 Veracity Classification

5 Experiments

6 Results and Analysis

7 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation