Quantifying the uncertainty of LLM hallucination spreading in complex adaptive social networks

Guozhi Hao, Jun Wu*, Qianqian Pan, Rosario Morello

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Large language models (LLMs) are becoming a significant source of content generation in social networks, which is a typical complex adaptive system (CAS). However, due to their hallucinatory nature, LLMs produce false information that can spread through social networks, which will impact the stability of the whole society. The uncertainty of LLMs false information spread within social networks is attributable to the diversity of individual behaviors, intricate interconnectivity, and dynamic network structures. Quantifying the uncertainty of false information spread by LLMs in social networks is beneficial for preemptively devising strategies to defend against threats. To address these challenges, we propose an LLMs hallucination-aware dynamic modeling method via agent-based probability distributions, spread popularity and community affiliation, to quantify the uncertain spreading of LLMs hallucination in social networks. We set up the node attributes and behaviors in the model based on real-world data. For evaluation, we consider the spreaders, informed people, discerning and unwilling non-spreaders as indicators, and quantified the spreading under different LLMs task situations, such as QA, dialogue, and summarization, as well as LLMs versions. Furthermore, we conduct experiments using real-world LLM hallucination data combined with social network features to ensure the validity of the proposed quantifying scheme.

Original languageEnglish
Article number16375
JournalScientific reports
Volume14
Issue number1
DOIs
Publication statusPublished - 2024 Dec

ASJC Scopus subject areas

  • General

Fingerprint

Dive into the research topics of 'Quantifying the uncertainty of LLM hallucination spreading in complex adaptive social networks'. Together they form a unique fingerprint.

Cite this