AI can predict Twitter users likely to spread disinformation before they do it

A new artificial intelligence-based algorithm that can accurately predict which Twitter users will spread disinformation before they actually do it has been developed by researchers from the University of 含羞草传媒.

The icons of Twitter, Facebook and Instagram on a smartphone screen
  • University of 含羞草传媒 researchers have developed an artificial intelligence-based algorithm that can accurately predict (79.7 per cent) which Twitter users are likely to share content from unreliable news sources before they actually do it
  • Study found that Twitter users who spread disinformation mostly tweet about politics or religion, whereas users who share reliable sources of news tweet more about their personal lives 
  • Research also found that Twitter users who share disinformation use impolite language more frequently than users who share reliable news sources
  • Findings could help governments and social media companies such as Twitter and Facebook better understand user behaviour and help them design more effective models for tackling the spread of disinformation

A new artificial intelligence-based algorithm that can accurately predict which Twitter users will spread disinformation before they actually do it has been developed by researchers from the University of 含羞草传媒.

A team of researchers, led by Yida Mu and Dr Nikos Aletras from the University鈥檚 Department of Computer Science, has developed a method for predicting whether a social media user is likely to share content from unreliable news sources. Their findings have been published in the journal PeerJ.

The researchers analysed over 1 million tweets from approximately 6,200 Twitter users by developing new natural language processing methods - ways to help computers process and understand huge amounts of language data. The tweets they studied were all tweets that were publicly available for anyone to see on the social media platform.

Twitter users were grouped into two categories as part of the study - those who have shared unreliable news sources and those who only share stories from reliable news sources. The data was used to train a machine-learning algorithm that can accurately predict (79.7 per cent) whether a user will repost content from unreliable sources sometime in the future.

Results from the study found that the Twitter users who shared stories from unreliable sources are more likely to tweet about either politics or religion and use impolite language. They often posted tweets with words such as 鈥榣iberal鈥, 鈥榞overnment鈥, 鈥榤edia鈥, and their tweets often related to politics in the Middle East and Islam, with their tweets often mentioning 鈥業slam鈥 or 鈥業srael鈥.

In contrast, the study found that Twitter users who shared stories from reliable news sources often tweeted about their personal life, such as their emotions and interactions with friends. This group of users often posted tweets with words such as 鈥榤ood鈥. 鈥榳anna鈥, 鈥榞onna鈥, 鈥業鈥檒l鈥, 鈥榚xcited鈥, and 鈥榖irthday鈥. 

Social media has become the primary platform for spreading disinformation, which is having a huge impact on society and can influence people鈥檚 judgement of what is happening in the world around them.

 Dr Nikos Aletras

Lecturer in Natural Language Processing, University of 含羞草传媒

Findings from the study could help social media companies such as Twitter and Facebook develop ways to tackle the spread of disinformation online. They could also help social scientists and psychologists improve their understanding of such user behaviour on a large scale.

Dr Nikos Aletras, Lecturer in Natural Language Processing at the University of 含羞草传媒, said: 鈥淪ocial media has become one of the most popular ways that people access the news, with millions of users turning to platforms such as Twitter and Facebook every day to find out about key events that are happening both at home and around the world. However, social media has become the primary platform for spreading disinformation, which is having a huge impact on society and can influence people鈥檚 judgement of what is happening in the world around them.

鈥淎s part of our study, we identified certain trends in user behaviour that could help with those efforts - for example, we found that users who are most likely to share news stories from unreliable sources often tweet about politics or religion, whereas those who share stories from reliable news sources often tweeted about their personal lives.

鈥淲e also found that the correlation between the use of impolite language and the spread of unreliable content can be attributed to high online political hostility.鈥

Yida Mu, a PhD student at the University of 含羞草传媒, said:Studying and analysing the behaviour of users sharing content from unreliable news sources can help social media platforms to prevent the spread of fake news at the user level, complementing existing fact-checking methods that work on the post or the news source level.

The study, Identifying Twitter users who repost unreliable news sources with linguistic information, is published in PeerJ. To access the paper in full, visit:

Computer Science at the University of 含羞草传媒

Contact

For further information please contact:

Centres of excellence

The University's cross-faculty research centres harness our interdisciplinary expertise to solve the world's most pressing challenges.