情报学报  2019, Vol. 38 Issue (1): 89-96    DOI: 10.3772/j.issn.1000-0135.2019.01.010
Current Issue | Archive | Adv Search |
Research on Microblog Rumor Identification Based on LDA and Random Forest
Zeng Ziming1,2, Wang Jing1,2
1. Center for the Study of Information Resources, Wuhan 430072;
2. Laboratory Center for Library and Information Science, Wuhan 430072
Download: PDF (1147 KB)   HTML (106 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The spread of Internet rumors has a negative impact on everyday life and social stability. In order to assist in rumor control, this paper analyzes information about the “haze” rumors on the Sina Weibo microblogging platform in 2016, and constructs reliability and influence variables based on Weibo data and history research. In addition, the LDA model is used to gather the topic distribution of the experimental text data. Based upon the reliability variable, the influence variable, and the probability of topics, the paper uses random forest for classification to achieve rumor identification. The experiment results show that the probability of topics plays an important role in rumor identification, and that the random forest model, based on LDA, can lead to an improvement in the accuracy of rumor identification.
Key wordsWeibo      rumor identification      LDA      random forest      haze     
Received: 03 November 2017     
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
Zeng Ziming
Wang Jing
Cite this article:   
Zeng Ziming,Wang Jing. Research on Microblog Rumor Identification Based on LDA and Random Forest[J]. 情报学报, 2019, 38(1): 89-96.
URL:  
https://qbxb.istic.ac.cn/EN/10.3772/j.issn.1000-0135.2019.01.010     OR     https://qbxb.istic.ac.cn/EN/Y2019/V38/I1/89