|
|
A Method for Constructing Topic Map in Professional Social Media:A Case Study of Automobile Forum |
Lin Jie, Miao Runsheng |
School of Economics and Management, Tongji University, Shanghai 200092 |
|
|
Abstract The content of topic maps in professional social media includes topics in forums and the relationship among these topics. Topic maps are important in different applications such as determining the direction of professional product innovation and building professional knowledge indices. Based on deep learning and text mining technology, this paper proposes a method for constructing topic maps in professional social media. First, the Skip-Gram model is trained using professional social media text. The hidden layer weight of the model is regarded as the semantic similarities of words, while prediction in the model is regarded as the contextual relevance between words. Second, the existing seed ontology vocabulary is expanded based on semantic similarity and contextual relevance; the aim of this step is to provide high-quality domain vocabulary for the next step of topic extraction. Topics are then extracted by the ontology-based latent Dirichlet allocation (LDA) topic model. Finally, the weight of relevancy between words is defined using semantic similarity and context relevance. The relevancy and hierarchical structure between topics or sub-topics are obtained through graph modeling and spectral clustering. In this paper, the car forum corpus is used to develop a topic map. The results show that the proposed method improved keyword purity by 20.2% compared to the LDA model, and could display the relationships among these topics both clearly and reasonably.
|
Received: 25 March 2019
|
|
|
|
1 卜曲. 品牌社区网络结构及成员互动内容研究[J]. 现代商贸工业, 2016, 37(4): 55-56. 2 韩永青, 陈卓群, 夏立新. 国内外主题图应用研究述评[J]. 图书情报知识, 2008(6): 105-109, 128. 3 Wang H C,Jhou H T,Tsai Y S. Adapting topic map and social influence to the personalized hybrid recommender system[J]. Information Sciences, 2018, in press. 4 吴婧. 试论网络论坛的文本构建特色[J]. 新闻研究导刊, 2016, 7(4): 66-67. 5 Ellouze N,Ben A M,Metais E, et al. State of the art on topic map building approaches[C]// Proceedings of the 14th International Conference on Applications of Natural Language to Information Systems. Heidelberg: Springer, 2008: 102-112. 6 Lacher M S,Decker S. RDF, topic maps, and the semantic Web[J]. Markup Languages: Theory and Practice, 2001, 3(3): 313-331. 7 Jose-Garcia A,Lopez-Arevalo I,Sosa-Sosa V. Building topic maps from relational databases[C]// Proceedings of the 9th International Conference on Electrical Engineering, Computing Science and Automatic Control. New York: IEEE, 2012: 1-6. 8 Bohm K,Heyer G,Wolff C. Topic map generation using text mining[J]. Journal of Universal Computer Science, 2002, 8(6): 623-633. 9 夏火松, 李保国, 杨培. 基于改进K-means聚类的在线新闻评论主题抽取[J]. 情报学报, 2016, 35(1): 55-65. 10 彭云, 万常选, 江腾蛟, 等. 基于语义约束 LDA 的商品特征和情感词提取[J]. 软件学报, 2017, 28(3): 676-693. 11 Roberson S,Dicheva D. Semi-automatic ontology extraction to create draft topic maps[C]// Proceedings of the 45th Annual Southeast Regional Conference. New York: ACM Press, 2007: 100-105. 12 Kasler L,Venczel Z,Varga L Z. Framework for semi-automatically generating topic maps[C]// Proceedings of the 3rd International Workshop on Text-based Information Retrieval. CEUR-WS, 2006: 24-30. 13 Dicheva D,Dichev C. TM4L: Creating and browsing educational topic maps[J]. British Journal of Educational Technology, 2006, 37(3): 391-404. 14 李煜, 刘虹, 孙建军. 中国图书馆学博士论文研究主题图谱分析[J]. 图书馆杂志, 2018, 37(6): 22-30. 15 白如江, 冷伏海, 廖君华. 一种基于科技规划文本的研究前沿主题地图构建方法[J]. 图书情报工作, 2017, 61(23): 114-121. 16 蔡晓妍, 戴冠中, 杨黎斌. 谱聚类算法综述[J]. 计算机科学, 2008, 35(7): 14-18. 17 公路交通科学名词审定委员会. 公路交通科技名词[M]. 北京: 科学出版社, 2016. |
|
|
|