|
|
Comparisons of Data Set Construction Methods in Domain Analysis of Science and Technology: Consistency and Reliability |
Chen Guo1,2, Shao Yu1, Wang Yuefen1,2 |
1.Department of Information Management, Nanjing University of Science and Technology, Nanjing 210094 2.Jiangsu Science and Technology Collaborative Innovation Center of Social Public Safety, Nanjing 210094 |
|
|
Abstract In scientific and technical intelligence analysis, there are several methods in which documents can be collected; however, differences between these methods and their reliabilities are not well-understood. A manner in which scientific intelligence analysis can be promoted is to explore methods to collect field documents that can guarantee the reliability of analysis results. Therefore, this study aims to address this question quantitatively via an empirical analysis. First, the field of artificial intelligence is considered as an example along with the various task scenarios of scientific and technical intelligence analysis (including different analysis elements, element numbers, and sorting methods). Three progressive experiments are then designed in this study, each addressing one of the following: (1) the differences among the mainstream methods used to collect documents; (2) the reliability of these methods; and (3) the reliability of the final analysis results when the different methods are combined. The experimental results demonstrate that there is a slight difference and high reliability between these collection methods in intelligence analysis tasks of coarse-grained elements such as countries. However, when the intelligence analysis tasks can implemented by different researchers, the results of the existing collected methods are evidently different and their reliability is low. In different intelligence analysis tasks, we can select a relatively suitable manner to collect documents. The improvement in the reliability is not evident when the documents are collected by combining different methods. In other words, optimizing the collection of documents is crucial for the analysis of scientific and technical intelligence. Finally, based on quantitative indicators, this paper presents method to improve the reliability of scientific and technical intelligence analysis in terms of collecting documents along with future scope.
|
Received: 15 October 2019
|
|
|
|
1 邱均平. 信息计量学(七)第七讲 文献信息分布的集中与离散规律——布-齐-洛分布系及理论[J]. 情报理论与实践, 2001, 24(1): 77-80. 2 冯璐. 面向学科信息集成的领域分析数据集构建[M]. 北京: 北京邮电大学出版社, 2013. 3 Morris S A, van der Veer Martens B. Mapping research specialties[J]. Annual Review of Information Science and Technology, 2008, 42(1): 213-295. 4 Shu F, Julien C A, Zhang L, et al. Comparing journal and paper level classifications of science[J]. Journal of Informetrics, 2019, 13(1): 202-225. 5 沈艳红, 张娣. 文献计量分析中的数据准备工作研究[J]. 图书馆建设, 2012(5): 90-92. 6 钟丽萍, 冷伏海, 罗世猛. 情报研究有效性的影响因素分析[J]. 情报理论与实践, 2013, 36(7): 6-9. 7 Lu C, Bu Y, Dong X L, et al. Analyzing linguistic complexity and scientific impact[J]. Journal of Informetrics, 2019, 13(3): 817-829. 8 谢瑞霞, 李秀霞. 基于合著网络的作者影响力评价指标[J]. 情报理论与实践, 2019, 42(1): 100-104. 9 Najmi A, Rashidi T H, Abbasi A, et al. Reviewing the transport domain: An evolutionary bibliometrics and network analysis[J]. Scientometrics, 2017, 110(2): 843-865. 10 刘自强, 岳丽欣, 王效岳, 等. 主题演化视角下的国际情报学研究热点与前沿分析[J]. 图书馆, 2017(3):14-22. 11 侯剑华, 杨秀财, 周莉娟. 国际图书情报领域研究的前沿主题及其演化趋势分析[J]. 图书情报工作, 2016, 60(13): 82-90. 12 Iqbal W, Qadir J, Tyson G, et al. A bibliometric analysis of publications in computer networking research[J]. Scientometrics, 2019, 119(2): 1121-1155. 13 Shu F, Dinneen J D, Asadi B, et al. Mapping science using library of congress subject headings[J]. Journal of Informetrics, 2017, 11(4): 1080-1094. 14 Lv Y, Ding Y, Song M, et al. Topology-driven trend analysis for drug discovery[J]. Journal of Informetrics, 2018, 12(3): 893-905. 15 Sinha A, Shen Z H, Song Y, et al. An overview of microsoft academic service (MAS) and applications[C]// Proceedings of the 24th International Conference on World Wide Web. New York: ACM Press, 2015: 243-246. 16 人工智能领域的会议和期刊[EB/OL]. (2012-09-03) [2019-09-06]. https://blog.csdn.net/ariessurfer/article/details/7936860. 17 Open Academic Graph[EB/OL]. [2019-09-06]. https://www.aminer.cn/open-academic-graph?tdsourcetag=s_pcqq_aiomsg. 18 Shu F, Julien C A, Larivière V. Does the web of science accurately represent Chinese scientific performance?[J]. Journal of the Association for Information Science and Technology, 2019, 70(10): 1138-1152. 19 Spearman C. The proof and measurement of association between two things[J]. The American Journal of Psychology, 1904, 15(1): 72-101. |
|
|
|