|
|
|
| Contribution Attribution Based on Semantic Similarity: Revealing Implicitlyd Sources in Directs |
| Yang Siluo1,2, Wu Lijuan1,2, Wu Biyao1,2 |
1.School of Information Management, Wuhan University, Wuhan 430072 2.Research Center for Chinese Science Evaluation, Wuhan University, Wuhan 430072 |
|
|
|
|
Abstract To accurately identify the hidden contributions and unlabeled implicitly cited sources in direct citations, in this paper, citation contribution attribution (CCA) methods are constructed based on citation markers and semantic similarity, and the necessity of integrating hidden contributions into academic evaluation is discussed to provide a new perspective for research contribution attribution. Using the CL-SciSumm 2017 dataset as an empirical object, two types of CCA methods are used to calculate the explicit contributions of focal papers. The remaining contributions are defined as hidden contributions of their references and allocated according to the equal allocation principle and semantic similarity. Two traditional methods, direct citation evaluation and citation cascade evaluation, are employed as benchmarks to compare the differences in contribution scores of implicitly cited papers. The references of focal papers provide hidden contributions to their citing papers. Semantic similarity-based CCA methods identify implicitly cited sources more comprehensively but need further improvement in identification precision. The contribution scores obtained by the proposed and traditional methods differ significantly, suggesting that the hidden contributions should be incorporated into the academic contribution evaluation system of scientific literature.
|
|
Received: 08 March 2025
|
|
|
|
1 Zhu X D, Turney P, Lemire D, et al. Measuring academic influence: not all citations are equal[J]. Journal of the Association for Information Science and Technology, 2015, 66(2): 408-427. 2 Cai L W, Tian J H, Liu J Y, et al. Scholarly impact assessment: a survey of citation weighting solutions[J]. Scientometrics, 2019, 118(2): 453-478. 3 Min C, Chen Q Y, Yan E J, et al. Citation cascade and the evolution of topic relevance[J]. Journal of the Association for Information Science and Technology, 2021, 72(1): 110-127. 4 Liu Y Y, Wu Q, Wu S J, et al. Weighted citation based on ranking-related contribution: a new index for evaluating article impact[J]. Scientometrics, 2021, 126(10): 8653-8672. 5 刘运梅, 李冉, 盛小平. 隐形三角引用: 科学论文中的不规范引用行为[J]. 现代情报, 2024, 44(4): 127-141. 6 Liu Y M, Yang L, Chen M. A new citation concept: triangular citation in the literature[J]. Journal of Informetrics, 2021, 15(2): 101141. 7 Lyu H H, Bu Y, Zhao Z Y, et al. Citation bias in measuring knowledge flow: evidence from the web of science at the discipline level[J]. Journal of Informetrics, 2022, 16(4): 101338. 8 Hu X J, Rousseau R. Scientific influence is not always visible: the phenomenon of under-cited influential publications[J]. Journal of Informetrics, 2016, 10(4): 1079-1091. 9 Ahlgren P, Chen Y W, Colliander C, et al. Enhancing direct citations: a comparison of relatedness measures for community detection in a large set of PubMed publications[J]. Quantitative Science Studies, 2020, 1(2): 714-729. 10 Fragkiadaki E, Evangelidis G. Three novel indirect indicators for the assessment of papers and authors based on generations of citations[J]. Scientometrics, 2016, 106(2): 657-694. 11 吕海华, 李江. 1987—2016年跨学科知识流动的规律: 一个新的视角“学科势能”[J]. 图书情报知识, 2021, 38(4): 125-135. 12 鞠秀芳. 基于文本相似度分析的期刊引文有效性识别研究[J]. 现代情报, 2018, 38(11): 14-17. 13 MacRoberts M H, MacRoberts B R. The mismeasure of science: citation analysis[J]. Journal of the Association for Information Science and Technology, 2018, 69(3): 474-482. 14 Yang S L, Xiao A X, Nie Y, et al. Measuring coauthors’ credit in medicine field—based on author contribution statement and citation context analysis[J]. Information Processing & Management, 2022, 59(3): 102924. 15 Liu J S, Kuan C H. A new approach for main path analysis: decay in knowledge diffusion[J]. Journal of the Association for Information Science and Technology, 2016, 67(2): 465-476. 16 Hou J H, Yang X C, Zhang Y. The effect of social media knowledge cascade: an analysis of scientific papers diffusion[J]. Scientometrics, 2023, 128(9): 5169-5195. 17 Asubiaro T V, Ajiferuke I. Semantic similarity-based credit attribution on citation paths: a method for allocating residual citation to and investigating depth of influence of scientific communications[J]. Scientometrics, 2022, 127(11): 6257-6277. 18 罗卓然, 蔡乐, 钱佳佳, 等. 学术论文创新贡献句识别研究[J]. 图书情报工作, 2021, 65(12): 93-100. 19 Fiala D, Tutoky G. PageRank-based prediction of award-winning researchers and the impact of citations[J]. Journal of Informetrics, 2017, 11(4): 1044-1068. 20 Waltman L. A review of the literature on citation impact indicators[J]. Journal of Informetrics, 2016, 10(2): 365-391. 21 Feng S D, Li H J, Qi Y B. How to detect the sleeping beauty papers and princes in technology considering indirect citations?[J]. Journal of Informetrics, 2023, 17(3): 101431. 22 康旭东, 邓乐乐, 王宇开, 等. 基于全代引证的专利累积影响力评价——一个诺奖得主专利的案例研究[J]. 情报学报, 2021, 40(3): 267-277. 23 Tahamtan I, Bornmann L. What do citation counts measure? An updated review of studies on citations in scientific documents published between 2006 and 2018[J]. Scientometrics, 2019, 121(3): 1635-1684. 24 卢超, 章成志, 王玉琢, 等. 语义特征分析的深化——学术文献的全文计量分析研究综述[J]. 中国图书馆学报, 2021, 47(2): 110-131. 25 金贤日, 欧石燕. 无监督引用文本自动识别与分析[J]. 数据分析与知识发现, 2021, 5(1): 66-77. 26 Fragkiadaki E, Evangelidis G, Samaras N, et al. Cascading citations indexing framework algorithm implementation and testing[C]// Proceedings of the 13th Panhellenic Conference on Informatics. Piscataway: IEEE, 2009: 70-74. 27 Dervos D A, Kalkanis T. Cc-IFF: a cascading citations impact factor framework for the automatic ranking of research publications[C]// Proceedings of the 2005 IEEE Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications. Piscataway: IEEE, 2005: 668-673. 责任编辑 冯家琪) |
|
|
|