数智赋能的科研创新——基于数智技术的创新辅助框架探析

doi:10.3772/j.issn.1000-0135.2023.09.001

情报学报

2023, Vol. 42

Issue (9): 1009-1017 DOI: 10.3772/j.issn.1000-0135.2023.09.001

专题

本期目录 | 过刊浏览 | 高级检索

数智赋能的科研创新——基于数智技术的创新辅助框架探析

陆伟^1,2, 马永强^1,2, 刘家伟^1,2, 杨金庆³, 程齐凯^1,2

1.武汉大学信息管理学院,武汉 430072
2.武汉大学信息检索与知识挖掘研究所,武汉 430072
3.华中师范大学信息管理学院,武汉 430079

Data Intelligence Empowered Innovation: An Exploration of the Innovation Assistance Framework Based on Data Intelligence Technology

Lu Wei^1,2, Ma Yongqiang^1,2, Liu Jiawei^1,2, Yang Jinqing³, Cheng Qikai^1,2

1.School of Information Management, Wuhan University, Wuhan 430072
2.Information Retrieval and Knowledge Mining Laboratory, Wuhan University, Wuhan 430072
3.School of Information Management, Central China Normal University, Wuhan 430079

摘要
图/表
参考文献
相关文章 (1)

全文: PDF (2000 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要以ChatGPT（Chat Generative Pre-trained Transformer）为代表的人工智能大模型在文本生成、人机对话等方面展现出了优异的性能。在大模型背景下，大数据、人工智能等数智技术在赋能科研创新方面表现出重要的现实价值。当前的科技信息资源管理和知识服务能够为科研创新提供较为准确的信息以及常规的知识聚合服务，但是仍未能与科研创新活动形成深度融合。同时，科研人员在科研活动中也面临信息处理能力不足、认知能力有限等挑战。据此，本文首先对数智时代科研活动的新特点进行了剖析，然后提出了基于数智技术的创新辅助框架，并对所提出的框架进行了深入分析和探讨，阐述了其在创新全过程中的功能定位、服务模式和关键赋能路径。未来，随着大数据和人工智能技术的不断成熟和进步，数智赋能的科技信息资源管理将进一步嵌入科研创新活动全过程。基于数智技术的创新辅助服务能够为科研人员提供个性化、细粒度的知识和场景化的解决方案，如面向文献阅读、实验设计和论文撰写场景的创新辅助服务，从而更好地服务于科研创新活动。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	陆伟
	马永强
	刘家伟
	杨金庆
	程齐凯

关键词 ：数智赋能, ChatGPT, 人工智能大模型, 科学智能, 科研创新全过程, 创新辅助框架

收稿日期: 2023-04-05

基金资助:国家自然科学基金重点项目“数智赋能的科技信息资源与知识管理理论变革”（72234005）；国家自然科学基金面上项目“基于机器阅读理解的科学命题文本论证逻辑识别”（72174157）。

作者简介: 陆伟，男，1974年生，教授，博士生导师，研究方向为信息检索、AI治理、人机协同，E-mail：weilu@whu.edu.cn；马永强，男，1997年生，博士研究生，研究方向为信息抽取、文档智能；刘家伟，男，1994年生，博士研究生，研究方向为信息检索、信息安全；杨金庆，男，1991年生，博士，讲师，研究方向为科技情报、学科知识演化；程齐凯，男，1989年生，博士，副教授，研究方向为文本挖掘、信息检索；

引用本文:

陆伟, 马永强, 刘家伟, 杨金庆, 程齐凯. 数智赋能的科研创新——基于数智技术的创新辅助框架探析[J]. 情报学报, 2023, 42(9): 1009-1017.
Lu Wei, Ma Yongqiang, Liu Jiawei, Yang Jinqing, Cheng Qikai. Data Intelligence Empowered Innovation: An Exploration of the Innovation Assistance Framework Based on Data Intelligence Technology. 情报学报, 2023, 42(9): 1009-1017.

链接本文:

https://qbxb.istic.ac.cn/CN/10.3772/j.issn.1000-0135.2023.09.001 或 https://qbxb.istic.ac.cn/CN/Y2023/V42/I9/1009

1 OpenAI. ChatGPT: optimizing language models for dialogue[EB/OL]. (2022-11-30) [2023-02-09]. https://openai.com/blog/chatgpt.
2 Hutson M. Could AI help you to write your next paper?[J]. Nature, 2022, 611(7934): 192-193.
3 Stokel-Walker C, van Noorden R. What ChatGPT and generative AI mean for science[J]. Nature, 2023, 614(7947): 214-216.
4 陆伟, 杨金庆. 数智赋能的情报学学科发展趋势探析[J]. 信息资源管理学报, 2022, 12(2): 4-12.
5 赵志耘. 论复杂信息环境下的科技情报卓智赋能[J]. 情报学报, 2022, 41(12): 1229-1237.
6 许勇, 黄福寿. 人工智能哲学研究述评[J]. 上海交通大学学报(哲学社会科学版), 2020, 28(1): 116-123.
7 新华社. 科技部启动“人工智能驱动的科学研究”专项部署工作[EB/OL]. (2023-03-27) [2023-03-28]. http://www.gov.cn/xinwen/2023-03/27/content_5748495.htm.
8 马费成. 守正创新, 继续推进信息资源管理学科的发展[J]. 情报资料工作, 2023, 44(1): 13-14.
9 张智雄, 于改红, 刘熠, 等. ChatGPT对文献情报工作的影响[J]. 数据分析与知识发现, 2023, 7(3): 36-42.
10 罗卓然, 陆伟, 蔡乐, 等. 学术文本词汇功能识别——在论文新颖性度量上的应用[J]. 情报学报, 2022, 41(7): 720-732.
11 胡志刚, 章成志. 悄然兴起的全文计量分析[J]. 图书馆论坛, 2021, 41(3): 1-11.
12 Cachola I, Lo K, Cohan A, et al. TLDR: extreme summarization of scientific documents[C]// Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2020: 4766-4777.
13 程齐凯, 李鹏程, 张国标, 等. 学术文本词汇功能识别——基于标题生成策略和注意力机制的问题方法抽取[J]. 情报学报, 2021, 40(1): 43-52.
14 Cohan A, Ammar W, van Zuylen M, et al. Structural scaffolds for citation intent classification in scientific publications[C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019: 3586-3596.
15 陈博立, 鲜国建, 赵瑞雪, 等. 科技文献问答式智能检索总体设计与关键技术探析[J]. 中国图书馆学报, 2023, 49(3): 92-106.
16 孙坦, 刘峥, 崔运鹏, 等. 融合知识组织与认知计算的新一代开放知识服务架构探析[J]. 中国图书馆学报, 2019, 45(3): 38-48.
17 Xu Y H, Li M H, Cui L, et al. LayoutLM: pre-training of text and layout for document image understanding[C]// Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York: ACM Press, 2020: 1192-1200.
18 Beltagy I, Lo K, Cohan A. SciBERT: a pretrained language model for scientific text[C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2019: 3615-3620.
19 Chithrananda S, Grand G, Ramsundar B. ChemBERTa: large-scale self-supervised pretraining for molecular property prediction[OL]. (2020-10-23). https://arxiv.org/pdf/2010.09885.pdf.
20 Luo R Q, Sun L A, Xia Y C, et al. BioGPT: generative pre-trained transformer for biomedical text generation and mining[J]. Briefings in Bioinformatics, 2022, 23(6): bbac409.
21 Stokel-Walker C. Twitter changed science—what happens now it’s in turmoil?[J]. Nature, 2023, 613(7942): 19-21.
22 陆伟, 刘家伟, 马永强, 等. ChatGPT为代表的大模型对信息资源管理的影响[J]. 图书情报知识, 2023, 40(2): 6-9, 70.
23 王兴成. 科学经济学的对象[J]. 国外社会科学, 1982(1): 71-73.
24 约翰·齐曼. 元科学导论[M]. 刘珺珺, 译. 长沙: 湖南人民出版社, 1988.
25 迈克尔·吉本斯, 卡米那·利摩日, 黑尔佳·诺沃提尼, 等. 知识生产的新模式: 当代社会科学与研究的动力学[M]. 陈洪捷, 沈文钦, 译. 北京: 北京大学出版社, 2011.
26 Segler M H S, Preuss M, Waller M P. Planning chemical syntheses with deep neural networks and symbolic AI[J]. Nature, 2018, 555(7698): 604-610.
27 Melnikov A A, Nautrup H P, Krenn M, et al. Active learning machine learns to create new quantum experiments[J]. Proceedings of the National Academy of Sciences of the United States of America, 2018, 115(6): 1221-1226.
28 Jumper J, Evans R, Pritzel A, et al. Highly accurate protein structure prediction with AlphaFold[J]. Nature, 2021, 596(7873): 583-589.
29 Nguyen T, Brandstetter J, Kapoor A, et al. ClimaX: a foundation model for weather and climate[OL]. (2023-07-10). https://arxiv.org/pdf/2301.10343.pdf.
30 Auer S, Oelen A, Haris M, et al. Improving access to scientific literature with knowledge graphs[J]. Bibliothek Forschung und Praxis, 2020, 44(3): 516-529.
31 Pankratius V, Li J, Gowanlock M, et al. Computer-aided discovery: toward scientific insight generation with machine support[J]. IEEE Intelligent Systems, 2016, 31(4): 3-10.
32 Pyzer-Knapp E O, Pitera J W, Staar P W J, et al. Accelerating materials discovery using artificial intelligence, high performance computing and robotics[J]. NPJ Computational Materials, 2022, 8: Article No.84.
33 Meyers F. IUPAC announces the 2020 top ten emerging technologies in chemistry[EB/OL]. (2020-10-25) [2022-02-02]. https://iupac.org/iupac-announces-the-2020-top-ten-emerging-technologies-in-chemistry/.
34 Johnson R, Watkinson A, Mabe M. The STM Report: an overview of scientific and scholarly publishing[R]. Fifth Edition. The Hague: International Association of Scientific, Technical and Medical Publishers, 2018.
35 van Noorden R. Scientists may be reaching a peak in reading habits[J]. Nature, 2014. DOI: 10.1038/nature.2014.14658.
36 Simon H A. The scientist as problem solver[M]// Complex Information Processing: The Impact of Herbert A. Simon. Hillsdale: Lawrence Erlbaum Associates, 1989: 375-398.
37 Hope T, Downey D, Weld D S, et al. A computational inflection for scientific discovery[J]. Communications of the ACM, 2023, 66(8): 62-73.
38 Krenn M, Pollice R, Guo S Y, et al. On scientific understanding with artificial intelligence[J]. Nature Reviews Physics, 2022, 4(12): 761-769.
39 马费成, 张帅. 我国图书情报领域新兴交叉学科发展探析[J]. 中国图书馆学报, 2023, 49(2): 4-14.
40 罗威, 罗准辰, 雷帅, 等. 智能科学家——科技信息创新引领的下一代科研范式[J]. 情报理论与实践, 2020, 43(1): 1-5, 17.
41 Li J, Huang J S, Liu J X, et al. Human-AI cooperation: modes and their effects on attitudes[J]. Telematics and Informatics, 2022, 73: 101862.
42 Leeming J. How AI is helping the natural sciences[J]. Nature, 2021, 598(7880): S5-S7.
43 Ouyang L, Wu J, Jiang X, et al. Training language models to follow instructions with human feedback[J]. Advances in Neural Information Processing Systems, 2022, 35: 27730-27744.
44 Chung H W, Hou L, Longpre S, et al. Scaling instruction-finetuned language models[OL]. (2022-12-06). https://arxiv.org/pdf/2210.11416.pdf.
45 Microsoft. The new Bing: our approach to responsible AI[R/OL]. (2023-02-01) [2023-03-19]. https://blogs.microsoft.com/wp-content/uploads/prod/sites/5/2023/02/The-new-Bing-Our-approach-to-Responsible-AI.pdf.
46 Lewis P, Perez E, Piktus A, et al. Retrieval-augmented generation for knowledge-intensive NLP tasks[J]. Advances in Neural Information Processing Systems, 2020, 33: 9459-9474.
47 Stephen W. ChatGPT gets its “Wolfram superpowers”![EB/OL]. (2023-03-23) [2023-03-27]. https://writings.stephenwolfram.com/2023/03/chatgpt-gets-its-wolfram-superpowers/.
48 薛守义. 科学性质透视[M]. 济南: 山东人民出版社, 2009.
49 谢莹莹, 马鹏宇, 冯凡, 等. 2022 AI4S全球发展观察与展望[R]. 北京: 北京科学智能研究院, 深势科技, 2022.
50 Ma Y Q, Liu J W, Yi F, et al. AI vs. human—differentiation analysis of scientific content generation[OL]. (2023-02-12). https://arxiv.org/ftp/arxiv/papers/2301/2301.10416.pdf.