|
|
Automatic Mapping Method and Empirical Research of U.S. Commerce Control List Data and Patent Data |
Lyu Lucheng1,2, Han Tao1,2, Chen Fang1, Wang Xuezhao1,2, Zhao Yajuan1,2, Guo Shijie1,2 |
1.National Science Library, Chinese Academy of Sciences, Beijing 100190
2.Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190 |
|
|
Abstract To efficiently reveal the technology gap between China and the United States in technologies recorded in the U.S. Commerce Control List (CCL), under the circumstances of the highly unstructured characteristics of CCL data, this paper proposes an automatic method to map the CCL and patent data, which can automatically reveal the technology gap from the perspective of the patent. Based on the theory of text mining, the text standardization process of CCL text is formulated, and the automatic mapping method and effect evaluation indicator of CCL data and patent data based on TF-IDF and Word2Vec are proposed. Taking the U.S. CCL data in 2019 and the global Patent Cooperation Treaty (PCT) patent application data in 2018 as an example, the empirical research is conducted. By evaluating the effect of the model, it is finally found that the automatic mapping result obtained when the text similarity threshold of Word2Vec model is 0.87 is optimal, and the technology gap analysis is carried out based on this model. The method proposed in this paper can automatically map CCL and patent data and carry out an intelligence analysis, and the analysis results are highly interpretable. It is a useful tool to improve the timeliness of intelligence analysis, and has high practical value.
|
Received: 18 January 2021
|
|
|
|
1 张建军. 中美技术出口管理法律制度的比较研究[D]. 西安: 西北大学, 2004.
2 彭爽, 张晓东. 论美国的出口管制体制[J]. 经济资料译丛, 2015(2): 24-41.
3 靳风. 美国出口管制体系概览[J]. 当代美国评论, 2018, 2(2): 117-120.
4 祝捷频, 赵蕴华. 基于美国对华技术管制清单的专利分析——以数控系统领域为例[J]. 情报杂志, 2014, 33(11): 46-53.
5 魏简康凯, 宿铮. 美国出口管制改革的竞争情报分析[J]. 情报杂志, 2019, 38(4): 4-8.
6 陈峰. 应对国外对华技术出口限制的竞争情报问题分析[J]. 情报杂志, 2018, 37(1): 9-13, 33.
7 陈峰. 中国实施高技术出口管制需要高度倚重竞争情报[J]. 情报杂志, 2018, 37(8): 1, 5, 37, 2-4.
8 陆天驰, 闵超, 高伊林, 等. 竞争情报视角下的中美人工智能技术领域差距分析——以美国商品管制清单为例[J]. 情报杂志, 2019, 38(11): 25-33.
9 周磊, 杨威, 余玲珑, 等. 美国对华技术出口管制的实体清单分析及其启示[J]. 情报杂志, 2020, 39(7): 23-28.
10 茹丽洁, 张娴. 专利技术相关性研究方法进展评述与展望[J]. 图书情报工作, 2016, 60(6): 128-134, 141.
11 Passing F, Moehrle M G. Measuring technological convergence in the field of smart grids: a semantic patent analysis approach using textual corpora of technologies[C]// Proceedings of the 2015 Portland International Conference on Management of Engineering and Technology. IEEE, 2015: 559-570.
12 曾文, 徐红姣, 李颖, 等. 基于VSM的科技期刊文献与专利文献的相似度计算方法研究[J]. 情报工程, 2016, 2(3): 37-42.
13 徐红姣, 曾文, 张运良. 基于Word2Vec的论文和专利主题关联演化分析方法研究[J]. 情报杂志, 2018, 37(12): 36-42.
14 田创, 赵亚娟. 一种基于相似度的专利与产业类目映射模型——以《国际专利分类》与《国民经济行业分类》为例[J]. 图书情报工作, 2016, 60(20): 123-131.
15 詹文青, 肖国华. 面向技术需求的潜在技术转移专利识别[J]. 情报理论与实践, 2019, 42(5): 117-121, 176.
16 吕璐成, 韩涛, 周健, 等. 基于深度学习的中文专利自动分类方法研究[J]. 图书情报工作, 2020, 64(10): 75-85.
17 Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space[OL]. (2013-09-07). https://arxiv.org/pdf/1301.3781.pdf.
18 吕璐成, 韩涛. AI在图情: 人工智能赋能图情服务——2019年图书馆前沿技术论坛(IT4L)会议综述[J]. 农业图书情报学报, 2020, 32(5): 13-18. |
|
|
|