|
|
Methodological and Automatic Sentence Extraction from Academic Article s Full-text |
Zhang Yingyi, Zhang Chengzhi |
Department of Information Management, School of Economics and Management, Nanjing University of Science & Technology, Nanjing 210094 |
|
|
Abstract Research methods are essential in the scientific literature. These include methods, tools, or techniques for solving problems in the field. Theresearch method s description is usually presented through sentences. Summarizing these scattered sentences in the scientific literature can help researchers to quickly explore appropriate research methods. According to the method s purpose in the research paper, the research method sentence is further divided into method used and method cited sentences. The method used sentence refers to the sentence that describes the research method used in the paper and the method cited sentence refers to that cited by the paper. In this study, a variety of neural network-based sentence classification models are used for extracting the method sentences from the scientific literature s full-text. At the word vector representation layer, the study uses two-word vector models: BERT and word2vec. In the feature selection layer, three different networks are utilized: convolutional neural network (CNN), bidirectional LSTM (BiLSTM), and attention mechanism network. In addition, the study uses two model training methods a single-level structure and a two-level structure. The experimental results show that the BERT-based BiLSTM model with single-level structure achieves the best performance. This paper analyzes the distribution of research method sentences extracted from the Journal of The China Society for Scientific and Technical Information. The analysis indicates that this journal paid more attention to the theoretical developments of information science; in addition, the journal also focused on constructing theoretical systems for this discipline.
|
Received: 13 October 2019
|
|
|
|
1 Bornmann L, Mutz R. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references[J]. Journal of the Association for Information Science and Technology, 2015, 66(11): 2215-2222. 2 Gupta S, Manning C. Analyzing the dynamics of research by extracting key aspects of scientific papers[C]// Proceedings of the 5th International Joint Conference on Natural Language Processing. Asian Federation of Natural Language Processing, 2011: 1-9. 3 Kova?evi? A, Konjovi? Z, Milosavljevi? B, et al. Mining methodologies from NLP publications: A case study in automatic terminology recognition[J]. Computer Speech & Language, 2012, 26(2): 105-126. 4 Singh M, Dan S, Agarwal S, et al. AppTechMiner: Mining applications and techniques from scientific articles[C]// Proceedings of the 6th International Workshop on Mining Scientific Publications. New York: ACM Press, 2017: 1-8. 5 蒋婷. 学科领域本体学习及学术资源语义标注研究[D]. 南京: 南京大学, 2017. 6 Hirohata K, Okazaki N, Ananiadou S, et al. Identifying sections in scientific abstracts using conditional random fields[C]// Proceedings of the Third International Joint Conference on Natural Language Processing, Hyderabad, India, 2008: 381-388. 7 Liakata M, Saha S, Dobnik S, et al. Automatic recognition of conceptualization zones in scientific articles and two life science applications[J]. Bioinformatics, 2012, 28(7): 991-1000. 8 Hayes P J, Andersen P M, Nirenburg I B, et al. TCS: A shell for content-based text categorization[C]// Proceedings of the Sixth Conference on Artificial Intelligence for Applications. IEEE Press, 1990: 320-326. 9 华秀丽, 徐凡, 王中卿, 等. 细粒度科技论文摘要句子分类方法[J]. 计算机工程, 2012, 38(14): 138-140. 10 Nomponkrang T, Sanrach C. The comparison of algorithms for Thai-sentence classification[J]. International Journal of Information and Education Technology, 2016, 6(10): 801-808. 11 Yamamoto Y, Takagi T. A sentence classification system for multi biomedical literature summarization[C]// Proceedings of the 21st International Conference on Data Engineering Workshops. IEEE Press, 2005: 1163-1163. 12 Hsu S T, Moon C, Jones P, et al. A hybrid CNN-RNN alignment model for phrase-aware sentence classification[C]// Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2017, 2: 443-449. 13 Limsopatham N, Collier N. Modelling the combination of generic and target domain embeddings in a convolutional neural network for sentence classification[C]// Proceedings of the 15th Workshop on Biomedical Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2016: 136-140. 14 Chung J, Gulcehre C, Cho K H, et al. Empirical evaluation of gated recurrent neural networks on sequence modeling[OL]. https://arxiv.org/pdf/1412.3555v1.pdf. 15 Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding[C]// Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2019, 1: 4171-4186. 16 Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality[C]// Proceedings of the 26th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates, 2013, 2: 3111-3119. 17 Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates, 2017: 6000-6010. 18 Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks, 2005, 18(5-6): 602-610. 19 McCallum A, Nigam K. A comparison of event models for naive Bayes text classification[C]// Proceedings of the AAAI/ICML -98 Workshop on Learning for Text Categorization. Palo Alto: AAAI Press, 1998: 41-48. 20 Salton G, Buckley C. Term-weighting approaches in automatic text retrieval[J]. Information Processing & Management, 1988, 24(5): 513-523. 21 Keller J M, Gray M R, Givens J A. A fuzzy K-nearest neighbor algorithm[J]. IEEE Transactions on Systems, Man, and Cybernetics, 1985, 15(4): 580-585. 22 Joachims T. Text categorization with Support Vector Machines: Learning with many relevant features[C]// Proceedings of the European Conference on Machine Learning. Heidelberg: Springer, 1998: 137-142. 23 王芳, 陈锋, 祝娜, 等. 我国情报学理论的来源、应用及学科专属度研究[J]. 情报学报, 2016, 35(11): 1148-1164. |
|
|
|