Full Abstracts

2018 Vol. 37, No. 12
Published: 2018-12-28

1175 Using Behavior and Influence Assessment of Algorithms Based on Full-text Academic Articles
Zhang Chengzhi, Ding Ruiyi, Wang Yuzhuo
DOI: 10.3772/j.issn.1000-0135.2018.12.001
Data mining algorithms have been widely used in scientific research and practice. Investigating the mentions of data mining algorithms in academic papers and assessing their influence can help researchers comprehensively understand algorithms used in their field and select those that are appropriate based on a given research task. We used full-text of academic articles to conduct an analysis using the behavior of algorithms and evaluating their influence. This paper considers the field of natural language processing and collects the full-text proceedings accumulated by the China National Conference on Computing Linguistics (CCL) from 1993 to 2016 to conduct a comprehensive examination of the top 10 data mining algorithms based on four aspects: frequency of mention, location of mention, motivation for mention, and age distribution. The impacts of the algorithms are evaluated according to these four aspects. The experimental results show that obvious differences exist among the 10 algorithms; the SVM algorithm has the highest influence, while the CART and Apriori algorithms have low influence. This investigation can provide recommendations for researchers, especially novices whose works are related to data-driven research or applications, as well as introduce new ideas toward the assessment of algorithm influence.
2018 Vol. 37 (12): 1175-1187 [Abstract] ( 168 ) HTML (1 KB)  PDF (909 KB)  ( 1357 )
1188 Comprehensive Amendment and Research on h Index: htop Index
Yu Liping, Wang Zuogong, and Zhang Zaijie
DOI: 10.3772/j.issn.1000-0135.2018.12.002
To avoid defects of the h index, which include decreased importance of highly cited papers, greater impact from downloads of papers, and low discrimination, this paper proposes the htop index, which uses the h index divided by the logarithm of downloads and multiplied by the proportion of highly cited papers. It is shown that the htop index is a great bibliometric indicator that is suitable for defining the time ranges of assessed objectives in the basis of the CNKI database and CSSCI library. Further efforts to revise the htop index may help solve the issue of cooperation between the author and measure and to generalize the htop index as a bibliometric indicator.
2018 Vol. 37 (12): 1188-1192 [Abstract] ( 219 ) HTML (1 KB)  PDF (381 KB)  ( 615 )
1193 Methods of Extracting Non-Categorical Semantic Relations between Chinese Terms
Zhu Hui, Wang Hao, Su Xinning, Deng Sanhong
DOI: 10.3772/j.issn.1000-0135.2018.12.003
Ontology is an effective method of knowledge organization, and it is also the important link in constructing the Semantic Web. The non-categorical semantic relations between concepts are important parts of ontology. Because the term is the external expression of a concept, this paper introduces co-occurrence, structural analysis, template construction, logical reasoning, and other methods to construct a model that can extract non-categorical semantic relations between terms from Chinese unstructured texts. The model extracts the relations from two different perspectives: content and structure. The paper puts forward the main operation flow of the model and the main components of each functional module, discusses the specific realization of the main components, and discusses the limitations of the methods. The research will provide new ideas for the extraction of non-categorical semantic relations between terms, enrich the methods of knowledge discovery, and provide references for the implementation of feasible and effective knowledge organization.
2018 Vol. 37 (12): 1193-1203 [Abstract] ( 187 ) HTML (1 KB)  PDF (694 KB)  ( 649 )
1204 Part-of-Speech Automated Annotation of Food Safety Events Based on BiLSTM-CRF
Xu Fei, Ye Wenhao, Song Yinghua
DOI: 10.3772/j.issn.1000-0135.2018.12.004
The accuracy and recall rate of part-of-speech annotation directly affects the overall effect of knowledge and strategy mining of subsequent food-safety incidents, which not only directly affects the performance of term and entity extraction in food-safety events, but also, to some extent, determines the accuracy of classification, clustering, and association knowledge mining related to food-safety events. The experiment of part-of-speech annotation is conducted based on traditional machine learning and deep learning models, such as CRF, RNN BiLSTM, and BiLSTM- CRF. The result of forty groups of experiments shows that the annotation F-scores of the deep learning models is higher than those of the CRF, among which, the average F-score of RNN and BiLSTM is 2.43% and 3.93% higher, respectively. The overall performance of BiLSTM-CRF, which systemically integrates the optimal characteristics of both BiLSTM and CRF, reaches the best level, in which the F-score is 7.12% higher than that of BiLSTM and the F-score of the best model is 95.89%.
2018 Vol. 37 (12): 1204-1211 [Abstract] ( 202 ) HTML (1 KB)  PDF (425 KB)  ( 896 )
1212 Effects of Intelligent Situation, Learning Acuity, and Smart Service: Empirical Analysis of 97 Librarians of “Double First-Rate” Universities
Zhang Qingpu, Chen Mang, Hou Zhiping
DOI: 10.3772/j.issn.1000-0135.2018.12.005
The rapid popularization of advanced information technology and intelligent situations has brought new opportunities for the development of smart services in university libraries. However, the willingness of librarians to innovate and provide efficient intelligent services is the key issue affecting the effectiveness of smart services. Based on systems theory and the technology adoption model (TAM), this paper constructs a model that considers librarians’ willingness to innovate and smart service behavior, from the perspective of learning acuity (intrinsic motivation) and intelligence situation (exterior situation). This paper explores the relationship between the intelligence situation, learning acuity, and smart service, and as well as the path of action and the mechanism of influence. Then, through empirical analysis of 97 librarians of “double first-rate” universities, this paper verifies the research hypothesis and conceptual model. The results showed that the learning acuity and the intelligence situation are the key factors that have a significant positive impact on librarians' intelligent service behavior and willingness to innovate. The librarians' willingness to innovate in service is an important intermediate variable for intelligent situations, learning acuity, and smart services. Finally, based on the research results, this paper puts forward corresponding suggestions and strategies for service innovation and management of the university library’s smart service.
2018 Vol. 37 (12): 1212-1223 [Abstract] ( 185 ) HTML (1 KB)  PDF (589 KB)  ( 811 )
1224 Research Focus of Three Northeast Provinces Based on Scientific Knowledge Map
Chen Xiaoling, Liu Dongliang
DOI: 10.3772/j.issn.1000-0135.2018.12.006
The aim of this work is to detect the research focus and disciplinary trend of the northeast provinces in the past 5 years by the visual analysis tool CiteSpace. This study is based on the original data of 8535 articles published in Jilin, Heilongjiang, and Liaoning province in the Web of Science database during 2012-2016. First, the number of published papers and highly cited documents per year was measured. Second, CiteSpace software was used to analyze co-institutional, keyword co-word analysis, and disciplinary co-occurrence. The number of published papers in the northeast provinces has been on the rise. The core institutions are Jilin University, the Chinese Academy of Sciences, and China Medical University. In the field of chemistry, medicine, and engineering, they have achieved fruitful research results.
2018 Vol. 37 (12): 1224-1231 [Abstract] ( 221 ) HTML (1 KB)  PDF (1078 KB)  ( 712 )
1232 Growth of Talent from the Perspective of Academic Genealogy: Taking the Laureates of Turing Award Artificial Intelligence Field for Example
Wang Shuang, Zhao Xiaoyuan, Pan Yuntao, Wang Yunhong
DOI: 10.3772/j.issn.1000-0135.2018.12.007
Academic genealogy research is an academic group consisting of scholars linked together based on academic inheritance. Advanced academic inheritance is an inherent motivation for the continuous propagation of an outstanding academic genealogy. This research constructs an analytical framework of academic background, commitment, and network three-dimensionality. It also uses the curriculum vitae and academic output of the Turing Award for artificial intelligence as a data source for empirical analysis, and comprehensively uses scientific measurement and intelligence analysis methods, combining résumé analysis and social network analysis to explore the general characteristics and patterns of the growth of scientific and technological talents. The research results can provide a reference for discovering and introducing talents.
2018 Vol. 37 (12): 1232-1240 [Abstract] ( 226 ) HTML (1 KB)  PDF (1033 KB)  ( 659 )
1241 Research on Sentiment Evaluation of Online Public Opinion Based on the Bayesian Model in a Mobile Environment: The Case of “China Women’s Volleyball Won the Championship in the Rio Olympics” in SinaWeibo
Wang Xiwei, Zhang Liu, Wen Qing, Wang Nan’axue
DOI: 10.3772/j.issn.1000-0135.2018.12.008
It is extremely important to understand and master the evolution rules of the emotional evolution of online public opinion forum users and to build an emotional evolution model within the mobile environment. It is also important for the relevant departments to strengthen the supervision of online public opinion information within the mobile environment and to guide the trends of online public opinion correctly. On the basis of the naive Bayes model, this paper uses training of word vectors, text preprocessing, and performance evaluation to process emotional evolution. Using the three visual dimensions of word frequency, region, and time, we constructed an emotional analysis model of users' online public opinion comments within a mobile environment, and, as an example, we used the sinaWeibo “China Women’s Volleyball Won the Championship in the Rio Olympics” topic to visualize and analyze the process of emotional evolution. The emotional analysis model constructed in this paper can be effectively applied to research on the emotional evolution of online public opinion users within the mobile environment. The empirical results demonstrate that Sina micro-blog users mainly applied positive emotions to the “China Women’s Volleyball Won the Championship in the Rio Olympics” topic. Developed areas, such as the municipalities directly under the central government and the coastal provinces, compared to the southwest, the less developed provinces in Northeast China and other less developed provinces, were more optimistic about the topic.
2018 Vol. 37 (12): 1241-1248 [Abstract] ( 185 ) HTML (1 KB)  PDF (1032 KB)  ( 1137 )
1249 An Essay on Theoretical Design and Practical Basis of Experimental Information Science
Liu Qianli, Fred Ying Ye
DOI: 10.3772/j.issn.1000-0135.2018.12.009
With reference to experimental psychology and experimental economics, we proposed a theoretical design for experimental information science, where the principle of empirical research and the principle of data induction become two pillars. The practical basis was characterized by experimental studies, laboratory establishing, and experimental courses. It is indicated that the experimental information science may mark a turn of information science, and promote its progress.
2018 Vol. 37 (12): 1249-1261 [Abstract] ( 241 ) HTML (1 KB)  PDF (1701 KB)  ( 657 )
1262 An International Comparison on the Use of Theories in Information Science
Wang Fang, Yang Jing, Chen Feng
DOI: 10.3772/j.issn.1000-0135.2018.12.010
This study combined dictionary and rule extraction methods to automatically identify theories used in 2872 articles of three international journals in information science from 2007 to 2016. On the basis of four coding schemas developed for country, theory, subject, and source discipline of theories, this paper analyzed the difference in the use and source disciplines of theories between countries by adopting the research methods of bibliometrics and content analysis. Then, the results of the three international research journals and the Journal of the China Society for Scientific and Technical Information were compared; results revealed the characteristics and differences in the ways theories are used and sourced in the research related to information science in various countries.
2018 Vol. 37 (12): 1262-1274 [Abstract] ( 286 ) HTML (1 KB)  PDF (586 KB)  ( 697 )