|
|
2020 Vol. 39, No. 6
Published: 2020-06-28 |
|
|
|
|
|
|
|
565 |
A Method for Institution Name Normalization Based on Institution-Author Vectors Hot! |
|
|
Lyu Dongqing, Lu Hongru, Cheng Ying, Sun Haixia |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.06.001 |
|
|
Institution transition is one reason behind variety in institution names. Normalization of institution names benefits both information retrieval recall and the reliability of bibliometric research results. Thus, this paper proposes a method for institution name normalization based on the stable feature of personnel in an academic institution in the short term. Specifically, institution-author and institution-annual vectors are constructed for each academic institution, and the similarity of the integrated institution-author vectors, the number of co-authors, and mapping rules are used to identify transition relationships between two institutions, including renaming, merger, split, and reorganization. The method was tested using data from the CSSCI database between 1999 and 2015. After controlling for the impact of personnel turnover and homonymous authors, the proposed method demonstrated excellent performance in both accuracy and recall. |
|
|
2020 Vol. 39 (6): 565-578
[Abstract]
(
319
)
HTML
(138 KB)
PDF
(2447 KB)
(
794
) |
|
|
|
589 |
Automatic Recognition of Research Methods from the Full-text of Academic Articles Hot! |
|
|
Zhang Chengzhi, Zhang Yingyi |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.06.003 |
|
|
The degree ofresearch methods standardization marks the maturity of a discipline s development. In information science, theoretical analysis and normative research has gradually started to attract the attention of researchers. However, there is a lack of research on quantitative analyses of research methods. In addition, when a research method appears in an academic article, this implies that either the research method is used in the article, or it is just cited for analysis or comparison. Through research methods, researchers can quickly understand the key contents of the academic articles. Summarizing the research methods cited by academic papers helps in clarifying their evolution and development mode in the field. Thus, this paper divides research methods into those reported by and those cited in academic articles. First, this article compares a variety of automatic named entity recognition method, such as BiLSTM (bi-directional long short-term memory), from which an optimal model for final research method entities identification would be selected. The experimental results show that the character vector based BiLSTM joint training model combined with a CRF (conditional random field) yields the best performance. This paper analyzes research methods’ use in information science through the extracted research method entities. The results of statistical analysis show that the usage and citation frequency of experimental methods is the highest in information science. |
|
|
2020 Vol. 39 (6): 589-600
[Abstract]
(
351
)
HTML
(174 KB)
PDF
(1551 KB)
(
1038
) |
|
|
|
630 |
Topic Mining of Online Reviews Based on Gaussian Latent Dirichlet Allocation Hot! |
|
|
Guo Xianda, Zhao Narisa, Gao Huan, Yang Xinyi |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.06.007 |
|
|
This study proposes a method based on Gaussian latent Dirichlet allocation (LDA) for online comments to overcome the limitations of the current topic mining methods, such as sparseness and semantic incoherence of generated topics, that result in a poor applicability. The word vectors of online comments are obtained by word2vec training, and the topic distribution of online comments is achieved based on the Gaussian LDA model. The topic distribution is then used to calculate the similarity matrix of comments, and the affinity propagation clustering algorithm is employed to cluster online comments. The topic discovery is realized by analyzing the clustering results. Finally, the TextRank algorithm is used to extract the key sentences of each topic to generate the topic summary so that the description of the topic can be completed. The proposed method effectively alleviates the information overload problem of consumers online comments. The effectiveness and practical application value of the proposed method have been established through experiments and calculations performed on online product reviews from seven platforms, such as Taobao, Jingdong, and Douban. |
|
|
2020 Vol. 39 (6): 630-639
[Abstract]
(
385
)
HTML
(137 KB)
PDF
(1634 KB)
(
1061
) |
|
|
|
640 |
Methodological and Automatic Sentence Extraction from Academic Article s Full-text Hot! |
|
|
Zhang Yingyi, Zhang Chengzhi |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.06.008 |
|
|
Research methods are essential in the scientific literature. These include methods, tools, or techniques for solving problems in the field. Theresearch method s description is usually presented through sentences. Summarizing these scattered sentences in the scientific literature can help researchers to quickly explore appropriate research methods. According to the method s purpose in the research paper, the research method sentence is further divided into method used and method cited sentences. The method used sentence refers to the sentence that describes the research method used in the paper and the method cited sentence refers to that cited by the paper. In this study, a variety of neural network-based sentence classification models are used for extracting the method sentences from the scientific literature s full-text. At the word vector representation layer, the study uses two-word vector models: BERT and word2vec. In the feature selection layer, three different networks are utilized: convolutional neural network (CNN), bidirectional LSTM (BiLSTM), and attention mechanism network. In addition, the study uses two model training methods a single-level structure and a two-level structure. The experimental results show that the BERT-based BiLSTM model with single-level structure achieves the best performance. This paper analyzes the distribution of research method sentences extracted from the Journal of The China Society for Scientific and Technical Information. The analysis indicates that this journal paid more attention to the theoretical developments of information science; in addition, the journal also focused on constructing theoretical systems for this discipline. |
|
|
2020 Vol. 39 (6): 640-650
[Abstract]
(
330
)
HTML
(154 KB)
PDF
(1677 KB)
(
1253
) |
|
|
|
651 |
Review of Research Progress on Emerging Technologies Identification Based on Quantitative and Evolutionary Perspectives Hot! |
|
|
Lu Xiaobin, Yang Guancan, Xu Shuo, Zhang Yangyi |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.06.009 |
|
|
Emerging technologies identification has always been the focus of scientific and technological innovation management, scientific and technological policy-making, and technologically competitive intelligence research. Although significant academic research has been performed in this field, the conceptual definition of “emerging technology” has seriously restricted its development, which is attributed to expanding the conceptual boundary of emerging technologies from two different cognitive perspectives: quantitative and evolutionary. Therefore, the basis for clarifying the concept of emerging technologies identification is to first understand the characteristics and application scenarios of the two perspectives. In this paper, first, a framework consisting of three parts is proposed: characteristics, data representations and methods of emerging technologies identification. This framework can comprehensively cover the practical progress of emerging technologies identification from the current quantitative perspective. Subsequently, through literature comparison, it was found that the rationality of the evolutionary perspective lies in the fact that the proposed framework cannot explain the following four issues: radical innovation based on technological recombination, fusion effect of disciplines and technological networks, driving effect by technological practicability and efficiency, and disruptive innovation, which promote the transformation of data representation and recognition methods from the evolutionary perspective. Finally, preliminary research prospects of the development of data representation and recognition methods of emerging technologies identification are described. The research will provide references for the further development of emerging technologies identification activities, by comparing its understanding and practice from two different perspectives. |
|
|
2020 Vol. 39 (6): 651-661
[Abstract]
(
247
)
HTML
(125 KB)
PDF
(1368 KB)
(
808
) |
|
|
|
662 |
Origin, Application, and Development of Message Framing Theory in Foreign Health Behavior Research Hot! |
|
|
Yang Mengqing, Zhao Yuxiang, Song Shijie, Qinghua nd Zhu |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.06.010 |
|
|
This paper reviews the origin, application, and development of the message framing theory in foreign health behavior research, and to provide theoretical reference for health information behavior research in China. By discussing the main characteristics in the different application phases of the message framing theory in health behavior studies, we first described the evolution of this research topic. Then, a detailed analysis of the application of the message framing theory was created from two perspectives, namely framing and cognition. Finally, we put forward possible breakthroughs for future research based on the results of the previous review. The development of the message framing theory in health behavior research can be divided into three stages, namely theme formation stage, theme development stage, and theme maturity stage. Each stage has distinct characteristics. Based on different framing design ideas, the related research can be divided into three categories, gain- and loss-framed message, temporal framing, and narrative. Self-efficacy, information processing, behavioral motivation, and health beliefs are common cognitive perspectives when discussing the framing effect in health behaviors. At the end of this paper, combined with the research theme of information management, we propose that new research work can be carried out in three directions: information tailoring in the online health community, framing context in health information application adoption, and misinformation in health information dissemination. |
|
|
2020 Vol. 39 (6): 662-674
[Abstract]
(
598
)
HTML
(152 KB)
PDF
(3431 KB)
(
1128
) |
|
|
|