|
|
2020 Vol. 39, No. 4
Published: 2020-04-28 |
|
|
|
|
|
|
|
345 |
Construction of Information System Oriented to Identifying the Frontier of Science and Technology in Key Areas Hot! |
|
|
Liu Qiyan, Zeng Wen, Che Yao |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.001 |
|
|
Recently, the development trend of science and technology in China has entered the stage of “following, merging, and leading.” To adapt to the new situation of science and technology, we need to develop frontier knowledge of science and technology in key areas, follow up the new developments of science and technology in major foreign countries in an all-round way, and perceive and judge the future development trend of science and technology. Additionally, we need to play a better role in ensuring the development of information systems. Based on the analysis of the current research situation all around the world, this study proposes and expounds the framework and methods for building an information system to identify the frontier of science and technology in key areas, and introduces the information practice in relevant research fields. The results show that the intelligence system and method of using multi-dimensional data to realize intelligence perception for identifying the science and technology frontier in key fields are operable. |
|
|
2020 Vol. 39 (4): 345-356
[Abstract]
(
591
)
HTML
(92 KB)
PDF
(4942 KB)
(
1069
) |
|
|
|
357 |
Evaluation of Academic Papers Impact Based on Scientific Communication Path: A Case Study of Chinese International Academic Papers in Social Sciences Hot! |
|
|
Guo Fengjiao, Zhao Rongying, Sun Shaomin |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.002 |
|
|
Based on the scientific communication path of academic papers, the academic papers impact is divided into native impact, offline impact, Web 1.0 impact, and Web 2.0 impact, and the generation processes of various types of impact are discussed. An evaluation model of the academic papers impact is constructed, and two sets of data (including a control group) obtained from the Chinese international academic papers in social sciences are empirically studied. This study has obtained the following research results andfirstly, the score of the academic papers impact demonstrates the highest correlation with relative impact factor and citation frequency. Secondly, the impact score of data group 1 is still lower than that of the overall control group, and the trend line of the academic papers impact score in both groups shows a right-leaning trend. Thirdly, the impact score of the 50 research institutes with the highest total score of academic papers decreases exponentially, and universities are the backbone of international social science research in China. Lastly, among the 79 subjects covered by the two data groups, the average score of data group 1 with 24 subjects is higher than that of the control group, and the subjects covered by academic papers are classified according to their relative influence. |
|
|
2020 Vol. 39 (4): 357-366
[Abstract]
(
299
)
HTML
(141 KB)
PDF
(1234 KB)
(
831
) |
|
|
|
367 |
Application of Network Representation Learning in the Prediction of Scholar Academic Cooperation Hot! |
|
|
Lin Yuan, Wang Kaiqiao, Liu Haifeng, Xu Kan, Ding Kun, Sun Xiaoling |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.003 |
|
|
In the context of Big Data, scientific cooperation has become an important means to improve the level of scientific research and output. A research focus in recent years includes accurately identifying the cooperation objects that are suitable for scholars, institutions, and fields among the vast numbers of these entities. This study constructs a co-occurrence network of author-author, institution-institution, author-institution, author-keyword, and institution-keyword through the recorded data of scientific literature in the field of science of science. The network representation method is used to learn the context information of authors, institutions, and keywords in a network, and the information entity is represented as a low-dimensional dense vector of the same space. Finally, the mining of the cooperation object is achieved based on the similarity calculation of representation vector. The network representation learning method can realize a variety of heterogeneous information fusion, quantitatively calculate the correlation strength between each information entity, capture the relationship between scholars-scholars, scholars-institutions, and scholars-keywords in the research network, accurately explore potential collaborators, and partner institutions and keywords for scholars. |
|
|
2020 Vol. 39 (4): 367-373
[Abstract]
(
279
)
HTML
(71 KB)
PDF
(2280 KB)
(
924
) |
|
|
|
374 |
Query-oriented Opinion Summarization Model Using Debatepedia as Datasource Hot! |
|
|
Yu Chuanming, Zheng Zhiliang, Zhu Xingyu, An Lu |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.004 |
|
|
This study systematically studies query-oriented opinion summarization, aiming to construct a query-oriented opinion summarization framework to explore the impact of different text summarization methods on the opinion summarization. Considering sentiment orientation and the similarity between the sentence and query, we extract the sentences from the original documents and employ neural networks and word embeddings to achieve abstractive summarization. The query-oriented opinion summarization framework is then constructed upon this. We crawl the topics and arguments to build the experimental opinion summarization dataset from the Debatepedia websites and apply the proposed method to the dataset to validate its effect. The experimental results show that on this dataset, the summaries generated by the extractive method are of higher quality; the highest average ROUGE score, deep semantic similarity score, and emotional score are 6.58%, 1.79%, and 11.52% higher than the generative method, and 8.33%, 2.80%, and 13.86% higher than the combined method, respectively. Furthermore, the evaluation indicator deep sentence similarity and the sentiment score proposed in this study can better evaluate the effects of the query-oriented opinion summarization model. The research results are of great significance for improving the effects of query-oriented opinion summarization and promoting the application of the opinion summarization model in the field of information science. |
|
|
2020 Vol. 39 (4): 374-386
[Abstract]
(
249
)
HTML
(191 KB)
PDF
(1580 KB)
(
539
) |
|
|
|
387 |
Learning Concept Hierarchies from Chinese Academic Literature for Domain Ontology Construction Hot! |
|
|
Tang Lin, Guo Chonghui, Chen Jingfeng, Sun Leilei |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.005 |
|
|
Constructing domain ontology from academic literature has great significance in promoting discipline development. Taking Chinese academic literature as a data source, this study proposed a semi-automatic method for extracting concept hierarchy. First, a fine-grained universal research framework for constructing hierarchical relations of the domain ontology was proposed. Then, a novel concept representation fusion method was developed, considering concepts semantic features based on deep learning and concept frequency in time series. Combined with an affinity propagation (AP) clustering algorithm, Prim s algorithm, and data from a Web search engine, the ontology concept hierarchy extraction algorithm was proposed via rule-based reasoning (RROCHE). Concept hierarchy relations are learned semi-automatically. The algorithm was then applied to the academic literature on Chinese word segmentation. Numerical experiments examined the feasibility and effectiveness of the proposed methods. The proposed method can also be applied effectively and widely to other domains. |
|
|
2020 Vol. 39 (4): 387-398
[Abstract]
(
322
)
HTML
(159 KB)
PDF
(3549 KB)
(
870
) |
|
|
|
409 |
Automatic Extraction of Chinese Terminology Based on BERT Embedding and BiLSTM-CRF Model Hot! |
|
|
Wu Jun, Cheng Yao, Hao Han, Ailiyaer·Aizezi, Liu Feixue, Su Yipo |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.007 |
|
|
High quality professional term recognition and its extraction play an important role in the fields of domain information retrieval and knowledge graph building. To improve the precision and recall rate of terminology recognition, we propose a Chinese terminology recognition and extraction approach that does not rely on specific domain knowledge or artificial features. Using the latest developments in representation learning, this study introduces BERT embedding as a character-based pre-trained model and incorporates it with a bi-directional long short-term memory (BiLSTM) and a conditional random field (CRF) to extract deep learning terminologies from 1278 annotated datasets collected from domain e-books. The experimental results show that the proposed model reaches 92.96% in F-score and outperforms other competing algorithms, such as left and right entropy, mutual information, a word2vec based similar terminology recognition algorithm, and a BiLSTM-CRF model. The best practices and related procedures for the implementation of the proposed model are also provided to guide its users in its further improvement. |
|
|
2020 Vol. 39 (4): 409-418
[Abstract]
(
261
)
HTML
(106 KB)
PDF
(2255 KB)
(
1733
) |
|
|
|
419 |
Analysis of Users Passive Use Behavior Patterns on Social Network Platforms Based on Time Characteristics Hot! |
|
|
Lu Xinyuan, Huang Mengmei, Lu Quan, Wang Xuelin |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.008 |
|
|
This study considers the Zhihu platform and uses the Python software to obtain the data related to users information behaviors, such as publishing articles, answering questions, asking questions, and participating in live activities. The time characteristics, frequency, and time interval of users behaviors are then analyzed, and the time characteristics and rules of users negative behaviors on social network platforms are examined. The results show that some users may reduce their use behavior, intermittently drop out, or interrupt during their use of social network platforms, thereby presenting different degrees of negative behavior characteristics. For identifying users passive use behavior, obtaining an insight into the underlying causes and formulating targeted measures is necessary. This analysis can guide the users to continuously use the social network platforms. |
|
|
2020 Vol. 39 (4): 419-426
[Abstract]
(
471
)
HTML
(81 KB)
PDF
(2115 KB)
(
795
) |
|
|
|
436 |
Modeling Research of Users Dynamic Interests Based on Ontology and Folksonomy Hot! |
|
|
Li Yuanyuan, Li Xuhui |
|
|
DOI: 10.3772/j.issn.1000-0135.2020.04.010 |
|
|
Web 2.0 application technologies, such as blog, instant messaging, social network, community sharing, and Folksonomy, enable the combination of users, information, and resources to form a closely related web network. In recent times, there have been limited research studies that apply ontology to analyze the dynamic interests of the users of social tags. Therefore, a platform’s requirements related to an accurate understanding of its users’ interests are not fulfilled. Our study discusses the construction method of interest model from this standpoint. The ontology of interest labels for Douban readers is constructed in this study on the basis of restriction rules and relationship definitions of words in the Chinese Classification Thesaurus and Chinese Library Classification. According to the index of reproduction, coverage, and calorific rate, the interest intensity and stability of tags are predicted, and the expression form of interest is determined. The initial interest model is then constructed, and the corresponding renewal process of interest nodes is proposed. The user’s interest model combined with ontology and its update process have improved the expression depth and width of user interests to a certain extent. Thus, this work demonstrates a higher scientific reference significance in the applicability of resource recommendation, retrieval, and other applications. |
|
|
2020 Vol. 39 (4): 436-449
[Abstract]
(
250
)
HTML
(127 KB)
PDF
(3965 KB)
(
656
) |
|
|
|