|
|
Mapping and Migration of Medical and Health Big Data with SNOMED CT |
Chen Donghua1, Zhang Runtong1, Fu Lei2, Shang Xiaopu1, Zhu Xiaomin3 |
1. School of Economics and Management, Beijing Jiaotong University, Beijing 100044; 2. Core Laboratory of Translational Medicine, Chinese PLA General Hospital, Beijing 100853; 3. School of Mechanical, Electronic and Control Engineering, Beijing Jiaotong University, Beijing 100044 |
|
|
Abstract This paper reports the application of the core system of Systematized Nomenclature of Medicine—Clinical Terms (SNOMED CT) to the analyzing process of medical and health big data, and proposes a mapping and migration approach through SNOMED CT. The system provides reference of relevant processes, models, and algorithms to analyze medical and health big data effectively. First, the implicit relationships between the data and SNOMED CT are analyzed. Then, through the use of the SNOMED CT, four stages in our method are illustrated; evaluating the mapping requirement, establishing the mapping model, verifying the model, and maintaining the model. Finally, we use a real mapping case of big data to demonstrate the feasibility of the proposed method. By examining the complex medical concepts and their semantic relationship set in SNOMED CT, our proposed method promotes deeper mining of knowledge from existing medical and health big data, which holds great significance for the development of medical informatics.
|
Received: 15 November 2017
|
|
|
|
[1] Koopman B, Zuccon G, Nguyen A, et al.Automatic ICD-10 classification of cancers from free-text death certificates[J]. International Journal of Medical Informatics, 2015, 84(11): 956-65. [2] Ji X, Ritter A, Yen P Y.Using ontology-based semantic similarity to facilitate the article screening process for systematic reviews[J]. Journal of Biomedical Informatics, 2017, 69: 33-42. [3] Pinto V B, Rabelo C R D O, Girao I P T. SNOMED-CT as standard language for organization and representation of the information in patient records[J]. Knowledge Organization, 2014, 41(4): 311-318. [4] 曹馨宇, 郭玉峰, 虞雪云, 等. SNOMED CT运行管理机制及对中医临床术语管理方式的启示[J]. 中国科技术语, 2016, 18(5): 8-13. [5] 钟伶, 林丹红, 林晓华. 临床医学系统术语SNOMED CT的特点及其应用[J]. 中华医学图书情报杂志, 2007, 16(2): 58-60. [6] 董燕, 朱玲, 朱彦, 等. 中医临床术语系统分类与顶层本体分类的比较[J]. 中国数字医学, 2017, 12(8): 32-34. [7] 郭玉峰, 刘保延, 姚乃礼, 等. 基于SNOMED CT核心构架研究的中医临床术语集标准化特征要素初探[J]. 中国中医药信息杂志, 2008, 15(9): 96-97. [8] 李莎莎, 董燕, 孟凡红, 等. SNOMED CT的应用现状及发展趋势[J]. 中国数字医学, 2016, 11(1): 100-102. [9] 郭玉峰, 刘保延, 周雪忠. SNOMED CT 2007的顶级概念分类详解[J]. 中华中医药学刊, 2008, 26(9): 1928-1932. [10] Monsen K A, Finn R S, Fleming T E, et al.Rigor in electronic health record knowledge representation: Lessons learned from a SNOMED CT clinical content encoding exercise[J]. Informatics for Health and Social Care, 2016, 41(2): 97-111. [11] 夏光辉, 李军莲, 李晓瑛, 等. SNOMEDCT概念关系表达与语义检索[J]. 医学信息学杂志, 2017(3): 49-53. [12] Lópezgarcía P, Schulz S.Structural patterns under X-rays: Is SNOMED CT growing straight?[J]. PLoS ONE, 2016, 11(11): e0165619. [13] Souvignet J, Declerck G, Asfari H, et al.OntoADR a semantic resource describing adverse drug reactions to support searching, coding, and information retrieval[J]. Journal of Biomedical Informatics, 2016, 63: 100-107. [14] 牟冬梅, 张艳侠, 黄丽丽, 等. 基于SNOMED CT和FCA的医学领域本体构建研究[J]. 情报学报, 2013, 32(6): 653-662. [15] Mortensen J M, Minty E P, Januszyk M, et al.Using the wisdom of the crowds to find critical errors in biomedical ontologies: a study of SNOMED CT[J]. Journal of the American Medical Informatics Association, 2015, 22(3): 640-648. [16] Dugas M, Meidt A, Neuhaus P, et al.ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository[J]. BMC Medical Research Methodology, 2016, 16: 65. [17] 李斌. SNOMED CT与OpenEHR的整合探究[J]. 医学信息学杂志, 2012, 33(1): 46-49. [18] Roldángarcía M D M, Garcíagodoy M J, Aldanamontes J F. Dione: An OWL representation of ICD-10-CM for classifying patients’ diseases[J]. Journal of Biomedical Semantics, 2016, 7(1): 62. [19] Perez-Rey D, Alonso-Calvo R, Paraiso-Medina S, et al.SNOMED2HL7: A tool to normalize and bind SNOMED CT concepts to the HL7 Reference Information Model[J]. Computer Methods and Programs in Biomedicine, 2017, 149: 1-9 [20] Petrova A, Ma Y, Tsatsaronis G, et al.Formalizing biomedical concepts from textual definitions[J]. Journal of Biomedical Semantics, 2015, 6: 22. [21] Rui Z, Liu J L, Huang Y, et al.Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes[J]. BMC Medical Informatics and Decision Making, 2017, 17: 54. [22] Harris M R, Langford L H, Miller H, et al.Harmonizing and extending standards from a domain-specific and bottom-up approach: an example from development through use in clinical applications[J]. Journal of the American Medical Informatics Association, 2015, 22(3): 545-552. [23] Pang C, Sollie A, Sijtsma A, et al. SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data[J]. Database (Oxford), 2015, 2015: Article ID bav089. [24] Haroske G, Schrader T.A reference model based interface terminology for generic observations in Anatomic Pathology Structured Reports[J]. Diagnostic Pathology, 2014, 9: S4. [25] Højen A R, Sundvall E, Gøeg K R.Methods and applications for visualization of SNOMED CT concept sets[J]. Applied Clinical Informatics, 2014, 5(1): 127-152. [26] Cui L, Zhu W, Tao S, et al.Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT[J]. Journal of the American Medical Informatics Association, 2017, 24(4): 788-798. [27] Xing G, Zhang G Q, Cui L.FEDRR: fast, exhaustive detection of redundant hierarchical relations for quality improvement of large biomedical ontologies[J]. Biodata Mining, 2016, 9(1): 31. [28] Zvára K, Tomečková M, Peleška J, et al.Tool-supported interactive correction and semantic annotation of Narrative Clinical Reports[J]. Methods of Information in Medicine, 2017, 56(3): 217-229. |
|
|
|