CN111599427A - A recommended method, device, electronic device and storage medium for unified diagnosis - Google Patents
A recommended method, device, electronic device and storage medium for unified diagnosis Download PDFInfo
- Publication number
- CN111599427A CN111599427A CN202010409050.0A CN202010409050A CN111599427A CN 111599427 A CN111599427 A CN 111599427A CN 202010409050 A CN202010409050 A CN 202010409050A CN 111599427 A CN111599427 A CN 111599427A
- Authority
- CN
- China
- Prior art keywords
- diagnosis
- disease
- cluster
- unified
- similarity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H15/00—ICT specially adapted for medical reports, e.g. generation or transmission thereof
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Public Health (AREA)
- Theoretical Computer Science (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Pathology (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
Abstract
本发明涉及健康管理中疾病辅助诊断的技术领域,具体涉及一种一元化诊断的推荐方法、装置、电子设备及存储介质。根据受检者的诊断编码,确定所述诊断编码与簇Ck中的主/次要典型疾病诊断编码集的相似性度量结果;所述簇Ck是根据N个受检者诊断信息之间的相似性划分的K个簇;选择大于预设阈值的相似度值所属的簇,从一元化诊断库中获取每个簇所对应的疾病共现模式;将所获得的疾病共现模式结合所述诊断编码在ICD本体结构进行可视化分析,按照同一分支下所述诊断编码间的语义关系,识别最近父节点,得到疾病类型并重新排序得到该受检者的一元化诊断,解决了对于同一位受检者,不同主检医师给出的一元化诊断结果存在较大差异的技术问题。
The invention relates to the technical field of auxiliary diagnosis of diseases in health management, in particular to a recommended method, device, electronic device and storage medium for unified diagnosis. According to the diagnostic codes of the subjects, the similarity measurement result between the diagnostic codes and the primary/secondary typical disease diagnostic code sets in the cluster C k is determined; the cluster C k is based on the difference between the diagnostic information of N subjects. K clusters divided by the similarity of the The diagnostic codes are visually analyzed in the ontology structure of the ICD. According to the semantic relationship between the diagnostic codes under the same branch, the nearest parent node is identified, and the disease type is obtained and reordered to obtain the unified diagnosis of the subject, which solves the problem of the same subject. However, there is a technical problem that the unified diagnosis results given by different chief inspectors are quite different.
Description
技术领域technical field
本发明涉及健康管理中疾病辅助诊断的技术领域,具体涉及一种一元化诊断的推荐方法、装置、电子设备及存储介质。The invention relates to the technical field of auxiliary diagnosis of diseases in health management, in particular to a recommended method, device, electronic device and storage medium for unified diagnosis.
背景技术Background technique
健康体检报告是健康管理(体检)机构给受检者出具的医学文书,包括健康体检报告首页、主检报告、体格检查记录、实验室和医学影像检查报告等。主检报告作为体检报告的核心组成部分,目前尚无具体实施细则可供参考,全国各地不同体检机构的书写方式各不相同,质量参差不齐。一元化诊断是主检报告的主要内容,主检医师应本着临床诊断思维的基本原则,特别是一元论原则,按照疾病系统合理归类,尽量用一种疾病去概括或解释疾病的多种临床表现,合理归类可以使主检报告条理清晰。目前各体检机构主检报告多借用相关软件完成,疾病的诊断结果多为各种阳性检查结果的罗列。A health examination report is a medical document issued by a health management (physical examination) institution to the subject, including the first page of the health examination report, the main examination report, the physical examination records, laboratory and medical imaging examination reports, etc. As the core component of the physical examination report, the main inspection report has no specific implementation rules for reference. Unified diagnosis is the main content of the main inspection report. The chief inspector should follow the basic principles of clinical diagnosis thinking, especially the principle of monism, reasonably classify diseases according to the system, and try to use one disease to summarize or explain multiple clinical manifestations of the disease. , Reasonable classification can make the main inspection report clear. At present, the main inspection reports of various medical institutions are mostly completed by borrowing relevant software, and the diagnosis results of diseases are mostly a list of various positive inspection results.
在实际工作中,主检医师通常需要严格核对受检者的基本信息,各项检查结果,总结主、次要诊断和阳性发现,结合自身医学知识,给出相应的诊断结果。In practical work, the chief examiner usually needs to strictly check the basic information of the examinee, various inspection results, summarize the primary and secondary diagnoses and positive findings, and give the corresponding diagnosis results based on their own medical knowledge.
发明人在实践中,发现上述现有技术存在以下缺陷:In practice, the inventor found that the above-mentioned prior art has the following defects:
由于主检医师专业水平的差异、受检者的检查项目不同、主/次要诊断与阳性体征的变异,现有的仅依赖主检医师自身专业知识的一元化诊断方法耗时耗力,效率低、耗时长,同时对于同一位受检者的体检结论,不同主检医师给出的一元化诊断结果也存在较大差异。Due to the differences in the professional level of the chief inspector, the different inspection items of the subjects, the variation of primary/secondary diagnosis and positive signs, the existing unified diagnosis methods that only rely on the professional knowledge of the chief inspector are time-consuming, labor-intensive, and inefficient. , time-consuming, and at the same time, for the same subject's physical examination conclusions, there are also great differences in the unified diagnosis results given by different chief examiners.
发明内容SUMMARY OF THE INVENTION
为了解决上述技术问题,本发明的目的在于提供一种一元化诊断的推荐方法、装置、设备及存储介质,所采用的技术方案具体如下:In order to solve the above-mentioned technical problems, the purpose of the present invention is to provide a recommended method, device, equipment and storage medium for unified diagnosis. The technical solutions adopted are as follows:
第一方面,本发明实施例提供了一种一元化诊断的推荐方法,该方法包括以下步骤:In a first aspect, an embodiment of the present invention provides a recommended method for unified diagnosis, which includes the following steps:
根据受检者的诊断编码,确定所述诊断编码与簇Ck中的主/次要典型疾病诊断编码集的相似性度量结果;所述簇Ck是根据N个受检者诊断信息之间的相似性划分的K个簇;According to the diagnostic codes of the subjects, the similarity measurement result between the diagnostic codes and the primary/secondary typical disease diagnostic code sets in the cluster C k is determined; the cluster C k is based on the difference between the diagnostic information of N subjects. K clusters divided by similarity;
选择大于预设阈值的相似度值所属的簇,从一元化诊断库中获取每个簇所对应的疾病共现模式;Select the cluster to which the similarity value greater than the preset threshold belongs, and obtain the disease co-occurrence pattern corresponding to each cluster from the unified diagnosis database;
将所获得的疾病共现模式结合所述诊断编码在ICD本体结构进行可视化分析,按照同一分支下所述诊断编码间的语义关系,识别最近父节点,得到疾病类型并重新排序得到该受检者的一元化诊断。Visually analyze the obtained disease co-occurrence pattern in combination with the diagnosis code in the ontology structure of the ICD, identify the nearest parent node according to the semantic relationship between the diagnosis codes under the same branch, obtain the disease type and reorder to obtain the subject unified diagnosis.
第二方面,本发明实施例提供了一种一元化诊断的推荐装置,所述装置包括:In a second aspect, an embodiment of the present invention provides a recommendation device for unified diagnosis, the device comprising:
相似性度量模块,用于根据受检者的诊断编码,确定所述诊断编码与簇Ck中的主/次要典型疾病诊断编码集的相似性度量结果;所述簇Ck是根据N个受检者诊断信息之间的相似性划分的K个簇;The similarity measurement module is used to determine the similarity measurement result of the diagnosis code and the primary/secondary typical disease diagnosis code set in the cluster C k according to the diagnosis code of the examinee; the cluster C k is based on N K clusters divided by the similarity between subjects' diagnostic information;
匹配检测模块,用于选择大于预设阈值的相似度值所属的簇,从一元化诊断库中获取每个簇所对应的疾病共现模式;和a matching detection module for selecting clusters to which a similarity value greater than a preset threshold belongs, and obtaining a disease co-occurrence pattern corresponding to each cluster from the unified diagnosis database; and
推荐模块,用于将所获得的疾病共现模式结合所述诊断编码在ICD本体结构进行可视化分析,按照同一分支下所述诊断编码间的语义关系,识别最近父节点,得到疾病类型并重新排序得到该受检者的一元化诊断。The recommendation module is used to visually analyze the obtained disease co-occurrence pattern in combination with the diagnostic codes in the ICD ontology structure, identify the nearest parent node according to the semantic relationship between the diagnostic codes under the same branch, obtain the disease types and reorder them A unified diagnosis for the subject is obtained.
第三方面,本发明实施例提供了一种电子设备,包括:In a third aspect, an embodiment of the present invention provides an electronic device, including:
处理单元;processing unit;
用于存储处理单元可执行指令的存储单元;a storage unit for storing instructions executable by the processing unit;
其中,所述处理单元被配置为:执行上述任意一项所述的方法。Wherein, the processing unit is configured to: execute any of the methods described above.
第四方面,本发明实施例提供了一种存储介质,该存储介质中存储有计算机可读的程序指令,所述程序指令被处理单元执行时实现上述任意一项所述的方法。In a fourth aspect, an embodiment of the present invention provides a storage medium, where computer-readable program instructions are stored in the storage medium, and when the program instructions are executed by a processing unit, any one of the methods described above is implemented.
本发明具有如下有益效果:The present invention has the following beneficial effects:
本发明实施例通过将受检者的诊断编码与主/次要典型疾病诊断编码集的相似性度量结果,从一元化诊断库中获取每个簇所对应的疾病共现模式,并对获取的疾病共现模式结合诊断编码在ICD本体结构进行可视化分析,进而得到疾病类型并重新排序得到该受检者的一元化诊断。针对受检者多种疾病诊断及阳性体征,为主检医师快速、自动化确定主要疾病的一元化诊断提供辅助参考,并根据一元化诊断及其排序结果可以让受检者清楚当前自身健康存在的主要问题与次要问题。解决了对于同一位受检者,不同主检医师给出的一元化诊断结果存在较大差异的技术问题。In the embodiment of the present invention, the disease co-occurrence pattern corresponding to each cluster is obtained from the unified diagnosis database by measuring the similarity between the diagnostic code of the subject and the primary/secondary typical disease diagnosis code set, and the obtained disease The co-occurrence patterns combined with the diagnostic codes are used for visual analysis in the ontology structure of the ICD, and then the disease types are obtained and rearranged to obtain the unified diagnosis of the subject. Aiming at the diagnosis of various diseases and positive signs of the examinee, it provides auxiliary reference for the primary examiner to quickly and automatically determine the unified diagnosis of major diseases, and according to the unified diagnosis and its ranking results, the examinee can clearly understand the main problems of their own health. with secondary issues. It solves the technical problem that for the same subject, the unified diagnosis results given by different chief inspectors are quite different.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案和优点,下面将对实施例或现有技术描述中所需要使用的附图作简单的介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它附图。In order to more clearly illustrate the technical solutions and advantages in the embodiments of the present invention or in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description The drawings are only some embodiments of the present invention, and for those of ordinary skill in the art, other drawings can also be obtained based on these drawings without any creative effort.
图1为本发明一个实施例所提供的一种一元化诊断的推荐方法的流程图;FIG. 1 is a flowchart of a recommended method for unified diagnosis provided by an embodiment of the present invention;
图2为本发明一个实施例所提供的关于一元化诊断库的构建方法的流程图;2 is a flowchart of a method for constructing a unified diagnostic library provided by an embodiment of the present invention;
图3为本发明一个实施例所提供的关于获得主/次要典型疾病诊断编码集的方法流程图;3 is a flowchart of a method for obtaining a primary/secondary typical disease diagnosis code set provided by an embodiment of the present invention;
图4为本发明一个实施例所提供的关于一元化诊断库的构建方法和一元化诊断的推荐方法的整体流程图;4 is an overall flowchart of a method for constructing a unified diagnosis library and a recommended method for unified diagnosis provided by an embodiment of the present invention;
图5为本发明一个实施例所提供的关于ICD-10疾病分类体系局部结构;FIG. 5 is a partial structure of the ICD-10 disease classification system provided by an embodiment of the present invention;
图6为本发明另一个实施例所提供的关于3个簇的典型疾病诊断编码集的结构示意图;6 is a schematic structural diagram of a typical disease diagnosis code set for three clusters provided by another embodiment of the present invention;
图7为本发明另一个实施例所提供的关于簇3的典型疾病共现模式的结构示意图;7 is a schematic structural diagram of a typical disease co-occurrence pattern of
图8为本发明另一个实施例所提供的关于该患者的一元化诊断结果的结构示意图;8 is a schematic structural diagram of a unified diagnosis result about the patient provided by another embodiment of the present invention;
图9为本发明一个实施例所提供的一种一元化诊断的推荐装置的结构框图;9 is a structural block diagram of a recommendation device for unified diagnosis provided by an embodiment of the present invention;
图10为本发明实施例提供的一种电子设备的结构示意图。FIG. 10 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
具体实施方式Detailed ways
为了更进一步阐述本发明为达成预定发明目的所采取的技术手段及功效,以下结合附图及较佳实施例,对依据本发明提出的一种一元化诊断的推荐方法、装置、电子设备及存储介质,其具体实施方式、结构、特征及其功效,详细说明如下。在下述说明中,不同的“一个实施例”或“另一个实施例”指的不一定是同一实施例。此外,一或多个实施例中的特定特征、结构、或特点可由任何合适形式组合。In order to further illustrate the technical means and effects adopted by the present invention to achieve the predetermined purpose of the invention, the following describes a recommended method, device, electronic device and storage medium for a unified diagnosis according to the present invention with reference to the accompanying drawings and preferred embodiments. , its specific implementation, structure, features and effects are described in detail as follows. In the following description, different "one embodiment" or "another embodiment" are not necessarily referring to the same embodiment. Furthermore, the particular features, structures, or characteristics in one or more embodiments may be combined in any suitable form.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
下面结合附图具体的说明本发明实施例所提供的一种一元化诊断的推荐方法、装置、电子设备及存储介质的具体方案。The following specifically describes a specific scheme of a recommended method, apparatus, electronic device, and storage medium for unified diagnosis provided by the embodiments of the present invention with reference to the accompanying drawings.
请参阅图1,其示出了本发明实施例所提供的一种一元化诊断的推荐方法的流程图,为了解决不同的主检医师给出的一元化诊断结果存在较大差异的技术问题,本发明实施例通过疾病诊断编码的相似性度量、可视化分析与主检医师的评估,推荐合理的一元化诊断结果,该一元化诊断的推荐方法包括以下步骤:Please refer to FIG. 1 , which shows a flowchart of a recommended method for unified diagnosis provided by an embodiment of the present invention. The embodiment recommends a reasonable unified diagnosis result through the similarity measurement of the disease diagnosis code, the visual analysis and the evaluation of the chief examiner, and the method for recommending the unified diagnosis includes the following steps:
步骤S001,根据受检者的诊断编码,确定所述诊断编码与簇Ck中的主/次要典型疾病诊断编码集的相似性度量结果;所述簇Ck是根据N个受检者诊断信息之间的相似性划分的 K个簇。Step S001, according to the diagnostic code of the examinee, determine the similarity measurement result of the diagnostic code and the primary/secondary typical disease diagnosis code set in the cluster C k ; the cluster C k is based on the diagnosis of N subjects. The similarity between the information is divided into K clusters.
步骤S002,选择大于预设阈值的相似度值所属的簇,从一元化诊断库中获取每个簇所对应的疾病共现模式。Step S002, select the cluster to which the similarity value greater than the preset threshold belongs, and obtain the disease co-occurrence pattern corresponding to each cluster from the unified diagnosis database.
步骤S003,将所获取的疾病共现模式结合所述诊断编码在ICD本体结构进行可视化分析,按照同一分支下所述诊断编码间的语义关系,识别最近父节点,得到疾病类型并重新排序得到该受检者的疾病共现模式。Step S003, the acquired disease co-occurrence pattern is combined with the diagnosis code to perform a visual analysis in the ICD ontology structure, according to the semantic relationship between the diagnosis codes under the same branch, identify the nearest parent node, obtain the disease type and reorder to obtain the disease type. Disease co-occurrence patterns in subjects.
具体到本实施例中,当新受检者进行健康体检后,主检医师根据受检者本次的检查结果,确定一系列主/次要诊断和阳性发现的诊断编码,首先根据相似性度量方法计算该受检者与每个簇的主/次要典型疾病诊断编码集的相似度,得到一组相似性度量结果。将相似性度量结果进行排序,识别最高的相似度值或者排名前五的相似度值,将其所属簇的结果赋予该受检者,即将一元化诊断库中的典型疾病共现模式推荐给主检医师。将该受检者的疾病诊断编码与被推荐的典型疾病共现模式在ICD本体结构进行可视化分析,按照同一分支下疾病诊断编码间的概念语义关系,自动识别最近父节点,并定义新的疾病类型及其次序,确定该受检者的一元化诊断。主检医师根据自身领域知识,对得到的疾病共现模式进行校正、归类、评估与定义等操作,进而得到针对该受检者本次体检的一元化诊断结果。Specifically in this embodiment, after the new subject has undergone a health checkup, the chief examiner determines a series of primary/secondary diagnoses and diagnostic codes for positive findings according to the subject's current examination results, firstly according to the similarity measure Methods Calculate the similarity between the subject and the primary/secondary typical disease diagnosis code set of each cluster, and obtain a set of similarity measurement results. Sort the similarity measurement results, identify the highest similarity value or the top five similarity values, and assign the result of the cluster to which it belongs to the subject, that is, recommend the typical disease co-occurrence pattern in the unified diagnosis database to the main examiner. physician. Visually analyze the subject's disease diagnosis code and the recommended typical disease co-occurrence pattern in the ICD ontology structure, according to the conceptual semantic relationship between disease diagnosis codes under the same branch, automatically identify the nearest parent node, and define a new disease Type and its order, determine the unified diagnosis for this subject. The chief examiner can correct, classify, evaluate and define the obtained disease co-occurrence patterns according to their own domain knowledge, and then obtain a unified diagnosis result for the subject's physical examination.
综上所述,本发明实施例通过将受检者的诊断编码与主/次要典型疾病诊断编码集的相似性度量结果,从一元化诊断库中获取每个簇所对应的疾病共现模式,并对获取的疾病共现模式结合诊断编码在ICD本体结构进行可视化分析,进而得到疾病类型并重新排序得到该受检者的疾病共现模式。针对受检者多种疾病诊断及阳性体征,为主检医师快速、自动化确定主要疾病的一元化诊断提供辅助参考,并根据一元化诊断及其排序结果可以让受检者清楚当前自身健康存在的主要问题与次要问题。解决了对于同一位受检者,不同主检医师给出的一元化诊断结果存在较大差异的技术问题。To sum up, the embodiment of the present invention obtains the disease co-occurrence pattern corresponding to each cluster from the unified diagnostic database by measuring the similarity between the diagnostic code of the subject and the primary/secondary typical disease diagnostic code set, The obtained disease co-occurrence patterns combined with diagnostic codes were visualized in the ontology structure of the ICD, and then the disease types were obtained and reordered to obtain the disease co-occurrence patterns of the subject. Aiming at the diagnosis of various diseases and positive signs of the examinee, it provides auxiliary reference for the primary examiner to quickly and automatically determine the unified diagnosis of major diseases, and according to the unified diagnosis and its ranking results, the examinee can clearly understand the main problems of their own health. with secondary issues. It solves the technical problem that for the same subject, the unified diagnosis results given by different chief inspectors are quite different.
请参阅图2~4,作为本发明的一个优选实施例,以ICD-10疾病分类体系为例,上述步骤S002中的一元化诊断库的构建方法,包括以下步骤:Please refer to FIGS. 2 to 4 , as a preferred embodiment of the present invention, taking the ICD-10 disease classification system as an example, the method for constructing a unified diagnosis library in the above step S002 includes the following steps:
步骤S201,将受检者的主检报告映射到ICD疾病体系中,用疾病诊断编码表示。In step S201, the main inspection report of the subject is mapped to the ICD disease system, which is represented by the disease diagnosis code.
具体的,在规范的主检报告中以疾病诊断编码(如ICD-10)的形式表示受检者的本次的主/次要诊断与阳性体征。通常一位受检者被标注为一系列疾病编码的集合,且编码具有一定的顺序性,越靠前的诊断编码表示受检者的主要疾病类型或者阳性特征。受检者的诊断信息定义为:Specifically, the current primary/secondary diagnosis and positive signs of the subject are expressed in the form of disease diagnosis codes (such as ICD-10) in the standard primary examination report. Usually, a subject is marked as a set of a series of disease codes, and the codes have a certain sequence, and the higher the diagnostic code is, the main disease type or positive characteristic of the subject is represented. The subject's diagnostic information is defined as:
Di={(d1,Seq(d1)),(d2,Seq(d2)),…}Di={(d 1 ,Seq(d 1 )),(d 2 ,Seq(d 2 )),…}
其中,d1表示受检者的疾病编码,Seq(d1)表示受检者的疾病编码的次序。Wherein, d 1 represents the disease code of the subject, and Seq(d 1 ) represents the order of the disease code of the subject.
步骤S202,根据疾病诊断编码,在ICD疾病分类体系的基础上,构建疾病编码本体结构。Step S202, construct a disease coding ontology structure on the basis of the ICD disease classification system according to the disease diagnosis code.
为了度量受检者诊断信息的相似度,需要在ICD疾病分类体系的基础上,构建疾病编码本体结构(树状结构)。以ICD-10为例,请参阅图5,其示出了ICD-10疾病分类体系局部结构,共包括22章(第一层level-1),262节(第二层level-2),2051个3位类目(第三层level-3),9505个4位亚目(第四层level-4),22908个6位扩展编码(第五层level-5)。In order to measure the similarity of the diagnostic information of the subjects, it is necessary to construct a disease coding ontology structure (tree structure) on the basis of the ICD disease classification system. Taking ICD-10 as an example, please refer to Figure 5, which shows the local structure of the ICD-10 disease classification system, including 22 chapters (the first level-1), 262 sections (the second level-2), 2051 There are 3-bit categories (the third level-3), 9505 4-bit subcategories (the fourth level-4), and 22908 6-bit extended codes (the fifth level-5).
步骤S203,疾病诊断编码在疾病编码本体结构中的语义关系的相似性度量。Step S203, the similarity measure of the semantic relationship of the disease diagnosis code in the disease code ontology structure.
ICD-10疾病分类体系是一类具有分类体系的层次树,同一分支下的诊断编码具有一定的相似性,本发明实施例采用基于语义关系的相似性度量方法,该相似性度量方法包括疾病诊断编码的信息量度量、疾病诊断编码间的相似性度量以及疾病诊断编码集间的相似性度量。The ICD-10 disease classification system is a hierarchical tree with a classification system. The diagnostic codes under the same branch have a certain similarity. The embodiment of the present invention adopts a similarity measurement method based on semantic relationship, and the similarity measurement method includes disease diagnosis. The measure of the information content of the codes, the measure of the similarity between the codes of the disease diagnosis, and the measure of the similarity between the sets of the codes of the disease diagnosis.
针对疾病诊断编码信息量度量方法,具体的,在ICD-10疾病分类体系中,每一个编码表示一个概念,且分类概念之间存在语义相似性,同一分支下的概念比不同分支的概念语义更相似。本发明实施例采用层次深度度量方法,即每一层赋予一个特定的数值,概念层次越深数值越大。对于一个ICD-10编码d1信息量定义为:For the measurement method of disease diagnosis coding information, specifically, in the ICD-10 disease classification system, each code represents a concept, and there is semantic similarity between classification concepts, and concepts in the same branch are more semantic than concepts in different branches. resemblance. The embodiment of the present invention adopts a hierarchical depth measurement method, that is, each layer is assigned a specific numerical value, and the deeper the conceptual level, the greater the numerical value. For an ICD- 10 code the d1 information volume is defined as:
IC(d1)=path(d1→r)IC(d 1 )=path(d 1 →r)
其中,r表示ICD-10疾病分类体系的根节点,path(·)定义为从ICD-10编码d1走到根节点r的路径长度。因此,根节点的信息量为0,第一层章的信息量为1,第二层节的信息量为2,第三层类目的信息量为3,第四层亚目的信息量为4,第五层扩展编码的信息量为5。Among them, r represents the root node of the ICD-10 disease classification system, and path(·) is defined as the path length from the ICD-10 code d 1 to the root node r. Therefore, the information volume of the root node is 0, the information volume of the first-level chapter is 1, the information volume of the second-level section is 2, the information volume of the third-level category is 3, and the information volume of the fourth-level sub-object is 4. , and the information amount of the fifth layer extended coding is 5.
针对疾病诊断编码间相似性度量方法,具体的,根据疾病诊断编码的信息量度量两个编码之间的相似性。本发明实施例采用两个语义概念的最近父节点的方法来计算相似度,对于两个编码之间的相似性定义为:For the method for measuring the similarity between the disease diagnosis codes, specifically, the similarity between the two codes is measured according to the information amount of the disease diagnosis codes. The embodiment of the present invention adopts the method of the nearest parent node of two semantic concepts to calculate the similarity, and the similarity between two codes is defined as:
其中,d1与d2为ICD-10疾病分类体系中的两个疾病诊断编码,LCA(d1,d2)定义为疾病诊断编码d1与疾病诊断编码d2的最近父节点,若d1=d2,则LCA(d1,d2)=d1=d2,IC(LCA(d1,d2))= IC(d1)=IC(d2);若d1≠d2,且LCA(d1,d2)=根节点,则IC(LCA(d1,d2))=0。Among them, d 1 and d 2 are the two disease diagnosis codes in the ICD-10 disease classification system, and LCA(d 1 , d 2 ) is defined as the nearest parent node of the disease diagnosis code d 1 and the disease diagnosis code d 2 , if d 1 =d 2 , then LCA(d 1 ,d 2 )=d 1 =d 2 , IC(LCA(d 1 ,d 2 ))=IC(d 1 )=IC(d 2 ); if d 1 ≠d 2 , and LCA(d 1 , d 2 )=root node, then IC(LCA(d 1 ,d 2 ))=0.
针对疾病诊断编码集间相似性度量方法,具体的,在健康体检报告中,受检者通常被诊断为一组疾病诊断编码的集合,通过度量两个疾病诊断编码集的相似性来表示受检者疾病诊断信息的相似性。本发明实施例采用一种考虑最相似编码成对平均值来度量编码集间的相似性,假设受检者i和受检者j的诊断信息分别定义为Di′i={di1,di2,…,dig,…}和Di′j={dj1,dj2,…,djh,…}(这里不考虑疾病诊断编码的次序),则受检者i和受检者j的诊断信息相似度定义为:For the similarity measurement method between disease diagnosis code sets, specifically, in the health examination report, the subject is usually diagnosed as a set of disease diagnosis code sets, and by measuring the similarity of the two disease diagnosis code sets to represent the subject. Similarity of disease diagnosis information among patients. In the embodiment of the present invention, a pairwise average value of the most similar codes is used to measure the similarity between code sets, and it is assumed that the diagnostic information of subject i and subject j is defined as Di′ i ={d i1 ,d i2 ,...,d ig ,...} and Di' j ={d j1 ,d j2 ,...,d jh ,...} (the order of disease diagnosis codes is not considered here), then the diagnosis of subject i and subject j Information similarity is defined as:
其中,Di′i|为受检者i疾病诊断信息中疾病诊断编码的个数,|Di′j|为受检者j疾病诊断信息中疾病诊断编码的个数,dig为受检者i的第g个疾病诊断编码,djh受检者j的第h个疾病诊断编码。Among them, Di' i | is the number of disease diagnosis codes in the disease diagnosis information of subject i, |Di' j | is the number of disease diagnosis codes in the disease diagnosis information of subject j, and dig is the number of disease diagnosis codes of subject i The gth disease diagnosis code of d jh is the hth disease diagnosis code of subject j.
通过对所有体检人群的诊断信息进行两两相似度度量,得到该体检人群的相似度矩阵 S。By measuring the pairwise similarity of the diagnostic information of all the physical examination populations, the similarity matrix S of the physical examination population is obtained.
步骤S204,根据相似度度量结果,对受检者进行聚类,获得簇CK。Step S204, according to the similarity measurement result, cluster the subjects to obtain the cluster C K .
在体检人群诊断信息相似度矩阵基础上,利用聚类算法将相似的受检者聚成一类,保证在一个簇内的受检者相似,簇外的受检者不相似。On the basis of the similarity matrix of diagnostic information of the physical examination population, a clustering algorithm is used to cluster similar subjects into a group to ensure that subjects within a cluster are similar and subjects outside a cluster are not similar.
聚类算法是一种有效的无监督学习算法,主要分为划分方法、层次方法、基于密度的方法等,最常见的为K-means、层次聚类、基于密度峰值的聚类等。本发明实施例以AP聚类(仿射传播聚类)为例,将体检人群划分为不同的簇中。Clustering algorithm is an effective unsupervised learning algorithm, which is mainly divided into division method, hierarchical method, density-based method, etc. The most common ones are K-means, hierarchical clustering, and density peak-based clustering. The embodiment of the present invention takes AP clustering (affine propagation clustering) as an example, and divides the medical examination population into different clusters.
在AP聚类中,簇的数量选择由偏好参数p值控制,本发明实施例采用偏好系数pc来控制簇的数量:In AP clustering, the selection of the number of clusters is controlled by the value of the preference parameter p. In the embodiment of the present invention, the preference coefficient p c is used to control the number of clusters:
p=mean(S)-pc·Np=mean(S)-p c ·N
其中,S为所有体检人群诊断信息的相似度矩阵,N为受检者数量。Among them, S is the similarity matrix of the diagnostic information of all physical examination populations, and N is the number of subjects.
根据受检者的疾病诊断信息,将N个受检者划分为K个簇(C1,C2,…,CK)中,每个簇的受检者人数定义为:According to the disease diagnosis information of the subjects, the N subjects are divided into K clusters (C 1 , C 2 , ..., C K ), and the number of subjects in each cluster is defined as:
其中,Ck(Di′j)表示被划分到簇CK中的受检者j,E(CK)表示簇Ck的人群代表。λ(·)为示性函数,当受检者j被划分到簇Ck中,Ck(Di′j)=E(Ck),λ(Ck(Di′j),E(Ck))=1,否则λ(Ck(Di′j),E(Ck))=0。换句话说,当受检者j被划分到簇Ck中,被赋予该簇的类标签E(CK),即用该簇的代表人群表示这个受检者被划分到这个簇中。Here, C k (Di' j ) represents the subject j classified into the cluster C K , and E(C K ) represents the population representative of the cluster C k . λ(·) is an indicative function, when subject j is divided into clusters C k , C k (Di′ j )=E(C k ), λ(C k (Di′ j ), E(C k ) ))=1, otherwise λ(C k (Di′ j ), E(C k ))=0. In other words, when subject j is divided into cluster C k , it is assigned the class label E(C K ) of this cluster, that is, the representative population of this cluster indicates that this subject is divided into this cluster.
步骤S205,获得主/次要典型疾病诊断编码集。Step S205, obtaining a primary/secondary typical disease diagnosis code set.
请参阅图3,获得主/次要典型疾病诊断编码集的方法,包括以下步骤:Referring to Figure 3, the method for obtaining the primary/secondary typical disease diagnosis code set includes the following steps:
步骤S301,在簇Ck中选择与受检者相似度大于相似度阈值的人群代表作为核心体检人群。Step S301 , in the cluster C k , the representative of the population whose similarity with the subject is greater than the similarity threshold is selected as the core physical examination population.
由于ICD-10疾病分类体系存在复杂的语义关系,不同层次间的编码之间存在明显的差异,使得简单采用人群代表的ICD-10疾病诊断编码不能完全描述所属簇的共性特征。本发明实施例采用从每个簇中定义核心区域,选择与该簇人群代表相似度大的受检者,分析其诊断信息并抽取高频繁的疾病诊断编码的方法,来表示该簇的共性特征。Due to the complex semantic relationship in the ICD-10 disease classification system, there are obvious differences between the codes at different levels, so that the ICD-10 disease diagnosis codes that simply use the population representative cannot fully describe the common characteristics of the clusters to which they belong. The embodiment of the present invention adopts a method of defining a core region from each cluster, selecting subjects with a high similarity to the representative of the cluster, analyzing their diagnostic information, and extracting high-frequent disease diagnosis codes to represent the common features of the cluster. .
定义簇Ck的核心区域,即核心体检人群的选择:Define the core area of cluster C k , that is, the selection of the core physical examination population:
Corek={Di′j|S(Di′j,E(Ck))≥τk}Core k ={Di′ j |S(Di′ j ,E(C k ))≥τ k }
其中,E(Ck)表示簇Ck的人群代表,τk为提前设定的相似度阈值。Among them, E(C k ) represents the crowd representative of cluster C k , and τ k is the similarity threshold set in advance.
步骤S302,在核心体检人群中选择出现的频率大于频率阈值的受检者的诊断编码组成典型疾病诊断编码集。Step S302, selecting the diagnostic codes of the subjects whose frequency is greater than the frequency threshold in the core physical examination population to form a typical disease diagnostic code set.
疾病诊断编码dh在簇Ck中出现的频率定义为:The frequency of disease diagnosis code d h in cluster C k is defined as:
其中,|Corek|为簇Ck核心体检人群的个数。λ(·)为示性函数,当疾病诊断编码dh属于第j位受检者的诊断编码时,λ(dh,Di′j)=1,否则λ(dh,Di′j)=0。Among them, |Core k | is the number of core physical examination populations in cluster C k . λ(·) is an indicative function. When the disease diagnosis code d h belongs to the diagnosis code of the jth subject, λ(d h , Di′ j )=1, otherwise λ(d h , Di′ j )= 0.
作为一个示例,簇Ck中有100位受检者,其中50位受检者都有高血压(dh)这种诊断编码,则返回概率为50/100=0.5。As an example, if there are 100 subjects in cluster C k , and 50 subjects have the diagnosis code of hypertension ( dh ), the return probability is 50/100=0.5.
通过设定疾病诊断编码出现频率阈值δ,定义簇Ck的典型疾病诊断编码集:By setting the frequency threshold δ of disease diagnosis codes, the typical disease diagnosis code set of cluster C k is defined:
TICDSk={dh|Frequencyk(dh)>δ}TICDS k ={d h |Frequency k (d h )>δ}
步骤S303,计算簇Ck中每种典型疾病诊断编码的平均次序。典型疾病诊断编码的平均次序越小,成为主要诊断的概率越大。Step S303, calculate the average order of diagnostic codes of each typical disease in the cluster C k . The smaller the average order of typical disease diagnosis codes, the greater the probability of being the main diagnosis.
定义簇Ck中每种典型疾病诊断编码的平均次序:Define the average order of diagnostic codes for each typical disease in cluster C k :
其中,H'为典型疾病诊断编码集的个数,为典型疾病诊断编码dh在受检者 j的诊断信息Dij中的次序。Among them, H' is the number of typical disease diagnosis code sets, The order of the typical disease diagnosis code dh in the diagnostic information Di j of subject j.
步骤S304,根据所述平均次序,对簇Ck中抽取的所述典型疾病诊断编码集进行重新排序,得到簇Ck中主要的疾病诊断类型和次要的疾病诊断类型。Step S304, according to the average order, reorder the typical disease diagnosis code sets extracted from the cluster C k to obtain the main disease diagnosis type and the secondary disease diagnosis type in the cluster C k .
通过对簇Ck中抽取的典型疾病诊断编码集进行重新排序,可以识别出该簇中主要的疾病诊断类型,次要的疾病诊断类型。By reordering the typical disease diagnosis code sets extracted from the cluster C k , the main disease diagnosis types and the secondary disease diagnosis types in the cluster can be identified.
定义排序函数Rank(·),得出具有次序特征的主/次要典型疾病诊断编码集:Define the ranking function Rank( ) to obtain the primary/secondary typical disease diagnosis code set with sequence features:
Averagek(d1)函数的返回值为正数。The return value of the Average k (d 1 ) function is a positive number.
作为一个示例,假如有A、B和C,3种疾病,相应的average分别为5.3、2.5、3.6,则经过Rank函数重新排序后,顺序依次为变为B、C、A,排序越靠前越可能是主要诊断。As an example, if there are three diseases, A, B, and C, and the corresponding averages are 5.3, 2.5, and 3.6, respectively, then after the Rank function is reordered, the order becomes B, C, and A, and the higher the order is. The more likely it is the primary diagnosis.
步骤S206,在ICD本体结构中展示簇Ck的主/次要典型疾病诊断编码集,按照同一分支下疾病诊断编码间的概念语义关系,识别最近父节点,得到新疾病类型,将新疾病类型重新排序得到典型疾病共现模式。Step S206, display the primary/secondary typical disease diagnosis code set of cluster C k in the ICD ontology structure, identify the nearest parent node according to the conceptual semantic relationship between the disease diagnosis codes under the same branch, obtain the new disease type, and assign the new disease type to the new disease type. Reordering yields typical disease co-occurrence patterns.
具体的,结合可视化分析技术,在ICD-10疾病分类体系中展示簇Ck的主/次要典型疾病诊断编码集,按照同一分支下疾病的诊断编码间的语义关系,自动识别最近父节点,定义新的疾病类型ei,即Specifically, combined with visual analysis technology, the primary/secondary typical disease diagnosis code set of cluster C k is displayed in the ICD-10 disease classification system, and the nearest parent node is automatically identified according to the semantic relationship between the diagnosis codes of diseases under the same branch. Define a new disease type e i , i.e.
ei=LCA(d1,d2,…,dm)e i =LCA(d 1 ,d 2 ,...,d m )
其中,d1、d2……dm为m个ICD-10体系中的疾病诊断编码,LCA(d1,d2,…,dm)为编码d1、 d2……dm的最近父节点。Among them, d 1 , d 2 ...... d m are the disease diagnosis codes in m ICD-10 systems, and LCA(d 1 , d 2 , ..., d m ) is the nearest code d 1 , d 2 ...... d m parent node.
因此,簇Ck的主/次要典型疾病诊断编码集转换为一系列新疾病类型的集合{e1,e2,…,em0,},并通过重新排序,得到典型疾病共现模式,即一元化诊断的初步结果。Therefore, the primary/secondary typical disease diagnosis code set of cluster C k is transformed into a set of new disease types {e 1 ,e 2 ,…,e m0 ,}, and by reordering, the typical disease co-occurrence pattern is obtained, That is, the preliminary results of a unified diagnosis.
步骤S207,接收主检医师的辅助操作,对所述典型疾病共现模式进行调整,得到一元化诊断库。Step S207: Receive an auxiliary operation of the chief examiner, adjust the typical disease co-occurrence pattern, and obtain a unified diagnosis database.
主检医师根据自身领域知识,对识别的典型疾病共现模式进行校正、归类、评估与定义等操作,得到最终的一元化诊断结果。According to their own domain knowledge, the chief examiner can correct, classify, evaluate and define the identified typical disease co-occurrence patterns, and obtain the final unified diagnosis result.
通过对所有簇的典型疾病共现模式进行总结概况,构建针对所选体检人群的典型疾病共现模式库,即一元化诊断库。By summarizing the typical disease co-occurrence patterns of all clusters, a typical disease co-occurrence pattern library for the selected physical examination population, that is, a unified diagnosis library is constructed.
下面以电子病历临床诊断数据为例,验证实验结果的可行性。The following takes the clinical diagnosis data of electronic medical records as an example to verify the feasibility of the experimental results.
选择公开的MIMIC-III数据集中4418例尿毒症患者的诊断信息,该诊断信息以ICD-9 编码的形式表示(纯数字表示,无字母),通过诊断信息相似性度量、AP聚类、主/次要典型疾病诊断编码集抽取,最终确定3个簇,其中典型疾病诊断编码集如图6所示。The diagnostic information of 4418 patients with uremia in the publicly available MIMIC-III dataset was selected, and the diagnostic information was represented in the form of ICD-9 codes (pure numbers, no letters), through the diagnostic information similarity measure, AP clustering, main/ The secondary typical disease diagnosis code set is extracted, and finally three clusters are determined, of which the typical disease diagnosis code set is shown in Figure 6.
以簇3中的患者诊断信息为例,将典型疾病诊断编码集及其次序在ICD-9本体结构中进行可视化分析,如图7所示,其中一元化诊断为泌尿生殖系统疾病、循环系统疾病、以及急性呼吸衰竭,次要诊断(并发症)为脓毒症、败血症、以及贫血等。Taking the patient diagnosis information in
对于新的入院患者,其诊断编码分别为519.09、518.81、491.21、38.9、995.92、785.52、 584.9、482.1、427.31、519.19,计算该患者的诊断信息与3个簇的典型疾病诊断编码集的相似性度量结果,发现与簇3的相似度最大,将簇3的一元化诊断推荐给该患者,因此该患者的一元化诊断结果是簇3诊断结果与本次检查诊断结果的融合,即主要诊断为呼吸系统疾病、泌尿系统疾病、以及循环系统疾病,次要诊断(并发症)为脓毒症、败血症、以及贫血等如图8所示。For newly admitted patients, their diagnostic codes are 519.09, 518.81, 491.21, 38.9, 995.92, 785.52, 584.9, 482.1, 427.31, 519.19, and the similarity between the patient's diagnostic information and the typical disease diagnostic code set of 3 clusters is calculated. The measurement result shows that the similarity with
请参阅图9,基于与上述方法实施例相同的发明构思,本发明另一个实施例还提供了一种一元化诊断的推荐装置,所述装置包括相似性度量模块901、匹配检测模块902和推荐模块903。Referring to FIG. 9, based on the same inventive concept as the above method embodiments, another embodiment of the present invention further provides a recommendation apparatus for unified diagnosis, the apparatus includes a
具体的,相似性度量模块901用于根据受检者的诊断编码,确定所述诊断编码与簇Ck中的主/次要典型疾病诊断编码集的相似性度量结果;所述簇Ck是根据N个受检者诊断信息之间的相似性划分的K个簇。匹配检测模块902用于选择大于预设阈值的相似度值所属的簇,从一元化诊断库中获取每个簇所对应的疾病共现模式。推荐模块903用于将所获得的疾病共现模式结合所述诊断编码在ICD本体结构进行可视化分析,按照同一分支下所述诊断编码间的语义关系,识别最近父节点,得到疾病类型并重新排序得到该受检者的疾病共现模式。Specifically, the
优选的,该装置还包括一元化诊断库构建模块,所述一元化诊断库构建模块包括典型疾病共现模式获取模块和一元化诊断库生成模块。具体的,典型疾病共现模式获取模块用于在ICD本体结构中展示所述主/次要典型疾病诊断编码集,按照同一分支下疾病诊断编码间的语义关系,识别最近父节点,得到新疾病类型,将新疾病类型重新排序得到典型疾病共现模式。一元化诊断库生成模块用于接收主检医师的辅助操作,对所述典型疾病共现模式进行调整,得到一元化诊断库。Preferably, the device further includes a unified diagnosis library building module, and the unified diagnosis library building module includes a typical disease co-occurrence pattern acquisition module and a unified diagnosis library generation module. Specifically, the typical disease co-occurrence pattern acquisition module is used to display the primary/secondary typical disease diagnosis code set in the ICD ontology structure, identify the nearest parent node according to the semantic relationship between the disease diagnosis codes under the same branch, and obtain a new disease Types, reordering new disease types to obtain typical disease co-occurrence patterns. The unified diagnosis library generation module is used for receiving the auxiliary operation of the chief examiner, and adjusting the typical disease co-occurrence pattern to obtain the unified diagnosis library.
优选的,该装置还包括主/次要典型疾病诊断编码集获取模块,主/次要典型疾病诊断编码集获取模块包括核心体检人群确定模块、典型疾病诊断编码集生成模块、平均次序计算模块和获取模块。具体的,核心体检人群确定模块用于在簇Ck中选择与受检者相似度大于相似度阈值的人群代表作为核心体检人群。典型疾病诊断编码集生成模块用于在所述核心体检人群中选择出现的频率大于频率阈值的受检者的诊断编码组成典型疾病诊断编码集。平均次序计算模块用于计算簇Ck中每种典型疾病诊断编码的平均次序。获取模块用于根据所述平均次序,对簇Ck中抽取的所述典型疾病诊断编码集进行重新排序,得到簇Ck中主要的疾病诊断类型和次要的疾病诊断类型。Preferably, the device further comprises a primary/secondary typical disease diagnosis code set acquisition module, the primary/secondary typical disease diagnosis code set acquisition module includes a core physical examination population determination module, a typical disease diagnosis code set generation module, an average order calculation module and Get the module. Specifically, the core physical examination population determination module is used to select the representative of the population whose similarity with the subject is greater than the similarity threshold in the cluster C k as the core physical examination population. The typical disease diagnosis code set generation module is used to select the diagnosis codes of the subjects whose frequency of occurrence is greater than the frequency threshold in the core physical examination population to form a typical disease diagnosis code set. The mean order calculation module is used to calculate the mean order of diagnostic codes for each typical disease in cluster C k . The obtaining module is configured to reorder the typical disease diagnosis code sets extracted from the cluster C k according to the average order, so as to obtain the main disease diagnosis type and the secondary disease diagnosis type in the cluster C k .
优选的,该装置还包括相似性度量模块,所述相似性度量模块包括疾病诊断编码的信息量度量模块、疾病诊断编码间的相似性度量模块以及疾病诊断编码集间的相似性度量模块。Preferably, the device further includes a similarity measurement module, the similarity measurement module includes an information amount measurement module for disease diagnosis codes, a similarity measurement module between disease diagnosis codes, and a similarity measurement module between disease diagnosis code sets.
请参阅图10,图10示出了上述实施例中所涉及的电子设备的一种可能的结构示意图。该电子设备可以包括:处理单元1001、存储单元1002和通信单元1003。处理单元1001 可以设置为与存储单元1002通信。存储单元1002用于存储处理单元1001可执行指令和/ 或程序代码等,其中,处理单元被配置为执行上述任意一个方法实施例所提供的一种一元化诊断的推荐方法。该通信单元1003用于支持该电子设备与其他网络实体的通信,以实现数据交互等功能,如该通信模块1003支持电子设备与其他智能终端的通信,以实现数据交互功能。Please refer to FIG. 10. FIG. 10 shows a possible schematic structural diagram of the electronic device involved in the above embodiment. The electronic device may include: a
其中,处理单元1001可以是处理器或控制器。通信模块1003可以是收发器、RF电路或通信接口等。存储单元1002可以是存储器。The
图10仅仅是本申请实施例的一种可能的实现方式,在实际应用中,该电子设备还可以包括更多或更少的部件,这里不作限制。FIG. 10 is only a possible implementation manner of the embodiment of the present application. In practical applications, the electronic device may further include more or less components, which is not limited here.
需要说明的是,该电子设备可以是服务器,也可以是智能终端,该智能终端可以是计算机、平板电脑或者智能手机等。It should be noted that the electronic device may be a server or an intelligent terminal, and the intelligent terminal may be a computer, a tablet computer, a smart phone, or the like.
本发明实施例还提供了一种存储介质,该存储介质中存储有计算机可读的程序指令,所述程序指令被处理单元执行时实现上述任意一个实施例中所提供的一种一元化诊断的推荐方法。例如,该计算机可读存储介质可以是只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、只读光盘(Compact Disc Read-OnlyMemory,CD-ROM)、磁带、软盘和光数据存储设备等。Embodiments of the present invention further provide a storage medium, where computer-readable program instructions are stored in the storage medium, and when the program instructions are executed by the processing unit, a unified diagnosis recommendation provided in any one of the foregoing embodiments is implemented. method. For example, the computer-readable storage medium may be Read-Only Memory (ROM), Random Access Memory (RAM), Compact Disc Read-Only Memory (CD-ROM), magnetic tape , floppy disks and optical data storage devices.
需要说明的是:上述本发明实施例先后顺序仅仅为了描述,不代表实施例的优劣。且上述对本说明书特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中,多任务处理和并行处理也是可以的或者可能是有利的。It should be noted that: the above-mentioned order of the embodiments of the present invention is only for description, and does not represent the advantages and disadvantages of the embodiments. And the foregoing describes specific embodiments of the present specification. Other embodiments are within the scope of the appended claims. In some cases, the actions or steps recited in the claims can be performed in an order different from that in the embodiments and still achieve desirable results. Additionally, the processes depicted in the figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于装置和服务器实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。Each embodiment in this specification is described in a progressive manner, and the same and similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the apparatus and server embodiments, since they are basically similar to the method embodiments, the description is relatively simple, and for related parts, please refer to the partial descriptions of the method embodiments.
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps of implementing the above embodiments can be completed by hardware, or can be completed by instructing relevant hardware through a program, and the program can be stored in a computer-readable storage medium. The storage medium mentioned may be a read-only memory, a magnetic disk or an optical disk, etc.
以上所述仅为本发明的较佳实施例,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection of the present invention. within the range.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010409050.0A CN111599427B (en) | 2020-05-14 | 2020-05-14 | Recommendation method and device for unified diagnosis, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010409050.0A CN111599427B (en) | 2020-05-14 | 2020-05-14 | Recommendation method and device for unified diagnosis, electronic equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111599427A true CN111599427A (en) | 2020-08-28 |
CN111599427B CN111599427B (en) | 2023-03-31 |
Family
ID=72192212
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010409050.0A Active CN111599427B (en) | 2020-05-14 | 2020-05-14 | Recommendation method and device for unified diagnosis, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111599427B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112017788A (en) * | 2020-09-07 | 2020-12-01 | 平安科技(深圳)有限公司 | Disease sorting method, device, equipment and medium based on reinforcement learning model |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWM478423U (en) * | 2013-11-29 | 2014-05-21 | Univ Southern Taiwan Sci & Tec | Patient disease diagnosis and exploration system |
CN104298344A (en) * | 2013-07-16 | 2015-01-21 | 精工爱普生株式会社 | Information processing apparatus, information processing method, and information processing system |
CN106934235A (en) * | 2017-03-09 | 2017-07-07 | 中国科学院软件研究所 | Patient's similarity measurement migratory system between a kind of disease areas based on transfer learning |
CN106951684A (en) * | 2017-02-28 | 2017-07-14 | 北京大学 | A kind of method of entity disambiguation in medical conditions idagnostic logout |
US20170228507A1 (en) * | 2014-08-08 | 2017-08-10 | Icahn School Of Medicine At Mount Sinai | Automatic disease diagnoses using longitudinal medical record data |
CN108154928A (en) * | 2017-12-27 | 2018-06-12 | 北京嘉和美康信息技术有限公司 | A kind of methods for the diagnosis of diseases and device |
WO2019132685A1 (en) * | 2017-12-29 | 2019-07-04 | Общество С Ограниченной Ответственностью "Интеллоджик" | Method and system for supporting medical decision making |
CN110299209A (en) * | 2019-06-25 | 2019-10-01 | 北京百度网讯科技有限公司 | Similar case history lookup method, device, equipment and readable storage medium storing program for executing |
CN110660459A (en) * | 2019-08-30 | 2020-01-07 | 腾讯科技(深圳)有限公司 | Method, device, server and storage medium for controlling medical record quality |
-
2020
- 2020-05-14 CN CN202010409050.0A patent/CN111599427B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104298344A (en) * | 2013-07-16 | 2015-01-21 | 精工爱普生株式会社 | Information processing apparatus, information processing method, and information processing system |
TWM478423U (en) * | 2013-11-29 | 2014-05-21 | Univ Southern Taiwan Sci & Tec | Patient disease diagnosis and exploration system |
US20170228507A1 (en) * | 2014-08-08 | 2017-08-10 | Icahn School Of Medicine At Mount Sinai | Automatic disease diagnoses using longitudinal medical record data |
CN106951684A (en) * | 2017-02-28 | 2017-07-14 | 北京大学 | A kind of method of entity disambiguation in medical conditions idagnostic logout |
CN106934235A (en) * | 2017-03-09 | 2017-07-07 | 中国科学院软件研究所 | Patient's similarity measurement migratory system between a kind of disease areas based on transfer learning |
CN108154928A (en) * | 2017-12-27 | 2018-06-12 | 北京嘉和美康信息技术有限公司 | A kind of methods for the diagnosis of diseases and device |
WO2019132685A1 (en) * | 2017-12-29 | 2019-07-04 | Общество С Ограниченной Ответственностью "Интеллоджик" | Method and system for supporting medical decision making |
CN110299209A (en) * | 2019-06-25 | 2019-10-01 | 北京百度网讯科技有限公司 | Similar case history lookup method, device, equipment and readable storage medium storing program for executing |
CN110660459A (en) * | 2019-08-30 | 2020-01-07 | 腾讯科技(深圳)有限公司 | Method, device, server and storage medium for controlling medical record quality |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112017788A (en) * | 2020-09-07 | 2020-12-01 | 平安科技(深圳)有限公司 | Disease sorting method, device, equipment and medium based on reinforcement learning model |
CN112017788B (en) * | 2020-09-07 | 2023-07-04 | 平安科技(深圳)有限公司 | Disease ordering method, device, equipment and medium based on reinforcement learning model |
Also Published As
Publication number | Publication date |
---|---|
CN111599427B (en) | 2023-03-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10929420B2 (en) | Structured report data from a medical text report | |
CN108648827B (en) | Cardiovascular and cerebrovascular disease risk prediction method and device | |
CN111933281B (en) | Disease typing determination system, method, device and storage medium | |
CN111512322B (en) | Using neural networks | |
Dack et al. | Artificial intelligence and interstitial lung disease: diagnosis and prognosis | |
CN112633601B (en) | Method, device, equipment and computer medium for predicting disease event occurrence probability | |
CN112151130B (en) | Decision support system based on literature retrieval and construction method | |
CN111243753B (en) | Multi-factor correlation interactive analysis method for medical data | |
JP2020518050A (en) | Learning and applying contextual similarity between entities | |
CN112967803A (en) | Early mortality prediction method and system for emergency patients based on integrated model | |
CN116884631B (en) | Comprehensive liver failure prediction and treatment reference system based on AI and similar patient analysis | |
CN113128654A (en) | Improved random forest model for coronary heart disease pre-diagnosis and pre-diagnosis system thereof | |
CN110610766A (en) | Apparatus and storage medium for deriving probability of disease based on symptom feature weight | |
CN111581969B (en) | Medical term vector representation method, device, storage medium and electronic equipment | |
Zhu et al. | Severity detection of COVID-19 infection with machine learning of clinical records and CT images | |
CN111640517A (en) | Medical record encoding method and device, storage medium and electronic equipment | |
CN111599427B (en) | Recommendation method and device for unified diagnosis, electronic equipment and storage medium | |
Zhang et al. | Development of automated neural network prediction for echocardiographic left ventricular ejection fraction | |
CN110957046A (en) | Medical health case knowledge matching method and system | |
CN110033862B (en) | Traditional Chinese medicine quantitative diagnosis system based on weighted directed graph and storage medium | |
Lin et al. | Automatic mammographic breast density classification in Chinese women: clinical validation of a deep learning model | |
CN114201613A (en) | Test question generation method, test question generation device, electronic device, and storage medium | |
CN111445993A (en) | Disease hierarchical relation processing method and device and electronic equipment | |
CN119324064B (en) | Method, device, electronic device and storage medium for determining severity of coronary artery stenosis | |
CN116052889B (en) | A sFLC prediction system based on blood routine index detection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |