CN110874409A - Disease grading prediction system, method, electronic device and readable storage medium - Google Patents

Disease grading prediction system, method, electronic device and readable storage medium Download PDF

Info

Publication number
CN110874409A
CN110874409A CN201911007766.1A CN201911007766A CN110874409A CN 110874409 A CN110874409 A CN 110874409A CN 201911007766 A CN201911007766 A CN 201911007766A CN 110874409 A CN110874409 A CN 110874409A
Authority
CN
China
Prior art keywords
text
medical record
electronic medical
model
record information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911007766.1A
Other languages
Chinese (zh)
Inventor
陈挺
王光宇
刘晓鸿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN201911007766.1A priority Critical patent/CN110874409A/en
Publication of CN110874409A publication Critical patent/CN110874409A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Health & Medical Sciences (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Tourism & Hospitality (AREA)
  • Public Health (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Computational Linguistics (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本申请公开了一种基于电子病历的病情分级预测系统、方法、电子设备及存储介质。所述系统包括:通过第一存储模块存储电子病历信息,并通过第一信息过滤器读取电子病历信息并对电子病历信息进行过滤,针对结构化和文本的多模态数据,通过基于深度学习的分级预测器并结合注意力机制,采用分级预测模型对结构化数据和文本数据进行处理,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。通过采用本申请分级预测系统,能够较为快速的对电子病历信息进行处理并得到病情分级预测结果,获得的病情分级预测结果、文本特征挖掘信息和可视化分析文本可作为医生的参考,协助医生分级以提高分级速度和分级准确度。

Figure 201911007766

The present application discloses a disease classification prediction system, method, electronic device and storage medium based on electronic medical records. The system includes: storing electronic medical record information through a first storage module, reading the electronic medical record information through a first information filter and filtering the electronic medical record information, for structured and textual multimodal data, through deep learning-based The hierarchical predictor combined with the attention mechanism uses the hierarchical prediction model to process the structured data and text data to obtain text feature mining information, disease classification prediction results and visual analysis text. By using the classification prediction system of the present application, the electronic medical record information can be processed relatively quickly and the disease classification prediction results can be obtained. Improve grading speed and grading accuracy.

Figure 201911007766

Description

病情分级预测系统、方法、电子设备及可读存储介质Disease grading prediction system, method, electronic device and readable storage medium

技术领域technical field

本发明涉及信息处理技术领域,特别是涉及一种病情分级预测系统、方法、电子设备及可读存储介质。The present invention relates to the technical field of information processing, and in particular, to a disease classification prediction system, method, electronic device and readable storage medium.

背景技术Background technique

分诊作为急诊患者就诊的第一道关口,是影响急诊科拥堵的最重要因素。根据患者病情危重程度分级就诊,有助于充分利用急诊资源,维持急诊患者就诊秩序,缩短危重患者候诊时间,提高工作效率,防止因分诊不足或分诊过度所导致急诊资源提前耗尽。Triage, as the first gateway for emergency patients to see a doctor, is the most important factor affecting emergency department congestion. According to the severity of the patient's condition, it is helpful to make full use of emergency resources, maintain the order of emergency patients, shorten the waiting time of critically ill patients, improve work efficiency, and prevent emergency resources from being exhausted in advance due to insufficient triage or excessive triage.

国外先进的分诊标准均采用病情分级,且在特定的医疗环境和社会背景下产生,而国内与国外的医疗保险制度、急诊模式和就医方式有着较大的区别,故无法照搬或套用他们的预检分诊标准,因此需要建立一套既与国际接轨又符合我国国情的简便、有效、科学的利于病情分级的急诊预检分诊标准。The advanced triage standards in foreign countries all adopt disease grading and are generated in a specific medical environment and social background. However, domestic and foreign medical insurance systems, emergency models and medical treatment methods are quite different, so it is impossible to copy or apply theirs. Therefore, it is necessary to establish a set of simple, effective and scientific emergency pre-examination and triage standards that are in line with international standards and in line with my country's national conditions.

我国现阶段主要采用经验分诊模式,这对医生的水平有较高的要求,而且分诊速度较慢,而随着急诊患者的数量剧增,容易出现分诊不足或分诊过度所导致急诊资源提前耗尽的情况。At present, my country mainly adopts the empirical triage model, which has higher requirements on the doctor's level, and the triage speed is relatively slow. With the sharp increase in the number of emergency patients, it is easy to cause insufficient or excessive triage, resulting in emergency department. A situation where resources are exhausted prematurely.

发明内容SUMMARY OF THE INVENTION

鉴于上述问题,提出了本发明实施例以便提供一种克服上述问题或者至少部分地解决上述问题的一种病情分级预测系统、方法、电子设备及可读存储介质。In view of the above problems, embodiments of the present invention are proposed to provide a disease classification prediction system, method, electronic device, and readable storage medium that overcome the above problems or at least partially solve the above problems.

第一方面,本申请实施例提供了一种基于电子病历的病情分级预测系统,所述系统包括:In a first aspect, an embodiment of the present application provides a disease classification prediction system based on an electronic medical record, the system comprising:

第一存储模块,用于存储电子病历信息,所述电子病历信息包括客观指标、现病史和体格检查结果;a first storage module for storing electronic medical record information, the electronic medical record information including objective indicators, current illness history and physical examination results;

第一信息过滤器,与所述第一存储模块连接,用于从所述第一存储模块中读取所述电子病历信息并对所述电子病历信息进行过滤,得到过滤后的电子病历信息,包括结构化数据和文本数据;a first information filter, connected to the first storage module, for reading the electronic medical record information from the first storage module and filtering the electronic medical record information to obtain filtered electronic medical record information, including structured data and textual data;

分级预测器,用于将所述结构化数据和所述文本数据分别输入急诊病情分级预测模型,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。The classification predictor is used for inputting the structured data and the text data into the emergency disease classification prediction model respectively to obtain text feature mining information, disease classification prediction results and visual analysis text.

可选地,所述急诊病情分级预测模型包括表示模型和融合模型,所述分级预测器包括:Optionally, the emergency condition classification prediction model includes a representation model and a fusion model, and the classification predictor includes:

第一表示模块,用于将所述文本数据输入所述表示模型,得到所述可视化分析文本、所述文本特征挖掘信息和文本特征向量;a first representation module, configured to input the text data into the representation model to obtain the visual analysis text, the text feature mining information and the text feature vector;

第二表示模块,用于将所述结构化数据输入所述表示模型,得到结构化特征向量;a second representation module, for inputting the structured data into the representation model to obtain a structured feature vector;

融合模块,用于将所述结构化特征向量和所述文本特征向量输入所述融合模型,得到所述病情分级预测结果。The fusion module is used for inputting the structured feature vector and the text feature vector into the fusion model to obtain the disease classification prediction result.

可选地,所述第一表示模块包括:Optionally, the first presentation module includes:

映射子模块,用于将所述文本数据输入所述表示模型,将文本数据的每一个汉字映射到一个向量,并提取所述文本数据嵌入的上下文信息,得到嵌入向量;A mapping submodule for inputting the text data into the representation model, mapping each Chinese character of the text data to a vector, and extracting the context information embedded in the text data to obtain an embedded vector;

注意力子模块,用于对所述嵌入向量进行注意力处理,得到所述可视化分析文本和文本特征向量;An attention submodule, for performing attention processing on the embedded vector to obtain the visual analysis text and text feature vector;

提取子模块,用于对所述可视化分析文本进行短语提取,得到所述文本特征挖掘信息。The extraction submodule is used to extract phrases from the visual analysis text to obtain the text feature mining information.

可选地,所述系统还包括:Optionally, the system further includes:

第二存储模块,用于存储多份电子病历信息样本;The second storage module is used to store a plurality of electronic medical record information samples;

第二信息过滤器,与所述第二存储模块连接,用于从所述第二存储模块中读取所述多份电子病历信息样本并对所述多份电子病历信息样本进行过滤,得到多份过滤后的病历信息样本,包括多份结构化数据样本和文本数据样本;The second information filter is connected to the second storage module, and is used for reading the multiple electronic medical record information samples from the second storage module and filtering the multiple electronic medical record information samples to obtain multiple electronic medical record information samples. Filtered medical record information samples, including multiple structured data samples and text data samples;

模型训练器,用于将所述多份结构化数据样本和文本数据样本分为训练集、验证集和测试集,对预设模型进行多轮训练,直至训练后的预设模型的评估得分在预设轮数内均不再上升,停止训练,将评估得分最高所对应的模型确定为所述急诊病情分级预测模型。A model trainer is used to divide the multiple structured data samples and text data samples into a training set, a verification set and a test set, and perform multiple rounds of training on the preset model until the evaluation score of the trained preset model is in the If there is no increase in the preset number of rounds, the training is stopped, and the model corresponding to the highest evaluation score is determined as the emergency disease classification prediction model.

第二方面,本申请实施例还提供了一种基于电子病历的病情分级预测方法,所述方法包括:In a second aspect, the embodiments of the present application further provide a method for predicting disease classification based on electronic medical records, the method comprising:

存储电子病历信息,所述电子病历信息包括客观指标、现病史和体格检查结果;storing electronic medical record information, the electronic medical record information including objective indicators, current illness history and physical examination results;

读取所述电子病历信息并对所述电子病历信息进行过滤,得到过滤后的电子病历信息,包括结构化数据和文本数据;reading the electronic medical record information and filtering the electronic medical record information to obtain filtered electronic medical record information, including structured data and text data;

将所述结构化数据和所述文本数据分别输入急诊病情分级预测模型,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。The structured data and the text data are respectively input into the emergency condition classification prediction model to obtain text feature mining information, disease classification prediction results and visual analysis text.

可选地,所述急诊病情分级预测模型包括表示模型和融合模型,将所述结构化数据和所述文本数据分别输入急诊病情分级预测模型,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本,包括:Optionally, the emergency disease classification prediction model includes a representation model and a fusion model, and the structured data and the text data are respectively input into the emergency disease classification prediction model to obtain text feature mining information, disease classification prediction results and visual analysis. text, including:

将所述文本数据输入所述表示模型,得到所述可视化分析文本、所述文本特征挖掘信息和文本特征向量;Inputting the text data into the representation model to obtain the visual analysis text, the text feature mining information and the text feature vector;

将所述结构化数据输入所述表示模型,得到结构化特征向量;Inputting the structured data into the representation model to obtain a structured feature vector;

将所述结构化特征向量和所述文本特征向量输入所述融合模型,得到所述病情分级预测结果。Inputting the structured feature vector and the text feature vector into the fusion model to obtain the disease classification prediction result.

可选地,将所述文本数据输入表示模型,得到所述可视化分析文本、所述文本特征挖掘信息和文本特征向量,包括:Optionally, inputting the text data into a representation model to obtain the visual analysis text, the text feature mining information and the text feature vector, including:

将所述文本数据输入所述表示模型,将文本数据的每一个汉字映射到一个向量,并提取所述文本数据嵌入的上下文信息,得到嵌入向量;Inputting the text data into the representation model, mapping each Chinese character of the text data to a vector, and extracting the context information embedded in the text data to obtain an embedded vector;

对所述嵌入向量进行注意力处理,得到所述可视化分析文本和文本特征向量;Perform attention processing on the embedded vector to obtain the visual analysis text and text feature vector;

对所述可视化分析文本进行短语提取,得到所述文本特征挖掘信息。Phrase extraction is performed on the visual analysis text to obtain the text feature mining information.

可选地,所述方法还包括:Optionally, the method further includes:

存储多份电子病历信息样本;Store multiple electronic medical record information samples;

读取所述多份电子病历信息样本并对所述多份电子病历信息样本进行过滤,得到多份过滤后的电子病历信息样本,包括多份结构化数据样本和文本数据样本;reading the plurality of electronic medical record information samples and filtering the plurality of electronic medical record information samples to obtain a plurality of filtered electronic medical record information samples, including a plurality of structured data samples and text data samples;

将所述多份结构化数据样本和文本数据样本分为训练集、验证集和测试集,对预设模型进行多轮训练,直至训练后的预设模型的评估得分在预设轮数内均不再上升,停止训练,将评估得分最高所对应的模型确定为所述急诊病情分级预测模型。The multiple structured data samples and text data samples are divided into training sets, verification sets and test sets, and the preset model is trained for multiple rounds until the evaluation score of the trained preset model is equal to the preset number of rounds. If it stops rising, the training is stopped, and the model corresponding to the highest evaluation score is determined as the emergency condition classification prediction model.

第三方面,本申请实施例还提供了一种电子设备,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述计算机程序被所述处理器执行时,实现如第二方面所述的基于电子病历的病情分级预测方法的步骤。In a third aspect, an embodiment of the present application further provides an electronic device, including: a memory, a processor, and a computer program stored on the memory and executable on the processor, the computer program being processed by the processor When the device is executed, the steps of the method for predicting disease classification based on electronic medical records as described in the second aspect are implemented.

第四方面,本申请实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时,实现如第二方面所述的基于电子病历的病情分级预测方法的步骤。In a fourth aspect, an embodiment of the present application further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program described in the second aspect is implemented. Steps of a disease classification prediction method based on electronic medical records.

本发明实施例包括以下优点:The embodiments of the present invention include the following advantages:

在本发明实施例中,通过第一存储模块存储电子病历信息,并通过第一信息过滤器读取电子病历信息并对电子病历信息进行过滤,得到结构化数据和文本数据,通过分级预测器,采用分级预测模型对结构化数据和文本数据进行处理,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。通过采用本申请分级预测系统,能够较为快速的对电子病历信息进行处理并得到病情分级预测结果,其中使用的分级预测模型具有较快的分级速度和较高的准确度,获得的病情分级预测结果、文本特征挖掘信息和可视化分析文本可作为医生的参考,协助医生分级以提高分级速度和分级准确度,使有限的急诊资源的到充分的利用。In the embodiment of the present invention, the electronic medical record information is stored by the first storage module, and the electronic medical record information is read and filtered by the first information filter to obtain structured data and text data. The structured data and text data are processed by the hierarchical prediction model, and the text feature mining information, the disease classification prediction results and the visual analysis text are obtained. By using the classification prediction system of the present application, the electronic medical record information can be processed relatively quickly and the disease classification prediction result can be obtained. The classification prediction model used has a faster classification speed and higher accuracy, and the obtained disease classification prediction result , Text feature mining information and visual analysis text can be used as a reference for doctors to assist doctors in grading to improve grading speed and grading accuracy, so that limited emergency resources can be fully utilized.

附图说明Description of drawings

图1是本发明的一种基于电子病历的病情分级预测系统的结构框图;Fig. 1 is a structural block diagram of a disease classification prediction system based on electronic medical records of the present invention;

图2是本发明的一种分级预测器的结构框图;Fig. 2 is the structural block diagram of a kind of hierarchical predictor of the present invention;

图3是本发明的另一种基于电子病历的病情分级预测系统的结构框图;Fig. 3 is the structural block diagram of another kind of disease grading prediction system based on electronic medical record of the present invention;

图4是本发明的一种基于电子病历的病情分级预测方法的步骤流程图。FIG. 4 is a flow chart of steps of a method for predicting disease classification based on electronic medical records of the present invention.

具体实施方式Detailed ways

为使本发明的上述目的、特征和优点能够更加明显易懂,下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more clearly understood, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments.

请参考图1,图1示出了本申请实施例的一种基于电子病历的病情分级预测系统的结构框图,如图1所示,所述系统包括以下结构:Please refer to FIG. 1. FIG. 1 shows a structural block diagram of an electronic medical record-based disease classification prediction system according to an embodiment of the present application. As shown in FIG. 1, the system includes the following structures:

第一存储模块101,用于存储电子病历信息,所述电子病历信息包括客观指标、现病史和体格检查结果。The first storage module 101 is configured to store electronic medical record information, where the electronic medical record information includes objective indicators, current illness history and physical examination results.

在本实施方式中,在第一存储模块存储电子病历信息之前,医护人员已经通过各种医疗手段获取了病人的电子病历信息,包括客观指标、现病史和体格检查结果,具体地,客观指标可包括呼吸率、收缩压、舒张压、脉搏、体温、血氧饱和度、年龄和性别,现病史可包括外伤、意识丧失、吞咽困难、呕吐、肢体肿痛、癫痫等,体格检查可包括精神状态、反射功能、肢体张力、呼吸音、心律、压痛、反跳痛等。通过医护人员将客观指标、现病史和体格检查结果输入本基于电子病历的病情分级预测,并由第一存储模块进行存储。In this embodiment, before the electronic medical record information is stored in the first storage module, the medical staff has obtained the electronic medical record information of the patient through various medical means, including objective indicators, current illness history and physical examination results. Specifically, the objective indicators can be Including respiratory rate, systolic blood pressure, diastolic blood pressure, pulse, body temperature, blood oxygen saturation, age and gender, history of present illness may include trauma, loss of consciousness, dysphagia, vomiting, limb swelling and pain, epilepsy, etc. Physical examination may include mental status , reflex function, limb tension, breath sounds, heart rhythm, tenderness, rebound tenderness, etc. The medical staff input objective indicators, current illness history and physical examination results into this electronic medical record-based disease classification prediction, and the first storage module stores them.

第一信息过滤器102,与所述第一存储模块连接,用于从所述第一存储模块中读取所述电子病历信息并对所述电子病历信息进行过滤,得到过滤后的电子病历信息,包括结构化数据和文本数据。A first information filter 102, connected to the first storage module, for reading the electronic medical record information from the first storage module and filtering the electronic medical record information to obtain filtered electronic medical record information , including structured data and text data.

在本实施方式中,第一信息过滤器与第一存储模块连接,进而从第一存储模块中读取电子病历信息,并对电子病历信息进行过滤,具体地,删除缺失数据和错误的记录数据,例如,删除以下数据:In this embodiment, the first information filter is connected to the first storage module, and then reads the electronic medical record information from the first storage module, and filters the electronic medical record information, specifically, deletes missing data and erroneous record data , for example, to delete the following data:

(1)温度<30℃,(2)SBP>400mmHg,(3)DBP<5mmHg。(1) Temperature<30℃, (2)SBP>400mmHg, (3)DBP<5mmHg.

进而得到过滤后的电子病历信息,包括结构化数据和文本数据,其中,结构化数据包括客观指标,文本数据包括现病史和体格检查结果,以便提高分类准确率。Further, the filtered electronic medical record information includes structured data and text data, wherein the structured data includes objective indicators, and the text data includes current illness history and physical examination results, so as to improve the classification accuracy.

分级预测器103,用于将所述结构化数据和所述文本数据分别输入急诊病情分级预测模型,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。The classification predictor 103 is configured to input the structured data and the text data into the emergency disease classification prediction model respectively, and obtain text feature mining information, disease classification prediction results and visual analysis text.

在本实施方式中,分级预测器将结构化数据和文本数据输入训练好的急诊病情分级预测模型,通过急诊病情分级预测模型对结构化数据和文本数据分别进行处理,通过LSTM(长短期记忆网络)结合注意力机制,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本,以作为医生分级的参考,从而提高分级速率和分级准确率。In this embodiment, the classification predictor inputs structured data and text data into the trained emergency disease classification prediction model, processes the structured data and text data respectively through the emergency disease classification prediction model, and uses LSTM (Long Short Term Memory Network) to process the structured data and text data respectively. ) combined with the attention mechanism to obtain text feature mining information, disease grading prediction results, and visual analysis text, which can be used as a reference for doctors' grading, thereby improving the grading rate and grading accuracy.

在本发明实施例中,通过第一存储模块存储电子病历信息,并通过第一信息过滤器读取电子病历信息并对电子病历信息进行过滤,得到结构化数据和文本数据,通过分级预测器,采用分级预测模型对结构化数据和文本数据进行处理,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。通过采用本申请分级预测系统,能够较为快速的对电子病历信息进行处理并得到病情分级预测结果,其中使用的分级预测模型具有较快的分级速度和较高的准确度,获得的病情分级预测结果、文本特征挖掘信息和可视化分析文本可作为医生的参考,协助医生分级以提高分级速度和分级准确度,使有限的急诊资源的到充分的利用。In the embodiment of the present invention, the electronic medical record information is stored by the first storage module, and the electronic medical record information is read and filtered by the first information filter to obtain structured data and text data. The structured data and text data are processed by the hierarchical prediction model, and the text feature mining information, the disease classification prediction results and the visual analysis text are obtained. By using the classification prediction system of the present application, the electronic medical record information can be processed relatively quickly and the disease classification prediction result can be obtained. The classification prediction model used has a faster classification speed and higher accuracy, and the obtained disease classification prediction result , Text feature mining information and visual analysis text can be used as a reference for doctors to assist doctors in grading to improve grading speed and grading accuracy, so that limited emergency resources can be fully utilized.

请参考图2,图2示出了本申请实施例的一种分级预测器的结构框图,如图2所示,在一种可行的实施方式中,所述急诊病情分级预测模型包括表示模型和融合模型,所述分级预测器包括以下201-203三个模块:Please refer to FIG. 2. FIG. 2 shows a structural block diagram of a classification predictor according to an embodiment of the present application. As shown in FIG. 2, in a feasible implementation, the emergency disease classification prediction model includes a representation model and a Fusion model, the hierarchical predictor includes the following 201-203 three modules:

第一表示模块201,用于将所述文本数据输入所述表示模型,得到所述可视化分析文本、所述文本特征挖掘信息和文本特征向量。The first representation module 201 is configured to input the text data into the representation model to obtain the visual analysis text, the text feature mining information and the text feature vector.

在本实施方式中,第一表示模块能够将文本数据输入表示模型,进行处理,得到可视化分析文本、文本特征挖掘信息和文本特征向量。其中,可视化分析文本和文本特征挖掘信息直接输出,文本特征向量用于后续继续处理。In this embodiment, the first representation module can input the text data into the representation model and process it to obtain the visual analysis text, text feature mining information and text feature vector. Among them, the visual analysis text and text feature mining information are directly output, and the text feature vector is used for subsequent processing.

在一种可行的实施方式中,所述第一表示模块包括:In a feasible implementation manner, the first presentation module includes:

映射子模块,用于将所述文本数据输入所述表示模型,将文本数据的每一个汉字映射到一个向量,并提取所述文本数据嵌入的上下文信息,得到嵌入向量;A mapping submodule for inputting the text data into the representation model, mapping each Chinese character of the text data to a vector, and extracting the context information embedded in the text data to obtain an embedded vector;

注意力子模块,用于对所述嵌入向量进行注意力处理,得到所述可视化分析文本和文本特征向量;An attention submodule, for performing attention processing on the embedded vector to obtain the visual analysis text and text feature vector;

提取子模块,用于对所述可视化分析文本进行短语提取,得到所述文本特征挖掘信息。The extraction submodule is used to extract phrases from the visual analysis text to obtain the text feature mining information.

在本实施方式中,映射子模块将文本数据输入表示模型,通过Embedding层将文本数据的每一个汉字映射到一个向量,再通过BiLSTM层提取所述文本数据嵌入的上下文信息,得到嵌入向量,并通过注意力子模块对嵌入向量进行注意力处理,得到可视化分析文本和文本特征向量,其中,可视化分析文本通过提取子模块进一步处理,提取其中的短语级别的文本特征,得到文本特征挖掘信息,其中,可视化分析文本和文本特征挖掘信息可直接输出,文本特征向量用于后续进一步处理。提取短语级别的文本特征更有意义,能够提高分级准确率,如:“无发热”如果被切分为“无”、“发热”,则无法正确挖掘出“无发热”。可视化分析文本和文本特征挖掘信息可满足临床解释性,为了便于医生快速获取重点信息,在具体的实施方式中,可视化分析文本中的重点信息(即文本特征挖掘信息)可以使用不同的颜色进行展示,越重要的信息使用越深的颜色。In this embodiment, the mapping sub-module inputs the text data into the representation model, maps each Chinese character of the text data to a vector through the Embedding layer, and then extracts the context information embedded in the text data through the BiLSTM layer to obtain the embedding vector, and The embedded vector is subjected to attention processing by the attention sub-module, and the visual analysis text and text feature vector are obtained. The visual analysis text is further processed by the extraction sub-module, and the text features at the phrase level are extracted, and the text feature mining information is obtained, wherein , the visual analysis text and text feature mining information can be directly output, and the text feature vector is used for subsequent further processing. Extracting text features at the phrase level is more meaningful and can improve the classification accuracy. For example, if "no fever" is divided into "no fever" and "fever", "no fever" cannot be correctly mined. Visual analysis text and text feature mining information can satisfy clinical interpretability. In order to facilitate doctors to quickly obtain key information, in specific implementations, the key information in the visual analysis text (ie, text feature mining information) can be displayed in different colors. , use darker colors for more important information.

第二表示模块202,用于将所述结构化数据输入所述表示模型,得到结构化特征向量。The second representation module 202 is configured to input the structured data into the representation model to obtain a structured feature vector.

在本实施方式中,第二表示模块可将所述结构化数据输入所述表示模型,通过两个全连接层对结构化数据进行特征提取,得到结构化特征向量。In this embodiment, the second representation module may input the structured data into the representation model, and perform feature extraction on the structured data through two fully connected layers to obtain a structured feature vector.

在上述方式中,使用不同的结构层对结构化数据和文本数据分别进行处理,能够获取更加准确的数据。In the above manner, different structural layers are used to process structured data and text data respectively, so that more accurate data can be obtained.

融合模块203,用于将所述结构化特征向量和所述文本特征向量输入所述融合模型,得到所述病情分级预测结果。The fusion module 203 is configured to input the structured feature vector and the text feature vector into the fusion model to obtain the disease classification prediction result.

在本实施方式中,融合模块可将结构化特征向量和文本特征向量融合进行处理,从而得到病情分级预测结果,病情分级预测结果输出后可与文本特征挖掘信息和可视化分析文本一起作为医生分级的参考,从而提高医生分级速率和分级准确率。In this embodiment, the fusion module can fuse the structured feature vector and the text feature vector for processing, so as to obtain the disease classification prediction result. After the disease classification prediction result is output, it can be used as the doctor classification together with the text feature mining information and the visual analysis text. Reference, thereby improving the doctor's grading rate and grading accuracy.

请参考图3,图3示出了本申请实施例的另一种基于电子病历的病情分级预测系统的结构框图,如图3所示,在一种可行的实施方式中,所述系统还包括:Please refer to FIG. 3 . FIG. 3 shows a structural block diagram of another electronic medical record-based disease classification prediction system according to an embodiment of the present application. As shown in FIG. 3 , in a feasible implementation manner, the system further includes: :

第二存储模块301,用于存储多份电子病历信息样本。The second storage module 301 is configured to store a plurality of electronic medical record information samples.

在对电子病历信息进行分类之前,还需要训练得到急诊病情分级预测模型,所以,第二存储模块中需要存储多份电子病历信息样本,其中,由人工获取多份已有的电子病历信息,将其作为训练样本,且每一份电子病历信息样本均由人工标注有分类结果,再将其输入第二存储模块。Before classifying the electronic medical record information, it is also necessary to train to obtain the emergency disease classification prediction model. Therefore, the second storage module needs to store multiple electronic medical record information samples. It is used as a training sample, and each electronic medical record information sample is manually marked with a classification result, and then input into the second storage module.

第二信息过滤器302,与所述第二存储模块连接,用于从所述第二存储模块中读取所述多份电子病历信息样本并对所述多份电子病历信息样本进行过滤,得到多份过滤后的病历信息样本,包括多份结构化数据样本和文本数据样本;The second information filter 302, connected to the second storage module, is configured to read the plurality of electronic medical record information samples from the second storage module and filter the plurality of electronic medical record information samples to obtain Multiple filtered medical record information samples, including multiple structured data samples and text data samples;

第二信息过滤器的过滤操作与第一信息过滤器的过滤操作相同,第二信息分类器与第一信息分类器相似,具体可参照上述对第一信息过滤器和第一信息分类器的具体操作解释,在此不再赘述。The filtering operation of the second information filter is the same as the filtering operation of the first information filter, and the second information classifier is similar to the first information classifier. The operation explanation will not be repeated here.

模型训练器303,用于将所述多份结构化数据样本和文本数据样本分为训练集、验证集和测试集,对预设模型进行多轮训练,直至训练后的预设模型的评估得分在预设轮数内均不再上升,停止训练,将评估得分最高所对应的模型确定为所述急诊病情分级预测模型。The model trainer 303 is used to divide the multiple structured data samples and text data samples into a training set, a verification set and a test set, and perform multiple rounds of training on the preset model until the evaluation score of the preset model after training If there is no increase in the preset number of rounds, the training is stopped, and the model corresponding to the highest evaluation score is determined as the emergency disease classification prediction model.

在本实施方式中,将多份结构化数据样本和文本数据样本分为训练集、验证集和测试集,例如,可将多份结构化数据样本和文本数据样本分为训练集70%、验证集10%和测试集20%,对预设模型进行多轮训练,具体地,使用训练集对预设模型进行训练,使用验证集对训练后的预设模型进行验证,并计算误差值,通过反向传播算法更新预设模型,并使用测试集对训练后的预设模型进行评估,再重复上述步骤,对更新后的预设模型进行多轮训练操作,直至训练后的预设模型的评估得分在预设轮数内均不再上升,停止训练,将评估得分最高所对应的模型确定为所述急诊病情分级预测模型。模型训练器与分级预测器连接,可将训练好的急诊病情分级预测模型输入分级预测器。In this embodiment, multiple structured data samples and text data samples are divided into training sets, validation sets and test sets. For example, multiple structured data samples and text data samples can be divided into Set 10% of the test set and 20% of the test set, perform multiple rounds of training on the preset model, specifically, use the training set to train the preset model, use the validation set to verify the trained preset model, and calculate the error value, through The back-propagation algorithm updates the preset model, and uses the test set to evaluate the trained preset model, and repeats the above steps to perform multiple rounds of training operations on the updated preset model until the evaluation of the trained preset model If the score no longer increases within the preset number of rounds, the training is stopped, and the model corresponding to the highest evaluation score is determined as the emergency disease classification prediction model. The model trainer is connected with the classification predictor, and the trained emergency disease classification prediction model can be input into the classification predictor.

在本实施方式中,通过上述方式训练得到的急诊病情分级预测模型,能够对结构化数据和文本数据进行快速处理,且得到的分类结果准确率较高。In this embodiment, the emergency disease classification prediction model trained in the above manner can quickly process structured data and text data, and the obtained classification result has a high accuracy rate.

基于同一发明构思,本申请一实施例提供一种基于电子病历的病情分级预测方法,参考图4,图4是申请实施例的一种基于电子病历的病情分级预测方法的步骤流程图,如图4所示,所述方法包括:Based on the same inventive concept, an embodiment of the present application provides a method for predicting disease classification based on electronic medical records. Referring to FIG. 4 , FIG. 4 is a flowchart of steps of a method for predicting disease classification based on electronic medical records according to an embodiment of the application, as shown in FIG. 4, the method includes:

步骤S401:存储电子病历信息,所述电子病历信息包括客观指标、现病史和体格检查结果;Step S401: Store electronic medical record information, where the electronic medical record information includes objective indicators, current illness history and physical examination results;

步骤S402:读取所述电子病历信息并对所述电子病历信息进行过滤,得到过滤后的电子病历信息,包括结构化数据和文本数据;Step S402: reading the electronic medical record information and filtering the electronic medical record information to obtain filtered electronic medical record information, including structured data and text data;

步骤S403:将所述结构化数据和所述文本数据分别输入急诊病情分级预测模型,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本。Step S403: Input the structured data and the text data into the emergency disease classification prediction model respectively, and obtain text feature mining information, disease classification prediction results and visual analysis text.

可选地,所述急诊病情分级预测模型包括表示模型和融合模型,将所述结构化数据和所述文本数据分别输入急诊病情分级预测模型,得到文本特征挖掘信息、病情分级预测结果以及可视化分析文本,包括:Optionally, the emergency disease classification prediction model includes a representation model and a fusion model, and the structured data and the text data are respectively input into the emergency disease classification prediction model to obtain text feature mining information, disease classification prediction results and visual analysis. text, including:

将所述文本数据输入所述表示模型,得到所述可视化分析文本、所述文本特征挖掘信息和文本特征向量;Inputting the text data into the representation model to obtain the visual analysis text, the text feature mining information and the text feature vector;

将所述结构化数据输入所述表示模型,得到结构化特征向量;Inputting the structured data into the representation model to obtain a structured feature vector;

将所述结构化特征向量和所述文本特征向量输入所述融合模型,得到所述病情分级预测结果。Inputting the structured feature vector and the text feature vector into the fusion model to obtain the disease classification prediction result.

可选地,将所述文本数据输入表示模型,得到所述可视化分析文本、所述文本特征挖掘信息和文本特征向量,包括:Optionally, inputting the text data into a representation model to obtain the visual analysis text, the text feature mining information and the text feature vector, including:

将所述文本数据输入所述表示模型,通过所述表示模型将文本数据的每一个汉字映射到一个向量,得到嵌入向量;Inputting the text data into the representation model, and mapping each Chinese character of the text data to a vector by the representation model to obtain an embedded vector;

对所述嵌入向量进行注意力处理,得到所述可视化分析文本和文本特征向量;Perform attention processing on the embedded vector to obtain the visual analysis text and text feature vector;

对所述可视化分析文本进行短语提取,得到所述文本特征挖掘信息。Phrase extraction is performed on the visual analysis text to obtain the text feature mining information.

可选地,所述方法还包括:Optionally, the method further includes:

存储多份电子病历信息样本;Store multiple electronic medical record information samples;

读取所述多份电子病历信息样本并对所述多份电子病历信息样本进行过滤,得到多份过滤后的电子病历信息样本;reading the plurality of electronic medical record information samples and filtering the plurality of electronic medical record information samples to obtain a plurality of filtered electronic medical record information samples;

对所述多份过滤后的电子病历信息进行分类,得到多份结构化数据样本和文本数据样本;Classifying the plurality of filtered electronic medical record information to obtain a plurality of structured data samples and text data samples;

将所述多份结构化数据样本和文本数据样本分为训练集、验证集和测试集,对预设模型进行多轮训练,直至训练后的预设模型的评估得分在预设轮数内均不再上升,停止训练,将评估得分最高所对应的模型确定为所述急诊病情分级预测模型。The multiple structured data samples and text data samples are divided into training sets, verification sets and test sets, and the preset model is trained for multiple rounds until the evaluation score of the trained preset model is equal to the preset number of rounds. If it stops rising, the training is stopped, and the model corresponding to the highest evaluation score is determined as the emergency condition classification prediction model.

基于同一发明构思,本申请另一实施例提供一种电子设备,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述计算机程序被所述处理器执行时,实现上述任一实施例所述的方法中的步骤。Based on the same inventive concept, another embodiment of the present application provides an electronic device, including: a memory, a processor, and a computer program stored on the memory and executable on the processor, the computer program being When executed by the processor, the steps in the method described in any of the foregoing embodiments are implemented.

基于同一发明构思,本申请另一实施例提供一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时,实现如本申请上述任一实施例所述的方法中的步骤。Based on the same inventive concept, another embodiment of the present application provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, any one of the above-mentioned applications in the present application is implemented. Steps in the methods described in the Examples.

对于方法实施例而言,由于其与系统实施例基本相似,所以描述的比较简单,相关之处参见系统实施例的部分说明即可。As for the method embodiment, since it is basically similar to the system embodiment, the description is relatively simple, and reference may be made to the partial description of the system embodiment for related parts.

本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments may be referred to each other.

本领域内的技术人员应明白,本发明实施例的实施例可提供为方法、装置、或计算机程序产品。因此,本发明实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。It should be understood by those skilled in the art that the embodiments of the embodiments of the present invention may be provided as a method, an apparatus, or a computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product implemented on one or more computer-usable storage media having computer-usable program code embodied therein, including but not limited to disk storage, CD-ROM, optical storage, and the like.

本发明实施例是参照根据本发明实施例的方法、终端设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。Embodiments of the present invention are described with reference to flowcharts and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the present invention. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing terminal equipment to produce a machine that causes the instructions to be executed by the processor of the computer or other programmable data processing terminal equipment Means are created for implementing the functions specified in the flow or flows of the flowcharts and/or the blocks or blocks of the block diagrams.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer readable memory capable of directing a computer or other programmable data processing terminal equipment to operate in a particular manner, such that the instructions stored in the computer readable memory result in an article of manufacture comprising instruction means, the The instruction means implement the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上,使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing terminal equipment, so that a series of operational steps are performed on the computer or other programmable terminal equipment to produce a computer-implemented process, thereby executing on the computer or other programmable terminal equipment The instructions executed on the above provide steps for implementing the functions specified in the flowchart or blocks and/or the block or blocks of the block diagrams.

尽管已描述了本发明实施例的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例做出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明实施例范围的所有变更和修改。Although preferred embodiments of the embodiments of the present invention have been described, additional changes and modifications to these embodiments may be made by those skilled in the art once the basic inventive concepts are known. Therefore, the appended claims are intended to be construed to include the preferred embodiments as well as all changes and modifications that fall within the scope of the embodiments of the present invention.

最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。Finally, it should also be noted that in this document, relational terms such as first and second are used only to distinguish one entity or operation from another, and do not necessarily require or imply these entities or that there is any such actual relationship or sequence between operations. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or terminal device comprising a list of elements includes not only those elements, but also a non-exclusive list of elements. other elements, or also include elements inherent to such a process, method, article or terminal equipment. Without further limitation, an element defined by the phrase "comprises a..." does not preclude the presence of additional identical elements in the process, method, article or terminal device comprising said element.

以上对本发明所提供的一种基于电子病历的病情分级预测系统、一种基于电子病历的病情分级预测方法、一种电子设备和一种计算机可读存储介质进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。A disease classification prediction system based on electronic medical records, a disease classification prediction method based on electronic medical records, an electronic device and a computer-readable storage medium provided by the present invention have been described in detail above. The principles and implementations of the present invention are described with a single example, and the descriptions of the above embodiments are only used to help understand the method and the core idea of the present invention; There will be changes in the implementation manner and the application scope. To sum up, the contents of this specification should not be construed as limiting the present invention.

Claims (10)

1. A system for staging and predicting a medical condition based on an electronic medical record, the system comprising:
the first storage module is used for storing electronic medical record information, and the electronic medical record information comprises objective indexes, current medical history and physical examination results;
the first information filter is connected with the first storage module and used for reading the electronic medical record information from the first storage module and filtering the electronic medical record information to obtain the filtered electronic medical record information which comprises structured data and text data;
and the hierarchical predictor is used for respectively inputting the structured data and the text data into an emergency disease hierarchical prediction model to obtain text characteristic mining information, a disease hierarchical prediction result and a visual analysis text.
2. The system of claim 1, wherein the emergency condition staging prediction model comprises a representation model and a fusion model, and wherein the staging predictor comprises:
the first representation module is used for inputting the text data into the representation model to obtain the visual analysis text, the text feature mining information and a text feature vector;
the second representation module is used for inputting the structured data into the representation model to obtain a structured feature vector;
and the fusion module is used for inputting the structural feature vector and the text feature vector into the fusion model to obtain the disease condition grading prediction result.
3. The system of claim 2, wherein the first representation module comprises:
the mapping submodule is used for inputting the text data into the representation model, mapping each Chinese character of the text data to a vector, and extracting context information embedded in the text data to obtain an embedded vector;
the attention submodule is used for carrying out attention processing on the embedded vector to obtain the visual analysis text and the text feature vector;
and the extraction submodule is used for carrying out phrase extraction on the visual analysis text to obtain the text feature mining information.
4. The system of claim 1, further comprising:
the second storage module is used for storing a plurality of electronic medical record information samples;
the second information filter is connected with the second storage module and used for reading the plurality of electronic medical record information samples from the second storage module and filtering the plurality of electronic medical record information samples to obtain a plurality of filtered medical record information samples which comprise a plurality of structured data samples and text data samples;
and the model trainer is used for dividing the multiple structured data samples and the text data samples into a training set, a verification set and a test set, performing multiple rounds of training on the preset model until the evaluation score of the trained preset model does not rise within the preset number of rounds, stopping training, and determining the model corresponding to the highest evaluation score as the emergency disease condition graded prediction model.
5. A disease grading prediction method based on an electronic medical record is characterized by comprising the following steps:
storing electronic medical record information, wherein the electronic medical record information comprises objective indexes, current medical history and physical examination results;
reading the electronic medical record information and filtering the electronic medical record information to obtain filtered electronic medical record information, wherein the filtered electronic medical record information comprises
Structured data and text data;
and respectively inputting the structured data and the text data into an emergency disease condition grading prediction model to obtain text characteristic mining information, a disease condition grading prediction result and a visual analysis text.
6. The method of claim 5, wherein the emergency disease grading prediction model comprises a representation model and a fusion model, and the structured data and the text data are respectively input into the emergency disease grading prediction model to obtain text feature mining information, a disease grading prediction result and a visual analysis text, and the method comprises the following steps:
inputting the text data into the representation model to obtain the visual analysis text, the text feature mining information and a text feature vector;
inputting the structured data into the representation model to obtain a structured feature vector;
and inputting the structural feature vector and the text feature vector into the fusion model to obtain the disease condition grading prediction result.
7. The method of claim 6, wherein inputting the text data into a representation model, resulting in the visual analysis text, the text feature mining information, and a text feature vector, comprises:
inputting the text data into the representation model, mapping each Chinese character of the text data to a vector, and extracting context information embedded in the text data to obtain an embedded vector;
performing attention processing on the embedded vector to obtain the visual analysis text and a text feature vector;
and performing phrase extraction on the visual analysis text to obtain text feature mining information.
8. The method of claim 5, further comprising:
storing a plurality of electronic medical record information samples;
reading the plurality of electronic medical record information samples and filtering the plurality of electronic medical record information samples to obtain a plurality of filtered electronic medical record information samples, wherein the plurality of filtered electronic medical record information samples comprise a plurality of structured data samples and text data samples;
and dividing the multiple structured data samples and the text data samples into a training set, a verification set and a test set, performing multiple rounds of training on the preset model until the evaluation score of the trained preset model does not rise within the preset rounds, stopping training, and determining the model corresponding to the highest evaluation score as the emergency disease condition grading prediction model.
9. An electronic device, comprising: a memory, a processor and a computer program stored on the memory and executable on the processor, the computer program when executed by the processor implementing the steps of the method for predicting a medical condition grade based on an electronic medical record according to any one of claims 5 to 8.
10. A computer-readable storage medium, having a computer program stored thereon, which, when being executed by a processor, implements the steps of the method for medical condition stratification prediction based on electronic medical record according to any of claims 5 to 8.
CN201911007766.1A 2019-10-22 2019-10-22 Disease grading prediction system, method, electronic device and readable storage medium Pending CN110874409A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911007766.1A CN110874409A (en) 2019-10-22 2019-10-22 Disease grading prediction system, method, electronic device and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911007766.1A CN110874409A (en) 2019-10-22 2019-10-22 Disease grading prediction system, method, electronic device and readable storage medium

Publications (1)

Publication Number Publication Date
CN110874409A true CN110874409A (en) 2020-03-10

Family

ID=69717848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911007766.1A Pending CN110874409A (en) 2019-10-22 2019-10-22 Disease grading prediction system, method, electronic device and readable storage medium

Country Status (1)

Country Link
CN (1) CN110874409A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111724897A (en) * 2020-06-12 2020-09-29 电子科技大学 A kind of sports function data processing method and system
CN112687367A (en) * 2020-12-29 2021-04-20 中国人民解放军总医院 Medical record grouping method, device and equipment based on dynamic disease condition and storage medium
WO2021159761A1 (en) * 2020-09-09 2021-08-19 平安科技(深圳)有限公司 Pathological data analysis method and apparatus, and computer device and storage medium
CN113793678A (en) * 2021-08-31 2021-12-14 中国医学科学院北京协和医院 Emergency grading assistance method and device for atypical critically ill patients
CN117009924A (en) * 2023-10-07 2023-11-07 之江实验室 Multi-mode self-adaptive multi-center data fusion method and system guided by electronic medical records
CN117558391A (en) * 2024-01-11 2024-02-13 天津市胸科医院 Postoperative condition deep learning method and system based on aortic dissection medical record data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140257122A1 (en) * 2013-03-08 2014-09-11 Singapore Health Services Pte Ltd System and method of determining a risk score for triage
CN107169298A (en) * 2017-05-26 2017-09-15 深圳市第二人民医院 A kind of intelligent emergency treatment classification system for distribution of out-patient department
US20190035505A1 (en) * 2017-07-31 2019-01-31 Boe Technology Group Co., Ltd. Intelligent triage server, terminal and system based on medical knowledge base (mkb)
CN109508377A (en) * 2018-11-26 2019-03-22 南京云思创智信息科技有限公司 Text feature, device, chat robots and storage medium based on Fusion Model
CN110097955A (en) * 2019-03-07 2019-08-06 南通奕霖智慧医学科技有限公司 A kind of paediatrics intelligence emergency treatment previewing triage system based on support vector machine classifier
CN110164549A (en) * 2019-05-20 2019-08-23 南通奕霖智慧医学科技有限公司 A kind of paediatrics based on neural network classifier point examines method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140257122A1 (en) * 2013-03-08 2014-09-11 Singapore Health Services Pte Ltd System and method of determining a risk score for triage
CN107169298A (en) * 2017-05-26 2017-09-15 深圳市第二人民医院 A kind of intelligent emergency treatment classification system for distribution of out-patient department
US20190035505A1 (en) * 2017-07-31 2019-01-31 Boe Technology Group Co., Ltd. Intelligent triage server, terminal and system based on medical knowledge base (mkb)
CN109508377A (en) * 2018-11-26 2019-03-22 南京云思创智信息科技有限公司 Text feature, device, chat robots and storage medium based on Fusion Model
CN110097955A (en) * 2019-03-07 2019-08-06 南通奕霖智慧医学科技有限公司 A kind of paediatrics intelligence emergency treatment previewing triage system based on support vector machine classifier
CN110164549A (en) * 2019-05-20 2019-08-23 南通奕霖智慧医学科技有限公司 A kind of paediatrics based on neural network classifier point examines method and system

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111724897A (en) * 2020-06-12 2020-09-29 电子科技大学 A kind of sports function data processing method and system
CN111724897B (en) * 2020-06-12 2022-07-01 电子科技大学 Motion function data processing method and system
WO2021159761A1 (en) * 2020-09-09 2021-08-19 平安科技(深圳)有限公司 Pathological data analysis method and apparatus, and computer device and storage medium
CN112687367A (en) * 2020-12-29 2021-04-20 中国人民解放军总医院 Medical record grouping method, device and equipment based on dynamic disease condition and storage medium
CN113793678A (en) * 2021-08-31 2021-12-14 中国医学科学院北京协和医院 Emergency grading assistance method and device for atypical critically ill patients
CN117009924A (en) * 2023-10-07 2023-11-07 之江实验室 Multi-mode self-adaptive multi-center data fusion method and system guided by electronic medical records
CN117009924B (en) * 2023-10-07 2024-01-26 之江实验室 Multi-mode self-adaptive multi-center data fusion method and system guided by electronic medical records
CN117558391A (en) * 2024-01-11 2024-02-13 天津市胸科医院 Postoperative condition deep learning method and system based on aortic dissection medical record data
CN117558391B (en) * 2024-01-11 2024-03-12 天津市胸科医院 Postoperative condition deep learning method and system based on aortic dissection medical record data

Similar Documents

Publication Publication Date Title
CN110874409A (en) Disease grading prediction system, method, electronic device and readable storage medium
CN109697285B (en) Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation
CN110162779B (en) Medical record quality evaluation method, device and equipment
Bilano et al. Risk factors of pre-eclampsia/eclampsia and its adverse outcomes in low-and middle-income countries: a WHO secondary analysis
CN107833603B (en) Electronic medical record document classification method and device, electronic equipment and storage medium
CN107578798B (en) Method and system for processing electronic medical record
US20090299977A1 (en) Method for Automatic Labeling of Unstructured Data Fragments From Electronic Medical Records
CN108280149A (en) A kind of doctor-patient dispute class case recommendation method based on various dimensions tag along sort
CN110136788A (en) A medical record quality inspection method, device, equipment and storage medium based on automatic detection
CN106844351B (en) A multi-data source-oriented medical institution organization entity identification method and device
CN110010217A (en) An electronic medical record labeling method and device
US20200293528A1 (en) Systems and methods for automatically generating structured output documents based on structural rules
CN112541066A (en) Text-structured-based medical and technical report detection method and related equipment
CN109299279B (en) Data processing method, device, system and medium
CN109299467B (en) Medical text recognition method and device and sentence recognition model training method and device
CN116884612A (en) Intelligent analysis method, device, equipment and storage medium for disease risk level
CN117828355A (en) Emotion quantitative model training method and emotion quantitative method based on multi-modal information
Zweigenbaum et al. Multiple Methods for Multi-class, Multi-label ICD-10 Coding of Multi-granularity, Multilingual Death Certificates.
CN107122582B (en) Multi-data source-oriented diagnosis and treatment entity recognition method and device
CN117672440A (en) Electronic medical record text information extraction method and system based on neural network
CN112802598A (en) Real-time auxiliary diagnosis and treatment method and system based on voice diagnosis and treatment data
CN111724873B (en) Data processing method and device
CN112071431B (en) Clinical path automatic generation method and system based on deep learning and knowledge graph
CN108831560B (en) Method and device for determining medical data attribute data
CN117766133A (en) Intelligent algorithm-based traditional Chinese medicine syndrome identification method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200310