WO2022147910A1 - Medical record information verification method and apparatus, and computer device and storage medium - Google Patents

Medical record information verification method and apparatus, and computer device and storage medium Download PDF

Info

Publication number
WO2022147910A1
WO2022147910A1 PCT/CN2021/083196 CN2021083196W WO2022147910A1 WO 2022147910 A1 WO2022147910 A1 WO 2022147910A1 CN 2021083196 W CN2021083196 W CN 2021083196W WO 2022147910 A1 WO2022147910 A1 WO 2022147910A1
Authority
WO
WIPO (PCT)
Prior art keywords
case
medical record
information
vector
department
Prior art date
Application number
PCT/CN2021/083196
Other languages
French (fr)
Chinese (zh)
Inventor
朱昭苇
孙行智
胡岗
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022147910A1 publication Critical patent/WO2022147910A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H40/00ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices
    • G16H40/20ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the management or administration of healthcare resources or facilities, e.g. managing hospital staff or surgery rooms
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Definitions

  • the present application relates to the technical field of detection models, and in particular, to a medical record information verification method, device, computer equipment and storage medium.
  • Medical record quality monitoring is one of the effective means to standardize medical behavior.
  • the manual verification method is inefficient, which in turn causes the problem of low quality monitoring accuracy.
  • the embodiments of the present application provide a medical record information verification method, device, computer equipment and storage medium, so as to solve the problem of low accuracy of quality monitoring due to incomplete utilization of case information.
  • a medical record information verification method comprising:
  • the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
  • the diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  • a medical record information verification device comprising:
  • a medical record text acquisition module used for acquiring the medical record text to be verified;
  • the medical record text to be verified includes case information, department information associated with the case information, and diagnosis information;
  • the first vector characterization module is used to input the case information into the case characterization model to obtain a case characterization vector corresponding to the case information; meanwhile, input the department information into the department characterization model to obtain the same The department representation vector corresponding to the department information;
  • a vector splicing module for splicing the case representation vector and the department representation vector to obtain a medical record splicing vector
  • a case judgment module configured to input the medical record splicing vector into a case judgment network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
  • the case matching module is configured to match the diagnosis information with each of the case judgment results, and when the diagnosis information and any one of the case judgment results are successfully matched, it is determined that the verification of the medical record text to be verified is successful.
  • a computer device comprising a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, wherein the processor implements the following steps when executing the computer-readable instructions:
  • the medical record text to be verified includes case information, department information and diagnostic information associated with the case information;
  • the diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  • One or more readable storage media storing computer-readable instructions, wherein the computer-readable instructions, when executed by one or more processors, cause the one or more processors to perform the following steps:
  • the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
  • the diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  • the above-mentioned medical record information verification method, device, computer equipment and storage medium obtains the medical record text to be verified;
  • the medical record text to be verified includes case information, department information and diagnosis information associated with the case information;
  • the case information is input into the case representation model, and a case representation vector corresponding to the case information is obtained;
  • the department information is input into the department representation model to obtain a department representation vector corresponding to the department information;
  • the case characterization vector and the department characterization vector are spliced to obtain a medical record splicing vector;
  • the medical record splicing vector is input into the case discrimination network model, and at least one case judgment result corresponding to the medical record text to be verified is determined. ;
  • the application learns the correlation between case information and department information through the case representation model and department representation model, so that the case discrimination network model predicts and outputs the case discrimination result based on the case information and department information. It has higher accuracy and improves the efficiency of medical record information verification and monitoring.
  • FIG. 1 is a schematic diagram of an application environment of a method for verifying medical record information in an embodiment of the present application
  • FIG. 2 is a flowchart of a method for verifying medical record information in an embodiment of the present application
  • FIG. 3 is another flowchart of a method for verifying medical record information in an embodiment of the present application.
  • step S40 is a flowchart of step S40 in the method for verifying medical record information in an embodiment of the present application
  • FIG. 5 is a schematic block diagram of a medical record information verification device in an embodiment of the present application.
  • FIG. 6 is another principle block diagram of the device for verifying medical record information in an embodiment of the present application.
  • FIG. 7 is a schematic block diagram of a case judgment module in a medical record information verification device in an embodiment of the present application.
  • FIG. 8 is a schematic diagram of a computer device in an embodiment of the present application.
  • the medical record information verification method provided by the embodiment of the present application can be applied in the application environment shown in FIG. 1 .
  • the medical record information verification method is applied in a medical record information verification system.
  • the medical record information verification system includes a client and a server as shown in FIG. Incomplete utilization of information leads to the problem of low accuracy of quality control.
  • the client also known as the client, refers to the program corresponding to the server and providing local services for the client.
  • Clients can be installed on, but not limited to, various personal computers, laptops, smartphones, tablets, and portable wearable devices.
  • the server can be implemented as an independent server or a server cluster composed of multiple servers.
  • a method for verifying medical record information is provided, and the method is applied to the server in FIG. 1 as an example for description, including the following steps:
  • the medical record text to be verified refers to the historical medical record text waiting to be verified
  • the medical record text to be verified contains case information, such as the patient's basic information (such as name, gender, test date, etc.), symptom information (such as chief complaint symptoms, test information, etc.), department information related to case information (such as the respiratory department corresponding to the symptoms of cough and sore throat is department information) and diagnostic information (such as the doctor's judgment for the symptoms of cough and sore throat as throat inflammation for diagnostic information).
  • case information such as the patient's basic information (such as name, gender, test date, etc.)
  • symptom information Such as chief complaint symptoms, test information, etc.
  • department information related to case information such as the respiratory department corresponding to the symptoms of cough and sore throat is department information
  • diagnostic information such as the doctor's judgment for the symptoms of cough and sore throat as throat inflammation for diagnostic information
  • S20 Input the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, input the department information into the department representation model to obtain a department representation corresponding to the department information vector;
  • the case representation model and the department representation model are both constructed based on the convolutional neural network model.
  • the case representation model is used to convert case information into case representation vectors
  • the department representation model is used to convert department information into department representation vectors.
  • the case information in the medical record text to be verified is input into the case representation model, and the case information is processed by convolution pooling, etc., to obtain the case representation corresponding to the case information.
  • the department information in the medical record text to be verified is input into the department characterization model, and the department information is processed by convolution pooling, etc., to obtain the department characterization vector corresponding to the department information.
  • the case information may be preprocessed before inputting the case information into the case representation model.
  • the case information is "I started coughing about three days ago”
  • the case information is trimmed into
  • the shorter sentence pair form of "cough for three days” means that the text length of the case information can be shortened while the important information in the case information is not changed, and then the model vector can be shortened when the case information is input into the case representation model.
  • the conversion time improves the efficiency of medical record information verification; similarly, before inputting the department information into the department representation model, the department information can also be preprocessed. Hospital Respiratory Department”, the department information is cut into a shorter sentence pair form of “Respiratory Department”.
  • step S20 it further includes:
  • S01 Obtain a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and department sample information corresponding to the case sample information; The medical record sample text is associated with a medical record sample label;
  • the medical record sample text can be obtained by crawling the medical record information text library, and the medical record sample text contains case sample information, such as the basic information of the patient (such as name, gender, test date, etc.), symptom information (such as chief complaint symptoms, test information, etc.), and the department sample information corresponding to the case sample information (for example, the respiratory department corresponding to the symptoms of cough and sore throat is the department information).
  • case sample information such as the basic information of the patient (such as name, gender, test date, etc.), symptom information (such as chief complaint symptoms, test information, etc.), and the department sample information corresponding to the case sample information (for example, the respiratory department corresponding to the symptoms of cough and sore throat is the department information).
  • a medical record sample text is associated with a medical record sample label, the medical record sample label is determined according to the case sample information and the department sample information, and the medical record sample label includes a positive medical record sample label and a negative medical record sample label; It is understandable that in the medical record sample text If the case sample information and department sample information match each other, the medical record sample label associated with the medical record sample text is the positive medical record sample label; if the case sample information does not match the department sample information in the medical record sample text, the medical record sample text The associated medical record sample labels are negative medical record sample labels.
  • the case sample information is "cough for 3 days", if the department sample information is "respiratory department", the medical record sample text is the positive medical record sample text, and the medical record sample label is the positive medical record sample label; if the department sample information is " Psychiatry”, the medical record sample text is the negative medical record sample text, and the medical record sample label is the negative medical record sample label.
  • the case training model and the department training model of the preset twin representation model in step S02 are trained through different positive medical record sample texts and negative medical record sample texts, so that the case training model and the department training model can achieve better training effects. , you can distinguish whether the case sample information matches the department sample information.
  • S02 Input the medical record sample text into a preset twin representation model, and perform vector representation on the case sample information through a case training model that includes a first initial parameter in the preset twin representation model, to obtain a case sample vector
  • vector characterization is carried out on the sample information of the department through the department training model including the second initial parameter in the preset twin characterization model to obtain the department sample vector;
  • the preset twin representation model is used to learn the representation of case sample information and department sample information.
  • the preset twin representation model includes a case training model and a department training model. Both the case training model and the department training model are based on convolution. Generated by neural network model building.
  • the medical record sample text is input into the preset twin representation model, and the case is trained by the case training model including the first initial parameter in the preset twin representation model.
  • the sample information is represented by a vector, that is, the case sample information is subjected to convolution pooling and other processing to obtain a case sample vector; at the same time, the department is trained by the department training model that includes the second initial parameter in the preset twin representation model.
  • the sample information is represented by a vector, that is, the sample information of the department is processed by convolution and pooling, and the sample vector of the department is obtained.
  • the model cannot learn because the department sample information name is too short and does not have rich semantic information.
  • the model training is performed through the case sample information and the department sample information, so that the department training model can also learn the department information representation that contains the semantic information rich in the case sample information.
  • S03 Perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into an initial regression model to determine the label prediction probability corresponding to the medical record sample text;
  • the case sample information is represented by a vector through a case training model that includes the first initial parameter in the preset twin representation model, and a case is obtained.
  • sample vector at the same time, vector representation is performed on the department sample information through the department training model including the second initial parameter in the preset twin representation model, and after the department sample vector is obtained, the department sample vector is spliced to the case
  • the sample splicing vector is obtained, and the sample splicing vector is input into the initial regression model to determine the label prediction probability corresponding to the sample splicing vector, that is, to determine whether the department sample vector matches the case sample vector.
  • S04 Determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability
  • a sample splicing vector is obtained, and the sample splicing vector is input into the initial regression model, and the label prediction corresponding to the medical record sample text is determined.
  • the predicted loss value is determined by the cross-entropy loss function; the cross-entropy loss function is:
  • Loss is the prediction loss value
  • w1 and w0 are the weights of the preset twin representation model
  • y is the label of the medical record sample
  • p is the label prediction probability.
  • the medical record sample text includes positive medical record sample text and negative medical record sample text.
  • the associated medical record sample label is a positive medical record sample label.
  • the label value of the label is 1; when the medical record sample text is the negative medical record sample text, the associated medical record sample label is the negative medical record sample label, and the label value of the negative medical record sample label is 0; therefore, when input to the preset twin representation model
  • the medical record sample text is positive medical record sample text.
  • y is 1, and p represents the probability that the predicted department sample information matches the case sample information; when the medical record sample text input to the preset twin representation model is negative
  • y is 0, and 1-p represents the probability that the predicted department sample information does not match the case sample information.
  • w1 and w0 in the above-mentioned cross-entropy loss function are weight values. It is understandable that w1 is used to predict the positive medical record sample text into the negative medical record sample text (that is, the department sample information is matched with the case sample information, predicting The prediction loss function of the department sample information does not match the case sample information) has a larger loss of rotation, w0 is to predict the negative medical record sample text into the positive medical record sample text (that is, the department sample information does not match the case sample information, the prediction is The prediction loss function of the department sample information and the case sample information match) is smaller and the loss is reversed, so that the recall rate of the preset twin representation model can be improved, the generalization ability of the preset twin representation model can be improved, and the result obtained in step S20 can be prevented. Case representation vectors and department representation vectors are filtered out of too much important information.
  • the convergence condition can be the condition that the predicted loss value is less than the set threshold, that is, when the predicted loss value is less than the set threshold, the training is stopped; the convergence condition can also be that the predicted loss value after 10,000 calculations is The condition is very small and will not decrease again, that is, when the predicted loss value is small and will not decrease after 10,000 calculations, stop training, and record the case training model after convergence as the case representation model, The department training model after convergence is recorded as the department characterization model.
  • the predicted loss value adjusts the first initial parameter of the case training model and the second initial parameter of the department training model, and re-inputs the case sample text into the preset twin representation model after adjusting the first and second initial parameters , so that when the predicted loss value corresponding to the medical record sample text reaches the preset convergence condition, select another medical record sample text in the preset medical record sample text set, and execute the above steps S01 to S04, and obtain the corresponding medical record sample text.
  • Predict the loss value and when the predicted loss value does not reach the preset convergence condition, adjust the first initial parameter of the case training model and the second initial parameter of the department training model again according to the predicted loss value, so that the medical record sample text The corresponding prediction loss value reaches the preset convergence condition.
  • the output results of the preset twin representation model can continue to be closer to the accurate results, so that the recognition accuracy is getting higher and higher.
  • the case training model after convergence is recorded as the case representation model
  • the department training model after convergence is recorded as the department Representation model.
  • the case information is input into the case representation model, and the case representation vector corresponding to the case information is obtained; at the same time, the department information is input into the department representation model, and the corresponding department information is obtained. After the department representation vector, the department representation vector is spliced to the back end of the case representation vector to obtain the medical record splicing vector.
  • S40 Input the medical record splicing vector into the case discrimination network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
  • the case discrimination network model is used to determine the case determination result corresponding to the medical record to be verified according to the medical record splicing vector (ie, case information and department information).
  • the medical record splicing vector is input into the case discrimination network model, so as to diagnose and predict the medical records to be verified according to the medical record splicing vector, and then determine At least one case judgment result corresponding to the medical record text to be verified. Understandably, for case information and department information, one or more different case judgment results may be included.
  • the medical record splicing vector is input into the case discrimination network model, and the medical record splicing vector is subjected to convolution pooling.
  • the medical record splicing vector is input into the case discrimination network model, and the medical record splicing vector is subjected to convolution pooling.
  • at least one case judgment result corresponding to the medical record text to be verified is obtained, and one case judgment result is also associated with a judgment probability, that is, according to the case information and department information in the medical record text to be verified, it can be determined.
  • the probability that the corresponding diagnostic information is a case judgment result is the judgment probability.
  • step S40 includes:
  • S401 Perform convolution pooling on the medical record splicing vector through a preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
  • the medical record splicing vector is subjected to convolution pooling processing through the preset convolutional neural network in the case discrimination network model, Get the medical record output vector.
  • the preset convolutional neural network may be a TextCNN network (text classification convolutional neural network).
  • the medical record splicing vector is input into the case discrimination network model, it also includes:
  • the third initial parameter refers to the parameters of the case representation model obtained after the training of the case training model in steps S01-S05 is completed. It is understandable that the model parameters of the case training model are the first initial parameters, and the case is obtained after the training is completed. The model parameters characterizing the model are updated to the third initial parameters.
  • the fourth initial parameter refers to the parameters of the department representation model obtained after the training of the department training model in steps S01-S05 is completed. It is understandable that the model parameters of the department training model are the second initial parameters, which are obtained after the training is completed. The model parameters of the department characterization model are updated to the fourth initial parameters.
  • the average value of the third initial parameter and the fourth initial parameter is recorded as the discriminative initial parameter of the preset convolutional neural network.
  • the case discrimination network model can be trained by a preset training sample (such as the positive medical record sample text in step S01), so that the case discrimination network model can be trained.
  • the positive diagnosis information contained in the positive medical record sample text can be regarded as In order to be correct information, the case judgment results output by the case discrimination network model are close to or even the same as the positive diagnosis information.
  • S402 Perform case classification on the medical record output vector through a preset classification network in the case discrimination network model, and determine a case judgment result corresponding to the medical record text to be verified.
  • the predetermined classification network in the case discrimination network model is used to perform convolution and pooling processing.
  • the medical record output vector is used for case classification, and a case judgment result corresponding to the medical record text to be verified is determined.
  • the preset classification network is the softmax layer in the case discrimination network model.
  • S50 Match the diagnostic information with each of the case judgment results, and when the diagnostic information and any one of the case judgment results are successfully matched, determine that the text of the medical record to be verified is successfully verified.
  • the diagnostic information is matched with each case determination result, exemplarily , for example, by determining the similarity between the diagnosis information and the judgment results of each case, or by performing character matching between the diagnosis information and the judgment results of each case through regular expressions, and then when the diagnosis information and any case judgment result are successfully matched , if the similarity between the diagnostic information and the case judgment result is greater than the preset similarity threshold (such as 95%), or the character matching between the diagnostic information and the case judgment result reaches more than 95%, it is determined that the text of the medical record to be verified is verified. Success, that is, it is determined that the diagnostic information in the medical record text to be verified is correct.
  • the preset similarity threshold such as 95%)
  • a case judgment result is also associated with a judgment probability, so in inputting the medical record splicing vector into the case judgment network model, at least one case judgment corresponding to the medical record text to be verified is determined.
  • the judgment probability corresponding to the result is not the largest, and then all the case judgment results in the case judgment sequence before the judgment result to be confirmed are sent to the preset receiver, so that the preset receiver can judge whether the medical record to be verified is verified. success.
  • the preset recipient may be a medical record manager or a medical record inspector.
  • the method further includes:
  • the representative diagnosis information may not match the case information and department information, and then it is determined that the diagnosis information does not match the case information and the department information.
  • the text verification of the medical record to be verified fails, and a risk of misjudgment of the diagnostic information is prompted, so as to wait for the preset recipient to manually verify the medical record to be verified.
  • case information and department information by introducing case information and department information, the correlation between case information and department information is learned through the case representation model and department representation model, so that the case discrimination network model can predict and output the case information and department information.
  • the results of the case judgment have higher accuracy, and improve the efficiency of medical record information verification and monitoring.
  • a medical record information verification apparatus is provided, and the medical record information verification apparatus corresponds one-to-one with the medical record information verification method in the above embodiment.
  • the medical record information verification device includes a medical record text acquisition module 10 , a first vector representation module 20 , a vector splicing module 30 , a case judgment module 40 and a case matching module 50 .
  • the detailed description of each functional module is as follows:
  • the medical record text acquisition module 10 is used to acquire the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnosis information;
  • the first vector representation module 20 is used for inputting the case information into the case representation model to obtain a case representation vector corresponding to the case information; meanwhile, inputting the department information into the department representation model to obtain a case representation vector corresponding to the case information. Describe the department representation vector corresponding to the department information;
  • the vector splicing module 30 is used for splicing the case representation vector and the department representation vector to obtain a medical record splicing vector;
  • the case judgment module 40 is used to input the medical record splicing vector into the case judgment network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
  • the case matching module 50 is used to match the diagnosis information with each of the case judgment results, and when the diagnosis information is successfully matched with any of the case judgment results, it is determined that the verification of the medical record text to be verified is successful .
  • the medical record information verification device further includes:
  • a medical record sample text set acquisition module 01 configured to acquire a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and corresponds to the case sample information Department sample information; a medical record sample text associated with a medical record sample label;
  • the second vector characterization module 02 is configured to input the text of the medical record sample into a preset twin representation model, and perform vectorization on the case sample information through the case training model including the first initial parameter in the preset twin representation model Characterization, to obtain a case sample vector; at the same time, through the department training model including the second initial parameter in the preset twin representation model, the department sample information is vectorized to obtain the department sample vector;
  • Label prediction module 03 configured to perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into the initial regression model, and determine that it corresponds to the medical record sample text
  • a predicted loss value determination module 04 configured to determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability;
  • a parameter update module 05 configured to update and iterate the first initial parameters of the case training model and the second initial parameters of the department training model when the predicted loss value does not reach a preset convergence condition, until the When the predicted loss value reaches the preset convergence condition, the case training model after convergence is recorded as the case characterization model, and the department training model after convergence is recorded as the department characterization model.
  • the predicted loss value determination module includes:
  • a predicted loss value determination unit configured to determine the predicted loss value through a cross-entropy loss function according to the medical record sample label and the label prediction probability; the cross-entropy loss function is:
  • Loss is the prediction loss value
  • w1 and w0 are the weights of the preset twin representation model
  • y is the label of the medical record sample
  • p is the label prediction probability.
  • the vector splicing module 30 includes:
  • the vector splicing unit is configured to obtain the medical record splicing vector after splicing the department representation vector to the back end of the case representation vector.
  • the case judgment module 40 includes:
  • a convolution pooling unit 401 configured to perform convolution pooling on the medical record splicing vector through a preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
  • the case classification unit 402 is configured to perform case classification on the medical record output vector through a preset classification network in the case discrimination network model, and determine a case judgment result corresponding to the medical record text to be verified.
  • the medical record information verification device further includes:
  • an initial parameter acquisition module for acquiring the third initial parameter of the case characterization model and the fourth initial parameter of the department characterization model
  • the initial parameter recording module is configured to record the mean value of the third initial parameter and the fourth initial parameter as the initial parameter for discrimination of the case discrimination network model.
  • the medical record information verification device further includes:
  • a verification failure prompting module is configured to determine that the text verification of the medical record to be verified fails when the diagnostic information does not match the judgment results of all the cases, and prompt that the diagnostic information has a risk of misjudgment.
  • Each module in the above-mentioned medical record information verification device may be implemented in whole or in part by software, hardware and combinations thereof.
  • the above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
  • a computer device is provided, and the computer device may be a server, and its internal structure diagram may be as shown in FIG. 8 .
  • the computer device includes a processor, memory, a network interface, and a database connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes a readable storage medium, an internal memory.
  • the readable storage medium stores an operating system, computer readable instructions and a database.
  • the internal memory provides an environment for the execution of the operating system and computer-readable instructions in the readable storage medium.
  • the database of the computer device is used to store the data used in the medical record information verification method in the above embodiment.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instructions when executed by the processor, implement a method for verifying medical record information.
  • the readable storage medium provided in this embodiment includes a non-volatile readable storage medium and a volatile readable storage medium.
  • a computer apparatus comprising a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, wherein the processor executes the computer
  • the following steps are implemented when readable instructions:
  • the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
  • the diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  • one or more readable storage media having computer-readable instructions stored thereon, wherein the computer-readable instructions, when executed by one or more processors, cause the one or more processing The device performs the following steps:
  • the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
  • the diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  • Nonvolatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in various forms such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Road (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Medical Informatics (AREA)
  • Public Health (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Theoretical Computer Science (AREA)
  • Epidemiology (AREA)
  • Primary Health Care (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • General Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Pathology (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

A medical record information verification method and apparatus, and a computer device and a storage medium, which relate to the technical field of detection models, and can be applied to the field of smart healthcare, thereby promoting the construction of smart cities. The method comprises: inputting, into a case representation model, case information of medical record text to be verified, so as to obtain a case representation vector; inputting department information into a department representation model, so as to obtain a department representation vector; performing splicing processing on the case representation vector and the department representation vector, so as to obtain a medical record splicing vector; inputting the medical record splicing vector into a case discrimination network model, and determining at least one case determination result corresponding to said medical record text; and matching diagnosis information with each case determination result, and when the diagnosis information successfully matches any case determination result, determining that said medical record text is successfully verified. By means of the method, the efficiency and accuracy of medical record information verification can be improved.

Description

病历信息校验方法、装置、计算机设备及存储介质Medical record information verification method, device, computer equipment and storage medium
本申请要求于2021年1月11日提交中国专利局、申请号为202110032946.6,发明名称为“病历信息校验方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on January 11, 2021 with the application number 202110032946.6 and the invention titled "Medical Record Information Verification Method, Device, Computer Equipment and Storage Medium", the entire content of which is approved by Reference is incorporated in this application.
技术领域technical field
本申请涉及检测模型技术领域,尤其涉及一种病历信息校验方法、装置、计算机设备及存储介质。The present application relates to the technical field of detection models, and in particular, to a medical record information verification method, device, computer equipment and storage medium.
背景技术Background technique
随着科学技术的发展,医疗体系也逐渐完善。病历质量监控是规范医疗行为的有效手段之一,发明人意识到,目前,针对于病历质量监控大多数仍然采用人工手动校验的方式,但是随着就医人群增多进而导致病历数量庞大,通过人工手动校验的方式效率较低,进而造成质量监控准确率较低的问题。With the development of science and technology, the medical system has gradually improved. Medical record quality monitoring is one of the effective means to standardize medical behavior. The inventor realized that at present, most of the medical record quality monitoring still adopts manual manual verification. The manual verification method is inefficient, which in turn causes the problem of low quality monitoring accuracy.
申请内容Application content
本申请实施例提供一种病历信息校验方法、装置、计算机设备及存储介质,以解决由于病例信息利用不全导致质量监控准确率较低的问题。The embodiments of the present application provide a medical record information verification method, device, computer equipment and storage medium, so as to solve the problem of low accuracy of quality monitoring due to incomplete utilization of case information.
一种病历信息校验方法,包括:A medical record information verification method, comprising:
获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
一种病历信息校验装置,包括:A medical record information verification device, comprising:
病历文本获取模块,用于获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;a medical record text acquisition module, used for acquiring the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnosis information;
第一向量表征模块,用于将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;The first vector characterization module is used to input the case information into the case characterization model to obtain a case characterization vector corresponding to the case information; meanwhile, input the department information into the department characterization model to obtain the same The department representation vector corresponding to the department information;
向量拼接模块,用于对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;a vector splicing module for splicing the case representation vector and the department representation vector to obtain a medical record splicing vector;
病例判断模块,用于将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;A case judgment module, configured to input the medical record splicing vector into a case judgment network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
病例匹配模块,用于将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The case matching module is configured to match the diagnosis information with each of the case judgment results, and when the diagnosis information and any one of the case judgment results are successfully matched, it is determined that the verification of the medical record text to be verified is successful.
一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其中,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device comprising a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, wherein the processor implements the following steps when executing the computer-readable instructions:
获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科 室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information and diagnostic information associated with the case information;
将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
一个或多个存储有计算机可读指令的可读存储介质,其中,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more readable storage media storing computer-readable instructions, wherein the computer-readable instructions, when executed by one or more processors, cause the one or more processors to perform the following steps:
获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
上述病历信息校验方法、装置、计算机设备及存储介质,该方法通过获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The above-mentioned medical record information verification method, device, computer equipment and storage medium, the method obtains the medical record text to be verified; the medical record text to be verified includes case information, department information and diagnosis information associated with the case information; The case information is input into the case representation model, and a case representation vector corresponding to the case information is obtained; at the same time, the department information is input into the department representation model to obtain a department representation vector corresponding to the department information; The case characterization vector and the department characterization vector are spliced to obtain a medical record splicing vector; the medical record splicing vector is input into the case discrimination network model, and at least one case judgment result corresponding to the medical record text to be verified is determined. ; Match the diagnostic information with each of the case judgment results, and when the diagnostic information and any one of the case judgment results are successfully matched, determine that the text of the medical record to be verified is successfully verified.
本申请通过引入病例信息以及科室信息,通过病例表征模型以及科室表征模型学习了病例信息与科室信息之间的关联性,使得病例判别网络模型根据病例信息以及科室信息进行预测后输出的病例判断结果具有更高的准确性,并且提高了病历信息校验监控的效率。By introducing case information and department information, the application learns the correlation between case information and department information through the case representation model and department representation model, so that the case discrimination network model predicts and outputs the case discrimination result based on the case information and department information. It has higher accuracy and improves the efficiency of medical record information verification and monitoring.
本申请的一个或多个实施例的细节在下面的附图和描述中提出,本申请的其他特征和优点将从说明书、附图以及权利要求变得明显。The details of one or more embodiments of the application are set forth in the accompanying drawings and the description below, and other features and advantages of the application will become apparent from the description, drawings, and claims.
附图说明Description of drawings
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments of the present application. Obviously, the drawings in the following description are only some embodiments of the present application. , for those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative labor.
图1是本申请一实施例中病历信息校验方法的一应用环境示意图;1 is a schematic diagram of an application environment of a method for verifying medical record information in an embodiment of the present application;
图2是本申请一实施例中病历信息校验方法的一流程图;2 is a flowchart of a method for verifying medical record information in an embodiment of the present application;
图3是本申请一实施例中病历信息校验方法的另一流程图;3 is another flowchart of a method for verifying medical record information in an embodiment of the present application;
图4是本申请一实施例中病历信息校验方法中步骤S40的一流程图;4 is a flowchart of step S40 in the method for verifying medical record information in an embodiment of the present application;
图5是本申请一实施例中病历信息校验装置的一原理框图;5 is a schematic block diagram of a medical record information verification device in an embodiment of the present application;
图6是本申请一实施例中病历信息校验装置的另一原理框图;FIG. 6 is another principle block diagram of the device for verifying medical record information in an embodiment of the present application;
图7是本申请一实施例中病历信息校验装置中病例判断模块的一原理框图;7 is a schematic block diagram of a case judgment module in a medical record information verification device in an embodiment of the present application;
图8是本申请一实施例中计算机设备的一示意图。FIG. 8 is a schematic diagram of a computer device in an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
本申请实施例提供的病历信息校验方法,该病历信息校验方法可应用如图1所示的应用环境中。具体地,该病历信息校验方法应用在病历信息校验系统中,该病历信息校验系统包括如图1所示的客户端和服务器,客户端与服务器通过网络进行通信,用于解决由于病例信息利用不全导致质量监控准确率较低的问题。其中,客户端又称为用户端,是指与服务器相对应,为客户提供本地服务的程序。客户端可安装在但不限于各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备上。服务器可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The medical record information verification method provided by the embodiment of the present application can be applied in the application environment shown in FIG. 1 . Specifically, the medical record information verification method is applied in a medical record information verification system. The medical record information verification system includes a client and a server as shown in FIG. Incomplete utilization of information leads to the problem of low accuracy of quality control. Among them, the client, also known as the client, refers to the program corresponding to the server and providing local services for the client. Clients can be installed on, but not limited to, various personal computers, laptops, smartphones, tablets, and portable wearable devices. The server can be implemented as an independent server or a server cluster composed of multiple servers.
在一实施例中,如图2所示,提供一种病历信息校验方法,以该方法应用在图1中的服务器为例进行说明,包括如下步骤:In one embodiment, as shown in FIG. 2 , a method for verifying medical record information is provided, and the method is applied to the server in FIG. 1 as an example for description, including the following steps:
S10:获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;S10: Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnosis information;
可以理解地,待校验病历文本指的是等待校验的历史病历文本,该待校验病历文本中包含病例信息,如患者的基本信息(如姓名、性别、检测日期等)、症状信息(如主诉症状、检测信息等),与病例信息关联的科室信息(如针对于咳嗽喉咙痛症状对应的呼吸科科室即为科室信息)以及诊断信息(如医生对于咳嗽喉咙痛症状判定为喉咙发炎即为诊断信息)。Understandably, the medical record text to be verified refers to the historical medical record text waiting to be verified, and the medical record text to be verified contains case information, such as the patient's basic information (such as name, gender, test date, etc.), symptom information ( Such as chief complaint symptoms, test information, etc.), department information related to case information (such as the respiratory department corresponding to the symptoms of cough and sore throat is department information) and diagnostic information (such as the doctor's judgment for the symptoms of cough and sore throat as throat inflammation for diagnostic information).
S20:将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;S20: Input the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, input the department information into the department representation model to obtain a department representation corresponding to the department information vector;
其中,病例表征模型以及科室表征模型均是基于卷积神经网络模型构建的,病例表征模型用于将病例信息转换成病例表征向量,科室表征模型用于将科室信息转换成科室表征向量。The case representation model and the department representation model are both constructed based on the convolutional neural network model. The case representation model is used to convert case information into case representation vectors, and the department representation model is used to convert department information into department representation vectors.
具体地,在获取待校验病历文本之后,将待校验病历文本中的病例信息输入至病例表征模型中,对所述病例信息进行卷积池化等处理,得到与病例信息对应的病例表征向量;同时,将待校验病历文本中的科室信息输入至科室表征模型中,对科室信息进行卷积池化等处理,得到与科室信息对应的科室表征向量。Specifically, after obtaining the medical record text to be verified, the case information in the medical record text to be verified is input into the case representation model, and the case information is processed by convolution pooling, etc., to obtain the case representation corresponding to the case information. At the same time, the department information in the medical record text to be verified is input into the department characterization model, and the department information is processed by convolution pooling, etc., to obtain the department characterization vector corresponding to the department information.
作为优选,在将病例信息输入病例表征模型之前,可以对病例信息进行预处理,示例性地,假设病例信息为“我大概在三天前开始一直在咳嗽”,则将该病例信息裁剪处理成“咳嗽三天”较短的句对形式,也即在保证病例信息中重要信息不改变的情况下,缩减病例信息的文本长度,进而在将病例信息输入至病例表征模型时,可以缩短模型向量转换的时间,提高病历信息校验效率;同理,在将科室信息输入至科室表征模型之前,也可以对科室信息进行预处理,示例性地,假设科室信息为“广东省深圳市第三人民医院呼吸科”,则将科室信息裁剪处理成“呼吸科”较短的句对形式。Preferably, before inputting the case information into the case representation model, the case information may be preprocessed. For example, assuming that the case information is "I started coughing about three days ago", the case information is trimmed into The shorter sentence pair form of "cough for three days" means that the text length of the case information can be shortened while the important information in the case information is not changed, and then the model vector can be shortened when the case information is input into the case representation model. The conversion time improves the efficiency of medical record information verification; similarly, before inputting the department information into the department representation model, the department information can also be preprocessed. Hospital Respiratory Department”, the department information is cut into a shorter sentence pair form of “Respiratory Department”.
在一具体实施例中,如图3所示,步骤S20之前,还包括:In a specific embodiment, as shown in FIG. 3, before step S20, it further includes:
S01:获取预设病历样本文本集;所述预设病历样本文本集中包含至少一个病历样本文本;所述病历样本文本包含病例样本信息以及与所述病例样本信息对应的科室样本信息;一个所述病历样本文本关联一个病历样本标签;S01: Obtain a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and department sample information corresponding to the case sample information; The medical record sample text is associated with a medical record sample label;
可以理解地,病历样本文本可以通过爬取病历信息文本库得到,该病历样本文本中包 含病例样本信息,如患者的基本信息(如姓名、性别、检测日期等)、症状信息(如主诉症状、检测信息等),与病例样本信息对应的科室样本信息(如针对于咳嗽喉咙痛症状对应的呼吸科科室即为科室信息)。Understandably, the medical record sample text can be obtained by crawling the medical record information text library, and the medical record sample text contains case sample information, such as the basic information of the patient (such as name, gender, test date, etc.), symptom information (such as chief complaint symptoms, test information, etc.), and the department sample information corresponding to the case sample information (for example, the respiratory department corresponding to the symptoms of cough and sore throat is the department information).
进一步地,一个病历样本文本关联一个病历样本标签,该病历样本标签根据病例样本信息以及科室样本信息确定,病历样本标签包括正病历样本标签以及负病历样本标签;可以理解地,在病历样本文本中病例样本信息与科室样本信息是相互匹配的,则该病历样本文本关联的病历样本标签为正病历样本标签;在病历样本文本中病例样本信息与科室样本信息是不匹配的,则该病历样本文本关联的病历样本标签为负病历样本标签。示例性地,病例样本信息为“咳嗽3天”,若科室样本信息为“呼吸科”,则该病历样本文本为正病历样本文本,病历样本标签为正病历样本标签;若科室样本信息为“精神科”,则该病历样本文本为负病历样本文本,病历样本标签为负病历样本标签。进而通过不同的正病历样本文本以及负病历样本文本对步骤S02中的预设孪生表征模型的病例训练模型,以及科室训练模型进行训练,可以使得病例训练模型以及科室训练模型达到更好的训练效果,可以区分病例样本信息是否与科室样本信息匹配。Further, a medical record sample text is associated with a medical record sample label, the medical record sample label is determined according to the case sample information and the department sample information, and the medical record sample label includes a positive medical record sample label and a negative medical record sample label; It is understandable that in the medical record sample text If the case sample information and department sample information match each other, the medical record sample label associated with the medical record sample text is the positive medical record sample label; if the case sample information does not match the department sample information in the medical record sample text, the medical record sample text The associated medical record sample labels are negative medical record sample labels. Exemplarily, the case sample information is "cough for 3 days", if the department sample information is "respiratory department", the medical record sample text is the positive medical record sample text, and the medical record sample label is the positive medical record sample label; if the department sample information is " Psychiatry", the medical record sample text is the negative medical record sample text, and the medical record sample label is the negative medical record sample label. Then, the case training model and the department training model of the preset twin representation model in step S02 are trained through different positive medical record sample texts and negative medical record sample texts, so that the case training model and the department training model can achieve better training effects. , you can distinguish whether the case sample information matches the department sample information.
S02:将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,得到得到病例样本向量;同时,通过所述预设孪生表征模型中包含第二初始参数的科室训练模型对所述科室样本信息进行向量表征,得到科室样本向量;S02: Input the medical record sample text into a preset twin representation model, and perform vector representation on the case sample information through a case training model that includes a first initial parameter in the preset twin representation model, to obtain a case sample vector At the same time, vector characterization is carried out on the sample information of the department through the department training model including the second initial parameter in the preset twin characterization model to obtain the department sample vector;
可以理解地,预设孪生表征模型用于学习病例样本信息以及科室样本信息的表征,该预设孪生表征模型中包含病例训练模型以及科室训练模型,病例训练模型以及科室训练模型均是基于卷积神经网络模型构建生成的。Understandably, the preset twin representation model is used to learn the representation of case sample information and department sample information. The preset twin representation model includes a case training model and a department training model. Both the case training model and the department training model are based on convolution. Generated by neural network model building.
进一步地,在获取预设病历样本文本集之后,将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,也即对病例样本信息进行卷积池化等处理,得到得到病例样本向量;同时,通过包含所述预设孪生表征模型中第二初始参数的科室训练模型对所述科室样本信息进行向量表征,也即对科室样本信息进行卷积池化等处理,得到科室样本向量。Further, after obtaining the preset medical record sample text set, the medical record sample text is input into the preset twin representation model, and the case is trained by the case training model including the first initial parameter in the preset twin representation model. The sample information is represented by a vector, that is, the case sample information is subjected to convolution pooling and other processing to obtain a case sample vector; at the same time, the department is trained by the department training model that includes the second initial parameter in the preset twin representation model. The sample information is represented by a vector, that is, the sample information of the department is processed by convolution and pooling, and the sample vector of the department is obtained.
进一步地,若仅采用科室样本信息进行模型训练,也即不采用病例样本信息和科室样本信息进行模型训练,则会由于科室样本信息名称太短且不具有丰富的语义信息,导致模型无法学习到能够将各个科室样本信息进行区分的能力,因此本实施例中通过病例样本信息以及科室样本信息进行模型训练,使得科室训练模型也可以学习到包含丰富病例样本信息的语义信息的科室信息表征。Further, if only the department sample information is used for model training, that is, the case sample information and department sample information are not used for model training, the model cannot learn because the department sample information name is too short and does not have rich semantic information. The ability to distinguish the sample information of each department, so in this embodiment, the model training is performed through the case sample information and the department sample information, so that the department training model can also learn the department information representation that contains the semantic information rich in the case sample information.
S03:对所述病例样本向量以及所述科室样本向量进行拼接处理,得到样本拼接向量,并将所述样本拼接向量输入至初始回归模型中,确定与所述病历样本文本对应的标签预测概率;S03: Perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into an initial regression model to determine the label prediction probability corresponding to the medical record sample text;
具体地,在将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,得到得到病例样本向量;同时,通过包含所述预设孪生表征模型中第二初始参数的科室训练模型对所述科室样本信息进行向量表征,得到科室样本向量之后,将所述科室样本向量拼接至所述病例样本向量的后端,得到样本拼接向量,并将样本拼接向量输入至初始回归模型中,确定与样本拼接向量对应的标签预测概率,也即判断科室样本向量是否与病例样本向量相匹配。Specifically, when the medical record sample text is input into a preset twin representation model, the case sample information is represented by a vector through a case training model that includes the first initial parameter in the preset twin representation model, and a case is obtained. sample vector; at the same time, vector representation is performed on the department sample information through the department training model including the second initial parameter in the preset twin representation model, and after the department sample vector is obtained, the department sample vector is spliced to the case At the back end of the sample vector, the sample splicing vector is obtained, and the sample splicing vector is input into the initial regression model to determine the label prediction probability corresponding to the sample splicing vector, that is, to determine whether the department sample vector matches the case sample vector.
S04:根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值;S04: Determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability;
具体地,在对所述病例样本向量以及所述科室样本向量进行拼接处理,得到样本拼接向量,并将所述样本拼接向量输入至初始回归模型中,确定与所述病历样本文本对应的标 签预测概率之后,根据病历样本标签以及标签预测概率,通过交叉熵损失函数确定所述预测损失值;所述交叉熵损失函数为:Specifically, after splicing the case sample vector and the department sample vector, a sample splicing vector is obtained, and the sample splicing vector is input into the initial regression model, and the label prediction corresponding to the medical record sample text is determined. After the probability, according to the medical record sample label and the label prediction probability, the predicted loss value is determined by the cross-entropy loss function; the cross-entropy loss function is:
Loss=w1*y*log(p)+w0*(1-y)*log(1-p)Loss=w1*y*log(p)+w0*(1-y)*log(1-p)
其中,Loss为所述预测损失值;w1以及w0为所述预设孪生表征模型的权重;y为所述病历样本标签;p为所述标签预测概率。Wherein, Loss is the prediction loss value; w1 and w0 are the weights of the preset twin representation model; y is the label of the medical record sample; p is the label prediction probability.
可以理解地,在步骤S01指出,病历样本文本包含正病历样本文本以及负病历样本文本,当病历样本文本为正病历样本文本时,其关联的病历样本标签为正病历样本标签,该正病历样本标签的标签值为1;当病历样本文本为负病历样本文本时,其关联的病历样本标签为负病历样本标签,该负病历样本标签的标签值为0;因此当输入至预设孪生表征模型的病历样本文本为正病历样本文本,根据上述交叉熵损失函数可知,y为1,p表征预测科室样本信息与病例样本信息匹配的概率;当输入至预设孪生表征模型的病历样本文本为负病历样本文本,根据上述交叉熵损失函数可知,y为0,1-p表征预测科室样本信息与病例样本信息不匹配的概率。Understandably, it is pointed out in step S01 that the medical record sample text includes positive medical record sample text and negative medical record sample text. When the medical record sample text is a positive medical record sample text, the associated medical record sample label is a positive medical record sample label. The label value of the label is 1; when the medical record sample text is the negative medical record sample text, the associated medical record sample label is the negative medical record sample label, and the label value of the negative medical record sample label is 0; therefore, when input to the preset twin representation model The medical record sample text is positive medical record sample text. According to the above cross-entropy loss function, y is 1, and p represents the probability that the predicted department sample information matches the case sample information; when the medical record sample text input to the preset twin representation model is negative For the medical record sample text, according to the above cross-entropy loss function, y is 0, and 1-p represents the probability that the predicted department sample information does not match the case sample information.
进一步地,上述交叉熵损失函数中的w1以及w0为权重值,可以理解地,w1是为了给将正病历样本文本预测成负病历样本文本(也即将科室样本信息与病例样本信息相匹配,预测成科室样本信息与病例样本信息不匹配)的预测损失函数更大的损失回转,w0是为了给将负病历样本文本预测成正病历样本文本(也即将科室样本信息与病例样本信息不匹配,预测成科室样本信息与病例样本信息匹配)的预测损失函数更小的损失回转,如此可以提高预设孪生表征模型的召回率,提高预设孪生表征模型的泛化能力,防止在步骤S20中,得到的病例表征向量以及科室表征向量被过滤掉过多的重要信息。Further, w1 and w0 in the above-mentioned cross-entropy loss function are weight values. It is understandable that w1 is used to predict the positive medical record sample text into the negative medical record sample text (that is, the department sample information is matched with the case sample information, predicting The prediction loss function of the department sample information does not match the case sample information) has a larger loss of rotation, w0 is to predict the negative medical record sample text into the positive medical record sample text (that is, the department sample information does not match the case sample information, the prediction is The prediction loss function of the department sample information and the case sample information match) is smaller and the loss is reversed, so that the recall rate of the preset twin representation model can be improved, the generalization ability of the preset twin representation model can be improved, and the result obtained in step S20 can be prevented. Case representation vectors and department representation vectors are filtered out of too much important information.
S05:在所述预测损失值未达到预设的收敛条件时,更新迭代所述病例训练模型的第一初始参数,以及所述科室训练模型的第二初始参数,直至所述预测损失值达到所述预设的收敛条件时,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。S05: When the predicted loss value does not reach the preset convergence condition, update and iterate the first initial parameter of the case training model and the second initial parameter of the department training model, until the predicted loss value reaches the predetermined value. When the preset convergence conditions are met, the case training model after convergence is recorded as the case characterization model, and the department training model after convergence is recorded as the department characterization model.
可以理解地,该收敛条件可以为预测损失值小于设定阈值的条件,也即在预测损失值小于设定阈值时,停止训练;收敛条件还可以为预测损失值经过了10000次计算后值为很小且不会再下降的条件,也即预测损失值经过10000次计算后值很小且不会下降时,停止训练,,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。Understandably, the convergence condition can be the condition that the predicted loss value is less than the set threshold, that is, when the predicted loss value is less than the set threshold, the training is stopped; the convergence condition can also be that the predicted loss value after 10,000 calculations is The condition is very small and will not decrease again, that is, when the predicted loss value is small and will not decrease after 10,000 calculations, stop training, and record the case training model after convergence as the case representation model, The department training model after convergence is recorded as the department characterization model.
进一步地,根据与病例样本文本对应的所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值之后,在预测损失值未达到预设的收敛条件时,根据该预测损失值调整病例训练模型的第一初始参数,以及科室训练模型的第二初始参数,并将该病例样本文本重新输入至调整第一初始参数以及第二初始参数后的预设孪生表征模型中,以在该病历样本文本对应的预测损失值达到预设的收敛条件时,选取预设病历样本文本集中另一个病历样本文本,并执行上述步骤S01至S04,并得到与该病历样本文本对应的预测损失值,并在该预测损失值未达到预设的收敛条件时,根据该预测损失值再次调整病例训练模型的第一初始参数,以及科室训练模型的第二初始参数,使得该病历样本文本对应的预测损失值达到预设的收敛条件。Further, after determining the predicted loss value of the preset twin representation model according to the medical record sample label corresponding to the case sample text and the label prediction probability, when the predicted loss value does not reach the preset convergence condition, according to the The predicted loss value adjusts the first initial parameter of the case training model and the second initial parameter of the department training model, and re-inputs the case sample text into the preset twin representation model after adjusting the first and second initial parameters , so that when the predicted loss value corresponding to the medical record sample text reaches the preset convergence condition, select another medical record sample text in the preset medical record sample text set, and execute the above steps S01 to S04, and obtain the corresponding medical record sample text. Predict the loss value, and when the predicted loss value does not reach the preset convergence condition, adjust the first initial parameter of the case training model and the second initial parameter of the department training model again according to the predicted loss value, so that the medical record sample text The corresponding prediction loss value reaches the preset convergence condition.
如此,在通过预设病历样本文本集中所有病历样本文本对预设孪生表征模型进行训练之后,使得预设孪生表征模型输出的结果可以不断向准确地结果靠拢,让识别准确率越来 越高,直至所有病历样本文本对应的预测损失值均达到预设的收敛条件时,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。In this way, after the preset twin representation model is trained through all the medical record sample texts in the preset medical record sample text set, the output results of the preset twin representation model can continue to be closer to the accurate results, so that the recognition accuracy is getting higher and higher. Until the predicted loss values corresponding to all the medical record sample texts reach the preset convergence condition, the case training model after convergence is recorded as the case representation model, and the department training model after convergence is recorded as the department Representation model.
S30:对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;S30: Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
具体地,在将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量之后,将科室表征向量拼接至病例表征向量的后端,得到病历拼接向量。Specifically, the case information is input into the case representation model, and the case representation vector corresponding to the case information is obtained; at the same time, the department information is input into the department representation model, and the corresponding department information is obtained. After the department representation vector, the department representation vector is spliced to the back end of the case representation vector to obtain the medical record splicing vector.
S40:将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;S40: Input the medical record splicing vector into the case discrimination network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
可以理解地,病例判别网络模型用于根据病历拼接向量(也即病例信息以及科室信息)判定与待校验病历对应的病例判断结果。在对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量之后,将病历拼接向量输入至病例判别网络模型中,以根据病历拼接向量对待校验病历进行诊断预测,进而确定与待校验病历文本对应的至少一个病例判断结果。可以理解地,针对于病例信息以及科室信息,可以包含一个或者多个不同的病例判断结果。Understandably, the case discrimination network model is used to determine the case determination result corresponding to the medical record to be verified according to the medical record splicing vector (ie, case information and department information). After splicing the case characterization vector and the department characterization vector to obtain the medical record splicing vector, the medical record splicing vector is input into the case discrimination network model, so as to diagnose and predict the medical records to be verified according to the medical record splicing vector, and then determine At least one case judgment result corresponding to the medical record text to be verified. Understandably, for case information and department information, one or more different case judgment results may be included.
进一步地,在对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量之后,将所述病历拼接向量输入至病例判别网络模型中,经过对病历拼接向量进行卷积池化分类等处理后,得到与待校验病历文本对一个的至少一个病例判断结果,其中一个病例判断结果还关联一个判断概率,也即根据待校验病历文本中的病例信息以及科室信息,可以确定对应的诊断信息为病例判断结果的概率即为判断概率。Further, after splicing the case representation vector and the department representation vector to obtain the medical record splicing vector, the medical record splicing vector is input into the case discrimination network model, and the medical record splicing vector is subjected to convolution pooling. After classification and other processing, at least one case judgment result corresponding to the medical record text to be verified is obtained, and one case judgment result is also associated with a judgment probability, that is, according to the case information and department information in the medical record text to be verified, it can be determined. The probability that the corresponding diagnostic information is a case judgment result is the judgment probability.
在一实施例中,如图4所示,步骤S40中,包括:In one embodiment, as shown in FIG. 4 , step S40 includes:
S401:通过所述病例判别网络模型中的预设卷积神经网络对所述病历拼接向量进行卷积池化处理,得到病历输出向量;S401: Perform convolution pooling on the medical record splicing vector through a preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
具体地,在对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量之后,通过病例判别网络模型中的预设卷积神经网络对病历拼接向量进行卷积池化处理,得到病历输出向量。可选地,预设卷积神经网络可以为TextCNN网络(文本分类卷积神经网络)。Specifically, after splicing the case representation vector and the department representation vector to obtain the medical record splicing vector, the medical record splicing vector is subjected to convolution pooling processing through the preset convolutional neural network in the case discrimination network model, Get the medical record output vector. Optionally, the preset convolutional neural network may be a TextCNN network (text classification convolutional neural network).
进一步地,在将所述病历拼接向量输入至病例判别网络模型中之前,还包括:Further, before the medical record splicing vector is input into the case discrimination network model, it also includes:
获取所述病例表征模型的第三初始参数,以及所述科室表征模型的第四初始参数;obtaining the third initial parameter of the case characterization model, and the fourth initial parameter of the department characterization model;
其中,第三初始参数指的是经过步骤S01‐S05对病例训练模型训练完成之后得到的病例表征模型的参数,可以理解地,病例训练模型的模型参数为第一初始参数,在训练完成得到病例表征模型的模型参数更新为第三初始参数。同理,第四初始参数指的是经过步骤S01‐S05对科室训练模型训练完成之后得到的科室表征模型的参数,可以理解地,科室训练模型的模型参数为第二初始参数,在训练完成得到科室表征模型的模型参数更新为第四初始参数。Among them, the third initial parameter refers to the parameters of the case representation model obtained after the training of the case training model in steps S01-S05 is completed. It is understandable that the model parameters of the case training model are the first initial parameters, and the case is obtained after the training is completed. The model parameters characterizing the model are updated to the third initial parameters. Similarly, the fourth initial parameter refers to the parameters of the department representation model obtained after the training of the department training model in steps S01-S05 is completed. It is understandable that the model parameters of the department training model are the second initial parameters, which are obtained after the training is completed. The model parameters of the department characterization model are updated to the fourth initial parameters.
将所述第三初始参数与所述第四初始参数的均值记录为所述预设卷积神经网络的判别初始参数。The average value of the third initial parameter and the fourth initial parameter is recorded as the discriminative initial parameter of the preset convolutional neural network.
可以理解地,与随机初始化参数相比,采用第三初始参数与第四初始参数的均值作为预设卷积神经网路的判别初始参数,一方面可以给预设卷积神经网络一个较好的初始参数分布空间,另一方面加快了对病例判别网络模型的训练。进一步地,在将所述病历拼接向量输入至病例判别网络模型中之前,可以通过预设训练样本(如步骤S01中的正病历样本文本)对病例判别网络模型进行训练,使得病例判别网络模型可以学习正病历样本文本中的病例样本信息以及科室样本信息,并根据病例样本信息以及科室样本信息预测出更加准确的病例判断结果;可以理解地,正病历样本文本中包含的正诊断信息可以说视为是正确的信息,进而使得病例判别网络模型输出的病例判断结果接近甚至于正诊断信息相同。Understandably, compared with random initialization parameters, using the average value of the third initial parameter and the fourth initial parameter as the initial discriminant parameter of the preset convolutional neural network can give the preset convolutional neural network a better value. The initial parameter distribution space, on the other hand, speeds up the training of the case discrimination network model. Further, before the medical record splicing vector is input into the case discrimination network model, the case discrimination network model can be trained by a preset training sample (such as the positive medical record sample text in step S01), so that the case discrimination network model can be trained. Learn the case sample information and department sample information in the positive medical record sample text, and predict more accurate case judgment results based on the case sample information and department sample information; understandably, the positive diagnosis information contained in the positive medical record sample text can be regarded as In order to be correct information, the case judgment results output by the case discrimination network model are close to or even the same as the positive diagnosis information.
S402:通过所述病例判别网络模型中的预设分类网络对所述病历输出向量进行病例分类,确定与所述待校验病历文本对应的病例判断结果。S402: Perform case classification on the medical record output vector through a preset classification network in the case discrimination network model, and determine a case judgment result corresponding to the medical record text to be verified.
具体地,在通过所述病例判别网络模型中的预设卷积神经网络对所述病历拼接向量进行卷积池化处理,得到病历输出向量之后,通过病例判别网络模型中的预设分类网络对所述病历输出向量进行病例分类,确定与所述待校验病历文本对应的病例判断结果。可选地,预设分类网络为病例判别网络模型中的softmax层。Specifically, after the medical record splicing vector is subjected to convolution pooling processing by the preset convolutional neural network in the case discrimination network model to obtain the medical record output vector, the predetermined classification network in the case discrimination network model is used to perform convolution and pooling processing. The medical record output vector is used for case classification, and a case judgment result corresponding to the medical record text to be verified is determined. Optionally, the preset classification network is the softmax layer in the case discrimination network model.
S50:将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。S50: Match the diagnostic information with each of the case judgment results, and when the diagnostic information and any one of the case judgment results are successfully matched, determine that the text of the medical record to be verified is successfully verified.
具体地,在将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果之后,将诊断信息与各病例判断结果进行匹配,示例性地,如通过确定诊断信息与各病例判断结果之间的相似度,亦或者通过正则表达式对诊断信息与各病例判断结果之间进行字符匹配,进而在诊断信息与任意一个病例判断结果匹配成功时,如诊断信息与病例判断结果之间的相似度大于预设相似阈值(如95%),亦或者诊断信息与病例判断结果之间字符匹配度达到95%以上,确定待校验病历文本校验成功,也即确定待校验病历文本中诊断信息正确。Specifically, after inputting the medical record splicing vector into the case discrimination network model, after determining at least one case determination result corresponding to the medical record text to be verified, the diagnostic information is matched with each case determination result, exemplarily , for example, by determining the similarity between the diagnosis information and the judgment results of each case, or by performing character matching between the diagnosis information and the judgment results of each case through regular expressions, and then when the diagnosis information and any case judgment result are successfully matched , if the similarity between the diagnostic information and the case judgment result is greater than the preset similarity threshold (such as 95%), or the character matching between the diagnostic information and the case judgment result reaches more than 95%, it is determined that the text of the medical record to be verified is verified. Success, that is, it is determined that the diagnostic information in the medical record text to be verified is correct.
进一步地,在步骤S40中指出,一个病例判断结果还关联一个判断概率,因此在将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果之后,将各病例判断结果按照判断概率从大到小的顺序插入病例判断序列中;自病例判断序列中第一个病例判断结果起,将各病例判断结果与诊断信息进行比较;在诊断信息与任意一个病例判断结果匹配成功时,将该病例判断结果记录为待确认判断结果;在该待确认判断结果不为病例判断序列中处于第一位置的病例判断结果时,也即该待确认判断结果对应的判断概率不是最大的,进而将病例判断序列中处于该待确认判断结果之前的所有病例判断结果发送至预设接收方,以令所述预设接收方判断待校验病历是否校验成功。其中,预设接收方可以为病历管理人员或者病历检验人员。Further, in step S40, it is pointed out that a case judgment result is also associated with a judgment probability, so in inputting the medical record splicing vector into the case judgment network model, at least one case judgment corresponding to the medical record text to be verified is determined. After the results, insert the judgment results of each case into the case judgment sequence in descending order of judgment probability; from the first case judgment result in the case judgment sequence, compare the judgment results of each case with the diagnostic information; When it is successfully matched with any case judgment result, the case judgment result is recorded as the judgment result to be confirmed; when the judgment result to be confirmed is not the case judgment result in the first position in the case judgment sequence, that is, the judgment result to be confirmed The judgment probability corresponding to the result is not the largest, and then all the case judgment results in the case judgment sequence before the judgment result to be confirmed are sent to the preset receiver, so that the preset receiver can judge whether the medical record to be verified is verified. success. The preset recipient may be a medical record manager or a medical record inspector.
在一具体实施方式中,步骤S50之后,将所述诊断信息与各所述病例判断结果进行匹配之后,还包括:In a specific embodiment, after step S50, after matching the diagnosis information with each of the case judgment results, the method further includes:
在所述诊断信息与所有所述病例判断结果均不匹配时,确定所述待校验病历文本校验失败,并提示所述诊断信息存在误判风险。When the diagnostic information does not match all the case judgment results, it is determined that the text verification of the medical record to be verified fails, and a risk of misjudgment exists in the diagnostic information.
可以理解地,在将所述诊断信息与各所述病例判断结果进行匹配之后,若诊断信息与所有病例判断结果均不匹配时,表征诊断信息可能与病例信息和科室信息不匹配,进而确定所述待校验病历文本校验失败,并提示所述诊断信息存在误判风险,以等待预设接收方对该待校验病历进行人工校验。Understandably, after the diagnosis information is matched with the judgment results of each of the cases, if the diagnosis information does not match the judgment results of all the cases, the representative diagnosis information may not match the case information and department information, and then it is determined that the diagnosis information does not match the case information and the department information. The text verification of the medical record to be verified fails, and a risk of misjudgment of the diagnostic information is prompted, so as to wait for the preset recipient to manually verify the medical record to be verified.
在本实施例中,通过引入病例信息以及科室信息,通过病例表征模型以及科室表征模型学习了病例信息与科室信息之间的关联性,使得病例判别网络模型根据病例信息以及科室信息进行预测后输出的病例判断结果具有更高的准确性,并且提高了病历信息校验监控的效率。In this embodiment, by introducing case information and department information, the correlation between case information and department information is learned through the case representation model and department representation model, so that the case discrimination network model can predict and output the case information and department information. The results of the case judgment have higher accuracy, and improve the efficiency of medical record information verification and monitoring.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence numbers of the steps in the above embodiments does not mean the sequence of execution, and the execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation to the implementation process of the embodiments of the present application.
在一实施例中,提供一种病历信息校验装置,该病历信息校验装置与上述实施例中病历信息校验方法一一对应。如图5所示,该病历信息校验装置包括病历文本获取模块10、第一向量表征模块20、向量拼接模块30、病例判断模块40和病例匹配模块50。各功能模块详细说明如下:In one embodiment, a medical record information verification apparatus is provided, and the medical record information verification apparatus corresponds one-to-one with the medical record information verification method in the above embodiment. As shown in FIG. 5 , the medical record information verification device includes a medical record text acquisition module 10 , a first vector representation module 20 , a vector splicing module 30 , a case judgment module 40 and a case matching module 50 . The detailed description of each functional module is as follows:
病历文本获取模块10,用于获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;The medical record text acquisition module 10 is used to acquire the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnosis information;
第一向量表征模块20,用于将所述病例信息输入至病例表征模型中,得到与所述病例 信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;The first vector representation module 20 is used for inputting the case information into the case representation model to obtain a case representation vector corresponding to the case information; meanwhile, inputting the department information into the department representation model to obtain a case representation vector corresponding to the case information. Describe the department representation vector corresponding to the department information;
向量拼接模块30,用于对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;The vector splicing module 30 is used for splicing the case representation vector and the department representation vector to obtain a medical record splicing vector;
病例判断模块40,用于将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;The case judgment module 40 is used to input the medical record splicing vector into the case judgment network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
病例匹配模块50,用于将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The case matching module 50 is used to match the diagnosis information with each of the case judgment results, and when the diagnosis information is successfully matched with any of the case judgment results, it is determined that the verification of the medical record text to be verified is successful .
优选地,如图6所示,病历信息校验装置还包括:Preferably, as shown in Figure 6, the medical record information verification device further includes:
病历样本文本集获取模块01,用于获取预设病历样本文本集;所述预设病历样本文本集中包含至少一个病历样本文本;所述病历样本文本包含病例样本信息以及与所述病例样本信息对应的科室样本信息;一个所述病历样本文本关联一个病历样本标签;A medical record sample text set acquisition module 01, configured to acquire a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and corresponds to the case sample information Department sample information; a medical record sample text associated with a medical record sample label;
第二向量表征模块02,用于将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,得到得到病例样本向量;同时,通过所述预设孪生表征模型中包含第二初始参数的科室训练模型对所述科室样本信息进行向量表征,得到科室样本向量;The second vector characterization module 02 is configured to input the text of the medical record sample into a preset twin representation model, and perform vectorization on the case sample information through the case training model including the first initial parameter in the preset twin representation model Characterization, to obtain a case sample vector; at the same time, through the department training model including the second initial parameter in the preset twin representation model, the department sample information is vectorized to obtain the department sample vector;
标签预测模块03,用于对所述病例样本向量以及所述科室样本向量进行拼接处理,得到样本拼接向量,并将所述样本拼接向量输入至初始回归模型中,确定与所述病历样本文本对应的标签预测概率; Label prediction module 03, configured to perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into the initial regression model, and determine that it corresponds to the medical record sample text The label prediction probability of ;
预测损失值确定模块04,用于根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值;A predicted loss value determination module 04, configured to determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability;
参数更新模块05,用于在所述预测损失值未达到预设的收敛条件时,更新迭代所述病例训练模型的第一初始参数,以及所述科室训练模型的第二初始参数,直至所述预测损失值达到所述预设的收敛条件时,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。A parameter update module 05, configured to update and iterate the first initial parameters of the case training model and the second initial parameters of the department training model when the predicted loss value does not reach a preset convergence condition, until the When the predicted loss value reaches the preset convergence condition, the case training model after convergence is recorded as the case characterization model, and the department training model after convergence is recorded as the department characterization model.
优选地,预测损失值确定模块包括:Preferably, the predicted loss value determination module includes:
预测损失值确定单元,用于根据所述病历样本标签以及所述标签预测概率,通过交叉熵损失函数确定所述预测损失值;所述交叉熵损失函数为:A predicted loss value determination unit, configured to determine the predicted loss value through a cross-entropy loss function according to the medical record sample label and the label prediction probability; the cross-entropy loss function is:
Loss=w1*y*log(p)+w0*(1-y)*log(1-p)Loss=w1*y*log(p)+w0*(1-y)*log(1-p)
其中,Loss为所述预测损失值;w1以及w0为所述预设孪生表征模型的权重;y为所述病历样本标签;p为所述标签预测概率。Wherein, Loss is the prediction loss value; w1 and w0 are the weights of the preset twin representation model; y is the label of the medical record sample; p is the label prediction probability.
优选地,向量拼接模块30包括:Preferably, the vector splicing module 30 includes:
向量拼接单元,用于将所述科室表征向量拼接至所述病例表征向量的后端之后,得到所述病历拼接向量。The vector splicing unit is configured to obtain the medical record splicing vector after splicing the department representation vector to the back end of the case representation vector.
优选地,如图7所示,病例判断模块40包括:Preferably, as shown in FIG. 7 , the case judgment module 40 includes:
卷积池化单元401,用于通过所述病例判别网络模型中的预设卷积神经网络对所述病历拼接向量进行卷积池化处理,得到病历输出向量;A convolution pooling unit 401, configured to perform convolution pooling on the medical record splicing vector through a preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
病例分类单元402,用于通过所述病例判别网络模型中的预设分类网络对所述病历输出向量进行病例分类,确定与所述待校验病历文本对应的病例判断结果。The case classification unit 402 is configured to perform case classification on the medical record output vector through a preset classification network in the case discrimination network model, and determine a case judgment result corresponding to the medical record text to be verified.
优选地,病历信息校验装置还包括:Preferably, the medical record information verification device further includes:
初始参数获取模块,用于获取所述病例表征模型的第三初始参数,以及所述科室表征模型的第四初始参数;an initial parameter acquisition module for acquiring the third initial parameter of the case characterization model and the fourth initial parameter of the department characterization model;
初始参数记录模块,用于将所述第三初始参数与所述第四初始参数的均值记录为病例判别网络模型的判别初始参数。The initial parameter recording module is configured to record the mean value of the third initial parameter and the fourth initial parameter as the initial parameter for discrimination of the case discrimination network model.
优选地,病历信息校验装置还包括:Preferably, the medical record information verification device further includes:
校验失败提示模块,用于在所述诊断信息与所有所述病例判断结果均不匹配时,确定所述待校验病历文本校验失败,并提示所述诊断信息存在误判风险。A verification failure prompting module is configured to determine that the text verification of the medical record to be verified fails when the diagnostic information does not match the judgment results of all the cases, and prompt that the diagnostic information has a risk of misjudgment.
关于病历信息校验装置的具体限定可以参见上文中对于病历信息校验方法的限定,在此不再赘述。上述病历信息校验装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For specific limitations on the medical record information verification device, reference may be made to the above limitations on the medical record information verification method, which will not be repeated here. Each module in the above-mentioned medical record information verification device may be implemented in whole or in part by software, hardware and combinations thereof. The above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图8所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括可读存储介质、内存储器。该可读存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为可读存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储上述实施例中病历信息校验方法所使用到的数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种病历信息校验方法。本实施例所提供的可读存储介质包括非易失性可读存储介质和易失性可读存储介质。In one embodiment, a computer device is provided, and the computer device may be a server, and its internal structure diagram may be as shown in FIG. 8 . The computer device includes a processor, memory, a network interface, and a database connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a readable storage medium, an internal memory. The readable storage medium stores an operating system, computer readable instructions and a database. The internal memory provides an environment for the execution of the operating system and computer-readable instructions in the readable storage medium. The database of the computer device is used to store the data used in the medical record information verification method in the above embodiment. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instructions, when executed by the processor, implement a method for verifying medical record information. The readable storage medium provided in this embodiment includes a non-volatile readable storage medium and a volatile readable storage medium.
在一个实施例中,提供了一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其中,所述处理器执行所述计算机可读指令时实现如下步骤:In one embodiment, there is provided a computer apparatus comprising a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, wherein the processor executes the computer The following steps are implemented when readable instructions:
获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
在一个实施例中,提供了一个或多个存储有计算机可读指令的可读存储介质,其中,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:In one embodiment, one or more readable storage media are provided having computer-readable instructions stored thereon, wherein the computer-readable instructions, when executed by one or more processors, cause the one or more processing The device performs the following steps:
获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计 算机可读取存储介质或者易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through computer-readable instructions, and the computer-readable instructions can be stored in a non-volatile computer. In a readable storage medium or a volatile computer-readable storage medium, the computer-readable instructions, when executed, may include the processes of the foregoing method embodiments. Wherein, any reference to memory, storage, database or other medium used in the various embodiments provided in this application may include non-volatile and/or volatile memory. Nonvolatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Road (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。Those skilled in the art can clearly understand that, for the convenience and simplicity of description, only the division of the above-mentioned functional units and modules is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated to different functional units, Module completion, that is, dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, but not to limit them; although the present application has been described in detail with reference to the above-mentioned embodiments, those of ordinary skill in the art should understand that: it can still be used for the above-mentioned implementations. The technical solutions described in the examples are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions in the embodiments of the application, and should be included in the within the scope of protection of this application.

Claims (20)

  1. 一种病历信息校验方法,其中,包括:A medical record information verification method, comprising:
    获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
    将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
    对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
    将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
    将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  2. 如权利要求1所述的病历信息校验方法,其中,所述将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量之前,包括:The method for verifying medical record information according to claim 1, wherein before the inputting the case information into the case representation model and obtaining the case representation vector corresponding to the case information, the method comprises:
    获取预设病历样本文本集;所述预设病历样本文本集中包含至少一个病历样本文本;所述病历样本文本包含病例样本信息以及与所述病例样本信息对应的科室样本信息;一个所述病历样本文本关联一个病历样本标签;Obtain a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and department sample information corresponding to the case sample information; one of the medical record sample information The text is associated with a medical record sample label;
    将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,得到得到病例样本向量;同时,通过所述预设孪生表征模型中包含第二初始参数的科室训练模型对所述科室样本信息进行向量表征,得到科室样本向量;Inputting the medical record sample text into a preset twin representation model, and performing vector representation on the case sample information through a case training model that includes the first initial parameter in the preset twin representation model to obtain a case sample vector; , performing vector representation on the department sample information through the department training model including the second initial parameter in the preset twin characterization model, to obtain a department sample vector;
    对所述病例样本向量以及所述科室样本向量进行拼接处理,得到样本拼接向量,并将所述样本拼接向量输入至初始回归模型中,确定与所述病历样本文本对应的标签预测概率;Perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into an initial regression model to determine the label prediction probability corresponding to the medical record sample text;
    根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值;Determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability;
    在所述预测损失值未达到预设的收敛条件时,更新迭代所述病例训练模型的第一初始参数,以及所述科室训练模型的第二初始参数,直至所述预测损失值达到所述预设的收敛条件时,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。When the predicted loss value does not reach the preset convergence condition, update and iterate the first initial parameter of the case training model and the second initial parameter of the department training model until the predicted loss value reaches the predicted loss value. When the convergence condition is set, the case training model after convergence is recorded as the case characterization model, and the department training model after convergence is recorded as the department characterization model.
  3. 如权利要求2所述的病历信息校验方法,其中,所述根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值,包括:The method for verifying medical record information according to claim 2, wherein the determining the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability comprises:
    根据所述病历样本标签以及所述标签预测概率,通过交叉熵损失函数确定所述预测损失值;所述交叉熵损失函数为:According to the medical record sample label and the label prediction probability, the predicted loss value is determined by a cross-entropy loss function; the cross-entropy loss function is:
    Loss=w1*y*log(p)+w0*(1-y)*log(1-p)Loss=w1*y*log(p)+w0*(1-y)*log(1-p)
    其中,Loss为所述预测损失值;w1以及w0为所述预设孪生表征模型的权重;y为所述病历样本标签;p为所述标签预测概率。Wherein, Loss is the prediction loss value; w1 and w0 are the weights of the preset twin representation model; y is the label of the medical record sample; p is the label prediction probability.
  4. 如权利要求1所述的病历信息校验方法,其中,所述对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量,包括:The medical record information verification method according to claim 1, wherein the splicing process is performed on the case representation vector and the department representation vector to obtain a medical record splicing vector, comprising:
    将所述科室表征向量拼接至所述病例表征向量的后端之后,得到所述病历拼接向量。After splicing the department representation vector to the back end of the case representation vector, the medical record splicing vector is obtained.
  5. 如权利要求1所述的病历信息校验方法,其中,所述将所述病历拼接向量输入至输入至病例判别网络模型中,确定与所述待校验病历文本对应的病例判断结果,包括:The medical record information verification method according to claim 1, wherein the inputting the medical record splicing vector into the input case discrimination network model, and determining the case judgment result corresponding to the medical record text to be verified, comprising:
    通过所述病例判别网络模型中的预设卷积神经网络对所述病历拼接向量进行卷积池化处理,得到病历输出向量;The medical record splicing vector is subjected to convolution pooling processing through the preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
    通过所述病例判别网络模型中的预设分类网络对所述病历输出向量进行病例分类,确定与所述待校验病历文本对应的病例判断结果。Case classification is performed on the medical record output vector through a preset classification network in the case discrimination network model, and a case judgment result corresponding to the medical record text to be verified is determined.
  6. 如权利要求5所述的病历信息校验方法,其中,所述将所述病历拼接向量输入至输入至病例判别网络模型中之前,包括:The medical record information verification method according to claim 5, wherein, before the inputting the medical record splicing vector into the case discrimination network model, the method comprises:
    获取所述病例表征模型的第三初始参数,以及所述科室表征模型的第四初始参数;obtaining the third initial parameter of the case characterization model, and the fourth initial parameter of the department characterization model;
    将所述第三初始参数与所述第四初始参数的均值记录为所述预设卷积神经网络的判别初始参数。The average value of the third initial parameter and the fourth initial parameter is recorded as the discriminative initial parameter of the preset convolutional neural network.
  7. 如权利要求1所述的病历信息校验方法,其中,所述将所述诊断信息与各所述病例判断结果进行匹配之后,还包括:The method for verifying medical record information according to claim 1, wherein after the matching of the diagnosis information with each of the case judgment results, the method further comprises:
    在所述诊断信息与所有所述病例判断结果均不匹配时,确定所述待校验病历文本校验失败,并提示所述诊断信息存在误判风险。When the diagnostic information does not match all the case judgment results, it is determined that the text verification of the medical record to be verified fails, and a risk of misjudgment exists in the diagnostic information.
  8. 一种病历信息校验装置,其中,包括:A medical record information verification device, comprising:
    病历文本获取模块,用于获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;a medical record text acquisition module, used for acquiring the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnosis information;
    第一向量表征模块,用于将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;The first vector characterization module is used to input the case information into the case characterization model to obtain a case characterization vector corresponding to the case information; meanwhile, input the department information into the department characterization model to obtain the same The department representation vector corresponding to the department information;
    向量拼接模块,用于对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;a vector splicing module for splicing the case representation vector and the department representation vector to obtain a medical record splicing vector;
    病例判断模块,用于将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;A case judgment module, configured to input the medical record splicing vector into a case judgment network model, and determine at least one case judgment result corresponding to the medical record text to be verified;
    病例匹配模块,用于将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The case matching module is configured to match the diagnosis information with each of the case judgment results, and when the diagnosis information and any one of the case judgment results are successfully matched, it is determined that the verification of the medical record text to be verified is successful.
  9. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其中,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device comprising a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, wherein the processor implements the following steps when executing the computer-readable instructions:
    获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
    将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
    对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
    将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
    将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  10. 如权利要求9所述的计算机设备,其中,所述将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量之前,所述处理器执行所述计算机可读指令时还实现如下步骤:The computer device of claim 9, wherein the processor executes the computer readable instructions before the case information is input into a case representation model to obtain a case representation vector corresponding to the case information Also implement the following steps:
    获取预设病历样本文本集;所述预设病历样本文本集中包含至少一个病历样本文本;所述病历样本文本包含病例样本信息以及与所述病例样本信息对应的科室样本信息;一个所述病历样本文本关联一个病历样本标签;Obtain a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and department sample information corresponding to the case sample information; one of the medical record sample information The text is associated with a medical record sample label;
    将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,得到得到病例样本向量;同时,通过所述预设孪生表征模型中包含第二初始参数的科室训练模型对所述科室样 本信息进行向量表征,得到科室样本向量;Inputting the medical record sample text into a preset twin representation model, and performing vector representation on the case sample information through a case training model that includes the first initial parameter in the preset twin representation model to obtain a case sample vector; , performing vector representation on the department sample information through the department training model including the second initial parameter in the preset twin characterization model, to obtain a department sample vector;
    对所述病例样本向量以及所述科室样本向量进行拼接处理,得到样本拼接向量,并将所述样本拼接向量输入至初始回归模型中,确定与所述病历样本文本对应的标签预测概率;Perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into an initial regression model to determine the label prediction probability corresponding to the medical record sample text;
    根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值;Determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability;
    在所述预测损失值未达到预设的收敛条件时,更新迭代所述病例训练模型的第一初始参数,以及所述科室训练模型的第二初始参数,直至所述预测损失值达到所述预设的收敛条件时,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。When the predicted loss value does not reach the preset convergence condition, update and iterate the first initial parameter of the case training model and the second initial parameter of the department training model until the predicted loss value reaches the predicted loss value. When the convergence condition is set, the case training model after convergence is recorded as the case characterization model, and the department training model after convergence is recorded as the department characterization model.
  11. 如权利要求10所述的计算机设备,其中,所述根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值,包括:The computer device according to claim 10, wherein the determining the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability comprises:
    根据所述病历样本标签以及所述标签预测概率,通过交叉熵损失函数确定所述预测损失值;所述交叉熵损失函数为:According to the medical record sample label and the label prediction probability, the predicted loss value is determined by a cross-entropy loss function; the cross-entropy loss function is:
    Loss=w1*y*log(p)+w0*(1-y)*log(1-p)Loss=w1*y*log(p)+w0*(1-y)*log(1-p)
    其中,Loss为所述预测损失值;w1以及w0为所述预设孪生表征模型的权重;y为所述病历样本标签;p为所述标签预测概率。Wherein, Loss is the prediction loss value; w1 and w0 are the weights of the preset twin representation model; y is the label of the medical record sample; p is the label prediction probability.
  12. 如权利要求9所述的计算机设备,其中,所述对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量,包括:The computer device according to claim 9, wherein the splicing process is performed on the case representation vector and the department representation vector to obtain a medical record splicing vector, comprising:
    将所述科室表征向量拼接至所述病例表征向量的后端之后,得到所述病历拼接向量。After splicing the department representation vector to the back end of the case representation vector, the medical record splicing vector is obtained.
  13. 如权利要求9所述的计算机设备,其中,所述将所述病历拼接向量输入至输入至病例判别网络模型中,确定与所述待校验病历文本对应的病例判断结果,包括:The computer device according to claim 9, wherein the inputting the medical record splicing vector into the input case discrimination network model, and determining the case judgment result corresponding to the medical record text to be verified, comprising:
    通过所述病例判别网络模型中的预设卷积神经网络对所述病历拼接向量进行卷积池化处理,得到病历输出向量;The medical record splicing vector is subjected to convolution pooling processing through the preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
    通过所述病例判别网络模型中的预设分类网络对所述病历输出向量进行病例分类,确定与所述待校验病历文本对应的病例判断结果。Case classification is performed on the medical record output vector through a preset classification network in the case discrimination network model, and a case judgment result corresponding to the medical record text to be verified is determined.
  14. 如权利要求13所述的计算机设备,其中,所述将所述病历拼接向量输入至输入至病例判别网络模型中之前,所述处理器执行所述计算机可读指令时还实现如下步骤:The computer device according to claim 13, wherein, before the medical record splicing vector is input into the case discrimination network model, the processor further implements the following steps when executing the computer-readable instructions:
    获取所述病例表征模型的第三初始参数,以及所述科室表征模型的第四初始参数;obtaining the third initial parameter of the case characterization model, and the fourth initial parameter of the department characterization model;
    将所述第三初始参数与所述第四初始参数的均值记录为所述预设卷积神经网络的判别初始参数。The average value of the third initial parameter and the fourth initial parameter is recorded as the discriminative initial parameter of the preset convolutional neural network.
  15. 一个或多个存储有计算机可读指令的可读存储介质,其中,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more readable storage media storing computer-readable instructions, wherein the computer-readable instructions, when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取待校验病历文本;所述待校验病历文本包含病例信息、与所述病例信息关联的科室信息以及诊断信息;Obtain the medical record text to be verified; the medical record text to be verified includes case information, department information associated with the case information, and diagnostic information;
    将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量;同时,将所述科室信息输入至科室表征模型中,得到与所述科室信息对应的科室表征向量;Inputting the case information into a case representation model to obtain a case representation vector corresponding to the case information; at the same time, inputting the department information into the department representation model to obtain a department representation vector corresponding to the department information;
    对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量;Perform splicing processing on the case representation vector and the department representation vector to obtain a medical record splicing vector;
    将所述病历拼接向量输入至病例判别网络模型中,确定与所述待校验病历文本对应的至少一个病例判断结果;Inputting the medical record splicing vector into the case discrimination network model, and determining at least one case judgment result corresponding to the medical record text to be verified;
    将所述诊断信息与各所述病例判断结果进行匹配,在所述诊断信息与任意一个所述病例判断结果匹配成功时,确定所述待校验病历文本校验成功。The diagnostic information is matched with each of the case judgment results, and when the diagnostic information is successfully matched with any one of the case judgment results, it is determined that the text of the medical record to be verified is successfully verified.
  16. 如权利要求15所述的可读存储介质,其中,所述将所述病例信息输入至病例表征模型中,得到与所述病例信息对应的病例表征向量之前,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:16. The readable storage medium of claim 15, wherein before the case information is input into the case representation model and the case representation vector corresponding to the case information is obtained, the computer-readable instructions are executed by one or more When the multiple processors are executed, the one or more processors are caused to further perform the following steps:
    获取预设病历样本文本集;所述预设病历样本文本集中包含至少一个病历样本文本;所述病历样本文本包含病例样本信息以及与所述病例样本信息对应的科室样本信息;一个所述病历样本文本关联一个病历样本标签;Obtain a preset medical record sample text set; the preset medical record sample text set includes at least one medical record sample text; the medical record sample text includes case sample information and department sample information corresponding to the case sample information; one of the medical record sample information The text is associated with a medical record sample label;
    将所述病历样本文本输入至预设孪生表征模型中,通过所述预设孪生表征模型中包含第一初始参数的病例训练模型对所述病例样本信息进行向量表征,得到得到病例样本向量;同时,通过所述预设孪生表征模型中包含第二初始参数的科室训练模型对所述科室样本信息进行向量表征,得到科室样本向量;Inputting the medical record sample text into a preset twin representation model, and performing vector representation on the case sample information through a case training model that includes the first initial parameter in the preset twin representation model to obtain a case sample vector; , performing vector representation on the department sample information through the department training model including the second initial parameter in the preset twin characterization model, to obtain a department sample vector;
    对所述病例样本向量以及所述科室样本向量进行拼接处理,得到样本拼接向量,并将所述样本拼接向量输入至初始回归模型中,确定与所述病历样本文本对应的标签预测概率;Perform splicing processing on the case sample vector and the department sample vector to obtain a sample splicing vector, and input the sample splicing vector into an initial regression model to determine the label prediction probability corresponding to the medical record sample text;
    根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值;Determine the predicted loss value of the preset twin representation model according to the medical record sample label and the label prediction probability;
    在所述预测损失值未达到预设的收敛条件时,更新迭代所述病例训练模型的第一初始参数,以及所述科室训练模型的第二初始参数,直至所述预测损失值达到所述预设的收敛条件时,将收敛之后的所述病例训练模型记录为所述病例表征模型,将收敛之后的所述科室训练模型记录为所述科室表征模型。When the predicted loss value does not reach the preset convergence condition, update and iterate the first initial parameter of the case training model and the second initial parameter of the department training model until the predicted loss value reaches the predicted loss value. When the convergence condition is set, the case training model after convergence is recorded as the case characterization model, and the department training model after convergence is recorded as the department characterization model.
  17. 如权利要求16所述的可读存储介质,其中,所述根据所述病历样本标签以及所述标签预测概率确定所述预设孪生表征模型的预测损失值,包括:The readable storage medium according to claim 16, wherein the determining the prediction loss value of the preset twin representation model according to the medical record sample label and the label prediction probability comprises:
    根据所述病历样本标签以及所述标签预测概率,通过交叉熵损失函数确定所述预测损失值;所述交叉熵损失函数为:According to the medical record sample label and the label prediction probability, the predicted loss value is determined by a cross-entropy loss function; the cross-entropy loss function is:
    Loss=w1*y*log(p)+w0*(1-y)*log(1-p)Loss=w1*y*log(p)+w0*(1-y)*log(1-p)
    其中,Loss为所述预测损失值;w1以及w0为所述预设孪生表征模型的权重;y为所述病历样本标签;p为所述标签预测概率。Wherein, Loss is the prediction loss value; w1 and w0 are the weights of the preset twin representation model; y is the label of the medical record sample; p is the label prediction probability.
  18. 如权利要求15所述的可读存储介质,其中,所述对所述病例表征向量以及所述科室表征向量进行拼接处理,得到病历拼接向量,包括:The readable storage medium according to claim 15, wherein the splicing process on the case representation vector and the department representation vector to obtain a medical record splicing vector, comprising:
    将所述科室表征向量拼接至所述病例表征向量的后端之后,得到所述病历拼接向量。After splicing the department representation vector to the back end of the case representation vector, the medical record splicing vector is obtained.
  19. 如权利要求15所述的可读存储介质,其中,所述将所述病历拼接向量输入至输入至病例判别网络模型中,确定与所述待校验病历文本对应的病例判断结果,包括:The readable storage medium according to claim 15, wherein the inputting the medical record splicing vector into the input case discrimination network model, and determining the case judgment result corresponding to the medical record text to be verified comprises:
    通过所述病例判别网络模型中的预设卷积神经网络对所述病历拼接向量进行卷积池化处理,得到病历输出向量;The medical record splicing vector is subjected to convolution pooling processing through the preset convolutional neural network in the case discrimination network model to obtain a medical record output vector;
    通过所述病例判别网络模型中的预设分类网络对所述病历输出向量进行病例分类,确定与所述待校验病历文本对应的病例判断结果。Case classification is performed on the medical record output vector through a preset classification network in the case discrimination network model, and a case judgment result corresponding to the medical record text to be verified is determined.
  20. 如权利要求19所述的可读存储介质,其中,所述将所述病历拼接向量输入至输入至病例判别网络模型中之前,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:19. The readable storage medium of claim 19, wherein, before the input of the medical record splicing vector into the case discrimination network model, the computer-readable instructions, when executed by one or more processors, cause The one or more processors also perform the following steps:
    获取所述病例表征模型的第三初始参数,以及所述科室表征模型的第四初始参数;obtaining the third initial parameter of the case characterization model, and the fourth initial parameter of the department characterization model;
    将所述第三初始参数与所述第四初始参数的均值记录为所述预设卷积神经网络的判别初始参数。The average value of the third initial parameter and the fourth initial parameter is recorded as the discriminative initial parameter of the preset convolutional neural network.
PCT/CN2021/083196 2021-01-11 2021-03-26 Medical record information verification method and apparatus, and computer device and storage medium WO2022147910A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110032946.6 2021-01-11
CN202110032946.6A CN112820367B (en) 2021-01-11 2021-01-11 Medical record information verification method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022147910A1 true WO2022147910A1 (en) 2022-07-14

Family

ID=75869015

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/083196 WO2022147910A1 (en) 2021-01-11 2021-03-26 Medical record information verification method and apparatus, and computer device and storage medium

Country Status (2)

Country Link
CN (1) CN112820367B (en)
WO (1) WO2022147910A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115691742A (en) * 2023-01-03 2023-02-03 江西曼荼罗软件有限公司 Electronic medical record quality control method, system, storage medium and equipment
CN115880704A (en) * 2023-02-16 2023-03-31 中国人民解放军总医院第一医学中心 Automatic case cataloging method, system, equipment and storage medium
CN118098475A (en) * 2024-04-22 2024-05-28 来未来科技(浙江)有限公司 Method, device, equipment and storage medium for generating differential diagnosis document

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114491040B (en) * 2022-01-28 2022-12-02 北京百度网讯科技有限公司 Information mining method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170076046A1 (en) * 2015-09-10 2017-03-16 Roche Molecular Systems, Inc. Informatics platform for integrated clinical care
US20180046773A1 (en) * 2016-08-11 2018-02-15 Htc Corporation Medical system and method for providing medical prediction
CN111402979A (en) * 2020-03-24 2020-07-10 清华大学 Method and device for detecting consistency of disease description and diagnosis
CN111883222A (en) * 2020-09-28 2020-11-03 平安科技(深圳)有限公司 Text data error detection method and device, terminal equipment and storage medium
CN112148882A (en) * 2020-11-27 2020-12-29 北京惠及智医科技有限公司 Medical record text processing method and device, electronic equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111767707B (en) * 2020-06-30 2023-10-31 平安科技(深圳)有限公司 Method, device, equipment and storage medium for detecting Leideogue cases
CN112016279B (en) * 2020-09-04 2023-11-14 平安科技(深圳)有限公司 Method, device, computer equipment and storage medium for structuring electronic medical record
CN112016318B (en) * 2020-09-08 2023-11-21 平安科技(深圳)有限公司 Triage information recommendation method, device, equipment and medium based on interpretation model

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170076046A1 (en) * 2015-09-10 2017-03-16 Roche Molecular Systems, Inc. Informatics platform for integrated clinical care
US20180046773A1 (en) * 2016-08-11 2018-02-15 Htc Corporation Medical system and method for providing medical prediction
CN111402979A (en) * 2020-03-24 2020-07-10 清华大学 Method and device for detecting consistency of disease description and diagnosis
CN111883222A (en) * 2020-09-28 2020-11-03 平安科技(深圳)有限公司 Text data error detection method and device, terminal equipment and storage medium
CN112148882A (en) * 2020-11-27 2020-12-29 北京惠及智医科技有限公司 Medical record text processing method and device, electronic equipment and storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115691742A (en) * 2023-01-03 2023-02-03 江西曼荼罗软件有限公司 Electronic medical record quality control method, system, storage medium and equipment
CN115880704A (en) * 2023-02-16 2023-03-31 中国人民解放军总医院第一医学中心 Automatic case cataloging method, system, equipment and storage medium
CN115880704B (en) * 2023-02-16 2023-06-16 中国人民解放军总医院第一医学中心 Automatic cataloging method, system, equipment and storage medium for cases
CN118098475A (en) * 2024-04-22 2024-05-28 来未来科技(浙江)有限公司 Method, device, equipment and storage medium for generating differential diagnosis document

Also Published As

Publication number Publication date
CN112820367B (en) 2023-06-30
CN112820367A (en) 2021-05-18

Similar Documents

Publication Publication Date Title
WO2022147910A1 (en) Medical record information verification method and apparatus, and computer device and storage medium
CN110021439B (en) Medical data classification method and device based on machine learning and computer equipment
US11816439B2 (en) Multi-turn dialogue response generation with template generation
WO2020232877A1 (en) Question answer selection method and apparatus, computer device, and storage medium
CN108536800B (en) Text classification method, system, computer device and storage medium
WO2020237869A1 (en) Question intention recognition method and apparatus, computer device, and storage medium
WO2021114620A1 (en) Medical-record quality control method, apparatus, computer device, and storage medium
CN112016318B (en) Triage information recommendation method, device, equipment and medium based on interpretation model
WO2020034801A1 (en) Medical feature screening method and apparatus, computer device, and storage medium
WO2021189960A1 (en) Method and apparatus for training adversarial network, method and apparatus for supplementing medical data, and device and medium
CN112016295A (en) Symptom data processing method and device, computer equipment and storage medium
CN113705685B (en) Disease feature recognition model training, disease feature recognition method, device and equipment
CN111667890A (en) Medication recommendation method, device, equipment and storage medium based on artificial intelligence
EP3832485A1 (en) Question answering systems
CN112035611A (en) Target user recommendation method and device, computer equipment and storage medium
US11037549B1 (en) System and method for automating the training of enterprise customer response systems using a range of dynamic or generic data sets
WO2021001517A1 (en) Question answering systems
CN111723870A (en) Data set acquisition method, device, equipment and medium based on artificial intelligence
US20220230095A1 (en) Active learning via a surrogate machine learning model using knowledge distillation
WO2022257468A1 (en) Method and apparatus for updating dialogue management system, and computer device and storage medium
US11880798B2 (en) Determining section conformity and providing recommendations
US20210342530A1 (en) Framework for Managing Natural Language Processing Tools
CN111882046B (en) Multimedia data identification method, device, equipment and computer storage medium
US10929761B2 (en) Systems and methods for automatically detecting and repairing slot errors in machine learning training data for a machine learning-based dialogue system
WO2021139078A1 (en) Artificial intelligence system risk detection method and apparatus, and computer device and medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21916961

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21916961

Country of ref document: EP

Kind code of ref document: A1