CN112749307B - Medical data processing method and device and storage medium - Google Patents

Medical data processing method and device and storage medium Download PDF

Info

Publication number
CN112749307B
CN112749307B CN202011604300.2A CN202011604300A CN112749307B CN 112749307 B CN112749307 B CN 112749307B CN 202011604300 A CN202011604300 A CN 202011604300A CN 112749307 B CN112749307 B CN 112749307B
Authority
CN
China
Prior art keywords
information
original
standard
medical data
operation information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011604300.2A
Other languages
Chinese (zh)
Other versions
CN112749307A (en
Inventor
郑永升
周世正
梁平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Yitu Healthcare Technology Co ltd
Original Assignee
Hangzhou Yitu Healthcare Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Yitu Healthcare Technology Co ltd filed Critical Hangzhou Yitu Healthcare Technology Co ltd
Priority to CN202011604300.2A priority Critical patent/CN112749307B/en
Publication of CN112749307A publication Critical patent/CN112749307A/en
Application granted granted Critical
Publication of CN112749307B publication Critical patent/CN112749307B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Epidemiology (AREA)
  • Medical Informatics (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The present disclosure relates to a processing method of medical data, a processing apparatus of medical data, and a computer-readable storage medium, the processing method including acquiring original medical data, the original medical data containing original surgery information; matching the original operation information with standard words of a standard operation information table to obtain a matching relation; screening out to-be-processed operation information respectively comprising original operation information and standard words based on the matching relationship; and carrying out normalization processing on the operation information to be processed through a classification algorithm to obtain target medical data. Through the embodiments of the disclosure, the medical data containing the operation information with complex description can be accurately and efficiently processed, so that the operation information can be intelligently processed.

Description

Medical data processing method and device and storage medium
Technical Field
The present disclosure relates to the field of medical data intelligent processing technology, and in particular, to a medical data processing method, a medical data processing apparatus, and a computer-readable storage medium.
Background
In the process of processing medical data containing operation information, the prior art only realizes data processing and information sorting based on short text operation names. In medical data related to operation information, for example, a complete operation record generally includes an operation name and an operation pass, when an electronic medical record is written, because there are many details of the operation, and some differences in habits or specifications, there are great differences in the recording results of different recording modes for the same operation, where the case of lack of operation details or lack of individual operations in the operation name is very common, and even the case of an incorrect operation name may occur more seriously.
Disclosure of Invention
The present disclosure is intended to provide a medical data processing method, a medical data processing apparatus, and a computer-readable storage medium, which can accurately and efficiently process medical data containing operation information of a complex description, thereby intelligently processing the operation information.
According to one aspect of the present disclosure, there is provided a method for processing medical data, including:
acquiring original medical data, wherein the original medical data comprises original operation information;
matching the original operation information with standard words of a standard operation information table to obtain a matching relation;
screening out to-be-processed operation information respectively comprising original operation information and standard words based on the matching relationship;
and carrying out normalization processing on the operation information to be processed through a classification algorithm to obtain target medical data.
In some embodiments, the first and second light sources, wherein,
the obtaining of the matching relationship comprises: combining every two standard words of the original operation information and a standard operation information table, and respectively calculating matching degrees;
based on the matching relationship, screening out the to-be-processed operation information respectively containing the original operation information and the standard words, and the method comprises the following steps: and under the condition that the matching degree meets a preset condition, taking the original operation information and the standard words of the standard operation information table as the operation information to be processed.
In some embodiments, the normalizing the surgical information to be processed by the classification algorithm includes:
inputting the operation information to be processed into a neural network classification model;
analyzing whether the original operation information in the operation information to be processed corresponds to the standard words or not;
according to the analysis result, normalization processing is carried out on the corresponding original operation information and the standard words;
outputting standard surgical information corresponding to the original surgical information for generating the target medical data.
In some embodiments, the neural network classification model is obtained based on a pre-training mode, where the pre-training mode includes:
acquiring original medical data containing original surgical information;
extracting operation name information and operation passing information in the original operation information;
combining the operation passing information with standard words in an operation name library, wherein the operation name library is constructed based on the operation name information and comprises an original operation name and a standard operation name of original operation information;
training a pre-training model by using the combined surgical procedure information and standard words in a surgical name library as training data;
and obtaining the neural network classification model based on the pre-training model.
In some embodiments, the method for constructing the operation name library includes:
inputting the operation name information into a neural network model for coding an operation name so as to extract a standard operation name code from the operation name information;
and constructing an operation name library containing the original operation names and the coded standard operation names based on the operation name information and the standard operation name codes.
In some embodiments, wherein obtaining the target medical data comprises:
characterizing the original surgical information in a standard surgical information format;
outputting the target medical data to update the raw medical data.
In some embodiments, among others, further comprising:
and analyzing the content of the original operation information to obtain an operation name and a long text operation process.
According to one aspect of the present disclosure, there is provided a processing apparatus of medical data, comprising:
an acquisition unit configured for acquiring raw medical data, the raw medical data containing raw surgical information;
the matching module is configured to match the original operation information with standard words of a standard operation information table to obtain a matching relation;
the screening module is configured to screen out to-be-processed surgical information respectively comprising original surgical information and standard words based on the matching relationship;
the processing module is configured to perform normalization processing on the surgical information to be processed through a classification algorithm to obtain target medical data.
According to one aspect of the present disclosure, there is provided a computer-readable storage medium having stored thereon computer-executable instructions that, when executed by a processor, implement:
the method for processing medical data according to the above.
The medical data processing method, the medical data processing device and the computer readable storage medium of various embodiments of the present disclosure are realized by acquiring original medical data, wherein the original medical data contains original surgery information; matching the original operation information with standard words of a standard operation information table to obtain a matching relation; screening out to-be-processed operation information respectively comprising original operation information and standard words based on the matching relationship; and performing normalization processing on the operation information to be processed through a classification algorithm to obtain target medical data, so that the target medical data can be generated based on a standard operation information table to process and sort the original medical data based on the original operation information, particularly the original medical data containing the operation process description content of the long text, to obtain a unified normalization processing result. According to the processing method, the idea and the method for extracting the standard operation name from the long text of the operation process can be applied to the scene of supplementing and correcting the original operation name by using the content of the operation process, the recall rate of the data processing performance from the upper end to the upper end reaches 95%, various operation information is completely, accurately and efficiently expressed by outputting the standard information and a uniform coding mode, and therefore the accuracy and the efficiency of medical research and medical diagnosis and treatment are improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure, as claimed.
Drawings
In the drawings, which are not necessarily drawn to scale, like reference numerals may designate similar components in different views. Like reference numerals with letter suffixes or like reference numerals with different letter suffixes may represent different instances of like components. The drawings illustrate various embodiments generally, by way of example and not by way of limitation, and together with the description and claims, serve to explain the disclosed embodiments.
Fig. 1 shows a flow chart of a method of processing medical data to which an embodiment of the present disclosure relates;
fig. 2 shows an architecture diagram of a medical data processing apparatus according to an embodiment of the present disclosure.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present disclosure more clear, the technical solutions of the embodiments of the present disclosure will be described below clearly and completely with reference to the accompanying drawings of the embodiments of the present disclosure. It is to be understood that the described embodiments are only a few embodiments of the present disclosure, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the described embodiments of the disclosure without inventive step, are within the scope of protection of the disclosure.
Unless otherwise defined, technical or scientific terms used herein shall have the ordinary meaning as understood by one of ordinary skill in the art to which this disclosure belongs. The word "comprising" or "comprises", and the like, means that the element or item listed before the word covers the element or item listed after the word and its equivalents, but does not exclude other elements or items.
To maintain the following description of the embodiments of the present disclosure clear and concise, a detailed description of known functions and known components have been omitted from the present disclosure.
The present disclosure relates to processing of medical data for processing of surgical information against a raw recording. In the process of processing medical data containing operation information, the prior art only realizes data processing and information sorting based on short text operation names, for example, realizes coding based on short text operation names, and cannot perform corresponding data processing based on long text, for example, aiming at texts with more than hundreds of characters. In medical data related to operation information, for example, a complete operation record generally includes an operation name and an operation pass, when an electronic medical record is written, because the operation has a lot of details and some differences in habits or specifications, different recording modes, for example, different doctors in different hospitals have a great difference in recording results of the same operation, wherein the case that the operation details are lacked or individual operations are lacked in the operation name is very common, and the case that the operation name is wrong is more serious, even occurs. Based on the existing difference, a more complete operation record needs to be generated, so as to unify the standardized output of the operation information to solve some problems encountered in practical clinics, such as supplementing and correcting the operation name.
As one aspect, as shown in fig. 1, an embodiment of the present disclosure provides a method for processing medical data, including:
s101: acquiring original medical data, wherein the original medical data comprises original operation information;
s102: matching the original operation information with standard words of a standard operation information table to obtain a matching relation;
s103: screening out to-be-processed operation information respectively comprising original operation information and standard words based on the matching relationship;
s104: and carrying out normalization processing on the operation information to be processed through a classification algorithm to obtain target medical data.
One of the inventive concepts of the present disclosure is to process and sort information of original medical data based on a standard operation information table according to original operation information, particularly original medical data of operation procedure description contents including a long text, so as to obtain a unified normalization processing result to generate target medical data.
The original medical data in the embodiments of the present disclosure, which belongs to the data source, need not be particularly limited, and may be historical data or current real-time data. From the aspect of data format, the medical record text data, video data, audio data and the like can be used as long as the original operation information which can be contained in the medical record text data, the original operation information can be identified by identification means, such as text identification (for example, NLP identification, OCR identification and the like), some medical information such as operation names and operation courses which are described in the medical record text data, the medical information content which is identified by character splitting, word splitting and the like, and the like can be identified. The ICD9-CM-3 standard information table can be referred to, and the raw medical data of the embodiments of the present disclosure includes various operation information from which the operation name information can be extracted and information describing the operation process, especially for the long text content information, the long text content can include hundreds of and thousands of words, and can be recorded in the raw medical data based on the form of data entry. In some embodiments, the raw medical data of the present disclosure may also be medical records, diagnostic books, surgical reports, which include a plurality of or a plurality of surgical names and surgical descriptions, and these information may or may not include codes characterizing the respective information. In various data information analysis scenarios, the original medical data in the embodiment of the present disclosure may be a medical text of the original medical data input by a user through an interactive interface and an input device, and may be used for interpretation of relevant medical information by way of annotation or analysis, such as manual or machine.
In each embodiment, in the implementation process of the present disclosure, the operation name information and the operation pass description information in the original medical operation information of the present embodiment may be extracted through a neural network model. In the implementation process, the specific neural network model is not particularly limited, and can be implemented by adopting a neural network model which meets the requirements and is matched with the architecture. In a more preferable scheme, the extraction accuracy of various information can be further optimized through an adaptive neural network model on the basis of a pre-training model. For extracting medical entity content, entity extraction can be performed based on a text recognition mode, for example, a text recognition mode such as NLP (natural language processing), and the entity is divided and classified in combination with medical concepts. More preferably, the entity can be analyzed by combining with a standard medical information table, such as various information tables of ICD, and extracted on the basis of combining with a proper medical rule analysis result.
In some embodiments, said obtaining a matching relationship of the present disclosure comprises: combining the original operation information and each standard word of a standard operation information table in pairs, and respectively calculating matching degrees;
based on the matching relationship, screening out the to-be-processed operation information respectively containing the original operation information and the standard words, and the method comprises the following steps: and under the condition that the matching degree meets a preset condition, taking the original operation information and standard words of a standard operation information table as operation information to be processed.
Specifically, the standard surgical information table of the present embodiment can be obtained through the ICD-9-CM-3 standard table, and the surgical information can be calibrated in detail and the corresponding surgical code can be given through the ICD-9-CM-3 standard table.
The method is characterized in that two combinations are performed on original operation information in original medical data and standard medical information in an ICD-9-CM-3 standard table or standard words. For example, the input in this embodiment is surgical case history data, wherein the original text records the surgical information as
"name of operation: coronary angiography + PCI
The operation process comprises the following steps: the operation area is disinfected conventionally, a sterile hole towel is laid, the right radial artery puncture is successful, and the radiography shows that: the left trunk is normal, the front descending branch is 7-8 sections and is 80% narrow, the circumgyrating branch is 11 sections and is 80% narrow, the right crown is 2-3 sections and is 40% narrow, and the PDA opening is 70% narrow. PCI (peripheral component interconnect) technology: replacing AL1guiding6F to the left coronary mouth, sending a BMW guide wire to the distal end of an anterior descending branch, sending Tazuna 2.0X 15mm saccule to the lesion with 7-8 sections to expand for 7-8 seconds at 12-13atm, then implanting Firenbird 3.0X 20mm stent to expand for 8 seconds at 14atm, releasing 3.5X 18mm stent to release at 7 sections to 12atm8 seconds, expanding the saccule to expand stent with 3.0X 10mm for 7-8 seconds, sending BMW guide wire to the distal end of a circumflex branch with 2.0X 15mm saccule to expand for 16atm8 seconds, releasing 3.5X 15mm stent to circumflex 12atm8 seconds, repeating the radiography to show that the stent expands well without residual stenosis, treating the blood flow TIMI3 grade, removing the catheter and the sheath, local compression hemostasis after the operation,
the name of the operation is only written as 'PCI operation', but according to the recorded content of the operation and the medical knowledge, the type of the stent can be determined, several stents are actually implanted, and the patient also has percutaneous coronary artery sacculus dilatation and plasty. Then, based on the data processing method of the present disclosure, at least error correction of the procedure name according to the procedure pass can be realized for the input.
The standard surgical information table of this embodiment includes a plurality of rows of data entries containing information such as primary code, additional code, surgical name, category, etc. And combining the input text original text and each standard word in the standard operation information table in pairs to form a data item in the format of 'original text and standard word'. Such as forming
"1. (original, non-drug eluting coronary stents placed)
2. (textual, coronary artery drug-coated stent placement)
3. (plain, drug-eluting coronary stent set)
… … "such a pair of information (pair).
The operation passing information is described through text, so that the operation passing information is the detailed content of the operation name, and the operation passing information and the operation name have a corresponding relation. Based on the corresponding relationship, each pair of combined pair can be used as a piece of training data in the embodiment.
The matching degree is calculated for each pair combination of the above forms, and the combination with the matching degree lower than a preset threshold is discarded and is not used as a data normalization target. And taking the combination with the matching degree meeting the preset threshold as a to-be-selected normalization object.
Specifically, the determination method of the matching degree in this embodiment may be calculated based on the number of words included in each of the original text and the standard word, for example, the matching degree is calculated by the number of words in intersection between the original text and the standard word and the number of words in union of the original text and the standard word, or is referred to as the similarity between the original text and the standard word. By way of example with simple text numbers, the original text contains 'bilateral thyroidectomy', a standard word after two-two combination is 'thyroidectomy', the intersection of the two is 'thyroidectomy', the number of words is 4, and the union of the two is the sum of the lengths of two character strings minus the number of intersection words: 9+8-4=13, so the similarity of this pairwise combination pair is "4/13". The embodiment can perform similarity judgment on long text operation passes, such as operation description characters in the shape of the Chinese character thousand.
In some embodiments, the normalizing the surgical information to be processed by the classification algorithm of the present disclosure includes:
inputting the operation information to be processed into a neural network classification model;
analyzing whether the original operation information in the operation information to be processed corresponds to the standard words or not;
according to the analysis result, normalization processing is carried out on the corresponding original operation information and the standard words;
outputting standard surgical information corresponding to the original surgical information for generating the target medical data.
Specifically, continuing with the above example, assuming that the pair of pair is used as the surgical information to be processed, the present embodiment may use the pair of pair as the surgical information to be processed
"1. (original, non-drug eluting coronary stents placed)
2. (textual, coronary artery drug-coated stent placement)
3. (in original text, drug-eluting coronary stents are placed)
……”
And inputting the neural network classification model. The method can construct an adaptive neural network classification model, and analyze and normalize the operation information to be processed in the modes of adaptively configuring corresponding frameworks, the number of neurons and the like.
For example, each group of information is classified, and it is determined whether the operation information from the original medical data and the standard word screened from the standard information table have related or similar medical concepts, for example, whether the two have a relationship of inclusion. If the neural network classification model considers that the two belong to the situation with the inclusion relationship, the neural network classification model outputs a corresponding result, for example, outputs "1" to characterize a positive conclusion. If the neural network classification model considers that the two do not belong to the situation with the inclusion relationship, the neural network classification model outputs a corresponding result, for example, outputs "0" to characterize a negative conclusion.
In order to optimize the performance of the neural network classification model, in various embodiments of the present disclosure, the performance of the neural network classification model may be increased by optimizing a pre-training model. In particular, large-scale medical anticipation data may be collected, with the data sources targeting surgical information, including but not limited to: medical record data of each medical structure, medical data of each medical research institution, medical data in each medical information system, medical textbooks of various languages, medical dictionaries, medical treatises, medical treatment data in online databases, department libraries, website webpage data, forum data and the like. In the embodiment of the disclosure, a pre-training model can be constructed to pre-train the acquired large-scale medical anticipation data, so that the model learns various medical knowledge from the large-scale data, and can be applied to specific tasks on the basis to improve the performance of the neural network model of the disclosure. Pre-training models include, but are not limited to: BERT model, XLNET model, roBERTa model, etc.
"1" for the aforementioned input neural network classification model based on the neural network classification model of the present embodiment (plain, non-drug eluting coronary stent placement)
2. (textual, coronary artery drug-coated stent placement)
3. (in original text, drug-eluting coronary stents are placed)
… … "can output standard surgical information corresponding to the original text. Since the present disclosure can be aided with the ICD-9-CM-3 table, such standard surgical information may include standard surgical names, standard surgical name codes, etc., such as output:
"1.36.0700 | drug eluting coronary artery stenting
2.00.6600 x 004| percutaneous coronary balloon dilatation
3.00.4700 | Stent for implanting into three blood vessels
4.88.5700 other and unspecified coronary angiography "
Intuitively, it can be understood that through the above results outputted by the neural network classification model of the present embodiment, accurate information description of the operation can be obtained, including details about what type of stent is, how many stents are actually implanted, and how the patient performs percutaneous coronary angioplasty and the like.
As another example, an embodiment of the present disclosure provides an input of
"name of operation: temporary pacemaker Implantation permanent pacemaker Implantation (DDD)
The operation process comprises the following steps: the patient takes the horizontal position, and is disinfected and paved with 2 percent lidocaine local anesthesia by a conventional method. Puncturing the right femoral vein, feeding a guide steel wire along the puncture point, confirming that the guide steel wire is positioned in the inferior vena cava by an X-ray, feeding the guide steel wire into a sheath along the guide steel wire, extracting the guide steel wire and an expansion sheath, feeding the temporary pacing electrode into the sheath, feeding the electrode into the right ventricular cardiac apex part under the X-ray, connecting the pacing electrode with a pacemaker, measuring parameters, sensing pacing well, setting pacing voltage to be 5V, and setting the pacing frequency to be 40 times/minute. The skin is cut in parallel for about 5cm at the scar of the original operation in the left thoracic operation area, the subcutaneous tissue is separated layer by layer and blunt, the pacemaker is fully exposed and taken out, the size of the original bag is adjusted to be consistent with that of the new pacemaker, and the hemostatic gauze is fully filled for standby. The parameter threshold of the original atrial electrode is tested to be 0.8V, the impedance is 560 omega, and 3mv is sensed. The parameter threshold of the original ventricular electrode is tested to be 0.7V, the impedance is 480 omega, and 12mv is sensed. Order the patient to breathe cough deeply, the electrode does not have the dislocation, and the signal of pacing does not drop, and maximum voltage pacing diaphragm does not have the stimulation. Connecting the pulse generator and screwing the fixing screw. The pulse generator is placed into the pacemaker sac, and the pacemaker is fixed by stitching. Sewing the closed bag layer by layer, wrapping by local pressurization, and pressing with sand bag. The temporary pacing electrode catheter was withdrawn, the right inguinal area was bandaged, and the patient was ordered to brake for 6 hours on the right lower limb. The postoperative patient has nausea and vomiting, the blood pressure is measured to be 200/100mmHg, isosorbide dinitrate is given for quiescent point to control the blood pressure, the antiemetic treatment is carried out, the symptom of the patient is relieved, the operation name is 'permanent pacemaker implantation', but the old pacemaker can be taken out and replaced by a new pacemaker according to the recorded content of the operation process and the medical knowledge, the replacement is the replacement of the pacemaker instead of the implantation, and the replacement is the operation comprising the taking out and the re-implantation. Then, based on the data processing method of the present disclosure, at least the correction of the operation name according to the operation pass can be realized for the input, and the output can be the neural network classification model implemented by the present embodiment
"37.7800| temporary transvenous pacemaker System Placement
37.8701| dual chamber permanent pacemaker replacement.
Intuitively, it can be understood that the above results output by the neural network classification model of the present embodiment can obtain accurate information description of the operation, that is, the replacement of the pacemaker instead of the implantation, the replacement including the operations of extraction and re-implantation, and the like, when the old pacemaker is actually extracted and the new pacemaker is replaced.
In some embodiments, the neural network classification model of the present disclosure is obtained based on pre-training, and the pre-training mode includes:
acquiring original medical data containing original surgical information;
extracting operation name information and operation passing information in the original operation information;
combining the operation passing information with standard words in an operation name library, wherein the operation name library is constructed based on the operation name information and comprises an original operation name and a standard operation name of original operation information;
training a pre-training model by using the combined surgical procedure information and standard words in a surgical name library as training data;
and obtaining the neural network classification model based on the pre-training model.
In particular, embodiments of the present disclosure may construct and iteratively train a training model to which the present disclosure relates based on a sufficient amount of data to satisfy the training, such as one or more batches, or massive acquisitions of raw medical data containing raw surgical information.
Continuing with the foregoing example, the foregoing procedure information, entitled "coronary angiography + PCI procedure," may be used as input data to the pre-trained model of the present embodiment. Based on the surgical name "coronarography + PCI surgery", the present embodiment can build a surgical name library that is intended to incorporate the original surgical name, and a standard surgical name as the primary purpose, to build data for training in combination with the surgical name. These data contain pairwise combinations of the original procedure and each standard word in the library of procedure names, each combination being the training data for training. On the basis of training the pre-training model of the present embodiment, the pre-training model can be used as an initialization model to train an end-to-end neural network classification model related to the embodiments of the present disclosure.
In some embodiments, the method of constructing the library of procedure names of the present disclosure comprises:
inputting the operation name information into a neural network model for coding an operation name so as to extract a standard operation name code from the operation name information;
and constructing an operation name library containing the original operation names and the coded standard operation names based on the operation name information and the standard operation name codes.
Specifically, with the operation name "coronary angiography + PCI operation", the ICD standard operation name code can be extracted from the operation name text in the original medical data, preferably by a neural network model encoding the operation name, in combination with the operation name "coronary angiography + PCI operation", based on the result of the original operation name text code
"1. Other and unspecified coronary angiography
2. Non-drug eluting coronary stent placement "
The operation name library of the present embodiment is constructed, which includes the original operation name and the encoded standard operation name.
In the embodiment, the original operation name is combined with the standard word in the operation name library after the standard processing, so that a corresponding pre-training model is constructed, and an end-to-end neural network classification model is trained, thereby greatly improving the intelligent processing efficiency of the embodiment.
In detail, if the pre-trained model of the embodiments of the present disclosure is used, the neural network classification model of the embodiments of the present disclosure may be substantially consistent with the pre-trained model, and the pre-trained model is required to initialize the neural network classification model of the embodiments of the present disclosure. The neural network classification model of the present embodiment may use any other classification model if the pre-trained model is not employed, including but not limited to: CNN, LSTM, transformer, etc. Based on the pre-trained model finetune in some embodiments, the model converges faster with a 2% improvement in performance.
In some embodiments, obtaining target medical data of the present disclosure includes:
characterizing the original surgical information in a standard surgical information format;
outputting the target medical data to update the raw medical data.
Specifically, with the above contents being combined, on the basis that the neural network classification model of the present embodiment outputs a corresponding result, and for a combination in which the output result is a positive conclusion, the present embodiment may extract the operation name and the operation code in the ICD-9-CM-3 standard table, and perform a unified expression process on the originally input operation data. Compared with the originally input operation data, it can be seen that the target medical data output by the embodiment can be used for carrying out precision processing on the original medical data in the modes of supplementing omission and correcting errors.
In some embodiments, the standard surgical information table of the present disclosure is constructed in a manner including:
and constructing based on the label of the ICD standard table, wherein the labeled result comprises the result according to the operation name and the coded result.
In combination with the foregoing, the present embodiment may construct a standard information table based on the labeling result, which includes information such as main codes, additional codes, operation names, and categories.
As one of the aspects of the present disclosure, as shown in fig. 2, the present disclosure also provides a processing apparatus of medical data, including:
an acquisition unit configured for acquiring raw medical data, the raw medical data containing raw surgical information;
the matching module is configured to match the original operation information with standard words of a standard operation information table to obtain a matching relation;
the screening module is configured to screen out to-be-processed surgical information respectively comprising original surgical information and standard words based on the matching relationship;
the processing module is configured to perform normalization processing on the surgical information to be processed through a classification algorithm to obtain target medical data.
In some embodiments, the obtaining unit of the present disclosure may be an input device, a screen capture device, a text recognition device, and the like, and is intended to enable obtaining a text generated based on an original input; and/or generating medical data based on AI algorithm recognition.
In combination with the foregoing, in some embodiments, the matching module of the present disclosure may be further configured to:
and combining the original operation information and each standard word of the standard operation information table in pairs, and respectively calculating the matching degree.
In combination with the foregoing, in some embodiments, the screening module of the present disclosure may be further configured to:
and under the condition that the matching degree meets a preset condition, taking the original operation information and the standard words of the standard operation information table as the operation information to be processed.
In combination with the foregoing, in some embodiments, the processing module of the present disclosure may be further configured to:
inputting the surgical information to be processed into a neural network classification model;
analyzing whether the original operation information in the operation information to be processed corresponds to the standard words or not;
according to the analysis result, normalization processing is carried out on the corresponding original operation information and the standard words;
outputting standard surgical information corresponding to the original surgical information for generating the target medical data.
In some embodiments, the processing module of the present disclosure may be further configured to:
acquiring original medical data containing original surgical information;
extracting operation name information and operation passing information in the original operation information;
combining the operation passing information with standard words in an operation name library, wherein the operation name library is constructed based on the operation name information and comprises an original operation name and a standard operation name of original operation information;
training a pre-training model by using the combined surgical procedure information and standard words in a surgical name library as training data;
and obtaining the neural network classification model based on the pre-training model.
In some embodiments, the processing module of the present disclosure may be further configured to:
inputting the operation name information into a neural network model for coding an operation name so as to extract a standard operation name code from the operation name information;
and constructing an operation name library containing the original operation names and the coded standard operation names based on the operation name information and the standard operation name codes.
In particular, one of the inventive concepts of the present disclosure is directed to a method for medical procedure planning by obtaining raw medical data, the raw medical data containing raw surgical information; matching the original operation information with standard words of a standard operation information table to obtain a matching relation; screening out to-be-processed operation information respectively comprising the original operation information and the standard words based on the matching relationship; and performing normalization processing on the operation information to be processed through a classification algorithm to obtain target medical data, so that the target medical data can be generated based on a standard operation information table to process and sort the original medical data based on the original operation information, particularly the original medical data containing the operation process description content of the long text, to obtain a unified normalization processing result. According to the processing method, the idea and the method for extracting the standard operation name from the long text of the operation process can be applied to the scene of supplementing and correcting the original operation name by using the content of the operation process, the recall rate of the data processing performance from the upper end to the upper end reaches 95%, various operation information is completely, accurately and efficiently expressed by outputting the standard information and a uniform coding mode, and therefore the accuracy and the efficiency of medical research and medical diagnosis and treatment are improved.
As one of the aspects of the present disclosure, the present disclosure also provides a computer-readable storage medium having stored thereon computer-executable instructions, which when executed by a processor, mainly implement a processing method according to the medical data described above, including at least:
acquiring original medical data, wherein the original medical data comprises original operation information;
matching the original operation information with standard words of a standard operation information table to obtain a matching relation;
screening out to-be-processed operation information respectively comprising original operation information and standard words based on the matching relationship;
and carrying out normalization processing on the operation information to be processed through a classification algorithm to obtain target medical data.
In some embodiments, a processor executing computer-executable instructions may be a processing device including more than one general-purpose processing device, such as a microprocessor, central Processing Unit (CPU), graphics Processing Unit (GPU), or the like. More specifically, the processor may be a Complex Instruction Set Computing (CISC) microprocessor, reduced Instruction Set Computing (RISC) microprocessor, very Long Instruction Word (VLIW) microprocessor, processor running other instruction sets, or processors running a combination of instruction sets. The processor may also be one or more special-purpose processing devices such as an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), system on chip (SoC), or the like.
In some embodiments, the computer-readable storage medium may be a memory, such as a read-only memory (ROM), a random-access memory (RAM), a phase-change random-access memory (PRAM), a static random-access memory (SRAM), a dynamic random-access memory (DRAM), an electrically erasable programmable read-only memory (EEPROM), other types of random-access memory (RAM), a flash disk or other form of flash memory, a cache, a register, a static memory, a compact disc read-only memory (CD-ROM), a Digital Versatile Disc (DVD) or other optical storage, a tape cartridge or other magnetic storage device, or any other possible non-transitory medium that may be used to store information or instructions that may be accessed by a computer device, and so forth.
In some embodiments, the computer-executable instructions may be implemented as a plurality of program modules that collectively implement the method of displaying medical images according to any one of the present disclosure.
The present disclosure describes various operations or functions that may be implemented as or defined as software code or instructions. The display unit may be implemented as software code or modules of instructions stored on a memory, which when executed by a processor may implement the respective steps and methods.
Such content may be source code or differential code ("delta" or "patch" code) that may be executed directly ("object" or "executable" form). A software implementation of the embodiments described herein may be provided through an article of manufacture having code or instructions stored thereon or through a method of operating a communication interface to transmit data through the communication interface. A machine or computer-readable storage medium may cause a machine to perform the functions or operations described, and includes any mechanism for storing information in a form accessible by a machine (e.g., a computing display device, an electronic system, etc.), such as recordable/non-recordable media (e.g., read Only Memory (ROM), random Access Memory (RAM), magnetic disk storage media, optical storage media, flash memory display devices, etc.). The communication interface includes any mechanism for interfacing with any of a hardwired, wireless, optical, etc. medium to communicate with other display devices, such as a memory bus interface, a processor bus interface, an internet connection, a disk controller, etc. The communication interface may be configured by providing configuration parameters and/or sending signals to prepare the communication interface to provide data signals describing the software content. The communication interface may be accessed by sending one or more commands or signals to the communication interface.
The computer-executable instructions of embodiments of the present disclosure may be organized into one or more computer-executable components or modules. Aspects of the disclosure may be implemented with any number and combination of such components or modules. For example, aspects of the disclosure are not limited to the specific computer-executable instructions or the specific components or modules illustrated in the figures and described herein. Other embodiments may include different computer-executable instructions or components having more or less functionality than illustrated and described herein.
The above description is intended to be illustrative, and not restrictive. For example, the above-described examples (or one or more versions thereof) may be used in combination with each other. For example, other embodiments may be used by those of ordinary skill in the art upon reading the above description. In addition, in the foregoing detailed description, various features may be grouped together to streamline the disclosure. This should not be interpreted as an intention that a disclosed feature not claimed is essential to any claim. Rather, the subject matter of the present disclosure may lie in less than all features of a particular disclosed embodiment. Thus, the following claims are hereby incorporated into the detailed description as examples or embodiments, with each claim standing on its own as a separate embodiment, and it is contemplated that the embodiments can be combined with each other in various combinations or permutations. The scope of the disclosure should be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled.
The above embodiments are merely exemplary embodiments of the present disclosure, which is not intended to limit the present disclosure, and the scope of the present disclosure is defined by the claims. Various modifications and equivalents of the disclosure may occur to those skilled in the art within the spirit and scope of the disclosure, and such modifications and equivalents are considered to be within the scope of the disclosure.

Claims (7)

1. A method of processing medical data, comprising:
acquiring original medical data, wherein the original medical data comprises original operation information;
matching the original operation information with standard words of a standard operation information table to obtain a matching relation: combining the original operation information and each standard word of a standard operation information table in pairs, and respectively calculating matching degrees;
screening out the to-be-processed operation information respectively comprising the original operation information and the standard words based on the matching relationship: under the condition that the matching degree meets a preset condition, taking the original operation information and standard words of a standard operation information table as operation information to be processed;
performing normalization processing on the surgical information to be processed through a classification algorithm to obtain target medical data: inputting the surgical information to be processed into a neural network classification model; analyzing whether the original operation information in the operation information to be processed corresponds to the standard words or not; according to the analysis result, normalization processing is carried out on the corresponding original operation information and the standard words; outputting standard surgical information corresponding to the original surgical information for generating the target medical data;
the neural network classification model is obtained based on pre-training, and the pre-training mode comprises the following steps:
acquiring original medical data containing original surgical information;
extracting operation name information and operation passing information in the original operation information;
combining the operation passing information with standard words in an operation name library, wherein the operation name library is constructed based on the operation name information and comprises an original operation name and a standard operation name of original operation information;
training a pre-training model by using the combined surgical procedure information and standard words in a surgical name library as training data;
and obtaining the neural network classification model based on the pre-training model.
2. The method of claim 1, wherein the operation name library is constructed by:
inputting the operation name information into a neural network model for encoding an operation name to extract a standard operation name code from the operation name information;
and constructing an operation name library containing the original operation names and the coded standard operation names based on the operation name information and the standard operation name codes.
3. The method of claim 1, wherein obtaining target medical data comprises:
characterizing the original surgical information in a standard surgical information format;
outputting the target medical data to update the raw medical data.
4. The method of claim 1, wherein the standard surgical information table is constructed in a manner comprising:
and constructing based on the label of the ICD standard table, wherein the labeled result comprises the result according to the operation name and the coded result.
5. The method of claim 1, further comprising:
and analyzing the content of the original operation information to obtain an operation name and a long text operation process.
6. Apparatus for processing medical data, comprising:
an acquisition unit configured for acquiring raw medical data, the raw medical data containing raw surgical information;
a matching module configured to match the original surgery information with standard words of a standard surgery information table to obtain a matching relationship: combining the original operation information and each standard word of a standard operation information table in pairs, and respectively calculating matching degrees;
a screening module configured to screen out to-be-processed surgical information including the original surgical information and the standard words, respectively, based on the matching relationship: under the condition that the matching degree meets a preset condition, taking the original operation information and standard words of a standard operation information table as operation information to be processed;
a processing module configured to perform normalization processing on the surgical information to be processed through a classification algorithm to obtain target medical data: inputting the operation information to be processed into a neural network classification model; analyzing whether the original operation information in the operation information to be processed corresponds to the standard words or not; according to the analysis result, normalization processing is carried out on the corresponding original operation information and the standard words; outputting standard operation information corresponding to the original operation information for generating target medical data;
the pre-training module is used for obtaining the neural network classification model based on pre-training, and the pre-training mode comprises the following steps:
acquiring original medical data containing original surgical information;
extracting operation name information and operation passing information in the original operation information;
combining the operation passing information with standard words in an operation name library, wherein the operation name library is constructed based on the operation name information and comprises an original operation name and a standard operation name of original operation information;
training a pre-training model by using the combined surgical procedure information and standard words in a surgical name library as training data;
and obtaining the neural network classification model based on the pre-training model.
7. A computer-readable storage medium having computer-executable instructions stored thereon which, when executed by a processor, implement:
the method of processing medical data according to any one of claims 1 to 5.
CN202011604300.2A 2020-12-30 2020-12-30 Medical data processing method and device and storage medium Active CN112749307B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011604300.2A CN112749307B (en) 2020-12-30 2020-12-30 Medical data processing method and device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011604300.2A CN112749307B (en) 2020-12-30 2020-12-30 Medical data processing method and device and storage medium

Publications (2)

Publication Number Publication Date
CN112749307A CN112749307A (en) 2021-05-04
CN112749307B true CN112749307B (en) 2022-11-08

Family

ID=75647160

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011604300.2A Active CN112749307B (en) 2020-12-30 2020-12-30 Medical data processing method and device and storage medium

Country Status (1)

Country Link
CN (1) CN112749307B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115689609A (en) * 2022-10-28 2023-02-03 上海润益互联网科技有限公司 Business opportunity pushing method, system, medium and equipment based on industrial information matching

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9946846B2 (en) * 2007-08-03 2018-04-17 A-Life Medical, Llc Visualizing the documentation and coding of surgical procedures
CN105069123B (en) * 2015-08-13 2018-06-26 易保互联医疗信息科技(北京)有限公司 A kind of automatic coding and system of Chinese surgical procedure information
CN106919793B (en) * 2017-02-24 2019-12-06 黑龙江特士信息技术有限公司 Data standardization processing method and device for medical big data
CN109256216B (en) * 2018-08-14 2023-06-27 平安医疗健康管理股份有限公司 Medical data processing method, medical data processing device, computer equipment and storage medium
CN110767296A (en) * 2019-10-09 2020-02-07 北京雅丁信息技术有限公司 Operation coding method based on semantic similarity
CN111191415A (en) * 2019-12-16 2020-05-22 山东众阳健康科技集团有限公司 Operation classification coding method based on original operation data
CN111192667A (en) * 2019-12-16 2020-05-22 山东众阳健康科技集团有限公司 Method for prompting operation code based on intelligent operation

Also Published As

Publication number Publication date
CN112749307A (en) 2021-05-04

Similar Documents

Publication Publication Date Title
CN112802575B (en) Medication decision support method, device, equipment and medium based on graphic state machine
Moons et al. Using ChatGPT and Google Bard to improve the readability of written patient information: a proof of concept
CN109920540A (en) Construction method, device and the computer equipment of assisting in diagnosis and treatment decision system
CN112735544B (en) Medical record data processing method, device and storage medium
CN107578798A (en) The processing method and system of electronic health record
CN108366750A (en) The system and method for ECG waveform adaptively interpreted
CN107958708A (en) Risk trend appraisal procedure and system after institute
CN114913953B (en) Medical entity relationship identification method and device, electronic equipment and storage medium
CN112749307B (en) Medical data processing method and device and storage medium
CN106844351A (en) A kind of medical institutions towards multi-data source organize class entity recognition method and device
CN112735545B (en) Self-training method, model, processing method, device and storage medium
CN110164524A (en) A kind of hemiplegia patient training mission adaptive matching method and its system
CN112542242A (en) Data transformation/symptom scoring
CN117497115A (en) Automatic generation method, system, terminal and medium for medical information abstract
CN110895580B (en) ICD operation and operation code automatic matching method based on deep learning
KR102623753B1 (en) Method and apparatus for generating dental electronic chart using speech recognition
CN112700825B (en) Medical data processing method, device and storage medium
CN110377698B (en) Reading understanding-based task processing method and device, equipment and readable medium
CN112989971A (en) Electrocardiogram data fusion method and device for different data sources
CN113392633B (en) Medical named entity identification method, device and storage medium
US9408550B2 (en) Evolving serial comparison system with critical alert notifications
Hu et al. Generating medical assessments using a neural network model: algorithm development and validation
WO2022050310A1 (en) Program, information processing device, information processing method and model generation method
CN109036503A (en) A kind of system and method generating electrocardiographic diagnosis report
KR102330360B1 (en) Method, Apparatus and Recording for Guiding Of Recommendation Treatment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant