WO2023029506A1 - Illness state analysis method and apparatus, electronic device, and storage medium - Google Patents

Illness state analysis method and apparatus, electronic device, and storage medium Download PDF

Info

Publication number
WO2023029506A1
WO2023029506A1 PCT/CN2022/087710 CN2022087710W WO2023029506A1 WO 2023029506 A1 WO2023029506 A1 WO 2023029506A1 CN 2022087710 W CN2022087710 W CN 2022087710W WO 2023029506 A1 WO2023029506 A1 WO 2023029506A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
information
treatment
target
disease
Prior art date
Application number
PCT/CN2022/087710
Other languages
French (fr)
Chinese (zh)
Inventor
成芳
Original Assignee
康键信息技术(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 康键信息技术(深圳)有限公司 filed Critical 康键信息技术(深圳)有限公司
Publication of WO2023029506A1 publication Critical patent/WO2023029506A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H15/00ICT specially adapted for medical reports, e.g. generation or transmission thereof
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Definitions

  • the present application relates to the field of artificial intelligence and digital medical technology, and in particular to a disease analysis method, device, electronic equipment and storage medium.
  • the main purpose of the embodiments of the present application is to provide a disease analysis method, device, electronic equipment and storage medium, aiming to obtain reference pathological data by analyzing the patient's historical disease, and improve the efficiency of disease analysis.
  • the embodiment of the present application proposes a disease analysis method, the method comprising:
  • a condition analysis report is generated according to the diagnostic conclusion label.
  • the embodiment of the present application proposes a disease analysis device, which includes:
  • Electronic medical record data acquisition module used to obtain electronic medical record data
  • a feature extraction module configured to extract entity features from the electronic medical record data to obtain target disease information
  • a processing module configured to use a pre-trained disease recognition model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
  • the treatment feedback data receiving module is used to receive the treatment feedback data that the client responds to according to the treatment reminder;
  • a diagnosis conclusion label generating module configured to generate a diagnosis conclusion label according to the treatment feedback data and the target condition information
  • a condition analysis report generating module configured to generate a condition analysis report according to the diagnosis conclusion label.
  • the embodiment of the present application provides an electronic device, the electronic device includes a memory, a processor, a program stored in the memory and operable on the processor, and a program for implementing the processor
  • a data bus connecting and communicating with the memory when the program is executed by the processor, a disease analysis method is implemented, wherein the disease analysis method includes: acquiring electronic medical record data; Perform entity feature extraction to obtain target condition information; use a pre-trained condition recognition model to process the target condition information to generate a treatment reminder corresponding to the target condition information; receive treatment feedback from the user terminal based on the treatment reminder response data; generate a diagnosis conclusion label according to the treatment feedback data and the target condition information; generate a condition analysis report according to the diagnosis conclusion label.
  • the embodiment of the present application provides a computer-readable storage medium for computer-readable storage, the computer-readable storage medium stores one or more programs, and the one or more programs can be stored by one Or executed by multiple processors to implement a disease analysis method, wherein the disease analysis method includes: obtaining electronic medical record data; performing entity feature extraction on the electronic medical record data to obtain target disease information; using pre-trained disease
  • the recognition model processes the target condition information to generate a treatment reminder corresponding to the target condition information; receives treatment feedback data from the user end according to the treatment reminder response; according to the treatment feedback data and the target condition information, Generate a diagnosis conclusion label; generate a condition analysis report according to the diagnosis conclusion label.
  • the disease analysis method, device, electronic equipment and storage medium proposed in this application obtain electronic medical record data and perform entity feature extraction on electronic medical record data to obtain target disease information.
  • This method can realize feature extraction of electronic medical record data. Reducing the total amount of data makes it easier to extract the required disease information; then use the pre-trained disease identification model to process the target disease information, and generate treatment reminders corresponding to the target disease information, so that users can take medication according to the treatment reminder or Seek medical attention.
  • Fig. 1 is the flowchart of the condition analysis method provided by the embodiment of the present application.
  • Fig. 2 is the flowchart of step S102 in Fig. 1;
  • Fig. 3 is the flowchart of step S103 in Fig. 1;
  • Fig. 4 is a partial flowchart of a disease analysis method provided by another embodiment of the present application.
  • Fig. 5 is the flowchart of step S105 in Fig. 1;
  • Fig. 6 is the flowchart of step S106 in Fig. 1;
  • Fig. 7 is a schematic structural diagram of a disease analysis device provided by an embodiment of the present application.
  • FIG. 8 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
  • Artificial Intelligence It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence; artificial intelligence is a branch of computer science. Intelligence attempts to understand the essence of intelligence and produce a new intelligent machine that can respond in a manner similar to human intelligence. Research in this field includes robotics, language recognition, image recognition, natural language processing, and expert systems. Artificial intelligence can simulate the information process of human consciousness and thinking. Artificial intelligence is also a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results.
  • Natural language processing uses computers to process, understand and use human languages (such as Chinese, English, etc.). NLP belongs to a branch of artificial intelligence and is an interdisciplinary subject between computer science and linguistics. Known as computational linguistics. Natural language processing includes syntax analysis, semantic analysis, text understanding, etc. Natural language processing is often used in technical fields such as machine translation, handwritten and printed character recognition, speech recognition and text-to-speech conversion, information retrieval, information extraction and filtering, text classification and clustering, public opinion analysis and opinion mining. It involves language processing Related data mining, machine learning, knowledge acquisition, knowledge engineering, artificial intelligence research and linguistics research related to language computing, etc.
  • Information Extraction A text processing technology that extracts specified types of factual information such as entities, relationships, and events from natural language texts, and forms structured data output.
  • Information extraction is a technique to extract specific information from text data.
  • Text data is composed of some specific units, such as sentences, paragraphs, and chapters.
  • Text information is composed of some small specific units, such as words, words, phrases, sentences, paragraphs, or combinations of these specific units. . Extracting noun phrases, personal names, and place names in text data is all text information extraction.
  • the information extracted by text information extraction technology can be various types of information.
  • Association analysis is a simple and practical analysis technique, which is to discover the association or correlation existing in a large number of data sets, so as to describe the laws and patterns of the simultaneous appearance of certain attributes in a thing. Association analysis is to discover interesting associations and correlations between item sets from a large amount of data.
  • Support count An item set appears in several transactions, and its support count is how many.
  • Strong association rules Rules greater than or equal to the minimum support threshold and minimum confidence threshold are called strong association rules. The ultimate goal of association analysis is to find out strong association rules.
  • Apriori algorithm It is a basic algorithm for mining frequent itemsets required to generate Boolean association rules.
  • Apriori property Any subset of a frequent itemset should also be a frequent itemset. Prove that by definition, if an itemset I does not meet the minimum support threshold min_sup, then I is not frequent, that is, P(I) ⁇ min_sup. If an item A is added to the item set I, the resulting new item set (I ⁇ A) is not frequent, and the number of occurrences in the entire transaction database cannot be more than the number of occurrences of the original item set I, so P (I ⁇ A) ⁇ min_sup, that is (I ⁇ A) is not frequent. In this way, it can be easily confirmed that the Apriori property holds according to the inverse axiom.
  • FP-tree After Pattern Tree, referred to as FP-tree; It is an algorithm for discovering frequent patterns based on frequent pattern trees.
  • FP-growth algorithm by scanning the transaction database twice, each transaction The included frequent items are compressed and stored in FP-tree in descending order of their support.
  • the process of discovering frequent patterns in the future there is no need to scan the transaction database, but only to search in the FP-Tree, and the frequent pattern is directly generated by recursively calling the FP-growth method, so the whole discovery process is also Candidate patterns need not be generated.
  • Collaborative filtering algorithm It is a relatively well-known and commonly used recommendation algorithm. It discovers the user's preferences based on the mining of user historical behavior data, and predicts the products that users may like to recommend, or finds similar users (based on users) or Items (based on items). The realization of the user-based collaborative filtering algorithm mainly needs to solve two problems. One is how to find people who have similar hobbies as you, that is, to calculate the similarity of data.
  • BERT Bit Encoder Representations from Transformers: It is a language representation model (language representation model). BERT uses the Transformer Encoder block for connection, which is a typical two-way encoding model.
  • MLP Multilayer Perceptron
  • An MLP can be viewed as a directed graph consisting of multiple layers of nodes, each fully connected to the next layer. Except for the input node, each node is a neuron (or processing unit) with a nonlinear activation function.
  • MLP is an extension of the perceptron, which overcomes the weakness that the perceptron cannot recognize linearly inseparable data.
  • the simplest MLP is a three-layer structure (input layer-hidden layer-output layer). The layers of the multi-layer perceptron are fully connected, that is, any neuron in each layer is connected to all neurons in the previous layer. This connection actually represents a weight summation.
  • BP Error backpropagation algorithm
  • BP Error backpropagation algorithm
  • This method computes the gradient of the loss function for all weights in the network. This gradient is fed back to the optimization method to update the weights to minimize the loss function.
  • BP algorithm is suitable for a learning algorithm of multi-layer neural network, which is based on the gradient descent method.
  • the input-output relationship of the BP network is essentially a mapping relationship: the function completed by a BP neural network with n inputs and m outputs is a continuous mapping from n-dimensional Euclidean space to a finite field in m-dimensional Euclidean space.
  • the learning process of BP algorithm is composed of forward propagation process and back propagation process.
  • the input information is processed layer by layer through the hidden layer through the input layer and transmitted to the output layer. If the desired output value cannot be obtained at the output layer, take the sum of the squares of the output and the expected error as the objective function, transfer to backpropagation, and calculate the partial derivative of the objective function with respect to the weight of each neuron layer by layer to form the objective
  • the gradient of the function to the weight vector is used as the basis for modifying the weight, and the learning of the network is completed in the process of modifying the weight. When the error reaches the expected value, the network learning ends.
  • Logistic function (Logistic function or Logistic curve): Logistic function or Logistic curve is a common S-shaped function, and the generalized Logistic curve can imitate the S-shaped curve of population growth (P) in some cases.
  • P population growth
  • AI artificial intelligence
  • digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results.
  • Artificial intelligence basic technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation/interaction systems, and mechatronics.
  • Artificial intelligence software technology mainly includes computer vision technology, robotics technology, biometrics technology, speech processing technology, natural language processing technology, and machine learning/deep learning.
  • the embodiment of the present application can realize the analysis of the patient's historical condition based on the medical cloud technology.
  • medical cloud refers to the use of "cloud computing" to create a medical and health service cloud based on new technologies such as cloud computing, mobile technology, multimedia, 4G communication, big data, and the Internet of Things, combined with medical technology.
  • the platform realizes the sharing of medical resources and the expansion of medical coverage.
  • medical cloud improves the efficiency of medical institutions and facilitates residents to seek medical treatment. For example, appointment registration, electronic medical records, and medical insurance in hospitals are all products of the combination of cloud computing and the medical field.
  • Medical cloud also has the advantages of data security, information sharing, dynamic expansion, and overall layout.
  • the embodiment of the present application provides a condition analysis method, device, electronic equipment and storage medium, which can obtain a condition analysis report by analyzing the patient's historical condition, and provide reference pathological data for the subsequent diagnosis process, improving the The efficiency of disease analysis can also reduce the cost of medical treatment and medication for patients.
  • the disease analysis method, device, electronic device, and storage medium provided in the embodiments of the present application are specifically described through the following embodiments. First, the disease analysis method in the embodiments of the present application is described.
  • the disease analysis method provided in the embodiment of the present application relates to the fields of artificial intelligence and digital medical technology.
  • the disease analysis method provided in the embodiment of the present application can be applied to a terminal, can also be applied to a server, and can also be software running on a terminal or a server.
  • the terminal can be a smart phone, a tablet computer, a notebook computer, a desktop computer, etc.
  • the server end can be configured as an independent physical server, or can be configured as a server cluster or a distributed system composed of multiple physical servers, or It can be configured as a cloud that provides basic cloud computing services such as cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms.
  • the server; the software can be the application of the disease analysis method, etc., but is not limited to the above forms.
  • Fig. 1 is an optional flowchart of the disease analysis method provided by the embodiment of the present application.
  • the method in Fig. 1 may include but not limited to include steps S101 to S106.
  • Step S101 obtaining electronic medical record data
  • Step S102 extracting entity features from electronic medical record data to obtain target disease information
  • Step S103 using the pre-trained disease identification model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
  • Step S104 receiving the treatment feedback data that the client responds to according to the treatment reminder
  • Step S105 generating a diagnosis conclusion label according to the treatment feedback data and the target condition information
  • Step S106 generating a disease analysis report according to the diagnosis conclusion label.
  • an electronic medical record is generated according to the personal historical medical records uploaded by the patient, and the electronic medical record data is obtained, wherein the electronic medical record data includes patient information, medical records, medication records and so on.
  • the entity feature extraction is performed on the electronic medical record data to obtain the target condition information, among which the target condition information includes the patient's age, gender, basic health indicators, medical diagnosis data, medication records, adverse drug reaction information, operation records, etc.
  • This method can realize the feature extraction of electronic medical record data, reduce the total amount of data, and make it easier to extract the required disease information.
  • the pre-trained condition recognition model to process the above target condition information, identify the patient's historical target condition, generate a treatment reminder corresponding to the target condition information, and feed back this treatment reminder to the patient, so that the patient can follow the treatment reminder Seek medical attention or take medication. After the treatment reminder is fed back to the patient, it is also necessary to obtain treatment feedback from the patient, for example, the treatment feedback data that the user terminal responds to according to the treatment reminder can be received.
  • the diagnostic conclusion label By analyzing the treatment feedback data and target condition information, the diagnostic conclusion label can be obtained, and the treatment effect can be evaluated more conveniently; it should be noted that when analyzing the treatment feedback data and target condition information, a big data analysis model can be used To match the treatment feedback data with the target condition information, generate the corresponding diagnostic conclusion label according to the matching situation, and finally generate the condition analysis report according to the diagnosis conclusion label, and obtain the condition analysis report by analyzing the patient's historical condition, which will be used for the subsequent diagnosis process Provide reference pathological data, improve the efficiency of disease analysis, and also reduce the cost of medical treatment and medication for patients.
  • the above data is medical data, such as personal health records, prescriptions, examination reports and other data.
  • step S102 may include but not limited to include steps S201 to S205:
  • Step S201 extracting natural language text in electronic medical record data
  • Step S202 using a preset lexical analysis model to identify entity features in the natural language text
  • Step S203 segmenting the natural language text to obtain a natural language vocabulary sequence
  • Step S204 constructing a list of feature sequences according to entity features and natural language vocabulary sequences
  • Step S205 determine the target condition information according to the feature sequence table.
  • the unstructured data in the electronic medical record data is first converted into unified structured data, and the required natural language text is extracted from the structured data.
  • the preset lexical analysis model uses the preset lexical analysis model to identify entity features in natural language text.
  • a medical thesaurus is pre-built, and the medical thesaurus may include medical names, medical terms, non-medical names, folk names, international medical terms, etc. related to various medical pathologies.
  • the preset lexical analysis model can enumerate medical specific names.
  • Input the natural language text into the preset lexical analysis model, and identify the entity features in the natural language text through the medical specific names contained in the preset lexical analysis model and the preset part-of-speech categories, and the entity features may include
  • the above-mentioned multi-dimensional entity vocabulary related to medical pathology such as medical names, medical terms, non-medical names, folk names, international medical terms, modifiers, and time information.
  • a sequence classifier can also be built based on the bi-LSTM algorithm. In the model based on the bi-LSTM algorithm, the word wi and character embedding are input, and the long and short memory from left to right and the length from right to left are used.
  • the sequence classifier can pass the input entity features directly to the softmax classifier through this output layer, and create a probability distribution on the preset label through the softmax classifier, so as to mark and classify the entity parameters according to the probability distribution, and finally
  • the entity features after the classification process are subjected to feature extraction to obtain the required entity features.
  • the BERT encoder can also be used to convert the entity feature string from text form to encoded form through the preset encoding function to realize the storage of entity features.
  • a corresponding part-of-speech is assigned to a word segment in the natural language text, such as noun, verb, adjective, and so on.
  • the continuous natural language text is segmented into natural language lexical sequences with semantic rationality and integrity.
  • a feature sequence table is constructed, and the data in the electronic medical record data can be searched and compared according to the feature sequence table, and the target disease information can be determined more conveniently.
  • the above-mentioned natural language text is a medical text
  • the medical text can be an electronic healthcare record (Electronic Healthcare Record), an electronic personal health record, including medical records, electrocardiograms, medical images, etc.
  • An electronic record of value is a medical text
  • the medical text can be an electronic healthcare record (Electronic Healthcare Record), an electronic personal health record, including medical records, electrocardiograms, medical images, etc.
  • An electronic record of value is a medical text
  • the medical text can be an electronic healthcare record (Electronic Healthcare Record), an electronic personal health record, including medical records, electrocardiograms, medical images, etc.
  • An electronic record of value is an electronic healthcare record of value.
  • step S103 may include but not limited to include step S301 or step S303:
  • Step S301 inputting the target condition information into the pre-trained condition identification model
  • Step S302 performing quadrant partitioning on the target disease information through the disease identification model to obtain the partitioned disease information
  • Step S303 performing a fitting process on the partitioned disease information of each quadrant to generate a medication reminder or a medical reminder.
  • the BP algorithm can be used for deep learning to establish a disease recognition model, and the target disease information is input into the pre-trained disease recognition model.
  • the information is divided into tasks, and the condition information of the partitions is obtained.
  • the disease data in the target disease information is prioritized to obtain partitioned disease information of different importance levels.
  • the condition information of each quadrant is fitted through the fitting function, and the fitting result is output to the appearance layer for the judgment of accuracy and excellent agent for iterative optimization.
  • the iterative optimization meets the preset
  • the fitting result is output, and the corresponding medication reminder or medical reminder is generated according to the fitting result.
  • the preset iteration condition may be that the number of iterations reaches the preset number, etc., or other conditions, and is not limited thereto.
  • the disease recognition model is a multi-layer neural network, which contains two layers of processing units and two hidden layers, and each feedback can only be sent to the previous output layer or hidden layer.
  • the multi-layer neural network is a neural network that uses a backward propagation algorithm to learn classification or prediction. Specifically, the construction process of the disease recognition model is as follows:
  • w ij is the weight of the connection from unit i to unit j in the previous layer
  • O i is the output of unit i in the previous layer
  • ⁇ j is the bias of unit j, which is used as a threshold to change the unit’s active
  • Step 5 Calculate the errors of hidden nodes and output nodes respectively, where the output layer error formula is shown in formula (3); the hidden layer formula is shown in formula (4);
  • O j is the actual output of unit j
  • T j is the known target value of j given the training tuple.
  • O j (1-O j ) is the derivative of the logistic function
  • wjk is the weight of the connection from unit k to unit j in the next higher layer, and is the error of unit k.
  • Step 6 Extract the eliminated agents and recycle the corresponding root parts of speech after segmenting the corresponding natural language text.
  • a condition identification model that meets the requirements can be constructed, and then the reference condition information is input into the condition identification model, and the reference condition information is divided into quadrants through the condition identification model, and the reference condition information is assigned tasks according to the four-quadrant rule. Differentiate, and classify the disease data in the reference disease information according to priority, and obtain the partitioned disease information of different importance levels. Furthermore, the fitting function is used to fit the partition reference disease information of each quadrant, and the fitting result is output to the appearance layer for the judgment of accuracy and excellent agent for iterative optimization. When the iterative optimization meets the expected When the iterative condition is set, the iterative optimization is stopped, so as to complete the training of the pathological recognition model.
  • the patient's historical target condition is identified through the trained condition recognition model.
  • the target condition information is input into the trained condition recognition model to obtain the fitting result, and the target condition information is generated according to the fitting result.
  • the corresponding treatment reminder is given, and the treatment reminder is fed back to the patient, so that the patient can seek medical treatment or take medicine according to the treatment reminder.
  • the method may include, but is not limited to, step S401 to step S402:
  • Step S401 analyzing the drug data in the target condition information to obtain drug reaction information; wherein, the drug reaction information includes adverse drug reaction information;
  • Step S402 identifying abnormal medication data in the treatment feedback data according to the adverse drug reaction information.
  • the automatic text generation model of words is used to extract drug data, and the automatic text generation model can perform different data processing according to the type of input data.
  • Input keywords or text sentences or fields in historical medical records and historical medication records to the pre-trained text automatic generation model if the input keywords, text sentences or fields can match the preset reference text, it indicates The current input meets the requirements. If the current input is a keyword, select the same sentence set as the input keyword in the basic corpus, and generate the corresponding drug data field according to this sentence set.
  • the current input is a text sentence or field
  • the candidate sentence is copied and supplemented, and the drug data field is generated according to the supplemented candidate sentence.
  • these drug data fields are analyzed to obtain drug response information, which includes adverse drug reaction information, drug efficacy information, and medication guidance information. wait. Furthermore, by comparing and analyzing the adverse drug reaction information and the treatment feedback data, if the treatment feedback data involves data that matches the adverse drug reaction information, it can be determined that the data is abnormal drug use data, which can be more conveniently identified Abnormal medication data, and mark the abnormal medication data in the treatment feedback data, so as to play a prompt role in subsequent diagnosis and treatment, avoid the occurrence of abnormal medication again, and improve the reliability of medication.
  • step S104 the method also includes, but is not limited to:
  • the treatment feedback data and target condition information can be written into local files through the logback component (open source log component) in the data management layer, and the interface log including treatment feedback data and target condition information can be recorded through the interceptor.
  • the log collection system (Flume) in the management layer visualizes the treatment feedback data and the target disease information respectively to obtain the corresponding target visualization data, and then inputs these target visualization data into the Hive database and the HBase database in the data management layer for integration , and finally import the target visualization data from the PostgreSql database into the Hive database through the Sqoop transmission component in the data management layer, generate corresponding charts, and sort the series of charts according to the preset part-of-speech ranking sequence to generate visual data picture.
  • the preset part-of-speech level is that the level of nouns is higher than that of verbs, and the level of verbs is higher than that of adjectives.
  • step S105 in some embodiments may include, but is not limited to, step S501 to step S502:
  • Step S501 encoding treatment feedback data and target condition information respectively to obtain treatment feedback data in encoded form and target condition information in encoded form;
  • Step S502 using the preset big data analysis model to perform data analysis on the coded treatment feedback data and the coded target condition information to generate a diagnostic conclusion label.
  • the treatment feedback data and the target condition information can be respectively encoded by a preset encoder
  • the preset encoder can be a BERT-based encoder, that is, by obtaining the treatment feedback data and the target condition information , and tokenize treatment feedback data and target condition information, build a BERT token generator, pre-train the BERT token generator, and form a BERT encoder that meets the requirements, so that the BERT encoder can pass the preset
  • the encoding function converts the treatment feedback data and the target condition information from the text form into the code form, and obtains the treatment feedback data in the code form and the target condition information in the code form.
  • the collaborative filtering algorithm in the preset big data analysis model is used to calculate the similarity between the treatment feedback data in the coded form and the target disease information in the coded form. According to the degree of similarity, the treatment feedback data in the coded form and the target disease information in the coded form are correlated and matched to generate corresponding diagnostic conclusion labels.
  • the collaborative filtering algorithm may be a Jaccard similarity coefficient method, an included angle cosine method, or a similarity measurement method such as Euclidean distance or Manhattan distance, without limitation.
  • the preset big data analysis model may also use an association analysis algorithm to associate and match the encoded treatment feedback data and the encoded target disease information. Commonly used correlation analysis algorithms include Apriori algorithm, FP-growth algorithm and so on. This method improves the efficiency of data analysis, and also improves the matching accuracy of treatment feedback data and target disease information.
  • step S106 may include but not limited to include steps S601 to S603:
  • Step S601 purifying the diagnostic conclusion label according to the target condition information to obtain the purified diagnostic conclusion label
  • Step S602 verifying and analyzing the purified diagnosis conclusion label to obtain a standard diagnosis conclusion label
  • Step S603 generating a disease analysis report according to the standard diagnosis conclusion label.
  • the select statement function can also be preset, fill in the required option data in the preset select statement function, and use the select statement function filled with option data to extract the target disease information.
  • the data set is compared with the diagnosis conclusion label, and the abnormal diagnosis conclusion label is eliminated to obtain the purified diagnosis conclusion label.
  • the judgment of the abnormal diagnosis conclusion label can be determined according to the similarity between the data set and the diagnosis conclusion label. If the similarity between the two is less than the preset similarity threshold, the diagnosis conclusion label is determined to be an abnormal diagnosis conclusion label.
  • the purified diagnostic conclusion label is verified and analyzed, the target condition information and treatment feedback data corresponding to the purified diagnostic conclusion label are reviewed, and the target condition information and treatment feedback data are corrected and adjusted to obtain a standard diagnosis conclusion label.
  • the condition analysis report is generated according to the standard diagnosis conclusion label, and the condition analysis report is obtained by analyzing the patient's historical condition, which provides reference pathological data for the subsequent diagnosis process, improves the efficiency of condition analysis, and can also reduce the cost of medical treatment for patients and drug costs.
  • this method can realize feature extraction of electronic medical record data, reduce the total amount of data, and make it more convenient to extract the required data.
  • the patient s condition information; and then use the pre-trained condition recognition model to process the target condition information, and generate a treatment reminder corresponding to the target condition information, so that the patient can take medication or seek medical treatment according to the treatment reminder.
  • the embodiment of the present application also provides a disease analysis device, which can implement the above disease analysis method, the device includes:
  • the processing module 703 is configured to use the pre-trained disease recognition model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
  • the treatment feedback data receiving module 704 is used to receive the treatment feedback data that the client responds to according to the treatment reminder;
  • the diagnosis conclusion label generation module 705 is used for generating diagnosis conclusion labels according to the treatment feedback data and the target condition information
  • the disease analysis report generating module 706 is configured to generate a disease analysis report according to the diagnosis conclusion label.
  • the embodiment of the present application also provides an electronic device, the electronic device includes: a memory, a processor, a program stored in the memory and operable on the processor, and a data bus for realizing connection and communication between the processor and the memory , when the program is executed by the processor, the above disease analysis method is realized.
  • the electronic device may be any intelligent terminal including a tablet computer, a vehicle-mounted computer, and the like.
  • FIG. 8 illustrates a hardware structure of an electronic device in another embodiment.
  • the electronic device includes:
  • the processor 801 may be implemented by a general-purpose CPU (Central Processing Unit, central processing unit), a microprocessor, an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, and is used to execute related programs to realize The technical solutions provided by the embodiments of the present application;
  • a general-purpose CPU Central Processing Unit, central processing unit
  • a microprocessor an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, and is used to execute related programs to realize The technical solutions provided by the embodiments of the present application;
  • ASIC Application Specific Integrated Circuit
  • the memory 802 may be implemented in the form of a read-only memory (ReadOnlyMemory, ROM), a static storage device, a dynamic storage device, or a random access memory (RandomAccessMemory, RAM).
  • the memory 802 can store operating systems and other application programs.
  • the relevant program codes are stored in the memory 802, and are invoked by the processor 801 to execute a condition Analytical method;
  • the input/output interface 803 is used to realize information input and output
  • the communication interface 804 is used to realize the communication interaction between the device and other devices, and the communication can be realized through a wired method (such as USB, network cable, etc.), or can be realized through a wireless method (such as a mobile network, WIFI, Bluetooth, etc.); and
  • a bus 805 which transmits information between various components of the device (such as a processor 801, a memory 802, an input/output interface 803, and a communication interface 804);
  • the processor 801 , the memory 802 , the input/output interface 803 and the communication interface 804 are connected to each other within the device through the bus 805 .
  • condition analysis method includes:
  • An embodiment of the present application also provides a computer-readable storage medium for computer-readable storage.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to implement a disease analysis method, wherein the disease analysis method includes: obtaining electronic medical record data; Extract entity features from medical record data to obtain target condition information; use pre-trained condition recognition model to process target condition information to generate treatment reminders corresponding to target condition information; receive treatment feedback data from user terminals based on treatment reminder responses; Feedback data and target condition information to generate a diagnosis conclusion label; generate a condition analysis report based on the diagnosis conclusion label.
  • memory can be used to store non-transitory software programs and non-transitory computer-executable programs.
  • the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage devices.
  • the memory optionally includes memory located remotely from the processor, and these remote memories may be connected to the processor via a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
  • the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
  • the technical solution of the present application is essentially or part of the contribution to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including multiple instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method in each embodiment of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM), magnetic disk or optical disc, etc., which can store programs. medium.
  • ROM read-only memory
  • RAM random access memory
  • magnetic disk or optical disc etc., which can store programs. medium.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Public Health (AREA)
  • Medical Informatics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Primary Health Care (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Epidemiology (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Pathology (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

Provided are an illness state analysis method and apparatus, an electronic device, and a storage medium. The method comprises: acquiring electronic medical record data (S101); performing entity feature extraction on the electronic medical record data to obtain target illness state information (S102); processing the target illness state information by using a pre-trained illness state recognition model to generate a treatment reminder corresponding to the target illness state information (S103); receiving treatment feedback data responded by a user side according to the treatment reminder (S104); generating a diagnosis conclusion label according to the treatment feedback data and the target illness state information (S105); and generating an illness state analysis report according to the diagnosis conclusion label (S106).

Description

病情分析方法、装置、电子设备及存储介质Disease analysis method, device, electronic device and storage medium
本申请要求于2021年8月30日提交中国专利局、申请号为202111007202.5,发明名称为“病情分析方法、装置、电子设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202111007202.5 and the title of the invention "disease analysis method, device, electronic equipment and storage medium" submitted to the China Patent Office on August 30, 2021, the entire contents of which are incorporated by reference in this application.
技术领域technical field
本申请涉及人工智能及数字医疗技术领域,尤其涉及一种病情分析方法、装置、电子设备及存储介质。The present application relates to the field of artificial intelligence and digital medical technology, and in particular to a disease analysis method, device, electronic equipment and storage medium.
背景技术Background technique
目前,由于患者的历史病历往往是纸质病历或者历史病历无法进行数据共享,发明人意识到这一情况常常给患者后续的就医带来不便,影响病情分析准确性及分析效率,因此,如何通过对患者的历史病情进行分析得到可参考的病理数据,提高病情分析效率,成为了亟待解决的技术问题。At present, since the historical medical records of patients are often paper medical records or historical medical records cannot be shared, the inventor realized that this situation often brings inconvenience to patients' follow-up medical treatment and affects the accuracy and efficiency of disease analysis. Therefore, how to pass Analyzing the historical conditions of patients to obtain reference pathological data and improving the efficiency of condition analysis has become a technical problem that needs to be solved urgently.
技术问题technical problem
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。The following is an overview of the topics described in detail in this article. This summary is not intended to limit the scope of the claims.
本申请实施例的主要目的在于提出一种病情分析方法、装置、电子设备及存储介质,旨在通过对患者的历史病情进行分析得到可参考的病理数据,提高病情分析效率。The main purpose of the embodiments of the present application is to provide a disease analysis method, device, electronic equipment and storage medium, aiming to obtain reference pathological data by analyzing the patient's historical disease, and improve the efficiency of disease analysis.
技术解决方案technical solution
第一方面,本申请实施例提出了一种病情分析方法,所述方法包括:In the first aspect, the embodiment of the present application proposes a disease analysis method, the method comprising:
获取电子病历数据;Access to electronic medical record data;
对所述电子病历数据进行实体特征提取,得到目标病情信息;Extracting entity features from the electronic medical record data to obtain target disease information;
利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;Processing the target condition information by using a pre-trained condition identification model to generate a treatment reminder corresponding to the target condition information;
接收用户端根据所述治疗提醒响应的治疗反馈数据;receiving the treatment feedback data that the user terminal responds to according to the treatment reminder;
根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;generating a diagnosis conclusion label according to the treatment feedback data and the target condition information;
根据所述诊断结论标签生成病情分析报告。A condition analysis report is generated according to the diagnostic conclusion label.
第二方面,本申请实施例提出了一种病情分析装置,所述装置包括:In the second aspect, the embodiment of the present application proposes a disease analysis device, which includes:
电子病历数据获取模块,用于获取电子病历数据;Electronic medical record data acquisition module, used to obtain electronic medical record data;
特征提取模块,用于对所述电子病历数据进行实体特征提取,得到目标病情信息;A feature extraction module, configured to extract entity features from the electronic medical record data to obtain target disease information;
处理模块,用于利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;A processing module, configured to use a pre-trained disease recognition model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
治疗反馈数据接收模块,用于接收用户端根据所述治疗提醒响应的治疗反馈数据;The treatment feedback data receiving module is used to receive the treatment feedback data that the client responds to according to the treatment reminder;
诊断结论标签生成模块,用于根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;A diagnosis conclusion label generating module, configured to generate a diagnosis conclusion label according to the treatment feedback data and the target condition information;
病情分析报告生成模块,用于根据所述诊断结论标签生成病情分析报告。A condition analysis report generating module, configured to generate a condition analysis report according to the diagnosis conclusion label.
第三方面,本申请实施例提出了一种电子设备,所述电子设备包括存储器、处理器、存储在所述存储器上并可在所述处理器上运行的程序以及用于实现所述处理器和所述存储器之间的连接通信的数据总线,所述程序被所述处理器执行时实现一种病情分析方法,其中,所述病情分析方法包括:获取电子病历数据;对所述电子病历数据进行实体特征提取,得到目标病情信息;利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;接收用户端根据所述治疗提醒响应的治疗反馈数据;根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;根据所述诊断结论标签生成病情分析 报告。In the third aspect, the embodiment of the present application provides an electronic device, the electronic device includes a memory, a processor, a program stored in the memory and operable on the processor, and a program for implementing the processor A data bus connecting and communicating with the memory, when the program is executed by the processor, a disease analysis method is implemented, wherein the disease analysis method includes: acquiring electronic medical record data; Perform entity feature extraction to obtain target condition information; use a pre-trained condition recognition model to process the target condition information to generate a treatment reminder corresponding to the target condition information; receive treatment feedback from the user terminal based on the treatment reminder response data; generate a diagnosis conclusion label according to the treatment feedback data and the target condition information; generate a condition analysis report according to the diagnosis conclusion label.
第四方面,本申请实施例提出了一种计算机可读存储介质,用于计算机可读存储,所述计算机可读存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现一种病情分析方法,其中,所述病情分析方法包括:获取电子病历数据;对所述电子病历数据进行实体特征提取,得到目标病情信息;利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;接收用户端根据所述治疗提醒响应的治疗反馈数据;根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;根据所述诊断结论标签生成病情分析报告。In the fourth aspect, the embodiment of the present application provides a computer-readable storage medium for computer-readable storage, the computer-readable storage medium stores one or more programs, and the one or more programs can be stored by one Or executed by multiple processors to implement a disease analysis method, wherein the disease analysis method includes: obtaining electronic medical record data; performing entity feature extraction on the electronic medical record data to obtain target disease information; using pre-trained disease The recognition model processes the target condition information to generate a treatment reminder corresponding to the target condition information; receives treatment feedback data from the user end according to the treatment reminder response; according to the treatment feedback data and the target condition information, Generate a diagnosis conclusion label; generate a condition analysis report according to the diagnosis conclusion label.
有益效果Beneficial effect
本申请提出的病情分析方法、装置、电子设备及存储介质,其通过获取电子病历数据,对电子病历数据进行实体特征提取,得到目标病情信息,这一方式能够实现对电子病历数据的特征抽取,缩小数据总量,使得更为方便提取到所需要的病情信息;进而利用预先训练的病情识别模型对目标病情信息进行处理,生成与目标病情信息对应的治疗提醒,以便用户根据治疗提醒进行用药或者就医。而后接收用户端根据治疗提醒响应的治疗反馈数据,对治疗反馈数据和目标病情信息进行分析,得到诊断结论标签,能够较为方便地评估出治疗效果;最后根据诊断结论标签生成病情分析报告,通过对用户的历史病情进行分析得到病情分析报告,为后续的诊断过程提供可参考的病理数据,提高了病情分析效率,同时也可以降低患者的就医成本和用药成本。The disease analysis method, device, electronic equipment and storage medium proposed in this application obtain electronic medical record data and perform entity feature extraction on electronic medical record data to obtain target disease information. This method can realize feature extraction of electronic medical record data. Reducing the total amount of data makes it easier to extract the required disease information; then use the pre-trained disease identification model to process the target disease information, and generate treatment reminders corresponding to the target disease information, so that users can take medication according to the treatment reminder or Seek medical attention. Then receive the treatment feedback data from the user terminal according to the treatment reminder response, analyze the treatment feedback data and the target condition information, and obtain the diagnosis conclusion label, which can evaluate the treatment effect more conveniently; finally, generate the condition analysis report according to the diagnosis conclusion label, through the The user's historical condition is analyzed to obtain a condition analysis report, which provides reference pathological data for the subsequent diagnosis process, improves the efficiency of condition analysis, and can also reduce the cost of medical treatment and medication for patients.
本申请的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本申请而了解。本申请的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Additional features and advantages of the application will be set forth in the description which follows, and, in part, will be obvious from the description, or may be learned by practice of the application. The objectives and other advantages of the application will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
附图说明Description of drawings
附图用来提供对本申请技术方案的进一步理解,并且构成说明书的一部分,与本申请的实施例一起用于解释本申请的技术方案,并不构成对本申请技术方案的限制。The accompanying drawings are used to provide a further understanding of the technical solution of the present application, and constitute a part of the specification, and are used together with the embodiments of the present application to explain the technical solution of the present application, and do not constitute a limitation to the technical solution of the present application.
图1是本申请实施例提供的病情分析方法的流程图;Fig. 1 is the flowchart of the condition analysis method provided by the embodiment of the present application;
图2是图1中的步骤S102的流程图;Fig. 2 is the flowchart of step S102 in Fig. 1;
图3是图1中的步骤S103的流程图;Fig. 3 is the flowchart of step S103 in Fig. 1;
图4是本申请另一实施例提供的病情分析方法的部分流程图;Fig. 4 is a partial flowchart of a disease analysis method provided by another embodiment of the present application;
图5是图1中的步骤S105的流程图;Fig. 5 is the flowchart of step S105 in Fig. 1;
图6是图1中的步骤S106的流程图;Fig. 6 is the flowchart of step S106 in Fig. 1;
图7是本申请实施例提供的病情分析装置的结构示意图;Fig. 7 is a schematic structural diagram of a disease analysis device provided by an embodiment of the present application;
图8是本申请实施例提供的电子设备的硬件结构示意图。FIG. 8 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
本申请的实施方式Embodiment of this application
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application.
需要说明的是,虽然在装置示意图中进行了功能模块划分,在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于装置中的模块划分,或流程图中的顺序执行所示出或描述的步骤。说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that although the functional modules are divided in the schematic diagram of the device, and the logical sequence is shown in the flowchart, in some cases, it can be executed in a different order than the module division in the device or the flowchart in the flowchart. steps shown or described. The terms "first", "second" and the like in the specification and claims and the above drawings are used to distinguish similar objects, and not necessarily used to describe a specific sequence or sequence.
除非另有定义,本文所使用的所有的技术和科学术语与属于本申请的技术领域的技术人员通常理解的含义相同。本文中所使用的术语只是为了描述本申请实施例的目的,不是旨在限制本申请。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which this application belongs. The terms used herein are only for the purpose of describing the embodiments of the present application, and are not intended to limit the present application.
首先,对本申请中涉及的若干名词进行解析:First, analyze some nouns involved in this application:
人工智能(artificial intelligence,AI):是研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的一门新的技术科学;人工智能是计算机科学的一个分支,人工智能企图了解智能的实质,并生产出一种新的能以人类智能相似的方式做出反应的智能机器,该领域的研究包括机器人、语言识别、图像识别、自然语言处理和专家系统等。人工智能可以对人的意识、思维的信息过程的模拟。人工智能还是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。Artificial Intelligence (AI): It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence; artificial intelligence is a branch of computer science. Intelligence attempts to understand the essence of intelligence and produce a new intelligent machine that can respond in a manner similar to human intelligence. Research in this field includes robotics, language recognition, image recognition, natural language processing, and expert systems. Artificial intelligence can simulate the information process of human consciousness and thinking. Artificial intelligence is also a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results.
自然语言处理(natural language processing,NLP):NLP用计算机来处理、理解以及运用人类语言(如中文、英文等),NLP属于人工智能的一个分支,是计算机科学与语言学的交叉学科,又常被称为计算语言学。自然语言处理包括语法分析、语义分析、篇章理解等。自然语言处理常用于机器翻译、手写体和印刷体字符识别、语音识别及文语转换、信息检索、信息抽取与过滤、文本分类与聚类、舆情分析和观点挖掘等技术领域,它涉及与语言处理相关的数据挖掘、机器学习、知识获取、知识工程、人工智能研究和与语言计算相关的语言学研究等。Natural language processing (NLP): NLP uses computers to process, understand and use human languages (such as Chinese, English, etc.). NLP belongs to a branch of artificial intelligence and is an interdisciplinary subject between computer science and linguistics. Known as computational linguistics. Natural language processing includes syntax analysis, semantic analysis, text understanding, etc. Natural language processing is often used in technical fields such as machine translation, handwritten and printed character recognition, speech recognition and text-to-speech conversion, information retrieval, information extraction and filtering, text classification and clustering, public opinion analysis and opinion mining. It involves language processing Related data mining, machine learning, knowledge acquisition, knowledge engineering, artificial intelligence research and linguistics research related to language computing, etc.
信息抽取(Information Extraction,NER):从自然语言文本中抽取指定类型的实体、关系、事件等事实信息,并形成结构化数据输出的文本处理技术。信息抽取是从文本数据中抽取特定信息的一种技术。文本数据是由一些具体的单位构成的,例如句子、段落、篇章,文本信息正是由一些小的具体的单位构成的,例如字、词、词组、句子、段落或是这些具体的单位的组合。抽取文本数据中的名词短语、人名、地名等都是文本信息抽取,当然,文本信息抽取技术所抽取的信息可以是各种类型的信息。Information Extraction (Information Extraction, NER): A text processing technology that extracts specified types of factual information such as entities, relationships, and events from natural language texts, and forms structured data output. Information extraction is a technique to extract specific information from text data. Text data is composed of some specific units, such as sentences, paragraphs, and chapters. Text information is composed of some small specific units, such as words, words, phrases, sentences, paragraphs, or combinations of these specific units. . Extracting noun phrases, personal names, and place names in text data is all text information extraction. Of course, the information extracted by text information extraction technology can be various types of information.
关联分析:关联分析是一种简单、实用的分析技术,就是发现存在于大量数据集中的关联性或相关性,从而描述了一个事物中某些属性同时出现的规律和模式。关联分析是从大量数据中发现项集之间有趣的关联和相关联系。Association analysis: Association analysis is a simple and practical analysis technique, which is to discover the association or correlation existing in a large number of data sets, so as to describe the laws and patterns of the simultaneous appearance of certain attributes in a thing. Association analysis is to discover interesting associations and correlations between item sets from a large amount of data.
(1)支持度计数:一个项集出现在几个事务当中,它的支持度计数就是几。(1) Support count: An item set appears in several transactions, and its support count is how many.
(2)支持度:支持度计数除于总的事务数。(2) Support: support count divided by the total number of transactions.
(3)置信度:对于规则{Diaper}→{Beer},{Diaper,Beer}的支持度计数除于{Diaper}的支持度计数,为这个规则的置信度。(3) Confidence: For the rule {Diaper}→{Beer}, the support count of {Diaper, Beer} divided by the support count of {Diaper} is the confidence of this rule.
(4)强关联规则:大于或等于最小支持度阈值和最小置信度阈值的规则叫做强关联规则。关联分析的最终目标就是要找出强关联规则。(4) Strong association rules: Rules greater than or equal to the minimum support threshold and minimum confidence threshold are called strong association rules. The ultimate goal of association analysis is to find out strong association rules.
(5)Apriori算法:是挖掘产生布尔关联规则所需频繁项集的基本算法。Apriori性质:一个频繁项集的任一子集也应该是频繁项集。证明根据定义,若一个项集I不满足最小支持度阈值min_sup,则I不是频繁的,即P(I)<min_sup。若增加一个项A到项集I中,则结果新项集(I∪A)也不是频繁的,在整个事务数据库中所出现的次数也不可能多于原项集I出现的次数,因此P(I∪A)<min_sup,即(I∪A)也不是频繁的。这样就可以根据逆反公理很容易地确定Apriori性质成立。(5) Apriori algorithm: It is a basic algorithm for mining frequent itemsets required to generate Boolean association rules. Apriori property: Any subset of a frequent itemset should also be a frequent itemset. Prove that by definition, if an itemset I does not meet the minimum support threshold min_sup, then I is not frequent, that is, P(I)<min_sup. If an item A is added to the item set I, the resulting new item set (I∪A) is not frequent, and the number of occurrences in the entire transaction database cannot be more than the number of occurrences of the original item set I, so P (I∪A)<min_sup, that is (I∪A) is not frequent. In this way, it can be easily confirmed that the Apriori property holds according to the inverse axiom.
(6)FP-growth算法(Frequent Pattern Tree,简称为FP-tree;):是基于频繁模式树的发现频繁模式的算法,在FP-growth算法中,通过两次扫描事务数据库,把每个事务所包含的频繁项目按其支持度降序压缩存储到FP—tree中。在以后发现频繁模式的过程中,不需要再扫描事务数据库,而仅在FP-Tree中进行查找即可,并通过递归调用FP-growth的方法来直接产生频繁模式,因此在整个发现过程中也不需产生候选模式。(6) FP-growth algorithm (Frequent Pattern Tree, referred to as FP-tree;): It is an algorithm for discovering frequent patterns based on frequent pattern trees. In the FP-growth algorithm, by scanning the transaction database twice, each transaction The included frequent items are compressed and stored in FP-tree in descending order of their support. In the process of discovering frequent patterns in the future, there is no need to scan the transaction database, but only to search in the FP-Tree, and the frequent pattern is directly generated by recursively calling the FP-growth method, so the whole discovery process is also Candidate patterns need not be generated.
协同过滤算法:是一种较为著名和常用的推荐算法,它基于对用户历史行为数据的挖掘发现用户的喜好偏向,并预测用户可能喜好的产品进行推荐,或者找到相似的用户(基于用户)或物品(基于物品)。基于用户的协同过滤算法的实现主要需要解决两个问题,一是如何找到和你有相似爱好的人,也就是要计算数据的相似度。Collaborative filtering algorithm: It is a relatively well-known and commonly used recommendation algorithm. It discovers the user's preferences based on the mining of user historical behavior data, and predicts the products that users may like to recommend, or finds similar users (based on users) or Items (based on items). The realization of the user-based collaborative filtering algorithm mainly needs to solve two problems. One is how to find people who have similar hobbies as you, that is, to calculate the similarity of data.
BERT(Bidirectional Encoder Representations from Transformers):是一个语言表示模型(language representation model)。BERT采用了Transformer Encoder block进行 连接,是一个典型的双向编码模型。BERT (Bidirectional Encoder Representations from Transformers): It is a language representation model (language representation model). BERT uses the Transformer Encoder block for connection, which is a typical two-way encoding model.
多层感知器(Multilayer Perceptron,MLP):MLP是一种前向结构的人工神经网络,映射一组输入向量到一组输出向量。MLP可以被看作是一个有向图,由多个的节点层所组成,每一层都全连接到下一层。除了输入节点,每个节点都是一个带有非线性激活函数的神经元(或称处理单元)。MLP是感知器的推广,克服了感知器不能对线性不可分数据进行识别的弱点。最简单的MLP是三层结构(输入层-隐藏层-输出层)。多层感知器的层与层之间是全连接的,即每一层的任意一个神经元均与其前一层的所有神经元有连接,这种连接其实代表了一种权重加和。Multilayer Perceptron (MLP): MLP is a forward-structured artificial neural network that maps a set of input vectors to a set of output vectors. An MLP can be viewed as a directed graph consisting of multiple layers of nodes, each fully connected to the next layer. Except for the input node, each node is a neuron (or processing unit) with a nonlinear activation function. MLP is an extension of the perceptron, which overcomes the weakness that the perceptron cannot recognize linearly inseparable data. The simplest MLP is a three-layer structure (input layer-hidden layer-output layer). The layers of the multi-layer perceptron are fully connected, that is, any neuron in each layer is connected to all neurons in the previous layer. This connection actually represents a weight summation.
误差反向传播算法(Backpropagation,缩写为BP):,是一种与最优化方法(如梯度下降法)结合使用的,用来训练人工神经网络的常见方法。该方法对网络中所有权重计算损失函数的梯度。这个梯度会反馈给最优化方法,用来更新权值以最小化损失函数。BP算法适合于多层神经元网络的一种学习算法,它建立在梯度下降法的基础上。BP网络的输入输出关系实质上是一种映射关系:一个n输入m输出的BP神经网络所完成的功能是从n维欧氏空间向m维欧氏空间中一有限域的连续映射,这一映射具有高度非线性。它的信息处理能力来源于简单非线性函数的多次复合,因此具有很强的函数复现能力。BP算法的学习过程由正向传播过程和反向传播过程组成。在正向传播过程中,输入信息通过输入层经隐含层,逐层处理并传向输出层。如果在输出层得不到期望的输出值,则取输出与期望的误差的平方和作为目标函数,转入反向传播,逐层求出目标函数对各神经元权值的偏导数,构成目标函数对权值向量的梯量,作为修改权值的依据,网络的学习在权值修改过程中完成。误差达到所期望值时,网络学习结束。Error backpropagation algorithm (Backpropagation, abbreviated as BP): It is a common method used in combination with optimization methods (such as gradient descent method) to train artificial neural networks. This method computes the gradient of the loss function for all weights in the network. This gradient is fed back to the optimization method to update the weights to minimize the loss function. BP algorithm is suitable for a learning algorithm of multi-layer neural network, which is based on the gradient descent method. The input-output relationship of the BP network is essentially a mapping relationship: the function completed by a BP neural network with n inputs and m outputs is a continuous mapping from n-dimensional Euclidean space to a finite field in m-dimensional Euclidean space. Mapping is highly non-linear. Its information processing ability comes from the multiple compounding of simple nonlinear functions, so it has a strong ability to reproduce functions. The learning process of BP algorithm is composed of forward propagation process and back propagation process. In the forward propagation process, the input information is processed layer by layer through the hidden layer through the input layer and transmitted to the output layer. If the desired output value cannot be obtained at the output layer, take the sum of the squares of the output and the expected error as the objective function, transfer to backpropagation, and calculate the partial derivative of the objective function with respect to the weight of each neuron layer by layer to form the objective The gradient of the function to the weight vector is used as the basis for modifying the weight, and the learning of the network is completed in the process of modifying the weight. When the error reaches the expected value, the network learning ends.
逻辑斯蒂函数(Logistic函数或Logistic曲线):Logistic函数或Logistic曲线是一种常见的S形函数,广义Logistic曲线可以模仿一些情况人口增长(P)的S形曲线。起初阶段大致是指数增长;然后随着开始变得饱和,增加变慢;最后,达到成熟时增加停止。Logistic function (Logistic function or Logistic curve): Logistic function or Logistic curve is a common S-shaped function, and the generalized Logistic curve can imitate the S-shaped curve of population growth (P) in some cases. The initial stage is roughly exponential growth; then the increase slows as the initiation becomes saturated; finally, the increase stops when maturity is reached.
本申请实施例可以基于人工智能技术对相关的数据进行获取和处理。其中,人工智能(Artificial Intelligence,AI)是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。The embodiments of the present application may acquire and process relevant data based on artificial intelligence technology. Among them, artificial intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .
人工智能基础技术一般包括如传感器、专用人工智能芯片、云计算、分布式存储、大数据处理技术、操作/交互系统、机电一体化等技术。人工智能软件技术主要包括计算机视觉技术、机器人技术、生物识别技术、语音处理技术、自然语言处理技术以及机器学习/深度学习等几大方向。Artificial intelligence basic technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation/interaction systems, and mechatronics. Artificial intelligence software technology mainly includes computer vision technology, robotics technology, biometrics technology, speech processing technology, natural language processing technology, and machine learning/deep learning.
本申请实施例可以基于医疗云技术实现对患者的历史病情的分析。其中,医疗云(Medical cloud),是指在云计算、移动技术、多媒体、4G通信、大数据、以及物联网等新技术基础上,结合医疗技术,使用“云计算”来创建医疗健康服务云平台,实现了医疗资源的共享和医疗范围的扩大。因为云计算技术的运用于结合,医疗云提高医疗机构的效率,方便居民就医。像现在医院的预约挂号、电子病历、医保等都是云计算与医疗领域结合的产物,医疗云还具有数据安全、信息共享、动态扩展、布局全局的优势。The embodiment of the present application can realize the analysis of the patient's historical condition based on the medical cloud technology. Among them, medical cloud refers to the use of "cloud computing" to create a medical and health service cloud based on new technologies such as cloud computing, mobile technology, multimedia, 4G communication, big data, and the Internet of Things, combined with medical technology. The platform realizes the sharing of medical resources and the expansion of medical coverage. Because of the combination of cloud computing technology, medical cloud improves the efficiency of medical institutions and facilitates residents to seek medical treatment. For example, appointment registration, electronic medical records, and medical insurance in hospitals are all products of the combination of cloud computing and the medical field. Medical cloud also has the advantages of data security, information sharing, dynamic expansion, and overall layout.
基于此,本申请实施例提供一种病情分析方法、装置、电子设备及存储介质,可以通过对患者的历史病情进行分析得到病情分析报告,为后续的诊断过程提供可参考的病理数据,提高了病情分析效率,同时也可以降低患者的就医成本和用药成本。Based on this, the embodiment of the present application provides a condition analysis method, device, electronic equipment and storage medium, which can obtain a condition analysis report by analyzing the patient's historical condition, and provide reference pathological data for the subsequent diagnosis process, improving the The efficiency of disease analysis can also reduce the cost of medical treatment and medication for patients.
本申请实施例提供的病情分析方法、装置、电子设备及存储介质,具体通过如下实施例进行说明,首先描述本申请实施例中的病情分析方法。The disease analysis method, device, electronic device, and storage medium provided in the embodiments of the present application are specifically described through the following embodiments. First, the disease analysis method in the embodiments of the present application is described.
本申请实施例提供的病情分析方法,涉及人工智能及数字医疗技术领域。本申请实施例提供的病情分析方法可应用于终端中,也可应用于服务器端中,还可以是运行于终端或服务器端中的软件。在一些实施例中,终端可以是智能手机、平板电脑、笔记本电脑、台式计算机等;服务器端可以配置成独立的物理服务器,也可以配置成多个物理服务器构成的服务器 集群或者分布式系统,还可以配置成提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN以及大数据和人工智能平台等基础云计算服务的云服务器;软件可以是实现病情分析方法的应用等,但并不局限于以上形式。The disease analysis method provided in the embodiment of the present application relates to the fields of artificial intelligence and digital medical technology. The disease analysis method provided in the embodiment of the present application can be applied to a terminal, can also be applied to a server, and can also be software running on a terminal or a server. In some embodiments, the terminal can be a smart phone, a tablet computer, a notebook computer, a desktop computer, etc.; the server end can be configured as an independent physical server, or can be configured as a server cluster or a distributed system composed of multiple physical servers, or It can be configured as a cloud that provides basic cloud computing services such as cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms. The server; the software can be the application of the disease analysis method, etc., but is not limited to the above forms.
图1是本申请实施例提供的病情分析方法的一个可选的流程图,图1中的方法可以包括但不限于包括步骤S101至步骤S106。Fig. 1 is an optional flowchart of the disease analysis method provided by the embodiment of the present application. The method in Fig. 1 may include but not limited to include steps S101 to S106.
步骤S101,获取电子病历数据;Step S101, obtaining electronic medical record data;
步骤S102,对电子病历数据进行实体特征提取,得到目标病情信息;Step S102, extracting entity features from electronic medical record data to obtain target disease information;
步骤S103,利用预先训练的病情识别模型对目标病情信息进行处理,生成与目标病情信息对应的治疗提醒;Step S103, using the pre-trained disease identification model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
步骤S104,接收用户端根据治疗提醒响应的治疗反馈数据;Step S104, receiving the treatment feedback data that the client responds to according to the treatment reminder;
步骤S105,根据治疗反馈数据和目标病情信息,生成诊断结论标签;Step S105, generating a diagnosis conclusion label according to the treatment feedback data and the target condition information;
步骤S106,根据诊断结论标签生成病情分析报告。Step S106, generating a disease analysis report according to the diagnosis conclusion label.
经过以上步骤S101至步骤S106,首先根据患者上传的个人历史病历生成电子病历,获取电子病历数据,其中,电子病历数据包括患者信息、就医记录、用药记录等等。对电子病历数据进行实体特征提取,得到目标病情信息,其中,目标病情信息包括患者的年龄、性别、基本健康指标、就医诊断数据、用药记录以及不良药物反应信息、手术记录等等。这一方式能够实现对电子病历数据的特征抽取,缩小数据总量,使得更为方便提取到所需要的病情信息。利用预先训练的病情识别模型对上述目标病情信息进行处理,对患者的历史目标病情进行识别,生成与该目标病情信息对应的治疗提醒,并将这一治疗提醒反馈至患者,以便患者根据治疗提醒进行就医或者服用药物。在将治疗提醒反馈至患者之后,还需要获取来自患者的治疗反馈,例如,可以接收用户端根据治疗提醒响应的治疗反馈数据。通过对治疗反馈数据和目标病情信息进行分析,得到诊断结论标签,能够较为方便地评估出治疗效果;需要说明的是,在对治疗反馈数据和目标病情信息进行分析时,可以采用大数据分析模型来对治疗反馈数据与目标病情信息进行匹配,根据匹配情况生成对应的诊断结论标签,最后根据诊断结论标签生成病情分析报告,通过对患者的历史病情进行分析得到病情分析报告,为后续的诊断过程提供可参考的病理数据,提高了病情分析效率,同时也可以降低患者的就医成本和用药成本。After the above steps S101 to S106, firstly, an electronic medical record is generated according to the personal historical medical records uploaded by the patient, and the electronic medical record data is obtained, wherein the electronic medical record data includes patient information, medical records, medication records and so on. The entity feature extraction is performed on the electronic medical record data to obtain the target condition information, among which the target condition information includes the patient's age, gender, basic health indicators, medical diagnosis data, medication records, adverse drug reaction information, operation records, etc. This method can realize the feature extraction of electronic medical record data, reduce the total amount of data, and make it easier to extract the required disease information. Use the pre-trained condition recognition model to process the above target condition information, identify the patient's historical target condition, generate a treatment reminder corresponding to the target condition information, and feed back this treatment reminder to the patient, so that the patient can follow the treatment reminder Seek medical attention or take medication. After the treatment reminder is fed back to the patient, it is also necessary to obtain treatment feedback from the patient, for example, the treatment feedback data that the user terminal responds to according to the treatment reminder can be received. By analyzing the treatment feedback data and target condition information, the diagnostic conclusion label can be obtained, and the treatment effect can be evaluated more conveniently; it should be noted that when analyzing the treatment feedback data and target condition information, a big data analysis model can be used To match the treatment feedback data with the target condition information, generate the corresponding diagnostic conclusion label according to the matching situation, and finally generate the condition analysis report according to the diagnosis conclusion label, and obtain the condition analysis report by analyzing the patient's historical condition, which will be used for the subsequent diagnosis process Provide reference pathological data, improve the efficiency of disease analysis, and also reduce the cost of medical treatment and medication for patients.
在一些医学应用场景中,在一种可能的实现方式中,上述数据是医疗数据,如个人健康档案、处方、检查报告等数据。In some medical application scenarios, in a possible implementation manner, the above data is medical data, such as personal health records, prescriptions, examination reports and other data.
请参阅图2,在一些实施例中,步骤S102可以包括但不限于包括步骤S201至步骤S205:Referring to FIG. 2, in some embodiments, step S102 may include but not limited to include steps S201 to S205:
步骤S201,提取电子病历数据中的自然语言文本;Step S201, extracting natural language text in electronic medical record data;
步骤S202,利用预设的词法分析模型识别自然语言文本中的实体特征;Step S202, using a preset lexical analysis model to identify entity features in the natural language text;
步骤S203,对自然语言文本进行分割处理,得到自然语言词汇序列;Step S203, segmenting the natural language text to obtain a natural language vocabulary sequence;
步骤S204,根据实体特征和自然语言词汇序列,构建特征序列表;Step S204, constructing a list of feature sequences according to entity features and natural language vocabulary sequences;
步骤S205,根据特征序列表确定目标病情信息。Step S205, determine the target condition information according to the feature sequence table.
具体地,首先将电子病历数据中的非结构化数据转化为统一的结构化数据,从结构化数据中提取所需要的自然语言文本。利用预设的词法分析模型识别自然语言文本中的实体特征。例如,预先构建医学词库,该医学词库可以包括各类医学病理相关的医学名、医学术语、非医学名称、民间俗称、国际医学名词等等。通过这一医学词库,预设的词法分析模型可以将医学特定名称进行列举。将自然语言文本输入至预设的词法分析模型中,通过预设的词法分析模型中包含的医学特定名称以及预设的词性类别,对自然语言文本中的实体特征进行识别,该实体特征可以包括上述与医学病理相关的医学名、医学术语、非医学名称、民间俗称、国际医学名词、修饰词、时间信息等多个维度的实体词汇。为了更准确地提取实体特征,还可以基于bi-LSTM算法构建序列分类器,在基于bi-LSTM算法的模型中,输入单词wi和字符嵌入,通过左到右的长短记忆和右向左的长短时记忆,使得在输出被连接的位置生成单一的输出层。序列分类器通过这一输出层可以将输入的实体特征直接传递到softmax分类器上,通过 softmax分类器在预设的标签上创建一个概率分布,从而根据概率分布对实体参数进行标记分类,最后对分类处理之后的实体特征进行特征提取,得到所需要的实体特征。另外,为了实现数据存储,还可以采用BERT编码器,通过预设的编码函数将实体特征串由文本形式转化为编码形式,以实现实体特征的存储。另外,除了需要从自然语言文本进行实体特征提取以外,还需要对自然语言文本进行分割处理,以得到自然语言词汇序列。例如,根据词性条件,对自然语言文本中的词段赋予对应的词性,例如,名词、动词、形容词等等。根据基础自然语言的词根、词性将连续的自然语言文本切分成具有语义合理性和完整性的自然语言词汇序列。进而,根据实体特征与自然语言词汇序列在词性、词根上的对应关系,构建出特征序列表,根据特征序列表对电子病历数据中的数据进行搜索和对比,可以较为方便地确定出目标病情信息。Specifically, the unstructured data in the electronic medical record data is first converted into unified structured data, and the required natural language text is extracted from the structured data. Use the preset lexical analysis model to identify entity features in natural language text. For example, a medical thesaurus is pre-built, and the medical thesaurus may include medical names, medical terms, non-medical names, folk names, international medical terms, etc. related to various medical pathologies. Through this medical lexicon, the preset lexical analysis model can enumerate medical specific names. Input the natural language text into the preset lexical analysis model, and identify the entity features in the natural language text through the medical specific names contained in the preset lexical analysis model and the preset part-of-speech categories, and the entity features may include The above-mentioned multi-dimensional entity vocabulary related to medical pathology, such as medical names, medical terms, non-medical names, folk names, international medical terms, modifiers, and time information. In order to extract entity features more accurately, a sequence classifier can also be built based on the bi-LSTM algorithm. In the model based on the bi-LSTM algorithm, the word wi and character embedding are input, and the long and short memory from left to right and the length from right to left are used. time memory, so that a single output layer is generated where the outputs are concatenated. The sequence classifier can pass the input entity features directly to the softmax classifier through this output layer, and create a probability distribution on the preset label through the softmax classifier, so as to mark and classify the entity parameters according to the probability distribution, and finally The entity features after the classification process are subjected to feature extraction to obtain the required entity features. In addition, in order to achieve data storage, the BERT encoder can also be used to convert the entity feature string from text form to encoded form through the preset encoding function to realize the storage of entity features. In addition, in addition to extracting entity features from natural language texts, it is also necessary to segment natural language texts to obtain natural language vocabulary sequences. For example, according to the part-of-speech condition, a corresponding part-of-speech is assigned to a word segment in the natural language text, such as noun, verb, adjective, and so on. According to the root and part of speech of the basic natural language, the continuous natural language text is segmented into natural language lexical sequences with semantic rationality and integrity. Furthermore, according to the corresponding relationship between entity features and natural language lexical sequences in parts of speech and roots, a feature sequence table is constructed, and the data in the electronic medical record data can be searched and compared according to the feature sequence table, and the target disease information can be determined more conveniently. .
在一种可能的实现方式中,上述自然语言文本为医疗文本,医疗文本可以是医疗电子记录(Electronic Healthcare Record),电子化的个人健康记录,包括病历、心电图、医学影像等一系列具备保存备查价值的电子化记录。In a possible implementation, the above-mentioned natural language text is a medical text, and the medical text can be an electronic healthcare record (Electronic Healthcare Record), an electronic personal health record, including medical records, electrocardiograms, medical images, etc. An electronic record of value.
请参阅图3,在一些实施例中,步骤S103可以包括但不限于包括步骤S301或者步骤S303:Referring to FIG. 3, in some embodiments, step S103 may include but not limited to include step S301 or step S303:
步骤S301,将目标病情信息输入至预先训练的病情识别模型中;Step S301, inputting the target condition information into the pre-trained condition identification model;
步骤S302,通过病情识别模型对目标病情信息进行象限分区,得到分区病情信息;Step S302, performing quadrant partitioning on the target disease information through the disease identification model to obtain the partitioned disease information;
步骤S303,对每一象限的分区病情信息进行拟合处理,生成用药提醒或者就医提醒。Step S303 , performing a fitting process on the partitioned disease information of each quadrant to generate a medication reminder or a medical reminder.
在一些实施例中,可以利用BP算法进行深度学习以建立病情识别模型,将目标病情信息输入至预先训练的病情识别模型中,通过病情识别模型对目标病情信息进行象限分区,利用象限对目标病情信息进行任务区分,得到分区病情信息。例如,根据四象限法则,对目标病情信息中的病情数据进行优先级分类,得到不同重要等级的分区病情信息。进而,通过拟合函数对每一象限的分区病情信息进行拟合处理,并将拟合结果输出到表象层进行准确率及优秀的智能体的判断以进行迭代优化,当迭代优化至满足预设的迭代条件时,停止迭代优化,输出拟合结果,并根据拟合结果生成对应的用药提醒或者就医提醒。需要说明的是,预设的迭代条件可是迭代次数达到预设次数等等,也可以是其他,不限于此。另外,病情识别模型为多层神经网络,该多层神经网络含有两层处理单元和两个隐藏层,且每一个反馈只能发送到前面的输出层或隐藏层。该多层神经网络是一个采用后向传播算法进行学习分类或者预测的神经网络。具体地,该病情识别模型的构建过程如下:In some embodiments, the BP algorithm can be used for deep learning to establish a disease recognition model, and the target disease information is input into the pre-trained disease recognition model. The information is divided into tasks, and the condition information of the partitions is obtained. For example, according to the four-quadrant rule, the disease data in the target disease information is prioritized to obtain partitioned disease information of different importance levels. Furthermore, the condition information of each quadrant is fitted through the fitting function, and the fitting result is output to the appearance layer for the judgment of accuracy and excellent agent for iterative optimization. When the iterative optimization meets the preset When the iterative condition is met, the iterative optimization is stopped, the fitting result is output, and the corresponding medication reminder or medical reminder is generated according to the fitting result. It should be noted that the preset iteration condition may be that the number of iterations reaches the preset number, etc., or other conditions, and is not limited thereto. In addition, the disease recognition model is a multi-layer neural network, which contains two layers of processing units and two hidden layers, and each feedback can only be sent to the previous output layer or hidden layer. The multi-layer neural network is a neural network that uses a backward propagation algorithm to learn classification or prediction. Specifically, the construction process of the disease recognition model is as follows:
第一步:以第一个训练元祖X={1,0,1}其类标号为1;初始化神经网络的所有权重和偏倚;网络的权重一般初始为小随机数(例如,-1.0到1.0);The first step: take the first training ancestor X={1,0,1} and its class label is 1; initialize all weights and biases of the neural network; the weights of the network are generally initially small random numbers (for example, -1.0 to 1.0 );
第二步:在终止循环条件下循环每个训练元祖,设X={1,0,1},其类标号Y为1;The second step: loop each training tuple ancestor under the termination loop condition, set X={1,0,1}, and its class label Y is 1;
第三步:循环输入单元,其中,输入单元的输入=输出,及I 1=O 1=1;I 2=O 2=0;I 3=O 2=1;第四步:计算隐藏层或输出层的输入和输出,其中,输入公式如公式(1)所示,输出公式如公式(2)所示,按照输入和输出公式,可以分别求预设节点的输入I j和输出θ jThe third step: cyclic input unit, wherein, the input of the input unit = output, and I 1 =O 1 =1; I 2 =O 2 =0; I 3 =O 2 =1; the fourth step: calculate the hidden layer or The input and output of the output layer, wherein, the input formula is as shown in formula (1), and the output formula is as shown in formula (2), according to the input and output formulas, the input I j and output θ j of the preset node can be obtained respectively;
I j=Σ iw ijO ij公式(1) I j = Σ i w ij O i + θ j Formula (1)
其中,w ij是由上一层的单元i到单元j的连接的权重;O i是上一层的单元i的输出,而θ j是单元j的偏倚,偏倚作为阈值,用来改变单元的活性; Among them, w ij is the weight of the connection from unit i to unit j in the previous layer; O i is the output of unit i in the previous layer, and θ j is the bias of unit j, which is used as a threshold to change the unit’s active;
O j=1+(1+E[I j])公式(2) O j =1+(1+E[I j ]) Formula (2)
第五步:分别计算隐藏节点和输出节点的误差,其中,输出层误差公式如公式(3)所示;隐藏层公式如公式(4)所示;Step 5: Calculate the errors of hidden nodes and output nodes respectively, where the output layer error formula is shown in formula (3); the hidden layer formula is shown in formula (4);
Figure PCTCN2022087710-appb-000001
Figure PCTCN2022087710-appb-000001
其中,O j是单元j的实际输出,而T j是j给定训练元组的已知目标值,需要说明的是,O j(1-O j)是逻辑斯蒂函数的导数; Among them, O j is the actual output of unit j, and T j is the known target value of j given the training tuple. It should be noted that O j (1-O j ) is the derivative of the logistic function;
Figure PCTCN2022087710-appb-000002
Figure PCTCN2022087710-appb-000002
其中,w jk是由下一较高层的单元k到单元j的连接的权重,而
Figure PCTCN2022087710-appb-000003
是单元k的误差。
where wjk is the weight of the connection from unit k to unit j in the next higher layer, and
Figure PCTCN2022087710-appb-000003
is the error of unit k.
第六步:提取淘汰下的智能体并将对应的自然语言文本分割后对应的词根词性进行回收。Step 6: Extract the eliminated agents and recycle the corresponding root parts of speech after segmenting the corresponding natural language text.
通过上述过程能够构建出符合需求的病情识别模型,进而,将参考病情信息输入至这一病情识别模型中,通过病情识别模型对参考病情信息进行象限分区,根据四象限法则对参考病情信息进行任务区分,对参考病情信息中的病情数据进行优先级分类,得到不同重要等级的分区病情信息。进而,通过拟合函数对每一象限的分区参考病情信息进行拟合处理,并将拟合结果输出到表象层进行准确率及优秀的智能体的判断以进行迭代优化,当迭代优化至满足预设的迭代条件时,停止迭代优化,从而完成对病理识别模型的训练。最后通过训练后的病情识别模型对患者的历史目标病情进行识别,具体地,将目标病情信息输入至训练后的病情识别模型中,得到拟合结果,并根据拟合结果生成与该目标病情信息对应的治疗提醒,并将这一治疗提醒反馈至患者,以便患者根据治疗提醒进行就医或者服用药物。Through the above process, a condition identification model that meets the requirements can be constructed, and then the reference condition information is input into the condition identification model, and the reference condition information is divided into quadrants through the condition identification model, and the reference condition information is assigned tasks according to the four-quadrant rule. Differentiate, and classify the disease data in the reference disease information according to priority, and obtain the partitioned disease information of different importance levels. Furthermore, the fitting function is used to fit the partition reference disease information of each quadrant, and the fitting result is output to the appearance layer for the judgment of accuracy and excellent agent for iterative optimization. When the iterative optimization meets the expected When the iterative condition is set, the iterative optimization is stopped, so as to complete the training of the pathological recognition model. Finally, the patient's historical target condition is identified through the trained condition recognition model. Specifically, the target condition information is input into the trained condition recognition model to obtain the fitting result, and the target condition information is generated according to the fitting result. The corresponding treatment reminder is given, and the treatment reminder is fed back to the patient, so that the patient can seek medical treatment or take medicine according to the treatment reminder.
请参阅图4,在一些实施例中,在步骤S104之后,该方法可以包括但不限于包括步骤S401至步骤S402:Referring to FIG. 4, in some embodiments, after step S104, the method may include, but is not limited to, step S401 to step S402:
步骤S401,对目标病情信息中的药物数据进行分析,得到药物反应信息;其中,药物反应信息包括不良药物反应信息;Step S401, analyzing the drug data in the target condition information to obtain drug reaction information; wherein, the drug reaction information includes adverse drug reaction information;
步骤S402,根据不良药物反应信息识别出治疗反馈数据中的异常用药数据。Step S402, identifying abnormal medication data in the treatment feedback data according to the adverse drug reaction information.
具体地,为了更好地对患者的历史病情进行分析,还需要对目标病情信息中的药物数据进行分析,将患者历史就医记录和历史用药记录中包含的药物数据进行提取,可以是通过基于关键词的文本自动生成模型来对药物数据进行提取,该文本自动生成模型可以根据输入数据的类型,进行不同的数据处理。将历史就医记录和历史用药记录中的关键词或者文本语句或者字段等等输入至预先训练的文本自动生成模型,若输入的关键词、文本语句或者字段与预设的参考文本能够进行匹配,表明当前输入符合要求,若当前输入的是关键词,则在基础语料库中选择和输入关键词相同的语句集合,根据这一语句集合生成对应的药物数据字段。若当前输入的是文本语句或字段,则需要在基础语料库中选取候选语句,并确定选取的候选语句是否符合要求;其中,候选语句为基础语料库中与输入的文本语句或字段相似度大于预设阈值的语句,若选取的候选语句符合要求,则根据候选语句直接生成药物数据字段;若选取的候选语句不符合要求,则对候选语句进行语句补充,例如,填充同义词、或者根据对应的输入信息对候选语句进行复写补充等等,并根据补充之后的候选语句生成药物数据字段。进而,基于国内外文献、医学资料库、医疗数据平台等等中的相关数据对这些药物数据字段进行分析,得到药物反应信息,药物反应信息包括不良药物反应信息、药物疗效信息、用药指导信息等等。进而通过将不良药物反应信息和治疗反馈数据进行对比分析,若治疗反馈数据中涉及到不良药物反应信息相匹配的数据,则可以确定该数据为异常用药数据,通过这一方式能够较为方便地识别异常用药数据,并在治疗反馈数据中对异常用药数据进行标记,以对后续的诊断治疗起到提示作用,避免再次出现异常用药情况,提高用药可靠性。Specifically, in order to better analyze the patient's historical condition, it is also necessary to analyze the drug data in the target condition information, and extract the drug data contained in the patient's historical medical records and historical medication records, which can be based on key The automatic text generation model of words is used to extract drug data, and the automatic text generation model can perform different data processing according to the type of input data. Input keywords or text sentences or fields in historical medical records and historical medication records to the pre-trained text automatic generation model, if the input keywords, text sentences or fields can match the preset reference text, it indicates The current input meets the requirements. If the current input is a keyword, select the same sentence set as the input keyword in the basic corpus, and generate the corresponding drug data field according to this sentence set. If the current input is a text sentence or field, you need to select a candidate sentence in the basic corpus, and determine whether the selected candidate sentence meets the requirements; wherein, the candidate sentence is that the similarity between the basic corpus and the input text sentence or field is greater than the preset Threshold sentence, if the selected candidate sentence meets the requirements, the drug data field will be directly generated according to the candidate sentence; if the selected candidate sentence does not meet the requirements, the sentence supplement will be performed on the candidate sentence, for example, filling in synonyms, or according to the corresponding input information The candidate sentence is copied and supplemented, and the drug data field is generated according to the supplemented candidate sentence. Furthermore, based on relevant data in domestic and foreign literature, medical databases, medical data platforms, etc., these drug data fields are analyzed to obtain drug response information, which includes adverse drug reaction information, drug efficacy information, and medication guidance information. wait. Furthermore, by comparing and analyzing the adverse drug reaction information and the treatment feedback data, if the treatment feedback data involves data that matches the adverse drug reaction information, it can be determined that the data is abnormal drug use data, which can be more conveniently identified Abnormal medication data, and mark the abnormal medication data in the treatment feedback data, so as to play a prompt role in subsequent diagnosis and treatment, avoid the occurrence of abnormal medication again, and improve the reliability of medication.
另外,在一些具体实施例中,步骤S104之后,该方法还包括但不限于包括:In addition, in some specific embodiments, after step S104, the method also includes, but is not limited to:
对治疗反馈数据、目标病情信息进行可视化处理,生成可视化数据图。Visualize the treatment feedback data and target disease information to generate a visualized data map.
具体地,可以通过数据管理层中的logback组件(开源日志组件)将治疗反馈数据和目标病情信息写入到本地文件中,通过拦截器记录包括治疗反馈数据和目标病情信息的接口日志,通过数据管理层中的日志收集系统(Flume)分别对治疗反馈数据和目标病情信息进行可视化处理,得到对应的目标可视化数据,将这些目标可视化数据输入到数据管理层中的Hive数据库与HBase数据库中进行整合,最后通过数据管理层中的Sqoop传送组件把目标可视化 数据从PostgreSql数据库导入Hive数据库,生成对应的图表,并根据预先设定的词性等级序列,对这一系列的图表进行等级排序,生成可视化数据图。例如,预设词性等级为名词等级高于动词、动词等级高于形容词等等;可视化数据图包括病情信息来源、就医区域、患者基本信息、初诊情况、复诊率、用药情况等等。通过可视化数据图能够较为方便地将患者的历史病情呈现出来,以便患者和医生进行查阅。Specifically, the treatment feedback data and target condition information can be written into local files through the logback component (open source log component) in the data management layer, and the interface log including treatment feedback data and target condition information can be recorded through the interceptor. The log collection system (Flume) in the management layer visualizes the treatment feedback data and the target disease information respectively to obtain the corresponding target visualization data, and then inputs these target visualization data into the Hive database and the HBase database in the data management layer for integration , and finally import the target visualization data from the PostgreSql database into the Hive database through the Sqoop transmission component in the data management layer, generate corresponding charts, and sort the series of charts according to the preset part-of-speech ranking sequence to generate visual data picture. For example, the preset part-of-speech level is that the level of nouns is higher than that of verbs, and the level of verbs is higher than that of adjectives. Through the visualized data graph, the historical condition of the patient can be presented more conveniently, so that the patient and the doctor can consult it.
请参阅图5,在一些实施例的步骤S105可以包括但不限于包括步骤S501至步骤S502:Referring to FIG. 5, step S105 in some embodiments may include, but is not limited to, step S501 to step S502:
步骤S501,分别对治疗反馈数据和目标病情信息进行编码处理,得到编码形式的治疗反馈数据和编码形式的目标病情信息;Step S501, encoding treatment feedback data and target condition information respectively to obtain treatment feedback data in encoded form and target condition information in encoded form;
步骤S502,利用预设的大数据分析模型对编码形式的治疗反馈数据和编码形式的目标病情信息进行数据分析,生成诊断结论标签。Step S502, using the preset big data analysis model to perform data analysis on the coded treatment feedback data and the coded target condition information to generate a diagnostic conclusion label.
在一些实施例中,可以通过预设的编码器分别对治疗反馈数据和目标病情信息进行编码处理,该预设的编码器可以是基于BERT的编码器,即通过获取治疗反馈数据和目标病情信息,并对治疗反馈数据和目标病情信息进行令牌化处理,构建出BERT令牌生成器,对BERT令牌生成器进行预训练,形成符合需求的BERT编码器,使得BERT编码器能够通过预设的编码函数将治疗反馈数据和目标病情信息由文本形式转化为编码形式,得到编码形式的治疗反馈数据和编码形式的目标病情信息。进而,通过预设的大数据分析模型中的协同过滤算法计算编码形式的治疗反馈数据与编码形式的目标病情信息的相似度大小。根据相似度大小对编码形式的治疗反馈数据与编码形式的目标病情信息进行关联匹配,生成对应的诊断结论标签。需要说明的是,该协同过滤算法可以是杰卡德相似系数法、夹角余弦法或者欧式距离、曼哈顿距离等相似性度量方法,不做限制。在一些其他实施例中,预设的大数据分析模型也可以采用关联分析算法,通过关联分析算法来对对编码形式的治疗反馈数据与编码形式的目标病情信息进行关联匹配。常用的关联分析算法包括Apriori算法、FP-growth算法等等。这一方式提高了数据分析效率,也提高了治疗反馈数据与目标病情信息的匹配准确性。In some embodiments, the treatment feedback data and the target condition information can be respectively encoded by a preset encoder, and the preset encoder can be a BERT-based encoder, that is, by obtaining the treatment feedback data and the target condition information , and tokenize treatment feedback data and target condition information, build a BERT token generator, pre-train the BERT token generator, and form a BERT encoder that meets the requirements, so that the BERT encoder can pass the preset The encoding function converts the treatment feedback data and the target condition information from the text form into the code form, and obtains the treatment feedback data in the code form and the target condition information in the code form. Furthermore, the collaborative filtering algorithm in the preset big data analysis model is used to calculate the similarity between the treatment feedback data in the coded form and the target disease information in the coded form. According to the degree of similarity, the treatment feedback data in the coded form and the target disease information in the coded form are correlated and matched to generate corresponding diagnostic conclusion labels. It should be noted that the collaborative filtering algorithm may be a Jaccard similarity coefficient method, an included angle cosine method, or a similarity measurement method such as Euclidean distance or Manhattan distance, without limitation. In some other embodiments, the preset big data analysis model may also use an association analysis algorithm to associate and match the encoded treatment feedback data and the encoded target disease information. Commonly used correlation analysis algorithms include Apriori algorithm, FP-growth algorithm and so on. This method improves the efficiency of data analysis, and also improves the matching accuracy of treatment feedback data and target disease information.
请参阅图6,在一些实施例中,步骤S106可以包括但不限于包括步骤S601至步骤S603:Referring to FIG. 6, in some embodiments, step S106 may include but not limited to include steps S601 to S603:
步骤S601,根据目标病情信息对诊断结论标签进行提纯处理,得到提纯后的诊断结论标签;Step S601, purifying the diagnostic conclusion label according to the target condition information to obtain the purified diagnostic conclusion label;
步骤S602,对提纯后的诊断结论标签进行验证分析,得到标准诊断结论标签;Step S602, verifying and analyzing the purified diagnosis conclusion label to obtain a standard diagnosis conclusion label;
步骤S603,根据标准诊断结论标签,生成病情分析报告。Step S603, generating a disease analysis report according to the standard diagnosis conclusion label.
具体地,为了提高诊断结论标签的准确性,还可以预设select语句函数,在预设的select语句函数中填充所需要的选项数据,利用已填充选项数据的select语句函数摘录目标病情信息中的数据集,通过数据集与诊断结论标签进行比对,剔除异常诊断结论标签,得到提纯后的诊断结论标签。其中,对异常诊断结论标签的判断可以根据数据集与诊断结论标签的相似度来确定,若两者的相似度小于预设的相似度阈值,则确定该诊断结论标签为异常诊断结论标签。进而,对提纯后的诊断结论标签进行验证分析,将提纯后的诊断结论标签对应的目标病情信息和治疗反馈数据进行复核,对目标病情信息与治疗反馈数据进行修正调整,得到标准诊断结论标签,最终根据标准诊断结论标签生成病情分析报告,通过对患者的历史病情进行分析得到病情分析报告,为后续的诊断过程提供可参考的病理数据,提高了病情分析效率,同时也可以降低患者的就医成本和用药成本。Specifically, in order to improve the accuracy of the diagnostic conclusion label, the select statement function can also be preset, fill in the required option data in the preset select statement function, and use the select statement function filled with option data to extract the target disease information. The data set is compared with the diagnosis conclusion label, and the abnormal diagnosis conclusion label is eliminated to obtain the purified diagnosis conclusion label. Among them, the judgment of the abnormal diagnosis conclusion label can be determined according to the similarity between the data set and the diagnosis conclusion label. If the similarity between the two is less than the preset similarity threshold, the diagnosis conclusion label is determined to be an abnormal diagnosis conclusion label. Furthermore, the purified diagnostic conclusion label is verified and analyzed, the target condition information and treatment feedback data corresponding to the purified diagnostic conclusion label are reviewed, and the target condition information and treatment feedback data are corrected and adjusted to obtain a standard diagnosis conclusion label. Finally, the condition analysis report is generated according to the standard diagnosis conclusion label, and the condition analysis report is obtained by analyzing the patient's historical condition, which provides reference pathological data for the subsequent diagnosis process, improves the efficiency of condition analysis, and can also reduce the cost of medical treatment for patients and drug costs.
本申请实施例通过获取电子病历数据,对电子病历数据进行实体特征提取,得到目标病情信息,这一方式能够实现对电子病历数据的特征抽取,缩小数据总量,使得更为方便提取到所需要的病情信息;进而利用预先训练的病情识别模型对目标病情信息进行处理,生成与目标病情信息对应的治疗提醒,以便患者根据治疗提醒进行用药或者就医。而后接收用户端根据治疗提醒响应的治疗反馈数据,对治疗反馈数据和目标病情信息进行分析,得到诊断结论标签,能够较为方便地评估出治疗效果;最后根据诊断结论标签生成病情分析报告,通过对患者的历史病情进行分析得到病情分析报告,为后续的诊断过程提供可参考的病理数据,提高了病情分析效率,同时也可以降低患者的就医成本和用药成本。In the embodiment of the present application, by obtaining electronic medical record data, extracting entity features from electronic medical record data, and obtaining target disease information, this method can realize feature extraction of electronic medical record data, reduce the total amount of data, and make it more convenient to extract the required data. The patient’s condition information; and then use the pre-trained condition recognition model to process the target condition information, and generate a treatment reminder corresponding to the target condition information, so that the patient can take medication or seek medical treatment according to the treatment reminder. Then receive the treatment feedback data from the user terminal according to the treatment reminder response, analyze the treatment feedback data and the target condition information, and obtain the diagnosis conclusion label, which can evaluate the treatment effect more conveniently; finally, generate the condition analysis report according to the diagnosis conclusion label, through the The patient's historical condition is analyzed to obtain a condition analysis report, which provides reference pathological data for the subsequent diagnosis process, improves the efficiency of condition analysis, and can also reduce the cost of medical treatment and medication for patients.
请参阅图7,本申请实施例还提供一种病情分析装置,可以实现上述病情分析方法,该装置包括:Please refer to Figure 7, the embodiment of the present application also provides a disease analysis device, which can implement the above disease analysis method, the device includes:
电子病历数据获取模块701,用于获取电子病历数据;Electronic medical record data acquisition module 701, used to acquire electronic medical record data;
特征提取模块702,用于对电子病历数据进行实体特征提取,得到目标病情信息;A feature extraction module 702, configured to extract entity features from electronic medical record data to obtain target disease information;
处理模块703,用于利用预先训练的病情识别模型对目标病情信息进行处理,生成与目标病情信息对应的治疗提醒;The processing module 703 is configured to use the pre-trained disease recognition model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
治疗反馈数据接收模块704,用于接收用户端根据治疗提醒响应的治疗反馈数据;The treatment feedback data receiving module 704 is used to receive the treatment feedback data that the client responds to according to the treatment reminder;
诊断结论标签生成模块705,用于根据治疗反馈数据和目标病情信息,生成诊断结论标签;The diagnosis conclusion label generation module 705 is used for generating diagnosis conclusion labels according to the treatment feedback data and the target condition information;
病情分析报告生成模块706,用于根据诊断结论标签生成病情分析报告。The disease analysis report generating module 706 is configured to generate a disease analysis report according to the diagnosis conclusion label.
本申请实施例还提供了一种电子设备,电子设备包括:存储器、处理器、存储在存储器上并可在处理器上运行的程序以及用于实现处理器和存储器之间的连接通信的数据总线,程序被处理器执行时实现上述病情分析方法。该电子设备可以为包括平板电脑、车载电脑等任意智能终端。The embodiment of the present application also provides an electronic device, the electronic device includes: a memory, a processor, a program stored in the memory and operable on the processor, and a data bus for realizing connection and communication between the processor and the memory , when the program is executed by the processor, the above disease analysis method is realized. The electronic device may be any intelligent terminal including a tablet computer, a vehicle-mounted computer, and the like.
请参阅图8,图8示意了另一实施例的电子设备的硬件结构,电子设备包括:Please refer to FIG. 8. FIG. 8 illustrates a hardware structure of an electronic device in another embodiment. The electronic device includes:
处理器801,可以采用通用的CPU(CentralProcessingUnit,中央处理器)、微处理器、应用专用集成电路(ApplicationSpecificIntegratedCircuit,ASIC)、或者一个或多个集成电路等方式实现,用于执行相关程序,以实现本申请实施例所提供的技术方案;The processor 801 may be implemented by a general-purpose CPU (Central Processing Unit, central processing unit), a microprocessor, an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, and is used to execute related programs to realize The technical solutions provided by the embodiments of the present application;
存储器802,可以采用只读存储器(ReadOnlyMemory,ROM)、静态存储设备、动态存储设备或者随机存取存储器(RandomAccessMemory,RAM)等形式实现。存储器802可以存储操作系统和其他应用程序,在通过软件或者固件来实现本说明书实施例所提供的技术方案时,相关的程序代码保存在存储器802中,并由处理器801来调用执行一种病情分析方法;The memory 802 may be implemented in the form of a read-only memory (ReadOnlyMemory, ROM), a static storage device, a dynamic storage device, or a random access memory (RandomAccessMemory, RAM). The memory 802 can store operating systems and other application programs. When implementing the technical solutions provided by the embodiments of this specification through software or firmware, the relevant program codes are stored in the memory 802, and are invoked by the processor 801 to execute a condition Analytical method;
输入/输出接口803,用于实现信息输入及输出;The input/output interface 803 is used to realize information input and output;
通信接口804,用于实现本设备与其他设备的通信交互,可以通过有线方式(例如USB、网线等)实现通信,也可以通过无线方式(例如移动网络、WIFI、蓝牙等)实现通信;和The communication interface 804 is used to realize the communication interaction between the device and other devices, and the communication can be realized through a wired method (such as USB, network cable, etc.), or can be realized through a wireless method (such as a mobile network, WIFI, Bluetooth, etc.); and
总线805,在设备的各个组件(例如处理器801、存储器802、输入/输出接口803和通信接口804)之间传输信息;A bus 805, which transmits information between various components of the device (such as a processor 801, a memory 802, an input/output interface 803, and a communication interface 804);
其中处理器801、存储器802、输入/输出接口803和通信接口804通过总线805实现彼此之间在设备内部的通信连接。The processor 801 , the memory 802 , the input/output interface 803 and the communication interface 804 are connected to each other within the device through the bus 805 .
其中,本申请实施例所提供的病情分析方法包括:Wherein, the condition analysis method provided by the embodiment of the present application includes:
获取电子病历数据;Access to electronic medical record data;
对电子病历数据进行实体特征提取,得到目标病情信息;Extract entity features from electronic medical record data to obtain target disease information;
利用预先训练的病情识别模型对目标病情信息进行处理,生成与目标病情信息对应的治疗提醒;Use the pre-trained disease recognition model to process the target disease information and generate treatment reminders corresponding to the target disease information;
接收用户端根据治疗提醒响应的治疗反馈数据;Receive the treatment feedback data that the client responds to according to the treatment reminder;
根据治疗反馈数据和目标病情信息,生成诊断结论标签;Generate diagnostic conclusion labels based on treatment feedback data and target condition information;
根据诊断结论标签生成病情分析报告。Generate a condition analysis report based on the diagnostic conclusion label.
本申请实施例还提供了一种计算机可读存储介质,用于计算机可读存储,计算机可读存储介质可以是非易失性,也可以是易失性。计算机可读存储介质存储有一个或者多个程序,一个或者多个程序可被一个或者多个处理器执行,以实现一种病情分析方法,其中,病情分析方法包括:获取电子病历数据;对电子病历数据进行实体特征提取,得到目标病情信息;利用预先训练的病情识别模型对目标病情信息进行处理,生成与目标病情信息对应的治疗提醒;接收用户端根据治疗提醒响应的治疗反馈数据;根据治疗反馈数据和目标病情信息,生成诊断结论标签;根据诊断结论标签生成病情分析报告。An embodiment of the present application also provides a computer-readable storage medium for computer-readable storage. The computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to implement a disease analysis method, wherein the disease analysis method includes: obtaining electronic medical record data; Extract entity features from medical record data to obtain target condition information; use pre-trained condition recognition model to process target condition information to generate treatment reminders corresponding to target condition information; receive treatment feedback data from user terminals based on treatment reminder responses; Feedback data and target condition information to generate a diagnosis conclusion label; generate a condition analysis report based on the diagnosis conclusion label.
存储器作为一种非暂态计算机可读存储介质,可用于存储非暂态软件程序以及非暂态性计算机可执行程序。此外,存储器可以包括高速随机存取存储器,还可以包括非暂态存储器, 例如至少一个磁盘存储器件、闪存器件、或其他非暂态固态存储器件。在一些实施方式中,存储器可选包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络连接至该处理器。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。As a non-transitory computer-readable storage medium, memory can be used to store non-transitory software programs and non-transitory computer-executable programs. In addition, the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage devices. In some embodiments, the memory optionally includes memory located remotely from the processor, and these remote memories may be connected to the processor via a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
本申请实施例描述的实施例是为了更加清楚的说明本申请实施例的技术方案,并不构成对于本申请实施例提供的技术方案的限定,本领域技术人员可知,随着技术的演变和新应用场景的出现,本申请实施例提供的技术方案对于类似的技术问题,同样适用。The embodiments described in the embodiments of the present application are to illustrate the technical solutions of the embodiments of the present application more clearly, and do not constitute a limitation to the technical solutions provided by the embodiments of the present application. Those skilled in the art know that with the evolution of technology and new For the emergence of application scenarios, the technical solutions provided by the embodiments of the present application are also applicable to similar technical problems.
本领域技术人员可以理解的是,图1-6中示出的技术方案并不构成对本申请实施例的限定,可以包括比图示更多或更少的步骤,或者组合某些步骤,或者不同的步骤。Those skilled in the art can understand that the technical solutions shown in Figures 1-6 do not constitute a limitation to the embodiments of the present application, and may include more or fewer steps than those shown in the illustrations, or combine certain steps, or be different A step of.
以上所描述的装置实施例仅仅是示意性的,其中作为分离部件说明的单元可以是或者也可以不是物理上分开的,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
本领域普通技术人员可以理解,上文中所公开方法中的全部或某些步骤、系统、设备中的功能模块/单元可以被实施为软件、固件、硬件及其适当的组合。Those of ordinary skill in the art can understand that all or some of the steps in the methods disclosed above, the functional modules/units in the system, and the device can be implemented as software, firmware, hardware, and an appropriate combination thereof.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括多指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例的方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,简称ROM)、随机存取存储器(Random Access Memory,简称RAM)、磁碟或者光盘等各种可以存储程序的介质。If the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or part of the contribution to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including multiple instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method in each embodiment of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM), magnetic disk or optical disc, etc., which can store programs. medium.
以上参照附图说明了本申请实施例的优选实施例,并非因此局限本申请实施例的权利范围。本领域技术人员不脱离本申请实施例的范围和实质内所作的任何修改、等同替换和改进,均应在本申请实施例的权利范围之内。The preferred embodiments of the embodiments of the present application have been described above with reference to the accompanying drawings, which does not limit the scope of rights of the embodiments of the present application. Any modifications, equivalent replacements and improvements made by those skilled in the art without departing from the scope and essence of the embodiments of the present application shall fall within the scope of rights of the embodiments of the present application.

Claims (20)

  1. 一种病情分析方法,其中,所述方法包括:A disease analysis method, wherein the method comprises:
    获取电子病历数据;Access to electronic medical record data;
    对所述电子病历数据进行实体特征提取,得到目标病情信息;Extracting entity features from the electronic medical record data to obtain target disease information;
    利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;Processing the target condition information by using a pre-trained condition identification model to generate a treatment reminder corresponding to the target condition information;
    接收用户端根据所述治疗提醒响应的治疗反馈数据;receiving the treatment feedback data that the user terminal responds to according to the treatment reminder;
    根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;generating a diagnosis conclusion label according to the treatment feedback data and the target condition information;
    根据所述诊断结论标签生成病情分析报告。A condition analysis report is generated according to the diagnostic conclusion label.
  2. 根据权利要求1所述的病情分析方法,其中,所述对所述电子病历数据进行实体特征提取,得到目标病情信息的步骤,包括:The disease analysis method according to claim 1, wherein the step of extracting entity features from the electronic medical record data to obtain target disease information includes:
    提取所述电子病历数据中的自然语言文本;Extracting natural language text in the electronic medical record data;
    利用预设的词法分析模型识别所述自然语言文本中的实体特征;Using a preset lexical analysis model to identify entity features in the natural language text;
    对所述自然语言文本进行分割处理,得到自然语言词汇序列;Segmenting the natural language text to obtain a natural language vocabulary sequence;
    根据所述实体特征和所述自然语言词汇序列,构建特征序列表;Constructing a feature sequence table according to the entity feature and the natural language vocabulary sequence;
    根据所述特征序列表确定目标病情信息。Determine the target condition information according to the feature sequence table.
  3. 根据权利要求1所述的病情分析方法,其中,所述利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒的步骤,包括:The disease analysis method according to claim 1, wherein the step of using a pre-trained disease identification model to process the target disease information and generate a treatment reminder corresponding to the target disease information includes:
    将所述目标病情信息输入至预先训练的病情识别模型中;Inputting the target condition information into the pre-trained condition identification model;
    通过所述病情识别模型对所述目标病情信息进行象限分区,得到分区病情信息;performing quadrant partitioning on the target disease information through the disease identification model to obtain partitioned disease information;
    对每一象限的分区病情信息进行拟合处理,生成用药提醒或者就医提醒。The condition information of each quadrant is fitted and processed to generate medication reminders or medical reminders.
  4. 根据权利要求1所述的病情分析方法,其中,在所述接收用户端根据所述治疗提醒响应的治疗反馈数据的步骤之后,所述方法还包括:The disease analysis method according to claim 1, wherein, after the step of receiving the treatment feedback data that the client responds to according to the treatment reminder, the method further comprises:
    对所述目标病情信息中的药物数据进行分析,得到药物反应信息;其中,所述药物反应信息包括不良药物反应信息;Analyzing the drug data in the target condition information to obtain drug reaction information; wherein the drug reaction information includes adverse drug reaction information;
    根据所述不良药物反应信息识别出所述治疗反馈数据中的异常用药数据。Identifying abnormal medication data in the treatment feedback data according to the adverse drug reaction information.
  5. 根据权利要求1至4任一项所述的病情分析方法,其中,在所述接收用户端根据所述治疗提醒响应的治疗反馈数据的步骤之后,所述方法还包括:The disease analysis method according to any one of claims 1 to 4, wherein, after the step of receiving the treatment feedback data that the client responds to according to the treatment reminder, the method further includes:
    对所述治疗反馈数据、所述目标病情信息进行可视化处理,生成可视化数据图。The treatment feedback data and the target condition information are visualized to generate a visualized data map.
  6. 根据权利要求1至4任一项所述的病情分析方法,其中,所述根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签的步骤,包括:The condition analysis method according to any one of claims 1 to 4, wherein the step of generating a diagnostic conclusion label according to the treatment feedback data and the target condition information includes:
    分别对所述治疗反馈数据和所述目标病情信息进行编码处理,得到编码形式的治疗反馈数据和编码形式的目标病情信息;Coding the treatment feedback data and the target condition information respectively to obtain the treatment feedback data in coded form and the target condition information in coded form;
    利用预设的大数据分析模型对所述编码形式的治疗反馈数据和所述编码形式的目标病情信息进行数据分析,生成诊断结论标签。Using a preset big data analysis model to perform data analysis on the coded treatment feedback data and the coded target condition information to generate a diagnosis conclusion label.
  7. 根据权利要求1至4任一项所述的病情分析方法,其中,所述根据所述诊断结论标签生成病情分析报告的步骤,包括:The disease analysis method according to any one of claims 1 to 4, wherein the step of generating a disease analysis report according to the diagnostic conclusion label includes:
    根据所述目标病情信息对所述诊断结论标签进行提纯处理,得到提纯后的诊断结论标签;Purifying the diagnostic conclusion label according to the target condition information to obtain the purified diagnostic conclusion label;
    对所述提纯后的诊断结论标签进行验证分析,得到标准诊断结论标签;Perform verification analysis on the purified diagnostic conclusion label to obtain a standard diagnostic conclusion label;
    根据所述标准诊断结论标签,生成病情分析报告。According to the standard diagnosis conclusion label, a condition analysis report is generated.
  8. 一种病情分析装置,其中,所述装置包括:A condition analysis device, wherein the device includes:
    电子病历数据获取模块,用于获取电子病历数据;Electronic medical record data acquisition module, used to obtain electronic medical record data;
    特征提取模块,用于对所述电子病历数据进行实体特征提取,得到目标病情信息;A feature extraction module, configured to extract entity features from the electronic medical record data to obtain target disease information;
    处理模块,用于利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;A processing module, configured to use a pre-trained disease recognition model to process the target disease information, and generate a treatment reminder corresponding to the target disease information;
    治疗反馈数据接收模块,用于接收用户端根据所述治疗提醒响应的治疗反馈数据;The treatment feedback data receiving module is used to receive the treatment feedback data that the client responds to according to the treatment reminder;
    诊断结论标签生成模块,用于根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;A diagnosis conclusion label generating module, configured to generate a diagnosis conclusion label according to the treatment feedback data and the target condition information;
    病情分析报告生成模块,用于根据所述诊断结论标签生成病情分析报告。A condition analysis report generating module, configured to generate a condition analysis report according to the diagnosis conclusion label.
  9. 一种电子设备,其中,所述电子设备包括存储器、处理器、存储在所述存储器上并可在所述处理器上运行的程序以及用于实现所述处理器和所述存储器之间的连接通信的数据总线,所述程序被所述处理器执行时实现如一种病情分析方法,其中,所述病情分析方法包括:An electronic device, wherein the electronic device includes a memory, a processor, a program stored on the memory and operable on the processor, and a program for realizing the connection between the processor and the memory A data bus for communication, when the program is executed by the processor, it is implemented as a disease analysis method, wherein the disease analysis method includes:
    获取电子病历数据;Access to electronic medical record data;
    对所述电子病历数据进行实体特征提取,得到目标病情信息;Extracting entity features from the electronic medical record data to obtain target disease information;
    利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒;Processing the target condition information by using a pre-trained condition identification model to generate a treatment reminder corresponding to the target condition information;
    接收用户端根据所述治疗提醒响应的治疗反馈数据;receiving the treatment feedback data that the user terminal responds to according to the treatment reminder;
    根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;generating a diagnosis conclusion label according to the treatment feedback data and the target condition information;
    根据所述诊断结论标签生成病情分析报告。A condition analysis report is generated according to the diagnostic conclusion label.
  10. 根据权利要求9所述的电子设备,其中,所述对所述电子病历数据进行实体特征提取,得到目标病情信息的步骤,包括:The electronic device according to claim 9, wherein the step of extracting entity features from the electronic medical record data to obtain target disease information includes:
    提取所述电子病历数据中的自然语言文本;Extracting natural language text in the electronic medical record data;
    利用预设的词法分析模型识别所述自然语言文本中的实体特征;Using a preset lexical analysis model to identify entity features in the natural language text;
    对所述自然语言文本进行分割处理,得到自然语言词汇序列;Segmenting the natural language text to obtain a natural language vocabulary sequence;
    根据所述实体特征和所述自然语言词汇序列,构建特征序列表;Constructing a feature sequence table according to the entity feature and the natural language vocabulary sequence;
    根据所述特征序列表确定目标病情信息。Determine the target condition information according to the feature sequence table.
  11. 根据权利要求9所述的电子设备,其中,所述利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒的步骤,包括:The electronic device according to claim 9, wherein the step of using a pre-trained disease identification model to process the target disease information to generate a treatment reminder corresponding to the target disease information includes:
    将所述目标病情信息输入至预先训练的病情识别模型中;Inputting the target condition information into the pre-trained condition identification model;
    通过所述病情识别模型对所述目标病情信息进行象限分区,得到分区病情信息;performing quadrant partitioning on the target disease information through the disease identification model to obtain partitioned disease information;
    对每一象限的分区病情信息进行拟合处理,生成用药提醒或者就医提醒。The condition information of each quadrant is fitted and processed to generate medication reminders or medical reminders.
  12. 根据权利要求9所述的电子设备,其中,在所述接收用户端根据所述治疗提醒响应的治疗反馈数据的步骤之后,所述方法还包括:The electronic device according to claim 9, wherein, after the step of receiving the treatment feedback data that the user terminal responds to according to the treatment reminder, the method further comprises:
    对所述目标病情信息中的药物数据进行分析,得到药物反应信息;其中,所述药物反应信息包括不良药物反应信息;Analyzing the drug data in the target condition information to obtain drug reaction information; wherein the drug reaction information includes adverse drug reaction information;
    根据所述不良药物反应信息识别出所述治疗反馈数据中的异常用药数据。Identifying abnormal medication data in the treatment feedback data according to the adverse drug reaction information.
  13. 根据权利要求9至12任一项所述的电子设备,其中,在所述接收用户端根据所述治疗提醒响应的治疗反馈数据的步骤之后,所述方法还包括:The electronic device according to any one of claims 9 to 12, wherein, after the step of receiving the treatment feedback data that the user terminal responds to according to the treatment reminder, the method further comprises:
    对所述治疗反馈数据、所述目标病情信息进行可视化处理,生成可视化数据图。The treatment feedback data and the target condition information are visualized to generate a visualized data map.
  14. 根据权利要求9至12任一项所述的电子设备,其中,所述根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签的步骤,包括:The electronic device according to any one of claims 9 to 12, wherein the step of generating a diagnostic conclusion label according to the treatment feedback data and the target condition information includes:
    分别对所述治疗反馈数据和所述目标病情信息进行编码处理,得到编码形式的治疗反馈数据和编码形式的目标病情信息;Coding the treatment feedback data and the target condition information respectively to obtain the treatment feedback data in coded form and the target condition information in coded form;
    利用预设的大数据分析模型对所述编码形式的治疗反馈数据和所述编码形式的目标病情信息进行数据分析,生成诊断结论标签。Using a preset big data analysis model to perform data analysis on the coded treatment feedback data and the coded target condition information to generate a diagnosis conclusion label.
  15. 一种计算机可读存储介质,用于计算机可读存储,其中,所述计算机可读存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现一种病情分析方法,其中,所述病情分析方法包括:A computer-readable storage medium for computer-readable storage, wherein the computer-readable storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to Realize a kind of condition analysis method, wherein, described condition analysis method comprises:
    获取电子病历数据;Access to electronic medical record data;
    对所述电子病历数据进行实体特征提取,得到目标病情信息;Extracting entity features from the electronic medical record data to obtain target disease information;
    利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息 对应的治疗提醒;Processing the target condition information using a pre-trained condition recognition model to generate a treatment reminder corresponding to the target condition information;
    接收用户端根据所述治疗提醒响应的治疗反馈数据;receiving the treatment feedback data that the user terminal responds to according to the treatment reminder;
    根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签;generating a diagnosis conclusion label according to the treatment feedback data and the target condition information;
    根据所述诊断结论标签生成病情分析报告。A condition analysis report is generated according to the diagnostic conclusion label.
  16. 根据权利要求15所述的计算机可读存储介质,其中,所述对所述电子病历数据进行实体特征提取,得到目标病情信息的步骤,包括:The computer-readable storage medium according to claim 15, wherein the step of extracting entity features from the electronic medical record data to obtain target disease information includes:
    提取所述电子病历数据中的自然语言文本;Extracting natural language text in the electronic medical record data;
    利用预设的词法分析模型识别所述自然语言文本中的实体特征;Using a preset lexical analysis model to identify entity features in the natural language text;
    对所述自然语言文本进行分割处理,得到自然语言词汇序列;Segmenting the natural language text to obtain a natural language vocabulary sequence;
    根据所述实体特征和所述自然语言词汇序列,构建特征序列表;Constructing a feature sequence table according to the entity feature and the natural language vocabulary sequence;
    根据所述特征序列表确定目标病情信息。Determine the target condition information according to the feature sequence table.
  17. 根据权利要求15所述的计算机可读存储介质,其中,所述利用预先训练的病情识别模型对所述目标病情信息进行处理,生成与所述目标病情信息对应的治疗提醒的步骤,包括:The computer-readable storage medium according to claim 15, wherein the step of using a pre-trained disease recognition model to process the target disease information to generate a treatment reminder corresponding to the target disease information includes:
    将所述目标病情信息输入至预先训练的病情识别模型中;Inputting the target condition information into the pre-trained condition identification model;
    通过所述病情识别模型对所述目标病情信息进行象限分区,得到分区病情信息;performing quadrant partitioning on the target disease information through the disease identification model to obtain partitioned disease information;
    对每一象限的分区病情信息进行拟合处理,生成用药提醒或者就医提醒。The condition information of each quadrant is fitted and processed to generate medication reminders or medical reminders.
  18. 根据权利要求15所述的计算机可读存储介质,其中,在所述接收用户端根据所述治疗提醒响应的治疗反馈数据的步骤之后,所述方法还包括:The computer-readable storage medium according to claim 15, wherein, after the step of receiving the treatment feedback data from the user terminal according to the treatment reminder response, the method further comprises:
    对所述目标病情信息中的药物数据进行分析,得到药物反应信息;其中,所述药物反应信息包括不良药物反应信息;Analyzing the drug data in the target condition information to obtain drug reaction information; wherein the drug reaction information includes adverse drug reaction information;
    根据所述不良药物反应信息识别出所述治疗反馈数据中的异常用药数据。Identifying abnormal medication data in the treatment feedback data according to the adverse drug reaction information.
  19. 根据权利要求15至18任一项所述的计算机可读存储介质,其中,在所述接收用户端根据所述治疗提醒响应的治疗反馈数据的步骤之后,所述方法还包括:The computer-readable storage medium according to any one of claims 15 to 18, wherein, after the step of receiving the treatment feedback data that the user responds to according to the treatment reminder, the method further comprises:
    对所述治疗反馈数据、所述目标病情信息进行可视化处理,生成可视化数据图。The treatment feedback data and the target condition information are visualized to generate a visualized data map.
  20. 根据权利要求15至18任一项所述的计算机可读存储介质,其中,所述根据所述治疗反馈数据和所述目标病情信息,生成诊断结论标签的步骤,包括:The computer-readable storage medium according to any one of claims 15 to 18, wherein the step of generating a diagnostic conclusion label according to the treatment feedback data and the target condition information includes:
    分别对所述治疗反馈数据和所述目标病情信息进行编码处理,得到编码形式的治疗反馈数据和编码形式的目标病情信息;Coding the treatment feedback data and the target condition information respectively to obtain the treatment feedback data in coded form and the target condition information in coded form;
    利用预设的大数据分析模型对所述编码形式的治疗反馈数据和所述编码形式的目标病情信息进行数据分析,生成诊断结论标签。Using a preset big data analysis model to perform data analysis on the coded treatment feedback data and the coded target condition information to generate a diagnosis conclusion label.
PCT/CN2022/087710 2021-08-30 2022-04-19 Illness state analysis method and apparatus, electronic device, and storage medium WO2023029506A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111007202.5 2021-08-30
CN202111007202.5A CN113707307A (en) 2021-08-30 2021-08-30 Disease analysis method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2023029506A1 true WO2023029506A1 (en) 2023-03-09

Family

ID=78657041

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/087710 WO2023029506A1 (en) 2021-08-30 2022-04-19 Illness state analysis method and apparatus, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN113707307A (en)
WO (1) WO2023029506A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116259422A (en) * 2023-03-13 2023-06-13 暨南大学 Virtual data enhancement-based ophthalmic disease diagnosis and treatment opinion generation method, system, medium and equipment
CN116343980A (en) * 2023-05-30 2023-06-27 深圳市即达健康医疗科技有限公司 Intelligent medical review follow-up data processing method and system

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113707307A (en) * 2021-08-30 2021-11-26 康键信息技术(深圳)有限公司 Disease analysis method and device, electronic equipment and storage medium
CN115132354B (en) * 2022-07-06 2023-05-30 哈尔滨医科大学 Patient type identification method and device, electronic equipment and storage medium
CN116386869B (en) * 2023-04-11 2024-01-19 中国科学技术大学 Disease critical degree assessment method based on multiple variables
CN116313162B (en) * 2023-05-12 2023-08-18 北京梆梆安全科技有限公司 Medical inquiry system based on AI model

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110231205A1 (en) * 2010-03-19 2011-09-22 Letts Gary A Method and system for cutaneous medicine diagnostics
US8548828B1 (en) * 2012-05-09 2013-10-01 DermTap Method, process and system for disease management using machine learning process and electronic media
CN108899064A (en) * 2018-05-31 2018-11-27 平安医疗科技有限公司 Electronic health record generation method, device, computer equipment and storage medium
CN109493939A (en) * 2018-10-27 2019-03-19 平安医疗健康管理股份有限公司 A kind of Drug use administration method, server and computer storage medium
CN110675951A (en) * 2019-08-26 2020-01-10 北京百度网讯科技有限公司 Intelligent disease diagnosis method and device, computer equipment and readable medium
CN111933239A (en) * 2020-09-17 2020-11-13 泰康保险集团股份有限公司 Data processing method, device, system and storage medium
CN113707307A (en) * 2021-08-30 2021-11-26 康键信息技术(深圳)有限公司 Disease analysis method and device, electronic equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110231205A1 (en) * 2010-03-19 2011-09-22 Letts Gary A Method and system for cutaneous medicine diagnostics
US8548828B1 (en) * 2012-05-09 2013-10-01 DermTap Method, process and system for disease management using machine learning process and electronic media
CN108899064A (en) * 2018-05-31 2018-11-27 平安医疗科技有限公司 Electronic health record generation method, device, computer equipment and storage medium
CN109493939A (en) * 2018-10-27 2019-03-19 平安医疗健康管理股份有限公司 A kind of Drug use administration method, server and computer storage medium
CN110675951A (en) * 2019-08-26 2020-01-10 北京百度网讯科技有限公司 Intelligent disease diagnosis method and device, computer equipment and readable medium
CN111933239A (en) * 2020-09-17 2020-11-13 泰康保险集团股份有限公司 Data processing method, device, system and storage medium
CN113707307A (en) * 2021-08-30 2021-11-26 康键信息技术(深圳)有限公司 Disease analysis method and device, electronic equipment and storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116259422A (en) * 2023-03-13 2023-06-13 暨南大学 Virtual data enhancement-based ophthalmic disease diagnosis and treatment opinion generation method, system, medium and equipment
CN116259422B (en) * 2023-03-13 2024-02-06 暨南大学 Virtual data enhancement-based ophthalmic disease diagnosis and treatment opinion generation method, system, medium and equipment
CN116343980A (en) * 2023-05-30 2023-06-27 深圳市即达健康医疗科技有限公司 Intelligent medical review follow-up data processing method and system
CN116343980B (en) * 2023-05-30 2023-08-29 深圳市即达健康医疗科技有限公司 Intelligent medical review follow-up data processing method and system

Also Published As

Publication number Publication date
CN113707307A (en) 2021-11-26

Similar Documents

Publication Publication Date Title
Babu et al. Sentiment analysis in social media data for depression detection using artificial intelligence: a review
WO2023029506A1 (en) Illness state analysis method and apparatus, electronic device, and storage medium
CN111708873B (en) Intelligent question-answering method, intelligent question-answering device, computer equipment and storage medium
CN112131366B (en) Method, device and storage medium for training text classification model and text classification
CN112131393B (en) Medical knowledge graph question-answering system construction method based on BERT and similarity algorithm
US10331659B2 (en) Automatic detection and cleansing of erroneous concepts in an aggregated knowledge base
CN113505243A (en) Intelligent question-answering method and device based on medical knowledge graph
CN113704428B (en) Intelligent inquiry method, intelligent inquiry device, electronic equipment and storage medium
US20200410050A1 (en) Deep learning approach to computing spans
US20230315994A1 (en) Natural Language Processing for Addressing Bias
CN115858886B (en) Data processing method, device, equipment and readable storage medium
Liu et al. A genetic algorithm enabled ensemble for unsupervised medical term extraction from clinical letters
CN116383430A (en) Knowledge graph construction method, device, equipment and storage medium
CN113657086B (en) Word processing method, device, equipment and storage medium
Mulwad Tabel–a domain independent and extensible framework for inferring the semantics of tables
US11281855B1 (en) Reinforcement learning approach to decode sentence ambiguity
CN116956934A (en) Task processing method, device, equipment and storage medium
Bandhu et al. Health care chatbot using natural language processing with SGD and ADAM optimizer parameter optimization
CN116030271A (en) Depression emotion prediction system based on deep learning and bimodal data
CN111339252B (en) Searching method, searching device and storage medium
CN114360715A (en) Constitution identification method and device, electronic equipment and storage medium
US11182552B2 (en) Routine evaluation of accuracy of a factoid pipeline and staleness of associated training data
Maktapwong et al. Message Classification for Breast Cancer Chatbot Using Bidirectional LSTM
Chowdhury et al. Improving medical nli using context-aware domain knowledge
Egleston et al. Statistical inference for natural language processing algorithms with a demonstration using type 2 diabetes prediction from electronic health record notes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22862652

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE