WO2022217713A1 - Syndrome monitoring and early warning method and apparatus, computer device, and storage medium - Google Patents

Syndrome monitoring and early warning method and apparatus, computer device, and storage medium Download PDF

Info

Publication number
WO2022217713A1
WO2022217713A1 PCT/CN2021/097415 CN2021097415W WO2022217713A1 WO 2022217713 A1 WO2022217713 A1 WO 2022217713A1 CN 2021097415 W CN2021097415 W CN 2021097415W WO 2022217713 A1 WO2022217713 A1 WO 2022217713A1
Authority
WO
WIPO (PCT)
Prior art keywords
syndrome
target
symptom
keyword
network
Prior art date
Application number
PCT/CN2021/097415
Other languages
French (fr)
Chinese (zh)
Inventor
唐蕊
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022217713A1 publication Critical patent/WO2022217713A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/80ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Definitions

  • the present application relates to the field of computer technology, and in particular, to a method, device, computer equipment and storage medium for monitoring and early warning of symptoms.
  • Public health is a public undertaking related to the public health of the people in a country or a region.
  • One of the important tasks of public health is the prevention, monitoring and treatment of major diseases, especially infectious diseases (such as tuberculosis, AIDS, SARS, new coronary pneumonia, etc.).
  • syndrome monitoring and early warning is a kind of public health abnormal event monitoring and early warning mechanism ahead of infectious disease monitoring and early warning. Syndrome monitoring and early warning can actively monitor the budding state of public health events and give early warnings, providing a basis for public health personnel to take effective prevention and control strategies.
  • Syndrome monitoring can detect abnormalities or characteristics of disease outbreak signals in time. Syndrome early warning based on syndrome monitoring Through the capture of abnormal syndromes, early warning signals can be issued in advance of specific disease warnings.
  • the present application provides a syndrome monitoring and early warning method, device, computer equipment and storage medium, which can efficiently and effectively provide a syndrome early warning risk level through an intelligent method, so as to provide a basis for public health personnel to take effective prevention and treatment strategies.
  • the present application provides a method for monitoring and early warning of symptoms, the method comprising:
  • the early warning information of the target syndrome is released.
  • the present application provides a syndrome monitoring and early warning device, including:
  • the symptom extraction module is used to extract symptom keywords from the electronic medical record of the patient;
  • the symptom matching module is used to judge whether the patient belongs to a case of one of the target syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
  • a target syndrome determination module configured to determine the syndrome as the target syndrome if it is determined that the patient belongs to one of the syndrome cases
  • a case statistics module used to obtain the number of cases of the target syndrome within a preset time unit
  • the index calculation module is used to calculate some statistical indexes of the target syndrome according to the number of cases of the target syndrome in the current preset time unit;
  • a risk prediction module for inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome
  • An early warning module configured to issue early warning information of the target syndrome according to the risk level of the target syndrome.
  • the present application provides a computer device, the computer device includes a memory and a processor; the memory is used for storing a computer program; the processor is used for executing the computer program and executing the computer During the program, the above-mentioned symptom monitoring and early warning method is realized.
  • the present application provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and if the computer program is executed by a processor, the above-mentioned method for monitoring and early warning of a syndrome is implemented.
  • the present application discloses a syndrome monitoring method, device, computer equipment and storage medium.
  • symptom keywords from an electronic medical record of a patient; according to the matching situation of the symptom keywords and the keywords of a target syndrome, the patient is judged. Whether it belongs to the case of the target syndrome; count the number of cases of the target syndrome according to the current preset time unit; calculate some statistical indicators of the target syndrome according to the number of cases of the target syndrome; input the statistical indicators into the risk prediction model,
  • artificial intelligence-based syndrome monitoring and early warning can be realized, and the efficiency and accuracy of syndrome monitoring and early warning can be improved.
  • FIG. 1 is a schematic flowchart of a syndrome monitoring and early warning method provided by an embodiment of the present application
  • FIG. 2 is a schematic block diagram of the structure of a syndrome monitoring and early warning device provided by an embodiment of the present application
  • FIG. 3 is a schematic structural block diagram of a computer device provided by an embodiment of the present application.
  • Embodiments of the present application provide a syndrome monitoring and early warning method, apparatus, computer device, and computer-readable storage medium. It is used to improve the efficiency and accuracy of syndrome monitoring and early warning based on artificial intelligence technology. Exemplarily, in the monitoring and early warning of the syndrome, if the historical data is analyzed and then the risk level is judged according to the manually set threshold, it requires a lot of manpower, and the early warning effect is limited by the threshold set based on experience.
  • the early warning risk level is efficiently provided by the method based on artificial intelligence, and the problem of false positives or false negatives caused by "one size fits all" according to the threshold value in the traditional method when the risk level is judged by relying on the manual setting of the threshold value is avoided, Improves the accuracy of risk level predictions.
  • the syndrome monitoring and early warning method can be applied to a server, and of course can also be applied to a terminal, wherein the terminal can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, etc.; the server can be, for example, a separate server or a server cluster.
  • the terminal can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, etc.
  • the server can be, for example, a separate server or a server cluster.
  • the following embodiments will be described in detail with a method for monitoring and early warning of symptoms applied to a server.
  • FIG. 1 is a schematic flowchart of a method for monitoring and early warning of a syndrome provided by an embodiment of the present application.
  • the syndrome monitoring and early warning method may include the following steps S110-S170.
  • Step S110 extracting symptom keywords from the electronic medical record of the patient.
  • the symptom keywords may include symptoms and related attributes of the symptoms, and the related attributes may be the time, duration, and severity of the symptoms.
  • the corresponding symptom keyword is "fever for three days", where "three days” is the manifestation of symptoms, and "three days” is one of the related attributes of symptoms.
  • the composition of the symptom keyword can be determined according to the composition of the keyword of the target syndrome. For example, the keyword of the target syndrome only has symptoms, then the symptom keyword also only retains symptoms such as "fever”. It is sufficient to express. If the keyword of the target syndrome includes the manifestation of the symptom and the related attributes of the symptom, the symptom keyword also includes the manifestation of the symptom and the related attribute of the symptom.
  • step S110 specifically includes steps S111-S112:
  • Step S111 through named entity recognition, extract the initial keywords of the symptoms of the patient from the text of the electronic medical record;
  • Named Entity Recognition is a natural language processing technology that recognizes entities with specific meanings in text. Through named entity recognition, the desired entity type can be extracted from the text. For example, the symptom of the patient is the type of entity that is expected to be obtained. The tool developed through the named entity recognition technology can extract specific symptoms such as "fever” and "nausea" in the sentence from the electronic medical record.
  • Step S112 based on a preset standardized vocabulary, replace the alias vocabulary in the symptom initial keyword with a standard vocabulary to obtain the symptom keyword.
  • nausea is an alias word
  • symptom keyword obtained by replacing the standard word is "diarrhea”.
  • the standardized vocabulary can be formed by collecting and arranging symptom-related alias words and standard words.
  • structured data also known as row data
  • row data is data that is logically expressed and implemented by a two-dimensional table structure. It strictly follows the data format and length specifications, and is mainly stored and managed through relational databases. Data management can be facilitated through structuring. , use, display.
  • Step S120 according to the matching situation of the symptom keyword and the keywords of several preset syndrome groups, determine whether the patient belongs to a case of one of the syndrome groups.
  • Syndrome refers to multiple clinical symptoms that often appear at the same time when certain possible diseases appear. For example, as shown in Table 1, there are five preset syndromes and keywords corresponding to their symptoms in an embodiment. During specific implementation, new syndromes can be added or defined as needed.
  • step S120 specifically includes an exact matching step and/or a fuzzy matching step.
  • the exact matching step includes:
  • a number of preset syndromes include the first syndrome, the keyword of the first syndrome is ⁇ A, B, C, D ⁇ , the symptom keyword of the patient A is ⁇ A, B, C, D ⁇ , and the patient B's symptom keywords are ⁇ C,D,E,F,G ⁇ .
  • the fuzzy matching step includes steps S122a-step S122e:
  • Step S122a construct a multi-hot encoding of the symptom keyword and a multi-hot encoding of the keyword of a certain syndrome according to the symptom keyword and the keyword of a certain syndrome.
  • the symptom keywords of patient B are ⁇ C, D, E, F, G ⁇
  • the keywords of the first syndrome are ⁇ A, B, C, D ⁇
  • the symptom keywords of patient B correspond to the multiple fever
  • the encoding is ⁇ 0, 0, 1, 1, 1, 1, 1 ⁇
  • the elements in the multi-hot encoding respectively represent the absence of keyword A, the absence of keyword B, the presence of keyword C...
  • the key of the first syndrome The corresponding multi-hot encoding of words is ⁇ 1, 1, 1, 1, 0, 0, 0 ⁇ .
  • Step S122b Input the multi-hot encoding of the symptom keyword into an autoencoder, and obtain an autoencoder vector output by the autoencoder.
  • the method further includes a training step S210 of acquiring the autoencoder, specifically including steps S211-S215.
  • Step S211 Acquire training data, where the training data includes multi-hot encoding.
  • the multi-hot encoding of the symptom keyword or the multi-hot encoding of the keyword of a certain syndrome may be used as the training data, or the multi-hot encoding as the training data may be obtained by random generation.
  • Step S212 Obtain an autoencoder to be trained, where the autoencoder includes a first network and a second network.
  • the structures of the first network and the second network are symmetrical; both the first network and the second network include multiple fully connected layers.
  • a fully-connected layer that is, each node of this layer is connected to all nodes of the previous layer, and the nonlinear expression capability of the autoencoder can be improved by using multiple fully-connected layers.
  • Step S213 Input the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, where the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding.
  • the first network constitutes an encoder to compress the high-dimensional sparse multi-hot encoding into the low-dimensional dense self-encoding vector.
  • Step S214 inputting the self-encoding vector output by the first network into the second network to obtain a vector output by the second network, and the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding;
  • the second network constitutes a decoder that reconstructs the low-dimensional dense self-encoding vector into a high-dimensional sparse multi-hot encoding.
  • Step S215 Train the first network and the second network according to the vector output by the second network and the multi-hot encoding.
  • the network parameters of the first network and the second network are adjusted, so that the vector output by the second network and the Multi-hot encoding tends to be consistent.
  • the multi-hot encoding is input into the autoencoder, and the autoencoder vector is obtained at the output of the first network.
  • Step S122c according to the multi-hot encoding of the keyword of a certain syndrome, obtain the self-encoding vector of the keyword of the certain syndrome by the autoencoder;
  • the multi-hot encoding of the keywords of the certain syndrome is input into the autoencoder, and the autoencoder vector of the certain syndrome is obtained at the output of the first network.
  • Step S122d calculating the similarity of the self-encoding vector of the symptom keyword and the self-encoding vector of the keyword of a certain syndrome
  • the similarity is calculated by cosine similarity, and the calculation formula of cosine similarity is:
  • X is the self-encoding vector of the symptom keyword
  • Y is the self-encoding vector of the keyword of the certain syndrome
  • n is the dimension of the self-encoding vector
  • the similarity may also be calculated by methods such as Euclidean distance, Manhattan distance, and Minkowski distance.
  • Step S122e judging whether the patient belongs to a case of the certain syndrome according to the similarity.
  • a similarity threshold is preset, for example, the similarity threshold is set to 0.8. If the similarity exceeds the similarity threshold, the corresponding patient is a case of the certain syndrome; otherwise, the corresponding patient is not a case of the certain syndrome.
  • the precise matching step and the fuzzy matching step are used in combination, for example, the precise matching step is performed first, and it is not possible to determine whether the patient belongs to the certain syndrome after the precise matching step. , and then perform the fuzzy matching step to further determine whether the patient belongs to the case of the certain syndrome.
  • Step S130 if it is determined that the patient belongs to a case of one of the syndromes, determine that the syndrome is the target syndrome.
  • the electronic medical records are processed one by one through steps S110 to S130 to obtain all the target syndromes; if after all the electronic medical records are processed, the number of the target syndromes is still 0, it means that there is currently no forecast. No risk warning is required for cases of the designated syndromes.
  • Step S140 Acquire the number of cases of the target syndrome within a preset time unit.
  • the preset time unit is days, and the number of cases of the target syndrome in days is obtained.
  • the number of cases of the target syndrome may be counted during the execution of step S120 or step S130. For example, if it is determined that a patient belongs to one of the syndrome cases, the number of cases of the syndrome corresponding to the visit date is increased by one.
  • Step S150 Calculate several statistical indexes of the target syndrome according to the number of cases of the target syndrome within the current preset time unit.
  • the number of historical cases of the target syndrome in the current preset time unit calculate several statistical indicators of the target syndrome, and the statistical indicators include the chain growth rate, One or more of year-over-year growth rate, historical percentile.
  • the chain growth rate represents the change ratio of the statistics in two consecutive statistical periods. During the specific implementation, the chain ratio corresponding to multiple statistical periods can be calculated.
  • Week-to-month growth rate calculated for the number of cases of the target syndrome described on 10th August 2020, based on the number of cases of the target syndrome described in the period from 18th August 2020 to 17th September 2020 and the The number of cases of the target syndrome mentioned on the 17th is calculated to calculate the month-on-month growth rate; the year-on-year growth rate generally refers to the change ratio of the statistics in a certain statistical period this year relative to the amount in the same period last year. Similarly, the corresponding statistics can be calculated.
  • the number of cases of the target syndrome from January 1, 2019 to December 31, 2019 is used as historical reference data.
  • the number of cases of the target syndrome on the 17th and the number of cases of the target syndrome from January 1, 2019 to December 31, 2019 Calculation date-historical percentile, based on September 11, 2020 to September 2020
  • the number of cases of the target syndrome described on the 17th and the number of cases of the target syndrome described in January 1, 2019 to December 31, 2019 Calculation week-historical percentile, based on August 18, 2020 to September 2020 Month-historical percentiles were calculated for the number of cases of the target syndrome on the 17th and the number of cases of the target syndrome from January 1, 2019 to December 31, 2019.
  • the statistical indicators adopted in this method are not limited to the above-mentioned ones.
  • the growth rate of the number of cases relative to the number of cases of the target syndrome described on September 14, 2020, the number of cases of the target syndrome described on September 15, 2020, and other statistical indicators that can reflect changes in data such as the fixed-base ratio can also be used in this method.
  • the number of cases of the target syndrome is used as a statistic to calculate the statistical index.
  • the data obtained according to the number of cases of the target syndrome can be used as a statistic to calculate the statistical index.
  • the statistical index is calculated by taking the ratio of the number of cases of the target syndrome to the patient as a statistic.
  • Step S160 Input several statistical indicators of the target syndrome into a risk prediction model to obtain a predicted risk level.
  • the statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome
  • the risk factors include weather data and/or environmental data.
  • the weather data includes the current temperature, humidity, and whether it is a sunny, cloudy or rainy day
  • the environmental data includes the current air quality, water quality level, and the like.
  • the risk prediction model can be trained by the following methods: respectively inputting the statistical indicators and the risk factors corresponding to a certain time in the past into the initial model, the initial model in this embodiment is the XGboost model, and the specific implementation can also be Use feedforward neural network, support vector machine and other machine learning models; use the risk level at a certain time as the label of the statistical index, and use the label as the expected output of the model to train the model.
  • the risk level at a certain time in the past if there are corresponding query channels, you can directly query through these channels.
  • the target syndrome is fever respiratory syndrome, and the society has the corresponding risk level of the fever respiratory tract announced, you can use these channels to query.
  • the target syndrome is febrile respiratory syndrome
  • the main disease related to febrile respiratory syndrome is the new crown epidemic
  • the fever respiratory tract can be obtained according to the risk level of the new crown epidemic at that time. The risk level of the syndrome at that time.
  • the risk prediction model is explained by an explanation model (SHAP, SHapley Additive exPlanations).
  • HTP HyperText Transfer Protocol
  • SHapley Additive exPlanations the impact of the statistical indicators in the input of the risk model or the risk factors in obtaining the risk prediction result can be evaluated, so as to know which data has an important impact on the risk level, Subsequent focus on data with significant impact improves the usability of the risk prediction model.
  • Step S170 Publish early warning information of the target syndrome according to the risk level of the target syndrome.
  • the risk level of the target syndrome includes high, medium and low, and the target syndrome is febrile respiratory syndrome.
  • the early warning information of the target syndrome to be released includes: high risk of fever and respiratory syndrome, heightened vigilance, and implementation of high-level prevention and control measures; if the risk level of the target syndrome is medium, all published
  • the early warning information of the target syndrome includes: the risk of febrile respiratory syndrome, pay close attention to the situation of the disease, and implement general prevention and control measures; if the risk level of the target syndrome is low, the published early warning information of the target syndrome includes: fever respiratory syndrome low risk.
  • the present application predicts the risk level by the method based on the risk prediction model, which avoids the defects of labor-intensive and low efficiency in the manual analysis.
  • the risk level is obtained by the risk prediction model through nonlinear calculation, which avoids judging the risk level by the threshold value. It is easy to cause "one size fits all", which improves the accuracy of risk level prediction.
  • the above symptom keywords, statistical indicators, the risk prediction model and risk levels can also be stored in one area. in the nodes of the blockchain.
  • the blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • the syndrome monitoring and early warning device includes: a symptom extraction module 110 , a symptom matching module 120 , a target syndrome determination module 130 , a case statistics module 140 , an index calculation module 150 , a risk prediction module 160 and an early warning module 170 .
  • the symptom extraction module 110 is used for extracting symptom keywords from the electronic medical record of the patient;
  • the symptom extraction module 110 includes an initial keyword extraction module and a normalization module.
  • an initial keyword extraction module used for extracting the initial keywords of the symptoms of the patient from the text of the electronic medical record through named entity recognition
  • the standardization module is configured to replace the alias words in the symptom initial keywords with standard words based on a preset tagging vocabulary, so as to obtain the symptom keywords.
  • the standardized vocabulary can be formed by collecting and arranging symptom-related alias words and standard words.
  • structured data also known as row data
  • row data is data that is logically expressed and implemented by a two-dimensional table structure. It strictly follows the data format and length specifications, and is mainly stored and managed through relational databases. Data management can be facilitated through structuring. , use, display.
  • the symptom matching module 120 is configured to judge whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keyword and the keywords of several preset syndromes.
  • Exemplary symptom matching modules 120 include exact matching modules and fuzzy matching modules.
  • an exact matching module is used to compare the symptom keywords with each keyword of each syndrome, and if all the symptom keywords have corresponding keywords in one of the syndromes, determine the The patient belongs to the case of one of the syndromes described.
  • the fuzzy matching module includes a multi-hot encoding building module, a symptom keyword encoding module, a target syndrome keyword encoding module, a similarity calculation module, and a case judgment module.
  • a multi-hot encoding building module is configured to construct a multi-hot encoding of the symptom keyword and a multi-hot encoding of the keyword of a certain syndrome according to the symptom keyword and the keyword of a certain syndrome.
  • the symptom keyword encoding module is configured to input the multi-hot encoding of the symptom keyword into the autoencoder, and obtain the autoencoder vector output by the autoencoder.
  • the syndrome keyword encoding module is configured to obtain the autoencoding vector of the keyword of the certain syndrome through the autoencoder according to the multi-hot encoding of the keyword of the certain syndrome.
  • the similarity calculation module is used to calculate the self-encoding vector of the symptom keyword and the similarity of the keyword of a certain syndrome.
  • the similarity is calculated by cosine similarity, and the calculation formula of cosine similarity is:
  • X is the self-encoding vector of the symptom keyword
  • Y is the self-encoding vector of the keyword of the certain syndrome
  • n is the dimension of the self-encoding vector
  • a case judgment module configured to judge whether the patient belongs to a case of the certain syndrome group according to the similarity.
  • a similarity threshold is preset, for example, the similarity threshold is set to 0.8. If the similarity exceeds the similarity threshold, the corresponding patient is a case of the certain syndrome; otherwise, the corresponding patient is not a case of the certain syndrome.
  • the apparatus further includes an autoencoder training module
  • the autoencoder training module includes a training data acquisition module, an autoencoder initial module, a compression module, a decompression module, and an autoencoder training module.
  • a training data acquisition module for acquiring training data, the training data including multi-hot encoding
  • the autoencoder initial module is used to obtain the autoencoder to be trained, and the autoencoder includes a first network and a second network;
  • a compression module configured to input the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding ;
  • a decompression module configured to input the self-encoding vector output by the first network into the second network, to obtain a vector output by the second network, the dimension of the vector output by the second network is equal to the multi-hot encoding dimension;
  • An autoencoder training module configured to train the first network and the second network according to the vector output by the second network and the multi-hot encoding.
  • the target syndrome determination module 130 is configured to determine the syndrome as the target syndrome if it is determined that the patient belongs to one of the syndrome cases.
  • a case statistics module 140 configured to acquire the number of cases of the target syndrome within a preset time unit
  • the index calculation module 150 is configured to calculate several statistical indexes of the target syndrome according to the number of cases of the target syndrome in a preset time unit;
  • the number of cases of the target syndrome in the current preset time unit calculate several statistical indicators of the target syndrome, and the statistical indicators include a chain growth rate, a year-on-year growth rate, One or more of growth rate, historical percentile.
  • the risk prediction module 160 is configured to input several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome.
  • the statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome
  • the risk factors include weather data and/or environmental data.
  • the weather data includes the current temperature, humidity, and whether it is a sunny, cloudy or rainy day
  • the environmental data includes the current air quality, water quality level, and the like.
  • the risk prediction model is explained by an explanation model (SHAP, SHapley Additive exPlanations).
  • HTP HyperText Transfer Protocol
  • SHapley Additive exPlanations the impact of the statistical indicators in the input of the risk model or the risk factors in obtaining the risk prediction result can be evaluated, so as to know which data has an important impact on the risk level, Subsequent focus on data with significant impact improves the usability of the risk prediction model.
  • the early warning module 170 is configured to issue early warning information of the target syndrome according to the risk level.
  • the methods and apparatus of the present application may be used in numerous general purpose or special purpose computing system environments or configurations.
  • the above-mentioned method and apparatus can be implemented in the form of a computer program, and the computer program can be executed on a computer device as shown in FIG. 3 .
  • FIG. 3 is a schematic diagram of a computer device provided by an embodiment of the present application.
  • the computer device can be a server or a terminal.
  • the computer device includes a processor, a memory, and a network interface connected through a system bus, wherein the memory may include a non-volatile storage medium and an internal memory.
  • the nonvolatile storage medium can store operating systems and computer programs.
  • the computer program includes program instructions, which, when executed, can cause the processor to execute any one of the syndrome monitoring and early warning methods.
  • the processor is used to provide computing and control capabilities to support the operation of the entire computer equipment.
  • the internal memory provides an environment for running the computer program in the non-volatile storage medium.
  • the processor can execute any one of the syndrome monitoring and early warning methods.
  • the network interface is used for network communication, such as sending assigned tasks.
  • the structure of the computer device is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. More or fewer components are shown in the figures, either in combination or with different arrangements of components.
  • the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSP), application specific integrated circuits (Application Specific Integrated circuits) Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.
  • the processor is configured to run a computer program stored in the memory, so as to realize the following steps: extracting symptom keywords from the electronic medical record of the patient; If it is determined that the patient belongs to a case of one of the syndrome groups according to the matching of the keywords of the syndrome, it is determined whether the patient belongs to a case of one of the syndrome groups, and the syndrome is determined as the target syndrome; number of cases; according to the number of cases of the target syndrome in the current preset time unit, calculate some statistical indicators of the target syndrome; input some statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome ; According to the risk level of the target syndrome, the early warning information of the target syndrome is released.
  • the processor is configured to extract symptom keywords from an electronic medical record of a patient, to achieve: extracting initial keywords of symptoms of a patient from the text of the medical record through named entity recognition; based on a preset standardized vocabulary , and replace the alias words in the symptom initial keywords with standard words to obtain the symptom keywords.
  • the processor is configured to realize that when judging whether the patient belongs to a case of the target syndrome according to the matching situation of the symptom keyword and the keywords of a number of preset syndromes, realize: compare the symptom keyword with each syndrome Each keyword is compared, and if all the symptom keywords have corresponding keywords in one of the syndromes, it is determined that the patient belongs to the case of the one of the syndromes; or/and realize: according to the Symptom keywords and keywords of a certain syndrome, construct the multi-hot encoding of the symptom keywords and the multi-hot encoding of the keywords of a certain syndrome; input the multi-hot encoding of the symptom keywords into the self-encoder, Obtain the self-encoding vector of the symptom keyword output by the autoencoder; according to the multi-hot encoding of the keyword of the certain syndrome, obtain the self-encoding of the keyword of the certain syndrome through the autoencoder vector; calculate the similarity between the self-encoding vector of the symptom keyword and the self-encoding vector of the
  • the processor when configured to acquire the autoencoder, it is implemented: acquiring training data, where the training data includes multi-hot encoding; acquiring an autoencoder to be trained, where the autoencoder includes a first network and a second network; input the multi-hot encoding in the training data into the first network, and obtain the self-encoding vector output by the first network, the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding; The self-encoding vector output by the first network is input into the second network to obtain a vector output by the second network, and the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding; according to the second network A vector of network outputs and the multi-hot encoding to train the first network and the second network.
  • the processor is used for the number of cases of the target syndrome in the current preset time unit, and when calculating several statistical indicators of the target syndrome, it is realized: according to the number of cases of the target syndrome in the current preset time unit, and The number of historical cases of the target syndrome is calculated, and several historical statistical indicators of the target syndrome are calculated, and the statistical indicators include one or more of a month-on-month growth rate, a year-on-year growth rate, and a historical percentile.
  • the processor when configured to input several statistical indicators of the target syndrome into a risk prediction model to obtain a predicted risk level, it is achieved: inputting the statistical indicators and risk factors into a risk prediction model to obtain the target syndrome.
  • the risk factors include weather data and/or environmental data.
  • the weather data includes the current temperature, humidity, and whether it is a sunny, cloudy or rainy day
  • the environmental data includes the current air quality, water quality level, and the like.
  • a computer-readable storage medium where the computer-readable storage medium stores a computer program, the computer program includes program instructions, and the processor executes the program instructions to implement any one of the syndromes provided in the embodiments of the present application Monitoring and early warning methods.
  • the computer-readable storage medium may be an internal storage unit of the computer device described in the foregoing embodiments, such as a hard disk or a memory of the computer device.
  • the computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) ) card, Flash Card, etc.
  • the computer-readable storage medium may be non-volatile or volatile.

Landscapes

  • Public Health (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Pathology (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

A syndrome monitoring and early warning method and apparatus, a computer device, and a readable storage medium, which relate to smart decision-making technology, and relate in particular to a prediction model. The method comprises: extracting symptom keywords from an electronic medical record of a patient (S110); according to the matching situation of the symptom keywords and keywords of a plurality of preset syndromes, determining whether the patient belongs to one of the syndrome cases therein (S120); if determined that the patient belongs to one of the syndrome cases therein, determining the syndrome as a target syndrome (S130); acquiring the number of cases of the target syndrome within a preset time unit (S140); according to the number of cases of the target syndrome within a current preset time unit, calculating a plurality of statistical indexes of the target syndrome (S150); inputting the plurality of statistical indexes of the target syndrome into a risk prediction model, so as to acquire a risk level of the target syndrome (S160); and issuing early warning information of the target syndrome according to the risk level of the target syndrome (S170). The present invention further relates to blockchain technology, and the obtained risk level may be stored in a blockchain.

Description

症候群监测预警方法、装置、计算机设备及存储介质Syndrome monitoring and early warning method, device, computer equipment and storage medium
本申请要求于2021年4月16日提交中国专利局、申请号为202110412510.X,发明名称为“症候群监测预警方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on April 16, 2021 with the application number 202110412510.X and the invention titled "Method, Device, Computer Equipment and Storage Medium for Syndrome Monitoring and Early Warning", the entire contents of which are Incorporated herein by reference.
技术领域technical field
本申请涉及计算机技术领域,尤其涉及一种症候群监测预警方法、装置、计算机设备及存储介质。The present application relates to the field of computer technology, and in particular, to a method, device, computer equipment and storage medium for monitoring and early warning of symptoms.
背景技术Background technique
公共卫生是关系到一国或一个地区人民大众健康的公共事业。公共卫生的重要工作之一是对重大疾病尤其是传染病(如结核、艾滋病、SARS、新冠肺炎等)的预防、监控和治疗。其中,症候群监测预警就是一种提前于传染病监测预警的公共卫生异常事件监测预警机制。症候群监测预警能够主动监测公共卫生事件的萌芽状态并进行预警,为公共卫生人员采取有效的防治策略提供依据。Public health is a public undertaking related to the public health of the people in a country or a region. One of the important tasks of public health is the prevention, monitoring and treatment of major diseases, especially infectious diseases (such as tuberculosis, AIDS, SARS, new coronary pneumonia, etc.). Among them, syndrome monitoring and early warning is a kind of public health abnormal event monitoring and early warning mechanism ahead of infectious disease monitoring and early warning. Syndrome monitoring and early warning can actively monitor the budding state of public health events and give early warnings, providing a basis for public health personnel to take effective prevention and control strategies.
症候群监测能够及时发现异常或者疾病爆发信号的特点。基于症候群监测的症候群预警通过对症候群异常的捕获,能够提前于特定疾病预警之前发出预警信号。Syndrome monitoring can detect abnormalities or characteristics of disease outbreak signals in time. Syndrome early warning based on syndrome monitoring Through the capture of abnormal syndromes, early warning signals can be issued in advance of specific disease warnings.
发明人意识到,传统的症候群监测预警主要通过对历史数据进行分析,然后根据人工设置的阈值判断风险等级,需要耗费大量人力,预警效果受限于凭经验设置的阈值,且通过阈值判断风险等级容易造成“一刀切”,容易造成风险等级误报或漏报。The inventor realized that the traditional monitoring and early warning of symptoms mainly analyzes historical data, and then judges the risk level according to the threshold value set manually, which requires a lot of manpower, and the early warning effect is limited by the threshold value set based on experience, and the risk level is judged by the threshold value. It is easy to cause "one size fits all", and it is easy to cause false alarms or omissions of risk levels.
发明内容SUMMARY OF THE INVENTION
本申请提供了一种症候群监测预警方法、装置、计算机设备及存储介质,能够通过智能化的方法高效、有效地提供症候群预警风险等级,以为公共卫生人员采取有效的防治策略提供依据。The present application provides a syndrome monitoring and early warning method, device, computer equipment and storage medium, which can efficiently and effectively provide a syndrome early warning risk level through an intelligent method, so as to provide a basis for public health personnel to take effective prevention and treatment strategies.
第一方面,本申请提供了一种症候群监测预警方法,所述方法包括:In a first aspect, the present application provides a method for monitoring and early warning of symptoms, the method comprising:
从就诊人的电子病历中提取症状关键词;Extract symptom keywords from the patient's electronic medical record;
根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例;Judging whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;If it is determined that the patient belongs to a case of one of the syndromes, determine the syndrome as the target syndrome;
获取预设的时间单位内所述目标症候群的病例数;Obtain the number of cases of the target syndrome within a preset time unit;
根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;Calculate several statistical indicators of the target syndrome according to the number of cases of the target syndrome within the current preset time unit;
将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级;Inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome;
根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。According to the risk level of the target syndrome, the early warning information of the target syndrome is released.
第二方面,本申请提供了一种症候群监测预警装置,包括:In a second aspect, the present application provides a syndrome monitoring and early warning device, including:
症状提取模块,用于从就诊人的电子病历中提取症状关键词;The symptom extraction module is used to extract symptom keywords from the electronic medical record of the patient;
症状匹配模块,用于根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个目标症候群的病例;The symptom matching module is used to judge whether the patient belongs to a case of one of the target syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
目标症候群确定模块,用于若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;a target syndrome determination module, configured to determine the syndrome as the target syndrome if it is determined that the patient belongs to one of the syndrome cases;
病例统计模块,用于获取预设的时间单位内所述目标症候群的病例数;A case statistics module, used to obtain the number of cases of the target syndrome within a preset time unit;
指标计算模块,用于根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;The index calculation module is used to calculate some statistical indexes of the target syndrome according to the number of cases of the target syndrome in the current preset time unit;
风险预测模块,用于将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级;a risk prediction module for inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome;
预警模块,用于根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。An early warning module, configured to issue early warning information of the target syndrome according to the risk level of the target syndrome.
第三方面,本申请提供了一种计算机设备,所述计算机设备包括存储器和处理器;所述存储器用于存储计算机程序;所述处理器,用于执行所述计算机程序并在执行所述计算机程序时实现上述的症候群监测预警方法。In a third aspect, the present application provides a computer device, the computer device includes a memory and a processor; the memory is used for storing a computer program; the processor is used for executing the computer program and executing the computer During the program, the above-mentioned symptom monitoring and early warning method is realized.
第四方面,本申请提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,若所述计算机程序被处理器执行,实现上述的症候群监测预警方法。In a fourth aspect, the present application provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and if the computer program is executed by a processor, the above-mentioned method for monitoring and early warning of a syndrome is implemented.
本申请公开了一种症候群监测方法、装置、计算机设备及存储介质,通过从就诊人的电子病历中提取症状关键词;根据所述症状关键词及目标症候群的关键词的匹配情况,判断就诊人是否属于目标症候群的病例;根据当前的预设时间单位统计所述目标症候群的病例数;根据所述目标症候群的病例数,计算目标症候群的若干统计指标;将所述统计指标输入风险预测模型,以获取预测的风险等级,实现基于人工智能的症候群监测预警,提升症候群监测预警的效率及准确性。The present application discloses a syndrome monitoring method, device, computer equipment and storage medium. By extracting symptom keywords from an electronic medical record of a patient; according to the matching situation of the symptom keywords and the keywords of a target syndrome, the patient is judged. Whether it belongs to the case of the target syndrome; count the number of cases of the target syndrome according to the current preset time unit; calculate some statistical indicators of the target syndrome according to the number of cases of the target syndrome; input the statistical indicators into the risk prediction model, In order to obtain the predicted risk level, artificial intelligence-based syndrome monitoring and early warning can be realized, and the efficiency and accuracy of syndrome monitoring and early warning can be improved.
附图说明Description of drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. For those of ordinary skill, other drawings can also be obtained from these drawings without any creative effort.
图1是本申请一实施例提供的一种症候群监测预警方法的流程示意图;1 is a schematic flowchart of a syndrome monitoring and early warning method provided by an embodiment of the present application;
图2是本申请一实施例提供的一种症候群监测预警装置的结构示意框图;2 is a schematic block diagram of the structure of a syndrome monitoring and early warning device provided by an embodiment of the present application;
图3是本申请一实施例提供的一种计算机设备的结构示意框图。FIG. 3 is a schematic structural block diagram of a computer device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的 实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
附图中所示的流程图仅是示例说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解、组合或部分合并,因此实际执行的顺序有可能根据实际情况改变。另外,虽然在装置示意图中进行了功能模块的划分,但是在某些情况下,可以以不同于装置示意图中的模块划分。The flowcharts shown in the figures are for illustration only, and do not necessarily include all contents and operations/steps, nor do they have to be performed in the order described. For example, some operations/steps can also be decomposed, combined or partially combined, so the actual execution order may be changed according to the actual situation. In addition, although the functional modules are divided in the schematic diagram of the device, in some cases, the modules may be divided differently from the schematic diagram of the device.
本申请的实施例提供了一种症候群监测预警方法、装置、计算机设备及计算机可读存储介质。用于基于人工智能技术,提高症候群监测预警的效率及准确性。示例性的,在症候群的监测预警中,若通过对历史数据进行分析,然后根据人工设置的阈值判断风险等级,需要耗费大量人力,预警效果受限于凭经验设置的阈值。根据本申请实施例的症候群预警方法,通过基于人工智能的方法高效地提供预警风险等级,避免了传统方法中依赖人工设置阈值判断风险等级时根据阈值“一刀切”造成误报或漏报的问题,提高了风险等级预测的准确性。Embodiments of the present application provide a syndrome monitoring and early warning method, apparatus, computer device, and computer-readable storage medium. It is used to improve the efficiency and accuracy of syndrome monitoring and early warning based on artificial intelligence technology. Exemplarily, in the monitoring and early warning of the syndrome, if the historical data is analyzed and then the risk level is judged according to the manually set threshold, it requires a lot of manpower, and the early warning effect is limited by the threshold set based on experience. According to the syndrome early warning method of the embodiment of the present application, the early warning risk level is efficiently provided by the method based on artificial intelligence, and the problem of false positives or false negatives caused by "one size fits all" according to the threshold value in the traditional method when the risk level is judged by relying on the manual setting of the threshold value is avoided, Improves the accuracy of risk level predictions.
其中,该症候群监测预警方法可以用于服务器,当然也可以用于终端,其中,终端可以是手机、平板电脑、笔记本电脑、台式电脑等电子设备;服务器例如可以为单独的服务器或服务器集群。但为了便于理解,以下实施例将以应用于服务器的症候群监测预警方法进行详细介绍。Wherein, the syndrome monitoring and early warning method can be applied to a server, and of course can also be applied to a terminal, wherein the terminal can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, etc.; the server can be, for example, a separate server or a server cluster. However, in order to facilitate understanding, the following embodiments will be described in detail with a method for monitoring and early warning of symptoms applied to a server.
下面结合附图,对本申请的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Some embodiments of the present application will be described in detail below with reference to the accompanying drawings. The embodiments described below and features in the embodiments may be combined with each other without conflict.
请参阅图1,图1是本申请实施例提供的一种症候群监测预警方法的示意流程图。Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a method for monitoring and early warning of a syndrome provided by an embodiment of the present application.
如图1所示,该症候群监测预警方法可以包括以下步骤S110-步骤S170。As shown in FIG. 1 , the syndrome monitoring and early warning method may include the following steps S110-S170.
步骤S110、从就诊人的电子病历中提取症状关键词。Step S110, extracting symptom keywords from the electronic medical record of the patient.
所述症状关键词可以包括症状的表现和症状的相关属性,相关属性可以是症状发生的时间、持续时间及严重程度等,例如就诊人的电子病例中记载有“就诊人三天前开始持续发热”,对应的症状关键词为“发热三天”,其中“三天”是症状的表现,“三天”是症状的相关属性之一。具体实施时,可以根据所述目标症候群的关键词的构成确定所述症状关键词的构成,例如所述目标症候群的关键词只有症状表现,那么所述症状关键词同样只保留“发热”等症状表现即可,如果所述目标症候群的关键词包括症状的表现及症状的相关属性,那么所述症状关键词同样包括症状的表现及症状的相关属性。The symptom keywords may include symptoms and related attributes of the symptoms, and the related attributes may be the time, duration, and severity of the symptoms. ", the corresponding symptom keyword is "fever for three days", where "three days" is the manifestation of symptoms, and "three days" is one of the related attributes of symptoms. In specific implementation, the composition of the symptom keyword can be determined according to the composition of the keyword of the target syndrome. For example, the keyword of the target syndrome only has symptoms, then the symptom keyword also only retains symptoms such as "fever". It is sufficient to express. If the keyword of the target syndrome includes the manifestation of the symptom and the related attributes of the symptom, the symptom keyword also includes the manifestation of the symptom and the related attribute of the symptom.
示例性的,步骤S110具体包括步骤S111-S112:Exemplarily, step S110 specifically includes steps S111-S112:
步骤S111、通过命名实体识别,从所述电子病历的文本中提取就诊人的症状初始关键词;Step S111, through named entity recognition, extract the initial keywords of the symptoms of the patient from the text of the electronic medical record;
命名实体识别技术(NER,Named Entity Recognition),是一项识别文本中具有特定意义的实体的自然语言处理技术。通过命名实体识别,可以将希望获取到的实体类型,从文本里提取出来。例如,就诊人的症状是希望获取到的实体类型,通过命名实体识别技术开发的工具可以将句子中的“发热”、“恶心”等具体的症状从电子病例从提取出来。Named Entity Recognition (NER, Named Entity Recognition) is a natural language processing technology that recognizes entities with specific meanings in text. Through named entity recognition, the desired entity type can be extracted from the text. For example, the symptom of the patient is the type of entity that is expected to be obtained. The tool developed through the named entity recognition technology can extract specific symptoms such as "fever" and "nausea" in the sentence from the electronic medical record.
步骤S112、基于预设的标准化词汇表,将所述症状初始关键词中的别名词汇替换成标准词汇,以获取所述症状关键词。Step S112 , based on a preset standardized vocabulary, replace the alias vocabulary in the symptom initial keyword with a standard vocabulary to obtain the symptom keyword.
例如,症状初始关键词为“闹肚子”,而医学上对应的标准词汇应为“腹泻”,则“闹肚子”为别名词汇,通过标准词汇替换后获得的所述症状关键词为“腹泻”。For example, if the initial keyword of the symptom is "nausea", and the corresponding standard medical term should be "diarrhea", then "nausea" is an alias word, and the symptom keyword obtained by replacing the standard word is "diarrhea".
可通过收集整理症状相关的别名词汇及标准词汇,以形成所述标准化词汇表。The standardized vocabulary can be formed by collecting and arranging symptom-related alias words and standard words.
示例性的,将所述症状关键词整理成结构化数据。结构化数据也称作行数据,是由二维表结构来逻辑表达和实现的数据,严格地遵循数据格式与长度规范,主要通过关系型数据库进行存储和管理,通过结构化可方便数据的管理、使用、展示。Exemplarily, the symptom keywords are organized into structured data. Structured data, also known as row data, is data that is logically expressed and implemented by a two-dimensional table structure. It strictly follows the data format and length specifications, and is mainly stored and managed through relational databases. Data management can be facilitated through structuring. , use, display.
步骤S120、根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例。Step S120 , according to the matching situation of the symptom keyword and the keywords of several preset syndrome groups, determine whether the patient belongs to a case of one of the syndrome groups.
症候群,指在某些可能的疾病出现时,经常会同时出现的多种临床症状。例如,如表1所示,是一实施例中预设的五个症候群及与其症状对应的关键词。具体实施时,可以根据需要添加或定义新的症候群。Syndrome refers to multiple clinical symptoms that often appear at the same time when certain possible diseases appear. For example, as shown in Table 1, there are five preset syndromes and keywords corresponding to their symptoms in an embodiment. During specific implementation, new syndromes can be added or defined as needed.
表1:症候群及其对应的关键词Table 1: Syndromes and their corresponding keywords
Figure PCTCN2021097415-appb-000001
Figure PCTCN2021097415-appb-000001
Figure PCTCN2021097415-appb-000002
Figure PCTCN2021097415-appb-000002
一些实施例中,步骤S120具体包括精确匹配步骤和/或模糊匹配步骤。In some embodiments, step S120 specifically includes an exact matching step and/or a fuzzy matching step.
示例性的,所述精确匹配步骤包括:Exemplarily, the exact matching step includes:
将所述症状关键词与各个症候群的每个关键词进行比较,若所有所述症状关键词在其中一个症候群中均有对应的关键词,则确定所述就诊人属于所述其中一个症候群的病例。Compare the symptom keywords with each keyword of each syndrome, and if all the symptom keywords have corresponding keywords in one of the syndromes, then determine that the patient belongs to the case of the one of the syndromes .
例如,预设的若干症候群中包括第一症候群,第一症候群的关键词为{A,B,C,D},就诊人甲的症状关键词为{A,B,C,D},就诊人乙的症状关键词为{C,D,E,F,G}。将就诊人甲的症状关键词与第一症候群的每个关键词进行一对一比较,根据匹配度计算公式:匹配度=被比较的症候群的关键词中与所述症状关键词一致的个数/所述症状关键词的个数,得到匹配度为4/4=1,说明就诊人甲的每个所述症状关键词均有与其一致的第一症候群的关键词,因此就诊人甲为第一症候群的病例;将就诊人乙的症状关键词与第一症候群的每个关键词进行一对一比较,根据所述匹配度计算公式得到的匹配度为2/5=0.4,匹配度小于1,说明就诊人乙的症状关键词中存在部分无与其一致的第一症候群的关键词,目前不能判定就诊人乙是第一症候群的病例。For example, a number of preset syndromes include the first syndrome, the keyword of the first syndrome is {A, B, C, D}, the symptom keyword of the patient A is {A, B, C, D}, and the patient B's symptom keywords are {C,D,E,F,G}. One-to-one comparison is made between the symptom keywords of patient A and each keyword of the first syndrome, and the formula is calculated according to the matching degree: matching degree=the number of the keywords of the syndrome being compared that are consistent with the symptom keywords / The number of the symptom keywords, the matching degree is 4/4=1, indicating that each symptom keyword of the patient A has the same keyword of the first syndrome, so the patient A is the first symptom keyword. A case of a syndrome; one-to-one comparison between the symptom keywords of the patient B and each keyword of the first syndrome, the matching degree obtained according to the matching degree calculation formula is 2/5=0.4, and the matching degree is less than 1 , indicating that there are some keywords that do not have the first syndrome in the symptom keywords of the patient B, and it is currently impossible to determine that the patient B is a case of the first syndrome.
示例性的,所述模糊匹配步骤包括步骤S122a-步骤S122e:Exemplarily, the fuzzy matching step includes steps S122a-step S122e:
步骤S122a、根据所述症状关键词及某个症候群的关键词,构建所述症状关键词的多热编码(multi-hot encoding)及所述某个症候群的关键词的多热编码。Step S122a, construct a multi-hot encoding of the symptom keyword and a multi-hot encoding of the keyword of a certain syndrome according to the symptom keyword and the keyword of a certain syndrome.
例如,就诊人乙的症状关键词为{C、D、E、F、G},第一症候群的关键词为{A、B、C、D},就诊人乙的症状关键词对应的多热编码为{0、0、1、1、1、1、1},该多热编码中的元素分别代表没有关键词A、没有关键词B、存在关键词C……,而第一症候群的关键词对应的多热编码为{1、1、1、1、0、0、0}。For example, the symptom keywords of patient B are {C, D, E, F, G}, the keywords of the first syndrome are {A, B, C, D}, and the symptom keywords of patient B correspond to the multiple fever The encoding is {0, 0, 1, 1, 1, 1, 1}, the elements in the multi-hot encoding respectively represent the absence of keyword A, the absence of keyword B, the presence of keyword C..., and the key of the first syndrome The corresponding multi-hot encoding of words is {1, 1, 1, 1, 0, 0, 0}.
步骤S122b、将所述症状关键词的多热编码输入自编码器(autoencoder),获取所述自编码器输出的自编码向量。Step S122b: Input the multi-hot encoding of the symptom keyword into an autoencoder, and obtain an autoencoder vector output by the autoencoder.
示例性的,本方法还包括获取所述自编码器的训练步骤S210,具体包括步骤S211-步骤S215。Exemplarily, the method further includes a training step S210 of acquiring the autoencoder, specifically including steps S211-S215.
步骤S211、获取训练数据,所述训练数据包括多热编码。Step S211: Acquire training data, where the training data includes multi-hot encoding.
示例性的,可以将所述症状关键词的多热编码或所述某个症候群的关键词的多热编码作为训练数据,也可以通过随机生成的方式获得作为训练数据的多热编码。Exemplarily, the multi-hot encoding of the symptom keyword or the multi-hot encoding of the keyword of a certain syndrome may be used as the training data, or the multi-hot encoding as the training data may be obtained by random generation.
步骤S212、获取待训练的自编码器,所述自编码器包括第一网络和第二网络。Step S212: Obtain an autoencoder to be trained, where the autoencoder includes a first network and a second network.
示例性的,所述第一网络与所述第二网络结构对称;所述第一网络及所述第二网络均包括多层全连接层。全连接层,即该层的每一个结点均与上一层的所有结点相连,通过多层全连接层可提高所述自编码器的非线性表达能力。Exemplarily, the structures of the first network and the second network are symmetrical; both the first network and the second network include multiple fully connected layers. A fully-connected layer, that is, each node of this layer is connected to all nodes of the previous layer, and the nonlinear expression capability of the autoencoder can be improved by using multiple fully-connected layers.
步骤S213、将所述训练数据中的多热编码输入所述第一网络,得到所述第一网络输出的自编码向量,所述自编码向量的维度低于所述多热编码的维度。Step S213: Input the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, where the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding.
所述第一网络,构成一个编码器,将高维稀疏的多热编码压缩成低维密实的所述自编码向量。The first network constitutes an encoder to compress the high-dimensional sparse multi-hot encoding into the low-dimensional dense self-encoding vector.
步骤S214、将所述第一网络输出的自编码向量输入所述第二网络,得到所述第二网络输出的向量,所述第二网络输出的向量的维度等于所述多热编码的维度;Step S214, inputting the self-encoding vector output by the first network into the second network to obtain a vector output by the second network, and the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding;
所述第二网络,构成一个解码器,将低维密实的自编码向量重构成高维稀疏的多热编码。The second network constitutes a decoder that reconstructs the low-dimensional dense self-encoding vector into a high-dimensional sparse multi-hot encoding.
步骤S215、根据所述第二网络输出的向量和所述多热编码,对所述第一网络和所述第二网络进行训练。Step S215: Train the first network and the second network according to the vector output by the second network and the multi-hot encoding.
示例性,根据所述第二网络输出的向量和所述多热编码的误差,调整所述第一网络及所述第二网络的网络参数,以使所述第二网络输出的向量和所述多热编码趋于一致。Exemplarily, according to the vector output by the second network and the error of the multi-hot encoding, the network parameters of the first network and the second network are adjusted, so that the vector output by the second network and the Multi-hot encoding tends to be consistent.
示例性的,完成自编码器的训练后,将所述多热编码输入所述自编码器,在所述第一网络的输出获取所述自编码向量。Exemplarily, after the training of the autoencoder is completed, the multi-hot encoding is input into the autoencoder, and the autoencoder vector is obtained at the output of the first network.
步骤S122c、根据所述某个症候群的关键词的多热编码,通过所述自编码器获取所述某个症候群的关键词的自编码向量;Step S122c, according to the multi-hot encoding of the keyword of a certain syndrome, obtain the self-encoding vector of the keyword of the certain syndrome by the autoencoder;
示例性的,将所述某个症候群的关键词的多热编码输入所述自编码器,在所述第一网络的输出获取所述某个症候群的自编码向量。Exemplarily, the multi-hot encoding of the keywords of the certain syndrome is input into the autoencoder, and the autoencoder vector of the certain syndrome is obtained at the output of the first network.
步骤S122d、计算所述症状关键词的自编码向量及所述某个症候群的关键词的自编码向量的相似度;Step S122d, calculating the similarity of the self-encoding vector of the symptom keyword and the self-encoding vector of the keyword of a certain syndrome;
示例性的,通过余弦相似度计算所述相似度,余弦相似度的计算公式为:Exemplarily, the similarity is calculated by cosine similarity, and the calculation formula of cosine similarity is:
Figure PCTCN2021097415-appb-000003
Figure PCTCN2021097415-appb-000003
其中,X为所述症状关键词的自编码向量、Y为所述某个症候群的关键词的自编码向量,n为所述自编码向量的维度。Wherein, X is the self-encoding vector of the symptom keyword, Y is the self-encoding vector of the keyword of the certain syndrome, and n is the dimension of the self-encoding vector.
其他实施例中,也可以通过欧几里得距离、曼哈顿距离、明可夫斯基距离等方法计算所述相似度。In other embodiments, the similarity may also be calculated by methods such as Euclidean distance, Manhattan distance, and Minkowski distance.
步骤S122e、根据所述相似度判断所述就诊人是否属于所述某个症候群的病例。Step S122e, judging whether the patient belongs to a case of the certain syndrome according to the similarity.
示例性的,预设一个相似度阈值,例如,将所述相似度阈值设置为0.8。若所述相似度超过所述相似度阈值,则对应的就诊人是所述某个症候群的病例;否则,对应的就诊人不是所述某个症候群的病例。Exemplarily, a similarity threshold is preset, for example, the similarity threshold is set to 0.8. If the similarity exceeds the similarity threshold, the corresponding patient is a case of the certain syndrome; otherwise, the corresponding patient is not a case of the certain syndrome.
示例性的,将所述精确匹配步骤及所述模糊匹配步骤结合使用,例如,先执行所述精确匹配步骤,对于经过所述精确匹配步骤还不能判断就诊人是否属于所述某个症候群的病例的,再执行所述模糊匹配步骤以进一步判断该就诊人是否属于所述某个症候群的病例。Exemplarily, the precise matching step and the fuzzy matching step are used in combination, for example, the precise matching step is performed first, and it is not possible to determine whether the patient belongs to the certain syndrome after the precise matching step. , and then perform the fuzzy matching step to further determine whether the patient belongs to the case of the certain syndrome.
步骤S130、若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群。Step S130 , if it is determined that the patient belongs to a case of one of the syndromes, determine that the syndrome is the target syndrome.
示例性的,通过步骤S110至步骤S130对一个个电子病历进行处理,以获取所有所述目标症候群;若处理完所有电子病历后,所述目标症候群的个数仍为0,则说明当前无预设的 症候群的病例,不需要进行风险预警。Exemplarily, the electronic medical records are processed one by one through steps S110 to S130 to obtain all the target syndromes; if after all the electronic medical records are processed, the number of the target syndromes is still 0, it means that there is currently no forecast. No risk warning is required for cases of the designated syndromes.
步骤S140、获取预设的时间单位内所述目标症候群的病例数。Step S140: Acquire the number of cases of the target syndrome within a preset time unit.
例如,预设的时间单位为天,获取以天为单位的所述目标症候群的病例数。For example, the preset time unit is days, and the number of cases of the target syndrome in days is obtained.
示例性的,所述目标症候群的病例数,可以在步骤S120或步骤S130的执行过程中进行统计。例如,确定一个就诊人属于其中一个症候群的病例,就将该症候群对应就诊日期的病例数增加一个。Exemplarily, the number of cases of the target syndrome may be counted during the execution of step S120 or step S130. For example, if it is determined that a patient belongs to one of the syndrome cases, the number of cases of the syndrome corresponding to the visit date is increased by one.
步骤S150、根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标。Step S150: Calculate several statistical indexes of the target syndrome according to the number of cases of the target syndrome within the current preset time unit.
示例性的,根据当前的预设时间单位内所述目标症候群的历史病例数,以及所述目标症候群的历史病例数,计算所述目标症候群的若干统计指标,所述统计指标包括环比增长率、同比增长率、历史百分位中的一种或多种。环比增长率,表示连续2个统计周期内的统计量的变化比,具体实施时,可以计算对应多个统计周期的环比,例如,当天为2020年9月17日,需对2020年9月18日进行预警,则可以根据2020年9月17日所述目标症候群的病例数(当前的预设时间单位内所述目标症候群的病例数),及2020年9月16日所述目标症候群的病例数(所述目标症候群的历史病例数)计算日-环比增长率,根据2020年9月11日至2020年9月17日所述目标症候群的病例数及2020年9月4日至2020年9月10日所述目标症候群的病例数计算周-环比增长率,根据2020年8月18日至2020年9月17日所述目标症候群的病例数及2020年7月18日至2020年8月17日所述目标症候群的病例数计算月-环比增长率;同比增长率一般是指今年某个统计周期内的统计量相对于去年同期的量的变化比,类似的,可以计算对应多个统计周期的同比,例如,根据2020年9月11日至2020年9月17日所述目标症候群病例数及2019年9月11日至2019年9月17日所述目标症候群的病例数计算周-同比增长率,根据2020年8月18日至2020年9月17日所述目标症候群的病例数及2019年8月18日至2020年9月17日所述目标症候群的病例数计算月-同比增长率;历史百分位,指某个周期内的统计量在历史参考数据中通过百分比量化的位置,例如,过去四天的所述病例数分别为100、500、300、200,今天的病例数为400,过去4天共有3天所述病例数低于今天的病例数,那么以过去四天的病例数作为历史参考数据,今天的病例数的历史百分位为3/4=75%;类似的,可以计算对应多个统计周期的历史百分位,例如,以2019年1月1日至2019年12月31日所述目标症候群的病例数作为历史参考数据,根据2020年9月17日所述目标症候群的病例数及2019年1月1日至2019年12月31日所述目标症候群的病例数计算日-历史百分位,根据2020年9月11日至2020年9月17日所述目标症候群的病例数及2019年1月1日至2019年12月31日所述目标症候群的病例数计算周-历史百分位,根据2020年8月18日至2020年9月17日所述目标症候群的病例数及2019年1月1日至2019年12月31日所述目标症候群的病例数计算月-历史百分位。当然,具体实施时,本方法中采纳的统计指标并不局限于上述几种,例如,还可以类似上述求所述日-环比增长率的方法,求2020年9月17日所述目标症候群的病例数分别相对于2020年9月14日所述目标症候群的病例数、2020年9月15日所述目标症候群的病例数的增长率,此外,定基比等其他可反映 数据变化情况的统计指标也可以使用在本方法中。Exemplarily, according to the number of historical cases of the target syndrome in the current preset time unit, and the number of historical cases of the target syndrome, calculate several statistical indicators of the target syndrome, and the statistical indicators include the chain growth rate, One or more of year-over-year growth rate, historical percentile. The chain growth rate represents the change ratio of the statistics in two consecutive statistical periods. During the specific implementation, the chain ratio corresponding to multiple statistical periods can be calculated. If an early warning is carried out on September 17, 2020 (the number of cases of the target syndrome in the current preset time unit), and the number of cases of the target syndrome on September 16, 2020 (the number of historical cases of the target syndrome) to calculate the daily-monthly growth rate, based on the number of cases of the target syndrome from September 11, 2020 to September 17, 2020 and the number of cases of the target syndrome from September 4, 2020 to September 2020. Week-to-month growth rate calculated for the number of cases of the target syndrome described on 10th August 2020, based on the number of cases of the target syndrome described in the period from 18th August 2020 to 17th September 2020 and the The number of cases of the target syndrome mentioned on the 17th is calculated to calculate the month-on-month growth rate; the year-on-year growth rate generally refers to the change ratio of the statistics in a certain statistical period this year relative to the amount in the same period last year. Similarly, the corresponding statistics can be calculated. The year-on-year period of the cycle, for example, is calculated based on the number of cases of the target syndrome from September 11, 2020 to September 17, 2020 and the number of cases of the target syndrome from September 11, 2019 to September 17, 2019- Year-on-year growth rate, calculated based on the number of cases of the target syndrome from August 18, 2020 to September 17, 2020 and the number of cases of the target syndrome from August 18, 2019 to September 17, 2020 month-year-on-year Growth rate; historical percentile, which refers to the position at which the statistic in a certain period is quantified by percentage in the historical reference data, for example, the number of said cases in the past four days were 100, 500, 300, 200, and today's cases If the number of cases is 400, and the number of cases in the past 4 days is lower than the number of cases today, then the number of cases in the past four days is used as the historical reference data, and the historical percentile of the number of cases today is 3/4=75% ; Similarly, historical percentiles corresponding to multiple statistical periods can be calculated. For example, the number of cases of the target syndrome from January 1, 2019 to December 31, 2019 is used as historical reference data. The number of cases of the target syndrome on the 17th and the number of cases of the target syndrome from January 1, 2019 to December 31, 2019 Calculation date-historical percentile, based on September 11, 2020 to September 2020 The number of cases of the target syndrome described on the 17th and the number of cases of the target syndrome described in January 1, 2019 to December 31, 2019 Calculation week-historical percentile, based on August 18, 2020 to September 2020 Month-historical percentiles were calculated for the number of cases of the target syndrome on the 17th and the number of cases of the target syndrome from January 1, 2019 to December 31, 2019. Of course, during the specific implementation, the statistical indicators adopted in this method are not limited to the above-mentioned ones. The growth rate of the number of cases relative to the number of cases of the target syndrome described on September 14, 2020, the number of cases of the target syndrome described on September 15, 2020, and other statistical indicators that can reflect changes in data such as the fixed-base ratio can also be used in this method.
本实施例中,是以所述目标症候群的病例数作为统计量计算所述统计指标,其他实施例中,也可以以根据所述目标症候群的病例数得到的数据作为统计量计算所述统计指标,例如以所述目标症候群的病例数占所述就诊人的比例作为统计量计算所述统计指标。In this embodiment, the number of cases of the target syndrome is used as a statistic to calculate the statistical index. In other embodiments, the data obtained according to the number of cases of the target syndrome can be used as a statistic to calculate the statistical index. , for example, the statistical index is calculated by taking the ratio of the number of cases of the target syndrome to the patient as a statistic.
步骤S160、将所述目标症候群的若干统计指标输入风险预测模型,以获取预测的风险等级。Step S160: Input several statistical indicators of the target syndrome into a risk prediction model to obtain a predicted risk level.
示例性的,将所述统计指标以及风险因子输入风险预测模型,以获取所述目标症候群的风险等级,所述风险因子包括天气数据和/或环境数据。例如,所述天气数据包括当时的温度、湿度及是否为晴天阴天或雨天,所述环境数据包括当时的空气质量、水质等级等。通过引入所述风险因子,使得所述风险等级模型可以从更丰富的数据维度对风险等级进行预测,使得风险等级的预测更加准确。所述风险预测模型可通过以下方法训练得到:分别将过去某个时间对应的所述统计指标及所述风险因子输入初始模型,本实施例中所述初始模型为XGboost模型,具体实施时也可以采用前馈神经网络、支持向量机等其他机器学习模型;将所述某个时间的风险等级作为所述统计指标的标签,将所述标签作为模型的期望输出,对模型进行训练。过去某个时间的风险等级,若有相应的查询渠道,则可以直接通过这些渠道进行查询,例如所述目标症候群为发热呼吸道症候群,而社会上有相应的发热呼吸道的风险等级公布,则可以通过对以往公布的发热呼吸道的风险等级,获得过去某个时间的发热呼吸道症候群的风险等级;若无相应的查询渠道,则可以根据所述目标症候群能查询到的主要相关疾病的风险等级进行判断得到,例如所述目标症候群为发热呼吸道症候群,目前没有相应的发热呼吸道的风险等级,而发热呼吸道症候群相关的主要疾病是新冠疫情,则可以根据该时间的新冠疫情的风险等级,得到所述发热呼吸道症候群在该时间的风险等级。Exemplarily, the statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome, and the risk factors include weather data and/or environmental data. For example, the weather data includes the current temperature, humidity, and whether it is a sunny, cloudy or rainy day, and the environmental data includes the current air quality, water quality level, and the like. By introducing the risk factor, the risk level model can predict the risk level from more abundant data dimensions, so that the prediction of the risk level is more accurate. The risk prediction model can be trained by the following methods: respectively inputting the statistical indicators and the risk factors corresponding to a certain time in the past into the initial model, the initial model in this embodiment is the XGboost model, and the specific implementation can also be Use feedforward neural network, support vector machine and other machine learning models; use the risk level at a certain time as the label of the statistical index, and use the label as the expected output of the model to train the model. The risk level at a certain time in the past, if there are corresponding query channels, you can directly query through these channels. For example, the target syndrome is fever respiratory syndrome, and the society has the corresponding risk level of the fever respiratory tract announced, you can use these channels to query. For the risk level of the fever respiratory tract published in the past, obtain the risk level of the fever respiratory syndrome at a certain time in the past; if there is no corresponding query channel, it can be judged according to the risk level of the main related diseases that can be queried for the target syndrome. For example, the target syndrome is febrile respiratory syndrome, there is currently no corresponding risk level of febrile respiratory tract, and the main disease related to febrile respiratory syndrome is the new crown epidemic, then the fever respiratory tract can be obtained according to the risk level of the new crown epidemic at that time. The risk level of the syndrome at that time.
示例性的,通过解释模型(SHAP,SHapley Additive exPlanations)对所述风险预测模型进行解释。通过解释模型,可以对所述风险模型的输入中的所述统计指标或所述风险因子在得到风险预测结果中所起的影响进行评估,从而可以知道是哪些数据对风险等级产生了重要影响,后续可对产生重要影响的数据进行重点关注,提高了所述风险预测模型的可用性。Exemplarily, the risk prediction model is explained by an explanation model (SHAP, SHapley Additive exPlanations). By interpreting the model, the impact of the statistical indicators in the input of the risk model or the risk factors in obtaining the risk prediction result can be evaluated, so as to know which data has an important impact on the risk level, Subsequent focus on data with significant impact improves the usability of the risk prediction model.
步骤S170、根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。Step S170: Publish early warning information of the target syndrome according to the risk level of the target syndrome.
例如,所述目标症候群的风险等级包括高、中、低,所述目标症候群为发热呼吸道症候群。若所述目标症候群的风险等级为高,发布所述目标症候群的预警信息包括:发热呼吸道症候群高风险,提高警惕,实施高级别防治措施;若所述目标症候群的风险等级为中,发布的所述目标症候群的预警信息包括:发热呼吸道症候群中风险,密切关注病情动态,实施一般防治措施;若所述目标症候群的风险等级为低,发布的所述目标症候群的预警信息包括:发热呼吸道症候群低风险。For example, the risk level of the target syndrome includes high, medium and low, and the target syndrome is febrile respiratory syndrome. If the risk level of the target syndrome is high, the early warning information of the target syndrome to be released includes: high risk of fever and respiratory syndrome, heightened vigilance, and implementation of high-level prevention and control measures; if the risk level of the target syndrome is medium, all published The early warning information of the target syndrome includes: the risk of febrile respiratory syndrome, pay close attention to the situation of the disease, and implement general prevention and control measures; if the risk level of the target syndrome is low, the published early warning information of the target syndrome includes: fever respiratory syndrome low risk.
本申请通过基于风险预测模型的方法对风险等级预测,避免了的人工分析中耗费人力、效率低的缺陷,同时获取风险等级由风险预测模型通过非线性的计算得到,避免了通过阈值判断风险等级时容易造成“一刀切”,提高了风险等级预测的准确性。The present application predicts the risk level by the method based on the risk prediction model, which avoids the defects of labor-intensive and low efficiency in the manual analysis. At the same time, the risk level is obtained by the risk prediction model through nonlinear calculation, which avoids judging the risk level by the threshold value. It is easy to cause "one size fits all", which improves the accuracy of risk level prediction.
需要强调的是,为进一步保证上述症状关键词、统计指标、风险预测模型及风险等级的 私密和安全性,上述症状关键词、统计指标、所述风险预测模型以及风险等级还可以存储于一区块链的节点中。It should be emphasized that, in order to further ensure the privacy and security of the above symptom keywords, statistical indicators, risk prediction models and risk levels, the above symptom keywords, statistical indicators, the risk prediction model and risk levels can also be stored in one area. in the nodes of the blockchain.
本申请所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
如图2所示,该症候群监测预警装置,包括:症状提取模块110、症状匹配模块120、目标症候群确定模块130、病例统计模块140、指标计算模块150、风险预测模块160及预警模块170。As shown in FIG. 2 , the syndrome monitoring and early warning device includes: a symptom extraction module 110 , a symptom matching module 120 , a target syndrome determination module 130 , a case statistics module 140 , an index calculation module 150 , a risk prediction module 160 and an early warning module 170 .
症状提取模块110,用于从就诊人的电子病历中提取症状关键词;The symptom extraction module 110 is used for extracting symptom keywords from the electronic medical record of the patient;
示例性的,症状提取模块110包括初始关键词提取模块及标准化模块。Exemplarily, the symptom extraction module 110 includes an initial keyword extraction module and a normalization module.
初始关键词提取模块,用于通过命名实体识别,从所述电子病历的文本中提取就诊人的症状初始关键词;an initial keyword extraction module, used for extracting the initial keywords of the symptoms of the patient from the text of the electronic medical record through named entity recognition;
标准化模块,用于基于预设的标注化词汇表,将所述症状初始关键词中的别名词汇替换成标准词汇,以获取所述症状关键词。可通过收集整理症状相关的别名词汇及标准词汇,以形成所述标准化词汇表。The standardization module is configured to replace the alias words in the symptom initial keywords with standard words based on a preset tagging vocabulary, so as to obtain the symptom keywords. The standardized vocabulary can be formed by collecting and arranging symptom-related alias words and standard words.
示例性的,将所述症状关键词整理成结构化数据。结构化数据也称作行数据,是由二维表结构来逻辑表达和实现的数据,严格地遵循数据格式与长度规范,主要通过关系型数据库进行存储和管理,通过结构化可方便数据的管理、使用、展示。Exemplarily, the symptom keywords are organized into structured data. Structured data, also known as row data, is data that is logically expressed and implemented by a two-dimensional table structure. It strictly follows the data format and length specifications, and is mainly stored and managed through relational databases. Data management can be facilitated through structuring. , use, display.
症状匹配模块120,用于根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例。The symptom matching module 120 is configured to judge whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keyword and the keywords of several preset syndromes.
示例性的症状匹配模块120包括精确匹配模块及模糊匹配模块。Exemplary symptom matching modules 120 include exact matching modules and fuzzy matching modules.
示例性的,精确匹配模块,用于将所述症状关键词与各个症候群的每个关键词进行比较,若所有所述症状关键词在其中一个症候群中均有对应的关键词,则确定所述就诊人属于所述其中一个症候群的病例。Exemplarily, an exact matching module is used to compare the symptom keywords with each keyword of each syndrome, and if all the symptom keywords have corresponding keywords in one of the syndromes, determine the The patient belongs to the case of one of the syndromes described.
示例性的,模糊匹配模块包括多热编码构建模块、症状关键词编码模块、目标症候群关键词编码模块、相似度计算模块及病例判断模块。Exemplarily, the fuzzy matching module includes a multi-hot encoding building module, a symptom keyword encoding module, a target syndrome keyword encoding module, a similarity calculation module, and a case judgment module.
多热编码构建模块,用于根据所述症状关键词及某个症候群的关键词,构建所述症状关键词的多热编码及所述某个症候群的关键词的多热编码。A multi-hot encoding building module is configured to construct a multi-hot encoding of the symptom keyword and a multi-hot encoding of the keyword of a certain syndrome according to the symptom keyword and the keyword of a certain syndrome.
症状关键词编码模块,用于将所述症状关键词的多热编码输入自编码器,获取所述自编码器输出的自编码向量。The symptom keyword encoding module is configured to input the multi-hot encoding of the symptom keyword into the autoencoder, and obtain the autoencoder vector output by the autoencoder.
症候群关键词编码模块,用于根据所述某个症候群的关键词的多热编码,通过所述自编码器获取所述某个症候群的关键词的自编码向量。The syndrome keyword encoding module is configured to obtain the autoencoding vector of the keyword of the certain syndrome through the autoencoder according to the multi-hot encoding of the keyword of the certain syndrome.
相似度计算模块,用于计算所述症状关键词的自编码向量及所述某个症候群的关键词的The similarity calculation module is used to calculate the self-encoding vector of the symptom keyword and the similarity of the keyword of a certain syndrome.
Figure PCTCN2021097415-appb-000004
Figure PCTCN2021097415-appb-000004
自编码向量的相似度。Similarity of autoencoding vectors.
示例性的,通过余弦相似度计算所述相似度,余弦相似度的计算公式为:Exemplarily, the similarity is calculated by cosine similarity, and the calculation formula of cosine similarity is:
其中,X为所述症状关键词的自编码向量、Y为所述某个症候群的关键词的自编码向量,n为所述自编码向量的维度。Wherein, X is the self-encoding vector of the symptom keyword, Y is the self-encoding vector of the keyword of the certain syndrome, and n is the dimension of the self-encoding vector.
病例判断模块,用于根据所述相似度判断所述就诊人是否属于所述某个症候群的病例。A case judgment module, configured to judge whether the patient belongs to a case of the certain syndrome group according to the similarity.
示例性的,预设一个相似度阈值,例如,将所述相似度阈值设置为0.8。若所述相似度超过所述相似度阈值,则对应的就诊人是所述某个症候群的病例;否则,对应的就诊人不是所述某个症候群的病例。Exemplarily, a similarity threshold is preset, for example, the similarity threshold is set to 0.8. If the similarity exceeds the similarity threshold, the corresponding patient is a case of the certain syndrome; otherwise, the corresponding patient is not a case of the certain syndrome.
示例性的,本装置还包括自编码器训练模块,所述自编码器的训练模块包括训练数据获取模块、自编码器初始模块、压缩模块、解压模块及自编码器训练模块。Exemplarily, the apparatus further includes an autoencoder training module, and the autoencoder training module includes a training data acquisition module, an autoencoder initial module, a compression module, a decompression module, and an autoencoder training module.
训练数据获取模块,用于获取训练数据,所述训练数据包括多热编码;a training data acquisition module for acquiring training data, the training data including multi-hot encoding;
自编码器初始模块,用于获取待训练的自编码器,所述自编码器包括第一网络和第二网络;The autoencoder initial module is used to obtain the autoencoder to be trained, and the autoencoder includes a first network and a second network;
压缩模块,用于将所述训练数据中的多热编码输入所述第一网络,得到所述第一网络输出的自编码向量,所述自编码向量的维度低于所述多热编码的维度;a compression module, configured to input the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding ;
解压模块,用于将所述第一网络输出的自编码向量输入所述第二网络,得到所述第二网络输出的向量,所述第二网络输出的向量的维度等于所述多热编码的维度;A decompression module, configured to input the self-encoding vector output by the first network into the second network, to obtain a vector output by the second network, the dimension of the vector output by the second network is equal to the multi-hot encoding dimension;
自编码器训练模块,用于根据所述第二网络输出的向量和所述多热编码,对所述第一网络和所述第二网络进行训练。An autoencoder training module, configured to train the first network and the second network according to the vector output by the second network and the multi-hot encoding.
目标症候群确定模块130,用于若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群。The target syndrome determination module 130 is configured to determine the syndrome as the target syndrome if it is determined that the patient belongs to one of the syndrome cases.
病例统计模块140,用于获取预设的时间单位内所述目标症候群的病例数;A case statistics module 140, configured to acquire the number of cases of the target syndrome within a preset time unit;
指标计算模块150,用于根据预设的时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;The index calculation module 150 is configured to calculate several statistical indexes of the target syndrome according to the number of cases of the target syndrome in a preset time unit;
示例性的,根据当前的预设时间单位内所述目标症候群的病例数,以及所述目标症候群的历史病例数,计算所述目标症候群的若干统计指标,所述统计指标包括环比增长率、同比增长率、历史百分位中的一种或多种。Exemplarily, according to the number of cases of the target syndrome in the current preset time unit, and the number of historical cases of the target syndrome, calculate several statistical indicators of the target syndrome, and the statistical indicators include a chain growth rate, a year-on-year growth rate, One or more of growth rate, historical percentile.
风险预测模块160,用于将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级。The risk prediction module 160 is configured to input several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome.
示例性的,将所述统计指标以及风险因子输入风险预测模型,以获取所述目标症候群的风险等级,所述风险因子包括天气数据和/或环境数据。例如,所述天气数据包括当时的温度、湿度及是否为晴天阴天或雨天,所述环境数据包括当时的空气质量、水质等级等。通过引入所述风险因子,使得所述风险等级模型可以从更丰富的数据维度对风险等级进行预测,使得风险等级的预测更加准确。Exemplarily, the statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome, and the risk factors include weather data and/or environmental data. For example, the weather data includes the current temperature, humidity, and whether it is a sunny, cloudy or rainy day, and the environmental data includes the current air quality, water quality level, and the like. By introducing the risk factor, the risk level model can predict the risk level from more abundant data dimensions, so that the prediction of the risk level is more accurate.
示例性的,通过解释模型(SHAP,SHapley Additive exPlanations)对所述风险预测 模型进行解释。通过解释模型,可以对所述风险模型的输入中的所述统计指标或所述风险因子在得到风险预测结果中所起的影响进行评估,从而可以知道是哪些数据对风险等级产生了重要影响,后续可对产生重要影响的数据进行重点关注,提高了所述风险预测模型的可用性。Exemplarily, the risk prediction model is explained by an explanation model (SHAP, SHapley Additive exPlanations). By interpreting the model, the impact of the statistical indicators in the input of the risk model or the risk factors in obtaining the risk prediction result can be evaluated, so as to know which data has an important impact on the risk level, Subsequent focus on data with significant impact improves the usability of the risk prediction model.
预警模块170,用于根据所述风险等级,发布所述目标症候群的预警信息。The early warning module 170 is configured to issue early warning information of the target syndrome according to the risk level.
需要说明的是,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的装置和各模块、单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。It should be noted that those skilled in the art can clearly understand that, for the convenience and brevity of description, for the specific working process of the above-described device and each module and unit, reference may be made to the corresponding process in the foregoing method embodiments. No longer.
本申请的方法、装置可用于众多通用或专用的计算系统环境或配置中。例如:个人计算机、服务器计算机、手持设备或便携式设备、平板型设备、多处理器系统、基于微处理器的系统、机顶盒、可编程的消费电子设备、网络PC、小型计算机、大型计算机、包括以上任何系统或设备的分布式计算环境等等。The methods and apparatus of the present application may be used in numerous general purpose or special purpose computing system environments or configurations. For example: personal computers, server computers, handheld or portable devices, tablet devices, multiprocessor systems, microprocessor-based systems, set-top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, including the above Distributed computing environment of any system or device, etc.
示例性地,上述的方法、装置可以实现为一种计算机程序的形式,该计算机程序可以在如图3所示的计算机设备上运行。Exemplarily, the above-mentioned method and apparatus can be implemented in the form of a computer program, and the computer program can be executed on a computer device as shown in FIG. 3 .
请参阅图3,图3是本申请实施例提供的一种计算机设备的示意图。该计算机设备可以是服务器或终端。Please refer to FIG. 3 , which is a schematic diagram of a computer device provided by an embodiment of the present application. The computer device can be a server or a terminal.
如图3所示,该计算机设备包括通过系统总线连接的处理器、存储器和网络接口,其中,存储器可以包括非易失性存储介质和内存储器。As shown in FIG. 3, the computer device includes a processor, a memory, and a network interface connected through a system bus, wherein the memory may include a non-volatile storage medium and an internal memory.
非易失性存储介质可存储操作系统和计算机程序。该计算机程序包括程序指令,该程序指令被执行时,可使得处理器执行任意一种症候群监测预警方法。The nonvolatile storage medium can store operating systems and computer programs. The computer program includes program instructions, which, when executed, can cause the processor to execute any one of the syndrome monitoring and early warning methods.
处理器用于提供计算和控制能力,支撑整个计算机设备的运行。The processor is used to provide computing and control capabilities to support the operation of the entire computer equipment.
内存储器为非易失性存储介质中的计算机程序的运行提供环境,该计算机程序被处理器执行时,可使得处理器执行任意一种症候群监测预警方法。The internal memory provides an environment for running the computer program in the non-volatile storage medium. When the computer program is executed by the processor, the processor can execute any one of the syndrome monitoring and early warning methods.
该网络接口用于进行网络通信,如发送分配的任务等。本领域技术人员可以理解,该计算机设备的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface is used for network communication, such as sending assigned tasks. Those skilled in the art can understand that the structure of the computer device is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. More or fewer components are shown in the figures, either in combination or with different arrangements of components.
应当理解的是,处理器可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSP), application specific integrated circuits (Application Specific Integrated circuits) Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. Wherein, the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.
其中,在一些实施方式中,所述处理器用于运行存储在存储器中的计算机程序,以实现如下步骤:从就诊人的电子病历中提取症状关键词;根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例;若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;获取预设的时间单位内所述目标症候群的病例数;根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干 统计指标;将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级;根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。Wherein, in some embodiments, the processor is configured to run a computer program stored in the memory, so as to realize the following steps: extracting symptom keywords from the electronic medical record of the patient; If it is determined that the patient belongs to a case of one of the syndrome groups according to the matching of the keywords of the syndrome, it is determined whether the patient belongs to a case of one of the syndrome groups, and the syndrome is determined as the target syndrome; number of cases; according to the number of cases of the target syndrome in the current preset time unit, calculate some statistical indicators of the target syndrome; input some statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome ; According to the risk level of the target syndrome, the early warning information of the target syndrome is released.
示例性的,处理器用于实现从就诊人的电子病历中提取症状关键词,实现:通过命名实体识别,从所述病历的文本中提取就诊人的症状初始关键词;基于预设的标准化词汇表,将所述症状初始关键词中的别名词汇替换成标准词汇,以获取所述症状关键词。Exemplarily, the processor is configured to extract symptom keywords from an electronic medical record of a patient, to achieve: extracting initial keywords of symptoms of a patient from the text of the medical record through named entity recognition; based on a preset standardized vocabulary , and replace the alias words in the symptom initial keywords with standard words to obtain the symptom keywords.
示例性的,处理器用于实现根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于目标症候群的病例时,实现:将所述症状关键词与各个症候群的每个关键词进行比较,若所有所述症状关键词均在其中一个症候群中均有对应的关键词,则确定所述就诊人属于所述其中一个症候群的病例;或/与实现:根据所述症状关键词及某个症候群的关键词,构建所述症状关键词的多热编码及所述某个症候群的关键词的多热编码;将所述症状关键词的多热编码输入自编码器,获取所述自编码器输出的所述症状关键词的自编码向量;根据所述某个症候群的关键词的多热编码,通过所述自编码器获取所述某个症候群的关键词的自编码向量;计算所述症状关键词的自编码向量及所述某个症候群的关键词的自编码向量的相似度;根据所述相似度判断所述就诊人是否属于所述某个症候群的病例。Exemplarily, the processor is configured to realize that when judging whether the patient belongs to a case of the target syndrome according to the matching situation of the symptom keyword and the keywords of a number of preset syndromes, realize: compare the symptom keyword with each syndrome Each keyword is compared, and if all the symptom keywords have corresponding keywords in one of the syndromes, it is determined that the patient belongs to the case of the one of the syndromes; or/and realize: according to the Symptom keywords and keywords of a certain syndrome, construct the multi-hot encoding of the symptom keywords and the multi-hot encoding of the keywords of a certain syndrome; input the multi-hot encoding of the symptom keywords into the self-encoder, Obtain the self-encoding vector of the symptom keyword output by the autoencoder; according to the multi-hot encoding of the keyword of the certain syndrome, obtain the self-encoding of the keyword of the certain syndrome through the autoencoder vector; calculate the similarity between the self-encoding vector of the symptom keyword and the self-encoding vector of the keyword of a certain syndrome; determine whether the patient belongs to a case of the certain syndrome according to the similarity.
示例性的,处理器用于获取所述自编码器时,实现:获取训练数据,所述训练数据包括多热编码;获取待训练的自编码器,所述自编码器包括第一网络和第二网络;将所述训练数据中的多热编码输入所述第一网络,得到所述第一网络输出的自编码向量,所述自编码向量的维度低于所述多热编码的维度;将所述第一网络输出的自编码向量输入所述第二网络,得到所述第二网络输出的向量,所述第二网络输出的向量的维度等于所述多热编码的维度;根据所述第二网络输出的向量和所述多热编码,对所述第一网络和所述第二网络进行训练。Exemplarily, when the processor is configured to acquire the autoencoder, it is implemented: acquiring training data, where the training data includes multi-hot encoding; acquiring an autoencoder to be trained, where the autoencoder includes a first network and a second network; input the multi-hot encoding in the training data into the first network, and obtain the self-encoding vector output by the first network, the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding; The self-encoding vector output by the first network is input into the second network to obtain a vector output by the second network, and the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding; according to the second network A vector of network outputs and the multi-hot encoding to train the first network and the second network.
示例性的,处理器用于当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标时,实现:根据当前的预设时间单位内所述目标症候群的病例数,以及所述目标症候群的历史病例数,计算所述目标症候群的若干历史统计指标,所述统计指标包括环比增长率、同比增长率、历史百分位中的一种或多种。Exemplarily, the processor is used for the number of cases of the target syndrome in the current preset time unit, and when calculating several statistical indicators of the target syndrome, it is realized: according to the number of cases of the target syndrome in the current preset time unit, and The number of historical cases of the target syndrome is calculated, and several historical statistical indicators of the target syndrome are calculated, and the statistical indicators include one or more of a month-on-month growth rate, a year-on-year growth rate, and a historical percentile.
示例性的,处理器用于将所述目标症候群的若干统计指标输入风险预测模型,以获取预测的风险等级时,实现:将所述统计指标以及风险因子输入风险预测模型,以获取所述目标症候群的风险等级,所述风险因子包括天气数据和/或环境数据。例如,所述天气数据包括当时的温度、湿度及是否为晴天阴天或雨天,所述环境数据包括当时的空气质量、水质等级等。Exemplarily, when the processor is configured to input several statistical indicators of the target syndrome into a risk prediction model to obtain a predicted risk level, it is achieved: inputting the statistical indicators and risk factors into a risk prediction model to obtain the target syndrome. , the risk factors include weather data and/or environmental data. For example, the weather data includes the current temperature, humidity, and whether it is a sunny, cloudy or rainy day, and the environmental data includes the current air quality, water quality level, and the like.
一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序中包括程序指令,所述处理器执行所述程序指令,实现本申请实施例提供的任一项症候群监测预警方法。A computer-readable storage medium, where the computer-readable storage medium stores a computer program, the computer program includes program instructions, and the processor executes the program instructions to implement any one of the syndromes provided in the embodiments of the present application Monitoring and early warning methods.
其中,所述计算机可读存储介质可以是前述实施例所述的计算机设备的内部存储单元,例如所述计算机设备的硬盘或内存。所述计算机可读存储介质也可以是所述计算机设备的外部存储设备,例如所述计算机设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。所述计算机可读存储介质可以是非易失性,也可以是易失性。The computer-readable storage medium may be an internal storage unit of the computer device described in the foregoing embodiments, such as a hard disk or a memory of the computer device. The computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) ) card, Flash Card, etc. The computer-readable storage medium may be non-volatile or volatile.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific embodiments of the present application, but the protection scope of the present application is not limited thereto. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in the present application. Modifications or substitutions shall be covered by the protection scope of this application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种症候群监测预警方法,其中,所述方法包括:A syndrome monitoring and early warning method, wherein the method comprises:
    从就诊人的电子病历中提取症状关键词;Extract symptom keywords from the patient's electronic medical record;
    根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例;Judging whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
    若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;If it is determined that the patient belongs to a case of one of the syndromes, determine the syndrome as the target syndrome;
    获取预设的时间单位内所述目标症候群的病例数;Obtain the number of cases of the target syndrome within a preset time unit;
    根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;Calculate several statistical indicators of the target syndrome according to the number of cases of the target syndrome within the current preset time unit;
    将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级;Inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome;
    根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。According to the risk level of the target syndrome, the early warning information of the target syndrome is released.
  2. 根据权利要求1所述的症候群监测预警方法,其中,所述根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例,包括:The syndrome monitoring and early warning method according to claim 1, wherein, according to the matching situation of the symptom keywords and the keywords of several preset syndromes, judging whether the patient belongs to a case of one of the syndromes includes:
    将所述症状关键词与各个症候群的每个关键词进行比较,若所有所述症状关键词在其中一个症候群中均有对应的关键词,则确定所述就诊人属于所述其中一个症候群的病例。Compare the symptom keywords with each keyword of each syndrome, and if all the symptom keywords have corresponding keywords in one of the syndromes, then determine that the patient belongs to the case of the one of the syndromes .
  3. 根据权利要求1所述的症候群监测预警方法,其中,根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例,包括:The syndrome monitoring and early warning method according to claim 1, wherein, according to the matching situation of the symptom keywords and the keywords of several preset syndromes, it is judged whether the patient belongs to a case of one of the syndromes, including:
    根据所述症状关键词及某个症候群的关键词,构建所述症状关键词的多热编码及所述某个症候群的关键词的多热编码;According to the symptom keyword and the keyword of a certain syndrome, construct the multi-hot encoding of the symptom keyword and the multi-hot encoding of the keyword of the certain syndrome;
    将所述症状关键词的多热编码输入自编码器,获取所述自编码器输出的所述症状关键词的自编码向量;Input the multi-hot encoding of the symptom keyword into an autoencoder, and obtain the autoencoder vector of the symptom keyword output by the autoencoder;
    根据所述某个症候群的关键词的多热编码,通过所述自编码器获取所述某个症候群的关键词的自编码向量;According to the multi-hot encoding of the keyword of the certain syndrome, the autoencoder vector of the keyword of the certain syndrome is obtained by the autoencoder;
    计算所述症状关键词的自编码向量及所述某个症候群的关键词的自编码向量的相似度;Calculate the similarity between the self-encoding vector of the symptom keyword and the self-encoding vector of the keyword of a certain syndrome;
    根据所述相似度判断所述就诊人是否属于所述某个症候群的病例。According to the similarity, it is determined whether the patient belongs to a case of the certain syndrome.
  4. 根据权利要求3所述的症候群监测预警方法,其中:所述方法还包括:The syndrome monitoring and early warning method according to claim 3, wherein: the method further comprises:
    获取训练数据,所述训练数据包括多热编码;obtaining training data, the training data includes multi-hot encoding;
    获取待训练的自编码器,所述自编码器包括第一网络和第二网络;Obtain an autoencoder to be trained, the autoencoder includes a first network and a second network;
    将所述训练数据中的多热编码输入所述第一网络,得到所述第一网络输出的自编码向量,所述自编码向量的维度低于所述多热编码的维度;Inputting the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, where the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding;
    将所述第一网络输出的自编码向量输入所述第二网络,得到所述第二网络输出的向量,所述第二网络输出的向量的维度等于所述多热编码的维度;Inputting the self-encoding vector output by the first network into the second network, to obtain a vector output by the second network, the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding;
    根据所述第二网络输出的向量和所述多热编码,对所述第一网络和所述第二网络进行训练。The first network and the second network are trained according to the vector output by the second network and the multi-hot encoding.
  5. 根据权利要求1-4中任一项所述的症候群监测预警方法,其中:所述根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标,包括:The syndrome monitoring and early warning method according to any one of claims 1-4, wherein: according to the number of cases of the target syndrome in the current preset time unit, calculate some statistical indicators of the target syndrome, including:
    根据当前的预设时间单位内所述目标症候群的病例数,以及所述目标症候群的历史病例数,计算所述目标症候群的若干统计指标,所述统计指标包括环比增长率、同比增长率、历史百分位中的一种或多种。According to the number of cases of the target syndrome in the current preset time unit, and the number of historical cases of the target syndrome, calculate several statistical indicators of the target syndrome, and the statistical indicators include the chain growth rate, year-on-year growth rate, historical One or more of the percentiles.
  6. 根据权利要求1-4中任一项所述的症候群监测预警方法,其中:所述将所述统计指标输入风险预测模型,以获取所述目标症候群的风险等级,包括:The syndrome monitoring and early warning method according to any one of claims 1-4, wherein: inputting the statistical index into a risk prediction model to obtain the risk level of the target syndrome, comprising:
    将所述统计指标以及风险因子输入风险预测模型,以获取所述目标症候群的风险等级,所述风险因子包括天气数据和/或环境数据。The statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome, and the risk factors include weather data and/or environmental data.
  7. 根据权利要求1所述的症候群监测预警方法,其中,所述从就诊人的电子病历中提取症状关键词,包括:The syndrome monitoring and early warning method according to claim 1, wherein the extracting symptom keywords from the electronic medical record of the patient comprises:
    通过命名实体识别,从所述病历的文本中提取就诊人的症状初始关键词;Through named entity recognition, the initial keywords of the symptoms of the patient are extracted from the text of the medical record;
    根据预设的标准化词汇表,将所述症状初始关键词中的别名词汇替换成标准词汇,以获取所述症状关键词。According to a preset standardized vocabulary, the alias vocabulary in the symptom initial keyword is replaced with a standard vocabulary to obtain the symptom keyword.
  8. 一种症候群监测预警装置,其中,所述装置包括:A syndrome monitoring and early warning device, wherein the device comprises:
    症状提取模块,用于从就诊人的电子病历中提取症状关键词;The symptom extraction module is used to extract symptom keywords from the electronic medical record of the patient;
    症状匹配模块,用于根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个目标症候群的病例;The symptom matching module is used to judge whether the patient belongs to a case of one of the target syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
    目标症候群确定模块,用于若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;a target syndrome determination module, configured to determine the syndrome as the target syndrome if it is determined that the patient belongs to one of the syndrome cases;
    病例统计模块,用于获取预设的时间单位内所述目标症候群的病例数;A case statistics module, used to obtain the number of cases of the target syndrome within a preset time unit;
    指标计算模块,用于根据预设的时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;an index calculation module, used for calculating several statistical indexes of the target syndrome according to the number of cases of the target syndrome in a preset time unit;
    风险预测模块,用于将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级;a risk prediction module for inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome;
    预警模块,用于根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。An early warning module, configured to issue early warning information of the target syndrome according to the risk level of the target syndrome.
  9. 一种计算机设备,其中,所述计算机设备包括存储器和处理器;A computer device, wherein the computer device includes a memory and a processor;
    所述存储器,用于存储计算机程序;the memory for storing computer programs;
    所述处理器,用于执行所述的计算机程序并在执行所述的计算机程序时实现如下步骤:The processor is configured to execute the computer program and implement the following steps when executing the computer program:
    从就诊人的电子病历中提取症状关键词;Extract symptom keywords from the patient's electronic medical record;
    根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例;Judging whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
    若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;If it is determined that the patient belongs to a case of one of the syndromes, determine the syndrome as the target syndrome;
    获取预设的时间单位内所述目标症候群的病例数;Obtain the number of cases of the target syndrome within a preset time unit;
    根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;Calculate several statistical indicators of the target syndrome according to the number of cases of the target syndrome within the current preset time unit;
    将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等 级;Inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome;
    根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。According to the risk level of the target syndrome, the early warning information of the target syndrome is released.
  10. 根据权利要求9所述的计算机设备,其中,所述处理器在实现所述根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例时,用于实现:The computer device according to claim 9, wherein, when the processor determines whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keyword and the keywords of several preset syndromes , which is used to implement:
    将所述症状关键词与各个症候群的每个关键词进行比较,若所有所述症状关键词在其中一个症候群中均有对应的关键词,则确定所述就诊人属于所述其中一个症候群的病例。Compare the symptom keywords with each keyword of each syndrome, and if all the symptom keywords have corresponding keywords in one of the syndromes, then determine that the patient belongs to the case of the one of the syndromes .
  11. 根据权利要求9所述的计算机设备,其中,所述处理器在实现所述根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例时,用于实现:The computer device according to claim 9, wherein, when the processor determines whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keyword and the keywords of several preset syndromes , which is used to implement:
    根据所述症状关键词及某个症候群的关键词,构建所述症状关键词的多热编码及所述某个症候群的关键词的多热编码;According to the symptom keyword and the keyword of a certain syndrome, construct the multi-hot encoding of the symptom keyword and the multi-hot encoding of the keyword of the certain syndrome;
    将所述症状关键词的多热编码输入自编码器,获取所述自编码器输出的所述症状关键词的自编码向量;Input the multi-hot encoding of the symptom keyword into an autoencoder, and obtain the autoencoder vector of the symptom keyword output by the autoencoder;
    根据所述某个症候群的关键词的多热编码,通过所述自编码器获取所述某个症候群的关键词的自编码向量;According to the multi-hot encoding of the keyword of the certain syndrome, the autoencoder vector of the keyword of the certain syndrome is obtained by the autoencoder;
    计算所述症状关键词的自编码向量及所述某个症候群的关键词的自编码向量的相似度;Calculate the similarity between the self-encoding vector of the symptom keyword and the self-encoding vector of the keyword of a certain syndrome;
    根据所述相似度判断所述就诊人是否属于所述某个症候群的病例。According to the similarity, it is determined whether the patient belongs to a case of the certain syndrome.
  12. 根据权利要求11所述的计算机设备,其中,所述处理器还用于实现:The computer device of claim 11, wherein the processor is further configured to implement:
    获取训练数据,所述训练数据包括多热编码;obtaining training data, the training data includes multi-hot encoding;
    获取待训练的自编码器,所述自编码器包括第一网络和第二网络;Obtain an autoencoder to be trained, the autoencoder includes a first network and a second network;
    将所述训练数据中的多热编码输入所述第一网络,得到所述第一网络输出的自编码向量,所述自编码向量的维度低于所述多热编码的维度;Inputting the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, where the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding;
    将所述第一网络输出的自编码向量输入所述第二网络,得到所述第二网络输出的向量,所述第二网络输出的向量的维度等于所述多热编码的维度;Inputting the self-encoding vector output by the first network into the second network, to obtain a vector output by the second network, the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding;
    根据所述第二网络输出的向量和所述多热编码,对所述第一网络和所述第二网络进行训练。The first network and the second network are trained according to the vector output by the second network and the multi-hot encoding.
  13. 根据权利要求9-12中任一项所述的计算机设备,其中,所述处理器在实现所述根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标时,用于实现:The computer device according to any one of claims 9-12, wherein, when the processor calculates several statistical indicators of the target syndrome according to the number of cases of the target syndrome in the current preset time unit , which is used to implement:
    根据当前的预设时间单位内所述目标症候群的病例数,以及所述目标症候群的历史病例数,计算所述目标症候群的若干统计指标,所述统计指标包括环比增长率、同比增长率、历史百分位中的一种或多种。According to the number of cases of the target syndrome in the current preset time unit, and the number of historical cases of the target syndrome, calculate several statistical indicators of the target syndrome, and the statistical indicators include the chain growth rate, year-on-year growth rate, historical One or more of the percentiles.
  14. 根据权利要求9-12中任一项所述的计算机设备,其中,所述处理器在实现所述将所述统计指标输入风险预测模型,以获取所述目标症候群的风险等级时,用于实现:The computer device according to any one of claims 9-12, wherein, when the processor implements the inputting the statistical index into a risk prediction model to obtain the risk level of the target syndrome, the processor is configured to implement :
    将所述统计指标以及风险因子输入风险预测模型,以获取所述目标症候群的风险等级, 所述风险因子包括天气数据和/或环境数据。The statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome, and the risk factors include weather data and/or environmental data.
  15. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时使所述处理器实现如下步骤:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the following steps:
    从就诊人的电子病历中提取症状关键词;Extract symptom keywords from the patient's electronic medical record;
    根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例;Judging whether the patient belongs to a case of one of the syndromes according to the matching situation of the symptom keywords and the keywords of several preset syndromes;
    若确定就诊人属于其中一个症候群的病例,确定所述症候群为目标症候群;If it is determined that the patient belongs to a case of one of the syndromes, determine the syndrome as the target syndrome;
    获取预设的时间单位内所述目标症候群的病例数;Obtain the number of cases of the target syndrome within a preset time unit;
    根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标;Calculate several statistical indicators of the target syndrome according to the number of cases of the target syndrome within the current preset time unit;
    将所述目标症候群的若干统计指标输入风险预测模型,以获取所述目标症候群的风险等级;Inputting several statistical indicators of the target syndrome into a risk prediction model to obtain the risk level of the target syndrome;
    根据所述目标症候群的风险等级,发布所述目标症候群的预警信息。According to the risk level of the target syndrome, the early warning information of the target syndrome is released.
  16. 根据权利要求15所述的计算机可读存储介质,其中,所述处理器在实现所述根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例时,用于实现:The computer-readable storage medium according to claim 15, wherein the processor determines whether the patient belongs to one of the syndromes according to the matching situation of the symptom keyword and the keywords of a plurality of preset syndromes. The case is used to achieve:
    将所述症状关键词与各个症候群的每个关键词进行比较,若所有所述症状关键词在其中一个症候群中均有对应的关键词,则确定所述就诊人属于所述其中一个症候群的病例。Compare the symptom keywords with each keyword of each syndrome, and if all the symptom keywords have corresponding keywords in one of the syndromes, then determine that the patient belongs to the case of the one of the syndromes .
  17. 根据权利要求15所述的计算机可读存储介质,其中,所述处理器在实现所述根据所述症状关键词及预设的若干症候群的关键词的匹配情况,判断就诊人是否属于其中一个症候群的病例时,用于实现:The computer-readable storage medium according to claim 15, wherein the processor determines whether the patient belongs to one of the syndromes according to the matching situation of the symptom keyword and the keywords of a plurality of preset syndromes. The case is used to achieve:
    根据所述症状关键词及某个症候群的关键词,构建所述症状关键词的多热编码及所述某个症候群的关键词的多热编码;According to the symptom keyword and the keyword of a certain syndrome, construct the multi-hot encoding of the symptom keyword and the multi-hot encoding of the keyword of the certain syndrome;
    将所述症状关键词的多热编码输入自编码器,获取所述自编码器输出的所述症状关键词的自编码向量;Input the multi-hot encoding of the symptom keyword into an autoencoder, and obtain the autoencoder vector of the symptom keyword output by the autoencoder;
    根据所述某个症候群的关键词的多热编码,通过所述自编码器获取所述某个症候群的关键词的自编码向量;According to the multi-hot encoding of the keyword of the certain syndrome, the autoencoder vector of the keyword of the certain syndrome is obtained by the autoencoder;
    计算所述症状关键词的自编码向量及所述某个症候群的关键词的自编码向量的相似度;Calculate the similarity between the self-encoding vector of the symptom keyword and the self-encoding vector of the keyword of a certain syndrome;
    根据所述相似度判断所述就诊人是否属于所述某个症候群的病例。According to the similarity, it is determined whether the patient belongs to a case of the certain syndrome.
  18. 根据权利要求17所述的计算机可读存储介质,其中,所述处理器还用于实现:The computer-readable storage medium of claim 17, wherein the processor is further configured to implement:
    获取训练数据,所述训练数据包括多热编码;obtaining training data, the training data includes multi-hot encoding;
    获取待训练的自编码器,所述自编码器包括第一网络和第二网络;Obtain an autoencoder to be trained, the autoencoder includes a first network and a second network;
    将所述训练数据中的多热编码输入所述第一网络,得到所述第一网络输出的自编码向量,所述自编码向量的维度低于所述多热编码的维度;Inputting the multi-hot encoding in the training data into the first network to obtain an auto-encoding vector output by the first network, where the dimension of the self-encoding vector is lower than the dimension of the multi-hot encoding;
    将所述第一网络输出的自编码向量输入所述第二网络,得到所述第二网络输出的向量,所述第二网络输出的向量的维度等于所述多热编码的维度;Inputting the self-encoding vector output by the first network into the second network, to obtain a vector output by the second network, the dimension of the vector output by the second network is equal to the dimension of the multi-hot encoding;
    根据所述第二网络输出的向量和所述多热编码,对所述第一网络和所述第二网络进行训 练。The first network and the second network are trained based on the vector output by the second network and the multi-hot encoding.
  19. 根据权利要求15-18任一项所述的计算机可读存储介质,其中,所述处理器在实现所述根据当前的预设时间单位内所述目标症候群的病例数,计算目标症候群的若干统计指标时,用于实现:The computer-readable storage medium according to any one of claims 15-18, wherein the processor calculates several statistics of the target syndrome according to the number of cases of the target syndrome within the current preset time unit. When the indicator is used, it is used to achieve:
    根据当前的预设时间单位内所述目标症候群的病例数,以及所述目标症候群的历史病例数,计算所述目标症候群的若干统计指标,所述统计指标包括环比增长率、同比增长率、历史百分位中的一种或多种。According to the number of cases of the target syndrome in the current preset time unit, and the number of historical cases of the target syndrome, calculate several statistical indicators of the target syndrome, and the statistical indicators include the chain growth rate, year-on-year growth rate, historical One or more of the percentiles.
  20. 根据权利要求15-18任一项所述的计算机可读存储介质,其中,所述处理器在实现所述将所述统计指标输入风险预测模型,以获取所述目标症候群的风险等级时,用于实现:The computer-readable storage medium according to any one of claims 15-18, wherein, when the processor implements the inputting the statistical index into a risk prediction model to obtain the risk level of the target syndrome, use To achieve:
    将所述统计指标以及风险因子输入风险预测模型,以获取所述目标症候群的风险等级,所述风险因子包括天气数据和/或环境数据。The statistical indicators and risk factors are input into a risk prediction model to obtain the risk level of the target syndrome, and the risk factors include weather data and/or environmental data.
PCT/CN2021/097415 2021-04-16 2021-05-31 Syndrome monitoring and early warning method and apparatus, computer device, and storage medium WO2022217713A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110412510.X 2021-04-16
CN202110412510.XA CN112951443B (en) 2021-04-16 2021-04-16 Syndrome monitoring and early warning method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022217713A1 true WO2022217713A1 (en) 2022-10-20

Family

ID=76232767

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/097415 WO2022217713A1 (en) 2021-04-16 2021-05-31 Syndrome monitoring and early warning method and apparatus, computer device, and storage medium

Country Status (2)

Country Link
CN (1) CN112951443B (en)
WO (1) WO2022217713A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116959692A (en) * 2023-09-18 2023-10-27 北方健康医疗大数据科技有限公司 Electronic medical record quality control method, system, terminal and storage medium

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113345598B (en) * 2021-07-21 2021-11-26 深圳市知酷信息技术有限公司 Regional epidemic monitoring and early warning system based on data analysis
CN113569058A (en) * 2021-08-05 2021-10-29 武汉美之修行信息科技有限公司 Information query method and device and computer readable storage medium
CN113724891A (en) * 2021-08-31 2021-11-30 平安国际智慧城市科技股份有限公司 Hospital epidemic situation monitoring method, device and related equipment
CN113871019A (en) * 2021-12-06 2021-12-31 江西易卫云信息技术有限公司 Disease public opinion monitoring method, system, storage medium and equipment
CN115148373A (en) * 2022-07-28 2022-10-04 深圳市腾云数据系统有限公司 Epidemic situation monitoring method and system based on symptom monitoring
CN115862897B (en) * 2023-02-21 2023-05-23 江西曼荼罗软件有限公司 Syndrome monitoring method and system based on clinical data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160283686A1 (en) * 2015-03-23 2016-09-29 International Business Machines Corporation Identifying And Ranking Individual-Level Risk Factors Using Personalized Predictive Models
CN111785380A (en) * 2020-07-01 2020-10-16 医渡云(北京)技术有限公司 Method, device, medium and equipment for predicting infection disease risk grade
CN112037930A (en) * 2020-11-02 2020-12-04 平安科技(深圳)有限公司 Infectious disease prediction equipment, method, device and storage medium
CN112216402A (en) * 2020-10-16 2021-01-12 平安国际智慧城市科技股份有限公司 Epidemic situation prediction method and device based on artificial intelligence, computer equipment and medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110659411B (en) * 2019-08-21 2022-03-11 桂林电子科技大学 Personalized recommendation method based on neural attention self-encoder
CN111767707B (en) * 2020-06-30 2023-10-31 平安科技(深圳)有限公司 Method, device, equipment and storage medium for detecting Leideogue cases

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160283686A1 (en) * 2015-03-23 2016-09-29 International Business Machines Corporation Identifying And Ranking Individual-Level Risk Factors Using Personalized Predictive Models
CN111785380A (en) * 2020-07-01 2020-10-16 医渡云(北京)技术有限公司 Method, device, medium and equipment for predicting infection disease risk grade
CN112216402A (en) * 2020-10-16 2021-01-12 平安国际智慧城市科技股份有限公司 Epidemic situation prediction method and device based on artificial intelligence, computer equipment and medium
CN112037930A (en) * 2020-11-02 2020-12-04 平安科技(深圳)有限公司 Infectious disease prediction equipment, method, device and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116959692A (en) * 2023-09-18 2023-10-27 北方健康医疗大数据科技有限公司 Electronic medical record quality control method, system, terminal and storage medium

Also Published As

Publication number Publication date
CN112951443A (en) 2021-06-11
CN112951443B (en) 2023-08-04

Similar Documents

Publication Publication Date Title
WO2022217713A1 (en) Syndrome monitoring and early warning method and apparatus, computer device, and storage medium
CN109036577B (en) Diabetes complication analysis method and device
CN111612041B (en) Abnormal user identification method and device, storage medium and electronic equipment
WO2019141144A1 (en) Method and apparatus for determining network failure
WO2020215671A1 (en) Method and device for smart analysis of data, and computer device and storage medium
Fang et al. Feature Selection Method Based on Class Discriminative Degree for Intelligent Medical Diagnosis.
CN113707297A (en) Medical data processing method, device, equipment and storage medium
CN112786205B (en) Data model-based syndrome early warning method, device, medium and equipment
WO2019200786A1 (en) Method for forecasting public sentiment data, device, terminal, and storage medium
US20220044809A1 (en) Systems and methods for using deep learning to generate acuity scores for critically ill or injured patients
CN111489800A (en) Analysis method and system for identifying and storing medical record and report list images
CN113094477B (en) Data structuring method and device, computer equipment and storage medium
CN113688248B (en) Medical event identification method and system under condition of small sample weak labeling
CN116580849B (en) Medical data acquisition and analysis system and method thereof
US11615361B2 (en) Machine learning model for predicting litigation risk in correspondence and identifying severity levels
EP3779757B1 (en) Simulated risk contributions
WO2021008601A1 (en) Method for testing medical data
CN116821646A (en) Data processing chain construction method, data reduction method, device, equipment and medium
CN114358001A (en) Method for standardizing diagnosis result, and related device, equipment and storage medium thereof
WO2021051496A1 (en) Diagnosis result identification and model training method, computer device, and storage medium
Zhao et al. Multivariate time series data imputation using attention-based mechanism
CN117316462A (en) Medical data management method
CN115083550B (en) Patient similarity classification method based on multi-source information
CN116468043A (en) Nested entity identification method, device, equipment and storage medium
Henao et al. Patient clustering with uncoded text in electronic medical records

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21936585

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21936585

Country of ref document: EP

Kind code of ref document: A1