WO2021159751A1 - Semantic and image recognition-based electrocardiography information extraction method and apparatus, computer device, and storage medium - Google Patents
Semantic and image recognition-based electrocardiography information extraction method and apparatus, computer device, and storage medium Download PDFInfo
- Publication number
- WO2021159751A1 WO2021159751A1 PCT/CN2020/125059 CN2020125059W WO2021159751A1 WO 2021159751 A1 WO2021159751 A1 WO 2021159751A1 CN 2020125059 W CN2020125059 W CN 2020125059W WO 2021159751 A1 WO2021159751 A1 WO 2021159751A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- text
- information
- recognized
- classification
- res2net
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/12—Classification; Matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Definitions
- This application relates to the technical field of artificial intelligence intelligent decision-making, and in particular to a method, device, computer equipment and storage medium for extracting ECG information based on semantic and image recognition.
- the ECG intelligent diagnosis system has been widely used in daily life.
- some smart wearable devices such as Apple's Apple Watch
- the inventor realizes that the above method has the following defects:
- ECG information is obtained based on smart portable measurement, and the accuracy and reliability are far inferior to medical electrocardiographs;
- the embodiments of this application provide a method, device, computer equipment and storage medium for extracting ECG information based on semantic and image recognition, aiming to solve the problem that the ECG information in the prior art is obtained based on smart portable measurement, with accuracy and reliability The problem of low accuracy and low accuracy of the image recognition model that recognizes the electrocardiogram.
- an embodiment of the present application provides a method for extracting ECG information based on semantic and image recognition, which includes:
- the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;
- the pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
- an embodiment of the present application provides an electrocardiographic information extraction device based on semantic and image recognition, which includes:
- the text description information receiving unit is used to receive the text description information uploaded by the client;
- the keyword judgment unit is used to judge whether the text description information includes preset keywords
- the first text information obtaining unit is configured to obtain the text description information as the current text information to be recognized if the keyword is included in the text description information;
- a guiding question set sending unit configured to call a pre-stored guiding question set and send it to the user terminal if the keyword is not included in the text description information
- the second text information acquiring unit is configured to receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;
- a semantic vector acquiring unit configured to perform semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information
- the image classification unit is configured to receive uploaded electrocardiogram images, call a pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram images according to the Res2Net classification network based on the attention mechanism to obtain a corresponding output vector; as well as
- the classification result obtaining unit is configured to call a pre-trained Light GBM model, input the semantic vector and the output vector to the Light GBM model for classification, and obtain a corresponding classification result.
- an embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program that is stored on the memory and can run on the processor, and the processor executes the following steps:
- the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;
- the pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
- the embodiments of the present application also provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to perform the following operations:
- the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;
- the pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
- the embodiments of the present application provide a method, device, computer equipment and storage medium for extracting ECG information based on semantic and image recognition, including semantic recognition of the current text information to be recognized, so as to obtain information corresponding to the current text information to be recognized
- Receiving the uploaded ECG image calling the pre-trained Res2Net classification network based on the attention mechanism, classifying the ECG image according to the Res2Net classification network based on the attention mechanism, and obtaining the corresponding output vector; and calling The pre-trained Light GBM model inputs the semantic vector and the output vector to the Light GBM model for classification, and obtains a corresponding classification result.
- This method combines the text information uploaded by the user terminal and the image information corresponding to the ECG image, and then performs classification through the Light GBM algorithm, which improves the accuracy and credibility of the classification results.
- FIG. 1 is a schematic diagram of an application scenario of an ECG information extraction method based on semantic and image recognition provided by an embodiment of the application;
- FIG. 2 is a schematic flowchart of a method for extracting ECG information based on semantic and image recognition provided by an embodiment of the application;
- FIG. 3 is a schematic diagram of a sub-process of a method for extracting ECG information based on semantics and image recognition provided by an embodiment of the application;
- FIG. 4 is a schematic diagram of another sub-process of the method for extracting ECG information based on semantics and image recognition provided by an embodiment of the application;
- FIG. 5 is a schematic block diagram of an electrocardiographic information extraction device based on semantic and image recognition provided by an embodiment of the application;
- FIG. 6 is a schematic block diagram of subunits of an electrocardiographic information extraction device based on semantic and image recognition provided by an embodiment of the application;
- FIG. 7 is a schematic block diagram of another subunit of the device for extracting ECG information based on semantics and image recognition according to an embodiment of the application;
- FIG. 8 is a schematic block diagram of a computer device provided by an embodiment of the application.
- Figure 1 is a schematic diagram of an application scenario of an ECG information extraction method based on semantic and image recognition provided by an embodiment of this application
- Figure 2 is an ECG based on semantic and image recognition provided by an embodiment of this application
- the schematic flow chart of the information extraction method The ECG information extraction method based on semantic and image recognition is applied to a server, and the method is executed by application software installed in the server.
- the method includes steps S110 to S180.
- the user terminal may be prompted to upload the text description information at this time.
- the user edits a self-description on the user terminal according to his own situation and uploads it to the server.
- S120 Determine whether the text description information includes preset keywords.
- the server can first detect and determine the description. Whether the text description information includes preset keywords (such as chest tightness, shortness of breath, fast heartbeat, etc.).
- the text description information when it is determined that the text description information includes the keyword, it means that the text description information includes valid information, and the text description information is directly acquired as the current text information to be recognized.
- the server needs to call the guided question set and send the guided question set to the client To guide users to supplement information.
- the guidance question set includes multiple guidance questions such as "whether chest tightness and shortness of breath?" and "whether the heartbeat is fast?"
- S150 Receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized.
- the text answers of the users are integrated, and the reply text information can be obtained as the current text information to be recognized.
- the currently obtained response text information includes more effective information.
- S160 Perform semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information.
- the word vectors corresponding to the keywords can be obtained to form a semantic vector.
- step S160 includes:
- S162 Perform one-hot encoding on each text keyword in the text keyword set to obtain a word vector corresponding to each text keyword;
- S163 Calculate the semantic vector corresponding to the current text to be recognized according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword.
- keyword extraction is performed on the current text information to be recognized through the BERT model (that is, the two-way encoder representation model of the Transformers model), and then each keyword can be extracted
- the correction corresponds to the medical terminology (for example, the fast correction is replaced by fast heartbeat).
- the corresponding keywords are converted into word vectors and then the semantic vectors corresponding to the current text to be recognized are calculated.
- the BERT model uses Transformer Encoder (ie the encoder in the Transformer structure) as the feature extractor, which is composed of Nx exactly the same layers, and each layer has 2 sub-layers (ie sub-layers), which are: Multi- Head Self-Attention mechanism (ie multi-head self-attention mechanism), Position-Wise fully connected forward neural network. For each sub-layer, two operations are added: Residual Connection and Normalization.
- the input of the BERT model is a linear sequence, which supports single sentence text and sentence pair text.
- the beginning of the sentence is represented by the symbol [CLS]
- the end of the sentence is represented by the symbol [SEP]. If it is a sentence pair, the symbol [SEP] is added between the sentences.
- MLM Masked LM
- NSP Next Sentence Prediction
- each text keyword in the text keyword set is one-hot encoded, Obtain the word vector corresponding to each text keyword. Since the weight value of each keyword is known in the corpus, at this time, according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated . The semantic vector extracted in this way can more accurately represent the current text information to be recognized.
- S170 Receive the uploaded electrocardiogram image, call a pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram image according to the Res2Net classification network based on the attention mechanism to obtain a corresponding output vector.
- the content of the vector representation may be less, which affects the final The classification result, at this time, can further prompt the user to upload the ECG image, and add some picture features combined with the semantic vector, so that the final vector representation content is rich, which is more conducive to obtaining accurate classification results.
- step S170 includes:
- S172 Use the pixel matrix as an input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;
- the pixel matrix corresponding to the ECG image is input to the deep learning Res2Net network to learn the morphological special diagnosis of the picture, and then input it into the attention structure, allowing the model to focus more on finding the input.
- Res2Net which is an upgraded version of the ResNet network, that is, the residual network.
- Res2Net Compared with ResNet, Res2Net not only improves the accuracy of recognition, but also optimizes the size and parameters of the model. This more lightweight model can improve Response speed and reduce the server's hardware requirements.
- step S172 includes:
- the pixel matrix is input into the Res2Net network for sequential convolution, identity mapping, pooling, and full connection are performed in a multi-layer residual structure to obtain a morphological feature vector.
- the method before step S170, the method further includes:
- the server provides two interfaces for uploading ECG images from the client and the smart electrocardiograph.
- the server can send to the client or the smart electrocardiograph for obtaining the ECG images. Prompt information. Through this notification method, after the extraction of the semantic vector, the process of obtaining the image classification result can be triggered more quickly.
- the server after integrating the semantic vector and the output vector corresponding to the ECG image, the server obtains a complete set of feature vectors, and performs learning and judgment based on the current Light GBM model to obtain the corresponding classification results.
- the Light GBM model is a learning algorithm based on decision trees, which has faster training speed, higher accuracy and big data processing capabilities.
- step S180 includes:
- the graphic feature vector is classified through the histogram-based decision in the Light GBM model to obtain a corresponding classification result.
- the purpose of combining the semantic vector and the output vector with independent features is to reduce feature dimensions and improve calculation efficiency. Since the semantic vector and the output vector are mutually exclusive, the two features are bundled together so that no information will be lost.
- the histogram-based decision is used for classification. Since the histogram only needs to calculate the information gain for the histogram statistics, it is compared with the pre-sorting algorithm, which traverses all values every time , The calculation amount of information gain is much smaller, and the memory space needs to be relatively small.
- step S180 the method further includes:
- the classification by the Light GBM model is more accurate, and the classification result determined by the semantic vector and the output vector can be determined (for example, there are heart disease).
- the server can be used as a blockchain node device to upload the current text to the blockchain network, making full use of the non-tamperable characteristics of the blockchain data, and realizing the solidification of data evidence.
- the corresponding summary information is obtained based on the current text.
- the summary information is obtained by hashing the current text, for example, obtained by using the sha256 algorithm.
- Uploading summary information to the blockchain can ensure its security and fairness and transparency to users.
- the user equipment can download the summary information from the blockchain to verify whether the current text has been tampered with.
- the blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
- Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block.
- the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
- This method combines the text information uploaded by the user terminal and the image information corresponding to the ECG image, and then performs classification through the Light GBM algorithm, which improves the accuracy and credibility of the classification results.
- the embodiment of the present application also provides an electrocardiographic information extraction device based on semantic and image recognition.
- the electrocardiographic information extraction device based on semantic and image recognition is used to perform any of the foregoing electrocardiographic information extraction methods based on semantic and image recognition. Examples. Specifically, please refer to FIG. 5, which is a schematic block diagram of an electrocardiographic information extraction device based on semantic and image recognition provided by an embodiment of the present application.
- the electrocardiographic information extraction device 100 based on semantic and image recognition can be configured in a server.
- the electrocardiographic information extraction device 100 based on semantic and image recognition includes: a text description information receiving unit 110, a keyword judgment unit 120, a first text information acquiring unit 130, a guide question set sending unit 140, and a second The text information acquisition unit 150, the semantic vector acquisition unit 160, the image classification unit 170, and the classification result acquisition unit 180.
- the text description information receiving unit 110 is configured to receive the text description information uploaded by the client.
- the user terminal may be prompted to upload the text description information at this time.
- the user edits a self-description on the user terminal according to his own situation and uploads it to the server.
- the keyword judgment unit 120 is configured to judge whether the text description information includes preset keywords.
- the server can first detect and determine the description. Whether the text description information includes preset keywords (such as chest tightness, shortness of breath, fast heartbeat, etc.).
- the first text information obtaining unit 130 is configured to obtain the text description information as the current text information to be recognized if the keyword is included in the text description information.
- the text description information when it is determined that the text description information includes the keyword, it means that the text description information includes valid information, and the text description information is directly acquired as the current text information to be recognized.
- the guiding question set sending unit 140 is configured to call a pre-stored guiding question set and send it to the user terminal if the keyword is not included in the text description information.
- the server needs to call the guided question set and send the guided question set to the client To guide users to supplement information.
- the guidance question set includes multiple guidance questions such as "whether chest tightness and shortness of breath?" and "whether the heartbeat is fast?"
- the second text information acquiring unit 150 is configured to receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized.
- the text answers of the users are integrated, and the reply text information can be obtained as the current text information to be recognized.
- the currently obtained response text information includes more effective information.
- the semantic vector obtaining unit 160 is configured to perform semantic recognition on the current text information to be recognized to obtain a semantic vector corresponding to the current text information to be recognized.
- the word vectors corresponding to the keywords can be obtained to form a semantic vector.
- the semantic vector obtaining unit 160 includes:
- the keyword extraction unit 161 is configured to call a pre-trained BERT model, extract keywords from the current text information to be recognized through the BERT model, and obtain a text keyword set corresponding to the current text information to be recognized; wherein ,
- the BERT model represents a bidirectional encoder representation model of the Transformers model;
- the word vector obtaining unit 162 is configured to perform one-hot encoding on each text keyword in the text keyword set to obtain the word vector corresponding to each text keyword;
- the semantic vector calculation unit 163 is configured to calculate the semantic vector corresponding to the current text to be recognized according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword.
- keyword extraction is performed on the current text information to be recognized through the BERT model (that is, the two-way encoder representation model of the Transformers model), and then each keyword can be extracted
- the correction corresponds to the medical terminology (for example, the fast correction is replaced by fast heartbeat).
- the corresponding keywords are converted into word vectors and then the semantic vectors corresponding to the current text to be recognized are calculated.
- the BERT model uses Transformer Encoder (ie the encoder in the Transformer structure) as the feature extractor, which is composed of Nx exactly the same layers, and each layer has 2 sub-layers (ie sub-layers), which are: Multi- Head Self-Attention mechanism (ie multi-head self-attention mechanism), Position-Wise fully connected forward neural network. For each sub-layer, two operations are added: Residual Connection and Normalization.
- the input of the BERT model is a linear sequence, which supports single sentence text and sentence pair text.
- the beginning of the sentence is represented by the symbol [CLS]
- the end of the sentence is represented by the symbol [SEP]. If it is a sentence pair, the symbol [SEP] is added between the sentences.
- MLM Masked LM
- NSP Next Sentence Prediction
- each text keyword in the text keyword set is one-hot encoded, Obtain the word vector corresponding to each text keyword. Since the weight value of each keyword is known in the corpus, at this time, according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated . The semantic vector extracted in this way can more accurately represent the current text information to be recognized.
- the image classification unit 170 is configured to receive the uploaded electrocardiogram images, call the pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram images according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector .
- the content of the vector representation may be less, which affects the final The classification result, at this time, can further prompt the user to upload the ECG image, and add some picture features combined with the semantic vector, so that the final vector representation content is rich, which is more conducive to obtaining accurate classification results.
- the image classification unit 170 includes:
- the matrix obtaining unit 171 is configured to obtain a pixel matrix corresponding to the electrocardiogram image
- the morphological feature vector obtaining unit 172 is configured to use the pixel matrix as the input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;
- the output vector calculation unit 173 is configured to use the morphological feature vector as the attention mechanism structure in the attention mechanism-based Res2Net classification network to perform operations to obtain an output vector.
- the pixel matrix corresponding to the ECG image is input to the deep learning Res2Net network to learn the morphological special diagnosis of the picture, and then input it into the attention structure, allowing the model to focus more on finding the input.
- Res2Net which is an upgraded version of the ResNet network, that is, the residual network.
- Res2Net Compared with ResNet, Res2Net not only improves the accuracy of recognition, but also optimizes the size and parameters of the model. This more lightweight model can improve Response speed and reduce the server's hardware requirements.
- the morphological feature vector obtaining unit 172 is further configured to:
- the pixel matrix is input into the Res2Net network for sequential convolution, identity mapping, pooling, and full connection are performed in a multi-layer residual structure to obtain a morphological feature vector.
- the device 100 for extracting ECG information based on semantic and image recognition further includes:
- the reminder information sending unit is used to send the reminder information for obtaining the electrocardiogram image to the user terminal or the smart electrocardiograph;
- the electrocardiogram image receiving unit is used to receive the electrocardiogram image sent by the user terminal or the smart electrocardiograph according to the prompt information.
- the server provides two interfaces for uploading ECG images on the client side and uploading ECG images from the smart electrocardiograph.
- the server can send ECG images to the client or the smart electrocardiograph when acquiring the ECG images. Prompt information. Through this notification method, after the extraction of the semantic vector, the process of obtaining the image classification result can be triggered more quickly.
- the classification result obtaining unit 180 is configured to call a pre-trained Light GBM model, input the semantic vector and the output vector into the Light GBM model for classification, and obtain a corresponding classification result.
- the server after integrating the semantic vector and the output vector corresponding to the ECG image, the server obtains a complete set of feature vectors, and performs learning and judgment based on the current Light GBM model to obtain the corresponding classification results.
- the Light GBM model is a learning algorithm based on decision trees, which has faster training speed, higher accuracy and big data processing capabilities.
- the classification result obtaining unit 180 includes:
- a graphic feature vector obtaining unit configured to merge the semantic vector and the output vector with independent features to obtain a graphic feature vector
- the decision classification unit is configured to classify the graphic feature vector through a histogram-based decision in the Light GBM model to obtain a corresponding classification result.
- the purpose of combining the semantic vector and the output vector with independent features is to reduce feature dimensions and improve calculation efficiency. Since the semantic vector and the output vector are mutually exclusive, the two features are bundled together so that no information will be lost.
- the histogram-based decision is used for classification. Since the histogram only needs to calculate the information gain for the histogram statistics, it is compared with the pre-sorting algorithm, which traverses all values every time , The calculation amount of information gain is much smaller, and the memory space needs to be relatively small.
- the device 100 for extracting ECG information based on semantic and image recognition further includes:
- the current text generation unit is configured to call a pre-stored text template, and fill the classification result into the text template to obtain the current text;
- the current text sending unit is used to send the current text to the user terminal
- the on-chain unit is used to upload the current text to the blockchain network.
- the classification by the Light GBM model is more accurate, and the classification result determined by the semantic vector and the output vector can be determined (for example, there are heart disease).
- the server can be used as a blockchain node device to upload the current text to the blockchain network, making full use of the non-tamperable characteristics of the blockchain data, and realizing the solidification of data evidence.
- the corresponding summary information is obtained based on the current text.
- the summary information is obtained by hashing the current text, for example, obtained by using the sha256 algorithm.
- Uploading summary information to the blockchain can ensure its security and fairness and transparency to users.
- the user equipment can download the summary information from the blockchain to verify whether the current text has been tampered with.
- the blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
- Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block.
- the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
- the device realizes the combination of the text information uploaded by the user terminal and the image information corresponding to the ECG image, and then the Light GBM algorithm is used for classification, which improves the accuracy and credibility of the classification results.
- the above-mentioned apparatus for extracting ECG information based on semantics and image recognition can be implemented in the form of a computer program, and the computer program can be run on a computer device as shown in FIG. 8.
- FIG. 8 is a schematic block diagram of a computer device according to an embodiment of the present application.
- the computer device 500 is a server, and the server may be an independent server or a server cluster composed of multiple servers.
- the computer device 500 includes a processor 502, a memory, and a network interface 505 connected through a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
- the non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032.
- the processor 502 can execute an electrocardiographic information extraction method based on semantics and image recognition.
- the processor 502 is used to provide calculation and control capabilities, and support the operation of the entire computer device 500.
- the internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503.
- the processor 502 can execute the method for extracting ECG information based on semantics and image recognition. .
- the network interface 505 is used for network communication, such as providing data information transmission.
- the structure shown in FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied.
- the specific computer device 500 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
- the processor 502 is configured to run a computer program 5032 stored in a memory to implement the method for extracting ECG information based on semantics and image recognition disclosed in the embodiments of the present application.
- the embodiment of the computer device shown in FIG. 8 does not constitute a limitation on the specific configuration of the computer device.
- the computer device may include more or less components than those shown in the figure. Or some parts are combined, or different parts are arranged.
- the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 8 and will not be repeated here.
- the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc.
- the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
- a computer-readable storage medium In another embodiment of the present application, a computer-readable storage medium is provided.
- the computer-readable storage medium may be non-volatile or volatile.
- the computer-readable storage medium stores a computer program, where the computer program is executed by a processor to realize the method for extracting electrocardiographic information based on semantic and image recognition disclosed in the embodiments of the present application.
- the disclosed equipment, device, and method may be implemented in other ways.
- the device embodiments described above are only illustrative.
- the division of the units is only a logical function division. In actual implementation, there may be other division methods, or the units with the same function may be combined into one. Units, for example, multiple units or components can be combined or integrated into another system, or some features can be omitted or not implemented.
- the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may also be electrical, mechanical or other forms of connection.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments of the present application.
- the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
- the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
- the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a storage medium.
- the technical solution of this application is essentially or the part that contributes to the existing technology, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium. It includes several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application.
- the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or optical disk and other media that can store program codes.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
A semantic and image recognition-based electrocardiography information extraction method and apparatus, a computer device, and a storage medium, which relate to artificial intelligence technology, and may be applied to smart medical treatment scenarios. The method comprises: performing semantic recognition on current text information to be recognized, and obtaining a semantic vector corresponding to the current text information to be recognized; receiving an uploaded electrocardiogram image, calling an attention mechanism-based Res2Net classification network, classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network, and obtaining a corresponding output vector; calling a pre-trained Light GBM model, inputting the semantic vector and the output vector into the Light GBM model for classification, and obtaining a corresponding classification result. The method also relates to medical technology and blockchain technology, in that text information uploaded by a user end and image information corresponding to an electrocardiogram image are combined, and classification is then performed by means of a Light GBM algorithm, thereby improving the accuracy and credibility of a classification result.
Description
本申请要求于2020年9月22日提交中国专利局、申请号为202011001748.5,申请名称为“基于语义和图像识别的心电信息提取方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on September 22, 2020, the application number is 202011001748.5, and the application title is "Electrocardiographic Information Extraction Method and Apparatus Based on Semantic and Image Recognition". The entire content of the application is approved The reference is incorporated in this application.
本申请涉及人工智能的智能决策技术领域,尤其涉及一种基于语义和图像识别的心电信息提取方法、装置、计算机设备及存储介质。This application relates to the technical field of artificial intelligence intelligent decision-making, and in particular to a method, device, computer equipment and storage medium for extracting ECG information based on semantic and image recognition.
目前,心电智能诊断系统在日常生活中得到了广泛应用,例如一些智能穿戴设备(如苹果公司的Apple Watch)可以采集用户的心电信息,还可将采集的心电信息生成心电图后上传至服务器进行后续图像识别,以生成报告信息。发明人意识到上述方式存在以下缺陷:At present, the ECG intelligent diagnosis system has been widely used in daily life. For example, some smart wearable devices (such as Apple's Apple Watch) can collect the user's ECG information, and can also generate the collected ECG information and upload it to The server performs subsequent image recognition to generate report information. The inventor realizes that the above method has the following defects:
1)上传的仅仅只有心电图,缺乏文字描述;1) Only the electrocardiogram is uploaded, and the text description is lacking;
2)心电图信息都是基于智能便携式测量得到的,精度与可靠性远远不如医用心电仪;2) ECG information is obtained based on smart portable measurement, and the accuracy and reliability are far inferior to medical electrocardiographs;
3)对心电图进行识别的图像识别模型准确度较低。3) The accuracy of the image recognition model that recognizes the electrocardiogram is low.
发明内容Summary of the invention
本申请实施例提供了一种基于语义和图像识别的心电信息提取方法、装置、计算机设备及存储介质,旨在解决现有技术中心电图信息都是基于智能便携式测量得到的,精度与可靠性较低,而且对心电图进行识别的图像识别模型准确度较低的问题。The embodiments of this application provide a method, device, computer equipment and storage medium for extracting ECG information based on semantic and image recognition, aiming to solve the problem that the ECG information in the prior art is obtained based on smart portable measurement, with accuracy and reliability The problem of low accuracy and low accuracy of the image recognition model that recognizes the electrocardiogram.
第一方面,本申请实施例提供了一种基于语义和图像识别的心电信息提取方法,其包括:In the first aspect, an embodiment of the present application provides a method for extracting ECG information based on semantic and image recognition, which includes:
接收用户端上传的文本描述信息;Receive text description information uploaded by the client;
判断所述文本描述信息中是否包括预设的关键词;Determine whether the text description information includes preset keywords;
若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;If the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;
若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;If the keyword is not included in the text description information, call the pre-stored guide question set and send it to the user terminal;
接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;Receiving the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;
对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;Performing semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;
接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及Receiving the uploaded electrocardiogram image, invoking the pre-trained Res2Net classification network based on the attention mechanism, and classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector; and
调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
第二方面,本申请实施例提供了一种基于语义和图像识别的心电信息提取装置,其包括:In the second aspect, an embodiment of the present application provides an electrocardiographic information extraction device based on semantic and image recognition, which includes:
文本描述信息接收单元,用于接收用户端上传的文本描述信息;The text description information receiving unit is used to receive the text description information uploaded by the client;
关键词判断单元,用于判断所述文本描述信息中是否包括预设的关键词;The keyword judgment unit is used to judge whether the text description information includes preset keywords;
第一文本信息获取单元,用于若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;The first text information obtaining unit is configured to obtain the text description information as the current text information to be recognized if the keyword is included in the text description information;
引导问题集发送单元,用于若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;A guiding question set sending unit, configured to call a pre-stored guiding question set and send it to the user terminal if the keyword is not included in the text description information;
第二文本信息获取单元,用于接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;The second text information acquiring unit is configured to receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;
语义向量获取单元,用于对所述当前待识别文本信息进行语义识别,以得到与所述当前 待识别文本信息对应的语义向量;A semantic vector acquiring unit, configured to perform semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;
影像分类单元,用于接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及The image classification unit is configured to receive uploaded electrocardiogram images, call a pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram images according to the Res2Net classification network based on the attention mechanism to obtain a corresponding output vector; as well as
分类结果获取单元,用于调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The classification result obtaining unit is configured to call a pre-trained Light GBM model, input the semantic vector and the output vector to the Light GBM model for classification, and obtain a corresponding classification result.
第三方面,本申请实施例又提供了一种计算机设备,其包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行以下步骤:In a third aspect, an embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program that is stored on the memory and can run on the processor, and the processor executes the following steps:
接收用户端上传的文本描述信息;Receive text description information uploaded by the client;
判断所述文本描述信息中是否包括预设的关键词;Determine whether the text description information includes preset keywords;
若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;If the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;
若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;If the keyword is not included in the text description information, call the pre-stored guide question set and send it to the user terminal;
接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;Receiving the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;
对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;Performing semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;
接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及Receiving the uploaded electrocardiogram image, invoking the pre-trained Res2Net classification network based on the attention mechanism, and classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector; and
调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其中所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行以下操作:In a fourth aspect, the embodiments of the present application also provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to perform the following operations:
接收用户端上传的文本描述信息;Receive text description information uploaded by the client;
判断所述文本描述信息中是否包括预设的关键词;Determine whether the text description information includes preset keywords;
若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;If the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;
若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;If the keyword is not included in the text description information, call the pre-stored guide question set and send it to the user terminal;
接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;Receiving the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;
对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;Performing semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;
接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及Receiving the uploaded electrocardiogram image, invoking the pre-trained Res2Net classification network based on the attention mechanism, and classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector; and
调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
本申请实施例提供了一种基于语义和图像识别的心电信息提取方法、装置、计算机设备及存储介质,包括对当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。该方法实现了结合用户端上传的文字信息与心电图影像对应的图像信息,再经过Light GBM算法来进行分类,提升了分类结果的准确度与可信度。The embodiments of the present application provide a method, device, computer equipment and storage medium for extracting ECG information based on semantic and image recognition, including semantic recognition of the current text information to be recognized, so as to obtain information corresponding to the current text information to be recognized Receiving the uploaded ECG image, calling the pre-trained Res2Net classification network based on the attention mechanism, classifying the ECG image according to the Res2Net classification network based on the attention mechanism, and obtaining the corresponding output vector; and calling The pre-trained Light GBM model inputs the semantic vector and the output vector to the Light GBM model for classification, and obtains a corresponding classification result. This method combines the text information uploaded by the user terminal and the image information corresponding to the ECG image, and then performs classification through the Light GBM algorithm, which improves the accuracy and credibility of the classification results.
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1为本申请实施例提供的基于语义和图像识别的心电信息提取方法的应用场景示意图;FIG. 1 is a schematic diagram of an application scenario of an ECG information extraction method based on semantic and image recognition provided by an embodiment of the application;
图2为本申请实施例提供的基于语义和图像识别的心电信息提取方法的流程示意图;2 is a schematic flowchart of a method for extracting ECG information based on semantic and image recognition provided by an embodiment of the application;
图3为本申请实施例提供的基于语义和图像识别的心电信息提取方法的子流程示意图;FIG. 3 is a schematic diagram of a sub-process of a method for extracting ECG information based on semantics and image recognition provided by an embodiment of the application;
图4为本申请实施例提供的基于语义和图像识别的心电信息提取方法的另一子流程示意图;4 is a schematic diagram of another sub-process of the method for extracting ECG information based on semantics and image recognition provided by an embodiment of the application;
图5为本申请实施例提供的基于语义和图像识别的心电信息提取装置的示意性框图;FIG. 5 is a schematic block diagram of an electrocardiographic information extraction device based on semantic and image recognition provided by an embodiment of the application;
图6为本申请实施例提供的基于语义和图像识别的心电信息提取装置的子单元示意性框图;FIG. 6 is a schematic block diagram of subunits of an electrocardiographic information extraction device based on semantic and image recognition provided by an embodiment of the application; FIG.
图7为本申请实施例提供的基于语义和图像识别的心电信息提取装置的另一子单元示意性框图;FIG. 7 is a schematic block diagram of another subunit of the device for extracting ECG information based on semantics and image recognition according to an embodiment of the application;
图8为本申请实施例提供的计算机设备的示意性框图。FIG. 8 is a schematic block diagram of a computer device provided by an embodiment of the application.
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and appended claims, the terms "including" and "including" indicate the existence of the described features, wholes, steps, operations, elements and/or components, but do not exclude one or The existence or addition of multiple other features, wholes, steps, operations, elements, components, and/or collections thereof.
还应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terms used in the specification of this application are only for the purpose of describing specific embodiments and are not intended to limit the application. As used in the specification of this application and the appended claims, unless the context clearly indicates other circumstances, the singular forms "a", "an" and "the" are intended to include plural forms.
还应当进一步理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should be further understood that the term "and/or" used in the specification and appended claims of this application refers to any combination and all possible combinations of one or more of the associated listed items, and includes these combinations .
请参阅图1和图2,图1为本申请实施例提供的基于语义和图像识别的心电信息提取方法的应用场景示意图;图2为本申请实施例提供的基于语义和图像识别的心电信息提取方法的流程示意图,该基于语义和图像识别的心电信息提取方法应用于服务器中,该方法通过安装于服务器中的应用软件进行执行。Please refer to Figures 1 and 2. Figure 1 is a schematic diagram of an application scenario of an ECG information extraction method based on semantic and image recognition provided by an embodiment of this application; Figure 2 is an ECG based on semantic and image recognition provided by an embodiment of this application The schematic flow chart of the information extraction method. The ECG information extraction method based on semantic and image recognition is applied to a server, and the method is executed by application software installed in the server.
如图2所示,该方法包括步骤S110~S180。As shown in Figure 2, the method includes steps S110 to S180.
S110、接收用户端上传的文本描述信息。S110: Receive the text description information uploaded by the user terminal.
在本实施例中,为了更加完整的获取基于心电图影像和描述文字的输出文本信息,此时可以先提示用户端上传文本描述信息。用户结合自身情况在用户端上编辑一段自我描述后上传至服务器。In this embodiment, in order to obtain the output text information based on the electrocardiogram image and the description text more completely, the user terminal may be prompted to upload the text description information at this time. The user edits a self-description on the user terminal according to his own situation and uploads it to the server.
S120、判断所述文本描述信息中是否包括预设的关键词。S120: Determine whether the text description information includes preset keywords.
在本实施例中,由于用户通过用户端上传的文本描述信息可能是一些与用户自身健康状态不太相关的描述,此时为了更准确的获取文本信息,可以先在服务器中检测并判断所述文本描述信息中是否包括预设的关键词(如胸闷气短、心跳过快等)。In this embodiment, since the text description information uploaded by the user through the user terminal may be some descriptions that are not closely related to the user’s own health status, at this time, in order to obtain the text information more accurately, the server can first detect and determine the description. Whether the text description information includes preset keywords (such as chest tightness, shortness of breath, fast heartbeat, etc.).
S130、若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息。S130: If the text description information includes the keyword, obtain the text description information as the current text information to be recognized.
在本实施例中,当判断所述文本描述信息中包括所述关键词,表示所述文本描述信息中包括有效信息,直接获取所述文本描述信息以作为当前待识别文本信息。In this embodiment, when it is determined that the text description information includes the keyword, it means that the text description information includes valid information, and the text description information is directly acquired as the current text information to be recognized.
S140、若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用 户端。S140: If the keyword is not included in the text description information, call a pre-stored guide question set and send it to the user terminal.
在本实施例中,当判断所述文本描述信息中不包括所述关键词,表示所述文本描述信息中不包括有效信息,此时需要服务器调用引导问题集,将引导问题集发送至用户端,以指导用户补充信息。例如,引导问题集包括“是否胸闷气短?”、“是否心跳过快?”等多个引导问题。通过设置这一引导问题集,可以高效的引导用户回复有效信息,提高后续关键词提取和语义向量提取的效率。In this embodiment, when it is judged that the text description information does not include the keyword, it means that the text description information does not include valid information. At this time, the server needs to call the guided question set and send the guided question set to the client To guide users to supplement information. For example, the guidance question set includes multiple guidance questions such as "whether chest tightness and shortness of breath?" and "whether the heartbeat is fast?" By setting up this guide question set, users can efficiently guide users to reply to valid information, and the efficiency of subsequent keyword extraction and semantic vector extraction can be improved.
S150、接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息。S150: Receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized.
在本实施例中,在用户端与服务器基于所述引导问题集的多轮对话之后,整合用户的文字回答,即可得到回复文本信息,以作为当前待识别文本信息。通过这一引导回答式的方式,当前获取的回复文本信息中包括的有效信息更多。In this embodiment, after multiple rounds of dialogue between the user and the server based on the guide question set, the text answers of the users are integrated, and the reply text information can be obtained as the current text information to be recognized. Through this guided response approach, the currently obtained response text information includes more effective information.
S160、对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量。S160. Perform semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information.
在本实施例中,当获取了当前待识别文本信息后,为了提取其中的关键信息,可以获取其中关键词对应的词向量以组成语义向量。In this embodiment, after obtaining the current text information to be recognized, in order to extract the key information therein, the word vectors corresponding to the keywords can be obtained to form a semantic vector.
在一实施例中,如图3所示,步骤S160包括:In one embodiment, as shown in FIG. 3, step S160 includes:
S161、调用预先训练的BERT模型,将所述当前待识别文本信息通过所述BERT模型进行关键词提取,得到与所述当前待识别文本信息对应的文本关键词集;其中,所述BERT模型表示Transformers模型的双向编码器表示模型;S161. Invoke a pre-trained BERT model, extract keywords from the current text information to be recognized through the BERT model, and obtain a text keyword set corresponding to the current text information to be recognized; wherein, the BERT model represents Two-way encoder representation model of Transformers model;
S162、将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量;S162: Perform one-hot encoding on each text keyword in the text keyword set to obtain a word vector corresponding to each text keyword;
S163、根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。S163: Calculate the semantic vector corresponding to the current text to be recognized according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword.
在本实施例中,在获取文本中的关键词时,通过BERT模型(即Transformers模型的双向编码器表示模型)对所述当前待识别文本信息进行关键词提取,之后还可对各关键词进行校正以与医学上的专业术语对应(例如将跳很快校正替换为心跳过快),最后将各关键词对应转化为词向量后计算得到与所述当前待识别文本对应的语义向量。In this embodiment, when acquiring keywords in the text, keyword extraction is performed on the current text information to be recognized through the BERT model (that is, the two-way encoder representation model of the Transformers model), and then each keyword can be extracted The correction corresponds to the medical terminology (for example, the fast correction is replaced by fast heartbeat). Finally, the corresponding keywords are converted into word vectors and then the semantic vectors corresponding to the current text to be recognized are calculated.
其中,BERT模型采用Transformer Encoder(即Transformer结构中的编码器)作为特征提取器,由Nx个完全一样的layer组成,每个layer有2个sub-layer(即子层),分别是:Multi-Head Self-Attention机制(即多头自注意力机制)、Position-Wise全连接前向神经网络。对于每个sub-layer,都添加了2个操作:残差连接Residual Connection和归一化Normalization。Among them, the BERT model uses Transformer Encoder (ie the encoder in the Transformer structure) as the feature extractor, which is composed of Nx exactly the same layers, and each layer has 2 sub-layers (ie sub-layers), which are: Multi- Head Self-Attention mechanism (ie multi-head self-attention mechanism), Position-Wise fully connected forward neural network. For each sub-layer, two operations are added: Residual Connection and Normalization.
而且BERT模型的输入是一个线性序列,支持单句文本和句对文本,句首用符号[CLS]表示,句尾用符号[SEP]表示,如果是句对,句子之间添加符号[SEP]。Moreover, the input of the BERT model is a linear sequence, which supports single sentence text and sentence pair text. The beginning of the sentence is represented by the symbol [CLS], and the end of the sentence is represented by the symbol [SEP]. If it is a sentence pair, the symbol [SEP] is added between the sentences.
BERT模型的预训练采用了MLM(MLM是Masked LM的简称,表示掩盖式语言模型)和NSP(NSP是Next Sentence Prediction的简称,表示预测下一句模型)两种策略用于模型预训练。The pre-training of the BERT model adopts two strategies for model pre-training: MLM (MLM is the abbreviation of Masked LM, which stands for masked language model) and NSP (NSP is the abbreviation of Next Sentence Prediction, which stands for predicting the next sentence model).
当通过BERT模型将所述当前待识别文本信息进行关键词提取,得到与所述当前待识别文本信息对应的文本关键词集后,将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量。由于在语料中是已知各关键词的权重值,此时根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。通过这一方式提取的语义向量,能更加准确的表征所述当前待识别文本信息。After keyword extraction is performed on the current text information to be recognized through the BERT model, and the text keyword set corresponding to the current text information to be recognized is obtained, each text keyword in the text keyword set is one-hot encoded, Obtain the word vector corresponding to each text keyword. Since the weight value of each keyword is known in the corpus, at this time, according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated . The semantic vector extracted in this way can more accurately represent the current text information to be recognized.
S170、接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量。S170. Receive the uploaded electrocardiogram image, call a pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram image according to the Res2Net classification network based on the attention mechanism to obtain a corresponding output vector.
在本实施例中,由于之前已根据当前待识别文本信息进行语义识别后,得到了对应的语 义向量,如果仅仅基于语义向量作为分类模型的数据,可能因导致向量表征的内容较少,影响最终分类结果,此时可以进一步提示用户上传心电图影像,增加一些图片特征与语义向量相结合,使得最终的向量表征内容丰富,更有利于得到准确的分类结果。In this embodiment, since the corresponding semantic vector has been obtained after semantic recognition based on the current text information to be recognized, if the semantic vector is only used as the data of the classification model, the content of the vector representation may be less, which affects the final The classification result, at this time, can further prompt the user to upload the ECG image, and add some picture features combined with the semantic vector, so that the final vector representation content is rich, which is more conducive to obtaining accurate classification results.
在一实施例中,如图4所示,步骤S170包括:In one embodiment, as shown in FIG. 4, step S170 includes:
S171、获取所述心电图影像对应的像素矩阵;S171. Obtain a pixel matrix corresponding to the electrocardiogram image;
S172、将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量;S172: Use the pixel matrix as an input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;
S173、将所述形态特征向量作为所述基于注意力机制的Res2Net分类网络中注意力机制结构进行运算,得到输出向量。S173. Use the morphological feature vector as the attention mechanism structure in the attention mechanism-based Res2Net classification network to perform operations to obtain an output vector.
在本实施例中,当接收到了心电图影像后,将心电图影像对应的像素矩阵输入到深度学习的Res2Net网络,学习图片的形态特诊,再输入到注意力结构中,让模型更专注于找到输入数据中与输出更加相关的有用信息,然后对这一区域投入更多的注意力资源,从而提高输出的质量。Res2Net(其是ResNet网络也即残差网络的升级版),Res2Net相对于ResNet不仅在识别的准确率上有提升,对模型的大小和参数也进行了优化,这种更加轻量化的模型可以提高反应速度并且减少服务器对硬件的要求。In this embodiment, when the ECG image is received, the pixel matrix corresponding to the ECG image is input to the deep learning Res2Net network to learn the morphological special diagnosis of the picture, and then input it into the attention structure, allowing the model to focus more on finding the input The useful information in the data is more relevant to the output, and then more attention resources are devoted to this area, thereby improving the quality of the output. Res2Net (which is an upgraded version of the ResNet network, that is, the residual network). Compared with ResNet, Res2Net not only improves the accuracy of recognition, but also optimizes the size and parameters of the model. This more lightweight model can improve Response speed and reduce the server's hardware requirements.
在一实施例中,步骤S172包括:In an embodiment, step S172 includes:
将所述像素矩阵输入至所述Res2Net网络中依次卷积、在多层残差结构进行恒等映射、池化及全连接,得到形态特征向量。The pixel matrix is input into the Res2Net network for sequential convolution, identity mapping, pooling, and full connection are performed in a multi-layer residual structure to obtain a morphological feature vector.
在本实施例中,Res2Net网络中是将卷积神经网络中的多层卷积层中除了第一层卷积层之外的卷积层均做一个残差块的改造以实现恒等映射,从而提高整个Res2Net网络识别的准确率。In this embodiment, in the Res2Net network, all the convolutional layers in the multi-layer convolutional layer in the convolutional neural network except the first convolutional layer are transformed into a residual block to realize the identity mapping. Thereby improving the accuracy of recognition of the entire Res2Net network.
在一实施例中,步骤S170之前,还包括:In an embodiment, before step S170, the method further includes:
将用于获取心电图影像的提示信息发送至用户端或智能心电仪;Send the prompt information for obtaining the ECG image to the client or the smart electrocardiograph;
接收用户端或智能心电仪根据所述提示信息发送的心电图影像。Receive the electrocardiogram image sent by the user terminal or the smart electrocardiograph according to the prompt information.
在本实施例中,服务器中提供了用户端上传心电图影像和智能心电仪上传心电图影像的两个接口,服务器在获取心电图影像时,可以向用户端或智能心电仪发送用于获取心电图影像的提示信息。通过这一通知方式,能在进行语义向量的提取后,更加快速的触发获取图像分类结果的处理过程。In this embodiment, the server provides two interfaces for uploading ECG images from the client and the smart electrocardiograph. When acquiring the ECG images, the server can send to the client or the smart electrocardiograph for obtaining the ECG images. Prompt information. Through this notification method, after the extraction of the semantic vector, the process of obtaining the image classification result can be triggered more quickly.
S180、调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。S180. Invoke a pre-trained Light GBM model, input the semantic vector and the output vector into the Light GBM model for classification, and obtain a corresponding classification result.
在本实施例中,在整合了语义向量和心电图影像对应的输出向量之后,服务器得到了一组完整的特征向量,基于目前的Light GBM模型,进行学习与判断,从而得到对应的分类结果。其中,Light GBM模型是一个使用基于决策树的学习算法,其拥有更快的训练速度,更高的准确率和大数据处理能力。In this embodiment, after integrating the semantic vector and the output vector corresponding to the ECG image, the server obtains a complete set of feature vectors, and performs learning and judgment based on the current Light GBM model to obtain the corresponding classification results. Among them, the Light GBM model is a learning algorithm based on decision trees, which has faster training speed, higher accuracy and big data processing capabilities.
在一实施例中,步骤S180包括:In an embodiment, step S180 includes:
将所述语义向量及所述输出向量进行独立特征合并,得到图文特征向量;Combining the semantic vector and the output vector with independent features to obtain a graphic feature vector;
将所述图文特征向量通过所述Light GBM模型中基于直方图的决策进行分类,得到对应的分类结果。The graphic feature vector is classified through the histogram-based decision in the Light GBM model to obtain a corresponding classification result.
在本实施例中,将所述语义向量及所述输出向量进行独立特征合并是为了减少特征维度,以提升计算效率。由于所述语义向量及所述输出向量是互斥的,这样两个特征捆绑起来才不会丢失信息。当完成了独立特征合并得到了图文特征向量后,采用基于直方图的决策进行分类,由于直方图只需对直方图统计量计算信息增益,相比较于预排序算法每次都遍历所有的值,信息增益的计算量要小很多,而且内存空间需要相对小很多。In this embodiment, the purpose of combining the semantic vector and the output vector with independent features is to reduce feature dimensions and improve calculation efficiency. Since the semantic vector and the output vector are mutually exclusive, the two features are bundled together so that no information will be lost. When the independent feature combination is completed and the graphic feature vector is obtained, the histogram-based decision is used for classification. Since the histogram only needs to calculate the information gain for the histogram statistics, it is compared with the pre-sorting algorithm, which traverses all values every time , The calculation amount of information gain is much smaller, and the memory space needs to be relatively small.
在一实施例中,步骤S180之后还包括:In an embodiment, after step S180, the method further includes:
调用预先存储的文本模板,将所述分类结果填充至文本模板得到当前文本;Call a pre-stored text template, and fill the classification result into the text template to obtain the current text;
将所述当前文本发送至用户端;Sending the current text to the client;
将所述当前文本上传至区块链网络。Upload the current text to the blockchain network.
在本实施例中,当结合了文本语义识别得到的语义向量以及心电图影像对应的输出向量后,通过所述Light GBM模型进行分类更加准确,能确定语义向量及输出向量确定的分类结果(例如有心脏疾病)。此时,可以调用服务器中预先存储的文本模板,将所述分类结果填充至文本模板后得到当前文本(除了包括分类结果,还有一些专业名称解释等),之后将所述当前文本发送至用户端以供用户查看。In this embodiment, when the semantic vector obtained by the text semantic recognition and the output vector corresponding to the ECG image are combined, the classification by the Light GBM model is more accurate, and the classification result determined by the semantic vector and the output vector can be determined (for example, there are heart disease). At this point, you can call the pre-stored text template in the server, fill the classification result into the text template and get the current text (in addition to the classification result, there are some professional name explanations, etc.), and then send the current text to the user End for users to view.
而且服务器可以作为一个区块链节点设备,以将所述当前文本上传至区块链网络,充分利用区块链数据不可篡改的特性,实现数据证据固化。In addition, the server can be used as a blockchain node device to upload the current text to the blockchain network, making full use of the non-tamperable characteristics of the blockchain data, and realizing the solidification of data evidence.
其中,基于所述当前文本得到对应的摘要信息,具体来说,摘要信息由所述当前文本进行散列处理得到,比如利用sha256算法处理得到。将摘要信息上传至区块链可保证其安全性和对用户的公正透明性。用户设备可以从区块链中下载得该摘要信息,以便查证所述当前文本是否被篡改。本示例所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。Wherein, the corresponding summary information is obtained based on the current text. Specifically, the summary information is obtained by hashing the current text, for example, obtained by using the sha256 algorithm. Uploading summary information to the blockchain can ensure its security and fairness and transparency to users. The user equipment can download the summary information from the blockchain to verify whether the current text has been tampered with. The blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
该方法实现了结合用户端上传的文字信息与心电图影像对应的图像信息,再经过Light GBM算法来进行分类,提升了分类结果的准确度与可信度。This method combines the text information uploaded by the user terminal and the image information corresponding to the ECG image, and then performs classification through the Light GBM algorithm, which improves the accuracy and credibility of the classification results.
本申请实施例还提供一种基于语义和图像识别的心电信息提取装置,该基于语义和图像识别的心电信息提取装置用于执行前述基于语义和图像识别的心电信息提取方法的任一实施例。具体地,请参阅图5,图5是本申请实施例提供的基于语义和图像识别的心电信息提取装置的示意性框图。该基于语义和图像识别的心电信息提取装置100可以配置于服务器中。The embodiment of the present application also provides an electrocardiographic information extraction device based on semantic and image recognition. The electrocardiographic information extraction device based on semantic and image recognition is used to perform any of the foregoing electrocardiographic information extraction methods based on semantic and image recognition. Examples. Specifically, please refer to FIG. 5, which is a schematic block diagram of an electrocardiographic information extraction device based on semantic and image recognition provided by an embodiment of the present application. The electrocardiographic information extraction device 100 based on semantic and image recognition can be configured in a server.
如图5所示,基于语义和图像识别的心电信息提取装置100包括:文本描述信息接收单元110、关键词判断单元120、第一文本信息获取单元130、引导问题集发送单元140、第二文本信息获取单元150、语义向量获取单元160、影像分类单元170、分类结果获取单元180。As shown in FIG. 5, the electrocardiographic information extraction device 100 based on semantic and image recognition includes: a text description information receiving unit 110, a keyword judgment unit 120, a first text information acquiring unit 130, a guide question set sending unit 140, and a second The text information acquisition unit 150, the semantic vector acquisition unit 160, the image classification unit 170, and the classification result acquisition unit 180.
文本描述信息接收单元110,用于接收用户端上传的文本描述信息。The text description information receiving unit 110 is configured to receive the text description information uploaded by the client.
在本实施例中,为了更加完整的获取基于心电图影像和描述文字的输出文本信息,此时可以先提示用户端上传文本描述信息。用户结合自身情况在用户端上编辑一段自我描述后上传至服务器。In this embodiment, in order to obtain the output text information based on the electrocardiogram image and the description text more completely, the user terminal may be prompted to upload the text description information at this time. The user edits a self-description on the user terminal according to his own situation and uploads it to the server.
关键词判断单元120,用于判断所述文本描述信息中是否包括预设的关键词。The keyword judgment unit 120 is configured to judge whether the text description information includes preset keywords.
在本实施例中,由于用户通过用户端上传的文本描述信息可能是一些与用户自身健康状态不太相关的描述,此时为了更准确的获取文本信息,可以先在服务器中检测并判断所述文本描述信息中是否包括预设的关键词(如胸闷气短、心跳过快等)。In this embodiment, since the text description information uploaded by the user through the user terminal may be some descriptions that are not closely related to the user’s own health status, at this time, in order to obtain the text information more accurately, the server can first detect and determine the description. Whether the text description information includes preset keywords (such as chest tightness, shortness of breath, fast heartbeat, etc.).
第一文本信息获取单元130,用于若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息。The first text information obtaining unit 130 is configured to obtain the text description information as the current text information to be recognized if the keyword is included in the text description information.
在本实施例中,当判断所述文本描述信息中包括所述关键词,表示所述文本描述信息中包括有效信息,直接获取所述文本描述信息以作为当前待识别文本信息。In this embodiment, when it is determined that the text description information includes the keyword, it means that the text description information includes valid information, and the text description information is directly acquired as the current text information to be recognized.
引导问题集发送单元140,用于若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端。The guiding question set sending unit 140 is configured to call a pre-stored guiding question set and send it to the user terminal if the keyword is not included in the text description information.
在本实施例中,当判断所述文本描述信息中不包括所述关键词,表示所述文本描述信息中不包括有效信息,此时需要服务器调用引导问题集,将引导问题集发送至用户端,以指导用户补充信息。例如,引导问题集包括“是否胸闷气短?”、“是否心跳过快?”等多个引导问题。通过设置这一引导问题集,可以高效的引导用户回复有效信息,提高后续关键词提取和语义向量提取的效率。In this embodiment, when it is judged that the text description information does not include the keyword, it means that the text description information does not include valid information. At this time, the server needs to call the guided question set and send the guided question set to the client To guide users to supplement information. For example, the guidance question set includes multiple guidance questions such as "whether chest tightness and shortness of breath?" and "whether the heartbeat is fast?" By setting up this guide question set, users can efficiently guide users to reply to valid information, and the efficiency of subsequent keyword extraction and semantic vector extraction can be improved.
第二文本信息获取单元150,用于接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息。The second text information acquiring unit 150 is configured to receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized.
在本实施例中,在用户端与服务器基于所述引导问题集的多轮对话之后,整合用户的文字回答,即可得到回复文本信息,以作为当前待识别文本信息。通过这一引导回答式的方式,当前获取的回复文本信息中包括的有效信息更多。In this embodiment, after multiple rounds of dialogue between the user and the server based on the guide question set, the text answers of the users are integrated, and the reply text information can be obtained as the current text information to be recognized. Through this guided response approach, the currently obtained response text information includes more effective information.
语义向量获取单元160,用于对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量。The semantic vector obtaining unit 160 is configured to perform semantic recognition on the current text information to be recognized to obtain a semantic vector corresponding to the current text information to be recognized.
在本实施例中,当获取了当前待识别文本信息后,为了提取其中的关键信息,可以获取其中关键词对应的词向量以组成语义向量。In this embodiment, after obtaining the current text information to be recognized, in order to extract the key information therein, the word vectors corresponding to the keywords can be obtained to form a semantic vector.
在一实施例中,如图6所示,语义向量获取单元160包括:In an embodiment, as shown in FIG. 6, the semantic vector obtaining unit 160 includes:
关键词提取单元161,用于调用预先训练的BERT模型,将所述当前待识别文本信息通过所述BERT模型进行关键词提取,得到与所述当前待识别文本信息对应的文本关键词集;其中,所述BERT模型表示Transformers模型的双向编码器表示模型;The keyword extraction unit 161 is configured to call a pre-trained BERT model, extract keywords from the current text information to be recognized through the BERT model, and obtain a text keyword set corresponding to the current text information to be recognized; wherein , The BERT model represents a bidirectional encoder representation model of the Transformers model;
词向量获取单元162,用于将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量;The word vector obtaining unit 162 is configured to perform one-hot encoding on each text keyword in the text keyword set to obtain the word vector corresponding to each text keyword;
语义向量计算单元163,用于根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。The semantic vector calculation unit 163 is configured to calculate the semantic vector corresponding to the current text to be recognized according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword.
在本实施例中,在获取文本中的关键词时,通过BERT模型(即Transformers模型的双向编码器表示模型)对所述当前待识别文本信息进行关键词提取,之后还可对各关键词进行校正以与医学上的专业术语对应(例如将跳很快校正替换为心跳过快),最后将各关键词对应转化为词向量后计算得到与所述当前待识别文本对应的语义向量。In this embodiment, when acquiring keywords in the text, keyword extraction is performed on the current text information to be recognized through the BERT model (that is, the two-way encoder representation model of the Transformers model), and then each keyword can be extracted The correction corresponds to the medical terminology (for example, the fast correction is replaced by fast heartbeat). Finally, the corresponding keywords are converted into word vectors and then the semantic vectors corresponding to the current text to be recognized are calculated.
其中,BERT模型采用Transformer Encoder(即Transformer结构中的编码器)作为特征提取器,由Nx个完全一样的layer组成,每个layer有2个sub-layer(即子层),分别是:Multi-Head Self-Attention机制(即多头自注意力机制)、Position-Wise全连接前向神经网络。对于每个sub-layer,都添加了2个操作:残差连接Residual Connection和归一化Normalization。Among them, the BERT model uses Transformer Encoder (ie the encoder in the Transformer structure) as the feature extractor, which is composed of Nx exactly the same layers, and each layer has 2 sub-layers (ie sub-layers), which are: Multi- Head Self-Attention mechanism (ie multi-head self-attention mechanism), Position-Wise fully connected forward neural network. For each sub-layer, two operations are added: Residual Connection and Normalization.
而且BERT模型的输入是一个线性序列,支持单句文本和句对文本,句首用符号[CLS]表示,句尾用符号[SEP]表示,如果是句对,句子之间添加符号[SEP]。Moreover, the input of the BERT model is a linear sequence, which supports single sentence text and sentence pair text. The beginning of the sentence is represented by the symbol [CLS], and the end of the sentence is represented by the symbol [SEP]. If it is a sentence pair, the symbol [SEP] is added between the sentences.
BERT模型的预训练采用了MLM(MLM是Masked LM的简称,表示掩盖式语言模型)和NSP(NSP是Next Sentence Prediction的简称,表示预测下一句模型)两种策略用于模型预训练。The pre-training of the BERT model adopts two strategies for model pre-training: MLM (MLM is the abbreviation of Masked LM, which stands for masked language model) and NSP (NSP is the abbreviation of Next Sentence Prediction, which stands for predicting the next sentence model).
当通过BERT模型将所述当前待识别文本信息进行关键词提取,得到与所述当前待识别文本信息对应的文本关键词集后,将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量。由于在语料中是已知各关键词的权重值,此时根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。通过这一方式提取的语义向量,能更加准确的表征所述当前待识别文本信息。After keyword extraction is performed on the current text information to be recognized through the BERT model, and the text keyword set corresponding to the current text information to be recognized is obtained, each text keyword in the text keyword set is one-hot encoded, Obtain the word vector corresponding to each text keyword. Since the weight value of each keyword is known in the corpus, at this time, according to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated . The semantic vector extracted in this way can more accurately represent the current text information to be recognized.
影像分类单元170,用于接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量。The image classification unit 170 is configured to receive the uploaded electrocardiogram images, call the pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram images according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector .
在本实施例中,由于之前已根据当前待识别文本信息进行语义识别后,得到了对应的语义向量,如果仅仅基于语义向量作为分类模型的数据,可能因导致向量表征的内容较少,影响最终分类结果,此时可以进一步提示用户上传心电图影像,增加一些图片特征与语义向量相结合,使得最终的向量表征内容丰富,更有利于得到准确的分类结果。In this embodiment, since the corresponding semantic vector has been obtained after semantic recognition based on the current text information to be recognized, if the semantic vector is only used as the data of the classification model, the content of the vector representation may be less, which affects the final The classification result, at this time, can further prompt the user to upload the ECG image, and add some picture features combined with the semantic vector, so that the final vector representation content is rich, which is more conducive to obtaining accurate classification results.
在一实施例中,如图7所示,影像分类单元170包括:In one embodiment, as shown in FIG. 7, the image classification unit 170 includes:
矩阵获取单元171,用于获取所述心电图影像对应的像素矩阵;The matrix obtaining unit 171 is configured to obtain a pixel matrix corresponding to the electrocardiogram image;
形态特征向量获取单元172,用于将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量;The morphological feature vector obtaining unit 172 is configured to use the pixel matrix as the input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;
输出向量计算单元173,用于将所述形态特征向量作为所述基于注意力机制的Res2Net 分类网络中注意力机制结构进行运算,得到输出向量。The output vector calculation unit 173 is configured to use the morphological feature vector as the attention mechanism structure in the attention mechanism-based Res2Net classification network to perform operations to obtain an output vector.
在本实施例中,当接收到了心电图影像后,将心电图影像对应的像素矩阵输入到深度学习的Res2Net网络,学习图片的形态特诊,再输入到注意力结构中,让模型更专注于找到输入数据中与输出更加相关的有用信息,然后对这一区域投入更多的注意力资源,从而提高输出的质量。Res2Net(其是ResNet网络也即残差网络的升级版),Res2Net相对于ResNet不仅在识别的准确率上有提升,对模型的大小和参数也进行了优化,这种更加轻量化的模型可以提高反应速度并且减少服务器对硬件的要求。In this embodiment, when the ECG image is received, the pixel matrix corresponding to the ECG image is input to the deep learning Res2Net network to learn the morphological special diagnosis of the picture, and then input it into the attention structure, allowing the model to focus more on finding the input The useful information in the data is more relevant to the output, and then more attention resources are devoted to this area, thereby improving the quality of the output. Res2Net (which is an upgraded version of the ResNet network, that is, the residual network). Compared with ResNet, Res2Net not only improves the accuracy of recognition, but also optimizes the size and parameters of the model. This more lightweight model can improve Response speed and reduce the server's hardware requirements.
在一实施例中,形态特征向量获取单元172还用于:In an embodiment, the morphological feature vector obtaining unit 172 is further configured to:
将所述像素矩阵输入至所述Res2Net网络中依次卷积、在多层残差结构进行恒等映射、池化及全连接,得到形态特征向量。The pixel matrix is input into the Res2Net network for sequential convolution, identity mapping, pooling, and full connection are performed in a multi-layer residual structure to obtain a morphological feature vector.
在本实施例中,Res2Net网络中是将卷积神经网络中的多层卷积层中除了第一层卷积层之外的卷积层均做一个残差块的改造以实现恒等映射,从而提高整个Res2Net网络识别的准确率。In this embodiment, in the Res2Net network, all the convolutional layers in the multi-layer convolutional layer in the convolutional neural network except the first convolutional layer are transformed into a residual block to realize the identity mapping. Thereby improving the accuracy of recognition of the entire Res2Net network.
在一实施例中,基于语义和图像识别的心电信息提取装置100还包括:In an embodiment, the device 100 for extracting ECG information based on semantic and image recognition further includes:
提示信息发送单元,用于将用于获取心电图影像的提示信息发送至用户端或智能心电仪;The reminder information sending unit is used to send the reminder information for obtaining the electrocardiogram image to the user terminal or the smart electrocardiograph;
心电图影像接收单元,用于接收用户端或智能心电仪根据所述提示信息发送的心电图影像。The electrocardiogram image receiving unit is used to receive the electrocardiogram image sent by the user terminal or the smart electrocardiograph according to the prompt information.
在本实施例中,服务器中提供了用户端上传心电图影像和智能心电仪上传心电图影像的两个接口,服务器在获取心电图影像是,可以向用户端或智能心电仪发送用于获取心电图影像的提示信息。通过这一通知方式,能在进行语义向量的提取后,更加快速的触发获取图像分类结果的处理过程。In this embodiment, the server provides two interfaces for uploading ECG images on the client side and uploading ECG images from the smart electrocardiograph. The server can send ECG images to the client or the smart electrocardiograph when acquiring the ECG images. Prompt information. Through this notification method, after the extraction of the semantic vector, the process of obtaining the image classification result can be triggered more quickly.
分类结果获取单元180,用于调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The classification result obtaining unit 180 is configured to call a pre-trained Light GBM model, input the semantic vector and the output vector into the Light GBM model for classification, and obtain a corresponding classification result.
在本实施例中,在整合了语义向量和心电图影像对应的输出向量之后,服务器得到了一组完整的特征向量,基于目前的Light GBM模型,进行学习与判断,从而得到对应的分类结果。其中,Light GBM模型是一个使用基于决策树的学习算法,其拥有更快的训练速度,更高的准确率和大数据处理能力。In this embodiment, after integrating the semantic vector and the output vector corresponding to the ECG image, the server obtains a complete set of feature vectors, and performs learning and judgment based on the current Light GBM model to obtain the corresponding classification results. Among them, the Light GBM model is a learning algorithm based on decision trees, which has faster training speed, higher accuracy and big data processing capabilities.
在一实施例中,分类结果获取单元180包括:In an embodiment, the classification result obtaining unit 180 includes:
图文特征向量获取单元,用于将所述语义向量及所述输出向量进行独立特征合并,得到图文特征向量;A graphic feature vector obtaining unit, configured to merge the semantic vector and the output vector with independent features to obtain a graphic feature vector;
决策分类单元,用于将所述图文特征向量通过所述Light GBM模型中基于直方图的决策进行分类,得到对应的分类结果。The decision classification unit is configured to classify the graphic feature vector through a histogram-based decision in the Light GBM model to obtain a corresponding classification result.
在本实施例中,将所述语义向量及所述输出向量进行独立特征合并是为了减少特征维度,以提升计算效率。由于所述语义向量及所述输出向量是互斥的,这样两个特征捆绑起来才不会丢失信息。当完成了独立特征合并得到了图文特征向量后,采用基于直方图的决策进行分类,由于直方图只需对直方图统计量计算信息增益,相比较于预排序算法每次都遍历所有的值,信息增益的计算量要小很多,而且内存空间需要相对小很多。In this embodiment, the purpose of combining the semantic vector and the output vector with independent features is to reduce feature dimensions and improve calculation efficiency. Since the semantic vector and the output vector are mutually exclusive, the two features are bundled together so that no information will be lost. When the independent feature combination is completed and the graphic feature vector is obtained, the histogram-based decision is used for classification. Since the histogram only needs to calculate the information gain for the histogram statistics, it is compared with the pre-sorting algorithm, which traverses all values every time , The calculation amount of information gain is much smaller, and the memory space needs to be relatively small.
在一实施例中,基于语义和图像识别的心电信息提取装置100还包括:In an embodiment, the device 100 for extracting ECG information based on semantic and image recognition further includes:
当前文本生成单元,用于调用预先存储的文本模板,将所述分类结果填充至文本模板得到当前文本;The current text generation unit is configured to call a pre-stored text template, and fill the classification result into the text template to obtain the current text;
当前文本发送单元,用于将所述当前文本发送至用户端;The current text sending unit is used to send the current text to the user terminal;
上链单元,用于将所述当前文本上传至区块链网络。The on-chain unit is used to upload the current text to the blockchain network.
在本实施例中,当结合了文本语义识别得到的语义向量以及心电图影像对应的输出向量后,通过所述Light GBM模型进行分类更加准确,能确定语义向量及输出向量确定的分类结果(例如有心脏疾病)。此时,可以调用服务器中预先存储的文本模板,将所述分类结果填充至文本模板后得到当前文本(除了包括分类结果,还有一些专业名称解释等),之后将所述当前文本发送至用户端以供用户查看。In this embodiment, when the semantic vector obtained by the text semantic recognition and the output vector corresponding to the ECG image are combined, the classification by the Light GBM model is more accurate, and the classification result determined by the semantic vector and the output vector can be determined (for example, there are heart disease). At this point, you can call the pre-stored text template in the server, fill the classification result into the text template and get the current text (in addition to the classification result, there are some professional name explanations, etc.), and then send the current text to the user End for users to view.
而且服务器可以作为一个区块链节点设备,以将所述当前文本上传至区块链网络,充分利用区块链数据不可篡改的特性,实现数据证据固化。In addition, the server can be used as a blockchain node device to upload the current text to the blockchain network, making full use of the non-tamperable characteristics of the blockchain data, and realizing the solidification of data evidence.
其中,基于所述当前文本得到对应的摘要信息,具体来说,摘要信息由所述当前文本进行散列处理得到,比如利用sha256算法处理得到。将摘要信息上传至区块链可保证其安全性和对用户的公正透明性。用户设备可以从区块链中下载得该摘要信息,以便查证所述当前文本是否被篡改。本示例所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。Wherein, the corresponding summary information is obtained based on the current text. Specifically, the summary information is obtained by hashing the current text, for example, obtained by using the sha256 algorithm. Uploading summary information to the blockchain can ensure its security and fairness and transparency to users. The user equipment can download the summary information from the blockchain to verify whether the current text has been tampered with. The blockchain referred to in this example is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
该装置实现了结合用户端上传的文字信息与心电图影像对应的图像信息,再经过Light GBM算法来进行分类,提升了分类结果的准确度与可信度。The device realizes the combination of the text information uploaded by the user terminal and the image information corresponding to the ECG image, and then the Light GBM algorithm is used for classification, which improves the accuracy and credibility of the classification results.
上述基于语义和图像识别的心电信息提取装置可以实现为计算机程序的形式,该计算机程序可以在如图8所示的计算机设备上运行。The above-mentioned apparatus for extracting ECG information based on semantics and image recognition can be implemented in the form of a computer program, and the computer program can be run on a computer device as shown in FIG. 8.
请参阅图8,图8是本申请实施例提供的计算机设备的示意性框图。该计算机设备500是服务器,服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群。Please refer to FIG. 8, which is a schematic block diagram of a computer device according to an embodiment of the present application. The computer device 500 is a server, and the server may be an independent server or a server cluster composed of multiple servers.
参阅图8,该计算机设备500包括通过系统总线501连接的处理器502、存储器和网络接口505,其中,存储器可以包括非易失性存储介质503和内存储器504。Referring to FIG. 8, the computer device 500 includes a processor 502, a memory, and a network interface 505 connected through a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
该非易失性存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032被执行时,可使得处理器502执行基于语义和图像识别的心电信息提取方法。The non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032. When the computer program 5032 is executed, the processor 502 can execute an electrocardiographic information extraction method based on semantics and image recognition.
该处理器502用于提供计算和控制能力,支撑整个计算机设备500的运行。The processor 502 is used to provide calculation and control capabilities, and support the operation of the entire computer device 500.
该内存储器504为非易失性存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行基于语义和图像识别的心电信息提取方法。The internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can execute the method for extracting ECG information based on semantics and image recognition. .
该网络接口505用于进行网络通信,如提供数据信息的传输等。本领域技术人员可以理解,图8中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备500的限定,具体的计算机设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 505 is used for network communication, such as providing data information transmission. Those skilled in the art can understand that the structure shown in FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied. The specific computer device 500 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现本申请实施例公开的基于语义和图像识别的心电信息提取方法。Wherein, the processor 502 is configured to run a computer program 5032 stored in a memory to implement the method for extracting ECG information based on semantics and image recognition disclosed in the embodiments of the present application.
本领域技术人员可以理解,图8中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图8所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 8 does not constitute a limitation on the specific configuration of the computer device. In other embodiments, the computer device may include more or less components than those shown in the figure. Or some parts are combined, or different parts are arranged. For example, in some embodiments, the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 8 and will not be repeated here.
应当理解,在本申请实施例中,处理器502可以是中央处理单元(Central Processing Unit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in this embodiment of the application, the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. Among them, the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
在本申请的另一实施例中提供计算机可读存储介质。所述计算机可读存储介质可以是非易失性,也可以是易失性。该计算机可读存储介质存储有计算机程序,其中计算机程序被处理器执行时实现本申请实施例公开的基于语义和图像识别的心电信息提取方法。In another embodiment of the present application, a computer-readable storage medium is provided. The computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium stores a computer program, where the computer program is executed by a processor to realize the method for extracting electrocardiographic information based on semantic and image recognition disclosed in the embodiments of the present application.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的设备、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。本领域 普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, the specific working process of the above-described equipment, device, and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here. A person of ordinary skill in the art may be aware that the units and algorithm steps of the examples described in the embodiments disclosed herein can be implemented by electronic hardware, computer software, or a combination of both, in order to clearly illustrate the hardware and software Interchangeability, in the above description, the composition and steps of each example have been generally described in accordance with the function. Whether these functions are executed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered beyond the scope of this application.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为逻辑功能划分,实际实现时可以有另外的划分方式,也可以将具有相同功能的单元集合成一个单元,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另外,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口、装置或单元的间接耦合或通信连接,也可以是电的,机械的或其它的形式连接。In the several embodiments provided in this application, it should be understood that the disclosed equipment, device, and method may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods, or the units with the same function may be combined into one. Units, for example, multiple units or components can be combined or integrated into another system, or some features can be omitted or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may also be electrical, mechanical or other forms of connection.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本申请实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments of the present application.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以是两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分,或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a storage medium. Based on this understanding, the technical solution of this application is essentially or the part that contributes to the existing technology, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium. It includes several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or optical disk and other media that can store program codes.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in this application. Modifications or replacements, these modifications or replacements shall be covered within the protection scope of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.
Claims (21)
- 一种基于语义和图像识别的心电信息提取方法,包括:An ECG information extraction method based on semantics and image recognition, including:接收用户端上传的文本描述信息;Receive text description information uploaded by the client;判断所述文本描述信息中是否包括预设的关键词;Determine whether the text description information includes preset keywords;若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;If the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;If the keyword is not included in the text description information, call the pre-stored guide question set and send it to the user terminal;接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;Receiving the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;Performing semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及Receiving the uploaded electrocardiogram image, invoking the pre-trained Res2Net classification network based on the attention mechanism, and classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector; and调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
- 根据权利要求1所述的基于语义和图像识别的心电信息提取方法,其中,所述对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量,包括:The method for extracting ECG information based on semantic and image recognition according to claim 1, wherein said performing semantic recognition on said current text information to be recognized to obtain a semantic vector corresponding to said current text information to be recognized, include:调用预先训练的BERT模型,将所述当前待识别文本信息通过所述BERT模型进行关键词提取,得到与所述当前待识别文本信息对应的文本关键词集;其中,所述BERT模型表示Transformers模型的双向编码器表示模型;Invoke a pre-trained BERT model, extract keywords from the current text information to be recognized through the BERT model, and obtain a text keyword set corresponding to the current text information to be recognized; wherein, the BERT model represents the Transformers model The two-way encoder representation model;将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量;Performing one-hot encoding on each text keyword in the text keyword set to obtain the word vector corresponding to each text keyword;根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。According to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated.
- 根据权利要求1所述的基于语义和图像识别的心电信息提取方法,其中,所述接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量之前,还包括:The method for extracting ECG information based on semantic and image recognition according to claim 1, wherein said receiving the uploaded ECG image, calling a pre-trained Res2Net classification network based on the attention mechanism, and classifying the ECG image according to the The Res2Net classification network based on the attention mechanism performs classification, and before the corresponding output vector is obtained, it also includes:将用于获取心电图影像的提示信息发送至用户端或智能心电仪;Send the prompt information for obtaining the ECG image to the client or the smart electrocardiograph;接收用户端或智能心电仪根据所述提示信息发送的心电图影像。Receive the electrocardiogram image sent by the user terminal or the smart electrocardiograph according to the prompt information.
- 根据权利要求1所述的基于语义和图像识别的心电信息提取方法,其中,所述调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量,包括:The method for extracting ECG information based on semantic and image recognition according to claim 1, wherein the pre-trained Res2Net classification network based on the attention mechanism is invoked, and the ECG image is based on the Res2Net based on the attention mechanism. The classification network performs classification and obtains the corresponding output vector, including:获取所述心电图影像对应的像素矩阵;Acquiring a pixel matrix corresponding to the electrocardiogram image;将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量;Using the pixel matrix as an input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;将所述形态特征向量作为所述基于注意力机制的Res2Net分类网络中注意力机制结构进行运算,得到输出向量。The morphological feature vector is used as the attention mechanism structure in the Res2Net classification network based on the attention mechanism to perform operations to obtain an output vector.
- 根据权利要求4所述的基于语义和图像识别的心电信息提取方法,其中,所述将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量,包括:The method for extracting ECG information based on semantics and image recognition according to claim 4, wherein the pixel matrix is used as the input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain morphological features Vectors, including:将所述像素矩阵输入至所述Res2Net网络中依次卷积、在多层残差结构进行恒等映射、池化及全连接,得到形态特征向量。The pixel matrix is input into the Res2Net network for sequential convolution, identity mapping, pooling, and full connection are performed in a multi-layer residual structure to obtain a morphological feature vector.
- 根据权利要求1所述的基于语义和图像识别的心电信息提取方法,其中,所述将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果,包括:The method for extracting ECG information based on semantics and image recognition according to claim 1, wherein said inputting said semantic vector and said output vector to said Light GBM model for classification to obtain a corresponding classification result comprises :将所述语义向量及所述输出向量进行独立特征合并,得到图文特征向量;Combining the semantic vector and the output vector with independent features to obtain a graphic feature vector;将所述图文特征向量通过所述Light GBM模型中基于直方图的决策进行分类,得到对应的分类结果。The graphic feature vector is classified through the histogram-based decision in the Light GBM model to obtain a corresponding classification result.
- 根据权利要求1所述的基于语义和图像识别的心电信息提取方法,其中,所述调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果之后,还包括:The method for extracting ECG information based on semantics and image recognition according to claim 1, wherein said calling a pre-trained Light GBM model, and inputting said semantic vector and said output vector to said Light GBM model for classification , After obtaining the corresponding classification results, it also includes:调用预先存储的文本模板,将所述分类结果填充至文本模板得到当前文本;Call a pre-stored text template, and fill the classification result into the text template to obtain the current text;将所述当前文本发送至用户端;Sending the current text to the client;将所述当前文本上传至区块链网络。Upload the current text to the blockchain network.
- 根据权利要求1所述的基于语义和图像识别的心电信息提取方法,其中,所述接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息,包括:The method for extracting ECG information based on semantics and image recognition according to claim 1, wherein the receiving text information corresponding to the response sent by the user terminal according to the guide question set is used as the current text information to be recognized, comprising:获取用户端与服务器根据所述引导问题集回复的多轮对话文本,在所述多轮对话文本文本中提取回复文本信息以作为当前待识别文本信息。Obtain multiple rounds of dialogue texts replies from the client and the server according to the guide question set, and extract the reply text information from the multiple rounds of dialogue text as the current text information to be recognized.
- 一种基于语义和图像识别的心电信息提取装置,包括:An ECG information extraction device based on semantic and image recognition, including:文本描述信息接收单元,用于接收用户端上传的文本描述信息;The text description information receiving unit is used to receive the text description information uploaded by the client;关键词判断单元,用于判断所述文本描述信息中是否包括预设的关键词;The keyword judgment unit is used to judge whether the text description information includes preset keywords;第一文本信息获取单元,用于若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;The first text information obtaining unit is configured to obtain the text description information as the current text information to be recognized if the keyword is included in the text description information;引导问题集发送单元,用于若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;A guiding question set sending unit, configured to call a pre-stored guiding question set and send it to the user terminal if the keyword is not included in the text description information;第二文本信息获取单元,用于接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;The second text information acquiring unit is configured to receive the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;语义向量获取单元,用于对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;A semantic vector obtaining unit, configured to perform semantic recognition on the current text information to be recognized to obtain a semantic vector corresponding to the current text information to be recognized;影像分类单元,用于接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及The image classification unit is configured to receive uploaded electrocardiogram images, call a pre-trained Res2Net classification network based on the attention mechanism, and classify the electrocardiogram images according to the Res2Net classification network based on the attention mechanism to obtain a corresponding output vector; as well as分类结果获取单元,用于调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The classification result obtaining unit is configured to call a pre-trained Light GBM model, input the semantic vector and the output vector to the Light GBM model for classification, and obtain a corresponding classification result.
- 一种计算机设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现以下步骤:A computer device includes a memory, a processor, and a computer program that is stored on the memory and can run on the processor, and the processor implements the following steps when the processor executes the computer program:接收用户端上传的文本描述信息;Receive text description information uploaded by the client;判断所述文本描述信息中是否包括预设的关键词;Determine whether the text description information includes preset keywords;若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;If the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;If the keyword is not included in the text description information, call the pre-stored guide question set and send it to the user terminal;接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;Receiving the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;Performing semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及Receiving the uploaded electrocardiogram image, invoking the pre-trained Res2Net classification network based on the attention mechanism, and classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector; and调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
- 根据权利要求9所述的计算机设备,其中,所述对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量,包括:The computer device according to claim 9, wherein said performing semantic recognition on said current text information to be recognized to obtain a semantic vector corresponding to said current text information to be recognized comprises:调用预先训练的BERT模型,将所述当前待识别文本信息通过所述BERT模型进行关键 词提取,得到与所述当前待识别文本信息对应的文本关键词集;其中,所述BERT模型表示Transformers模型的双向编码器表示模型;Invoke a pre-trained BERT model, extract keywords from the current text information to be recognized through the BERT model, and obtain a text keyword set corresponding to the current text information to be recognized; wherein, the BERT model represents the Transformers model The two-way encoder representation model;将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量;Performing one-hot encoding on each text keyword in the text keyword set to obtain the word vector corresponding to each text keyword;根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。According to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated.
- 根据权利要求9所述的计算机设备,其中,所述接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量之前,还包括:The computer device according to claim 9, wherein said receiving the uploaded electrocardiogram image, calling a pre-trained Res2Net classification network based on the attention mechanism, and processing the electrocardiogram image according to the Res2Net classification network based on the attention mechanism Before classifying and obtaining the corresponding output vector, it also includes:将用于获取心电图影像的提示信息发送至用户端或智能心电仪;Send the prompt information for obtaining the ECG image to the client or the smart electrocardiograph;接收用户端或智能心电仪根据所述提示信息发送的心电图影像。Receive the electrocardiogram image sent by the user terminal or the smart electrocardiograph according to the prompt information.
- 根据权利要求9所述的计算机设备,其中,所述调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量,包括:The computer device according to claim 9, wherein the pre-trained Res2Net classification network based on the attention mechanism is invoked to classify the electrocardiogram image according to the Res2Net classification network based on the attention mechanism to obtain the corresponding output Vectors, including:获取所述心电图影像对应的像素矩阵;Acquiring a pixel matrix corresponding to the electrocardiogram image;将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量;Using the pixel matrix as an input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;将所述形态特征向量作为所述基于注意力机制的Res2Net分类网络中注意力机制结构进行运算,得到输出向量。The morphological feature vector is used as the attention mechanism structure in the Res2Net classification network based on the attention mechanism to perform operations to obtain an output vector.
- 根据权利要求12所述的计算机设备,其中,所述将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量,包括:The computer device according to claim 12, wherein said using the pixel matrix as the input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector comprises:将所述像素矩阵输入至所述Res2Net网络中依次卷积、在多层残差结构进行恒等映射、池化及全连接,得到形态特征向量。The pixel matrix is input into the Res2Net network for sequential convolution, identity mapping, pooling, and full connection are performed in a multi-layer residual structure to obtain a morphological feature vector.
- 根据权利要求9所述的计算机设备,其中,所述将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果,包括:The computer device according to claim 9, wherein said inputting said semantic vector and said output vector to said Light GBM model for classification to obtain a corresponding classification result comprises:将所述语义向量及所述输出向量进行独立特征合并,得到图文特征向量;Combining the semantic vector and the output vector with independent features to obtain a graphic feature vector;将所述图文特征向量通过所述Light GBM模型中基于直方图的决策进行分类,得到对应的分类结果。The graphic feature vector is classified through the histogram-based decision in the Light GBM model to obtain a corresponding classification result.
- 根据权利要求9所述的计算机设备,其中,所述调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果之后,还包括:The computer device according to claim 9, wherein the pre-trained Light GBM model is invoked, the semantic vector and the output vector are input to the Light GBM model for classification, and after the corresponding classification result is obtained, further include:调用预先存储的文本模板,将所述分类结果填充至文本模板得到当前文本;Call a pre-stored text template, and fill the classification result into the text template to obtain the current text;将所述当前文本发送至用户端;Sending the current text to the client;将所述当前文本上传至区块链网络。Upload the current text to the blockchain network.
- 根据权利要求9所述的计算机设备,其中,所述接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息,包括:9. The computer device according to claim 9, wherein the receiving text information corresponding to the response sent by the user terminal according to the guide question set is used as the current text information to be recognized, comprising:获取用户端与服务器根据所述引导问题集回复的多轮对话文本,在所述多轮对话文本文本中提取回复文本信息以作为当前待识别文本信息。Obtain multiple rounds of dialogue texts replies from the client and the server according to the guide question set, and extract the reply text information from the multiple rounds of dialogue text as the current text information to be recognized.
- 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行以下操作:A computer-readable storage medium that stores a computer program that, when executed by a processor, causes the processor to perform the following operations:接收用户端上传的文本描述信息;Receive text description information uploaded by the client;判断所述文本描述信息中是否包括预设的关键词;Determine whether the text description information includes preset keywords;若所述文本描述信息中包括所述关键词,获取所述文本描述信息以作为当前待识别文本信息;If the text description information includes the keyword, acquiring the text description information as the current text information to be recognized;若所述文本描述信息中不包括所述关键词,调用预先存储的引导问题集发送至用户端;If the keyword is not included in the text description information, call the pre-stored guide question set and send it to the user terminal;接收用户端根据所述引导问题集对应发送的回复文本信息以作为当前待识别文本信息;Receiving the reply text information correspondingly sent by the user terminal according to the guide question set as the current text information to be recognized;对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量;Performing semantic recognition on the currently to-be-recognized text information to obtain a semantic vector corresponding to the currently-to-be-recognized text information;接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量;以及Receiving the uploaded electrocardiogram image, invoking the pre-trained Res2Net classification network based on the attention mechanism, and classifying the electrocardiogram image according to the attention mechanism-based Res2Net classification network to obtain the corresponding output vector; and调用预先训练的Light GBM模型,将所述语义向量及所述输出向量输入至所述Light GBM模型进行分类,得到对应的分类结果。The pre-trained Light GBM model is called, the semantic vector and the output vector are input to the Light GBM model for classification, and the corresponding classification result is obtained.
- 根据权利要求17所述的计算机可读存储介质,其中,所述对所述当前待识别文本信息进行语义识别,以得到与所述当前待识别文本信息对应的语义向量,包括:18. The computer-readable storage medium according to claim 17, wherein said performing semantic recognition on said currently to-be-recognized text information to obtain a semantic vector corresponding to said currently-to-be-recognized text information comprises:调用预先训练的BERT模型,将所述当前待识别文本信息通过所述BERT模型进行关键词提取,得到与所述当前待识别文本信息对应的文本关键词集;其中,所述BERT模型表示Transformers模型的双向编码器表示模型;Invoke a pre-trained BERT model, extract keywords from the current text information to be recognized through the BERT model, and obtain a text keyword set corresponding to the current text information to be recognized; wherein, the BERT model represents the Transformers model The two-way encoder representation model;将所述文本关键词集中各文本关键词进行独热编码,得到各文本关键词分别对应的词向量;Performing one-hot encoding on each text keyword in the text keyword set to obtain the word vector corresponding to each text keyword;根据各文本关键词分别对应的词向量以及各文本关键词分别对应的权重值,计算得到与所述当前待识别文本对应的语义向量。According to the word vector corresponding to each text keyword and the weight value corresponding to each text keyword, the semantic vector corresponding to the current text to be recognized is calculated.
- 根据权利要求17所述的计算机可读存储介质,其中,所述接收上传的心电图影像,调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量之前,还包括:The computer-readable storage medium according to claim 17, wherein said receiving the uploaded ECG image, calling a pre-trained Res2Net classification network based on the attention mechanism, and classifying the ECG image according to the attention mechanism-based Res2Net Before the classification network performs classification and obtains the corresponding output vector, it also includes:将用于获取心电图影像的提示信息发送至用户端或智能心电仪;Send the prompt information for obtaining the ECG image to the client or the smart electrocardiograph;接收用户端或智能心电仪根据所述提示信息发送的心电图影像。Receive the electrocardiogram image sent by the user terminal or the smart electrocardiograph according to the prompt information.
- 根据权利要求17所述的计算机可读存储介质,其中,所述调用预先训练的基于注意力机制的Res2Net分类网络,将所述心电图影像根据所述基于注意力机制的Res2Net分类网络进行分类,得到对应的输出向量,包括:The computer-readable storage medium according to claim 17, wherein the pre-trained Res2Net classification network based on the attention mechanism is invoked, and the ECG image is classified according to the attention mechanism-based Res2Net classification network to obtain The corresponding output vector includes:获取所述心电图影像对应的像素矩阵;Acquiring a pixel matrix corresponding to the electrocardiogram image;将所述像素矩阵作为所述基于注意力机制的Res2Net分类网络中Res2Net网络的输入进行运算,得到形态特征向量;Using the pixel matrix as an input of the Res2Net network in the Res2Net classification network based on the attention mechanism to perform operations to obtain a morphological feature vector;将所述形态特征向量作为所述基于注意力机制的Res2Net分类网络中注意力机制结构进行运算,得到输出向量。The morphological feature vector is used as the attention mechanism structure in the Res2Net classification network based on the attention mechanism to perform operations to obtain an output vector.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011001748.5 | 2020-09-22 | ||
CN202011001748.5A CN111931717B (en) | 2020-09-22 | 2020-09-22 | Semantic and image recognition-based electrocardiogram information extraction method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021159751A1 true WO2021159751A1 (en) | 2021-08-19 |
Family
ID=73333988
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/125059 WO2021159751A1 (en) | 2020-09-22 | 2020-10-30 | Semantic and image recognition-based electrocardiography information extraction method and apparatus, computer device, and storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111931717B (en) |
WO (1) | WO2021159751A1 (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113887422A (en) * | 2021-09-30 | 2022-01-04 | 中国平安人寿保险股份有限公司 | Table picture content extraction method, device and equipment based on artificial intelligence |
CN114092813A (en) * | 2021-11-25 | 2022-02-25 | 中国科学院空天信息创新研究院 | Industrial park image extraction method, model, electronic equipment and storage medium |
CN114299522A (en) * | 2022-01-10 | 2022-04-08 | 北京百度网讯科技有限公司 | Image recognition method, device and storage medium |
CN115860783A (en) * | 2022-12-23 | 2023-03-28 | 广东南粤分享汇控股有限公司 | E-commerce platform user feedback analysis method and system based on artificial intelligence |
CN117219249A (en) * | 2023-05-10 | 2023-12-12 | 威海市立医院 | Equipment positioning management system based on intelligent medical digital platform |
CN117540935A (en) * | 2024-01-09 | 2024-02-09 | 上海银行股份有限公司 | DAO operation management method based on block chain technology |
CN118334430A (en) * | 2024-04-18 | 2024-07-12 | 西南石油大学 | Cardiomyopathy classification method, system and medium based on deep neural network |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112597289B (en) * | 2020-12-24 | 2024-03-01 | 北京明略软件系统有限公司 | Query information classification method, model training method, device and electronic equipment |
CN112603330B (en) * | 2020-12-28 | 2022-07-26 | 中国科学院微电子研究所 | Electrocardiosignal identification and classification method |
CN113221869B (en) * | 2021-05-25 | 2023-09-05 | 中国平安人寿保险股份有限公司 | Medical invoice structured information extraction method, device equipment and storage medium |
CN113496228B (en) * | 2021-07-30 | 2024-03-26 | 大连海事大学 | Human body semantic segmentation method based on Res2Net, transUNet and cooperative attention |
CN114446434A (en) * | 2021-11-11 | 2022-05-06 | 中国科学院深圳先进技术研究院 | Report generation method, system and terminal equipment |
CN113920512B (en) * | 2021-12-08 | 2022-03-15 | 共道网络科技有限公司 | Image recognition method and device |
CN116431004B (en) * | 2023-06-01 | 2023-08-29 | 山东协和学院 | Control method and system for interactive behavior of rehabilitation robot |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109512442A (en) * | 2018-12-21 | 2019-03-26 | 杭州电子科技大学 | A kind of EEG fatigue state classification method based on LightGBM |
CN110543822A (en) * | 2019-07-29 | 2019-12-06 | 浙江理工大学 | finger vein identification method based on convolutional neural network and supervised discrete hash algorithm |
US10599774B1 (en) * | 2018-02-26 | 2020-03-24 | Facebook, Inc. | Evaluating content items based upon semantic similarity of text |
CN111444960A (en) * | 2020-03-26 | 2020-07-24 | 上海交通大学 | Skin disease image classification system based on multi-mode data input |
CN111678531A (en) * | 2020-06-18 | 2020-09-18 | 李�昊 | Subway path planning method based on LightGBM |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106778685A (en) * | 2017-01-12 | 2017-05-31 | 司马大大(北京)智能系统有限公司 | Electrocardiogram image-recognizing method, device and service terminal |
CN108764268A (en) * | 2018-04-02 | 2018-11-06 | 华南理工大学 | A kind of multi-modal emotion identification method of picture and text based on deep learning |
CN110363439A (en) * | 2019-07-19 | 2019-10-22 | 山东浪潮人工智能研究院有限公司 | A kind of credit-graded approach based on consumer demographics' portrait |
CN111159378B (en) * | 2019-12-30 | 2023-07-18 | 支付宝(杭州)信息技术有限公司 | Method and device for classifying problem description information |
CN111317464B (en) * | 2020-02-13 | 2021-06-22 | 中国科学院自动化研究所 | Electrocardiogram analysis method and device |
CN111341408B (en) * | 2020-03-26 | 2024-06-28 | 联影智能医疗科技(北京)有限公司 | Image report template generation method, computer equipment and storage medium |
CN111627531A (en) * | 2020-06-02 | 2020-09-04 | 中国医学科学院阜外医院深圳医院(深圳市孙逸仙心血管医院) | Medical image classification processing system based on artificial intelligence |
-
2020
- 2020-09-22 CN CN202011001748.5A patent/CN111931717B/en active Active
- 2020-10-30 WO PCT/CN2020/125059 patent/WO2021159751A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10599774B1 (en) * | 2018-02-26 | 2020-03-24 | Facebook, Inc. | Evaluating content items based upon semantic similarity of text |
CN109512442A (en) * | 2018-12-21 | 2019-03-26 | 杭州电子科技大学 | A kind of EEG fatigue state classification method based on LightGBM |
CN110543822A (en) * | 2019-07-29 | 2019-12-06 | 浙江理工大学 | finger vein identification method based on convolutional neural network and supervised discrete hash algorithm |
CN111444960A (en) * | 2020-03-26 | 2020-07-24 | 上海交通大学 | Skin disease image classification system based on multi-mode data input |
CN111678531A (en) * | 2020-06-18 | 2020-09-18 | 李�昊 | Subway path planning method based on LightGBM |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113887422A (en) * | 2021-09-30 | 2022-01-04 | 中国平安人寿保险股份有限公司 | Table picture content extraction method, device and equipment based on artificial intelligence |
CN113887422B (en) * | 2021-09-30 | 2024-05-31 | 中国平安人寿保险股份有限公司 | Table picture content extraction method, device and equipment based on artificial intelligence |
CN114092813A (en) * | 2021-11-25 | 2022-02-25 | 中国科学院空天信息创新研究院 | Industrial park image extraction method, model, electronic equipment and storage medium |
CN114299522A (en) * | 2022-01-10 | 2022-04-08 | 北京百度网讯科技有限公司 | Image recognition method, device and storage medium |
CN114299522B (en) * | 2022-01-10 | 2023-08-29 | 北京百度网讯科技有限公司 | Image recognition method device, apparatus and storage medium |
CN115860783A (en) * | 2022-12-23 | 2023-03-28 | 广东南粤分享汇控股有限公司 | E-commerce platform user feedback analysis method and system based on artificial intelligence |
CN115860783B (en) * | 2022-12-23 | 2023-09-26 | 广东南粤分享汇控股有限公司 | E-commerce platform user feedback analysis method and system based on artificial intelligence |
CN117219249A (en) * | 2023-05-10 | 2023-12-12 | 威海市立医院 | Equipment positioning management system based on intelligent medical digital platform |
CN117219249B (en) * | 2023-05-10 | 2024-07-05 | 威海市立医院 | Equipment positioning management system based on intelligent medical digital platform |
CN117540935A (en) * | 2024-01-09 | 2024-02-09 | 上海银行股份有限公司 | DAO operation management method based on block chain technology |
CN117540935B (en) * | 2024-01-09 | 2024-04-05 | 上海银行股份有限公司 | DAO operation management method based on block chain technology |
CN118334430A (en) * | 2024-04-18 | 2024-07-12 | 西南石油大学 | Cardiomyopathy classification method, system and medium based on deep neural network |
Also Published As
Publication number | Publication date |
---|---|
CN111931717B (en) | 2021-01-26 |
CN111931717A (en) | 2020-11-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021159751A1 (en) | Semantic and image recognition-based electrocardiography information extraction method and apparatus, computer device, and storage medium | |
CN109919928B (en) | Medical image detection method and device and storage medium | |
CN111710412B (en) | Diagnostic result verification method and device and electronic equipment | |
US20230119593A1 (en) | Method and apparatus for training facial feature extraction model, method and apparatus for extracting facial features, device, and storage medium | |
WO2022105118A1 (en) | Image-based health status identification method and apparatus, device and storage medium | |
CN109919078A (en) | A kind of method, the method and device of model training of video sequence selection | |
CN111414946B (en) | Artificial intelligence-based medical image noise data identification method and related device | |
US20200372639A1 (en) | Method and system for identifying skin texture and skin lesion using artificial intelligence cloud-based platform | |
CN112232276B (en) | Emotion detection method and device based on voice recognition and image recognition | |
CN111128391A (en) | Information processing apparatus, method and storage medium | |
CN113052090B (en) | Method and apparatus for generating caption generator and outputting caption | |
CN114886404B (en) | Electronic equipment, device and storage medium | |
CN112971718A (en) | Syndrome identification method and device, electronic equipment and storage medium | |
CN110083842A (en) | Translation quality detection method, device, machine translation system and storage medium | |
CN112256827A (en) | Sign language translation method and device, computer equipment and storage medium | |
US20240274275A1 (en) | Image analysis and insight generation | |
CN116992879A (en) | Entity identification method, device, equipment and medium based on artificial intelligence | |
CN111191035B (en) | Method and device for recognizing lung cancer clinical database text entity | |
WO2020215682A1 (en) | Fundus image sample expansion method and apparatus, electronic device, and computer non-volatile readable storage medium | |
CN116955529A (en) | Data processing method and device and electronic equipment | |
CN113903433B (en) | Image processing method and device and electronic equipment | |
US12099536B2 (en) | Extracting knowledge from a knowledge database | |
CN116955601A (en) | Multi-mode emotion recognition method and device | |
CN114090789A (en) | Intelligent multi-round interaction system for traditional Chinese medicine health maintenance based on knowledge graph | |
US11636262B2 (en) | Method and system for providing user-specific explanations for output generated by artificial neural network model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20918182 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20918182 Country of ref document: EP Kind code of ref document: A1 |