WO2021169101A1 - Method and apparatus for generating medical image recognition model, computer device and medium - Google Patents

Method and apparatus for generating medical image recognition model, computer device and medium Download PDF

Info

Publication number
WO2021169101A1
WO2021169101A1 PCT/CN2020/097964 CN2020097964W WO2021169101A1 WO 2021169101 A1 WO2021169101 A1 WO 2021169101A1 CN 2020097964 W CN2020097964 W CN 2020097964W WO 2021169101 A1 WO2021169101 A1 WO 2021169101A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
medical
information
annotation
file
Prior art date
Application number
PCT/CN2020/097964
Other languages
French (fr)
Chinese (zh)
Inventor
丁思雨
丁文静
Original Assignee
平安国际智慧城市科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安国际智慧城市科技股份有限公司 filed Critical 平安国际智慧城市科技股份有限公司
Publication of WO2021169101A1 publication Critical patent/WO2021169101A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H30/00ICT specially adapted for the handling or processing of medical images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/5866Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Definitions

  • This application relates to a method, device, computer equipment and medium for generating a medical image recognition model.
  • a method, device, computer device, and medium for generating a medical image recognition model are provided.
  • a method for generating a medical image recognition model including:
  • the image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
  • a medical image recognition model generation device including:
  • Data acquisition module for acquiring medical data
  • the feature extraction module is used to extract medical images in the medical data, and perform feature extraction on the medical images to obtain image features;
  • An annotation generation module configured to extract medical documents in the medical data, analyze the medical documents to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information;
  • An image file generating module configured to establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship;
  • the model generation module is configured to input the image annotation file into a machine learning model to learn the feature relationship between the image feature in the image annotation file and the image annotation information through the machine learning model, and according to the feature relationship Generate medical image recognition model.
  • a computer device including a memory and one or more processors, the memory stores computer readable instructions, and when the computer readable instructions are executed by the processor, the one or more processors execute The following steps:
  • the image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
  • One or more computer-readable storage media storing computer-readable instructions.
  • the one or more processors perform the following steps:
  • the image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
  • Fig. 1 is an application scenario diagram of a method for generating a medical image recognition model according to one or more embodiments.
  • Fig. 2 is a schematic flowchart of a method for generating a medical image recognition model according to one or more embodiments.
  • FIG. 3 is a schematic flowchart of a method for generating and using a medical image recognition model according to one or more embodiments.
  • Fig. 4 is a schematic flowchart of a method for generating a medical image recognition model in another embodiment.
  • Fig. 5 is a structural block diagram of a device for generating a medical image recognition model according to one or more embodiments.
  • Figure 6 is a block diagram of a computer device according to one or more embodiments.
  • the medical image recognition model generation method provided in this application can be applied to the application environment as shown in FIG. 1.
  • the user terminal 102 communicates with the server 104 through the network.
  • the server 104 obtains medical data; the server 104 extracts medical images in the medical data, and extracts features from the medical images to obtain image features; the server 104 extracts medical files in the medical data, analyzes the medical files to generate annotation information, and the server 104 searches for the annotation information
  • the image annotation information corresponding to the medical image in the image; the server 104 establishes a mapping relationship according to the image characteristics and the image annotation information, and generates an image annotation file according to the mapping relationship; inputs the image annotation file into the machine learning model to learn the image annotation file through the machine learning model Based on the feature relationship between the image feature and the image annotation information, a medical image recognition model is generated according to the feature relationship.
  • the medical image recognition model can also be pushed to the user terminal 102.
  • the user terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices.
  • the server 104 may be implemented by an independent server or a server cluster composed of multiple servers. When the server 104 is an independent server, multiple databases can be deployed in the server 104, and each database can store specific medical files; when the server 104 is a server cluster composed of multiple servers, the database deployed in each server Can store specific medical file data tables.
  • FIG. 2 a schematic flow chart of a method for generating a medical image recognition model is provided.
  • the method is applied to the server 104 in FIG. 1 as an example for description.
  • the The method can also be applied to the terminal, and the method includes the following steps:
  • Step 210 Obtain medical data.
  • Medical data includes at least medical images and medical documents generated during medical procedures. Specifically, the user terminal uploads medical data with the help of the client, and the server obtains the medical data uploaded by the user terminal to perform data analysis on the medical data.
  • Step 220 Extract medical images from the medical data, and perform feature extraction on the medical images to obtain image features.
  • Medical images can be human body structure diagrams collected by medical equipment. Medical images not only include the medical image itself, but may also have attribute information associated with the medical image. For example, the attribute information can be the label information, diagnosis description information, and annotations corresponding to the medical image. Auxiliary information of medical images such as result information. And medical images can also have multiple storage formats.
  • the server may use an image feature extraction algorithm to extract image features corresponding to the medical image.
  • an image feature extraction algorithm For example, use the Histogram of Oriented Gradient (HOG) feature extraction algorithm, Local Binary Pattern (LBP) feature extraction algorithm, or Haar feature extraction algorithm to extract image features corresponding to medical images, such as image features It can be "raised”, “dark tone” and so on.
  • HOG Histogram of Oriented Gradient
  • LBP Local Binary Pattern
  • Haar feature extraction algorithm to extract image features corresponding to medical images, such as image features It can be "raised”, “dark tone” and so on.
  • the server preprocesses the acquired medical image, such as performing enhancement processing or filtering processing on the medical image, so as to improve the accuracy of the feature extraction of the medical image by the server.
  • Step 230 Extract medical files in the medical data, analyze the medical files to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information.
  • Medical documents can be textual information generated during medical treatment, etc., and medical documents can have multiple formats, such as medical paper medical records or medical electronic medical record documents, etc., more specifically, medical electronic medical record documents can be excel, xml And other file formats.
  • the server extracts medical documents from the medical data, analyzes the medical documents to generate annotation information, and the annotation information may be diagnostic information given by medical professionals.
  • the medical file may contain labeling information for the medical image, and the labeling information may be the diagnostic information for the medical image given by the medical professional.
  • the server may use keyword extraction technology to extract annotation information from medical documents, and when the acquired medical document is a medical paper medical record, the server also includes electronic processing of the paper medical record to obtain an electronic medical record document, Then the required annotation information is parsed from the electronic medical record document. Parsing refers to the process of extracting the required information from the medical file.
  • the annotation information can be the cause of the disease, diagnosis suggestions, or diagnosis results.
  • the server can extract the annotation information of the medical image in the medical text according to the keyword matching technology, for example, the annotation information corresponding to the diagnosis result of the medical image may be "lumps", "shadows", and so on.
  • Step 240 Establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship.
  • an image annotation file can be: shadow-pulmonary cyst, indicating that when the medical image feature extracted by the server is "shadow”, the corresponding image annotation information should be "pulmonary cyst” at this time, so that the annotation information can be automatically obtained from the medical image Function.
  • the server can also perform data processing on the acquired image annotation files, such as data cleaning, data management, data verification, etc., to make the acquired data more standard.
  • Step 250 Input the image annotation file into the machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and generate a medical image recognition model based on the feature relationship.
  • the server uses the sample set to train the machine learning model to obtain the association relationship between the image features and the image annotation information, and obtain the medical image recognition model according to the association relationship. Since the medical image recognition model stores the correspondence between image features and image annotation information, the medical image recognition model can be used to realize automatic acquisition of medical image features or image annotation information.
  • the server extracts medical images and medical texts from medical data, extracts image features corresponding to medical images and annotation information in medical texts, realizes automatic extraction of medical information from medical data, and improves the processing of medical data. Efficiency and accuracy.
  • an image annotation file is established, and the image annotation file is used as a sample set to train the machine learning model to obtain a medical image recognition model.
  • the medical image recognition model can then be used to automatically extract image annotation information, which improves The recognition accuracy and efficiency of medical data.
  • establishing a mapping relationship based on image features and image annotation information, and generating an image annotation file based on the mapping relationship includes: obtaining an image identifier corresponding to the medical image; searching for an image analysis method corresponding to the image identifier; The attribute information associated with the medical image is analyzed to generate image label information; the mapping relationship is established according to the image characteristics, the image annotation information, and the image label information, and the image annotation file is generated according to the mapping relationship.
  • the image identifier is used to uniquely identify the category of the medical image, and the category of the medical image can be the storage format of the image.
  • Different medical images can be stored in different medical image formats.
  • the information contained in different formats of medical images is different, and the usage scenarios are different. In traditional technology, they are read and viewed in different formats through corresponding client tools. Medical imaging requires users to download multiple tools and know the instructions for use of the corresponding tools, which brings great inconvenience to users and also makes the efficiency of viewing medical images inefficient.
  • the server extracts the medical image from the medical data, obtains the image identifier corresponding to the medical image, and calls the image analysis method associated with the image identifier to automatically analyze the attribute information associated with the medical image according to the image analysis method to generate image tag information.
  • the storage format can be DICOM, ndpi, tiff, jpg, bmp, png, etc.
  • C++-based DCMTK, Java-based dcm4che, and python-based pydicom can be used for analysis.
  • Another example is the use of dcm4che to analyze the tag information of DICOM files, and the use of opencv to analyze and process medical images such as pathology and fundus.
  • the image analysis method is used to extract and analyze the attribute information corresponding to the medical image to obtain the required image tag information.
  • the attribute information associated with a medical image can be encoded information, and the server parses and converts the encoded information according to the image analysis method found to obtain image tag information.
  • the parsed image tag information is: Tag. Tag.PatientName--patient name, Tag.PatientAge--patient age, Tag.PatientSex--patient gender, Tag.StudyID--examination ID, Tag.StudyDate--examination date, etc.
  • performing feature extraction on medical images to obtain image features includes: extracting annotation information in the medical image; obtaining location coordinates corresponding to the annotation information; extracting regional medical images corresponding to the location coordinates from the medical images; using Image feature extraction algorithm extracts image features of regional medical images.
  • the annotation information in the medical image may be information for identifying areas that need to be focused on in the medical image.
  • the annotation information may be parsed based on files such as xml and json generated by an internal annotation platform that has been desensitized.
  • the location coordinates can be the coordinates of the labeled information in the medical image, and the server can obtain the regional medical image corresponding to the area that needs to be focused on through the coordinates.
  • the server may use HOG feature extraction algorithm, LBP feature extraction algorithm, Haar feature extraction algorithm, etc. to automatically extract image features corresponding to regional medical images.
  • the extracted image features can be "bulge” or “dark tone”.
  • the server can extract the labeling information of the regional medical image in the medical text according to the keyword matching technology, for example, the labeling information can be "lumps", "shadows", and so on.
  • the regional medical image in the medical image is automatically extracted according to the coordinates in the medical image, and the key areas in the medical image and the regional features of the key areas are automatically extracted, so that the operation steps of the regional medical image can be realized. Improve the efficiency of medical image data processing.
  • after parsing the medical file to generate the annotation information it further includes: dividing the annotation information into unit annotation information according to preset rules; matching the unit annotation information with the information in the annotation information database, and matching The information with the highest degree is extracted as label information.
  • the labeling information generated by the server parsing the medical documents can be professionals, such as doctors’ diagnosis suggestions for patients. Since different doctors may have different expressions for the same disease, the labeling information obtained at this time may exist. Inconsistent textual representations of the same matter.
  • the server can use Chinese natural language processing word segmentation methods, such as HanLP for Chinese word segmentation, and segment and adjust the obtained labeling information according to certain rules to obtain multiple unit labeling information, and then label the unit
  • Chinese natural language processing word segmentation methods such as HanLP for Chinese word segmentation
  • segment and adjust the obtained labeling information according to certain rules to obtain multiple unit labeling information, and then label the unit
  • the information is matched with words in an annotation information database such as a dictionary, and the information with the largest matching degree in the dictionary is extracted as annotation information. Since the information stored in the dictionary is standardized information, the use of dictionary matching technology can realize the standardized processing of labeling information.
  • Chinese word segmentation methods include word segmentation methods based on string matching, word segmentation methods based on understanding, and word segmentation methods based on statistics.
  • Chinese word segmentation is performed based on HanLP to generate labeling information, which realizes the standardization of labeling information and facilitates subsequent data retrieval.
  • the method further includes: receiving the query request sent by the terminal, extracting the query identifier carried in the query request; extracting the image identifier corresponding to the image annotation file; calculating the query identifier and each image The matching degree of the logo; the image annotation file associated with the image logo with the largest matching degree is pushed to the terminal.
  • the image annotation file after the image annotation file is established according to each image feature and corresponding image annotation information, it also includes associating each image annotation file with the corresponding image identifier to generate an image annotation file library, that is, the image annotation file library stores different images Mark the corresponding image annotation file, such as "lung-lung image annotation file, stomach-stomach image annotation file, etc.” are stored.
  • the user terminal wants to find medical image diagnosis suggestions from the image annotation file library
  • the user terminal sends a query request to the server, and the server matches the corresponding image annotation file from the image annotation file library according to the obtained query request, and annotates the image
  • the file is pushed to the user terminal, and then the user can perform operations based on the queried image annotation file. For example, users can match their own medical images with the medical images in the acquired image annotation files, and then query the annotation information of the medical images in the image annotation file library to obtain diagnosis suggestions for personal pathological conditions.
  • the user terminal may send a query request to the server, for example, it may be an image query request.
  • the server extracts the query identifier carried in the query request, for example, extracts the image query identifier carried in the image query request. Then the server matches the obtained query identifier with the image identifier corresponding to the image annotation file, and pushes the image annotation file associated with the successfully matched image identifier to the user terminal, so as to realize the function that the user can obtain the corresponding image annotation file through the query request .
  • the user wants to obtain the image annotation file corresponding to a certain type of disease, he can obtain the corresponding file only according to the query request. Specifically, the user can use the obtained image annotation file as a reference file, and can refer to the medical image of his concern. Compare with the medical images in the acquired reference image annotation files, and then obtain the reference image features and reference image annotation information of the medical images that they are concerned about, and help users obtain professional medical image suggestions in time.
  • the server obtains the user's query request, and then can push the corresponding image annotation file to the user, which realizes the automatic acquisition of the image annotation file and improves the efficiency of file acquisition.
  • a flow diagram of a method for generating and using a medical image recognition model including:
  • Step 310 Obtain medical data.
  • Step 320 Extract medical images from the medical data, and perform feature extraction on the medical images to obtain image features.
  • Step 330 Extract medical files in the medical data, analyze the medical files to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information.
  • Step 340 Establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship.
  • Step 350 Input the image annotation file into the machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and generate a medical image recognition model based on the feature relationship.
  • step 310 to step 350 For specific embodiments of step 310 to step 350, reference may be made to the description of step 210 to step 250 in the foregoing, which is not repeated here.
  • the method further includes:
  • Step 360 Receive the medical image sent by the terminal.
  • the user terminal may send medical images to the server, and there is no restriction on the format of the medical images sent by the user terminal.
  • the server receives the medical images sent by the user terminal.
  • Step 370 Input the medical image into the medical image recognition model to extract image features corresponding to the medical image through the medical image recognition model, and obtain image annotation information based on the medical image features.
  • the server recognizes the acquired medical images according to the trained medical image recognition model, and then obtains the image annotation information corresponding to the medical image, and realizes the medical image Automatic recognition.
  • Step 380 Push the image annotation information to the terminal.
  • the server pushes the image annotation information recognized according to the medical image recognition model to the user terminal.
  • the trained medical image recognition model can be used to recognize medical images in any format, realize automatic recognition and automatic labeling of medical images, and use the medical image recognition model obtained by automatic training of big data.
  • the recognition result is more accurate and the recognition efficiency is higher.
  • a schematic diagram of the process of generating a medical image recognition model in another embodiment including:
  • Step 400 Obtain files in DICOM, ndpi, tiff, jpg, bmp, png and other formats.
  • the server can obtain medical files in multiple formats, and the server can obtain files uploaded by the user terminal.
  • Step 410 Obtain a diagnosis file, an annotation file, and an electronic medical record.
  • the server can obtain the diagnosis file, the annotation file, the electronic medical record, etc. in the medical file.
  • the server can obtain the files uploaded by the user terminal.
  • Step 420 upload the client.
  • the medical files obtained in step 400 and step 410 are uploaded to the client.
  • the server can call the corresponding analysis algorithm for analysis based on the obtained medical files in multiple formats, and obtain the medical image in step 430 and the metadata information in step 440.
  • the metadata information can be label information, including tag information and diagnosis. Information and other label information, etc.
  • Step 450 Obtain a medical tag library.
  • the medical tag library may be an image annotation file.
  • Step 460 the medical image management platform.
  • the server uploads the data acquired in step 430 and step 450 to the medical image management platform.
  • the medical image management platform may be an image annotation file library, which stores multiple types of image annotation files. Then, the medical image management platform can be used to perform the medical image retrieval in step 470, the medical image export in step 480, and the intelligent labeling function in step 490.
  • a medical image recognition model generation device including: a data acquisition module 510, a feature extraction module 520, an annotation generation module 530, an image file generation module 540, and a model generation module 550, of which:
  • the data acquisition module 510 is used to acquire medical data.
  • the feature extraction module 520 is configured to extract medical images in the medical data, and perform feature extraction on the medical images to obtain image features.
  • the annotation generating module 530 is configured to extract medical documents in the medical data, analyze the medical documents to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information.
  • the image file generating module 540 is configured to establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship.
  • the model generation module 550 is configured to input the image annotation file into a machine learning model to learn the feature relationship between the image feature in the image annotation file and the image annotation information through the machine learning model, and according to the feature Relationship generation medical image recognition model.
  • the feature extraction module 520 includes:
  • the image annotation information extraction unit is used to extract the annotation information in the medical image.
  • the coordinate acquisition unit is used to acquire the position coordinates corresponding to the label information.
  • the regional image extraction unit is used to extract regional medical images corresponding to the position coordinates from the medical images.
  • the regional feature extraction unit is used to extract image features of the regional medical image by using an image feature extraction algorithm.
  • the device further includes:
  • the segmentation module is configured to segment the label information into unit label information according to preset rules.
  • the information extraction module is configured to extract and match the unit label information with the information in the label information database, and extract the information with the greatest degree of matching as label information.
  • the image file generating module 540 includes:
  • the identification acquiring unit is used to acquire the image identification corresponding to the medical image.
  • the method search unit is used to search for the image analysis method corresponding to the image identifier.
  • the information generating unit is configured to analyze the attribute information associated with the medical image by using the image analysis method to generate image tag information.
  • the file generating unit is configured to establish a mapping relationship according to the image characteristics, the image annotation information, and the image tag information, and generate an image annotation file according to the mapping relationship.
  • the device further includes:
  • the query identifier extraction module is used to receive the query request sent by the terminal, and extract the query identifier carried in the query request.
  • the identification extraction module is used to extract the image identification corresponding to the image annotation file.
  • the calculation module is used to calculate the matching degree between the query identifier and each of the image identifiers.
  • the first pushing module is configured to push the image annotation file associated with the image identifier with the greatest matching degree to the terminal.
  • the device further includes:
  • the receiving module is used to receive medical images sent by the terminal.
  • the annotation information recognition module is used to input the medical image into the medical image recognition model to extract the image feature corresponding to the medical image through the medical image recognition model, and obtain the image according to the medical image feature Annotate information.
  • the second push module is used to push the image annotation information to the terminal.
  • Each module in the above-mentioned medical image recognition model generating device can be implemented in whole or in part by software, hardware, and a combination thereof.
  • the above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 6.
  • the computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus.
  • the processor of the computer device is used to provide calculation and control capabilities.
  • the memory of the computer device includes non-volatile or volatile storage media and internal memory.
  • the non-volatile or volatile storage medium stores an operating system, computer readable instructions, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile or volatile storage medium.
  • the computer equipment database is used for medical data processing related data.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • FIG. 6 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
  • the specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
  • a computer device includes a memory and one or more processors.
  • the memory stores computer-readable instructions.
  • the one or more processors perform the following steps: acquiring medical data; extracting For the medical images in the medical data, feature extraction is performed on the medical images to obtain image features; the medical files in the medical data are extracted, the medical files are parsed to generate annotation information, and the annotation information is searched for and the The image annotation information corresponding to the medical image; establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship; and input the image annotation file into a machine learning model to pass machine learning The model learns the feature relationship between the image feature and the image tag information in the image tag file, and generates a medical image recognition model according to the feature relationship.
  • the processor when the processor executes the computer-readable instructions to implement the step of extracting features from the medical image to obtain image features, it is also used to: extract the annotation information in the medical image; and obtain the annotation information Corresponding location coordinates; extracting regional medical images corresponding to the location coordinates from the medical images; and extracting image features of the regional medical images using an image feature extraction algorithm.
  • the processor when the processor executes the computer-readable instructions, after the processor executes the parsing of the medical file to generate annotation information, it is further used to: divide the annotation information into unit annotation information according to a preset rule; And matching the unit labeling information with the information in the labeling information database, and extracting the information with the greatest degree of matching as labeling information.
  • the step of establishing a mapping relationship based on the image characteristics and the image annotation information, and generating an image annotation file based on the mapping relationship is further used for: Obtain the image identifier corresponding to the medical image; find the image analysis method corresponding to the image identifier; use the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and according to the image feature Establishing a mapping relationship between the image annotation information and the image tag information, and generating an image annotation file according to the mapping relationship.
  • the processor when the processor executes the computer-readable instruction, after the image annotation file is generated according to the mapping relationship, the processor is further used to: receive the query request sent by the terminal, and extract the query identifier carried in the query request Extracting the image identification corresponding to the image annotation file; calculating the matching degree between the query identification and each of the image identification; and pushing the image annotation file associated with the image identification with the largest matching degree to the terminal.
  • the processor when the processor executes the computer-readable instructions to realize the generation of the medical image recognition model, it is further used to: receive the medical image sent by the terminal; input the medical image into the medical image recognition model to Extracting the image feature corresponding to the medical image through the medical image recognition model, obtaining the image annotation information according to the medical image feature; and pushing the image annotation information to the terminal.
  • One or more computer-readable storage media storing computer-readable instructions.
  • the one or more processors perform the following steps: acquiring medical data; extracting the medical The medical image in the data is extracted from the medical image to obtain the image feature; the medical file in the medical data is extracted, the medical file is parsed to generate annotation information, and the annotation information is searched for in the annotation information and the medical image Corresponding image annotation information; establish a mapping relationship according to the image features and the image annotation information, and generate an image annotation file according to the mapping relationship; and input the image annotation file into a machine learning model to learn all the information through the machine learning model According to the feature relationship between the image feature and the image tag information in the image tag file, a medical image recognition model is generated according to the feature relationship.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable instruction when executed by the processor to implement the step of extracting features from the medical image to obtain image features, it is also used to: extract the annotation information in the medical image; and obtain the annotation information Corresponding location coordinates; extracting regional medical images corresponding to the location coordinates from the medical images; and extracting image features of the regional medical images using an image feature extraction algorithm.
  • the computer-readable instruction when executed by the processor, after realizing the parsing of the medical file to generate annotation information, it is further used to: divide the annotation information into unit annotation information according to a preset rule; And matching the unit labeling information with the information in the labeling information database, and extracting the information with the greatest degree of matching as labeling information.
  • the step of establishing a mapping relationship based on the image characteristics and the image annotation information, and generating an image annotation file based on the mapping relationship is further used to: Obtain the image identifier corresponding to the medical image; find the image analysis method corresponding to the image identifier; use the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and according to the image feature Establishing a mapping relationship between the image annotation information and the image tag information, and generating an image annotation file according to the mapping relationship.
  • the computer-readable instruction when executed by the processor, after the image annotation file is generated according to the mapping relationship, it is further used to: receive a query request sent by the terminal, and extract the query identifier carried in the query request Extracting the image identification corresponding to the image annotation file; calculating the matching degree between the query identification and each of the image identification; and pushing the image annotation file associated with the image identification with the largest matching degree to the terminal.
  • the computer-readable instruction when executed by the processor to realize the generation of the medical image recognition model, it is further used to: receive the medical image sent by the terminal; input the medical image into the medical image recognition model to Extracting the image feature corresponding to the medical image through the medical image recognition model, obtaining the image annotation information according to the medical image feature; and pushing the image annotation information to the terminal.
  • Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
  • SRAM static RAM
  • DRAM dynamic RAM
  • SDRAM synchronous DRAM
  • DDRSDRAM double data rate SDRAM
  • ESDRAM enhanced SDRAM
  • SLDRAM synchronous chain Channel
  • memory bus Radbus direct RAM
  • RDRAM direct memory bus dynamic RAM
  • RDRAM memory bus dynamic RAM

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Medical Informatics (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Radiology & Medical Imaging (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
  • Library & Information Science (AREA)

Abstract

A method for generating a medical image recognition model, which relates to the technical field of artificial intelligence. The method comprises: acquiring medical data; extracting a medical image in the medical data, and performing feature extraction on the medical image to obtain image features; extracting a medical file in the medical data, analyzing the medical file to generate annotation information, and searching within the annotation information for image annotation information corresponding to the medical image; establishing a mapping relationship according to the image features and the image annotation information, and generating an image annotation file according to the mapping relationship; and inputting the image annotation file into a machine learning model so as to learn the feature relationship between the image features in the image annotation file and the image annotation information by means of the machine learning model, and generating a medical image recognition model according to the feature relationship.

Description

医疗影像识别模型生成方法、装置、计算机设备和介质Medical image recognition model generation method, device, computer equipment and medium
相关申请的交叉引用Cross-references to related applications
本申请要求于2020年02月28日提交中国专利局,申请号为2020101284650,申请名称为“医疗影像识别模型生成方法、装置、计算机设备和介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on February 28, 2020, the application number is 2020101284650, and the application name is "Medical Image Recognition Model Generation Method, Apparatus, Computer Equipment, and Medium". The entire content of the application is approved The reference is incorporated in this application.
技术领域Technical field
本申请涉及一种医疗影像识别模型生成方法、装置、计算机设备和介质。This application relates to a method, device, computer equipment and medium for generating a medical image recognition model.
背景技术Background technique
随着经济的快速发展和生活水平的提高,人们对健康问题越来越关注,如今利用医疗设备采集人体器官影像图,已经得到了广泛的应用。With the rapid economic development and the improvement of living standards, people are paying more and more attention to health problems. Nowadays, the use of medical equipment to collect images of human organs has been widely used.
但是发明人意识到,通过医疗设备采取的医疗影像必须依靠医疗专业的人士才能识别,并且人工进行识别也不能保证对医疗影像中信息识别的准确度,虽然传统技术中有根据医疗文件建立的医疗文件库,通过查询医疗文件库可获取医疗影像信息,但是目前构建医疗文件库的方法大多是人工建立,使得模型建立的效率低下。However, the inventor realizes that medical images taken through medical equipment must be recognized by medical professionals, and manual recognition cannot guarantee the accuracy of information recognition in medical images. Although there are medical records established based on medical documents in traditional technology. The file library, medical imaging information can be obtained by querying the medical file library, but the current methods for constructing the medical file library are mostly manual establishment, which makes the model establishment inefficient.
发明内容Summary of the invention
根据本申请公开的各种实施例,提供一种医疗影像识别模型生成方法、装置、计算机设备和介质。According to various embodiments disclosed in the present application, a method, device, computer device, and medium for generating a medical image recognition model are provided.
一种医疗影像识别模型生成方法,包括:A method for generating a medical image recognition model, including:
获取医疗数据;Access to medical data;
提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;Extracting medical images in the medical data, and performing feature extraction on the medical images to obtain image features;
提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;Extracting medical documents in the medical data, analyzing the medical documents to generate annotation information, and searching for image annotation information corresponding to the medical image in the annotation information;
根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及Establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship; and
将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
一种医疗影像识别模型生成装置,包括:A medical image recognition model generation device, including:
数据获取模块,用于获取医疗数据;Data acquisition module for acquiring medical data;
特征提取模块,用于提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得 到影像特征;The feature extraction module is used to extract medical images in the medical data, and perform feature extraction on the medical images to obtain image features;
标注生成模块,用于提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;An annotation generation module, configured to extract medical documents in the medical data, analyze the medical documents to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information;
影像文件生成模块,用于根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及An image file generating module, configured to establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship; and
模型生成模块,用于将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The model generation module is configured to input the image annotation file into a machine learning model to learn the feature relationship between the image feature in the image annotation file and the image annotation information through the machine learning model, and according to the feature relationship Generate medical image recognition model.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device, including a memory and one or more processors, the memory stores computer readable instructions, and when the computer readable instructions are executed by the processor, the one or more processors execute The following steps:
获取医疗数据;Access to medical data;
提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;Extracting medical images in the medical data, and performing feature extraction on the medical images to obtain image features;
提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;Extracting medical documents in the medical data, analyzing the medical documents to generate annotation information, and searching for image annotation information corresponding to the medical image in the annotation information;
根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及Establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship; and
将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
一个或多个存储有计算机可读指令的计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:One or more computer-readable storage media storing computer-readable instructions. When the computer-readable instructions are executed by one or more processors, the one or more processors perform the following steps:
获取医疗数据;Access to medical data;
提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;Extracting medical images in the medical data, and performing feature extraction on the medical images to obtain image features;
提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;Extracting medical documents in the medical data, analyzing the medical documents to generate annotation information, and searching for image annotation information corresponding to the medical image in the annotation information;
根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及Establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship; and
将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征和优点将从说明书、附图以及权利要求书变得明显。The details of one or more embodiments of the present application are set forth in the following drawings and description. Other features and advantages of this application will become apparent from the description, drawings and claims.
附图说明Description of the drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通 技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly describe the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings needed in the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. A person of ordinary skill in the art can obtain other drawings based on these drawings without creative work.
图1为根据一个或多个实施例中医疗影像识别模型生成方法的应用场景图。Fig. 1 is an application scenario diagram of a method for generating a medical image recognition model according to one or more embodiments.
图2为根据一个或多个实施例中医疗影像识别模型生成方法的流程示意图。Fig. 2 is a schematic flowchart of a method for generating a medical image recognition model according to one or more embodiments.
图3为根据一个或多个实施例中医疗影像识别模型生成方法及使用方法的流程示意图。FIG. 3 is a schematic flowchart of a method for generating and using a medical image recognition model according to one or more embodiments.
图4为另一个实施例中医疗影像识别模型生成方法的流程示意图。Fig. 4 is a schematic flowchart of a method for generating a medical image recognition model in another embodiment.
图5为根据一个或多个实施例中医疗影像识别模型生成装置的结构框图。Fig. 5 is a structural block diagram of a device for generating a medical image recognition model according to one or more embodiments.
图6为根据一个或多个实施例中计算机设备的框图。Figure 6 is a block diagram of a computer device according to one or more embodiments.
具体实施方式Detailed ways
为了使本申请的技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the technical solutions and advantages of the present application clearer, the following further describes the present application in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.
本申请提供的医疗影像识别模型生成方法,可以应用于如图1所示的应用环境中。用户终端102通过网络与服务器104进行通信。服务器104获取医疗数据;服务器104提取医疗数据中的医疗影像,对医疗影像进行特征提取得到影像特征;服务器104提取医疗数据中的医疗文件,对医疗文件进行解析生成标注信息,服务器104查找标注信息中与医疗影像对应的影像标注信息;服务器104根据影像特征以及影像标注信息建立映射关系,根据映射关系生成影像标注文件;将影像标注文件输入机器学习模型,以通过机器学习模型学习影像标注文件中影像特征与影像标注信息之间的特征关系,根据特征关系生成医疗影像识别模型,进一步地,还可以将医疗影像识别模型推送至用户终端102。用户终端102可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备,服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。当服务器104为独立的服务器时,服务器104中可以部署多个数据库,每个数据库中可以存储特定的医疗文件;当服务器104为多个服务器组成的服务器集群时,每个服务器中部署的数据库中可以存储特定的医疗文件数据表。The medical image recognition model generation method provided in this application can be applied to the application environment as shown in FIG. 1. The user terminal 102 communicates with the server 104 through the network. The server 104 obtains medical data; the server 104 extracts medical images in the medical data, and extracts features from the medical images to obtain image features; the server 104 extracts medical files in the medical data, analyzes the medical files to generate annotation information, and the server 104 searches for the annotation information The image annotation information corresponding to the medical image in the image; the server 104 establishes a mapping relationship according to the image characteristics and the image annotation information, and generates an image annotation file according to the mapping relationship; inputs the image annotation file into the machine learning model to learn the image annotation file through the machine learning model Based on the feature relationship between the image feature and the image annotation information, a medical image recognition model is generated according to the feature relationship. Furthermore, the medical image recognition model can also be pushed to the user terminal 102. The user terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices. The server 104 may be implemented by an independent server or a server cluster composed of multiple servers. When the server 104 is an independent server, multiple databases can be deployed in the server 104, and each database can store specific medical files; when the server 104 is a server cluster composed of multiple servers, the database deployed in each server Can store specific medical file data tables.
在其中一个实施例中,如图2所示,提供了一种医疗影像识别模型生成方法的流程示意图,以该方法应用于图1中的服务器104为例进行说明,在其他实施例中,该方法也可以应用于终端,方法包括以下步骤:In one of the embodiments, as shown in FIG. 2, a schematic flow chart of a method for generating a medical image recognition model is provided. The method is applied to the server 104 in FIG. 1 as an example for description. In other embodiments, the The method can also be applied to the terminal, and the method includes the following steps:
步骤210,获取医疗数据。Step 210: Obtain medical data.
医疗数据至少包含医疗过程中产生的医疗影像以及医疗文件。具体地,用户终端借助客户端上传医疗数据,服务器获取用户终端上传的医疗数据,以对医疗数据进行数据分析。Medical data includes at least medical images and medical documents generated during medical procedures. Specifically, the user terminal uploads medical data with the help of the client, and the server obtains the medical data uploaded by the user terminal to perform data analysis on the medical data.
步骤220,提取医疗数据中的医疗影像,对医疗影像进行特征提取得到影像特征。Step 220: Extract medical images from the medical data, and perform feature extraction on the medical images to obtain image features.
医疗影像可为医疗设备采集到的人体结构图,医疗影像不仅包含医疗影像本身,还可能会具有与医疗影像关联的属性信息,如属性信息可为医疗影像对应的标签信息、诊断描述信息以及标注结果信息等医疗影像的辅助信息。并且医疗影像也可具有多种存储格式。Medical images can be human body structure diagrams collected by medical equipment. Medical images not only include the medical image itself, but may also have attribute information associated with the medical image. For example, the attribute information can be the label information, diagnosis description information, and annotations corresponding to the medical image. Auxiliary information of medical images such as result information. And medical images can also have multiple storage formats.
具体地,服务器可以利用图像特征提取算法提取医疗影像对应的影像特征。如利用方向 梯度直方图(Histogram of Oriented Gradient,HOG)特征提取算法、局部二值模式(Local Binary Pattern,LBP)特征提取算法、或者Haar特征提取算法等提取医疗影像对应的影像特征,如影像特征可为“凸起”、“色调暗”等。Specifically, the server may use an image feature extraction algorithm to extract image features corresponding to the medical image. For example, use the Histogram of Oriented Gradient (HOG) feature extraction algorithm, Local Binary Pattern (LBP) feature extraction algorithm, or Haar feature extraction algorithm to extract image features corresponding to medical images, such as image features It can be "raised", "dark tone" and so on.
进一步地,在对医疗影像进行特征提取之前还包括:服务器对获取的医疗影像进行预处理,如对医疗影像进行增强处理或者滤波处理等,以提高服务器对医疗影像进行特征提取的精度。Further, before the feature extraction of the medical image, it also includes: the server preprocesses the acquired medical image, such as performing enhancement processing or filtering processing on the medical image, so as to improve the accuracy of the feature extraction of the medical image by the server.
步骤230,提取医疗数据中的医疗文件,对医疗文件进行解析生成标注信息,查找标注信息中与医疗影像对应的影像标注信息。Step 230: Extract medical files in the medical data, analyze the medical files to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information.
医疗文件可为就医过程中产生的文字信息等,并且医疗文件可具有多种格式,如可为医疗纸质病历本或者医疗电子病历文档等,更加具体地,医疗电子病历文档可为excel、xml等文件格式。具体地,服务器从医疗数据中提取医疗文件,对医疗文件进行解析生成标注信息,标注信息可为医疗专业人员给出的诊断信息等。如医疗文件中可包含对医疗影像的标注信息,标注信息可为医疗专业人员给出的针对医疗影像的诊断信息。Medical documents can be textual information generated during medical treatment, etc., and medical documents can have multiple formats, such as medical paper medical records or medical electronic medical record documents, etc., more specifically, medical electronic medical record documents can be excel, xml And other file formats. Specifically, the server extracts medical documents from the medical data, analyzes the medical documents to generate annotation information, and the annotation information may be diagnostic information given by medical professionals. For example, the medical file may contain labeling information for the medical image, and the labeling information may be the diagnostic information for the medical image given by the medical professional.
具体地,服务器可利用关键词提取技术从医疗文件中提取标注信息,并且当获取的医疗文件为医疗纸质病历本时,服务器还包括对纸质病历本进行电子化处理,得到电子病历文档,然后从电子病历文档中解析出所需的标注信息,解析是指从医疗文件中提取所需信息的过程,标注信息可为病因、诊断建议或者诊断结果等。如服务器可以根据关键词匹配技术提取医疗文本中对医疗影像的标注信息,如针对该医疗影像的诊断结果对应的标注信息可以为“肿块”、“阴影”等。Specifically, the server may use keyword extraction technology to extract annotation information from medical documents, and when the acquired medical document is a medical paper medical record, the server also includes electronic processing of the paper medical record to obtain an electronic medical record document, Then the required annotation information is parsed from the electronic medical record document. Parsing refers to the process of extracting the required information from the medical file. The annotation information can be the cause of the disease, diagnosis suggestions, or diagnosis results. For example, the server can extract the annotation information of the medical image in the medical text according to the keyword matching technology, for example, the annotation information corresponding to the diagnosis result of the medical image may be "lumps", "shadows", and so on.
步骤240,根据影像特征以及影像标注信息建立映射关系,根据映射关系生成影像标注文件。Step 240: Establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship.
将影像特征与影像标注信息进行关联,建立映射关系,得到影像标注文件。如一条影像标注文件可为:阴影-肺囊肿,表明当服务器提取的医疗影像特征为“阴影”时,此时对应的影像标注信息应该为“肺囊肿”,实现根据医疗影像自动获取标注信息的功能。The image features are associated with the image annotation information, a mapping relationship is established, and the image annotation file is obtained. For example, an image annotation file can be: shadow-pulmonary cyst, indicating that when the medical image feature extracted by the server is "shadow", the corresponding image annotation information should be "pulmonary cyst" at this time, so that the annotation information can be automatically obtained from the medical image Function.
进一步地,服务器还可以将获取的影像标注文件进行数据处理,如进行数据清洗、数据治理、数据核验等,以使得获取的数据更加标准。Further, the server can also perform data processing on the acquired image annotation files, such as data cleaning, data management, data verification, etc., to make the acquired data more standard.
步骤250,将影像标注文件输入机器学习模型,以通过机器学习模型学习影像标注文件中影像特征与影像标注信息之间的特征关系,根据特征关系生成医疗影像识别模型。Step 250: Input the image annotation file into the machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and generate a medical image recognition model based on the feature relationship.
将预先获取的影像标注文件作为样本集,服务器利用样本集对机器学习模型进行训练得到影像特征与影像标注信息的关联关系,根据关联关系得到医疗影像识别模型。由于医疗影像识别模型中存储了影像特征与影像标注信息之间的对应关系,进而可以利用医疗影像识别模型实现对医疗影像特征或者影像标注信息的自动获取。Taking pre-acquired image annotation files as a sample set, the server uses the sample set to train the machine learning model to obtain the association relationship between the image features and the image annotation information, and obtain the medical image recognition model according to the association relationship. Since the medical image recognition model stores the correspondence between image features and image annotation information, the medical image recognition model can be used to realize automatic acquisition of medical image features or image annotation information.
在本实施例中,服务器提取医疗数据中的医疗影像以及医疗文本,提取医疗影像对应的影像特征以及医疗文本中的标注信息,实现了自动从医疗数据中提取医疗信息,提高对医疗数据的处理效率和准确性。并根据影像特征以及标注信息建立影像标注文件,将影像标注文件作为样本集对机器学习模型进行训练,得到医疗影像识别模型,进而可以利用医疗影像识 别模型实现对影像标注信息的自动提取,提高了对医疗数据的识别准确率以及识别效率。In this embodiment, the server extracts medical images and medical texts from medical data, extracts image features corresponding to medical images and annotation information in medical texts, realizes automatic extraction of medical information from medical data, and improves the processing of medical data. Efficiency and accuracy. According to the image features and annotation information, an image annotation file is established, and the image annotation file is used as a sample set to train the machine learning model to obtain a medical image recognition model. The medical image recognition model can then be used to automatically extract image annotation information, which improves The recognition accuracy and efficiency of medical data.
并且,在本实施例中不仅可以实现对医疗文本的自动处理,还可以实现对医疗影像的自动处理,提高了对不同格式的医疗数据处理的能力,并且医疗影像中具有更多的医疗特征信息,通过识别医疗影像数据也使得医疗数据的获取更加全面。Moreover, in this embodiment, not only can the automatic processing of medical texts be realized, but also the automatic processing of medical images can be realized, which improves the ability to process medical data in different formats, and the medical images have more medical feature information. , Recognizing medical image data also makes the acquisition of medical data more comprehensive.
在其中一个实施例中,根据影像特征以及影像标注信息建立映射关系,根据映射关系生成影像标注文件,包括:获取医疗影像对应的影像标识;查找影像标识对应的影像解析方法;利用影像解析方法对医疗影像关联的属性信息进行解析,生成影像标签信息;根据影像特征、影像标注信息以及影像标签信息建立映射关系,根据映射关系生成影像标注文件。In one of the embodiments, establishing a mapping relationship based on image features and image annotation information, and generating an image annotation file based on the mapping relationship includes: obtaining an image identifier corresponding to the medical image; searching for an image analysis method corresponding to the image identifier; The attribute information associated with the medical image is analyzed to generate image label information; the mapping relationship is established according to the image characteristics, the image annotation information, and the image label information, and the image annotation file is generated according to the mapping relationship.
影像标识用于唯一识别医疗影像的类别,医疗影像的类别可为影像的存储格式。不同的医疗影像可存储为不同的医疗影像格式,不同格式的医疗影像包含的信息各不相同,且使用场景也不一样,在传统技术中都是通过相应的客户端工具读取查看不同格式的医疗影像,这就需要用户下载多种工具且知道对应工具的使用说明,给用户带来了极大的不便,并且也使得对医疗影像查看的效率低下。The image identifier is used to uniquely identify the category of the medical image, and the category of the medical image can be the storage format of the image. Different medical images can be stored in different medical image formats. The information contained in different formats of medical images is different, and the usage scenarios are different. In traditional technology, they are read and viewed in different formats through corresponding client tools. Medical imaging requires users to download multiple tools and know the instructions for use of the corresponding tools, which brings great inconvenience to users and also makes the efficiency of viewing medical images inefficient.
具体地,不同格式的医疗影像需要不同的解析方法实现对医疗影像的解析。服务器从医疗数据中提取医疗影像,获取医疗影像对应的影像标识,调用与影像标识关联的影像解析方法,以根据影像解析方法对医疗影像关联的属性信息进行自动解析生成影像标签信息。例如存储格式可为DICOM、ndpi、tiff、jpg、bmp、png等。如对于DICOM格式的医疗影像,可利用基于C++的DCMTK、基于Java的dcm4che以及基于python的pydicom进行解析。再例如利用dcm4che解析DICOM文件的tag信息,利用opencv解析处理病理和眼底等医疗影像。Specifically, different formats of medical images require different analysis methods to realize the analysis of medical images. The server extracts the medical image from the medical data, obtains the image identifier corresponding to the medical image, and calls the image analysis method associated with the image identifier to automatically analyze the attribute information associated with the medical image according to the image analysis method to generate image tag information. For example, the storage format can be DICOM, ndpi, tiff, jpg, bmp, png, etc. For example, for medical images in DICOM format, C++-based DCMTK, Java-based dcm4che, and python-based pydicom can be used for analysis. Another example is the use of dcm4che to analyze the tag information of DICOM files, and the use of opencv to analyze and process medical images such as pathology and fundus.
利用影像解析方法对医疗影像对应的属性信息进行提取、解析,得到所需的影像标签信息。如医疗影像关联的属性信息可为编码信息,服务器根据查找得到的影像解析方法对编码信息进行解析、转换得到影像标签信息,如解析出来的影像标签信息为:Tag.PatientID--患者唯一标识、Tag.PatientName--患者姓名、Tag.PatientAge--患者年龄、Tag.PatientSex--患者性别、Tag.StudyID--检查标识、Tag.StudyDate--检查日期等。The image analysis method is used to extract and analyze the attribute information corresponding to the medical image to obtain the required image tag information. For example, the attribute information associated with a medical image can be encoded information, and the server parses and converts the encoded information according to the image analysis method found to obtain image tag information. For example, the parsed image tag information is: Tag. Tag.PatientName--patient name, Tag.PatientAge--patient age, Tag.PatientSex--patient gender, Tag.StudyID--examination ID, Tag.StudyDate--examination date, etc.
在本实施例中,通过识别医疗影像的格式,根据不同的格式选择对应的医疗影像解析方法,进而实现了利用不同的医疗影像解析方法识别不同格式下的医疗影像,提高了对医疗影像的识别范围以及对医疗影像格式的适应性,提高了对医疗影像的处理能力。解决传统技术中对于不同格式的医疗影像的查看过程需要获取其他医疗文件辅助说明,如标注结果、诊断结果、电子病历等,并且这些辅助文件格式、内容区别也是很大,使得用户对医疗影像的存储、管理、查看、对比等带来了极大不便。In this embodiment, by identifying the format of the medical image, and selecting the corresponding medical image analysis method according to different formats, different medical image analysis methods are used to identify medical images in different formats, which improves the recognition of medical images. The scope and adaptability to medical image formats have improved the processing capabilities of medical images. To solve the problem that the viewing process of different formats of medical images in the traditional technology needs to obtain other medical document auxiliary instructions, such as labeling results, diagnosis results, electronic medical records, etc., and the format and content of these auxiliary files are also very different, making users feel Storage, management, viewing, and comparison have brought great inconvenience.
在其中一个实施例中,对医疗影像进行特征提取得到影像特征,包括:提取医疗影像中的标注信息;获取标注信息对应的位置坐标;从医疗影像中提取与位置坐标对应的区域医疗影像;利用图像特征提取算法提取区域医疗影像的影像特征。In one of the embodiments, performing feature extraction on medical images to obtain image features includes: extracting annotation information in the medical image; obtaining location coordinates corresponding to the annotation information; extracting regional medical images corresponding to the location coordinates from the medical images; using Image feature extraction algorithm extracts image features of regional medical images.
医疗影像中的标注信息可以为对医疗影像中需要重点关注的区域进行标识的信息,具体地,标注信息可以基于已经脱敏的内部标注平台生成的xml、json等文件解析出。The annotation information in the medical image may be information for identifying areas that need to be focused on in the medical image. Specifically, the annotation information may be parsed based on files such as xml and json generated by an internal annotation platform that has been desensitized.
位置坐标可为标注信息在医疗影像中的坐标,服务器通过坐标可以获取需要重点关注区 域对应的区域医疗影像。The location coordinates can be the coordinates of the labeled information in the medical image, and the server can obtain the regional medical image corresponding to the area that needs to be focused on through the coordinates.
具体地,服务器可以利用HOG特征提取算法、LBP特征提取算法、Haar特征提取算法等自动提取区域医疗影像对应的影像特征,例如提取到的影像特征可为“凸起”、“色调暗”。服务器可以根据关键词匹配技术提取医疗文本中对区域医疗影像的标注信息,例如标注信息可以为“肿块”、“阴影”等。Specifically, the server may use HOG feature extraction algorithm, LBP feature extraction algorithm, Haar feature extraction algorithm, etc. to automatically extract image features corresponding to regional medical images. For example, the extracted image features can be "bulge" or "dark tone". The server can extract the labeling information of the regional medical image in the medical text according to the keyword matching technology, for example, the labeling information can be "lumps", "shadows", and so on.
在本实施例中,根据医疗影像中的坐标自动提取医疗影像中的区域医疗影像,实现自动化提取医疗影像中的重点区域,以及重点区域的区域特征,进而可以实现对区域医疗影像的操作步骤,提高了对医疗影像数据处理的效率。In this embodiment, the regional medical image in the medical image is automatically extracted according to the coordinates in the medical image, and the key areas in the medical image and the regional features of the key areas are automatically extracted, so that the operation steps of the regional medical image can be realized. Improve the efficiency of medical image data processing.
在其中一个实施例中,对医疗文件进行解析生成标注信息之后,还包括:将标注信息按照预设的规则分割为单元标注信息;将单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。In one of the embodiments, after parsing the medical file to generate the annotation information, it further includes: dividing the annotation information into unit annotation information according to preset rules; matching the unit annotation information with the information in the annotation information database, and matching The information with the highest degree is extracted as label information.
服务器对医疗文件进行解析生成的标注信息可为专业人士,例如医生对患者的诊断建议,由于不同的医生对同一个疾病的描述可能会有不同的表述方式,故而此时得到的标注信息可能存在对同一个事项的文字表述不一致情况。The labeling information generated by the server parsing the medical documents can be professionals, such as doctors’ diagnosis suggestions for patients. Since different doctors may have different expressions for the same disease, the labeling information obtained at this time may exist. Inconsistent textual representations of the same matter.
为了对标注信息进行标准化处理,服务器可利用中文自然语言处理分词方法,如HanLP进行汉语分词处理,将获得的标注信息根据一定规则进行切分和调整,得到多个单元标注信息,然后将单元标注信息与标注信息数据库如词典中的词语进行匹配,将词典中匹配度最大的信息提取为标注信息。由于词典中存储的信息都是标准化的信息,故而利用词典匹配技术可以实现对标注信息的标准化处理。In order to standardize the labeling information, the server can use Chinese natural language processing word segmentation methods, such as HanLP for Chinese word segmentation, and segment and adjust the obtained labeling information according to certain rules to obtain multiple unit labeling information, and then label the unit The information is matched with words in an annotation information database such as a dictionary, and the information with the largest matching degree in the dictionary is extracted as annotation information. Since the information stored in the dictionary is standardized information, the use of dictionary matching technology can realize the standardized processing of labeling information.
如中文分词方法有基于字符串匹配的分词方法、基于理解的分词方法、基于统计的分词方法等。For example, Chinese word segmentation methods include word segmentation methods based on string matching, word segmentation methods based on understanding, and word segmentation methods based on statistics.
在本实施例中,基于HanLP进行汉语分词处理生成标注信息,实现了对标注信息的标准化处理,便于后续数据检索。In this embodiment, Chinese word segmentation is performed based on HanLP to generate labeling information, which realizes the standardization of labeling information and facilitates subsequent data retrieval.
在其中一个实施例中,根据映射关系生成影像标注文件之后,还包括:接收终端发送的查询请求,提取查询请求中携带的查询标识;提取影像标注文件对应的影像标识;计算查询标识与各影像标识的匹配度;将匹配度最大的影像标识关联的影像标注文件推送至终端。In one of the embodiments, after generating the image annotation file according to the mapping relationship, the method further includes: receiving the query request sent by the terminal, extracting the query identifier carried in the query request; extracting the image identifier corresponding to the image annotation file; calculating the query identifier and each image The matching degree of the logo; the image annotation file associated with the image logo with the largest matching degree is pushed to the terminal.
进一步地,根据各影像特征以及对应的影像标注信息建立影像标注文件后,还包括将各影像标注文件与对应的影像标识进行关联,生成影像标注文件库,即影像标注文件库中存储了不同影像标识对应的影像标注文件,如存储了“肺部—肺部影像标注文件、胃部-胃部影像标注文件等”。Further, after the image annotation file is established according to each image feature and corresponding image annotation information, it also includes associating each image annotation file with the corresponding image identifier to generate an image annotation file library, that is, the image annotation file library stores different images Mark the corresponding image annotation file, such as "lung-lung image annotation file, stomach-stomach image annotation file, etc." are stored.
当用户终端想要从影像标注文件库中查找医疗影像诊断建议时,用户终端向服务器发送查询请求,服务器根据获取到的查询请求从影像标注文件库中匹配到对应的影像标注文件,将影像标注文件推送至用户终端,进而用户可以跟据查询到的影像标注文件进行操作。例如,用户可以将自己的医疗影像与获取的影像标注文件中的医疗影像进行匹配,进而通过查询影像标注文件库中对医疗影像的标注信息,实现获取对个人病理情况的诊断建议。When the user terminal wants to find medical image diagnosis suggestions from the image annotation file library, the user terminal sends a query request to the server, and the server matches the corresponding image annotation file from the image annotation file library according to the obtained query request, and annotates the image The file is pushed to the user terminal, and then the user can perform operations based on the queried image annotation file. For example, users can match their own medical images with the medical images in the acquired image annotation files, and then query the annotation information of the medical images in the image annotation file library to obtain diagnosis suggestions for personal pathological conditions.
具体地,用户终端可向服务器发送查询请求,如可为影像查询请求。服务器提取查询请 求中携带的查询标识,如提取影像查询请求中携带的影像查询标识。然后服务器将获取的查询标识与影像标注文件对应的影像标识进行匹配,将匹配成功的影像标识关联的影像标注文件推送至用户终端,以实现用户通过查询请求就可以获取对应的影像标注文件的功能。Specifically, the user terminal may send a query request to the server, for example, it may be an image query request. The server extracts the query identifier carried in the query request, for example, extracts the image query identifier carried in the image query request. Then the server matches the obtained query identifier with the image identifier corresponding to the image annotation file, and pushes the image annotation file associated with the successfully matched image identifier to the user terminal, so as to realize the function that the user can obtain the corresponding image annotation file through the query request .
再如用户想要获取关于某一类型疾病对应的影像标注文件,只需根据查询请求就可以获取相应的文件,具体地,用户可将获取影像标注文件作为参考文件,可以将其关注的医疗影像与获取的参考影像标注文件中的医疗影像进行比对,进而可以获取关于其关注的医疗影像的参考影像特征以及参考影像标注信息,帮助用户及时获取专业的医疗影像建议。For another example, if the user wants to obtain the image annotation file corresponding to a certain type of disease, he can obtain the corresponding file only according to the query request. Specifically, the user can use the obtained image annotation file as a reference file, and can refer to the medical image of his concern. Compare with the medical images in the acquired reference image annotation files, and then obtain the reference image features and reference image annotation information of the medical images that they are concerned about, and help users obtain professional medical image suggestions in time.
在本实施例中,服务器获取用户的查询请求,进而可以向用户推送对应的影像标注文件,实现了影像标注文件的自动获取,提高了文件的获取效率。In this embodiment, the server obtains the user's query request, and then can push the corresponding image annotation file to the user, which realizes the automatic acquisition of the image annotation file and improves the efficiency of file acquisition.
如图3所示,提供了一种医疗影像识别模型生成方法以及使用方法的流程示意图,包括:As shown in Figure 3, a flow diagram of a method for generating and using a medical image recognition model is provided, including:
步骤310,获取医疗数据。Step 310: Obtain medical data.
步骤320,提取医疗数据中的医疗影像,对医疗影像进行特征提取得到影像特征。Step 320: Extract medical images from the medical data, and perform feature extraction on the medical images to obtain image features.
步骤330,提取医疗数据中的医疗文件,对医疗文件进行解析生成标注信息,查找标注信息中与医疗影像对应的影像标注信息。Step 330: Extract medical files in the medical data, analyze the medical files to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information.
步骤340,根据影像特征以及影像标注信息建立映射关系,根据映射关系生成影像标注文件。Step 340: Establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship.
步骤350,将影像标注文件输入机器学习模型,以通过机器学习模型学习影像标注文件中影像特征与影像标注信息之间的特征关系,根据特征关系生成医疗影像识别模型。Step 350: Input the image annotation file into the machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and generate a medical image recognition model based on the feature relationship.
关于步骤310至步骤350的具体实施例可以参考前文中步骤210至步骤250的描述,在此不做赘述。For specific embodiments of step 310 to step 350, reference may be made to the description of step 210 to step 250 in the foregoing, which is not repeated here.
在其中一个实施例中,生成医疗影像识别模型之后,还包括:In one of the embodiments, after generating the medical image recognition model, the method further includes:
步骤360,接收终端发送的医疗影像。Step 360: Receive the medical image sent by the terminal.
具体地,用户终端可以向服务器发送医疗影像,对用户终端发送的医疗影像的格式不做限制。服务器接收用户终端发送的医疗影像。Specifically, the user terminal may send medical images to the server, and there is no restriction on the format of the medical images sent by the user terminal. The server receives the medical images sent by the user terminal.
步骤370,将医疗影像输入医疗影像识别模型,以通过医疗影像识别模型提取医疗影像对应的影像特征,根据医疗影像特征得到影像标注信息。Step 370: Input the medical image into the medical image recognition model to extract image features corresponding to the medical image through the medical image recognition model, and obtain image annotation information based on the medical image features.
由于医疗影像识别模型具有识别医疗影像特征以及医疗影像标注信息的功能,故而服务器根据训练好的医疗影像识别模型对获取的医疗影像进行识别,进而得到医疗影像对应的影像标注信息,实现对医疗影像的自动识别。Since the medical image recognition model has the function of recognizing medical image characteristics and medical image annotation information, the server recognizes the acquired medical images according to the trained medical image recognition model, and then obtains the image annotation information corresponding to the medical image, and realizes the medical image Automatic recognition.
步骤380,将影像标注信息推送至终端。Step 380: Push the image annotation information to the terminal.
服务器将根据医疗影像识别模型识别到的影像标注信息推送至用户终端。The server pushes the image annotation information recognized according to the medical image recognition model to the user terminal.
在本实施例中,利用训练好的医疗影像识别模型可以实现对任何格式的医疗影像的识别,实现对医疗影像的自动识别以及自动标注,并且利用大数据自动训练得到的医疗影像识别模型得到的识别结果更加准确,识别效率更高。In this embodiment, the trained medical image recognition model can be used to recognize medical images in any format, realize automatic recognition and automatic labeling of medical images, and use the medical image recognition model obtained by automatic training of big data. The recognition result is more accurate and the recognition efficiency is higher.
如图4所示,提供了另一个实施例中医疗影像识别模型生成的流程示意图,包括:As shown in Fig. 4, a schematic diagram of the process of generating a medical image recognition model in another embodiment is provided, including:
步骤400,获取DICOM、ndpi、tiff、jpg、bmp、png等格式的文件。Step 400: Obtain files in DICOM, ndpi, tiff, jpg, bmp, png and other formats.
具体地,服务器可以获取多种格式的医疗文件,服务器可获取用户终端上传的文件。Specifically, the server can obtain medical files in multiple formats, and the server can obtain files uploaded by the user terminal.
步骤410,获取诊断文件、标注文件、电子病历。Step 410: Obtain a diagnosis file, an annotation file, and an electronic medical record.
具体地,服务器可以获取医疗文件中的诊断文件、标注文件以及电子病历等。服务器可获取用户终端上传的文件。Specifically, the server can obtain the diagnosis file, the annotation file, the electronic medical record, etc. in the medical file. The server can obtain the files uploaded by the user terminal.
步骤420,上传客户端。 Step 420, upload the client.
具体地,将在步骤400以及步骤410中获取的医疗文件,上传至客户端。然后服务器可以基于获取的多种格式的医疗文件调用相应的解析算法进行解析,得到步骤430中的医疗影像以及步骤440中的元数据信息,元数据信息可为标注信息,包括tag标签信息、诊断信息以及其他标注信息等。Specifically, the medical files obtained in step 400 and step 410 are uploaded to the client. Then the server can call the corresponding analysis algorithm for analysis based on the obtained medical files in multiple formats, and obtain the medical image in step 430 and the metadata information in step 440. The metadata information can be label information, including tag information and diagnosis. Information and other label information, etc.
步骤450,获取医疗标签库。Step 450: Obtain a medical tag library.
包括服务器对在步骤440中获取的元数据信息进行清洗、治理等操作,得到标准化后的元数据信息,根据标准化后的元数据信息生成医疗标签库,医疗标签库可为影像标注文件。This includes operations such as cleaning and management of the metadata information obtained in step 440 by the server to obtain standardized metadata information, and generating a medical tag library based on the standardized metadata information. The medical tag library may be an image annotation file.
步骤460,医疗影像管理平台。 Step 460, the medical image management platform.
服务器将在步骤430以及步骤450中获取的数据上传至医疗影像管理平台,医疗影像管理平台可为影像标注文件库,存储了多种类型的影像标注文件。然后利用医疗影像管理平台可以进行步骤470医疗影像检索、步骤480医疗影像导出以及步骤490中的智能标注功能。The server uploads the data acquired in step 430 and step 450 to the medical image management platform. The medical image management platform may be an image annotation file library, which stores multiple types of image annotation files. Then, the medical image management platform can be used to perform the medical image retrieval in step 470, the medical image export in step 480, and the intelligent labeling function in step 490.
应该理解的是,虽然图2-4的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2-4中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flowcharts of FIGS. 2-4 are displayed in sequence as indicated by the arrows, these steps are not necessarily performed in sequence in the order indicated by the arrows. Unless specifically stated in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least part of the steps in Figures 2-4 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but can be executed at different times. These sub-steps or stages The execution order of is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.
在其中一个实施例中,如图5所示,提供了一种医疗影像识别模型生成装置,包括:数据获取模块510、特征提取模块520、标注生成模块530、影像文件生成模块540以及模型生成模块550,其中:In one of the embodiments, as shown in FIG. 5, a medical image recognition model generation device is provided, including: a data acquisition module 510, a feature extraction module 520, an annotation generation module 530, an image file generation module 540, and a model generation module 550, of which:
数据获取模块510,用于获取医疗数据。The data acquisition module 510 is used to acquire medical data.
特征提取模块520,用于提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征。The feature extraction module 520 is configured to extract medical images in the medical data, and perform feature extraction on the medical images to obtain image features.
标注生成模块530,用于提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息。The annotation generating module 530 is configured to extract medical documents in the medical data, analyze the medical documents to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information.
影像文件生成模块540,用于根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件。The image file generating module 540 is configured to establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship.
模型生成模块550,用于将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The model generation module 550 is configured to input the image annotation file into a machine learning model to learn the feature relationship between the image feature in the image annotation file and the image annotation information through the machine learning model, and according to the feature Relationship generation medical image recognition model.
在其中一个实施例中,所述特征提取模块520包括:In one of the embodiments, the feature extraction module 520 includes:
影像标注信息提取单元,用于提取所述医疗影像中的标注信息。The image annotation information extraction unit is used to extract the annotation information in the medical image.
坐标获取单元,用于获取所述标注信息对应的位置坐标。The coordinate acquisition unit is used to acquire the position coordinates corresponding to the label information.
区域影像提取单元,用于从医疗影像中提取与所述位置坐标对应的区域医疗影像。The regional image extraction unit is used to extract regional medical images corresponding to the position coordinates from the medical images.
区域特征提取单元,用于利用图像特征提取算法提取所述区域医疗影像的影像特征。The regional feature extraction unit is used to extract image features of the regional medical image by using an image feature extraction algorithm.
在其中一个实施例中,所述装置,还包括:In one of the embodiments, the device further includes:
分割模块,用于将所述标注信息按照预设的规则分割为单元标注信息。The segmentation module is configured to segment the label information into unit label information according to preset rules.
信息提取模块,用于提取将所述单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。The information extraction module is configured to extract and match the unit label information with the information in the label information database, and extract the information with the greatest degree of matching as label information.
在其中一个实施例中,影像文件生成模块540,包括:In one of the embodiments, the image file generating module 540 includes:
标识获取单元,用于获取所述医疗影像对应的影像标识。The identification acquiring unit is used to acquire the image identification corresponding to the medical image.
方法查找单元,用于查找所述影像标识对应的影像解析方法。The method search unit is used to search for the image analysis method corresponding to the image identifier.
信息生成单元,用于利用所述影像解析方法对所述医疗影像关联的属性信息进行解析,生成影像标签信息。The information generating unit is configured to analyze the attribute information associated with the medical image by using the image analysis method to generate image tag information.
文件生成单元,用于根据所述影像特征、所述影像标注信息以及所述影像标签信息建立映射关系,根据映射关系生成影像标注文件。The file generating unit is configured to establish a mapping relationship according to the image characteristics, the image annotation information, and the image tag information, and generate an image annotation file according to the mapping relationship.
在其中一个实施例中,所述装置还包括:In one of the embodiments, the device further includes:
查询标识提取模块,用于接收终端发送的查询请求,提取所述查询请求中携带的查询标识。The query identifier extraction module is used to receive the query request sent by the terminal, and extract the query identifier carried in the query request.
标识提取模块,用于提取所述影像标注文件对应的影像标识。The identification extraction module is used to extract the image identification corresponding to the image annotation file.
计算模块,用于计算所述查询标识与各所述影像标识的匹配度。The calculation module is used to calculate the matching degree between the query identifier and each of the image identifiers.
第一推送模块,用于将所述匹配度最大的所述影像标识关联的所述影像标注文件推送至终端。The first pushing module is configured to push the image annotation file associated with the image identifier with the greatest matching degree to the terminal.
在其中一个实施例中,所述装置还包括:In one of the embodiments, the device further includes:
接收模块,用于接收终端发送的医疗影像。The receiving module is used to receive medical images sent by the terminal.
标注信息识别模块,用于将所述医疗影像输入所述医疗影像识别模型,以通过所述医疗影像识别模型提取所述医疗影像对应的所述影像特征,根据所述医疗影像特征得到所述影像标注信息。The annotation information recognition module is used to input the medical image into the medical image recognition model to extract the image feature corresponding to the medical image through the medical image recognition model, and obtain the image according to the medical image feature Annotate information.
第二推送模块,用于将影像标注信息推送至所述终端。The second push module is used to push the image annotation information to the terminal.
关于医疗影像识别模型生成装置的具体限定可以参见上文中对于医疗影像识别模型生成方法的限定,在此不再赘述。上述医疗影像识别模型生成装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。Regarding the specific limitations of the medical image recognition model generating device, please refer to the above limitation on the medical image recognition model generating method, which will not be repeated here. Each module in the above-mentioned medical image recognition model generating device can be implemented in whole or in part by software, hardware, and a combination thereof. The above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.
在其中一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图6所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易 失性或易失性存储介质、内存储器。该非易失性或易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性或易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于医疗数据处理相关数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种医疗影像识别模型生成方法。In one of the embodiments, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 6. The computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus. The processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes non-volatile or volatile storage media and internal memory. The non-volatile or volatile storage medium stores an operating system, computer readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile or volatile storage medium. The computer equipment database is used for medical data processing related data. The network interface of the computer device is used to communicate with an external terminal through a network connection. When the computer-readable instructions are executed by the processor, a method for generating a medical image recognition model is realized.
本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 6 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
一种计算机设备,包括存储器和一个或多个处理器,存储器中储存有计算机可读指令,计算机可读指令被处理器执行时,使得一个或多个处理器执行以下步骤:获取医疗数据;提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the processor, the one or more processors perform the following steps: acquiring medical data; extracting For the medical images in the medical data, feature extraction is performed on the medical images to obtain image features; the medical files in the medical data are extracted, the medical files are parsed to generate annotation information, and the annotation information is searched for and the The image annotation information corresponding to the medical image; establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship; and input the image annotation file into a machine learning model to pass machine learning The model learns the feature relationship between the image feature and the image tag information in the image tag file, and generates a medical image recognition model according to the feature relationship.
在其中一个实施例中,该处理器执行计算机可读指令时实现所述对医疗影像进行特征提取得到影像特征的步骤时还用于:提取所述医疗影像中的标注信息;获取所述标注信息对应的位置坐标;从医疗影像中提取与所述位置坐标对应的区域医疗影像;及利用图像特征提取算法提取所述区域医疗影像的影像特征。In one of the embodiments, when the processor executes the computer-readable instructions to implement the step of extracting features from the medical image to obtain image features, it is also used to: extract the annotation information in the medical image; and obtain the annotation information Corresponding location coordinates; extracting regional medical images corresponding to the location coordinates from the medical images; and extracting image features of the regional medical images using an image feature extraction algorithm.
在其中一个实施例中,该处理器执行计算机可读指令时实现所述对所述医疗文件进行解析生成标注信息之后还用于:将所述标注信息按照预设的规则分割为单元标注信息;及将所述单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。In one of the embodiments, when the processor executes the computer-readable instructions, after the processor executes the parsing of the medical file to generate annotation information, it is further used to: divide the annotation information into unit annotation information according to a preset rule; And matching the unit labeling information with the information in the labeling information database, and extracting the information with the greatest degree of matching as labeling information.
在其中一个实施例中,该处理器执行计算机可读指令时实现所述根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件的步骤时还用于:获取所述医疗影像对应的影像标识;查找所述影像标识对应的影像解析方法;利用所述影像解析方法对所述医疗影像关联的属性信息进行解析,生成影像标签信息;及根据所述影像特征、所述影像标注信息以及所述影像标签信息建立映射关系,根据映射关系生成影像标注文件。In one of the embodiments, when the processor executes the computer-readable instructions, the step of establishing a mapping relationship based on the image characteristics and the image annotation information, and generating an image annotation file based on the mapping relationship is further used for: Obtain the image identifier corresponding to the medical image; find the image analysis method corresponding to the image identifier; use the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and according to the image feature Establishing a mapping relationship between the image annotation information and the image tag information, and generating an image annotation file according to the mapping relationship.
在其中一个实施例中,该处理器执行计算机可读指令时实现所述根据所述映射关系生成影像标注文件之后还用于:接收终端发送的查询请求,提取所述查询请求中携带的查询标识;提取所述影像标注文件对应的影像标识;计算所述查询标识与各所述影像标识的匹配度;及将所述匹配度最大的所述影像标识关联的所述影像标注文件推送至终端。In one of the embodiments, when the processor executes the computer-readable instruction, after the image annotation file is generated according to the mapping relationship, the processor is further used to: receive the query request sent by the terminal, and extract the query identifier carried in the query request Extracting the image identification corresponding to the image annotation file; calculating the matching degree between the query identification and each of the image identification; and pushing the image annotation file associated with the image identification with the largest matching degree to the terminal.
在其中一个实施例中,该处理器执行计算机可读指令时实现所述生成医疗影像识别模型之后还用于:接收终端发送的医疗影像;将所述医疗影像输入所述医疗影像识别模型,以通过所述医疗影像识别模型提取所述医疗影像对应的所述影像特征,根据所述医疗影像特征得 到所述影像标注信息;及将影像标注信息推送至所述终端。In one of the embodiments, when the processor executes the computer-readable instructions to realize the generation of the medical image recognition model, it is further used to: receive the medical image sent by the terminal; input the medical image into the medical image recognition model to Extracting the image feature corresponding to the medical image through the medical image recognition model, obtaining the image annotation information according to the medical image feature; and pushing the image annotation information to the terminal.
一个或多个存储有计算机可读指令的计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:获取医疗数据;提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。One or more computer-readable storage media storing computer-readable instructions. When the computer-readable instructions are executed by one or more processors, the one or more processors perform the following steps: acquiring medical data; extracting the medical The medical image in the data is extracted from the medical image to obtain the image feature; the medical file in the medical data is extracted, the medical file is parsed to generate annotation information, and the annotation information is searched for in the annotation information and the medical image Corresponding image annotation information; establish a mapping relationship according to the image features and the image annotation information, and generate an image annotation file according to the mapping relationship; and input the image annotation file into a machine learning model to learn all the information through the machine learning model According to the feature relationship between the image feature and the image tag information in the image tag file, a medical image recognition model is generated according to the feature relationship.
该计算机可读存储介质可以是非易失性,也可以是易失性的。The computer-readable storage medium may be non-volatile or volatile.
在其中一个实施例中,计算机可读指令被处理器执行时实现所述对医疗影像进行特征提取得到影像特征的步骤时还用于:提取所述医疗影像中的标注信息;获取所述标注信息对应的位置坐标;从医疗影像中提取与所述位置坐标对应的区域医疗影像;及利用图像特征提取算法提取所述区域医疗影像的影像特征。In one of the embodiments, when the computer-readable instruction is executed by the processor to implement the step of extracting features from the medical image to obtain image features, it is also used to: extract the annotation information in the medical image; and obtain the annotation information Corresponding location coordinates; extracting regional medical images corresponding to the location coordinates from the medical images; and extracting image features of the regional medical images using an image feature extraction algorithm.
在其中一个实施例中,计算机可读指令被处理器执行时实现所述对所述医疗文件进行解析生成标注信息之后还用于:将所述标注信息按照预设的规则分割为单元标注信息;及将所述单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。In one of the embodiments, when the computer-readable instruction is executed by the processor, after realizing the parsing of the medical file to generate annotation information, it is further used to: divide the annotation information into unit annotation information according to a preset rule; And matching the unit labeling information with the information in the labeling information database, and extracting the information with the greatest degree of matching as labeling information.
在其中一个实施例中,计算机可读指令被处理器执行时实现所述根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件的步骤时还用于:获取所述医疗影像对应的影像标识;查找所述影像标识对应的影像解析方法;利用所述影像解析方法对所述医疗影像关联的属性信息进行解析,生成影像标签信息;及根据所述影像特征、所述影像标注信息以及所述影像标签信息建立映射关系,根据映射关系生成影像标注文件。In one of the embodiments, when the computer-readable instructions are executed by the processor, the step of establishing a mapping relationship based on the image characteristics and the image annotation information, and generating an image annotation file based on the mapping relationship is further used to: Obtain the image identifier corresponding to the medical image; find the image analysis method corresponding to the image identifier; use the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and according to the image feature Establishing a mapping relationship between the image annotation information and the image tag information, and generating an image annotation file according to the mapping relationship.
在其中一个实施例中,计算机可读指令被处理器执行时实现所述根据所述映射关系生成影像标注文件之后还用于:接收终端发送的查询请求,提取所述查询请求中携带的查询标识;提取所述影像标注文件对应的影像标识;计算所述查询标识与各所述影像标识的匹配度;及将所述匹配度最大的所述影像标识关联的所述影像标注文件推送至终端。In one of the embodiments, when the computer-readable instruction is executed by the processor, after the image annotation file is generated according to the mapping relationship, it is further used to: receive a query request sent by the terminal, and extract the query identifier carried in the query request Extracting the image identification corresponding to the image annotation file; calculating the matching degree between the query identification and each of the image identification; and pushing the image annotation file associated with the image identification with the largest matching degree to the terminal.
在其中一个实施例中,计算机可读指令被处理器执行时实现所述生成医疗影像识别模型之后还用于:接收终端发送的医疗影像;将所述医疗影像输入所述医疗影像识别模型,以通过所述医疗影像识别模型提取所述医疗影像对应的所述影像特征,根据所述医疗影像特征得到所述影像标注信息;及将影像标注信息推送至所述终端。In one of the embodiments, when the computer-readable instruction is executed by the processor to realize the generation of the medical image recognition model, it is further used to: receive the medical image sent by the terminal; input the medical image into the medical image recognition model to Extracting the image feature corresponding to the medical image through the medical image recognition model, obtaining the image annotation information according to the medical image feature; and pushing the image annotation information to the terminal.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、 电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through computer-readable instructions. The computer-readable instructions can be stored in a computer-readable storage. In the medium, when the computer-readable instructions are executed, they may include the processes of the above-mentioned method embodiments. Any reference to memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. As an illustration and not a limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. In order to make the description concise, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered as the range described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation manners of the present application, and the description is relatively specific and detailed, but it should not be understood as a limitation on the scope of the invention patent. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of this application, several modifications and improvements can be made, and these all fall within the protection scope of this application. Therefore, the scope of protection of the patent of this application shall be subject to the appended claims.

Claims (19)

  1. 一种医疗影像识别模型生成方法,包括:A method for generating a medical image recognition model, including:
    获取医疗数据;Access to medical data;
    提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;Extracting medical images in the medical data, and performing feature extraction on the medical images to obtain image features;
    提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;Extracting medical documents in the medical data, analyzing the medical documents to generate annotation information, and searching for image annotation information corresponding to the medical image in the annotation information;
    根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及Establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship; and
    将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
  2. 根据权利要求1所述的方法,其中,所述对所述医疗影像进行特征提取得到影像特征,包括:The method according to claim 1, wherein said performing feature extraction on said medical image to obtain image features comprises:
    提取所述医疗影像中的标注信息;Extract the annotation information in the medical image;
    获取所述标注信息对应的位置坐标;Acquiring position coordinates corresponding to the label information;
    从医疗影像中提取与所述位置坐标对应的区域医疗影像;及Extracting regional medical images corresponding to the position coordinates from the medical images; and
    利用图像特征提取算法提取所述区域医疗影像的影像特征。The image feature extraction algorithm is used to extract the image feature of the regional medical image.
  3. 根据权利要求1所述的方法,其中,所述对所述医疗文件进行解析生成标注信息之后,所述方法还包括:The method according to claim 1, wherein after the analysis of the medical file to generate annotation information, the method further comprises:
    将所述标注信息按照预设的规则分割为单元标注信息;及Dividing the label information into unit label information according to preset rules; and
    将所述单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。The unit labeling information is matched with the information in the labeling information database, and the information with the largest matching degree is extracted as labeling information.
  4. 根据权利要求1所述的方法,其中,所述根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件,包括:The method according to claim 1, wherein the establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship, comprises:
    获取所述医疗影像对应的影像标识;Acquiring an image identifier corresponding to the medical image;
    查找所述影像标识对应的影像解析方法;Searching for an image analysis method corresponding to the image identifier;
    利用所述影像解析方法对所述医疗影像关联的属性信息进行解析,生成影像标签信息;及Using the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and
    根据所述影像特征、所述影像标注信息以及所述影像标签信息建立映射关系,根据映射关系生成影像标注文件。A mapping relationship is established according to the image feature, the image annotation information, and the image tag information, and an image annotation file is generated according to the mapping relationship.
  5. 根据权利要求1或4所述的方法,其中,所述根据所述映射关系生成影像标注文件之后,所述方法还包括:The method according to claim 1 or 4, wherein, after the image annotation file is generated according to the mapping relationship, the method further comprises:
    接收终端发送的查询请求,提取所述查询请求中携带的查询标识;Receiving the query request sent by the terminal, and extracting the query identifier carried in the query request;
    提取所述影像标注文件对应的影像标识;Extracting the image identifier corresponding to the image annotation file;
    计算所述查询标识与各所述影像标识的匹配度;及Calculating the degree of matching between the query identifier and each of the image identifiers; and
    将所述匹配度最大的所述影像标识关联的所述影像标注文件推送至终端。Push the image annotation file associated with the image identifier with the greatest degree of matching to the terminal.
  6. 根据权利要求1所述的方法,其中,所述生成医疗影像识别模型之后,所述方法还包括:The method according to claim 1, wherein after said generating the medical image recognition model, the method further comprises:
    接收终端发送的医疗影像;Receive medical images sent by the terminal;
    将所述医疗影像输入所述医疗影像识别模型,以通过所述医疗影像识别模型提取所述医疗影像对应的所述影像特征,根据所述医疗影像特征得到所述影像标注信息;及Inputting the medical image into the medical image recognition model to extract the image feature corresponding to the medical image through the medical image recognition model, and obtain the image annotation information according to the medical image feature; and
    将影像标注信息推送至所述终端。Push the image annotation information to the terminal.
  7. 一种医疗影像识别模型生成装置,包括:A medical image recognition model generation device, including:
    数据获取模块,用于获取医疗数据;Data acquisition module for acquiring medical data;
    特征提取模块,用于提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;The feature extraction module is used to extract medical images in the medical data, and perform feature extraction on the medical images to obtain image features;
    标注生成模块,用于提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;An annotation generation module, configured to extract medical documents in the medical data, analyze the medical documents to generate annotation information, and search for image annotation information corresponding to the medical image in the annotation information;
    影像文件生成模块,用于根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及An image file generating module, configured to establish a mapping relationship according to the image characteristics and the image annotation information, and generate an image annotation file according to the mapping relationship; and
    模型生成模块,用于将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The model generation module is configured to input the image annotation file into a machine learning model to learn the feature relationship between the image feature in the image annotation file and the image annotation information through the machine learning model, and according to the feature relationship Generate medical image recognition model.
  8. 一种计算机设备,包括存储器及一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the one or more processors, the one or more Each processor performs the following steps:
    获取医疗数据;Access to medical data;
    提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;Extracting medical images in the medical data, and performing feature extraction on the medical images to obtain image features;
    提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;Extracting medical documents in the medical data, analyzing the medical documents to generate annotation information, and searching for image annotation information corresponding to the medical image in the annotation information;
    根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及Establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship; and
    将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
  9. 根据权利要求8所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述对所述医疗影像进行特征提取得到影像特征,包括:8. The computer device according to claim 8, wherein the feature extraction of the medical image to obtain the image feature, which is implemented when the processor executes the computer readable instruction, comprises:
    提取所述医疗影像中的标注信息;Extract the annotation information in the medical image;
    获取所述标注信息对应的位置坐标;Acquiring position coordinates corresponding to the label information;
    从医疗影像中提取与所述位置坐标对应的区域医疗影像;及Extracting regional medical images corresponding to the position coordinates from the medical images; and
    利用图像特征提取算法提取所述区域医疗影像的影像特征。The image feature extraction algorithm is used to extract the image feature of the regional medical image.
  10. 根据权利要求8所述的计算机设备,其中,所述处理器执行所述计算机可读指令时 所实现的所述对所述医疗文件进行解析生成标注信息之后,还包括:The computer device according to claim 8, wherein, after the analysis of the medical file to generate annotation information, which is realized when the processor executes the computer-readable instruction, further comprises:
    将所述标注信息按照预设的规则分割为单元标注信息;及Dividing the label information into unit label information according to preset rules; and
    将所述单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。The unit labeling information is matched with the information in the labeling information database, and the information with the largest matching degree is extracted as labeling information.
  11. 根据权利要求8所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件,包括:8. The computer device according to claim 8, wherein the establishment of a mapping relationship based on the image feature and the image annotation information, which is implemented when the processor executes the computer-readable instruction, is generated based on the mapping relationship Image annotation files, including:
    获取所述医疗影像对应的影像标识;Acquiring an image identifier corresponding to the medical image;
    查找所述影像标识对应的影像解析方法;Searching for an image analysis method corresponding to the image identifier;
    利用所述影像解析方法对所述医疗影像关联的属性信息进行解析,生成影像标签信息;及Using the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and
    根据所述影像特征、所述影像标注信息以及所述影像标签信息建立映射关系,根据映射关系生成影像标注文件。A mapping relationship is established according to the image feature, the image annotation information, and the image tag information, and an image annotation file is generated according to the mapping relationship.
  12. 根据权利要求8或11所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述根据所述映射关系生成影像标注文件之后,还包括:The computer device according to claim 8 or 11, wherein after the generating of the image annotation file according to the mapping relationship, which is implemented when the processor executes the computer-readable instruction, further comprises:
    接收终端发送的查询请求,提取所述查询请求中携带的查询标识;Receiving the query request sent by the terminal, and extracting the query identifier carried in the query request;
    提取所述影像标注文件对应的影像标识;Extracting the image identifier corresponding to the image annotation file;
    计算所述查询标识与各所述影像标识的匹配度;及Calculating the degree of matching between the query identifier and each of the image identifiers; and
    将所述匹配度最大的所述影像标识关联的所述影像标注文件推送至终端。Push the image annotation file associated with the image identifier with the greatest degree of matching to the terminal.
  13. 根据权利要求8所述的计算机设备,其中,所述处理器执行所述计算机可读指令时所实现的所述生成医疗影像识别模型之后,还包括:8. The computer device according to claim 8, wherein after the generating of the medical image recognition model realized when the processor executes the computer-readable instruction, the method further comprises:
    接收终端发送的医疗影像;Receive medical images sent by the terminal;
    将所述医疗影像输入所述医疗影像识别模型,以通过所述医疗影像识别模型提取所述医疗影像对应的所述影像特征,根据所述医疗影像特征得到所述影像标注信息;及Inputting the medical image into the medical image recognition model to extract the image feature corresponding to the medical image through the medical image recognition model, and obtain the image annotation information according to the medical image feature; and
    将影像标注信息推送至所述终端。Push the image annotation information to the terminal.
  14. 一个或多个存储有计算机可读指令的计算机可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more computer-readable storage media storing computer-readable instructions, which when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取医疗数据;Access to medical data;
    提取所述医疗数据中的医疗影像,对所述医疗影像进行特征提取得到影像特征;Extracting medical images in the medical data, and performing feature extraction on the medical images to obtain image features;
    提取所述医疗数据中的医疗文件,对所述医疗文件进行解析生成标注信息,查找所述标注信息中与所述医疗影像对应的影像标注信息;Extracting medical documents in the medical data, analyzing the medical documents to generate annotation information, and searching for image annotation information corresponding to the medical image in the annotation information;
    根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件;及Establishing a mapping relationship according to the image characteristics and the image annotation information, and generating an image annotation file according to the mapping relationship; and
    将所述影像标注文件输入机器学习模型,以通过机器学习模型学习所述影像标注文件中所述影像特征与所述影像标注信息之间的特征关系,根据所述特征关系生成医疗影像识别模型。The image annotation file is input into a machine learning model to learn the feature relationship between the image features in the image annotation file and the image annotation information through the machine learning model, and a medical image recognition model is generated according to the feature relationship.
  15. 根据权利要求14所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述对所述医疗影像进行特征提取得到影像特征,包括:The storage medium according to claim 14, wherein the feature extraction of the medical image to obtain the image feature, which is realized when the computer-readable instruction is executed by the processor, comprises:
    提取所述医疗影像中的标注信息;Extract the annotation information in the medical image;
    获取所述标注信息对应的位置坐标;Acquiring position coordinates corresponding to the label information;
    从医疗影像中提取与所述位置坐标对应的区域医疗影像;及Extracting regional medical images corresponding to the position coordinates from the medical images; and
    利用图像特征提取算法提取所述区域医疗影像的影像特征。The image feature extraction algorithm is used to extract the image feature of the regional medical image.
  16. 根据权利要求14所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述对所述医疗文件进行解析生成标注信息之后,还包括:The storage medium according to claim 14, wherein, after the parsing of the medical file to generate annotation information realized when the computer-readable instructions are executed by the processor, the method further comprises:
    将所述标注信息按照预设的规则分割为单元标注信息;及Dividing the label information into unit label information according to preset rules; and
    将所述单元标注信息与标注信息数据库中的信息进行匹配,将匹配度最大的信息提取为标注信息。The unit labeling information is matched with the information in the labeling information database, and the information with the largest matching degree is extracted as labeling information.
  17. 根据权利要求14所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述根据所述影像特征以及所述影像标注信息建立映射关系,根据所述映射关系生成影像标注文件,包括:The storage medium according to claim 14, wherein the establishment of a mapping relationship based on the image feature and the image annotation information, which is implemented when the computer-readable instruction is executed by the processor, is based on the mapping relationship Generate image annotation files, including:
    获取所述医疗影像对应的影像标识;Acquiring an image identifier corresponding to the medical image;
    查找所述影像标识对应的影像解析方法;Searching for an image analysis method corresponding to the image identifier;
    利用所述影像解析方法对所述医疗影像关联的属性信息进行解析,生成影像标签信息;及Using the image analysis method to analyze the attribute information associated with the medical image to generate image tag information; and
    根据所述影像特征、所述影像标注信息以及所述影像标签信息建立映射关系,根据映射关系生成影像标注文件。A mapping relationship is established according to the image feature, the image annotation information, and the image tag information, and an image annotation file is generated according to the mapping relationship.
  18. 根据权利要求14或17所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述根据所述映射关系生成影像标注文件之后,还包括:The storage medium according to claim 14 or 17, wherein, after the generating of the image annotation file according to the mapping relationship, which is realized when the computer-readable instruction is executed by the processor, further comprises:
    接收终端发送的查询请求,提取所述查询请求中携带的查询标识;Receiving the query request sent by the terminal, and extracting the query identifier carried in the query request;
    提取所述影像标注文件对应的影像标识;Extracting the image identifier corresponding to the image annotation file;
    计算所述查询标识与各所述影像标识的匹配度;及Calculating the degree of matching between the query identifier and each of the image identifiers; and
    将所述匹配度最大的所述影像标识关联的所述影像标注文件推送至终端。Push the image annotation file associated with the image identifier with the greatest degree of matching to the terminal.
  19. 根据权利要求14所述的存储介质,其中,所述计算机可读指令被所述处理器执行时所实现的所述生成医疗影像识别模型之后,还包括:14. The storage medium according to claim 14, wherein after the generating of the medical image recognition model realized when the computer-readable instructions are executed by the processor, the method further comprises:
    接收终端发送的医疗影像;Receive medical images sent by the terminal;
    将所述医疗影像输入所述医疗影像识别模型,以通过所述医疗影像识别模型提取所述医疗影像对应的所述影像特征,根据所述医疗影像特征得到所述影像标注信息;及Inputting the medical image into the medical image recognition model to extract the image feature corresponding to the medical image through the medical image recognition model, and obtain the image annotation information according to the medical image feature; and
    将影像标注信息推送至所述终端。Push the image annotation information to the terminal.
PCT/CN2020/097964 2020-02-28 2020-06-24 Method and apparatus for generating medical image recognition model, computer device and medium WO2021169101A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010128465.0 2020-02-28
CN202010128465.0A CN111462865A (en) 2020-02-28 2020-02-28 Medical image recognition model generation method and device, computer equipment and medium

Publications (1)

Publication Number Publication Date
WO2021169101A1 true WO2021169101A1 (en) 2021-09-02

Family

ID=71684198

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/097964 WO2021169101A1 (en) 2020-02-28 2020-06-24 Method and apparatus for generating medical image recognition model, computer device and medium

Country Status (2)

Country Link
CN (1) CN111462865A (en)
WO (1) WO2021169101A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114067964A (en) * 2021-11-22 2022-02-18 武汉联影医疗科技有限公司 Medical image data processing method and device, computer equipment and storage medium

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111816284B (en) * 2020-09-04 2020-12-25 平安国际智慧城市科技股份有限公司 Batch generation method, device, equipment and storage medium of medical test data
CN113192607B (en) * 2021-04-13 2024-03-26 复旦大学附属中山医院 Labeling processing method, labeling processing device, computer equipment and storage medium
CN113744845A (en) * 2021-09-17 2021-12-03 平安好医投资管理有限公司 Medical image processing method, device, equipment and medium based on artificial intelligence
CN114095757A (en) * 2021-11-17 2022-02-25 南通市肿瘤医院 Self-learning image transmission system for hospital radiology department based on cloud

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110022615A1 (en) * 2009-07-21 2011-01-27 National Taiwan University Digital data processing method for personalized information retrieval and computer readable storage medium and information retrieval system thereof
CN110503151A (en) * 2019-08-26 2019-11-26 北京推想科技有限公司 A kind of processing method and system of image
CN110737714A (en) * 2019-10-18 2020-01-31 西安电子科技大学 Computer vision-based electronic competition data visualization system and method
CN110796180A (en) * 2019-10-12 2020-02-14 吉林大学 Model training system and method based on artificial intelligence

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109063740A (en) * 2018-07-05 2018-12-21 高镜尧 The detection model of ultrasonic image common-denominator target constructs and detection method, device
CN110598782B (en) * 2019-09-06 2020-07-07 上海杏脉信息科技有限公司 Method and device for training classification network for medical image

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110022615A1 (en) * 2009-07-21 2011-01-27 National Taiwan University Digital data processing method for personalized information retrieval and computer readable storage medium and information retrieval system thereof
CN110503151A (en) * 2019-08-26 2019-11-26 北京推想科技有限公司 A kind of processing method and system of image
CN110796180A (en) * 2019-10-12 2020-02-14 吉林大学 Model training system and method based on artificial intelligence
CN110737714A (en) * 2019-10-18 2020-01-31 西安电子科技大学 Computer vision-based electronic competition data visualization system and method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114067964A (en) * 2021-11-22 2022-02-18 武汉联影医疗科技有限公司 Medical image data processing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN111462865A (en) 2020-07-28

Similar Documents

Publication Publication Date Title
WO2021169101A1 (en) Method and apparatus for generating medical image recognition model, computer device and medium
WO2021068321A1 (en) Information pushing method and apparatus based on human-computer interaction, and computer device
WO2021184567A1 (en) Electronic health record query method and apparatus, computer device, and storage medium
CN108899064A (en) Electronic health record generation method, device, computer equipment and storage medium
CN112015917A (en) Data processing method and device based on knowledge graph and computer equipment
WO2020177365A1 (en) Data mining-based social insurance data processing method and apparatus, and computer device
CN112800766A (en) Chinese medical entity identification and labeling method and system based on active learning
CN110580942A (en) novel physical examination report generation method, device, medium and terminal equipment
CN113111162A (en) Department recommendation method and device, electronic equipment and storage medium
CN112530550A (en) Image report generation method and device, computer equipment and storage medium
CN114358001A (en) Method for standardizing diagnosis result, and related device, equipment and storage medium thereof
CN113435200A (en) Entity recognition model training and electronic medical record processing method, system and equipment
CN113903422A (en) Medical image diagnosis report entity extraction method, device and equipment
CN109065015B (en) Data acquisition method, device and equipment and readable storage medium
CN113111660A (en) Data processing method, device, equipment and storage medium
CN116469505A (en) Data processing method, device, computer equipment and readable storage medium
CN115859984B (en) Medical named entity recognition model training method, device, equipment and medium
CN112037904A (en) Online diagnosis and treatment data processing method and device, computer equipment and storage medium
Nagarajan et al. Content-based medical image annotation and retrieval using perceptual hashing algorithm
CN116719840A (en) Medical information pushing method based on post-medical-record structured processing
CN115662562A (en) Medical record diagnosis and treatment data management method, device, equipment and storage medium
CN111724873B (en) Data processing method and device
Li et al. Extrinsic factors affecting the accuracy of biomedical NER
CN113343680B (en) Structured information extraction method based on multi-type medical record text
Shivamurthy Procedures Design and Development of Framework for Content Based Image Retrieval

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20921773

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16/01/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20921773

Country of ref document: EP

Kind code of ref document: A1