WO2020168647A1 - Image recognition method and related device - Google Patents

Image recognition method and related device Download PDF

Info

Publication number
WO2020168647A1
WO2020168647A1 PCT/CN2019/088825 CN2019088825W WO2020168647A1 WO 2020168647 A1 WO2020168647 A1 WO 2020168647A1 CN 2019088825 W CN2019088825 W CN 2019088825W WO 2020168647 A1 WO2020168647 A1 WO 2020168647A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
images
neural network
nodule
probability map
Prior art date
Application number
PCT/CN2019/088825
Other languages
French (fr)
Chinese (zh)
Inventor
王健宗
肖京
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020168647A1 publication Critical patent/WO2020168647A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/457Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/03Recognition of patterns in medical or anatomical images

Definitions

  • This application relates to the field of data processing technology, and mainly relates to an image recognition method and related equipment.
  • Lung cancer is one of the malignant tumors with the fastest increase in morbidity and mortality and the greatest threat to the health and life of the population. In the past 50 years, many countries have reported that the incidence and mortality of lung cancer have increased significantly. Traditional lung cancer screening relies on professional medical personnel to interpret lung images and screen out suspicious lung nodules. This is a problem for medical personnel. The workload is extremely demanding and false positive diagnosis is prone to occur. Therefore, how to improve the accuracy of image recognition of lung cancer lesions is a technical problem to be solved by those skilled in the art.
  • the embodiments of the present application provide an image recognition method and related equipment, which can recognize the lung cancer prevalence probability of a patient through lung scan images, and improve the accuracy of image recognition of lung cancer lesions.
  • an embodiment of the present application provides an image recognition method, wherein:
  • Each nodular unit in the multiple nodular units is input to a third neural network to obtain a third category probability map for the nodule type of each nodular unit in the multiple nodular units, so
  • the nodule types include benign nodules and malignant nodules, and the third neural network is used to identify the nodule type of each nodular unit of the plurality of nodular units;
  • the second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, and the fourth neural network is used for Classify the second category probability map and the third category probability map.
  • an embodiment of the present application provides an image recognition device, wherein:
  • the first processing unit is configured to input the scanned image of the target lung to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify the target lung Nodules in the scanned images;
  • the second processing unit is configured to input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, and the second neural network uses To identify the nodule type of the nodule image in the first category probability map;
  • the third processing unit is configured to extract nodule units in the scan image of the target lung according to the first category probability map to obtain a plurality of nodule units; respectively, each of the multiple nodule units
  • the nodule unit is input to the third neural network to obtain a third category probability map for the nodule type of each nodule unit in the plurality of nodule units, where the nodule types include benign nodules and malignant nodules,
  • the third neural network is used to identify the nodule type of each nodular unit in the multiple nodular units;
  • the fourth processing unit is used to input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, so The fourth neural network is used to classify the second category probability map and the third category probability map.
  • embodiments of the present application provide an electronic device, including a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured to be processed by the above
  • the program includes instructions for some or all of the steps described in the first aspect.
  • an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, wherein the computer program causes the computer to execute the Some or all of the steps described.
  • the electronic device After adopting the above-mentioned image recognition method and related equipment, the electronic device first recognizes the nodule image of the lung scan image, and then determines the lung cancer probability through the locally recognized nodule type and the globally recognized nodule type, which improves the lung cancer focus Accuracy of image recognition of parts.
  • FIG. 1 is a schematic flowchart of an image recognition method provided by an embodiment of this application.
  • FIG. 2 is a schematic structural diagram of an image recognition device provided by an embodiment of this application.
  • FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the application.
  • an embodiment of the present application provides a schematic flowchart of an image recognition method.
  • the image recognition method is applied to electronic equipment.
  • the electronic devices involved in the embodiments of the present application may include various handheld devices with wireless communication functions, wearable devices, computing devices or other processing devices connected to a wireless modem, as well as various forms of user equipment (UE). ), mobile station (mobile station, MS), terminal device (terminal device), etc.
  • UE user equipment
  • MS mobile station
  • terminal device terminal device
  • an image recognition method is applied to an electronic device, where:
  • S101 Input the scanned image of the target lung to the first neural network to obtain the first category probability map for nodules and no nodules.
  • the target lung scan image is an image obtained by a patient’s lung computed tomography (CT) in a hospital.
  • CT computed tomography
  • the patient can be placed in the supine position with the head advanced.
  • the acquisition spiral scanning method scans from the tip of the lung to the bottom of the lung.
  • the thickness of the acquisition layer is less than or equal to 1 mm.
  • the spacing is 5 to 7 mm, the width of the mediastinal window is 300 to 500 HU, and the window level is 30 to 50 HU; the width of the lung window is 800 to 1500 HU, and the window level is -600 to 800 HU.
  • HU is the unit of CT value, also known as the Heinz unit, used to express the relative density of the tissue structure on the CT image.
  • the method before inputting the scanned image of the target lung to the first neural network to obtain the first category probability map for nodules and no nodules, the method further includes: obtaining Multiple lung scan images to be recognized; perform morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images; Each first processed image undergoes pixel normalization processing to obtain multiple second processed images; according to the scan sequence and preset size of the multiple lung scan images, the multiple second processed images are stereoscopically stacked To obtain a scanned image of the target lung.
  • the multiple lung scan images are planar scan images, and the pixel value range is (-1024, 3071), which corresponds to the radiodensity unit of the houns field.
  • Morphology operation is an image processing method developed for binary images based on the set theory method of Mathematical Morphology. It can be understood that the lung scan images inevitably have noise, for example, the original CT includes clothing, medical equipment, etc., which is not limited here.
  • denoising processing is performed based on morphology, which can remove noise in the lung scan image, which facilitates the improvement of the recognition efficiency and accuracy of image recognition.
  • the morphological denoising is performed on each lung scan image in the multiple lung scan images to Obtaining multiple first processed images includes: performing an expansion operation on the target first processed image to obtain a first vector; performing an erosion operation on the target first processed image to obtain a second vector; Merge with the second vector to obtain a first processed image corresponding to the target first processed image.
  • the dilation operation is to expand the highlighted part of the image, similar to the field expansion, the effect picture has a larger highlight area than the original image.
  • the Erosion operation is that the highlight part of the original image is corroded, similar to the area being eroded, the effect image has a smaller highlight area than the original image.
  • the expansion operation and the erosion operation are to convolve the image with the kernel, which can be of any shape and size. It can be understood that taking the target first processed image in the multiple first processed images as an example, the expansion operation and the erosion operation are performed on the target first processed image respectively, and then vector addition is used to merge the two sets to obtain the denoising First process the image. In this way, noise in the lung scan image can be removed, which is convenient for improving the recognition efficiency and accuracy of image recognition.
  • the performing morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images includes: denoising the multiple lung scan images.
  • Each lung scan image in the scanned image is preprocessed to obtain multiple fourth processed images; each fourth processed image in the multiple fourth processed images is morphologically denoised to obtain the multiple first processed images Process images.
  • preprocessing may include image format conversion processing, image deletion filling processing, average subtraction, normalization, principal component analysis (PCA), whiten, and so on.
  • PCA principal component analysis
  • the fourth processed image obtained by preprocessing the lung scan image can further improve the recognition efficiency and accuracy of image recognition.
  • This application does not limit the preset size, which can be 512*512*512, and the real aspect ratio can be maintained as much as possible. It can be understood that the morphological denoising of lung scan images obtained by multiple scans is performed to obtain multiple first processed images with noise removed, which is convenient to improve the recognition efficiency and accuracy of image recognition. Then perform pixel normalization processing on each of the first processed images to obtain multiple second processed images whose pixel values are normalized to the range (0,1), which can eliminate the dimensional influence between the indicators to improve the data indicators. Comparability between. Then, the multiple second processed images are stereo-stacked according to the scanning sequence and the preset size of the multiple lung scan images to obtain a three-dimensional target lung scan image. In this way, it is convenient to meet the processing requirements of the neural network, and it is convenient to improve the recognition efficiency and accuracy of image recognition.
  • the first neural network is used to identify the nodule image in the target lung scan image, that is, input the first neural network to obtain the first category probability map for nodules and no nodules.
  • step S101 the training of the first neural network is completed, and the training method thereof is not limited in this application.
  • the method further includes: dividing each marked image in the plurality of marked images to obtain a plurality of first images; from each of the plurality of first images Extracting a second threshold of the uniform grid images to obtain multiple second images; performing size processing on each of the multiple second images to obtain multiple third images; according to the multiple marks Obtain the reference nodule position corresponding to each third image in the multiple third images according to the nodule marking information included in each marked image in the image; according to the multiple third images and the multiple third images Training the first initial neural network to obtain the first network parameters of the first neural network according to the reference nodule position corresponding to each third image in the third image; according to the first initial neural network and the first neural network The network parameter obtains the first neural network.
  • each marked image includes nodule marking information, using the aforementioned scanning method and processing method, and each marked image is manually marked. For example: three radiologists or a designated number of radiologists agree on the number, location, size or type of nodules in each first image and other nodule marking information.
  • Each first image includes a plurality of uniform grid images, and the size of each uniform network image is a first threshold.
  • This application does not limit the first threshold, which can be 16*16*16. That is, each marker image is divided into regions, so that the size of each uniform grid image in the first image obtained after the region division is the first threshold.
  • Each second image includes a second threshold of uniform grid images.
  • This application does not limit the second threshold, which can be 128, that is, only a specified number of uniform grids in each first image are extracted Images, in this way, can improve computing efficiency.
  • the first initial neural network is the first neural network without network parameters defined, and the size of each third image meets the input size defined by the first initial neural network.
  • This application also does not limit the size processing method of the second image, which can be zero-filled; it can also copy a uniform grid image with nodules to maintain class balance; it can also use 3D convolution to merge and use (1*1 *1) Convolution replaces the global average merge operation to obtain an image that meets the size of the training image.
  • This application does not limit the size of the third image, which can be 32*32*32. It can be understood that due to the smaller input size, the calculation efficiency can be improved.
  • each marked image includes nodule marking information
  • the third image is a processed image corresponding to the marked image.
  • the reference nodule position corresponding to the third image can be obtained according to the nodule marking information of the marked image, that is, the position to be The reference nodule position of the training image.
  • the performing size processing on each of the plurality of second images to obtain a plurality of third images includes: extracting nodules in the plurality of second images To obtain a plurality of fourth images; and perform copy processing on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
  • the fourth image is a uniform network image with nodules.
  • This application does not limit the method for extracting uniform grid images with nodules.
  • the multiple second images include The target second image, where the target second image corresponds to multiple target second uniform network images
  • the method further includes: dividing the multiple target second uniform grid images to obtain multiple uniform grid images Set; superimpose the nodule probability corresponding to each uniform network image set in the plurality of uniform network image sets to obtain multiple superposition values; superimpose corresponding to each uniform network image set in the plurality of uniform network image sets Perform an averaging operation on the values to obtain a plurality of average values; extract the uniform grid images in the uniform grid image set corresponding to the average value of the plurality of averages greater than the third threshold to obtain the plurality of fourth images.
  • the method of dividing the uniform grid image set can be randomly assigned, for example, scanning to 10 uniform grid images is divided into a group.
  • This application does not limit the third threshold, which can be 0.5. It can be understood that the second uniform grid images of multiple targets are collected to obtain a uniform grid image set, and the nodule probabilities corresponding to each uniform grid image set are superimposed to obtain multiple superimposed values, and for each uniform grid The superimposed values corresponding to the grid image set are averaged to obtain multiple average values. If the average value is greater than the third threshold, it is determined that nodules exist in each uniform grid image in the uniform grid image set corresponding to the average value. In this way, determining whether a nodule is included in the image set can improve the efficiency of extracting the fourth image.
  • This application does not limit the training process of the first initial neural network.
  • Batch Gradient Descent (BGD), Stochastic Gradient Descent (SGD), or mini-batch gradient descent (mini-batch) algorithm can be used. SGD) and so on for training.
  • a training cycle is completed by a single forward operation and reverse gradient propagation, that is, the image to be trained is forwardly input to the neural network to be trained to obtain the output target object. If the target object fails to match the reference object, the target object Obtain the loss function from the reference object, and then input the loss function back to the neural network to adjust the network parameters of the neural network, such as weight and bias. Then, input the next image to be trained until the matching is successful or the training of all images is completed.
  • the reference object is the reference nodule position
  • the target object is the target nodule position.
  • the first initial neural network is trained based on the multiple third images and the reference nodule position corresponding to each third image in the multiple third images
  • To obtain the first network parameters of the first neural network including: dividing the multiple third images according to a preset ratio to obtain multiple first training images and multiple first verification images; Classify the first initial neural network to obtain the to-be-verified network parameters of the first neural network according to the reference nodule position corresponding to each first training image in the first training images; The image verifies the network parameter to be verified to obtain the first network parameter.
  • This application does not limit the preset ratio, which can be 7:3.
  • This application does not limit the classification algorithm.
  • Logistic regression or decision tree algorithm can be used to classify the image features and reference nodule positions corresponding to the multiple first training images, so as to obtain the network parameters to be verified of the first neural network. .
  • the verification process is used to train the neural network to be verified for which the network parameters have been obtained according to the multiple first verification images to obtain the first network parameters of the first neural network.
  • the test image can be input, that is, S101 is executed.
  • the plurality of third images are divided according to a preset ratio to obtain a plurality of first training images and a plurality of first verification images, and then the first initial neural network is analyzed according to the plurality of first training images.
  • the network is classified to obtain the network parameters to be verified of the first neural network, and finally the network parameters to be verified of the first neural network are verified according to the multiple first verification images to obtain the first neural network.
  • Network parameters In this way, the batch gradient descent algorithm is used for training and verification, which improves the training speed of the first neural network.
  • the training parameters of the first initial neural network of this application are also not limited. For example, 24 small batches of 10,000 iterations are used for training, the learning rate is 0.01, and the weight attenuation is 0.0001.
  • a linear rectification (Rectified Linear Units, Relu) function is used as an activation function (Activation function).
  • a weighted cross entropy function is used as the loss function. In this way, a strong class imbalance can be avoided. In addition, the loss can be balanced by the weight of each batch and applied to the weaker category.
  • This application does not limit the probability map of the first category, and may be a density histogram used to describe the nodule probability of each uniform grid image.
  • each marked image is divided into regions to obtain multiple first images with the same grid size, and then a specified number of uniform grid images are extracted to obtain multiple second images. In this way, the computational efficiency is improved.
  • the multiple second images are further subjected to size processing to obtain multiple third images.
  • the reference nodule position corresponding to each third image is obtained according to the nodule marking information of each marked image.
  • the first initial neural network is trained according to the multiple third images and the position of the reference nodule corresponding to each third image to obtain the first network parameters of the first neural network, so that according to the first initial neural network and the first network Parameter acquisition of the first neural network. In this way, the training speed of the first neural network is improved.
  • S102 Input the probability map of the first category to the second neural network to obtain a probability map of the second category for benign nodules, malignant nodules and no nodules.
  • the second neural network is used to identify the nodule type of the nodule image, that is, to further identify the nodule type of the nodule image in the first category probability map, and input the first category probability map to the first category probability map.
  • the second category probability map for benign nodules, malignant nodules and no nodules can be obtained. It can be understood that directly inputting the probability map of the first category to the second neural network can save the time for identifying no nodules and improve the efficiency of recognition.
  • This application does not limit the probability map of the second category, and may be a density histogram used to describe the nodule type probability of each uniform grid image.
  • This application does not limit the labeling method of the target nodule type. All nodules of patients with cancer can be marked as malignant, and all nodules of non-cancer patients can be marked as benign.
  • the diagnosis time of cancer is 1 year. That is, the nodules in the scan pictures diagnosed with cancer within 1 year are all marked as malignant.
  • the second neural network is trained.
  • the training method can refer to the training method of the first neural network, which will not be repeated here.
  • the reference object is the reference nodule type
  • the target object is the target nodule. Types of.
  • This application also does not limit the training parameters of the second neural network.
  • the training phase performs 20,000 iterations with a learning rate of 0.01
  • the verification phase performs 30,000 iterations with a learning rate of 0.001.
  • S103 Extract nodule units in the scan image of the target lung according to the first category probability map to obtain multiple nodule units.
  • a nodule unit is a unit that is identified as a unit in the first category probability map. If the uniform grid image intersects the bounding box of the nodule, the uniform grid image can be determined to be a nodule unit.
  • S104 Input each nodular unit of the multiple nodular units to a third neural network to obtain a third category probability map for the nodule type of each nodular unit of the multiple nodular units ,
  • the nodule types include benign nodules and malignant nodules.
  • the third neural network is used to separately identify the nodule type of each nodule unit, that is, to further identify the nodule type of each nodule unit corresponding to the first category probability map, and the multiple nodules
  • the probability that each nodular unit is a benign nodule or a malignant nodule can be determined. It can be understood that directly inputting multiple nodule images extracted from the first category probability map to the third neural network can improve the accuracy of identifying the nodule type.
  • This application does not limit the probability map of the third category, and may be a density histogram to describe the nodule type probability of each nodular unit.
  • the marking information of each first image in the first image set further includes a target nodule type
  • the method further includes: performing an operation on each fourth image in the plurality of fourth images. Perform data enhancement to obtain multiple fifth images; obtain the reference nodule type corresponding to each fifth image in the multiple fifth images according to the nodule marking information included in each marked image in the multiple marked images ; According to the multiple fifth images and the reference nodule type corresponding to each fifth image in the multiple fifth images, the second initial neural network is trained to obtain the second network parameters of the third neural network .
  • the data enhancement is performed on each fourth image in the plurality of fourth images to obtain a plurality of fifth images , Including: performing rotation processing on the mask corresponding to the target fourth image according to a first angle to obtain a first sub-processed image; subtracting an average value from the first sub-processed image to obtain a second sub-processing Image; according to the first multiple, size processing of the width of the mask corresponding to the second sub-processed image to obtain a third sub-processed image; according to the second multiple, the size of the mask corresponding to the third sub-processed image The length is subjected to size processing to obtain the fourth sub-processed image; according to the third multiple, the fourth sub-processed image is subjected to size processing to obtain the fifth sub-processed image; according to the second angle,
  • the first angle may be less than or equal to 270 degrees
  • the first multiple may be 0.9 or 1.1
  • the second The multiple can be 0.9 or 1.1
  • the third multiple can be 0.8 or 1.2
  • the second angle can be less than or equal to 270 degrees.
  • the display object can be rotated by setting the rotation property, that is, setting this property to a number (0-360), in degrees, which represents the amount of rotation applied to the object.
  • the above-mentioned multiple processing steps are performed, that is, the target third image is rotated , Subtract the average value, size, and mirror inversion processing, so that the fifth image corresponding to the target third image undergoes data enhancement processing.
  • the definition of the image is improved, which is convenient for improving the recognition efficiency of the second neural network.
  • the second initial neural network is the third neural network without defining network parameters.
  • the training method of the third neural network can refer to the training method of the first neural network, wherein the reference object is the reference nodule type, and the target object is the target nodule type. That is, input multiple fifth images to the neural network to be trained or to be verified to obtain the target nodule type in each fifth image. If the target nodule type fails to match the previously marked reference nodule type, then The target nodule type and the reference nodule type obtain a loss function, and update the network parameters of the neural network according to the loss function.
  • This application also does not limit the training parameters of the third neural network.
  • the batch size is 32
  • the Adam optimizer is used for 6000 iterations
  • the learning rate is 0.01
  • the weight attenuation is 0.0001.
  • the second initial neural network is trained to obtain the second network parameters of the third neural network, and the second initial neural network
  • the network is the third neural network with no network parameters defined. In this way, the training efficiency of the third neural network is improved.
  • the training image of the third neural network can be a batch of images different from the training image of the first neural network, and the processing method before training can refer to the method of the training image of the first neural network, which will not be repeated here. .
  • S105 Input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image.
  • the fourth neural network is used to classify the second category probability map and the third category probability map. That is to say, classify the globally recognized nodule type obtained by the second neural network and the locally recognized nodule type obtained by the third neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, namely
  • the probability that the target patient corresponding to the target lung scan image has lung cancer can be determined. It can be understood that the probability of lung cancer is determined by the recognition results of locally recognizing nodule types and global recognizing nodule types, which further improves the accuracy of recognizing lung cancer.
  • the training method of the fourth neural network can refer to the training method of the first neural network, wherein the reference object is the reference lung cancer probability, and the target object is the target lung cancer probability.
  • This application also does not limit the training parameters of the fourth neural network. For example, all data is used as a batch, and the Adam optimizer is used for 2000 iterations, and the weight is attenuated to 0.0001.
  • the second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer patients of the target patient corresponding to the target lung scan image.
  • the disease probability includes: performing data enhancement on the second category probability map and the third category probability map to obtain the target second category probability map and the target third category probability map; and the target second category probability map And the target third category probability map is input to the fourth neural network to obtain the lung cancer probability.
  • the data enhancement can perform volume transposition enhancement or tailoring, and can also refer to the data enhancement operation of the third neural network, which is not limited here. It can be understood that through the data enhancement operation, the clarity of the image is improved, which facilitates the improvement of the recognition efficiency of the fourth neural network.
  • the second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer disease of the target patient corresponding to the target lung scan image
  • the probability includes: performing feature weighting on the second category probability map and the third category probability map to obtain a fourth category probability map for the nodule type of each nodular unit in the plurality of nodular units ; Input the fourth category probability map to a fourth neural network to obtain the lung cancer probability.
  • This application does not limit the probability map of the fourth category, and may be a density histogram to describe the nodule type probability of each uniform grid image.
  • This application can calculate the second neural network and the third neural network based on the number, minimum, maximum, average, standard deviation, and integration of all maximum outputs in the second category probability map and the third category probability map Then, the feature weights are performed according to their weights.
  • the recognition results of the nodule types in the locally and globally determined target lung scan images are feature-weighted to obtain the fourth category probability map, and then the lung cancer patients are determined for the nodule types of each nodule in the fourth category probability map.
  • the disease probability improves the accuracy of identifying lung cancer.
  • the nodule image of the lung scan image is first recognized, and then the locally recognized nodule type and the globally recognized nodule type are used to determine the probability of lung cancer, which improves the location of the lung cancer lesion. The accuracy of image recognition.
  • FIG. 2 is a schematic structural diagram of an image recognition device provided by an embodiment of the present application, and the device is applied to electronic equipment. As shown in FIG. 2, the above-mentioned image recognition device 200 includes:
  • the first processing unit 201 is configured to input a scanned image of the target lung to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify the target Nodules in the lung scan image;
  • the second processing unit 202 is configured to input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, the second neural network Used to identify the nodule type of the nodule image in the first category probability map;
  • the third processing unit 203 is configured to extract a nodule unit in the target lung scan image according to the first category probability map to obtain a plurality of nodule units; each of the multiple nodule units The nodule unit is input to the third neural network to obtain a third category probability map for the nodule type of each nodule unit in the plurality of nodule units, and the nodule types include benign nodules and malignant nodules , The third neural network is used to identify the nodule type of each nodular unit in the plurality of nodular units;
  • the fourth processing unit 204 is configured to input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image,
  • the fourth neural network is used to classify the second category probability map and the third category probability map.
  • the image recognition device first recognizes the nodule image of the lung scan image, and then determines the probability of lung cancer through the locally recognized nodule types and the globally recognized nodule types, which improves the accuracy of image recognition of lung cancer lesions. .
  • the device 200 further includes:
  • the preprocessing unit 205 is configured to obtain multiple lung scan images to be recognized; perform morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images; Each of the plurality of first processed images is subjected to pixel normalization processing to obtain a plurality of second processed images; according to the scan sequence and preset size of the plurality of lung scan images, all The multiple second processed images are three-dimensionally stacked to obtain the target lung scan image.
  • the preprocessing unit 205 is further configured to divide each marked image in the multiple marked images to obtain multiple first images, and each first image includes multiple uniform grids.
  • Image the size of each uniform grid image is a first threshold, and each marked image includes nodule marking information; extracting a second threshold of the uniform grid images from each first image in the plurality of first images , In order to obtain multiple second images; the size of each second image in the multiple second images is processed to obtain multiple third images, the size of each third image meets the definition of the first initial neural network Input size, the first initial neural network is the first neural network without defined network parameters; according to the nodule marking information included in each marked image in the multiple marked images, the multiple third images are acquired The position of the reference nodule corresponding to each third image in the image; the device 200 further includes:
  • the training unit 206 is configured to train the first initial neural network according to the multiple third images and the reference nodule position corresponding to each third image in the multiple third images to obtain the The first network parameter of the first neural network; the first neural network is obtained according to the first initial neural network and the first network parameter.
  • the preprocessing unit 205 is specifically configured to extract the plurality of third images. There are uniform grid images of nodules in the second images to obtain multiple fourth images; copy processing is performed on the fourth image of each second image in the multiple second images to obtain the multiple The third image.
  • the preprocessing unit 205 is specifically configured to use the The multiple target second uniform grid images are divided to obtain multiple uniform grid image sets; the nodule probability corresponding to each uniform network image set in the multiple uniform network image sets is superimposed to obtain multiple Superimposed value; averaging the superimposed value corresponding to each uniform network image set in the plurality of uniform network image sets to obtain multiple average values; extracting the average value of the multiple average values greater than the third threshold A uniform grid image in a uniform grid image set to obtain the plurality of fourth images.
  • the preprocessing unit 205 is further configured to perform data enhancement on each fourth image in the multiple fourth images to obtain multiple fifth images; according to the multiple labeled images
  • the nodule marking information included in each marked image is used to obtain the reference nodule type corresponding to each fifth image in the plurality of fifth images; according to the plurality of fifth images and each of the plurality of fifth images
  • a second initial neural network is trained to obtain the second network parameters of the third neural network, and the second initial neural network is the first network parameter without defined network parameters.
  • the label information of each first image in the first image set further includes a target nodule type
  • the training unit is further configured to determine the type of nodule included in each of the multiple labeled images. Section mark information to obtain the reference nodule type corresponding to each fifth image in the multiple fifth images; according to the multiple fifth images and the reference corresponding to each fifth image in the multiple fifth images Nodules type, training the second initial neural network to obtain the second network parameters of the third neural network.
  • the preprocessing unit is specifically configured to perform rotation processing on the mask corresponding to the target fourth image according to the first angle, so as to Obtain the first sub-processed image; subtract the average value from the first sub-processed image to obtain the second sub-processed image; perform the width of the mask corresponding to the second sub-processed image according to the first multiple Size processing to obtain the third sub-processed image; according to the second multiple, size processing is performed on the length of the mask corresponding to the third sub-processed image to obtain the fourth sub-processed image; according to the third multiple, the The fourth sub-processed image is subjected to size processing to obtain the fifth sub-processed image; according to the second angle, the mask of the sixth sub-processed image is mirrored and inverted to obtain the first corresponding to the target fourth processed image Five images.
  • the fourth processing unit 204 is specifically configured to perform feature weighting on the probability map of the second category and the probability map of the third category, so as to obtain a reference to each of the multiple nodule units.
  • the fourth category probability map of the nodule type of the nodular unit; the fourth category probability map is input to the fourth neural network to obtain the lung cancer probability.
  • FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
  • the electronic device 300 includes a processor 310, a memory 320, a communication interface 330, and one or more programs 340.
  • the one or more programs 340 are stored in the memory 320 and are configured by
  • the foregoing processor 310 executes, and the foregoing program 340 includes instructions for executing the following steps:
  • Each nodular unit in the multiple nodular units is input to a third neural network to obtain a third category probability map for the nodule type of each nodular unit in the multiple nodular units, so
  • the nodule types include benign nodules and malignant nodules, and the third neural network is used to identify the nodule type of each nodular unit of the plurality of nodular units;
  • the second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, and the fourth neural network is used for Classify the second category probability map and the third category probability map.
  • the electronic device first recognizes the nodule image of the lung scan image, and then determines the probability of lung cancer through the locally recognized nodule type and the globally recognized nodule type, which improves the accuracy of image recognition of lung cancer lesions.
  • the program 340 is also used to execute the instructions of the following steps:
  • the multiple second processed images are stereoscopically stacked to obtain the target lung scan image.
  • the program 340 is also used to execute the instructions of the following steps:
  • Each marker image in the multiple marker images is divided into regions to obtain multiple first images.
  • Each first image includes multiple uniform grid images.
  • the size of each uniform grid image is the first threshold.
  • the marked image includes nodule marking information;
  • the network is the first neural network without defining network parameters
  • the first initial neural network is trained to obtain the first neural network A network parameter
  • the program 340 is specifically used to execute instructions of the following steps:
  • Copy processing is performed on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
  • the program 340 is specifically configured to execute the instructions of the following steps :
  • the program 340 is also used to execute the instructions of the following steps:
  • the second initial neural network is trained to obtain the second network of the third neural network Parameters, the second initial neural network is the third neural network without defining network parameters.
  • the marking information of each first image in the first image set further includes the target nodule type
  • the program 340 is further used to execute instructions of the following steps:
  • Copy processing is performed on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
  • the program 340 is further used to execute the instructions of the following steps:
  • size processing is performed on the width of the mask corresponding to the second sub-processed image to obtain a third sub-processed image
  • size processing is performed on the length of the mask corresponding to the third sub-processed image to obtain a fourth sub-processed image
  • the mask of the sixth sub-processed image is mirrored and reversed to obtain the fifth image corresponding to the target fourth processed image.
  • the program 340 is specifically used to execute instructions of the following steps:
  • the fourth category probability map is input to a fourth neural network to obtain the lung cancer probability.
  • the embodiments of the present application also provide a computer-readable storage medium, where the computer-readable storage medium stores a computer program for storing a computer program that enables a computer to execute part or all of the steps of any method as recorded in the method embodiment ,
  • Computers include electronic equipment.
  • the embodiments of the present application also provide a computer program product.
  • the computer program product includes a non-transitory computer-readable storage medium storing a computer program.
  • the computer program is operable to make a computer execute a part of any method described in the method embodiment. Or all steps.
  • the computer program product may be a software installation package, and the computer includes electronic equipment.
  • the functions described in this application can be implemented by hardware, software, firmware or any combination thereof. When implemented by software, these functions can be stored in a computer-readable medium or transmitted as one or more instructions or codes on the computer-readable medium.
  • the computer-readable medium includes a computer storage medium and a communication medium, where the communication medium includes any medium that facilitates the transfer of a computer program from one place to another.
  • the storage medium may be any available medium that can be accessed by a general-purpose or special-purpose computer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • General Physics & Mathematics (AREA)
  • Public Health (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Pathology (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Databases & Information Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)
  • Apparatus For Radiation Diagnosis (AREA)

Abstract

The embodiments of the present application disclose an image recognition method and a related device. Said method comprises: inputting a target lung scanning image into a first neural network, so as to obtain a first category probability map; inputting the first category probability map into a second neural network, so as to obtain a second category probability map; extracting nodule units in the target lung scanning image according to the first category probability map, so as to obtain a plurality of nodule units; inputting each of the plurality of nodule units into a third neural network respectively, so as to obtain a third category probability map for the nodule type of each of the plurality of nodule units; and inputting the second category probability map and the third category probability map into a fourth neural network, so as to obtain a lung cancer prevalence rate of a target patient corresponding to the target lung scanning image. The present application improves the accuracy of lung cancer lesion site image recognitions.

Description

图像识别方法及相关设备Image recognition method and related equipment
本申请要求于2019年2月21日提交中国专利局、申请号为201910135802.6、申请名称为“图像识别方法及相关设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on February 21, 2019, the application number is 201910135802.6, and the application name is "Image Recognition Method and Related Equipment", the entire content of which is incorporated into this application by reference.
技术领域Technical field
本申请涉及数据处理技术领域,主要涉及了一种图像识别方法及相关设备。This application relates to the field of data processing technology, and mainly relates to an image recognition method and related equipment.
背景技术Background technique
肺癌是发病率和死亡率增长最快,对人群健康和生命威胁最大的恶性肿瘤之一。近50年来许多国家都报道肺癌的发病率和死亡率均明显增高,传统的肺癌筛查依靠专业的医疗人员对肺部影像进行解读,筛查出可疑的肺部结节,这对于医疗人员的工作量要求极高,且容易出现假阳性诊断,因此,如何提高肺癌病灶部位的图像识别的准确率是本领域技术人员待解决的技术问题。Lung cancer is one of the malignant tumors with the fastest increase in morbidity and mortality and the greatest threat to the health and life of the population. In the past 50 years, many countries have reported that the incidence and mortality of lung cancer have increased significantly. Traditional lung cancer screening relies on professional medical personnel to interpret lung images and screen out suspicious lung nodules. This is a problem for medical personnel. The workload is extremely demanding and false positive diagnosis is prone to occur. Therefore, how to improve the accuracy of image recognition of lung cancer lesions is a technical problem to be solved by those skilled in the art.
发明内容Summary of the invention
本申请实施例提供了一种图像识别方法及相关设备,可通过肺部扫描图像识别患者的肺癌患病概率,提高了肺癌病灶部位的图像识别的准确率。The embodiments of the present application provide an image recognition method and related equipment, which can recognize the lung cancer prevalence probability of a patient through lung scan images, and improve the accuracy of image recognition of lung cancer lesions.
第一方面,本申请实施例提供一种图像识别方法,其中:In the first aspect, an embodiment of the present application provides an image recognition method, wherein:
将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图,所述第一神经网络用于识别所述目标肺部扫描图像中的结节图像;Input the target lung scan image to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify nodules in the target lung scan image image;
将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图,所述第二神经网络用于识别所述第一类别概率图中的结节图像的结节类型;Input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, and the second neural network is used to identify the first category The nodule type of the nodule image in the probability map;
根据所述第一类别概率图提取所述目标肺部扫描图像中的结节单元,以得到多个结节单元;Extracting nodular units in the scan image of the target lung according to the first category probability map to obtain multiple nodular units;
分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节类型的第三类别概率图,所述结节类型包括良性结节和恶性结节,所述第三神经网络用于分别识别所述多个结节单元中每一结节单元的结节类型;Each nodular unit in the multiple nodular units is input to a third neural network to obtain a third category probability map for the nodule type of each nodular unit in the multiple nodular units, so The nodule types include benign nodules and malignant nodules, and the third neural network is used to identify the nodule type of each nodular unit of the plurality of nodular units;
将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,所述第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。The second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, and the fourth neural network is used for Classify the second category probability map and the third category probability map.
第二方面,本申请实施例提供一种图像识别装置,其中:In the second aspect, an embodiment of the present application provides an image recognition device, wherein:
第一处理单元,用于将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图,所述第一神经网络用于识别所述目标肺部扫描图像中的结节图像;The first processing unit is configured to input the scanned image of the target lung to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify the target lung Nodules in the scanned images;
第二处理单元,用于将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图,所述第二神经网络用于识别所述第一类别概率图中的结节图像的结节类型;The second processing unit is configured to input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, and the second neural network uses To identify the nodule type of the nodule image in the first category probability map;
第三处理单元,用于根据所述第一类别概率图提取所述目标肺部扫描图像中的结节单元,以得到多个结节单元;分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节类型的第三类别概率图,所述结节类型包括良性结节和恶性结节,所述第三神经网络用于分别识别所述多个结节单元中每一结节单元的结节类型;The third processing unit is configured to extract nodule units in the scan image of the target lung according to the first category probability map to obtain a plurality of nodule units; respectively, each of the multiple nodule units The nodule unit is input to the third neural network to obtain a third category probability map for the nodule type of each nodule unit in the plurality of nodule units, where the nodule types include benign nodules and malignant nodules, The third neural network is used to identify the nodule type of each nodular unit in the multiple nodular units;
第四处理单元,用于将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,所述第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。The fourth processing unit is used to input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, so The fourth neural network is used to classify the second category probability map and the third category probability map.
第三方面,本申请实施例提供一种电子设备,包括处理器、存储器、通信接口以及一个或多个程序,其中,上述一个或多个程序被存储在上述存储器中,并且被配置由上述处理器执行,所述程序包括用于如第一方面中所描述的部分或全部步骤的指令。In a third aspect, embodiments of the present application provide an electronic device, including a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured to be processed by the above The program includes instructions for some or all of the steps described in the first aspect.
第四方面,本申请实施例提供了一种计算机可读存储介质,其中,所述计算机可读存储介质存储计算机程序,其中,所述计算机程序使得计算机执行如本申请实施例第一方面中所描述的部分或全部步骤。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, wherein the computer program causes the computer to execute the Some or all of the steps described.
采用了上述的图像识别方法及相关设备之后,电子设备先识别肺部扫描图像的结节图像,再通过局部识别的结节类型和全局识别的结节类型确定肺癌患病概率,提高了肺癌病灶部位的图像识别的准确率。After adopting the above-mentioned image recognition method and related equipment, the electronic device first recognizes the nodule image of the lung scan image, and then determines the lung cancer probability through the locally recognized nodule type and the globally recognized nodule type, which improves the lung cancer focus Accuracy of image recognition of parts.
附图说明Description of the drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings needed in the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, without creative work, other drawings can be obtained based on these drawings.
图1为本申请实施例提供的一种图像识别方法的流程示意图;FIG. 1 is a schematic flowchart of an image recognition method provided by an embodiment of this application;
图2为本申请实施例提供的一种图像识别装置的结构示意图;2 is a schematic structural diagram of an image recognition device provided by an embodiment of this application;
图3为本申请实施例提供的一种电子设备的结构示意图。FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the application.
具体实施方式detailed description
为了使本技术领域的人员更好地理解本申请方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。下面对本申请实施例进行详细介绍。In order to enable those skilled in the art to better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. The following describes the embodiments of the present application in detail.
请参照图1,本申请实施例提供一种图像识别方法的流程示意图。该图像识别方法应用于电子设备。本申请实施例所涉及到的电子设备可以包括各种具有无线通信功能的手持设备、可穿戴设备、计算设备或连接到无线调制解调器的其他处理设备,以及各种形式的用户设备(user equipment,UE),移动台(mobile station,MS),终端设备(terminal device)等等。为方便描述,上面提到的设备统称为电子设备。Please refer to FIG. 1, an embodiment of the present application provides a schematic flowchart of an image recognition method. The image recognition method is applied to electronic equipment. The electronic devices involved in the embodiments of the present application may include various handheld devices with wireless communication functions, wearable devices, computing devices or other processing devices connected to a wireless modem, as well as various forms of user equipment (UE). ), mobile station (mobile station, MS), terminal device (terminal device), etc. For ease of description, the devices mentioned above are collectively referred to as electronic devices.
具体的,如图1所示,一种图像识别方法,应用于电子设备,其中:Specifically, as shown in Figure 1, an image recognition method is applied to an electronic device, where:
S101:将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图。S101: Input the scanned image of the target lung to the first neural network to obtain the first category probability map for nodules and no nodules.
在本申请中,目标肺部扫描图像为患者在医院中进行肺部计算机体层摄影(Computed Tomography,CT)得到的图像。本申请对于具体的扫描方式不做限定,可让患者仰卧位,头先进,采集螺旋扫描方式从肺尖开始扫描至肺底,采集层厚小于等于1毫米,重建层厚5至7毫米,层间距5至7毫米,纵膈窗窗宽300至500HU,窗位30至50HU;肺窗窗宽800至1500HU,窗位-600至800HU。其中HU为CT值单位,又称亨氏单位,用来表示CT图像上组织结构的相对密度。In the present application, the target lung scan image is an image obtained by a patient’s lung computed tomography (CT) in a hospital. This application does not limit the specific scanning method. The patient can be placed in the supine position with the head advanced. The acquisition spiral scanning method scans from the tip of the lung to the bottom of the lung. The thickness of the acquisition layer is less than or equal to 1 mm. The spacing is 5 to 7 mm, the width of the mediastinal window is 300 to 500 HU, and the window level is 30 to 50 HU; the width of the lung window is 800 to 1500 HU, and the window level is -600 to 800 HU. Among them, HU is the unit of CT value, also known as the Heinz unit, used to express the relative density of the tissue structure on the CT image.
在一种可能的实施例中,在所述将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图之前,所述方法还包括:获取待识别的多张肺部扫描图像;对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪以得到多张第一处理图像;对所述多张第一处理图像中每一第一处理图像进行像素归一化处理以得到多张第二处理图像;根据所述多张肺部扫描图像的扫描序列和预设尺寸,对所述多张第二处理图像进行立体堆叠以得到所述目标肺部扫描图像。In a possible embodiment, before inputting the scanned image of the target lung to the first neural network to obtain the first category probability map for nodules and no nodules, the method further includes: obtaining Multiple lung scan images to be recognized; perform morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images; Each first processed image undergoes pixel normalization processing to obtain multiple second processed images; according to the scan sequence and preset size of the multiple lung scan images, the multiple second processed images are stereoscopically stacked To obtain a scanned image of the target lung.
其中,多张肺部扫描图像为平面扫描图像,像素值范围是(-1024,3071),对应于houns场的放射密度单位。形态学运算是针对二值图像依据数学形态学(Mathematical Morphology)的集合论方法发展起来的图像处理方法。可以理解,肺部扫描图像难免存在噪音,例如:原始CT包含衣物、医疗设备等,在此不做限定。在该实施例中,根据形态学进行去噪处理,可去除肺部扫描图像中的噪声,便于提高图像识别的识别效率和准确率。Among them, the multiple lung scan images are planar scan images, and the pixel value range is (-1024, 3071), which corresponds to the radiodensity unit of the houns field. Morphology operation is an image processing method developed for binary images based on the set theory method of Mathematical Morphology. It can be understood that the lung scan images inevitably have noise, for example, the original CT includes clothing, medical equipment, etc., which is not limited here. In this embodiment, denoising processing is performed based on morphology, which can remove noise in the lung scan image, which facilitates the improvement of the recognition efficiency and accuracy of image recognition.
在一种可能的实施例中,若所述多张第一处理图像包括目标第一处理图像,则所述对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪以得到多张第一处理图像,包括:对所述目标第一处理图像进行膨胀运算以得到第 一向量;对所述目标第一处理图像进行腐蚀运算以得到第二向量;对所述第一向量和所述第二向量进行合并以得到所述目标第一处理图像对应的第一处理图像。In a possible embodiment, if the multiple first processed images include the target first processed image, the morphological denoising is performed on each lung scan image in the multiple lung scan images to Obtaining multiple first processed images includes: performing an expansion operation on the target first processed image to obtain a first vector; performing an erosion operation on the target first processed image to obtain a second vector; Merge with the second vector to obtain a first processed image corresponding to the target first processed image.
其中,膨胀(Dilation)运算是图像中的高亮部分进行膨胀,类似于领域扩张,效果图拥有比原图更大的高亮区域。腐蚀(Erosion)运算是原图的高亮部分被腐蚀,类似于领域被蚕食,效果图拥有比原图更小的高亮区域。从数学的角度来说,膨胀运算和腐蚀运算就是将图像与核进行卷积,核可以是任意形状和大小的。可以理解,以多张第一处理图像中的目标第一处理图像为例,分别对目标第一处理图像进行膨胀运算和腐蚀运算,然后采用向量加法来实现两个集合的合并以得到去噪的第一处理图像。如此,可去除肺部扫描图像中的噪声,便于提高图像识别的识别效率和准确率。Among them, the dilation operation is to expand the highlighted part of the image, similar to the field expansion, the effect picture has a larger highlight area than the original image. The Erosion operation is that the highlight part of the original image is corroded, similar to the area being eroded, the effect image has a smaller highlight area than the original image. From a mathematical point of view, the expansion operation and the erosion operation are to convolve the image with the kernel, which can be of any shape and size. It can be understood that taking the target first processed image in the multiple first processed images as an example, the expansion operation and the erosion operation are performed on the target first processed image respectively, and then vector addition is used to merge the two sets to obtain the denoising First process the image. In this way, noise in the lung scan image can be removed, which is convenient for improving the recognition efficiency and accuracy of image recognition.
在一种可能的实施例中,所述对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪以得到多张第一处理图像,包括:对所述多张肺部扫描图像中每一肺部扫描图像进行预处理以得到多张第四处理图像;对所述多张第四处理图像中每一第四处理图像进行形态学去噪以得到所述多张第一处理图像。In a possible embodiment, the performing morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images includes: denoising the multiple lung scan images. Each lung scan image in the scanned image is preprocessed to obtain multiple fourth processed images; each fourth processed image in the multiple fourth processed images is morphologically denoised to obtain the multiple first processed images Process images.
其中,预处理可包括图像格式转换处理、图像缺失填补处理、减去平均值、规范化(normalization)、主成分分析(principal components analysis,PCA)和白化(whiten)等。在该实施例中,通过对肺部扫描图像进行预处理得到的第四处理图像,可进一步提高图像识别的识别效率和准确率。Among them, preprocessing may include image format conversion processing, image deletion filling processing, average subtraction, normalization, principal component analysis (PCA), whiten, and so on. In this embodiment, the fourth processed image obtained by preprocessing the lung scan image can further improve the recognition efficiency and accuracy of image recognition.
本申请对于预设尺寸不做限定,可以为512*512*512,可尽量保持真实的宽高比。可以理解,先将多张扫描得到的肺部扫描图像进行形态学去噪以得到去除噪声的多张第一处理图像,便于提高图像识别的识别效率和准确率。然后对各张第一处理图像进行像素归一化处理以得到像素值归一到(0,1)范围的多张第二处理图像,可消除指标之间的量纲影响,以提高数据指标之间的可比性。再根据多张肺部扫描图像的扫描序列和预设尺寸对多张第二处理图像进行立体堆叠以得到立体的目标肺部扫描图像。如此,便于满足神经网络的处理要求,以及便于提高图像识别的识别效率和准确率。This application does not limit the preset size, which can be 512*512*512, and the real aspect ratio can be maintained as much as possible. It can be understood that the morphological denoising of lung scan images obtained by multiple scans is performed to obtain multiple first processed images with noise removed, which is convenient to improve the recognition efficiency and accuracy of image recognition. Then perform pixel normalization processing on each of the first processed images to obtain multiple second processed images whose pixel values are normalized to the range (0,1), which can eliminate the dimensional influence between the indicators to improve the data indicators. Comparability between. Then, the multiple second processed images are stereo-stacked according to the scanning sequence and the preset size of the multiple lung scan images to obtain a three-dimensional target lung scan image. In this way, it is convenient to meet the processing requirements of the neural network, and it is convenient to improve the recognition efficiency and accuracy of image recognition.
在本申请中,第一神经网络用于识别目标肺部扫描图像中的结节图像,即输入该第一神经网络,可获取针对有结节和无结节的第一类别概率图。在执行步骤S101之前,该第一神经网络是训练完成的,本申请对于其训练方法不做限定。在一种可能的实施例中,所述方法还包括:将多张标记图像中每一标记图像进行区域划分以得到多张第一图像;从所述多张第一图像中每一第一图像提取第二阈值个所述均匀网格图像以得到多张第二图像;对所述多张第二图像中每一第二图像进行尺寸处理以得到多张第三图像;根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第三图像中每一第三图像对应的参考结节位置;根据所述多张第三图像和所述多张第三图像中每一第三图 像对应的参考结节位置,对所述第一初始神经网络进行训练以得到所述第一神经网络的第一网络参数;根据所述第一初始神经网络和所述第一网络参数获取所述第一神经网络。In this application, the first neural network is used to identify the nodule image in the target lung scan image, that is, input the first neural network to obtain the first category probability map for nodules and no nodules. Before step S101 is executed, the training of the first neural network is completed, and the training method thereof is not limited in this application. In a possible embodiment, the method further includes: dividing each marked image in the plurality of marked images to obtain a plurality of first images; from each of the plurality of first images Extracting a second threshold of the uniform grid images to obtain multiple second images; performing size processing on each of the multiple second images to obtain multiple third images; according to the multiple marks Obtain the reference nodule position corresponding to each third image in the multiple third images according to the nodule marking information included in each marked image in the image; according to the multiple third images and the multiple third images Training the first initial neural network to obtain the first network parameters of the first neural network according to the reference nodule position corresponding to each third image in the third image; according to the first initial neural network and the first neural network The network parameter obtains the first neural network.
在本申请中,每一标记图像包括结节标记信息,均采用前述的扫描方法和处理方法,且每一标记图像均进行了人工标记。例如:由三位或指定位数的放射科医生商定每个第一图像中结节的数量、位置、大小或类型等结节标记信息。In this application, each marked image includes nodule marking information, using the aforementioned scanning method and processing method, and each marked image is manually marked. For example: three radiologists or a designated number of radiologists agree on the number, location, size or type of nodules in each first image and other nodule marking information.
每一第一图像包括多张均匀网格图像,每一均匀网络图像的大小为第一阈值。本申请对于第一阈值不做限定,可以为16*16*16。也就是说,将各个标记图像进行区域划分,以使区域划分之后得到的第一图像中各个均匀网格图像的大小为第一阈值。Each first image includes a plurality of uniform grid images, and the size of each uniform network image is a first threshold. This application does not limit the first threshold, which can be 16*16*16. That is, each marker image is divided into regions, so that the size of each uniform grid image in the first image obtained after the region division is the first threshold.
每一第二图像中包括第二阈值个均匀网格图像,本申请对于第二阈值也不做限定,可以为128,也就是说,只提取每一第一图像中的指定数量的均匀网格图像,如此,可提高运算效率。Each second image includes a second threshold of uniform grid images. This application does not limit the second threshold, which can be 128, that is, only a specified number of uniform grids in each first image are extracted Images, in this way, can improve computing efficiency.
第一初始神经网络为没有定义网络参数的所述第一神经网络,每一第三图像的大小满足第一初始神经网络定义的输入尺寸。本申请对于第二图像的尺寸处理方法也不做限定,可以进行零填充;也可复制有结节的均匀网格图像,可保持阶级平衡;还可使用3D卷积合并,并用(1*1*1)卷积代替全局平均合并操作,以此得到满足训练图像大小的图像。本申请对于第三图像的大小不做限定,可以为32*32*32。可以理解,由于输入的尺寸较小,可提高运算效率。The first initial neural network is the first neural network without network parameters defined, and the size of each third image meets the input size defined by the first initial neural network. This application also does not limit the size processing method of the second image, which can be zero-filled; it can also copy a uniform grid image with nodules to maintain class balance; it can also use 3D convolution to merge and use (1*1 *1) Convolution replaces the global average merge operation to obtain an image that meets the size of the training image. This application does not limit the size of the third image, which can be 32*32*32. It can be understood that due to the smaller input size, the calculation efficiency can be improved.
如前所述,每一标记图像包括结节标记信息,第三图像为标记图像对应的处理图像,则可根据标记图像的结节标记信息获取第三图像对应的参考结节位置,即获取待训练图像的参考结节位置。As mentioned above, each marked image includes nodule marking information, and the third image is a processed image corresponding to the marked image. The reference nodule position corresponding to the third image can be obtained according to the nodule marking information of the marked image, that is, the position to be The reference nodule position of the training image.
在一种可能的实施例中,所述对所述多张第二图像中每一第二图像进行尺寸处理以得到多张第三图像,包括:提取所述多张第二图像中存在结节的均匀网格图像以得到多张第四图像;对所述多张第二图像中每一第二图像的第四图像进行复制处理以得到所述多张第三图像。In a possible embodiment, the performing size processing on each of the plurality of second images to obtain a plurality of third images includes: extracting nodules in the plurality of second images To obtain a plurality of fourth images; and perform copy processing on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
其中,第四图像为存在结节的均匀网络图像,本申请对于提取存在结节的均匀网格图像的方法不做限定,在一种可能的实施例中,若所述多张第二图像包括目标第二图像,所述目标第二图像对应多张目标第二均匀网络图像,则所述方法还包括:将所述多张目标第二均匀网格图像进行划分以得到多个均匀网格图像集;对所述多张均匀网络图像集中每一均匀网络图像集对应的结节概率进行叠加运算以得到多个叠加值;对所述多张均匀网络图像集中每一均匀网络图像集对应的叠加值进行平均运算以得到多个平均值;提取所述多个平均值中的平均值大于第三阈值对应的均匀网格图像集中的均匀网格图像以得到所述多张第四图像。Wherein, the fourth image is a uniform network image with nodules. This application does not limit the method for extracting uniform grid images with nodules. In a possible embodiment, if the multiple second images include The target second image, where the target second image corresponds to multiple target second uniform network images, and the method further includes: dividing the multiple target second uniform grid images to obtain multiple uniform grid images Set; superimpose the nodule probability corresponding to each uniform network image set in the plurality of uniform network image sets to obtain multiple superposition values; superimpose corresponding to each uniform network image set in the plurality of uniform network image sets Perform an averaging operation on the values to obtain a plurality of average values; extract the uniform grid images in the uniform grid image set corresponding to the average value of the plurality of averages greater than the third threshold to obtain the plurality of fourth images.
其中,均匀网格图像集的划分方法可以随机分配,例如,扫描到10个均 匀网格图像分为一组。本申请对于第三阈值不做限定,可以为0.5。可以理解,将多个目标第二均匀网格图像进行集合得到均匀网格图像集,并对各个均匀网格图像集对应的结节概率进行叠加运算以得到多个叠加值,以及对各个均匀网格图像集对应的叠加值进行平均运算以得到多个平均值,若平均值大于第三阈值,则确定该平均值对应的均匀网格图像集中每一均匀网格图像存在结节。如此,通过图像集的方式确定是否包含结节,可提高提取第四图像的效率。Among them, the method of dividing the uniform grid image set can be randomly assigned, for example, scanning to 10 uniform grid images is divided into a group. This application does not limit the third threshold, which can be 0.5. It can be understood that the second uniform grid images of multiple targets are collected to obtain a uniform grid image set, and the nodule probabilities corresponding to each uniform grid image set are superimposed to obtain multiple superimposed values, and for each uniform grid The superimposed values corresponding to the grid image set are averaged to obtain multiple average values. If the average value is greater than the third threshold, it is determined that nodules exist in each uniform grid image in the uniform grid image set corresponding to the average value. In this way, determining whether a nodule is included in the image set can improve the efficiency of extracting the fourth image.
本申请对于第一初始神经网络的训练过程不做限定,可采用批量梯度下降算法(Batch Gradient Descent,BGD)、随机梯度下降算法(Stochastic Gradient Descent,SGD)或小批量梯度下降算法(mini-batch SGD)等进行训练。一个训练周期由单次正向运算和反向梯度传播完成,即将待训练的图像正向输入至待训练的神经网络以得到输出的目标对象,若目标对象与参考对象匹配失败,则根据目标对象与参考对象获取损失函数,再根据该损失函数反向输入至神经网络,以调整该神经网络的网络参数,例如:权值和偏置。然后,再输入下一个待训练的图像,直至匹配成功或完成所有图像的训练。在第一神经网络的训练过程中,参考对象为参考结节位置,目标对象为目标结节位置。This application does not limit the training process of the first initial neural network. Batch Gradient Descent (BGD), Stochastic Gradient Descent (SGD), or mini-batch gradient descent (mini-batch) algorithm can be used. SGD) and so on for training. A training cycle is completed by a single forward operation and reverse gradient propagation, that is, the image to be trained is forwardly input to the neural network to be trained to obtain the output target object. If the target object fails to match the reference object, the target object Obtain the loss function from the reference object, and then input the loss function back to the neural network to adjust the network parameters of the neural network, such as weight and bias. Then, input the next image to be trained until the matching is successful or the training of all images is completed. In the training process of the first neural network, the reference object is the reference nodule position, and the target object is the target nodule position.
在一种可能的实施例中,所述根据所述多张第三图像和所述多张第三图像中每一第三图像对应的参考结节位置,对所述第一初始神经网络进行训练以得到所述第一神经网络的第一网络参数,包括:按照预设比例将所述多张第三图像进行划分以得到多张第一训练图像和多张第一验证图像;根据所述多张第一训练图像中每一第一训练图像对应的参考结节位置对所述第一初始神经网络进行分类以得到所述第一神经网络的待验证网络参数;根据所述多张第一验证图像对所述待验证网络参数进行验证以得到所述第一网络参数。In a possible embodiment, the first initial neural network is trained based on the multiple third images and the reference nodule position corresponding to each third image in the multiple third images To obtain the first network parameters of the first neural network, including: dividing the multiple third images according to a preset ratio to obtain multiple first training images and multiple first verification images; Classify the first initial neural network to obtain the to-be-verified network parameters of the first neural network according to the reference nodule position corresponding to each first training image in the first training images; The image verifies the network parameter to be verified to obtain the first network parameter.
本申请对于预设比例不做限定,可以为7:3。本申请对于分类算法也不做限定,可采用逻辑回归或者决策树算法,对该多个第一训练图像对应的图像特征和参考结节位置进行分类,从而得到第一神经网络的待验证网络参数。This application does not limit the preset ratio, which can be 7:3. This application does not limit the classification algorithm. Logistic regression or decision tree algorithm can be used to classify the image features and reference nodule positions corresponding to the multiple first training images, so as to obtain the network parameters to be verified of the first neural network. .
验证处理用于根据多张第一验证图像将已得到网络参数的待验证神经网络进行训练以得到第一神经网络的第一网络参数,具体可参照前述的训练周期的方法,在此不再赘述。如此,就可输入测试图像,即执行S101。The verification process is used to train the neural network to be verified for which the network parameters have been obtained according to the multiple first verification images to obtain the first network parameters of the first neural network. For details, please refer to the method of the aforementioned training period, which will not be repeated here. . In this way, the test image can be input, that is, S101 is executed.
可以理解,按照预设比例将所述多张第三图像进行划分以得到多张第一训练图像和多张第一验证图像,然后根据所述多张第一训练图像对所述第一初始神经网络进行分类以得到第一神经网络的待验证网络参数,最后根据所述多张第一验证图像对所述第一神经网络的待验证网络参数进行验证以得到所述第一神经网络的第一网络参数。如此,采用批量梯度下降算法进行训练和验证,提高了第一神经网络的训练速度。It can be understood that the plurality of third images are divided according to a preset ratio to obtain a plurality of first training images and a plurality of first verification images, and then the first initial neural network is analyzed according to the plurality of first training images. The network is classified to obtain the network parameters to be verified of the first neural network, and finally the network parameters to be verified of the first neural network are verified according to the multiple first verification images to obtain the first neural network. Network parameters. In this way, the batch gradient descent algorithm is used for training and verification, which improves the training speed of the first neural network.
本申请第一初始神经网络的训练参数也不做限定,例如:训练采用24个小批次的10000次迭代,学习率为0.01,体重衰减为0.0001使用默认参数 (β 1=0.9,β 2=0.999)的第j个Adam优化器。 The training parameters of the first initial neural network of this application are also not limited. For example, 24 small batches of 10,000 iterations are used for training, the learning rate is 0.01, and the weight attenuation is 0.0001. The default parameters (β 1 =0.9, β 2 = 0.999) the jth Adam optimizer.
在一种可能的实施例中,采用线性整流(Rectified Linear Units,Relu)函数作为激活函数(Activation function)。In a possible embodiment, a linear rectification (Rectified Linear Units, Relu) function is used as an activation function (Activation function).
其中:Relu函数的表达式:f(x)=max(0,x)。可以理解,Relu函数作为激励函数,可增强判定函数和整个神经网络的非线性特性,而本身并不会改变卷积层。Among them: the expression of Relu function: f(x)=max(0,x). It can be understood that the Relu function as an excitation function can enhance the non-linear characteristics of the decision function and the entire neural network without changing the convolutional layer itself.
在一种可能的实施例中,采用加权交叉熵函数作为损失函数。如此,可避免出现较强的阶层失衡。此外,还可通过每批次的重量来平衡损失,并将其应用于较弱的类别。In a possible embodiment, a weighted cross entropy function is used as the loss function. In this way, a strong class imbalance can be avoided. In addition, the loss can be balanced by the weight of each batch and applied to the weaker category.
本申请对于第一类别概率图不做限定,可以是一种密度直方图,用于描述各个均匀网格图像的结节概率。This application does not limit the probability map of the first category, and may be a density histogram used to describe the nodule probability of each uniform grid image.
可以理解,先将各个标记图像进行区域划分以得到多张网格大小相同的第一图像,再提取指定数量的均匀网格图像以得到多张第二图像。如此,提高了运算效率。为了满足运行条件,进一步对多张第二图像进行尺寸处理以得到多张第三图像。然后根据各个标记图像的结节标记信息获取各个第三图像对应的参考结节位置。最后根据多张第三图像以及每一第三图像对应的参考结节位置对第一初始神经网络进行训练以得到第一神经网络的第一网络参数,从而根据第一初始神经网络和第一网络参数获取第一神经网络。如此,提高了第一神经网络的训练速度。It can be understood that firstly, each marked image is divided into regions to obtain multiple first images with the same grid size, and then a specified number of uniform grid images are extracted to obtain multiple second images. In this way, the computational efficiency is improved. In order to meet the operating conditions, the multiple second images are further subjected to size processing to obtain multiple third images. Then, the reference nodule position corresponding to each third image is obtained according to the nodule marking information of each marked image. Finally, the first initial neural network is trained according to the multiple third images and the position of the reference nodule corresponding to each third image to obtain the first network parameters of the first neural network, so that according to the first initial neural network and the first network Parameter acquisition of the first neural network. In this way, the training speed of the first neural network is improved.
S102:将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图。S102: Input the probability map of the first category to the second neural network to obtain a probability map of the second category for benign nodules, malignant nodules and no nodules.
在本申请中,第二神经网络用于识别结节图像的结节类型,即进一步识别所述第一类别概率图中的结节图像的结节类型,在将第一类别概率图输入至第二神经网络时,可获取针对良性结节、恶性结节和无结节的第二类别概率图。可以理解,直接将第一类别概率图输入至第二神经网络,可节省识别无结节的时间,提高识别效率。In this application, the second neural network is used to identify the nodule type of the nodule image, that is, to further identify the nodule type of the nodule image in the first category probability map, and input the first category probability map to the first category probability map. In the second neural network, the second category probability map for benign nodules, malignant nodules and no nodules can be obtained. It can be understood that directly inputting the probability map of the first category to the second neural network can save the time for identifying no nodules and improve the efficiency of recognition.
本申请对于第二类别概率图不做限定,可以是一种密度直方图,用于描述各个均匀网格图像的结节类型概率。This application does not limit the probability map of the second category, and may be a density histogram used to describe the nodule type probability of each uniform grid image.
本申请对于目标结节类型的标记方法不做限定,可以将患有癌症的患者的所有结节标记为恶性,无癌患者的所有结节标记为良性,其中,癌症的诊断时长为1年,即在1年内被诊断患有癌症的扫描图片中的结节均被标记为恶性。This application does not limit the labeling method of the target nodule type. All nodules of patients with cancer can be marked as malignant, and all nodules of non-cancer patients can be marked as benign. The diagnosis time of cancer is 1 year. That is, the nodules in the scan pictures diagnosed with cancer within 1 year are all marked as malignant.
在执行步骤S102之前,第二神经网络是训练完成的,其训练方法可参照第一神经网络的训练方法,在此不做赘述,其中,参考对象为参考结节类型,目标对象为目标结节类型。Before step S102 is performed, the second neural network is trained. The training method can refer to the training method of the first neural network, which will not be repeated here. The reference object is the reference nodule type, and the target object is the target nodule. Types of.
本申请对于第二神经网络的训练参数也不做限定,例如:训练阶段进行20000次迭代,学习率为0.01,验证阶段进行30000次迭代,学习率为0.001。This application also does not limit the training parameters of the second neural network. For example, the training phase performs 20,000 iterations with a learning rate of 0.01, and the verification phase performs 30,000 iterations with a learning rate of 0.001.
S103:根据所述第一类别概率图提取所述目标肺部扫描图像中的结节单元,以得到多个结节单元。S103: Extract nodule units in the scan image of the target lung according to the first category probability map to obtain multiple nodule units.
在本申请中,结节单元为第一类别概率图中识别为识别的单元,若均匀网格图像与结节的边界框相交,则可确定该均匀网格图像为结节单元。In this application, a nodule unit is a unit that is identified as a unit in the first category probability map. If the uniform grid image intersects the bounding box of the nodule, the uniform grid image can be determined to be a nodule unit.
S104:分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节类型的第三类别概率图,所述结节类型包括良性结节和恶性结节。S104: Input each nodular unit of the multiple nodular units to a third neural network to obtain a third category probability map for the nodule type of each nodular unit of the multiple nodular units , The nodule types include benign nodules and malignant nodules.
在本申请中,第三神经网络用于分别识别各个结节单元的结节类型,即进一步识别所述第一类别概率图对应的每一结节单元的结节类型,在将多个结节单元输入至该第三神经网络时,可确定每一结节单元为良性结节还是恶性结节的概率。可以理解,直接将第一类别概率图中提取的多个结节图像分别输入至第三神经网络,可提高识别结节类型的准确性。In this application, the third neural network is used to separately identify the nodule type of each nodule unit, that is, to further identify the nodule type of each nodule unit corresponding to the first category probability map, and the multiple nodules When the unit is input to the third neural network, the probability that each nodular unit is a benign nodule or a malignant nodule can be determined. It can be understood that directly inputting multiple nodule images extracted from the first category probability map to the third neural network can improve the accuracy of identifying the nodule type.
本申请对于第三类别概率图不做限定,可以是一种密度直方图,用于描述各个结节单元的结节类型概率。This application does not limit the probability map of the third category, and may be a density histogram to describe the nodule type probability of each nodular unit.
在一种可能的实施例中,所述第一图像集中每一第一图像的标记信息还包括目标结节类型,所述方法还包括:对所述多张第四图像中每一第四图像进行数据增强以得到多张第五图像;根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型对第二初始神经网络进行训练以得到所述第三神经网络的第二网络参数。In a possible embodiment, the marking information of each first image in the first image set further includes a target nodule type, and the method further includes: performing an operation on each fourth image in the plurality of fourth images. Perform data enhancement to obtain multiple fifth images; obtain the reference nodule type corresponding to each fifth image in the multiple fifth images according to the nodule marking information included in each marked image in the multiple marked images ; According to the multiple fifth images and the reference nodule type corresponding to each fifth image in the multiple fifth images, the second initial neural network is trained to obtain the second network parameters of the third neural network .
本申请对于数据增强的方法不做限定,可包括音量增强、旋转、减去平均值、放大和缩小等。在一种可能的实施例中,若所述多张第四图像包括目标第四图像,则所述对所述多张第四图像中每一第四图像进行数据增强以得到多张第五图像,包括:按照第一角度,对所述目标第四图像对应的掩膜进行旋转处理以得到第一子处理图像;对所述第一子处理图像进行减去平均值处理以得到第二子处理图像;按照第一倍数,对所述第二子处理图像对应的掩膜的宽度进行尺寸处理以得到第三子处理图像;按照第二倍数,对所述第三子处理图像对应的掩膜的长度进行尺寸处理以得到第四子处理图像;按照第三倍数,对所述第四子处理图像进行尺寸处理以得到第五子处理图像;按照第二角度,对所述第六子处理图像的掩膜进行镜像翻转处理以得到所述目标第四处理图像对应的第五图像。This application does not limit the method of data enhancement, which may include volume enhancement, rotation, average subtraction, enlargement and reduction, etc. In a possible embodiment, if the plurality of fourth images include a target fourth image, the data enhancement is performed on each fourth image in the plurality of fourth images to obtain a plurality of fifth images , Including: performing rotation processing on the mask corresponding to the target fourth image according to a first angle to obtain a first sub-processed image; subtracting an average value from the first sub-processed image to obtain a second sub-processing Image; according to the first multiple, size processing of the width of the mask corresponding to the second sub-processed image to obtain a third sub-processed image; according to the second multiple, the size of the mask corresponding to the third sub-processed image The length is subjected to size processing to obtain the fourth sub-processed image; according to the third multiple, the fourth sub-processed image is subjected to size processing to obtain the fifth sub-processed image; according to the second angle, the size of the sixth sub-processed image is The mask undergoes mirror inversion processing to obtain a fifth image corresponding to the target fourth processed image.
本申请对于第一角度、第一倍数、第二倍数、第三倍数和第四角度不做限定,其中,第一角度可以为小于或等于270度,第一倍数可以为0.9或1.1,第二倍数可以为0.9或1.1,第三倍数可以为0.8或1.2,第二角度可以小于或等于270度。通过设置rotation属性可以旋转显示对象,即将此属性设置为一个数字(0-360),以度为单位,表示应用于该对象的旋转量。This application does not limit the first angle, the first multiple, the second multiple, the third multiple, and the fourth angle. The first angle may be less than or equal to 270 degrees, the first multiple may be 0.9 or 1.1, and the second The multiple can be 0.9 or 1.1, the third multiple can be 0.8 or 1.2, and the second angle can be less than or equal to 270 degrees. The display object can be rotated by setting the rotation property, that is, setting this property to a number (0-360), in degrees, which represents the amount of rotation applied to the object.
可以理解,在该实施例中,以目标第三图像为例,则多张第三图像中的任一第三图像在训练之前,均执行上述多种处理步骤,即对目标第三图像进行旋转、减去平均值、尺寸以及镜像翻转处理,使得目标第三图像对应的第五图像进行数据增强处理。如此,提高了图像的清晰度,便于提高第二神经网络的识别效率。It can be understood that, in this embodiment, taking the target third image as an example, before training any third image in the multiple third images, the above-mentioned multiple processing steps are performed, that is, the target third image is rotated , Subtract the average value, size, and mirror inversion processing, so that the fifth image corresponding to the target third image undergoes data enhancement processing. In this way, the definition of the image is improved, which is convenient for improving the recognition efficiency of the second neural network.
在本申请中,所述第二初始神经网络为没有定义网络参数的所述第三神经网络。第三神经网络的训练方法可参照第一神经网络的训练方法,其中,参考对象为参考结节类型,目标对象为目标结节类型。即将多张第五图像输入至待训练或待验证的神经网络以得到该每一第五图像中的目标结节类型,若该目标结节类型与之前标记的参考结节类型匹配失败,则根据该目标结节类型和参考结节类型获取损失函数,根据该损失函数对神经网络的网络参数进行更新。In this application, the second initial neural network is the third neural network without defining network parameters. The training method of the third neural network can refer to the training method of the first neural network, wherein the reference object is the reference nodule type, and the target object is the target nodule type. That is, input multiple fifth images to the neural network to be trained or to be verified to obtain the target nodule type in each fifth image. If the target nodule type fails to match the previously marked reference nodule type, then The target nodule type and the reference nodule type obtain a loss function, and update the network parameters of the neural network according to the loss function.
本申请对于第三神经网络的训练参数也不做限定,例如:批量大小为32,使用Adam优化器进行6000次迭代,学习率为0.01,权重衰减为0.0001。This application also does not limit the training parameters of the third neural network. For example, the batch size is 32, the Adam optimizer is used for 6000 iterations, the learning rate is 0.01, and the weight attenuation is 0.0001.
可以理解,先提取多张第二图像中存在结节的均匀网格图像以得到多张第四图像,即仅提取结节单元。再对多张第四图像中每一第四图像进行数据增强以得到多张第五图像,可提高数据处理效率。然后根据多张标记图像中每一标记图像包括的结节标记信息,获取多张第五图像中每一第五图像对应的参考结节类型。最后根据多张第五图像和多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练以得到第三神经网络的第二网络参数,第二初始神经网络为没有定义网络参数的第三神经网络。如此,提高了第三神经网络的训练效率。It can be understood that, firstly extract the uniform grid images with nodules in the multiple second images to obtain multiple fourth images, that is, only extract the nodular units. Then, data enhancement is performed on each fourth image among the multiple fourth images to obtain multiple fifth images, which can improve the data processing efficiency. Then, according to the nodule marking information included in each marked image in the multiple marked images, the reference nodule type corresponding to each fifth image in the multiple fifth images is obtained. Finally, according to the reference nodule type corresponding to each fifth image in the multiple fifth images and the multiple fifth images, the second initial neural network is trained to obtain the second network parameters of the third neural network, and the second initial neural network The network is the third neural network with no network parameters defined. In this way, the training efficiency of the third neural network is improved.
需要说明的是,第三神经网络的训练图像可以是与第一神经网络的训练图像不同的一批图像,其训练之前的处理方法可参照第一神经网络的训练图像的方法,在此不在赘述。It should be noted that the training image of the third neural network can be a batch of images different from the training image of the first neural network, and the processing method before training can refer to the method of the training image of the first neural network, which will not be repeated here. .
S105:将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率。S105: Input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image.
在本申请中,第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。也就是说,对第二神经网络得到的全局识别结节类型和第三神经网络得到的局部识别结节类型进行分类以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,即在将第二类别概率图和第三类别概率图输入至该第四神经网络时,可确定该目标肺部扫描图像对应的目标患者患有肺癌的概率。可以理解,通过局部识别结节类型和全局识别结节类型的识别结果确定肺癌患病概率,进一步提高了识别肺癌的准确性。In this application, the fourth neural network is used to classify the second category probability map and the third category probability map. That is to say, classify the globally recognized nodule type obtained by the second neural network and the locally recognized nodule type obtained by the third neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, namely When the second category probability map and the third category probability map are input to the fourth neural network, the probability that the target patient corresponding to the target lung scan image has lung cancer can be determined. It can be understood that the probability of lung cancer is determined by the recognition results of locally recognizing nodule types and global recognizing nodule types, which further improves the accuracy of recognizing lung cancer.
在本申请中,第四神经网络的训练方法可参照第一神经网络的训练方法,其中,参考对象为参考肺癌概率,目标对象为目标肺癌概率。本申请对于第四神经网络的训练参数也不做限定,例如:所有的数据作为一个批次,使用Adam 优化器进行2000次迭代,权重衰减为0.0001。In this application, the training method of the fourth neural network can refer to the training method of the first neural network, wherein the reference object is the reference lung cancer probability, and the target object is the target lung cancer probability. This application also does not limit the training parameters of the fourth neural network. For example, all data is used as a batch, and the Adam optimizer is used for 2000 iterations, and the weight is attenuated to 0.0001.
在一种可能的实施例中,所述将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,包括:分别对所述第二类别概率图和所述第三类别概率图进行数据增强以得到目标第二类别概率图和目标第三类别概率图;将所述目标第二类别概率图和所述目标第三类别概率图输入至所述第四神经网络以得到所述肺癌患病概率。In a possible embodiment, the second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer patients of the target patient corresponding to the target lung scan image. The disease probability includes: performing data enhancement on the second category probability map and the third category probability map to obtain the target second category probability map and the target third category probability map; and the target second category probability map And the target third category probability map is input to the fourth neural network to obtain the lung cancer probability.
其中,数据增强可进行音量转置增强,也可进行剪裁,还可参照第三神经网络的数据增强操作,在此不做限定。可以理解,通过数据增强操作,提高了图像的清晰度,便于提高第四神经网络的识别效率。Among them, the data enhancement can perform volume transposition enhancement or tailoring, and can also refer to the data enhancement operation of the third neural network, which is not limited here. It can be understood that through the data enhancement operation, the clarity of the image is improved, which facilitates the improvement of the recognition efficiency of the fourth neural network.
在一种可能的实施例中,所述将所述第二类别概率图和所述第三类别概率图输入至第四神经网络以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,包括:将所述第二类别概率图和所述第三类别概率图进行特征加权,以得到针对所述多个结节单元中每一结节单元的结节类型的第四类别概率图;将所述第四类别概率图输入至第四神经网络以得到所述肺癌患病概率。In a possible embodiment, the second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer disease of the target patient corresponding to the target lung scan image The probability includes: performing feature weighting on the second category probability map and the third category probability map to obtain a fourth category probability map for the nodule type of each nodular unit in the plurality of nodular units ; Input the fourth category probability map to a fourth neural network to obtain the lung cancer probability.
本申请对于第四类别概率图不做限定,可以是一种密度直方图,用于描述各个均匀网格图像的结节类型概率。This application does not limit the probability map of the fourth category, and may be a density histogram to describe the nodule type probability of each uniform grid image.
本申请可根据第二类别概率图和第三类别概率图中的结节的数量、最小值、最大值、平均值、标准偏差和所有最大输出的综合等计算第二神经网络和第三神经网络的权值,然后,分别根据其权值进行特征加权。This application can calculate the second neural network and the third neural network based on the number, minimum, maximum, average, standard deviation, and integration of all maximum outputs in the second category probability map and the third category probability map Then, the feature weights are performed according to their weights.
可以理解,先对局部和全局确定目标肺部扫描图像的结节类型的识别结果进行特征加权以得到第四类别概率图,然后针对第四类别概率图中各个结节的结节类型确定肺癌患病概率,提高了识别肺癌的准确性。It can be understood that firstly, the recognition results of the nodule types in the locally and globally determined target lung scan images are feature-weighted to obtain the fourth category probability map, and then the lung cancer patients are determined for the nodule types of each nodule in the fourth category probability map. The disease probability improves the accuracy of identifying lung cancer.
在如图1所示的图像识别方法中,先识别肺部扫描图像的结节图像,再通过局部识别的结节类型和全局识别的结节类型确定肺癌患病概率,提高了肺癌病灶部位的图像识别的准确率。In the image recognition method shown in Figure 1, the nodule image of the lung scan image is first recognized, and then the locally recognized nodule type and the globally recognized nodule type are used to determine the probability of lung cancer, which improves the location of the lung cancer lesion. The accuracy of image recognition.
与图1的实施例一致,请参照图2,图2是本申请实施例提供的一种图像识别装置的结构示意图,所述装置应用于电子设备。如图2所示,上述图像识别装置200包括:Consistent with the embodiment in FIG. 1, please refer to FIG. 2. FIG. 2 is a schematic structural diagram of an image recognition device provided by an embodiment of the present application, and the device is applied to electronic equipment. As shown in FIG. 2, the above-mentioned image recognition device 200 includes:
第一处理单元201,用于将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图,所述第一神经网络用于识别所述目标肺部扫描图像中的结节图像;The first processing unit 201 is configured to input a scanned image of the target lung to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify the target Nodules in the lung scan image;
第二处理单元202,用于将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图,所述第二神经网络用于识别所述第一类别概率图中的结节图像的结节类型;The second processing unit 202 is configured to input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, the second neural network Used to identify the nodule type of the nodule image in the first category probability map;
第三处理单元203,用于根据所述第一类别概率图提取所述目标肺部扫描 图像中的结节单元,以得到多个结节单元;分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节类型的第三类别概率图,所述结节类型包括良性结节和恶性结节,所述第三神经网络用于分别识别所述多个结节单元中每一结节单元的结节类型;The third processing unit 203 is configured to extract a nodule unit in the target lung scan image according to the first category probability map to obtain a plurality of nodule units; each of the multiple nodule units The nodule unit is input to the third neural network to obtain a third category probability map for the nodule type of each nodule unit in the plurality of nodule units, and the nodule types include benign nodules and malignant nodules , The third neural network is used to identify the nodule type of each nodular unit in the plurality of nodular units;
第四处理单元204,用于将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,所述第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。The fourth processing unit 204 is configured to input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, The fourth neural network is used to classify the second category probability map and the third category probability map.
可以理解,该图像识别装置先识别肺部扫描图像的结节图像,再通过局部识别的结节类型和全局识别的结节类型确定肺癌患病概率,提高了肺癌病灶部位的图像识别的准确率。It can be understood that the image recognition device first recognizes the nodule image of the lung scan image, and then determines the probability of lung cancer through the locally recognized nodule types and the globally recognized nodule types, which improves the accuracy of image recognition of lung cancer lesions. .
在一个可能的示例中,所述装置200还包括:In a possible example, the device 200 further includes:
预处理单元205,用于获取待识别的多张肺部扫描图像;对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪,以得到多张第一处理图像;对所述多张第一处理图像中每一第一处理图像进行像素归一化处理,以得到多张第二处理图像;根据所述多张肺部扫描图像的扫描序列和预设尺寸,对所述多张第二处理图像进行立体堆叠,以得到所述目标肺部扫描图像。The preprocessing unit 205 is configured to obtain multiple lung scan images to be recognized; perform morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images; Each of the plurality of first processed images is subjected to pixel normalization processing to obtain a plurality of second processed images; according to the scan sequence and preset size of the plurality of lung scan images, all The multiple second processed images are three-dimensionally stacked to obtain the target lung scan image.
在一个可能的示例中,所述预处理单元205,还用于将多张标记图像中每一标记图像进行区域划分,以得到多张第一图像,每一第一图像包括多张均匀网格图像,每一均匀网格图像的大小为第一阈值,每一标记图像包括结节标记信息;从所述多张第一图像中每一第一图像提取第二阈值个所述均匀网格图像,以得到多张第二图像;对所述多张第二图像中每一第二图像进行尺寸处理,以得到多张第三图像,每一第三图像的大小满足第一初始神经网络定义的输入尺寸,所述第一初始神经网络为没有定义网络参数的所述第一神经网络;根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第三图像中每一第三图像对应的参考结节位置;所述装置200还包括:In a possible example, the preprocessing unit 205 is further configured to divide each marked image in the multiple marked images to obtain multiple first images, and each first image includes multiple uniform grids. Image, the size of each uniform grid image is a first threshold, and each marked image includes nodule marking information; extracting a second threshold of the uniform grid images from each first image in the plurality of first images , In order to obtain multiple second images; the size of each second image in the multiple second images is processed to obtain multiple third images, the size of each third image meets the definition of the first initial neural network Input size, the first initial neural network is the first neural network without defined network parameters; according to the nodule marking information included in each marked image in the multiple marked images, the multiple third images are acquired The position of the reference nodule corresponding to each third image in the image; the device 200 further includes:
训练单元206,用于根据所述多张第三图像和所述多张第三图像中每一第三图像对应的参考结节位置,对所述第一初始神经网络进行训练,以得到所述第一神经网络的第一网络参数;根据所述第一初始神经网络和所述第一网络参数获取所述第一神经网络。The training unit 206 is configured to train the first initial neural network according to the multiple third images and the reference nodule position corresponding to each third image in the multiple third images to obtain the The first network parameter of the first neural network; the first neural network is obtained according to the first initial neural network and the first network parameter.
在一个可能的示例中,在所述对所述多张第二图像中每一第二图像进行尺寸处理,以得到多张第三图像方面,所述预处理单元205具体用于提取所述多张第二图像中存在结节的均匀网格图像,以得到多张第四图像;对所述多张第二图像中每一第二图像的第四图像进行复制处理,以得到所述多张第三图像。In a possible example, in the aspect of performing size processing on each second image of the plurality of second images to obtain a plurality of third images, the preprocessing unit 205 is specifically configured to extract the plurality of third images. There are uniform grid images of nodules in the second images to obtain multiple fourth images; copy processing is performed on the fourth image of each second image in the multiple second images to obtain the multiple The third image.
在一个可能的示例中,若所述多张第二图像包括目标第二图像,所述目标第二图像对应多张目标第二均匀网络图像,则所述预处理单元205具体用于将 所述多张目标第二均匀网格图像进行划分,以得到多个均匀网格图像集;对所述多张均匀网络图像集中每一均匀网络图像集对应的结节概率进行叠加运算,以得到多个叠加值;对所述多张均匀网络图像集中每一均匀网络图像集对应的叠加值进行平均运算,以得到多个平均值;提取所述多个平均值中的平均值大于第三阈值对应的均匀网格图像集中的均匀网格图像,以得到所述多张第四图像。In a possible example, if the multiple second images include a target second image, and the target second image corresponds to multiple target second uniform network images, the preprocessing unit 205 is specifically configured to use the The multiple target second uniform grid images are divided to obtain multiple uniform grid image sets; the nodule probability corresponding to each uniform network image set in the multiple uniform network image sets is superimposed to obtain multiple Superimposed value; averaging the superimposed value corresponding to each uniform network image set in the plurality of uniform network image sets to obtain multiple average values; extracting the average value of the multiple average values greater than the third threshold A uniform grid image in a uniform grid image set to obtain the plurality of fourth images.
在一个可能的示例中,所述预处理单元205还用于对所述多张第四图像中每一第四图像进行数据增强,以得到多张第五图像;根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数,所述第二初始神经网络为没有定义网络参数的所述第三神经网络。In a possible example, the preprocessing unit 205 is further configured to perform data enhancement on each fourth image in the multiple fourth images to obtain multiple fifth images; according to the multiple labeled images The nodule marking information included in each marked image is used to obtain the reference nodule type corresponding to each fifth image in the plurality of fifth images; according to the plurality of fifth images and each of the plurality of fifth images For a reference nodule type corresponding to a fifth image, a second initial neural network is trained to obtain the second network parameters of the third neural network, and the second initial neural network is the first network parameter without defined network parameters. Three neural networks.
在一个可能的示例中,所述第一图像集中每一第一图像的标记信息还包括目标结节类型,所述训练单元还用于根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数。In a possible example, the label information of each first image in the first image set further includes a target nodule type, and the training unit is further configured to determine the type of nodule included in each of the multiple labeled images. Section mark information to obtain the reference nodule type corresponding to each fifth image in the multiple fifth images; according to the multiple fifth images and the reference corresponding to each fifth image in the multiple fifth images Nodules type, training the second initial neural network to obtain the second network parameters of the third neural network.
在一个可能的示例中,若所述多张第四图像包括目标第四图像,所述预处理单元具体用于按照第一角度,对所述目标第四图像对应的掩膜进行旋转处理,以得到第一子处理图像;对所述第一子处理图像进行减去平均值处理,以得到第二子处理图像;按照第一倍数,对所述第二子处理图像对应的掩膜的宽度进行尺寸处理,以得到第三子处理图像;按照第二倍数,对所述第三子处理图像对应的掩膜的长度进行尺寸处理,以得到第四子处理图像;按照第三倍数,对所述第四子处理图像进行尺寸处理,以得到第五子处理图像;按照第二角度,对所述第六子处理图像的掩膜进行镜像翻转处理,以得到所述目标第四处理图像对应的第五图像。In a possible example, if the multiple fourth images include a target fourth image, the preprocessing unit is specifically configured to perform rotation processing on the mask corresponding to the target fourth image according to the first angle, so as to Obtain the first sub-processed image; subtract the average value from the first sub-processed image to obtain the second sub-processed image; perform the width of the mask corresponding to the second sub-processed image according to the first multiple Size processing to obtain the third sub-processed image; according to the second multiple, size processing is performed on the length of the mask corresponding to the third sub-processed image to obtain the fourth sub-processed image; according to the third multiple, the The fourth sub-processed image is subjected to size processing to obtain the fifth sub-processed image; according to the second angle, the mask of the sixth sub-processed image is mirrored and inverted to obtain the first corresponding to the target fourth processed image Five images.
在一个可能的示例中,所述第四处理单元204具体用于将所述第二类别概率图和所述第三类别概率图进行特征加权,以得到针对所述多个结节单元中每一结节单元的结节类型的第四类别概率图;将所述第四类别概率图输入至第四神经网络,以得到所述肺癌患病概率。In a possible example, the fourth processing unit 204 is specifically configured to perform feature weighting on the probability map of the second category and the probability map of the third category, so as to obtain a reference to each of the multiple nodule units. The fourth category probability map of the nodule type of the nodular unit; the fourth category probability map is input to the fourth neural network to obtain the lung cancer probability.
与图1的实施例一致,请参照图3,图3是本申请实施例提供的一种电子设备的结构示意图。如图3所示,该电子设备300包括处理器310、存储器320、通信接口330以及一个或多个程序340,其中,上述一个或多个程序340被存储在上述存储器320中,并且被配置由上述处理器310执行,上述程序340 包括用于执行以下步骤的指令:Consistent with the embodiment of FIG. 1, please refer to FIG. 3. FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present application. As shown in FIG. 3, the electronic device 300 includes a processor 310, a memory 320, a communication interface 330, and one or more programs 340. The one or more programs 340 are stored in the memory 320 and are configured by The foregoing processor 310 executes, and the foregoing program 340 includes instructions for executing the following steps:
将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图,所述第一神经网络用于识别所述目标肺部扫描图像中的结节图像;Input the target lung scan image to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify nodules in the target lung scan image image;
将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图,所述第二神经网络用于识别所述第一类别概率图中的结节图像的结节类型;Input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, and the second neural network is used to identify the first category The nodule type of the nodule image in the probability map;
根据所述第一类别概率图提取所述目标肺部扫描图像中的结节单元,以得到多个结节单元;Extracting nodular units in the scan image of the target lung according to the first category probability map to obtain multiple nodular units;
分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节类型的第三类别概率图,所述结节类型包括良性结节和恶性结节,所述第三神经网络用于分别识别所述多个结节单元中每一结节单元的结节类型;Each nodular unit in the multiple nodular units is input to a third neural network to obtain a third category probability map for the nodule type of each nodular unit in the multiple nodular units, so The nodule types include benign nodules and malignant nodules, and the third neural network is used to identify the nodule type of each nodular unit of the plurality of nodular units;
将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,所述第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。The second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, and the fourth neural network is used for Classify the second category probability map and the third category probability map.
可以理解,该电子设备先识别肺部扫描图像的结节图像,再通过局部识别的结节类型和全局识别的结节类型确定肺癌患病概率,提高了肺癌病灶部位的图像识别的准确率。It can be understood that the electronic device first recognizes the nodule image of the lung scan image, and then determines the probability of lung cancer through the locally recognized nodule type and the globally recognized nodule type, which improves the accuracy of image recognition of lung cancer lesions.
在一个可能的示例中,所述程序340还用于执行以下步骤的指令:In a possible example, the program 340 is also used to execute the instructions of the following steps:
获取待识别的多张肺部扫描图像;Acquire multiple lung scan images to be recognized;
对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪,以得到多张第一处理图像;Performing morphological denoising on each lung scan image in the plurality of lung scan images to obtain a plurality of first processed images;
对所述多张第一处理图像中每一第一处理图像进行像素归一化处理,以得到多张第二处理图像;Performing pixel normalization processing on each first processed image in the plurality of first processed images to obtain a plurality of second processed images;
根据所述多张肺部扫描图像的扫描序列和预设尺寸,对所述多张第二处理图像进行立体堆叠,以得到所述目标肺部扫描图像。According to the scanning sequence and the preset size of the multiple lung scan images, the multiple second processed images are stereoscopically stacked to obtain the target lung scan image.
在一个可能的示例中,所述程序340还用于执行以下步骤的指令:In a possible example, the program 340 is also used to execute the instructions of the following steps:
将多张标记图像中每一标记图像进行区域划分,以得到多张第一图像,每一第一图像包括多张均匀网格图像,每一均匀网格图像的大小为第一阈值,每一标记图像包括结节标记信息;Each marker image in the multiple marker images is divided into regions to obtain multiple first images. Each first image includes multiple uniform grid images. The size of each uniform grid image is the first threshold. The marked image includes nodule marking information;
从所述多张第一图像中每一第一图像提取第二阈值个所述均匀网格图像,以得到多张第二图像;Extracting a second threshold number of the uniform grid images from each of the plurality of first images to obtain a plurality of second images;
对所述多张第二图像中每一第二图像进行尺寸处理,以得到多张第三图像,每一第三图像的大小满足第一初始神经网络定义的输入尺寸,所述第一初始神经网络为没有定义网络参数的所述第一神经网络;Perform size processing on each second image in the plurality of second images to obtain a plurality of third images, and the size of each third image meets the input size defined by the first initial neural network, and the first initial neural network The network is the first neural network without defining network parameters;
根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第三图像中每一第三图像对应的参考结节位置;Acquiring a reference nodule position corresponding to each third image in the plurality of third images according to nodule marking information included in each of the plurality of marked images;
根据所述多张第三图像和所述多张第三图像中每一第三图像对应的参考结节位置,对所述第一初始神经网络进行训练,以得到所述第一神经网络的第一网络参数;According to the multiple third images and the reference nodule position corresponding to each third image in the multiple third images, the first initial neural network is trained to obtain the first neural network A network parameter;
根据所述第一初始神经网络和所述第一网络参数获取所述第一神经网络。Acquiring the first neural network according to the first initial neural network and the first network parameters.
在一个可能的示例中,所述程序340具体用于执行以下步骤的指令:In a possible example, the program 340 is specifically used to execute instructions of the following steps:
提取所述多张第二图像中存在结节的均匀网格图像,以得到多张第四图像;Extracting uniform grid images with nodules in the plurality of second images to obtain a plurality of fourth images;
对所述多张第二图像中每一第二图像的第四图像进行复制处理,以得到所述多张第三图像。Copy processing is performed on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
在一个可能的示例中,若所述多张第二图像包括目标第二图像,所述目标第二图像对应多张目标第二均匀网络图像,则所述程序340具体用于执行以下步骤的指令:In a possible example, if the multiple second images include a target second image, and the target second image corresponds to multiple target second uniform network images, the program 340 is specifically configured to execute the instructions of the following steps :
将所述多张目标第二均匀网格图像进行划分,以得到多个均匀网格图像集;Dividing the multiple target second uniform grid images to obtain multiple uniform grid image sets;
对所述多张均匀网络图像集中每一均匀网络图像集对应的结节概率进行叠加运算,以得到多个叠加值;Performing a superposition operation on the probabilities of nodules corresponding to each uniform network image set in the plurality of uniform network image sets to obtain a plurality of superposition values;
对所述多张均匀网络图像集中每一均匀网络图像集对应的叠加值进行平均运算,以得到多个平均值;Performing an averaging operation on the superimposed values corresponding to each uniform network image set in the plurality of uniform network image sets to obtain multiple average values;
提取所述多个平均值中的平均值大于第三阈值对应的均匀网格图像集中的均匀网格图像,以得到所述多张第四图像。Extracting the uniform grid image in the uniform grid image set corresponding to the average value of the plurality of average values greater than the third threshold to obtain the plurality of fourth images.
在一个可能的示例中,所述程序340还用于执行以下步骤的指令:In a possible example, the program 340 is also used to execute the instructions of the following steps:
对所述多张第四图像中每一第四图像进行数据增强,以得到多张第五图像;Performing data enhancement on each fourth image in the plurality of fourth images to obtain a plurality of fifth images;
根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;Obtaining a reference nodule type corresponding to each fifth image in the plurality of fifth images according to the nodule marking information included in each of the plurality of marked images;
根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数,所述第二初始神经网络为没有定义网络参数的所述第三神经网络。According to the multiple fifth images and the reference nodule type corresponding to each fifth image in the multiple fifth images, the second initial neural network is trained to obtain the second network of the third neural network Parameters, the second initial neural network is the third neural network without defining network parameters.
在一个可能的示例中,所述第一图像集中每一第一图像的标记信息还包括目标结节类型,所述程序340还用于执行以下步骤的指令:In a possible example, the marking information of each first image in the first image set further includes the target nodule type, and the program 340 is further used to execute instructions of the following steps:
提取所述多张第二图像中存在结节的均匀网格图像,以得到多张第四图像;Extracting uniform grid images with nodules in the plurality of second images to obtain a plurality of fourth images;
对所述多张第二图像中每一第二图像的第四图像进行复制处理,以得到所述多张第三图像。Copy processing is performed on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
在一个可能的示例中,若所述多张第四图像包括目标第四图像,则所述程序340还用于执行以下步骤的指令:In a possible example, if the multiple fourth images include the target fourth image, the program 340 is further used to execute the instructions of the following steps:
按照第一角度,对所述目标第四图像对应的掩膜进行旋转处理,以得到第一子处理图像;Performing rotation processing on the mask corresponding to the target fourth image according to the first angle to obtain the first sub-processed image;
对所述第一子处理图像进行减去平均值处理,以得到第二子处理图像;Performing average value subtraction processing on the first sub-processed image to obtain a second sub-processed image;
按照第一倍数,对所述第二子处理图像对应的掩膜的宽度进行尺寸处理,以得到第三子处理图像;According to the first multiple, size processing is performed on the width of the mask corresponding to the second sub-processed image to obtain a third sub-processed image;
按照第二倍数,对所述第三子处理图像对应的掩膜的长度进行尺寸处理,以得到第四子处理图像;According to the second multiple, size processing is performed on the length of the mask corresponding to the third sub-processed image to obtain a fourth sub-processed image;
按照第三倍数,对所述第四子处理图像进行尺寸处理,以得到第五子处理图像;Performing size processing on the fourth sub-processed image according to the third multiple to obtain a fifth sub-processed image;
按照第二角度,对所述第六子处理图像的掩膜进行镜像翻转处理,以得到所述目标第四处理图像对应的第五图像。According to the second angle, the mask of the sixth sub-processed image is mirrored and reversed to obtain the fifth image corresponding to the target fourth processed image.
在一个可能的示例中,所述程序340具体用于执行以下步骤的指令:In a possible example, the program 340 is specifically used to execute instructions of the following steps:
将所述第二类别概率图和所述第三类别概率图进行特征加权,以得到针对所述多个结节单元中每一结节单元的结节类型的第四类别概率图;Performing feature weighting on the second category probability map and the third category probability map to obtain a fourth category probability map for the nodule type of each nodular unit in the plurality of nodular units;
将所述第四类别概率图输入至第四神经网络,以得到所述肺癌患病概率。The fourth category probability map is input to a fourth neural network to obtain the lung cancer probability.
本申请实施例还提供一种计算机可读存储介质,其中,该计算机可读存储介质存储用于存储计算机程序,该计算机程序使得计算机执行如方法实施例中记载的任一方法的部分或全部步骤,计算机包括电子设备。The embodiments of the present application also provide a computer-readable storage medium, where the computer-readable storage medium stores a computer program for storing a computer program that enables a computer to execute part or all of the steps of any method as recorded in the method embodiment , Computers include electronic equipment.
本申请实施例还提供一种计算机程序产品,计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,计算机程序可操作来使计算机执行如方法实施例中记载的任一方法的部分或全部步骤。该计算机程序产品可以为一个软件安装包,计算机包括电子设备。The embodiments of the present application also provide a computer program product. The computer program product includes a non-transitory computer-readable storage medium storing a computer program. The computer program is operable to make a computer execute a part of any method described in the method embodiment. Or all steps. The computer program product may be a software installation package, and the computer includes electronic equipment.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own focus. For parts that are not described in detail in an embodiment, reference may be made to related descriptions of other embodiments.
本领域技术人员应该可以意识到,在上述一个或多个示例中,本申请所描述的功能可以用硬件、软件、固件或它们的任意组合来实现。当使用软件实现时,可以将这些功能存储在计算机可读介质中或者作为计算机可读介质上的一个或多个指令或代码进行传输。计算机可读介质包括计算机存储介质和通信介质,其中通信介质包括便于从一个地方向另一个地方传送计算机程序的任何介质。存储介质可以是通用或专用计算机能够存取的任何可用介质。Those skilled in the art should be aware that in one or more of the above examples, the functions described in this application can be implemented by hardware, software, firmware or any combination thereof. When implemented by software, these functions can be stored in a computer-readable medium or transmitted as one or more instructions or codes on the computer-readable medium. The computer-readable medium includes a computer storage medium and a communication medium, where the communication medium includes any medium that facilitates the transfer of a computer program from one place to another. The storage medium may be any available medium that can be accessed by a general-purpose or special-purpose computer.
以上所述的具体实施方式,对本申请的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本申请的具体实施方式而已,并不用于限定本申请的保护范围,凡在本申请的技术方案的基础之上,所做的任何修改、等同替换、改进等,均应包括在本申请的保护范围之内。The specific implementations described above further describe the purpose, technical solutions and beneficial effects of this application in detail. It should be understood that the above are only specific implementations of this application and are not intended to limit the scope of this application. The protection scope, any modification, equivalent replacement, improvement, etc. made on the basis of the technical solution of this application shall be included in the protection scope of this application.

Claims (20)

  1. 一种图像识别方法,其特征在于,包括:An image recognition method, characterized in that it comprises:
    将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图,所述第一神经网络用于识别所述目标肺部扫描图像中的结节图像;Input the target lung scan image to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify nodules in the target lung scan image image;
    将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图,所述第二神经网络用于识别所述第一类别概率图中的结节图像的结节类型;Input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, and the second neural network is used to identify the first category The nodule type of the nodule image in the probability map;
    根据所述第一类别概率图提取所述目标肺部扫描图像中的结节单元,以得到多个结节单元;Extracting nodular units in the scan image of the target lung according to the first category probability map to obtain multiple nodular units;
    分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节类型的第三类别概率图,所述结节类型包括良性结节和恶性结节,所述第三神经网络用于分别识别所述多个结节单元中每一结节单元的结节类型;Each nodular unit in the multiple nodular units is input to a third neural network to obtain a third category probability map for the nodule type of each nodular unit in the multiple nodular units, so The nodule types include benign nodules and malignant nodules, and the third neural network is used to identify the nodule type of each nodular unit of the plurality of nodular units;
    将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,所述第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。The second category probability map and the third category probability map are input to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, and the fourth neural network is used for Classify the second category probability map and the third category probability map.
  2. 根据权利要求1所述的方法,其特征在于,在所述将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图之前,所述方法还包括:The method according to claim 1, characterized in that, before inputting the target lung scan image to the first neural network to obtain the first category probability map for nodules and no nodules, the method Also includes:
    获取待识别的多张肺部扫描图像;Acquire multiple lung scan images to be recognized;
    对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪,以得到多张第一处理图像;Performing morphological denoising on each lung scan image in the plurality of lung scan images to obtain a plurality of first processed images;
    对所述多张第一处理图像中每一第一处理图像进行像素归一化处理,以得到多张第二处理图像;Performing pixel normalization processing on each first processed image in the plurality of first processed images to obtain a plurality of second processed images;
    根据所述多张肺部扫描图像的扫描序列和预设尺寸,对所述多张第二处理图像进行立体堆叠,以得到所述目标肺部扫描图像。According to the scanning sequence and the preset size of the multiple lung scan images, the multiple second processed images are stereoscopically stacked to obtain the target lung scan image.
  3. 根据权利要求1所述的方法,其特征在于,在所述将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图之前,所述方法还包括:The method according to claim 1, characterized in that, before inputting the target lung scan image to the first neural network to obtain the first category probability map for nodules and no nodules, the method Also includes:
    将多张标记图像中每一标记图像进行区域划分,以得到多张第一图像,每一第一图像包括多张均匀网格图像,每一均匀网格图像的大小为第一阈值,每一标记图像包括结节标记信息;Each marker image in the multiple marker images is divided into regions to obtain multiple first images. Each first image includes multiple uniform grid images. The size of each uniform grid image is the first threshold. The marked image includes nodule marking information;
    从所述多张第一图像中每一第一图像提取第二阈值个所述均匀网格图像,以得到多张第二图像;Extracting a second threshold number of the uniform grid images from each of the plurality of first images to obtain a plurality of second images;
    对所述多张第二图像中每一第二图像进行尺寸处理,以得到多张第三图 像,每一第三图像的大小满足第一初始神经网络定义的输入尺寸,所述第一初始神经网络为没有定义网络参数的所述第一神经网络;Perform size processing on each second image in the plurality of second images to obtain a plurality of third images, and the size of each third image meets the input size defined by the first initial neural network, and the first initial neural network The network is the first neural network without defining network parameters;
    根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第三图像中每一第三图像对应的参考结节位置;Acquiring a reference nodule position corresponding to each third image in the plurality of third images according to nodule marking information included in each of the plurality of marked images;
    根据所述多张第三图像和所述多张第三图像中每一第三图像对应的参考结节位置,对所述第一初始神经网络进行训练,以得到所述第一神经网络的第一网络参数;According to the multiple third images and the reference nodule position corresponding to each third image in the multiple third images, the first initial neural network is trained to obtain the first neural network A network parameter;
    根据所述第一初始神经网络和所述第一网络参数获取所述第一神经网络。Acquiring the first neural network according to the first initial neural network and the first network parameters.
  4. 根据权利要求3所述的方法,其特征在于,所述对所述多张第二图像中每一第二图像进行尺寸处理,以得到多张第三图像,包括:The method according to claim 3, wherein the performing size processing on each second image in the plurality of second images to obtain a plurality of third images comprises:
    提取所述多张第二图像中存在结节的均匀网格图像,以得到多张第四图像;Extracting uniform grid images with nodules in the plurality of second images to obtain a plurality of fourth images;
    对所述多张第二图像中每一第二图像的第四图像进行复制处理,以得到所述多张第三图像。Copy processing is performed on the fourth image of each second image in the plurality of second images to obtain the plurality of third images.
  5. 根据权利要求4所述的方法,其特征在于,若所述多张第二图像包括目标第二图像,所述目标第二图像对应多张目标第二均匀网络图像,则所述提取所述多张第二图像中存在结节的均匀网格图像,以得到多张第四图像,包括:The method according to claim 4, wherein if the plurality of second images includes a target second image, and the target second image corresponds to a plurality of target second uniform network images, the extracting the plurality of second images A uniform grid image with nodules in the second image to obtain multiple fourth images, including:
    将所述多张目标第二均匀网格图像进行划分,以得到多个均匀网格图像集;Dividing the multiple target second uniform grid images to obtain multiple uniform grid image sets;
    对所述多张均匀网络图像集中每一均匀网络图像集对应的结节概率进行叠加运算,以得到多个叠加值;Performing a superposition operation on the probabilities of nodules corresponding to each uniform network image set in the plurality of uniform network image sets to obtain a plurality of superposition values;
    对所述多张均匀网络图像集中每一均匀网络图像集对应的叠加值进行平均运算,以得到多个平均值;Performing an averaging operation on the superimposed values corresponding to each uniform network image set in the plurality of uniform network image sets to obtain multiple average values;
    提取所述多个平均值中的平均值大于第三阈值对应的均匀网格图像集中的均匀网格图像,以得到所述多张第四图像。Extracting the uniform grid image in the uniform grid image set corresponding to the average value of the plurality of average values greater than the third threshold to obtain the plurality of fourth images.
  6. 根据权利要求4所述的方法,其特征在于,所述方法还包括:The method according to claim 4, wherein the method further comprises:
    对所述多张第四图像中每一第四图像进行数据增强,以得到多张第五图像;Performing data enhancement on each fourth image in the plurality of fourth images to obtain a plurality of fifth images;
    根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;Acquiring the reference nodule type corresponding to each fifth image in the plurality of fifth images according to the nodule marking information included in each of the plurality of marked images;
    根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数,所述第二初始神经网络为没有定义网络参数的所述第三神经网络。According to the multiple fifth images and the reference nodule type corresponding to each fifth image in the multiple fifth images, the second initial neural network is trained to obtain the second network of the third neural network Parameters, the second initial neural network is the third neural network without defining network parameters.
  7. 根据权利要求6所述的方法,其特征在于,所述第一图像集中每一第一图像的标记信息还包括目标结节类型,在所述对所述多张第四图像中每一第 四图像进行数据增强,以得到多张第五图像之后,所述方法还包括:The method according to claim 6, wherein the marking information of each first image in the first image set further includes a target nodule type, and each fourth image in the pair of fourth images After the image is data-enhanced to obtain multiple fifth images, the method further includes:
    根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;Acquiring the reference nodule type corresponding to each fifth image in the plurality of fifth images according to the nodule marking information included in each of the plurality of marked images;
    根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数。According to the multiple fifth images and the reference nodule type corresponding to each fifth image in the multiple fifth images, the second initial neural network is trained to obtain the second network of the third neural network parameter.
  8. 根据权利要求6或7所述的方法,其特征在于,若所述多张第四图像包括目标第四图像,则所述对所述多张第四图像中每一第四图像进行数据增强,以得到多张第五图像,包括:The method according to claim 6 or 7, wherein if the plurality of fourth images include a target fourth image, then performing data enhancement on each fourth image in the plurality of fourth images, To get multiple fifth images, including:
    按照第一角度,对所述目标第四图像对应的掩膜进行旋转处理,以得到第一子处理图像;Performing rotation processing on the mask corresponding to the target fourth image according to the first angle to obtain the first sub-processed image;
    对所述第一子处理图像进行减去平均值处理,以得到第二子处理图像;Performing average value subtraction processing on the first sub-processed image to obtain a second sub-processed image;
    按照第一倍数,对所述第二子处理图像对应的掩膜的宽度进行尺寸处理,以得到第三子处理图像;According to the first multiple, size processing is performed on the width of the mask corresponding to the second sub-processed image to obtain a third sub-processed image;
    按照第二倍数,对所述第三子处理图像对应的掩膜的长度进行尺寸处理,以得到第四子处理图像;According to the second multiple, size processing is performed on the length of the mask corresponding to the third sub-processed image to obtain a fourth sub-processed image;
    按照第三倍数,对所述第四子处理图像进行尺寸处理,以得到第五子处理图像;Performing size processing on the fourth sub-processed image according to the third multiple to obtain a fifth sub-processed image;
    按照第二角度,对所述第六子处理图像的掩膜进行镜像翻转处理,以得到所述目标第四处理图像对应的第五图像。According to the second angle, the mask of the sixth sub-processed image is mirrored and reversed to obtain the fifth image corresponding to the target fourth processed image.
  9. 根据权利要求1-8任一项所述的方法,其特征在于,所述将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,包括:The method according to any one of claims 1-8, wherein the second category probability map and the third category probability map are input to a fourth neural network to obtain the target lung The lung cancer probability of the target patient corresponding to the scanned image includes:
    将所述第二类别概率图和所述第三类别概率图进行特征加权,以得到针对所述多个结节单元中每一结节单元的结节类型的第四类别概率图;Performing feature weighting on the second category probability map and the third category probability map to obtain a fourth category probability map for the nodule type of each nodular unit in the plurality of nodular units;
    将所述第四类别概率图输入至第四神经网络,以得到所述肺癌患病概率。The fourth category probability map is input to a fourth neural network to obtain the lung cancer probability.
  10. 一种图像识别装置,其特征在于,包括:An image recognition device, characterized by comprising:
    第一处理单元,用于将目标肺部扫描图像输入至第一神经网络,以得到针对有结节和无结节的第一类别概率图,所述第一神经网络用于识别所述目标肺部扫描图像中的结节图像;The first processing unit is configured to input the scanned image of the target lung to a first neural network to obtain a first category probability map for nodules and no nodules, and the first neural network is used to identify the target lung Nodules in the scanned images;
    第二处理单元,用于将所述第一类别概率图输入至第二神经网络,以得到针对良性结节、恶性结节和无结节的第二类别概率图,所述第二神经网络用于识别所述第一类别概率图中的结节图像的结节类型;The second processing unit is configured to input the first category probability map to a second neural network to obtain a second category probability map for benign nodules, malignant nodules and no nodules, and the second neural network uses To identify the nodule type of the nodule image in the first category probability map;
    第三处理单元,用于根据所述第一类别概率图提取所述目标肺部扫描图像中的结节单元,以得到多个结节单元;分别将所述多个结节单元中每一结节单元输入至第三神经网络,以得到针对所述多个结节单元中每一结节单元的结节 类型的第三类别概率图,所述结节类型包括良性结节和恶性结节,所述第三神经网络用于分别识别所述多个结节单元中每一结节单元的结节类型;The third processing unit is configured to extract nodule units in the scan image of the target lung according to the first category probability map to obtain a plurality of nodule units; respectively, each of the multiple nodule units The nodule unit is input to the third neural network to obtain a third category probability map for the nodule type of each nodule unit in the plurality of nodule units, where the nodule types include benign nodules and malignant nodules, The third neural network is used to identify the nodule type of each nodular unit in the multiple nodular units;
    第四处理单元,用于将所述第二类别概率图和所述第三类别概率图输入至第四神经网络,以得到所述目标肺部扫描图像对应的目标患者的肺癌患病概率,所述第四神经网络用于对所述第二类别概率图和所述第三类别概率图进行分类。The fourth processing unit is used to input the second category probability map and the third category probability map to a fourth neural network to obtain the lung cancer probability of the target patient corresponding to the target lung scan image, so The fourth neural network is used to classify the second category probability map and the third category probability map.
  11. 根据权利要求10所述的装置,其特征在于,所述装置还包括:The device according to claim 10, wherein the device further comprises:
    预处理单元,用于获取待识别的多张肺部扫描图像;对所述多张肺部扫描图像中每一肺部扫描图像进行形态学去噪,以得到多张第一处理图像;对所述多张第一处理图像中每一第一处理图像进行像素归一化处理,以得到多张第二处理图像;根据所述多张肺部扫描图像的扫描序列和预设尺寸,对所述多张第二处理图像进行立体堆叠,以得到所述目标肺部扫描图像。The preprocessing unit is used to obtain multiple lung scan images to be recognized; perform morphological denoising on each lung scan image in the multiple lung scan images to obtain multiple first processed images; Each of the plurality of first processed images is subjected to pixel normalization processing to obtain a plurality of second processed images; according to the scan sequence and preset size of the plurality of lung scan images, the A plurality of second processed images are three-dimensionally stacked to obtain the target lung scan image.
  12. 根据权利要求10所述的装置,其特征在于,所述装置还包括:The device according to claim 10, wherein the device further comprises:
    预处理单元,用于将多张标记图像中每一标记图像进行区域划分,以得到多张第一图像,每一第一图像包括多张均匀网格图像,每一均匀网格图像的大小为第一阈值,每一标记图像包括结节标记信息;从所述多张第一图像中每一第一图像提取第二阈值个所述均匀网格图像,以得到多张第二图像;对所述多张第二图像中每一第二图像进行尺寸处理,以得到多张第三图像,每一第三图像的大小满足第一初始神经网络定义的输入尺寸,所述第一初始神经网络为没有定义网络参数的所述第一神经网络;根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第三图像中每一第三图像对应的参考结节位置;The pre-processing unit is used to divide each marked image in the multiple marked images to obtain multiple first images, each first image includes multiple uniform grid images, and the size of each uniform grid image is A first threshold, each marked image includes nodule marking information; a second threshold of the uniform grid images is extracted from each first image in the plurality of first images to obtain a plurality of second images; Each of the multiple second images is subjected to size processing to obtain multiple third images, and the size of each third image meets the input size defined by the first initial neural network, and the first initial neural network is The first neural network without defining network parameters; according to the nodule marking information included in each marked image in the multiple marked images, obtain the reference nodule corresponding to each third image in the multiple third images position;
    训练单元,用于根据所述多张第三图像和所述多张第三图像中每一第三图像对应的参考结节位置,对所述第一初始神经网络进行训练,以得到所述第一神经网络的第一网络参数;根据所述第一初始神经网络和所述第一网络参数获取所述第一神经网络。The training unit is configured to train the first initial neural network according to the multiple third images and the reference nodule position corresponding to each third image in the multiple third images to obtain the first A first network parameter of a neural network; obtaining the first neural network according to the first initial neural network and the first network parameters.
  13. 根据权利要求12所述的装置,其特征在于,所述预处理单元具体用于提取所述多张第二图像中存在结节的均匀网格图像,以得到多张第四图像;对所述多张第二图像中每一第二图像的第四图像进行复制处理,以得到所述多张第三图像。The device according to claim 12, wherein the preprocessing unit is specifically configured to extract uniform grid images with nodules in the multiple second images to obtain multiple fourth images; The fourth image of each second image among the plurality of second images is copied to obtain the plurality of third images.
  14. 根据权利要求13所述的装置,其特征在于,若所述多张第二图像包括目标第二图像,所述目标第二图像对应多张目标第二均匀网络图像,则所述预处理单元具体用于将所述多张目标第二均匀网格图像进行划分,以得到多个均匀网格图像集;对所述多张均匀网络图像集中每一均匀网络图像集对应的结节概率进行叠加运算,以得到多个叠加值;对所述多张均匀网络图像集中每一均匀网络图像集对应的叠加值进行平均运算,以得到多个平均值;提取所述多 个平均值中的平均值大于第三阈值对应的均匀网格图像集中的均匀网格图像,以得到所述多张第四图像。The device according to claim 13, wherein if the plurality of second images include a target second image, and the target second image corresponds to a plurality of target second uniform network images, the preprocessing unit specifically Used to divide the multiple target second uniform grid images to obtain multiple uniform grid image sets; superimpose the nodule probability corresponding to each uniform network image set in the multiple uniform network image sets , In order to obtain multiple superimposed values; averaging the superimposed values corresponding to each uniform network image set in the multiple uniform network image sets to obtain multiple average values; extracting the average value of the multiple average values greater than The uniform grid image in the uniform grid image set corresponding to the third threshold is used to obtain the plurality of fourth images.
  15. 根据权利要求13所述的装置,其特征在于,所述预处理单元还用于对所述多张第四图像中每一第四图像进行数据增强,以得到多张第五图像;根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数,所述第二初始神经网络为没有定义网络参数的所述第三神经网络。The device according to claim 13, wherein the preprocessing unit is further configured to perform data enhancement on each fourth image in the plurality of fourth images to obtain a plurality of fifth images; The nodule marking information included in each marked image in the plurality of marked images is obtained, and the reference nodule type corresponding to each fifth image in the plurality of fifth images is acquired; according to the plurality of fifth images and the plurality of fifth images For the reference nodule type corresponding to each fifth image in the fifth image, the second initial neural network is trained to obtain the second network parameters of the third neural network, and the second initial neural network is an undefined network Parameters of the third neural network.
  16. 根据权利要求15所述的装置,其特征在于,所述第一图像集中每一第一图像的标记信息还包括目标结节类型,所述训练单元还用于根据所述多张标记图像中每一标记图像包括的结节标记信息,获取所述多张第五图像中每一第五图像对应的参考结节类型;根据所述多张第五图像和所述多张第五图像中每一第五图像对应的参考结节类型,对第二初始神经网络进行训练,以得到所述第三神经网络的第二网络参数。The device according to claim 15, wherein the label information of each first image in the first image set further includes a target nodule type, and the training unit is further configured to perform according to each of the multiple labeled images. The nodule marking information included in a marked image is obtained, and the reference nodule type corresponding to each fifth image in the plurality of fifth images is acquired; according to the plurality of fifth images and each of the plurality of fifth images For the reference nodule type corresponding to the fifth image, the second initial neural network is trained to obtain the second network parameters of the third neural network.
  17. 根据权利要求15或16所述的装置,其特征在于,若所述多张第四图像包括目标第四图像,所述预处理单元具体用于按照第一角度,对所述目标第四图像对应的掩膜进行旋转处理,以得到第一子处理图像;对所述第一子处理图像进行减去平均值处理,以得到第二子处理图像;按照第一倍数,对所述第二子处理图像对应的掩膜的宽度进行尺寸处理,以得到第三子处理图像;按照第二倍数,对所述第三子处理图像对应的掩膜的长度进行尺寸处理,以得到第四子处理图像;按照第三倍数,对所述第四子处理图像进行尺寸处理,以得到第五子处理图像;按照第二角度,对所述第六子处理图像的掩膜进行镜像翻转处理,以得到所述目标第四处理图像对应的第五图像。The device according to claim 15 or 16, wherein if the plurality of fourth images include a target fourth image, the preprocessing unit is specifically configured to correspond to the target fourth image according to a first angle Perform rotation processing on the mask of the first sub-processed image to obtain the first sub-processed image; subtract the average value from the first sub-processed image to obtain the second sub-processed image; perform the second sub-processed image according to the first multiple Performing size processing on the width of the mask corresponding to the image to obtain a third sub-processed image; performing size processing on the length of the mask corresponding to the third sub-processed image according to the second multiple to obtain a fourth sub-processed image; According to the third multiple, the size of the fourth sub-processed image is processed to obtain the fifth sub-processed image; according to the second angle, the mask of the sixth sub-processed image is mirrored and inverted to obtain the The fifth image corresponding to the fourth processed image of the target.
  18. 根据权利要求10-17任一项所述的装置,其特征在于,所述第四处理单元具体用于将所述第二类别概率图和所述第三类别概率图进行特征加权,以得到针对所述多个结节单元中每一结节单元的结节类型的第四类别概率图;将所述第四类别概率图输入至第四神经网络,以得到所述肺癌患病概率。The device according to any one of claims 10-17, wherein the fourth processing unit is specifically configured to perform feature weighting on the second category probability map and the third category probability map to obtain The fourth category probability map of the nodule type of each nodular unit in the plurality of nodular units; the fourth category probability map is input to a fourth neural network to obtain the lung cancer probability.
  19. 一种电子设备,其特征在于,包括处理器、存储器、通信接口以及一个或多个程序,其中,所述一个或多个程序被存储在所述存储器中,并且被配置由所述处理器执行,所述程序包括用于执行权利要求1-9任一项方法中的步骤的指令。An electronic device characterized by comprising a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the processor The program includes instructions for executing the steps in any one of the methods of claims 1-9.
  20. 一种计算机可读存储介质,其特征在于,其用于存储计算机程序,其中,所述计算机程序使得计算机执行如权利要求1-9任一项所述的方法。A computer-readable storage medium, characterized in that it is used to store a computer program, wherein the computer program causes a computer to execute the method according to any one of claims 1-9.
PCT/CN2019/088825 2019-02-21 2019-05-28 Image recognition method and related device WO2020168647A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910135802.6 2019-02-21
CN201910135802.6A CN109978004B (en) 2019-02-21 2019-02-21 Image recognition method and related equipment

Publications (1)

Publication Number Publication Date
WO2020168647A1 true WO2020168647A1 (en) 2020-08-27

Family

ID=67077245

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/088825 WO2020168647A1 (en) 2019-02-21 2019-05-28 Image recognition method and related device

Country Status (2)

Country Link
CN (1) CN109978004B (en)
WO (1) WO2020168647A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112785562A (en) * 2021-01-13 2021-05-11 北京智拓视界科技有限责任公司 System for evaluating based on neural network model and related products
CN112884706A (en) * 2021-01-13 2021-06-01 北京智拓视界科技有限责任公司 Image evaluation system based on neural network model and related product
CN114283290A (en) * 2021-09-27 2022-04-05 腾讯科技(深圳)有限公司 Training of image processing model, image processing method, device, equipment and medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112907533A (en) * 2021-02-10 2021-06-04 武汉精测电子集团股份有限公司 Detection model training method, device, equipment and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013154998A1 (en) * 2012-04-09 2013-10-17 Duke University Serum biomarkers and pulmonary nodule size for the early detection of lung cancer
CN108280487A (en) * 2018-02-05 2018-07-13 深圳天琴医疗科技有限公司 A kind of good pernicious determination method and device of tubercle
CN108346154A (en) * 2018-01-30 2018-07-31 浙江大学 The method for building up of Lung neoplasm segmenting device based on Mask-RCNN neural networks
CN108364006A (en) * 2018-01-17 2018-08-03 超凡影像科技股份有限公司 Medical Images Classification device and its construction method based on multi-mode deep learning
CN109035234A (en) * 2018-07-25 2018-12-18 腾讯科技(深圳)有限公司 A kind of nodule detection methods, device and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9305351B2 (en) * 2013-06-16 2016-04-05 Larry D. Partain Method of determining the probabilities of suspect nodules being malignant
CN106504232B (en) * 2016-10-14 2019-06-14 北京网医智捷科技有限公司 A kind of pulmonary nodule automatic checkout system based on 3D convolutional neural networks
CN108765369B (en) * 2018-04-20 2023-05-02 平安科技(深圳)有限公司 Method, apparatus, computer device and storage medium for detecting lung nodule
CN109003260B (en) * 2018-06-28 2021-02-09 深圳视见医疗科技有限公司 CT image pulmonary nodule detection method, device and equipment and readable storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013154998A1 (en) * 2012-04-09 2013-10-17 Duke University Serum biomarkers and pulmonary nodule size for the early detection of lung cancer
CN108364006A (en) * 2018-01-17 2018-08-03 超凡影像科技股份有限公司 Medical Images Classification device and its construction method based on multi-mode deep learning
CN108346154A (en) * 2018-01-30 2018-07-31 浙江大学 The method for building up of Lung neoplasm segmenting device based on Mask-RCNN neural networks
CN108280487A (en) * 2018-02-05 2018-07-13 深圳天琴医疗科技有限公司 A kind of good pernicious determination method and device of tubercle
CN109035234A (en) * 2018-07-25 2018-12-18 腾讯科技(深圳)有限公司 A kind of nodule detection methods, device and storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112785562A (en) * 2021-01-13 2021-05-11 北京智拓视界科技有限责任公司 System for evaluating based on neural network model and related products
CN112884706A (en) * 2021-01-13 2021-06-01 北京智拓视界科技有限责任公司 Image evaluation system based on neural network model and related product
CN114283290A (en) * 2021-09-27 2022-04-05 腾讯科技(深圳)有限公司 Training of image processing model, image processing method, device, equipment and medium
CN114283290B (en) * 2021-09-27 2024-05-03 腾讯科技(深圳)有限公司 Training of image processing model, image processing method, device, equipment and medium

Also Published As

Publication number Publication date
CN109978004B (en) 2024-03-29
CN109978004A (en) 2019-07-05

Similar Documents

Publication Publication Date Title
CN109461495B (en) Medical image recognition method, model training method and server
WO2020168647A1 (en) Image recognition method and related device
Gan et al. Toward accurate tooth segmentation from computed tomography images using a hybrid level set model
JP4999163B2 (en) Image processing method, apparatus, and program
WO2020077962A1 (en) Method and device for breast image recognition
JP2019022477A (en) Crowdsourcing and deep learning based segmenting and karyotyping of chromosomes
JP2008520345A (en) Method and system for detecting and classifying lesions in ultrasound images
US11501431B2 (en) Image processing method and apparatus and neural network model training method
WO2021136368A1 (en) Method and apparatus for automatically detecting pectoralis major region in molybdenum target image
Palma et al. Detection of masses and architectural distortions in digital breast tomosynthesis images using fuzzy and a contrario approaches
JP2006325937A (en) Image determination device, image determination method, and program therefor
US10748282B2 (en) Image processing system, apparatus, method and storage medium
US20120099771A1 (en) Computer aided detection of architectural distortion in mammography
JP2006006359A (en) Image generator, image generator method, and its program
US11684333B2 (en) Medical image analyzing system and method thereof
CN114140378A (en) Scanned image processing method, electronic device, and readable medium
CN113160153A (en) Lung nodule screening method and system based on deep learning technology
CN114332132A (en) Image segmentation method and device and computer equipment
JP2016016265A (en) Image processing apparatus, image processing method and medical image diagnostic apparatus
Lee et al. Hybrid airway segmentation using multi-scale tubular structure filters and texture analysis on 3D chest CT scans
CN113888566B (en) Target contour curve determination method and device, electronic equipment and storage medium
WO2022160731A1 (en) Image processing method and apparatus, electronic device, storage medium, and program
CN113168537A (en) Segmentation of deep neural networks
Mortaheb et al. Metal artifact reduction and segmentation of dental computerized tomography images using least square support vector machine and mean shift algorithm
CN115294401B (en) CT image-based classification method, device, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19916220

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO FORM 1205A DATED 12.10.2021)