WO2021189910A1 - Image recognition method and apparatus, and electronic device and computer-readable storage medium - Google Patents

Image recognition method and apparatus, and electronic device and computer-readable storage medium Download PDF

Info

Publication number
WO2021189910A1
WO2021189910A1 PCT/CN2020/131990 CN2020131990W WO2021189910A1 WO 2021189910 A1 WO2021189910 A1 WO 2021189910A1 CN 2020131990 W CN2020131990 W CN 2020131990W WO 2021189910 A1 WO2021189910 A1 WO 2021189910A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture set
picture
recognition
initial
standard
Prior art date
Application number
PCT/CN2020/131990
Other languages
French (fr)
Chinese (zh)
Inventor
李楠楠
叶苓
刘新卉
周云舒
黄凌云
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021189910A1 publication Critical patent/WO2021189910A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • G06T2207/20132Image cropping
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30061Lung
    • G06T2207/30064Lung nodule

Definitions

  • This application relates to the field of artificial intelligence, and in particular to an image recognition method, device, electronic equipment, and computer-readable storage medium.
  • the inventor realizes that the current picture recognition model recognizes pictures globally, and it is easy to ignore local subtle features, resulting in low accuracy of picture recognition.
  • a picture recognition method provided by this application includes:
  • the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  • the present application also provides a picture recognition device, the device includes:
  • the global model generation module is used to obtain an initial picture set, perform global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
  • the first deep learning model is trained to obtain the first recognition model;
  • the local model generation module is used to perform local target area cropping conversion, data enhancement, and pixel normalization processing on the initial picture set to obtain a second standard picture set; use the second standard picture set to compare the pre-built second
  • the deep learning model is trained to obtain the second recognition model
  • the picture recognition module is configured to, when the picture to be recognized is received, use the first recognition model and the second recognition model to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  • an electronic device which includes:
  • At least one processor and,
  • a memory communicatively connected with the at least one processor; wherein,
  • the memory stores computer program instructions executable by the at least one processor, and the computer program instructions are executed by the at least one processor, so that the at least one processor can execute the following steps:
  • the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  • the present application also provides a computer-readable storage medium in which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:
  • the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  • FIG. 1 is a schematic flowchart of a picture recognition method provided by an embodiment of this application.
  • FIG. 2 is a detailed flowchart of one of the steps in the image recognition method provided in FIG. 1;
  • FIG. 3 is a schematic diagram of modules of a picture recognition device provided by an embodiment of the application.
  • FIG. 4 is a schematic diagram of the internal structure of an electronic device for implementing a picture recognition method provided by an embodiment of the application;
  • This application provides a picture recognition method.
  • FIG. 1 it is a schematic flowchart of a picture recognition method provided by an embodiment of this application.
  • the method can be executed by a device, and the device can be implemented by software and/or hardware.
  • the picture recognition method includes:
  • the initial picture set may be a medical image picture set containing initial tags, such as a patient chest X-ray film set containing tags, wherein the initial tags are preset disease identification tags, such as tuberculosis and tuberculosis. Non-tuberculosis.
  • the embodiment of the present application crops the first region of interest (Region Of Interest, ROI region for short) of each picture in the initial picture set to obtain the first global Picture collection, preferably, in this embodiment of the present application, the first region of interest is the whole lung region.
  • the first region of interest is the whole lung region.
  • the embodiment of the present application fills and interpolates each picture in the first global picture set to a preset size to obtain the second global picture set.
  • the embodiment of the present application interpolating the filling of each picture in the first global picture set to a preset size includes: filling blank pixels in each picture in the first global picture set according to a preset rule to obtain the filling Picture set; picture interpolation is performed on each picture in the filling picture set to a preset size to obtain the second global picture set, wherein the pictures in the filling picture set are the same as the pictures in the second global picture set Have the same aspect ratio, for example: fill and interpolate picture A in the first global picture set to a preset size, the default size is 1024*1024, the size of picture A is 256*240, and picture A is filled with blank pixels It is the smallest square picture B containing picture A, the size of picture B is 256*256, and then picture B is interpolated to obtain picture C with a size of 1024*1024, and the filling and interpolation of picture A is completed.
  • normalization processing is performed on each pixel value in each picture in the second global picture set to obtain the first standard picture set. Further, in this embodiment of the present application, the normalization of each original pixel value in each picture in the second global picture set can be calculated by the following formula:
  • P x represents the original pixel value
  • P g represents the original pixel value after normalization
  • performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain the first standard picture set includes: cropping each picture in the initial picture set To obtain the first global picture set; fill and interpolate each picture in the first global picture set to a preset size to obtain the second global picture set; for each picture in the second global picture set Each pixel value in a picture is normalized to obtain the first standard picture set.
  • the first deep learning model may be a convolutional neural network model or a residual network model.
  • first standard picture set to train the pre-built first deep learning model in the embodiment of the present application includes:
  • Step A Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
  • Step B Use the preset activation function to calculate the predicted value of the feature set, obtain the label value of the initial label corresponding to each picture in the standard picture set, and according to the predicted value and the label value, Calculate using the pre-built first loss function to obtain the first loss value;
  • the label value corresponds to the initial label one-to-one.
  • the initial label has two labels: tuberculosis and non-tuberculosis, the label value corresponding to the tuberculosis label is 1, and the label corresponding to the non-tuberculosis label The value is 0.
  • Step C Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
  • performing a convolution pooling operation on the first standard picture set to obtain the first feature set includes: performing a convolution operation on the first standard picture set to obtain the first convolution data Set; maximum pooling operation is performed on the first convolutional data set to obtain the first feature set.
  • ⁇ ' represents the number of channels of the first convolution data set
  • represents the number of channels of the first standard picture set
  • k is the size of the preset convolution kernel
  • f is the step size of the preset convolution operation
  • P is the preset data zero-filling matrix
  • the first activation function in the preferred embodiment of the present application includes:
  • ⁇ t represents the predicted value
  • s represents the data in the feature set
  • the first loss function described in the preferred embodiment of the present application includes:
  • L ce represents the first loss value
  • N is the number of data in the first standard picture set
  • i is a positive integer
  • y i is the label value
  • p i is the predicted value
  • the pictures in the first standard picture set are global pictures, so the first recognition model is a global recognition model, but the actual application of the global recognition model usually ignores the subtle features corresponding to the local position, resulting in omissions. Therefore, it is necessary to use the local location picture set to train a local recognition model as a supplement to the first recognition model.
  • the first recognition model is a global recognition model for recognizing the whole lung, but the above The situation where there are slight lung lesions (such as fibrosis, multiple small spots) is easy to miss, therefore, the local recognition model of the upper lung part class is used to train the upper lung picture as a supplement to the first recognition model; Therefore, the local target area cropping conversion, data enhancement, and pixel normalization processing are performed on the initial picture set to obtain a second standard picture set, wherein the pictures in the second standard picture set are local position pictures, for example: The pictures in the first standard picture set are pictures of the whole lung area, and the pictures in the second standard picture set are pictures of the upper lung area.
  • performing local target region cropping conversion, data enhancement, and pixel normalization processing on the initial picture set in the embodiment of the present application includes:
  • the second region of interest is an upper lung region.
  • S32 Mark corresponding pictures in the initial partial picture set according to the initial label corresponding to each picture in the initial picture set to obtain a first partial picture set;
  • the label of picture A in the initial picture set is tuberculosis
  • the label position is the upper left lung.
  • the upper left lung picture a and the upper right lung picture b are obtained from picture A, and the picture a is marked as tuberculosis according to the label of picture A.
  • the following picture processing process only processes pictures in the first partial picture set, and does not affect the labels corresponding to the pictures.
  • S33 Filling and interpolating pictures in the first partial picture set to a preset size to obtain a second partial picture set;
  • the data processing method of self-supervised learning model training well known to those skilled in the art is used to adjust the angle and the corresponding angle of each picture in the third partial picture set.
  • Label marking for example, random 0°, 90°, 180°, 270° rotation of pictures in the third partial picture set, and label marking of rotation angles, to obtain the second standard picture set.
  • the pictures in the second standard picture set have double labels, which are the initial label and the rotation angle label.
  • the initial label of picture A is tuberculosis
  • the rotation angle label is 90°.
  • the second deep learning model may be a convolutional neural network model or a residual network model.
  • using the second standard picture set to train the pre-built second deep learning model includes:
  • Step I Perform weight calculation according to the preset second loss function and the preset third loss function to obtain the target loss function
  • the pictures in the second standard picture set have double labels, which are the initial label and the rotation angle label. Therefore, two types of prediction results will be generated during the model training process.
  • two loss functions are required, namely the second loss function and the third loss function, and the second loss function is the loss function corresponding to the initial label ,
  • the third loss function is a loss function corresponding to the rotation angle label.
  • the weight calculation is performed according to the preset second loss function and the preset third loss function, and the weight calculation can be expressed by the following formula:
  • L is the target loss function
  • L tb is the second loss function
  • L rot is the third loss function
  • is a preset weight coefficient
  • the weight coefficient is 0.1.
  • Step II According to the target loss function, use the second standard picture set to train the second deep learning model; when the value of the target loss function is less than a second preset threshold, stop training to obtain The second recognition model.
  • the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a target recognition result.
  • the format of the picture to be recognized in the embodiment of this application is the same as that of the picture in the initial picture collection.
  • the picture to be recognized in the embodiment of this application is a medical imaging picture in medical technology, such as a patient's chest X-ray. .
  • the first recognition model and the second recognition model are used to recognize the picture to be recognized, and the first recognition model is used to recognize the picture to be recognized to obtain a first recognition. Result; using the second recognition model to recognize the picture to be recognized to obtain a second recognition result, wherein the second recognition result includes a disease recognition result and a picture rotation angle result, preferably, the embodiment of the present application
  • the disease recognition result mentioned in is the recognition result of tuberculosis.
  • the logical operation is performed according to the first recognition result and the second recognition result to obtain the target recognition result, wherein the logical operation in the embodiment of the present application is OR, AND Two logical operations, for example: the first recognition result is positive for pulmonary tuberculosis, the disease recognition result in the disease recognition result in the second recognition result is negative for pulmonary tuberculosis, or the first recognition result is negative for pulmonary tuberculosis, the If the disease recognition result in the second recognition result is positive for pulmonary tuberculosis, the target recognition result is positive for pulmonary tuberculosis; when the disease recognition results in the first recognition result and the second recognition result are both negative for pulmonary tuberculosis, then The target recognition result is negative for pulmonary tuberculosis; when the disease recognition results in the first recognition result and the second recognition result are both positive for pulmonary tuberculosis, the target recognition result is positive for pulmonary tuberculosis.
  • the picture to be identified may be stored in the blockchain.
  • the global target area cropping conversion and pixel normalization processing are performed on the initial picture set to obtain a first standard picture set, and the first standard picture set is used to train the pre-built first deep learning model , Improve the training speed and accuracy of the model, realize the global recognition of the picture; perform local target area cropping conversion, data enhancement and pixel normalization processing on the initial picture set, and use the second standard picture set to pre-build
  • the second deep learning model is trained to obtain the second recognition model, which improves the training speed of the model, enhances the robustness of the model, and realizes the local recognition of the picture; when the picture to be recognized is received, the first recognition model is used.
  • a recognition model and the second recognition model recognize and determine the result of the picture to be recognized to obtain a recognition result, and perform complementary operations of global recognition of the picture and local recognition of the picture through a dual model to improve the accuracy of picture recognition.
  • FIG. 3 it is a functional block diagram of the picture recognition device of the present application.
  • the picture recognition apparatus 100 described in this application can be installed in an electronic device.
  • the picture recognition device may include a global model generation module 101, a local model generation module 102, and a picture recognition module 103.
  • the module described in the present invention can also be called a unit, which refers to a series of computer program segments that can be executed by the processor of an electronic device and can complete fixed functions, and are stored in the memory of the electronic device.
  • each module/unit is as follows:
  • the global model generation module 101 is used to obtain an initial picture set, perform global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
  • the constructed first deep learning model is trained to obtain the first recognition model.
  • the initial picture set may be a medical image picture set containing initial tags, such as a patient chest X-ray film set containing tags, wherein the initial tags are preset disease identification tags, such as tuberculosis and tuberculosis. Non-tuberculosis.
  • the global model generation module 101 crops the first region of interest of each picture in the initial picture set to obtain the first global picture set
  • the first region of interest is a whole lung region.
  • the global model generation module 101 described in this embodiment of the present application fills and interpolates each picture in the first global picture set to a preset size to obtain a second global picture set.
  • the global model generation module 101 of the embodiment of the present application uses the following means to fill and interpolate each picture in the first global picture set to a preset size, including: inserting each picture in the first global picture set Fill in blank pixels according to preset rules to obtain a filled picture set; perform picture interpolation and adjust each picture in the filled picture set to a preset size to obtain the second global picture set, wherein the pictures in the filled picture set It has the same aspect ratio as the pictures in the second global picture set.
  • the interpolation value for picture A in the first global picture set is a preset size
  • the preset size is 1024*1024
  • the size of picture A is 256 *240
  • the size of picture B is 256*256
  • interpolate picture B to get a size of 1024*1024 picture C completes the filling and interpolation of picture A .
  • the global model generation module 101 normalizes each pixel value in each picture in the second global picture set to obtain The first standard picture collection. Further, the global model generation module 101 according to the embodiment of the present application normalizes each original pixel value in each picture in the second global picture set and can be calculated by the following formula:
  • P x represents the original pixel value
  • P g represents the original pixel value after normalization
  • the global model generation module 101 performs global target area cropping conversion and pixel normalization processing on the initial picture set to obtain the first standard picture set, including: cropping the initial picture set.
  • the first region of interest of each picture in the picture set is used to obtain the first global picture set;
  • the filling and interpolation of each picture in the first global picture set is the preset size to obtain the second global picture set;
  • Each pixel value in each picture in the global picture set is normalized to obtain the first standard picture set.
  • the first deep learning model may be a convolutional neural network model or a residual network model.
  • the global model generation module 101 in the embodiment of the present application uses the following methods to train the pre-built first deep learning model, including:
  • Step A Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
  • Step B Use the preset activation function to calculate the predicted value of the feature set, obtain the label value of the initial label corresponding to each picture in the standard picture set, and according to the predicted value and the label value, Calculate using the pre-built first loss function to obtain the first loss value;
  • the label value corresponds to the initial label one-to-one.
  • the initial label has two labels: tuberculosis and non-tuberculosis, the label value corresponding to the tuberculosis label is 1, and the label corresponding to the non-tuberculosis label The value is 0.
  • Step C Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
  • the global model generation module 101 performs a convolution pooling operation on the first standard picture set to obtain a first feature set, including: performing a convolution operation on the first standard picture set to obtain A first convolution data set; performing a maximum pooling operation on the first convolution data set to obtain the first feature set.
  • ⁇ ' represents the number of channels of the first convolution data set
  • represents the number of channels of the first standard picture set
  • k is the size of the preset convolution kernel
  • f is the step size of the preset convolution operation
  • P is the preset data zero-filling matrix
  • activation function described in the preferred embodiment of the present application includes:
  • ⁇ t represents the predicted value
  • s represents the data in the feature set
  • the first loss function described in the preferred embodiment of the present application includes:
  • L ce represents the first loss value
  • N is the number of data in the first standard picture set
  • i is a positive integer
  • y i is the label value
  • p i is the predicted value
  • the local model generation module 102 is used to perform local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
  • the second deep learning model is trained to obtain the second recognition model.
  • the pictures in the first standard picture set are global pictures, so the first recognition model is a global recognition model, but the actual application of the global recognition model usually ignores the subtle features corresponding to the local position, resulting in omissions. Therefore, it is necessary to use the local location picture set to train a local recognition model as a supplement to the first recognition model.
  • the first recognition model is a global recognition model for recognizing the whole lung, but the above The situation where there are slight lung lesions (such as fibrosis, multiple small spots) is easy to miss, therefore, the local recognition model of the upper lung part class is used to train the upper lung picture as a supplement to the first recognition model; Therefore, the local target area cropping conversion, data enhancement, and pixel normalization processing are performed on the initial picture set to obtain a second standard picture set, wherein the pictures in the second standard picture set are local position pictures, for example: The pictures in the first standard picture set are pictures of the whole lung area, and the pictures in the second standard picture set are pictures of the upper lung area.
  • the local model generation module 102 in the embodiment of the present application uses the following methods to perform local target region cropping conversion, data enhancement, and pixel normalization processing on the initial picture set, including:
  • the second region of interest is an upper lung region.
  • the label of picture A in the initial picture set is tuberculosis
  • the label position is the upper left lung.
  • the upper left lung picture a and the upper right lung picture b are obtained from picture A, and picture a is labeled as tuberculosis label according to the label of picture A.
  • picture b is labeled as a non-tuberculosis label.
  • the following picture processing process only processes pictures in the first partial picture set, and does not affect the labels corresponding to the pictures.
  • the data processing method of self-supervised learning model training well known to those skilled in the art is used to adjust the angle and the corresponding angle of each picture in the third partial picture set.
  • Label marking for example, random 0°, 90°, 180°, 270° rotation of pictures in the third partial picture set, and label marking of rotation angles, to obtain the second standard picture set.
  • the pictures in the second standard picture set have double labels, which are the initial label and the rotation angle label.
  • the initial label of picture A is tuberculosis
  • the rotation angle label is 90°.
  • the second deep learning model may be a convolutional neural network model or a residual network model.
  • the local model generation module 102 uses the following means to train the pre-built second deep learning model including:
  • the pictures in the second standard picture set have double labels as the initial label and the rotation angle label. Therefore, two types of prediction results will be generated during the model training process.
  • the prediction results of the two categories are measured separately, and two loss functions are required, namely the second loss function and the third loss function, and the second loss function is the loss function corresponding to the initial label,
  • the third loss function is a loss function corresponding to the rotation angle label.
  • the weight calculation is performed according to the preset second loss function and the preset third loss function, and the weight calculation can be expressed by the following formula:
  • the weight calculation can be expressed by the following formula:
  • L is the target loss function
  • L tb is the second loss function
  • L rot is the third loss function
  • is a preset weight coefficient
  • the weight coefficient is 0.1.
  • the target loss function use the second standard picture set to train the second deep learning model; when the value of the target loss function is less than a second preset threshold, stop training to obtain the second Identify the model.
  • the picture recognition module 103 is configured to, when a picture to be recognized is received, use the first recognition model and the second recognition model to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  • the format of the picture to be recognized in the embodiment of this application is the same as that of the picture in the initial picture collection.
  • the picture to be recognized in the embodiment of this application is a medical imaging picture in medical technology, such as a patient's chest X-ray. .
  • the picture recognition module 103 in the embodiment of the present application respectively uses the first recognition model and the second recognition model to recognize the picture to be recognized, and uses the first recognition model to recognize the picture to be recognized. Perform recognition to obtain a first recognition result; use the second recognition model to recognize the picture to be recognized to obtain a second recognition result, wherein the second recognition result includes a disease recognition result and a picture rotation angle result, preferably Specifically, the disease recognition result in the embodiment of the present application is the pulmonary tuberculosis recognition result.
  • the picture recognition module 103 in the embodiment of the present application uses the following means to perform logical operations according to the first recognition result and the second recognition result to obtain the target recognition result.
  • the logical operation is two logical operations of OR and AND, for example: the first recognition result is positive for tuberculosis, the disease recognition result in the disease recognition result in the second recognition result is negative for tuberculosis, or the first recognition result
  • the result is pulmonary tuberculosis negative, and the disease recognition result in the second recognition result is positive for pulmonary tuberculosis, then the target recognition result is positive for pulmonary tuberculosis; when the first recognition result and the disease recognition results in the second recognition result are both Is negative for pulmonary tuberculosis, the target recognition result is negative for pulmonary tuberculosis; when the disease recognition results in the first recognition result and the second recognition result are both positive for pulmonary tuberculosis, the target recognition result is positive for pulmonary tuberculosis .
  • the picture to be identified may be stored in the blockchain.
  • FIG. 4 it is a schematic diagram of the structure of an electronic device that implements the image recognition method of the present application.
  • the electronic device 1 may include a processor 10, a memory 11, and a bus, and may also include a computer program stored in the memory 11 and running on the processor 10, such as a picture recognition program.
  • the memory 11 may be volatile or non-volatile.
  • the memory 11 includes at least one type of readable storage medium.
  • the readable storage medium includes flash memory, mobile hard disk, and multimedia card.
  • Card-type memory for example: SD or DX memory, etc.
  • magnetic memory magnetic disk, optical disk, etc.
  • the memory 11 may be an internal storage unit of the electronic device 1 in some embodiments, for example, a mobile hard disk of the electronic device 1.
  • the memory 11 may also be an external storage device of the electronic device 1, such as a plug-in mobile hard disk, a smart media card (SMC), and a secure digital (Secure Digital) equipped on the electronic device 1.
  • SD Secure Digital
  • flash Card Flash Card
  • the memory 11 may also include both an internal storage unit of the electronic device 1 and an external storage device.
  • the memory 11 can be used not only to store application software and various data installed in the electronic device 1, such as the code of a picture recognition program, etc., but also to temporarily store data that has been output or will be output.
  • the processor 10 may be composed of integrated circuits in some embodiments, for example, may be composed of a single packaged integrated circuit, or may be composed of multiple integrated circuits with the same function or different functions, including one or more Combinations of central processing unit (CPU), microprocessor, digital processing chip, graphics processor, and various control chips, etc.
  • the processor 10 is the control unit of the electronic device, which uses various interfaces and lines to connect the various components of the entire electronic device, and runs or executes programs or modules (such as pictures) stored in the memory 11 Recognition program, etc.), and call the data stored in the memory 11 to execute various functions of the electronic device 1 and process data.
  • the bus may be a peripheral component interconnect standard (PCI) bus or an extended industry standard architecture (EISA) bus, etc.
  • PCI peripheral component interconnect standard
  • EISA extended industry standard architecture
  • the bus can be divided into address bus, data bus, control bus and so on.
  • the bus is configured to implement connection and communication between the memory 11 and at least one processor 10 and the like.
  • FIG. 4 only shows an electronic device with components. Those skilled in the art can understand that the structure shown in FIG. 3 does not constitute a limitation on the electronic device 1, and may include fewer or more components than shown in the figure. Components, or combinations of certain components, or different component arrangements.
  • the electronic device 1 may also include a power source (such as a battery) for supplying power to various components.
  • the power source may be logically connected to the at least one processor 10 through a power management device, thereby controlling power
  • the device implements functions such as charge management, discharge management, and power consumption management.
  • the power supply may also include any components such as one or more DC or AC power supplies, recharging devices, power failure detection circuits, power converters or inverters, and power status indicators.
  • the electronic device 1 may also include various sensors, Bluetooth modules, Wi-Fi modules, etc., which will not be repeated here.
  • the electronic device 1 may also include a network interface.
  • the network interface may include a wired interface and/or a wireless interface (such as a Wi-Fi interface, a Bluetooth interface, etc.), which is usually used in the electronic device 1 Establish a communication connection with other electronic devices.
  • the electronic device 1 may also include a user interface.
  • the user interface may be a display (Display) and an input unit (such as a keyboard (Keyboard)).
  • the user interface may also be a standard wired interface or a wireless interface.
  • the display may be an LED display, a liquid crystal display, a touch liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, etc.
  • the display can also be appropriately called a display screen or a display unit, which is used to display the information processed in the electronic device 1 and to display a visualized user interface.
  • the picture recognition program 12 stored in the memory 11 in the electronic device 1 is a combination of multiple instructions. When running in the processor 10, it can realize:
  • the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  • the integrated module/unit of the electronic device 1 can be stored in a computer-readable storage medium. It can be volatile or non-volatile.
  • the computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory) .
  • the computer usable storage medium may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function, etc.; the storage data area may store a block chain node Use the created data, etc.
  • modules described as separate components may or may not be physically separated, and the components displayed as modules may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the modules can be selected according to actual needs to achieve the goals of the solutions of the embodiments.
  • the functional modules in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit may be implemented in the form of hardware, or may be implemented in the form of hardware plus software functional modules.
  • the blockchain referred to in this application is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

An image recognition method and apparatus, and an electronic device and a storage medium, which relate to the field of artificial intelligence, and can be applied to an application scenario of medical image recognition. The method comprises: performing global target region clipping and pixel normalization processing on an initial image set so as to obtain a first standard image set (S1); performing first model training using the first standard image set so as to obtain a first recognition model (S2); performing local target region clipping, data enhancement and pixel normalization processing on the initial image set so as to obtain a second standard image set (S3); performing second model training using the second standard image set so as to obtain a second recognition model (S4); and when an image to be subjected to recognition is received, performing recognition and result determination on said image using the first recognition model and the second recognition model so as to obtain a recognition result (S5). The image to be subjected to recognition can be stored in a blockchain node. By means of the method, the accuracy of image recognition can be improved.

Description

图片识别方法、装置、电子设备及计算机可读存储介质Picture recognition method, device, electronic equipment and computer readable storage medium
本申请要求于2020年9月24日提交中国专利局、申请号为CN202011015349.4、名称为“图片识别方法、装置、电子设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office, the application number is CN202011015349.4, and the title is "picture recognition method, device, electronic equipment, and computer-readable storage medium" on September 24, 2020, all of which The content is incorporated in this application by reference.
技术领域Technical field
本申请涉及人工智能领域,尤其涉及一种图片识别方法、装置、电子设备及计算机可读存储介质。This application relates to the field of artificial intelligence, and in particular to an image recognition method, device, electronic equipment, and computer-readable storage medium.
背景技术Background technique
随着人工智能的发展,利用图片识别模型对图片进行识别的应用越来越广泛,不仅能够应用在生活中,在医疗科技中也得到了广泛的应用,例如:对病人的胸部CT图片进行识别,辅助医生进行肺结核的诊断。With the development of artificial intelligence, the use of image recognition models to recognize images has become more and more widely used, not only in life, but also in medical technology, such as: recognizing patients’ chest CT images Assist the doctor in the diagnosis of tuberculosis.
但是,发明人意识到,目前的图片识别模型识别图片是对图片进行全局识别,容易忽略局部的细微特征,导致图片识别的准确度低。However, the inventor realizes that the current picture recognition model recognizes pictures globally, and it is easy to ignore local subtle features, resulting in low accuracy of picture recognition.
发明内容Summary of the invention
本申请提供的一种图片识别方法,包括:A picture recognition method provided by this application includes:
获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
本申请还提供一种图片识别装置,所述装置包括:The present application also provides a picture recognition device, the device includes:
全局模型生成模块,用于获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;The global model generation module is used to obtain an initial picture set, perform global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set; The first deep learning model is trained to obtain the first recognition model;
局部模型生成模块,用于对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;The local model generation module is used to perform local target area cropping conversion, data enhancement, and pixel normalization processing on the initial picture set to obtain a second standard picture set; use the second standard picture set to compare the pre-built second The deep learning model is trained to obtain the second recognition model;
图片识别模块,用于当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。The picture recognition module is configured to, when the picture to be recognized is received, use the first recognition model and the second recognition model to recognize and determine the result of the picture to be recognized to obtain a recognition result.
为了解决上述问题,本申请还提供一种电子设备,所述电子设备包括:In order to solve the above-mentioned problems, the present application also provides an electronic device, which includes:
至少一个处理器;以及,At least one processor; and,
与所述至少一个处理器通信连接的存储器;其中,A memory communicatively connected with the at least one processor; wherein,
所述存储器存储有可被所述至少一个处理器执行的计算机程序指令,所述计算机程序指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如下步骤:The memory stores computer program instructions executable by the at least one processor, and the computer program instructions are executed by the at least one processor, so that the at least one processor can execute the following steps:
获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
为了解决上述问题,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时实现如下步骤:In order to solve the above-mentioned problems, the present application also provides a computer-readable storage medium in which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:
获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
附图说明Description of the drawings
图1为本申请一实施例提供的图片识别方法的流程示意图;FIG. 1 is a schematic flowchart of a picture recognition method provided by an embodiment of this application;
图2为图1提供的图片识别方法中其中一个步骤的详细流程示意图;FIG. 2 is a detailed flowchart of one of the steps in the image recognition method provided in FIG. 1;
图3为本申请一实施例提供的图片识别装置的模块示意图;FIG. 3 is a schematic diagram of modules of a picture recognition device provided by an embodiment of the application;
图4为本申请一实施例提供的实现图片识别方法的电子设备的内部结构示意图;4 is a schematic diagram of the internal structure of an electronic device for implementing a picture recognition method provided by an embodiment of the application;
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not used to limit the present application.
本申请提供一种图片识别方法。参照图1所示,为本申请一实施例提供的图片识别方法的流程示意图。该方法可以由一个装置执行,该装置可以由软件和/或硬件实现。This application provides a picture recognition method. Referring to FIG. 1, it is a schematic flowchart of a picture recognition method provided by an embodiment of this application. The method can be executed by a device, and the device can be implemented by software and/or hardware.
在本实施例中,所述图片识别方法包括:In this embodiment, the picture recognition method includes:
S1、获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;S1. Obtain an initial picture set, and perform global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
本申请实施例中,所述初始图片集可以为包含初始标签的医疗影像图片集合,如包含标签的病人胸部X光片集合,其中,所述初始标签为预设疾病判别标签,如:肺结核与非肺结核。In the embodiment of the present application, the initial picture set may be a medical image picture set containing initial tags, such as a patient chest X-ray film set containing tags, wherein the initial tags are preset disease identification tags, such as tuberculosis and tuberculosis. Non-tuberculosis.
进一步地,本申请实施例为了排除背景区域的干扰,提高后续模型的训练精度,裁剪所述初始图片集中每张图片的第一感兴趣区域(Region Of Interest,简称ROI区域),得到第一全局图片集,较佳地,本申请实施例中,所述第一感兴趣区域为全肺区域。Further, in order to eliminate the interference of the background region and improve the training accuracy of the subsequent model, the embodiment of the present application crops the first region of interest (Region Of Interest, ROI region for short) of each picture in the initial picture set to obtain the first global Picture collection, preferably, in this embodiment of the present application, the first region of interest is the whole lung region.
进一步地,为了便于后续模型统一处理,本申请实施例将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集。Further, in order to facilitate subsequent unified processing of the model, the embodiment of the present application fills and interpolates each picture in the first global picture set to a preset size to obtain the second global picture set.
详细地,本申请实施例将所述第一全局图片集中的每张图片填充插值为预设大小,包括:将所述第一全局图片集中的每张图片按照预设规则填充空白像素,得到填充图片集;对所述填充图片集中的每张图片进行图片插值调整为预设大小,得到所述第二全局图片集,其中,所述填充图片集中的图片与所述第二全局图片集中的图片具有相同的长宽比,例如:对所述第一全局图片集中图片A填充插值为预设大小,预设大小为1024*1024,图片A的大小为256*240,利用空白像素将图片A填充为包含图片A的最小正方形图片B,图片B的大小为256*256,再对图片B进行插值得到大小为1024*1024图片C,完成对图片A的 填充插值。In detail, the embodiment of the present application interpolating the filling of each picture in the first global picture set to a preset size includes: filling blank pixels in each picture in the first global picture set according to a preset rule to obtain the filling Picture set; picture interpolation is performed on each picture in the filling picture set to a preset size to obtain the second global picture set, wherein the pictures in the filling picture set are the same as the pictures in the second global picture set Have the same aspect ratio, for example: fill and interpolate picture A in the first global picture set to a preset size, the default size is 1024*1024, the size of picture A is 256*240, and picture A is filled with blank pixels It is the smallest square picture B containing picture A, the size of picture B is 256*256, and then picture B is interpolated to obtain picture C with a size of 1024*1024, and the filling and interpolation of picture A is completed.
进一步地,本申请实施例中,为了加快后续模型的训练速度,对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。进一步地,本申请实施例将所述第二全局图片集中的每张图片中的每个原始像素值归一化可用如下公式进行计算:Further, in the embodiment of the present application, in order to speed up the training speed of the subsequent model, normalization processing is performed on each pixel value in each picture in the second global picture set to obtain the first standard picture set. Further, in this embodiment of the present application, the normalization of each original pixel value in each picture in the second global picture set can be calculated by the following formula:
P g=P x/256 P g =P x /256
其中,P x表示所述原始像素值,P g表示归一化后的所述原始像素值。 Wherein, P x represents the original pixel value, and P g represents the original pixel value after normalization.
综上所述,本申请实施例中,所述对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集,包括:裁剪所述初始图片集中每张图片的第一感兴趣区域,得到第一全局图片集;将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集;对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。In summary, in the embodiment of the present application, performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain the first standard picture set includes: cropping each picture in the initial picture set To obtain the first global picture set; fill and interpolate each picture in the first global picture set to a preset size to obtain the second global picture set; for each picture in the second global picture set Each pixel value in a picture is normalized to obtain the first standard picture set.
S2、利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;S2. Use the first standard picture set to train the pre-built first deep learning model to obtain the first recognition model;
本申请实施例中,所述第一深度学习模型可以为卷积神经网络模型、残差网络模型。In the embodiment of the present application, the first deep learning model may be a convolutional neural network model or a residual network model.
进一步地,本申请实施例中利用所述第一标准图片集对预构建的第一深度学习模型进行训练,包括:Further, using the first standard picture set to train the pre-built first deep learning model in the embodiment of the present application includes:
步骤A:根据预设的卷积池化次数,对所述第一标准图片集进行卷积池化操作,得到特征集;Step A: Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
步骤B:利用预设的激活函数对所述特征集进行计算得到预测值,获取所述标准图片集中每张图片对应的所述初始标签的标签值,根据所述预测值及所述标签值,利用预构建的第一损失函数进行计算,得到第一损失值;Step B: Use the preset activation function to calculate the predicted value of the feature set, obtain the label value of the initial label corresponding to each picture in the standard picture set, and according to the predicted value and the label value, Calculate using the pre-built first loss function to obtain the first loss value;
本申请实施例中所述标签值与所述初始标签是一一对应的,例如:所述初始标签共有肺结核与非肺结核两种标签,肺结核标签对应的标签值为1,非肺结核标签对应的标签值为0。In the examples of this application, the label value corresponds to the initial label one-to-one. For example, the initial label has two labels: tuberculosis and non-tuberculosis, the label value corresponding to the tuberculosis label is 1, and the label corresponding to the non-tuberculosis label The value is 0.
步骤C:对比所述第一损失值与预设的第一损失阈值的大小,当所述第一损失值大于或等于所述第一预设阈值时,返回所述步骤A;当所述第一损失值小于所述第一预设阈值时,停止训练,得到所述第一识别模型。Step C: Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
详细地,本申请实施例中所述对所述第一标准图片集进行卷积池化操作得到第一特征集,包括:对所述第一标准图片集进行卷积操作得到第一卷积数据集;对所述第一卷积数据集进行最大池化操作得到所述第一特征集。In detail, in the embodiment of the present application, performing a convolution pooling operation on the first standard picture set to obtain the first feature set includes: performing a convolution operation on the first standard picture set to obtain the first convolution data Set; maximum pooling operation is performed on the first convolutional data set to obtain the first feature set.
进一步地,所述卷积操作为:Further, the convolution operation is:
Figure PCTCN2020131990-appb-000001
Figure PCTCN2020131990-appb-000001
其中,ω’表示所述第一卷积数据集的通道数,ω表示所述第一标准图片集的通道数,k为预设卷积核的大小,f为预设卷积操作的步幅,p为预设数据补零矩阵。Where, ω'represents the number of channels of the first convolution data set, ω represents the number of channels of the first standard picture set, k is the size of the preset convolution kernel, and f is the step size of the preset convolution operation , P is the preset data zero-filling matrix.
进一步地,本申请较佳实施例所述第一激活函数包括:Further, the first activation function in the preferred embodiment of the present application includes:
Figure PCTCN2020131990-appb-000002
Figure PCTCN2020131990-appb-000002
其中,μ t表示所述预测值,s表示所述特征集中的数据。 Wherein, μ t represents the predicted value, and s represents the data in the feature set.
详细地,本申请较佳实施例所述第一损失函数包括:In detail, the first loss function described in the preferred embodiment of the present application includes:
Figure PCTCN2020131990-appb-000003
Figure PCTCN2020131990-appb-000003
其中,L ce表示所述第一损失值,N为所述第一标准图片集的数据数目,i为正整数,y i为 所述标签值,p i为所述预测值。 Wherein, L ce represents the first loss value, N is the number of data in the first standard picture set, i is a positive integer, y i is the label value, and p i is the predicted value.
S3、对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;S3. Perform local target region cropping conversion, data enhancement, and pixel normalization processing on the initial picture set to obtain a second standard picture set;
本申请实施例中,所述第一标准图片集中的图片是全局图片,所以所述第一识别模型为全局识别模型,但是实际应用所述全局识别模型通常会忽略对应局部位置的细微特征导致漏检,因此,需要利用局部位置图片集合再训练一个局部识别的模型作为对所述第一识别模型的补充,例如:所述第一识别模型为对全肺进行识别的全局识别的模型,但是上肺部有轻微的病灶的情形(如纤维化、多发性小斑点)容易漏掉,因此,利用上肺部图片训练对上肺部分类的局部识别模型作为对所述第一识别模型的补充;所以对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集,其中,所述第二标准图片集中的图片为局部位置图片,例如:所述第一标准图片集中的图片为全肺区域图片,所述第二标准图片集中的图片为上肺部区域图片。In the embodiment of the present application, the pictures in the first standard picture set are global pictures, so the first recognition model is a global recognition model, but the actual application of the global recognition model usually ignores the subtle features corresponding to the local position, resulting in omissions. Therefore, it is necessary to use the local location picture set to train a local recognition model as a supplement to the first recognition model. For example, the first recognition model is a global recognition model for recognizing the whole lung, but the above The situation where there are slight lung lesions (such as fibrosis, multiple small spots) is easy to miss, therefore, the local recognition model of the upper lung part class is used to train the upper lung picture as a supplement to the first recognition model; Therefore, the local target area cropping conversion, data enhancement, and pixel normalization processing are performed on the initial picture set to obtain a second standard picture set, wherein the pictures in the second standard picture set are local position pictures, for example: The pictures in the first standard picture set are pictures of the whole lung area, and the pictures in the second standard picture set are pictures of the upper lung area.
详细地,参照图2所示,本申请实施例中所述对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,包括:In detail, referring to FIG. 2, performing local target region cropping conversion, data enhancement, and pixel normalization processing on the initial picture set in the embodiment of the present application includes:
S31、裁剪所述初始图片集中每张图片的第二感兴趣区域,得到初始局部图片集;S31. Crop the second region of interest of each picture in the initial picture set to obtain an initial partial picture set;
较佳地,本申请实施例中,所述第二感兴趣区域为上肺部区域。Preferably, in this embodiment of the present application, the second region of interest is an upper lung region.
S32、根据所述初始图片集中每张图片对应的所述初始标签对所述初始局部图片集中对应的图片进行标记,得到第一局部图片集;S32: Mark corresponding pictures in the initial partial picture set according to the initial label corresponding to each picture in the initial picture set to obtain a first partial picture set;
例如:所述初始图片集中的图片A标签为肺结核,标签位置为左上肺,经过S31的处理,通过图片A得到左上肺图片a及右上肺图片b,根据图片A的标签将图片a标记为肺结核标签,将图片b标记为非肺结核标签。For example: the label of picture A in the initial picture set is tuberculosis, and the label position is the upper left lung. After S31 processing, the upper left lung picture a and the upper right lung picture b are obtained from picture A, and the picture a is marked as tuberculosis according to the label of picture A. Label, mark picture b as a non-TB label.
本申请实施例中,下述图片处理过程只是对第一局部图片集中的图片进行处理,不会影响图片对应的标签。In the embodiment of the present application, the following picture processing process only processes pictures in the first partial picture set, and does not affect the labels corresponding to the pictures.
S33、将所述第一局部图片集中的图片填充插值为预设大小,得到第二局部图片集;S33: Filling and interpolating pictures in the first partial picture set to a preset size to obtain a second partial picture set;
S34、对所述第二局部图片集中的每张图片中的每个像素值进行归一化处理,得到第三局部图片集;S34. Perform normalization processing on each pixel value in each picture in the second partial picture set to obtain a third partial picture set;
S35、对所述第三局部图片集中的每张图片进行预设角度旋转,以对应的旋转角度进行标签标记,得到所述第二标准图片集;S35: Rotate each picture in the third partial picture set by a preset angle, and mark the picture with the corresponding rotation angle to obtain the second standard picture set;
本申请实施例中,为了提高后续模型的泛化能力,利用本领域技术人员熟知的自监督学习模型训练的数据处理方法对所述第三局部图片集中的每张图片进行角度调整及对应的角度标签标记,例如:对所述第三局部图片集中的图片进行随机0°、90°、180°、270°旋转,且进行旋转角度标签标记,得到所述第二标准图片集。In the embodiment of the application, in order to improve the generalization ability of the subsequent model, the data processing method of self-supervised learning model training well known to those skilled in the art is used to adjust the angle and the corresponding angle of each picture in the third partial picture set. Label marking, for example, random 0°, 90°, 180°, 270° rotation of pictures in the third partial picture set, and label marking of rotation angles, to obtain the second standard picture set.
详细地,所述第二标准图片集中的图片具有双重标签,分别为所述初始标签及所述旋转角度标签,例如:图片A的初始标签为肺结核,旋转角度标签为90°。In detail, the pictures in the second standard picture set have double labels, which are the initial label and the rotation angle label. For example, the initial label of picture A is tuberculosis, and the rotation angle label is 90°.
S4、利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;S4. Use the second standard picture set to train the pre-built second deep learning model to obtain a second recognition model;
本申请实施例中,所述第二深度学习模型可以为卷积神经网络模型、残差网络模型。In the embodiment of the present application, the second deep learning model may be a convolutional neural network model or a residual network model.
详细地,本申请实施例中,利用所述第二标准图片集对预构建的第二深度学习模型进行训练包括:In detail, in this embodiment of the present application, using the second standard picture set to train the pre-built second deep learning model includes:
步骤I:根据预设的第二损失函数及预设的第三损失函数进行权重计算,得到目标损失函数;Step I: Perform weight calculation according to the preset second loss function and the preset third loss function to obtain the target loss function;
详细地,本申请实施例中,所述第二标准图片集中的图片具有双重标签,分别为所述初始标签与所述旋转角度标签,因此模型训练过程中会产生两种类别的预测结果,为了对所述两种类别的预测结果分别进行衡量,需要两个损失函数,分别为所述第二损失函数及所述第三损失函数,所述第二损失函数为所述初始标签对应的损失函数,所述第三损失函 数为所述旋转角度标签对应的损失函数。In detail, in the embodiment of the present application, the pictures in the second standard picture set have double labels, which are the initial label and the rotation angle label. Therefore, two types of prediction results will be generated during the model training process. To measure the prediction results of the two categories separately, two loss functions are required, namely the second loss function and the third loss function, and the second loss function is the loss function corresponding to the initial label , The third loss function is a loss function corresponding to the rotation angle label.
进一步地,为了很好的衡量模型的训练进度,根据预设的第二损失函数及预设的第三损失函数进行权重计算,所述权重计算可用以下公式表示:Further, in order to better measure the training progress of the model, the weight calculation is performed according to the preset second loss function and the preset third loss function, and the weight calculation can be expressed by the following formula:
L=L tb+αL rot L=L tb +αL rot
其中,L为所述目标损失函数,L tb为所述第二损失函数;L rot为所述第三损失函数,α为预设权重系数。 Wherein, L is the target loss function, L tb is the second loss function; L rot is the third loss function, and α is a preset weight coefficient.
较佳地,所述权重系数为0.1。Preferably, the weight coefficient is 0.1.
步骤II:根据所述目标损失函数,利用所述第二标准图片集对所述第二深度学习模型进行训练;当所述目标损失函数的值小于第二预设阈值时,停止训练,得到所述第二识别模型。Step II: According to the target loss function, use the second standard picture set to train the second deep learning model; when the value of the target loss function is less than a second preset threshold, stop training to obtain The second recognition model.
S5、当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到目标识别结果。S5. When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a target recognition result.
本申请实施例中所述待识别图片与所述初始图片集中图片格式相同,较佳地,本申请实施例中所述待识别图片为医疗科技中的医学影像图片,如:病人胸部X光片。The format of the picture to be recognized in the embodiment of this application is the same as that of the picture in the initial picture collection. Preferably, the picture to be recognized in the embodiment of this application is a medical imaging picture in medical technology, such as a patient's chest X-ray. .
进一步地,本申请实施例分别利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别,利用所述第一识别模型对所述待识别图片进行识别得到第一识别结果;利用所述第二识别模型对所述待识别图片进行识别得到第二识别结果,其中,所述第二识别结果中包括疾病识别结果及图片旋转角度结果,较佳地,本申请实施例中所述疾病识别结果为肺结核识别结果。Further, in this embodiment of the application, the first recognition model and the second recognition model are used to recognize the picture to be recognized, and the first recognition model is used to recognize the picture to be recognized to obtain a first recognition. Result; using the second recognition model to recognize the picture to be recognized to obtain a second recognition result, wherein the second recognition result includes a disease recognition result and a picture rotation angle result, preferably, the embodiment of the present application The disease recognition result mentioned in is the recognition result of tuberculosis.
进一步地,本申请实施例中所述根据所述第一识别结果及所述第二识别结果进行逻辑运算,得到所述目标识别结果,其中,本申请实施例中所述逻辑运算为或、与两种逻辑运算,例如:所述第一识别结果为肺结核阳性,所述第二识别结果中的疾病识别结果中的疾病识别结果为肺结核阴性,或所述第一识别结果为肺结核阴性,所述第二识别结果中的疾病识别结果为肺结核阳性,则所述目标识别结果为肺结核阳性;当所述第一识别结果与所述第二识别结果中的疾病识别结果均为肺结核阴性,则所述目标识别结果为肺结核阴性;当所述当所述第一识别结果与所述第二识别结果中的疾病识别结果均为肺结核阳性,则所述目标识别结果为肺结核阳性。Further, in the embodiment of the present application, the logical operation is performed according to the first recognition result and the second recognition result to obtain the target recognition result, wherein the logical operation in the embodiment of the present application is OR, AND Two logical operations, for example: the first recognition result is positive for pulmonary tuberculosis, the disease recognition result in the disease recognition result in the second recognition result is negative for pulmonary tuberculosis, or the first recognition result is negative for pulmonary tuberculosis, the If the disease recognition result in the second recognition result is positive for pulmonary tuberculosis, the target recognition result is positive for pulmonary tuberculosis; when the disease recognition results in the first recognition result and the second recognition result are both negative for pulmonary tuberculosis, then The target recognition result is negative for pulmonary tuberculosis; when the disease recognition results in the first recognition result and the second recognition result are both positive for pulmonary tuberculosis, the target recognition result is positive for pulmonary tuberculosis.
本申请的另一实施例中,为了保证数据的隐私性,所述待识别图片可以存储在区块链中。In another embodiment of the present application, in order to ensure the privacy of the data, the picture to be identified may be stored in the blockchain.
本申请实施例中,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理得到第一标准图片集,利用所述第一标准图片集对预构建的第一深度学习模型进行训练,提高了模型的训练速度及精度,实现对图片的全局识别;对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型,提高了模型的训练速度,增强了模型的鲁棒性,实现了对图片的局部识别;当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果,通过双模型的进行图片全局识别与图片局部识别的互补操作,提高图片识别的准确度。In the embodiment of the present application, the global target area cropping conversion and pixel normalization processing are performed on the initial picture set to obtain a first standard picture set, and the first standard picture set is used to train the pre-built first deep learning model , Improve the training speed and accuracy of the model, realize the global recognition of the picture; perform local target area cropping conversion, data enhancement and pixel normalization processing on the initial picture set, and use the second standard picture set to pre-build The second deep learning model is trained to obtain the second recognition model, which improves the training speed of the model, enhances the robustness of the model, and realizes the local recognition of the picture; when the picture to be recognized is received, the first recognition model is used. A recognition model and the second recognition model recognize and determine the result of the picture to be recognized to obtain a recognition result, and perform complementary operations of global recognition of the picture and local recognition of the picture through a dual model to improve the accuracy of picture recognition.
如图3所示,是本申请图片识别装置的功能模块图。As shown in Figure 3, it is a functional block diagram of the picture recognition device of the present application.
本申请所述图片识别装置100可以安装于电子设备中。根据实现的功能,所述图片识别装置可以包括全局模型生成模块101、局部模型生成模块102、图片识别模块103。本发所述模块也可以称之为单元,是指一种能够被电子设备处理器所执行,并且能够完成固定功能的一系列计算机程序段,其存储在电子设备的存储器中。The picture recognition apparatus 100 described in this application can be installed in an electronic device. According to the realized functions, the picture recognition device may include a global model generation module 101, a local model generation module 102, and a picture recognition module 103. The module described in the present invention can also be called a unit, which refers to a series of computer program segments that can be executed by the processor of an electronic device and can complete fixed functions, and are stored in the memory of the electronic device.
在本实施例中,关于各模块/单元的功能如下:In this embodiment, the functions of each module/unit are as follows:
所述全局模型生成模块101用于获取初始图片集,对所述初始图片集进行全局目标区 域裁剪转换及像素归一化处理,得到第一标准图片集;利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型。The global model generation module 101 is used to obtain an initial picture set, perform global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set; The constructed first deep learning model is trained to obtain the first recognition model.
本申请实施例中,所述初始图片集可以为包含初始标签的医疗影像图片集合,如包含标签的病人胸部X光片集合,其中,所述初始标签为预设疾病判别标签,如:肺结核与非肺结核。In the embodiment of the present application, the initial picture set may be a medical image picture set containing initial tags, such as a patient chest X-ray film set containing tags, wherein the initial tags are preset disease identification tags, such as tuberculosis and tuberculosis. Non-tuberculosis.
进一步地,本申请实施例为了排除背景区域的干扰,提高后续模型的训练精度,所述全局模型生成模块101裁剪所述初始图片集中每张图片的第一感兴趣区域,得到第一全局图片集,较佳地,本申请实施例中,所述第一感兴趣区域为全肺区域。Further, in the embodiment of the present application, in order to eliminate the interference of the background area and improve the training accuracy of the subsequent model, the global model generation module 101 crops the first region of interest of each picture in the initial picture set to obtain the first global picture set Preferably, in the embodiment of the present application, the first region of interest is a whole lung region.
进一步地,为了便于后续模型统一处理,本申请实施例所述全局模型生成模块101将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集。Further, in order to facilitate subsequent unified model processing, the global model generation module 101 described in this embodiment of the present application fills and interpolates each picture in the first global picture set to a preset size to obtain a second global picture set.
详细地,本申请实施例所述全局模型生成模块101利用如下手段将所述第一全局图片集中的每张图片填充插值为预设大小,包括:将所述第一全局图片集中的每张图片按照预设规则填充空白像素,得到填充图片集;对所述填充图片集中的每张图片进行图片插值调整为预设大小,得到所述第二全局图片集,其中,所述填充图片集中的图片与所述第二全局图片集中的图片具有相同的长宽比,例如:对所述第一全局图片集中图片A填充插值为预设大小,预设大小为1024*1024,图片A的大小为256*240,利用空白像素将图片A填充为包含图片A的最小正方形图片B,图片B的大小为256*256,再对图片B进行插值得到大小为1024*1024图片C完成对图片A的填充插值。In detail, the global model generation module 101 of the embodiment of the present application uses the following means to fill and interpolate each picture in the first global picture set to a preset size, including: inserting each picture in the first global picture set Fill in blank pixels according to preset rules to obtain a filled picture set; perform picture interpolation and adjust each picture in the filled picture set to a preset size to obtain the second global picture set, wherein the pictures in the filled picture set It has the same aspect ratio as the pictures in the second global picture set. For example, the interpolation value for picture A in the first global picture set is a preset size, the preset size is 1024*1024, and the size of picture A is 256 *240, use blank pixels to fill picture A into the smallest square picture B containing picture A, the size of picture B is 256*256, and then interpolate picture B to get a size of 1024*1024 picture C completes the filling and interpolation of picture A .
进一步地,本申请实施例中,为了加快后续模型的训练速度,所述全局模型生成模块101对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。进一步地,本申请实施例所述全局模型生成模块101将所述第二全局图片集中的每张图片中的每个原始像素值归一化可用如下公式进行计算:Further, in this embodiment of the application, in order to speed up the training of subsequent models, the global model generation module 101 normalizes each pixel value in each picture in the second global picture set to obtain The first standard picture collection. Further, the global model generation module 101 according to the embodiment of the present application normalizes each original pixel value in each picture in the second global picture set and can be calculated by the following formula:
P g=P x/256 P g =P x /256
其中,P x表示所述原始像素值,P g表示归一化后的所述原始像素值。 Wherein, P x represents the original pixel value, and P g represents the original pixel value after normalization.
综上所述,本申请实施例中,所述全局模型生成模块101对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集,包括:裁剪所述初始图片集中每张图片的第一感兴趣区域,得到第一全局图片集;将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集;对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。In summary, in the embodiment of the present application, the global model generation module 101 performs global target area cropping conversion and pixel normalization processing on the initial picture set to obtain the first standard picture set, including: cropping the initial picture set. The first region of interest of each picture in the picture set is used to obtain the first global picture set; the filling and interpolation of each picture in the first global picture set is the preset size to obtain the second global picture set; Each pixel value in each picture in the global picture set is normalized to obtain the first standard picture set.
本申请实施例中,所述第一深度学习模型可以为卷积神经网络模型、残差网络模型。In the embodiment of the present application, the first deep learning model may be a convolutional neural network model or a residual network model.
进一步地,本申请实施例中所述全局模型生成模块101利用如下手段对预构建的第一深度学习模型进行训练,包括:Further, the global model generation module 101 in the embodiment of the present application uses the following methods to train the pre-built first deep learning model, including:
步骤A:根据预设的卷积池化次数,对所述第一标准图片集进行卷积池化操作,得到特征集;Step A: Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
步骤B:利用预设的激活函数对所述特征集进行计算得到预测值,获取所述标准图片集中每张图片对应的所述初始标签的标签值,根据所述预测值及所述标签值,利用预构建的第一损失函数进行计算,得到第一损失值;Step B: Use the preset activation function to calculate the predicted value of the feature set, obtain the label value of the initial label corresponding to each picture in the standard picture set, and according to the predicted value and the label value, Calculate using the pre-built first loss function to obtain the first loss value;
本申请实施例中所述标签值与所述初始标签是一一对应的,例如:所述初始标签共有肺结核与非肺结核两种标签,肺结核标签对应的标签值为1,非肺结核标签对应的标签值为0。In the examples of this application, the label value corresponds to the initial label one-to-one. For example, the initial label has two labels: tuberculosis and non-tuberculosis, the label value corresponding to the tuberculosis label is 1, and the label corresponding to the non-tuberculosis label The value is 0.
步骤C:对比所述第一损失值与预设的第一损失阈值的大小,当所述第一损失值大于或等于所述第一预设阈值时,返回所述步骤A;当所述第一损失值小于所述第一预设阈值时,停止训练,得到所述第一识别模型。Step C: Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
详细地,本申请实施例中所述全局模型生成模块101对所述第一标准图片集进行卷积池化操作得到第一特征集,包括:对所述第一标准图片集进行卷积操作得到第一卷积数据 集;对所述第一卷积数据集进行最大池化操作得到所述第一特征集。In detail, in the embodiment of the present application, the global model generation module 101 performs a convolution pooling operation on the first standard picture set to obtain a first feature set, including: performing a convolution operation on the first standard picture set to obtain A first convolution data set; performing a maximum pooling operation on the first convolution data set to obtain the first feature set.
进一步地,所述卷积操作为:Further, the convolution operation is:
Figure PCTCN2020131990-appb-000004
Figure PCTCN2020131990-appb-000004
其中,ω’表示所述第一卷积数据集的通道数,ω表示所述第一标准图片集的通道数,k为预设卷积核的大小,f为预设卷积操作的步幅,p为预设数据补零矩阵。Where, ω'represents the number of channels of the first convolution data set, ω represents the number of channels of the first standard picture set, k is the size of the preset convolution kernel, and f is the step size of the preset convolution operation , P is the preset data zero-filling matrix.
进一步地,本申请较佳实施例所述激活函数包括:Further, the activation function described in the preferred embodiment of the present application includes:
Figure PCTCN2020131990-appb-000005
Figure PCTCN2020131990-appb-000005
其中,μ t表示所述预测值,s表示所述特征集中的数据。 Wherein, μ t represents the predicted value, and s represents the data in the feature set.
详细地,本申请较佳实施例所述第一损失函数包括:In detail, the first loss function described in the preferred embodiment of the present application includes:
Figure PCTCN2020131990-appb-000006
Figure PCTCN2020131990-appb-000006
其中,L ce表示所述第一损失值,N为所述第一标准图片集的数据数目,i为正整数,y i为所述标签值,p i为所述预测值。 Wherein, L ce represents the first loss value, N is the number of data in the first standard picture set, i is a positive integer, y i is the label value, and p i is the predicted value.
所述局部模型生成模块102用于对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型。The local model generation module 102 is used to perform local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set; The second deep learning model is trained to obtain the second recognition model.
本申请实施例中,所述第一标准图片集中的图片是全局图片,所以所述第一识别模型为全局识别模型,但是实际应用所述全局识别模型通常会忽略对应局部位置的细微特征导致漏检,因此,需要利用局部位置图片集合再训练一个局部识别的模型作为对所述第一识别模型的补充,例如:所述第一识别模型为对全肺进行识别的全局识别的模型,但是上肺部有轻微的病灶的情形(如纤维化、多发性小斑点)容易漏掉,因此,利用上肺部图片训练对上肺部分类的局部识别模型作为对所述第一识别模型的补充;所以对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集,其中,所述第二标准图片集中的图片为局部位置图片,例如:所述第一标准图片集中的图片为全肺区域图片,所述第二标准图片集中的图片为上肺部区域图片。In the embodiment of the present application, the pictures in the first standard picture set are global pictures, so the first recognition model is a global recognition model, but the actual application of the global recognition model usually ignores the subtle features corresponding to the local position, resulting in omissions. Therefore, it is necessary to use the local location picture set to train a local recognition model as a supplement to the first recognition model. For example, the first recognition model is a global recognition model for recognizing the whole lung, but the above The situation where there are slight lung lesions (such as fibrosis, multiple small spots) is easy to miss, therefore, the local recognition model of the upper lung part class is used to train the upper lung picture as a supplement to the first recognition model; Therefore, the local target area cropping conversion, data enhancement, and pixel normalization processing are performed on the initial picture set to obtain a second standard picture set, wherein the pictures in the second standard picture set are local position pictures, for example: The pictures in the first standard picture set are pictures of the whole lung area, and the pictures in the second standard picture set are pictures of the upper lung area.
详细地,本申请实施例中所述局部模型生成模块102利用如下手段对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,包括:In detail, the local model generation module 102 in the embodiment of the present application uses the following methods to perform local target region cropping conversion, data enhancement, and pixel normalization processing on the initial picture set, including:
裁剪所述初始图片集中每张图片的第二感兴趣区域,得到初始局部图片集;Crop the second region of interest of each picture in the initial picture set to obtain the initial partial picture set;
较佳地,本申请实施例中,所述第二感兴趣区域为上肺部区域。Preferably, in this embodiment of the present application, the second region of interest is an upper lung region.
根据所述初始图片集中每张图片对应的所述初始标签对所述初始局部图片集中对应的图片进行标记,得到第一局部图片集;Marking the corresponding pictures in the initial partial picture set according to the initial label corresponding to each picture in the initial picture set to obtain a first partial picture set;
例如:所述初始图片集中的图片A标签为肺结核,标签位置为左上肺,经过上述处理,通过图片A得到左上肺图片a及右上肺图片b,根据图片A的标签将图片a标记为肺结核标签,将图片b标记为非肺结核标签。For example: the label of picture A in the initial picture set is tuberculosis, and the label position is the upper left lung. After the above processing, the upper left lung picture a and the upper right lung picture b are obtained from picture A, and picture a is labeled as tuberculosis label according to the label of picture A. , And mark picture b as a non-tuberculosis label.
本申请实施例中,下述图片处理过程只是对第一局部图片集中的图片进行处理,不会影响图片对应的标签。In the embodiment of the present application, the following picture processing process only processes pictures in the first partial picture set, and does not affect the labels corresponding to the pictures.
将所述第一局部图片集中的图片填充插值为预设大小,得到第二局部图片集;Filling and interpolating pictures in the first partial picture set to a preset size to obtain a second partial picture set;
对所述第二局部图片集中的每张图片中的每个像素值进行归一化处理,得到第三局部图片集;Performing normalization processing on each pixel value in each picture in the second partial picture set to obtain a third partial picture set;
对所述第三局部图片集中的每张图片进行预设角度旋转,以对应的旋转角度进行标签标记,得到所述第二标准图片集;Performing a preset angle rotation on each picture in the third partial picture set, and labeling with a corresponding rotation angle to obtain the second standard picture set;
本申请实施例中,为了提高后续模型的泛化能力,利用本领域技术人员熟知的自监督 学习模型训练的数据处理方法对所述第三局部图片集中的每张图片进行角度调整及对应的角度标签标记,例如:对所述第三局部图片集中的图片进行随机0°、90°、180°、270°旋转,且进行旋转角度标签标记,得到所述第二标准图片集。In the embodiment of the application, in order to improve the generalization ability of the subsequent model, the data processing method of self-supervised learning model training well known to those skilled in the art is used to adjust the angle and the corresponding angle of each picture in the third partial picture set. Label marking, for example, random 0°, 90°, 180°, 270° rotation of pictures in the third partial picture set, and label marking of rotation angles, to obtain the second standard picture set.
详细地,所述第二标准图片集中的图片具有双重标签,分别为所述初始标签及所述旋转角度标签,例如:图片A的初始标签为肺结核,旋转角度标签为90°。In detail, the pictures in the second standard picture set have double labels, which are the initial label and the rotation angle label. For example, the initial label of picture A is tuberculosis, and the rotation angle label is 90°.
本申请实施例中,所述第二深度学习模型可以为卷积神经网络模型、残差网络模型。In the embodiment of the present application, the second deep learning model may be a convolutional neural network model or a residual network model.
详细地,本申请实施例中,所述局部模型生成模块102利用如下手段对预构建的第二深度学习模型进行训练包括:In detail, in the embodiment of the present application, the local model generation module 102 uses the following means to train the pre-built second deep learning model including:
根据预设的第二损失函数及预设的第三损失函数进行权重计算,得到目标损失函数;Perform weight calculation according to the preset second loss function and the preset third loss function to obtain the target loss function;
详细地,本申请实施例中,所述第二标准图片集中的图片具有双重标签分别为所述初始标签与所述旋转角度标签,因此模型训练过程中会产生两种类别的预测结果,为了对所述两种类别的预测结果分别进行衡量,需要两个损失函数,分别为所述第二损失函数及所述第三损失函数,所述第二损失函数为所述初始标签对应的损失函数,所述第三损失函数为所述旋转角度标签对应的损失函数。In detail, in the embodiment of the present application, the pictures in the second standard picture set have double labels as the initial label and the rotation angle label. Therefore, two types of prediction results will be generated during the model training process. The prediction results of the two categories are measured separately, and two loss functions are required, namely the second loss function and the third loss function, and the second loss function is the loss function corresponding to the initial label, The third loss function is a loss function corresponding to the rotation angle label.
进一步地,为了很好的衡量模型的训练进度,根据预设的第二损失函数及预设的第三损失函数进行权重计算,所述权重计算可用以下公式表示:Further, in order to better measure the training progress of the model, the weight calculation is performed according to the preset second loss function and the preset third loss function, and the weight calculation can be expressed by the following formula:
所述权重计算可用以下公式表示:The weight calculation can be expressed by the following formula:
L=L tb+αL rot L=L tb +αL rot
其中,L为所述目标损失函数,L tb为所述第二损失函数;L rot为所述第三损失函数,α为预设权重系数。 Wherein, L is the target loss function, L tb is the second loss function; L rot is the third loss function, and α is a preset weight coefficient.
较佳地,所述权重系数为0.1。Preferably, the weight coefficient is 0.1.
根据所述目标损失函数,利用所述第二标准图片集对所述第二深度学习模型进行训练;当所述目标损失函数的值小于第二预设阈值时,停止训练,得到所述第二识别模型。According to the target loss function, use the second standard picture set to train the second deep learning model; when the value of the target loss function is less than a second preset threshold, stop training to obtain the second Identify the model.
所述图片识别模块103用于当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。The picture recognition module 103 is configured to, when a picture to be recognized is received, use the first recognition model and the second recognition model to recognize and determine the result of the picture to be recognized to obtain a recognition result.
本申请实施例中所述待识别图片与所述初始图片集中图片格式相同,较佳地,本申请实施例中所述待识别图片为医疗科技中的医学影像图片,如:病人胸部X光片。The format of the picture to be recognized in the embodiment of this application is the same as that of the picture in the initial picture collection. Preferably, the picture to be recognized in the embodiment of this application is a medical imaging picture in medical technology, such as a patient's chest X-ray. .
进一步地,本申请实施例所述图片识别模块103分别利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别,利用所述第一识别模型对所述待识别图片进行识别得到第一识别结果;利用所述第二识别模型对所述待识别图片进行识别得到第二识别结果,其中,所述第二识别结果中包括疾病识别结果及图片旋转角度结果,较佳地,本申请实施例中所述疾病识别结果为肺结核识别结果。Further, the picture recognition module 103 in the embodiment of the present application respectively uses the first recognition model and the second recognition model to recognize the picture to be recognized, and uses the first recognition model to recognize the picture to be recognized. Perform recognition to obtain a first recognition result; use the second recognition model to recognize the picture to be recognized to obtain a second recognition result, wherein the second recognition result includes a disease recognition result and a picture rotation angle result, preferably Specifically, the disease recognition result in the embodiment of the present application is the pulmonary tuberculosis recognition result.
进一步地,本申请实施例中所述图片识别模块103利用如下手段根据所述第一识别结果及所述第二识别结果进行逻辑运算,得到所述目标识别结果,其中,本申请实施例中所述逻辑运算为或、与两种逻辑运算,例如:所述第一识别结果为肺结核阳性,所述第二识别结果中的疾病识别结果中的疾病识别结果为肺结核阴性,或所述第一识别结果为肺结核阴性,所述第二识别结果中的疾病识别结果为肺结核阳性,则所述目标识别结果为肺结核阳性;当所述第一识别结果与所述第二识别结果中的疾病识别结果均为肺结核阴性,则所述目标识别结果为肺结核阴性;当所述当所述第一识别结果与所述第二识别结果中的疾病识别结果均为肺结核阳性,则所述目标识别结果为肺结核阳性。Further, the picture recognition module 103 in the embodiment of the present application uses the following means to perform logical operations according to the first recognition result and the second recognition result to obtain the target recognition result. The logical operation is two logical operations of OR and AND, for example: the first recognition result is positive for tuberculosis, the disease recognition result in the disease recognition result in the second recognition result is negative for tuberculosis, or the first recognition result The result is pulmonary tuberculosis negative, and the disease recognition result in the second recognition result is positive for pulmonary tuberculosis, then the target recognition result is positive for pulmonary tuberculosis; when the first recognition result and the disease recognition results in the second recognition result are both Is negative for pulmonary tuberculosis, the target recognition result is negative for pulmonary tuberculosis; when the disease recognition results in the first recognition result and the second recognition result are both positive for pulmonary tuberculosis, the target recognition result is positive for pulmonary tuberculosis .
本申请的另一实施例中,为了保证数据的隐私性,所述待识别图片可以存储在区块链中。In another embodiment of the present application, in order to ensure the privacy of the data, the picture to be identified may be stored in the blockchain.
如图4所示,是本申请实现图片识别方法的电子设备的结构示意图。As shown in FIG. 4, it is a schematic diagram of the structure of an electronic device that implements the image recognition method of the present application.
所述电子设备1可以包括处理器10、存储器11和总线,还可以包括存储在所述存储器11中并可在所述处理器10上运行的计算机程序,如图片识别程序。The electronic device 1 may include a processor 10, a memory 11, and a bus, and may also include a computer program stored in the memory 11 and running on the processor 10, such as a picture recognition program.
其中,所述存储器11可以是易失性的,也可以是非易失性的,所述存储器11至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、移动硬盘、多媒体卡、卡型存储器(例如:SD或DX存储器等)、磁性存储器、磁盘、光盘等。所述存储器11在一些实施例中可以是电子设备1的内部存储单元,例如该电子设备1的移动硬盘。所述存储器11在另一些实施例中也可以是电子设备1的外部存储设备,例如电子设备1上配备的插接式移动硬盘、智能存储卡(Smart Media Card,SMC)、安全数字(Secure Digital,SD)卡、闪存卡(Flash Card)等。进一步地,所述存储器11还可以既包括电子设备1的内部存储单元也包括外部存储设备。所述存储器11不仅可以用于存储安装于电子设备1的应用软件及各类数据,例如图片识别程序的代码等,还可以用于暂时地存储已经输出或者将要输出的数据。The memory 11 may be volatile or non-volatile. The memory 11 includes at least one type of readable storage medium. The readable storage medium includes flash memory, mobile hard disk, and multimedia card. , Card-type memory (for example: SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, etc. The memory 11 may be an internal storage unit of the electronic device 1 in some embodiments, for example, a mobile hard disk of the electronic device 1. In other embodiments, the memory 11 may also be an external storage device of the electronic device 1, such as a plug-in mobile hard disk, a smart media card (SMC), and a secure digital (Secure Digital) equipped on the electronic device 1. , SD) card, flash card (Flash Card), etc. Further, the memory 11 may also include both an internal storage unit of the electronic device 1 and an external storage device. The memory 11 can be used not only to store application software and various data installed in the electronic device 1, such as the code of a picture recognition program, etc., but also to temporarily store data that has been output or will be output.
所述处理器10在一些实施例中可以由集成电路组成,例如可以由单个封装的集成电路所组成,也可以是由多个相同功能或不同功能封装的集成电路所组成,包括一个或者多个中央处理器(Central Processing unit,CPU)、微处理器、数字处理芯片、图形处理器及各种控制芯片的组合等。所述处理器10是所述电子设备的控制核心(Control Unit),利用各种接口和线路连接整个电子设备的各个部件,通过运行或执行存储在所述存储器11内的程序或者模块(例如图片识别程序等),以及调用存储在所述存储器11内的数据,以执行电子设备1的各种功能和处理数据。The processor 10 may be composed of integrated circuits in some embodiments, for example, may be composed of a single packaged integrated circuit, or may be composed of multiple integrated circuits with the same function or different functions, including one or more Combinations of central processing unit (CPU), microprocessor, digital processing chip, graphics processor, and various control chips, etc. The processor 10 is the control unit of the electronic device, which uses various interfaces and lines to connect the various components of the entire electronic device, and runs or executes programs or modules (such as pictures) stored in the memory 11 Recognition program, etc.), and call the data stored in the memory 11 to execute various functions of the electronic device 1 and process data.
所述总线可以是外设部件互连标准(peripheral component interconnect,简称PCI)总线或扩展工业标准结构(extended industry standard architecture,简称EISA)总线等。该总线可以分为地址总线、数据总线、控制总线等。所述总线被设置为实现所述存储器11以及至少一个处理器10等之间的连接通信。The bus may be a peripheral component interconnect standard (PCI) bus or an extended industry standard architecture (EISA) bus, etc. The bus can be divided into address bus, data bus, control bus and so on. The bus is configured to implement connection and communication between the memory 11 and at least one processor 10 and the like.
图4仅示出了具有部件的电子设备,本领域技术人员可以理解的是,图3示出的结构并不构成对所述电子设备1的限定,可以包括比图示更少或者更多的部件,或者组合某些部件,或者不同的部件布置。FIG. 4 only shows an electronic device with components. Those skilled in the art can understand that the structure shown in FIG. 3 does not constitute a limitation on the electronic device 1, and may include fewer or more components than shown in the figure. Components, or combinations of certain components, or different component arrangements.
例如,尽管未示出,所述电子设备1还可以包括给各个部件供电的电源(比如电池),优选地,电源可以通过电源管理装置与所述至少一个处理器10逻辑相连,从而通过电源管理装置实现充电管理、放电管理、以及功耗管理等功能。电源还可以包括一个或一个以上的直流或交流电源、再充电装置、电源故障检测电路、电源转换器或者逆变器、电源状态指示器等任意组件。所述电子设备1还可以包括多种传感器、蓝牙模块、Wi-Fi模块等,在此不再赘述。For example, although not shown, the electronic device 1 may also include a power source (such as a battery) for supplying power to various components. Preferably, the power source may be logically connected to the at least one processor 10 through a power management device, thereby controlling power The device implements functions such as charge management, discharge management, and power consumption management. The power supply may also include any components such as one or more DC or AC power supplies, recharging devices, power failure detection circuits, power converters or inverters, and power status indicators. The electronic device 1 may also include various sensors, Bluetooth modules, Wi-Fi modules, etc., which will not be repeated here.
进一步地,所述电子设备1还可以包括网络接口,可选地,所述网络接口可以包括有线接口和/或无线接口(如WI-FI接口、蓝牙接口等),通常用于在该电子设备1与其他电子设备之间建立通信连接。Further, the electronic device 1 may also include a network interface. Optionally, the network interface may include a wired interface and/or a wireless interface (such as a Wi-Fi interface, a Bluetooth interface, etc.), which is usually used in the electronic device 1 Establish a communication connection with other electronic devices.
可选地,该电子设备1还可以包括用户接口,用户接口可以是显示器(Display)、输入单元(比如键盘(Keyboard)),可选地,用户接口还可以是标准的有线接口、无线接口。可选地,在一些实施例中,显示器可以是LED显示器、液晶显示器、触控式液晶显示器以及OLED(Organic Light-Emitting Diode,有机发光二极管)触摸器等。其中,显示器也可以适当的称为显示屏或显示单元,用于显示在电子设备1中处理的信息以及用于显示可视化的用户界面。Optionally, the electronic device 1 may also include a user interface. The user interface may be a display (Display) and an input unit (such as a keyboard (Keyboard)). Optionally, the user interface may also be a standard wired interface or a wireless interface. Optionally, in some embodiments, the display may be an LED display, a liquid crystal display, a touch liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, etc. Among them, the display can also be appropriately called a display screen or a display unit, which is used to display the information processed in the electronic device 1 and to display a visualized user interface.
应该了解,所述实施例仅为说明之用,在专利申请范围上并不受此结构的限制。It should be understood that the embodiments are only for illustrative purposes, and are not limited by this structure in the scope of the patent application.
所述电子设备1中的所述存储器11存储的图片识别程序12是多个指令的组合,在所述处理器10中运行时,可以实现:The picture recognition program 12 stored in the memory 11 in the electronic device 1 is a combination of multiple instructions. When running in the processor 10, it can realize:
获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
具体地,所述处理器10对上述指令的具体实现方法可参考图1对应实施例中相关步骤的描述,在此不赘述。Specifically, for the specific implementation method of the above-mentioned instructions by the processor 10, reference may be made to the description of the relevant steps in the embodiment corresponding to FIG. 1, which will not be repeated here.
进一步地,所述电子设备1集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中,所述计算机可读存储介质可以是易失性的,也可以是非易失性的,。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)。Further, if the integrated module/unit of the electronic device 1 is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. It can be volatile or non-volatile. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory) .
进一步地,所述计算机可用存储介质可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序等;存储数据区可存储根据区块链节点的使用所创建的数据等。Further, the computer usable storage medium may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function, etc.; the storage data area may store a block chain node Use the created data, etc.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In the several embodiments provided in this application, it should be understood that the disclosed equipment, device, and method may be implemented in other ways. For example, the device embodiments described above are merely illustrative. For example, the division of the modules is only a logical function division, and there may be other division methods in actual implementation.
所述作为分离部件说明的模块可以是或者也可以不是物理上分开的,作为模块显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目标。The modules described as separate components may or may not be physically separated, and the components displayed as modules may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the modules can be selected according to actual needs to achieve the goals of the solutions of the embodiments.
另外,在本申请各个实施例中的各功能模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能模块的形式实现。In addition, the functional modules in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit may be implemented in the form of hardware, or may be implemented in the form of hardware plus software functional modules.
对于本领域技术人员而言,显然本申请不限于上述示范性实施例的细节,而且在不背离本申请的精神或基本特征的情况下,能够以其他的具体形式实现本申请。For those skilled in the art, it is obvious that the present application is not limited to the details of the foregoing exemplary embodiments, and the present application can be implemented in other specific forms without departing from the spirit or basic characteristics of the application.
因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本申请的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本申请内。不应将权利要求中的任何附关联图标记视为限制所涉及的权利要求。Therefore, no matter from which point of view, the embodiments should be regarded as exemplary and non-limiting. The scope of this application is defined by the appended claims rather than the above description, and therefore it is intended to fall into the claims. All changes in the meaning and scope of the equivalent elements of are included in this application. Any associated diagram marks in the claims should not be regarded as limiting the claims involved.
本申请所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in this application is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。系统权利要求中陈述的多个单元或装置也可以由一个单元或装置通过软件或者硬件来实现。第二等词语用来表示名称,而并不表示任何特定的顺序。In addition, it is obvious that the word "including" does not exclude other units or steps, and the singular does not exclude the plural. Multiple units or devices stated in the system claims can also be implemented by one unit or device through software or hardware. The second class words are used to indicate names, and do not indicate any specific order.
最后应说明的是,以上实施例仅用以说明本申请的技术方案而非限制,尽管参照较佳实施例对本申请进行了详细说明,本领域的普通技术人员应当理解,可以对本申请的技术方案进行修改或等同替换,而不脱离本申请技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the application and not to limit them. Although the application has been described in detail with reference to the preferred embodiments, those of ordinary skill in the art should understand that the technical solutions of the application can be Make modifications or equivalent replacements without departing from the spirit and scope of the technical solution of the present application.

Claims (20)

  1. 一种图片识别方法,其中,所述方法包括:A picture recognition method, wherein the method includes:
    获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
    利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
    对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
    利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
    当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  2. 如权利要求1所述的图片识别方法,其中,所述对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集,包括:8. The picture recognition method according to claim 1, wherein said performing global target area cropping conversion and pixel normalization processing on said initial picture set to obtain a first standard picture set comprises:
    裁剪所述初始图片集中每张图片的第一感兴趣区域,得到第一全局图片集;Crop the first region of interest of each picture in the initial picture set to obtain the first global picture set;
    将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集;Filling and interpolating each picture in the first global picture set to a preset size to obtain a second global picture set;
    对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。Perform normalization processing on each pixel value in each picture in the second global picture set to obtain the first standard picture set.
  3. 如权利要求1所述的图片识别方法,其中,所述对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集,包括:8. The picture recognition method according to claim 1, wherein said performing partial target region cropping conversion, data enhancement and pixel normalization processing on said initial picture set to obtain a second standard picture set comprises:
    裁剪所述初始图片集中每张图片的第二感兴趣区域,得到初始局部图片集;Crop the second region of interest of each picture in the initial picture set to obtain the initial partial picture set;
    根据所述初始图片集中每张图片对应的初始标签对所述初始局部图片集中对应的图片进行标记,得到第一局部图片集;Marking the corresponding pictures in the initial partial picture set according to the initial label corresponding to each picture in the initial picture set to obtain the first partial picture set;
    将所述第一局部图片集中的图片填充插值为预设大小,得到第二局部图片集;Filling and interpolating pictures in the first partial picture set to a preset size to obtain a second partial picture set;
    对所述第二局部图片集中的每张图片中的每个像素值进行归一化处理,得到第三局部图片集;Performing normalization processing on each pixel value in each picture in the second partial picture set to obtain a third partial picture set;
    对所述第三局部图片集中的每张图片进行预设角度旋转,以对应的旋转角度进行标签标记,得到所述第二标准图片集。Rotate each picture in the third partial picture set by a preset angle, and mark it with a corresponding rotation angle to obtain the second standard picture set.
  4. 如权利要求3所述的图片识别方法,其中,所述利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型,包括:5. The picture recognition method according to claim 3, wherein said training a pre-built first deep learning model using said first standard picture set to obtain a first recognition model comprises:
    步骤A:根据预设的卷积池化次数,对所述第一标准图片集进行卷积池化操作,得到特征集;Step A: Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
    步骤B:利用预设的激活函数对所述特征集进行计算,得到预测值,获取所述第一标准图片集中每张图片对应的所述初始标签的标签值,根据所述预测值及所述标签值,利用预构建的第一损失函数进行计算,得到第一损失值;Step B: Calculate the feature set by using a preset activation function to obtain a predicted value, obtain the label value of the initial label corresponding to each picture in the first standard picture set, according to the predicted value and the The label value is calculated by using the pre-built first loss function to obtain the first loss value;
    步骤C:对比所述第一损失值与预设的第一损失阈值的大小,当所述第一损失值大于或等于所述第一预设阈值时,返回所述步骤A;当所述第一损失值小于所述第一预设阈值时,停止训练,得到所述第一识别模型。Step C: Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
  5. 如权利要求1所述的图片识别方法,其中,所述利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型,包括:5. The picture recognition method according to claim 1, wherein said training a pre-built second deep learning model using said second standard picture set to obtain a second recognition model comprises:
    根据预设的第二损失函数及预设的第三损失函数进行权重计算,得到目标损失函数;Perform weight calculation according to the preset second loss function and the preset third loss function to obtain the target loss function;
    根据所述目标损失函数,利用所述第二标准图片集对所述第二深度学习模型进行训练;Training the second deep learning model by using the second standard picture set according to the target loss function;
    当所述目标损失函数的值小于第二预设阈值时,停止训练,得到所述第二识别模型。When the value of the target loss function is less than the second preset threshold, the training is stopped to obtain the second recognition model.
  6. 如权利要求1所述的图片识别方法,其中,所述利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到目标识别结果,包括:8. The picture recognition method of claim 1, wherein said using said first recognition model and said second recognition model to recognize and determine the result of the picture to be recognized to obtain a target recognition result comprises:
    利用所述第一识别模型对所述待识别图片进行识别,得到第一识别结果;Recognizing the picture to be recognized by using the first recognition model to obtain a first recognition result;
    利用所述第二识别模型对所述待识别图片进行识别,得到第二识别结果;Recognizing the picture to be recognized by using the second recognition model to obtain a second recognition result;
    根据所述第一识别结果及所述第二识别结果进行逻辑运算,得到所述目标识别结果。Perform logical operations according to the first recognition result and the second recognition result to obtain the target recognition result.
  7. 如权利要求1至6中任何一项所述的图片识别方法,其中,所述初始图片集为病人胸部X光图片集合,所述第一标准图片集为全肺区域图片集合,所述第二标准图片集为上肺部区域图片集合。The picture recognition method according to any one of claims 1 to 6, wherein the initial picture set is a patient chest X-ray picture set, the first standard picture set is a whole lung area picture set, and the second The standard picture set is a picture set of the upper lung area.
  8. 一种图片识别装置,其中,所述装置包括:A picture recognition device, wherein the device includes:
    全局模型生成模块,用于获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;The global model generation module is used to obtain an initial picture set, perform global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set; The first deep learning model is trained to obtain the first recognition model;
    局部模型生成模块,用于对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;The local model generation module is used to perform local target area cropping conversion, data enhancement, and pixel normalization processing on the initial picture set to obtain a second standard picture set; use the second standard picture set to compare the pre-built second The deep learning model is trained to obtain the second recognition model;
    图片识别模块,用于当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。The picture recognition module is configured to, when the picture to be recognized is received, use the first recognition model and the second recognition model to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  9. 一种电子设备,其中,所述电子设备包括:An electronic device, wherein the electronic device includes:
    至少一个处理器;以及,At least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,A memory communicatively connected with the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的计算机程序指令,所述计算机程序指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如下步骤:The memory stores computer program instructions executable by the at least one processor, and the computer program instructions are executed by the at least one processor, so that the at least one processor can execute the following steps:
    获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
    利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
    对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
    利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
    当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  10. 如权利要求9所述的电子设备,其中,所述对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集,包括:9. The electronic device according to claim 9, wherein said performing global target area cropping conversion and pixel normalization processing on said initial picture set to obtain a first standard picture set comprises:
    裁剪所述初始图片集中每张图片的第一感兴趣区域,得到第一全局图片集;Crop the first region of interest of each picture in the initial picture set to obtain the first global picture set;
    将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集;Filling and interpolating each picture in the first global picture set to a preset size to obtain a second global picture set;
    对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。Perform normalization processing on each pixel value in each picture in the second global picture set to obtain the first standard picture set.
  11. 如权利要求9所述的电子设备,其中,所述对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集,包括:9. The electronic device according to claim 9, wherein said performing local target region cropping conversion, data enhancement and pixel normalization processing on said initial picture set to obtain a second standard picture set comprises:
    裁剪所述初始图片集中每张图片的第二感兴趣区域,得到初始局部图片集;Crop the second region of interest of each picture in the initial picture set to obtain the initial partial picture set;
    根据所述初始图片集中每张图片对应的初始标签对所述初始局部图片集中对应的图片进行标记,得到第一局部图片集;Marking the corresponding pictures in the initial partial picture set according to the initial label corresponding to each picture in the initial picture set to obtain the first partial picture set;
    将所述第一局部图片集中的图片填充插值为预设大小,得到第二局部图片集;Filling and interpolating pictures in the first partial picture set to a preset size to obtain a second partial picture set;
    对所述第二局部图片集中的每张图片中的每个像素值进行归一化处理,得到第三局部图片集;Performing normalization processing on each pixel value in each picture in the second partial picture set to obtain a third partial picture set;
    对所述第三局部图片集中的每张图片进行预设角度旋转,以对应的旋转角度进行标签标记,得到所述第二标准图片集。Rotate each picture in the third partial picture set by a preset angle, and mark it with a corresponding rotation angle to obtain the second standard picture set.
  12. 如权利要求11所述的电子设备,其中,所述利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型,包括:11. The electronic device according to claim 11, wherein said training a pre-built first deep learning model using said first standard picture set to obtain a first recognition model comprises:
    步骤A:根据预设的卷积池化次数,对所述第一标准图片集进行卷积池化操作,得到 特征集;Step A: Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
    步骤B:利用预设的激活函数对所述特征集进行计算,得到预测值,获取所述第一标准图片集中每张图片对应的所述初始标签的标签值,根据所述预测值及所述标签值,利用预构建的第一损失函数进行计算,得到第一损失值;Step B: Calculate the feature set by using a preset activation function to obtain a predicted value, obtain the label value of the initial label corresponding to each picture in the first standard picture set, according to the predicted value and the The label value is calculated by using the pre-built first loss function to obtain the first loss value;
    步骤C:对比所述第一损失值与预设的第一损失阈值的大小,当所述第一损失值大于或等于所述第一预设阈值时,返回所述步骤A;当所述第一损失值小于所述第一预设阈值时,停止训练,得到所述第一识别模型。Step C: Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
  13. 如权利要求9所述的电子设备,其中,所述利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型,包括:9. The electronic device according to claim 9, wherein said training a pre-built second deep learning model using said second standard picture set to obtain a second recognition model comprises:
    根据预设的第二损失函数及预设的第三损失函数进行权重计算,得到目标损失函数;Perform weight calculation according to the preset second loss function and the preset third loss function to obtain the target loss function;
    根据所述目标损失函数,利用所述第二标准图片集对所述第二深度学习模型进行训练;Training the second deep learning model by using the second standard picture set according to the target loss function;
    当所述目标损失函数的值小于第二预设阈值时,停止训练,得到所述第二识别模型。When the value of the target loss function is less than the second preset threshold, the training is stopped to obtain the second recognition model.
  14. 如权利要求9所述的电子设备,其中,所述利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到目标识别结果,包括:9. The electronic device according to claim 9, wherein said using said first recognition model and said second recognition model to recognize and determine the result of said image to be recognized to obtain a target recognition result comprises:
    利用所述第一识别模型对所述待识别图片进行识别,得到第一识别结果;Recognizing the picture to be recognized by using the first recognition model to obtain a first recognition result;
    利用所述第二识别模型对所述待识别图片进行识别,得到第二识别结果;Recognizing the picture to be recognized by using the second recognition model to obtain a second recognition result;
    根据所述第一识别结果及所述第二识别结果进行逻辑运算,得到所述目标识别结果。Perform logical operations according to the first recognition result and the second recognition result to obtain the target recognition result.
  15. 如权利要求9至14中任何一项所述的电子设备,其中,所述初始图片集为病人胸部X光图片集合,所述第一标准图片集为全肺区域图片集合,所述第二标准图片集为上肺部区域图片集合。The electronic device according to any one of claims 9 to 14, wherein the initial picture set is a patient's chest X-ray picture set, the first standard picture set is a whole lung area picture set, and the second standard picture set is a whole lung area picture set. The picture collection is a collection of pictures of the upper lung area.
  16. 一种计算机可读存储介质,存储有计算机程序,其中,所述计算机程序被处理器执行时实现如下步骤:A computer-readable storage medium storing a computer program, wherein the computer program is executed by a processor to implement the following steps:
    获取初始图片集,对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集;Acquiring an initial picture set, and performing global target area cropping conversion and pixel normalization processing on the initial picture set to obtain a first standard picture set;
    利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型;Training the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model;
    对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集;Performing local target region cropping conversion, data enhancement and pixel normalization processing on the initial picture set to obtain a second standard picture set;
    利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型;Training the pre-built second deep learning model by using the second standard picture set to obtain a second recognition model;
    当接收到待识别图片时,利用所述第一识别模型及所述第二识别模型对所述待识别图片进行识别及结果判断,得到识别结果。When the picture to be recognized is received, the first recognition model and the second recognition model are used to recognize and determine the result of the picture to be recognized to obtain a recognition result.
  17. 如权利要求16所述的计算机可读存储介质,其中,所述对所述初始图片集进行全局目标区域裁剪转换及像素归一化处理,得到第一标准图片集,包括:15. The computer-readable storage medium according to claim 16, wherein said performing global target area cropping conversion and pixel normalization processing on said initial picture set to obtain a first standard picture set comprises:
    裁剪所述初始图片集中每张图片的第一感兴趣区域,得到第一全局图片集;Crop the first region of interest of each picture in the initial picture set to obtain the first global picture set;
    将所述第一全局图片集中的每张图片填充插值为预设大小,得到第二全局图片集;Filling and interpolating each picture in the first global picture set to a preset size to obtain a second global picture set;
    对所述第二全局图片集中的每张图片中的每个像素值进行归一化处理,得到所述第一标准图片集。Perform normalization processing on each pixel value in each picture in the second global picture set to obtain the first standard picture set.
  18. 如权利要求16所述的计算机可读存储介质,其中,所述对所述初始图片集进行局部目标区域裁剪转换、数据增强及像素归一化处理,得到第二标准图片集,包括:16. The computer-readable storage medium of claim 16, wherein the performing partial target region cropping conversion, data enhancement, and pixel normalization processing on the initial picture set to obtain a second standard picture set comprises:
    裁剪所述初始图片集中每张图片的第二感兴趣区域,得到初始局部图片集;Crop the second region of interest of each picture in the initial picture set to obtain the initial partial picture set;
    根据所述初始图片集中每张图片对应的初始标签对所述初始局部图片集中对应的图片进行标记,得到第一局部图片集;Marking the corresponding pictures in the initial partial picture set according to the initial label corresponding to each picture in the initial picture set to obtain the first partial picture set;
    将所述第一局部图片集中的图片填充插值为预设大小,得到第二局部图片集;Filling and interpolating pictures in the first partial picture set to a preset size to obtain a second partial picture set;
    对所述第二局部图片集中的每张图片中的每个像素值进行归一化处理,得到第三局部图片集;Performing normalization processing on each pixel value in each picture in the second partial picture set to obtain a third partial picture set;
    对所述第三局部图片集中的每张图片进行预设角度旋转,以对应的旋转角度进行标签 标记,得到所述第二标准图片集。Each picture in the third partial picture set is rotated by a preset angle, and a label is marked with a corresponding rotation angle to obtain the second standard picture set.
  19. 如权利要求18所述的计算机可读存储介质,其中,所述利用所述第一标准图片集对预构建的第一深度学习模型进行训练,得到第一识别模型,包括:18. The computer-readable storage medium according to claim 18, wherein the training of the pre-built first deep learning model by using the first standard picture set to obtain the first recognition model comprises:
    步骤A:根据预设的卷积池化次数,对所述第一标准图片集进行卷积池化操作,得到特征集;Step A: Perform a convolution pooling operation on the first standard picture set according to the preset number of convolution pooling times to obtain a feature set;
    步骤B:利用预设的激活函数对所述特征集进行计算,得到预测值,获取所述第一标准图片集中每张图片对应的所述初始标签的标签值,根据所述预测值及所述标签值,利用预构建的第一损失函数进行计算,得到第一损失值;Step B: Calculate the feature set by using a preset activation function to obtain a predicted value, obtain the label value of the initial label corresponding to each picture in the first standard picture set, according to the predicted value and the The label value is calculated by using the pre-built first loss function to obtain the first loss value;
    步骤C:对比所述第一损失值与预设的第一损失阈值的大小,当所述第一损失值大于或等于所述第一预设阈值时,返回所述步骤A;当所述第一损失值小于所述第一预设阈值时,停止训练,得到所述第一识别模型。Step C: Compare the magnitude of the first loss value with the preset first loss threshold, and when the first loss value is greater than or equal to the first preset threshold, return to the step A; When a loss value is less than the first preset threshold, stop training to obtain the first recognition model.
  20. 如权利要求16所述的计算机可读存储介质,其中,所述利用所述第二标准图片集对预构建的第二深度学习模型进行训练,得到第二识别模型,包括:15. The computer-readable storage medium according to claim 16, wherein said training a pre-built second deep learning model using said second standard picture set to obtain a second recognition model comprises:
    根据预设的第二损失函数及预设的第三损失函数进行权重计算,得到目标损失函数;Perform weight calculation according to the preset second loss function and the preset third loss function to obtain the target loss function;
    根据所述目标损失函数,利用所述第二标准图片集对所述第二深度学习模型进行训练;Training the second deep learning model by using the second standard picture set according to the target loss function;
    当所述目标损失函数的值小于第二预设阈值时,停止训练,得到所述第二识别模型。When the value of the target loss function is less than the second preset threshold, the training is stopped to obtain the second recognition model.
PCT/CN2020/131990 2020-09-24 2020-11-27 Image recognition method and apparatus, and electronic device and computer-readable storage medium WO2021189910A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011015349.4A CN111932564B (en) 2020-09-24 2020-09-24 Picture identification method and device, electronic equipment and computer readable storage medium
CN202011015349.4 2020-09-24

Publications (1)

Publication Number Publication Date
WO2021189910A1 true WO2021189910A1 (en) 2021-09-30

Family

ID=73334154

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/131990 WO2021189910A1 (en) 2020-09-24 2020-11-27 Image recognition method and apparatus, and electronic device and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN111932564B (en)
WO (1) WO2021189910A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114511569A (en) * 2022-04-20 2022-05-17 中南大学湘雅医院 Tumor marker-based medical image identification method, device, equipment and medium
CN115564656A (en) * 2022-11-11 2023-01-03 成都智元汇信息技术股份有限公司 Multi-graph merging and graph recognizing method and device based on scheduling
CN116129206A (en) * 2023-04-14 2023-05-16 吉林大学 Processing method and device for image decoupling characterization learning and electronic equipment

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111932564B (en) * 2020-09-24 2021-03-02 平安科技(深圳)有限公司 Picture identification method and device, electronic equipment and computer readable storage medium
CN112951233A (en) * 2021-03-30 2021-06-11 平安科技(深圳)有限公司 Voice question and answer method and device, electronic equipment and readable storage medium
CN113469296A (en) * 2021-09-03 2021-10-01 广东众聚人工智能科技有限公司 Image classification method and system based on small amount of label data self-supervision joint learning
CN113961067B (en) * 2021-09-28 2024-04-05 广东新王牌智能信息技术有限公司 Non-contact doodling drawing method and recognition interaction system based on deep learning

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190073569A1 (en) * 2017-09-07 2019-03-07 International Business Machines Corporation Classifying medical images using deep convolution neural network (cnn) architecture
CN110738235A (en) * 2019-09-16 2020-01-31 平安科技(深圳)有限公司 Pulmonary tuberculosis determination method, pulmonary tuberculosis determination device, computer device, and storage medium
CN111598867A (en) * 2020-05-14 2020-08-28 国家卫生健康委科学技术研究所 Method, apparatus, and computer-readable storage medium for detecting specific facial syndrome
CN111695522A (en) * 2020-06-15 2020-09-22 重庆邮电大学 In-plane rotation invariant face detection method and device and storage medium
CN111932564A (en) * 2020-09-24 2020-11-13 平安科技(深圳)有限公司 Picture identification method and device, electronic equipment and computer readable storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105139390A (en) * 2015-08-14 2015-12-09 四川大学 Image processing method for detecting pulmonary tuberculosis focus in chest X-ray DR film
CN107729911A (en) * 2017-07-26 2018-02-23 江西中科九峰智慧医疗科技有限公司 A kind of pulmonary tuberculosis intelligent identification Method and system based on DR
CN110838103B (en) * 2019-10-29 2023-05-16 重庆金山医疗技术研究院有限公司 Image processing method, device, diagnosis equipment and computer storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190073569A1 (en) * 2017-09-07 2019-03-07 International Business Machines Corporation Classifying medical images using deep convolution neural network (cnn) architecture
CN110738235A (en) * 2019-09-16 2020-01-31 平安科技(深圳)有限公司 Pulmonary tuberculosis determination method, pulmonary tuberculosis determination device, computer device, and storage medium
CN111598867A (en) * 2020-05-14 2020-08-28 国家卫生健康委科学技术研究所 Method, apparatus, and computer-readable storage medium for detecting specific facial syndrome
CN111695522A (en) * 2020-06-15 2020-09-22 重庆邮电大学 In-plane rotation invariant face detection method and device and storage medium
CN111932564A (en) * 2020-09-24 2020-11-13 平安科技(深圳)有限公司 Picture identification method and device, electronic equipment and computer readable storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114511569A (en) * 2022-04-20 2022-05-17 中南大学湘雅医院 Tumor marker-based medical image identification method, device, equipment and medium
CN115564656A (en) * 2022-11-11 2023-01-03 成都智元汇信息技术股份有限公司 Multi-graph merging and graph recognizing method and device based on scheduling
CN115564656B (en) * 2022-11-11 2023-04-28 成都智元汇信息技术股份有限公司 Multi-graph merging and graph identifying method and device based on scheduling
CN116129206A (en) * 2023-04-14 2023-05-16 吉林大学 Processing method and device for image decoupling characterization learning and electronic equipment

Also Published As

Publication number Publication date
CN111932564A (en) 2020-11-13
CN111932564B (en) 2021-03-02

Similar Documents

Publication Publication Date Title
WO2021189910A1 (en) Image recognition method and apparatus, and electronic device and computer-readable storage medium
WO2021151338A1 (en) Medical imagery analysis method, apparatus, electronic device and readable storage medium
WO2021189912A1 (en) Method and apparatus for detecting target object in image, and electronic device and storage medium
WO2021217851A1 (en) Abnormal cell automatic labeling method and apparatus, electronic device, and storage medium
WO2022121156A1 (en) Method and apparatus for detecting target object in image, electronic device and readable storage medium
WO2021151345A1 (en) Method and apparatus for parameter acquisition for recognition model, electronic device, and storage medium
WO2021189913A1 (en) Method and apparatus for target object segmentation in image, and electronic device and storage medium
WO2021189909A1 (en) Lesion detection and analysis method and apparatus, and electronic device and computer storage medium
CN111047609B (en) Pneumonia focus segmentation method and device
CN110276366A (en) Carry out test object using Weakly supervised model
WO2021143103A1 (en) Video data processing method, apparatus and device, and computer-readable storage medium
WO2021189901A1 (en) Image segmentation method and apparatus, and electronic device and computer-readable storage medium
WO2022048209A1 (en) License plate recognition method and apparatus, electronic device, and storage medium
JP6636506B2 (en) Image fingerprint generation
WO2021189911A1 (en) Target object position detection method and apparatus based on video stream, and device and medium
WO2021151275A1 (en) Image segmentation method and apparatus, device, and storage medium
WO2021114817A1 (en) Oct image lesion detection method and apparatus based on neural network, and medium
WO2021135474A1 (en) Method and apparatus for fusing data from multiple data sources, electronic device, and storage medium
WO2022126914A1 (en) Living body detection method and apparatus, electronic device, and storage medium
CN111063422A (en) Medical image labeling method, device, equipment and medium
WO2021184576A1 (en) Medical image generation method and apparatus, electronic device and medium
WO2021189856A1 (en) Certificate check method and apparatus, and electronic device and medium
WO2021151278A1 (en) Method and apparatus for vehicle damage assessment based on artificial intelligence, electronic device, and medium
CN111933274A (en) Disease classification diagnosis method and device, electronic equipment and storage medium
WO2021189914A1 (en) Electronic device, medical image index generation method and apparatus, and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20926475

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20926475

Country of ref document: EP

Kind code of ref document: A1