TW202221568A

TW202221568A - Image recognition method, electronic device and computer readable storage medium

Info

Publication number: TW202221568A
Application number: TW110131345A
Authority: TW
Inventors: 陳翼男
Original assignee: 大陸商上海商湯智能科技有限公司
Priority date: 2020-09-24
Filing date: 2021-08-24
Publication date: 2022-06-01
Also published as: CN112036506A; WO2022062590A1

Abstract

The application discloses an image recognition method, an electronic device and a computer readable storage medium, wherein the image recognition method includes: acquiring a plurality of medical images to be recognized; extracting the style feature representation of each medical image to be recognized; classifying the style feature representation of a plurality of medical images to be recognized to obtain the scanned image category of each medical image to be recognized.

Description

Image recognition method, electronic device, computer-readable storage medium

本發明關於人工智慧技術領域，關於但不限於一種圖像識別方法、電子設備、電腦可讀儲存介質。The present invention relates to the technical field of artificial intelligence, and relates to, but is not limited to, an image recognition method, an electronic device, and a computer-readable storage medium.

電腦斷層掃描（Computed Tomography，CT）和核磁共振掃描（Magnetic Resonance Imaging，MRI）等醫學圖像在臨床具有重要意義。為了使醫學圖像應用於臨床，一般需要掃描得到至少一種掃描圖像類別的醫學圖像。以與肝臟相關的臨床為例，掃描圖像類別往往包括與時序有關的造影前平掃、動脈早期、動脈晚期、門脈期、延遲期等等，此外，掃描圖像類別還可以包含與掃描參數有關的T1加權反相成像、T1加權同相成像、T2加權成像、擴散加權成像、表面擴散係數成像等等。Medical images such as Computed Tomography (CT) and Magnetic Resonance Imaging (MRI) are of great clinical significance. In order to apply a medical image to the clinic, it is generally necessary to scan to obtain a medical image of at least one type of scanned image. Taking liver-related clinics as an example, scan image categories often include time-series-related pre-contrast scan, early arterial phase, late arterial phase, portal phase, delayed phase, etc. Parameter-related T1-weighted in-phase imaging, T1-weighted in-phase imaging, T2-weighted imaging, diffusion-weighted imaging, surface diffusion coefficient imaging, etc.

在掃描過程中，通常需要放射科醫師鑒別掃描得到的醫學圖像的掃描圖像類別，以確保獲取所需要的醫學圖像；或者，在住院或門診診療時，通常需要醫生對掃描得到的醫學圖像進行識別，判斷每一醫學圖像的掃描圖像類別，再進行閱片。During the scanning process, a radiologist is usually required to identify the scanned image category of the scanned medical image to ensure that the required medical image is obtained; The image is recognized, the scanned image type of each medical image is judged, and then the image is read.

本發明實施例提供一種圖像識別方法、電子設備、電腦可讀儲存介質。Embodiments of the present invention provide an image recognition method, an electronic device, and a computer-readable storage medium.

本發明實施例提供了一種圖像識別方法，包括：獲取多個待識別醫學圖像；分別提取每一待識別醫學圖像的風格特徵表示；對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。An embodiment of the present invention provides an image recognition method, including: acquiring a plurality of medical images to be recognized; extracting a style feature representation of each medical image to be recognized; The classification process is performed to obtain the scanned image category of each medical image to be identified.

因此，通過獲取多個待識別醫學圖像，並提取每一待識別醫學圖像的風格特徵表示，從而對多個待識別醫學圖像的風格特徵表示進行分類處理，故能夠在分類處理時，考慮多個待識別醫學圖像在各自風格特徵上的差異，進而能夠提高識別得到的掃描圖像類別的準確性，且由於能夠對多個待識別醫學圖像的風格特徵表示進行分類處理，並得到每一待識別醫學圖像的掃描圖像類別，故能夠一次得到多個待識別醫學圖像的掃描圖像類別，從而能夠提高圖像識別的效率，故此，上述方案能夠提高圖像識別的效率和準確性。Therefore, by acquiring a plurality of medical images to be recognized, and extracting the style feature representation of each medical image to be recognized, the style feature representation of the plurality of medical images to be recognized can be classified and processed. Considering the differences in the respective style features of multiple medical images to be identified, the accuracy of the recognized scanned image category can be improved, and because the style feature representation of multiple medical images to be identified can be classified and processed, and The scanned image category of each medical image to be recognized is obtained, so multiple scanned image categories of the medical image to be recognized can be obtained at one time, thereby improving the efficiency of image recognition. Therefore, the above solution can improve the efficiency of image recognition. Efficiency and accuracy.

本發明的一些實施例中，對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別包括：將多個待識別醫學圖像的風格特徵表示進行第一融合處理，得到最終風格特徵表示；對最終風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。In some embodiments of the present invention, classifying the style feature representations of multiple medical images to be identified, and obtaining the scanned image category of each medical image to be identified includes: classifying the style features of the multiple medical images to be identified. The first fusion processing is performed to obtain the final style feature representation; the final style feature representation is classified and processed to obtain the scanned image category of each medical image to be recognized.

因此，在對多個待識別醫學圖像的風格特徵表示進行分類處理時，將多個待識別醫學圖像的風格特徵表示進行第一融合處理，得到最終風格特徵表示，故最終風格特徵表示能夠表示每一待識別醫學圖像的風格特徵表示與其他待識別醫學圖像的風格特徵表示之間的差異，故利用最終風格特徵表示進行分類處理能夠提高識別得到的掃描圖像類別的準確性。Therefore, when classifying the style feature representations of multiple medical images to be recognized, the first fusion processing is performed on the style feature representations of the multiple unidentified medical images to obtain the final style feature representation, so the final style feature representation can be It represents the difference between the style feature representation of each to-be-recognized medical image and the style-feature representations of other to-be-recognized medical images, so the classification process using the final style feature representation can improve the accuracy of the recognized scanned image category.

本發明的一些實施例中，對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別之後，圖像識別方法還包括以下至少一者：將多個待識別醫學圖像按照其掃描圖像類別進行排序；將按照掃描圖像類別進行排序後的至少一個待識別醫學圖像進行同屏顯示；若待識別醫學圖像的掃描圖像類別存在重複，則輸出第一預警資訊，以提示掃描人員；若多個待識別醫學圖像的掃描圖像類別中不存在預設掃描圖像類別，則輸出第二預警資訊，以提示掃描人員；若待識別醫學圖像的掃描圖像類別的分類置信度小於預設置信度閾值，則輸出第三預警資訊，以提示掃描人員。In some embodiments of the present invention, the image recognition method further includes at least one of the following after classifying and processing the style feature representations of a plurality of medical images to be recognized to obtain the scanned image category of each medical image to be recognized: Sort the multiple medical images to be recognized according to their scanned image categories; display at least one medical image to be recognized after being sorted according to the scanned image categories; if the scanned image category of the medical image to be recognized If there is duplication, output the first warning information to remind the scanning personnel; if there is no preset scanning image category in the scanned image categories of the multiple medical images to be identified, output the second warning information to remind the scanning personnel; If the classification confidence of the scanned image category of the medical image to be recognized is smaller than the preset reliability threshold, output third warning information to prompt the scanning personnel.

因此，在確定得到每一待識別醫學圖像所屬的掃描圖像類別之後，執行將至少一個待識別醫學圖像按照其掃描圖像類別進行排序，能夠提高醫生閱片的便捷性；將按照掃描圖像類別進行排序後的至少一個待識別醫學圖像進行同屏顯示，能夠免去醫生翻閱待識別醫學圖像時來回對照，從而能夠提高醫生閱片的效率；在待識別醫學圖像的掃描圖像類別存在重複時，輸出第一預警資訊，以提示掃描人員，在至少一個待識別醫學圖像的掃描圖像類別中不存在預設掃描圖像類別時，輸出第二預警資訊，以提示掃描人員，在待識別醫學圖像的掃描圖像類別的分類置信度小於預設置信度閾值時，輸出第三預警資訊，以提示掃描人員，能夠在掃描過程中實現圖像質控，以在與實際相悖時，能夠及時糾錯，避免病人二次掛號。Therefore, after determining the scanned image category to which each medical image to be recognized belongs, sorting at least one medical image to be recognized according to its scanned image category can improve the convenience for doctors to read images; After the image categories are sorted, at least one to-be-recognized medical image is displayed on the same screen, which can eliminate the need for doctors to check back and forth when viewing the to-be-recognized medical image, thereby improving the efficiency of the doctor's reading; when scanning the to-be-recognized medical image When the image categories are duplicated, output the first warning information to remind the scanning personnel, and when there is no preset scanning image category in the scanned image category of at least one medical image to be identified, output the second warning information to remind the scanning image category. When the classification confidence of the scanned image category of the medical image to be recognized is less than the preset reliability threshold, the scanner will output the third warning information to remind the scanner, and the image quality control can be realized during the scanning process, so as to ensure the quality of the image in the scanning process. When it is contrary to the actual situation, it can correct the error in time and avoid the second registration of the patient.

本發明的一些實施例中，分別提取每一待識別醫學圖像的風格特徵表示之前，上述方法還包括：對每一待識別醫學圖像進行預處理，其中，預處理包括以下至少一種：將待識別醫學圖像的圖像尺寸調整至預設尺寸，將待識別醫學圖像的圖像強度歸一化至預設範圍。In some embodiments of the present invention, before extracting the style feature representation of each to-be-recognized medical image, the above method further includes: preprocessing each to-be-recognized medical image, wherein the preprocessing includes at least one of the following: The image size of the medical image to be recognized is adjusted to a preset size, and the image intensity of the medical image to be recognized is normalized to a preset range.

因此，在提取風格特徵表示之前，對每一目的地區域的圖像資料進行預處理，且預處理包括以下至少一種：將目的地區域的圖像尺寸調整至預設尺寸，將目的地區域的圖像強度歸一化至預設範圍，故能夠有利於提高後續圖像識別的準確性。Therefore, before extracting the style feature representation, the image data of each destination area is preprocessed, and the preprocessing includes at least one of the following: adjusting the image size of the destination area to a preset size, The image intensity is normalized to a preset range, which can help to improve the accuracy of subsequent image recognition.

本發明的一些實施例中，圖像識別方法還包括：分別提取每一待識別醫學圖像的內容特徵表示；對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。In some embodiments of the present invention, the image recognition method further includes: extracting the content feature representation of each to-be-recognized medical image; Lesion areas in medical images.

因此，通過提取每一待識別醫學圖像的內容特徵表示，並對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，能夠在得到每一待識別醫學圖像的掃描圖像類別的同時，確定其中的病灶區域，故能夠有利於提高整體閱片效能，同時能夠有利於消除病灶對掃描圖像類別識別帶來的干擾，從而能夠提高圖像識別的準確性。Therefore, by extracting the content feature representation of each to-be-recognized medical image, and performing lesion identification on the content-feature representations of a plurality of to-be-recognized medical images, the lesion area in each to-be-recognized medical image can be obtained. When the scanned image category of the medical image is to be identified, the lesion area in the medical image is determined, which can help to improve the overall reading performance, and at the same time, it can help to eliminate the interference caused by the lesion to the identification of the scanned image category, so as to improve the Image recognition accuracy.

本發明的一些實施例中，對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域包括：將多個待識別醫學圖像的內容特徵表示進行第二融合處理，得到最終內容特徵表示；對最終內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。In some embodiments of the present invention, performing lesion identification on the content feature representations of a plurality of medical images to be identified, and obtaining a lesion area in each medical image to be identified includes: representing the content feature representations of the multiple medical images to be identified. The second fusion process is performed to obtain the final content feature representation; the lesion identification is performed on the final content feature representation to obtain the lesion area in each to-be-recognized medical image.

因此，將多個待識別醫學圖像的內容特徵表示進行第二融合處理，得到最終內容特徵表示，能夠有利於使最終內容特徵表示補償單一待識別醫學圖像中可能存在的病灶不明顯或運動干擾產生的偽影等問題，從而在利用最終內容特徵表示進行病灶識別時，能夠提高病灶識別的準確性。Therefore, the second fusion processing is performed on the content feature representations of multiple medical images to be recognized to obtain the final content feature representation, which can help the final content feature representation to compensate for inconspicuous or moving lesions that may exist in a single medical image to be recognized. Problems such as artifacts generated by interference, so that the accuracy of lesion identification can be improved when using the final content feature representation for lesion identification.

本發明的一些實施例中，圖像識別方法還包括：提示當前顯示的待識別醫學圖像的病灶區域。In some embodiments of the present invention, the image recognition method further includes: prompting the currently displayed lesion area of the medical image to be recognized.

因此，通過提示當前顯示的待識別醫學圖像的病灶區域，能夠提升醫生閱片體驗。Therefore, by prompting the lesion area of the currently displayed medical image to be identified, the doctor's reading experience can be improved.

本發明的一些實施例中，將所述多個待識別醫學圖像的內容特徵表示進行第二融合處理，包括以下任一者：將多個待識別醫學圖像的內容特徵表示進行拼接處理；將多個待識別醫學圖像的內容特徵表示進行相加處理；其中，最終內容特徵表示和多個待識別醫學圖像的內容特徵表示的維度相同。In some embodiments of the present invention, performing a second fusion process on the content feature representations of the plurality of medical images to be recognized includes any one of the following: performing a splicing process on the content feature representations of the plurality of medical images to be recognized; The content feature representations of the multiple medical images to be recognized are added; wherein, the final content feature representation has the same dimension as the content feature representations of the multiple medical images to be recognized.

因此，通過將多個待識別醫學圖像的內容特徵表示進行拼接處理，或者將多個待識別醫學圖像的內容特徵表示進行相加處理中的任一者，得到最終內容特徵表示，且最終內容特徵表示和多個待識別醫學圖像的內容特徵表示的維度相同，能夠通過多種方式得到最終內容特徵表示，從而能夠提高圖像識別的魯棒性。Therefore, by performing a splicing process on the content feature representations of multiple medical images to be recognized, or performing an addition process on the content feature representations of multiple medical images to be recognized, the final content feature representation is obtained, and finally The content feature representation has the same dimension as the content feature representation of multiple medical images to be recognized, and the final content feature representation can be obtained in various ways, thereby improving the robustness of image recognition.

本發明的一些實施例中，分別提取每一待識別醫學圖像的風格特徵表示，包括：利用識別網路的風格編碼子網路分別提取每一待識別醫學圖像的風格特徵表示；對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別，包括：利用識別網路的分類處理子網路對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別；分別提取每一待識別醫學圖像的內容特徵表示，包括：利用識別網路的內容編碼子網路分別提取每一待識別醫學圖像的內容特徵表示；對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，包括：利用識別網路的區域分割子網路對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。In some embodiments of the present invention, extracting the style feature representation of each to-be-recognized medical image separately includes: extracting the style feature representation of each to-be-recognized medical image by using the style coding sub-network of the recognition network; The style feature representation of each medical image to be recognized is classified and processed to obtain the scanned image category of each medical image to be recognized, including: using the classification processing sub-network of the recognition network to classify the styles of the medical images to be recognized. The feature representation is classified and processed to obtain the scanned image category of each medical image to be recognized; the content feature representation of each medical image to be recognized is extracted separately, including: using the content coding sub-network of the recognition network to extract each The content feature representation of the medical image to be recognized; the lesion identification is performed on the content feature representation of a plurality of medical images to be recognized, and the lesion area in each to-be-recognized medical image is obtained, including: using the region of the recognition network to segment the subnet The road performs lesion identification on the content feature representations of multiple medical images to be identified, and obtains the lesion area in each medical image to be identified.

因此，利用識別網路的風格編碼子網路分別提取每一待識別醫學圖像的風格特徵表示，利用識別網路的分類處理子網路對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別，利用識別網路的內容編碼子網路分別提取每一待識別醫學圖像的內容特徵表示，利用識別網路的區域分割子網路對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，能夠利用識別網路執行風格特徵表示的提取、分類處理、內容特徵表示的提取以及病灶識別等任務，故能夠有利於提高圖像識別的效率。Therefore, use the style coding sub-network of the recognition network to extract the style feature representation of each medical image to be recognized, and use the classification processing sub-network of the recognition network to classify the style feature representations of multiple medical images to be recognized Process to obtain the scanned image category of each medical image to be recognized, use the content coding sub-network of the recognition network to extract the content feature representation of each medical image to be recognized, and use the area of the recognition network to divide the sub-network Perform lesion identification on the content feature representations of multiple medical images to be identified, and obtain the lesion area in each to-be-recognized medical image, and can use the recognition network to perform extraction of style feature representations, classification processing, extraction of content feature representations, and extraction of content feature representations. It can help to improve the efficiency of image recognition.

本發明的一些實施例中，分別提取每一待識別醫學圖像的風格特徵表示之前，圖像識別方法還包括：獲取多個樣本醫學圖像，其中，多個樣本醫學圖像標注有其真實掃描圖像類別和真實病灶區域；利用風格編碼子網路分別提取每一樣本醫學圖像的樣本風格特徵表示，並利用內容編碼子網路分別提取每一樣本醫學圖像的樣本內容特徵表示；利用分類處理子網路對多個樣本醫學圖像的樣本風格特徵表示進行分類處理，得到每一樣本醫學圖像的預測掃描圖像類別，並利用區域分割子網路對多個樣本醫學圖像的樣本內容特徵表示進行病灶識別，得到每一樣本醫學圖像中的預測病灶區域；利用真實掃描圖像類別和預測掃描圖像類別的差異，調整風格編碼子網路和分類處理子網路的網路參數，以及利用真實病灶區域和預測病灶區域的差異，調整內容編碼子網路和區域分割子網路的網路參數。In some embodiments of the present invention, before extracting the style feature representation of each to-be-recognized medical image, the image recognition method further includes: acquiring a plurality of sample medical images, wherein the plurality of sample medical images are marked with their real Scan the image category and the real lesion area; use the style coding sub-network to extract the sample style feature representation of each sample medical image, and use the content coding sub-network to extract the sample content feature representation of each sample medical image respectively; The classification processing sub-network is used to classify the sample style feature representation of multiple sample medical images, and the predicted scanning image category of each sample medical image is obtained, and the region segmentation sub-network is used to classify the multiple sample medical images. The feature representation of the sample content is used to identify the lesions, and obtain the predicted lesion area in each sample medical image; use the difference between the real scanned image category and the predicted scanned image category to adjust the style coding sub-network and the classification processing sub-network. network parameters, as well as network parameters for the content coding sub-network and the region segmentation sub-network using the difference between the actual lesion area and the predicted lesion area.

因此，在對風格編碼子網路和分類處理子網路進行訓練的同時，加入對內容編碼子網路和區域分割子網路的訓練，從而能夠在提高區域分割子網路的病灶識別能力的同時，提高內容編碼子網路對於病灶相關的內容特徵的獲取度，進而能夠有利於使得風格編碼子網路不響應與病灶相關的特徵，使後續分類時不受病灶相關特徵的影響，故能夠提高圖像識別的魯棒性。Therefore, while training the style coding sub-network and the classification processing sub-network, the training of the content-coding sub-network and the region segmentation sub-network can be added, so as to improve the lesion identification ability of the region segmentation sub-network. At the same time, improving the acquisition of content features related to lesions by the content coding sub-network can help to make the style coding sub-network not respond to features related to lesions, so that subsequent classification is not affected by features related to lesions, so it can be Improve the robustness of image recognition.

本發明的一些實施例中，圖像識別方法還包括：獲取每一樣本醫學圖像的樣本風格特徵表示的樣本資料分佈情況；利用樣本資料分佈情況之間的差異，調整風格編碼子網路的網路參數。In some embodiments of the present invention, the image recognition method further includes: acquiring the distribution of sample data represented by the sample style features of each sample medical image; network parameters.

因此，在訓練過程中，同時獲取樣本資料分佈情況，並利用樣本資料分佈情況之間的差異，調整風格編碼子網路的網路參數，故能夠有利於使後續提取到風格特徵表示之間相互獨立，從而能夠有利於提高識別到的掃描圖像類別的準確性。Therefore, in the training process, the distribution of the sample data is obtained at the same time, and the difference between the distribution of the sample data is used to adjust the network parameters of the style coding sub-network, so it can be beneficial to make the subsequent extraction of the style feature representations. independent, which can help to improve the accuracy of the recognized scanned image categories.

本發明的一些實施例中，圖像識別方法還包括：利用一樣本風格特徵表示和一內容特徵表示，構建得到與樣本風格特徵表示對應的重建圖像；利用重建圖像與對應的樣本風格特徵表示所屬的樣本醫學圖像之間的差異，調整風格編碼子網路和內容編碼子網路的網路參數。In some embodiments of the present invention, the image recognition method further includes: using a sample style feature representation and a content feature representation to construct a reconstructed image corresponding to the sample style feature representation; using the reconstructed image and the corresponding sample style feature To represent the differences between the belonging sample medical images, adjust the network parameters of the style coding sub-network and the content-coding sub-network.

因此，在訓練過程中，同時利用一樣本風格特徵表示和一內容特徵表示，構建得到與樣本風格特徵表示對應的重建圖像，並利用重建圖像與對應的樣本風格特徵表示所屬的樣本醫學圖像之間的差異，調整風格編碼子網路和內容編碼子網路的網路參數，從而能夠使風格編碼子網路盡可能地提取到完整準確的風格特徵，而內容編碼子網路盡可能地提取到完整準確的風格特徵，進而能夠有利於提高後續掃描圖像類別以及病灶識別的準確性。Therefore, in the training process, a sample style feature representation and a content feature representation are used at the same time to construct a reconstructed image corresponding to the sample style feature representation, and the reconstructed image and the corresponding sample style feature are used to represent the sample medical map to which they belong. Adjust the network parameters of the style coding subnet and the content coding subnet, so that the style coding subnet can extract as complete and accurate style features as possible, and the content coding subnet can extract as much as possible the complete and accurate style features. It can extract complete and accurate style features, which can help to improve the accuracy of subsequent scanning image categories and lesion identification.

本發明的一些實施例中，風格編碼子網路包括：順序連接的下採樣層和全域池化層；和/或，內容編碼子網路包括以下任一者：順序連接的下採樣層和殘差塊、順序連接的卷積層和池化層。In some embodiments of the present invention, the style encoding sub-network includes: a sequentially connected downsampling layer and a global pooling layer; and/or, the content encoding sub-network includes any one of the following: a sequentially connected downsampling layer and a residual Difference blocks, sequentially connected convolutional layers and pooling layers.

因此，通過將風格編碼子網路設置為包括順序連接的下採樣層和全域池化層，能夠有利於在簡化網路結構的同時便於網路訓練；通過將內容編碼子網路設置為包括以下任一者：順序連接的下採樣層和殘差塊、順序連接的卷積層和池化層，能夠有利於在簡化網路結構的同時便於網路訓練。Therefore, by setting the style encoding sub-network to include sequentially connected downsampling layers and global pooling layers, it can facilitate network training while simplifying the network structure; by setting the content encoding sub-network to include the following Either: sequentially connected downsampling layers and residual blocks, sequentially connected convolutional layers and pooling layers, can facilitate network training while simplifying the network structure.

本發明實施例還提供了一種圖像識別裝置，包括圖像獲取模組、風格提取模組和分類處理模組，圖像獲取模組配置為獲取多個待識別醫學圖像；風格提取模組配置為分別提取每一待識別醫學圖像的風格特徵表示；分類處理模組配置為對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。The embodiment of the present invention also provides an image recognition device, including an image acquisition module, a style extraction module and a classification processing module, the image acquisition module is configured to acquire a plurality of medical images to be recognized; the style extraction module It is configured to separately extract the style feature representation of each medical image to be recognized; the classification processing module is configured to classify and process the style feature representation of a plurality of medical images to be recognized, and obtain a scanned image of each medical image to be recognized. category.

本發明實施例還提供了一種電子設備，包括相互耦接的記憶體和處理器，處理器配置為執行記憶體中儲存的程式指令，以實現上述任意一種圖像識別方法。An embodiment of the present invention further provides an electronic device, including a memory and a processor coupled to each other, the processor is configured to execute program instructions stored in the memory, so as to implement any one of the above image recognition methods.

本發明實施例還提供了一種電腦可讀儲存介質，其上儲存有程式指令，程式指令被處理器執行時實現上述任意一種圖像識別方法。Embodiments of the present invention further provide a computer-readable storage medium, which stores program instructions, and when the program instructions are executed by a processor, any one of the above image recognition methods is implemented.

本發明實施例還提供了一種電腦程式，包括電腦可讀代碼，當所述電腦可讀代碼在電子設備中運行時，所述電子設備中的處理器執行用於實現上述任意一種圖像識別方法。An embodiment of the present invention further provides a computer program, including computer-readable code, when the computer-readable code is executed in an electronic device, a processor in the electronic device executes any one of the image recognition methods described above. .

本發明實施例中，通過獲取多個待識別醫學圖像，並提取每一待識別醫學圖像的風格特徵表示，從而對多個待識別醫學圖像的風格特徵表示進行分類處理，故能夠在分類處理時，考慮多個待識別醫學圖像在各自風格特徵上的差異，進而能夠提高識別得到的掃描圖像類別的準確性，且由於能夠對多個待識別醫學圖像的風格特徵表示進行分類處理，並得到每一待識別醫學圖像的掃描圖像類別，故能夠一次得到多個待識別醫學圖像的掃描圖像類別，從而能夠提高圖像識別的效率，故此，本發明實施例能夠提高圖像識別的效率和準確性。In the embodiment of the present invention, by acquiring a plurality of medical images to be recognized, and extracting the style feature representation of each medical image to be recognized, the style feature representation of the plurality of medical images to be recognized is classified and processed, so it is possible to During the classification process, the differences in the respective style features of the multiple medical images to be identified are considered, so as to improve the accuracy of the recognized scanned image categories, and because the style feature representation of the multiple medical images to be identified can be performed. classification processing, and obtain the scanned image category of each medical image to be recognized, so multiple scanned image categories of the medical image to be recognized can be obtained at one time, so that the efficiency of image recognition can be improved. Therefore, the embodiment of the present invention It can improve the efficiency and accuracy of image recognition.

下面結合說明書附圖，對本發明實施例的方案進行詳細說明。The solutions of the embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

以下描述中，為了說明而不是為了限定，提出了諸如特定系統結構、介面、技術之類的具體細節，以便透徹理解本發明。In the following description, for purposes of illustration and not limitation, specific details such as specific system structures, interfaces, techniques, etc. are set forth in order to provide a thorough understanding of the present invention.

本文中術語“系統”和“網路”在本文中常被可互換使用。本文中術語“和/或”，僅僅是一種描述關聯對象的關聯關係，表示可以存在三種關係，例如，A和/或B，可以表示：單獨存在A，同時存在A和B，單獨存在B這三種情況。另外，本文中字元“/”，一般表示前後關聯對象是一種“或”的關係。此外，本文中的“多”表示兩個或者多於兩個。The terms "system" and "network" are often used interchangeably herein. The term "and/or" in this article is only an association relationship to describe associated objects, indicating that there can be three kinds of relationships, for example, A and/or B, which can mean that A exists alone, A and B exist at the same time, and B exists alone. three conditions. In addition, the character "/" in this text generally indicates that the related objects are an "or" relationship. Also, "multiple" herein means two or more than two.

請參閱圖1，圖1是本發明圖像識別方法一實施例的流程示意圖。具體而言，可以包括如下步驟。Please refer to FIG. 1 , which is a schematic flowchart of an embodiment of an image recognition method of the present invention. Specifically, the following steps may be included.

步驟S11：獲取多個待識別醫學圖像。Step S11: Acquire a plurality of medical images to be recognized.

待識別醫學圖像可以包括CT圖像、MR圖像，在此不做限定。在一個實施場景中，待識別醫學圖像可以是對腹部、胸部等區域進行掃描得到的，具體可以根據實際應用情況而設置，在此不做限定。例如，當肝臟、脾臟、腎臟為需要診療的臟器時，可以對腹部進行掃描，得到待識別醫學圖像；或者，當心臟、肺為需要診療的臟器時，可以對胸部進行掃描，得到待識別醫學圖像，其他情況可以以此類推，在此不再一一舉例。在另一個實施場景中，掃描方式可以是平掃、增強掃描等方式，在此不做限定。在又一個實施場景中，待識別醫學圖像可以是三維圖像，在此不做限定。在又一實施場景中，多個待識別醫學圖像可以對同一對象掃描得到的。The medical images to be identified may include CT images and MR images, which are not limited herein. In an implementation scenario, the medical image to be recognized may be obtained by scanning regions such as the abdomen, chest, etc., which may be specifically set according to the actual application, which is not limited herein. For example, when the liver, spleen, and kidney are the organs that need diagnosis and treatment, the abdomen can be scanned to obtain the medical image to be identified; or, when the heart and lungs are the organs that need diagnosis and treatment, the chest can be scanned to obtain For medical images to be recognized, other situations can be deduced by analogy, and examples are not given here. In another implementation scenario, the scanning mode may be a flat scanning, an enhanced scanning, or the like, which is not limited herein. In yet another implementation scenario, the medical image to be recognized may be a three-dimensional image, which is not limited herein. In yet another implementation scenario, multiple medical images to be recognized may be obtained by scanning the same object.

步驟S12：分別提取每一待識別醫學圖像的風格特徵表示。Step S12: Extract the style feature representation of each medical image to be recognized, respectively.

風格特徵表示用於描述待識別醫學圖像的風格，例如，待識別醫學圖像中所表示的注射進血管中的造影劑的強化程度。以肝臟為例，不同掃描圖像類別的待識別醫學圖像中肝臟的靜脈、門脈等血管造影劑強化程度各不相同，其中，掃描圖像類別為動脈早期的待識別醫學圖像中門靜脈尚未增強，掃描圖像類別為動脈晚期的待識別醫學圖像中門靜脈已被增強，掃描圖像類別為門脈期的待識別醫學圖像中門靜脈已充分增強且肝臟血管已被前向性血流增強、肝臟軟細胞組織在標記物下已達到峰值，掃描圖像類別為延遲期的待識別醫學圖像中門脈和動脈處於增強狀態並弱於門脈期、且肝臟軟細胞組織處於增強狀態並弱於門脈期，其他掃描圖像類別在此不再一一舉例。The style feature represents the style used to describe the medical image to be recognized, eg, the degree of enhancement of a contrast agent injected into a blood vessel represented in the medical image to be recognized. Taking the liver as an example, the degree of enhancement of angiographic contrast agents such as veins and portal veins in the liver in the medical images to be identified in different scan image categories is different. Among them, the scan image category is the portal vein in the medical images to be identified in the early arterial stage. Not yet enhanced, the portal vein has been enhanced in the medical image to be identified whose scan image category is the late arterial stage, and the portal vein has been fully enhanced in the unidentified medical image whose scan image category is the portal venous phase, and the liver blood vessels have been Flow enhancement, the liver parenchyma has reached the peak value under the marker, the portal vein and arteries are in an enhanced state and weaker than the portal phase in the scan image category to be identified in the delayed phase, and the liver parenchyma is enhanced The state is weaker than that of the portal venous phase, and other scan image types are not listed one by one here.

在一個實施場景中，風格特徵表示可以由一向量表示，向量的大小可以根據實際情況進行設置，例如，可以將向量的大小設置為8比特，在此不做限定。In an implementation scenario, the style feature representation may be represented by a vector, and the size of the vector may be set according to the actual situation. For example, the size of the vector may be set to 8 bits, which is not limited herein.

在一個實施場景中，為了提高風格特徵表示提取的便利性，可以預先訓練一識別網路，且識別網路中包含一風格編碼子網路，並利用識別網路的風格編碼子網路分別提取每一待識別醫學圖像的風格特徵表示。在一個實施場景中，為了簡化網路結構，風格編碼子網路可以包括順序連接的下採樣層和全域池化層，從而待識別醫學圖像經過下採樣處理之後，再利用全域池化層進行池化處理，得到風格特徵表示。In an implementation scenario, in order to improve the convenience of extracting the style feature representation, a recognition network can be pre-trained, and the recognition network includes a style encoding sub-network, and the style encoding sub-network of the recognition network is used to extract the A representation of style features for each medical image to be identified. In one implementation scenario, in order to simplify the network structure, the style encoding sub-network may include a downsampling layer and a global pooling layer that are connected in sequence, so that after the medical image to be recognized is subjected to downsampling processing, the global pooling layer is used to perform Pooling to get the style feature representation.

在一個實施場景中，為了提高後續圖像識別的準確性，在對待識別醫學圖像的風格特徵表示進行提取之前，還可以對每一待識別醫學圖像進行預處理，示例性地，預處理可以包括將待識別醫學圖像的圖像尺寸調整至預設尺寸（例如，32*256*256）。或者，預處理還可以包括將待識別醫學圖像的圖像強度歸一化至預設範圍（例如，0至1的範圍），在一個實施場景中，可以採用灰度累積分佈函數下預設比例（例如，99.9%）對應的灰度值作為歸一化的鉗位值，從而能夠加強待識別醫學圖像的對比度，有利於提升後續圖像識別的準確性。In an implementation scenario, in order to improve the accuracy of subsequent image recognition, before extracting the style feature representation of the medical image to be recognized, each medical image to be recognized may also be preprocessed, for example, preprocessing It may include resizing the image of the medical image to be recognized to a preset size (eg, 32*256*256). Alternatively, the preprocessing may further include normalizing the image intensity of the medical image to be recognized to a preset range (for example, a range of 0 to 1). In one implementation scenario, a preset grayscale cumulative distribution function may be used. The gray value corresponding to the ratio (for example, 99.9%) is used as the normalized clamp value, so that the contrast of the medical image to be recognized can be enhanced, which is beneficial to improve the accuracy of subsequent image recognition.

步驟S13：對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。Step S13: Perform classification processing on the style feature representations of a plurality of medical images to be recognized, to obtain a scanned image category of each medical image to be recognized.

掃描圖像類別具體可以根據實際情況進行設置。例如，仍以對肝臟進行掃描得到的待識別醫學圖像為例，掃描圖像類別可以包括與時序有關的造影前平掃、動脈早期、動脈晚期、門脈期、延遲期；或者，掃描圖像類別還可以包括與描參數有關的T1加權反相成像、T1加權同相成像、T2加權成像、擴散加權成像、表面擴散係數成像。示例性地，動脈早期可以表示門靜脈尚未增強，動脈晚期可以表示門靜脈已被增強，門脈期可以表示門靜脈已充分增強且肝臟血管已被前向性血流增強、肝臟軟細胞組織在標記物下已達到峰值，延遲期可以表示門脈和動脈處於增強狀態並弱於門脈期、且肝臟軟細胞組織處於增強狀態並弱於門脈期，其他掃描圖像類別在此不再一一舉例。當待識別醫學圖像為對其他臟器掃描得到的醫學圖像時，可以以此類推，在此不再一一舉例。The scanned image category can be set according to the actual situation. For example, still taking the medical image to be identified obtained by scanning the liver as an example, the scan image category may include time-series-related pre-contrast scan, early arterial, late arterial, portal venous phase, and delayed phase; or, scan images Image categories may also include T1-weighted in-phase imaging, T1-weighted in-phase imaging, T2-weighted imaging, diffusion-weighted imaging, and surface diffusion coefficient imaging related to scanning parameters. Illustratively, the early arterial phase may indicate that the portal vein has not been enhanced, the late arterial phase may indicate that the portal vein has been enhanced, and the portal venous phase may indicate that the portal vein is sufficiently enhanced and the liver vessels have been enhanced by forward blood flow, and liver parenchyma tissue is under the marker. The peak value has been reached, and the delay period can indicate that the portal vein and arteries are in an enhanced state and weaker than the portal venous phase, and the liver parenchyma tissue is in an enhanced state and weaker than the portal venous phase. Other scan image categories are not listed here. When the medical image to be identified is a medical image obtained by scanning other organs, it can be deduced by analogy, and no examples will be given here.

由於屬於不同掃描圖像類別的待識別醫學圖像各自的風格特徵表示存在差異，故對其各自的風格特徵表示進行分類處理，能夠得到每一待識別醫學圖像的掃描圖像類別。在一個實施場景中，為了提高分類處理的便利性，上述識別網路中還包括分類處理子網路，從而可以利用分類處理子網路對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別，進而只需分類處理子網路對風格特徵表示進行分類處理即可得到待識別醫學圖像分別所屬的掃描圖像類別，故此能夠提高分類處理的便利性。在一個實施場景中，為了簡化網路結構，分類處理子網路可以包括順序連接的全連接層和softmax層，在此不做限定。Since there are differences in the respective style feature representations of the medical images to be recognized belonging to different scanned image categories, the respective style feature representations of the respective style feature representations are classified to obtain the scanned image category of each medical image to be recognized. In an implementation scenario, in order to improve the convenience of classification processing, the above-mentioned identification network further includes a classification processing sub-network, so that the classification processing sub-network can be used to classify and process the style feature representations of multiple medical images to be identified. , obtain the scanned image category of each medical image to be recognized, and then only need to classify the style feature representation by the classification processing sub-network to obtain the scanned image category to which the medical image to be recognized belongs, so the classification can be improved. Ease of handling. In an implementation scenario, in order to simplify the network structure, the classification processing sub-network may include sequentially connected fully connected layers and softmax layers, which are not limited herein.

在一個實施場景中，為了提高圖像識別準確性，還可以將多個待識別醫學圖像的風格特徵表示進行第一融合處理，得到最終風格特徵表示，並對最終風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。示例性地，第一融合處理的操作可以是將多個待識別醫學圖像的風格特徵表示進行首尾拼接，從而得到最終風格特徵表示；或者，第一融合處理的操作還可以是將多個待識別醫學圖像的風格特徵表示進行堆疊處理，從而得到最終風格特徵表示，在此不做限定。通過將多個待識別醫學圖像的風格特徵表示進行第一融合處理而得到的最終風格特徵表示，能夠表示每一待識別醫學圖像的風格特徵表示與其他待識別醫學圖像的風格特徵表示之間的差異，故利用最終風格特徵表示進行分類處理能夠提高識別得到的掃描圖像類別的準確性。In an implementation scenario, in order to improve the accuracy of image recognition, the style feature representations of multiple medical images to be recognized can also be subjected to a first fusion process to obtain a final style feature representation, and the final style feature representation is classified. Obtain the scanned image category of each medical image to be recognized. Exemplarily, the operation of the first fusion process may be to perform end-to-end splicing of the style feature representations of a plurality of medical images to be recognized, so as to obtain a final style feature representation; The style feature representations of the identified medical images are stacked to obtain the final style feature representation, which is not limited here. The final style feature representation obtained by performing the first fusion process on the style feature representations of multiple medical images to be recognized can represent the style feature representation of each to-be-recognized medical image and the style feature representations of other to-be-recognized medical images Therefore, using the final style feature representation for classification processing can improve the accuracy of the recognized scanned image categories.

在一個實施場景中，為了提高閱片效能，以及掃描圖像類別識別的準確性，還可以提取每一待識別醫學圖像的內容特徵表示，並對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，故能夠在得到每一待識別醫學圖像的掃描圖像類別的同時，確定其中的病灶區域，故能夠有利於提高整體閱片效能，同時能夠有利於消除病灶對掃描圖像類別識別帶來的干擾，從而能夠提高圖像識別的準確性。內容特徵表示待識別醫學圖像中的內容，例如，待識別醫學圖像中器髒的解剖特徵。仍以肝臟為例，內容特徵表示可以描述肝臟及其毗鄰器臟（如，脾臟、腎臟）之間的生理位置關係、肝臟的外形特徵、質地（如，軟、硬等）特徵、成分（如，含水、含脂等）特徵等等，在此不做限定。此外，病灶可以包括腫瘤、血栓、結節等，具體可以根據實際情況進行設置，在此不做限定。In an implementation scenario, in order to improve the reading efficiency and the accuracy of scanning image category recognition, the content feature representation of each to-be-recognized medical image can also be extracted, and the content feature representation of multiple to-be-recognized medical images can be represented. Perform lesion identification to obtain the lesion area in each medical image to be identified, so the lesion area can be determined while obtaining the scanned image category of each medical image to be identified, so it can be beneficial to improve the overall reading. At the same time, it can help to eliminate the interference caused by the lesions to the classification recognition of the scanned image, so that the accuracy of the image recognition can be improved. The content feature represents the content in the medical image to be recognized, for example, the anatomical features of an organ in the medical image to be recognized. Still taking the liver as an example, the content feature representation can describe the physiological positional relationship between the liver and its adjacent organs (such as spleen, kidney), the shape characteristics of the liver, the texture (such as soft, hard, etc.) characteristics, the composition (such as spleen, kidney, etc.) characteristics. , water-containing, fat-containing, etc.) characteristics, etc., are not limited here. In addition, the lesions may include tumors, thrombus, nodules, etc., which may be set according to actual conditions, and are not limited herein.

在一個實施場景中，為了提高提取內容特徵表示的便利性，識別網路中還可以包括一內容編碼子網路，從而可以利用識別網路的內容編碼子網路分別提取每一待識別醫學圖像的內容特徵表示。示例性地，為了簡化網路結構，內容編碼子網路可以採用順序連接的下採樣層和殘差塊（resblock），殘差塊的數量可以根據實際情況進行設置，通過設置殘差塊可以提升網路深度，從而可以提高提取得到的內容特徵表示的深度；或者，內容編碼子網路還可以採用順序連接的卷積層和池化層，卷積層和池化層的組數可以根據實際情況進行設置，例如，可以採用一組順序連接的卷積層和池化層、兩組順序連接的卷積層和池化層、三組順序連接的卷積層和池化層等等，在此不做限定。In an implementation scenario, in order to improve the convenience of extracting the content feature representation, the identification network may further include a content encoding sub-network, so that each medical image to be identified can be extracted separately by using the content encoding sub-network of the identification network. Image content feature representation. Exemplarily, in order to simplify the network structure, the content coding sub-network can use sequentially connected downsampling layers and residual blocks (resblocks), and the number of residual blocks can be set according to the actual situation. network depth, which can improve the depth of the extracted content feature representation; alternatively, the content coding sub-network can also use sequentially connected convolutional layers and pooling layers, and the number of groups of convolutional layers and pooling layers can be determined according to the actual situation. The setting, for example, can adopt a set of sequentially connected convolution layers and pooling layers, two sets of sequentially connected convolution layers and pooling layers, three sets of sequentially connected convolution layers and pooling layers, etc., which are not limited here.

在另一個實施場景中，為了提高病灶識別的便利性，識別網路中還可以包括一區域分割子網路，從而利用識別網路的區域分割子網路對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。示例性地，區域分割子網路可以採用Unet、Vnet等，在此不做限定。此外，病灶區域可以包括包圍病灶的區域，例如，病灶的輪廓等等，在此不做限定。In another implementation scenario, in order to improve the convenience of lesion identification, the identification network may further include a region segmentation sub-network, so that the content of multiple medical images to be identified is analyzed by using the region segmentation sub-network of the identification network. The feature representation is used for lesion identification, and the lesion area in each medical image to be identified is obtained. Exemplarily, the area dividing subnet may adopt Unet, Vnet, etc., which is not limited herein. In addition, the lesion area may include an area surrounding the lesion, for example, the outline of the lesion, etc., which is not limited herein.

在又一個實施場景中，內容特徵表示可以由一張量來表示，例如，可以通過一低解析度的張量來表示內容特徵表示，在此不做限定。In yet another implementation scenario, the content feature representation may be represented by a tensor, for example, the content feature representation may be represented by a low-resolution tensor, which is not limited herein.

在又一個實施場景中，為了提高病灶識別的準確性，還可以將多個待識別醫學圖像的內容特徵表示進行第二融合處理，得到最終內容特徵表示，並對最終內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，故能夠有利於使最終內容特徵表示補償單一待識別醫學圖像中可能存在的病灶不明顯或運動干擾產生的偽影等問題，從而在利用最終內容特徵表示進行病灶識別時，能夠提高病灶識別的準確性。示例性地，第二融合處理具體可以是將多個待識別醫學圖像的內容特徵表示進行拼接（concatenate）處理以實現對內容特徵表示的融合處理，在一些實施例中，在進行拼接處理後，可以經過幾個簡單的沒有池化的卷積層得到最終內容特徵表示；這裡，對內容特徵進行拼接處理可以視為張量的串聯運算；或者，也可以是將多個待識別醫學圖像的內容特徵表示進行相加（add）處理以實現對內容特徵表示的融合處理，具體可以視為張量的求和運算；或者，還可以將多個待識別醫學圖像的內容特徵表示進行堆疊，再利用諸如1*1的卷積核對堆疊後的內容特徵表示進行卷積運算，以實現對內容特徵表示的融合處理；或者，還可以將多個待識別醫學圖像的內容特徵表示進行加權處理，在此不做限定。此外，最終內容特徵表示和多個待識別醫學圖像的內容特徵表示的維度相同。In yet another implementation scenario, in order to improve the accuracy of lesion identification, a second fusion process may be performed on the content feature representations of multiple medical images to be identified to obtain a final content feature representation, and the final content feature representation is used for lesion identification. , to obtain the lesion area in each medical image to be identified, so that the final content feature representation can help to compensate for the problems of inconspicuous lesions or artifacts caused by motion interference that may exist in a single medical image to be identified. The final content feature representation can improve the accuracy of lesion identification when performing lesion identification. Exemplarily, the second fusion process may specifically be performing a concatenation process on the content feature representations of a plurality of medical images to be recognized to implement a fusion process on the content feature representations. In some embodiments, after the concatenation process is performed, , the final content feature representation can be obtained through several simple convolutional layers without pooling; here, the splicing of content features can be regarded as a concatenation operation of tensors; The content feature representation is added (add) to realize the fusion processing of the content feature representation, which can be regarded as the summation operation of tensors; or, the content feature representation of multiple medical images to be recognized can also be stacked, Then use a convolution kernel such as 1*1 to perform a convolution operation on the stacked content feature representations to realize the fusion processing of the content feature representations; or, the content feature representations of multiple medical images to be recognized can also be weighted. , which is not limited here. In addition, the final content feature representation and the content feature representation of the plurality of medical images to be recognized have the same dimension.

在又一個實施場景中，為了提升醫生體驗，還可以在得到待識別醫學圖像中的病灶區域之後，提示當前顯示的待識別醫學圖像的病灶區域。例如，可以採用預設線條（如，加粗線、點劃線、雙實線等）和/或預設顏色（如，黃色、紅色、綠色等）表示病灶區域；或者，還可以採用預設符號（例如，指向病灶區域的箭頭等）表示病灶區域，具體可以根據實際情況進行設置，在此不做限定。In yet another implementation scenario, in order to improve the doctor's experience, after obtaining the lesion area in the medical image to be recognized, the currently displayed lesion area of the medical image to be recognized may be prompted. For example, preset lines (eg, bold lines, dot-dash lines, double solid lines, etc.) and/or preset colors (eg, yellow, red, green, etc.) can be used to represent the lesion area; or, preset lines can also be used. Symbols (for example, arrows pointing to the lesion area, etc.) represent the lesion area, which can be set according to the actual situation, which is not limited here.

在又一個實施場景中，可以將上述經訓練的識別網路設置於影像後處理工作站、攝片工作站、電腦輔助閱片系統、遠端醫療診斷場景，雲平臺輔助智慧診斷場景等，從而能夠實現對待識別醫學圖像的自動識別，提高識別效率。In yet another implementation scenario, the above-mentioned trained identification network can be set up in image post-processing workstations, imaging workstations, computer-assisted image reading systems, remote medical diagnosis scenarios, cloud platform-assisted smart diagnosis scenarios, etc., so as to realize Automatic recognition of medical images to be recognized to improve recognition efficiency.

在一個實施場景中，至少一個待識別醫學圖像為對同一對象掃描得到的，故為了便於醫生閱片，在得到每一待識別醫學圖像所屬的掃描圖像類別之後，還可以將至少一個待識別醫學圖像按照其掃描圖像類別進行排序，例如，可以按照T1加權反相成像、T1加權同相成像、造影前平掃、動脈早期、動脈晚期、門脈期、延遲期、T2加權成像、擴散加權成像、表面擴散係數成像的預設順序進行排序，此外，預設順序還可以根據醫生習慣進行設置，在此不做限定，從而能夠提高醫生閱片的便捷性。In an implementation scenario, at least one medical image to be recognized is obtained by scanning the same object, so in order to facilitate the doctor to read the image, after obtaining the scanned image category to which each medical image to be recognized belongs, at least one medical image can also be scanned. The medical images to be identified are sorted according to their scan image categories, for example, T1-weighted inverse imaging, T1-weighted in-phase imaging, pre-contrast scan, early arterial, late arterial, portal venous phase, delayed phase, and T2-weighted imaging , Diffusion Weighted Imaging, and Surface Diffusion Coefficient Imaging in the preset order. In addition, the preset order can also be set according to the doctor's habit, which is not limited here, so as to improve the convenience of the doctor's reading.

在另一個實施場景中，為了進一步提高閱片的便捷性，還可以將按照掃描圖像類別進行排序後的至少一個待識別醫學圖像進行同屏顯示，例如，待識別醫學圖像的數量為5個，則可以在5個顯示視窗中分別顯示待識別醫學圖像。故此，能夠降低醫生翻閱多個待識別醫學圖像來回對照的時間，提升閱片效率。In another implementation scenario, in order to further improve the convenience of image reading, at least one medical image to be recognized sorted according to the scan image category can also be displayed on the same screen. For example, the number of medical images to be recognized is 5, the medical images to be recognized can be displayed in the 5 display windows respectively. Therefore, it is possible to reduce the time for doctors to review multiple medical images to be recognized and compare them back and forth, and improve the efficiency of reading images.

在又一個實施場景中，至少一個待識別醫學圖像為對同一對象掃描得到的，故為了在掃描過程中進行品質控制，在得到每一待識別醫學圖像所屬的掃描圖像類別之後，還可以判斷待識別醫學圖像的掃描圖像類別是否存在重複，並在存在重複時，輸出第一預警資訊，以提示掃描人員。例如，若存在兩張掃描圖像類別均為“延遲期”的待識別醫學圖像，則可以認為掃描過程中存在掃描品質不合規的情況，故為了提示掃描人員，可以輸出第一預警資訊，示例性地，可以輸出預警原因（如，存在掃描圖像類別重複的待識別醫學圖像等）。或者，在得到每一待識別醫學圖像所屬的掃描圖像類別之後，還可以判斷至少一個待識別醫學圖像的掃描圖像類別中不存在預設掃描圖像類別，並在不存在預設掃描圖像類別時，輸出第二預警資訊，以提示掃描人員。例如，預設掃描圖像類別為“門脈期”，若至少一個待識別醫學圖像中不存在掃描圖像類別為“門脈期”的圖像，則可以認為掃描過程中存在掃描品質不合規的情況，故為了提示掃描人員，可以輸出第二預警資訊，示例性地，可以輸出預警原因（如，待識別醫學圖像中不存在門脈期圖像等）。或者，在得到每一待識別醫學圖像所屬的掃描圖像類別之後，還可以判斷待識別醫學圖像的掃描圖像類別的分類置信度是否小於預設置信度閾值，並在待識別醫學圖像的掃描圖像類別的分類置信度小於預設置信度閾值時，輸出第三預警資訊，以提示掃描人員。示例性地，分類置信度可以是分類處理子網路在進行分類處理時預測得到的。例如，分類處理子網路在進行分類處理時，預測得到每個待識別醫學圖像的掃描圖像類別和對應的分類置信度，若掃描圖像類別為“門脈期”的待識別醫學圖像的分類置信度（如，20%）低於預設置信度閾值（如，90%），則可以認為掃描圖像類別為“門脈期”的待識別醫學圖像的圖像品質不合規，故為了提示掃描人員，可以輸出第三預警資訊，示例性地，可以輸出預警原因（如，待識別醫學圖像可能存在掃描品質不佳的問題）。故此，能夠在掃描過程中實現圖像質控，以在與實際相悖時，能夠及時糾錯，避免病人二次掛號。In yet another implementation scenario, at least one medical image to be recognized is obtained by scanning the same object, so in order to perform quality control during the scanning process, after obtaining the scanned image category to which each medical image to be recognized belongs, the It can be judged whether the scanned image categories of the medical images to be identified are duplicated, and when duplicates exist, first warning information is output to prompt the scanning personnel. For example, if there are two medical images to be identified with both scanned image categories as "delayed period", it can be considered that the scanning quality is not compliant during the scanning process. Therefore, in order to remind the scanning personnel, the first warning information can be output , exemplarily, the cause of the warning (eg, there are medical images to be identified with duplicate types of scanned images, etc.) can be output. Alternatively, after obtaining the scanned image category to which each to-be-recognized medical image belongs, it can also be determined that there is no preset scanned image category in the scanned image category of at least one to-be-identified medical image, and if there is no preset scanned image category. When scanning the image category, output the second warning information to remind the scanning personnel. For example, the preset scanned image category is "portal phase". If there is no image with the scanned image category of "portal phase" in at least one of the medical images to be identified, it can be considered that there is an inconsistency in scan quality during the scanning process. In order to prompt the scanning personnel, the second warning information may be output, for example, the warning reason (for example, there is no portal phase image in the to-be-recognized medical image, etc.) may be output. Alternatively, after obtaining the scanned image category to which each medical image to be recognized belongs, it is also possible to determine whether the classification confidence of the scanned image category of the medical image to be recognized is less than a preset reliability threshold, and the When the classification confidence of the scanned image category of the image is less than the preset confidence threshold, the third warning information is output to remind the scanning personnel. Exemplarily, the classification confidence may be predicted by the classification processing sub-network during the classification processing. For example, when the classification processing sub-network performs the classification processing, it predicts the scanned image category and the corresponding classification confidence of each medical image to be recognized. The classification confidence (for example, 20%) of the image is lower than the preset confidence threshold (for example, 90%), it can be considered that the image quality of the medical image to be recognized whose scan image category is "portal phase" is not suitable. Therefore, in order to prompt the scanning personnel, the third warning information can be output, for example, the warning reason can be output (for example, the medical image to be recognized may have a problem of poor scanning quality). Therefore, the image quality control can be realized during the scanning process, so that when it is contrary to the actual situation, the error can be corrected in time and the second registration of the patient can be avoided.

上述方案，通過獲取多個待識別醫學圖像，並提取每一待識別醫學圖像的風格特徵表示，從而對多個待識別醫學圖像的風格特徵表示進行分類處理，故能夠在分類處理時，考慮多個待識別醫學圖像在各自風格特徵上的差異，進而能夠提高識別得到的掃描圖像類別的準確性，且由於能夠對多個待識別醫學圖像的風格特徵表示進行分類處理，並得到每一待識別醫學圖像的掃描圖像類別，故能夠一次得到多個待識別醫學圖像的掃描圖像類別，從而能夠提高圖像識別的效率，故此，上述方案能夠提高圖像識別的效率和準確性。In the above scheme, by acquiring a plurality of medical images to be recognized, and extracting the style feature representation of each medical image to be recognized, the style feature representation of the plurality of medical images to be recognized is classified and processed, so it can be used in the classification process. , considering the differences in the respective style features of the multiple medical images to be identified, thereby improving the accuracy of the recognized scanned image category, and because the style feature representation of the multiple medical images to be identified can be classified and processed, and obtain the scanned image category of each medical image to be recognized, so multiple scanned image categories of the medical image to be recognized can be obtained at one time, thereby improving the efficiency of image recognition. Therefore, the above solution can improve image recognition. efficiency and accuracy.

請參閱圖2，圖2是訓練識別網路一實施例的流程示意圖。具體而言，識別網路包括前述實施例中的內容編碼子網路、風格編碼子網路、分類處理子網路和區域分割子網路，具體過程如下。Please refer to FIG. 2 , which is a schematic flowchart of an embodiment of training a recognition network. Specifically, the identification network includes the content encoding sub-network, the style encoding sub-network, the classification processing sub-network and the area segmentation sub-network in the foregoing embodiments, and the specific process is as follows.

步驟S21：獲取多個樣本醫學圖像，其中，多個樣本醫學圖像標注有其真實掃描圖像類別和真實病灶區域。Step S21 : acquiring multiple sample medical images, wherein the multiple sample medical images are marked with their real scanned image categories and real lesion areas.

一次訓練過程中所使用的多個樣本醫學圖像可以是對同一對象掃描得到的。例如，某次訓練所採用的樣本醫學圖像可以是對對象A掃描得到的，另一次訓練所採用的樣本醫學圖像可以是對對象B掃描得到的。The multiple sample medical images used in a training process can be obtained by scanning the same object. For example, the sample medical image used in a certain training may be obtained by scanning object A, and the sample medical image used in another training may be obtained by scanning object B.

樣本醫學圖像也可以包括CT圖像、MR圖像，在此不做限定。具體可以參閱前述實施例中的待識別醫學圖像，在此不再贅述。The sample medical images may also include CT images and MR images, which are not limited herein. For details, reference may be made to the medical images to be recognized in the foregoing embodiments, and details are not described herein again.

樣本醫學圖像所標注的真實掃描圖像類別和真實病灶區域可以是臨床醫生、影像科醫生等具有醫學影像知識的人員標注的。掃描圖像類別具體可以根據實際情況進行設置，例如，樣本醫學圖像是對肝臟進行掃描得到的，則掃描圖像類別具體可以包括與時序有關的造影前平掃、動脈早期、動脈晚期、門脈期、延遲期；或者，掃描圖像類別還可以包括與描參數有關的T1加權反相成像、T1加權同相成像、T2加權成像、擴散加權成像、表面擴散係數成像，具體可以參閱前述實施例中的相關步驟，在此不再贅述。當樣本醫學圖像是對其他臟器掃描得到時，可以以此類推，在此不再一一舉例。真實病灶區域可以採用多邊形進行標注，例如，可以採用多邊形標注病灶輪廓等，在此不做限定。The real scanned image category and real lesion area marked by the sample medical image can be marked by clinicians, radiologists and other personnel with medical imaging knowledge. The scan image category can be set according to the actual situation. For example, if the sample medical image is obtained by scanning the liver, the scan image category can specifically include the timing-related pre-contrast scan, early arterial, late arterial, and portal scans. Pulse phase, delay phase; or, the scan image category may also include T1-weighted inverse imaging, T1-weighted in-phase imaging, T2-weighted imaging, diffusion-weighted imaging, and surface diffusion coefficient imaging related to scanning parameters. For details, please refer to the foregoing embodiments. The relevant steps in are not repeated here. When the sample medical image is obtained by scanning other organs, it can be deduced by analogy, and examples will not be given here. The real lesion area may be marked with polygons, for example, the contour of the lesion may be marked with polygons, etc., which is not limited herein.

請結合參閱圖3，圖3是訓練識別網路一實施例的狀態示意圖。如圖3所示，在一次訓練過程中，可以獲取樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n，其中，n的數值可以根據實際情況進行設置，例如，當掃描圖像類別包括與時序有關的造影前平掃、動脈早期、動脈晚期、門脈期、延遲期時，n的數值可以設置為小於或等於5的整數，例如，5、4、3等等；或者，當掃描圖像類別包括與描參數有關的T1加權反相成像、T1加權同相成像、T2加權成像、擴散加權成像、表面擴散係數成像時，n的數值可以設置為小於或等於5的整數，例如，5、4、3等等；或者，當掃描圖像類別既包括與描參數有關的T1加權反相成像、T1加權同相成像、T2加權成像、擴散加權成像、表面擴散係數成像，也包括與時序有關的造影前平掃、動脈早期、動脈晚期、門脈期、延遲期時，n的數值可以設置為小於或等於10的整數，例如，10、9、8等等，具體可以根據實際情況進行設置，在此不做限定。Please refer to FIG. 3 . FIG. 3 is a schematic state diagram of an embodiment of a training identification network. As shown in Figure 3, in a training process, sample medical image 1, sample medical image 2, sample medical image 3, ..., sample medical image n can be obtained, where the value of n can be based on the actual situation For example, when the scan image category includes time-related pre-contrast scan, early arterial, late arterial, portal venous phase, and delayed phase, the value of n can be set to an integer less than or equal to 5, for example, 5 , 4, 3, etc.; or, when the scanned image category includes T1-weighted inverse imaging, T1-weighted in-phase imaging, T2-weighted imaging, diffusion-weighted imaging, and surface diffusion coefficient imaging related to scanning parameters, the value of n can be set is an integer less than or equal to 5, for example, 5, 4, 3, etc.; or, when the scan image category includes both T1-weighted inverse imaging, T1-weighted in-phase imaging, T2-weighted imaging, and diffusion-weighted imaging related to scanning parameters , surface diffusion coefficient imaging, including timing-related pre-contrast scan, early arterial, late arterial, portal venous phase, and delayed phase, the value of n can be set to an integer less than or equal to 10, for example, 10, 9, 8, etc., which can be set according to the actual situation, which is not limited here.

步驟S22：利用風格編碼子網路分別提取每一樣本醫學圖像的樣本風格特徵表示，並利用內容編碼子網路分別提取每一樣本醫學圖像的樣本內容特徵表示。Step S22 : using the style coding sub-network to extract the sample style feature representation of each sample medical image, and using the content coding sub-network to separately extract the sample content feature representation of each sample medical image.

風格編碼子網路和內容編碼子網路的網路結構具體可以參閱前述實施例中的相關步驟，在此不再贅述。請結合參閱圖3，樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n在經過風格編碼子網路進行風格特徵提取之後，可以分別得到樣本風格特徵表示1、樣本風格特徵表示2、樣本風格特徵表示3、……、樣本風格特徵表示n；經過內容編碼子網路進行內容特徵提取之後，可以分別得到樣本內容特徵表示1、樣本內容特徵表示2、樣本內容特徵表示3、……、樣本內容特徵表示n。樣本風格特徵表示和樣本內容特徵表示具體可以參考前述實施例中的風格特徵表示、內容特徵表示，在此不再贅述。For the network structures of the style encoding sub-network and the content encoding sub-network, reference may be made to the relevant steps in the foregoing embodiments, which will not be repeated here. Please refer to Figure 3 in conjunction with the sample medical image 1, the sample medical image 2, the sample medical image 3, ..., and the sample medical image n after the style feature extraction is performed by the style coding sub-network, the sample styles can be obtained respectively. Feature representation 1, sample style feature representation 2, sample style feature representation 3, ..., sample style feature representation n; after content feature extraction by the content coding sub-network, sample content feature representation 1, sample content feature representation can be obtained respectively 2. Sample content feature representation 3, ..., sample content feature representation n. For the sample style feature representation and the sample content feature representation, reference may be made to the style feature representation and the content feature representation in the foregoing embodiments, which will not be repeated here.

步驟S23：利用分類處理子網路對多個樣本醫學圖像的樣本風格特徵表示進行分類處理，得到每一樣本醫學圖像的預測掃描圖像類別，並利用區域分割子網路對多個樣本醫學圖像的樣本內容特徵表示進行病灶識別，得到每一樣本醫學圖像中的預測病灶區域。Step S23: Use the classification processing sub-network to classify and process the sample style feature representations of the multiple sample medical images to obtain the predicted scanning image category of each sample medical image, and use the region segmentation sub-network to classify the multiple samples. The feature representation of the sample content of the medical image is used for lesion identification, and the predicted lesion area in each sample medical image is obtained.

在一個實施場景中，可以對多個樣本醫學圖像的樣本風格特徵表示進行拼接處理，得到最終樣本風格特徵表示，並利用分類處理子網路對最終樣本風格特徵表示進行分類處理，得到每一樣本醫學圖像的預測掃描圖像類別，從而最終樣本風格特徵表示能夠表示每一樣本醫學圖像的樣本風格特徵表示與其他樣本醫學圖像的樣本風格特徵表示之間的差異，故利用分類處理子網路對最終樣本風格特徵表示進行分類處理時，能夠提高分類處理的準確性。In an implementation scenario, the sample style feature representations of multiple sample medical images can be spliced to obtain the final sample style feature representation, and the final sample style feature representation can be classified by using the classification processing sub-network to obtain each sample style feature representation. The predicted scan image category of this medical image, so that the final sample style feature representation can represent the difference between the sample style feature representation of each sample medical image and the sample style feature representation of other sample medical images, so the classification process is used. When the sub-network performs classification processing on the final sample style feature representation, the accuracy of the classification processing can be improved.

如圖3所示，在利用分類處理子網路對樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n的樣本風格特徵表示進行分類處理時，可以將上述樣本醫學圖像的樣本風格特徵進行拼接處理（或堆疊處理等其他處理方式，具體可以參閱前述公開實施例，在此不再贅述），得到最終樣本風格特徵表示，從而利用分類處理子網路對最終樣本風格特徵表示進行分類處理，得到樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n的預測掃描圖像類別。As shown in FIG. 3 , when the sample medical image 1, sample medical image 2, sample medical image 3, . The sample style features of the above-mentioned sample medical images can be spliced (or other processing methods such as stacking processing, for details, please refer to the aforementioned disclosed embodiments, which will not be repeated here) to obtain the final sample style feature representation, so as to use the classification processor. The network performs classification processing on the final sample style feature representation, and obtains the predicted scanning image categories of sample medical image 1, sample medical image 2, sample medical image 3, ..., and sample medical image n.

在一個實施場景中，可以將多個樣本醫學圖像的樣本內容特徵表示進行融合處理，得到最終樣本內容特徵表示，並利用區域分割子網路對最終樣本內容特徵表示進行病灶識別，得到每一樣本醫學圖像中的預測病灶區域，從而能夠有利於使最終樣本內容特徵表示補償單一樣本醫學圖像中可能存在的病灶不明顯或運動干擾產生的偽影等問題，從而在利用區域分割子網路對最終樣本內容特徵表示進行病灶識別時，能夠提高病灶識別的準確性。示例性地，上述融合處理的操作可以包括以下任一者：將多個樣本醫學圖像的樣本內容特徵表示進行拼接處理，利用至少一個卷積層對多個樣本醫學圖像的樣本內容特徵表示進行特徵提取，且最終樣本內容特徵表示和多個樣本醫學圖像的內容特徵表示的維度相同。具體可以參閱前述實施例中的相關步驟，在此不再贅述。In one implementation scenario, the sample content feature representations of multiple sample medical images can be fused to obtain the final sample content feature representation, and the region segmentation sub-network can be used to identify lesions on the final sample content feature representation to obtain each sample content feature representation. The predicted lesion area in this medical image can help to make the final sample content feature representation to compensate for problems such as inconspicuous lesions or artifacts caused by motion interference that may exist in a single sample medical image, so as to use the area segmentation subnet. When the path is used to identify the final sample content feature representation, the accuracy of the lesion identification can be improved. Exemplarily, the operation of the fusion processing may include any one of the following: performing a splicing process on the sample content feature representations of multiple sample medical images, and performing a splicing process on the sample content feature representations of the multiple sample medical images by using at least one convolution layer. Feature extraction, and the final sample content feature representation has the same dimension as the content feature representation of multiple sample medical images. For details, reference may be made to the relevant steps in the foregoing embodiments, which will not be repeated here.

如圖3所示，在利用區域分割子網路對樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n的樣本內容特徵表示進行病灶識別時，可以將上述樣本醫學圖像的樣本內容特徵表示進行融合處理（如，拼接處理、相加處理等，具體可以參閱前述公開實施例，在此不再贅述），得到最終樣本內容特徵表示，從而利用區域分割子網路對最終樣本內容特徵表示進行病灶識別，得到樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n中的預測病灶區域。As shown in FIG. 3 , when using the region segmentation sub-network to perform lesion identification on the sample content feature representation of sample medical image 1, sample medical image 2, sample medical image 3, . . . , sample medical image n, The sample content feature representation of the above-mentioned sample medical images can be subjected to fusion processing (eg, splicing processing, addition processing, etc., for details, please refer to the aforementioned disclosed embodiments, which will not be repeated here) to obtain the final sample content feature representation. The region segmentation sub-network performs lesion identification on the final sample content feature representation, and obtains the predicted lesion regions in the sample medical image 1, the sample medical image 2, the sample medical image 3, ... and the sample medical image n.

步驟S24：利用真實掃描圖像類別和預測掃描圖像類別的差異，調整風格編碼子網路和分類處理子網路的網路參數，以及利用真實病灶區域和預測病灶區域的差異，調整內容編碼子網路和區域分割子網路的網路參數。Step S24 : Adjust the network parameters of the style coding sub-network and the classification processing sub-network by using the difference between the real scanned image category and the predicted scanned image category, and adjust the content coding by using the difference between the real lesion area and the predicted lesion area Network parameters for subnetworks and zone-split subnetworks.

在一個實施場景中，可以利用真實掃描圖像類別和預測掃描圖像類別，計算得到風格編碼子網路和分類處理子網路的第一損失值，並利用第一損失值調整風格編碼子網路和分類處理子網路的網路參數。在一個實施場景中，可以採用交叉熵損失（cross entropy loss）或者softmax邏輯損失（logistic softmax loss）等來計算第一損失值，在此不做限定。In an implementation scenario, the real scanned image category and the predicted scanned image category can be used to calculate the first loss value of the style encoding sub-network and the classification processing sub-network, and use the first loss value to adjust the style encoding sub-network Roads and classes handle network parameters for subnetworks. In an implementation scenario, a cross entropy loss (cross entropy loss) or a softmax logistic loss (logistic softmax loss) or the like may be used to calculate the first loss value, which is not limited herein.

在一個實施場景中，可以利用真實病灶區域和預測病灶區域，計算得到內容編碼子網路和區域分割子網路的第二損失值，並利用第二損失值調整內容編碼子網路和區域分割子網路的網路參數。在一個實施場景中，可以採用二分類交叉熵損失（binary cross-entropy loss）或者dice係數損失來計算第二損失值，在此不做限定。示例性地，dice係數損失是一種集合相似度度量函數，通常用於計算兩個樣本的相似度（範圍為0~1），具體可以採用公式（1）進行計算：

（1）上式中，

表示以dice係數損失計算得到的第二損失值，

表示真實病灶區域，

表示預測病灶區域，

表示真實病灶區域與預測病灶區域的交集。 In an implementation scenario, the actual lesion area and the predicted lesion area can be used to calculate the second loss value of the content coding sub-network and the region segmentation sub-network, and the second loss value can be used to adjust the content coding sub-network and the region segmentation Network parameters for the subnet. In an implementation scenario, a binary cross-entropy loss (binary cross-entropy loss) or a dice coefficient loss may be used to calculate the second loss value, which is not limited herein. Exemplarily, the dice coefficient loss is an ensemble similarity measure function, which is usually used to calculate the similarity between two samples (ranging from 0 to 1). Specifically, formula (1) can be used to calculate:

(1) In the above formula,

represents the second loss value calculated by the dice coefficient loss,

represents the real lesion area,

represents the predicted lesion area,

Represents the intersection of the real lesion area and the predicted lesion area.

在一個實施場景中，為了提高掃描圖像類別識別的準確性，還可以在訓練過程中，獲取每一樣本醫學圖像的樣本風格特徵表示的樣本資料分佈情況，並利用樣本資料分佈情況之間的差異，調整風格編碼子網路的網路參數，進而能夠有利於使得後續提取到的風格特徵表示之間相互獨立，故能夠有利於提高識別到的掃描圖像類別的準確性。示例性地，可以在訓練過程中，採用KL散度度量樣本資料分佈情況之間的差異，並將其作為第三損失值，以此來約束風格特徵表示的分佈。KL散度（Kullback–Leibler divergence），又稱相對熵（relative entropy），是描述兩個概率分佈P和Q差異的一種方法，具體可以採用公式（2）進行計算：

（2）上式中，

和

分別表示兩個樣本風格特徵表示的樣本資料分佈情況，

表示其中一個樣本風格特徵表示的概率分佈期望，

和

分別表示兩個樣本風格特徵表示中的元素 x在各自樣本資料分佈情況中的分佈概率。在一個實施場景中，可以採用高斯分佈函數獲取樣本風格特徵表示的樣本資料分佈情況，從而通過上述訓練，能夠使風格編碼子網路後續提取得到的風格特徵表示為中心相同且各向異性的高斯分佈，進而故能夠有利於提高識別到的掃描圖像類別的準確性。 In an implementation scenario, in order to improve the accuracy of scanning image category recognition, the distribution of sample data represented by the sample style features of each sample medical image can also be obtained during the training process, and use the difference between the distribution of sample data. Adjust the network parameters of the style coding sub-network, which can help to make the subsequent extracted style feature representations independent of each other, so it can help to improve the accuracy of the identified scanned image categories. Exemplarily, in the training process, the KL divergence can be used to measure the difference between the distributions of the sample data, and use it as the third loss value, so as to constrain the distribution of the style feature representation. KL divergence (Kullback–Leibler divergence), also known as relative entropy (relative entropy), is a method to describe the difference between two probability distributions P and Q, which can be calculated by formula (2):

(2) In the above formula,

and

respectively represent the distribution of the sample data represented by the two sample style features,

represents the probability distribution expectation represented by one of the sample style features,

and

Respectively represent the distribution probability of the element x in the two sample style feature representations in the distribution of the respective sample data. In an implementation scenario, a Gaussian distribution function can be used to obtain the distribution of sample data represented by the sample style features, so that through the above training, the style features obtained by the style coding sub-network subsequently extracted can be represented as Gaussians with the same center and anisotropy distribution, and thus can help to improve the accuracy of the identified scanned image categories.

在一個實施場景中，為了能夠使風格編碼子網路盡可能地提取到完整準確的風格特徵，而內容編碼子網路盡可能地提取到完整準確的風格特徵，還可以利用一樣本風格特徵表示和一樣本內容特徵表示，構建得到與樣本風格特徵表示對應的重建圖像，並利用重建圖像與對應的樣本風格特徵表示所屬的樣本醫學圖像之間的差異，調整風格編碼子網路和內容編碼子網路的網路參數，從而能夠在訓練過程中使重建圖像與對應的樣本醫學圖像盡可能地相同，進而能夠使風格編碼子網路盡可能地提取到完整準確的風格特徵，而內容編碼子網路盡可能地提取到完整準確的風格特徵，故此能夠有利於提高後續掃描圖像類別以及病灶識別的準確性，示例性地，可以將重建圖像與對應的樣本風格特徵表示所屬的樣本醫學圖像之間的差異作為第四損失值。在一個實施場景中，可以利用每一樣本醫學圖像的樣本風格特徵表示和樣本內容特徵表示，得到對應樣本醫學圖像的域內重建圖像，並利用每一樣本醫學圖像及其域內重建圖像的差異，調整風格編碼子網路和內容編碼子網路的網路參數，從而確保分解得到的樣本內容特徵表示和樣本風格特徵表示能夠穩定重建其自身，防止中途變異。在另一個實施場景中，還可以利用每一樣本醫學圖像的樣本風格特徵表示和任一其他樣本醫學圖像的樣本內容特徵表示（或最終樣本內容特徵表示），得到對應樣本醫學圖像的跨域重建圖像，並利用每一樣本醫學圖像及其跨域重建圖像的差異，調整風格編碼子網路和內容編碼子網路的網路參數，從而確保提取得到的內容特徵表示是醫學圖像之間真正彼此重合的基礎特徵集。示例性地，在訓練過程中，可以利用生成器實現重建，利用鑒別器鑒定重建圖像是真實的樣本醫學圖像還是重建得到的圖像，故可以採用生成對抗損失（Generative Adversarial Networks loss，GAN loss）來度量上述跨域重建的損失值，以及L1範數損失來度量上述域內重建的損失值，具體在此不再贅述。In an implementation scenario, in order to enable the style encoding sub-network to extract as complete and accurate style features as possible, and the content encoding sub-network to extract as complete and accurate style features as possible, a sample style feature representation can also be used. With the same sample content feature representation, a reconstructed image corresponding to the sample style feature representation is constructed, and the difference between the reconstructed image and the corresponding sample style feature representation belongs to the sample medical image, and the style coding sub-network and the corresponding sample medical image are adjusted. The network parameters of the content coding subnet can make the reconstructed image and the corresponding sample medical image as similar as possible during the training process, so that the style coding subnet can extract the complete and accurate style features as much as possible. , and the content coding sub-network can extract complete and accurate style features as much as possible, so it can help improve the accuracy of subsequent scanned image categories and lesion identification. Exemplarily, the reconstructed image can be compared with the corresponding sample style features The difference between the belonging sample medical images is represented as the fourth loss value. In an implementation scenario, the sample style feature representation and the sample content feature representation of each sample medical image can be used to obtain an intra-domain reconstructed image of the corresponding sample medical image, and each sample medical image and its in-domain reconstructed image can be obtained. Reconstruct the difference of the image, adjust the network parameters of the style coding sub-network and the content coding sub-network, so as to ensure that the decomposed sample content feature representation and sample style feature representation can reconstruct themselves stably and prevent midway variation. In another implementation scenario, the sample style feature representation of each sample medical image and the sample content feature representation (or final sample content feature representation) of any other sample medical image can also be used to obtain the corresponding sample medical image Cross-domain reconstructed images, and use the differences between each sample medical image and its cross-domain reconstructed images to adjust the network parameters of the style coding sub-network and the content coding sub-network, so as to ensure that the extracted content feature representation is correct. The underlying feature set that truly coincides with each other between medical images. Exemplarily, in the training process, the generator can be used to achieve reconstruction, and the discriminator can be used to identify whether the reconstructed image is a real sample medical image or a reconstructed image, so Generative Adversarial Networks loss (GAN) can be used. loss) to measure the loss value of the above-mentioned cross-domain reconstruction, and L1 norm loss to measure the loss value of the above-mentioned intra-domain reconstruction, which will not be repeated here.

如圖3所示，在進行圖像重建時，可以將樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n分別對應的樣本內容特徵表示1、樣本內容特徵表示2、樣本內容特徵表示3、……、樣本內容特徵表示n進行融合處理，得到最終樣本內容特徵表示，並分別將樣本醫學圖像1、樣本醫學圖像2、樣本醫學圖像3、……、樣本醫學圖像n對應的樣本風格特徵表示1、樣本風格特徵表示2、樣本風格特徵表示3、……、樣本風格特徵表示n與最終樣本內容特徵表示進行重建，得到重建圖像1、重建圖像2、重建圖像3、……、重建圖像n，從而實現域內重建，並採用L1範數損失來度量上述域內重建的損失值。此外，還可以採用樣本醫學圖像1對應的樣本風格特徵表示1與其他樣本醫學圖像（即樣本醫學圖像2~n）對應的樣本內容特徵表示進行跨域重建，其他樣本醫學圖像以此類推，並採用生成對抗損失（GAN loss）來度量上述跨域重建的損失值。As shown in Fig. 3, when performing image reconstruction, the sample content features corresponding to sample medical image 1, sample medical image 2, sample medical image 3, ... and sample medical image n can be represented as 1, The sample content feature representation 2, the sample content feature representation 3, ..., the sample content feature representation n are fused to obtain the final sample content feature representation, and the sample medical image 1, the sample medical image 2, and the sample medical image are respectively 3, ..., the sample style feature representation corresponding to the sample medical image n 1, the sample style feature representation 2, the sample style feature representation 3, ..., the sample style feature representation n and the final sample content feature representation are reconstructed to obtain a reconstructed image Like 1, reconstructed image 2, reconstructed image 3, ..., reconstructed image n, so as to achieve intra-domain reconstruction, and use the L1 norm loss to measure the loss value of the above-mentioned intra-domain reconstruction. In addition, the sample style feature representation 1 corresponding to the sample medical image 1 and the sample content feature representation corresponding to other sample medical images (ie, sample medical images 2~n) can also be used for cross-domain reconstruction, and other sample medical images are represented by And so on, and adopt the generative adversarial loss (GAN loss) to measure the loss value of the above cross-domain reconstruction.

在一個實施場景中，還可以同時計算上述第一損失值、第二損失值、第三損失值和第四損失值，並根據這些損失值來對識別網路的網路參數進行調整，以提高內容編碼子網路對於病灶相關的內容特徵的獲取度，使風格編碼子網路不回應與病灶相關的特徵，提高圖像識別的魯棒性，並使得後續提取到的風格特徵表示之間相互獨立，且使風格編碼子網路能夠提取到完整準確的風格特徵表示，內容編碼子網路能夠提取到完整準確的內容特徵表示，從而提高圖像識別的準確性。In an implementation scenario, the above-mentioned first loss value, second loss value, third loss value and fourth loss value may also be calculated at the same time, and the network parameters of the identification network may be adjusted according to these loss values, so as to improve the The degree of acquisition of content features related to lesions by the content coding sub-network makes the style coding sub-network not respond to features related to lesions, improves the robustness of image recognition, and makes the subsequent extracted style feature representations interact with each other. It is independent and enables the style encoding sub-network to extract complete and accurate style feature representation, and the content encoding sub-network can extract complete and accurate content feature representation, thereby improving the accuracy of image recognition.

區別於前述實施例，在對風格編碼子網路和分類處理子網路進行訓練的同時，加入對內容編碼子網路和區域分割子網路的訓練，從而能夠在提高區域分割子網路的病灶識別能力的同時，提高內容編碼子網路對於病灶相關的內容特徵的獲取度，進而能夠有利於使得風格編碼子網路不響應與病灶相關的特徵，使後續分類時不受病灶相關特徵的影響，故能夠提高圖像識別的魯棒性。Different from the foregoing embodiments, while training the style coding sub-network and the classification processing sub-network, the training of the content coding sub-network and the region segmentation sub-network is added, so that the performance of the region segmentation sub-network can be improved. At the same time, it can improve the acquisition of content features related to lesions by the content coding subnet, which can help make the style coding subnet not respond to the features related to lesions, so that the subsequent classification will not be affected by the features related to lesions. Therefore, the robustness of image recognition can be improved.

請參閱圖4，圖4是本發明圖像識別裝置40一實施例的方塊示意圖。圖像識別裝置40包括圖像獲取模組41、風格提取模組42和分類處理模組43，圖像獲取模組41配置為獲取多個待識別醫學圖像；風格提取模組42配置為分別提取每一待識別醫學圖像的風格特徵表示；分類處理模組43配置為對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。Please refer to FIG. 4 , which is a block diagram of an embodiment of an image recognition apparatus 40 of the present invention. The image recognition device 40 includes an image acquisition module 41, a style extraction module 42 and a classification processing module 43. The image acquisition module 41 is configured to acquire a plurality of medical images to be recognized; the style extraction module 42 is configured to separately Extract the style feature representation of each to-be-recognized medical image; the classification processing module 43 is configured to classify and process the style-feature representations of a plurality of to-be-recognized medical images to obtain the scanned image category of each to-be-recognized medical image.

在一些實施例中，分類處理模組43包括第一融合處理子模組，配置為將多個待識別醫學圖像的風格特徵表示進行第一融合處理，得到最終風格特徵表示；分類處理模組43包括分類處理子模組，配置為對最終風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別。In some embodiments, the classification processing module 43 includes a first fusion processing sub-module, configured to perform a first fusion processing on the style feature representations of a plurality of to-be-recognized medical images to obtain a final style feature representation; the classification processing module 43 includes a classification processing sub-module configured to perform classification processing on the final style feature representation to obtain the scanned image category of each medical image to be identified.

區別於前述實施例，在對多個待識別醫學圖像的風格特徵表示進行分類處理時，將多個待識別醫學圖像的風格特徵表示進行第一融合處理，得到最終風格特徵表示，故最終風格特徵表示能夠表示每一待識別醫學圖像的風格特徵表示與其他待識別醫學圖像的風格特徵表示之間的差異，故利用最終風格特徵表示進行分類處理能夠提高識別得到的掃描圖像類別的準確性。Different from the foregoing embodiments, when the style feature representations of multiple medical images to be recognized are classified and processed, the style feature representations of the multiple unidentified medical images are subjected to a first fusion process to obtain the final style feature representation, so the final style feature representation is obtained. The style feature representation can represent the difference between the style feature representation of each medical image to be recognized and the style feature representation of other to-be-recognized medical images, so using the final style feature representation for classification processing can improve the classification of scanned images obtained by recognition. accuracy.

在一些實施例中，圖像識別裝置40還包括圖像排除模組、圖像顯示模組、第一預警模組、第二預警模組和第三預警模組中的至少一個模組。In some embodiments, the image recognition device 40 further includes at least one of an image exclusion module, an image display module, a first early warning module, a second early warning module and a third early warning module.

所述圖像排除模組，配置為將多個待識別醫學圖像按照其掃描圖像類別進行排序；所述圖像顯示模組，配置為將按照掃描圖像類別進行排序後的至少一個待識別醫學圖像進行同屏顯示；所述第一預警模組，配置為在待識別醫學圖像的掃描圖像類別存在重複時，輸出第一預警資訊，以提示掃描人員；所述第二預警模組，配置為在多個待識別醫學圖像的掃描圖像類別中不存在預設掃描圖像類別時，輸出第二預警資訊，以提示掃描人員；所述第三預警模組，配置為在所述待識別醫學圖像的掃描圖像類別的分類置信度小於預設置信度閾值時，輸出第三預警資訊，提示掃描人員。The image exclusion module is configured to sort a plurality of medical images to be identified according to their scanned image categories; the image display module is configured to sort at least one to-be-identified image according to the scanned image categories. Identifying medical images and displaying them on the same screen; the first warning module is configured to output first warning information to prompt the scanning personnel when there is a repetition of the scanned image categories of the medical images to be identified; the second warning The module is configured to output the second early warning information to prompt the scanning personnel when the preset scanning image category does not exist in the scanned image categories of the medical images to be identified; the third early warning module is configured as When the classification confidence of the scanned image category of the to-be-recognized medical image is less than the preset confidence threshold, output third warning information to prompt the scanning personnel.

區別於前述實施例，在確定得到每一待識別醫學圖像所屬的掃描圖像類別之後，執行將至少一個待識別醫學圖像按照其掃描圖像類別進行排序，能夠提高醫生閱片的便捷性；將按照掃描圖像類別進行排序後的至少一個待識別醫學圖像進行同屏顯示，能夠免去醫生翻閱待識別醫學圖像時來回對照，從而能夠提高醫生閱片的效率；在待識別醫學圖像的掃描圖像類別存在重複時，輸出第一預警資訊，以提示掃描人員，在至少一個待識別醫學圖像的掃描圖像類別中不存在預設掃描圖像類別時，輸出第二預警資訊，以提示掃描人員，能夠在掃描過程中實現圖像質控，以在與實際相悖時，能夠及時糾錯，避免病人二次掛號。Different from the foregoing embodiments, after the scanning image category to which each medical image to be recognized belongs is determined, the at least one medical image to be recognized is sorted according to its scanning image category, which can improve the convenience of doctor reading. ; Display at least one to-be-recognized medical image sorted according to the scanned image category on the same screen, which can save the doctor from checking back and forth when reading the to-be-recognized medical image, thereby improving the efficiency of the doctor's image reading; When the scanned image categories of the images are repeated, output the first warning information to prompt the scanning personnel, and output the second warning when there is no preset scanned image category in the scanned image categories of at least one of the medical images to be identified. Information to remind the scanning personnel, to achieve image quality control during the scanning process, to correct errors in time when it is inconsistent with the actual situation, and avoid the second registration of the patient.

在一些實施例中，圖像識別裝置40還包括預處理模組，配置為對每一待識別醫學圖像進行預處理，其中，預處理包括以下至少一種：將待識別醫學圖像的圖像尺寸調整至預設尺寸、將待識別醫學圖像的圖像強度歸一化至預設範圍。In some embodiments, the image recognition apparatus 40 further includes a preprocessing module configured to preprocess each medical image to be recognized, wherein the preprocessing includes at least one of the following: The size is adjusted to a preset size, and the image intensity of the medical image to be recognized is normalized to a preset range.

區別於前述實施例，在提取風格特徵表示之前，對每一目的地區域的圖像資料進行預處理，且預處理包括以下至少一種：將目的地區域的圖像尺寸調整至預設尺寸，將目的地區域的圖像強度歸一化至預設範圍，故能夠有利於提高後續圖像識別的準確性。Different from the foregoing embodiments, before extracting the style feature representation, preprocessing is performed on the image data of each destination area, and the preprocessing includes at least one of the following: adjusting the image size of the destination area to a preset size, The image intensity of the destination area is normalized to a preset range, which can help improve the accuracy of subsequent image recognition.

在一些實施例中，圖像識別裝置40還包括內容提取模組，配置為分別提取每一待識別醫學圖像的內容特徵表示；圖像識別裝置40還包括病灶識別模組，配置為對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。In some embodiments, the image recognition apparatus 40 further includes a content extraction module, configured to extract the content feature representation of each medical image to be recognized, respectively; the image recognition apparatus 40 further includes a lesion identification module, configured to multiple The content features of each medical image to be identified represent the lesion identification, and the lesion area in each medical image to be identified is obtained.

區別於前述實施例，通過提取每一待識別醫學圖像的內容特徵表示，並對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，能夠在得到每一待識別醫學圖像的掃描圖像類別的同時，確定其中的病灶區域，故能夠有利於提高整體閱片效能，同時能夠有利於消除病灶對掃描圖像類別識別帶來的干擾，從而能夠提高圖像識別的準確性。Different from the foregoing embodiments, by extracting the content feature representation of each to-be-recognized medical image, and performing lesion identification on the content-feature representations of a plurality of to-be-recognized medical images, the lesion area in each to-be-recognized medical image is obtained, It is possible to determine the lesion area while obtaining the scanned image category of each medical image to be identified, so it can help improve the overall reading performance, and at the same time, it can help eliminate the interference caused by lesions to the scanning image category recognition. , which can improve the accuracy of image recognition.

在一些實施例中，病灶識別模組包括第二融合處理子模組，配置為將多個待識別醫學圖像的內容特徵表示進行第二融合處理，得到最終內容特徵表示；病灶識別模組包括病灶識別子模組，配置為對最終內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。In some embodiments, the lesion identification module includes a second fusion processing sub-module configured to perform a second fusion process on the content feature representations of multiple medical images to be identified to obtain a final content feature representation; the lesion identification module includes The lesion identification sub-module is configured to perform lesion identification on the final content feature representation to obtain the lesion area in each medical image to be identified.

區別於前述實施例，將多個待識別醫學圖像的內容特徵表示進行第二融合處理，得到最終內容特徵表示，能夠有利於使最終內容特徵表示補償單一待識別醫學圖像中可能存在的病灶不明顯或運動干擾產生的偽影等問題，從而在利用最終內容特徵表示進行病灶識別時，能夠提高病灶識別的準確性。Different from the foregoing embodiments, the second fusion processing is performed on the content feature representations of a plurality of medical images to be recognized to obtain a final content feature representation, which can help the final content feature representation to compensate for possible lesions in a single to-be-recognized medical image. Therefore, the accuracy of lesion identification can be improved when using the final content feature representation for lesion identification.

在一些實施例中，病灶識別模組還包括病灶提示子模組，配置為提示當前顯示的待識別醫學圖像的病灶區域。In some embodiments, the lesion identification module further includes a lesion prompting sub-module configured to prompt the lesion area of the currently displayed medical image to be identified.

區別於前述實施例，通過提示當前顯示的待識別醫學圖像的病灶區域，能夠提升醫生閱片體驗。Different from the foregoing embodiments, by prompting the lesion area of the currently displayed medical image to be identified, the doctor's reading experience can be improved.

在一些實施例中，第二融合處理子模組具體配置為執行以下任一者：將多個待識別醫學圖像的內容特徵表示進行拼接處理；將多個待識別醫學圖像的內容特徵表示進行相加處理；其中，最終內容特徵表示和多個待識別醫學圖像的內容特徵表示的維度相同。In some embodiments, the second fusion processing sub-module is specifically configured to perform any one of the following: perform splicing processing on the content feature representations of a plurality of to-be-recognized medical images; Addition processing is performed; wherein, the final content feature representation and the content feature representations of multiple medical images to be identified have the same dimension.

區別於前述實施例，通過將多個待識別醫學圖像的內容特徵表示進行拼接處理，或者將多個待識別醫學圖像的內容特徵表示進行相加處理中的任一者，得到最終內容特徵表示，且最終內容特徵表示和多個待識別醫學圖像的內容特徵表示的維度相同，能夠通過多種方式得到最終內容特徵表示，從而能夠提高圖像識別的魯棒性。Different from the foregoing embodiments, the final content feature is obtained by splicing the content feature representations of a plurality of medical images to be recognized, or performing an addition process on the content feature representations of a plurality of medical images to be recognized. and the final content feature representation has the same dimension as the content feature representation of multiple medical images to be recognized, the final content feature representation can be obtained in various ways, thereby improving the robustness of image recognition.

在一些實施例中，風格提取模組42具體配置為利用識別網路的風格編碼子網路分別提取每一待識別醫學圖像的風格特徵表示；分類處理模組43具體配置為利用識別網路的分類處理子網路對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別；內容提取模組具體配置為利用識別網路的內容編碼子網路分別提取每一待識別醫學圖像的內容特徵表示；病灶識別模組具體配置為利用識別網路的區域分割子網路對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域。In some embodiments, the style extraction module 42 is specifically configured to use the style coding sub-network of the recognition network to extract the style feature representation of each medical image to be recognized; the classification processing module 43 is specifically configured to use the recognition network The classification processing sub-network performs classification processing on the style feature representation of a plurality of medical images to be recognized, and obtains the scanned image category of each medical image to be recognized; the content extraction module is specifically configured to use the content coding of the recognition network. The sub-network separately extracts the content feature representation of each medical image to be identified; the lesion identification module is specifically configured to use the region segmentation sub-network of the identification network to perform lesion identification on the content feature representation of a plurality of medical images to be identified, Obtain the lesion area in each medical image to be identified.

區別於前述實施例，利用識別網路的風格編碼子網路分別提取每一待識別醫學圖像的風格特徵表示，利用識別網路的分類處理子網路對多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一待識別醫學圖像的掃描圖像類別，利用識別網路的內容編碼子網路分別提取每一待識別醫學圖像的內容特徵表示，利用識別網路的區域分割子網路對多個待識別醫學圖像的內容特徵表示進行病灶識別，得到每一待識別醫學圖像中的病灶區域，能夠利用識別網路執行風格特徵表示的提取、分類處理、內容特徵表示的提取以及病灶識別等任務，故能夠有利於提高圖像識別的效率。Different from the foregoing embodiments, the style feature representation of each medical image to be recognized is extracted by the style coding sub-network of the recognition network, and the style of the medical images to be recognized is analyzed by the classification processing sub-network of the recognition network. The feature representation is classified to obtain the scanned image category of each medical image to be recognized, and the content feature representation of each medical image to be recognized is extracted by using the content coding sub-network of the recognition network. The segmentation sub-network performs lesion identification on the content feature representation of multiple medical images to be identified, and obtains the lesion area in each medical image to be identified. The identification network can be used to perform extraction of style feature representation, classification processing, and content feature representation. It can help to improve the efficiency of image recognition because of tasks such as representation extraction and lesion recognition.

在一些實施例中，圖像識別裝置40還包括樣本獲取模組，配置為獲取多個樣本醫學圖像，其中，多個樣本醫學圖像標注有其真實掃描圖像類別和真實病灶區域；圖像識別裝置40還包括特徵提取模組，配置為利用風格編碼子網路分別提取每一樣本醫學圖像的樣本風格特徵表示，並利用內容編碼子網路分別提取每一樣本醫學圖像的樣本內容特徵表示；圖像識別裝置40還包括識別處理模組，配置為利用分類處理子網路對多個樣本醫學圖像的樣本風格特徵表示進行分類處理，得到每一樣本醫學圖像的預測掃描圖像類別，並利用區域分割子網路對多個樣本醫學圖像的樣本內容特徵表示進行病灶識別，得到每一樣本醫學圖像中的預測病灶區域；圖像識別裝置40還包括第一調整模組，配置為利用真實掃描類別和預測掃描類別的差異，調整風格編碼子網路和分類處理子網路的網路參數，以及利用真實病灶區域和預測病灶區域的差異，調整內容編碼子網路和區域分割子網路的網路參數。In some embodiments, the image recognition device 40 further includes a sample acquisition module configured to acquire multiple sample medical images, wherein the multiple sample medical images are marked with their real scanned image categories and real lesion areas; Fig. The image recognition device 40 also includes a feature extraction module configured to extract the sample style feature representation of each sample medical image by using the style coding sub-network, and extract the sample of each sample medical image by using the content coding sub-network. Content feature representation; the image recognition device 40 further includes a recognition processing module, configured to use a classification processing sub-network to classify and process the sample style feature representations of a plurality of sample medical images to obtain a predictive scan of each sample medical image image category, and use the region segmentation sub-network to perform lesion identification on the sample content feature representation of multiple sample medical images to obtain the predicted lesion area in each sample medical image; the image recognition device 40 also includes a first adjustment A module configured to adjust the network parameters of the style coding subnet and the classification processing subnet using the difference between the actual scan category and the predicted scan category, and to adjust the content coding subnet using the difference between the actual lesion area and the predicted lesion area Network parameters for road and area split subnets.

在一些實施例中，圖像識別裝置40還包括分佈獲取模組，配置為獲取每一樣本醫學圖像的樣本風格特徵表示的樣本資料分佈情況；圖像識別裝置40還包括第二調整模組，配置為利用樣本資料分佈情況之間的差異，調整風格編碼子網路的網路參數。In some embodiments, the image recognition apparatus 40 further includes a distribution acquisition module configured to acquire the distribution of sample data represented by the sample style features of each sample medical image; the image recognition apparatus 40 further includes a second adjustment module , which is configured to adjust the network parameters of the style coding sub-network by taking advantage of the differences between the distributions of the sample data.

區別於前述實施例，在訓練過程中，同時獲取樣本資料分佈情況，並利用樣本資料分佈情況之間的差異，調整風格編碼子網路的網路參數，故能夠有利於使後續提取到風格特徵表示之間相互獨立，從而能夠有利於提高識別到的掃描圖像類別的準確性。Different from the previous embodiment, in the training process, the distribution of sample data is obtained at the same time, and the difference between the distribution of sample data is used to adjust the network parameters of the style coding sub-network, so it can be beneficial to the subsequent extraction of style features. The representations are independent of each other, which can help improve the accuracy of the scanned image categories identified.

在一些實施例中，圖像識別裝置40還包括圖像重建模組，配置為利用一樣本風格特徵表示和一內容特徵表示，構建得到與樣本風格特徵表示對應的重建圖像；圖像識別裝置40還包括第三調整模組，配置為利用重建圖像與對應的樣本風格特徵表示所屬的樣本醫學圖像之間的差異，調整風格編碼子網路和內容編碼子網路的網路參數。In some embodiments, the image recognition apparatus 40 further includes an image reconstruction module configured to use a sample style feature representation and a content feature representation to construct a reconstructed image corresponding to the sample style feature representation; the image recognition apparatus 40 also includes a third adjustment module configured to adjust the network parameters of the style coding sub-network and the content coding sub-network by using the difference between the reconstructed image and the corresponding sample medical image to which the sample style feature representation belongs.

區別於前述實施例，在訓練過程中，同時利用一樣本風格特徵表示和一內容特徵表示，構建得到與樣本風格特徵表示對應的重建圖像，並利用重建圖像與對應的樣本風格特徵表示所屬的樣本醫學圖像之間的差異，調整風格編碼子網路和內容編碼子網路的網路參數，從而能夠使風格編碼子網路盡可能地提取到完整準確的風格特徵，而內容編碼子網路盡可能地提取到完整準確的風格特徵，進而能夠有利於提高後續掃描圖像類別以及病灶識別的準確性。Different from the foregoing embodiments, in the training process, a sample style feature representation and a content feature representation are used to construct a reconstructed image corresponding to the sample style feature representation, and the reconstructed image and the corresponding sample style feature are used to represent the The difference between the sample medical images, adjust the network parameters of the style coding subnet and the content coding subnet, so that the style coding subnet can extract as complete and accurate style features as possible, while the content coding subnet can extract the complete and accurate style features as much as possible. The network extracts complete and accurate style features as much as possible, which can help to improve the classification of subsequent scanned images and the accuracy of lesion identification.

在一些實施例中，風格編碼子網路包括：順序連接的下採樣層和全域池化層；和/或，內容編碼子網路包括以下任一者：順序連接的下採樣層和殘差塊、順序連接的卷積層和池化層。In some embodiments, the style encoding sub-network includes: sequentially connected downsampling layers and global pooling layers; and/or, the content encoding sub-network includes any of the following: sequentially connected downsampling layers and residual blocks , sequentially connected convolutional and pooling layers.

區別於前述實施例，通過將風格編碼子網路設置為包括順序連接的下採樣層和全域池化層，能夠有利於在簡化網路結構的同時便於網路訓練；通過將內容編碼子網路設置為包括以下任一者：順序連接的下採樣層和殘差塊、順序連接的卷積層和池化層，能夠有利於在簡化網路結構的同時便於網路訓練。Different from the foregoing embodiments, by setting the style encoding sub-network to include a sequentially connected downsampling layer and a global pooling layer, it can facilitate network training while simplifying the network structure; Setting to include any of the following: sequentially connected downsampling layers and residual blocks, sequentially connected convolutional layers and pooling layers, can facilitate network training while simplifying the network structure.

請參閱圖5，圖5是本發明電子設備50一實施例的方塊示意圖。電子設備50包括相互耦接的記憶體51和處理器52，處理器52配置為執行記憶體51中儲存的程式指令，以實現上述任一圖像識別方法實施例的步驟。在一個實施場景中，電子設備50可以包括但不限於：微型電腦、伺服器，此外，電子設備50還可以包括筆記型電腦、平板電腦等移動設備，在此不做限定。Please refer to FIG. 5 , which is a block diagram illustrating an embodiment of an electronic device 50 of the present invention. The electronic device 50 includes a memory 51 and a processor 52 coupled to each other, and the processor 52 is configured to execute program instructions stored in the memory 51 to implement the steps of any of the above image recognition method embodiments. In an implementation scenario, the electronic device 50 may include, but is not limited to, a microcomputer and a server. In addition, the electronic device 50 may also include mobile devices such as a notebook computer and a tablet computer, which are not limited herein.

具體而言，處理器52用於控制其自身以及記憶體51以實現上述任一圖像識別方法實施例的步驟。處理器52還可以稱為中央處理單元（Central Processing Unit，CPU）。處理器52可能是一種積體電路晶片，具有信號的處理能力。處理器52還可以是通用處理器、數位訊號處理器（Digital Signal Processor, DSP）、專用積體電路（Application Specific Integrated Circuit, ASIC）、現場可程式設計閘陣列（Field-Programmable Gate Array, FPGA）或者其他可程式設計邏輯器件、分立門或者電晶體邏輯器件、分立硬體元件。通用處理器可以是微處理器或者該處理器也可以是任何常規的處理器等。另外，處理器52可以由積體電路晶片共同實現。Specifically, the processor 52 is configured to control itself and the memory 51 to implement the steps of any of the image recognition method embodiments described above. The processor 52 may also be referred to as a central processing unit (Central Processing Unit, CPU). The processor 52 may be an integrated circuit chip with signal processing capabilities. The processor 52 may also be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), or a field-programmable gate array (FPGA) Or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. Additionally, the processor 52 may be commonly implemented by an integrated circuit die.

上述方案，能夠提高圖像識別的效率和準確性。The above solution can improve the efficiency and accuracy of image recognition.

請參閱圖6，圖6為本發明電腦可讀儲存介質60一實施例的方塊示意圖。電腦可讀儲存介質60儲存有能夠被處理器運行的程式指令601，程式指令601用於實現上述任一圖像識別方法。Please refer to FIG. 6 , which is a block diagram illustrating an embodiment of a computer-readable storage medium 60 according to the present invention. The computer-readable storage medium 60 stores program instructions 601 that can be executed by the processor, and the program instructions 601 are used to implement any of the above-mentioned image recognition methods.

相應地，本發明實施例還提供了一種電腦程式，包括電腦可讀代碼，當所述電腦可讀代碼在電子設備中運行時，所述電子設備中的處理器執行用於實現上述任意一種圖像識別方法。Correspondingly, an embodiment of the present invention also provides a computer program, including computer-readable code, when the computer-readable code is executed in an electronic device, a processor in the electronic device executes any one of the above diagrams. like identification methods.

在本發明所提供的幾個實施例中，應該理解到，所揭露的方法和裝置，可以通過其它的方式實現。例如，以上所描述的裝置實施方式僅僅是示意性的，例如，模組或單元的劃分，僅僅為一種邏輯功能劃分，實際實現時可以有另外的劃分方式，例如單元或元件可以結合或者可以集成到另一個系統，或一些特徵可以忽略，或不執行。另一點，所顯示或討論的相互之間的耦合或直接耦合或通信連接可以是通過一些介面，裝置或單元的間接耦合或通信連接，可以是電性、機械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed method and apparatus may be implemented in other manners. For example, the device implementations described above are only illustrative. For example, the division of modules or units is only a logical function division. In actual implementation, there may be other divisions. For example, units or elements may be combined or integrated. to another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.

作為分離部件說明的單元可以是或者也可以不是物理上分開的，作為單元顯示的部件可以是或者也可以不是物理單元，即可以位於一個地方，或者也可以分佈到網路單元上。可以根據實際的需要選擇其中的部分或者全部單元來實現本實施方式方案的目的。Units described as separate components may or may not be physically separated, and components shown as units may or may not be physical units, that is, may be located in one place, or may be distributed over network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this implementation manner.

另外，在本發明各個實施例中的各功能單元可以集成在一個處理單元中，也可以是各個單元單獨物理存在，也可以兩個或兩個以上單元集成在一個單元中。上述集成的單元既可以採用硬體的形式實現，也可以採用軟體功能單元的形式實現。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware, or can be implemented in the form of software functional units.

集成的單元如果以軟體功能單元的形式實現並作為獨立的產品銷售或使用時，可以儲存在一個電腦可讀取儲存介質中。基於這樣的理解，本發明的技術方案本質上或者說對現有技術做出貢獻的部分或者該技術方案的全部或部分可以以軟體產品的形式體現出來，該電腦軟體產品儲存在一個儲存介質中，包括若干指令用以使得一台電腦設備（可以是個人電腦，伺服器，或者網路設備等）或處理器（processor）執行本發明各個實施方式方法的全部或部分步驟。而前述的儲存介質包括：U盤、移動硬碟、唯讀記憶體（ROM，Read-Only Memory）、隨機存取記憶體（RAM，Random Access Memory）、磁碟或者光碟等各種可以儲存程式碼的介質。The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention is essentially or the part that contributes to the prior art or the whole or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, Several instructions are included to cause a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to execute all or part of the steps of the methods of the various embodiments of the present invention. The aforementioned storage medium includes: U disk, removable hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or CD, etc. medium.

工業實用性本發明實施例公開了一種圖像識別方法、電子設備、電腦可讀儲存介質，圖像識別方法包括：獲取多個待識別醫學圖像；分別提取每一所述待識別醫學圖像的風格特徵表示；對所述多個待識別醫學圖像的風格特徵表示進行分類處理，得到每一所述待識別醫學圖像的掃描圖像類別。上述方案，能夠提高圖像識別的效率和準確性。 Industrial Applicability The embodiment of the present invention discloses an image recognition method, an electronic device, and a computer-readable storage medium. The image recognition method includes: acquiring a plurality of medical images to be recognized; extracting a style feature of each medical image to be recognized separately Representation; classifying and processing the style feature representations of the plurality of medical images to be recognized, to obtain a scanned image category of each of the medical images to be recognized. The above solution can improve the efficiency and accuracy of image recognition.

40:圖像識別裝置 41:圖像獲取模組 42:風格提取模組 43:分類處理模組 50:電子設備 51:記憶體 52:處理器 60:電腦可讀儲存介質 601:程式指令 S11~S13,S21~S24:步驟 40: Image recognition device 41: Image acquisition module 42: Style Extraction Module 43: Classification processing module 50: Electronics 51: Memory 52: Processor 60: Computer-readable storage medium 601: Program command S11~S13, S21~S24: Steps

圖1是本發明圖像識別方法一實施例的流程示意圖；圖2是訓練識別網路一實施例的流程示意圖；圖3是訓練識別網路一實施例的狀態示意圖；圖4是本發明圖像識別裝置一實施例的方塊示意圖；圖5是本發明電子設備一實施例的方塊示意圖；圖6是本發明電腦可讀儲存介質一實施例的方塊示意圖。 1 is a schematic flowchart of an embodiment of an image recognition method of the present invention; 2 is a schematic flow chart of an embodiment of a training identification network; 3 is a schematic state diagram of an embodiment of a training recognition network; 4 is a schematic block diagram of an embodiment of an image recognition apparatus of the present invention; 5 is a schematic block diagram of an embodiment of an electronic device of the present invention; FIG. 6 is a schematic block diagram of an embodiment of a computer-readable storage medium of the present invention.

S11~S13:步驟 S11~S13: Steps

Claims

An image recognition method, comprising: Obtain multiple medical images to be recognized; Extracting the style feature representation of each of the medical images to be identified, respectively; The style feature representation of the plurality of medical images to be recognized is classified and processed to obtain a scanned image category of each of the medical images to be recognized.

The image recognition method according to claim 1, wherein, by performing classification processing on the style feature representations of the plurality of medical images to be recognized, obtaining the scanned image category of each of the medical images to be recognized includes the following steps: : Perform a first fusion process on the style feature representations of the plurality of medical images to be recognized to obtain a final style feature representation; The final style feature representation is classified to obtain a scanned image category of each of the medical images to be identified.

The image recognition method according to claim 1 or 2, wherein the style feature representation of the plurality of medical images to be recognized is classified to obtain a scanned image of each of the medical images to be recognized After the classification, the image recognition method further includes at least one of the following: sorting the plurality of medical images to be identified according to their scanned image categories; Displaying at least one of the medical images to be identified, sorted according to the scanned image categories, on the same screen; If the scanned image categories of the to-be-recognized medical images are duplicated, output first warning information to prompt the scanning personnel; If there is no preset scanned image category in the scanned image categories of the plurality of medical images to be identified, outputting second warning information to prompt the scanning personnel; If the classification confidence of the scanned image category of the to-be-recognized medical image is less than the preset confidence threshold, output third warning information to prompt the scanning personnel.

The image recognition method according to claim 1 or 2, wherein, before extracting the style feature representation of each of the medical images to be recognized, the method further comprises: Preprocessing is performed on each of the medical images to be recognized, wherein the preprocessing includes at least one of the following: adjusting the image size of the medical images to be recognized to a preset size; The image intensity of the image is normalized to a preset range.

The image recognition method according to claim 1 or 2, wherein the method further comprises: Respectively extract the content feature representation of each of the medical images to be identified; Perform lesion identification on the content feature representations of the plurality of medical images to be identified, to obtain a lesion area in each of the medical images to be identified.

The image recognition method according to claim 5, wherein the performing focus recognition on the content feature representations of the plurality of medical images to be recognized, and obtaining the focus area in each of the medical images to be recognized includes: performing a second fusion process on the content feature representations of the plurality of to-be-identified medical images to obtain a final content feature representation; Perform lesion identification on the final content feature representation to obtain a lesion area in each of the to-be-identified medical images; and/or, The method also includes: Prompt the currently displayed lesion area of the medical image to be identified.

The image recognition method according to claim 6, wherein performing a second fusion process on the content feature representations of the plurality of medical images to be recognized includes any one of the following: Perform splicing processing on the content feature representations of the plurality of medical images to be identified; performing addition processing on the content feature representations of the plurality of medical images to be identified; The dimensions of the final content feature representation and the content feature representations of the multiple medical images to be recognized are the same.

The image recognition method according to claim 5, wherein the separately extracting the style feature representation of each of the medical images to be recognized comprises: Extract the style feature representation of each medical image to be identified by using the style coding sub-network of the identification network; The classification processing is performed on the style feature representations of the plurality of medical images to be recognized, and the scanned image category of each medical image to be recognized is obtained, including: Use the classification processing sub-network of the recognition network to classify and process the style feature representations of the plurality of medical images to be recognized, to obtain a scanned image category of each of the medical images to be recognized; The separately extracting the content feature representation of each of the medical images to be identified includes: Extract the content feature representation of each of the medical images to be identified by using the content coding sub-network of the identification network; Performing focus identification on the content feature representations of the plurality of medical images to be identified, to obtain a focus area in each of the medical images to be identified, including: Use the region segmentation sub-network of the identification network to perform lesion identification on the content feature representation of the plurality of medical images to be identified, to obtain a lesion area in each of the medical images to be identified.

The image recognition method according to claim 8, wherein, before extracting the style feature representation of each of the medical images to be recognized, the image recognition method further comprises: acquiring a plurality of sample medical images, wherein the plurality of sample medical images are marked with their real scanned image categories and real lesion areas; Using the style coding sub-network to extract the sample style feature representation of each of the sample medical images respectively, and using the content coding sub-network to separately extract the sample content feature representation of each of the sample medical images; Use the classification processing sub-network to classify and process the sample style feature representations of the multiple sample medical images to obtain the predicted scanned image category of each of the sample medical images, and use the region segmentation sub-network performing lesion identification on the sample content feature representation of the plurality of sample medical images to obtain a predicted lesion area in each of the sample medical images; Using the difference between the real scanned image category and the predicted scanned image category, adjust the network parameters of the style encoding sub-network and the classification processing sub-network, and use the real lesion area and the The difference of the lesion area is predicted, and the network parameters of the content coding sub-network and the area segmentation sub-network are adjusted.

The image recognition method according to claim 9, wherein the image recognition method further comprises: Obtaining the distribution of sample data represented by the sample style feature of each of the sample medical images; Using the difference between the distributions of the sample data, the network parameters of the style encoding sub-network are adjusted.

The image recognition method according to claim 9, wherein the image recognition method further comprises: Using one of the sample style feature representations and one of the content feature representations, constructing a reconstructed image corresponding to the sample style feature representations; The network parameters of the style encoding sub-network and the content encoding sub-network are adjusted by using the difference between the reconstructed image and the sample medical image to which the corresponding sample style feature representation belongs.

The image recognition method according to claim 8, wherein the style encoding sub-network comprises: a sequentially connected downsampling layer and a global pooling layer; and/or, The content encoding sub-network includes any of the following: sequentially connected downsampling layers and residual blocks, sequentially connected convolutional layers and pooling layers.

An electronic device includes a mutually coupled memory and a processor, wherein the processor is configured to execute program instructions stored in the memory, so as to implement the image recognition method according to any one of claim 1 to 12.

A computer-readable storage medium on which program instructions are stored, and when the program instructions are executed by a processor, implement the image recognition method described in any one of claim 1 to 12.