TWI789932B - Imaging detection method and device for colorectal polyps and system thereof - Google Patents

Imaging detection method and device for colorectal polyps and system thereof Download PDF

Info

Publication number
TWI789932B
TWI789932B TW110136684A TW110136684A TWI789932B TW I789932 B TWI789932 B TW I789932B TW 110136684 A TW110136684 A TW 110136684A TW 110136684 A TW110136684 A TW 110136684A TW I789932 B TWI789932 B TW I789932B
Authority
TW
Taiwan
Prior art keywords
image
location block
inference
switch
information
Prior art date
Application number
TW110136684A
Other languages
Chinese (zh)
Other versions
TW202316445A (en
Inventor
洪志聖
李嘉龍
葉肇元
李青怡
陳星豪
Original Assignee
國泰醫療財團法人國泰綜合醫院
雲象科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 國泰醫療財團法人國泰綜合醫院, 雲象科技股份有限公司 filed Critical 國泰醫療財團法人國泰綜合醫院
Priority to TW110136684A priority Critical patent/TWI789932B/en
Application granted granted Critical
Publication of TWI789932B publication Critical patent/TWI789932B/en
Publication of TW202316445A publication Critical patent/TW202316445A/en

Links

Images

Landscapes

  • Endoscopes (AREA)
  • Accessory Devices And Overall Control Thereof (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
  • Air Bags (AREA)

Abstract

The invention provides an imaging detection method and device for colorectal polyps and system thereof. The imaging detection device for colorectal polyps inputs an image received from an endoscopic device to an inference module to obtain a plurality of location block information and their corresponding probability information, and marks a location block on the image based on the filtered location block information. At the same time, the control pedal can be used to control the switch of the inference module. This convenience imaging detection system for colorectal polyps helps doctors reduce the rate of misdiagnosis.

Description

大腸瘜肉影像偵測方法、裝置及其系統Colorectal polyp image detection method, device and system

本發明關於一種大腸瘜肉影像偵測方法、裝置及其系統,特別關於應用卷積深度學習模型之一種大腸瘜肉影像偵測方法、裝置及其系統。The present invention relates to a colorectal polyp image detection method, device and system, in particular to a colorectal polyp image detection method, device and system using a convolution deep learning model.

在大腸鏡臨床實務上,會透過大腸鏡來檢查大腸是否有病灶。檢查開始時,大腸鏡會沿著大腸的走向深入到迴盲瓣,在抽出大腸鏡時,醫師會仔細檢查腸壁,若有可疑病灶會依據瘜肉大小以及癌化的程度,進行大腸鏡切除或是開刀切除,並送至病理科進行更詳細的診斷,有效的偵測並切除瘜肉可以降低大腸癌的風險。In the clinical practice of colonoscopy, the colonoscope is used to check whether there are lesions in the large intestine. At the beginning of the examination, the colonoscope will go deep into the ileocecal valve along the direction of the large intestine. When the colonoscope is pulled out, the doctor will carefully check the intestinal wall. If there are suspicious lesions, the colonoscope will be removed according to the size of the polyp and the degree of cancerization. Alternatively, it can be surgically removed and sent to the pathology department for a more detailed diagnosis. Effective detection and removal of polyps can reduce the risk of colorectal cancer.

目前瘜肉之偵測乃由腸胃內科醫師以大腸內視鏡進行人眼判讀。傳統內視鏡使用氙氣(Xenon)做為光源,讓內視鏡醫師可以見到有如白色日光光源的影像。然而,腸道瘜肉在初長出來時,不但小且扁平,甚至顏色與周圍正常組織一模一樣。就如同像一顆綠豆掉到草地上,很難發現,也讓內視鏡初學者容易在檢查過程中錯過瘜肉的存在。研究顯示,在大腸鏡檢查中有17%~28%的瘜肉會被遺漏,這也會提高大腸癌未被檢測到的風險 (Kim, N. H., Jung, Y. S., Jeong, W. S., Yang, H. J., Park, S. K., Choi, K., & Park, D. I. (2017). Miss rate of colorectal neoplastic polyps and risk factors for missed polyps in consecutive colonoscopies. Intestinal research, 15(3), 411.)。At present, the detection of polyps is performed by human eyes by gastroenterologists using a colon endoscope. Traditional endoscopes use xenon gas (Xenon) as a light source, allowing endoscopists to see images similar to white daylight light sources. However, intestinal polyps are not only small and flat when they first grow out, but even have the same color as the surrounding normal tissue. Just like a mung bean falling on the grass, it is difficult to find, and it also makes it easy for beginners in endoscopy to miss the existence of polyps during the inspection process. Studies have shown that 17%-28% of polyps are missed during colonoscopy, which also increases the risk of undetected colorectal cancer (Kim, N. H., Jung, Y. S., Jeong, W. S., Yang, H. J., Park, S. K., Choi, K., & Park, D. I. (2017). Miss rate of colorectal neoplastic polyps and risk factors for missed polyps in consecutive colonoscopies. Intestinal research, 15(3), 411.).

因此,醫學界亟需能夠方便攜帶使用的即時大腸瘜肉人工智慧影像偵測系統,可以彈性搭配各種廠牌之大腸鏡裝置,自動標示出病灶區域,協助醫師或技師發現瘜肉的存在,並發出提醒的警示聲,減少誤診率。Therefore, the medical field urgently needs a real-time colorectal polyp artificial intelligence image detection system that can be easily carried and used. It can be flexibly matched with various brands of colonoscopy devices, automatically mark the lesion area, and assist physicians or technicians to detect the existence of polyps. Send out a warning sound to reduce the misdiagnosis rate.

本發明提供一種大腸瘜肉影像偵測系統,包含:一影像偵測裝置,係搭配一內視鏡裝置及一顯示裝置使用,其包含:一影像讀取模組,其係用以讀取接收自該內視鏡裝置之一影像;一推論模組,其在一推論開關為開時,降低該影像之解析度後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練;以及一顯示模組,將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上;以及一推論開關踏板,係與該影像偵測裝置連接,其發出一第一控制訊號以切換該推論開關。The present invention provides an image detection system for large intestine polyps, which includes: an image detection device used in conjunction with an endoscope device and a display device, which includes: an image reading module, which is used to read and receive An image from the endoscope device; an inference module, which reduces the resolution of the image when an inference switch is turned on, and then inputs the image into an inference model to obtain a plurality of location block information and their respective correspondences Probability information, the inference model uses a supervised convolutional deep learning model, which is trained using a Resnet50 classification model with polyp images marked by the method of category activation mapping; and a display module, the probability information is greater than a filter The location block information corresponding to the threshold is subjected to a repeated detection screening, and a location block is marked on the image according to the filtered location block information; and a deduction switch pedal is connected with the image detection device , which sends a first control signal to switch the inference switch.

本發明又提供一種大腸瘜肉影像偵測裝置,係搭配一內視鏡裝置及一顯示裝置使用,其包含:一影像讀取模組,其係用以讀取接收自該內視鏡裝置之一影像;一推論模組,其在一推論開關為開時,降低該影像之解析度後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練;以及一顯示模組,將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上。The present invention also provides a large intestine polyp image detection device, which is used with an endoscope device and a display device, which includes: an image reading module, which is used to read the image received from the endoscope device An image; an inference module, which reduces the resolution of the image and then inputs the image into an inference model to obtain a plurality of location block information and their corresponding probability information when an inference switch is turned on, the inference model A supervised convolutional deep learning model is used, which is trained by using a Resnet50 classification model with class activation mapping method to mark polyp images; and a display module, which corresponds to the probability information greater than a filtering threshold The location block information is subjected to a repeated detection filter, and a location block is marked on the image according to the filtered location block information.

本發明還提供一種大腸瘜肉影像偵測方法,包含:讀取接收自一內視鏡裝置之一影像;在一推論開關為開時,降低該影像之解析度後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練;以及將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上。The present invention also provides an image detection method for large intestine polyps, comprising: reading an image received from an endoscope device; when an inference switch is turned on, reducing the resolution of the image and then inputting the image into an inference model In order to obtain a plurality of location block information and their corresponding probability information, the inference model adopts a supervised convolutional deep learning model, which is trained using a Resnet50 classification model and a class activation mapping method to mark polyp images; and The location block information corresponding to the probability information greater than a filtering threshold is subjected to a repetitive detection screening, and a location block is marked on the image according to the filtered location block information.

於某些具體實施例中,進一步包含:一提示音開關踏板,係與該影像偵測裝置連接,其發出一第二控制訊號以切換一提示音開關;以及一提示音量踏板,係與該影像偵測裝置連接,其發出一第三控制訊號以切換一提示音音量;其中該影像偵測裝置進一步包含一通知模組,在該提示音開關為開時,當該顯示模組標示該位置區塊時,觸發一提示音裝置根據該提示音音量發出提示音。In some specific embodiments, it further includes: a tone switch pedal, connected to the image detection device, which sends a second control signal to switch a tone switch; and a tone volume pedal, connected to the image The detection device is connected, and it sends a third control signal to switch the volume of a prompt sound; wherein the image detection device further includes a notification module, when the prompt sound switch is on, when the display module marks the location area block, a prompting sound device is triggered to emit a prompting sound according to the volume of the prompting sound.

於某些具體實施例中,該推論模型係執行於一圖形處理器中,該推論模型採用一Yolov4架構,並運作在一TensorRT框架上。In some embodiments, the inference model is executed on a GPU, the inference model adopts a Yolov4 architecture, and runs on a TensorRT framework.

於某些具體實施例中,該影像之解析度降至672 x 352。In some embodiments, the resolution of the image is reduced to 672 x 352.

於某些具體實施例中,該TensorRT框架採用16浮點數精度。In some embodiments, the TensorRT framework uses 16 floating point precision.

於某些具體實施例中,該過濾閥值為0.15,且該重複偵測篩選係將大於該過濾閥值之機率資訊依大小排序,依排序陸續計算每兩個該些機率資訊對應之該些位置區塊資訊之間之一重疊資訊,當該重疊資訊超過一定值時,將兩個該些位置區塊資訊中相對應之機率資訊較小的該位置區塊資訊刪除。In some specific embodiments, the filtering threshold value is 0.15, and the repetition detection screening is to sort the probability information greater than the filtering threshold value in order of size, and successively calculate the corresponding numbers of every two pieces of the probability information according to the sorting. One piece of overlapping information among the location block information, when the overlapping information exceeds a certain value, the location block information corresponding to the smaller probability information among the two location block information is deleted.

於某些具體實施例中,影像讀取模組係讀取一列緩存空間擷取卡自該內視鏡裝置接收之該影像。In some embodiments, the image reading module reads the image received from the endoscope device by a buffer space capture card.

於某些具體實施例中,該影像偵測裝置進一步包含一輸入偵測模組,用以持續確認該第一控制訊號、該第二控制訊號、或該第三控制訊號。In some embodiments, the image detection device further includes an input detection module for continuously confirming the first control signal, the second control signal, or the third control signal.

於某些具體實施例中,該影像偵測裝置進一步包含一初始模組,用以設定該提示音裝置、設定一踏板訊息管道及確定一程序檔存在。In some embodiments, the image detection device further includes an initial module for setting the prompt sound device, setting a pedal message channel and determining the existence of a program file.

本發明所提供之大腸瘜肉影像偵測系統不但方便攜帶移動,可以彈性搭配各種廠牌之大腸鏡裝置,其中利用人工智慧開發之大腸瘜肉影像偵測方法可以即時標示出病灶區域,協助醫師或技師發現瘜肉的存在,並發出提醒的警示聲,減少誤診率。同時,在醫師雙手忙於大腸鏡之掃描時,隨時簡單可以用腳踏板調整是否要應用人工智慧協助偵測瘜肉,或是調整是否要開啟提示音及調整提示音的音量。The colorectal polyp image detection system provided by the present invention is not only convenient to carry and move, but also can be flexibly matched with various brands of colonoscope devices. Among them, the colorectal polyp image detection method developed by artificial intelligence can immediately mark the lesion area and assist doctors Or the technician discovers the existence of polyps and sends out a warning sound to reduce the rate of misdiagnosis. At the same time, when the doctor's hands are busy with the colonoscopy scan, he can easily use the foot pedal to adjust whether to use artificial intelligence to assist in the detection of polyps at any time, or to adjust whether to turn on the prompt sound and adjust the volume of the prompt sound.

有關於本發明其他技術內容、特點與功效,在以下配合參考圖式之較佳實施例的詳細說明中,將可清楚的呈現。Other technical contents, features and effects of the present invention will be clearly presented in the following detailed description of preferred embodiments with reference to the drawings.

除非另有定義,本文使用的所有技術和科學術語具有與本發明所屬領域中的技術人員所通常理解相同的含義。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

本發明說明書之內容所揭示用於描述結構組合關係的「設置」之用語,泛指多個結構在組合後不會輕易的分離或掉落,可以是固定連接,也可以是可拆式的連接,亦可以是一體成型地連接;可以是機械連接,也可以是電連接;可以是直接的物理相連,亦也可以通過中間媒介間接相連,可以是兩個件內部的連通,例如:使用螺紋、卡榫、扣具、釘子、黏著劑或高週波任一方式結合者。The term "installation" disclosed in the description of the present invention is used to describe the combination of structures. It generally refers to multiple structures that cannot be easily separated or dropped after being combined. They can be fixed connections or detachable connections. , It can also be integrally connected; it can be mechanically connected or electrically connected; it can be directly physically connected or indirectly connected through an intermediary, and it can be internal communication between two parts, for example: using threads, Tenon, buckle, nail, adhesive or any combination of high frequency.

本發明說明書之內容所揭示用於描述結構組合關係的「連接」、「連結」或「電性連接」之用語,係指使用電線、電路板、連接線、網路線、轉接裝置、轉換裝置、藍芽或無線網路任一方式進行通電或網路通訊的結合。The terms "connection", "connection" or "electrical connection" disclosed in the description of the present invention are used to describe the structural combination relationship, which refers to the use of electric wires, circuit boards, connecting wires, network cables, switching devices, and conversion devices. , bluetooth or wireless network for power or network communication.

如本文所用,冠詞「一」、「一個」以及「任何」是指一個或多於一個(即至少一個)的物品的文法物品。例如,「一個元件」意指一個元件或多於一個元件。As used herein, the articles "a", "an" and "any" refer to one or more than one (ie, at least one) of the grammatical items of the item. For example, "an element" means one element or more than one element.

本文所使用的「約」、「大約」或「近乎」一詞實質上代表所述之數值或數值範圍為基準在20%以內,較佳為於10%以內,以及更佳者為於5%以內浮動。於文中所提供之數字化的量為近似值,意旨若術語「約」、「大約」或「近乎」沒有被使用時亦可被推得。The term "about", "approximately" or "approximately" as used herein essentially means that the stated value or range of values is within 20%, preferably within 10%, and more preferably within 5% float within. Numerical quantities provided herein are approximations and are intended to be inferred if the terms "about", "approximately" or "approximately" are not used.

如圖1所示,本發明實施例之一種大腸瘜肉影像偵測系統1包含一影像偵測裝置10以及一推論開關踏板21,影像偵測裝置10包含:一影像讀取模組14、一推論模組15以及一顯示模組16。影像偵測裝置10可以進一步包含一通知模組17,除此之外,還可進一步包含一初始模組11、一輸入偵測模組12以及一執行模組13。大腸瘜肉影像偵測系統1可以進一步包含多個控制踏板,例如推論開關踏板21與一提示音開關踏板22及一提示音量踏板33形成一控制踏板組20。As shown in Figure 1, a large intestine polyp image detection system 1 according to an embodiment of the present invention includes an image detection device 10 and an inference switch pedal 21, and the image detection device 10 includes: an image reading module 14, a Inference module 15 and a display module 16 . The image detection device 10 may further include a notification module 17 , besides, it may further include an initialization module 11 , an input detection module 12 and an execution module 13 . The large intestine polyp image detection system 1 may further include a plurality of control pedals, for example, the inference switch pedal 21 forms a control pedal group 20 with a prompt tone switch pedal 22 and a prompt volume pedal 33 .

影像偵測裝置10係採用一種大腸瘜肉影像偵測方法偵測大腸鏡影像中之瘜肉特徵。大腸瘜肉影像偵測方法包含:讀取接收自該內視鏡裝置40之一影像;在一推論開關為開時,降低該影像之解析度後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練;以及將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上。The image detection device 10 adopts a colorectal polyp image detection method to detect the polyp feature in the colonoscope image. The image detection method for large intestine polyps includes: reading an image received from the endoscope device 40; when an inference switch is turned on, reducing the resolution of the image and then inputting the image into an inference model to obtain a plurality of positions Block information and their corresponding probability information, the inference model adopts a supervised convolutional deep learning model, which uses a Resnet50 classification model with a class activation mapping method to mark polyp images for training; and the probability information is greater than The location block information corresponding to a filtering threshold is subjected to a repeated detection filter, and a location block is marked on the image according to the filtered location block information.

一般檢測大腸的內視鏡裝置40影像約為30幀/秒(frame per second, fps)。為了能在大腸鏡影像上即時標註大腸瘜肉的位置,影像偵測裝置10可以為一運算主機,其具有包含中央處理器及圖形處理器之主機板,例如採用Nvidia的邊緣運算設備Jetson Xavier,可以讓硬體體積降到最小,同時運算效率亦能滿足上述影像處理要求,再配合下述的模型運算及系統軟硬體整合,可以達到每張影像處理速度33ms以內的表現。硬體規格示例如表1。 1 硬體規格表 硬體 規格 圖形處理器 512核心 Volta GPU 配備 Tensor Cores 中央處理器 8核心 ARM v8.2 64-bit CPU, 8MB L2+4MB L3 記憶體 32GB 256-bit LPDDR4x | 137GB/s 儲存裝置 32GB eMMC 5.1 深度學習加速器 [2x] NVIDIA Engines 視覺加速器 7-way VLIW Vision Processor 編碼器/解碼器 [2x] 4Kp60 | HEVC/[2x] 4Kp60 | 12-Bit Support Generally, the image of the endoscope device 40 for detecting the large intestine is about 30 frames per second (frame per second, fps). In order to mark the position of the large intestine polyp on the colonoscopy image in real time, the image detection device 10 can be a computing host, which has a main board including a central processing unit and a graphics processing unit, such as using Nvidia's edge computing device Jetson Xavier, It can minimize the size of the hardware, and at the same time, the computing efficiency can also meet the above image processing requirements. Combined with the following model computing and system hardware and software integration, it can achieve the performance of each image processing speed within 33ms. Examples of hardware specifications are shown in Table 1. Table 1 Hardware specification table Hardware Specification graphics processor 512-core Volta GPU with Tensor Cores CPU 8-core ARM v8.2 64-bit CPU, 8MB L2+4MB L3 Memory 32GB 256-bit LPDDR4x | 137GB/s storage device 32GB eMMC 5.1 Deep Learning Accelerator [2x] NVIDIA Engines visual accelerator 7-way VLIW Vision Processor encoder/decoder [2x] 4Kp60 | HEVC/[2x] 4Kp60 | 12-Bit Support

請同時參考圖2,影像偵測裝置10可以設計成單手可攜的輕便裝置,例如尺寸約為118.2mm x 135mm x 93.1mm。為了方便連接上其他裝置,外觀設置有USB、HDMI、SDI等連接埠,使影像偵測裝置10可以搭配不同廠牌的內視鏡裝置40、顯示裝置30使用,也可以接上控制踏板來控制影像偵測功能。例如影像偵測裝置10透過USB連接埠與控制踏板組20連接,透過HDMI連接埠與一顯示裝置30連接,以及透過一影像擷取卡之SDI連接埠與一內視鏡裝置40連接。Please also refer to FIG. 2 , the image detection device 10 can be designed as a portable device that can be carried by one hand, for example, the size is about 118.2mm x 135mm x 93.1mm. In order to facilitate the connection to other devices, the appearance is equipped with USB, HDMI, SDI and other connection ports, so that the image detection device 10 can be used with different brands of endoscope devices 40 and display devices 30, and can also be connected to control pedals to control Image detection function. For example, the image detection device 10 is connected to the control pedal set 20 through a USB connection port, connected to a display device 30 through an HDMI connection port, and connected to an endoscope device 40 through an SDI connection port of an image capture card.

舉例來說,影像偵測裝置10可以經由USB連接埠與推論開關踏板21連接,以控制人工智慧大腸瘜肉影像偵測功能。推論開關踏板21被觸發時能發出一第一控制訊號,第一控制訊號可以輪流切換推論開關。提示音開關踏板22與提示音量踏板23亦可以經由USB連接埠與影像偵測裝置10連接,提示音開關踏板22可觸發出一第二控制訊號以切換提示音開關。提示音量踏板23可觸發出一第三控制訊號以切換該提示音音量(例如:大/中/小)。For example, the image detection device 10 can be connected to the inference switch pedal 21 via a USB port to control the artificial intelligence colorectal polyp image detection function. When the inference switch pedal 21 is triggered, a first control signal can be sent, and the first control signal can switch the inference switch in turn. The prompt sound switch pedal 22 and the prompt volume pedal 23 can also be connected to the image detection device 10 via the USB port, and the prompt sound switch pedal 22 can trigger a second control signal to switch the prompt sound switch. The prompt volume pedal 23 can trigger a third control signal to switch the volume of the prompt sound (for example: large/medium/small).

如圖3所示,影像偵測裝置10可以進一步包含初始模組11,初始程式的程序並檢查設定檔是否存在(步驟S10),若設定檔存在則接著設定提示音裝置(步驟S11)、設定踏板訊息管道(步驟S12)以供接收第一控制訊號至第三控制訊號、確認程序檔存在(步驟S13),步驟S13即確認檔案完整性,其中程序檔包含執行檔及設定檔,若程序檔存在則啟動系統全功率模式(步驟S14),使系統運轉最高時脈速度。As shown in Figure 3, the image detection device 10 can further include an initial module 11, the program of the initial program checks whether the configuration file exists (step S10), if the configuration file exists, then set the prompting sound device (step S11), set The pedal message pipeline (step S12) is used to receive the first control signal to the third control signal, and confirm the existence of the program file (step S13). If it exists, start the full power mode of the system (step S14 ), so that the system runs at the highest clock speed.

影像偵測裝置10可以進一步包含輸入偵測模組12。踏板訊息管道設定完成後,輸入偵測模組12會持續確認踏板輸入(步驟S15),若有來自踏板被觸發而輸出的第一控制訊號、第二控制訊號或第三控制訊號就傳送給程式。每次的踏板控制訊號都是切換一個選項,若是關於開關的控制踏板,每次觸發就是在開與關之間交互切換,若是關於控制三個選項的控制踏板(例如三種不同大小的音量),每次觸發就是在三個選項間依次切換。The image detection device 10 may further include an input detection module 12 . After the setting of the pedal signal channel is completed, the input detection module 12 will continue to confirm the pedal input (step S15), and if there is a first control signal, a second control signal or a third control signal output from the pedal being triggered, it will be sent to the program . Each pedal control signal is to switch an option. If it is a control pedal of a switch, each trigger is an interactive switch between on and off. If it is a control pedal that controls three options (such as three different volumes), Each trigger is a sequential switch between the three options.

影像偵測裝置10可以進一步包含一執行模組13,其會初始化影像讀取模組14、推論模組15及顯示模組16,並將訓練好的大腸瘜肉影像偵測推論模型載入圖形處理器。The image detection device 10 may further include an execution module 13, which will initialize the image reading module 14, the inference module 15 and the display module 16, and load the trained image detection inference model of large intestine polyps into the graphics processor.

如圖4所示,影像偵測裝置10之影像擷取卡會從內視鏡裝置40持續擷取掃描影像,大約是每秒30張。為了減少延遲,達到能匹配大腸鏡裝置傳來之影像速度,本發明實施例之系統選擇列緩存空間(line buffer)的擷取卡,整個影像資訊在擷取卡內處理單位為一列,相較於常見的幀緩存空間(frame buffer)擷取卡,可以避免因需要等整張影像擷取完成才開始後續處理的延遲(大約 33ms)。As shown in FIG. 4 , the image capture card of the image detection device 10 will continuously capture scanning images from the endoscope device 40 at approximately 30 images per second. In order to reduce the delay and achieve a speed that can match the image speed transmitted from the colonoscope device, the system of the embodiment of the present invention selects a capture card with a line buffer space (line buffer). The entire image information is processed in a single line in the capture card. In the common frame buffer space (frame buffer) capture card, it can avoid the delay (about 33ms) due to the need to wait for the entire image capture to complete before starting the subsequent processing.

影像讀取模組14係用以讀取接收自該內視鏡裝置40之一影像,示例影像解析度大約為1920x1080 (影像長寬比為16:9)。影像讀取模組14會持續等待輸入影像檔(步驟S20),接著確認推論開/關(步驟S21)。若推論開關之狀態為開,則傳送影像檔至推論模組15(步驟S22)進行後續推論處理。若推論開關之狀態為關,則直接在影像前景加入系統資訊(步驟S23),再輸出影像(步驟S24)。系統資訊可以為商標或系統名稱或圖樣、系統版本資訊、推論開關標示、提示音開關標示或音量大小標示。The image reading module 14 is used to read an image received from the endoscope device 40 , and the example image resolution is about 1920×1080 (image aspect ratio is 16:9). The image reading module 14 will continue to wait for the input image file (step S20), and then confirm the inference on/off (step S21). If the state of the inference switch is on, the image file is sent to the inference module 15 (step S22) for subsequent inference processing. If the state of the inference switch is off, directly add system information to the foreground of the image (step S23), and then output the image (step S24). The system information can be a trademark or system name or pattern, system version information, inference switch mark, prompt tone switch mark or volume mark.

如圖5所示,在推論開關為開時,推論模組15接收到影像讀取模組14傳來的影像後,先降低影像解析度(步驟S30),為了能滿足本發明實施例之模型及軟體推論的時間必須小於33ms之目標,搭配本發明實施例之軟硬體整合,至少需將解析度降至704 x 384或704x 352,最佳降低至672 x 352,可以達到最適合的推論時間及推論準確度。除此之外,亦可將每個像素的RGB值進行標準化,也就是將每個數值除以255。之後,輸入影像至一推論模型(步驟S31)。As shown in Figure 5, when the inference switch is turned on, after the inference module 15 receives the image sent by the image reading module 14, it first reduces the image resolution (step S30), in order to satisfy the model of the embodiment of the present invention And the goal of software inference time must be less than 33ms. With the integration of software and hardware in the embodiment of the present invention, at least the resolution needs to be reduced to 704 x 384 or 704x 352, and it is best to reduce it to 672 x 352 to achieve the most suitable inference Time and inference accuracy. In addition, the RGB value of each pixel can also be normalized, that is, each value is divided by 255. Afterwards, the image is input into an inference model (step S31).

TensorRT是 Nvidia推出的一個高性能的深度學習推理框架,Yolov4架構經過TensoRT的優化可使推論更加快速,其係藉由簡化捲積層來提高運算效率,維持正確率且降低GPU記憶體使用率。因此訓練完成的 Yolov4推論模型會轉換成TensorRT的形式,並且將浮點數32降至浮點數(floating point, FP)16的精度,進而部署至硬體中。上述軟硬體整合的架構同時降低浮點數,可以在準確率損失(accuracy loss)相對小的情況下達到最快速的推論能力。TensorRT is a high-performance deep learning inference framework launched by Nvidia. The Yolov4 architecture is optimized by TensoRT to make inferences faster. It improves computing efficiency by simplifying convolutional layers, maintains accuracy and reduces GPU memory usage. Therefore, the trained Yolov4 inference model will be converted into the form of TensorRT, and the floating point number 32 will be reduced to the precision of floating point number (floating point, FP) 16, and then deployed to the hardware. The above-mentioned software-hardware integrated architecture reduces floating-point numbers at the same time, and can achieve the fastest inference ability with a relatively small accuracy loss.

推論模型係以約40萬張之大腸鏡影像訓練而得,採用監督式深度學習。訓練影像來源可以有DICOM格式的大腸鏡影像,或是將mp4格式之大腸鏡影片每秒一張轉成的大腸鏡影像。每張影像皆在有瘜肉的地方標註出瘜肉位置,瘜肉影像搭配瘜肉位置的資訊輸入推論模型進行訓練,透過多次迭代及參數優化,使其損失(Loss)下降至區域最小值,得到一個經訓練過的成熟推論模型。The inference model is trained with approximately 400,000 colonoscopy images, using supervised deep learning. The source of the training image can be a colonoscopy image in DICOM format, or a colonoscopy image converted from a colonoscopy video in mp4 format one per second. Each image is marked with the position of the polyp, and the polyp image is combined with the information of the polyp position to input the inference model for training. Through multiple iterations and parameter optimization, the loss (Loss) is reduced to the minimum value in the area. , to get a trained mature inference model.

接著先解釋瘜肉影像搭配瘜肉位置的資訊要如何取得,標註瘜肉的方法有兩種,一種是由腸胃內科專業訓練的醫師,對照病理報告結果在有瘜肉的地方標註出瘜肉位置。另外一種替代方案為僅標註出是否有瘜肉,若是影像中有瘜肉出現,醫師會將此影像標記為有瘜肉之影像(但不需圈選出瘜肉位置),影像分為有瘜肉及沒瘜肉後,會進入一Resnet50分類模型,Resnet50分類模型含有49層捲積層,1層平均池化層及1層全連接層,總共降維32次,模型每層中會學習不同的特徵,最後輸出每張影像有瘜肉和沒瘜肉的分類機率值。這樣的替代方案可以大幅減少開發的時間。Next, explain how to obtain polyp images and polyp location information. There are two ways to mark polyps. One is to use a doctor trained in gastroenterology to mark the location of polyps according to the results of the pathology report. . Another alternative is to only mark whether there are polyps. If there are polyps in the image, the doctor will mark this image as an image with polyps (but there is no need to circle the location of the polyps). The images are divided into polyps After the polyp is removed, it will enter a Resnet50 classification model. The Resnet50 classification model contains 49 layers of convolutional layers, 1 layer of average pooling layer and 1 layer of fully connected layer, with a total of 32 times of dimensionality reduction. Each layer of the model will learn different features. , and finally output the classification probability value of each image with polyp and without polyp. Such alternatives can drastically reduce development time.

如圖6A所示,其為大腸鏡的原始影像。經過分類模型後,分類機率值輸出後,可以利用模型視覺化類別活化映射(Class Activation Mapping, CAM)的方法找出分類模型將圖片中的哪個範圍判斷為瘜肉,也就是將最後一個卷積層之後使用全域平均池化(Global Average Pooling, GAP),再依不同輸出節點,將所有GAP前的特徵圖乘上一組GAP後的全連結層,當作對於不同特徵圖乘上不同的比重,並將其所有輸出疊合在一起,即可得到類似熱點圖(Heatmap)的CAM結果圖。As shown in FIG. 6A , it is the original image of the colonoscope. After the classification model, after the classification probability value is output, you can use the method of model visualization Class Activation Mapping (CAM) to find out which range in the picture is judged as a polyp by the classification model, that is, the last convolutional layer Then use the global average pooling (Global Average Pooling, GAP), and then according to different output nodes, multiply all the feature maps before GAP by a set of fully connected layers after GAP, as multiplying different proportions for different feature maps, And superimpose all its outputs together, you can get a CAM result map similar to a heat map (Heatmap).

如圖6B所示,大腸鏡影像偵測熱點圖之顏色代表該顏色所處位置區塊之大腸瘜肉的機率值。之後,如圖6C所示,將熱點圖中機率值大於 0.5、0.6或0.7之區域,更佳為機率大於等於0.7之區域設為1,其他區域設為0,藉此可以使用邊緣檢測演算法(opencv之find_contour)找到瘜肉周圍的輪廓,若是輪廓內的面積小於原圖的 1/400 則移除此輪廓,若是大於等於1/400 則取出輪廓上的左下及右上位置(Xmin, Ymin, Xmax, Ymax)形成一矩形框,將偵測到的大腸瘜肉區域B框起來,以輸出一大腸鏡掃描偵測結果,此結果可代替專家標註瘜肉位置影像以節省訓練時間。As shown in FIG. 6B , the color of the heat map detected by the colonoscopy image represents the probability value of the colorectal polyps in the block where the color is located. Afterwards, as shown in Figure 6C, set the area in the heat map with a probability value greater than 0.5, 0.6 or 0.7, more preferably, the area with a probability greater than or equal to 0.7 is set to 1, and other areas are set to 0, so that the edge detection algorithm can be used (find_contour of opencv) Find the contour around the polyp. If the area inside the contour is less than 1/400 of the original image, remove the contour. If it is greater than or equal to 1/400, take out the lower left and upper right positions on the contour (Xmin, Ymin, Xmax, Ymax) form a rectangular frame to frame the detected colorectal polyp area B to output the detection result of the colonoscopy scan. This result can replace the position image of the polyp marked by an expert to save training time.

在將上述影像及大腸瘜肉區域資訊進入使用Yolov4物件偵測架構之推論模型訓練之前,先將上述處理好的複數個影像及其中所標註的複數大腸瘜肉區域進行K-means群集演算法分析,以求出九組錨箱(anchor box)資訊,作為後續Yolov4物件偵測架構針對大腸瘜肉影像所使用的錨箱參數,九組資訊是較適合的計算數量。每個標註的大腸瘜肉區域至少包含區塊面積(H, W)資訊,先隨機取出九組大腸瘜肉區域作為群心,之後將所有影像的大腸瘜肉區域與個別群心進行重疊資訊(Intersection over Union, IoU)計算。IoU=(大腸瘜肉區域與個別群心的交集區域/大腸瘜肉區域與個別群心的聯集區域)。將大腸瘜肉區域判定給重疊資訊大的群心,新的群組可以依照每個群組中群體之平均數值計算出新的群心,反覆再依照重疊資訊分出新的群組,並計算新的群組中群體之平均數值,直到群心不再變動,最終的九個群心就是九組錨箱(anchor box)資訊。Before putting the above image and colorectal polyp area information into the inference model training using the Yolov4 object detection framework, the above-mentioned processed multiple images and the marked multiple colorectal polyp area are analyzed by K-means clustering algorithm , to obtain nine sets of anchor box (anchor box) information, as the anchor box parameters used by the subsequent Yolov4 object detection framework for polyp images of the large intestine, nine sets of information are more suitable calculation quantities. Each marked large intestine polyp region contains at least block area (H, W) information. First, nine groups of large intestine polyp regions are randomly selected as group centers, and then the large intestine polyp regions of all images are overlapped with individual group centers ( Intersection over Union, IoU) calculation. IoU=(intersection area of large intestine polyp region and individual cluster centers/union area of large intestine polyp area and individual cluster centers). Determine the large intestine polyp area to the group center with large overlapping information, the new group can calculate the new group center according to the average value of the group in each group, and repeatedly divide the new group according to the overlapping information, and calculate The average value of the group in the new group, until the group hearts do not change, the final nine group hearts are nine sets of anchor box (anchor box) information.

推論模型為一卷積深度學習模型(Convolutional Neural Network, CNN),此模型是由多層的卷積層堆疊而成,採用Yolov4物件偵測架構,搭配使用Darknet53骨架(backbone)。Darknet53係由52層捲積層堆疊而成,最後省略全連接層,整個卷積過程中降維(down-sampling)32次。經過頸部(neck)處理,使用3層最大池化層,經過不同的升維次數(最多2次),在頭部(head)分別產生出三個不同尺寸的網格(grid)特徵圖,分別為降維8次﹑降維16次以及降維32次的特徵圖。降維8次的特徵圖對於小物體偵測較有利,降維32次的特徵圖對於大物體偵測較有利。其中降維 8 次的特徵圖會用最小的三組錨箱並在每個網格的中心點進行回歸預測,降維 16 次的特徵圖會用中間大小的三組錨箱在每個網格的中心點進行回歸預測,降維 32 次的特徵圖會用最大的三組錨箱在每個網格的中心點進行回歸預測。The inference model is a convolutional deep learning model (Convolutional Neural Network, CNN). This model is formed by stacking multiple convolutional layers. It adopts the Yolov4 object detection architecture and uses the Darknet53 backbone (backbone). Darknet53 is composed of 52 convolutional layers stacked, and the fully connected layer is omitted at the end, and the dimensionality reduction (down-sampling) is performed 32 times during the entire convolution process. After neck (neck) processing, using 3 layers of maximum pooling layers, after different times of dimensionality enhancement (up to 2 times), three grid (grid) feature maps of different sizes are generated on the head (head), They are the feature maps of 8 times of dimensionality reduction, 16 times of dimensionality reduction and 32 times of dimensionality reduction. The feature map with dimensionality reduction 8 times is more beneficial for the detection of small objects, and the feature map with dimensionality reduction 32 times is more beneficial for the detection of large objects. Among them, the feature map with dimensionality reduction 8 times will use the smallest three sets of anchor boxes and perform regression prediction at the center point of each grid, and the feature map with dimensionality reduction 16 times will use three sets of anchor boxes of the middle size in each grid. The center point of each grid is used for regression prediction, and the feature map with dimensionality reduction 32 times will use the largest three groups of anchor boxes to perform regression prediction at the center point of each grid.

每張即時大腸影像輸入經訓練過的成熟推論模型後,可以經過上述計算得到複數個位置區塊資訊(x, y, w, h)及各自相對應之機率資訊(p),機率資訊即是偵測到的物件為瘜肉的機率。由此可知,最後每張影像至少會有(網格數*3) *3種網格組資訊,每組資訊至少有五項資料(x, y, w, h, p)。最後,推論模組15輸出影像檔以及輸出複數個位置區塊資訊及各自相對應之機率資訊(步驟S32)至顯示模組16。After each real-time colorectal image is input into the trained and mature inference model, multiple location block information (x, y, w, h) and their corresponding probability information (p) can be obtained through the above calculation. The probability information is Chance that the detected object is a polyp. It can be seen that each image will have at least (number of grids*3)*3 grid group information, and each group of information will have at least five items of data (x, y, w, h, p). Finally, the inference module 15 outputs the image file and a plurality of location block information and their corresponding probability information (step S32 ) to the display module 16 .

如圖7所示,顯示模組16可以先等待垂直消隱期間(步驟S41)再傳入影像檔,以避免不同的影像同時出現在螢幕的上下半部造成誤診。根據從推論模組15傳送來的複數個位置區塊資訊及各自相對應之機率資訊判斷有無檢測到瘜肉(步驟S42)。若無檢測到瘜肉,則在影像前景加入系統資訊(步驟S43)後輸出影像(步驟S44)。As shown in FIG. 7 , the display module 16 can wait for the vertical blanking period (step S41 ) before uploading the image file, so as to avoid misdiagnosis caused by different images appearing on the upper and lower half of the screen at the same time. It is judged whether a polyp is detected or not according to the plurality of location block information transmitted from the inference module 15 and the corresponding probability information (step S42). If no polyp is detected, the system information is added to the foreground of the image (step S43) and then the image is output (step S44).

若有檢測到瘜肉,將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選(步驟S45)。為了找到最適合的框,例如可以設定過濾閥值為0.15,所有機率資訊大於0.15之相對應的位置區塊資訊都進行重複偵測篩選。每個位置區塊資訊包含了位置中心位置(X, Y)及區塊面積(H, W)資訊,利用位置區塊資訊即可計算出每兩個位置區塊間的重疊資訊(IoU)。IoU=(兩個位置區塊間的交集區域/兩個位置區塊間的聯集區域)。If a polyp is detected, the location block information corresponding to the probability information greater than a filtering threshold is subjected to a repeated detection screening (step S45 ). In order to find the most suitable frame, for example, a filter threshold value of 0.15 can be set, and all location block information corresponding to probability information greater than 0.15 will be filtered for duplicate detection. Each location block information includes location center position (X, Y) and block area (H, W) information, and the overlap information (IoU) between every two location blocks can be calculated by using the location block information. IoU=(intersection area between two location blocks/union area between two location blocks).

先將機率資訊大於0.15之機率資訊由大排到小,先將最大機率資訊相對應的位置區塊開始分別依序與其他機率資訊相對應的位置區塊計算重疊資訊。若兩個位置區塊之重疊資訊超過0.5時,即將兩個位置區塊中,機率資訊較小之機率資訊與其相對應的位置區塊資訊刪除。再繼續將留下的機率資訊中,第二大機率資訊相對應的位置區塊開始分別依序與其他機率資訊相對應的位置區塊計算重疊資訊,並依上述條件刪除,再繼續將留下的機率資訊中第三大機率資訊進行相同操作,以此類推直到所有剩下的位置區塊都與其他位置區塊比較完畢。最後剩下的位置區塊資訊即是篩選出來的位置區塊資訊,根據篩選出之位置區塊資訊標示其對應之一位置區塊於原始影像上(步驟S46),在影像前景加入系統資訊(步驟S47),最後輸出影像及偵測到之瘜肉位置。The probability information whose probability information is greater than 0.15 is first sorted from large to small, and the location block corresponding to the maximum probability information is firstly calculated to overlap with the location blocks corresponding to other probability information respectively. If the overlapping information of the two location blocks exceeds 0.5, among the two location blocks, the probability information with the smaller probability information and the corresponding location block information are deleted. Then continue to calculate the overlapping information of the position block corresponding to the second largest probability information in the remaining probability information, and delete it according to the above conditions, and then continue to leave The same operation is performed for the third largest probability information among the probability information, and so on until all remaining location blocks are compared with other location blocks. The last remaining location block information is the filtered location block information. According to the filtered location block information, one of the corresponding location blocks is marked on the original image (step S46), and the system information ( Step S47), finally output the image and the detected polyp position.

影像偵測裝置10可以進一步包含通知模組17,通知模組17可將其訊號傳至與主機板連接之一提示音裝置,例如喇叭。若有檢測到瘜肉,確認提示音開關開啟與否(步驟S48),在該提示音開關為開時,確認三音階提示音音量(步驟S49),提示音裝置則根據提示音量狀態發出設定之提示音(步驟S50)。當該顯示模組標示該位置區塊時,觸發一提示音裝置根據該提示音音量發出提示音。The image detection device 10 can further include a notification module 17, and the notification module 17 can transmit its signal to a prompt sound device connected to the motherboard, such as a speaker. If a polyp is detected, confirm whether the prompt tone switch is on or not (step S48), and when the prompt tone switch is on, confirm the volume of the three-tone scale prompt tone (step S49), and the prompt tone device sends the setting according to the prompt volume state. Prompt sound (step S50). When the display module marks the location block, a prompting sound device is triggered to emit a prompting sound according to the volume of the prompting sound.

如圖8,在進行大腸鏡檢查時,藉由本發明實施例之大腸瘜肉影像偵測方法、裝置及其系統,可以即時協助醫師偵測到瘜肉的存在,在大腸鏡影像中標示出來瘜肉位置並發出提示音告知醫師。醫師在雙手進行大腸鏡掃描時,在醫師想要自行確認大腸鏡影像或想要暫時觀察大腸鏡原始影像時,可以利用腳踏板關閉大腸瘜肉自動偵測,在需要時再開啟偵測,亦可以控制提示音的開關及音量。As shown in Figure 8, during a colonoscopy examination, the colorectal polyp image detection method, device and system of the embodiment of the present invention can assist the physician to detect the presence of polyp in real time, and mark the polyp in the colonoscopy image The location of the meat and a beep sound to inform the doctor. When the doctor scans the colonoscope with both hands, when the doctor wants to confirm the colonoscope image or temporarily observe the original image of the colonoscope, he can use the foot pedal to turn off the automatic detection of large bowel polyps, and then turn on the detection when needed , you can also control the switch and volume of the prompt sound.

如圖9所示,開啟推論開關時,醫師可以在顯示裝置30中看到大腸鏡影像A,人工智慧偵測到的大腸瘜肉區域B,關閉推論開關時,醫師就只能看到大腸鏡影像A,需自行判讀大腸瘜肉位置。另外,可以隨時透過圖形使用者介面了解推論開關狀態C1、提示音開關狀態C2或提示音量狀態C3。As shown in Figure 9, when the inference switch is turned on, the doctor can see the colonoscope image A on the display device 30, and the large intestine polyp region B detected by artificial intelligence; when the inference switch is turned off, the doctor can only see the colonoscope In image A, you need to judge the position of the large intestine polyp by yourself. In addition, the inference switch state C1 , the prompt sound switch state C2 or the prompt volume state C3 can be known at any time through the graphical user interface.

本發明已透過上述之實施例及各項示例揭露如上,僅是發明部分較佳的實施例選擇,然其並非用以限定本發明,任何熟悉此一技術領域具有通常知識者,在瞭解本發明前述的技術特徵及實施例,並在不脫離本發明之精神和範圍內所做的均等變化或潤飾,仍屬本發明涵蓋之範圍,而本發明之專利保護範圍須視本說明書所附之請求項所界定者為準。The present invention has been disclosed through the above-mentioned embodiments and various examples. The above are only some preferred embodiments of the invention, but they are not used to limit the present invention. The aforementioned technical features and embodiments, and the equivalent changes or modifications made within the spirit and scope of the present invention, still belong to the scope covered by the present invention, and the scope of patent protection of the present invention depends on the requirements attached to this specification. The one defined in the item shall prevail.

1:大腸瘜肉影像偵測系統 10:影像偵測裝置 11:初始模組 12:輸入偵測模組 13:執行模組 14:影像讀取模組 15:推論模組 16:顯示模組 17:通知模組 20:控制踏板組 21:推論開關踏板 22:提示音開關踏板 23:提示音量踏板 30:顯示裝置 40:內視鏡裝置 A:大腸鏡影像 B:大腸瘜肉區域 C1:推論開關狀態 C2:提示音開關狀態 C3:提示音量狀態 1: Colorectal polyps image detection system 10: Image detection device 11: Initial module 12: Input detection module 13: Execute the module 14: Image reading module 15: Deduction module 16: Display module 17:Notification module 20:Control Pedal Set 21: Inference switch pedal 22: Prompt sound switch pedal 23: Cue Volume Pedal 30: Display device 40: Endoscope device A: Colonoscopy image B: Large intestine polyp area C1: Deduce switch state C2: Prompt sound switch status C3: Prompt volume status

圖1為本發明實施例之大腸瘜肉影像偵測系統方塊圖。FIG. 1 is a block diagram of a colorectal polyp image detection system according to an embodiment of the present invention.

圖2為本發明實施例之影像偵測裝置外型示意圖。FIG. 2 is a schematic diagram of an image detection device according to an embodiment of the present invention.

圖3為本發明實施例之初始模組流程圖。FIG. 3 is a flow chart of the initial module of the embodiment of the present invention.

圖4為本發明實施例之影像讀取模組流程圖。FIG. 4 is a flowchart of an image reading module according to an embodiment of the present invention.

圖5為本發明實施例之推論模組流程圖。FIG. 5 is a flowchart of the inference module of the embodiment of the present invention.

圖6A、6B、6C分別依序為本發明實施例之大腸鏡影像、大腸鏡影像偵測熱點圖、大腸鏡掃描偵測結果圖。6A, 6B, and 6C respectively show the colonoscope image, the colonoscope image detection heat map, and the colonoscope scan detection result map according to the embodiment of the present invention.

圖7為本發明實施例之顯示模組及通知模組流程圖。FIG. 7 is a flowchart of a display module and a notification module according to an embodiment of the present invention.

圖8為本發明實施例之大腸瘜肉檢測情境示意圖。Fig. 8 is a schematic diagram of the detection situation of large intestine polyp according to the embodiment of the present invention.

圖9為本發明實施例之大腸瘜肉檢測顯示示意圖。Fig. 9 is a schematic diagram showing the detection of large intestine polyp according to the embodiment of the present invention.

1:大腸瘜肉影像偵測系統 1: Colorectal polyps image detection system

10:影像偵測裝置 10: Image detection device

11:初始模組 11: Initial module

12:輸入偵測模組 12: Input detection module

13:執行模組 13: Execute the module

14:影像讀取模組 14: Image reading module

15:推論模組 15: Deduction module

16:顯示模組 16: Display module

17:通知模組 17:Notification module

20:控制踏板組 20:Control Pedal Set

21:推論開關踏板 21: Inference switch pedal

22:提示音開關踏板 22: Prompt sound switch pedal

23:提示音量踏板 23: Cue Volume Pedal

30:顯示裝置 30: Display device

40:內視鏡裝置 40: Endoscope device

Claims (10)

一種大腸瘜肉影像偵測系統,包含:一影像偵測裝置,係搭配一內視鏡裝置及一顯示裝置使用,其包含:一影像讀取模組,其係用以讀取接收自該內視鏡裝置之一影像;一推論模組,其在一推論開關為開時,降低該影像之解析度至672 x 352後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練,該推論模型係執行於一圖形處理器中,該推論模型採用一Yolov4架構,並運作在一TensorRT框架上,且該TensorRT框架採用16浮點數精度;以及一顯示模組,將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上;以及一推論開關踏板,係與該影像偵測裝置連接,其發出一第一控制訊號以切換該推論開關。 An image detection system for large intestinal polyps, comprising: an image detection device, used with an endoscope device and a display device, which includes: an image reading module, which is used to read the An image of the viewing mirror device; an inference module, which reduces the resolution of the image to 672 x 352 when an inference switch is turned on, and then inputs the image into an inference model to obtain a plurality of location block information and their respective relative Corresponding to the probability information, the inference model adopts a supervised convolutional deep learning model, which is trained by using a Resnet50 classification model with polyp images marked by the method of class activation mapping, and the inference model is executed in a graphics processor. The inference model adopts a Yolov4 architecture, and operates on a TensorRT framework, and the TensorRT framework adopts 16 floating-point precision; and a display module, the location block corresponding to the probability information greater than a filter threshold The information is subjected to a repeated detection screening, and a location block is marked on the image according to the filtered location block information; and a deduction switch pedal is connected with the image detection device, which sends a first control signal to switch The inference switch. 如請求項1所述之偵測系統,其進一步包含:一提示音開關踏板,係與該影像偵測裝置連接,其發出一第二控制訊號以切換一提示音開關;以及一提示音量踏板,係與該影像偵測裝置連接,其發出一第三控制訊號以切換一提示音音量; 其中該影像偵測裝置進一步包含一通知模組,在該提示音開關為開時,當該顯示模組標示該位置區塊時,觸發一提示音裝置根據該提示音音量發出提示音。 The detection system as described in claim 1, further comprising: a beep sound switch pedal connected to the image detection device, which sends a second control signal to switch a beep sound switch; and a beep volume pedal, is connected with the image detection device, which sends out a third control signal to switch the volume of a prompt sound; Wherein the image detection device further includes a notification module, when the notification sound switch is on, when the display module marks the location block, a notification sound device is triggered to emit a notification sound according to the volume of the notification sound. 如請求項1所述之偵測系統,其中該過濾閥值為0.15,且該重複偵測篩選係將大於該過濾閥值之機率資訊依大小排序,依排序陸續計算每兩個該些機率資訊對應之每兩個該些位置區塊資訊之間之一重疊資訊,當該重疊資訊超過一定值時,將兩個該些位置區塊資訊中相對應之機率資訊較小的該位置區塊資訊刪除。 The detection system as described in claim 1, wherein the filter threshold is 0.15, and the repeated detection filter is to sort the probability information greater than the filter threshold in order of size, and calculate every two of the probability information successively according to the sorting Corresponding to one of the overlapping information between each two location block information, when the overlapping information exceeds a certain value, the location block information corresponding to the smaller probability information among the two location block information delete. 如請求項1所述之偵測系統,其中該影像讀取模組係讀取一列緩存空間擷取卡自該內視鏡裝置接收之該影像。 The detection system as described in claim 1, wherein the image reading module reads the image received from the endoscope device by a cache space capture card. 如請求項2所述之偵測系統,其中該影像偵測裝置進一步包含一輸入偵測模組,用以持續確認該第一控制訊號、該第二控制訊號、或該第三控制訊號。 The detection system as described in claim 2, wherein the image detection device further includes an input detection module for continuously confirming the first control signal, the second control signal, or the third control signal. 如請求項2所述之偵測系統,其中該影像偵測裝置進一步包含一初始模組,用以設定該提示音裝置、設定一踏板訊息管道及確定一程序檔存在。 The detection system as described in claim 2, wherein the image detection device further includes an initial module for setting the prompt sound device, setting a pedal message channel and determining the existence of a program file. 一種大腸瘜肉影像偵測裝置,係搭配一內視鏡裝置及一顯示裝置使用,其包含:一影像讀取模組,其係用以讀取接收自該內視鏡裝置之一影像;一推論模組,其在一推論開關為開時,降低該影像之解析度至672 x 352後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練,該推論模型係執行於 一圖形處理器中,該推論模型採用一Yolov4架構,並運作在一TensorRT框架上,且該TensorRT框架採用16浮點數精度;以及一顯示模組,將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上。 A large intestine polyp image detection device is used with an endoscope device and a display device, which includes: an image reading module, which is used to read an image received from the endoscope device; The inference module, when an inference switch is turned on, reduces the resolution of the image to 672 x 352 and then inputs the image into an inference model to obtain a plurality of location block information and their corresponding probability information, the inference model A supervised convolutional deep learning model was used for training on polyp images labeled with a Resnet50 classification model coupled with class activation mapping. The inference model was implemented on In a graphics processor, the inference model adopts a Yolov4 architecture and operates on a TensorRT framework, and the TensorRT framework adopts 16 floating-point precision; and a display module, the probability information is greater than a filter threshold A repetition detection filter is performed on the corresponding location block information, and a location block is marked on the image according to the filtered location block information. 如請求項7所述之偵測裝置,進一步包含:一通知模組,在一提示音開關為開時,當該顯示模組標示該位置區塊時,觸發一提示音裝置根據一提示音音量發出提示音。 The detection device as described in claim 7, further comprising: a notification module, when a prompt sound switch is on, when the display module marks the location block, trigger a prompt sound device according to a prompt sound volume Beep. 一種大腸瘜肉影像偵測方法,包含:讀取接收自一內視鏡裝置之一影像;在一推論開關為開時,降低該影像之解析度至672 x 352後將該影像輸入一推論模型以得到複數個位置區塊資訊及各自相對應之機率資訊,該推論模型採用監督式卷積深度學習模型,其係使用一Resnet50分類模型搭配類別活化映射的方法標記之瘜肉影像進行訓練,該推論模型係執行於一圖形處理器中,該推論模型採用一Yolov4架構,並運作在一TensorRT框架上,且該TensorRT框架採用16浮點數精度;以及將該機率資訊大於一過濾閥值之相對應之該位置區塊資訊進行一重複偵測篩選,根據篩選出之位置區塊資訊標示一位置區塊於該影像上。 An image detection method for large intestinal polyps, comprising: reading an image received from an endoscope device; when an inference switch is turned on, reducing the resolution of the image to 672 x 352 and then inputting the image into an inference model In order to obtain a plurality of location block information and their corresponding probability information, the inference model uses a supervised convolutional deep learning model, which uses a Resnet50 classification model and a class activation mapping method to mark polyp images for training. The inference model is executed in a graphics processor, the inference model adopts a Yolov4 architecture, and operates on a TensorRT framework, and the TensorRT framework adopts 16 floating-point precision; and the probability information is greater than a filtering threshold. A repetition detection filter is performed on the corresponding location block information, and a location block is marked on the image according to the filtered location block information. 如請求項9所述之偵測方法,進一步包含:在一提示音開關為開時,當標示該位置區塊時,觸發一提示音裝置根據一提示音音量發出提示音。 The detection method as described in claim 9, further comprising: when a prompt sound switch is on, when the location block is marked, triggering a prompt sound device to emit a prompt sound according to a volume of a prompt sound.
TW110136684A 2021-10-01 2021-10-01 Imaging detection method and device for colorectal polyps and system thereof TWI789932B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW110136684A TWI789932B (en) 2021-10-01 2021-10-01 Imaging detection method and device for colorectal polyps and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW110136684A TWI789932B (en) 2021-10-01 2021-10-01 Imaging detection method and device for colorectal polyps and system thereof

Publications (2)

Publication Number Publication Date
TWI789932B true TWI789932B (en) 2023-01-11
TW202316445A TW202316445A (en) 2023-04-16

Family

ID=86670139

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110136684A TWI789932B (en) 2021-10-01 2021-10-01 Imaging detection method and device for colorectal polyps and system thereof

Country Status (1)

Country Link
TW (1) TWI789932B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110600122A (en) * 2019-08-23 2019-12-20 腾讯医疗健康(深圳)有限公司 Digestive tract image processing method and device and medical system
CN111655116A (en) * 2017-10-30 2020-09-11 公益财团法人癌研究会 Image diagnosis support device, data collection method, image diagnosis support method, and image diagnosis support program
TW202037327A (en) * 2018-11-21 2020-10-16 日商Ai醫療服務股份有限公司 Disease diagnostic assistance method based on digestive organ endoscopic images, diagnostic assistance system, diagnostic assistance program, and computer-readable recording medium having diagnostic assistance program stored thereon
TWM602876U (en) * 2020-07-21 2020-10-21 華碩電腦股份有限公司 Assisted detection system
CN112863699A (en) * 2020-12-31 2021-05-28 山东大学齐鲁医院 ESD preoperative discussion system based on mobile terminal
TWM621648U (en) * 2021-10-01 2021-12-21 國泰醫療財團法人國泰綜合醫院 Colorectal polyp image detection system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111655116A (en) * 2017-10-30 2020-09-11 公益财团法人癌研究会 Image diagnosis support device, data collection method, image diagnosis support method, and image diagnosis support program
TW202037327A (en) * 2018-11-21 2020-10-16 日商Ai醫療服務股份有限公司 Disease diagnostic assistance method based on digestive organ endoscopic images, diagnostic assistance system, diagnostic assistance program, and computer-readable recording medium having diagnostic assistance program stored thereon
CN110600122A (en) * 2019-08-23 2019-12-20 腾讯医疗健康(深圳)有限公司 Digestive tract image processing method and device and medical system
TWM602876U (en) * 2020-07-21 2020-10-21 華碩電腦股份有限公司 Assisted detection system
CN112863699A (en) * 2020-12-31 2021-05-28 山东大学齐鲁医院 ESD preoperative discussion system based on mobile terminal
TWM621648U (en) * 2021-10-01 2021-12-21 國泰醫療財團法人國泰綜合醫院 Colorectal polyp image detection system

Also Published As

Publication number Publication date
TW202316445A (en) 2023-04-16

Similar Documents

Publication Publication Date Title
US10101890B2 (en) System and method for displaying portions of in-vivo images
WO2021147429A1 (en) Endoscopic image display method, apparatus, computer device, and storage medium
US20220296081A1 (en) Method for real-time detection of objects, structures or patterns in a video, an associated system and an associated computer readable medium
US10743752B2 (en) Endoscope system, capsule endoscope device, and method for operating endoscope system
CN111936032A (en) Image processing apparatus, endoscope system, and image processing method
CN110738655B (en) Image report generation method, device, terminal and storage medium
US8768024B1 (en) System and method for real time detection of villi texture in an image stream of the gastrointestinal tract
CN113498323B (en) Medical image processing device, processor device, endoscope system, medical image processing method, and recording medium
WO2020162275A1 (en) Medical image processing device, endoscope system, and medical image processing method
WO2020008834A1 (en) Image processing device, method, and endoscopic system
WO2012042986A1 (en) Information processing device and capsule endoscope system
US20220414880A1 (en) Medical system, information processing method, and computer-readable medium
US8986198B2 (en) Image display apparatus and capsule endoscope system
CN114945314A (en) Medical image processing device, endoscope system, diagnosis support method, and program
CN113744266B (en) Method and device for displaying focus detection frame, electronic equipment and storage medium
TWM621648U (en) Colorectal polyp image detection system
TWI789932B (en) Imaging detection method and device for colorectal polyps and system thereof
JP7335976B2 (en) Endoscope system, control program, and display method
US10726553B2 (en) Image processing apparatus, image processing system, operation method of image processing apparatus, and computer-readable recording medium
JP2013063179A (en) Observation system
CN116681681B (en) Endoscopic image processing method, device, user equipment and medium
CN108292427A (en) Operation control device, operation control method, program and surgery systems
CN106780429B (en) Method for extracting key frame of WCE video time sequence redundant image data based on perception color space and key corner
WO2023067922A1 (en) Endoscope image processing device, endoscope image processing method, and endoscope system
WO2022210508A1 (en) Processor device, medical image processing device, medical image processing system, and endoscopic system