TWI781760B - Application of deep learning image recognition technology to door collision detection method and system - Google Patents

Application of deep learning image recognition technology to door collision detection method and system Download PDF

Info

Publication number
TWI781760B
TWI781760B TW110133829A TW110133829A TWI781760B TW I781760 B TWI781760 B TW I781760B TW 110133829 A TW110133829 A TW 110133829A TW 110133829 A TW110133829 A TW 110133829A TW I781760 B TWI781760 B TW I781760B
Authority
TW
Taiwan
Prior art keywords
image
deep learning
warning
image recognition
information
Prior art date
Application number
TW110133829A
Other languages
Chinese (zh)
Other versions
TW202311087A (en
Inventor
王圳木
黃子瀚
Original Assignee
國立勤益科技大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 國立勤益科技大學 filed Critical 國立勤益科技大學
Priority to TW110133829A priority Critical patent/TWI781760B/en
Application granted granted Critical
Publication of TWI781760B publication Critical patent/TWI781760B/en
Publication of TW202311087A publication Critical patent/TW202311087A/en

Links

Images

Landscapes

  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

一種應用深度學習影像辨識技術於車門碰撞檢測方法及系統,其包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格,俾能藉由整合深度學習影像辨識與警戒閾值等技術,讓駕駛及乘客獲得車後方的人車動態及發出後方車輛進入警戒區的警告訊號,以大幅降低於門開遭到撞擊的發生機率,進而提升汽車出入的安全防護性。 A method and system for applying deep learning image recognition technology to vehicle door collision detection, which includes an image capture device, an information processing unit, and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, removes the image part outside the preset region of interest of each dynamic image, and uses the image to perform image feature extraction and then input it to the image recognition model to make the image recognition The model performs image recognition processing of object classification and object positioning on dynamic images to predict the category information and location information of moving objects, and judge whether it will enter the threshold range of the warning zone. If the judgment result is yes, it will output a warning signal. The display screen of the information output unit can display prediction frames and warning frames that include frame selection of moving objects to represent position information, so that by integrating deep learning image recognition and warning threshold technologies, drivers and passengers can obtain The movement of people and vehicles in the rear and the warning signal for the vehicles in the rear to enter the warning zone can greatly reduce the probability of being hit when the door is opened, thereby improving the safety protection of vehicles entering and exiting.

Description

應用深度學習影像辨識技術於車門碰撞檢測方法及系統 Applying Deep Learning Image Recognition Technology to Vehicle Door Collision Detection Method and System

本發明係關於一種應用深度學習影像辨識技術於車門碰撞檢測方法及系統,尤指一種可以藉由整合深度學習影像辨識與警戒閾值等技術讓駕駛及乘客獲得車後方的人車動態及發出進入警戒區的警告訊號的汽車門開防撞安全防護技術。 The present invention relates to a method and system for applying deep learning image recognition technology to car door collision detection, especially to a method and system that can enable drivers and passengers to obtain the dynamics of people and vehicles behind the car and issue entry warnings by integrating deep learning image recognition and warning threshold technologies Zone warning signal of car door opening anti-collision safety protection technology.

按,汽車的各個車門大多是與車體形成轉動的樞接,所以每個車門皆可以各自獨立的相對車體做向外的往復開合動作,於此,以供乘客及駕駛者作為出入車輛的用途。然而,時常可以在新聞或者是社群網站上看到一些交通事故,主要是由汽車駕駛未依規定二段式開門或者是未注意後方來車突然開啟車門,因而導致後方機車驚嚇使機車駕駛偏離路線跌倒;或是突然被開啟的車門撞到而飛出路線,無論是發生何種開門事件都會讓機車駕駛受到傷害嚴重則使機車駕駛傷亡,在交通事故中,此事故確實已顯得屢見不鮮,致使造成無謂的人命傷亡以及財產上的損失。也因如此,各國政府當局也已經正視此一問題的嚴重性,並開始做相關二段式開門操作的安全開導與宣傳;惟此類的交通意外事故發生率仍無降低的趨勢,因此,上述類型的交通事故實已成為各國當局政府、相關學界以及業者所急亟欲解決的技術課題與挑戰。 By the way, most of the doors of the car are pivotally connected with the car body, so each door can be independently opened and closed relative to the car body. Here, it is used as an entry and exit vehicle for passengers and drivers. the use of. However, you can often see some traffic accidents on the news or on social networking sites. The main reason is that the driver of the car did not open the door in two stages according to the regulations or did not pay attention to the car coming from behind and suddenly opened the door, which caused the rear locomotive to frighten and make the locomotive driver deviate. The line falls; or is suddenly hit by the open door and flies out of the line. No matter what kind of door opening event occurs, the locomotive driver will be injured seriously, and the locomotive driver will be seriously injured. In traffic accidents, this accident has indeed become common. Cause unnecessary casualties and property losses. Also because of this, government authorities in various countries have also faced up to the seriousness of this problem, and have begun to do relevant two-stage door opening operation safety enlightenment and propaganda; Types of traffic accidents have become a technical issue and challenge that governments, relevant academic circles, and industry players in various countries urgently want to solve.

針對上述缺失,相關業者已研發出一種於下車開門前可以顯 示後方動態影像的習知技術,該習知技術的代表性專利如中國大陸發明第CN1128600A『車輛後方確認裝置』及中華民國新型公告第M376460號『開門防追撞反射鏡』等專利所示。該等專利可於下車開門前關閉車門鎖一段時間,並顯示後方動態影像供乘客或駕駛者觀看,再由乘客或駕駛者自行決定是否開啟車門,當乘客或駕駛者因漫不精心或是視力出現問題而導致無法辨識後方是否有來車時,同樣仍會發生門開遭撞的交通意外事故,因而造成乘客在門開下車時的極大壓力困擾與不便。 In response to the above-mentioned shortcomings, related companies have developed a device that can be displayed before getting off the car and opening the door. The conventional technology of displaying the rear dynamic image, the representative patent of this conventional technology is shown in the patents such as CN1128600A "Vehicle Rear Confirmation Device" of Chinese Mainland Invention and No. M376460 "Door Anti-Collision Mirror" of the Republic of China New Announcement. These patents can close the door lock for a period of time before getting out of the car and opening the door, and display the rear dynamic image for the passenger or driver to watch, and then the passenger or driver decides whether to open the door. When a problem occurs and it is impossible to identify whether there is an oncoming car from the rear, a traffic accident in which the door is opened and bumped will still occur, thus causing great pressure, distress and inconvenience for passengers when the door is opened and get off.

另有一種習知技術係利用感測技術(如超音波感測器、紅外線感測器及雷達感測器)來感測接近車輛後方的人車動態,當有車輛靠近時,則將車門暫時鎖定卡制而無法開啟,當無車輛靠近時,則將車門解除鎖定而可被開啟;該習知技術的代表性專利如中國大陸新型第CN201633624號『車門開啟安全監測裝置』、中華民國新型公告第516534號『檢視後方來車之開門警示裝置』及中華民國新型公告第M318541號『車門警示裝置結構改良』等專利所示;該等專利雖然可以透過上述感測技術來感測車輛後方的行車動態;惟,超音波感測器與紅外線感測器的感測範圍距離不足以捕捉到急速行駛的車輛動態,以致會產生感測死角與盲點的出現問題,因而同樣會發生門開遭到撞擊的交通事故;此外,雷達的感測範圍距離雖然較遠,但是造價過於高價昂貴,以致較無法全面普及採用,因而造成商品化實現的重要阻礙因素。 Another known technology is to use sensing technology (such as ultrasonic sensor, infrared sensor and radar sensor) to sense the dynamics of people and vehicles approaching the rear of the vehicle. When a vehicle approaches, the door is temporarily closed. Locking and jamming cannot be opened, and when no vehicle is approaching, the door is unlocked and can be opened; representative patents of this known technology such as the Chinese Mainland New Model No. CN201633624 "Car Door Opening Safety Monitoring Device", the Republic of China New Announcement Patents such as No. 516534 "Door Opening Warning Device for Checking Cars Coming from Behind" and ROC New Announcement No. M318541 "Structure Improvement of Vehicle Door Warning Device" are shown; although these patents can use the above-mentioned sensing technology to sense the driving behind the vehicle dynamics; however, the distance between the sensing range of the ultrasonic sensor and the infrared sensor is not enough to capture the dynamics of a fast-moving vehicle, so that there will be problems with sensing dead angles and blind spots, so door openings will also be hit traffic accidents; in addition, although the sensing range of the radar is relatively long, the cost is too high, so that it cannot be fully popularized and adopted, thus causing an important obstacle to the realization of commercialization.

又有一種如中華民國發明公告第I531500號『利用影像處理的汽車車門開啟警示方法』,該專利之影像處理單元可以根據預設之透視轉換公式,將所獲得之目標物件在第n幅影像中之中心之像素位置轉換為該 目標物件在該路面上之位置的二維座標;並根據該目標物件的二維座標及該汽車之車門在該路面上之位置的二維座標來判定該目標物件與該汽車之車門之距離是否小於一預設距離;判定結果為是時,藉由該影像處理單元,將警示輸出至輸出單元。該專利雖然具備影像辨識目標物件與車門距離而發出警示的功能;惟,該專利並無整合深度學習影像辨識與警戒閾值範圍等技術的建置,所以僅能以距離值來作為警示發佈的依據,況且該專利非以目標物件是否進入警戒閾值範圍來作為警示發佈的依據,理論上位處較側邊不會進入警戒區的目標物件只要達到預設距離值仍會發佈警示,以致會因偵測誤差過大而造成誤報的情事產生,因而造成使用上的不便與困擾的情事產生。 There is another such as the Republic of China Invention Announcement No. I531500 "Automobile Door Opening Warning Method Using Image Processing", the image processing unit of this patent can convert the obtained target object in the nth image according to the preset perspective conversion formula The pixel position of the center is converted to the The two-dimensional coordinates of the position of the target object on the road surface; and determine whether the distance between the target object and the car door of the car is less than a preset distance; when the determination result is yes, the image processing unit outputs a warning to the output unit. Although this patent has the function of image recognition of the distance between the target object and the car door to issue a warning; however, this patent does not integrate deep learning image recognition and the establishment of warning threshold range technology, so the distance value can only be used as the basis for warning release Moreover, the patent does not use whether the target object enters the warning threshold range as the basis for issuing the warning. The error is too large to cause false positives, thus causing inconvenience and troubles in use.

有鑑於此,本發明人認為上述習知技術與該等專利前案確實未臻完善,仍然有再改善的必要性,因此,本發明人等乃再深入研究,進而研發出如本發明所揭露的技術成果。 In view of this, the inventor believes that the above-mentioned prior art and the prior patents are indeed not perfect, and there is still a need for further improvement. Therefore, the inventors conducted further research and developed a patent as disclosed in the present invention. technical achievements.

本發明第一目的,在於提供一種應用深度學習影像辨識技術於車門碰撞檢測方法及系統,主要藉由整合深度學習影像辨識與警戒閾值等技術,讓駕駛及乘客獲得車後方的人車動態及發出後方車輛進入警戒區的警告訊號,以大幅降低於門開遭到撞擊的發生機率,進而提升汽車出入的安全防護性。達成本發明第一目的之技術手段,係包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去 除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格。 The first object of the present invention is to provide a method and system for applying deep learning image recognition technology to vehicle door collision detection, mainly by integrating deep learning image recognition and warning threshold technology, so that drivers and passengers can obtain the dynamics and signals of people and vehicles behind the car. The warning signal of the rear vehicle entering the warning area can greatly reduce the probability of being hit when the door is opened, thereby improving the safety protection of car entry and exit. The technical means to achieve the first objective of the present invention includes an image capture device, an information processing unit and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, and removes the image parts other than the preset interest area of each dynamic image In addition, image features are extracted by images and then input to the image recognition model, so that the image recognition model performs image recognition processing for object classification and object positioning on dynamic images, so as to predict the category information and location information of moving objects, and judge whether they will Enter the threshold range of the warning zone, if the judgment result is yes, then output a warning signal. The display screen of the information output unit can display a prediction frame and a warning frame that include frame selection of moving objects to represent position information.

本發明第二目的,在於提供一種具備物件合併功能使預測框格可以快速框出更準確範圍的應用深度學習影像辨識技術於車門碰撞檢測方法及系統。達成本發明第二目的之技術手段,係包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格。其中,該影像辨識處理模組執行時更包含物件合併步驟,其包括下列步驟:步驟一,將預測完成的相關類別以信心度從大至小進行排序;步驟二,判斷是否有人的物件,物件則是將其他的類別進行框選,若有人的物件將會針對人的物件與其他預測出來的類別以IoU的方式來判斷是否有重疊的部分;步驟三,IoU出來的參數大於閾值範圍時,則合併該兩物件成為一個新的物件;及步驟四,若經過比對並 無任何重複時,則分別框出與所預測的類別。 The second object of the present invention is to provide a method and system for applying deep learning image recognition technology to vehicle door collision detection with the function of merging objects so that the prediction frame can quickly frame a more accurate range. The technical means to achieve the second objective of the present invention includes an image capture device, an information processing unit and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, removes the image part outside the preset region of interest of each dynamic image, and uses the image to perform image feature extraction and then input it to the image recognition model to make the image recognition The model performs image recognition processing of object classification and object positioning on dynamic images to predict the category information and location information of moving objects, and judge whether it will enter the threshold range of the warning zone. If the judgment result is yes, it will output a warning signal. The display screen of the information output unit can display a prediction frame and a warning frame that include frame selection of moving objects to represent position information. Wherein, the image recognition processing module further includes an object merging step during execution, which includes the following steps: Step 1, sorting the predicted related categories from large to small according to the degree of confidence; Step 2, judging whether there are human objects, objects It is to frame other categories. If there is a human object, it will use IoU to judge whether there is any overlap between the human object and other predicted categories; step 3, when the IoU parameter is greater than the threshold range, Then merge the two objects into a new object; and step 4, if compared and merged When there is no repetition, frame the predicted category separately.

本發明第三目的,在於提供一種可以針對小車因閃避大車而突然往車門方向靠近所致的危險而發出危險預警的應用深度學習影像辨識技術於車門碰撞檢測方法及系統。達成本發明第三目的之技術手段,係包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格。其中,當該影像辨識處理模組影像辨識出二該物件為自行車與大型車或是機車與大型車並行且判定該自行車或該機車判斷該物件不會進入該警戒區及超過該閾值範圍時,則發出可能會進入該警戒區及超過該閾值範圍的第二警示訊號。 The third object of the present invention is to provide a door collision detection method and system that can issue an early warning against the danger caused by a small car dodging a large car and suddenly approaching the door. The technical means to achieve the third objective of the present invention includes an image capture device, an information processing unit and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, removes the image part outside the preset region of interest of each dynamic image, and uses the image to perform image feature extraction and then input it to the image recognition model to make the image recognition The model performs image recognition processing of object classification and object positioning on dynamic images to predict the category information and location information of moving objects, and judge whether it will enter the threshold range of the warning zone. If the judgment result is yes, it will output a warning signal. The display screen of the information output unit can display a prediction frame and a warning frame that include frame selection of moving objects to represent position information. Wherein, when the image recognition processing module recognizes that the object is a bicycle and a large vehicle or a motorcycle and a large vehicle, and it is determined that the bicycle or the motorcycle judges that the object will not enter the warning zone and exceed the threshold range, Then send out the second warning signal that may enter the warning zone and exceed the threshold range.

1:汽車 1: car

10:影像擷取裝置 10: Image capture device

20:資訊處理單元 20: Information processing unit

21:影像辨識處理模組 21: Image recognition processing module

210:深度學習演算模型 210:Deep Learning Calculus Model

211:座標參數資料庫 211: Coordinate parameter database

30:資訊輸出單元 30: Information output unit

31:顯示幕 31: display screen

32:警示裝置 32:Warning device

40:供電單元 40: Power supply unit

50:藍芽通訊系統 50:Bluetooth communication system

60:記憶模組 60: Memory module

A1:感興趣區域 A1: Region of interest

A2:警戒區 A2: warning area

A3:閾值範圍 A3: Threshold range

A4:預測框格 A4: Forecast grid

A5:警戒框格 A5: warning sash

B1,B2:中心像素位置 B1, B2: center pixel position

圖1為本發明具體實施架構的功能方塊示意圖。 FIG. 1 is a schematic functional block diagram of a specific implementation architecture of the present invention.

圖2係本發明深度學習演算模組於預測階段的流程實施示意圖。 FIG. 2 is a schematic diagram of the implementation process of the deep learning calculation module in the prediction stage of the present invention.

圖3係本發明深度學習演算模組於訓練階段的流程實施示意圖。 FIG. 3 is a schematic diagram of the implementation process of the deep learning algorithm module in the training phase of the present invention.

圖4為本發明顯示幕顯示動態影像之分類與定位資訊的實施示意圖。 FIG. 4 is a schematic diagram of the implementation of displaying dynamic image classification and positioning information on the display screen of the present invention.

圖5為本發明顯示幕顯示動態影像中之二物件合併前的實施示意圖。 FIG. 5 is a schematic diagram showing the implementation of the display screen before combining two objects in the dynamic image according to the present invention.

圖6為本發明顯示幕顯示動態影像中之二物件合併後的實施示意圖。 FIG. 6 is a schematic diagram of the implementation of the combination of two objects in a dynamic image displayed on the display screen of the present invention.

圖7為本發明顯示幕顯示動態影像之分類與定位資訊的又一實施示意圖。 FIG. 7 is a schematic diagram of another implementation of displaying dynamic image classification and positioning information on the display screen of the present invention.

圖8為本發明影像處理取得物件影像之中心像素位置的實施示意圖。 FIG. 8 is a schematic diagram of the implementation of image processing to obtain the center pixel position of an object image according to the present invention.

圖9為圖7、8物件影像之中心像素位置的座標軌跡分析示意圖。 FIG. 9 is a schematic diagram of the coordinate track analysis of the central pixel position of the image of the object in FIGS. 7 and 8 .

圖10為本發明合併物件的流程控制實施示意圖。 Fig. 10 is a schematic diagram of the process control implementation of the merged object of the present invention.

圖11為本發明深度學習演算模型訓練完成後的LOSS示意圖。 Fig. 11 is a schematic diagram of LOSS after the training of the deep learning calculation model of the present invention is completed.

圖12為本發明深度學習演算模型訓練完成後的權重測試實施示意圖。 FIG. 12 is a schematic diagram of the implementation of the weight test after the training of the deep learning calculation model of the present invention is completed.

為讓 貴審查委員能進一步瞭解本發明整體的技術特徵與達成本發明目的之技術手段,玆以具體實施例並配合圖式加以詳細說明:請參看圖1~4所示,為達成本發明第一目的之第一實施例,係包括一影像擷取裝置10、一資訊處理單元20(如電腦或微處理器MCU;但不以此為限)、一資訊輸出單元30一用以供應所需電源的供電單元40(如汽車電瓶與電源處理電路的組合;但不以此為限)。該影像擷取裝置10設在汽車1上(如左後照鏡;或行李箱等位置)且以鏡頭朝向汽車1後方,而可於下車時段自動或半自動連續對汽車1後方或左側後方的人車動態進行影像擷取而成像為複數連續的動態影像。該資訊處理單元20包含一影像辨識處理模組21,並建立有一深度學習演算模型210,該影像辨識處理模組21將輸入之每一動態影像的一預設區域設定為一感興趣區域A1,並將感興趣區域A1以外的影像部分去除,以對保留有感興趣區域A1的動態影像進行影像特徵擷取後輸入至深度學習演算模型210,使深度學習演算模型210對每一輸入之動態影像進行物件之分類與物件之定位的影像辨識處理,以預測出動態影像中移動之物件的類別資訊與位置資訊,並由類別資訊與位置資訊來判斷物件是否會進入預設警戒區A2的閾值範圍A3,當判斷結果是,則輸出警示訊號。該資訊輸出單元30包含用以顯示動態影像的顯示幕31 及用以將警示訊號輸出為音頻或語音的警示資訊(如危險不可開門,車後方即將有車輛通過)的警示裝置32,該顯示幕31之動態影像上顯示包含有框選移動中之物件以代表位置資訊的預測框格A4、位於預測框格A4附近的類別資訊(如汽車、機車、自行車或人)及代表警戒區A2的警戒框格A5,如果預測框格A4內的物件會通過閾值範圍A3,預測框格A4則會以紅色框格及閃爍來示警,於是即可讓駕駛或乘客知悉,此時處於危險狀態,不可開啟車門下車;如果預測框格A4內的物件不會通過閾值範圍A3,預測框格A4則會以綠色框格來表示,於是即可讓駕駛或乘客知悉,此時處理安全狀態,可以放心地開啟車門下車。 In order to allow your examiner to further understand the overall technical characteristics of the present invention and the technical means to achieve the purpose of the present invention, the specific embodiments are described in detail in conjunction with the drawings: please refer to Figures 1 to 4, in order to achieve the first part of the present invention The first embodiment of an object includes an image capture device 10, an information processing unit 20 (such as a computer or a microprocessor MCU; but not limited thereto), an information output unit 30 for supplying required The power supply unit 40 of the power supply (such as a combination of a car battery and a power processing circuit; but not limited thereto). The image capture device 10 is set on the car 1 (such as the left rear-view mirror; or the trunk, etc.) and faces the rear of the car 1 with the camera lens, and can automatically or semi-automatically continuously monitor the people behind the car 1 or on the left side during the time of getting off the car. Car dynamic image capture and imaging into a plurality of continuous dynamic images. The information processing unit 20 includes an image recognition processing module 21, and establishes a deep learning algorithm model 210. The image recognition processing module 21 sets a preset region of each input dynamic image as a region of interest A1, And the part of the image other than the region of interest A1 is removed, so as to extract the image features of the dynamic image that retains the region of interest A1, and then input it to the deep learning calculation model 210, so that the deep learning calculation model 210 can perform a dynamic image analysis for each input Carry out image recognition processing of object classification and object positioning to predict the category information and location information of moving objects in the dynamic image, and judge whether the object will enter the threshold range of the default warning area A2 based on the category information and location information A3, when the judgment result is yes, then output a warning signal. The information output unit 30 includes a display screen 31 for displaying dynamic images And a warning device 32 for outputting the warning signal as audio or voice warning information (such as the danger that the door cannot be opened, and there will be a vehicle passing behind the car). Prediction frame A4 representing location information, category information (such as automobile, motorcycle, bicycle or person) near prediction frame A4 and warning frame A5 representing warning area A2, if the object in prediction frame A4 will pass the threshold In the range A3, the prediction box A4 will give a warning with a red box and flashing, so that the driver or passengers can know that they are in a dangerous state at this time and cannot open the door to get off; if the object in the prediction box A4 will not pass the threshold The range A3 and the prediction frame A4 will be represented by a green frame, so that the driver or passengers can know that the processing is safe at this time, and you can safely open the door and get off the car.

請參看圖2~3所示的實施例,該影像辨識處理模組21執行時則包含下列步驟: Please refer to the embodiment shown in Figures 2 to 3, the image recognition processing module 21 includes the following steps when executed:

(a)訓練階段步驟,係建立深度學習演算模型210,於深度學習演算模型210輸入巨量物件的特徵樣本、警戒區辨識參數、影像分類參數及影像定位參數,並由深度學習演算模型210測試影像辨識的正確率,再判斷影像辨識正確率是否足夠,當判斷結果為是,則將辨識結果輸出及儲存,當判斷結果為否,則使深度學習演算模型210自我修正學習。 (a) The step of the training stage is to establish a deep learning calculation model 210, input the feature samples of a huge number of objects, warning area identification parameters, image classification parameters and image positioning parameters into the deep learning calculation model 210, and test by the deep learning calculation model 210 The correct rate of image recognition is then judged whether the correct rate of image recognition is sufficient. If the judgment result is yes, then the recognition result is output and stored.

(b)運行預測階段步驟,係於深度學習演算模型210依序輸入即時擷取之動態影像,並由深度學習演算模型210計算出相應的物件特徵,以預測辨識出物件的類別資訊、位置資訊及物件是否超過警戒區A2所預設的閾值範圍A3等之辨識結果資訊。 (b) The step of running the prediction stage is to input the real-time captured dynamic images into the deep learning algorithm model 210 in sequence, and calculate the corresponding object features by the deep learning algorithm model 210, so as to predict the category information and location information of the recognized object and identification result information such as whether the object exceeds the threshold range A3 preset by the alert area A2.

本發明的一種具體實施例中,該深度學習演算模型210係為採用YOLO v4作為網路架構進行物件之分類與定位辨識的偵測,該深度學習演算模型210的資料集係採用VOC資料集,以VOC2012資料集作為訓練 階段步驟使用,並以VOC2007資料集做為預測階段步驟使用。 In a specific embodiment of the present invention, the deep learning calculation model 210 uses YOLO v4 as the network architecture to detect object classification and location identification, and the data set of the deep learning calculation model 210 uses the VOC data set. Using the VOC2012 data set as training The stage step is used, and the VOC2007 data set is used as the prediction stage step.

請參看圖1~6及圖10所示,為達成本發明第二目的之第二實施例,本實施例除了包括上述第一實施例的整體技術內容之外,係於影像辨識處理模組21執行時更包含物件合併步驟,其包括下列步驟: Please refer to Figures 1 to 6 and Figure 10, in order to achieve the second embodiment of the second purpose of the present invention, this embodiment is based on the image recognition processing module 21 in addition to the overall technical content of the first embodiment above. Execution further includes an object merging step, which includes the following steps:

步驟一,將預測完成的相關類別以信心度從大至小進行排序。 Step 1: sort the predicted related categories in descending order of confidence.

步驟二,判斷是否有人的物件,物件則是將其他的類別進行框選,若有人的物件將會針對人的物件與其他預測出來的類別以IoU的方式來判斷是否有重疊的部分。 Step 2. Determine whether there is a human object, and the object is to frame other categories. If there is a human object, it will judge whether there is an overlap between the human object and other predicted categories in the form of IoU.

步驟三,IoU出來的參數大於閾值範圍A3時,則合併兩物件成為一個新的物件,如圖5、6所示,將機車與人合併為騎士。 Step 3, when the IoU parameter is greater than the threshold range A3, merge the two objects into a new object, as shown in Figures 5 and 6, merging the motorcycle and the person into a knight.

步驟四,若經過比對並無任何重複時,則分別框出與所預測的類別。 Step 4, if there is no duplication after comparison, frame the predicted category respectively.

請參看圖1、7及圖8所示,該影像辨識處理模組21建立一座標參數資料庫211,該座標參數資料庫211設定儲存有一與動態影像對應的座標參數數資及一用以圍起以定位警戒區A2的閾值參數資料,該影像辨識處理模組21更包括執行一影像定位步驟,係將保留有感興趣區域A1的動態影像由RGB格式轉換至YIQ平面,再由YIQ平面取得物件影像,再使用YIQ平面的I值,取閥值除去大部份影像,僅保留該物件之物件影像,再經小面積雜訊過濾除去小面積非物件影像,以取得物件影像底部或左側底部的中心像素位置,再將中心像素位置與閾值參數資料帶入座標參數資 料庫211而分別計算出物件影像底部(如圖7機車輪胎底部);或左側底部(如圖7巴士左側輪胎底部)的中心像素位置B1,B2與閾值範圍A3所處的實際座標位置,以作為物件影像進入警戒區A2及達到閾值範圍A3的判斷依據。 1, 7 and 8, the image recognition processing module 21 establishes a coordinate parameter database 211, and the coordinate parameter database 211 is set to store a coordinate parameter data corresponding to the dynamic image and a frame for surrounding To locate the threshold parameter data of the warning area A2, the image recognition processing module 21 further includes an image positioning step, which is to convert the dynamic image retaining the area of interest A1 from the RGB format to the YIQ plane, and then obtain it from the YIQ plane Object image, and then use the I value of the YIQ plane, take the threshold to remove most of the image, keep only the object image of the object, and then filter out the small area of non-object image through small area noise filtering to obtain the bottom or left bottom of the object image The center pixel position, and then bring the center pixel position and threshold parameter data into the coordinate parameter data The material warehouse 211 calculates the actual coordinate positions of the center pixel positions B1, B2 and the threshold range A3 of the bottom of the object image (as shown in Figure 7, the bottom of the locomotive tire); or the left bottom (as shown in Figure 7, the bottom of the left tire of the bus), respectively. It is used as the basis for judging that the object image enters the warning area A2 and reaches the threshold range A3.

具體的,請再參看圖8所示,中心像素位置B1會通過閾值範圍A3,所以會立即發佈警示,至於中心像素位置B2雖然會通過警戒區A2,但不會通過閾值範圍A3,所以不會發佈警示。 Specifically, please refer to Figure 8 again. The central pixel position B1 will pass through the threshold range A3, so a warning will be issued immediately. As for the central pixel position B2, although it will pass through the warning area A2, it will not pass through the threshold range A3, so it will not Issue a warning.

於本發明的一種具體實施例中,該資訊輸出單元30可以是智慧型手機、智能手錶、智能手環以及智能眼鏡的其中一種,該資訊輸出單元30透過藍芽通訊系統50與資訊處理單元20資訊連結,並以其顯示幕31來顯示動態影像、類別資訊及位置資訊,再以其警示裝置32來輸出警示資訊。 In a specific embodiment of the present invention, the information output unit 30 can be one of smart phones, smart watches, smart bracelets and smart glasses, and the information output unit 30 communicates with the information processing unit 20 through the Bluetooth communication system 50 Information links, and use its display screen 31 to display dynamic images, category information and location information, and then use its warning device 32 to output warning information.

請參看圖1~4及圖7所示,達成本發明第三目的之第三實施例,本實施例除了包括上述第一實施例的整體技術內容之外,當影像辨識處理模組21影像辨識出二物件為自行車與大型車;或是機車與大型車並行且判定自行車或機車不會進入警戒區A2與閾值範圍A3時,則發出可能會進入警戒區A2與閾值範圍A3的第二警示訊號,讓駕駛會乘客知悉此一並行狀況,於是即可避免因閃避大車而突然往汽車之車門方向靠近所致的危險情事發生。 Please refer to Figures 1 to 4 and Figure 7, the third embodiment to achieve the third purpose of the present invention, this embodiment includes the overall technical content of the first embodiment, when the image recognition processing module 21 image recognition When the second object is a bicycle and a large vehicle; or a motorcycle and a large vehicle parallel and it is determined that the bicycle or motorcycle will not enter the warning zone A2 and the threshold range A3, then a second warning signal that may enter the warning zone A2 and the threshold range A3 will be issued , let the driver and passengers know this parallel situation, so that the dangerous situation caused by dodging the large vehicle and approaching the direction of the car door of the car suddenly can be avoided.

於本發明的應用實施例中係為一種行車記錄器,如圖1所示,係包括一用以記錄連續動態之動態影像的記憶模組60及一供影像擷取裝置10、資訊處理單元20、記憶模組60及警示裝置32容裝的機殼組(本圖 式例未示),該機殼組設於汽車1內靠近後車窗使鏡頭可以朝向汽車後方的位置上。 In the application embodiment of the present invention, it is a driving recorder, as shown in Figure 1, it includes a memory module 60 for recording continuous dynamic dynamic images, an image capture device 10, and an information processing unit 20 , memory module 60 and warning device 32 accommodating casing groups (this figure formula example is not shown), the casing group is located in the car 1 near the rear window so that the camera lens can face the position behind the car.

必須陳明的是,上述所指的下車時段係可透過擷取汽車1原本之每一車門開關的開門訊號及關門訊號來觸發資訊處理單元20而加以實現;舉例來說,當其中一個車門被開啟至第一角度時,此車門的車門開關則會產生開門訊號,當資訊處理單元20受到此開門訊號的觸發時,則認定為下車時段的起始時間,並啟動影像擷取裝置10而進行影像擷取;當資訊處理單元20受到關門訊號的觸發時,則認定為下車時段的結束時間,並關閉影像擷取裝置10而結束影像擷取的動作。 It must be stated that the time period for getting off the car mentioned above can be realized by capturing the original door opening signal and door closing signal of each door switch of the car 1 to trigger the information processing unit 20; When it is opened to the first angle, the door switch of the car door will generate a door-opening signal, and when the information processing unit 20 is triggered by the door-opening signal, it will be identified as the start time of the getting-off period, and the image capture device 10 will be activated to carry out Image capture; when the information processing unit 20 is triggered by the door closing signal, it will be determined as the end time of the getting off period, and the image capture device 10 will be closed to end the image capture operation.

上述感興趣區域A1(Region of Interest,RoI)是針對動態影像設計一塊感興趣區域A1如圖4所示,從圖像中框出一塊區域,所框選出來的區域將會是所分析的一關重點之一,對框選出的區域進行下一步處理,故而原本要處理一張較大的圖片將會變成一個圖片的小區域,可以大幅度的減少處理時間也增加了精確度。 The above-mentioned region of interest A1 (Region of Interest, RoI) is to design a region of interest A1 for dynamic images, as shown in Figure 4. A region is framed from the image, and the region selected by the frame will be a part of the analysis. One of the key points is to carry out the next step of processing on the area selected by the frame, so a larger image to be processed will become a small area of the image, which can greatly reduce the processing time and increase the accuracy.

圖11所示為YOLOv4從VOC2012經過訓練完成後的相關LOSS值表現,從該圖可以得知在訓練的過程中可以非常快速的下降LOSS並且收斂完成,相關的val loss也與train loss較為相近。圖12所示為將即時訓練完成的權重進行測試,測試資料為VOC2007資料集,可以看到該圖中person達到85%的AP而car和motorbike有將近80%的AP,但在bicycle上準確度逼近60%AP,而四種類別在IoU 0.5的情況下mAP有達到74.76%的 表現,因此在沒經過訓練的圖片中,YOLOv4的表現還是非常的突出,也透過測試資料我們也可以得知該權重並未產生過擬合(Over-fitting)的問題。 Figure 11 shows the relevant LOSS value performance of YOLOv4 after training from VOC2012. From this figure, it can be seen that the LOSS can be reduced very quickly and the convergence is completed during the training process. The related val loss is also similar to the train loss. Figure 12 shows the test of the weights completed in real-time training. The test data is the VOC2007 data set. It can be seen that the person in the figure reaches 85% AP, while the car and motorbike have nearly 80% AP, but the accuracy on the bicycle Approaching 60% AP, and the four categories have a mAP of 74.76% in the case of IoU 0.5 performance, so in untrained pictures, the performance of YOLOv4 is still very prominent, and through the test data, we can also know that the weight has not caused the problem of over-fitting (Over-fitting).

YOLOv4網路架構可分為輸入(Input)、骨架(Backbone)、頸部(Neck)、頭部(Head),而頭部可分為Dense Prediction(一階段)與Sparse Prediction(二階段),YOLOv4在骨架採用CSPDarknet53、頸部採用SPP與PAN、頭部採用YOLOv3。YOLOv4於FPS表現和AP表現上有著突出,以YOLOv4與ASFF進行比較,可以從YOLOv4位於趨近FPS70的位置AP精準度還有在43.5%而ASFF的FPS趨近於30的位置AP表現趨近於40%,相較於YOLOv4的表現相差甚遠,而YOLOv4與EfficienDet網路比較雖然EfficienDet的AP將近高達50%但在FPS的表現上只有12左右,距離即時運算還是有一段距離,若與EfficienDet在即時運算的表現AP低於YOLOv4的43.5%,因此,本發明使用YOLOv4的網路架構,AP為預測結果與基準真相所大於IoU閾值0.50:0.05:0.95之平均值同為所有10個IoU閾值所有80個類別的平均值。 YOLOv4 network architecture can be divided into input (Input), skeleton (Backbone), neck (Neck), head (Head), and the head can be divided into Dense Prediction (one stage) and Sparse Prediction (two stages), YOLOv4 The skeleton uses CSPDarknet53, the neck uses SPP and PAN, and the head uses YOLOv3. YOLOv4 is outstanding in FPS performance and AP performance. Comparing YOLOv4 with ASFF, it can be seen that YOLOv4's AP accuracy is close to FPS70, and its AP performance is close to 43.5%, while ASFF's FPS is close to 30. 40%, which is far from the performance of YOLOv4. Compared with the EfficienDet network, although the AP of YOLOv4 and EfficienDet is nearly as high as 50%, the FPS performance is only about 12. There is still a distance from real-time computing. The performance AP of the calculation is lower than 43.5% of YOLOv4. Therefore, the present invention uses the network architecture of YOLOv4. The AP is the average value of the predicted result and the benchmark truth greater than the IoU threshold 0.50:0.05:0.95. All 10 IoU thresholds are all 80 average of the categories.

再者,YOLOv4網路輸出時,會分別預測出人、機車、腳踏車、汽車物件,預測結果顯示如圖5所示,若預測出來的過程中,人與機車(腳踏車)有進行重疊並超過所設定之重疊度,後面將會進行合併為新的物件為騎士(rider),合併後顯示如圖6所示。 Furthermore, when the YOLOv4 network outputs, it will predict people, locomotives, bicycles, and car objects respectively. The prediction results are shown in Figure 5. The set overlapping degree will be merged later to form a new object called rider (rider). After the merge, it will be displayed as shown in Figure 6.

如圖4所示,未進入警戒區A2的閾值範圍A3時圖像中央上方會顯示無警告(No Alarm),當騎士進入警戒區A2時將會顯示警告(Alarm),若騎士未超過進入警戒區A2之閾值將不會顯示警告。 As shown in Figure 4, when the rider does not enter the threshold range A3 of the warning zone A2, No Alarm will be displayed above the center of the image. When the knight enters the warning zone A2, a warning will be displayed. Thresholds in zone A2 will not display warnings.

本發明採用深度學習YOLOv4進行檢測後方來車系統,當汽車1駕駛忘記二段式開啟車門或未注意後方來車時,多一項輔助工具警告汽車1駕駛後方目前是否有來車或者是在警告提示聲警告駕駛,YOLOv4的架構可以迅速的辨識物件且可以進行準確的框選位置與物件類別,在輸入大小為608x608的尺寸中擁有將近15(Fps;每秒幀數)的辨識速度使在車門碰撞檢測中可達到即使檢測的判斷能力。 The present invention adopts the deep learning YOLOv4 to detect the coming car system. When the car 1 forgets to open the door in two stages or does not pay attention to the car coming from behind, an additional auxiliary tool warns the car 1 whether there is a car coming from behind or is warning The beep warns driving, the architecture of YOLOv4 can quickly identify objects and can accurately frame the position and object category, and has a recognition speed of nearly 15 (Fps; frames per second) in the input size of 608x608. In the collision detection, the judgment ability of even detection can be achieved.

經上述詳細具體說明後,本發明確實具有下列所述特點: After the above detailed description, the present invention has the following features:

1.本發明確實可以藉由整合深度學習影像辨識與警戒閾值範圍等技術,讓駕駛及乘客獲得車後方的人車動態及發出進入警戒區的警告訊號,以大幅降低於門開遭到撞擊的發生機率,進而提升汽車出入的安全防護性。 1. The present invention can indeed, by integrating technologies such as deep learning image recognition and warning threshold range, allow drivers and passengers to obtain the dynamics of people and vehicles behind the car and issue a warning signal of entering the warning area, so as to greatly reduce the impact on the door opening The probability of occurrence, thereby improving the safety protection of car access.

2.本發明確實是一種具備物件合併功能使預測框格可以更為快速框出準確範圍的應用深度學習影像辨識技術。 2. The present invention is indeed a deep learning image recognition technology that has the function of merging objects so that the prediction frame can frame an accurate range more quickly.

3.本發明確實是一種可以針對小車因閃避大車而突然往車門方向靠近所致的危險而發出警示的應用深度學習影像辨識技術。 3. The present invention is indeed an application of deep learning image recognition technology that can issue a warning against the danger caused by the small car dodging the large car and suddenly approaching the door.

以上所述,僅為本發明之一可行實施例,並非用以限定本發明之專利範圍,凡舉依據下列申請專利範圍所述之內容、特徵以及其精神而為之其他變化的等效實施,皆應包含於本發明之專範圍內。本發明所其體界定於申請專利範圍之結構特徵,未見於同類物品,且具實用性與進步性,已符合發明專利要件,爰依法具文提出申請,謹請 鈞局依法核予專利,以維護本申請人合法之權益。 The above is only a feasible embodiment of the present invention, and is not intended to limit the patent scope of the present invention. All equivalent implementations of other changes based on the content, features and spirit of the following patent scopes are cited. All should be included within the scope of the present invention. The structural features of this invention, which are defined in the scope of the patent application, are not found in similar products, and are practical and progressive, and have met the requirements of an invention patent. Please file an application in accordance with the law. I would like to ask the Jun Bureau to approve the patent in accordance with the law. Safeguard the legitimate rights and interests of the applicant.

10:影像擷取裝置 10: Image capture device

20:資訊處理單元 20: Information processing unit

21:影像辨識處理模組 21: Image recognition processing module

211:座標參數資料庫 211: Coordinate parameter database

30:資訊輸出單元 30: Information output unit

31:顯示幕 31: display screen

32:警示裝置 32:Warning device

40:供電單元 40: Power supply unit

50:藍芽通訊系統 50:Bluetooth communication system

60:記憶模組 60: Memory module

Claims (10)

一種應用深度學習影像辨識技術於車門碰撞檢測方法,其包括: A method of applying deep learning image recognition technology to vehicle door collision detection, comprising: 提供一影像擷取裝置、一資訊處理單元及一資訊輸出單元;其中,該資訊處理單元包含一影像辨識處理模組,該資訊輸出單元包含一顯示幕及一警示裝置; Provide an image capture device, an information processing unit, and an information output unit; wherein, the information processing unit includes an image recognition processing module, and the information output unit includes a display screen and a warning device; 將該影像擷取裝置設在一汽車上且以一鏡頭朝向該汽車的後方,用以於下車時段連續對該後方的人車動態進行影像擷取而成像為複數動態影像; The image capture device is installed on a car with a lens facing the rear of the car, and is used to continuously capture images of the dynamics of people and vehicles at the rear during the time of getting off the car and form multiple dynamic images; 於該影像辨識處理模組建立有一深度學習演算模型,該影像辨識處理模組將輸入之每一該動態影像的一預設區域設定為一感興趣區域,並將該感興趣區域以外的影像部分去除,以對保留有該感興趣區域的該動態影像進行影像特徵擷取後輸入至該深度學習演算模型,使該深度學習演算模型對每一輸入之該動態影像進行物件之分類與該物件之定位的影像辨識處理,以預測出該動態影像中移動之該物件的類別資訊與位置資訊,並由該類別資訊與該位置資訊來判斷該物件是否會進入一警戒區之一閾值範圍,判斷結果是,則輸出一警示訊號;及 A deep learning algorithm model is established in the image recognition processing module, and the image recognition processing module sets a preset region of each input dynamic image as a region of interest, and sets the image part outside the region of interest Remove, to extract the image features of the dynamic image that retains the region of interest and then input it to the deep learning calculation model, so that the deep learning calculation model can classify the object and the object for each input dynamic image Positioning image recognition processing to predict the type information and location information of the object moving in the dynamic image, and judge whether the object will enter a threshold range of a warning zone based on the type information and the location information, and judge the result Yes, output a warning signal; and 以該顯示幕來顯示該動態影像,並以該警示裝置將該警示訊號輸出為音頻或語音警示資訊,再於該顯示幕之該動態影像上顯示包含有框選移動中之該物件以代表該位置資訊的至少一預測框格、位於該至少一預測框格附近的該類別資訊及代表該警戒區的一警戒框格。 Use the display screen to display the dynamic image, and use the warning device to output the warning signal as audio or voice warning information, and then display the moving object on the display screen including the object in frame selection to represent the At least one prediction frame of location information, the category information located near the at least one prediction frame, and a warning frame representing the warning area. 如請求項1所述之應用深度學習影像辨識技術於車門碰撞檢測方法,其中,該影像辨識處理模組執行時則包含下列步驟: As described in Claim 1, the application of deep learning image recognition technology to the vehicle door collision detection method, wherein the image recognition processing module includes the following steps when executed: (a)訓練階段步驟,係建立該深度學習演算模型,於該深度學習演算模型輸入巨量的該物件的特徵樣本、警戒區辨識參數、影像分類參數及影像 定位參數,並由該深度學習演算模型測試影像辨識的正確率,再判斷影像辨識正確率是否足夠,當判斷結果為是,則將辨識結果輸出及儲存;當判斷結果為否,則使該深度學習演算模型自我修正學習;及 (a) The step of the training stage is to establish the deep learning calculation model, and input a huge amount of feature samples of the object, warning area identification parameters, image classification parameters and images into the deep learning calculation model Positioning parameters, and the deep learning calculation model tests the accuracy of image recognition, and then judges whether the accuracy of image recognition is sufficient. If the judgment result is yes, the recognition result is output and stored; when the judgment result is no, the depth learning algorithm model self-modification learning; and (b)運行預測階段步驟,係於該深度學習演算模型依序輸入即時擷取之該動態影像,並由該深度學習演算模型計算出相應的該物件特徵,以預測辨識出該物件的該類別資訊、該位置資訊及該物件是否超過該警戒區所預設的該閾值範圍等之辨識結果資訊。 (b) The step of running the prediction stage is to sequentially input the dynamic image captured in real time into the deep learning calculation model, and calculate the corresponding characteristics of the object by the deep learning calculation model, so as to predict and recognize the category of the object information, the location information, and identification result information such as whether the object exceeds the preset threshold range of the warning zone. 如請求項1所述之應用深度學習影像辨識技術於車門碰撞檢測方法,其中,該深度學習演算模型係為採用YOLO v4作為網路架構進行物件之分類與定位辨識的偵測,該深度學習演算模型的資料集係採用VOC資料集,以VOC2012資料集作為訓練階段步驟使用,並以VOC2007資料集做為預測階段步驟使用。 Applying deep learning image recognition technology to the car door collision detection method as described in claim 1, wherein the deep learning algorithm model is to use YOLO v4 as the network architecture for detection of object classification and positioning recognition, the deep learning algorithm The data set of the model uses the VOC data set, the VOC2012 data set is used as the training stage step, and the VOC2007 data set is used as the prediction stage step. 如請求項3所述之應用深度學習影像辨識技術於車門碰撞檢測方法,其中,該VOC資料集係針對人、汽車、機車及自行車等四項類別而於該深度學習演算模型進行訓練、驗證及測試。 The application of deep learning image recognition technology to the car door collision detection method as described in claim 3, wherein the VOC data set is trained, verified and processed on the deep learning algorithm model for the four categories of people, cars, motorcycles and bicycles. test. 如請求項1所述之應用深度學習影像辨識技術於車門碰撞檢測方法,其中,該影像辨識處理模組執行時更包含物件合併步驟,其包括下列步驟: As described in Claim 1, the application of deep learning image recognition technology to the vehicle door collision detection method, wherein the image recognition processing module further includes an object merging step during execution, which includes the following steps: 步驟一,將預測完成的相關類別以信心度從大至小進行排序; Step 1, sort the predicted related categories in order of confidence from large to small; 步驟二,判斷是否有人的物件,物件則是將其他的類別進行框選,若有人的物件將會針對人的物件與其他預測出來的類別以IoU的方式來判斷 是否有重疊的部分; Step 2. Determine whether there is a human object. The object is to frame other categories. If there is a human object, it will be judged by IoU against the human object and other predicted categories. whether there are overlapping parts; 步驟三,IoU出來的參數大於閾值範圍時,則合併該兩物件成為一個新的物件;及 Step 3, when the parameter from IoU is greater than the threshold range, merge the two objects into a new object; and 步驟四,若經過比對並無任何重複時,則分別框出與所預測的類別。 Step 4, if there is no duplication after comparison, frame the predicted category respectively. 如請求項1所述之應用深度學習影像辨識技術於車門碰撞檢測方法,其中,該影像辨識處理模組建立一座標參數資料庫,該座標參數資料庫設定儲存有一與該動態影像對應的座標參數數資及一用以圍起以定位該警戒區的閾值參數資料,該影像辨識處理模組更包括執行一影像定位步驟,係將保留有該感興趣區域的該動態影像由RGB格式轉換至YIQ平面,再由該YIQ平面取得該物件影像,再使用該YIQ平面的I值,取閥值除去大部份影像,僅保留該物件之物件影像,再經小面積雜訊過濾除去小面積非該物件影像,以取得該物件影像底部或左側底部的中心像素位置,再將該中心像素位置與該閾值參數資料帶入該座標參數資料庫而分別計算出該物件影像之該中心像素位置與該閾值範圍所處座標位置,以作為該物件影像進入該警戒區及超過該閾值範圍的判斷依據。 As described in claim 1, the application of deep learning image recognition technology to the vehicle door collision detection method, wherein the image recognition processing module establishes a coordinate parameter database, and the coordinate parameter database is set to store a coordinate parameter corresponding to the dynamic image Data and a threshold parameter data used to surround and locate the warning area, the image recognition processing module further includes performing an image positioning step, which is to convert the dynamic image with the region of interest from RGB format to YIQ plane, then obtain the image of the object from the YIQ plane, then use the I value of the YIQ plane, take the threshold value to remove most of the image, and only keep the object image of the object, and then filter out the small area of the non-target image by small-area noise filtering Object image to obtain the center pixel position of the bottom or left bottom of the object image, and then bring the center pixel position and the threshold parameter data into the coordinate parameter database to calculate the center pixel position and the threshold of the object image respectively The coordinate position of the range is used as the basis for judging that the object image enters the warning zone and exceeds the threshold range. 如請求項1所述之應用深度學習影像辨識技術於車門碰撞檢測方法,其中,當該影像辨識處理模組影像辨識出二該物件為自行車與大型車或是機車與大型車並行且判定該自行車或該機車不會進入該警戒區與該閾值範圍時,則發出可能會進入該警戒區及超過該閾值範圍的第二警示訊號。 The application of deep learning image recognition technology to the vehicle door collision detection method as described in claim 1, wherein, when the image recognition processing module recognizes that the object is a bicycle and a large vehicle or a locomotive and a large vehicle, it is determined that the bicycle Or when the locomotive will not enter the warning zone and the threshold range, it will send a second warning signal that it may enter the warning zone and exceed the threshold range. 一種應用深度學習影像辨識技術於車門碰撞檢測系統,其包括: A vehicle door collision detection system applying deep learning image recognition technology, comprising: 一影像擷取裝置,其設在一汽車上且以一鏡頭朝向該汽車的後方,用 以於下車時段連續對該後方的人車動態進行影像擷取而成像為複數動態影像; An image capture device, which is installed on a car and faces the rear of the car with a lens, for Continuously capture images of the dynamics of people and vehicles behind the vehicle during the time of getting off the vehicle to form multiple dynamic images; 一資訊處理單元,其包含一影像辨識處理模組,並建立有一深度學習演算模型,該影像辨識處理模組將輸入之每一該動態影像的一預設區域設定為一感興趣區域,並將該感興趣區域以外的影像部分去除,以對保留有該感興趣區域的該動態影像進行影像特徵擷取後輸入至該深度學習演算模型,使該深度學習演算模型對每一輸入之該動態影像進行物件之分類與該物件之定位的影像辨識處理,以預測出該動態影像中移動之該物件的類別資訊與位置資訊,並由該類別資訊與該位置資訊來判斷該物件是否會進入一警戒區之一閾值範圍,判斷結果是,則輸出一警示訊號;及 An information processing unit, which includes an image recognition processing module, and establishes a deep learning algorithm model, and the image recognition processing module sets a preset area of each input dynamic image as an interest area, and The part of the image outside the region of interest is removed, so as to extract the image features of the dynamic image that retains the region of interest, and then input it to the deep learning algorithm model, so that the deep learning algorithm model can be used for each input of the dynamic image. Perform object classification and image recognition processing for the location of the object to predict the category information and location information of the object moving in the dynamic image, and judge whether the object will enter a warning based on the category information and the location information One of the threshold ranges of the zone, if the judgment result is yes, then output a warning signal; and 一資訊輸出單元,其包含一用以顯示該動態影像的顯示幕及一用以將該警示訊號輸出為音頻或語音警示資訊的景示裝置,該顯示幕之該動態影像上顯示包含有框選移動中之該物件以代表該位置資訊的至少一預測框格、位於該至少一預測框格附近的該類別資訊及代表該警戒區的一警戒框格。 An information output unit, which includes a display screen for displaying the dynamic image and a scene display device for outputting the warning signal as audio or voice warning information, the dynamic image displayed on the display screen includes a frame selection The moving object is represented by at least one predicted frame of the location information, the category information near the at least one predicted frame, and a warning frame representing the warning area. 如請求項8所述之應用深度學習影像辨識技術於車門碰撞檢測系統,其中,該資訊輸出單元係選自智慧型手機、智能手錶、智能手環以及智能眼鏡的其中一種,該資訊輸出單元透過藍芽通訊系統與該資訊處理單元資訊連結,並以其顯示幕來顯示該動態影像、該類別資訊及該位置資訊,再以其警示裝置來輸出警示資訊。 Applying deep learning image recognition technology to the car door collision detection system as described in claim 8, wherein the information output unit is selected from one of smart phones, smart watches, smart bracelets and smart glasses, and the information output unit is passed through The bluetooth communication system is connected with the information processing unit, and uses its display screen to display the dynamic image, the category information and the location information, and then uses its warning device to output warning information. 一種應用請求項8之應用深度學習影像辨識技術於車門碰撞檢測系統的行車記錄器,其包括一用以記錄連續動態之該動態影像的記憶模組及 一供該影像擷取裝置、資訊處理單元、該記憶模組及該警示裝置容裝的機殼組,該機殼組設於該汽車內靠近後車窗使該鏡頭可以朝向該後方的位置上。 A driving recorder applying the deep learning image recognition technology to the door collision detection system of application claim 8, which includes a memory module for recording the dynamic image of continuous dynamics and A housing set for the image capture device, the information processing unit, the memory module and the warning device, the housing set is installed in the car near the rear window so that the lens can face the rear .
TW110133829A 2021-09-10 2021-09-10 Application of deep learning image recognition technology to door collision detection method and system TWI781760B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW110133829A TWI781760B (en) 2021-09-10 2021-09-10 Application of deep learning image recognition technology to door collision detection method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW110133829A TWI781760B (en) 2021-09-10 2021-09-10 Application of deep learning image recognition technology to door collision detection method and system

Publications (2)

Publication Number Publication Date
TWI781760B true TWI781760B (en) 2022-10-21
TW202311087A TW202311087A (en) 2023-03-16

Family

ID=85462546

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110133829A TWI781760B (en) 2021-09-10 2021-09-10 Application of deep learning image recognition technology to door collision detection method and system

Country Status (1)

Country Link
TW (1) TWI781760B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWM540067U (en) * 2016-12-13 2017-04-21 Nat Chin-Yi Univ Of Tech Intelligent automobile door opening anti-collision system
CN111267724A (en) * 2020-03-31 2020-06-12 东风柳州汽车有限公司 Method and system for protecting and reminding vehicle door opening
CN112572291A (en) * 2020-12-22 2021-03-30 的卢技术有限公司 Vehicle door opening early warning method
CN113239827A (en) * 2021-05-19 2021-08-10 平安科技(深圳)有限公司 Intelligent anti-collision method, system, equipment and storage medium based on deep learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWM540067U (en) * 2016-12-13 2017-04-21 Nat Chin-Yi Univ Of Tech Intelligent automobile door opening anti-collision system
CN111267724A (en) * 2020-03-31 2020-06-12 东风柳州汽车有限公司 Method and system for protecting and reminding vehicle door opening
CN112572291A (en) * 2020-12-22 2021-03-30 的卢技术有限公司 Vehicle door opening early warning method
CN113239827A (en) * 2021-05-19 2021-08-10 平安科技(深圳)有限公司 Intelligent anti-collision method, system, equipment and storage medium based on deep learning

Also Published As

Publication number Publication date
TW202311087A (en) 2023-03-16

Similar Documents

Publication Publication Date Title
US11816983B2 (en) Helmet wearing determination method, helmet wearing determination system, helmet wearing determination apparatus, and program
CN106709420B (en) Method for monitoring driving behavior of commercial vehicle driver
CN102765365B (en) Pedestrian detection method based on machine vision and pedestrian anti-collision warning system based on machine vision
CN110866427A (en) Vehicle behavior detection method and device
CN103366506A (en) Device and method for automatically monitoring telephone call behavior of driver when driving
CN112686090A (en) Intelligent monitoring system for abnormal behaviors in bus
CN109740477A (en) Study in Driver Fatigue State Surveillance System and its fatigue detection method
CN110901385B (en) Active speed limiting method based on fatigue state of driver
US11417108B2 (en) Two-wheel vehicle riding person number determination method, two-wheel vehicle riding person number determination system, two-wheel vehicle riding person number determination apparatus, and program
CN115937830A (en) Special vehicle-oriented driver fatigue detection method
CN118025093B (en) Automatic emergency brake system and method for buses based on passenger state identification
CN113561983A (en) Intelligent security detection system for personnel in vehicle and detection method thereof
CN110992709A (en) Active speed limiting system based on fatigue state of driver
TWM626569U (en) car accident severity prediction device
CN103177248B (en) A kind of rapid pedestrian detection method of view-based access control model
TWI781760B (en) Application of deep learning image recognition technology to door collision detection method and system
CN108492527B (en) Fatigue driving monitoring method based on overtaking behavior characteristics
CN101667323A (en) Real-time monitoring and alarming system for distraction of driver
CN116729254A (en) Bus cockpit safe driving behavior monitoring system based on overhead view image
CN103942541A (en) Electric vehicle automatic detection method based on vehicle-mounted vision within blind zone
JP7277511B2 (en) car estimator
CN115691215A (en) Road guardrail collision detection method based on YOLOv5-GRU
TWI408612B (en) Method and system for dynamically and simultaneously determining the relative relation between moving objects
Shelke et al. A Review on Identification of Number Plate and Wrong Way Vehicles Detection
KR101460902B1 (en) Intelligent black box systems and methods for car theft symptoms sensing

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent