TWI781760B - Application of deep learning image recognition technology to door collision detection method and system - Google Patents
Application of deep learning image recognition technology to door collision detection method and system Download PDFInfo
- Publication number
- TWI781760B TWI781760B TW110133829A TW110133829A TWI781760B TW I781760 B TWI781760 B TW I781760B TW 110133829 A TW110133829 A TW 110133829A TW 110133829 A TW110133829 A TW 110133829A TW I781760 B TWI781760 B TW I781760B
- Authority
- TW
- Taiwan
- Prior art keywords
- image
- deep learning
- warning
- image recognition
- information
- Prior art date
Links
Images
Landscapes
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
一種應用深度學習影像辨識技術於車門碰撞檢測方法及系統,其包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格,俾能藉由整合深度學習影像辨識與警戒閾值等技術,讓駕駛及乘客獲得車後方的人車動態及發出後方車輛進入警戒區的警告訊號,以大幅降低於門開遭到撞擊的發生機率,進而提升汽車出入的安全防護性。 A method and system for applying deep learning image recognition technology to vehicle door collision detection, which includes an image capture device, an information processing unit, and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, removes the image part outside the preset region of interest of each dynamic image, and uses the image to perform image feature extraction and then input it to the image recognition model to make the image recognition The model performs image recognition processing of object classification and object positioning on dynamic images to predict the category information and location information of moving objects, and judge whether it will enter the threshold range of the warning zone. If the judgment result is yes, it will output a warning signal. The display screen of the information output unit can display prediction frames and warning frames that include frame selection of moving objects to represent position information, so that by integrating deep learning image recognition and warning threshold technologies, drivers and passengers can obtain The movement of people and vehicles in the rear and the warning signal for the vehicles in the rear to enter the warning zone can greatly reduce the probability of being hit when the door is opened, thereby improving the safety protection of vehicles entering and exiting.
Description
本發明係關於一種應用深度學習影像辨識技術於車門碰撞檢測方法及系統,尤指一種可以藉由整合深度學習影像辨識與警戒閾值等技術讓駕駛及乘客獲得車後方的人車動態及發出進入警戒區的警告訊號的汽車門開防撞安全防護技術。 The present invention relates to a method and system for applying deep learning image recognition technology to car door collision detection, especially to a method and system that can enable drivers and passengers to obtain the dynamics of people and vehicles behind the car and issue entry warnings by integrating deep learning image recognition and warning threshold technologies Zone warning signal of car door opening anti-collision safety protection technology.
按,汽車的各個車門大多是與車體形成轉動的樞接,所以每個車門皆可以各自獨立的相對車體做向外的往復開合動作,於此,以供乘客及駕駛者作為出入車輛的用途。然而,時常可以在新聞或者是社群網站上看到一些交通事故,主要是由汽車駕駛未依規定二段式開門或者是未注意後方來車突然開啟車門,因而導致後方機車驚嚇使機車駕駛偏離路線跌倒;或是突然被開啟的車門撞到而飛出路線,無論是發生何種開門事件都會讓機車駕駛受到傷害嚴重則使機車駕駛傷亡,在交通事故中,此事故確實已顯得屢見不鮮,致使造成無謂的人命傷亡以及財產上的損失。也因如此,各國政府當局也已經正視此一問題的嚴重性,並開始做相關二段式開門操作的安全開導與宣傳;惟此類的交通意外事故發生率仍無降低的趨勢,因此,上述類型的交通事故實已成為各國當局政府、相關學界以及業者所急亟欲解決的技術課題與挑戰。 By the way, most of the doors of the car are pivotally connected with the car body, so each door can be independently opened and closed relative to the car body. Here, it is used as an entry and exit vehicle for passengers and drivers. the use of. However, you can often see some traffic accidents on the news or on social networking sites. The main reason is that the driver of the car did not open the door in two stages according to the regulations or did not pay attention to the car coming from behind and suddenly opened the door, which caused the rear locomotive to frighten and make the locomotive driver deviate. The line falls; or is suddenly hit by the open door and flies out of the line. No matter what kind of door opening event occurs, the locomotive driver will be injured seriously, and the locomotive driver will be seriously injured. In traffic accidents, this accident has indeed become common. Cause unnecessary casualties and property losses. Also because of this, government authorities in various countries have also faced up to the seriousness of this problem, and have begun to do relevant two-stage door opening operation safety enlightenment and propaganda; Types of traffic accidents have become a technical issue and challenge that governments, relevant academic circles, and industry players in various countries urgently want to solve.
針對上述缺失,相關業者已研發出一種於下車開門前可以顯 示後方動態影像的習知技術,該習知技術的代表性專利如中國大陸發明第CN1128600A『車輛後方確認裝置』及中華民國新型公告第M376460號『開門防追撞反射鏡』等專利所示。該等專利可於下車開門前關閉車門鎖一段時間,並顯示後方動態影像供乘客或駕駛者觀看,再由乘客或駕駛者自行決定是否開啟車門,當乘客或駕駛者因漫不精心或是視力出現問題而導致無法辨識後方是否有來車時,同樣仍會發生門開遭撞的交通意外事故,因而造成乘客在門開下車時的極大壓力困擾與不便。 In response to the above-mentioned shortcomings, related companies have developed a device that can be displayed before getting off the car and opening the door. The conventional technology of displaying the rear dynamic image, the representative patent of this conventional technology is shown in the patents such as CN1128600A "Vehicle Rear Confirmation Device" of Chinese Mainland Invention and No. M376460 "Door Anti-Collision Mirror" of the Republic of China New Announcement. These patents can close the door lock for a period of time before getting out of the car and opening the door, and display the rear dynamic image for the passenger or driver to watch, and then the passenger or driver decides whether to open the door. When a problem occurs and it is impossible to identify whether there is an oncoming car from the rear, a traffic accident in which the door is opened and bumped will still occur, thus causing great pressure, distress and inconvenience for passengers when the door is opened and get off.
另有一種習知技術係利用感測技術(如超音波感測器、紅外線感測器及雷達感測器)來感測接近車輛後方的人車動態,當有車輛靠近時,則將車門暫時鎖定卡制而無法開啟,當無車輛靠近時,則將車門解除鎖定而可被開啟;該習知技術的代表性專利如中國大陸新型第CN201633624號『車門開啟安全監測裝置』、中華民國新型公告第516534號『檢視後方來車之開門警示裝置』及中華民國新型公告第M318541號『車門警示裝置結構改良』等專利所示;該等專利雖然可以透過上述感測技術來感測車輛後方的行車動態;惟,超音波感測器與紅外線感測器的感測範圍距離不足以捕捉到急速行駛的車輛動態,以致會產生感測死角與盲點的出現問題,因而同樣會發生門開遭到撞擊的交通事故;此外,雷達的感測範圍距離雖然較遠,但是造價過於高價昂貴,以致較無法全面普及採用,因而造成商品化實現的重要阻礙因素。 Another known technology is to use sensing technology (such as ultrasonic sensor, infrared sensor and radar sensor) to sense the dynamics of people and vehicles approaching the rear of the vehicle. When a vehicle approaches, the door is temporarily closed. Locking and jamming cannot be opened, and when no vehicle is approaching, the door is unlocked and can be opened; representative patents of this known technology such as the Chinese Mainland New Model No. CN201633624 "Car Door Opening Safety Monitoring Device", the Republic of China New Announcement Patents such as No. 516534 "Door Opening Warning Device for Checking Cars Coming from Behind" and ROC New Announcement No. M318541 "Structure Improvement of Vehicle Door Warning Device" are shown; although these patents can use the above-mentioned sensing technology to sense the driving behind the vehicle dynamics; however, the distance between the sensing range of the ultrasonic sensor and the infrared sensor is not enough to capture the dynamics of a fast-moving vehicle, so that there will be problems with sensing dead angles and blind spots, so door openings will also be hit traffic accidents; in addition, although the sensing range of the radar is relatively long, the cost is too high, so that it cannot be fully popularized and adopted, thus causing an important obstacle to the realization of commercialization.
又有一種如中華民國發明公告第I531500號『利用影像處理的汽車車門開啟警示方法』,該專利之影像處理單元可以根據預設之透視轉換公式,將所獲得之目標物件在第n幅影像中之中心之像素位置轉換為該 目標物件在該路面上之位置的二維座標;並根據該目標物件的二維座標及該汽車之車門在該路面上之位置的二維座標來判定該目標物件與該汽車之車門之距離是否小於一預設距離;判定結果為是時,藉由該影像處理單元,將警示輸出至輸出單元。該專利雖然具備影像辨識目標物件與車門距離而發出警示的功能;惟,該專利並無整合深度學習影像辨識與警戒閾值範圍等技術的建置,所以僅能以距離值來作為警示發佈的依據,況且該專利非以目標物件是否進入警戒閾值範圍來作為警示發佈的依據,理論上位處較側邊不會進入警戒區的目標物件只要達到預設距離值仍會發佈警示,以致會因偵測誤差過大而造成誤報的情事產生,因而造成使用上的不便與困擾的情事產生。 There is another such as the Republic of China Invention Announcement No. I531500 "Automobile Door Opening Warning Method Using Image Processing", the image processing unit of this patent can convert the obtained target object in the nth image according to the preset perspective conversion formula The pixel position of the center is converted to the The two-dimensional coordinates of the position of the target object on the road surface; and determine whether the distance between the target object and the car door of the car is less than a preset distance; when the determination result is yes, the image processing unit outputs a warning to the output unit. Although this patent has the function of image recognition of the distance between the target object and the car door to issue a warning; however, this patent does not integrate deep learning image recognition and the establishment of warning threshold range technology, so the distance value can only be used as the basis for warning release Moreover, the patent does not use whether the target object enters the warning threshold range as the basis for issuing the warning. The error is too large to cause false positives, thus causing inconvenience and troubles in use.
有鑑於此,本發明人認為上述習知技術與該等專利前案確實未臻完善,仍然有再改善的必要性,因此,本發明人等乃再深入研究,進而研發出如本發明所揭露的技術成果。 In view of this, the inventor believes that the above-mentioned prior art and the prior patents are indeed not perfect, and there is still a need for further improvement. Therefore, the inventors conducted further research and developed a patent as disclosed in the present invention. technical achievements.
本發明第一目的,在於提供一種應用深度學習影像辨識技術於車門碰撞檢測方法及系統,主要藉由整合深度學習影像辨識與警戒閾值等技術,讓駕駛及乘客獲得車後方的人車動態及發出後方車輛進入警戒區的警告訊號,以大幅降低於門開遭到撞擊的發生機率,進而提升汽車出入的安全防護性。達成本發明第一目的之技術手段,係包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去 除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格。 The first object of the present invention is to provide a method and system for applying deep learning image recognition technology to vehicle door collision detection, mainly by integrating deep learning image recognition and warning threshold technology, so that drivers and passengers can obtain the dynamics and signals of people and vehicles behind the car. The warning signal of the rear vehicle entering the warning area can greatly reduce the probability of being hit when the door is opened, thereby improving the safety protection of car entry and exit. The technical means to achieve the first objective of the present invention includes an image capture device, an information processing unit and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, and removes the image parts other than the preset interest area of each dynamic image In addition, image features are extracted by images and then input to the image recognition model, so that the image recognition model performs image recognition processing for object classification and object positioning on dynamic images, so as to predict the category information and location information of moving objects, and judge whether they will Enter the threshold range of the warning zone, if the judgment result is yes, then output a warning signal. The display screen of the information output unit can display a prediction frame and a warning frame that include frame selection of moving objects to represent position information.
本發明第二目的,在於提供一種具備物件合併功能使預測框格可以快速框出更準確範圍的應用深度學習影像辨識技術於車門碰撞檢測方法及系統。達成本發明第二目的之技術手段,係包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格。其中,該影像辨識處理模組執行時更包含物件合併步驟,其包括下列步驟:步驟一,將預測完成的相關類別以信心度從大至小進行排序;步驟二,判斷是否有人的物件,物件則是將其他的類別進行框選,若有人的物件將會針對人的物件與其他預測出來的類別以IoU的方式來判斷是否有重疊的部分;步驟三,IoU出來的參數大於閾值範圍時,則合併該兩物件成為一個新的物件;及步驟四,若經過比對並
無任何重複時,則分別框出與所預測的類別。
The second object of the present invention is to provide a method and system for applying deep learning image recognition technology to vehicle door collision detection with the function of merging objects so that the prediction frame can quickly frame a more accurate range. The technical means to achieve the second objective of the present invention includes an image capture device, an information processing unit and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, removes the image part outside the preset region of interest of each dynamic image, and uses the image to perform image feature extraction and then input it to the image recognition model to make the image recognition The model performs image recognition processing of object classification and object positioning on dynamic images to predict the category information and location information of moving objects, and judge whether it will enter the threshold range of the warning zone. If the judgment result is yes, it will output a warning signal. The display screen of the information output unit can display a prediction frame and a warning frame that include frame selection of moving objects to represent position information. Wherein, the image recognition processing module further includes an object merging step during execution, which includes the following steps:
本發明第三目的,在於提供一種可以針對小車因閃避大車而突然往車門方向靠近所致的危險而發出危險預警的應用深度學習影像辨識技術於車門碰撞檢測方法及系統。達成本發明第三目的之技術手段,係包括影像擷取裝置、資訊處理單元及資訊輸出單元。影像擷取裝置可連續對汽車後方進行影像擷取而成像為複數動態影像。資訊處理單元之影像辨識處理模組建立深度學習演算模型,將每一動態影像的預設之感興趣區域以外的影像部分去除,以像進行影像特徵擷取後輸入至影像辨識模型,使影像辨識模型對動態影像進行物件分類與物件定位的影像辨識處理,以預測出移動物件的類別資訊與位置資訊,並判斷是否會進入警戒區之閾值範圍,判斷結果是,則輸出警示訊號。資訊輸出單元之顯示幕可顯示包含有框選移動中之物件以代表位置資訊的預測框格、警戒框格。其中,當該影像辨識處理模組影像辨識出二該物件為自行車與大型車或是機車與大型車並行且判定該自行車或該機車判斷該物件不會進入該警戒區及超過該閾值範圍時,則發出可能會進入該警戒區及超過該閾值範圍的第二警示訊號。 The third object of the present invention is to provide a door collision detection method and system that can issue an early warning against the danger caused by a small car dodging a large car and suddenly approaching the door. The technical means to achieve the third objective of the present invention includes an image capture device, an information processing unit and an information output unit. The image capture device can continuously capture images of the rear of the car and form multiple dynamic images. The image recognition processing module of the information processing unit establishes a deep learning algorithm model, removes the image part outside the preset region of interest of each dynamic image, and uses the image to perform image feature extraction and then input it to the image recognition model to make the image recognition The model performs image recognition processing of object classification and object positioning on dynamic images to predict the category information and location information of moving objects, and judge whether it will enter the threshold range of the warning zone. If the judgment result is yes, it will output a warning signal. The display screen of the information output unit can display a prediction frame and a warning frame that include frame selection of moving objects to represent position information. Wherein, when the image recognition processing module recognizes that the object is a bicycle and a large vehicle or a motorcycle and a large vehicle, and it is determined that the bicycle or the motorcycle judges that the object will not enter the warning zone and exceed the threshold range, Then send out the second warning signal that may enter the warning zone and exceed the threshold range.
1:汽車 1: car
10:影像擷取裝置 10: Image capture device
20:資訊處理單元 20: Information processing unit
21:影像辨識處理模組 21: Image recognition processing module
210:深度學習演算模型 210:Deep Learning Calculus Model
211:座標參數資料庫 211: Coordinate parameter database
30:資訊輸出單元 30: Information output unit
31:顯示幕 31: display screen
32:警示裝置 32:Warning device
40:供電單元 40: Power supply unit
50:藍芽通訊系統 50:Bluetooth communication system
60:記憶模組 60: Memory module
A1:感興趣區域 A1: Region of interest
A2:警戒區 A2: warning area
A3:閾值範圍 A3: Threshold range
A4:預測框格 A4: Forecast grid
A5:警戒框格 A5: warning sash
B1,B2:中心像素位置 B1, B2: center pixel position
圖1為本發明具體實施架構的功能方塊示意圖。 FIG. 1 is a schematic functional block diagram of a specific implementation architecture of the present invention.
圖2係本發明深度學習演算模組於預測階段的流程實施示意圖。 FIG. 2 is a schematic diagram of the implementation process of the deep learning calculation module in the prediction stage of the present invention.
圖3係本發明深度學習演算模組於訓練階段的流程實施示意圖。 FIG. 3 is a schematic diagram of the implementation process of the deep learning algorithm module in the training phase of the present invention.
圖4為本發明顯示幕顯示動態影像之分類與定位資訊的實施示意圖。 FIG. 4 is a schematic diagram of the implementation of displaying dynamic image classification and positioning information on the display screen of the present invention.
圖5為本發明顯示幕顯示動態影像中之二物件合併前的實施示意圖。 FIG. 5 is a schematic diagram showing the implementation of the display screen before combining two objects in the dynamic image according to the present invention.
圖6為本發明顯示幕顯示動態影像中之二物件合併後的實施示意圖。 FIG. 6 is a schematic diagram of the implementation of the combination of two objects in a dynamic image displayed on the display screen of the present invention.
圖7為本發明顯示幕顯示動態影像之分類與定位資訊的又一實施示意圖。 FIG. 7 is a schematic diagram of another implementation of displaying dynamic image classification and positioning information on the display screen of the present invention.
圖8為本發明影像處理取得物件影像之中心像素位置的實施示意圖。 FIG. 8 is a schematic diagram of the implementation of image processing to obtain the center pixel position of an object image according to the present invention.
圖9為圖7、8物件影像之中心像素位置的座標軌跡分析示意圖。 FIG. 9 is a schematic diagram of the coordinate track analysis of the central pixel position of the image of the object in FIGS. 7 and 8 .
圖10為本發明合併物件的流程控制實施示意圖。 Fig. 10 is a schematic diagram of the process control implementation of the merged object of the present invention.
圖11為本發明深度學習演算模型訓練完成後的LOSS示意圖。 Fig. 11 is a schematic diagram of LOSS after the training of the deep learning calculation model of the present invention is completed.
圖12為本發明深度學習演算模型訓練完成後的權重測試實施示意圖。 FIG. 12 is a schematic diagram of the implementation of the weight test after the training of the deep learning calculation model of the present invention is completed.
為讓 貴審查委員能進一步瞭解本發明整體的技術特徵與達成本發明目的之技術手段,玆以具體實施例並配合圖式加以詳細說明:請參看圖1~4所示,為達成本發明第一目的之第一實施例,係包括一影像擷取裝置10、一資訊處理單元20(如電腦或微處理器MCU;但不以此為限)、一資訊輸出單元30一用以供應所需電源的供電單元40(如汽車電瓶與電源處理電路的組合;但不以此為限)。該影像擷取裝置10設在汽車1上(如左後照鏡;或行李箱等位置)且以鏡頭朝向汽車1後方,而可於下車時段自動或半自動連續對汽車1後方或左側後方的人車動態進行影像擷取而成像為複數連續的動態影像。該資訊處理單元20包含一影像辨識處理模組21,並建立有一深度學習演算模型210,該影像辨識處理模組21將輸入之每一動態影像的一預設區域設定為一感興趣區域A1,並將感興趣區域A1以外的影像部分去除,以對保留有感興趣區域A1的動態影像進行影像特徵擷取後輸入至深度學習演算模型210,使深度學習演算模型210對每一輸入之動態影像進行物件之分類與物件之定位的影像辨識處理,以預測出動態影像中移動之物件的類別資訊與位置資訊,並由類別資訊與位置資訊來判斷物件是否會進入預設警戒區A2的閾值範圍A3,當判斷結果是,則輸出警示訊號。該資訊輸出單元30包含用以顯示動態影像的顯示幕31
及用以將警示訊號輸出為音頻或語音的警示資訊(如危險不可開門,車後方即將有車輛通過)的警示裝置32,該顯示幕31之動態影像上顯示包含有框選移動中之物件以代表位置資訊的預測框格A4、位於預測框格A4附近的類別資訊(如汽車、機車、自行車或人)及代表警戒區A2的警戒框格A5,如果預測框格A4內的物件會通過閾值範圍A3,預測框格A4則會以紅色框格及閃爍來示警,於是即可讓駕駛或乘客知悉,此時處於危險狀態,不可開啟車門下車;如果預測框格A4內的物件不會通過閾值範圍A3,預測框格A4則會以綠色框格來表示,於是即可讓駕駛或乘客知悉,此時處理安全狀態,可以放心地開啟車門下車。
In order to allow your examiner to further understand the overall technical characteristics of the present invention and the technical means to achieve the purpose of the present invention, the specific embodiments are described in detail in conjunction with the drawings: please refer to Figures 1 to 4, in order to achieve the first part of the present invention The first embodiment of an object includes an
請參看圖2~3所示的實施例,該影像辨識處理模組21執行時則包含下列步驟:
Please refer to the embodiment shown in Figures 2 to 3, the image
(a)訓練階段步驟,係建立深度學習演算模型210,於深度學習演算模型210輸入巨量物件的特徵樣本、警戒區辨識參數、影像分類參數及影像定位參數,並由深度學習演算模型210測試影像辨識的正確率,再判斷影像辨識正確率是否足夠,當判斷結果為是,則將辨識結果輸出及儲存,當判斷結果為否,則使深度學習演算模型210自我修正學習。
(a) The step of the training stage is to establish a deep
(b)運行預測階段步驟,係於深度學習演算模型210依序輸入即時擷取之動態影像,並由深度學習演算模型210計算出相應的物件特徵,以預測辨識出物件的類別資訊、位置資訊及物件是否超過警戒區A2所預設的閾值範圍A3等之辨識結果資訊。
(b) The step of running the prediction stage is to input the real-time captured dynamic images into the deep
本發明的一種具體實施例中,該深度學習演算模型210係為採用YOLO v4作為網路架構進行物件之分類與定位辨識的偵測,該深度學習演算模型210的資料集係採用VOC資料集,以VOC2012資料集作為訓練
階段步驟使用,並以VOC2007資料集做為預測階段步驟使用。
In a specific embodiment of the present invention, the deep
請參看圖1~6及圖10所示,為達成本發明第二目的之第二實施例,本實施例除了包括上述第一實施例的整體技術內容之外,係於影像辨識處理模組21執行時更包含物件合併步驟,其包括下列步驟:
Please refer to Figures 1 to 6 and Figure 10, in order to achieve the second embodiment of the second purpose of the present invention, this embodiment is based on the image
步驟一,將預測完成的相關類別以信心度從大至小進行排序。 Step 1: sort the predicted related categories in descending order of confidence.
步驟二,判斷是否有人的物件,物件則是將其他的類別進行框選,若有人的物件將會針對人的物件與其他預測出來的類別以IoU的方式來判斷是否有重疊的部分。
步驟三,IoU出來的參數大於閾值範圍A3時,則合併兩物件成為一個新的物件,如圖5、6所示,將機車與人合併為騎士。
步驟四,若經過比對並無任何重複時,則分別框出與所預測的類別。
請參看圖1、7及圖8所示,該影像辨識處理模組21建立一座標參數資料庫211,該座標參數資料庫211設定儲存有一與動態影像對應的座標參數數資及一用以圍起以定位警戒區A2的閾值參數資料,該影像辨識處理模組21更包括執行一影像定位步驟,係將保留有感興趣區域A1的動態影像由RGB格式轉換至YIQ平面,再由YIQ平面取得物件影像,再使用YIQ平面的I值,取閥值除去大部份影像,僅保留該物件之物件影像,再經小面積雜訊過濾除去小面積非物件影像,以取得物件影像底部或左側底部的中心像素位置,再將中心像素位置與閾值參數資料帶入座標參數資
料庫211而分別計算出物件影像底部(如圖7機車輪胎底部);或左側底部(如圖7巴士左側輪胎底部)的中心像素位置B1,B2與閾值範圍A3所處的實際座標位置,以作為物件影像進入警戒區A2及達到閾值範圍A3的判斷依據。
1, 7 and 8, the image
具體的,請再參看圖8所示,中心像素位置B1會通過閾值範圍A3,所以會立即發佈警示,至於中心像素位置B2雖然會通過警戒區A2,但不會通過閾值範圍A3,所以不會發佈警示。 Specifically, please refer to Figure 8 again. The central pixel position B1 will pass through the threshold range A3, so a warning will be issued immediately. As for the central pixel position B2, although it will pass through the warning area A2, it will not pass through the threshold range A3, so it will not Issue a warning.
於本發明的一種具體實施例中,該資訊輸出單元30可以是智慧型手機、智能手錶、智能手環以及智能眼鏡的其中一種,該資訊輸出單元30透過藍芽通訊系統50與資訊處理單元20資訊連結,並以其顯示幕31來顯示動態影像、類別資訊及位置資訊,再以其警示裝置32來輸出警示資訊。
In a specific embodiment of the present invention, the
請參看圖1~4及圖7所示,達成本發明第三目的之第三實施例,本實施例除了包括上述第一實施例的整體技術內容之外,當影像辨識處理模組21影像辨識出二物件為自行車與大型車;或是機車與大型車並行且判定自行車或機車不會進入警戒區A2與閾值範圍A3時,則發出可能會進入警戒區A2與閾值範圍A3的第二警示訊號,讓駕駛會乘客知悉此一並行狀況,於是即可避免因閃避大車而突然往汽車之車門方向靠近所致的危險情事發生。
Please refer to Figures 1 to 4 and Figure 7, the third embodiment to achieve the third purpose of the present invention, this embodiment includes the overall technical content of the first embodiment, when the image
於本發明的應用實施例中係為一種行車記錄器,如圖1所示,係包括一用以記錄連續動態之動態影像的記憶模組60及一供影像擷取裝置10、資訊處理單元20、記憶模組60及警示裝置32容裝的機殼組(本圖
式例未示),該機殼組設於汽車1內靠近後車窗使鏡頭可以朝向汽車後方的位置上。
In the application embodiment of the present invention, it is a driving recorder, as shown in Figure 1, it includes a
必須陳明的是,上述所指的下車時段係可透過擷取汽車1原本之每一車門開關的開門訊號及關門訊號來觸發資訊處理單元20而加以實現;舉例來說,當其中一個車門被開啟至第一角度時,此車門的車門開關則會產生開門訊號,當資訊處理單元20受到此開門訊號的觸發時,則認定為下車時段的起始時間,並啟動影像擷取裝置10而進行影像擷取;當資訊處理單元20受到關門訊號的觸發時,則認定為下車時段的結束時間,並關閉影像擷取裝置10而結束影像擷取的動作。
It must be stated that the time period for getting off the car mentioned above can be realized by capturing the original door opening signal and door closing signal of each door switch of the
上述感興趣區域A1(Region of Interest,RoI)是針對動態影像設計一塊感興趣區域A1如圖4所示,從圖像中框出一塊區域,所框選出來的區域將會是所分析的一關重點之一,對框選出的區域進行下一步處理,故而原本要處理一張較大的圖片將會變成一個圖片的小區域,可以大幅度的減少處理時間也增加了精確度。 The above-mentioned region of interest A1 (Region of Interest, RoI) is to design a region of interest A1 for dynamic images, as shown in Figure 4. A region is framed from the image, and the region selected by the frame will be a part of the analysis. One of the key points is to carry out the next step of processing on the area selected by the frame, so a larger image to be processed will become a small area of the image, which can greatly reduce the processing time and increase the accuracy.
圖11所示為YOLOv4從VOC2012經過訓練完成後的相關LOSS值表現,從該圖可以得知在訓練的過程中可以非常快速的下降LOSS並且收斂完成,相關的val loss也與train loss較為相近。圖12所示為將即時訓練完成的權重進行測試,測試資料為VOC2007資料集,可以看到該圖中person達到85%的AP而car和motorbike有將近80%的AP,但在bicycle上準確度逼近60%AP,而四種類別在IoU 0.5的情況下mAP有達到74.76%的 表現,因此在沒經過訓練的圖片中,YOLOv4的表現還是非常的突出,也透過測試資料我們也可以得知該權重並未產生過擬合(Over-fitting)的問題。 Figure 11 shows the relevant LOSS value performance of YOLOv4 after training from VOC2012. From this figure, it can be seen that the LOSS can be reduced very quickly and the convergence is completed during the training process. The related val loss is also similar to the train loss. Figure 12 shows the test of the weights completed in real-time training. The test data is the VOC2007 data set. It can be seen that the person in the figure reaches 85% AP, while the car and motorbike have nearly 80% AP, but the accuracy on the bicycle Approaching 60% AP, and the four categories have a mAP of 74.76% in the case of IoU 0.5 performance, so in untrained pictures, the performance of YOLOv4 is still very prominent, and through the test data, we can also know that the weight has not caused the problem of over-fitting (Over-fitting).
YOLOv4網路架構可分為輸入(Input)、骨架(Backbone)、頸部(Neck)、頭部(Head),而頭部可分為Dense Prediction(一階段)與Sparse Prediction(二階段),YOLOv4在骨架採用CSPDarknet53、頸部採用SPP與PAN、頭部採用YOLOv3。YOLOv4於FPS表現和AP表現上有著突出,以YOLOv4與ASFF進行比較,可以從YOLOv4位於趨近FPS70的位置AP精準度還有在43.5%而ASFF的FPS趨近於30的位置AP表現趨近於40%,相較於YOLOv4的表現相差甚遠,而YOLOv4與EfficienDet網路比較雖然EfficienDet的AP將近高達50%但在FPS的表現上只有12左右,距離即時運算還是有一段距離,若與EfficienDet在即時運算的表現AP低於YOLOv4的43.5%,因此,本發明使用YOLOv4的網路架構,AP為預測結果與基準真相所大於IoU閾值0.50:0.05:0.95之平均值同為所有10個IoU閾值所有80個類別的平均值。 YOLOv4 network architecture can be divided into input (Input), skeleton (Backbone), neck (Neck), head (Head), and the head can be divided into Dense Prediction (one stage) and Sparse Prediction (two stages), YOLOv4 The skeleton uses CSPDarknet53, the neck uses SPP and PAN, and the head uses YOLOv3. YOLOv4 is outstanding in FPS performance and AP performance. Comparing YOLOv4 with ASFF, it can be seen that YOLOv4's AP accuracy is close to FPS70, and its AP performance is close to 43.5%, while ASFF's FPS is close to 30. 40%, which is far from the performance of YOLOv4. Compared with the EfficienDet network, although the AP of YOLOv4 and EfficienDet is nearly as high as 50%, the FPS performance is only about 12. There is still a distance from real-time computing. The performance AP of the calculation is lower than 43.5% of YOLOv4. Therefore, the present invention uses the network architecture of YOLOv4. The AP is the average value of the predicted result and the benchmark truth greater than the IoU threshold 0.50:0.05:0.95. All 10 IoU thresholds are all 80 average of the categories.
再者,YOLOv4網路輸出時,會分別預測出人、機車、腳踏車、汽車物件,預測結果顯示如圖5所示,若預測出來的過程中,人與機車(腳踏車)有進行重疊並超過所設定之重疊度,後面將會進行合併為新的物件為騎士(rider),合併後顯示如圖6所示。 Furthermore, when the YOLOv4 network outputs, it will predict people, locomotives, bicycles, and car objects respectively. The prediction results are shown in Figure 5. The set overlapping degree will be merged later to form a new object called rider (rider). After the merge, it will be displayed as shown in Figure 6.
如圖4所示,未進入警戒區A2的閾值範圍A3時圖像中央上方會顯示無警告(No Alarm),當騎士進入警戒區A2時將會顯示警告(Alarm),若騎士未超過進入警戒區A2之閾值將不會顯示警告。 As shown in Figure 4, when the rider does not enter the threshold range A3 of the warning zone A2, No Alarm will be displayed above the center of the image. When the knight enters the warning zone A2, a warning will be displayed. Thresholds in zone A2 will not display warnings.
本發明採用深度學習YOLOv4進行檢測後方來車系統,當汽車1駕駛忘記二段式開啟車門或未注意後方來車時,多一項輔助工具警告汽車1駕駛後方目前是否有來車或者是在警告提示聲警告駕駛,YOLOv4的架構可以迅速的辨識物件且可以進行準確的框選位置與物件類別,在輸入大小為608x608的尺寸中擁有將近15(Fps;每秒幀數)的辨識速度使在車門碰撞檢測中可達到即使檢測的判斷能力。
The present invention adopts the deep learning YOLOv4 to detect the coming car system. When the
經上述詳細具體說明後,本發明確實具有下列所述特點: After the above detailed description, the present invention has the following features:
1.本發明確實可以藉由整合深度學習影像辨識與警戒閾值範圍等技術,讓駕駛及乘客獲得車後方的人車動態及發出進入警戒區的警告訊號,以大幅降低於門開遭到撞擊的發生機率,進而提升汽車出入的安全防護性。 1. The present invention can indeed, by integrating technologies such as deep learning image recognition and warning threshold range, allow drivers and passengers to obtain the dynamics of people and vehicles behind the car and issue a warning signal of entering the warning area, so as to greatly reduce the impact on the door opening The probability of occurrence, thereby improving the safety protection of car access.
2.本發明確實是一種具備物件合併功能使預測框格可以更為快速框出準確範圍的應用深度學習影像辨識技術。 2. The present invention is indeed a deep learning image recognition technology that has the function of merging objects so that the prediction frame can frame an accurate range more quickly.
3.本發明確實是一種可以針對小車因閃避大車而突然往車門方向靠近所致的危險而發出警示的應用深度學習影像辨識技術。 3. The present invention is indeed an application of deep learning image recognition technology that can issue a warning against the danger caused by the small car dodging the large car and suddenly approaching the door.
以上所述,僅為本發明之一可行實施例,並非用以限定本發明之專利範圍,凡舉依據下列申請專利範圍所述之內容、特徵以及其精神而為之其他變化的等效實施,皆應包含於本發明之專範圍內。本發明所其體界定於申請專利範圍之結構特徵,未見於同類物品,且具實用性與進步性,已符合發明專利要件,爰依法具文提出申請,謹請 鈞局依法核予專利,以維護本申請人合法之權益。 The above is only a feasible embodiment of the present invention, and is not intended to limit the patent scope of the present invention. All equivalent implementations of other changes based on the content, features and spirit of the following patent scopes are cited. All should be included within the scope of the present invention. The structural features of this invention, which are defined in the scope of the patent application, are not found in similar products, and are practical and progressive, and have met the requirements of an invention patent. Please file an application in accordance with the law. I would like to ask the Jun Bureau to approve the patent in accordance with the law. Safeguard the legitimate rights and interests of the applicant.
10:影像擷取裝置 10: Image capture device
20:資訊處理單元 20: Information processing unit
21:影像辨識處理模組 21: Image recognition processing module
211:座標參數資料庫 211: Coordinate parameter database
30:資訊輸出單元 30: Information output unit
31:顯示幕 31: display screen
32:警示裝置 32:Warning device
40:供電單元 40: Power supply unit
50:藍芽通訊系統 50:Bluetooth communication system
60:記憶模組 60: Memory module
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW110133829A TWI781760B (en) | 2021-09-10 | 2021-09-10 | Application of deep learning image recognition technology to door collision detection method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW110133829A TWI781760B (en) | 2021-09-10 | 2021-09-10 | Application of deep learning image recognition technology to door collision detection method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
TWI781760B true TWI781760B (en) | 2022-10-21 |
TW202311087A TW202311087A (en) | 2023-03-16 |
Family
ID=85462546
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW110133829A TWI781760B (en) | 2021-09-10 | 2021-09-10 | Application of deep learning image recognition technology to door collision detection method and system |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWI781760B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWM540067U (en) * | 2016-12-13 | 2017-04-21 | Nat Chin-Yi Univ Of Tech | Intelligent automobile door opening anti-collision system |
CN111267724A (en) * | 2020-03-31 | 2020-06-12 | 东风柳州汽车有限公司 | Method and system for protecting and reminding vehicle door opening |
CN112572291A (en) * | 2020-12-22 | 2021-03-30 | 的卢技术有限公司 | Vehicle door opening early warning method |
CN113239827A (en) * | 2021-05-19 | 2021-08-10 | 平安科技(深圳)有限公司 | Intelligent anti-collision method, system, equipment and storage medium based on deep learning |
-
2021
- 2021-09-10 TW TW110133829A patent/TWI781760B/en active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWM540067U (en) * | 2016-12-13 | 2017-04-21 | Nat Chin-Yi Univ Of Tech | Intelligent automobile door opening anti-collision system |
CN111267724A (en) * | 2020-03-31 | 2020-06-12 | 东风柳州汽车有限公司 | Method and system for protecting and reminding vehicle door opening |
CN112572291A (en) * | 2020-12-22 | 2021-03-30 | 的卢技术有限公司 | Vehicle door opening early warning method |
CN113239827A (en) * | 2021-05-19 | 2021-08-10 | 平安科技(深圳)有限公司 | Intelligent anti-collision method, system, equipment and storage medium based on deep learning |
Also Published As
Publication number | Publication date |
---|---|
TW202311087A (en) | 2023-03-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11816983B2 (en) | Helmet wearing determination method, helmet wearing determination system, helmet wearing determination apparatus, and program | |
CN106709420B (en) | Method for monitoring driving behavior of commercial vehicle driver | |
CN102765365B (en) | Pedestrian detection method based on machine vision and pedestrian anti-collision warning system based on machine vision | |
CN110866427A (en) | Vehicle behavior detection method and device | |
CN103366506A (en) | Device and method for automatically monitoring telephone call behavior of driver when driving | |
CN112686090A (en) | Intelligent monitoring system for abnormal behaviors in bus | |
CN109740477A (en) | Study in Driver Fatigue State Surveillance System and its fatigue detection method | |
CN110901385B (en) | Active speed limiting method based on fatigue state of driver | |
US11417108B2 (en) | Two-wheel vehicle riding person number determination method, two-wheel vehicle riding person number determination system, two-wheel vehicle riding person number determination apparatus, and program | |
CN115937830A (en) | Special vehicle-oriented driver fatigue detection method | |
CN118025093B (en) | Automatic emergency brake system and method for buses based on passenger state identification | |
CN113561983A (en) | Intelligent security detection system for personnel in vehicle and detection method thereof | |
CN110992709A (en) | Active speed limiting system based on fatigue state of driver | |
TWM626569U (en) | car accident severity prediction device | |
CN103177248B (en) | A kind of rapid pedestrian detection method of view-based access control model | |
TWI781760B (en) | Application of deep learning image recognition technology to door collision detection method and system | |
CN108492527B (en) | Fatigue driving monitoring method based on overtaking behavior characteristics | |
CN101667323A (en) | Real-time monitoring and alarming system for distraction of driver | |
CN116729254A (en) | Bus cockpit safe driving behavior monitoring system based on overhead view image | |
CN103942541A (en) | Electric vehicle automatic detection method based on vehicle-mounted vision within blind zone | |
JP7277511B2 (en) | car estimator | |
CN115691215A (en) | Road guardrail collision detection method based on YOLOv5-GRU | |
TWI408612B (en) | Method and system for dynamically and simultaneously determining the relative relation between moving objects | |
Shelke et al. | A Review on Identification of Number Plate and Wrong Way Vehicles Detection | |
KR101460902B1 (en) | Intelligent black box systems and methods for car theft symptoms sensing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
GD4A | Issue of patent certificate for granted invention patent |