TW201820260A

TW201820260A - All-weather thermal image-type pedestrian detecting method to express the LBP encoding in the same window by HOG as the feature representation, and use SVM and Adaboost to proceed classifier training

Info

Publication number: TW201820260A
Application number: TW105138846A
Authority: TW
Inventors: 黃世勳; 簡士哲; 張峰嘉; 蕭簡浩; 蕭有崧
Original assignee: 國家中山科學研究院
Priority date: 2016-11-25
Filing date: 2016-11-25
Publication date: 2018-06-01
Also published as: TWI628623B

Abstract

An all-weather thermal image-type pedestrian detecting method is disclosed, which comprises the following steps: (a) capturing a pedestrian's and a non-pedestrian's daytime thermal image and nighttime thermal image in the same window to establish a sample database of thermal image, wherein the sample database comprises plural pedestrians' and plural non-pedestrians' samples; (b) for the pedestrians' and non-pedestrians' samples in the sample database, proceeding with LBP encoding in the same window, wherein the LBP encodings having complementarity in the same window are treated as the same LBP encoding; (c) expressing the LBP encoding of the same window by the Histogram of Oriented Gradient (HOG) as the feature representation, so as to obtain the feature training samples of the pedestrian sample and the non-pedestrian sample; (d) using SVM (Support Vector Machine) and Adaboost (Adaptive Boosting) to proceed with classifier training; and (e) based on the sliding window method, searching and detecting the thermal image, and determining if there is a pedestrian.

Description

All-weather thermal image type pedestrian detection method

本發明係關於一種熱影像之行人偵測技術，更特別的是關於一種適用於全天候且基於區塊LBP編碼技術之全天候熱影像式行人偵測方法。The invention relates to a thermal image pedestrian detection technology, and more particularly to an all-weather thermal image-type pedestrian detection method based on block LBP coding technology.

先前基於熱影像之行人偵測技術主要基於熱影像中人形為高亮度區域假設前提下，應用門檻值演算法(Thresholding)對熱影像進行切割，以獲得數個高亮度之可能行人區域，接著透過人形樣本或特徵比對方式達到熱影像行人偵測之目的。然而，此類演算法其效能取決於門檻值之選定，因此其並不適用於多種環境場景與天候狀況。為避免門檻值選定之問題，目前熱影像行人偵測改以紋理特徵描述人形外觀，再給定一個訓練資料庫條件下，其中包含大量人形及非人形樣本，透過機器學習方式，訓練一個能夠有效分辨人形與非人形之分類器，利用此分類器直接對熱影像進行掃描，以避免門檻值選定所產生之錯誤切割問題。Previously, the pedestrian detection technology based on thermal images was mainly based on the assumption that the human figure in the thermal image is a high-brightness area. The thermal image was cut using a threshold algorithm to obtain several possible pedestrian areas with high brightness. Human shape samples or feature comparison methods achieve the purpose of thermal image pedestrian detection. However, the effectiveness of such algorithms depends on the selection of thresholds, so they are not suitable for many environmental scenarios and weather conditions. In order to avoid the problem of threshold selection, the current thermal image pedestrian detection uses texture features to describe the appearance of human figures. Given a training database, it contains a large number of human figures and non-human figures. Using machine learning, training one can effectively A classifier that distinguishes between humanoid and non-humanoid, and uses this classifier to directly scan the thermal image to avoid the problem of erroneous cutting caused by threshold selection.

透過機器學習方式之技術雖可避免切割問題，並處理熱影像中衣物明亮度不同(Cloth Distortion)與行人外觀差異(Appearance Variation)等問題外，但其無法有效克服熱感測器所產生之未校正亮度兩極化現象(Un-Calibrated White-Black Polarity Change)，未校正亮度兩極化為熱影像中人形區域之亮度產生，為與環境溫度之對比結果，當環境溫度較低(例如，黃昏與夜晚)，熱影像人形區域為高亮度區域；反之，當環境溫度較低(例如，中午與下午)，熱影像人形區域為低亮度區域。因此，先前技術可有效應用於單一狀況，並無法有效運用於全天候(包含：日間與夜晚)的狀況。Although the technology of machine learning can avoid cutting problems and deal with issues such as clothing brightness differences (Appothance Variation) and other issues in thermal images, it cannot effectively overcome the problems caused by thermal sensors. Un-Calibrated White-Black Polarity Change. Uncorrected brightness bipolarization is caused by the brightness of the human-shaped area in the thermal image. It is the result of comparison with the ambient temperature. When the ambient temperature is low (for example, dusk and night ), The thermal image human-shaped area is a high-brightness area; conversely, when the ambient temperature is low (for example, noon and afternoon), the thermal image human-shaped area is a low-brightness area. Therefore, the prior art can be effectively applied to a single condition and cannot be effectively applied to all-weather (including day and night) conditions.

本發明係針對現有技術之不足，提出一種基於LBP索引之多層級(Multi-Level)機器學習演算法，以達到熱影像行人偵測之目的。主要提出之演算法分為訓練(Training)及測試(Testing)兩個階段，首先，給定一組熱影像行人資料庫，其中包含行人樣本與非行人樣本，對所有定義之矩形區塊(Rectangular Block)，透過LBP對所有樣本於此區塊之熱影像進行紋理編碼(Texture Encoding)，並將該訓練區塊熱影像依LBP編碼歸類，接著對相同矩形區塊之相同LBP編碼之訓練影像，以梯度方向直方圖(Histogram of Oriented Gradient, HOG)作為特徵表示，並透過學習所獲得之支援向量機(Support Vector Machine, SVM)作為該區塊與編碼之行人與非行人分類器(Pedestrian/Non-Pedestrian Classifier)，然後將所有屬於相同區塊之SVM作為弱分類器(Weak Classifier)，並透過自適應增強(Adaptive Boosting, Adaboost)選取出具鑑別能力之矩形區塊(對應至一弱分類器)，以形成行人分類器，稱為強分類器(Strong Classifier)，最為後續行人偵測使用。其次，於測試階段，使用傳統之滑動視窗(Sliding Window)方法，將行人偵測問題轉換成二元分類問題(Binary Classification)，針對每一個滑動視窗，透過上述所學習之行人分類器，判斷其是否有行人存在，以達到行人偵測之目的。The invention aims at the shortcomings of the prior art, and proposes a multi-level machine learning algorithm based on LBP index to achieve the purpose of thermal image pedestrian detection. The proposed algorithm is mainly divided into two phases: Training and Testing. First, a set of thermal image pedestrian database is given, which contains pedestrian samples and non-pedestrian samples. For all defined rectangular blocks (Rectangular Block), using LBP to texture-encode the thermal images of all samples in this block, classify the training block thermal images according to LBP encoding, and then encode the same LBP-encoded training images of the same rectangular block. Histogram of Oriented Gradient (HOG) is used as the feature representation, and the Support Vector Machine (SVM) obtained through learning is used as the block and coded pedestrian and non-pedestrian classifier (Pedestrian / Non-Pedestrian Classifier), and then use all SVMs that belong to the same block as Weak Classifier, and select adaptive rectangular blocks (corresponding to a weak classifier) through adaptive boosting (Adaboost) ) To form a pedestrian classifier, called a strong classifier (Strong Classifier), which is used for subsequent pedestrian detection. Secondly, in the testing phase, the traditional sliding window method is used to convert the pedestrian detection problem into a binary classification problem. For each sliding window, the pedestrian classifier learned above is used to determine the pedestrian classification problem. Whether there are any pedestrians to achieve the purpose of pedestrian detection.

為達上述目的及其他目的，本發明提出一種全天候熱影像式行人偵測方法，包含下列步驟：(a)擷取同一行人及非行人於一同一定義區塊中之日間熱影像及夜間熱影像，以建立熱影像的一樣本資料庫，其中該樣本資料庫包括複數個行人及複數個非行人樣本；(b)對於該樣本資料庫中的該等行人及該等非行人樣本，於該同一定義區塊中進行LBP編碼，其中將該同一定義區塊中具有互補性之LBP編碼作為相同之LBP編碼；(c)將該同一定義區塊中之LBP編碼以梯度方向直方圖(HOG)作為特徵表示，以得到該等行人樣本及該等非行人樣本之特徵訓練樣本；(d) 將該等特徵訓練樣本輸入至SVM並以Adaboost進行訓練，形成一強分類器；(e) 進行行人偵測，基於滑動視窗法對於熱影像中的強分類器進行搜索檢測並判定是否為行人。To achieve the above and other objectives, the present invention proposes an all-weather thermal image type pedestrian detection method, including the following steps: (a) capturing daytime thermal images and nighttime thermal images of the same pedestrian and non-pedestrian in a same defined block In order to establish a sample database of thermal images, wherein the sample database includes a plurality of pedestrians and a plurality of non-pedestrian samples; (b) for the pedestrians and the non-pedestrian samples in the sample database, the same LBP encoding in the definition block, where the complementary LBP encoding in the same definition block is the same LBP encoding; (c) the LBP encoding in the same definition block is the gradient direction histogram (HOG) as the Feature representation to obtain the feature training samples of the pedestrian samples and the non-pedestrian samples; (d) input the feature training samples to the SVM and train with Adaboost to form a strong classifier; (e) conduct pedestrian detection Detection, based on the sliding window method, searches and detects the strong classifiers in the thermal image and determines whether they are pedestrians.

於本發明之一實施例中，其中該步驟(b)包含下列步驟：(b1) 將該等行人及該等非行人樣本之日間熱影像及夜間熱影像進行LBP編碼；(b2) 將該同一定義區塊中具有互補性之LBP編碼作為相同之LBP編碼。In one embodiment of the present invention, the step (b) includes the following steps: (b1) LBP encoding the daytime and nighttime thermal images of the pedestrians and the non-pedestrian samples; (b2) the same A complementary LBP code in a block is defined as the same LBP code.

於本發明之一實施例中，其中該步驟(c)包含下列步驟：(c1) 將該同一定義區塊劃分成複數個區塊區域；(c2) 將各該區塊區域劃分成複數個單元區域，各該單元區域具有複數個LBP編碼；(c3) 對於各該區塊區域中所有的LBP編碼進行梯度強度與梯度方向計算；(c4) 對於各該單元區域中所有的LBP編碼，依照其梯度強度與梯度方向進行投票統計而得到各該單元區域的特徵向量，其中各該單元區域的特徵向量組成各該區塊區域的HOG特徵，及各該區塊區域的HOG特徵組成該同一定義區塊的HOG特徵，從而得到該等行人樣本及該等非行人樣本之特徵訓練樣本。In an embodiment of the present invention, the step (c) includes the following steps: (c1) dividing the same defined block into a plurality of block regions; (c2) dividing each of the block regions into a plurality of units Area, each unit area has a plurality of LBP codes; (c3) calculate the gradient strength and gradient direction for all LBP codes in each block area; (c4) for all the LBP codes in each unit area, according to its Voting statistics of gradient intensity and gradient direction to obtain the feature vector of each unit area, where the feature vector of each unit area constitutes the HOG feature of each block area, and the HOG features of each block area form the same defined area Block the HOG features to obtain feature training samples for the pedestrian samples and the non-pedestrian samples.

於本發明之一實施例中，其中該步驟(d)包含下列步驟：(d1) 掃描整張影像上之複數個不同大小的定義區域；(d2) 各該定義區塊透過步驟(a)~(c)得到複數個行人樣本及非行人樣本之特徵訓練樣本；(d3) 將該等特徵訓練樣本輸入至SVM進行訓練以得到複數個弱分類器；(d4) 透過Adaboost 運算，於該等弱分類器中找出具有行人之關鍵位置之至少一強分類器。In an embodiment of the present invention, the step (d) includes the following steps: (d1) scanning a plurality of defined areas of different sizes on the entire image; (d2) each of the defined blocks passes through steps (a) ~ (c) Obtaining feature training samples for multiple pedestrian and non-pedestrian samples; (d3) inputting these feature training samples to the SVM for training to obtain multiple weak classifiers; (d4) using Adaboost operations on the weak The classifier finds at least one strong classifier with a key position of the pedestrian.

於本發明之一實施例中，其中該步驟(e)包含下列步驟：(e1) 基於滑動視窗法對於熱影像中的強分類器進行掃描；(e2) 將該強分類器的區塊作LBP編碼；(e3) 將該LBP編碼以HOG作為特徵表示；(e4) 將該HOG特徵輸入SVM分類器進行行人之判定。In an embodiment of the present invention, the step (e) includes the following steps: (e1) scanning the strong classifier in the thermal image based on the sliding window method; (e2) using the block of the strong classifier as LBP Encoding; (e3) encode the LBP with HOG as the feature; (e4) input the HOG feature into the SVM classifier for pedestrian judgment.

藉此，本發明之基於區塊LBP編碼之多層級人形分類器之全天候熱影像式行人偵測方法係透過上述所提出之LBP紋理編碼與歸類方式，可避免因未校正亮度兩極化現象，導致特徵過於混亂以至於影響辨識結果；以及利用Adaboost訓練分類器可有效選取與整合具鑑別度之區塊，以克服因行人外觀姿態變異與遮蔽所產生之錯誤辨識。Therefore, the all-weather thermal image pedestrian detection method of the multi-layer humanoid classifier based on block LBP coding of the present invention can avoid the uncorrected brightness polarization phenomenon through the above-mentioned LBP texture coding and classification method. The features are too chaotic to affect the recognition results; and Adaboost training classifiers can effectively select and integrate discriminative blocks to overcome misidentification caused by pedestrian appearance and posture variations and obscuration.

為充分瞭解本發明之目的、特徵及功效，茲藉由下述具體之實施例，並配合所附之圖式，對本發明做一詳細說明，說明如後：In order to fully understand the purpose, features and effects of the present invention, the following specific embodiments are used in conjunction with the accompanying drawings to make a detailed description of the present invention, which will be described later:

如圖1所示，本發明之全天候熱影像式行人偵測方法，其具體包含步驟(a)~ 步驟(e)，其中步驟(a)~步驟(d)為訓練階段，步驟(e)為測試階段：As shown in FIG. 1, the all-weather thermal image type pedestrian detection method of the present invention specifically includes steps (a) to (e), where steps (a) to (d) are training stages, and step (e) is Test phase:

步驟(a)、擷取同一行人及非行人於一同一定義區塊中之日間熱影像及夜間熱影像，並重複擷取複數個行人及非行人熱影像，以建立熱影像的一樣本資料庫，因此該樣本資料庫包括複數個行人及複數個非行人之日間熱影像及夜間熱影像樣本。Step (a): Acquire daytime thermal images and nighttime thermal images of the same pedestrian and non-pedestrian in the same defined block, and repeatedly retrieve multiple thermal images of pedestrians and non-pedestrians to create a database of thermal images. Therefore, the sample database includes a plurality of pedestrians and a plurality of non-pedestrians during the day and night at night.

步驟(b)、對於該樣本資料庫中的該等行人及該等非行人樣本，於該同一定義區塊中進行LBP編碼，其中將該同一定義區塊中具有互補性之LBP編碼作為相同之LBP編碼，以克服熱影像日夜間特性相反問題。其中，(b1) 將該等行人及該等非行人樣本之日間熱影像及夜間熱影像進行LBP編碼；(b2) 將該同一定義區塊中具有互補性之LBP編碼作為相同之LBP編碼。如圖2所示，一行人的身體部分之日間LBP編碼=124 與夜間LBP編碼=131為互補，而將其設定為互補特徵。Step (b): For the pedestrians and the non-pedestrian samples in the sample database, perform LBP encoding in the same defined block, where the complementary LBP codes in the same defined block are regarded as the same LBP coding to overcome the opposite problem of day and night characteristics of thermal images. Among them, (b1) encode the daytime and nighttime thermal images of these pedestrian and non-pedestrian samples by LBP encoding; (b2) use the complementary LBP encoding in the same definition block as the same LBP encoding. As shown in Figure 2, the daytime LBP code of a person's body part = 124 with night LBP coding = 131 is complementary, and it is set as a complementary feature.

步驟(c)、將該同一定義區塊中之LBP編碼以HOG作為特徵表示，以得到該等行人樣本及該等非行人樣本之特徵訓練樣本。其中，(c1) 將該同一定義區塊(windows)劃分成複數個區塊區域(block)；(c2) 將各該區塊區域劃分成複數個單元區域(cell)，各該單元區域具有複數個LBP編碼；(c3) 對於各該區塊區域中所有的LBP編碼進行梯度強度與梯度方向計算；(c4) 對於各該單元區域中所有的LBP編碼，依照其梯度強度與梯度方向進行投票統計而得到各該單元區域的特徵向量，其中各該單元區域的特徵向量組成各該區塊區域的HOG特徵，及各該區塊區域的HOG特徵組成該同一定義區塊的HOG特徵，從而得到該等行人樣本及該等非行人樣本之特徵訓練樣本。上述之HOG主要基於權重式HOG以描述人形局部影像紋理之特徵表示式(Feature Descriptor)，目前已被證實為有效之人形表示方式，而HOG特徵表示主要包含以下三個步驟：(1)區塊定義，(2)梯度強度與方向計算及(3)直方圖統計。以下將針對此三個步驟進行說明：Step (c): The LBP code in the same definition block is represented by HOG as a feature to obtain characteristic training samples of the pedestrian samples and the non-pedestrian samples. Among them, (c1) divides the same defined block (windows) into a plurality of block areas (block); (c2) divides each block area into a plurality of cell areas (cells), each of which has a plurality of block areas (C3) calculate the gradient strength and gradient direction for all LBP codes in each block area; (c4) vote statistics for all LBP codes in each unit area according to their gradient strength and gradient direction The feature vector of each unit area is obtained, wherein the feature vector of each unit area constitutes the HOG feature of each block area, and the HOG features of each block area form the HOG feature of the same defined block, thereby obtaining the Equally pedestrian samples and feature training samples of these non-pedestrian samples. The above HOG is mainly based on the weighted HOG to describe the feature descriptor of the human image local image texture (Feature Descriptor), which has been proven to be an effective human form representation. The HOG feature representation mainly includes the following three steps: (1) Block Definition, (2) calculation of gradient intensity and direction, and (3) histogram statistics. These three steps are explained below:

(1)區塊定義：HOG演算法會先將影像畫面劃分為多種不同大小及數量的視窗區域，例如，圖3係說明HOG演算法劃分之視窗區域、區塊區域、單元區域及像素(pixel)之示意圖。視窗區域100是由固定大小及數量的區塊區域(例如區塊區域110~區塊區域140)所組成的區域，區塊區域是由固定大小及數量的單元區域(例如，單元區域111~單元區域114)所組成的區域，各區塊區域之間可以重疊(例如，區塊區域110與區塊區域120)，單元區域是由固定大小及數量的像素(例如，像素1111及像素1112)所組成的區域，各單元區域之間不重疊。以圖3所示之區塊區域110為例，區塊區域110劃分為四個單元區域111~114，即其單元維度大小為4×4 pixel。(1) Block definition: The HOG algorithm will first divide the image screen into a variety of different size and number of window areas. For example, Figure 3 illustrates the window area, block area, unit area, and pixel (pixel) divided by the HOG algorithm. ). The window area 100 is an area composed of a fixed size and number of block areas (for example, block area 110 to block area 140). The block area is a fixed size and number of unit areas (for example, unit area 111 to unit). Area 114). Each block area can overlap (for example, block area 110 and block area 120). The unit area is a fixed size and number of pixels (for example, pixels 1111 and 1112). The composition area, each unit area does not overlap. Taking the block area 110 shown in FIG. 3 as an example, the block area 110 is divided into four unit areas 111 to 114, that is, the unit dimension size is 4 × 4 pixel.

(2)梯度強度與方向計算：對區塊中所有像素點，透過水平遮罩與垂直遮罩分別計算出其水平分量與垂直分量，其中表示像素點之亮度。而之梯度強度與梯度方向計算如(式一)及(式二)：(2) Gradient intensity and direction calculation: for all pixels in the block Through horizontal mask With vertical mask Calculate their horizontal components separately With vertical component ,among them Pixels The brightness. and Gradient intensity With gradient direction The calculations are as in (Formula 1) and (Formula 2):

…………………………….(式一) ………………………………. (Formula 1)

……………………………………...(式二) …………………………………… (Formula 2)

(3)直方圖統計：經過上述計算後，針對每一個單元區域中所有的像素點，依其在梯度強度大小對梯度方向進行投票統計，一般而言，將方向以為一個區間(Bin)，因此共可區分為9個區間(Bin)，像素點所對應之區間索引值為，投票統計權重為該像素之梯度強度，因此對各細胞可獲得一9維特徵向量(Feature vector)，該9維特徵向量用以描述各該單元區域之紋理特性，最後將4個單元區域之特徵向量以串接方式(Concatenation)結合而成一個36維(9x4=36)HOG特徵向量(Feature vector)。進一步言，本發明係將上述之HOG特徵表示中的像素點取代為步驟(b)得到之LBP編碼，因此本發明係利用區塊LBP編碼的方式來提取熱影像之特徵。(3) Histogram statistics: After the above calculations, for all pixels in each unit area, the gradient direction is voted according to its gradient intensity. Generally speaking, the direction is calculated by Is an interval (Bin), so Can be divided into 9 intervals (Bin), pixels The corresponding interval index value is , The voting statistical weight is the gradient intensity of the pixel , So a 9-dimensional feature vector can be obtained for each cell The 9-dimensional feature vector is used to describe the texture characteristics of each unit area. Finally, the feature vectors of the 4 unit areas are combined in a concatenation manner to form a 36-dimensional (9x4 = 36) HOG feature vector. ) . Furthermore, the present invention replaces the pixels in the HOG feature representation with the LBP code obtained in step (b). Therefore, the present invention uses the method of block LBP encoding to extract the features of the thermal image.

步驟(d)、將該等特徵訓練樣本輸入至SVM並以Adaboost進行訓練，形成一強分類器。其中，(d1) 掃描整張影像上之複數個不同大小的定義區域；(d2) 各該定義區塊透過步驟(a)~(c)得到複數個行人樣本及非行人樣本之特徵訓練樣本；(d3) 將該等特徵訓練樣本輸入至SVM進行訓練以得到複數個弱分類器；(d4) 透過Adaboost 運算，於該等弱分類器中找出具有行人之關鍵位置之至少一強分類器。Step (d): input the feature training samples to the SVM and train them with Adaboost to form a strong classifier. Among them, (d1) scans a plurality of defined areas of different sizes on the entire image; (d2) each of the defined blocks obtains a plurality of pedestrian samples and non-pedestrian sample characteristic training samples through steps (a) to (c); (d3) input the feature training samples to the SVM for training to obtain a plurality of weak classifiers; (d4) find at least one strong classifier with the key position of the pedestrian in the weak classifiers through the Adaboost operation.

步驟(e)、進行行人偵測，基於滑動視窗法對於熱影像中的強分類器進行搜索檢測並判定是否為行人。其中，(e1) 基於滑動視窗法對於熱影像中的強分類器進行掃描；(e2) 將該強分類器的區塊作LBP編碼；(e3) 將該LBP編碼以HOG作為特徵表示；(e4) 將該HOG特徵輸入SVM分類器進行行人之判定。如此，本發明之全天候熱影像式行人偵測方法於測試階段時，不須對熱影像中的每一個區塊區域皆進行行人辨識，而是只要對於步驟(d)中藉由Adaboost訓練得到的強分類器(即一行人分類器)進行掃描即可，從而讓使用本發明之全天候熱影像式行人偵測方法之系統能減少運算量。Step (e): Perform pedestrian detection, search and detect the strong classifier in the thermal image based on the sliding window method, and determine whether it is a pedestrian. Among them, (e1) scans the strong classifier in the thermal image based on the sliding window method; (e2) encodes the block of the strong classifier by LBP; (e3) encodes the LBP with HOG as the feature; (e4) ) Enter the HOG feature into the SVM classifier for pedestrian determination. In this way, in the test phase of the all-weather thermal image type pedestrian detection method of the present invention, it is not necessary to perform pedestrian identification for each block area in the thermal image, but only for the area obtained by the Adaboost training in step (d). The strong classifier (that is, a pedestrian classifier) can be scanned, so that the system using the all-weather thermal image type pedestrian detection method of the present invention can reduce the calculation amount.

需說明的是，上述之LBP編碼、SVM及Adaboost分類器訓練皆為圖像辨識技術領域中已相當成熟之技術，因此於本說明書中便不再對其操作步驟進行說明。It should be noted that the above-mentioned LBP coding, SVM and Adaboost classifier training are all quite mature technologies in the field of image recognition technology, so the operation steps will not be described in this specification.

基於上述，本發明透過自行建構之熱像人形資料庫，其中主要包含兩天中四個時段(日間、中午、黃昏與夜間)之影像，對上述所提之全天候熱影像式行人偵測方法進行四個實驗分析，此四個實驗設計分別為，實驗1 (訓練：第一天四個時段所有影像；測試：第二天四個時段所有影像)；實驗2 (訓練：第二天四個時段所有影像；測試：第一天四個時段所有影像)；實驗3 (訓練：兩天中日間與中午時段所有影像；測試：兩天中黃昏與夜間時段所有影像)；實驗4 (訓練：兩天中黃昏與夜間時段所有影像；測試：兩天中日間與中午時段所有影像)，所獲得之實驗數據如下表一所示，其中本發明使用之效能評估之準則分別為準確性(Precision)、召回(Recall)及F-Measure，由下表一可明顯發現本發明所提出之全天候熱影像式行人偵測方法的準確度高達98%以上，召回(Recall)率亦高於80%，此可說明本發明所提出之全天候熱影像式行人偵測方法可有效應用於全天候之場景；另外，由圖示4中所示之熱像式行人檢測結果圖亦顯示出本發明之全天候熱影像式行人偵測方法能精確的檢測出熱像式人形。Based on the above, the present invention uses the self-constructed thermal image humanoid database, which mainly includes images in four periods of two days (day, noon, dusk and night) to perform the above-mentioned all-weather thermal image type pedestrian detection method. Four experimental analyses. The four experimental designs are: Experiment 1 (training: all images in the first day and four periods; test: all images in the second day and four periods); Experiment 2 (training: four periods of the second day) All images; test: all images on the first day and four periods); experiment 3 (training: all images during the two days during the day and noon time; test: all images on two days during the dusk and night periods); experiment 4 (training: two days) All images at mid-dusk and night time; test: all images during two days during day and noon time), the experimental data obtained are shown in Table 1 below, where the performance evaluation criteria used in the present invention are accuracy and recall respectively (Recall) and F-Measure, from Table 1 below, it can be clearly found that the accuracy of the all-weather thermal image type pedestrian detection method proposed by the present invention is as high as 98% or more, and the recall rate is also high. At 80%, this can show that the all-weather thermal image pedestrian detection method proposed by the present invention can be effectively applied to all-weather scenes; in addition, the thermal image pedestrian detection result diagram shown in Figure 4 also shows the present invention The all-weather thermal image-type pedestrian detection method can accurately detect thermal image-type human figures.

表一、本發明之全天候熱影像式行人偵測方法之效能評估 Table 1. Effectiveness evaluation of the all-weather thermal image type pedestrian detection method of the present invention

藉此，本發明之基於區塊LBP編碼之多層級人形分類器之全天候熱影像式行人偵測方法係透過所提出之LBP紋理編碼與歸類方式，可避免因未校正亮度兩極化現象，導致特徵過於混亂以至於影響辨識結果；以及利用Adaboost訓練分類器可有效選取與整合具鑑別度之區塊，以克服因行人外觀姿態變異與遮蔽所產生之錯誤辨識。Therefore, the all-weather thermal image type pedestrian detection method of the multi-layer humanoid classifier based on block LBP coding of the present invention can avoid the uncorrected brightness polarization phenomenon through the proposed LBP texture coding and classification method. Features are too chaotic to affect the recognition results; and Adaboost training classifiers can effectively select and integrate discriminative blocks to overcome misidentification caused by pedestrian appearance and posture variation and obscuration.

本發明在上文中已以較佳實施例揭露，然熟習本項技術者應理解的是，該實施例僅用於描繪本發明，而不應解讀為限制本發明之範圍。應注意的是，舉凡與該實施例等效之變化與置換，均應設為涵蓋於本發明之範疇內。因此，本發明之保護範圍當以申請專利範圍所界定者為準。The present invention has been disclosed in the foregoing with a preferred embodiment, but those skilled in the art should understand that this embodiment is only for describing the present invention, and should not be interpreted as limiting the scope of the present invention. It should be noted that all changes and substitutions equivalent to this embodiment should be included in the scope of the present invention. Therefore, the scope of protection of the present invention shall be defined by the scope of the patent application.

(a)~(e)‧‧‧步驟(a) ~ (e) ‧‧‧step

100‧‧‧視窗區域100‧‧‧window area

110~140‧‧‧區塊區域110 ~ 140‧‧‧ Block area

111~114‧‧‧單元區域111 ~ 114‧‧‧Unit area

1111~1112‧‧‧像素1111 ~ 1112‧‧‧pixels

［圖1］係為本發明一實施例中之全天候熱影像式行人偵測方法之偵測流程圖。［圖2］係為本發明一實施例中之LBP編碼之互補特性改進圖。［圖3］係為本發明一實施例中之HOG特徵提取之區塊定義示意圖。［圖4］係為本發明一實施例中之利用全天候熱影像式行人偵測方法之熱像式行人檢測結果圖。[Fig. 1] is a detection flowchart of an all-weather thermal image type pedestrian detection method according to an embodiment of the present invention. [Fig. 2] is a diagram for improving the complementary characteristics of LBP coding in an embodiment of the present invention. [Fig. 3] is a block definition diagram of HOG feature extraction in an embodiment of the present invention. [Fig. 4] It is a thermal image type pedestrian detection result diagram using an all-weather thermal image type pedestrian detection method in an embodiment of the present invention.

Claims

An all-weather thermal image type pedestrian detection method includes the following steps: (a) capturing daytime thermal images and nighttime thermal images of the same pedestrian and non-pedestrian in a same defined block to create a sample database of thermal images, The sample database includes a plurality of pedestrians and a plurality of non-pedestrian samples; (b) for the pedestrians and the non-pedestrian samples in the sample database, LBP encoding is performed in the same defined block, where the The complementary LBP codes in the same definition block are regarded as the same LBP codes; (c) the LBP codes in the same definition block are represented by the gradient direction histogram (HOG) as a feature to obtain the pedestrian samples and the Feature training samples such as non-pedestrian samples; (d) Input these feature training samples to SVM and train them with Adaboost to form a strong classifier; (e) Perform pedestrian detection. The strong classifier performs search detection and determines whether it is a pedestrian.

The all-weather thermal image type pedestrian detection method described in claim 1, wherein step (b) includes the following steps: (b1) LBP encoding the daytime and nighttime thermal images of the pedestrians and the non-pedestrian samples (B2) Treat the complementary LBP codes in the same definition block as the same LBP codes.

The all-weather thermal image type pedestrian detection method as described in claim 1, wherein step (c) includes the following steps: (c1) dividing the same defined block into a plurality of block regions; (c2) dividing each of the regions The block area is divided into a plurality of unit areas, each of which has a plurality of LBP codes; (c3) gradient intensity and gradient direction calculations are performed for all LBP codes in each of the block areas; (c4) for each of the unit areas All LBP codes are voted according to their gradient intensity and gradient direction to obtain the feature vector of each unit area, where the feature vector of each unit area constitutes the HOG feature of each block area, and the The HOG features form the HOG features of the same defined block, so as to obtain the feature training samples of the pedestrian samples and the non-pedestrian samples.

The all-weather thermal image type pedestrian detection method as described in claim 1, wherein step (d) includes the following steps: (d1) scanning a plurality of defined areas of different sizes on the entire image; (d2) each of the defined areas The block obtains feature training samples of a plurality of pedestrian samples and non-pedestrian samples through steps (a) ~ (c); (d3) inputs these feature training samples to the SVM for training to obtain a plurality of weak classifiers; (d4) through Adaboost operation finds at least one strong classifier with the key position of the pedestrian among the weak classifiers.

The all-weather thermal image type pedestrian detection method as described in claim 4, wherein step (e) includes the following steps: (e1) scanning the strong classifier in the thermal image based on the sliding window method; (e2) The block of the classifier is LBP coded; (e3) the LBP code is represented by HOG as a feature; (e4) the HOG feature is input into the SVM classifier for pedestrian judgment.