TWI417796B

TWI417796B - Method of recognizing objects in images

Info

Publication number: TWI417796B
Application number: TW98116102A
Authority: TW
Inventors: C Smith Gregory
Original assignee: Univ Nat Taiwan
Priority date: 2009-05-15
Filing date: 2009-05-15
Publication date: 2013-12-01
Also published as: TW201040847A

Description

Method of identifying objects in images

本發明係關於一種在影像中辨識物體的方法，特別係關於一種可準確及迅速辨識出擷取影像之形狀、大小或位置之在影像中辨識物體的方法。The present invention relates to a method for recognizing an object in an image, and more particularly to a method for recognizing an object in an image that accurately and quickly recognizes the shape, size or position of the captured image.

習知物體辨識系統，係包含一擷取輸入影像用之照相機(或類似裝置)，並包含切割輸入影像、從切割之影像中擷取物體、表示輸入之影像物體，及將待辨識擷取物體分類。A conventional object recognition system includes a camera (or similar device) for capturing an input image, and includes cutting an input image, extracting an object from the cut image, representing the input image object, and extracting the object to be recognized classification.

經擷取輸入影像一般係由數位格式之色彩或灰階像素資料組成，並被排列為X與Y方向之二維陣列，其中，該影像可包含數千或數萬個獨立像素。The captured input image is generally composed of color or grayscale pixel data in a digital format and arranged in a two-dimensional array of X and Y directions, wherein the image may contain thousands or tens of thousands of independent pixels.

在最低階層的影像表示部份，物體可被模式化為全部的影像，並將該影像與一在資料庫中的原始輸入的影像，一個像素一個像素地做比對。不過多數物體辨識系統採用不同的切割影像及擷取、表示及分類影像物體的方法，以提升速度與/或準確度(Krumm,U.S. Patent 7,092,566)。At the lowest level of the image representation, the object can be modeled as a full image, and the image is compared to the original input image in the database, pixel by pixel. However, most object recognition systems use different cut images and methods of capturing, representing, and classifying image objects to increase speed and/or accuracy (Krumm, U.S. Patent 7,092,566).

特別指出的是，色彩長條統計圖已被用於各種不同的處理步驟中，其主要係用以提升彩色影像中物體辨識的速度。In particular, the color strip chart has been used in a variety of different processing steps, mainly to improve the speed of object recognition in color images.

在美國7,020,329號專利中，Prempraneerach等人描述了一種利用色彩長條統計圖切割一彩色影像成複數個區域的技術，其係藉由將影像轉換為三維色彩空間，並產生該色彩空間之每一維度的色彩長條統計圖，使用該長條統計圖在該三維色彩空間中產生複數個連接盒，以計算每一連接盒之一正規化變異值，而形成連接盒叢組，對應該等經切割與對應相同色彩特性之影像區域之叢組化像素影像，以從該影像中取出經切割區域，並分類該影像域中之叢組化像素，以辨識該影像中的物體。對於其中之分類方法，可使用類神經網路(具可適性樣板匹配技術)、頻敏競爭學習、對手處罰競爭學習或統計分類分級法。In US Patent No. 7,020,329, Prempraneerach et al. describe a technique for cutting a color image into a plurality of regions using a color strip chart by converting the image into a three-dimensional color space and generating each of the color spaces. a color strip chart of the dimension, using the strip chart to generate a plurality of connection boxes in the three-dimensional color space, to calculate a normalized variation value of each connection box, and forming a connected box group, corresponding to the same A clustered pixel image of the image region corresponding to the same color characteristic is cut to extract the cut region from the image, and the clustered pixels in the image domain are classified to identify the object in the image. For the classification method, a neural network (with adaptive template matching technology), frequency-sensitive competition learning, opponent penalty competition learning or statistical classification and classification method can be used.

Prempraneerach等人說明其所提出之叢組化技術較過去疊代叢組化技術更具效力，並還指出其所提出之切割方法更藉每一像素僅處理一次的方式，降低計算時間，以建立其後叢組化所需之長條統計圖的強度、色度及飽和度。Prempraneerach et al. stated that the proposed clustering technique is more effective than the past iterative clustering technique, and also pointed out that the proposed cutting method is only processed once per pixel, reducing computation time to establish The intensity, chroma, and saturation of the long chart required for subsequent clustering.

不過，該方法在達成物體辨識處理之前仍需要一連串複雜的計算。首先，輸入影像必須被一邊緣預留濾鏡加以處理，使彩色影像得到平滑化，並減少在後續求得之色彩長條統計圖中的不連續性，接著該影像中的像素自RBG轉換為LUV，並以方程式組將LUV轉換為IHS色彩空間值，計算出IHS長條統計圖，過濾該長條統計圖，以移除高頻雜訊，且每一長條統計圖中的最低處被找出，以形成連接盒(藉以一高斯核或高斯濾鏡旋繞每一長條統計圖的方式為之)。However, this method still requires a series of complicated calculations before the object recognition process is achieved. First, the input image must be processed by an edge reservation filter to smooth the color image and reduce the discontinuity in the subsequently determined color strip chart. The pixels in the image are then converted from RBG to LUV, and convert the LUV into the IHS color space value by the equation group, calculate the IHS strip chart, filter the strip chart to remove the high frequency noise, and the lowest point in each strip chart Find out to form a junction box (by means of a Gaussian kernel or a Gaussian filter to spiral each strip chart).

接著，連接盒叢組被計算每一連接盒中之像素值之正規變異的方式形成，且連接盒被根據正規化變異值連結為樹狀結構。連接盒之具區域最小正規化變異值者被作為每一樹狀結構之根節點，其餘者則被利用一最陡梯度下降演算法連結為根節點的分支節點。Next, the connected cassette group is formed by calculating the normal variation of the pixel values in each of the connection boxes, and the connection boxes are connected into a tree structure according to the normalized variation values. The minimum normalized variability of the region of the junction box is used as the root node of each tree structure, and the rest is linked to the branch node of the root node by a steepest gradient descent algorithm.

為使該方法正常工作，影像中的同質區域必須被叢組化成該三維長條統計圖中的良好定義間距，不過真實影像色彩變異可能會難與雜訊或自RGB轉換成IHS色彩空間之非線性轉換所形成之色彩變異相區隔。In order for this method to work properly, the homogenous regions in the image must be clustered into well-defined spacings in the three-dimensional strip chart, but the true image color variation may be difficult to convert from noise or RGB to IHS color space. The color variation phase formed by linear transformation is separated.

在美國第7,092,566、6,952,496及6,611,622號專利中，Krumm描述使用色彩長條統計圖，以表示及分類一輸入影像的技術。其物體辨識方法首先建立並儲存待辨識物體之模型長條統計圖，並接著切割一輸入影像，以取出可能對應該等待辨識物體之區域，自該等經切割區域中求出長條統計圖，接著比較經求出長條統計圖與該經儲存之模型長條統計圖。如果兩長條統計圖的相似程度超過預定臨界值，代表輸入影像與模型物體相匹配。相匹配之輸入長條統計圖亦可被加至該固定物體之模型長條統計圖的資料庫中。In U.S. Patent Nos. 7,092,566, 6,952,496 and 6,611,622, Krumm describes a technique for using color strip charts to represent and classify an input image. The object recognition method firstly establishes and stores a model strip chart of the object to be identified, and then cuts an input image to extract an area that may be corresponding to the object to be recognized, and obtains a long graph from the cut areas. Then, the long graph and the stored model strip graph are obtained. If the similarity of the two long bars exceeds a predetermined threshold, the input image is matched to the model object. The matching input bar graph can also be added to the database of the model strip chart of the fixed object.

上述方法係藉下列方式建立模型與輸入影像長條統計圖：判定一模型或輸入影像區域中，像素所呈現之真實RGB色彩，將真實像素色彩之全部範圍切分成一系列分立色彩範圍或量化色彩類目，將擷取出之模型或輸入影像區域之每一像素指定至經量化之色彩類目，及建立被指定予每一經量化色彩類目之像素數的計數值。在一較佳實施例中，RGB像素值被量化為27個色彩類目，藉由計算在每一經量化色彩類目中的像素計數值做輸入影像及模型長條統計圖的比較。模型長條統計圖必須與輸入待辨識物體相似之一前序(prefatory)影像中求出，求出模型長條統計圖之影像區域亦被用於後續輸入影像中，以取出待辨識物體。The above method establishes a model and input image strip chart by determining the true RGB color represented by the pixel in a model or input image area, and dividing the entire range of the real pixel color into a series of discrete color ranges or quantized colors. The category assigns each pixel of the extracted model or input image area to the quantized color category and establishes a count value for the number of pixels assigned to each quantized color category. In a preferred embodiment, the RGB pixel values are quantized into 27 color categories, and the comparison of the input image and the model strip chart is performed by calculating the pixel count value in each of the quantized color categories. The model strip chart must be obtained from a prefatory image similar to the input object to be identified. The image area of the model strip chart is also used in the subsequent input image to extract the object to be identified.

在一較佳實施例中，模型影像被藉下列方式切割：自相同影像裝置分析諸時序影像，並藉分辨時序影像上不明顯改變之像素值判定一靜態背景影像，藉自一後續影像中抽出該背景影像形成一前景影像及藉分辨各平滑改變像素值群的方式切割該前景影像為各物體區域。In a preferred embodiment, the model image is cut by: analyzing the time series images from the same image device, and determining a static background image by distinguishing pixel values that are not significantly changed on the time series image, and extracting from a subsequent image. The background image forms a foreground image and cuts the foreground image into object regions by distinguishing each of the smoothed pixel values.

然而，該方法主要係用於追蹤時序影像中的物體。此外，該方法仍需要大量之建立模型長條統計圖與比較輸入影像長條統計圖與模型長條統計圖之處理時間。該方法所用之色彩長條統計圖產生技術並不預留原幾何形狀，係為固定色彩之影像像素數目的總結，故真實物體形狀、大小及位置不能被判定。最後，不同物體之類似長條統計圖可能會得到不正確的物體辨識結果。However, this method is mainly used to track objects in time-series images. In addition, the method still requires a large amount of time to establish a model strip chart and compare the processing time of the input image strip graph and the model strip graph. The color strip chart generation technique used in the method does not reserve the original geometric shape, which is a summary of the number of pixels of the fixed color image, so the shape, size and position of the real object cannot be determined. Finally, similar strip graphs of different objects may result in incorrect object recognition results.

在第6,532,301及6,477,272號美國專利中，Krum說明了使用共生(co-occurrence)長條統計圖代表及確認一搜尋影像中一經模型化之物體之位置的技術。In U.S. Patent Nos. 6,532,301 and 6,477,272, Krum describes the use of a co-occurrence strip chart to represent and confirm the location of a modeled object in a search image.

上述方法先建立物體的模型影像，並接著計算每一模型影像之一共生長條統計圖。其中模型影像之建立，係藉該物體周環之彼此以等角相隔，但不同間距之視點擷取該待被確認物體之影像組，共生長條統計圖之計算則係藉由確認該模型影像中每一種可能，且唯一，之未排列像素對及產生像素對落於相同色彩範圍之數量。The above method first establishes a model image of the object, and then calculates a co-growth bar graph for each model image. The image of the model is created by the equiangular separation of the circumferences of the object, but the viewpoints of the different distances capture the image group of the object to be confirmed, and the calculation of the common growth bar chart is performed by confirming the image of the model. Each of the possible, and unique, unaligned pairs of pixels and the number of pairs of pixels that fall within the same color range.

接著，預定大小之搜尋窗，係自該搜尋影像之覆疊部份產生，且每一搜尋窗之共生長條統計圖，係利用該技術及模型影像共生長條統計圖建立之像素色彩及間離範圍產生。Then, a search window of a predetermined size is generated from the overlay portion of the search image, and a common growth bar graph of each search window is a pixel color and a space established by using the technology and the model image co-growth bar graph. Produced from the range.

最後，每一模型影像共生長條統計圖與每一搜尋窗共生長條統計圖被比較，以評估出其相似度。每一模型影像共生長條統計圖匹配之搜尋窗共生長條統計圖，係被設定為可能包含該待辨識物體，其中匹配係由相似值大於一臨界值的方式進行。接著，待辨識物體之位置被判定為位於可能包含該待辨識物體之所有搜尋窗中之具最大相似度計量值之單一搜尋區內，並可以重覆往上、往下、往左或往右移動一像素位置，接著計算該搜尋窗共生長條統計圖，及比較該搜尋窗共生長條統計圖與每一模型長條統計圖，藉以求出可能較高之相似度計量值。該系統及方法必須令其搜尋窗大小、色彩範圍及距離範圍在影像搜尋開始即被選定。Finally, each model image co-growth bar chart is compared with each search window co-growth bar chart to evaluate its similarity. Each model image co-growth bar graph matching search window co-growth bar graph is set to possibly include the object to be identified, wherein the matching system is performed by a similar value greater than a critical value. Then, the position of the object to be recognized is determined to be located in a single search area having the largest similarity measure among all search windows that may include the object to be recognized, and may be repeated upward, downward, left or right. Moving a pixel position, then calculating the search window co-growth bar graph, and comparing the search window co-growth bar graph with each model strip graph to obtain a possibly higher similarity measure value. The system and method must have its search window size, color range, and range of distances selected at the beginning of the image search.

Krum說明該方法之數個優點，其中特別提到共生長條統計圖係代表辨識影像中物體的有效方法。藉持續追蹤具匹配色彩，並在其間具有一固定距離之像素，能夠將可變之幾何資訊量加至一規則唯色彩長條統計圖中。接著，藉考慮色彩與幾何資訊，物體辨識方法在背景凌亂及適量閉塞及物體屈曲的條件下仍可工作。Krum illustrates several advantages of this approach, with particular reference to co-growth bar graphs representing an efficient way to identify objects in an image. By continuously tracking pixels with matching colors and having a fixed distance therebetween, the variable geometric information can be added to a regular color strip chart. Then, by considering the color and geometric information, the object recognition method can still work under the condition that the background is messy and the amount of occlusion and the object is buckling.

不過，長條統計圖匹配所需之模型影像資料庫的建立相當耗時，且藉由計算模型影像及搜尋影像中，針對每一種可能唯一非排列像素距離進行計量，在計算共生長條統計圖上亦相當消耗計算成本。However, the establishment of a model image database required for long-status chart matching is quite time consuming, and by calculating the model image and the search image, the distance of each possible unique non-arranged pixel is measured, and the co-growth bar graph is calculated. It also consumes considerable computational costs.

此外，物體的表示，並不包含模型與搜尋影像中同色彩像素間距以外的其它詳細幾何資訊，真實物體形狀、大小及位置之資訊並不存在，故最後的物體位置判定不是準確的，且後續之判定位置連續限定需要大量的計算量。In addition, the representation of the object does not include detailed geometric information other than the distance between the model and the searched image, and the information of the shape, size and position of the real object does not exist, so the final object position determination is not accurate, and subsequent The determination of the position continuously requires a large amount of calculation.

再者，搜尋窗大小等方法參數會影響物體辨識準確度，且影像必須被調整大小，以令搜尋與模型影像間的總大小差得以受到處理。Furthermore, method parameters such as the size of the search window affect the object recognition accuracy, and the image must be resized so that the total size difference between the search and the model image is processed.

物體辨識系統之一有效用，且定義相當良好的應用為在移動之車輛上即時辨識交通號誌，且此交通號誌系統一般必須能夠快速抽出物體及正確對物體加以分類。One of the object recognition systems is effective, and a well-defined application is to instantly identify traffic signs on a moving vehicle, and the traffic sign system must generally be able to quickly extract objects and properly classify objects.

在美國第6,801,638號專利中，Jassen等人說明一種辨識交通號誌之方法與裝置，其能靠記憶體協助顯示該等號誌給觀看者。在該方法與裝置中，影像係由一影像感測器擷取，並透過一資訊處理單元中之分類器進行分析及分類。接著，一交通號誌之合成影像被產生，該影像被存於一記憶單元中，並透過一顯示單元顯示。In U.S. Patent No. 6,801,638, Jassen et al. describe a method and apparatus for identifying traffic signs that can assist in displaying the number to the viewer by means of memory. In the method and device, the image is captured by an image sensor and analyzed and classified by a classifier in an information processing unit. Then, a synthetic image of the traffic sign is generated, and the image is stored in a memory unit and displayed through a display unit.

輸入影像先被藉色彩與/或空間位置資訊搜尋，大於平均機率之區域，則判定可能包含的交通號誌物體。物體在經判定區域中被辨識的方式係為，以階層及順序方式藉交通號誌之各分立已知特性，針對儲存中特性資料分類該等影像區域，如為確認外形(圓或正方形)及內部符號用之校正程序等。分類器比較輸入物體特性資料與記憶單元中儲存之典型特性資料組，物體在比較距離低於組臨界值時被辨識出。The input image is first searched by color and/or spatial position information, and the area larger than the average probability determines the traffic sign object that may be included. The manner in which the object is identified in the determined region is to classify the image regions for the in-store characteristic data by hierarchically and sequentially by means of the discrete known characteristics of the traffic signs, such as to confirm the shape (circle or square) and Correction procedures for internal symbols, etc. The classifier compares the input object characteristic data with the typical characteristic data set stored in the memory unit, and the object is recognized when the comparison distance is lower than the group threshold.

然而，如此設計之分類器必須被訓練數次，以處理因天氣變化及光條件之影像品質差異。此外，該分類器亦與具記憶單元中儲存形狀之輸入物體形狀資料間的關聯性相關。一般而言，與關聯性相關之分類器可能不夠準確或速度慢。關聯性與訓練、觀視環境及/或儲存中形狀資料之品質相關。However, the classifier so designed must be trained several times to handle image quality differences due to weather changes and light conditions. In addition, the classifier is also related to the correlation between the shape data of the input object having the shape stored in the memory unit. In general, classifiers associated with associations may not be accurate or slow. Relevance is related to the quality of training, viewing environment and/or shape data in storage.

改善儲存中形狀資料需要大量的訓練或大型的儲存資料庫。其次，大型儲存資料庫需要更多的處理時間進行關聯處理。舉例而言，圓形及正方形物體在不同觀視角度時會以不同的卵形及長方形出現，如此，可能會降低物體辨識的準確度。Improving the shape data in storage requires a lot of training or a large storage database. Second, large storage databases require more processing time for correlation processing. For example, circular and square objects appear in different ovals and rectangles at different viewing angles, which may reduce the accuracy of object recognition.

上述方法亦需要對輸入影像可能包含的色彩值及/或空間位置相關之交通號誌之區域加以搜尋。The above method also requires searching for the area of the traffic signal associated with the color value and/or spatial position that the input image may contain.

在美國第6,813,545號專利中，Stromme說明了一種提醒司機至少一特定交通號誌存在的系統，其係由一影像單元、一資料庫，一自動辨識單元、一雙號誌間選擇機制及一聲音與/或可視指示器組成，其中該影像單元被連接於車輛前之靠近道路處，該資料庫包含至少一預先儲存之交通號誌形狀，該自動辨識單元則被用以偵測及確認連續影像之交通號誌，且該偵測及確認，係藉由搜尋該資料庫中之形狀的方式達成，該雙訊號間選擇機制包含於相同影像中，用以判定車輛與號誌間的距離，該聲音及/或可視指示器通知一經確認之交通號誌已出現於車輛前的路上。In U.S. Patent No. 6,813,545, Stromme describes a system for alerting a driver to at least one particular traffic sign, which consists of an image unit, a database, an automatic identification unit, a dual-choice selection mechanism, and a sound. And/or a visual indicator, wherein the image unit is connected to the road in front of the vehicle, the database includes at least one pre-stored traffic symbol shape, and the automatic identification unit is used to detect and confirm the continuous image. The traffic sign, and the detection and confirmation is achieved by searching for the shape in the database. The dual signal selection mechanism is included in the same image to determine the distance between the vehicle and the sign. The sound and/or visual indicator informs that a confirmed traffic sign has appeared on the road in front of the vehicle.

輸入影像被定期根據車速擷取，且每一輸入影像在一形狀辨識處理器中被分析，以偵測交通號誌形狀與含於形狀資訊資料中的交通號誌符號形狀。The input images are periodically captured according to the speed of the vehicle, and each input image is analyzed in a shape recognition processor to detect the shape of the traffic signal and the shape of the traffic symbol contained in the shape information.

該系統中的形狀搜尋與辨識單元係使用傳統影像處理方法，如Canny邊緣偵測，接著再於被處理影像上進行簡易的逐一像素匹配程序。號誌形狀之數個方向觀看結果被存於形狀匹配資料庫中。對於號誌中的三角形、圓形或長方形符號亦被利用圖案或用於經偵測形狀上之辨識演算法確認。色彩偵測亦被執行以核對一經偵測形狀確為一交通號誌。The shape search and recognition unit in the system uses traditional image processing methods, such as Canny edge detection, and then performs a simple pixel-by-pixel matching procedure on the processed image. The results of viewing in several directions of the shape of the logo are stored in the shape matching database. The triangular, circular or rectangular symbols in the symbol are also confirmed using a pattern or a recognition algorithm for the detected shape. Color detection is also performed to verify that the detected shape is a traffic sign.

然而，邊緣偵測與逐一像素匹配方法一般都不準確，且速度相當慢。此外，同一號誌與號誌符號形狀及大小隨車位置之不同的差異相當大，這會使得以邊緣偵測及逐一像素匹配方式執行之形狀匹配與物體辨識的準確度受制。However, edge detection and pixel-by-pixel matching methods are generally inaccurate and relatively slow. In addition, the shape and size of the same number and symbol are quite different depending on the position of the vehicle, which makes the accuracy of shape matching and object recognition performed by edge detection and pixel-by-pixel matching.

在美國專利第5,926,564號中，Masayuki Kimura係說明一種以長條統計圖模式進行影像辨識，然而，其主要係針對X軸及Y軸進行掃描，並以”0”、”1”方式進行影像比對，如此的比對模式，一旦影像擷取裝置與拍攝物產生角度差時，即無法正確辨識出物體之形狀及大小尺寸。In US Patent No. 5,926,564, Masayuki Kimura describes an image recognition in a long chart mode. However, it mainly scans the X and Y axes and performs image ratios in "0" and "1" modes. Yes, in such a comparison mode, once the image capturing device is in a different angle from the subject, the shape and size of the object cannot be correctly recognized.

由此可見，上述習用方式仍有諸多缺失，實非一良善之設計，而亟待加以改良。It can be seen that there are still many shortcomings in the above-mentioned methods of use, which is not a good design, but needs to be improved.

本案發明人鑑於上述習用方式所衍生的各項缺點，乃亟思加以改良創新，並經多年苦心孤詣潛心研究後，終於成功研發完成本件在影像中辨識物體的方法。In view of the shortcomings derived from the above-mentioned conventional methods, the inventor of the present invention has improved and innovated, and after years of painstaking research, he finally successfully developed a method for identifying objects in the image.

本發明之目的即在於提供一種藉由X軸或Y軸長條統計圖上取得之最小特定點及最大特定點，配合多項式回歸分析，即可快速並簡易判定物體形狀、大小及位置之在影像中辨識物體的方法。The object of the present invention is to provide a minimum specific point and a maximum specific point obtained by the X-axis or Y-axis bar graph, and the polynomial regression analysis can quickly and easily determine the shape, size and position of the object. A method of identifying objects.

本發明之次一目的係在於提供一種在影像中辨識物體的方法，係可擷取不同的可辨識影像特徵，如RGB色彩特徵或灰階影像特徵或視頻影像特徵，並迅速及準確對該擷取影像進行形狀、大小及位置的辨識，使其可應用於交通號誌或其他領域上。A second object of the present invention is to provide a method for recognizing an object in an image, which can capture different recognizable image features, such as RGB color features or grayscale image features or video image features, and quickly and accurately The image is identified by its shape, size and position so that it can be applied to traffic signs or other fields.

達成上述發明目的之在影像中辨識物體的方法，其步驟為：A method for recognizing an object in an image to achieve the above object is as follows:

歩驟1：先在影像資料中進行掃描，以取得一數位輸入影像；Step 1: First scanning in the image data to obtain a digital input image;

歩驟2：擷取該數位輸入影像中其中一種可辨識的特徵；例如，色彩特徵(如RGB或IHS色彩)或黑白特徵或光譜特徵等；Step 2: capturing one of the recognizable features of the digital input image; for example, a color feature (such as RGB or IHS color) or a black and white feature or spectral feature;

步驟3：同時建立可辨識影像特徵之一維X軸或Y軸長條統計圖，該X軸或Y軸長條統計圖中保有該擷取出影像之幾何資訊；Step 3: Simultaneously establish one of the identifiable image features, the X-axis or the Y-axis long bar graph, and the X-axis or Y-axis bar graph retains the geometric information of the extracted image;

歩驟4：求出X軸或Y軸長條統計圖上之最小特定點及最大特定點，以找出該等長條統計圖中的物體；Step 4: Find the minimum specific point and the maximum specific point on the X-axis or Y-axis strip chart to find the objects in the strip chart;

歩驟5：藉由X軸或Y軸長條統計圖上之最小特定點及最大特定點，以多項式回歸分析的方式判定物體的形狀；若判定物體為線性走向即可判斷為三角形；若為直線走向則判定為方行；若符合二次方程式則判斷為圓形；Step 5: Determine the shape of the object by polynomial regression analysis by using the minimum specific point and the maximum specific point on the X-axis or Y-axis bar graph; if the object is determined to be linear, it can be judged as a triangle; The straight line is judged as a square line; if it conforms to the quadratic equation, it is judged to be a circle;

歩驟6：藉由X軸或Y軸長條統計圖上之兩最小特定點間的距離判定物體大小；Step 6: determining the size of the object by the distance between the two minimum specific points on the X-axis or Y-axis bar graph;

歩驟7：藉由X軸或Y軸長條統計圖上之最小特定點判定物體位置；Step 7: determining the position of the object by a minimum specific point on the X-axis or Y-axis bar graph;

歩驟8：再抓取其他可辨識特徵的物體影像，並重覆步驟2至步驟7，以辨識該影像中之其他特徵。Step 8: Grab the image of the object of other identifiable features and repeat steps 2 through 7 to identify other features in the image.

請參閱圖一所示，為本發明在影像中辨識物體的方法之流程步驟圖，主要包括：Please refer to FIG. 1 , which is a flow chart of a method for identifying an object in an image according to the present invention, which mainly includes:

步驟1：係藉由數位影像擷取裝置對物體進行數位影像的拍攝，以取得一數位影像資料；另外，該數位影像資料亦可藉由一類比影像擷取裝置取得，再將該類比信號轉換成數位信號，同樣可取一數位影像資料，該數位影像資料即為待辨識物體；再者，該數位影像擷取裝置或類比影像擷取裝置可為照相機或攝影機或其他可擷取影像之裝置801；Step 1: The digital image capturing device performs digital image capturing on the object to obtain a digital image data. In addition, the digital image data can also be obtained by an analog image capturing device, and then the analog signal is converted. The digital signal can also take a digital image, and the digital image data is the object to be identified; further, the digital image capturing device or the analog image capturing device can be a camera or a camera or other device capable of capturing images. ;

步驟2：再將數位影像可之辨識特徵擷取出802；例如，該影像為彩色影像時，即可依照RGB之色彩特性，擷取某單一顏色的可辨識影像特徵；或者，當影像為一灰階影像時，即可擷取該影像可辨識灰階的特徵；或者，可以擷取之數位影像資料為可見光譜頻率範圍時，同樣係僅擷取該單一光譜之影像可辨識特徵；Step 2: The digital image can be identified and extracted 802; for example, when the image is a color image, the identifiable image feature of a single color can be captured according to the color characteristics of RGB; or, when the image is gray In the case of the order image, the image can be captured to identify the features of the gray scale; or, when the digital image data that can be captured is in the visible spectral frequency range, only the image identifiable feature of the single spectrum is captured;

步驟3：將影像可辨識特徵擷取出後，即可同時計算出該擷取影像之X軸方向(水平)或Y軸方向(垂直)長條統計圖803；而取得X軸或Y軸長條統計圖的方式，係計算該影像可辨識特徵之每一行或每一列上的像素，如此，即可取得X軸或Y軸長條統計圖；Step 3: After extracting the image recognizable feature, the X-axis direction (horizontal) or Y-axis direction (vertical) strip chart 803 of the captured image can be simultaneously calculated; and the X-axis or Y-axis strip is obtained. The method of calculating the graph is to calculate the pixels on each row or column of the image recognizable feature, so that the X-axis or Y-axis strip chart can be obtained;

步驟4：取得X軸或Y軸長條統計圖後，再求出X軸或Y軸長條統計圖之最小特定點及最大特定點，該最小特定點及最大特定點係為可辨識特徵影像中的零點像素及最大值點像素804；另外，最小及最大特定點之找尋方式，係藉由線性搜尋模式於X軸或Y軸長條統計圖中被求出，並同時並註記與記錄該最小特定點及最大特定點的位置；另外，亦可以不同搜尋演算法或方法求出X軸或Y軸長條統計圖資料中的最小特定點及最大特定點；Step 4: After obtaining the X-axis or Y-axis bar graph, the minimum specific point and the maximum specific point of the X-axis or Y-axis bar graph are obtained, and the minimum specific point and the largest specific point are the identifiable feature images. The zero point pixel and the maximum point point pixel 804; in addition, the minimum and maximum specific point finding method is obtained by the linear search mode in the X-axis or Y-axis strip chart, and simultaneously note and record the The minimum specific point and the position of the largest specific point; in addition, different search algorithms or methods may be used to find the minimum specific point and the maximum specific point in the X-axis or Y-axis long chart data;

步驟5：再透過多項式回歸分析方式，根據求出的最小特定點及最大特定點定義出物體形狀805；例如：若最小特定點至最大特定點間係呈線性走向，即可判定該待辨識物體應為三角形；若最小特定點至最大特定點間係呈直線走向，即可判定該待辨識物體應為四方形或矩形；若最小特定點至最大特定點間係符合二次方程式，即可判定該待辨識物體應為圓形或橢圓形；Step 5: Through the polynomial regression analysis method, the object shape 805 is defined according to the minimum specific point and the largest specific point obtained; for example, if the minimum specific point to the maximum specific point is linear, the object to be identified can be determined. It should be a triangle; if the minimum specific point to the maximum specific point is a straight line, it can be determined that the object to be identified should be square or rectangular; if the minimum specific point to the maximum specific point is in accordance with the quadratic equation, it can be determined The object to be identified should be circular or elliptical;

步驟6：再依照影像之X軸或Y軸長條統計圖顯示資料與最小特定點及最大特定點的位置，即可取得影像在X軸或Y軸方向的像素值，透過該像素值即可準確計算出拍攝物體的大小尺寸806；Step 6: According to the X-axis or Y-axis long chart of the image, the position of the data and the minimum specific point and the maximum specific point can be obtained, and the pixel value of the image in the X-axis or the Y-axis direction can be obtained, and the pixel value can be obtained through the pixel value. Accurately calculate the size and size of the object 806;

步驟7：再根據影像之X軸及Y軸長條統計圖資料與最小特點的位置判定出物體的正確位置807；Step 7: According to the X-axis and Y-axis long chart data of the image and the position of the smallest feature, the correct position of the object is determined 807;

步驟8：再擷取上述數位影像資料中不同辨識特徵的影像資料(如其他不同色彩或光譜頻率的特徵)，並重複步驟2至步驟7之處理流程，如此，將同一影像中不同的辨識特徵作多次個別的辨識處理，以達到更準確及更快速辨識物體形狀、大小及位置之目的。Step 8: Retrieve the image data of different identification features (such as other characteristics of different colors or spectral frequencies) in the above digital image data, and repeat the processing flow of steps 2 to 7, so that different identification features in the same image are obtained. Perform multiple identification processes to achieve more accurate and faster identification of the shape, size and position of the object.

請參閱圖二至圖五所示，係本發明之第一實施示意圖，該影像辨識步驟如下：Please refer to FIG. 2 to FIG. 5 , which are schematic diagrams of the first implementation of the present invention. The image recognition steps are as follows:

步驟1：如圖二所示，該白色背景上具有五種態樣的數位影像資料，分別為一紅色三角形1、一紅色圓形2、一紅色四方形3、一綠色三角形4及一藍色四方形5；該影像可透過數位電腦繪製而成，或透過數位或類比影像擷取裝置拍攝而成；Step 1: As shown in FIG. 2, the white background has five kinds of digital image data, which are a red triangle 1, a red circle 2, a red square 3, a green triangle 4, and a blue color. Square 5; the image can be drawn through a digital computer or captured by a digital or analog image capture device;

步驟2：如圖三所示，當取得數位影像質料後，即可依照影像可辨識之特徵，決定先擷取具有紅色特徵之影像，因此，圖二中具有紅色特徵之紅色三角形1、紅色圓形2及紅色四方形3皆會被擷取出，並開始進行影像辨識，而非紅色特徵之影像，如綠色三角形4及藍色四方形5會在影像擷取過程中被移除；Step 2: As shown in Figure 3, after obtaining the digital image material, it is possible to determine the image with the red feature according to the identifiable features of the image. Therefore, the red triangle with the red feature in Figure 2 is a red circle. Both Shape 2 and Red Square 3 will be removed and image recognition will begin, instead of red features, such as Green Triangle 4 and Blue Square 5 will be removed during image capture;

步驟3：計算X軸及Y軸長條統計圖；如圖四所示，同時計算圖三中所擷取影像內的每一行上的紅色(非黑)像素，以取得X軸(水平)長條統計圖，該X軸長條統計圖上會顯示出與紅色三角形1、紅色圓形2及紅色四方形3相對應之三角長條統計圖101、曲線長條統計圖201及方形長條統計圖301，而X軸長條統計圖之水平方向所載數值係代表行數，而垂直方向則代表像素點值；該X軸長條統計圖所顯示的資料保有足夠資訊，以判定在原始輸入影像中之物體形狀與水平物體大小及位置；如圖五所示，係計算圖三中經擷取影像內每一列上的紅色(非黑)像素，以取得之Y軸(垂直)長條統計圖；該Y軸長條統計圖上會顯示出與紅色三角形1、紅色圓形2及紅色四方形3相對應之三角長條統計圖102、曲線長條統計圖202及方形長條統計圖302，而Y軸長條統計圖之水平方向所載數值係代表列數，而垂直方向則代表像素點值；該Y軸長條統計圖資料保有足夠資訊，以判定在原始輸入影像中之物體形狀與垂直物體大小及位置；Step 3: Calculate the X-axis and Y-axis strip charts; as shown in Figure 4, simultaneously calculate the red (non-black) pixels on each line in the captured image in Figure 3 to obtain the X-axis (horizontal) length. For the chart, the X-axis strip chart will display the triangle strip chart 101 corresponding to the red triangle 1, the red circle 2 and the red square 3, the curve strip chart 201 and the square strip statistics. Figure 301, and the value in the horizontal direction of the X-axis bar graph represents the number of rows, while the vertical direction represents the pixel value; the data displayed in the X-axis bar graph retains enough information to determine the original input. The shape of the object in the image and the size and position of the horizontal object; as shown in Figure 5, calculate the red (non-black) pixels in each column of the captured image in Figure 3 to obtain the Y-axis (vertical) strip statistics. Figure; the Y-axis strip chart will display the triangular strip chart 102 corresponding to the red triangle 1, the red circle 2 and the red square 3, the curve strip chart 202 and the square strip chart 302 , and the values in the horizontal direction of the Y-axis bar graph represent the number of columns. The vertical direction represents pixel values; the Y-axis strip chart data retain enough information to determine the object size and the shape of the vertical position of the object in the original input image;

步驟4：再依照X軸或Y軸長條統計圖求出最小特定點及最大特定點，如圖四所示，係以線性搜尋方式求出二最小特定點及一最大特定點，同時將求出之二最小特定點及最大特定點的位置(行數)註記與記錄；以X軸長條統計圖為例，該二最小特定點之像素值皆為0點，故經由像素0點朝對應行數的位置逐行尋找，即可在第50行中蒐尋到三角長條統計圖101的第一個最小特定點A，並將第50行加以註記及記錄，緊接著會搜尋到三角長型統計圖之第二個最小特定點C為第170行，及最大特定點B為像素85點，並將所搜尋的特定點B、C註記及記錄；接著搜尋出曲線長條統計圖201及方形長條統計圖301之最小特定點D、F、G、J及最大特定點E、H、I；其中，該最小特定點D為273行，F為362行，G為498行，J為563行，而最大特定點E像素為86點，H及I像素為86點；最後搜尋至X軸長條統計圖的終端；同理，圖五所示之Y軸長條統計圖所顯示之三角長條統計圖102、曲線長條統計圖202及方形長條統計圖302的最小特定點及最大特定點同樣以相同方式求出，以此不在贅述，可得到三角長條統計圖102的最小特定點K及M分別為50行及136行，而最大特定點L為像素119點；該曲線長條統計圖202之最小特定點N及P分別為160行及245行，最高特定點O為像素88點；該方形長條統計圖302之最小特定點Q及T分別為280行及365行，最高特定點R及S為像素65點，依序搜尋到Y軸長條統計圖的終端；Step 4: According to the X-axis or Y-axis bar graph, the minimum specific point and the maximum specific point are obtained. As shown in FIG. 4, the second minimum specific point and the largest specific point are obtained by linear search, and will be sought The minimum specific point and the position of the largest specific point (the number of rows) are recorded and recorded. Taking the X-axis long chart as an example, the pixel values of the two minimum specific points are all 0 points, so The position of the number of rows is searched line by line, and the first smallest specific point A of the triangular strip chart 101 can be searched in the 50th line, and the 50th line is noted and recorded, and then the triangle long type is searched. The second minimum specific point C of the statistical map is the 170th line, and the maximum specific point B is the pixel 85 points, and the specific points B and C searched for are recorded and recorded; then the curve strip chart 201 and the square are searched. The minimum specific points D, F, G, J and the maximum specific points E, H, and I of the long chart 301; wherein the minimum specific point D is 273 lines, F is 362 lines, G is 498 lines, and J is 563. Line, and the maximum specific point E pixel is 86 points, H and I pixels are 86 points; finally search to the end of the X-axis strip chart Similarly, the minimum specific point and the maximum specific point of the triangular strip chart 102, the curve strip chart 202 and the square strip chart 302 shown in the Y-axis bar graph shown in FIG. 5 are also in the same manner. The minimum specific points K and M of the triangular strip chart 102 are 50 rows and 136 rows, respectively, and the maximum specific point L is 119 dots; the minimum of the curve bar graph 202 is obtained. The specific points N and P are 160 lines and 245 lines, respectively, and the highest specific point O is 88 points; the minimum specific points Q and T of the square strip chart 302 are 280 lines and 365 lines, respectively, and the highest specific points R and S For the pixel 65 points, sequentially search for the terminal of the Y-axis long chart;

步驟5：取得最小特定點及最大特定點後，請參照圖四所示，係透過X軸長條統計圖中加以回歸(regression)分析，以判斷出可能之物體形狀；係經由被註記及記錄為最小特定點及最大特定點之位置，而定義出三角長條統計圖101之線條1011及1012與曲線長條統計圖201之線條2011與方形長條統計圖301之線條3011、3012、3013的邊界形狀；Step 5: After obtaining the minimum specific point and the maximum specific point, please refer to Figure 4, and perform regression analysis through the X-axis bar graph to determine the shape of the possible object; For the minimum specific point and the position of the largest specific point, the lines 1011 and 1012 of the triangular strip chart 101 and the line 2011 of the curve strip chart 201 and the lines 3011, 3012, 3013 of the square strip chart 301 are defined. Boundary shape

首先，係針對三角長條統計圖101之線條1011的多項式回歸分析的結果：First, the results of the polynomial regression analysis for the line 1011 of the triangular strip chart 101:

Degree 1:-70.28+1.398x,α=0.05,p<0.0001,p<0.0001,R^2=1.00；Degree 1:-70.28+1.398x, α=0.05, p<0.0001, p<0.0001, R^2=1.00;

Degree 2：-70.71+1.41x-7.0424e-5x^2,a=0.05,p<0.0001,p<0.0001,p=0.6085,R^2=1.00；Degree 2: -70.71+1.41x-7.0424e-5x^2, a=0.05, p<0.0001, p<0.0001, p=0.6085, R^2=1.00;

結果顯示三角長條統計圖101很可能與斜率1.398互為線性，多項次回歸分析結果顯示，第二級項(Degree 2)在a=0.05時在統計上為不顯著。The results show that the triangular strip chart 101 is likely to be linear with the slope 1.398, and the results of the multiple regression analysis show that the second level (Degree 2) is statistically insignificant at a=0.05.

而針對線條1012的多項式回歸分析結果如下：The result of the polynomial regression analysis for line 1012 is as follows:

Degree 1:237.2-1.398x,a=0.05,p<0.0001,p<0.0001,R^2=1.00Degree 1:237.2-1.398x, a=0.05, p<0.0001, p<0.0001, R^2=1.00

Degree 2：238.5-1.418x+7.0424e-5x^2,a=0.05,p<0.0001,p<0.0001,p=0.6085,R^2=1.00Degree 2:238.5-1.418x+7.0424e-5x^2, a=0.05, p<0.0001, p<0.0001, p=0.6085, R^2=1.00

結果顯示三角長條統計圖101可能與斜率1.398互為線性，多項式回歸分析結果顯示，第二級項(Degree 2)在α=0.05時在統計上為不顯著。The results show that the triangular strip chart 101 may be linear with the slope 1.398, and the results of the polynomial regression analysis show that the second level item (Degree 2) is statistically insignificant when α=0.05.

因此，線條1011與1021之聯合結果顯示，在圖二中物體為紅色三角形1；接著，進行曲線長條統計圖201之線條2011的多項式回歸分析結果如下：Therefore, the joint result of the lines 1011 and 1021 shows that the object is a red triangle 1 in Fig. 2; then, the polynomial regression analysis of the line 2011 of the curve strip chart 201 is as follows:

Degree 1:48.71+0.05037x,α=0.05,p=0.0784,p=0.5589,R^2=0.00；Degree 1:48.71+0.05037x, α=0.05, p=0.0784, p=0.5589, R^2=0.00;

Degree 2:-3331+21.48x-0.03375x^2,α=0.05,p<0.0001,p<0.0001,p<0.0001,R^2=0.95；Degree 2: -3331 + 21.48x - 0.03375x^2, α = 0.05, p < 0.0001, p < 0.0001, p < 0.0001, R^2 = 0.95;

結果顯示曲線長條統計圖201之結果很可能為圓形或卵形，多項式回歸分析結果顯示，第二級項(Degree 2)在α=0.05時在統計上為顯著。The results show that the results of the curve bar graph 201 are likely to be round or oval, and the results of the polynomial regression analysis show that the second term (Degree 2) is statistically significant at α=0.05.

因此，結果顯示圖二中物體為紅色圓形2。Therefore, the result shows that the object in Figure 2 is a red circle 2.

接著，由於方形長條統計圖301之線條3011、3012及3013為垂直與水平之線條，故無法進多項式回歸分析；但是，該線條3011係代表零點最小特定值至最大特定值之直線，而線條3012包含所有相等之最大特定值，且線條3013代表最大特定值至零值最小特定值之直線，故該線條3011、3012、3013之組成物體為一紅色四方形3。Then, since the lines 3011, 3012, and 3013 of the square strip chart 301 are vertical and horizontal lines, the polynomial regression analysis cannot be performed; however, the line 3011 represents a line from the minimum specific value of the zero point to the maximum specific value, and the line 3012 contains all equal maximum specific values, and line 3013 represents a straight line from the maximum specific value to the minimum specific value of zero value, so the constituent objects of the lines 3011, 3012, and 3013 are a red square 3.

步驟6：得知物體形狀後，再進行物體大小之判定，係如圖四所示，該三角長條統計圖101之最小特定點A、C為50行及170行，最大特定點B為85個像素，因此，紅色三角形1之實際大小尺寸可由照相機及影像特性與照相機在實際環境中的位置判定出。Step 6: After knowing the shape of the object, the object size is determined. As shown in FIG. 4, the minimum specific points A and C of the triangular strip chart 101 are 50 lines and 170 lines, and the maximum specific point B is 85. The pixels, therefore, the actual size of the red triangle 1 can be determined by the camera and image characteristics and the position of the camera in the actual environment.

該曲線長條統計圖201之最小特定點D、F為273行及362行，最大特定點E為86個像素，因此，若欲計算圖二之紅色圓形2之大小，可由照相機及影像特性與照相機在實際環境中的位置判定出。The minimum specific points D and F of the curve bar graph 201 are 273 lines and 362 lines, and the maximum specific point E is 86 pixels. Therefore, if the size of the red circle 2 of FIG. 2 is to be calculated, the camera and image characteristics can be obtained. Determined with the position of the camera in the actual environment.

該方形長條統計圖301之最小特定點G、J為498行及563行，最大特定點H、I為86個像素，因此，若欲計算圖二之紅色四方形3之大小，可由照相機及影像特性與照相機在實際環境中的位置判定出。The minimum specific points G and J of the square strip chart 301 are 498 lines and 563 lines, and the maximum specific points H and I are 86 pixels. Therefore, if the size of the red square 3 in FIG. 2 is to be calculated, the camera and The image characteristics are determined by the position of the camera in the actual environment.

步驟7：最後，計算物體的位置，請同時參閱圖四及圖五所示，係透過X軸長條統計圖及Y軸長條統計圖所得之最小特定點A、C、D、F、G、J、K、M、N、P、Q、T進行物體位置的判斷，其中，該三角長條統計圖101之X軸最小特定點A、C為50至170；而Y軸最小特定點K、M則為52至136，透過X軸及Y軸之最小特定點A、C、K、M資料，即可得知三角長條統計圖的位置；該曲線長條統計圖201之X軸最小特定點D、F為273至362；而Y軸最小特定點N、P則為160至245，透過X軸及Y軸之最小特定點D、F、N、P資料，即可得知曲線長條統計圖的位置；該方形長條統計圖301之X軸最小特定點G、J為498至563；而Y軸最小特定點Q、T則為280至365，透過X軸及Y軸之最小特定點G、J、Q、T資料，即可得知方形長條統計圖的位置；另外，在必要時，真實物體的大小通常可由照相機及影像特性與照相機在實際環境中的位置判定出。Step 7: Finally, calculate the position of the object. Please refer to Figure 4 and Figure 5 below. The minimum specific points A, C, D, F, G obtained through the X-axis strip chart and the Y-axis strip chart. J, K, M, N, P, Q, T determine the position of the object, wherein the minimum specific points A and C of the X-axis of the triangular strip chart 101 are 50 to 170; and the minimum specific point K of the Y-axis And M is 52 to 136. The position of the triangular stripe chart can be known through the minimum specific points A, C, K, and M of the X-axis and the Y-axis; the X-axis of the curve strip chart 201 is the smallest. The specific points D and F are 273 to 362; and the minimum specific points N and P of the Y axis are 160 to 245. The minimum specific points D, F, N, and P of the X and Y axes can be used to know the long curve. The position of the bar chart; the minimum specific point G and J of the X-axis of the square bar chart 301 is 498 to 563; and the minimum specific point Q and T of the Y-axis is 280 to 365, and the minimum of the X-axis and the Y-axis At a specific point G, J, Q, T data, you can know the position of the square strip chart; in addition, when necessary, the size of the real object can usually be determined by the camera and image characteristics and the camera in the actual environment. The position is judged.

步驟8：可換擷取綠色辨識特徵或藍色辨識特徵的影像資料，並重複上述歩驟2~7的步驟流程，即可取得辨識物體之形狀、大小及位置。Step 8: The image data of the green identification feature or the blue identification feature can be exchanged, and the steps of the steps 2 to 7 can be repeated to obtain the shape, size and position of the identified object.

因此，本發明之影像辨識係在無圖案比對與圖案比對資料庫的條件下完成，較習用物體辨識方法改善了物體辨識速度與準確度。另外，物體形狀資訊被求出，係利用多項式回歸分析，如此亦根據R^2適合分析提供對形狀辨識準確度之計量。此外，辨識物體形狀時不會有不準確與耗時直接輸入影像搜尋之情形。Therefore, the image recognition system of the present invention is completed under the condition of no pattern comparison and pattern comparison database, and the object recognition method improves the object recognition speed and accuracy. In addition, the object shape information is obtained by using polynomial regression analysis, and thus the measurement of the shape identification accuracy is also provided according to the R^2 suitable analysis. In addition, there is no inaccurate and time-consuming input image search when identifying the shape of the object.

請同時參閱圖六至圖九所示，係本發明之第二實施示意圖，其主要係應用在交通號誌影像辨識上，主要辨識步驟為：Please refer to FIG. 6 to FIG. 9 at the same time, which is a schematic diagram of the second embodiment of the present invention, which is mainly applied to image recognition of traffic signals, and the main identification steps are as follows:

歩驟1：如圖六所示，係透過影像擷取裝置拍攝拍攝道路邊之交通號誌，以取得該交通號誌影像6，該交通號誌影像6具有紅色外環61、白色內環62及黑色字體63等影像可辨識特徵，同時，該交通號誌影像6中亦包含其它紅色特徵物體，如：樹上的紅花71與複數朵在灌木中的紅花71，該紅花71有些部位被綠葉所遮蔽；Step 1: As shown in FIG. 6, the traffic sign is taken by the image capturing device to obtain the traffic sign image 6 of the road. The traffic sign image 6 has a red outer ring 61 and a white inner ring 62. And the black font 63 and other image recognizable features, at the same time, the traffic sign image 6 also contains other red features, such as: safflower 71 on the tree and a plurality of safflower 71 in the shrub, the safflower 71 is partially covered by green leaves Obscured

歩驟2：如圖七所示，係抓取圖六中具有紅色特徵得影像資料，故可取得交通號誌影像6之紅色外環61及樹上紅花71與灌木叢中紅花71等影像特徵；Step 2: As shown in Figure 7, the image data with red features in Figure 6 is captured, so the image features of the red outer ring 61 of the traffic sign image 6, the safflower 71 on the tree, and the safflower 71 in the bush can be obtained. ;

步驟3：如圖八及圖九所示，係經由計算而取得X軸及Y軸長條統計圖，該X軸及Y軸長條統計圖上包含紅色外環長條統計圖501、樹上紅花統計圖502及灌木叢上複數朵紅花統計圖503，其中，長條統計圖501之獨特圖形，是原始環狀物體形狀的特徵；且經由圖九之Y軸長條統計圖可知紅色外環61與樹上紅花71在Y軸(垂直)方向上重疊，故將該紅色外環長條統計圖501及樹上紅花長條統計圖502資料彼此相加，以在待辨識物體對應之紅色外環長條統計圖501資料上加入雜訊；另外，長條統計圖501之獨特圖形，是原始環狀物體形狀的特徵；上述X軸及Y軸長條統計圖之計算模式與圖四及圖五相同，於此不在贅述；Step 3: As shown in FIG. 8 and FIG. 9 , the X-axis and Y-axis bar graphs are obtained through calculation, and the X-axis and Y-axis bar graphs include a red outer ring strip graph 501 and a tree. The safflower chart 502 and the plurality of safflower statistics 503 on the bush, wherein the unique figure of the long chart 501 is a feature of the shape of the original ring object; and the red outer ring is known by the Y-axis bar chart of FIG. 61 overlaps with the safflower 71 on the tree in the Y-axis (vertical) direction, so the red outer ring strip graph 501 and the tree safflower strip graph 502 data are added to each other to match the red color corresponding to the object to be identified. The ring strip chart 501 adds noise to the data; in addition, the unique graph of the long chart 501 is the feature of the original ring shape; the calculation mode of the X-axis and Y-axis strip charts and Figure 4 and The same is true, and it is not mentioned here;

歩驟4：如圖八及圖九所示，係利用零行至3264行掃描X軸及Y軸長條統計圖資料，以求出X軸紅色外環長條統計圖501之最小特定點A’、D’及最大特定點B’、C’及Y軸紅色外環長條統計圖501之最小特定點E’、H’及最大特定點F’、G’；該最小特定點A’為1112行，D’為2040行、E’為1099行、H’為2078行；該最大特定點B’及C’之像素值皆為710點，而F’及G’之像素值為660及650；另外，該最小特定點A’、D’、E’、H’及最大特定點B’、C’、F’、G’之找尋方式皆與圖四及圖五相同，於此不在贅述；Step 4: As shown in FIG. 8 and FIG. 9 , the X-axis and Y-axis long bar graph data are scanned from zero rows to 3264 rows to obtain the minimum specific point A of the X-axis red outer loop strip graph 501. ', D' and the maximum specific point B', C' and the Y-axis red outer ring strip chart 501 minimum specific points E', H' and the maximum specific points F', G'; the minimum specific point A' is 1112 lines, D' is 2040 lines, E' is 1099 lines, H' is 2078 lines; the maximum specific points B' and C' have pixel values of 710 points, and F' and G' have pixel values of 660 and 650; In addition, the search methods of the minimum specific points A', D', E', H' and the maximum specific points B', C', F', G' are the same as those in FIG. 4 and FIG. 5, and are not described herein. ;

歩驟5：取得最小特定點及最大特定點後，利用圖八所示之三組線條5011、5012、5013搭配多項式回歸分析(regression)判斷物體之形狀；Step 5: After obtaining the minimum specific point and the maximum specific point, use the three sets of lines 5011, 5012, and 5013 shown in FIG. 8 to determine the shape of the object by using polynomial regression analysis (regression);

圖八中線條5011之多項式回歸分析結果如下：The polynomial regression analysis of line 5011 in Figure 8 is as follows:

Degree 1:-4439+4.145x,α=0.05,p<0.0001,p<0.0001,R^2=0.93Degree 1:-4439+4.145x, α=0.05, p<0.0001, p<0.0001, R^2=0.93

Degree 2:-42765+68.91x-0.02733x^2,α=0.05,p<0.0001,p<0.0001,p=0.0001,R^2=0.99Degree 2:-42765+68.91x-0.02733x^2, α=0.05, p<0.0001, p<0.0001, p=0.0001, R^2=0.99

結果顯示該線條5011很可能為圓形或卵形，且多項式回歸分析結果顯示，第二級項在α=0.05時在統計上為顯著；The results show that the line 5011 is likely to be round or oval, and the polynomial regression analysis shows that the second level term is statistically significant at α=0.05;

圖八中線條5012之多項式回歸分析結果如下：The polynomial regression analysis of line 5012 in Figure 8 is as follows:

Degree 1:352.3+0.008745x,α=0.05,p<0.0001,p=0.6111,R^2=0.00Degree 1:352.3+0.008745x, α=0.05, p<0.0001, p=0.6111, R^2=0.00

Degree 2:6456-7.861x+0.002503x^2,α=0.05,p<0.0001,p<0.0001,p<0.0001,R^2=0.89Degree 2: 6456-7.861x+0.002503x^2, α=0.05, p<0.0001, p<0.0001, p<0.0001, R^2=0.89

結果顯示線條5012可能為圓形或卵形，且多項次回歸分析結果顯示，第二級項在α=0.05時在統計上為顯著；圖八中線條5013多項式回歸分析結果如下：The results show that the line 5012 may be round or oval, and the results of multiple regression analysis show that the second level term is statistically significant at α=0.05; the results of the line 5013 polynomial regression analysis in Figure 8 are as follows:

Degree 1:9155-4.43x,α=0.05,p<0.0001,p<0.0001,R^2=0.92Degree 1:9155-4.43x, α=0.05, p<0.0001, p<0.0001, R^2=0.92

Degree 2:-102241+109x-0.02887x^2,α=0.05,p<0.0001,p<0.0001,p=0.0001,R^2=0.99Degree 2:-102241+109x-0.02887x^2, α=0.05, p<0.0001, p<0.0001, p=0.0001, R^2=0.99

結果顯示線條5013可能為圓形或卵形，且多項式回歸分析，第二級項在α=0.05時在統計上為顯著；因此，線條5011、5012、5013之聯合結果顯示，圖八所顯示之長條統計圖501很可能為環狀體之圓形或卵形。The results show that line 5013 may be circular or oval, and polynomial regression analysis, the second level term is statistically significant at α = 0.05; therefore, the combined results of lines 5011, 5012, 5013 show that Figure 8 shows The strip chart 501 is likely to be circular or oval in shape.

歩驟6：得知物體形狀後，再進行物體大小之判定，係參照圖八所示，該X軸方向之紅色外環長條統計圖501之最小特定點A’、D’為1112行及2040行，最大特定點B’、C’為710個像素；而圖九所示之Y軸方向最小特定點E’、H’為1099行及2078行，的最大特定點F，為660個像素、G’為650個像素，經由上述條件及可記算出物體正確的大小尺寸；另外，物體之正確大小尺寸可由照相機及影像特性與照相機在實際環境中的位置判定出，於此不在贅述。Step 6: After the shape of the object is known, the object size is determined. Referring to FIG. 8, the minimum specific points A' and D' of the red outer ring strip chart 501 in the X-axis direction are 1112 lines and In 2040 lines, the maximum specific points B' and C' are 710 pixels; and the minimum specific points E' and H' in the Y-axis direction shown in FIG. 9 are 1099 lines and 2078 lines, and the maximum specific point F is 660 pixels. G' is 650 pixels. According to the above conditions, the correct size of the object can be calculated. In addition, the correct size of the object can be determined by the camera and image characteristics and the position of the camera in the actual environment, and will not be described here.

歩驟7：最後，計算物體的位置，請同時參閱圖八及圖九所示，係透過X軸長條統計圖及Y軸長條統計圖所得之最小特定點A’、D’、E’、H’進行物體位置的判斷，而判斷方式與上述圖四及圖五所揭露之方式相同，於此不在贅述；Step 7: Finally, calculate the position of the object. Please refer to Figure 8 and Figure 9. The minimum specific points A', D', E' obtained through the X-axis strip chart and the Y-axis strip chart. And H' judge the position of the object, and the manner of judgment is the same as that disclosed in the above FIG. 4 and FIG. 5, and is not described herein;

歩驟8：再重新擷取其他影像可辨識特徵，並重複步驟2~7即可重新進行影像之辨識。Step 8: Re-take the other image identifiable features and repeat steps 2~7 to re-identify the image.

因此，透過本發明方法所取得物體之色彩、形狀、大小及/或位置等資訊，除可應用於交通號誌上，並可應用於其他用途或領域上。Therefore, the information such as the color, shape, size and/or position of the object obtained by the method of the present invention can be applied to traffic signs and can be applied to other purposes or fields.

另外，圖八所顯示之樹上紅花統計圖502，經由多項式回歸分析顯示，該物體不具有良好定義的線性或曲線，因為第一與第二級分析之R^2值很小：In addition, the safflower chart 502 on the tree shown in FIG. 8 shows that the object does not have a well-defined linearity or curve via polynomial regression analysis because the R^2 values of the first and second levels of analysis are small:

Degree 1:-1653+0.5543x,α=0.05,p<0.0001,p<0.0001,R^2=0.48Degree 1:-1653+0.5543x, α=0.05, p<0.0001, p<0.0001, R^2=0.48

Degree 2:-155477+102x-0.01672x^2,α=0.05,p<0.0001,p<0.0001,p=0.0001,R^2=0.66Degree 2:-155477+102x-0.01672x^2, α=0.05, p<0.0001, p<0.0001, p=0.0001, R^2=0.66

因此，該物體明顯非為圓形或線性交通號誌物體。Therefore, the object is clearly not a circular or linear traffic sign object.

上述利用多項式回歸分析影像形狀時，並不會受到雜訊存在影響，如圖九所示，若針對Y軸長條統計圖所顯示之線條5014進行多項式回歸分析結果如下：When using the polynomial regression to analyze the image shape, it is not affected by noise. As shown in Figure 9, the polynomial regression analysis results for the line 5014 displayed on the Y-axis strip chart are as follows:

Degree 1:393-0.02945x,α=0.05,p<0.0001,p=0.0370,R^2=0.01Degree 1:393-0.02945x, α=0.05, p<0.0001, p=0.0370, R^2=0.01

Degree 2:5350-6.353x+0.001987x^2,α=0.05,p<0.0001,p<0.0001,p<0.0001,R^2=0.9Degree 2: 5350-6.353x+0.001987x^2, α=0.05, p<0.0001, p<0.0001, p<0.0001, R^2=0.9

結果顯示長條統計圖501之線條5014可能為圓形或卵形(非為線性)，且多項式回歸分析結果顯示第二級項在α=0.05時為統計上顯著，此與圖八X軸長條統計圖之線條5012相似，故Y軸長條統計圖之形狀辨識不會受到樹上紅花之雜訊影響。The results show that the line 5014 of the long chart 501 may be circular or oval (not linear), and the result of the polynomial regression analysis shows that the second level term is statistically significant at α=0.05, which is longer than the X-axis of Figure 8. The line 5012 of the chart is similar, so the shape recognition of the Y-axis strip chart is not affected by the noise of the safflower on the tree.

本發明所提供之在影像中辨識物體的方法，與其他習用技術相互比較時，更具備下列優點：The method for recognizing an object in an image provided by the present invention has the following advantages when compared with other conventional technologies:

1.本發明係藉由X軸或Y軸長條統計圖上取得之最小特定點及最大特定點，配合多項式回歸分析，即可快速並簡易判定物體形狀、大小及位置之在影像中辨識物體的方法。1. The present invention is capable of quickly and easily determining the shape, size and position of an object in an image by using a minimum specific point and a maximum specific point obtained on an X-axis or Y-axis strip chart. Methods.

2.本發明係可擷取不同的可辨識影像特徵，如RGB 色彩特徵或灰階影像特徵或視頻影像特徵，並迅速及準確對該擷取影像進行形狀、大小及位置的辨識，使其可應用於交通號誌或其他領域上。2. The invention can capture different identifiable image features, such as RGB color features or grayscale image features or video image features, and quickly and accurately identify the shape, size and position of the captured image, so that Used in traffic signs or other fields.

上列詳細說明係針對本發明之一可行實施例之具體說明，惟該實施例並非用以限制本發明之專利範圍，凡未脫離本發明技藝精神所為之等效實施或變更，均應包含於本案之專利範圍中。The detailed description of the preferred embodiments of the present invention is intended to be limited to the scope of the invention, and is not intended to limit the scope of the invention. The patent scope of this case.

綜上所述，本案不但在技術思想上確屬創新，並能較習用物品增進上述多項功效，應已充分符合新穎性及進步性之法定發明專利要件，爰依法提出申請，懇請　貴局核准本件發明專利申請案，以勵發明，至感德便。To sum up, this case is not only innovative in terms of technical thinking, but also able to enhance the above-mentioned multiple functions compared with conventional articles. It should fully comply with the statutory invention patent requirements of novelty and progressiveness, and apply in accordance with the law. I urge you to approve this article. Invention patent application, in order to invent invention, to the sense of virtue.

1．．．紅色三角形1. . . Red triangle

2．．．紅色圓形2. . . Red circle

3．．．紅色四方形3. . . Red square

4．．．綠色三角形4. . . Green triangle

5．．．藍色四方形5. . . Blue square

6．．．交通標誌6. . . Traffic signs

61．．．紅色外環61. . . Red outer ring

62．．．白色內環62. . . White inner ring

63．．．黑色字體63. . . Black font

71．．．紅花71. . . safflower

101．．．三角長條統計圖101. . . Triangle strip chart

1011．．．線條1011. . . line

1012．．．線條1012. . . line

102．．．三角長條統計圖102. . . Triangle strip chart

201．．．曲線長條統計圖201. . . Curve strip chart

2011．．．線條2011. . . line

202．．．曲線長條統計圖202. . . Curve strip chart

301．．．方形長條統計圖301. . . Square strip chart

3011．．．線條3011. . . line

3012．．．線條3012. . . line

3013．．．線條3013. . . line

302．．．方形長條統計圖302. . . Square strip chart

501．．．外環長條統計圖501. . . Outer ring strip chart

5011．．．線條5011. . . line

5012．．．線條5012. . . line

5013．．．線條5013. . . line

5014．．．線條5014. . . line

502．．．樹木紅花長條統計圖502. . . Tree safflower strip chart

503．．．灌木紅花長條統計圖503. . . Shrub red flower strip chart

A、C、D、F、G、J．．．最小特定點A, C, D, F, G, J. . . Minimum specific point

B、E、H、I．．．最大特定點B, E, H, I. . . Maximum specific point

R、M、N、P、Q、T．．．最小特定點R, M, N, P, Q, T. . . Minimum specific point

L、O、R、S．．．最大特定點L, O, R, S. . . Maximum specific point

A’、D’、E’、H’．．．最小特定點A', D', E', H'. . . Minimum specific point

B’、C’、F’、G’．．．最大特定點B', C', F', G'. . . Maximum specific point

圖一為本發明在影像中辨識物體的方法之步驟圖；1 is a step diagram of a method for identifying an object in an image according to the present invention;

圖二為本發明在影像中辨識物體的方法之第一實施之五種物體色彩圖；2 is a color diagram of five objects of the first implementation of the method for identifying an object in an image;

圖三為抓取圖二中可辨識影像特徵示意圖；Figure 3 is a schematic diagram of the features of the identifiable image captured in Figure 2;

圖四為計算圖三中每一行彩色像素所得之X軸彩長條統計圖；Figure 4 is a graph of the X-axis color strips obtained by calculating the color pixels of each row in Figure 3;

圖五為計算圖三中每一列彩色像素所得之Y軸彩長條統計圖；Figure 5 is a statistical diagram of the Y-axis color strip obtained by calculating each column of color pixels in Figure 3.

圖六為本發明在影像中辨識物體的方法之第二實施彩色交通號誌圖；Figure 6 is a second embodiment of a color traffic locomogram of the method for identifying an object in an image of the present invention;

圖七為抓取圖六中可辨識影像特徵示意圖；Figure 7 is a schematic diagram of capturing the identifiable image features in Figure 6.

圖八為計算圖六中每一行彩色像素所得之X軸彩長條統計圖；以及Figure 8 is a graph of the X-axis color strips obtained by calculating the color pixels of each row in Figure 6;

圖九為計算圖六中每一列彩色像素所得之Y軸彩長條統計圖。Figure 9 is a statistical diagram of the Y-axis color strip obtained by calculating the color pixels of each column in Figure 6.

無元件代表符號No component representation symbol

Claims

A method for recognizing an object in an image, comprising: step 1: first obtaining a digital image data; step 2: extracting at least one object according to the identifiable feature of the digital image; and step 3: simultaneously calculating step 2 Take the X-axis or Y-axis long chart of the image; Step 4: Obtain the X-axis or Y-axis long bar graph, and then find the minimum specific point and the maximum specific point of the X-axis or Y-axis bar graph; 5: Through the polynomial regression analysis method, the shape of each object is determined according to the minimum specific point and the maximum specific point of the identifiable object.

The method for recognizing an object in an image as described in claim 1 further includes a step 6 of capturing the image data of the same image but different identifiable features in step 1, and repeating step 2 Go to the processing flow of step 5.

The method for recognizing an object in an image as described in claim 1 further includes a step 6, wherein the step 6 determines the size of the object according to the minimum specific point and the position of the largest specific point.

The method for recognizing an object in an image as described in claim 1 further includes a step 6 of determining the correct position of the object according to the minimum specific point and the position of the largest specific point.

The method for recognizing an object in an image according to the first aspect of the patent application, wherein the digital image data obtained in the step 1 is obtained by photographing an object by a digital image capturing device.

The party that identifies the object in the image as described in item 5 of the patent application scope The method wherein the digital image capturing device is a camera or a camera.

The method for recognizing an object in an image according to the first aspect of the patent application, wherein the digital image data obtained in the step 1 is obtained by capturing an object by an analog image capturing device, and converting the analog signal into a digital signal. To obtain a digital image.

A method for recognizing an object in an image as described in claim 7 wherein the analog image capturing device is a camera or a camera.

A method for recognizing an object in an image as described in claim 1 wherein the step 2 identifies an RGB or IHS color identifiable feature characterized by a single color.

The method for recognizing an object in an image according to the first aspect of the patent application, wherein the step 2 identifiable feature is a gray-scale identifiable feature.

A method for recognizing an object in an image as described in claim 1, wherein the step 2 identifiable feature is a video spectral frequency identifiable feature.

The method for recognizing an object in an image as described in claim 1, wherein the calculation method of the X-axis or Y-axis bar graph of the step 3 is performed on each row or column of the image recognizable feature. Pixels.

The method for recognizing an object in an image as described in claim 1, wherein the minimum specific point of the step 4 is obtained on the X-axis or Y-axis bar graph, and is obtained in a linear search mode. Zero pixels.

The method for recognizing an object in an image as described in claim 1, wherein the maximum specific point of the step 4 is obtained on the X-axis or Y-axis bar graph, and is obtained in a linear search mode. Maximum Value point pixels.

The method for recognizing an object in an image as described in claim 1, wherein the minimum specific point and the maximum specific point position obtained in the step 4 must be noted and recorded.

The method for recognizing an object in an image according to the first aspect of the patent application, wherein the polynomial regression analysis method of the step 5 is based on analyzing the line on the X-axis or the Y-axis strip chart, when the X-axis is Or the Y-axis strip chart has two lines connecting the two minimum specific points and one maximum specific point, and the two lines are linearly oriented, and it can be determined that the object to be recognized should be a triangle.

For example, in the method of identifying an object in an image as described in claim 15, when there are four lines on the X-axis or Y-axis strip chart, the two minimum specific points and the two largest specific points are connected, and the four lines are in a straight line. , it can be determined that the object to be recognized should be square or rectangular.

The method for recognizing an object in an image as described in claim 15 wherein the minimum specific point to the maximum specific point of the line conforms to a quadratic equation, and the object to be identified should be determined to be circular or elliptical. .

A method for recognizing an object in an image, comprising: step 1: first obtaining a digital image data; step 2: extracting at least one object according to the identifiable feature of the digital image; and step 3: simultaneously calculating step 2 Take the X-axis or Y-axis long chart of the image; Step 4: Obtain the X-axis or Y-axis long bar graph, and then find the minimum specific point and the maximum specific point of the X-axis or Y-axis bar graph; Step 5: Determine the shape of each object according to the minimum specific point and the maximum specific point of the identifiable object through the polynomial regression analysis method. Step 6: Determine the size and size of the object according to the minimum specific point and the position of the largest specific point. .

A method for recognizing an object in an image, comprising: step 1: first obtaining a digital image data; step 2: extracting at least one object according to the identifiable feature of the digital image; and step 3: simultaneously calculating step 2 Take the X-axis or Y-axis long chart of the image; Step 4: Obtain the X-axis or Y-axis long bar graph, and then find the minimum specific point and the maximum specific point of the X-axis or Y-axis bar graph; 5: Further, through the polynomial regression analysis method, the shape of each object is determined according to the minimum specific point and the maximum specific point of the identifiable object; Step 6: determining the size and size of the object according to the minimum specific point and the position of the largest specific point; Step 7: Determine the correct position of the object based on the minimum specific point and the position of the largest specific point.