TWI618027B

TWI618027B - 3d hand gesture image recognition method and system thereof with ga

Info

Publication number: TWI618027B
Application number: TW105113887A
Authority: TW
Inventors: 王敬文
Original assignee: 國立高雄應用科技大學
Priority date: 2016-05-04
Filing date: 2016-05-04
Publication date: 2018-03-11
Also published as: TW201740347A

Abstract

一種基因演算三維手勢影像辨識系統包含一光場攝影單元、一演算單元及一輸出單元。利用該光場攝影單元攝取一手勢動作，以獲得一3D手勢影像。該演算單元連接至該光場攝影單元，再將該3D手勢影像投影至一預定辨識空間，以獲得一特徵向量，利用一基因演算法處理該特徵向量，以獲得一適應性特徵向量，再將該適應性特徵向量與數個樣本特徵向量以進行比對分類，以分類該3D手勢影像，以便辨識該3D手勢影像之種類。該輸出單元連接至該演算單元，以便輸出該3D手勢影像之種類。 A genetic algorithm three-dimensional gesture image recognition system comprises a light field photography unit, a calculation unit and an output unit. A light gesture is taken by the light field photographing unit to obtain a 3D gesture image. The calculating unit is connected to the light field photographing unit, and then projecting the 3D gesture image to a predetermined recognition space to obtain a feature vector, and processing the feature vector by using a genetic algorithm to obtain an adaptive feature vector, and then The adaptive feature vector and the plurality of sample feature vectors are compared for classification to classify the 3D gesture image to identify the type of the 3D gesture image. The output unit is connected to the calculation unit to output the type of the 3D gesture image.

Description

Gene calculus three-dimensional gesture image recognition method and system thereof

本發明係關於一種基因演算三維手勢影像辨識方法及其系統；特別是關於一種二維最佳主成分分析〔2D optimal principal component analysis〕結合基因演算法〔GA，genetic algorithm〕之三維手勢影像辨識方法及其系統。 The invention relates to a method and system for recognizing a three-dimensional gesture image of a gene algorithm; in particular, a method for recognizing a three-dimensional gesture image of a 2D optimal principal component analysis combined with a genetic algorithm (GA) And its system.

習用手勢影像辨識之相關應用裝置，例如：中華民國專利公告第M382675號之〝以手勢辨識為基礎之監控攝影機操控裝置〞新型專利，其揭示有關於一種可以下達監控攝影機鏡頭上下左右轉向、拉近及拉遠等動作指令的操控裝置。利用手勢取像攝影機所拍攝使用者的手勢影像，辨識出使用者手勢的上、下、左、右、前、後移動情形，並對監控攝影機發出鏡頭往上、往下、往左、往右、拉近、拉遠的操控訊息，而不需藉由操作滑鼠或操縱桿來下達上述操控訊息。 A related application device for recognizing gesture image recognition, for example, the Republic of China Patent Publication No. M382675, a new type of surveillance camera control device based on gesture recognition, which discloses that a camera lens can be turned up and down, left and right, and zoomed in. And remote control and other action command control devices. Using the gesture image of the user captured by the gesture camera to recognize the movement of the user's gestures up, down, left, right, front, and back, and to send the camera up, down, left, and right to the surveillance camera. To zoom in and out, without having to manipulate the mouse or joystick to release the above control message.

前述第M382675號先利用手勢取像攝影機拍攝使用者手部影像，再利用手勢位移偵測模組偵測與辨識使用者的手部位置與趨勢，進而計算出欲控制監控攝影機的方向，再傳遞訊號控制監控攝影機之鏡頭執行往上、往下、往左、往右、拉近及拉遠的動作。該手勢位移偵測模組先計算出手勢取像攝影機中之手部出現的位置與面積，做為手勢辨識的原點，此部份亦可以在手勢取像攝影機的畫面上預先設定一手勢辨識的原點，做為後續手勢辨識的位移偵測參考點，並分別定義出一個手勢最小面積的門檻值、一個手勢最小移動距離的門檻值及一個手勢最小面積變化的門檻值。 The aforementioned M382675 first uses a gesture camera to capture the user's hand image, and then uses the gesture displacement detection module to detect and recognize the user's hand position and trend, thereby calculating the direction of the camera to be controlled, and then transmitting The signal control monitors the lens of the camera to perform actions of going up, down, left, right, zooming in and out. The gesture displacement detecting module first calculates the position and area of the hand in the gesture camera, and serves as the origin of the gesture recognition. This part can also be used in the gesture camera. The origin of a gesture recognition is preset on the screen as a displacement detection reference point for subsequent gesture recognition, and a threshold value of a minimum area of the gesture, a threshold value of a minimum moving distance of the gesture, and a minimum area change of a gesture are respectively defined. Threshold value.

前述第M382675號在使用者的手勢面積大小大於預先設定之手勢最小面積門檻值後，裝置便會開始啟動手勢辨識功能，利用該手勢位移偵測模組偵測目前手勢位置與手勢原點的位移量。當手勢移動的位移量大於預先設定的門檻值時，則視為有手勢移動發生，反之，當手勢移動的位移量小於門檻值時，則視為沒有手勢移動發生；又，該手勢位移偵測模組亦會偵測目前手勢位置與手勢原點的位移方向，若手勢往上、下、左或右移動，則觸發一控制訊號以驅動監控攝影機鏡頭往上、下、左或右移動。另外，該手勢位移偵測模組以影像辨識技術判斷手勢在活動範圍內的前後移動；手勢往前移時其面積會比面積門檻值大，則觸發一控制訊號以驅動監控攝影機鏡頭拉近影像；手勢往後移時其面積會比面積門檻值小，則觸發一控制訊號以驅動監控攝影機鏡頭拉遠影像。 In the foregoing No. M382675, after the size of the gesture area of the user is greater than a preset minimum threshold value of the gesture, the device starts to activate the gesture recognition function, and uses the gesture displacement detection module to detect the displacement of the current gesture position and the origin of the gesture. the amount. When the displacement of the gesture movement is greater than a preset threshold value, the gesture movement is considered to occur. Conversely, when the displacement amount of the gesture movement is less than the threshold value, no gesture movement occurs; and the gesture displacement detection is performed. The module also detects the current gesture position and the direction of displacement of the gesture origin. If the gesture moves up, down, left or right, a control signal is triggered to drive the camera lens to move up, down, left or right. In addition, the gesture displacement detection module uses image recognition technology to determine the movement of the gesture in the range of motion; when the gesture moves forward, the area thereof is larger than the area threshold, and a control signal is triggered to drive the surveillance camera lens to zoom in. When the gesture moves backwards, its area will be smaller than the area threshold, triggering a control signal to drive the camera lens to zoom out.

另一習用手勢影像辨識之相關應用裝置，例如：中華民國專利公告第I298461號之〝手勢辨識系統及其方法〞發明專利，其揭示一種手勢辨識系統及其方法應用於一具有影像擷取器之筆記型電腦。使用者可直接對準此影像擷取器比出一預設的手勢，而筆記型電腦即會執行此手勢動作相對應之應用程式或是功能選項，以增加使用者執行應用程式或是功能選項時之方便性。 Another application device for recognizing gesture image recognition, for example, the gesture recognition system of the Republic of China Patent No. I298461 and its method and invention patent, which discloses a gesture recognition system and a method thereof applied to an image capture device Notebook computer. The user can directly align the image capture device with a preset gesture, and the notebook computer will execute the corresponding application or function option of the gesture action to increase the user execution application or function options. The convenience of time.

另一習用手勢影像辨識之相關應用裝置，例如：中華民國專利公開第I395145號之〝手勢辨識系統及其方法〞發明專利，其揭示一種手勢辨識系統包括：一攝影裝置用於取得可能含有自然手勢的影像；一處理器用以從影像中找出膚色部份的膚色輪廓〔edge〕，再將膚色輪廓分類為多個不同角度的輪廓碎片；一運算引擎具有數個平行運算單元及數個不同角度類別的手勢模板庫，該數個平行運算單元分別在不同角度類別的手勢模板庫中找出和輪廓碎片最近似的手勢模板；一最佳模板選取手段，自由該數個平行運算單元找出的數個近似的手勢模板中再選出一個最佳的手勢模板；及一顯示終端用以顯示最佳的手勢模板的影像；藉此達到無需使用任何標記〔marker less〕且能夠即時辨識手勢的目的。 Another related application device for recognizing gesture image recognition, for example, the gesture recognition system of the Republic of China Patent No. I395145 and the method thereof, and the invention patent, which discloses that a gesture recognition system includes: a photographing device for obtaining a possible natural gesture Image; one processor used Find the skin contour of the skin color part from the image, and then classify the skin color contour into a plurality of contour fragments of different angles; an operation engine has a plurality of parallel operation units and a plurality of gesture template libraries of different angle categories, A plurality of parallel computing units respectively find a gesture template that is closest to the contour fragments in the gesture template library of different angle categories; an optimal template selection means, which is free from the plurality of approximate gesture templates found by the plurality of parallel operation units Then, an optimal gesture template is selected; and a display terminal is used to display an image of the best gesture template; thereby achieving the purpose of not recognizing the marker without using any marker.

另一習用手勢影像辨識相關方法及系統，例如：中華民國專利公告第I431538號之〝基於影像之動作手勢辨識方法及系統〞發明專利，其揭示一種基於影像之動作手勢辨識方法及系統。該方法包含下列步驟：接收複數張影像畫面；根據複數張影像畫面執行一手勢偵測，以得到一第一手勢，如果該第一手勢符合一預設開始手勢，則根據複數張影像畫面中手部位置執行一移動追蹤，以取得一移動手勢；於執行移動追蹤之過程中，根據複數張影像畫面執行手勢偵測，以得到一第二手勢，若該第二手勢符合一預設結束手勢，停止移動追蹤。 Another conventional gesture image recognition related method and system, for example, the image-based motion gesture recognition method and system invention patent of the Republic of China Patent Publication No. I431538, discloses an image-based motion gesture recognition method and system. The method includes the steps of: receiving a plurality of image frames; performing a gesture detection according to the plurality of image images to obtain a first gesture, and if the first gesture conforms to a preset start gesture, according to the plurality of image frames The part position performs a movement tracking to obtain a movement gesture; during the execution of the movement tracking, performing gesture detection according to the plurality of image images to obtain a second gesture, if the second gesture meets a preset end Gesture, stop moving tracking.

另一習用手勢影像辨識相關方法及系統，例如：發明人提出中華民國專利公告第I444907號之〝採用奇異值分解處理複雜背景之手勢影像辨識方法及其系統〞發明專利，其揭示一種採用奇異值分解處理複雜背景之手勢影像辨識方法。該方法包含：利用一奇異值分解法分解一原始手勢影像，以獲得一增益手勢影像；自該增益手勢紋影像去除深色背景，以獲得至少一類皮膚圖素區塊；及利用一膚色偵測方法於該類皮膚圖素區塊進行膚色偵測，以去除該類皮膚圖素區塊之剩餘背景。該手勢影像辨識系統包含一輸入單元、一演算單元及一輸出單元。該輸入單元用以輸入該原始手勢影像，該演算單元用以自該增益手勢紋影像去除深色背景及剩餘背景，而該輸出單元用以輸出一膚色手勢影像。 Another conventional method and system for recognizing gesture image recognition, for example, the inventor proposes a gesture image recognition method for singular value decomposition processing complex background using the singular value decomposition method and a system patent of the invention, which discloses a singular value Decompose a gesture image recognition method that handles complex backgrounds. The method comprises: decomposing an original gesture image by using a singular value decomposition method to obtain a gain gesture image; removing a dark background from the gain gesture image to obtain at least one type of skin pixel block; and utilizing a skin color detection method The method performs skin color detection on the skin pixel block to remove the remaining background of the skin pixel block. The gesture image recognition system comprises an input unit, a calculation unit and an output unit. The input form The input unit is configured to input the original gesture image, and the calculation unit is configured to remove the dark background and the remaining background from the gain gesture image, and the output unit is configured to output a skin motion gesture image.

另一習用手勢影像辨識相關方法及系統，例如：發明人提出中華民國專利公告第I444908號之〝採用影像方向對正處理之手勢影像辨識方法及其系統〞發明專利，其揭示一種採用影像方向對正處理之手勢影像辨識方法。該方法包含：輸入一膚色手勢影像；於該膚色手勢影像計算一全域性質心；於該膚色手勢影像選擇一感興趣區塊；利用該感興趣區塊選擇一子區域；於該子區域計算一區域性質心；利用該全域性質心及區域性質心計算一對正角度。該手勢影像辨識系統包含一輸入單元、一演算單元及一輸出單元。該輸入單元用以輸入該膚色手勢影像，該演算單元於該膚色手勢影像選擇該感興趣區塊及子區域，以計算該全域性質心及區域性質心，再計算該對正角度，而該輸出單元用以輸出該對正角度。 Another conventional method and system for recognizing gesture image recognition, for example, the inventor proposes a gesture image recognition method using image direction alignment and a system invention patent after the Republic of China Patent Publication No. I444908, which discloses an image orientation pair The gesture image recognition method being processed. The method includes: inputting a skin color gesture image; calculating a global heart in the skin color gesture image; selecting a region of interest in the skin color gesture image; selecting a sub region using the region of interest; and calculating a subregion in the subregion Regional nature; use this global nature and regional nature to calculate a pair of positive angles. The gesture image recognition system comprises an input unit, a calculation unit and an output unit. The input unit is configured to input the skin color gesture image, and the calculation unit selects the region of interest and the sub-region in the skin color gesture image to calculate the global nature heart and the region property core, and then calculate the alignment angle, and the output is The unit is used to output the pair of positive angles.

另一習用手勢影像辨識相關方法及系統，例如：發明人提出中華民國專利公告第I444909號之〝採用奇異值分解進行光線補償處理之手勢影像辨識方法及其系統〞發明專利，其揭示一種採用奇異值分解光線補償處理之手勢影像辨識方法。該方法包含：輸入一手勢影像；利用一奇異值分解法應用於該手勢影像；以一光線補償法計算至少一光線補償係數；及利用該光線補償係數進行光線補償處理該手勢影像，以獲得一光線補償手勢影像。該手勢影像辨識系統包含一輸入單元、一演算單元及一輸出單元。該輸入單元用以輸入該原始手勢影像，該演算單元以該光線補償法計算該光線補償係數，並利用該光線補償係數進行光線補償處理該手勢影像，以獲得該光線補償手勢影像，而該輸出單元用以輸出該光線補償手勢影像。 Another method and system for recognizing gesture image recognition, for example, the inventor proposes a gesture image recognition method using singular value decomposition for ray compensation processing and a system for inventing the invention, which discloses a singularity Value decomposition light compensation processing gesture image recognition method. The method comprises: inputting a gesture image; applying a singular value decomposition method to the gesture image; calculating at least one ray compensation coefficient by a ray compensation method; and performing ray compensation processing on the gesture image by using the ray compensation coefficient to obtain a gesture image Light compensated gesture image. The gesture image recognition system comprises an input unit, a calculation unit and an output unit. The input unit is configured to input the original gesture image, and the calculating unit calculates the light compensation coefficient by using the light compensation method, and performs light compensation processing on the gesture image by using the light compensation coefficient to obtain the light compensation gesture image, and the output is The unit is configured to output the light compensation gesture image.

另外，習用手勢影像辨識之相關應用技術已揭示於部分美國專利，例如：美國專利第7,702,130號之〝User interface apparatus using hand gesture recognition and method thereof〞、第7,680,295號之〝Hand-gesture based interface apparatus〞、第6,215,890號之〝Hand gesture recognizing device〞、第6,002,808號之〝Hand gesture control system〞及第5,594,469號之〝Hand gesture machine control system〞等。前述中華民國專利及美國專利僅為本發明技術背景之參考及說明目前技術發展狀態而已，其並非用以限制本發明之範圍。 In addition, the related application techniques of conventional gesture image recognition have been disclosed in some U.S. patents, for example, U.S. Patent No. 7,702,130, User interface apparatus using hand gesture recognition and method thereof, No. 7,680,295, Hand-gesture based interface apparatus〞 No. 6,215,890, Hand gesture recognizing device〞, No. 6,002,808, Hand gesture control system〞, and No. 5,594,469, Hand gesture machine control system〞. The foregoing Japanese patents and U.S. patents are only for the purpose of the present invention and are not intended to limit the scope of the present invention.

雖然前述專利已揭示相關手勢影像辨識技術，但其並未提供三維手勢影像辨識之相關技術。事實上，就手勢影像辨識技術而言，其必然需要簡化複雜的系統架構或省略複雜的前置處理程序，否則其影響手勢影像辨識的可靠度。因此，習用手勢影像辨識技術必然存在進一步提供三維手勢影像辨識的需求。 Although the aforementioned patents have disclosed related gesture image recognition techniques, they do not provide related techniques for three-dimensional gesture image recognition. In fact, in terms of gesture image recognition technology, it is necessary to simplify the complicated system architecture or omit complicated pre-processing procedures, otherwise it will affect the reliability of gesture image recognition. Therefore, the conventional gesture image recognition technology necessarily has the need to further provide three-dimensional gesture image recognition.

有鑑於此，本發明為了滿足上述需求，其提供一種基因演算三維手勢影像辨識方法及其系統，其利用一光場攝影單元攝取一手勢動作，以獲得一3D手勢影像，而將該3D手勢影像投影至一預定辨識空間，以獲得一特徵向量，且利用一基因演算法處理該特徵向量，以獲得一適應性特徵向量，且將該適應性特徵向量與數個樣本特徵向量進行比對分類，即可完成手勢影像辨識，以提升習用手勢影像辨識技術之可靠度。 In view of the above, the present invention provides a genetic algorithm for three-dimensional gesture image recognition and a system thereof, which utilizes a light field photographing unit to take a gesture to obtain a 3D gesture image, and the 3D gesture image. Projecting to a predetermined recognition space to obtain a feature vector, and processing the feature vector by using a genetic algorithm to obtain an adaptive feature vector, and comparing the adaptive feature vector with the plurality of sample feature vectors. Gesture image recognition can be completed to improve the reliability of the conventional gesture image recognition technology.

本發明較佳實施例之主要目的係提供一種基因演算三維手勢影像辨識方法及其系統，其利用一光場攝影單元攝取一手勢動作，以獲得一3D手勢影像，而將該3D手勢影像投影至一預定辨識空間，以獲得一特徵向量，且利用一基因演算法處理該特徵向量，以獲得一適應性特徵向量，且將該適應性特徵向量與數個樣本特徵向量進行比對分類，以達成提升手勢影像辨識可靠度之目的。 The main object of the preferred embodiment of the present invention is to provide a method for recognizing a three-dimensional gesture image of a genetic algorithm and a system thereof, which utilizes a light field photographing unit to take a gesture motion to obtain a 3D gesture image, and project the 3D gesture image to a predetermined recognition space to obtain a feature vector, The eigenvector is processed by a genetic algorithm to obtain an adaptive eigenvector, and the adaptive eigenvector is compared with the plurality of sample eigenvectors to achieve the purpose of improving the reliability of the gesture image recognition.

為了達成上述目的，本發明較佳實施例之基因演算三維手勢影像辨識方法包含：利用一光場攝影單元攝取一手勢動作，以獲得一3D手勢影像；將該3D手勢影像投影至一預定辨識空間，以獲得一特徵向量；利用一基因演算法處理該特徵向量，以獲得一適應性特徵向量；及將該適應性特徵向量與數個樣本特徵向量以進行比對分類，以分類該3D手勢影像，以便辨識該3D手勢影像之種類。 In order to achieve the above object, a three-dimensional gesture image recognition method for a genetic algorithm according to a preferred embodiment of the present invention includes: capturing a gesture by using a light field photographing unit to obtain a 3D gesture image; and projecting the 3D gesture image to a predetermined recognition space. Obtaining a feature vector; processing the feature vector by using a gene algorithm to obtain an adaptive feature vector; and classifying the adaptive feature vector and the plurality of sample feature vectors to classify the 3D gesture image In order to identify the type of the 3D gesture image.

本發明較佳實施例之該3D手勢影像包含一平面影像資訊及一深度資訊。 In the preferred embodiment of the present invention, the 3D gesture image includes a plane image information and a depth information.

本發明較佳實施例之該3D手勢影像為一3D手勢輪廓影像或一3D手勢實心影像。 In the preferred embodiment of the present invention, the 3D gesture image is a 3D gesture contour image or a 3D gesture solid image.

本發明較佳實施例利用投影色彩空間方式將該3D手勢影像進行投影色彩空間轉換，以獲得一R通道影像資訊、一G通道影像資訊及一B通道影像資訊。 In a preferred embodiment of the present invention, the 3D gesture image is converted into a projected color space by using a projected color space method to obtain an R channel image information, a G channel image information, and a B channel image information.

本發明較佳實施例在將該3D手勢影像投影至該預定辨識空間時，採用主成分分析法或二維最佳主成分分析法進行投影。 In the preferred embodiment of the present invention, when the 3D gesture image is projected to the predetermined recognition space, projection is performed by principal component analysis or two-dimensional optimal principal component analysis.

本發明較佳實施例在利用該基因演算法處理該特徵向量前，採用一適應性函數或一貝式概似函數。 In the preferred embodiment of the present invention, an adaptive function or a shell-like approximate function is used before the feature vector is processed by the genetic algorithm.

本發明較佳實施例在將該適應性特徵向量與數個樣本特徵向量以進行比對分類時，採用最近鄰居法進行比對分類。 In the preferred embodiment of the present invention, when the adaptive feature vector and the plurality of sample feature vectors are compared for classification, the nearest neighbor method is used for comparison and classification.

本發明較佳實施例在分類該適應性特徵向量前，利用一馬式距離函數或一馬式餘弦距離函數處理該3D手勢影像。 The preferred embodiment of the present invention processes the 3D gesture image using a horse distance function or a horse cosine distance function before classifying the adaptive feature vector.

為了達成上述目的，本發明較佳實施例之基因演算三維手勢影像辨識系統包含：一光場攝影單元，其攝取一手勢動作，以獲得一3D手勢影像；一演算單元，其連接至該光場攝影單元，將該3D手勢影像投影至一預定辨識空間，以獲得一特徵向量，且利用一基因演算法處理該特徵向量，以獲得一適應性特徵向量，再將該適應性特徵向量與數個樣本特徵向量以進行比對分類，以分類該3D手勢影像，以便辨識該3D手勢影像之種類；及一輸出單元，其連接至該演算單元，以便輸出該3D手勢影像之種類。 In order to achieve the above object, a genetic algorithm three-dimensional gesture image recognition system according to a preferred embodiment of the present invention comprises: a light field photographing unit that takes a gesture motion to obtain a 3D gesture image; and an arithmetic unit connected to the light field The photographing unit projects the 3D gesture image into a predetermined recognition space to obtain a feature vector, and processes the feature vector by using a genetic algorithm to obtain an adaptive feature vector, and then the adaptive feature vector and the plurality of The sample feature vectors are categorized to classify the 3D gesture image to identify the type of the 3D gesture image; and an output unit coupled to the calculation unit to output the type of the 3D gesture image.

本發明較佳實施例在將該3D手勢影像投影至該預定辨識空間時，採用主成分分析法或二維最佳主成分分析法進行投影，而在將該適應性特徵向量與數個樣本特徵向量以進行比對分類時，採用最近鄰居法進行比對分類。 In the preferred embodiment of the present invention, when the 3D gesture image is projected into the predetermined recognition space, the principal component analysis method or the two-dimensional optimal principal component analysis method is used for projection, and the adaptive feature vector and the plurality of sample features are used. When the vectors are compared for classification, the nearest neighbor method is used for comparison classification.

S1‧‧‧步驟 S1‧‧‧ steps

S2‧‧‧步驟 S2‧‧‧ steps

S3‧‧‧步驟 S3‧‧‧ steps

S4‧‧‧步驟 S4‧‧‧ steps

10‧‧‧光場攝影單元 10‧‧‧Light field photography unit

20‧‧‧演算單元 20‧‧‧ calculus unit

30‧‧‧輸出單元 30‧‧‧Output unit

第1圖：本發明較佳實施例之基因演算三維手勢影像辨識方法之流程示意圖。 FIG. 1 is a flow chart showing a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention.

第2圖：本發明較佳實施例之基因演算三維手勢影像辨識系統之方塊示意圖。 Figure 2: Image analysis of three-dimensional gestures of genetic algorithm in a preferred embodiment of the present invention A block diagram of the system.

第3A及3B圖：本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統攝取九個3D手勢輪廓影像及九個3D手勢實心影像之示意圖。 3A and 3B are diagrams showing a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention and a system for capturing nine 3D gesture contour images and nine solid images of 3D gestures.

第4圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統以k=3條件進行辨識3D手勢輪廓影像及3D手勢實心影像後，其辨識率與特徵向量關係之曲線示意圖。 FIG. 4 is a schematic diagram showing the relationship between the recognition rate and the feature vector after the 3D gesture contour image and the 3D gesture solid image are recognized by the k =3 condition in the preferred embodiment of the present invention.

第5A及5B圖：本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用每影像取三種旋轉變異條件分別攝取九種3D手勢輪廓影像及九種3D手勢實心影像之示意圖。 5A and 5B are diagrams showing a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention, and a system for taking nine kinds of 3D gesture contour images and nine solid images of 3D gestures for each of the three rotation variation conditions.

第6(A)及6(B)圖：本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統由原始3D手勢實心經RGB影像及其處理後產生一PCS投影影像之示意圖。 6(A) and 6(B) are diagrams showing a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention, and a system for generating a PCS projection image from an original 3D gesture through an RGB image and processing thereof.

第7(a)至7(d)圖：本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後，將原始3D手勢輪廓影像與一系列測試3D手勢輪廓影像之比較示意圖。 7(a) to 7(d): a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention and a system thereof, which are tested by adding various Gaussian noise variations conditions, and then the original 3D gesture contour image and a series of images A comparison diagram for comparing 3D gesture contour images.

第8(a)至8(d)圖：本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後，將原始3D手勢實心影像與一系列測試3D手勢實心影像之比較示意圖。 8(a) to 8(d): a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention, and a system for adding a variety of Gaussian noise variations to test a solid image of the original 3D gesture with a series of Test a comparison of solid images of 3D gestures.

為了充分瞭解本發明，於下文將舉例較佳實施例並配合所附圖式作詳細說明，且其並非用以限定本發明。 In order to fully understand the present invention, the preferred embodiments of the present invention are described in detail below, and are not intended to limit the invention.

本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統適用於各種手勢辨識裝置及其相關應用設備，例如：各類型電腦系統、家電產品控制系統〔如物聯網〕、自動化控制系統、醫療照護系統或保全系統，但其並非用以限定本發明之範圍。 The gene calculus three-dimensional gesture image recognition method and system thereof according to the preferred embodiment of the present invention are applicable to various gesture recognition devices and related application devices, for example, various types of computer systems, home appliance product control systems (such as objects) Networking], automated control systems, medical care systems, or security systems, but are not intended to limit the scope of the invention.

第1圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法之流程示意圖。第2圖揭示本發明較佳實施例之基因演算三維手勢影像辨識系統之方塊示意圖，其對應於第1圖。請參照第1及2圖所示，舉例而言，本發明較佳實施例之三維手勢影像辨識系統包含一光場攝影〔light field capturing〕單元10、一演算單元20及一輸出單元30，而該演算單元20適當連接至該光場攝影單元10，且該輸出單元30適當連接至該演算單元20。 FIG. 1 is a flow chart showing a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention. 2 is a block diagram showing a three-dimensional gesture image recognition system for genetic algorithm according to a preferred embodiment of the present invention, which corresponds to FIG. Referring to FIGS. 1 and 2, for example, a three-dimensional gesture image recognition system according to a preferred embodiment of the present invention includes a light field capturing unit 10, a calculation unit 20, and an output unit 30. The calculation unit 20 is suitably connected to the light field photographing unit 10, and the output unit 30 is suitably connected to the calculation unit 20.

請再參照第1及2圖所示，舉例而言，該光場攝影單元10包含數個微光學鏡頭〔micro lens〕，而該微光學鏡頭形成一微鏡頭陣列，且該微鏡頭陣列用以主動攝取一場深度〔depth of field〕影像。另外，該場深度影像之深度資訊由利用數個深度計算參數〔depth calculation parameter〕進行計算。該深度計算參數包含最大虛擬深度〔maximum virtual depth〕、近似解析度基準〔approximate resolution level〕、像素等級〔pixel step〕、最小標準差〔minimum standard deviation〕、最小相關度〔minimum correlation〕、曲率極值〔extremum curvature〕、相關性調節部數〔correlation patch diameter〕、相關性調節因子〔correlation patch stride〕、填補深度〔depth fill〕及後處理深度〔depth post-process〕。 Referring to FIGS. 1 and 2, for example, the light field photographing unit 10 includes a plurality of micro lenses, and the microlens lens forms a microlens array, and the microlens array is used for Actively take a depth of field image. In addition, the depth information of the depth image of the field is calculated by using a plurality of depth calculation parameters. The depth calculation parameter includes a maximum virtual depth, an approximate resolution level, a pixel step, a minimum standard deviation, a minimum correlation, and a curvature pole. Extremum curvature, correlation patch diameter, correlation patch stride, depth fill, and depth post-process.

請再參照第1及2圖所示，本發明較佳實施例之三維手勢影像辨識方法包含步驟S1：首先，利用該光場攝影單元10或具類似功能的光場影像輸入單元以一預定距離〔例如：50至70公分〕攝取一手勢動作，以獲得一3D手勢影像〔例如：100x100像素〕。舉例而言，該3D手勢影像包含一平面影像資訊及一深度資訊。該3D手勢影像為一3D手勢輪廓〔contour〕影像或一3D手勢實心〔solid〕影像。 Referring to FIG. 1 and FIG. 2 again, the three-dimensional gesture image recognition method according to the preferred embodiment of the present invention includes the step S1: first, using the light field photographing unit 10 or the light field image input unit having a similar function at a predetermined distance. [Example: 50 to 70 cm] Take a gesture to get a 3D gesture image (for example: 100x100 pixels). For example, the 3D gesture image includes a plane image information and a depth information. The 3D gesture The image is a 3D gesture contour image or a 3D gesture solid image.

第3A圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統攝取九個3D手勢輪廓影像〔平面旋轉手勢輪廓影像，IC〕之示意圖，其包含零至八的3D手勢灰階〔grayscale〕輪廓影像，以便用於平面旋轉〔in-plane rotation〕變異。相對的，第3B圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統攝取九個3D手勢實心影像〔平面旋轉手勢實心影像，IS〕之示意圖，其包含零至八的3D手勢灰階實心影像，以便用於平面旋轉變異。 FIG. 3A is a schematic diagram of a three-dimensional gesture image recognition method for a gene calculus according to a preferred embodiment of the present invention, and a schematic diagram of a system for capturing nine 3D gesture contour images (a planar rotation gesture contour image, IC), which includes a zero-to-eight 3D gesture gray scale. [grayscale] contour image for use in in-plane rotation variation. In contrast, FIG. 3B discloses a schematic diagram of a three-dimensional gesture image recognition method for a genetic algorithm according to a preferred embodiment of the present invention, and a schematic diagram of a system for capturing nine solid images of a 3D gesture (solid image of a planar rotation gesture, IS), which includes zero to eight 3D. Gesture grayscale solid image for use in planar rotation variations.

請再參照第1及2圖所示，本發明較佳實施例之三維手勢影像辨識方法包含步驟S2：接著，將該3D手勢影像投影至一預定辨識空間，以獲得一特徵向量。舉例而言，本發明較佳實施例採用主成分分析〔PCA，principal component analysis〕或二維最佳主成分分析〔2D optimal PCA〕方法將該3D手勢影像之特徵投影至比較容易辨別的空間。 Referring to FIG. 1 and FIG. 2 again, the three-dimensional gesture image recognition method according to the preferred embodiment of the present invention includes the step S2: then, the 3D gesture image is projected to a predetermined recognition space to obtain a feature vector. For example, the preferred embodiment of the present invention projects the features of the 3D gesture image into a relatively easily discernible space using a principal component analysis (PCA) or a 2D optimal PCA method.

該主成分分析可運用於高維資料的降維，且保留資料的變異程度，由於一維PCA需要將輸入訓練影像的維度變直。就(m×n)大小的影像為例，在進行共變異矩陣進算時，會產生(m×n)×(m×n)大小的矩陣，對計算特徵向量的步驟上需要極大的計算時間。因此，將原本的共變異矩陣改為下式： The principal component analysis can be applied to the dimensionality reduction of high-dimensional data, and the degree of variation of the data is preserved, because the one-dimensional PCA needs to straighten the dimensions of the input training image. Taking the image of ( m × n ) size as an example, when the covariation matrix is calculated, a matrix of ( m × n ) × ( m × n ) is generated, which requires a great calculation time for the step of calculating the feature vector. . Therefore, the original covariation matrix is changed to the following formula:

其中C _i為改變過的共變異矩陣，L為訓練樣本數量， X _tr為訓練影像，為全部訓練影像的平均值。如此，將共變異矩陣維度大小降為L×L，且大幅減少計算投影基底所需的時間。本發明較佳實施例採用奇異值分解〔SVD〕對共變異矩陣進行運算如下式： Where C _i is the changed covariation matrix, L is the number of training samples, and X _tr is the training image. The average of all training images. In this way, the covariation matrix dimension is reduced to L × L and the time required to calculate the projection substrate is greatly reduced. The preferred embodiment of the present invention uses the singular value decomposition [SVD] to operate the covariation matrix as follows:

上式所求得的特徵值矩陣Σ_i與原本的共變異矩陣C經奇異值分解所得特徵值矩陣Σ相同，而特徵向量矩陣則須以(X _tr-)*U _i方形成特徵向量矩陣U。接著，利用特徵向量矩陣將原本的訓練資料投影至PCA空間，以獲得經過PCA處理過後的訓練樣本特徵向量F _tr。 The formula obtained eigenvalue matrix Σ _i resulting eigenvalue matrix [Sigma same original covariance matrix C through singular value decomposition, and the eigenvector matrix shall be to (X _tr - ) * U _i square forms the eigenvector matrix U . Then, the original training data is projected into the PCA space by using the feature vector matrix to obtain the PCA processed training sample feature vector F _tr .

本發明另一較佳實施例利用二維最佳主成分分析自n個總特徵向量〔特徵向量集U={1,2,...,n}〕中選取k個特徵向量。特徵向量集U={1,2,...,n}包含第一子集〔X〕及第二子集〔Y〕，其中第一子集〔X〕為選取特徵向量之子集，第二子集〔Y〕為剩餘特徵向量之子集。本發明較佳實施例採用運算子φ用以自X選取最小判別特徵子集〔least discriminant feature subset〕及運算子ψ用以自Y選取最大判別〔most discriminant〕特徵子集，因此|Ω|=κ，，，本發明較佳實施例採用函數如下：,κ D Another preferred embodiment of the present invention utilizes two-dimensional optimal principal component analysis to select k feature vectors from n total feature vectors [feature vector set U = {1, 2, ..., n }]. The eigenvector set U = {1, 2, ..., n } includes a first subset [ X ] and a second subset [ Y ], wherein the first subset [ X ] is a subset of the selected feature vectors, and the second The subset [ Y ] is a subset of the remaining feature vectors. Preferred embodiment of the present invention to employ operators φ is determined from the X minimum feature subset selection least discriminant feature subset [] and ψ operators to select the maximum is determined from the [Y] most discriminant feature subset, thus | Ω | = κ , , The preferred embodiment of the present invention employs the following functions: , κ D

J(Ω)為用以留一法〔leave-one-out〕評估子集Ω。 J (Ω) is used to evaluate the subset Ω by leave-one-out.

請再參照第1及2圖所示，本發明較佳實施例之三維手勢影像辨識方法包含步驟S3：接著，利用一基因演算法〔GA〕處理該特徵向量，以獲得一適應性特徵向量。舉例而言，在利用該基因演算法處理該特徵向量時，採用一適應性函數〔fitness function〕或一貝式概似函數〔Bayesian likelihood function〕。 Referring again to Figures 1 and 2, a preferred embodiment of the present invention The three-dimensional gesture image recognition method comprises the step S3: Next, the feature vector is processed by a genetic algorithm [GA] to obtain an adaptive feature vector. For example, when the feature vector is processed by the genetic algorithm, a fitness function or a Bayesian likelihood function is used.

本發明較佳實施例採用貝式概似函數如下： The preferred embodiment of the present invention adopts a Bayesian approximate function as follows:

κ/n為選取特徵向量數量與總特徵向量數量之比例，η ₁、η ₂、η ₃、η ₄為作用係數，TPR為真陽性率〔true positive rate〕、TNR為真陰性率〔true negative rate〕、FNR為偽陽性率〔false negative rate〕、FPR為偽陰性率〔false posi κ/n is the ratio of the number of eigenvectors to the total number of eigenvectors, η ₁ , η ₂ , η ₃ , η ₄ are the action coefficients, TPR is the true positive rate, and TNR is the true negative rate. Rate], FNR is false negative rate, FPR is false negative rate [false posi

本發明較佳實施例輸入該3D手勢影像〔矩陣 a R ^ζ，ζ n〕之單一欄〔column per image〕，以獲得二維最佳主成分分析，其中ζ為影像尺寸大小；接著，利用基因演算法〔GA〕選取κ個二維最佳主成分，以便對每個訓練主向量〔training subject vector〕_{c ml}產生投影向量〔projection vector〕_{a ml}，且該投影向量之運算如下式：，m=1,2,...,M，l=1,2,...,L，，κ<n The preferred embodiment of the present invention inputs the 3D gesture image [matrix a R ^ζ ,ζ n 〕 a column (column per image) to obtain a two-dimensional optimal principal component analysis, where ζ is the size of the image; then, using the genetic algorithm [GA] to select κ two-dimensional optimal principal components, so that for each Training subject vector _{c ml} produces a projection vector [ a projection vector ] _{a ml} , and the operation of the projection vector is as follows: , m =1,2,..., M , l =1,2,..., L , , κ < n

其中a _ml為第l影像之第m主向量，為第κ二維最佳主成分對應於選自樣本共變異〔co-variance〕矩陣之第m主向量。 Wherein a _ml is the main vector of the m l of images, The two-dimensional optimal principal component of the κ corresponds to the m -th main vector selected from the co-variance matrix of the sample.

請再參照第1及2圖所示，本發明較佳實施例之三維手勢影像辨識方法包含步驟S4：接著，將該適應性特徵向量與數個樣本特徵向量以進行比對分類，以分類該3D手勢影像，以便辨識該3D手勢影像之種類。舉例而言，在將該適應性特徵向量與數個樣本特徵向量以進行比對分類時，採用最近鄰居法〔KNN，k-nearest neighbors〕進行比對分類。利用在特徵空間中最接近測試影像的k個訓練樣本進行分類，將測試影像投影至PCA空間，並與訓練樣本比較，以便進行相似度的計算如下式： Referring to FIG. 1 and FIG. 2 again, the three-dimensional gesture image recognition method according to the preferred embodiment of the present invention includes the step S4: then, the adaptive feature vector and the plurality of sample feature vectors are compared and classified to classify the 3D gesture image to identify the type of the 3D gesture image. For example, when the adaptive feature vector is compared with a plurality of sample feature vectors for comparison, the nearest neighbor method [ KNN , k- nearest neighbors] is used for comparison classification. The k training samples closest to the test image in the feature space are classified, the test image is projected into the PCA space, and compared with the training samples, so that the similarity is calculated as follows:

其中S _k為計算出的相似度矩陣，k為所設定最鄰近的訓練樣本數量，N為特徵根數的最大值，F _te、F _tr為測試及訓練樣本的特徵向量，依照k值選擇相似度最近的k個訓練樣本，判斷測試資料與選定訓練樣本中的哪一類最相近，以進行分類的預測。 Where S _k is the calculated similarity matrix, k is the number of the nearest training samples, N is the maximum value of the feature roots, F _te and F _tr are the feature vectors of the test and training samples, and the similarity is selected according to the k value. The nearest k training samples determine which of the selected training samples is closest to the classification of the selected training samples for classification prediction.

本發明較佳實施例在分類該適應性特徵向量前，利用一馬式距離函數〔Mahalanobis distance function〕或一馬式餘弦〔cosine〕距離函數處理該3D手勢影像。本發明較佳實施例之馬式距離函數的計算如下式： In a preferred embodiment of the present invention, the 3D gesture image is processed using a Mahalanobis distance function or a cosine distance function before classifying the adaptive feature vector. The horse distance function of the preferred embodiment of the present invention is calculated as follows:

其中 c _t為第t測試二維最佳主成分之投影向量，ζ_mt為第m範疇〔category〕至第t測試影像之距離，均值u _m及變異值v _m為自第m範疇選取之二維最佳主成分，以留一交叉驗證〔leave-one-out〕法進行驗證，γ顯示為範疇標示碼，δ顯示為辨識門檻值〔threshold〕。 Where c _t is the projection vector of the two-dimensional optimal principal component of the t- test, ζ _mt is the distance from the m- category to the t- test image, and the mean u _m and the variation value v _m are selected from the m- th category. The optimal principal component of the dimension is verified by the leave-one-out method. γ is displayed as the category identification code, and δ is displayed as the threshold value.

第4圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統以k=3條件進行辨識3D手勢輪廓影像及3D手勢實心影像後，其辨識率與特徵向量關係之曲線示意圖。請參照第4圖所示，本發明較佳實施例採用辨識3D手勢影像包含平面旋轉手勢輪廓影像〔IC〕、平面旋轉手勢實心影像〔IS〕、深度旋轉手勢輪廓影像〔OC〕、深度旋轉手勢實心影像〔OSG〕及投影色彩空間深度旋轉手勢實心影像〔OSG PCS〕。 FIG. 4 is a schematic diagram showing the relationship between the recognition rate and the feature vector after the 3D gesture contour image and the 3D gesture solid image are recognized by the k =3 condition in the preferred embodiment of the present invention. Referring to FIG. 4, the preferred embodiment of the present invention uses a recognition 3D gesture image including a plane rotation gesture contour image (IC), a plane rotation gesture solid image [IS], a depth rotation gesture contour image [OC], and a deep rotation gesture. Solid image [OSG] and projected color space depth rotation gesture solid image [OSG PCS].

第5A圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用每影像取三種平面旋轉變異條件分別攝取九種3D手勢輪廓影像〔平面旋轉手勢輪廓影像，OC〕之示意圖。相對的，第5B圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用每影像取三種深度〔out-of-plane rotation〕旋轉變異條件分別攝取九種3D手勢實心影像〔深度旋轉手勢實心影像，OSG〕之示意圖。舉例而言，本發明較佳實施例採用旋轉角度分別為0度、±15度及±30度〔三種旋轉角度手勢由左至右方式排列〕。 FIG. 5A is a schematic diagram of a three-dimensional gesture image recognition method for a genetic algorithm according to a preferred embodiment of the present invention, and a system for capturing nine kinds of 3D gesture contour images (plane rotation gesture contour image, OC) by using three plane rotation variation conditions per image. In contrast, FIG. 5B discloses a three-dimensional gesture image recognition method for a genetic algorithm according to a preferred embodiment of the present invention, and a system for taking nine kinds of 3D gesture solid images by using three out-of-plane rotation conditions for each image. Deep rotation gesture solid image, OSG] schematic. For example, the preferred embodiment of the present invention uses rotation angles of 0 degrees, ±15 degrees, and ±30 degrees, respectively (three rotation angle gestures are arranged from left to right).

第6(A)及6(B)圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統由原始3D手勢實心經RGB影像及其處理後產生一PCS投影影像之示意圖。本發明較佳實施例之基因演算三維手勢影像辨識方法將選擇擷取至少一深度旋轉彩色手勢實心影像，如第6(A)圖所示。接著，將該深度旋轉彩色手勢實心影像以投影色彩空間方法取得RGB三通道的影像資訊。在投影色彩空間時，先產生一R通道影像、一G通道影像及一B通道影像，再將該RGB三通道的影像資訊投影至同一空間，以獲得一PCS手勢實心投影影像，如第6(B)圖所示。 6(A) and 6(B) are diagrams showing a method for recognizing a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention, and a system for generating a PCS projection image from an original 3D gesture through an RGB image and processing thereof. The genetic algorithm three-dimensional gesture image recognition method according to the preferred embodiment of the present invention will select at least one deep-rotation color gesture solid image, as shown in FIG. 6(A). Then, the depth-rotating color gesture solid image is used to obtain RGB three-channel image information by a projection color space method. When projecting the color space, first generate an R channel image, a G channel image and a B channel image, and then project the RGB three channel image information into the same space to obtain a PCS gesture solid projection image, such as the sixth ( B) The picture shows.

第7(a)至7(d)圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用添加三種高斯雜訊〔Gaussian noise〕變異條件〔添加10%、20%、30%雜訊〕進行測試後，將原始3D手勢輪廓影像與一系列測試 3D手勢輪廓影像之比較示意圖。請參照第7(a)至7(d)圖所示，將該原始3D手勢輪廓影像〔如第7(a)圖所示〕添加的高斯雜訊平均值均為0，且在該3D手勢輪廓影像添加10%、20%、30%雜訊的變異數進行辨識，如第7(b)至7(d)圖所示。表1顯示在該3D手勢輪廓影像添加10%、20%、30%雜訊的變異數下，且其TPR及TNR辨識率仍高達100%。 7(a) to 7(d) show a method for identifying a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention, and a system for adding three Gaussian noise variations conditions [adding 10%, 20%, 30) % noise] After testing, the original 3D gesture contour image and a series of tests A schematic diagram of the comparison of 3D gesture contour images. Referring to Figures 7(a) through 7(d), the average of the Gaussian noise added to the original 3D gesture contour image (as shown in Fig. 7(a)) is 0, and the 3D gesture is The contour image is identified by adding 10%, 20%, and 30% noise, as shown in Figures 7(b) through 7(d). Table 1 shows that the variation of the 10%, 20%, and 30% noise is added to the 3D gesture contour image, and the TPR and TNR recognition rates are still as high as 100%.

第8(a)至8(d)圖揭示本發明較佳實施例之基因演算三維手勢影像辨識方法及其系統採用添加三種高斯雜訊變異條件〔10%、20%、30%〕進行測試後，將原始3D手勢實心影像與一系列測試3D手勢實心影像之比較示意圖。請參照第8(a)至8(d)圖所示，將該原始3D手勢實心影像〔如第8(a)圖所示〕添加的高斯雜訊平均值均為0，且在該3D手勢實心影像添加10%、20%、30%雜訊的變異數進行辨識，如第8(b)至8(d)圖所示。表2顯示在該3D手勢實心影像添加10%、20%、30%雜訊的變異數下，且其TPR及TNR辨識率仍高達100%。 8(a) to 8(d) show a method for identifying a three-dimensional gesture image of a genetic algorithm according to a preferred embodiment of the present invention, and a system for adding three Gaussian noise variations conditions (10%, 20%, 30%) for testing. A comparison of the original 3D gesture solid image with a series of test 3D gesture solid images. Referring to Figures 8(a) to 8(d), the average of the Gaussian noise added to the original 3D gesture solid image (as shown in Fig. 8(a)) is 0, and the 3D gesture is Solid images were identified by adding 10%, 20%, and 30% noise, as shown in Figures 8(b) through 8(d). Table 2 shows that the 10%, 20%, and 30% noise is added to the solid image of the 3D gesture, and the TPR and TNR recognition rates are still as high as 100%.

表2：添加高斯雜訊3D手勢實心影像之辨識結果 Table 2: Identification results of adding solid image of Gaussian noise 3D gesture

前述較佳實施例僅舉例說明本發明及其技術特徵，該實施例之技術仍可適當進行各種實質等效修飾及/或替換方式予以實施；因此，本發明之權利範圍須視後附申請專利範圍所界定之範圍為準。本案著作權限制使用於中華民國專利申請用途。 The foregoing preferred embodiments are merely illustrative of the invention and the technical features thereof, and the techniques of the embodiments can be carried out with various substantial equivalent modifications and/or alternatives; therefore, the scope of the invention is subject to the appended claims. The scope defined by the scope shall prevail. The copyright limitation of this case is used for the purpose of patent application in the Republic of China.

Claims

A method for recognizing a three-dimensional gesture image of a genetic algorithm, comprising: capturing a gesture by using a light field photographing unit to obtain a 3D gesture image, wherein the 3D gesture image is a 3D light field gesture image, and the 3D light field gesture image Include a 3D light field information; project the 3D light field gesture image into a predetermined recognition space, and the predetermined identification space is a space to obtain a 3D light field feature vector; and process the 3D light field feature by using a genetic algorithm Obtaining an adaptive 3D light field feature vector; and classifying the adaptive 3D light field feature vector and the plurality of sample feature vectors to classify the 3D gesture image to identify the type of the 3D gesture image .

The method for recognizing a three-dimensional gesture image according to the first aspect of the patent application scope, wherein the 3D light field gesture image comprises a plane image information and a depth information.

The method for recognizing a three-dimensional gesture image according to the first aspect of the patent application scope, wherein the 3D light field gesture image is a 3D light field gesture contour image or a 3D light field gesture solid image.

According to the third aspect of the patent application scope, the three-dimensional gesture image recognition method is performed, wherein the 3D light field gesture image is converted into a projection color space by using a projection color space method to obtain an R channel image information and a G channel image information. And a B channel image information.

The method for recognizing a three-dimensional gesture image according to the first aspect of the patent application scope, wherein when the 3D light field gesture image is projected to the predetermined recognition space, the principal component analysis method or the two-dimensional optimal principal component analysis method is used. projection.

The three-dimensional gesture image recognition method according to the first aspect of the patent application scope, wherein when the adaptive 3D light field feature vector and the plurality of sample feature vectors are compared for classification, the nearest neighbor method is used for comparison and classification. .

A gene calculus three-dimensional gesture image recognition system includes: a light field photographing unit that takes a gesture motion, and the 3D gesture image is a 3D light field gesture image, and the 3D light field gesture image includes a 3D light field information Obtaining a 3D gesture image; a calculation unit connected to the light field photography unit, projecting the 3D light field gesture image to a predetermined recognition space, and the predetermined recognition space is a space to obtain a 3D light field The feature vector is processed by a gene algorithm to obtain an adaptive 3D light field feature vector, and the adaptive 3D light field feature vector and the plurality of sample feature vectors are compared and classified. The 3D light field gesture image is classified to identify the type of the 3D light field gesture image; and an output unit is connected to the calculation unit to output the type of the 3D light field gesture image.

The three-dimensional gesture image recognition system according to the seventh aspect of the patent application scope, wherein the 3D light field gesture image is a 3D light field gesture contour image or a 3D light field gesture solid image.

According to the genetic algorithm of claim 8, the three-dimensional gesture image recognition system, wherein the 3D light field gesture image is converted into a projection color space by using a projection color space method to obtain an R channel image information and a G channel image information. And a B channel image information.

The three-dimensional gesture image recognition system according to the seventh aspect of the patent application scope, wherein when the 3D light field gesture image is projected to the predetermined recognition space, the principal component analysis method or the two-dimensional optimal principal component analysis method is used. Projection, and when the adaptive 3D light field feature vector is compared with a plurality of sample feature vectors for comparison, the nearest neighbor method is used for comparison classification.