TWI494865B

TWI494865B - Depth-based subject extraction method

Info

Publication number: TWI494865B
Application number: TW100143816A
Authority: TW
Inventors: Chu Sing Yang
Original assignee: Univ Nat Cheng Kung
Priority date: 2011-11-29
Filing date: 2011-11-29
Publication date: 2015-08-01
Also published as: TW201322146A

Description

Method of intercepting a target from depth information

本發明涉及從深度資訊中截取目標物的方法，尤指一種從深度資訊中截取目標人體的方法。The invention relates to a method for intercepting a target object from depth information, in particular to a method for intercepting a target human body from depth information.

使用攝影機拍攝的人體影像需要先分離目標人體(前景)與背景。前景擷取方法已有許多研究，例如幀差法、背景建模與光流法等，茲簡述如下。Human images taken with a camera need to be separated from the target body (foreground) and background. There have been many studies on foreground extraction methods, such as frame difference method, background modeling and optical flow method, which are briefly described below.

幀差法：將影像與前一幀的影像相減，取變化值大於一閥值的部分作為前景。Frame difference method: subtracts the image from the image of the previous frame, and takes the portion whose change value is greater than a threshold as the foreground.

背景建模：將畫面中較無變動的部分，建立成一背景模型，再將輸入影像與背景相減得到前景。Background Modeling: Create a background model with less changes in the picture, and subtract the input image from the background to get the foreground.

光流法：計算畫面中各點的移動向量，取等速移動或移動量大的部分作為前景。Optical flow method: Calculate the motion vector of each point in the picture, and take the part with constant speed movement or large amount of movement as the foreground.

但這些方法多基於「移動物體為前景」的假設，且受限於色彩影像輸入，易受到畫面中的光線、人物陰影與變動背景部分干擾，更有無法辨別同色背景的問題。However, these methods are mostly based on the assumption that "moving objects are foreground", and are limited by color image input, which is easily interfered by the light in the picture, the shadow of the person and the background of the change, and there is a problem that the background of the same color cannot be discerned.

職是之故，發明人鑒於習知技術之缺失，乃思及改良發明之意念，終能發明出本案之「從深度資訊中截取目標物的方法」。As a result of his position, the inventor, in view of the lack of prior art, thought of and improved the idea of invention, and finally invented the "method of intercepting objects from in-depth information" in this case.

本發明提出的方法適用於深度感測裝置所獲得的深度矩陣，與用於灰階或彩色二維影像的方法相比，能利用額外的距離維度資訊來分離出空間中的物件。與一般使用深度閥值(將目標物的前後深度定為最小/最大深度，過濾掉之外的部分)的方式相比，本發明所提出的方法不須定義精確的閥值大小，容許較大的人體移動範圍，可以更有效的截取人體輪廓。另與使用深度資訊來定位人體各部位、關節位置的方法相比，本發明所提出的方法不需先前資料的訓練與比對，能降低運行時的計算量。The method proposed by the present invention is applicable to a depth matrix obtained by a depth sensing device, and can utilize additional distance dimension information to separate objects in a space compared to a method for grayscale or color two-dimensional images. Compared with the general use of the depth threshold (the front and rear depth of the target is set to the minimum/maximum depth, the portion other than the filtering), the method proposed by the present invention does not need to define an accurate threshold value, and allows a larger The range of human body movement can more effectively intercept the contours of the human body. In addition, compared with the method of using depth information to locate various parts and joint positions of the human body, the method proposed by the invention does not require training and comparison of previous data, and can reduce the calculation amount at the time of operation.

本案之主要目的在於提供一種從深度資訊中截取目標物的方法，能利用額外的距離維度資訊來分離出空間中的物件，不須定義精確的閥值大小，容許較大的人體移動範圍，可以更有效的截取人體輪廓，且本發明所提出的方法不需先前資料的訓練與比對，能降低運行時的計算量。The main purpose of this case is to provide a method for intercepting objects from deep information. It can use additional distance dimension information to separate objects in space without defining an accurate threshold value and allowing a larger range of human body movement. The human body contour is intercepted more effectively, and the method proposed by the invention can reduce the calculation amount at the time of operation without training and comparison of the previous data.

本案之又一主要目的在於提供一種從一深度資訊中截取一目標物的方法，包含下列之步驟：提供一深度矩陣，用以產生複數個三維空間點；依據該複數個三維空間點以重新取樣並補入至少一延伸點，俾產生複數個三維連通區塊；當該複數個三維連通區塊的其中之任一具有一體積大於一預設值時，則計算該三維連通區塊之一質心與一已知之目標區塊之一質心的一距離；計算各該距離中之一最小值，並將該距離為該最小值之一特定之該三維連通區塊設定為一主區塊；當該複數個三維連通區塊的其中之任一具有該體積小於或等於該預設值與位於該主區塊之一前視圖邊界，且具有一深度值小於該主區塊之一深度值時，則將該體積小於或等於該預設值之一特定之該三維連通區塊設定為一前端區塊；以及將該主區塊與該前端區塊之一點集合聯集作為該已知之目標區塊。Another main object of the present invention is to provide a method for intercepting an object from a depth information, comprising the steps of: providing a depth matrix for generating a plurality of three-dimensional spatial points; and resampling according to the plurality of three-dimensional spatial points And adding at least one extension point to generate a plurality of three-dimensional connected blocks; when any one of the plurality of three-dimensional connected blocks has a volume greater than a predetermined value, calculating one of the three-dimensional connected blocks a distance from a centroid of a heart to a known target block; calculating a minimum value of each of the distances, and setting the three-dimensional connected block specified as one of the minimum values as a primary block; When any one of the plurality of three-dimensional connected blocks has the volume less than or equal to the preset value and is located at a front view boundary of the main block, and has a depth value smaller than a depth value of the main block And setting the three-dimensional connected block whose volume is less than or equal to one of the preset values as a front end block; and associating the main block with a point of the front end block as the known item Blocks.

本案之下一主要目的在於提供一種從一深度資訊中截取一目標物的方法，包含下列之步驟：提供一深度矩陣，用以產生複數個三維連通區塊；當該複數個三維連通區塊的其中之任一具有一體積大於一預設值時，則計算該三維連通區塊之一質心與一已知之目標區塊之一質心的一距離；計算各該距離中之一最小值，並將該距離為該最小值之一特定之該三維連通區塊設定為一主區塊；當該複數個三維連通區塊的其中之任一具有該體積小於或等於該預設值，且具有一深度值小於該主區塊之一深度值時，則將該體積小於或等於該預設值之一特定之該三維連通區塊設定為一前端區塊；以及將該主區塊與該前端區塊之一點集合聯集作為該已知之目標區塊。A primary object of the present invention is to provide a method for intercepting an object from a depth information, comprising the steps of: providing a depth matrix for generating a plurality of three-dimensional connected blocks; and when the plurality of three-dimensional connected blocks are When any one of the volumes has a volume greater than a predetermined value, calculating a distance between a centroid of the three-dimensional connected block and a centroid of a known target block; calculating a minimum value of each of the distances, And setting the three-dimensional connected block whose distance is one of the minimum values to a primary block; and when any one of the plurality of three-dimensional connected blocks has the volume being less than or equal to the preset value, and having When a depth value is less than a depth value of the primary block, the three-dimensional connected block whose volume is less than or equal to one of the preset values is set as a front end block; and the main block and the front end are One of the blocks is a collection of points as the known target block.

本案之再一主要目的在於提供一種從一深度矩陣中截取一目標物的方法，包含下列之步驟：從該深度矩陣中獲取複數個區塊；從該等區塊中確定一主區塊；從該等區塊中確定一前端區塊；以及聯集該主區塊與該前端區塊以作為該目標物。A further main object of the present invention is to provide a method for intercepting a target from a depth matrix, comprising the steps of: obtaining a plurality of blocks from the depth matrix; determining a primary block from the blocks; Determining a front end block in the blocks; and associating the main block with the front end block as the target.

為了讓本發明之上述目的、特徵、和優點能更明顯易懂，下文特舉較佳實施例，並配合所附圖式，作詳細說明如下：The above described objects, features, and advantages of the present invention will become more apparent and understood.

本發明所提岀之方法是一種基於深度資訊的目標人體擷取方法，係利用一深度感測裝置取得場景的連續深度矩陣，再藉由對此深度資訊進行後續處理，定義出空間中的物件，以取出感興趣的目標物。首先，此方法對單張深度矩陣做前處理轉為空間點，並重新對深度值取樣並延伸來連接一物件的點。接著從處理後的空間點判斷所屬物件與其體積，過濾掉過小的物件，根據前張的位置定位人體，並處理連接處被遮蔽的問題。藉此達成擷取目標人體之目的。The method proposed by the present invention is a target human body capturing method based on depth information, which uses a depth sensing device to obtain a continuous depth matrix of a scene, and then performs subsequent processing on the depth information to define objects in the space. To take out the target of interest. First, this method converts the single depth matrix into a spatial point and resamples the depth value and extends to connect the points of an object. Then, the object and its volume are judged from the processed space point, the small object is filtered out, the human body is positioned according to the position of the front sheet, and the problem that the joint is shielded is dealt with. In this way, the goal of capturing the target human body is achieved.

本發明所提岀之方法將從一感測裝置所得到的連續深度矩陣中取出目標物所占的位置。深度矩陣為紀錄一場景深度資訊的矩陣，表示此場景中物體到感測裝置的平行距離。若此感測裝置之感測範圍為長h寬w，則此深度矩陣即為h行w列之矩陣。而矩陣之一元素為此點到感測裝置面上的深度值，越近越小，隨著距離越遠而有越大的值。如第一圖所示，場景中有一向前舉臂之人體，感測裝置的感測範圍為長120寬160，則可以得到一160x120之矩陣。其中位於(75,50)的元素值代表原畫面中人體手臂距離感測裝置的值。The method of the present invention extracts the position occupied by the target from the continuous depth matrix obtained by a sensing device. The depth matrix is a matrix that records a scene depth information, indicating the parallel distance of the object to the sensing device in the scene. If the sensing range of the sensing device is a long h width w, the depth matrix is a matrix of h rows and w columns. The depth of one of the elements of the matrix from the point to the surface of the sensing device is smaller and smaller, and the larger the distance, the larger the value. As shown in the first figure, there is a human body with a forward arm in the scene. The sensing range of the sensing device is 120 pixels long and 160, so that a matrix of 160x120 can be obtained. The element value at (75, 50) represents the value of the human arm distance sensing device in the original picture.

本例中此距離值以公釐記數，範圍約從500至6000。而軀幹部份的值皆大於手掌部分，亦即手掌部分離裝置較近。設若此裝置每秒可傳回多張深度矩陣，將一段時間內的所有深度矩陣依序收集，稱之為連續深度矩陣，如第二圖所示，可用以表示每個時間點上的深度資訊。此深度矩陣可由多種方式取得，如TOF camera,Stereo Camera等。In this example, the distance value is measured in metric and ranges from about 500 to 6,000. The value of the trunk portion is greater than the palm portion, that is, the palm separation device is closer. If the device can return multiple depth matrices per second, all the depth matrices in a period of time are collected sequentially, which is called a continuous depth matrix. As shown in the second figure, it can be used to represent the depth information at each time point. . This depth matrix can be obtained in a variety of ways, such as TOF camera, Stereo Camera, and the like.

以下動作流程參考第三圖，對連續深度矩陣其中之一張執行，設為第k張深度矩陣：The following action flow refers to the third figure and is performed on one of the continuous depth matrices, and is set as the kth depth matrix:

1.　將深度矩陣轉為空間點：即原本位於矩陣中(x,y)的元素其值為z，則將此元素轉為一三維座標點(x,y,z)。如第四圖，可見場景中各物件在三維空間上有深度上的區隔。但一物件上的點未完全鄰接，因此需要下述兩步驟處理。1. Convert the depth matrix to a spatial point: the element originally located in the matrix (x, y) has a value of z, which is converted into a three-dimensional coordinate point (x, y, z). As shown in the fourth figure, it can be seen that the objects in the scene have a depth interval in the three-dimensional space. However, the points on an object are not completely adjacent, so the following two steps are required.

i.深度重新取樣：將所有點上的z軸值同除一取樣範圍s，使各點的距離縮近。代表此範圍內，只要有座標點存在即視為有值。i. Depth resampling: Divide the z-axis values at all points by a sampling range s to make the distance of each point narrow. Representing this range, as long as there is a punctuation point, it is considered to have a value.

ii.補延伸點：在每點的z軸上向後一定距離t作補點動作。經過此二步驟後將盡可能使畫面中屬於同一物件的點連通。上述該兩步驟其執行後之結果分別如第五圖(a)與第五圖(b)所示。亦即第五圖(a)與第五圖(b)分別顯示一依據本發明之較佳實施例與第三圖所示之流程圖所獲得之未處理的點集之二維上視圖與重新取樣並補點後之二維上視圖。Ii. Complementary extension point: Make a supplementary point motion at a certain distance t backward on the z-axis of each point. After these two steps, the points belonging to the same object in the picture are connected as much as possible. The results of the above two steps after execution are as shown in the fifth (a) and fifth (b), respectively. That is, the fifth (a) and fifth (b) drawings respectively show a two-dimensional top view and a re-set of the unprocessed point set obtained according to the preferred embodiment of the present invention and the flowchart shown in the third figure. A two-dimensional top view after sampling and complementing.

2.三維連通區塊標記：對上述取樣補點後的點集合進行連通區塊標記，即將相連的點標示為同一區域。例如(2,1,2)與(2,1,3)、(2,2,2)會被判斷為相連區塊，但(9,8,7)則不屬於上述區塊(若其中沒有其他相連點)。經過連通區塊標記過程的步驟，其結果如第六圖所示，其係顯示一依據本發明之較佳實施例與第三圖所示之流程圖所獲得之三維連通區塊標記後的二維示意圖。圖上編號為該區塊的標記編號(在顯示幕的畫面上可以顏色區分)，圓圈標示該區塊質心位置，圓圈大小則代表區塊點數量。2. Three-dimensional connected block mark: Connect the block set after the above-mentioned sampling and complementing points, and mark the connected points as the same area. For example, (2,1,2) and (2,1,3), (2,2,2) will be judged as connected blocks, but (9,8,7) will not belong to the above block (if there is no Other connected points). After the step of connecting the block marking process, the result is as shown in the sixth figure, which shows a two-dimensional connected block mark obtained according to the preferred embodiment of the present invention and the flow chart shown in the third figure. Schematic diagram. The figure is numbered as the mark number of the block (can be color-coded on the screen of the display screen), the circle indicates the centroid position of the block, and the circle size represents the number of block points.

3.對各區塊做如下判斷：3. Make the following judgments for each block:

i.　將區塊以其體積分為大區塊與小區塊i. Divide the block into large blocks and blocks by its volume

ii.　從大區塊中根據第k-1張深度矩陣之目標物位置，選出質心距離最近的區塊做為主區塊。(初始化k=0時，使用手動圈選或人臉辨識來決定初始目標位置。)Ii. From the large block, according to the target position of the k-1th depth matrix, the block with the closest centroid distance is selected as the main block. (When initializing k=0, use manual circle selection or face recognition to determine the initial target position.)

iii.　從小區塊中根據上一步驟之主區塊，經下述判斷選出：Iii. From the cell block, according to the main block of the previous step, select the following:

a.　在前視角上，其邊緣有相鄰。且a. In the front view, the edges are adjacent. And

b.　深度值小於主區塊b. The depth value is smaller than the main block

將符合條件的小區塊加入為前端區塊Add eligible cell blocks as front-end blocks

最後合併主區塊與前端區塊，即為新的目標物，第七圖為最後的單張目標擷取結果，第八圖則為連續深度矩陣的目標擷取結果。Finally, the main block and the front block are merged, which is the new target. The seventh picture is the final single target extraction result, and the eighth picture is the continuous depth matrix target extraction result.

Example:

1.一種從一深度資訊中截取一目標物的方法，包含下列之步驟：1. A method of intercepting a target from a depth information, comprising the steps of:

提供一深度矩陣，用以產生複數個三維空間點；依據該複數個三維空間點以重新取樣並補入至少一延伸點，俾產生複數個三維連通區塊；當該複數個三維連通區塊的其中之任一具有一體積大於一預設值時，則計算該三維連通區塊之一質心與一已知之目標區塊之一質心的一距離；計算各該距離中之一最小值，並將該距離為該最小值之一特定之該三維連通區塊設定為一主區塊；當該複數個三維連通區塊的其中之任一具有該體積小於或等於該預設值與位於該主區塊之一前視圖邊界，且具有一深度值小於該主區塊之一深度值時，則將該體積小於或等於該預設值之一特定之該三維連通區塊設定為一前端區塊；以及將該主區塊與該前端區塊之一點集合聯集作為該已知之目標區塊。Providing a depth matrix for generating a plurality of three-dimensional spatial points; re-sampling and supplementing at least one extension point according to the plurality of three-dimensional spatial points to generate a plurality of three-dimensional connected blocks; and when the plurality of three-dimensional connected blocks are When any one of the volumes has a volume greater than a predetermined value, calculating a distance between a centroid of the three-dimensional connected block and a centroid of a known target block; calculating a minimum value of each of the distances, And setting the three-dimensional connected block whose distance is one of the minimum values to a primary block; and when any one of the plurality of three-dimensional connected blocks has the volume less than or equal to the preset value and located at the a front view boundary of one of the main blocks, and having a depth value smaller than a depth value of the main block, setting the three-dimensional connected block whose volume is less than or equal to one of the preset values as a front end area a block; and associating the primary block with a point set of the front end block as the known target block.

2.根據實施例1所述之方法，其中該提供一深度矩陣步驟，更包括下列之步驟：2. The method of embodiment 1, wherein the step of providing a depth matrix further comprises the steps of:

進行一深度重新取樣，將各該複數個三維空間點的一z軸座標值同除一取樣範圍，其中當該取樣範圍內，有至少一座標點存在時，即視為有一值；以及在各該複數個三維空間點的一z軸上，向後一固定距離作一補延伸點動作。Performing a depth resampling to divide a z-axis coordinate value of each of the plurality of three-dimensional spatial points by a sampling range, wherein when at least one punctuation exists in the sampling range, the value is regarded as having a value; On a z-axis of a plurality of three-dimensional space points, a complementary extension point action is performed for a fixed distance.

3.根據實施例1或2所述之方法，其中該提供一深度矩陣步驟，更包括下列之步驟：3. The method of embodiment 1 or 2, wherein the step of providing a depth matrix further comprises the steps of:

將彼此相鄰接的各該複數個三維空間點標示為同一區域，以產生該複數個三維連通區塊。Each of the plurality of three-dimensional spatial points adjacent to each other is marked as the same area to generate the plurality of three-dimensional connected blocks.

4.根據以上任一實施例所述之方法，其中該已知之目標區塊之一初始目標位置是以一手動圈選方式或以一人臉辯識方式來決定，且該目標物為一人體。4. The method according to any of the preceding embodiments, wherein the initial target position of one of the known target blocks is determined by a manual circle or by a face recognition mode, and the target is a human body.

5.一種從一深度資訊中截取一目標物的方法，包含下列之步驟：5. A method of intercepting a target from a depth information, comprising the steps of:

提供一深度矩陣，用以產生複數個三維連通區塊；當該複數個三維連通區塊的其中之任一具有一體積大於一預設值時，則計算該三維連通區塊之一質心與一已知之目標區塊之一質心的一距離；計算各該距離中之一最小值，並將該距離為該最小值之一特定之該三維連通區塊設定為一主區塊；當該複數個三維連通區塊的其中之任一具有該體積小於或等於該預設值，且具有一深度值小於該主區塊之一深度值時，則將該體積小於或等於該預設值之一特定之該三維連通區塊設定為一前端區塊；以及將該主區塊與該前端區塊之一點集合聯集作為該已知之目標區塊。Providing a depth matrix for generating a plurality of three-dimensional connected blocks; and when any one of the plurality of three-dimensional connected blocks has a volume greater than a predetermined value, calculating a centroid of the three-dimensional connected block a distance from a centroid of a known target block; calculating a minimum value of each of the distances, and setting the three-dimensional connected block specified as one of the minimum values as a primary block; If any one of the plurality of three-dimensional connected blocks has the volume less than or equal to the preset value, and has a depth value smaller than a depth value of the primary block, the volume is less than or equal to the preset value. A specific three-dimensional connected block is set as a front end block; and the main block is combined with a point of the front end block as the known target block.

6.根據實施例5所述之方法，其中該前端區塊位於該主區塊之一前視圖邊界，且該提供一深度矩陣步驟更包括下列之步驟：6. The method of embodiment 5, wherein the front end block is located at a front view boundary of the main block, and the step of providing a depth matrix further comprises the steps of:

進行一深度重新取樣，將各該複數個三維空間點的一z軸座標值同除一取樣範圍，其中當該取樣範圍內，有至少一座標點存在時，即視為有值；在各該複數個三維空間點的一z軸上向後一定距離t作一補延伸點動作；以及將彼此相鄰接的各該複數個三維空間點標示為同一區域，以產生該複數個三維連通區塊。Performing a depth resampling to divide a z-axis coordinate value of each of the plurality of three-dimensional spatial points by a sampling range, wherein when at least one punctuation exists in the sampling range, the value is regarded as having a value; a z-axis of a three-dimensional space point is made a complementary extension point motion by a certain distance t; and each of the plurality of three-dimensional space points adjacent to each other is marked as the same area to generate the plurality of three-dimensional connected blocks.

7.一種從一深度矩陣中截取一目標物的方法，包含下列之步驟：7. A method of intercepting a target from a depth matrix, comprising the steps of:

從該深度矩陣中獲取複數個區塊；從該等區塊中確定一主區塊；從該等區塊中確定一前端區塊；以及聯集該主區塊與該前端區塊以作為該目標物。Obtaining a plurality of blocks from the depth matrix; determining a primary block from the blocks; determining a front end block from the blocks; and associating the main block with the front end block as the Target.

8.根據實施例7所述之方法，其中該從該深度矩陣中獲取複數個區塊步驟，更包括下列之步驟：8. The method of embodiment 7, wherein the step of obtaining a plurality of blocks from the depth matrix further comprises the steps of:

使用該深度矩陣中之一元素(x,y)之一深度值z，以產生一三維座標點(x,y,z)；當該三維座標點(x,y,z)之該深度值z小於一預定值時，即以該三維座標點(x,y,z)代表一三維空間點，使產生複數個三維空間點；以及依據該複數個三維空間點以重新取樣並補入至少一延伸點，俾產生該複數個區塊。Using a depth value z of one of the elements (x, y) in the depth matrix to generate a three-dimensional coordinate point (x, y, z); when the depth value z of the three-dimensional coordinate point (x, y, z) When less than a predetermined value, the three-dimensional coordinate point (x, y, z) represents a three-dimensional spatial point, so that a plurality of three-dimensional spatial points are generated; and according to the plurality of three-dimensional spatial points, re-sampling and supplementing at least one extension Point, 俾 produces the plurality of blocks.

9.根據實施例7或8所述之方法，其中該從該等區塊中確定一主區塊步驟，更包括下列之步驟：9. The method of embodiment 7 or 8, wherein the step of determining a primary block from the blocks further comprises the steps of:

當該複數個區塊的其中之任一具有一體積大於一預設值時，則計算該區塊之一質心與一已知之目標區塊之一質心的一距離；以及計算各該距離中之一最小值，並將該距離為該最小值之一特定之該區塊設定為一主區塊，且該從該等區塊中確定一前端區塊之步驟，更包括一步驟：當該複數個三維連通區塊的其中之任一具有該體積小於或等於該預設值與位於該主區塊之一前視圖邊界，且具有一深度值小於該主區塊之一深度值時，則將該體積小於或等於該預設值之一特定之該三維連通區塊設定為一前端區塊。When any one of the plurality of blocks has a volume greater than a predetermined value, calculating a distance between a centroid of the block and a centroid of a known target block; and calculating the distance One of the minimum values, and the distance is specified as one of the minimum values, the block is set as a main block, and the step of determining a front end block from the blocks further includes a step: when When any one of the plurality of three-dimensional connected blocks has a volume less than or equal to the preset value and is located at a front view boundary of the main block, and has a depth value smaller than a depth value of the main block, Then, the three-dimensional connected block whose volume is less than or equal to one of the preset values is set as a front end block.

10.根據以上任一實施例所述之方法，其中各該區塊為一三維連通區塊，該已知之目標區塊之一初始目標位置是以一手動圈選方式或以一人臉辯識方式來決定，且該目標物為一人體。10. The method according to any of the preceding embodiments, wherein each of the blocks is a three-dimensional connected block, and an initial target position of the known target block is manually circled or recognized by a face. To decide, and the target is a human body.

綜上所述，本發明提供一種從深度資訊中截取目標物的方法，能利用額外的距離維度資訊來分離出空間中的物件，不須定義精確的閥值大小，容許較大的人體移動範圍，可以更有效的截取人體輪廓，且本發明所提出的方法不需先前資料的訓練與比對，能降低運行時的計算量，故其確實具有進步性與新穎性。In summary, the present invention provides a method for intercepting a target object from depth information, which can utilize additional distance dimension information to separate objects in space without defining an accurate threshold value and allowing a larger range of human body movement. The human body contour can be intercepted more effectively, and the method proposed by the invention does not require the training and comparison of the prior data, and can reduce the calculation amount at the time of operation, so it is indeed progressive and novel.

是以，縱使本案已由上述之實施例所詳細敘述而可由熟悉本技藝之人士任施匠思而為諸般修飾，然皆不脫如附申請專利範圍所欲保護者。Therefore, even though the present invention has been described in detail by the above-described embodiments, it can be modified by those skilled in the art, and is not intended to be protected as claimed.

第一圖：其係顯示一以灰階顏色對應表示其各點深度值的輸入之深度矩陣的示意圖；First: a schematic diagram showing a depth matrix corresponding to the input of the depth value of each point in grayscale color;

第二圖：其係顯示一以灰階顏色對應表示其各點深度值的連續深度矩陣的示意圖；The second figure: a schematic diagram showing a continuous depth matrix corresponding to the depth value of each point in gray scale color;

第三圖：其係顯示一依據本發明之較佳實施例所提出之從一深度資訊中截取一目標物的方法之流程圖；FIG. 3 is a flow chart showing a method for intercepting an object from a depth information according to a preferred embodiment of the present invention;

第四圖：其係顯示一將深度矩陣轉換至三維空間點之三維示意圖；The fourth picture: a three-dimensional diagram showing the conversion of the depth matrix to the three-dimensional space point;

第五圖(a)與第五圖(b)：其係分別顯示一依據本發明之較佳實施例與第三圖所示之流程圖所獲得之未處理的點集之二維上視圖與重新取樣並補點後之二維上視圖；Fifth (a) and fifth (b): respectively showing a two-dimensional top view of an unprocessed set of points obtained according to the preferred embodiment of the present invention and the flow chart shown in FIG. Re-sampling and replenishing the 2D top view;

第六圖：其係顯示一依據本發明之較佳實施例與第三圖所示之流程圖所獲得之三維連通區塊標記後的二維示意圖；Figure 6 is a two-dimensional diagram showing a three-dimensional connected block mark obtained according to the preferred embodiment of the present invention and the flow chart shown in Figure 3;

第七圖：其係顯示一單張目標擷取結果的二維示意圖；以及Figure 7: A two-dimensional diagram showing the results of a single target capture;

第八圖：其係顯示一連續前景擷取結果的二維示意圖。Figure 8: A two-dimensional diagram showing the results of a continuous foreground capture.

Claims

A method for intercepting an object from a depth information, comprising the steps of: providing a depth matrix for generating a plurality of three-dimensional spatial points; re-sampling and supplementing at least one extension point according to the plurality of three-dimensional spatial points;俾 generating a plurality of three-dimensional connected blocks; when any one of the plurality of three-dimensional connected blocks has a volume greater than a predetermined value, calculating a centroid of the three-dimensional connected block and a known target block a distance of one centroid; calculating a minimum value of each of the distances, and setting the three-dimensional connected block specified as one of the minimum values as a main block; when the plurality of three-dimensional connected blocks If any one of the volumes has a volume less than or equal to the preset value and is located at a front view boundary of the main block, and has a depth value smaller than a depth value of the main block, the volume is less than or equal to One of the preset values specifies that the three-dimensional connected block is set as a front end block; and the main block is combined with a point of the front end block as the known target block.

The method of claim 1, wherein the step of providing a depth matrix further comprises the steps of: performing a depth resampling, and dividing a z-axis coordinate value of each of the plurality of three-dimensional spatial points by one sampling. a range, wherein when at least one punctuation exists in the sampling range, it is regarded as having a value; and on a z-axis of each of the plurality of three-dimensional space points, a complementary extension point action is performed on the latter fixed distance.

The method of claim 2, wherein the providing a depth matrix step further comprises the steps of: marking each of the plurality of three-dimensional spatial points adjacent to each other as the same area to generate the plurality of three-dimensional Connected blocks.

The method of claim 1, wherein the initial target position of the known target block is determined by a manual circle or by a face recognition method, and the target is a human body.

A method for intercepting an object from a depth information, comprising the steps of: providing a depth matrix for generating a plurality of three-dimensional connected blocks; and when any one of the plurality of three-dimensional connected blocks has a volume greater than a predetermined value, calculating a distance between a centroid of the three-dimensional connected block and a centroid of a known target block; calculating a minimum value of each of the distances, and the distance is the minimum One of the three-dimensional connected blocks is set as a main block; when any one of the plurality of three-dimensional connected blocks has the small size When the value is equal to or greater than the preset value, and the depth value is less than the depth value of the primary block, the three-dimensional connected block whose volume is less than or equal to one of the preset values is set as a front end block. And associating the primary block with a point set of the front end block as the known target block.

The method of claim 5, wherein the front end block is located at a front view boundary of the main block, and the step of providing a depth matrix further comprises the steps of: performing a depth resampling, each of which A z-axis coordinate value of a plurality of three-dimensional spatial points is equal to a sampling range, wherein when at least one punctuation exists in the sampling range, it is regarded as having a value; on a z-axis of each of the plurality of three-dimensional spatial points A complementary extension point action is performed for a certain distance t; and each of the plurality of three-dimensional space points adjacent to each other is marked as the same area to generate the plurality of three-dimensional connected blocks.

A method for extracting a target from a depth matrix, comprising the steps of: obtaining a plurality of blocks from the depth matrix; and when any one of the plurality of blocks has a volume greater than a predetermined value , calculating one of the centroids of the block and one of the known target blocks a distance of the centroid; calculating a minimum value of each of the distances, and setting the block to be one of the minimum values to be a primary block; determining a front end region from the plurality of blocks a block; and a union of the main block and the front end block as the target.

The method of claim 7, wherein the step of obtaining a plurality of blocks from the depth matrix further comprises the step of using a depth value z of one of the elements (x, y) in the depth matrix To generate a three-dimensional coordinate point (x, y, z); when the depth value z of the three-dimensional coordinate point (x, y, z) is less than a predetermined value, that is, the three-dimensional coordinate point (x, y, z) Representing a three-dimensional spatial point such that a plurality of three-dimensional spatial points are generated; and the plurality of blocks are generated by re-sampling and supplementing at least one extended point based on the plurality of three-dimensional spatial points.

The method of claim 7, wherein the step of determining a front end block from the blocks further comprises the step of: when any one of the plurality of three-dimensional connected blocks has the volume is less than Or equal to the preset value and a front view boundary of the main block, and having a depth value smaller than a depth value of the main block, the volume is less than or equal to one of the preset values. The three-dimensional connected block is set as a front end block.

The method of claim 7, wherein each of the blocks is a three-dimensional connected block, and an initial target position of the known target block is A manual circle selection method or a face recognition method is used, and the target object is a human body.