JPWO2020184066A1

JPWO2020184066A1 - Joint position estimation device, joint position estimation method and program

Info

Publication number: JPWO2020184066A1
Application number: JP2021505611A
Authority: JP
Inventors: 夏美鈴木
Original assignee: NEC Solution Innovators Ltd
Current assignee: NEC Solution Innovators Ltd
Priority date: 2019-03-13
Filing date: 2020-02-14
Publication date: 2021-11-11
Anticipated expiration: 2040-02-14
Also published as: WO2020184066A1; JP7120690B2; CN113330482A

Abstract

関節位置判定装置１は、画像データを取得する画像データ取得部２と、取得した画像データから人体部位を推定する推定部３と、推定部３が推定した人体部位の領域、および、人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするマーキング部４と、マーキング部４によるマーキング後の画像データに対して、分水嶺アルゴリズムを適用して、画像データ内の領域を複数に分割する分割部５と、分割部５が分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算する計算部６と、を備える。The joint position determination device 1 includes an image data acquisition unit 2 for acquiring image data, an estimation unit 3 for estimating a human body part from the acquired image data, a region of the human body part estimated by the estimation unit 3, and a human body part. For each area other than the area, the marking unit 4 that marks one or more pixels and the image data after marking by the marking unit 4 are applied with the water diversion ridge algorithm to make a plurality of areas in the image data. It includes a division unit 5 to be divided, and a calculation unit 6 to calculate a joint position from two areas corresponding to human body parts adjacent to the joint among the areas divided by the division unit 5.

Description

本発明は、画像データから関節位置を推定する関節位置推定装置および関節位置推定方法に関し、さらには、これらを実現するためのプログラムを記憶したコンピュータ読み取り可能な記録媒体に関する。 The present invention relates to a joint position estimation device and a joint position estimation method for estimating a joint position from image data, and further relates to a computer-readable recording medium in which a program for realizing these is stored.

従来、人物を撮像装置で撮像し、その人物の姿勢、人物が行う動作（ジェスチャなど）を推定する様々な技術が提案されている。例えば、非特許文献１には、人物の関節位置を推定して、人物の姿勢を推定する技術が開示されている。非特許文献１では、深度画像から人物の人体部位を推定し、その推定結果を用いて関節位置推定器により関節位置を推定している。 Conventionally, various techniques have been proposed in which a person is imaged with an imaging device and the posture of the person and the movement (gesture, etc.) performed by the person are estimated. For example, Non-Patent Document 1 discloses a technique of estimating the joint position of a person and estimating the posture of the person. In Non-Patent Document 1, a human body part of a person is estimated from a depth image, and the joint position is estimated by a joint position estimator using the estimation result.

高橋龍平、“特別研究報告書ＣＮＮによる深度画像を用いた視点不変な人物姿勢推定”、平成３０年２月８日、インターネット＜URL:http://www.mm.media.kyoto-u.ac.jp/wp-content/uploads/2018/02/2017-b-takahashi.pdf>Ryuhei Takahashi, "Special Research Report CNN's Estimating Human Posture with Invariant Perspective Using Depth Images", February 8, 2018, Internet <URL: http: //www.mm.media.kyoto-u.ac .jp / wp-content / uploads / 2018/02/2017-b-takahashi.pdf>

しかしながら、非特許文献１では、人体部位の推定を精度よく行わないと、関節位置の推定精度が低下するという問題点があった。 However, Non-Patent Document 1 has a problem that the estimation accuracy of the joint position is lowered unless the human body part is estimated accurately.

本発明の目的の一例は、関節位置を精度よく推定できる、関節位置推定装置、関節位置推定方法およびコンピュータ読み取り可能な記録媒体を提供することにある。 An example of an object of the present invention is to provide a joint position estimation device, a joint position estimation method, and a computer-readable recording medium capable of accurately estimating a joint position.

上記目的を達成するため、本発明の一側面における関節位置推定装置は、
画像データを取得する画像データ取得部と、
前記画像データ取得部が取得した画像データから人体部位を推定する推定部と、
前記推定部が複数の人体部位を推定した場合、前記推定部が推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするマーキング部と、
前記マーキング部によるマーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割する分割部と、
前記分割部が分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算する計算部と、
を備える。In order to achieve the above object, the joint position estimation device in one aspect of the present invention is used.
The image data acquisition unit that acquires image data and
An estimation unit that estimates a human body part from the image data acquired by the image data acquisition unit, and an estimation unit.
When the estimation unit estimates a plurality of human body parts, a marking unit that marks one or a plurality of pixels for each of the region of the human body part estimated by the estimation unit and the region other than the region of the human body part. ,
A watershed algorithm is applied to the image data after marking by the marking unit to divide the area in the image data into a plurality of divided units.
A calculation unit that calculates the joint position from two regions corresponding to the human body part adjacent to the joint among the regions divided by the division portion.
To be equipped.

また、上記目的を達成するため、本発明の一側面における関節位置推定方法は、
画像データを取得するステップと、
前記画像データに対して、人体部位を推定するステップと、
前記画像データに対して複数の人体部位を推定した場合、推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするステップと、
マーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割するステップと、
分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算するステップと、
を備える。Further, in order to achieve the above object, the joint position estimation method in one aspect of the present invention is:
Steps to get image data and
A step of estimating a human body part based on the image data,
When a plurality of human body parts are estimated from the image data, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body part region.
A step of applying a watershed algorithm to the image data after marking to divide the area in the image data into a plurality of areas, and
A step of calculating the joint position from two regions corresponding to the human body part adjacent to the joint among the divided regions, and
To be equipped.

更に、上記目的を達成するため、本発明の一側面におけるコンピュータ読み取り可能な記録媒体は、画像データを取得するコンピュータに、
前記画像データに対して、人体部位を推定するステップと、
複数の人体部位を推定した場合、推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするステップと、
マーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割するステップと、
分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算するステップと、
を実行させる命令を含む、プログラムを記録している。Further, in order to achieve the above object, the computer-readable recording medium in one aspect of the present invention is used in a computer for acquiring image data.
A step of estimating a human body part based on the image data,
When a plurality of human body parts are estimated, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body region.
A step of applying a watershed algorithm to the image data after marking to divide the area in the image data into a plurality of areas, and
A step of calculating the joint position from two regions corresponding to the human body part adjacent to the joint among the divided regions, and
The program is recorded, including the instruction to execute.

以上のように本発明によれば、関節位置を精度よく推定できる。 As described above, according to the present invention, the joint position can be estimated accurately.

図１は、実施形態１における関節位置推定装置の概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of the joint position estimation device according to the first embodiment. 図２は、関節位置推定装置の構成を具体的に示すブロック図である。FIG. 2 is a block diagram specifically showing the configuration of the joint position estimation device. 図３は、画像データ一例を示す図である。FIG. 3 is a diagram showing an example of image data. 図４は、特徴ベクトル計算部が求めた特徴量を示す図である。FIG. 4 is a diagram showing a feature amount obtained by the feature vector calculation unit. 図５は、図４に示す推定部による推定結果に対して設定した領域を示す図である。FIG. 5 is a diagram showing a region set for the estimation result by the estimation unit shown in FIG. 図６は、マーキング部によるマーキング結果を示す図である。FIG. 6 is a diagram showing a marking result by the marking unit. 図７は、分割部による領域分割結果を示す図である。FIG. 7 is a diagram showing the result of region division by the division portion. 図８は、計算部による計算方法を説明するための図である。FIG. 8 is a diagram for explaining a calculation method by the calculation unit. 図９は、関節位置推定装置による動作を示すフロー図である。FIG. 9 is a flow chart showing the operation of the joint position estimation device. 図１０は、実施形態における関節位置推定装置を実現するコンピュータの一例を示すブロック図である。FIG. 10 is a block diagram showing an example of a computer that realizes the joint position estimation device according to the embodiment.

以下、本発明における一実施形態の関節位置推定装置および関節位置推定方法について、図１〜図１０を参照しながら説明する。 Hereinafter, the joint position estimation device and the joint position estimation method according to the embodiment of the present invention will be described with reference to FIGS. 1 to 10.

［装置構成］
図１は、実施形態１における関節位置推定装置１の概略構成を示すブロック図である。[Device configuration]
FIG. 1 is a block diagram showing a schematic configuration of the joint position estimation device 1 according to the first embodiment.

関節位置推定装置１は、画像データから、人体の部位（例えば手のひらと前腕など）を推定して、推定した人体部位から関節の位置を計算する装置である。この関節位置推定装置１は、画像データ取得部２、推定部３、マーキング部４、分割部５、および計算部６を備えている。 The joint position estimation device 1 is a device that estimates parts of the human body (for example, palm and forearm) from image data and calculates the position of joints from the estimated human body parts. The joint position estimation device 1 includes an image data acquisition unit 2, an estimation unit 3, a marking unit 4, a division unit 5, and a calculation unit 6.

画像データ取得部２は画像データを取得する。 The image data acquisition unit 2 acquires image data.

推定部３は、画像データ取得部２が取得した画像データから人体部位を推定する。 The estimation unit 3 estimates a human body part from the image data acquired by the image data acquisition unit 2.

マーキング部４は、推定部３が複数の人体部位を推定した場合、推定部３が推定した人体部位の領域、および、その人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングする。 When the estimation unit 3 estimates a plurality of human body parts, the marking unit 4 has one or a plurality of pixels for each of the region of the human body part estimated by the estimation unit 3 and the region other than the region of the human body part. Mark.

分割部５は、マーキング部４によるマーキング後の画像データに対して、分水嶺アルゴリズムを適用して、画像データ内の領域を複数に分割する。 The dividing unit 5 applies the watershed algorithm to the image data after marking by the marking unit 4 to divide the area in the image data into a plurality of areas.

計算部６は、分割部５が分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算する。 The calculation unit 6 calculates the joint position from two regions corresponding to the human body portion adjacent to the joint among the regions divided by the division unit 5.

このように、本実施形態においては、人体部位の推定した結果に対してマーキングをし、分水嶺アルゴリズムを適用している。このため、マーキングを精度よく行うことができ、その結果を入力として、分水嶺アルゴリズムを適用することで、画像データの領域分割を精度よく行える。その結果、関節に隣接する人体部位の領域も精度よく推定できるため、信頼度の高い、関節（手首）の位置の計算結果が得られる。 As described above, in the present embodiment, the estimation result of the human body part is marked and the watershed algorithm is applied. Therefore, the marking can be performed with high accuracy, and the area division of the image data can be performed with high accuracy by applying the watershed algorithm with the result as an input. As a result, the region of the human body adjacent to the joint can be estimated accurately, and a highly reliable calculation result of the joint (wrist) position can be obtained.

続いて、図２を参照して、本実施形態における関節位置推定装置１の構成をより具体的に説明する。図２は、関節位置推定装置１の構成を具体的に示すブロック図である。 Subsequently, with reference to FIG. 2, the configuration of the joint position estimation device 1 in the present embodiment will be described more specifically. FIG. 2 is a block diagram specifically showing the configuration of the joint position estimation device 1.

関節位置推定装置１は、上記した画像データ取得部２、推定部３、マーキング部４、分割部５、および計算部６に加え、領域設定部７を備えている。 The joint position estimation device 1 includes an area setting unit 7 in addition to the image data acquisition unit 2, the estimation unit 3, the marking unit 4, the division unit 5, and the calculation unit 6 described above.

画像データ取得部２は画像データ５０を取得する。画像データ取得部２は、有線または無線ＬＡＮ（Local Area Network）等のネットワークを介して、画像データ５０を取得してもよいし、近距離無線通信などにより画像データ５０を取得してもよい。また、関節位置推定装置１には撮像装置が直接接続され、または、関節位置推定装置１は撮像装置を備え、画像データ取得部２は、その撮像装置から画像データ５０を取得してもよい。 The image data acquisition unit 2 acquires the image data 50. The image data acquisition unit 2 may acquire the image data 50 via a network such as a wired or wireless LAN (Local Area Network), or may acquire the image data 50 by short-range wireless communication or the like. Further, an image pickup device may be directly connected to the joint position estimation device 1, or the joint position estimation device 1 may include an image pickup device, and the image data acquisition unit 2 may acquire image data 50 from the image pickup device.

図３は、画像データ５０の一例を示す図である。画像データ５０は、人体の前腕から指にかけて撮像された画像である。本実施形態における画像データ５０は、画素毎の深度が付加された画像データ（以下、ＴＯＦ画像と言う）である。ＴＯＦ画像は、撮像装置から撮像対象までの距離を示す情報（深度情報）が各画素に対応づけられた画像である。なお、画像データ５０は、ＴＯＦ画像に限定されるものではなく、２次元画像であってもよい。 FIG. 3 is a diagram showing an example of image data 50. The image data 50 is an image captured from the forearm to the finger of the human body. The image data 50 in this embodiment is image data to which a depth for each pixel is added (hereinafter, referred to as a TOF image). The TOF image is an image in which information (depth information) indicating the distance from the imaging device to the imaging target is associated with each pixel. The image data 50 is not limited to the TOF image, and may be a two-dimensional image.

また、画像データ５０は、人物の全体像が撮像された画像であってもよい。この場合、例えば画像データ取得部２が、人物の全体像が撮像された画像から関節を含む部位（例えば、手全体）を推定して、その部位の画像データを抽出するようにしてもよい。そして、以降の処理は、抽出した画像データに対して行われるようにしてもよい。 Further, the image data 50 may be an image in which the entire image of the person is captured. In this case, for example, the image data acquisition unit 2 may estimate a portion including a joint (for example, the entire hand) from an image in which the entire image of the person is captured, and extract image data of that portion. Then, the subsequent processing may be performed on the extracted image data.

推定部３は、画像データ５０に写っている人体部位の種別（前腕、指など）を推定する。推定部３は、特徴ベクトル計算部３１と、推定実行部３２とを有している。 The estimation unit 3 estimates the type (forearm, finger, etc.) of the human body part shown in the image data 50. The estimation unit 3 has a feature vector calculation unit 31 and an estimation execution unit 32.

特徴ベクトル計算部３１は、画像データ５０の画素ごとに特徴量（特徴ベクトル）を求める。図４は、特徴ベクトル計算部３１が求めた特徴量を示す図である。図４に示すように、特徴ベクトル計算部３１は、各特徴量５１、５２、５３、５４を求める。推定実行部３２は、ニューラルネットワークなどによる機械学習を行い、その学習モデル６０を用いて、特徴ベクトル計算部３１が求めた特徴量から、画像データ５０に写っている物体の種別を推定する。図４の場合、推定実行部３２は、特徴量５１は前腕の特徴量であり、特徴量５２は手のひらの特徴量であり、特徴量５３は指の特徴量であると推定する。また、推定実行部３２は、特徴量５４は、人体部位以外であり、背景に写った物体の特徴量であると推定する。なお、図４では、特徴ベクトル計算部３１が求めた特徴量の理解のために、図３で示す人体部位を破線で示している。 The feature vector calculation unit 31 obtains a feature amount (feature vector) for each pixel of the image data 50. FIG. 4 is a diagram showing a feature amount obtained by the feature vector calculation unit 31. As shown in FIG. 4, the feature vector calculation unit 31 obtains each feature amount 51, 52, 53, 54. The estimation execution unit 32 performs machine learning by a neural network or the like, and uses the learning model 60 to estimate the type of the object shown in the image data 50 from the feature amount obtained by the feature vector calculation unit 31. In the case of FIG. 4, the estimation execution unit 32 estimates that the feature amount 51 is the feature amount of the forearm, the feature amount 52 is the feature amount of the palm, and the feature amount 53 is the feature amount of the finger. Further, the estimation execution unit 32 estimates that the feature amount 54 is a feature amount of an object reflected in the background other than the human body part. In FIG. 4, the human body part shown in FIG. 3 is shown by a broken line in order to understand the feature amount obtained by the feature vector calculation unit 31.

なお、画像データ５０がＴＯＦ画像であることにより、推定部３での人体部位の推定精度は高くなる。例えば、画像データ５０が関節を含む画像である場合、関節が曲げられた姿勢であっても、特徴ベクトル計算部３１は、画素ごとの深度情報を利用することで、特徴量を精度よく求めることができる。その結果、推定実行部３２による人体部位の推定精度は高くなる。 Since the image data 50 is a TOF image, the estimation accuracy of the human body part by the estimation unit 3 is high. For example, when the image data 50 is an image including joints, the feature vector calculation unit 31 can accurately obtain the feature amount by using the depth information for each pixel even if the joints are in a bent posture. Can be done. As a result, the estimation accuracy of the human body part by the estimation execution unit 32 becomes high.

領域設定部７は、推定部３が推定した人体部位のうち、関節に隣接する人体部位である前腕および手のひらを包含する領域を設定する。図５は、図４に示す推定部３による推定結果に対して設定した領域５５を示す図である。図５では、領域５５は、指の特徴量５３を包含しているが、少なくとも、関節（手首）に隣接する前腕の特徴量５１、および手のひらの特徴量５２を包含していればよい。領域５５を設定することで、以降の処理において、不要な領域に対して処理が行われないようにできる。 The area setting unit 7 sets a region including the forearm and the palm, which are human body parts adjacent to the joints, among the human body parts estimated by the estimation unit 3. FIG. 5 is a diagram showing a region 55 set for the estimation result by the estimation unit 3 shown in FIG. In FIG. 5, the region 55 includes the finger feature 53, but at least the forearm feature 51 adjacent to the joint (wrist) and the palm feature 52 may be included. By setting the area 55, it is possible to prevent the unnecessary area from being processed in the subsequent processing.

マーキング部４は、領域設定部７が設定した領域５５内の一または複数の画素をマーキングする。図６は、マーキング部４によるマーキング結果を示す図である。マーキング部４は、領域５５内において、関節（手首）に隣接する人体部位（前腕、手のひら）と、人体部位以外の領域（背景）に対して初期マーキングする。図６に示すマーキング５６は、前腕の特徴量５１に対して設定した初期マーキングである。マーキング５７は、手のひらの特徴量５２に対して設定した初期マーキングである。マーキング５８は、背景の領域に対して設定した初期マーキングを示す。 The marking unit 4 marks one or more pixels in the area 55 set by the area setting unit 7. FIG. 6 is a diagram showing a marking result by the marking unit 4. The marking portion 4 initially marks the human body part (forearm, palm) adjacent to the joint (wrist) and the area other than the human body part (background) in the area 55. The marking 56 shown in FIG. 6 is an initial marking set for the feature amount 51 of the forearm. The marking 57 is an initial marking set for the feature amount 52 of the palm. The marking 58 indicates an initial marking set for the background area.

マーキング部４は、前腕の特徴量５１、および、手のひらの特徴量５２に対して初期マーキングする場合、他の領域との境界に近いとノイズが影響するため、ノイズの影響を防ぐために、各特徴量の領域の中心部をマーキングする。また、マーキング部４は、背景の領域に対して初期マーキングする場合、マーキング５６、５７より一定距離範囲外の位置にマーキングする。なお、マーキング部４は、一画素のみをマーキングしてもよいが、後述する分水嶺アルゴリズムにおいて、ＴＯＦ画像の深度値の変化が滑らかではない場合、マーキングした領域が広がらないことがある。したがって、マーキング部４は、複数の画素をマーキングすることが好ましい。 When the marking portion 4 initially marks the feature amount 51 of the forearm and the feature amount 52 of the palm, noise affects when it is close to the boundary with other regions. Therefore, in order to prevent the influence of noise, each feature is used. Mark the center of the area of quantity. Further, when the marking portion 4 initially marks the background area, the marking portion 4 marks a position outside a certain distance range from the markings 56 and 57. The marking unit 4 may mark only one pixel, but in the watershed algorithm described later, if the change in the depth value of the TOF image is not smooth, the marked area may not be widened. Therefore, it is preferable that the marking unit 4 marks a plurality of pixels.

分割部５は、マーキング部４が設定したマーキング５６、５７、５８を入力とする分水嶺アルゴリズムを適用して、領域設定部７が設定した領域５５を複数の領域に分割する。分水嶺アルゴリズムでは、マーキング５６、５７、５８を、画素値（深度値）の極小値（または極大値）として、領域を広げつつ画素値の極大値（または極小値）となる点を求めていく。その極大値となる点を境界として、画素値（深度値）が近い領域を一つの領域とする。図７は、分割部５による領域分割結果を示す図である。図７では、領域５５は、前腕領域５９Ａと、手のひら領域５９Ｂと、背景領域５９Ｃとに分割されている。 The dividing unit 5 applies a watershed algorithm that inputs markings 56, 57, and 58 set by the marking unit 4 to divide the area 55 set by the area setting unit 7 into a plurality of areas. In the watershed algorithm, the markings 56, 57, and 58 are set as the minimum value (or maximum value) of the pixel value (depth value), and the point where the maximum value (or minimum value) of the pixel value is obtained while expanding the area is obtained. A region where the pixel values (depth values) are close to each other is defined as one region with the point at which the maximum value is the boundary as a boundary. FIG. 7 is a diagram showing the result of region division by the division unit 5. In FIG. 7, the region 55 is divided into a forearm region 59A, a palm region 59B, and a background region 59C.

なお、分水嶺アルゴリズムでは、必ずしもマーキング数と同じ数の領域に分割されるわけではなく、図７において、４つ以上の領域に分割される場合もある。また、分割部５は、深度値を用いてマーキングした領域を広げているが、画像データ５０の各画素の情報であれば、輝度値などを用いてもよい。 The watershed algorithm is not necessarily divided into the same number of regions as the number of markings, and may be divided into four or more regions in FIG. 7. Further, although the division portion 5 expands the marked area using the depth value, the luminance value or the like may be used as long as it is the information of each pixel of the image data 50.

計算部６は、分割部５が分割した領域のうち、人体部位に対応する２つの領域５９Ａ、５９Ｂから、関節位置を計算する。図８は、計算部６による計算方法を説明するための図である。計算部６は、領域５９Ａ、５９Ｂそれぞれに対してラベリング処理を行い、各領域の輪郭を取得する。図８では、輪郭は概略的に示している。図８に示すように、領域５９Ａと、領域５９Ｂとが離れている場合、計算部６は、２つの領域５９Ａ、５９Ｂにおいて、互いに最も近くなる座標位置５９Ａ１、５９Ｂ１を求める。計算部６は、座標位置５９Ａ１、５９Ｂ１を繋ぐ最短直線Ｌを包含する領域５９Ｄを計算して、関節位置とする。なお、２つの領域５９Ａ、５９Ｂの輪郭が重なる場合、計算部６は、重なり部分を関節位置としてもよい。 The calculation unit 6 calculates the joint position from the two regions 59A and 59B corresponding to the human body part among the regions divided by the division unit 5. FIG. 8 is a diagram for explaining a calculation method by the calculation unit 6. The calculation unit 6 performs labeling processing on each of the areas 59A and 59B, and acquires the contour of each area. In FIG. 8, the contour is shown schematically. As shown in FIG. 8, when the area 59A and the area 59B are separated from each other, the calculation unit 6 obtains the coordinate positions 59A1 and 59B1 closest to each other in the two areas 59A and 59B. The calculation unit 6 calculates a region 59D including the shortest straight line L connecting the coordinate positions 59A1 and 59B1 to obtain the joint position. When the contours of the two regions 59A and 59B overlap, the calculation unit 6 may use the overlapping portion as the joint position.

［動作説明］
次に、本実施形態における関節位置判定装置の動作について図９を用いて説明する。図９は、関節位置推定装置１による動作を示すフロー図である。以下の説明においては、適宜図１〜図８を参酌する。また、本実施形態では、関節位置推定装置を動作させることによって、関節位置推定法が実施される。よって、本実施形態における関節位置推定方法の説明は、以下の関節位置推定装置の動作説明に代える。[Operation description]
Next, the operation of the joint position determination device in the present embodiment will be described with reference to FIG. FIG. 9 is a flow chart showing the operation of the joint position estimation device 1. In the following description, FIGS. 1 to 8 will be referred to as appropriate. Further, in the present embodiment, the joint position estimation method is carried out by operating the joint position estimation device. Therefore, the description of the joint position estimation method in the present embodiment is replaced with the following operation description of the joint position estimation device.

まず、前提として、撮像装置で関節を含む人体部位が撮像され、その画像データが関節位置推定装置１に入力されるものとする。 First, as a premise, it is assumed that a human body part including a joint is imaged by an imaging device and the image data is input to the joint position estimation device 1.

上述の前提において、画像データ取得部２は入力された画像データを取得する（Ｓ１）。次に、推定部３の特徴ベクトル計算部３１は、画像データ５０の画素ごとに特徴量５１〜５４（図４参照）を求める（Ｓ２）。そして、推定部３の推定実行部３２は、特徴ベクトル計算部３１が求めた特徴量から、その特徴量をもつ物体の種別を認識し、画像データ５０内における人体部位を推定する（Ｓ３）。 Under the above premise, the image data acquisition unit 2 acquires the input image data (S1). Next, the feature vector calculation unit 31 of the estimation unit 3 obtains the feature amounts 51 to 54 (see FIG. 4) for each pixel of the image data 50 (S2). Then, the estimation execution unit 32 of the estimation unit 3 recognizes the type of the object having the feature amount from the feature amount obtained by the feature vector calculation unit 31, and estimates the human body part in the image data 50 (S3).

続いて、領域設定部７は、位置推定対象の関節に隣接する、推定部３が推定した人体部位を包含する領域５５（図５参照）を設定する（Ｓ４）。マーキング部４は、領域設定部７が設定した領域５５内において、初期マーキングする（Ｓ５）。Ｓ５では、マーキング部４は、位置推定対象の関節（手首）に隣接する前腕のマーキング５６（図６参照）、および、手のひらのマーキング５７と、人体部位以外の領域（背景）のマーキング５８とを設定する。 Subsequently, the region setting unit 7 sets a region 55 (see FIG. 5) adjacent to the joint whose position is to be estimated and which includes the human body portion estimated by the estimation unit 3 (S4). The marking unit 4 initially marks in the area 55 set by the area setting unit 7 (S5). In S5, the marking portion 4 provides a marking 56 for the forearm (see FIG. 6) adjacent to the joint (wrist) to be estimated for position, a marking 57 for the palm, and a marking 58 for an area (background) other than the human body part. Set.

分割部５は、マーキング５６、５７、５８を入力とする分水嶺アルゴリズムを適用して、Ｓ４で設定した領域５５を複数の領域５９Ａ、５９Ｂ、５９Ｃ（図７参照）に分割する（Ｓ６）。そして、計算部６は、分割部５が分割した領域のうち、人体部位に対応する２つの領域５９Ａ、５９Ｂから、関節位置を計算する（Ｓ７）。 The dividing unit 5 applies a watershed algorithm that inputs markings 56, 57, and 58 to divide the region 55 set in S4 into a plurality of regions 59A, 59B, and 59C (see FIG. 7) (S6). Then, the calculation unit 6 calculates the joint position from the two regions 59A and 59B corresponding to the human body portion among the regions divided by the division unit 5 (S7).

［本実施形態による効果］
以上のように本実施形態によれば、画像データ５０から人体部位を推定し、推定結果に対して初期マーキングしている。このため、画像データ５０において、マーキングする対象領域、例えば前腕または手のひらの形状が、複雑な形状であっても、初期マーキングを精度よく行える。その初期マーキングを入力として、分水嶺アルゴリズムを利用することで、関節に隣接する部位である、前腕および手のひらの２つの領域の推定精度は高くなる。前腕または手のひらの一方の領域が明確に判断できないと、２つの領域の間にある関節（手首）の位置を精度よく計算できないが、２つの領域を精度よく推定することで、信頼度の高い関節（手首）の位置の計算結果を得ることができる。[Effect of this embodiment]
As described above, according to the present embodiment, the human body part is estimated from the image data 50, and the estimation result is initially marked. Therefore, in the image data 50, even if the target area to be marked, for example, the shape of the forearm or the palm is a complicated shape, the initial marking can be performed with high accuracy. By using the watershed algorithm with the initial marking as an input, the estimation accuracy of the two regions adjacent to the joint, the forearm and the palm, is improved. If one area of the forearm or palm cannot be clearly determined, the position of the joint (wrist) between the two areas cannot be calculated accurately, but by estimating the two areas accurately, a highly reliable joint can be calculated. The calculation result of the position of (wrist) can be obtained.

また、関節、例えば手首の位置が決まると、手首の先にある指の位置を推定しやすくなり、また、手の動作、手の姿勢も推定しやすくなる。その結果、手を動かしたジェスチャーを認識する装置などに適用することができる。 Further, once the position of the joint, for example, the wrist, is determined, it becomes easy to estimate the position of the finger at the tip of the wrist, and it becomes easy to estimate the movement of the hand and the posture of the hand. As a result, it can be applied to a device that recognizes a gesture of moving a hand.

［プログラム］
本発明の実施形態におけるプログラムは、コンピュータに、図９に示すステップＳ１〜Ｓ７を実行させるプログラムであれば良い。このプログラムをコンピュータにインストールし、実行することによって、本実施形態における関節位置推定装置と関節位置推定方法とを実現することができる。この場合、コンピュータのプロセッサは、画像データ取得部２、推定部３、マーキング部４、分割部５および計算部６として機能し、処理を行なう。[program]
The program according to the embodiment of the present invention may be any program that causes a computer to execute steps S1 to S7 shown in FIG. By installing this program on a computer and executing it, the joint position estimation device and the joint position estimation method according to the present embodiment can be realized. In this case, the computer processor functions as an image data acquisition unit 2, an estimation unit 3, a marking unit 4, a division unit 5, and a calculation unit 6 to perform processing.

また、本実施形態では、画像データ取得部２、推定部３、マーキング部４、分割部５および計算部６は、コンピュータに備えられたハードディスク等の記憶装置に、これらを構成するデータファイルを格納することによって、またはこのデータファイルが格納された記録媒体をコンピュータと接続された読取装置に搭載することによって実現されている。 Further, in the present embodiment, the image data acquisition unit 2, the estimation unit 3, the marking unit 4, the division unit 5, and the calculation unit 6 store the data files constituting them in a storage device such as a hard disk provided in the computer. This is achieved by doing so or by mounting the recording medium in which this data file is stored in a reader connected to a computer.

また、本実施形態におけるプログラムは、複数のコンピュータによって構築されたコンピュータシステムによって実行されても良い。この場合は、例えば、各コンピュータが、それぞれ、画像データ取得部２、推定部３、マーキング部４、分割部５および計算部６のいずれかとして機能しても良い。 Moreover, the program in this embodiment may be executed by a computer system constructed by a plurality of computers. In this case, for example, each computer may function as any of the image data acquisition unit 2, the estimation unit 3, the marking unit 4, the division unit 5, and the calculation unit 6, respectively.

［物理構成］
ここで、本実施形態におけるプログラムを実行することによって、関節位置推定装置１を実現するコンピュータについて図１０を用いて説明する。図１０は、本実施形態における関節位置推定装置１を実現するコンピュータの一例を示すブロック図である。[Physical configuration]
Here, a computer that realizes the joint position estimation device 1 by executing the program in the present embodiment will be described with reference to FIG. FIG. 10 is a block diagram showing an example of a computer that realizes the joint position estimation device 1 according to the present embodiment.

図１０に示すように、コンピュータ１１０は、ＣＰＵ１１１と、メインメモリ１１２と、記憶装置１１３と、入力インターフェイス１１４と、表示コントローラ１１５と、データリーダ／ライタ１１６と、通信インターフェイス１１７とを備える。これらの各部は、バス１２１を介して、互いにデータ通信可能に接続される。なお、コンピュータ１１０は、ＣＰＵ１１１に加えて、またはＣＰＵ１１１に代えて、ＧＰＵ（Graphics Processing Unit）、又はＦＰＧＡ（Field-Programmable Gate Array）を備えていても良い。 As shown in FIG. 10, the computer 110 includes a CPU 111, a main memory 112, a storage device 113, an input interface 114, a display controller 115, a data reader / writer 116, and a communication interface 117. Each of these parts is connected to each other via a bus 121 so as to be capable of data communication. The computer 110 may include a GPU (Graphics Processing Unit) or an FPGA (Field-Programmable Gate Array) in addition to the CPU 111 or in place of the CPU 111.

ＣＰＵ１１１は、記憶装置１１３に格納された、本実施形態におけるプログラム（コード）をメインメモリ１１２に展開し、これらを所定順序で実行することにより、各種の演算を実施する。メインメモリ１１２は、典型的には、ＤＲＡＭ（Dynamic Random Access Memory）等の揮発性の記憶装置である。また、本実施形態におけるプログラムは、コンピュータ読み取り可能な記録媒体１２０に格納された状態で提供される。なお、本実施形態におけるプログラムは、通信インターフェイス１１７を介して接続されたインターネット上で流通するものであっても良い。 The CPU 111 expands the programs (codes) of the present embodiment stored in the storage device 113 into the main memory 112 and executes them in a predetermined order to perform various operations. The main memory 112 is typically a volatile storage device such as a DRAM (Dynamic Random Access Memory). Further, the program in the present embodiment is provided in a state of being stored in a computer-readable recording medium 120. The program in the present embodiment may be distributed on the Internet connected via the communication interface 117.

また、記憶装置１１３の具体例としては、ハードディスクドライブの他、フラッシュメモリ等の半導体記憶装置が挙げられる。入力インターフェイス１１４は、ＣＰＵ１１１と、キーボード及びマウスといった入力機器１１８との間のデータ伝送を仲介する。表示コントローラ１１５は、ディスプレイ装置１１９と接続され、ディスプレイ装置１１９での表示を制御する。 Further, specific examples of the storage device 113 include a semiconductor storage device such as a flash memory in addition to a hard disk drive. The input interface 114 mediates data transmission between the CPU 111 and an input device 118 such as a keyboard and mouse. The display controller 115 is connected to the display device 119 and controls the display on the display device 119.

データリーダ／ライタ１１６は、ＣＰＵ１１１と記録媒体１２０との間のデータ伝送を仲介し、記録媒体１２０からのプログラムの読み出し、及びコンピュータ１１０における処理結果の記録媒体１２０への書き込みを実行する。通信インターフェイス１１７は、ＣＰＵ１１１と、他のコンピュータとの間のデータ伝送を仲介する。 The data reader / writer 116 mediates data transmission between the CPU 111 and the recording medium 120, reads a program from the recording medium 120, and writes a processing result in the computer 110 to the recording medium 120. The communication interface 117 mediates data transmission between the CPU 111 and another computer.

また、記録媒体１２０の具体例としては、ＣＦ（Compact Flash（登録商標））及びＳＤ（Secure Digital）等の汎用的な半導体記憶デバイス、フレキシブルディスク（Flexible Disk）等の磁気記録媒体、またはＣＤ−ＲＯＭ（Compact Disk Read Only Memory）などの光学記録媒体が挙げられる。 Specific examples of the recording medium 120 include a general-purpose semiconductor storage device such as CF (Compact Flash (registered trademark)) and SD (Secure Digital), a magnetic recording medium such as a flexible disk, or a CD-. Examples include optical recording media such as ROM (Compact Disk Read Only Memory).

なお、本実施形態における関節位置推定装置１は、プログラムがインストールされたコンピュータではなく、各部に対応したハードウェアを用いることによっても実現可能である。更に、関節位置推定装置１は、一部がプログラムで実現され、残りの部分がハードウェアで実現されていてもよい。 The joint position estimation device 1 in the present embodiment can also be realized by using hardware corresponding to each part instead of the computer in which the program is installed. Further, the joint position estimation device 1 may be partially realized by a program and the rest may be realized by hardware.

上述した実施形態の一部又は全部は、以下に記載する（付記１）〜（付記１５）によって表現することができるが、以下の記載に限定されるものではない。 A part or all of the above-described embodiments can be expressed by the following descriptions (Appendix 1) to (Appendix 15), but the present invention is not limited to the following description.

（付記１）
画像データを取得する画像データ取得部と、
前記画像データ取得部が取得した画像データから人体部位を推定する推定部と、
前記推定部が複数の人体部位を推定した場合、前記推定部が推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするマーキング部と、
前記マーキング部によるマーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割する分割部と、
前記分割部が分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算する計算部と、
を備える、関節位置推定装置。(Appendix 1)
The image data acquisition unit that acquires image data and
An estimation unit that estimates a human body part from the image data acquired by the image data acquisition unit, and an estimation unit.
When the estimation unit estimates a plurality of human body parts, a marking unit that marks one or a plurality of pixels for each of the region of the human body part estimated by the estimation unit and the region other than the region of the human body part. ,
A watershed algorithm is applied to the image data after marking by the marking unit to divide the area in the image data into a plurality of divided units.
A calculation unit that calculates the joint position from two regions corresponding to the human body part adjacent to the joint among the regions divided by the division portion.
A joint position estimation device.

（付記２）
付記１に記載の関節位置推定装置であって、
関節に隣接する、前記推定部が推定した人体部位を包含する領域を設定する領域設定部、
をさらに備え、
前記マーキング部は、前記領域設定部が設定した領域内でマーキングし、
前記分割部は、前記領域設定部が設定した領域を複数に分割する、
関節位置推定装置。(Appendix 2)
The joint position estimation device according to Appendix 1.
An area setting unit that sets an area adjacent to a joint that includes a human body part estimated by the estimation unit,
With more
The marking unit marks within the area set by the area setting unit.
The division unit divides the area set by the area setting unit into a plurality of areas.
Joint position estimation device.

（付記３）
付記１または付記２に記載の関節位置推定装置であって、
前記画像データは、画素毎の深度が付加された画像データである、
関節位置推定装置。(Appendix 3)
The joint position estimation device according to Appendix 1 or Appendix 2.
The image data is image data to which a depth for each pixel is added.
Joint position estimation device.

（付記４）
付記１から付記３までのいずれか一つに記載の関節位置推定装置であって、
前記分割部は、前記マーキング部がマーキングした画素と、画素値が前記画素の所定範囲内である画素とを同じ領域として、領域を複数に分割する、
関節位置推定装置。(Appendix 4)
The joint position estimation device according to any one of Supplementary note 1 to Supplementary note 3.
The dividing portion divides the region into a plurality of regions, with the pixels marked by the marking portion and the pixels whose pixel values are within a predetermined range of the pixels as the same region.
Joint position estimation device.

（付記５）
付記１から付記４までのいずれか一つに記載の関節位置推定装置であって、
前記計算部は、前記２つの領域間が最小距離となる前記２つの領域それぞれにおける位置を基準として、関節位置を計算する、
関節位置推定装置。(Appendix 5)
The joint position estimation device according to any one of Supplementary note 1 to Supplementary note 4.
The calculation unit calculates the joint position with reference to the position in each of the two regions where the minimum distance between the two regions is the minimum distance.
Joint position estimation device.

（付記６）
画像データを取得するステップと、
取得した画像データに対して、人体部位を推定するステップと、
複数の人体部位を推定した場合、推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするステップと、
マーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割するステップと、
分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算するステップと、
を備える、関節位置推定方法。(Appendix 6)
Steps to get image data and
Steps to estimate the human body part from the acquired image data,
When a plurality of human body parts are estimated, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body region.
A step of applying a watershed algorithm to the image data after marking to divide the area in the image data into a plurality of areas, and
A step of calculating the joint position from two regions corresponding to the human body part adjacent to the joint among the divided regions, and
A method for estimating joint position.

（付記７）
付記６に記載の関節位置推定方法であって、
関節に隣接する、推定した人体部位を包含する領域を設定するステップ、
をさらに備え、
前記一または複数の画素にマーキングするステップでは、設定した領域内でマーキングし、
前記画像データ内の領域を複数に分割するステップでは、設定した領域を複数に分割する、
関節位置推定方法。(Appendix 7)
The joint position estimation method described in Appendix 6
Steps to set the area adjacent to the joint and containing the estimated human body part,
With more
In the step of marking one or more pixels, marking is performed within the set area.
In the step of dividing the area in the image data into a plurality of areas, the set area is divided into a plurality of areas.
Joint position estimation method.

（付記８）
付記６または付記７に記載の関節位置推定方法であって、
前記画像データは、画素毎の深度が付加された画像データである、
関節位置推定方法。(Appendix 8)
The joint position estimation method according to Appendix 6 or Appendix 7, wherein the joint position is estimated.
The image data is image data to which a depth for each pixel is added.
Joint position estimation method.

（付記９）
付記６から付記８までのいずれか一つに記載の関節位置推定方法であって、
前記画像データ内の領域を複数に分割するステップでは、マーキングした画素と、画素値が前記画素の所定範囲内である画素とを同じ領域として、領域を複数に分割する、
関節位置推定方法。(Appendix 9)
The joint position estimation method according to any one of Supplementary note 6 to Supplementary note 8.
In the step of dividing the area in the image data into a plurality of areas, the marked pixel and the pixel whose pixel value is within a predetermined range of the pixel are set as the same area, and the area is divided into a plurality of areas.
Joint position estimation method.

（付記１０）
付記６から付記９までのいずれか一つに記載の関節位置推定方法であって、
前記関節位置を計算するステップでは、前記２つの領域間が最小距離となる前記２つの領域それぞれにおける位置を基準として、関節位置を計算する、
関節位置推定方法。(Appendix 10)
The joint position estimation method according to any one of Supplementary note 6 to Supplementary note 9.
In the step of calculating the joint position, the joint position is calculated with reference to the position in each of the two regions where the distance between the two regions is the minimum distance.
Joint position estimation method.

（付記１１）
画像データを取得するコンピュータに、
前記画像データに対して、人体部位を推定するステップと、
複数の人体部位を推定した場合、推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするステップと、
マーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割するステップと、
分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算するステップと、
を実行させる命令を含む、プログラムを記録したコンピュータ読み取り可能な記録媒体。(Appendix 11)
On the computer that acquires the image data,
A step of estimating a human body part based on the image data,
When a plurality of human body parts are estimated, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body region.
A step of applying a watershed algorithm to the image data after marking to divide the area in the image data into a plurality of areas, and
A step of calculating the joint position from two regions corresponding to the human body part adjacent to the joint among the divided regions, and
A computer-readable recording medium on which a program is recorded, including instructions to execute.

（付記１２）
付記１１に記載のコンピュータ読み取り可能な記録媒体であって、
前記プログラムは、
前記コンピュータに、
関節に隣接する、推定された人体部位を包含する領域を設定するステップ、
を実行させる命令をさらに含み、
前記前記一または複数の画素にマーキングするステップでは、設定された領域内でマーキングし、
前記画像データ内の領域を複数に分割するステップでは、設定された領域を複数に分割する、
コンピュータ読み取り可能な記録媒体。(Appendix 12)
The computer-readable recording medium according to Appendix 11, which is a computer-readable recording medium.
The program
On the computer
Steps to set the area adjacent to the joint and containing the estimated human body part,
Including further instructions to execute
In the step of marking one or more pixels, marking is performed within a set area.
In the step of dividing the area in the image data into a plurality of areas, the set area is divided into a plurality of areas.
A computer-readable recording medium.

（付記１３）
付記１１または付記１２に記載のコンピュータ読み取り可能な記録媒体であって、
前記画像データは、画素毎の深度が付加された画像データである、
コンピュータ読み取り可能な記録媒体。(Appendix 13)
The computer-readable recording medium according to Appendix 11 or Appendix 12, wherein the recording medium is readable.
The image data is image data to which a depth for each pixel is added.
A computer-readable recording medium.

（付記１４）
付記１１から付記１３までのいずれか一つに記載のコンピュータ読み取り可能な記録媒体であって、
前記画像データ内の領域を複数に分割するステップでは、マーキングされた画素と、画素値が前記画素の所定範囲内である画素とを同じ領域として、領域を複数に分割する、
コンピュータ読み取り可能な記録媒体。(Appendix 14)
The computer-readable recording medium according to any one of Supplementary note 11 to Supplementary note 13.
In the step of dividing the area in the image data into a plurality of areas, the marked pixel and the pixel whose pixel value is within a predetermined range of the pixel are set as the same area, and the area is divided into a plurality of areas.
A computer-readable recording medium.

（付記１５）
付記１１から付記１４までのいずれか一つに記載のコンピュータ読み取り可能な記録媒体であって、
前記関節位置を計算するステップでは、前記２つの領域間が最小距離となる前記２つの領域それぞれにおける位置を基準として、関節位置を計算する、
コンピュータ読み取り可能な記録媒体。(Appendix 15)
The computer-readable recording medium according to any one of Supplementary note 11 to Supplementary note 14.
In the step of calculating the joint position, the joint position is calculated with reference to the position in each of the two regions where the distance between the two regions is the minimum distance.
A computer-readable recording medium.

以上、実施の形態を参照して本願発明を説明したが、本願発明は上記実施の形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described above with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made within the scope of the present invention in terms of the structure and details of the present invention.

この出願は、２０１９年３月１３日に出願された日本出願特願２０１９−４６３９０を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims priority on the basis of Japanese application Japanese Patent Application No. 2019-46390 filed on March 13, 2019, and the entire disclosure thereof is incorporated herein by reference.

本発明は、関節位置を精度よく推定できる。 The present invention can accurately estimate the joint position.

１：関節位置推定装置
２：画像データ取得部
３：推定部
４：マーキング部
５：分割部
６：計算部
７：領域設定部
３１：特徴ベクトル計算部
３２：推定実行部
５０：画像データ
６０：学習モデル
５１、５２、５３、５４：特徴量
５５：領域
５６、５７、５８：マーキング1: Joint position estimation device 2: Image data acquisition unit 3: Estimate unit 4: Marking unit 5: Division unit 6: Calculation unit 7: Area setting unit 31: Feature vector calculation unit 32: Estimation execution unit 50: Image data 60: Learning models 51, 52, 53, 54: Feature 55: Areas 56, 57, 58: Marking

本発明は、画像データから関節位置を推定する関節位置推定装置および関節位置推定方法に関し、さらには、これらを実現するためのプログラムに関する。
The present invention relates to a joint position estimation device and joint position estimating method for estimating a joint position from the image data, and further relates to a program for realizing these.

本発明の目的の一例は、関節位置を精度よく推定できる、関節位置推定装置、関節位置推定方法およびプログラムを提供することにある。
An example of an object of the present invention is to provide a joint position estimation device, a joint position estimation method, and a program capable of accurately estimating a joint position.

更に、上記目的を達成するため、本発明の一側面におけるプログラムは、画像データを取得するコンピュータに、
前記画像データに対して、人体部位を推定するステップと、
複数の人体部位を推定した場合、推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするステップと、
マーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割するステップと、
分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算するステップと、
を実行させる命令を含む。
Further, in order to achieve the above object, the program in one aspect of the present invention is applied to a computer that acquires image data.
A step of estimating a human body part based on the image data,
When a plurality of human body parts are estimated, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body region.
A step of applying a watershed algorithm to the image data after marking to divide the area in the image data into a plurality of areas, and
A step of calculating the joint position from two regions corresponding to the human body part adjacent to the joint among the divided regions, and
Including an instruction to the execution.

（付記１１）
画像データを取得するコンピュータに、
前記画像データに対して、人体部位を推定するステップと、
複数の人体部位を推定した場合、推定した人体部位の領域、および、前記人体部位の領域以外の領域それぞれに対して、一または複数の画素にマーキングするステップと、
マーキング後の画像データに対して、分水嶺アルゴリズムを適用して、前記画像データ内の領域を複数に分割するステップと、
分割した領域のうち、関節に隣接する人体部位に対応する２つの領域から、関節位置を計算するステップと、
を実行させる命令を含む、プログラム。
(Appendix 11)
On the computer that acquires the image data,
A step of estimating a human body part based on the image data,
When a plurality of human body parts are estimated, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body region.
A step of applying a watershed algorithm to the image data after marking to divide the area in the image data into a plurality of areas, and
A step of calculating the joint position from two regions corresponding to the human body part adjacent to the joint among the divided regions, and
The including the instruction to be executed, program.

（付記１２）
付記１１に記載のプログラムであって、
前記コンピュータに、
関節に隣接する、推定された人体部位を包含する領域を設定するステップ、
を実行させる命令をさらに含み、
前記前記一または複数の画素にマーキングするステップでは、設定された領域内でマーキングし、
前記画像データ内の領域を複数に分割するステップでは、設定された領域を複数に分割する、
プログラム。
(Appendix 12)
The program described in Appendix 11
Before Symbol computer,
Steps to set the area adjacent to the joint and containing the estimated human body part,
Including further instructions to execute
In the step of marking one or more pixels, marking is performed within a set area.
In the step of dividing the area in the image data into a plurality of areas, the set area is divided into a plurality of areas.
Program .

（付記１３）
付記１１または付記１２に記載のプログラムであって、
前記画像データは、画素毎の深度が付加された画像データである、
プログラム。
(Appendix 13)
The program according to Appendix 11 or Appendix 12, which is described in Appendix 12.
The image data is image data to which a depth for each pixel is added.
Program .

（付記１４）
付記１１から付記１３までのいずれか一つに記載のプログラムであって、
前記画像データ内の領域を複数に分割するステップでは、マーキングされた画素と、画素値が前記画素の所定範囲内である画素とを同じ領域として、領域を複数に分割する、
プログラム。
(Appendix 14)
The program described in any one of Supplementary Note 11 to Supplementary Note 13.
In the step of dividing the area in the image data into a plurality of areas, the marked pixel and the pixel whose pixel value is within a predetermined range of the pixel are set as the same area, and the area is divided into a plurality of areas.
Program .

（付記１５）
付記１１から付記１４までのいずれか一つに記載のプログラムであって、
前記関節位置を計算するステップでは、前記２つの領域間が最小距離となる前記２つの領域それぞれにおける位置を基準として、関節位置を計算する、
プログラム。
(Appendix 15)
The program described in any one of Supplementary note 11 to Supplementary note 14.
In the step of calculating the joint position, the joint position is calculated with reference to the position in each of the two regions where the distance between the two regions is the minimum distance.
Program .

Claims

Image data acquisition means for acquiring image data and
An estimation means for estimating a human body part from the image data acquired by the image data acquisition means, and an estimation means.
When the estimation means estimates a plurality of human body parts, the marking means for marking one or a plurality of pixels for each of the region of the human body part estimated by the estimation means and the region other than the region of the human body part. ,
A watershed algorithm is applied to the image data after marking by the marking means to divide the area in the image data into a plurality of parts.
Of the regions divided by the dividing means, a calculation means for calculating the joint position from two regions corresponding to the human body part adjacent to the joint, and
A joint position estimation device.

The joint position estimation device according to claim 1.
A region setting means that sets a region adjacent to a joint and includes a human body part estimated by the estimation means,
With more
The marking means marks within the area set by the area setting means.
The dividing means divides the area set by the area setting means into a plurality of areas.
Joint position estimation device.

The joint position estimation device according to claim 1 or 2.
The image data is image data to which a depth for each pixel is added.
Joint position estimation device.

The joint position estimation device according to any one of claims 1 to 3.
The dividing means divides a region into a plurality of regions with the pixels marked by the marking means and the pixels whose pixel values are within a predetermined range of the pixels as the same region.
Joint position estimation device.

The joint position estimation device according to any one of claims 1 to 4.
The calculation means calculates the joint position with reference to the position in each of the two regions where the minimum distance between the two regions is the minimum distance.
Joint position estimation device.

Get image data,
The human body part is estimated from the image data, and
When a plurality of human body parts are estimated with respect to the image data, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body part region.
A watershed algorithm is applied to the image data after marking to divide the area in the image data into a plurality of areas.
The joint position is calculated from two regions corresponding to the human body part adjacent to the joint among the plurality of divided regions.
Joint position estimation method.

The joint position estimation method according to claim 6.
Of the human body parts estimated from the image data, a region including the human body part adjacent to the joint is set.
When marking the one or more pixels, mark within the area including the human body part adjacent to the joint.
A joint position estimation method for dividing a region including a human body part adjacent to a joint into a plurality of regions when the region in the image data is divided into a plurality of regions.

The joint position estimation method according to claim 6 or 7.
The image data is image data to which a depth for each pixel is added.
Joint position estimation method.

The joint position estimation method according to any one of claims 6 to 8.
When the area in the image data is divided into a plurality of areas, the area is divided into a plurality of areas with the marked one or a plurality of pixels and the pixels whose pixel values are within a predetermined range of the pixels as the same area.
Joint position estimation method.

The joint position estimation method according to any one of claims 6 to 9.
When calculating the joint position, the joint position is calculated based on the position in each of the two regions where the minimum distance between the two regions is the minimum distance.
Joint position estimation method.

On the computer that acquires the image data,
The human body part is estimated from the image data, and
When a plurality of human body parts are estimated, one or a plurality of pixels are marked for each of the estimated human body part region and the region other than the human body region.
A watershed algorithm is applied to the image data after the marking to divide the area in the image data into a plurality of areas.
Of the divided regions, the joint position is calculated from the two regions corresponding to the human body parts adjacent to the joints.
A computer-readable recording medium that records a program that executes an instruction.

The computer-readable recording medium according to claim 11.
The program
On the computer
Have the area set up adjacent to the joint and containing the estimated human body part,
Including more instructions
When marking one or more pixels, mark within the set area.
When the area in the image data is divided into a plurality of areas, the set area is divided into a plurality of areas.
A computer-readable recording medium.

The computer-readable recording medium according to claim 11 or 12.
The image data is image data to which a depth for each pixel is added.
A computer-readable recording medium.

The computer-readable recording medium according to any one of claims 11 to 13.
When the area in the image data is divided into a plurality of areas, the marked pixel and the pixel whose pixel value is within a predetermined range of the pixel are set as the same area, and the area is divided into a plurality of areas.
A computer-readable recording medium.

The computer-readable recording medium according to any one of claims 11 to 14.
When calculating the joint position, the joint position is calculated based on the position in each of the two regions where the minimum distance between the two regions is the minimum distance.
A computer-readable recording medium.