KR20220004009A

KR20220004009A - Key point detection method, apparatus, electronic device and storage medium

Info

Publication number: KR20220004009A
Application number: KR1020217021260A
Authority: KR
Inventors: 솅 진; 웬타오 리우; 첸 치안
Original assignee: 베이징 센스타임 테크놀로지 디벨롭먼트 컴퍼니 리미티드
Priority date: 2020-06-30
Filing date: 2021-02-03
Publication date: 2022-01-11
Also published as: CN111783882A; TW202203211A; JP2022542199A; TWI774271B; WO2022001106A1; CN111783882B

Abstract

본 발명은 키 포인트 검출 방법, 장치, 전자 기기 및 저장 매체를 제공하고, 상기 방법은, 검출될 이미지를 획득하는 단계; 상기 검출될 이미지에 기반하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 생성하는 단계 - 상기 이미지 특징맵은 상기 검출될 이미지 중 각 목표 대상 사이의 상대적 위치 관계를 나타내기 위한 것이고, 각 상기 키 포인트 히트맵은 상기 검출될 이미지의 하나의 카테고리의 키 포인트를 포함하며, 상이한 카테고리의 키 포인트는 상기 목표 대상의 상이한 부위에 대응됨 - ; 상기 이미지 특징맵 및 복수 개의 상기 키 포인트 히트맵에 기반하여, 키 포인트 이미지 모델을 생성하는 단계 - 상기 키 포인트 이미지 모델은 상기 검출될 이미지 중 상이한 카테고리의 키 포인트의 정보 및 연결 가장자리의 정보를 포함하고, 각 연결 가장자리는 두 개의 상이한 카테고리의 키 포인트 사이의 가장자리임 - ; 및 상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계를 포함한다.The present invention provides a key point detection method, apparatus, electronic device and storage medium, the method comprising the steps of: acquiring an image to be detected; generating an image feature map and a plurality of key point heat maps based on the image to be detected, wherein the image feature map is for indicating a relative positional relationship between each target object in the image to be detected, each of the key points the point heat map includes key points of one category of the image to be detected, and key points of different categories correspond to different parts of the target object; generating a key point image model based on the image feature map and the plurality of key point heat maps, wherein the key point image model includes information on key points of different categories among the images to be detected and information on connection edges , and each connecting edge is an edge between key points of two different categories; and determining each key point belonging to the same target object based on the key point image model.

Description

Key point detection method, apparatus, electronic device and storage medium

본 발명은 출원 번호가 202010622132.3이고, 출원일이 2020년 6월 30일인 중국 특허 출원에 기반하여 제출하였고, 상기 중국 특허 출원의 우선권을 주장하는 바, 상기 중국 특허 출원의 모든 내용은 참조로서 본 출원에 인용된다.The present invention is filed on the basis of a Chinese patent application with an application number of 202010622132.3 and an application date of June 30, 2020, and claims the priority of the Chinese patent application, all contents of the Chinese patent application are incorporated herein by reference. are cited

본 발명은 이미지 검출 기술 분야에 관한 것으로, 특히 키 포인트 검출 방법, 장치, 전자 기기 및 저장 매체에 관한 것이다.The present invention relates to the field of image detection technology, and more particularly to a key point detection method, apparatus, electronic device and storage medium.

인체 키 포인트 검출은 비디오에서 사람의 행동을 분석하여 상위 계층 정보의 지원을 제공하는 것이고, 비디오 인체 동작 인식 및 인간-기계 인터페이스를 구현하는 기반이다. 최근, 딥 뉴럴 네트워크에 기반한 인체 키 포인트 검출 방법은, 입력 이미지가 간단하고 획득하기 쉬우며, 검출 효과가 효율적이고 정확하므로, 연구의 화제가 되었다.Human body key point detection is to provide support for higher layer information by analyzing human behavior in video, and is the basis for implementing video human motion recognition and human-machine interface. Recently, a human body key point detection method based on a deep neural network has become a topic of research because the input image is simple and easy to acquire, and the detection effect is efficient and accurate.

일반적으로, 딥 뉴럴 네트워크에 기반한 인체 키 포인트 검출 방법에 있어서, 키 포인트의 검출 및 키 포인트의 클러스터링은 두 개의 독립적인 단계이고, 즉 키 포인트의 클러스터링은 일반적으로 미분할 수 없는 후처리 작업이다. 그러나, 이러한 방식으로, 키 포인트 클러스터링 과정의 정확성이 비교적 낮고, 즉 클러스터링 오류가 발생할 수 있으며, 서로 다른 사람의 키 포인트를 동일한 사람의 키 포인트로 하여 함께 클러스터링되어, 검출 결과에 오류가 발생하게 된다.In general, in a human key point detection method based on a deep neural network, the key point detection and the key point clustering are two independent steps, that is, the key point clustering is generally an indifferentiable post-processing operation. However, in this way, the accuracy of the key point clustering process is relatively low, that is, clustering errors may occur, and different people's key points are clustered together with the same person's key points, resulting in an error in the detection result. .

이 점을 고려하여, 본 발명의 실시예는 적어도 키 포인트 검출 방법, 장치, 전자 기기 및 저장 매체를 제공한다.In view of this, embodiments of the present invention provide at least a key point detection method, an apparatus, an electronic device, and a storage medium.

제1 측면에 있어서, 본 발명의 실시예는 키 포인트 검출 방법을 제공하고,In a first aspect, an embodiment of the present invention provides a key point detection method,

검출될 이미지를 획득하는 단계;acquiring an image to be detected;

상기 검출될 이미지에 기반하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 생성하는 단계 - 상기 이미지 특징맵은 상기 검출될 이미지 중 각 목표 대상 사이의 상대적 위치 관계를 나타내기 위한 것이고, 각 상기 키 포인트 히트맵은 상기 검출될 이미지의 하나의 카테고리의 키 포인트를 포함하며, 상이한 카테고리의 키 포인트는 상기 목표 대상의 상이한 부위에 대응됨 - ;generating an image feature map and a plurality of key point heat maps based on the image to be detected, wherein the image feature map is for indicating a relative positional relationship between each target object in the image to be detected, each of the key points the point heat map includes key points of one category of the image to be detected, and key points of different categories correspond to different parts of the target object;

상기 이미지 특징맵 및 복수 개의 상기 키 포인트 히트맵에 기반하여, 키 포인트 이미지 모델을 생성하는 단계 - 상기 키 포인트 이미지 모델은 상기 검출될 이미지 중 상이한 카테고리의 키 포인트의 정보 및 연결 가장자리의 정보를 포함하고, 각 연결 가장자리는 두 개의 상이한 카테고리의 키 포인트 사이의 가장자리임 - ; 및generating a key point image model based on the image feature map and the plurality of key point heat maps, wherein the key point image model includes information on key points of different categories among the images to be detected and information on connection edges , and each connecting edge is an edge between key points of two different categories; and

상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계를 포함한다.and determining, based on the key point image model, each key point belonging to the same target object.

상기 방법을 사용하여, 생성한 이미지 특징맵 및 복수 개의 키 포인트 히트맵에 기반하여, 검출될 이미지에 대응되는 키 포인트 이미지 모델을 생성할 수 있고, 키 포인트 이미지 모델은 이미지 특징맵 및 키 포인트 히트맵에서의 정보를 포함하고, 이미지 특징맵은 검출될 이미지 중 상이한 목표 대상 사이의 상대적 위치 관계를 나타낼 수 있으므로, 키 포인트 이미지 모델에 기반하여, 상이한 목표 대상의 키 포인트에 대해 비교적 정확하게 구분하여, 키 포인트 클러스터링의 정확성을 향상시킬 수 있다.Using the above method, based on the generated image feature map and the plurality of key point heat maps, a key point image model corresponding to the image to be detected may be generated, wherein the key point image model includes an image feature map and a key point hit map. Since it contains information in the map, and the image feature map can indicate the relative positional relationship between different target objects among the images to be detected, based on the key point image model, the key points of different target objects are distinguished relatively accurately, It can improve the accuracy of key point clustering.

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계는,In a possible embodiment, based on the key point image model, determining each key point belonging to the same target object comprises:

상기 키 포인트 이미지 모델 중 각 키 포인트의 정보 및 상기 연결 가장자리의 정보에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하는 단계; 및determining a correlation between two key points having a connection relationship in the key point image model based on information on each key point in the key point image model and information on the connection edge; and

결정된 상기 상관관계에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계를 포함한다.and determining each key point belonging to the same target object based on the determined correlation.

가능한 실시형태에 있어서, 결정된 상기 상관관계에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계는, 상기 기설정된 임계값보다 큰 대응되는 상기 상관관계를 갖는 각 키 포인트를 동일한 목표 대상의 키 포인트로 취하는 단계를 포함한다.In a possible embodiment, the determining, based on the determined correlation, each key point belonging to the same target object comprises: assigning each key point having the corresponding correlation greater than the predetermined threshold value to the same target object. and taking as key points.

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델 중 각 키 포인트의 정보 및 상기 연결 가장자리의 정보에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하는 단계는, 각 키 포인트에 대해, 상기 키 포인트의 정보, 및 상기 키 포인트 이미지 모델 중 상기 키 포인트와 연결 관계가 있는 다른 키 포인트의 정보에 기반하여, 상기 키 포인트의 융합 특징을 결정하는 단계; 및 각 키 포인트에 각각 대응하는 융합 특징에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하는 단계를 포함한다.In a possible embodiment, the determining of a correlation between two key points having a connection relationship in the key point image model based on the information of each key point in the key point image model and the information of the connection edge includes: , for each key point, determining a fusion characteristic of the key point based on information on the key point and information on other key points having a connection relationship with the key point in the key point image model; and determining a correlation between two key points having a connection relationship in the key point image model, based on the fusion feature respectively corresponding to each key point.

가능한 실시형태에 있어서, 상기 키 포인트의 정보는 위치 정보, 카테고리 정보, 및 픽셀 특징 정보를 포함하고; 상기 키 포인트 히트맵에 기반하여, 각 키 포인트의 위치 정보를 결정하는 단계; 및 각 상기 키 포인트의 위치 정보에 기반하여, 상기 이미지 특징맵에서 상기 키 포인트의 픽셀 특징 정보를 추출하고, 상기 키 포인트가 속하는 키 포인트 히트맵의 카테고리 라벨에 기반하여, 상기 키 포인트에 대응되는 카테고리 정보를 결정하는 단계에 따라 상기 키 포인트 이미지 모델 중 각 키 포인트의 정보를 결정한다.In a possible embodiment, the information of the key point includes location information, category information, and pixel characteristic information; determining location information of each key point based on the key point heat map; and extracting pixel feature information of the key point from the image feature map based on the location information of each key point, and based on the category label of the key point heat map to which the key point belongs, corresponding to the key point According to the step of determining the category information, information of each key point in the key point image model is determined.

가능한 실시형태에 있어서, 각 키 포인트에 대응되는 상기 카테고리 정보에 기반하여, 각 키 포인트를 상기 키 포인트가 속하는 카테고리와 상이한 다른 키 포인트와 연결하고, 상기 키 포인트 이미지 모델에서의 연결 가장자리를 형성하는 단계에 따라 상기 키 포인트 이미지 모델에서의 각 연결 가장자리를 생성한다.In a possible embodiment, based on the category information corresponding to each key point, each key point is associated with another key point different from the category to which the key point belongs, and a connection edge in the key point image model is formed. Create each connecting edge in the key point image model according to the steps.

가능한 실시형태에 있어서, 각 키 포인트에 대응되는 상기 카테고리, 및 기설정된 상이한 카테고리 사이의 매칭 관계에 기반하여, 각 키 포인트를 상기 키 포인트가 속하는 카테고리와 매칭되는 목표 카테고리에 대응되는 키 포인트와 연결하여, 상기 키 포인트 이미지 모델에서의 연결 가장자리를 형성하는 단계에 따라 상기 키 포인트 이미지 모델에서의 각 연결 가장자리를 생성한다.In a possible embodiment, each key point is associated with a key point corresponding to a target category matching a category to which the key point belongs, based on a matching relationship between the category corresponding to each key point and a predetermined different category Thus, each connection edge in the key point image model is generated according to the step of forming a connection edge in the key point image model.

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한 후, 각 목표 대상에 대응되는 각 키 포인트의 정보에 기반하여, 상기 목표 대상의 행동 타입을 결정하는 단계를 더 포함한다.In a possible embodiment, after determining each key point belonging to the same target target based on the key point image model, the behavior type of the target target is determined based on information of each key point corresponding to each target target further comprising the step of

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한 후, 각 목표 대상에 대응되는 각 키 포인트의 정보에 기반하여, 상기 목표 대상에 대한 특수 효과 정보를 생성하는 단계를 더 포함한다.In a possible embodiment, after determining each key point belonging to the same target object based on the key point image model, based on the information of each key point corresponding to each target object, special effect information for the target object It further comprises the step of generating

아래의 장치, 전자 기기 등의 효과 설명은 상기 방법의 설명을 참조하면 되고, 여기서 더이상 반복하여 설명하지 않는다.The description of the effects of the device, electronic device, etc. below may refer to the description of the method, and will not be repeated any longer.

제2 측면에 있어서, 본 발명의 실시예는 키 포인트 검출 장치를 제공하고,In a second aspect, an embodiment of the present invention provides a key point detection device,

검출될 이미지를 획득하도록 구성된 획득 모듈;an acquiring module configured to acquire an image to be detected;

상기 검출될 이미지에 기반하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 생성하도록 구성된 제1 생성 모듈 - 상기 이미지 특징맵은 상기 검출될 이미지 중 각 목표 대상 사이의 상대적 위치 관계를 나타내기 위한 것이고, 각 상기 키 포인트 히트맵은 상기 검출될 이미지의 하나의 카테고리의 키 포인트를 포함하며, 상이한 카테고리의 키 포인트는 상기 목표 대상의 상이한 부위에 대응됨 - ;a first generating module, configured to generate, based on the image to be detected, an image feature map and a plurality of key point heat maps, wherein the image feature map is for indicating a relative positional relationship between each target object in the image to be detected , each of the key point heatmaps includes key points of one category of the image to be detected, and key points of different categories correspond to different parts of the target object;

상기 이미지 특징맵 및 복수 개의 상기 키 포인트 히트맵에 기반하여, 키 포인트 이미지 모델을 생성하도록 구성된 제2 생성 모듈 - 상기 키 포인트 이미지 모델은 상기 검출될 이미지 중 상이한 카테고리의 키 포인트의 정보 및 연결 가장자리의 정보를 포함하고, 각 연결 가장자리는 두 개의 상이한 카테고리의 키 포인트 사이의 가장자리임 - ; 및a second generating module, configured to generate a key point image model based on the image feature map and the plurality of key point heat maps, wherein the key point image model includes information and connecting edges of key points of different categories among the to-be-detected images contains information from, each connecting edge being an edge between key points of two different categories; and

상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하도록 구성된 결정 모듈을 포함한다.and a determining module, configured to determine, based on the key point image model, each key point belonging to the same target object.

제3 측면에 있어서, 본 발명의 실시예는 전자 기기를 제공하고, 상기 전자 기기는, 프로세서, 메모리 및 버스를 포함하며, 상기 메모리에는 상기 프로세서가 실행 가능한 기계 판독 가능한 명령어가 저장되어 있으며, 전자 기기가 작동될 경우, 상기 프로세서와 상기 메모리 사이는 버스를 통해 통신하고, 상기 기계 판독 가능한 명령어가 상기 프로세서에 의해 실행될 경우 상기 제1 측면 또는 어느 하나의 실시형태에 따른 키 포인트 검출 방법의 단계를 실행한다.In a third aspect, an embodiment of the present invention provides an electronic device, the electronic device comprising a processor, a memory and a bus, wherein the memory stores machine-readable instructions executable by the processor; When a device is operated, communication between the processor and the memory is via a bus, and when the machine readable instructions are executed by the processor, the steps of the key point detection method according to the first aspect or any one of the embodiments run

제4 측면에 있어서, 본 발명의 실시예는 컴퓨터 판독 가능한 저장 매체를 제공하고, 상기 컴퓨터 판독 가능한 저장 매체에는 컴퓨터 프로그램이 저장되어 있으며, 상기 컴퓨터 프로그램이 프로세서에 의해 작동될 경우 상기 제1 측면 또는 어느 하나의 실시형태에 따른 키 포인트 검출 방법의 단계를 실행한다.In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, wherein a computer program is stored in the computer-readable storage medium, and when the computer program is operated by a processor, the first aspect or Execute the steps of the key point detection method according to any one of the embodiments.

본 발명의 실시예는 컴퓨터 판독 가능 코드를 포함한 컴퓨터 프로그램 제품을 더 제공하며, 상기 컴퓨터 판독 가능 코드가 전자 기기에서 작동될 때, 상기 전자 기기에서의 프로세서는 상기 하나 또는 복수 개의 실시예에서 서버에 의해 실행되는 상기 방법을 구현하기 위한 것이다.An embodiment of the present invention further provides a computer program product comprising computer readable code, wherein when the computer readable code is operated in an electronic device, a processor in the electronic device is configured to provide a server in the one or more embodiments to implement the method executed by

본 발명의 상기 목적, 특징 및 장점을 보다 이해하기 쉽게 하기 위해, 아래에 첨부된 도면과 함께 바람직한 실시예를 상세히 설명한다.In order to make it easier to understand the above objects, features and advantages of the present invention, preferred embodiments will be described in detail below in conjunction with the accompanying drawings.

본 발명의 실시예의 기술방안을 더욱 선명하게 설명하기 위해, 아래에 실시예에서 사용되어야 하는 도면을 간단하게 설명하고, 여기서 도면은 명세서의 일부분으로서 명세서 전체를 구성하며, 이러한 도면은 본 발명에 부합되는 실시예를 도시하며, 명세서와 함께 본 발명의 기술방안을 설명하기 위한 것이다. 이해해야 할 것은, 아래의 도면은 본 발명의 일부 실시예만 도시할 뿐이므로, 범위에 대한 한정으로 간주되어서는 안되며, 본 분야의 기술자는 창조성 노동을 부여하지 않는 전제 하에서도, 이러한 도면에 따라 다른 연관된 도면을 획득할 수 있다.
도 1은 본 발명의 실시예에서 제공한 키포인트 검출 방법의 흐름 예시도이다.
도 2a는 본 발명의 실시예에서 제공한 키 포인트 이미지 모델의 예시도이다.
도 2b는 본 발명의 실시예에서 제공한 키 포인트 검출 방법의 시스템 블록도이다.
도 2c는 본 발명의 실시예에서 제공한 키 포인트 검출 방법의 응용 시나리오도이다.
도 3은 본 발명의 실시예에서 제공한 키 포인트 검출 장치의 아키텍처 예시도이다.
도 4는 본 발명의 실시예에서 제공한 전자 기기(400)의 구조 예시도이다.In order to more clearly explain the technical solutions of the embodiments of the present invention, the drawings to be used in the embodiments are briefly described below, wherein the drawings are a part of the specification and constitute the entire specification, and these drawings are consistent with the present invention. It shows an embodiment to be used, and is for explaining the technical solution of the present invention together with the specification. It is to be understood that the drawings below only show some embodiments of the present invention, and therefore should not be regarded as limiting in scope, and those skilled in the art may differ according to these drawings, even under the premise that creative labor is not imparted Associated drawings can be obtained.
1 is a flowchart illustrating a keypoint detection method provided in an embodiment of the present invention.
2A is an exemplary diagram of a key point image model provided in an embodiment of the present invention.
2B is a system block diagram of a key point detection method provided in an embodiment of the present invention.
Figure 2c is an application scenario diagram of the key point detection method provided in the embodiment of the present invention.
3 is an exemplary architecture diagram of a key point detection apparatus provided in an embodiment of the present invention.
4 is an exemplary structural diagram of an electronic device 400 provided in an embodiment of the present invention.

본 발명의 실시예의 목적, 기술적 해결수단 및 장점을 더욱 명확하게 하기 위해, 아래에, 본 발명의 실시예에서의 도면과 결합하여, 본 발명의 실시예에서의 기술적 해결수단에 대해 명확하고 완전하게 설명하며, 설명된 실시예는 단지 본 발명의 전부 실시예가 아닌 일부 실시예라는 것은 명백하다. 일반적으로 이곳의 도면에서 설명되고 도시된 본 발명의 실시예의 컴포넌트는 다양하고 상이한 구성으로 배치되고 설계될 수 있다. 따라서, 아래에 도면에서 제공한 본 발명의 실시예의 상세한 설명은 본 발명의 보호 범위를 한정하려는 것이 아닌 본 발명의 선택된 실시예를 나타내기 위한 것이다. 본 발명의 실시예에 기반하여, 당업자가 창조성 노동 없이 얻은 다른 실시예는 전부 본 발명의 보호 범위에 속해야 한다.In order to make the objects, technical solutions and advantages of the embodiments of the present invention more clear, below, in combination with the drawings in the embodiments of the present invention, the technical solutions in the embodiments of the present invention are clearly and completely While describing, it is clear that the described embodiments are merely some, but not all, embodiments of the present invention. In general, the components of the embodiments of the invention described and illustrated in the drawings herein may be arranged and designed in a variety of different configurations. Accordingly, the detailed description of the embodiments of the present invention provided in the drawings below is not intended to limit the protection scope of the present invention, but to represent selected embodiments of the present invention. Based on the embodiment of the present invention, all other embodiments obtained by those skilled in the art without creative labor should fall within the protection scope of the present invention.

일반적으로, 딥 뉴럴 네트워크를 사용하여 이미지를 검출하여, 이미지에 포함된 목표 대상의 키 포인트 정보를 결정할 수 있으며, 여기서, 딥 뉴럴 네트워크에 기반한 키 포인트 검출 방법은, 키 포인트의 검출 및 키 포인트의 클러스터링을 포함한다.In general, by using a deep neural network to detect an image, it is possible to determine key point information of a target object included in the image, wherein the key point detection method based on the deep neural network includes the detection of the key point and the Includes clustering.

현재, 키 포인트의 검출 및 키 포인트의 클러스터링은 두 개의 독립적인 단계이고, 즉 키 포인트의 클러스터링은 일반적으로 미분할 수 없는 후처리 작업이다. 그러나, 이러한 방식을 사용하는 경우, 키 포인트 클러스터링 과정의 정확성이 비교적 낮고, 즉 클러스터링 오류가 발생할 수 있으며, 상이한 사람의 키 포인트를 동일한 사람의 키 포인트로 취하여 함께 클러스터링함으로써, 검출 결과에 오류가 발생하게 된다.Currently, the detection of key points and the clustering of key points are two independent steps, that is, the clustering of key points is generally a non-differentiable post-processing operation. However, when this method is used, the accuracy of the key point clustering process is relatively low, that is, clustering errors may occur. will do

상기 문제를 해결하기 위해, 본 발명의 실시예는 키 포인트 검출 방법을 제공한다.In order to solve the above problem, an embodiment of the present invention provides a key point detection method.

본 발명의 실시예를 용이하게 이해하기 위해, 먼저 본 발명의 실시예에서 개시되는 키 포인트 검출 방법에 대해 상세히 설명하고자 한다.In order to easily understand the embodiment of the present invention, first, a key point detection method disclosed in the embodiment of the present invention will be described in detail.

본 발명의 실시예에서 제공하는 키 포인트 검출 방법의 실행 주체는 서버일 수 있고, 상기 서버는 로컬 서버이거나, 클라우드 서버일 수 있으며; 또는, 상기 방법의 실행 주체는 단말 기기일 수 있고, 예를 들어, 상기 단말 기기는 핸드폰, 태블릿 PC, AR 안경 등일 수 있다.The execution subject of the key point detection method provided in the embodiment of the present invention may be a server, and the server may be a local server or a cloud server; Alternatively, the execution subject of the method may be a terminal device, for example, the terminal device may be a mobile phone, a tablet PC, AR glasses, or the like.

도 1은, 본 발명의 실시예에서 제공한 키 포인트 검출 방법의 프로세스 예시도이고, 상기 방법은 단계 S101 내지 단계 S104를 포함한다.1 is a process exemplification diagram of a key point detection method provided in an embodiment of the present invention, and the method includes steps S101 to S104.

단계 S101에 있어서, 검출될 이미지를 획득한다.In step S101, an image to be detected is acquired.

단계 S102에 있어서, 검출될 이미지에 기반하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 생성하고; 이미지 특징맵은 검출될 이미지 중 각 목표 대상 사이의 상대적 위치 관계를 나타내기 위한 것이며; 각 키 포인트 히트맵에는 검출될 이미지의 하나의 카테고리의 키 포인트를 포함하고, 상이한 카테고리의 키 포인트는 목표 대상의 상이한 부위에 대응된다.In step S102, based on the image to be detected, an image feature map and a plurality of key point heat maps are generated; The image feature map is for indicating the relative positional relationship between each target object in the image to be detected; Each key point heat map includes key points of one category of an image to be detected, and key points of different categories correspond to different parts of the target object.

단계 S103에 있어서, 이미지 특징맵 및 복수 개의 키 포인트 히트맵에 기반하여, 키 포인트 이미지 모델을 생성하고; 키 포인트 이미지 모델은 검출될 이미지 중 상이한 카테고리의 키 포인트의 정보 및 연결 가장자리의 정보를 포함하며, 각 연결 가장자리는 두 개의 상이한 카테고리의 키 포인트 사이의 가장자리이다.In step S103, based on the image feature map and the plurality of key point heat maps, a key point image model is generated; The key point image model includes information on key points of different categories and information on connecting edges among images to be detected, and each connecting edge is an edge between key points of two different categories.

단계 S104에 있어서, 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한다.In step S104, each key point belonging to the same target object is determined based on the key point image model.

상기 방법에 있어서, 생성된 이미지 특징맵 및 복수 개의 키 포인트 히트맵에 기반하여, 검출될 이미지에 대응되는 키 포인트 이미지 모델을 생성할 수 있고, 키 포인트 이미지 모델은 이미지 특징맵 및 키 포인트 히트맵에서의 정보를 포함하고, 이미지 특징맵은 검출될 이미지 중 상이한 목표 대상 사이의 상대적 위치 관계를 나타낼 수 있으므로, 키 포인트 이미지 모델에 기반하여, 상이한 목표 대상의 키 포인트를 비교적 정확하게 구분할 수 있으므로, 키 포인트 클러스터링의 정확성을 향상할 수 있다.In the method, based on the generated image feature map and the plurality of key point heat maps, a key point image model corresponding to the image to be detected may be generated, and the key point image model is an image feature map and a key point heat map. Since the image feature map can indicate the relative positional relationship between different target objects in the image to be detected, based on the key point image model, key points of different target objects can be distinguished relatively accurately, so that the key The accuracy of point clustering can be improved.

단계 S101 및 단계 S102에 대해,For steps S101 and S102,

검출될 이미지는 목표 대상을 포함하는 임의의 이미지일 수 있다. 실시 과정에 있어서, 연결되어 있는 저장 장치로부터 검출될 이미지를 획득할 수 있고, 연결되어 있는 촬영 장치로부터 실시간으로 수집된 검출될 이미지를 획득할 수도 있다.The image to be detected may be any image including the target object. In the implementation process, an image to be detected may be acquired from a connected storage device, and an image to be detected may be acquired in real time from a connected photographing device.

실시 과정에 있어서, 획득된 검출될 이미지를 훈련된 키 포인트 검출 뉴럴 네트워크에 입력하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 생성하고; 이미지 특징맵, 복수 개의 키 포인트 히트맵, 및 훈련된 키 포인트 검출 뉴럴 네트워크에 기반하여, 각 목표 대상의 각 키 포인트를 결정할 수 있다.In the implementation process, input the obtained image to be detected into a trained key point detection neural network to generate an image feature map and a plurality of key point heat maps; Each key point of each target may be determined based on the image feature map, the plurality of key point heat maps, and the trained key point detection neural network.

여기서, 각 키 포인트 히트맵은 검출될 이미지의 하나의 카테고리의 키 포인트를 포함하고, 상이한 카테고리의 키 포인트는 목표 대상의 상이한 부위에 대응된다. 예를 들어, 키 포인트의 카테고리는 머리, 목, 손 등일 수 있고, 키 포인트 히트맵은 머리 키 포인트를 포함하는 이미지일 수 있으며, 또는, 키 포인트 히트맵은 목 키 포인트를 포함하는 이미지일 수 있고; 또는, 키 포인트의 카테고리는 설정된 제1 카테고리, 제2 카테고리 등일수 있으며, 여기서, 제1 카테고리 키 포인트는 엄지 손가락의 키 포인트일 수 있고, 제2 카테고리 키 포인트는 검지 손가락의 키 포인트 등일 수 있으며, 키 포인트 히트맵은 제1 카테고리 키 포인트의 이미지를 포함할 수 있고, 또는, 키 포인트 히트맵은 제2 카테고리 키 포인트의 이미지를 포함할 수 있다. 여기서, 키 포인트의 카테고리 및 카테고리의 개수는 실제 수요에 따라 설정될 수 있다. 각 목표 대상에 대응되는 키 포인트의 개수는 실제 수요에 따라 설정될 수 있고, 예를 들어, 각 목표 대상에 대응되는 키 포인트의 개수는 17 개, 105 개 등일 수 있다.Here, each key point heat map includes key points of one category of an image to be detected, and key points of different categories correspond to different parts of the target object. For example, a category of key points may be head, neck, hand, etc., a key point heatmap may be an image containing a head key point, or a key point heat map may be an image containing a neck key point. there is; Alternatively, the category of the key point may be a set first category, a second category, etc., where the first category key point may be a key point of a thumb, and the second category key point may be a key point of an index finger, etc. , the key point heat map may include an image of the first category key point, or the key point heat map may include an image of the second category key point. Here, the category of the key point and the number of categories may be set according to actual demand. The number of key points corresponding to each target may be set according to actual demand, for example, the number of key points corresponding to each target may be 17, 105, or the like.

여기서, 키 포인트 히트맵의 개수는 설정된 키 포인트 카테고리의 개수와 일치하고, 예를 들어, 만약 설정된 키 포인트의 카테고리 개수가 17 개 라면, 검출될 이미지에 기반하여 생성된 키 포인트 히트맵의 개수도 17 개이다. 여기서, 각 카테고리의 키 포인트의 개수는 하나 또는 복수 개일 수 있다.Here, the number of key point heatmaps coincides with the set number of key point categories. For example, if the set number of key point categories is 17, the number of key point heatmaps generated based on the image to be detected is also 17 pieces. Here, the number of key points in each category may be one or plural.

이미지 특징맵의 개수는 하나, 또는 복수 개일 수 있다. 여기서, 이미지 특징맵의 개수가 하나인 경우, 상기 이미지 특징맵은 검출될 이미지 중 각 목표 대상과 각 카테고리의 키 포인트에 대응되는 부위 사이의 상대적 위치 관계를 나타낼 수 있다. 이미지 특징맵의 개수가 복수 개인 경우, 이미지 특징맵의 개수와 키 포인트 히트맵의 개수가 동일할 수 있고, 각 이미지 특징맵은 검출될 이미지 중 각 목표 대상의 하나의 카테고리의 키 포인트에 대응되는 부위 사이의 상대적 위치 관계를 나타낼 수 있다. 여기서, 이미지 특징맵의 사이즈 및 키 포인트 히트맵의 사이즈는 일치하다.The number of image feature maps may be one or plural. Here, when the number of image feature maps is one, the image feature map may indicate a relative positional relationship between each target object and a portion corresponding to a key point of each category among images to be detected. When the number of image feature maps is plural, the number of image feature maps and the number of key point heatmaps may be the same, and each image feature map corresponds to a key point of one category of each target object among images to be detected. It can represent the relative positional relationship between the parts. Here, the size of the image feature map and the size of the key point heat map match.

실시 과정에 있어서, 키 포인트 검출 뉴럴 네트워크에서 상이한 손실 함수를 설정하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 획득할 수 있다.In the implementation process, by setting different loss functions in the key point detection neural network, an image feature map and a plurality of key point heat maps may be obtained.

단계 S103에 대해,For step S103,

여기서, 복수 개의 키 포인트 히트맵 및 이미지 특징맵으로부터 각 키 포인트의 정보를 추출하여 얻을 수 있고, 정보가 포함된 각 키 포인트를 노드로 취하고, 상이한 카테고리의 키 포인트 사이의 가장자리를 연결 가장자리로 하여, 키 포인트 이미지 모델을 구성한다.Here, information of each key point can be extracted and obtained from a plurality of key point heatmaps and image feature maps, and each key point containing information is taken as a node, and edges between key points of different categories are used as connection edges. , to construct the key point image model.

선택 가능한 실시형태에 있어서, 키 포인트의 정보는 위치 정보, 카테고리 정보, 및 픽셀 특징 정보를 포함할 수 있다. 여기서, 키 포인트 히트맵에 기반하여, 각 키 포인트의 위치 정보를 결정하는 단계; 및 각 키 포인트의 위치 정보에 기반하여, 이미지 특징맵에서 키 포인트의 픽셀 특징 정보를 추출하고, 키 포인트가 속하는 키 포인트 히트맵의 카테고리 라벨에 기반하여, 키 포인트에 대응되는 카테고리 정보를 결정하는 단계에 따라 상기 키 포인트 이미지 모델 중 각 키 포인트의 정보를 결정할 수 있다.In a selectable embodiment, the information of the key point may include location information, category information, and pixel characteristic information. Here, based on the key point heat map, determining the location information of each key point; and extracting pixel feature information of a key point from an image feature map based on the location information of each key point, and determining category information corresponding to the key point based on the category label of the key point heat map to which the key point belongs According to the steps, it is possible to determine the information of each key point in the key point image model.

실시 과정에 있어서, 키 포인트 히트맵 중 각 픽셀 포인트의 픽셀 값에 기반하여, 각 키 포인트의 위치 정보를 결정할 수 있다. 예시적으로, 각 키 포인트 히트맵에 대해, 픽셀 값이 최대값인 픽셀 포인트를 선택하여, 하나의 키 포인트로 결정하고, 선택된 상기 픽셀 포인트의 위치 정보를 키 포인트의 위치 정보로 결정할 수 있다. 여기서, 키 포인트 히트맵 중 특정 픽셀 포인트의 픽셀 값이 주변 픽셀 포인트의 픽셀 값보다 클 경우, 상기 픽셀 포인트의 픽셀 값을 최대값으로 간주하고, 상기 픽셀 포인트를 키 포인트로 간주한다.In the implementation process, location information of each key point may be determined based on the pixel value of each pixel point in the key point heat map. For example, for each key point heatmap, a pixel point having a maximum pixel value may be selected and determined as one key point, and location information of the selected pixel point may be determined as location information of the key point. Here, when the pixel value of a specific pixel point in the key point heatmap is greater than the pixel value of a neighboring pixel point, the pixel value of the pixel point is regarded as the maximum value, and the pixel point is regarded as the key point.

각 픽셀 포인트의 위치 정보를 획득한 후, 이미지 특징맵에서 상기 위치 정보에 대응되는 픽셀 포인트의 픽셀 값을 추출하여, 추출된 픽셀 값을 키 포인트의 픽셀 특징 정보로 결정할 수 있다.After obtaining the location information of each pixel point, the pixel value of the pixel point corresponding to the location information may be extracted from the image feature map, and the extracted pixel value may be determined as the pixel feature information of the key point.

또한, 각 키 포인트가 속하는 키 포인트 히트맵의 카테고리 라벨에 기반하여, 키 포인트에 대응되는 카테고리 정보를 결정할 수 있다. 예를 들어, 키 포인트 히트맵 A의 카테고리 라벨이 머리인 경우, 키 포인트 히트맵 A에 포함된 각 키 포인트의 카테고리 정보는 머리 키 포인트이고; 키 포인트 히트맵 B의 카테고리 라벨이 목인 경우, 키 포인트 히트맵 B에 포함된 각 키 포인트의 카테고리 정보는 목 키 포인트이다.Also, based on the category label of the key point heat map to which each key point belongs, category information corresponding to the key point may be determined. For example, if the category label of the key point heat map A is head, the category information of each key point included in the key point heat map A is the head key point; When the category label of the key point heat map B is a neck, the category information of each key point included in the key point heat map B is a mock key point.

선택 가능한 실시형태에 있어서, 아래의 두 가지 방식에 따라 키 포인트 이미지 모델에서의 각 연결 가장자리를 생성할수 있다.In a selectable embodiment, each connecting edge in the key point image model may be generated according to the following two methods.

방식 1에 있어서, 각 키 포인트에 대응되는 카테고리 정보에 기반하여, 각 키 포인트를 키 포인트가 속하는 카테고리와 상이한 다른 키 포인트와 연결하여, 키 포인트 이미지 모델에서의 연결 가장자리를 형성한다.In method 1, based on category information corresponding to each key point, each key point is linked with another key point different from the category to which the key point belongs to form a connection edge in the key point image model.

방식 2에 있어서, 각 키 포인트에 대응되는 카테고리, 및 기설정된 상이한 카테고리 사이의 매칭 관계에 기반하여, 각 키 포인트를 키 포인트가 속하는 카테고리와 매칭되는 목표 카테고리에 대응되는 키 포인트와 연결하여, 키 포인트 이미지 모델에서의 연결 가장자리를 형성한다.In method 2, based on a matching relationship between a category corresponding to each key point and a predetermined different category, each key point is linked with a key point corresponding to a target category matched with a category to which the key point belongs, It forms the connecting edge in the point image model.

방식 1에 있어서, 각 키 포인트에 대응되는 카테고리 정보에 기반하여, 동일한 카테고리 정보를 갖는 키 포인트를 연결하지 않고, 상이한 카테고리 정보를 갖는 키 포인트를 연결하여, 키 포인트 이미지 모델에서의 연결 가장자리를 형성할 수 있다.In method 1, based on the category information corresponding to each key point, key points having the same category information are not connected, but key points having different category information are connected to form a connection edge in the key point image model can do.

여기서, 각 키 포인트에 대응되는 카테고리 정보에 기반하여, 각 키 포인트 및 키 포인트가 속하는 카테고리와 상이한 다른 키 포인트와 연결하여, 키 포인트 이미지 모델에서의 연결 가장자리를 형성함으로써, 키 포인트 이미지 모델을 획득하여, 후속적으로 각 목표 대상의 각 키 포인트를 결정함에 있어서 데이터 지원을 제공한다.Here, based on the category information corresponding to each key point, each key point and other key points different from the category to which the key point belongs are connected to form a connection edge in the key point image model, thereby obtaining a key point image model Thus, it provides data assistance in subsequently determining each key point of each target object.

방식 2에 있어서, 인체 구조에 기반하여, 상이한 카테고리 사이의 매칭 관계를 미리 설정할 수 있으며, 예를 들어, 기설정된 상이한 카테고리 사이의 매칭 관계는 머리 카테고리와 목 카테고리의 매칭일 수 있고, 목 카테고리 포인트는 각각 왼쪽 어깨 카테고리, 오른쪽 어깨 카테고리, 및 머리 카테고리와 각각 매칭된다. 여기서, 상이한 카테고리 사이의 매칭 관계는 실제 수요에 따라 설정될 수 있다.In method 2, a matching relationship between different categories may be preset based on the human body structure, for example, the matching relationship between the different preset categories may be a head category and a neck category matching, and a neck category point are respectively matched with the left shoulder category, the right shoulder category, and the head category, respectively. Here, the matching relationship between different categories may be established according to actual demand.

나아가 각 키 포인트에 대해, 상기 키 포인트의 카테고리와 매칭되는 목표 카테고리를 결정하고, 상기 키 포인트를 목표 카테고리에 대응되는 키 포인트와 연결하여, 키 포인트 이미지 모델에서의 연결 가장자리를 형성할 수 있다.Furthermore, for each key point, a target category matching the category of the key point may be determined, and the key point may be connected with a key point corresponding to the target category to form a connection edge in the key point image model.

상기 실시 방식에 있어서, 머리 키 포인트가 목 키 포인트와 매칭되고, 발 키 포인트가 무릎 키 포인트와 매칭되도록 미리 설정하는 등과 같은 상이한 카테고리 사이의 매칭 관계를 미리 설정할 수 있으며, 각 키 포인트를 키 포인트가 속하는 카테고리와 매칭되는 목표 카테고리에 대응되는 키 포인트와 연결하므로, 각 키 포인트가 매칭되지 않는 키 포인트와 연결되지 않도록 하고, 상관관계 계산의 계산 량을 감소할 수 있으므로, 키 포인트 검출의 효율을 향상시킨다.In the above implementation manner, matching relationships between different categories may be preset, such as presetting that the head key point matches the neck key point, the foot key point matches the knee key point, etc. Since it connects with key points corresponding to the target category matching the category to which it belongs, it is possible to prevent each key point from being connected to an unmatched key point, and it is possible to reduce the amount of computation of correlation calculation, thereby improving the efficiency of key point detection. improve

도 2a에 도시된 키 포인트 이미지 모델의 예시도와 같이, 상기 이미지에는 방식 1에 기반하여 생성된 키 포인트 이미지 모델(21), 및 방식 2에 기반하여 생성된 키 포인트 이미지 모델(22)이 포함된다. 여기서, 이미지는 제1 카테고리의 키 포인트(201), 제2 카테고리의 키 포인트(202), 및 제3 카테고리의 키 포인트(203)를 포함하고, 상이한 키 포인트 사이의 연결 가장자리(204)를 더 포함한다. 도면에서 도시된 바와 같이, 키 포인트 이미지 모델(21)은, 상이한 카테고리의 각 키 포인트 사이의 연결 가장자리를 포함하고; 키 포인트 이미지 모델(22)은, 매칭 관계가 있는 상이한 키 포인트 사이의 연결 가장자리를 포함하며, 도면에서 도시된 바와 같이 기설정된 상이한 카테고리 사이의 매칭 관계는 제1 카테고리와 제2 카테고리의 매칭, 제2 카테고리와 제3 카테고리 매칭이다.As an exemplary view of the key point image model shown in FIG. 2A , the image includes a key point image model 21 generated based on method 1 and a key point image model 22 generated based on method 2 . Here, the image includes key points 201 of a first category, key points 202 of a second category, and key points 203 of a third category, further adding connecting edges 204 between different key points. include As shown in the figure, the key point image model 21 includes a connecting edge between each key point of a different category; The key point image model 22 includes a connecting edge between different key points having a matching relationship, and as shown in the figure, the preset matching relationship between different categories is the matching of the first category and the second category, the second It is category 2 and category 3 matching.

단계 S104에 대해,For step S104,

여기서, 키 포인트 이미지 모델에 기반하여, 키 포인트 이미지 모델에 포함된 각 키 포인트를 분할하고, 동일한 목표 대상에 속하는 복수 개의 키 포인트를 함께 분할할 수 있음으로써, 검출될 이미지에 포함된 각 목표 대상에 대응되는 복수 개의 키 포인트를 획득할 수 있다. 여기서, 각 목표 대상에 대응되는 키 포인트의 개수는 동일하다.Here, based on the key point image model, each key point included in the key point image model may be divided, and a plurality of key points belonging to the same target target may be divided together, so that each target target included in the image to be detected may be divided. A plurality of key points corresponding to can be obtained. Here, the number of key points corresponding to each target object is the same.

선택 가능한 실시형태에 있어서, 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계는 A1 및 A2를 포함한다.In a selectable embodiment, determining, based on the key point image model, each key point belonging to the same target object comprises A1 and A2.

단계 A1에 있어서, 키 포인트 이미지 모델 중 각 키 포인트의 정보 및 연결 가장자리의 정보에 기반하여, 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정한다.In step A1, a correlation between two key points having a connection relationship in the key point image model is determined based on information on each key point in the key point image model and information on a connection edge.

단계 A2에 있어서, 결정된 상관관계에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한다.In step A2, each key point belonging to the same target object is determined based on the determined correlation.

상기 실시 방식 하에 있어서, 각 키 포인트의 정보 및 연결 가장자리의 정보에 기반하여, 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정할 수 있고, 상기 상관관계는 대응되는 두 개의 키 포인트 사이의 동일한 목표 대상에 속하는 확률을 나타낼 수 있으므로, 결정된 상관관계를 통해, 동일한 목표 대상에 속하는 각 키 포인트를 함께 클러스터링하여, 각 목표 대상에 대응되는 각 키 포인트를 획득할 수 있다.Under the above implementation method, a correlation between two key points having a connection relationship in the key point image model may be determined based on the information of each key point and the information of the connection edge, and the correlation is Since the probability of belonging to the same target between key points may be indicated, each key point belonging to the same target may be clustered together through the determined correlation to obtain each key point corresponding to each target.

단계 A1에 있어서, 키 포인트 이미지 모델에서의 각 연결 가장자리에 대해, 상기 연결 가장자리에 대응되는 두 개의 키 포인트 사이의 상관관계를 결정할 수 있고, 즉 각 연결 가장자리에 대응되는 상관관계를 획득할 수 있다.In step A1, for each connecting edge in the key point image model, a correlation between two key points corresponding to the connecting edge may be determined, that is, a correlation corresponding to each connecting edge may be obtained. .

선택 가능한 실시형태로서, 키 포인트 이미지 모델 중 각 키 포인트의 정보 및 연결 가장자리의 정보에 기반하여, 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하는 단계는,As an optional embodiment, the determining of a correlation between two key points having a connection relationship in the key point image model based on information of each key point in the key point image model and information on a connection edge includes:

첫째, 각 키 포인트에 대해, 키 포인트의 정보, 및 키 포인트 이미지 모델 중 키 포인트와 연결 관계가 있는 다른 키 포인트의 정보에 기반하여, 키 포인트의 융합 특징을 결정한다.First, for each key point, the fusion characteristic of the key point is determined based on the key point information and information on other key points that have a connection relationship with the key point in the key point image model.

둘째, 각 키 포인트에 각각 대응하는 융합 특징에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정한다.Second, a correlation between two key points having a connection relationship in the key point image model is determined based on a fusion feature corresponding to each key point.

각 키 포인트의 융합 특징을 결정하는 경우, 상기 키 포인트의 정보, 및 키 포인트 이미지 모델 중 상기 키 포인트와 연결 관계가 있는 다른 키 포인트의 정보에 기반하여, 키 포인트의 융합 특징을 결정할 수 있고, 여기서, 다른 키 포인트는 키 포인트 이미지 모델 중 상기 키 포인트와의 사이에 연결 가장자리가 있는 키 포인트일 수 있다.When determining the fusion characteristic of each key point, the fusion characteristic of the key point can be determined based on the information on the key point and information on other key points that have a connection relationship with the key point in the key point image model, Here, the other key point may be a key point having a connecting edge between the key point and the key point in the key point image model.

여기서, 키 포인트 이미지 모델에서의 각 키 포인트를 위해 대응하는 융합 특징을 결정할 수 있고, 각 키 포인트에 각각 대응하는 융합 특징에 기반하여, 키 포인트 이미지 모델 중 각 연결 가장자리에 대응되는 두 개의 키 포인트 사이의 상관관계를 결정한다.Here, it is possible to determine a corresponding fusion feature for each key point in the key point image model, and based on the fusion feature respectively corresponding to each key point, two key points corresponding to each connection edge in the key point image model determine the correlation between

실시 과정에 있어서, 융합 특징은 각 키 포인트에 대응되는 특징 벡터일 수 있고, 각 연결 가장자리에 대응되는 두 개의 키 포인트의 융합 특징(특징 벡터) 사이의 유사도를 계산함으로써, 계산하여 획득된 유사도를 이 두 개의 키 포인트 사이의 상관관계로 결정한다.In the implementation process, the fusion feature may be a feature vector corresponding to each key point, and by calculating the similarity between the fusion feature (feature vector) of two key points corresponding to each connection edge, the degree of similarity obtained by calculating It is determined by the correlation between these two key points.

상기 실시 방식 하에 있어서, 각 키 포인트에 대해, 상기 키 포인트의 정보 및 상기 키 포인트와 연결 관계가 있는 다른 키 포인트의 정보에 기반하여, 상기 키 포인트의 융합 특징을 결정함으로써, 상기 키 포인트의 융합 특징은 상기 키 포인트의 특징을 나타낼 수 있을 뿐만 아니라, 상기 키 포인트와 다른 키 포인트 사이의 관련 관계를 나타낼 수도 있으므로, 각 키 포인트에 각각 대응하는 융합 특징에 기반하여, 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정할 수 있어, 상기 상관관계에 기반하여, 각 목표 대상에 대응되는 각 키 포인트를 비교적 정확하게 결정할 수 있다.Under the above implementation manner, for each key point, the fusion of the key point is determined by determining the fusion characteristic of the key point based on the information of the key point and the information of other key points having a connection relationship with the key point. Since the feature may represent not only the characteristic of the key point, but also the related relationship between the key point and another key point, based on the fusion feature respectively corresponding to each key point, two keys having a connection relationship It is possible to determine a correlation between the points, and based on the correlation, each key point corresponding to each target object can be determined relatively accurately.

단계 A2에 있어서, 키 포인트 이미지 모델 중 각 연결 가장자리에 대응되는 두 개의 키 포인트의 상관관계를 획득한 것으로 결정한 후, 결정된 각 상관관계에 기반하여, 각 키 포인트를 분할하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정할 수 있다.In step A2, after it is determined that the correlation between two key points corresponding to each connection edge in the key point image model is obtained, each key point is divided based on each determined correlation and belongs to the same target object Each key point can be determined.

선택 가능한 실시형태로서, 결정된 상관관계에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계는, 설정된 임계값보다 큰 대응되는 상관관계를 갖는 각 키 포인트를 동일한 목표 대상의 키 포인트로 취하는 단계를 포함할 수 있다.As a selectable embodiment, based on the determined correlation, determining each key point belonging to the same target object comprises: taking each key point having a corresponding correlation greater than a set threshold as a key point of the same target object may include steps.

실시 과정에 있어서, 상관관계의 임계값의 설정, 및 기설정된 각 카테고리의 키 포인트의 목표 개수에 기반하여, 설정된 임계값보다 큰 대응되는 상관관계를 갖는 각 키 포인트를 동일한 목표 대상의 키 포인트로 취한다. 여기서, 각 목표 대상의 키 포인트는 복수 개의 상이한 카테고리의 키 포인트를 포함하고, 각 카테고리의 키 포인트의 개수는 기설정된 목표 개수와 일치하다.In the implementation process, based on the setting of the threshold value of correlation, and the target number of key points of each predetermined category, each key point having a corresponding correlation greater than the set threshold is set as a key point of the same target target get drunk Here, the key points of each target object include a plurality of key points of different categories, and the number of key points in each category coincides with the preset number of targets.

선택 가능한 실시형태에 있어서, 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한 후, 각 목표 대상에 대응되는 각 키 포인트의 정보에 기반하여, 상기 목표 대상의 행동 타입을 결정하는 단계를 더 포함할 수 있다.In a selectable embodiment, after determining each key point belonging to the same target target based on the key point image model, the behavior type of the target target is determined based on information on each key point corresponding to each target target It may further include the step of

여기서, 각 목표 대상의 각 키 포인트의 정보를 획득한 후, 각 목표 대상의 각 키 포인트의 정보를 행동 검출 뉴럴 네트워크에 입력하여, 상기 목표 대상의 행동 타입을 결정할 수 있고, 예를 들어, 상기 행동 타입은 달리기, 걷기, 두 팔 올리기 등일 수 있다.Here, after obtaining information on each key point of each target target, information on each key point of each target target may be input to a behavior detection neural network to determine a behavior type of the target target, for example, the The action type may be running, walking, raising both arms, and the like.

선택 가능한 실시형태에 있어서, 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한 후, 각 목표 대상에 대응되는 각 키 포인트의 정보에 기반하여, 목표 대상에 대한 특수 효과 정보를 생성하는 단계를 더 포함할 수 있다.In a selectable embodiment, after determining each key point belonging to the same target object based on the key point image model, special effect information for the target object is generated based on the information of each key point corresponding to each target object It may further include the step of generating.

여기서, 목표 대상의 각 키 포인트의 정보에 대해, 목표 대상의 목표 부위의 위치를 결정하고, 기설정된 목표 부위 대응되는 특수 효과 정보에 기반하여, 목표 대상의 목표 부위의 위치에서 대응되는 특수 효과 정보를 생성할 수 있다. 여기서, 목표 부위는 팔, 머리, 손 등일 수 있다. 예를 들어, 목표 대상의 각 키 포인트의 정보에 대해, 목표 대상의 팔 위치를 결정하고, 기설정된 팔의 특수 효과 정보에 기반하여, 목표 대상의 팔 위치에서 대응되는 특수 효과 정보를 생성할 수 있다.Here, with respect to the information of each key point of the target target, the location of the target part of the target is determined, and based on the special effect information corresponding to the predetermined target part, special effect information corresponding to the location of the target part of the target target can create Here, the target region may be an arm, a head, a hand, or the like. For example, with respect to the information of each key point of the target target, the arm position of the target target can be determined, and based on the preset arm special effect information, corresponding special effect information can be generated from the arm position of the target target. have.

여기서, 각 목표 대상에 대응되는 각 키 포인트의 정보에 대해, 상기 목표 대상의 동작 타입을 결정하고, 기설정된 동작 타입과 특수 효과 정보 사이의 매핑 관계에 기반하여, 상기 목표 대상을 위해 대응되는 특수 효과 정보를 생성할 수 있다. 예를 들어, 목표 대상 A에 대응되는 각 키 포인트의 정보에 기반하여, 상기 목표 대상의 동작 타입이 하트 동작임을 결정하면, 상기 목표 대상을 위해 하트 모양의 특수 효과 정보를 생성할 수 있다.Here, with respect to the information of each key point corresponding to each target object, the operation type of the target object is determined, and based on the mapping relationship between the preset operation type and the special effect information, a special corresponding to the target object You can create effect information. For example, if it is determined that the operation type of the target object is a heart operation based on information on each key point corresponding to the target object A, heart-shaped special effect information may be generated for the target object.

관련 기술에 있어서, 목표 대상에 대한 키 포인트의 검출은 상향식의 복수 개의 목표 대상 키 포인트 검출 방법을 통해 구현된다. 이러한 방법은 먼저 이미지에서의 모든 가능한 목표 대상 키 포인트를 검출하고, 키 포인트의 임베딩 특징(Embedding feature)과 같은 각 키 포인트의 정보를 동시에 추출하며; 다음, 최적화 방정식을 해결하여, 이러한 키 포인트를 클러스터링하고 복수 개의 상이한 목표 대상에 할당한다. 이로부터 알다시피, 상향식의 방법은 동작 속도가 더 빠르고, 인접한 목표 또는 차단 상황에 대한 로버스트가 더 좋다.In the related art, detection of a key point for a target is implemented through a bottom-up method of detecting a plurality of target key points. This method first detects all possible target target key points in the image, and simultaneously extracts information of each key point, such as an embedding feature of the key point; Then, by solving the optimization equation, these key points are clustered and assigned to a plurality of different target objects. As can be seen from this, the bottom-up method is faster to operate and more robust to adjacent targets or blocking situations.

키 포인트의 임베딩 특징을 추출하는 방법은 방법 1 및 방법 2를 포함한다.A method of extracting an embedding feature of a key point includes method 1 and method 2 .

방법 1에 있어서, 키 포인트 임베딩 특징(Keypoint Embedding, KE)에 기반하여 구현한다.In method 1, it is implemented based on the key point embedding feature (Keypoint Embedding, KE).

방법 2에 있어서, 공간 인스턴스 임베딩 특징(Spatial Instance Embedding, SIE)의 키 포인트 검출 방법에 기반하여 구현한다.In method 2, it is implemented based on a key point detection method of spatial instance embedding features (SIE).

여기서, 방법 1의 훈련 과정에 있어서, KE 가 주로 키 포인트 근처의 픽셀의 겉보기 정보를 포함하므로, 동일한 목표 대상의 각 키 포인트의 임베딩 값을 좁히며, 상이한 목표 대상의 키 포인트의 임베딩 값을 넓히므로, 키 포인트의 임베딩 특징을 추출을 구현할 수 있다.Here, in the training process of method 1, since KE mainly includes the apparent information of pixels near the key point, the embedding value of each key point of the same target object is narrowed, and the embedding value of the key point of the different target object is widened. Therefore, it is possible to implement extraction of the embedding feature of the key point.

여기서, 방법 2의 훈련 과정에 있어서, SIE는 목표 대상 중심 위치 정보를 포함하므로, 각 픽셀 값이 목표 대상 중심으로 회귀하는 벡터를 통해, 키 포인트의 임베딩 특징의 추출을 구현할 수 있다.Here, in the training process of Method 2, since the SIE includes target target center position information, extraction of the embedding feature of the key point may be implemented through a vector in which each pixel value returns to the target target center.

이로부터 알다시피, KE는 주로 픽셀 특징 정보에 관한 것이고, 여기서, 상기 픽셀 특징 정보는 픽셀의 픽셀 값과 같은 픽셀의 겉보기 정보를 포함할 수 있고, 공간 위치 정보에 대해 예민하지 않으며, 긴 거리의 키 포인트 사이의 관계를 모델링할 수 있으나, 공간 제약이 부족하므로, KE에만 의존하면 먼 거리의 상이한 목표 대상의 키 포인트가 함께 모이는 문제가 존재할 수 있다.As can be seen from this, KE mainly relates to pixel characteristic information, where the pixel characteristic information may include apparent information of a pixel, such as a pixel value of a pixel, is not sensitive to spatial location information, and is Although it is possible to model the relationship between key points, there may be a problem in that the key points of different target objects at a long distance gather together if only relying on KE because of the lack of space constraints.

이로부터 알다시피, 관련 기술에서, 상향식의 키 포인트 검출 방법에 있어서, 키 포인트의 검출 및 키 포인트의 클러스터링은 두 개의 독립적인 단계이다. 키 포인트의 클러스터링은 미분할 수 없는 후처리 작업이다. 아래와 같은 몇 가지 문제가 존재하며, 첫째, 키 포인트의 클러스터링 과정은 고정된 후처리이고, 데이터에서 학습하여 최적화할 수 없으며; 둘째, 공동 최적화를 수행하지 않았으므로, 키 포인트의 클러스터링 정밀도가 높지 않고, 상이한 목표 대상의 키 포인트를, 동일한 목표 대상의 키 포인트로 간주하여 클러스터링하는 것과 같은 클러스터링 오류가 발생할 수 있으며; 셋째, 클러스터링 후처리 과정은, 추가 하이퍼 파라미터 설정이 필요하다.As can be seen from this, in the related art, in the bottom-up key point detection method, the key point detection and the key point clustering are two independent steps. Clustering of key points is an indivisible post-processing operation. There are several problems as follows, first, the clustering process of key points is a fixed post-processing, and cannot be optimized by learning from data; Second, since joint optimization is not performed, the clustering precision of key points is not high, and clustering errors such as clustering key points of different target objects as key points of the same target object may occur; Third, the clustering post-processing process requires additional hyperparameter settings.

상기 문제를 해결하기 위해, 본 발명의 실시예는 키 포인트 검출 방법을 제공하고, 상기 방법의 시스템 블록도는 도 2b에 도시된 바와 같으며,In order to solve the above problem, an embodiment of the present invention provides a key point detection method, the system block diagram of the method is as shown in Fig. 2b,

관련 기술에 있어서, 키 포인트의 검출 단계 S21 및 키 포인트의 클러스터링 단계 S22를 포함한다. 키 포인트의 클러스터링 단계는 키 포인트 검출 후의 후처리 작업이다.In the related art, it includes a key point detection step S21 and a key point clustering step S22. The clustering step of key points is a post-processing operation after key point detection.

본 발명의 실시예는 키 포인트 검출 방법을 제공하고, 키 포인트 후보 추출 모듈 G1(Keypoint Candidate Proposal) 및 이미지 클러스터링 모듈 G2(Graph Grouping)를 포함한다.An embodiment of the present invention provides a key point detection method, and includes a key point candidate extraction module G1 (Keypoint Candidate Proposal) and an image clustering module G2 (Graph Grouping).

키 포인트 후보 추출 모듈 G1은, 각 프레임의 이미지에 대해, 먼저 멀티 작업의 상향식의 목표 대상 키 포인트 모델을 통해, 목표 대상 키 포인트의 키 포인트 히트맵(Heatmaps), 및 이미지 특징맵 또는 공간 인스턴스 이미지 특징맵 (Feature Maps)을 직접 출력하기 위한 것이다. 키 포인트 히트맵을 통해 최대값(argmax)을 획득하므로, 키 포인트의 픽셀 좌표 위치 X를 획득할 수 있다. 동시에, 이미지 특징맵의 대응되는 위치 정보에서, 키 포인트의 특징 정보 F를 획득할 수 있다.Key point candidate extraction module G1, for the image of each frame, first through a multi-task bottom-up target target key point model, key point heatmaps of target target key points, and image feature maps or spatial instance images This is to directly output feature maps. Since the maximum value (argmax) is obtained through the key point heatmap, the pixel coordinate position X of the key point can be obtained. At the same time, it is possible to obtain the characteristic information F of the key point from the corresponding position information of the image characteristic map.

이미지 클러스터링 모듈 G2는, 두 개마다의 키 포인트가 동일한 목표 대상에 속하는지 여부를 판단하고, 동일한 목표 대상의 키 포인트를 클러스터링하기 위한 것이다.The image clustering module G2 is configured to determine whether every two key points belong to the same target object, and cluster the key points of the same target object.

예를 들어, 도 2c는 키 포인트 히트맵 P1 및 이미지 특징맵 P2로, 이미지 컨볼루션 네트워크를 입력하는 과정을 도시한다. 먼저, 키 포인트의 정보를 추출하고, 키 포인트 이미지 모델 G={V, E}를 구축한다. 키 포인트 이미지 모델 G는 키 포인트 V 및 가장자리 E 두 개의 부분으로 분할되고, 여기서, 키 포인트 V는 각 키 포인트의 정보이며, 즉 「키 포인트의 카테고리 T, 키 포인트의 좌표 X, 키 포인트의 특징 정보 F」를 포함한다. 가장자리 V는 키 포인트 사이의 관계 즉 동일한 목표 대상에 속하는지 여부를 표시한다. 키 포인트 이미지 모델을 구축한 후, 키 포인트 상관관계의 계산을 수행한다. 에지 컨볼루션 EdgeConv 구축맵 컨볼루션 뉴럴 네트워크 모델을 사용하여, 키 포인트 이미지 모델에 대해 컨볼루션을 수행하고, 키 포인트의 특징 정보를 지속적으로 업데이트한다. 이어서 가장자리 분류 기기를 훈련하고, 각 쌍의 키 포인트를 판별하여, 해당 쌍의 키 포인트가 동일한 목표 대상에 속하는지 여부를 판단한다. 키 포인트 후보 추출 모듈 G1, 이미지 클러스터링 모듈 G2, 가장자리 분류 기기를 통해 엔드 투 엔드 훈련 최적화를 수행할 수 있다.For example, FIG. 2C shows a process of inputting an image convolution network into a key point heat map P1 and an image feature map P2. First, the key point information is extracted, and the key point image model G={V, E} is constructed. The key point image model G is divided into two parts, the key point V and the edge E, where the key point V is the information of each key point, that is, "the category T of the key point, the coordinate X of the key point, and the characteristic of the key point. information F”. Edge V indicates the relationship between key points, i.e. whether they belong to the same target object. After building the key point image model, the calculation of key point correlation is performed. Edge Convolution EdgeConv Construction Map Using the convolutional neural network model, convolution is performed on the key point image model, and the feature information of the key point is continuously updated. Next, the edge classification device is trained, and each pair of key points is determined to determine whether the pair of key points belong to the same target object. End-to-end training optimization can be performed through the key point candidate extraction module G1, the image clustering module G2, and the edge classification device.

가능한 실시형태에 있어서, 본 발명의 실시예에서 제공하는 키 포인트 검출 방법의 일 측면에 있어서, 목표 대상 키 포인트의 위치 정보에 대해 예측을 수행할 수 있고; 다른 측면에 있어서, 상기 목표 대상의 행동 타입을 결정할 수 있으며; 또 다른 측면에 있어서, 목표 대상의 상이한 부위에서 실시간 특수 효과 정보를 증가할 수 있다.In a possible embodiment, in one aspect of the key point detection method provided in the embodiments of the present invention, prediction may be performed on the location information of the target key point; In another aspect, it is possible to determine the behavioral type of the target object; In another aspect, real-time special effect information can be increased in different parts of the target object.

상기 방법을 사용하여, 다중 목표 대상 포즈 추정을 이미지 클러스터링 문제로 변환할 수 있고; 목표 대상의 토폴로지 구조 정보를 유지하여, 다중 목표 대상 포즈 이미지에 대한 특징 정보 추출을 수행할 수 있으며; 하나의 딥 뉴럴 네트워크 모델을 통해, 다중 목표 대상 포즈의 클러스터링 결과를 획득할 수 있고; 관련 기술에 있어서, 키 포인트에 대한 클러스터링 등 후처리 작업을 예방할 수 있으므로, 작업 단계를 간소화할 수 있으며; 키 포인트의 검출 및 키 포인트의 클러스터링을 통합하여, 키 포인트의 클러스터링 정밀도를 향상시킬 수 있다.Using the above method, it is possible to transform the multi-target object pose estimation into an image clustering problem; by maintaining the topological structure information of the target target to perform feature information extraction for multiple target target pose images; Through one deep neural network model, it is possible to obtain clustering results of multiple target object poses; In the related art, it is possible to prevent post-processing operations such as clustering for key points, so that the operation steps can be simplified; By integrating the detection of key points and clustering of key points, it is possible to improve the clustering precision of key points.

본 분야의 기술자는 실시형태의 상기 방법에서, 각 단계의 기록 순서는 엄격한 실행 순서를 의미하지 않고 실시 과정에 대한 임의의 제한을 구성하며, 각 단계의 실행 순서는 그 기능 및 가능한 내부 논리에 의해 결정된다는 것을 이해할 수 있다.A person skilled in the art will know that in the above method of the embodiment, the recording order of each step does not imply a strict execution order but constitutes any limitation on the implementation process, and the execution order of each step is determined by its function and possible internal logic. can be understood to be determined.

동일한 구상에 기반하여, 본 발명의 실시예는 키 포인트 검출 장치를 제공하고, 도 3은, 본 발명의 실시예에서 제공한 키 포인트 검출 장치의 아키텍처 예시도이고, 획득 모듈(301), 제1 생성 모듈(302), 제2 생성 모듈(303), 결정 모듈(304), 행동 타입 결정 모듈(305), 및 특수 효과 정보 생성 모듈(306)을 포함하며, 여기서,Based on the same concept, an embodiment of the present invention provides a key point detection device, and Fig. 3 is an architectural illustration of the key point detection device provided in an embodiment of the present invention, the acquisition module 301, the first a generating module 302 , a second generating module 303 , a determining module 304 , a behavior type determining module 305 , and a special effect information generating module 306 , wherein:

검출될 이미지를 획득하도록 구성된 획득 모듈(301);an acquiring module 301, configured to acquire an image to be detected;

상기 검출될 이미지에 기반하여, 이미지 특징맵 및 복수 개의 키 포인트 히트맵을 생성하도록 구성된 제1 생성 모듈(302) - 상기 이미지 특징맵은 상기 검출될 이미지 중 각 목표 대상 사이의 상대적 위치 관계를 나타내기 위한 것이고, 각 상기 키 포인트 히트맵은 상기 검출될 이미지의 하나의 카테고리의 키 포인트를 포함하며, 상이한 카테고리의 키 포인트는 상기 목표 대상의 상이한 부위에 대응됨 - ;a first generating module 302, configured to generate, based on the image to be detected, an image feature map and a plurality of key point heat maps, wherein the image feature map indicates a relative positional relationship between each target object in the image to be detected wherein each said key point heatmap includes key points of one category of said to-be-detected image, wherein key points of different categories correspond to different parts of said target object;

상기 이미지 특징맵 및 복수 개의 상기 키 포인트 히트맵에 기반하여, 키 포인트 이미지 모델을 생성하도록 구성된 제2 생성 모듈(303) - 상기 키 포인트 이미지 모델은 상기 검출될 이미지 중 상이한 카테고리의 키 포인트의 정보 및 연결 가장자리의 정보를 포함하고, 각 연결 가장자리는 두 개의 상이한 카테고리의 키 포인트 사이의 가장자리임 - ; 및a second generating module 303, configured to generate a key point image model based on the image feature map and the plurality of key point heat maps, wherein the key point image model includes information of key points of different categories among the to-be-detected images and information of connecting edges, each connecting edge being an edge between key points of two different categories; and

상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하도록 구성된 결정 모듈(304)을 포함한다.and a determining module 304, configured to determine, based on the key point image model, each key point belonging to the same target object.

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 경우, 상기 결정 모듈(304)은,In a possible embodiment, when determining, based on the key point image model, each key point belonging to the same target object, the determining module 304 comprises:

상기 키 포인트 이미지 모델 중 각 키 포인트의 정보 및 상기 연결 가장자리의 정보에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하고; determine a correlation between two key points having a connection relationship in the key point image model based on information on each key point in the key point image model and information on the connection edge;

결정된 상기 상관관계에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하도록 구성된다.and determine each key point belonging to the same target object based on the determined correlation.

가능한 실시형태에 있어서, 결정된 상기 상관관계에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 경우, 상기 결정 모듈(304)은,In a possible embodiment, when determining, based on the determined correlation, each key point belonging to the same target object, the determining module 304 is configured to:

상기 기설정된 임계값보다 큰 대응되는 상기 상관관계를 갖는 각 키 포인트를 동일한 목표 대상의 키 포인트로 하도록 구성된다.and make each key point having the corresponding correlation greater than the preset threshold as a key point of the same target object.

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델 중 각 키 포인트의 정보 및 상기 연결 가장자리의 정보에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하는 경우, 상기 결정 모듈(304)은,In a possible embodiment, when determining a correlation between two key points having a connection relationship in the key point image model based on information on each key point in the key point image model and information on the connection edge, The determination module 304,

각 키 포인트에 대해, 상기 키 포인트의 정보, 및 상기 키 포인트 이미지 모델 중 상기 키 포인트와 연결 관계가 있는 다른 키 포인트의 정보에 기반하여, 상기 키 포인트의 융합 특징을 결정하고;for each key point, determine a fusion characteristic of the key point, based on the information of the key point and information on other key points having a connection relationship with the key point in the key point image model;

각 키 포인트에 각각 대응하는 융합 특징에 기반하여, 상기 키 포인트 이미지 모델에서 연결 관계가 있는 두 개의 키 포인트 사이의 상관관계를 결정하도록 구성된다.and determine a correlation between two key points having a connection relationship in the key point image model, based on the fusion feature respectively corresponding to each key point.

가능한 실시형태에 있어서, 상기 키 포인트의 정보는 위치 정보, 카테고리 정보, 및 픽셀 특징 정보를 포함하고;In a possible embodiment, the information of the key point includes location information, category information, and pixel characteristic information;

상기 제2 생성 모듈(303)은,The second generation module 303 is

상기 키 포인트 히트맵에 기반하여, 각 키 포인트의 위치 정보를 결정하는 단계; 및determining location information of each key point based on the key point heat map; and

각 상기 키 포인트의 위치 정보에 기반하여, 상기 이미지 특징맵에서 상기 키 포인트의 픽셀 특징 정보를 추출하고, 상기 키 포인트가 속하는 키 포인트 히트맵의 카테고리 라벨에 기반하여, 상기 키 포인트에 대응되는 카테고리 정보를 결정하는 단계에 따라 상기 키 포인트 이미지 모델 중 각 키 포인트의 정보를 결정하도록 구성된다.Based on the location information of each key point, pixel characteristic information of the key point is extracted from the image feature map, and a category corresponding to the key point is extracted based on the category label of the key point heat map to which the key point belongs. and determine information of each key point in the key point image model according to the determining information.

가능한 실시형태에 있어서, 상기 제2 생성 모듈(303)은,In a possible embodiment, the second generating module 303 comprises:

각 키 포인트에 대응되는 상기 카테고리 정보에 기반하여, 각 키 포인트를 상기 키 포인트가 속하는 카테고리와 상이한 다른 키 포인트와 연결하여, 상기 키 포인트 이미지 모델에서의 연결 가장자리를 형성하는 단계에 따라 상기 키 포인트 이미지 모델에서의 각 연결 가장자리를 생성하도록 구성된다.Based on the category information corresponding to each key point, each key point is linked with another key point different from the category to which the key point belongs to form a connection edge in the key point image model; It is configured to create each connecting edge in the image model.

각 키 포인트에 대응되는 상기 카테고리, 및 기설정된 상이한 카테고리 사이의 매칭 관계에 기반하여, 각 키 포인트를 상기 키 포인트가 속하는 카테고리와 매칭되는 목표 카테고리에 대응되는 키 포인트와 연결하여, 상기 키 포인트 이미지 모델에서의 연결 가장자리를 형성하는 단계에 따라 상기 키 포인트 이미지 모델에서의 각 연결 가장자리를 생성하도록 구성된다.Based on the category corresponding to each key point and a matching relationship between different preset categories, each key point is linked with a key point corresponding to a target category matching the category to which the key point belongs, and the key point image and generate each connecting edge in the key point image model according to the step of forming a connecting edge in the model.

가능한 실시형태에 있어서, 상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정한 후,In a possible embodiment, after determining, based on the key point image model, each key point belonging to the same target object,

각 목표 대상에 대응되는 각 키 포인트의 정보에 기반하여, 상기 목표 대상의 행동 타입을 결정하도록 구성된 행동 타입 결정 모듈(305)을 더 포함한다.and a behavior type determination module 305, configured to determine a behavior type of the target target based on the information of each key point corresponding to each target target.

각 목표 대상에 대응되는 각 키 포인트의 정보에 기반하여, 상기 목표 대상에 대한 특수 효과 정보를 생성하도록 구성된 특수 효과 정보 생성 모듈(306)을 더 포함한다.and a special effect information generating module 306, configured to generate special effect information for the target object based on the information of each key point corresponding to each target object.

일부 실시예에 있어서, 본 발명의 실시예에서 제공한 장치가 갖고 있는 기능 또는 포함하는 템플렛은 전술한 방법 실시예에서 설명한 방법을 실행하는데 사용될 수 있고, 그 구현은 전술한 방법 실시예의 설명을 참조할 수 있으며, 간결함을 위해, 여기서 더이상 반복하여 설명하지 않는다.In some embodiments, the functions possessed by the apparatus provided in the embodiments of the present invention or templates including the functions may be used to execute the methods described in the above-described method embodiments, and for implementation, refer to the description of the above-described method embodiments. and, for the sake of brevity, are not further repeated here.

동일한 기술적 사상에 기반하여, 본 발명의 실시예는 전자 기기를 더 제공한다. 도 4는, 본 발명의 실시예에서 제공한 전자 기기의 구조 예시도이고, 프로세서(401), 메모리(402) 및 버스(403)를 포함한다. 여기서, 메모리(402)는 실행 명령어를 저장하도록 구성되고, 내부 메모리(4021) 및 외부 메모리(4022)를 포함하며; 여기서 내부 메모리(4021)는 내부 저장소로도 지칭되며, 프로세서(401)에서의 연산 데이터 및 하드 디스크 등 외부 메모리(4022)와 교환되는 데이터를 임시 저장하도록 구성되며, 프로세서(401)는 내부 메모리(4021)와 외부 메모리(4022)를 통해 데이터 교환을 수행하고, 전자기기(400)이 동작하는 경우, 프로세서(401)와 메모리(402)사이는 버스(403)를 통해 통신하여, 프로세서(401)가 아래의 단계를 실행하도록 한다. 즉Based on the same technical idea, an embodiment of the present invention further provides an electronic device. 4 is an exemplary structural diagram of an electronic device provided in an embodiment of the present invention, and includes a processor 401 , a memory 402 and a bus 403 . Here, the memory 402 is configured to store an execution instruction, and includes an internal memory 4021 and an external memory 4022 ; Here, the internal memory 4021 is also referred to as an internal storage, and is configured to temporarily store data exchanged with the external memory 4022 such as arithmetic data in the processor 401 and a hard disk, and the processor 401 includes the internal memory ( 4021) and the external memory 4022 perform data exchange, and when the electronic device 400 operates, the processor 401 and the memory 402 communicate through the bus 403, and the processor 401 to execute the steps below. In other words

검출될 이미지를 획득하는 단계;acquiring an image to be detected;

상기 키 포인트 이미지 모델에 기반하여, 동일한 목표 대상에 속하는 각 키 포인트를 결정하는 단계를 실행한다.Based on the key point image model, determining each key point belonging to the same target object is performed.

또한, 본 발명의 실시예는 컴퓨터 판독 가능한 저장 매체를 더 제공하고, 상기 컴퓨터 판독 가능한 저장 매체에는 컴퓨터 프로그램이 저장되어 있으며, 상기 컴퓨터 프로그램이 프로세서에 의해 작동될 경우 상기 방법 실세예 중 상기 키포인트 검출 방법의 단계를 실행한다.In addition, an embodiment of the present invention further provides a computer readable storage medium, wherein the computer program is stored in the computer readable storage medium, and when the computer program is operated by a processor, detecting the keypoint in the embodiment of the method Execute the steps of the method.

본 발명의 실시예에서 제공한 키 포인트 검출 방법의 컴퓨터 프로그램 제품은, 프로그램 코드를 포함하는 컴퓨터 판독 가능한 저장 매체를 포함하고, 상기 프로그램 코드에 포함된 명령어는 상기 방법 실시예에 따른 키 포인트 검출 방법의 단계를 실행하는데 사용될 수 있으며, 상기 방법 실시예를 참조할 수 있고, 여기서 더이상 반복하여 설명하지 않는다.The computer program product of the key point detection method provided in the embodiment of the present invention includes a computer readable storage medium including a program code, and the instructions included in the program code are the key point detection method according to the embodiment of the method. may be used to perform the steps of , and reference may be made to the above method embodiments, which will not be repeated herein any longer.

본 기술분야의 통상의 기술자는 설명의 편의 및 간결함을 위해, 상기 설명된 시스템, 장치의 동작 과정이, 전술된 방법 실시예 중 대응되는 과정을 참조할 수 있음을 이해할 수 있으며, 여기서 반복적으로 설명하지 않는다. 본 발명에서 제공된 몇 개의 실시예에 있어서, 개시된 시스템, 장치 및 방법은 다른 방식으로 구현될 수 있음을 이해해야 한다. 전술된 장치 실시예는 다만 예시적이며, 예를 들어, 상기 유닛에 대한 분할은 다만 논리적 기능 분할이고, 실제로 구현될 경우 다른 분할 방식이 있을 수 있으며, 또 예를 들어, 복수 개의 유닛 또는 컴포넌트는 다른 시스템에 결합되거나 통합될 수 있거나, 일부 특징을 무시하거나 실행하지 않을 수 있다. 또한, 나타내거나 논의된 상호간의 결합 또는 직접 결합 또는 통신 연결은, 일부 통신 인터페이스를 통해 구현되며, 장치 또는 유닛을 통한 간접 결합 또는 통신 연결은, 전기, 기계 또는 다른 형태일 수 있다.A person skilled in the art may understand that, for convenience and brevity of description, the operation process of the above-described system and apparatus may refer to a corresponding process among the above-described method embodiments, which will be described repeatedly I never do that. In the several embodiments provided herein, it should be understood that the disclosed systems, apparatuses, and methods may be implemented in other ways. The above-described device embodiment is merely exemplary, for example, the division for the unit is merely logical function division, and when actually implemented, there may be other division methods, and for example, a plurality of units or components may be It may be combined or integrated into other systems, or some features may be ignored or not implemented. Further, any coupling or direct coupling or communication connection between each other shown or discussed may be implemented through some communication interface, and the indirect coupling or communication connection through a device or unit may be electrical, mechanical, or other form.

상기 분리 부재로서 설명된 유닛은, 물리적으로 분리된 것이거나 아닐 수 있고, 유닛으로서 나타낸 부재는 물리적 유닛이거나 아닐 수 있고, 즉 한 곳에 위치할 수 있거나, 복수 개의 네트워크 유닛에 분포될 수도 있다. 실제 필요에 따라 유닛의 일부 또는 전부를 선택하여 본 실시예 방안의 목적을 구현할 수 있다.The unit described as the separation member may or may not be physically separated, and the member shown as a unit may or may not be a physical unit, that is, it may be located in one place or may be distributed in a plurality of network units. According to actual needs, some or all of the units may be selected to implement the purpose of the present embodiment method.

또한, 본 발명의 각 실시예에서의 각 기능 유닛은 하나의 처리 유닛에 통합될 수 있고, 각 유닛이 독립적인 물리적 존재일 수도 있고, 두 개 또는 두 개 이상의 유닛이 한 유닛에 통합될 수도 있다.In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, each unit may be an independent physical entity, and two or two or more units may be integrated into one unit. .

상기 기능이 소프트웨어 기능 유닛의 형태로 구현되고 단독적인 제품으로 판매되거나 사용될 경우, 하나의 프로세서가 실행 가능한 비휘발성 컴퓨터 판독 가능한 저장 매체에 저장될 수 있다. 이러한 이해에 기반하여, 본 발명의 기술 방안, 즉 종래 기술에 기여하는 부분 또는 상기 기술 방안의 일부는 소프트웨어 제품의 형태로 구현될 수 있고, 상기 컴퓨터 소프트웨어 제품은 하나의 저장 매체에 저장되며, 하나의 컴퓨터 기기(개인용 컴퓨터, 서버 또는 네트워크 기기 등일 수 있음)로 하여금 본 발명의 각 실시예의 상기 방법의 전부 또는 일부 단계를 실행하는데 사용되는 복수 개의 명령어를 포함한다. 전술한 저장 매체는 U 디스크, 모바일 디스크, 판독 전용 메모리(Read-Only Memory, ROM), 랜덤 액세스 메모리(Random Access Memory, RAM), 자기 디스크 또는 광 디스크 등 프로그램 코드를 저장할 수 있는 여러 가지 매체를 포함한다.When the above functions are implemented in the form of a software function unit and sold or used as a standalone product, one processor may be stored in an executable nonvolatile computer readable storage medium. Based on this understanding, the technical solution of the present invention, that is, a part contributing to the prior art or a part of the technical solution may be implemented in the form of a software product, the computer software product being stored in one storage medium, one It includes a plurality of instructions used to cause a computer device (which may be a personal computer, a server or a network device, etc.) of The above-mentioned storage medium includes a variety of media capable of storing a program code, such as a U disk, a mobile disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk. include

이상의 설명은 다만 본 발명의 구체적인 실시 형태일뿐이고, 본 발명의 보호 범위는 이에 한정되지 않는다. 본 출원이 속하는 기술 분야의 통상의 기술자라면, 본 발명에서 개시된 기술적 범위 내의 변화 또는 교체가 모두 본 발명의 보호 범위 내에 속해야 함을 쉽게 알 수 있을 것이다. 따라서, 본 발명의 보호 범위는 특허청구범위의 보호 범위를 기준으로 해야 한다.The above description is only specific embodiments of the present invention, and the protection scope of the present invention is not limited thereto. Those of ordinary skill in the art to which this application pertains will readily appreciate that all changes or replacements within the technical scope disclosed in the present invention must fall within the protection scope of the present invention. Accordingly, the protection scope of the present invention should be based on the protection scope of the claims.

본 발명은 생성한 이미지 특징맵 및 복수 개의 키 포인트 히트맵에 기반하여, 검출될 이미지에 대응되는 키 포인트 이미지 모델을 생성할 수 있고, 키 포인트 이미지 모델에는 이미지 특징맵 및 키 포인트 히트맵에서의 정보를 포함하고, 이미지 특징맵은 검출될 이미지 중 상이한 목표 대상 사이의 상대적 위치 관계를 나타낼 수 있으며, 따라서 키 포인트 이미지 모델에 기반하여, 상이한 목표 대상의 키 포인트에 대해 비교적 정확하게 구분할 수 있으므로, 키 포인트 클러스터링의 정확성을 향상시킬 수 있다.The present invention can generate a key point image model corresponding to an image to be detected based on the generated image feature map and a plurality of key point heat maps, and the key point image model includes an image feature map and a key point heat map. information, and the image feature map can indicate the relative positional relationship between different target objects in the image to be detected, and thus, based on the key point image model, can distinguish relatively accurately for key points of different target objects, so that the key The accuracy of point clustering can be improved.

Claims

A key point detection method comprising:
acquiring an image to be detected;
generating an image feature map and a plurality of key point heat maps based on the image to be detected, wherein the image feature map is for indicating a relative positional relationship between each target object in the image to be detected, each of the key points the point heat map includes key points of one category of the image to be detected, and key points of different categories correspond to different parts of the target object;
generating a key point image model based on the image feature map and the plurality of key point heat maps, wherein the key point image model includes information on key points of different categories among the images to be detected and information on connection edges , and each connecting edge is an edge between key points of two different categories; and
and determining each key point belonging to the same target object based on the key point image model.

According to claim 1,
Determining each key point belonging to the same target object based on the key point image model comprises:
determining a correlation between two key points having a connection relationship in the key point image model based on information on each key point in the key point image model and information on the connection edge; and
and determining each key point belonging to the same target object based on the determined correlation.

3. The method of claim 2,
Based on the determined correlation, determining each key point belonging to the same target includes:
and taking each key point having the corresponding correlation greater than the preset threshold as a key point of the same target object.

4. The method of claim 2 or 3,
Determining a correlation between two key points having a connection relationship in the key point image model based on information on each key point in the key point image model and information on the connection edge includes:
determining, for each key point, a fusion characteristic of the key point based on information on the key point and information on other key points having a connection relationship with the key point in the key point image model; and
and determining a correlation between two key points having a connection relationship in the key point image model based on a fusion feature corresponding to each key point, respectively.

5. The method according to any one of claims 1 to 4,
the key point information includes location information, category information, and pixel characteristic information;
determining location information of each key point based on the key point heat map; and
Based on the location information of each key point, pixel characteristic information of the key point is extracted from the image feature map, and a category corresponding to the key point is extracted based on the category label of the key point heat map to which the key point belongs. Determining information; according to the key point detection device, characterized in that the information of each key point in the key point image model is determined.

6. The method of claim 5,
Based on the category information corresponding to each key point, each key point is linked with another key point different from the category to which the key point belongs, and the key point according to the step of forming a connection edge in the key point image model A method for detecting key points, comprising generating each connecting edge in the image model.

6. The method of claim 5,
Based on the category corresponding to each key point and a matching relationship between different preset categories, each key point is linked with a key point corresponding to a target category matching the category to which the key point belongs, and the key point image A key point detection method, characterized in that generating each connecting edge in the key point image model according to the step of forming a connecting edge in the model.

8. The method according to any one of claims 1 to 7,
After determining each key point belonging to the same target object based on the key point image model,
The method of detecting a key point further comprising the step of determining a behavior type of the target target based on information on each key point corresponding to each target target.

8. The method according to any one of claims 1 to 7,
After determining each key point belonging to the same target object based on the key point image model,
The method of detecting a key point further comprising the step of generating special effect information on the target target based on the information of each key point corresponding to each target target.

A key point detection device comprising:
an acquiring module configured to acquire an image to be detected;
a first generating module, configured to generate, based on the image to be detected, an image feature map and a plurality of key point heat maps, wherein the image feature map is for indicating a relative positional relationship between each target object in the image to be detected , each of the key point heatmaps includes key points of one category of the image to be detected, and key points of different categories correspond to different parts of the target object;
a second generating module, configured to generate a key point image model based on the image feature map and the plurality of key point heat maps, wherein the key point image model includes information and connecting edges of key points of different categories among the to-be-detected images contains information from, each connecting edge being an edge between key points of two different categories; and
and a determining module, configured to determine, based on the key point image model, each key point belonging to the same target object.

11. The method of claim 10,
When determining each key point belonging to the same target object based on the key point image model, the determination module is configured to:
determine a correlation between two key points having a connection relationship in the key point image model based on information on each key point in the key point image model and information on the connection edge;
and determine each key point belonging to the same target object based on the determined correlation.

12. The method of claim 11,
When determining each key point belonging to the same target object based on the determined correlation, the determination module is configured to:
and each key point having the corresponding correlation greater than the preset threshold is configured to be a key point of the same target object.

13. The method of claim 11 or 12,
When determining a correlation between two key points having a connection relationship in the key point image model based on the information on each key point in the key point image model and the information on the connection edge, the determination module is configured to:
for each key point, determine a fusion characteristic of the key point, based on the information of the key point and information on other key points having a connection relationship with the key point in the key point image model;
and determine a correlation between two key points having a connection relationship in the key point image model, based on a fusion feature respectively corresponding to each key point.

14. The method according to any one of claims 10 to 13,
the key point information includes location information, category information, and pixel characteristic information;
The second generation module,
determine location information of each key point based on the key point heat map;
Based on the location information of each key point, pixel characteristic information of the key point is extracted from the image feature map, and a category corresponding to the key point is extracted based on the category label of the key point heat map to which the key point belongs. and determine information of each key point in the key point image model according to the step of determining the information.

15. The method of claim 14,
The second generation module,
Based on the category information corresponding to each key point, each key point is linked with another key point different from the category to which the key point belongs, and the key point according to the step of forming a connection edge in the key point image model A key point detection device configured to generate each connecting edge in the image model.

15. The method of claim 14,
The second generation module,
Based on the category corresponding to each key point and a matching relationship between different preset categories, each key point is linked with a key point corresponding to a target category matching the category to which the key point belongs, and the key point image and generate each connecting edge in the key point image model according to the step of forming a connecting edge in the model.

17. The method according to any one of claims 10 to 16,
After determining each key point belonging to the same target object based on the key point image model,
The device for detecting a key point according to claim 1, further comprising: a behavior type determination module configured to determine a behavior type of the target target based on information on each key point corresponding to each target target.

17. The method according to any one of claims 10 to 16,
After determining each key point belonging to the same target object based on the key point image model,
The key point detection apparatus according to claim 1, further comprising: a special effect information generating module configured to generate special effect information on the target object based on the information of each key point corresponding to each target object.

As an electronic device,
a processor, a memory and a bus, wherein the memory stores executable machine readable instructions, and when an electronic device is operated, the processor and the memory communicate via a bus, and the machine readable instructions 10. Computer device, characterized in that when executed by a processor, it executes the steps of the method for detecting a key point according to any one of claims 1 to 9.

A computer-readable storage medium having a computer program stored therein, comprising:
A computer-readable storage medium, characterized in that when the computer program is executed by a processor, the steps of the method for detecting a key point according to any one of claims 1 to 9 are executed.

A computer program product comprising computer readable code, comprising:
10. A computer program product, characterized in that when the computer readable code is run in an electronic device, a processor in the electronic device implements the step of the key point detection method according to any one of claims 1 to 9.