KR102382521B1

KR102382521B1 - User device, call relay server and method for enabling video call with avatar

Info

Publication number: KR102382521B1
Application number: KR1020210170106A
Authority: KR
Inventors: 김기환; 공상표; 오동률; 장우석; 김현중; 류도현; 임주영; 정수회
Original assignee: 주식회사 케이티
Priority date: 2019-03-13
Filing date: 2021-12-01
Publication date: 2022-04-04
Also published as: KR20200109634A; KR20210149674A

Abstract

아바타를 이용하여 영상 통화를 수행하는 사용자 단말은 상기 사용자 단말과 적어도 하나의 타사용자 단말 간에 영상 통화를 요청하는 영상 통화 요청부, 상기 사용자 단말과 상기 타사용자 단말 간의 영상 통화가 연결된 경우, 상기 사용자 단말과 연동하는 카메라를 이용하여 사용자 실사 영상을 촬영하는 촬영부, 상기 촬영된 사용자 실사 영상을 통화 중계 서버로 전송하고, 상기 통화 중계 서버로부터 상기 타사용자 단말에서 촬영된 타사용자 실사 영상을 수신하는 통신부, 상기 영상 통화에 대한 일반 통화 모드를 통해 상기 사용자 단말의 화면의 제 1 영역에 상기 사용자 실사 영상을 표시하고, 상기 화면의 제 2 영역에 상기 타사용자 실사 영상을 표시하는 표시부 및 상기 영상 통화에 대한 아바타 통화 모드를 요청받는 경우, 상기 일반 통화 모드를 상기 아바타 통화 모드로 변경하는 모드 변경부를 포함하되, 상기 표시부는 상기 아바타 통화 모드를 요청 받은 경우, 상기 화면의 제 1 영역에 표시된 상기 사용자 실사 영상을 기설정된 사용자 아바타로 대체하여 표시하되, 상기 사용자 단말의 사용자에 대한 얼굴 특징 및 기학습된 평균 얼굴 모델에 기초하여 사용자 얼굴 모델을 생성하는 아바타 생성부를 더 포함하고, 상기 아바타 생성부는 기모델링된 뷰티 모델에 포함된 복수의 버텍스 좌표로부터 상기 평균 얼굴 모델에 포함된 복수의 버텍스 좌표가 차감된 오프셋에 뷰티 가중치가 적용된 뷰티 오프셋 및 상기 사용자 얼굴 모델과의 합성을 통해 사용자 아바타를 생성한다.A user terminal performing a video call using an avatar includes a video call requesting unit for requesting a video call between the user terminal and at least one other user terminal, and when a video call between the user terminal and the other user terminal is connected, the user A photographing unit that shoots a user's actual image using a camera interlocking with the terminal, transmits the photographed user's actual image to a call relay server, and receives the other user's actual image taken in the other user terminal from the call relay server A communication unit, a display unit for displaying the actual image of the user on the first area of the screen of the user terminal through the general call mode for the video call, and the other user's actual image on the second area of the screen, and the video call and a mode change unit configured to change the normal call mode to the avatar call mode when receiving a request for an avatar call mode for The live-action image is replaced with a preset user avatar and displayed, and further comprising an avatar generator generating a user face model based on facial features of the user of the user terminal and a pre-learned average face model, wherein the avatar generator includes a A user avatar is generated by synthesizing a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the modeled beauty model and the user face model.

Description

A user terminal, call relay server and method for performing a video call using an avatar {USER DEVICE, CALL RELAY SERVER AND METHOD FOR ENABLING VIDEO CALL WITH AVATAR}

본 발명은 아바타를 이용하여 영상 통화를 수행하는 사용자 단말, 통화 중계 서버 및 방법에 관한 것이다. The present invention relates to a user terminal, a call relay server, and a method for performing a video call using an avatar.

스마트폰(Smart Phone)이란 컴퓨터로 할 수 있는 작업 중 일부를 휴대폰에서도 할 수 있도록 개발된 휴대 기기이다. 사람들은 스마트폰을 항상 들고 다니면서 인터넷을 검색하거나 메일을 송수신하고, 동영상 또는 사진을 촬영 및 편집할 수도 있다. 또한, 스마트폰 이용자들은 서로 간에 동영상, 음악, 사진 등의 콘텐츠를 주고받을 수도 있다.A smart phone is a mobile device developed so that some of the tasks that can be done with a computer can also be performed on a mobile phone. People carry their smartphones with them all the time to browse the Internet, send and receive mail, and shoot and edit videos or photos. In addition, smartphone users may exchange content such as videos, music, and photos with each other.

최근에는 스마트폰에 장착된 카메라를 이용하여 영상 통화가 가능해졌으며, 이러한 스마트폰에서 영상 통화를 수행하는 기술과 관련하여, 선행기술인 한국공개특허 제 2017-0110469호는 영상 통화를 제공하기 위한 방법 및 이를 위한 전자 장치를 개시하고 있다. Recently, video calls have been made possible using a camera mounted on a smartphone, and with respect to a technology for performing a video call in such a smart phone, Korea Patent Application Laid-Open No. 2017-0110469, a prior art, discloses a method for providing a video call and An electronic device for this purpose is disclosed.

그러나 일부 사용자들의 경우, 사용자의 현재 상태에 따라 불시에 걸려온 영상 통화를 받기 꺼려하기도 한다. 또한, 영상 통화를 이용하는 사용자들은 상대방에게 자신의 개성이 보여지길 원하거나, 상대방에게 좀더 예쁜 모습으로 비춰지길 원한다. However, some users are reluctant to answer an unexpected video call depending on the user's current status. In addition, users who use video calls want to show their individuality to the other party or to be seen as a more beautiful appearance to the other party.

사용자 얼굴 특징에 기반한 사용자 아바타를 생성하여, 영상 통화 중 촬영되는 사용자의 모습을 사용자 아바타로 변환하여 상대방과 영상 통화를 수행하도록 하는 사용자 단말, 통화 중계 서버 및 방법을 제공하고자 한다. An object of the present invention is to provide a user terminal, a call relay server, and a method for generating a user avatar based on a user's facial features, converting a user's image taken during a video call into a user avatar, and performing a video call with a counterpart.

영상 통화 중 촬영되는 사용자의 얼굴 표정에 따라 사용자 아바타의 표정이 변화되도록 하여, 사용자 아바타를 통해 상대방에게 사용자의 현재 얼굴 표정 및 감정이 전달되도록 하는 사용자 단말, 통화 중계 서버 및 방법을 제공하고자 한다. An object of the present invention is to provide a user terminal, a call relay server, and a method that allow the user's avatar's expression to change according to the user's facial expression captured during a video call, so that the user's current facial expression and emotions are transmitted to the other party through the user avatar.

다만, 본 실시예가 이루고자 하는 기술적 과제는 상기된 바와 같은 기술적 과제들로 한정되지 않으며, 또 다른 기술적 과제들이 존재할 수 있다. However, the technical problems to be achieved by the present embodiment are not limited to the technical problems described above, and other technical problems may exist.

상술한 기술적 과제를 달성하기 위한 수단으로서, 본 발명의 일 실시예는, 사용자 단말과 적어도 하나의 타사용자 단말 간에 영상 통화를 요청하는 영상 통화 요청부, 상기 사용자 단말과 상기 타사용자 단말 간의 영상 통화가 연결된 경우, 상기 사용자 단말과 연동하는 카메라를 이용하여 사용자 실사 영상을 촬영하는 촬영부, 상기 촬영된 사용자 실사 영상을 통화 중계 서버로 전송하고, 상기 통화 중계 서버로부터 상기 타사용자 단말에서 촬영된 타사용자 실사 영상을 수신하는 통신부, 상기 영상 통화에 대한 일반 통화 모드를 통해 상기 사용자 단말의 화면의 제 1 영역에 상기 사용자 실사 영상을 표시하고, 상기 화면의 제 2 영역에 상기 타사용자 실사 영상을 표시하는 표시부 및 상기 영상 통화에 대한 아바타 통화 모드를 요청받는 경우, 상기 일반 통화 모드를 상기 아바타 통화 모드로 변경하는 모드 변경부를 포함하되, 상기 표시부는 상기 아바타 통화 모드를 요청 받은 경우, 상기 화면의 제 1 영역에 표시된 상기 사용자 실사 영상을 기설정된 사용자 아바타로 대체하여 표시하되, 상기 사용자 단말의 사용자에 대한 얼굴 특징 및 기학습된 평균 얼굴 모델에 기초하여 사용자 얼굴 모델을 생성하는 아바타 생성부를 더 포함하고, 상기 아바타 생성부는 기모델링된 뷰티 모델에 포함된 복수의 버텍스 좌표로부터 상기 평균 얼굴 모델에 포함된 복수의 버텍스 좌표가 차감된 오프셋에 뷰티 가중치가 적용된 뷰티 오프셋 및 상기 사용자 얼굴 모델과의 합성을 통해 사용자 아바타를 생성하는 것인 사용자 단말을 제공할 수 있다. As a means for achieving the above-described technical problem, an embodiment of the present invention provides a video call request unit for requesting a video call between a user terminal and at least one other user terminal, and a video call between the user terminal and the other user terminal is connected, a photographing unit that shoots a user's actual image using a camera interlocked with the user terminal, transmits the photographed user's actual image to a call relay server, and the other photographed in the other user terminal from the call relay server A communication unit for receiving the user's live-action image, displays the user's live-action image in the first area of the screen of the user terminal through the general call mode for the video call, and displays the other user's live-action image in the second area of the screen and a mode change unit for changing the normal call mode to the avatar call mode when a request for an avatar call mode for the video call is received, wherein the display unit displays the second screen of the screen when the avatar call mode is requested. The user's live-action image displayed in area 1 is replaced with a preset user avatar and displayed, and further comprising an avatar generator for generating a user face model based on facial features of the user of the user terminal and a pre-learned average face model, , the avatar generator is a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the pre-modeled beauty model, and the user's face model through synthesis It is possible to provide a user terminal that generates a user avatar.

본 발명의 다른 실시예는, 사용자 단말로부터 적어도 하나의 타사용자 단말로의 영상 통화를 요청받는 요청부, 상기 타사용자 단말에서 상기 영상 통화를 수락한 경우, 상기 사용자 단말로부터 사용자 영상 데이터를 수신하여 상기 타사용자 단말로 전송하고, 상기 타사용자 단말로부터 타사용자 영상 데이터를 수신하여 상기 사용자 단말로 전송하는 통신부를 포함하되, 상기 영상 통화가 일반 통화 모드인 경우, 상기 사용자 영상 데이터는 상기 사용자 단말의 사용자 실사 영상을 포함하고, 상기 타사용자 영상 데이터는 상기 타사용자 단말의 타사용자 실사 영상을 포함하고, 상기 사용자 단말에서 상기 영상 통화가 상기 일반 통화 모드로부터 아바타 통화 모드로 변경된 경우, 상기 사용자 영상 데이터는 상기 사용자 실사 영상으로부터 대체된 사용자 아바타에 대한 영상을 포함하되, 상기 사용자 단말에 의해 상기 사용자 단말의 사용자에 대한 얼굴 특징 및 기학습된 평균 얼굴 모델에 기초하여 사용자 얼굴 모델이 생성되고, 상기 사용자 아바타는 기모델링 뷰티 모델에 포함된 복수의 버텍스 좌표로부터 상기 평균 얼굴 모델에 포함된 복수의 버텍스 좌표가 차감된 오프셋에 뷰티 가중치가 적용된 뷰티 오프셋 및 상기 사용자 얼굴 모델과의 합성을 통해 생성되는 것인 통화 중계 서버를 제공할 수 있다. Another embodiment of the present invention provides a requesting unit for receiving a request for a video call from a user terminal to at least one other user terminal, and when the video call is accepted by the other user terminal, by receiving user video data from the user terminal and a communication unit for transmitting to the other user terminal, receiving the other user's image data from the other user's terminal and transmitting it to the user terminal, wherein when the video call is in a normal call mode, the user image data is transmitted from the user terminal's When the video call is changed from the normal call mode to the avatar call mode in the user terminal, the user image data includes a live-action user image, and the other user image data includes a live-action image of another user of the other user terminal. includes an image of a user avatar replaced from the user's actual image, wherein a user face model is generated by the user terminal based on facial features of the user of the user terminal and a pre-learned average face model, the user The avatar is generated by synthesizing a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the pre-modeled beauty model and the user face model. A call relay server may be provided.

본 발명의 또 다른 실시예는, 사용자 단말로부터 적어도 하나의 타사용자 단말로의 영상 통화를 요청받는 단계, 상기 타사용자 단말에서 상기 영상 통화를 수락한 경우, 상기 사용자 단말로부터 사용자 영상 데이터를 수신하여 상기 타사용자 단말로 전송하는 단계, 상기 타사용자 단말로부터 타사용자 영상 데이터를 수신하여 상기 사용자 단말로 전송하는 단계를 포함하되, 상기 영상 통화가 일반 통화 모드인 경우, 상기 사용자 영상 데이터는 상기 사용자 단말의 사용자 실사 영상을 포함하고, 상기 타사용자 영상 데이터는 상기 타사용자 단말의 타사용자 실사 영상을 포함하고, 상기 사용자 단말에서 상기 영상 통화가 상기 일반 통화 모드로부터 아바타 통화 모드로 변경된 경우, 상기 사용자 영상 데이터는 상기 사용자 실사 영상으로부터 대체되어 표시된 사용자 아바타에 대한 영상을 포함하되, 상기 사용자 단말에 의해 상기 사용자 단말의 사용자에 대한 얼굴 특징 및 기학습된 평균 얼굴 모델에 기초하여 사용자 얼굴 모델이 생성되고, 상기 사용자 아바타는 기모델링 뷰티 모델에 포함된 복수의 버텍스 좌표로부터 상기 평균 얼굴 모델에 포함된 복수의 버텍스 좌표가 차감된 오프셋에 뷰티 가중치가 적용된 뷰티 오프셋 및 상기 사용자 얼굴 모델과의 합성을 통해 생성되는 것인 통화 중계 방법을 제공할 수 있다. Another embodiment of the present invention includes the steps of receiving a request for a video call from a user terminal to at least one other user terminal, and when the video call is accepted by the other user terminal, receiving user video data from the user terminal transmitting to the other user terminal; receiving the other user's image data from the other user's terminal and transmitting the other user's image data to the user terminal; of a user's live-action image, wherein the other user's image data includes another user's live-action image of the other user terminal, and when the video call is changed from the general call mode to the avatar call mode in the user terminal, the user image The data includes an image for a user avatar displayed by being replaced from the user's actual image, wherein a user face model is generated by the user terminal based on facial features of the user of the user terminal and a pre-learned average face model, The user avatar is generated by synthesizing the user face model and a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the pre-modeled beauty model. It is possible to provide a call relay method that is.

상술한 과제 해결 수단은 단지 예시적인 것으로서, 본 발명을 제한하려는 의도로 해석되지 않아야 한다. 상술한 예시적인 실시예 외에도, 도면 및 발명의 상세한 설명에 기재된 추가적인 실시예가 존재할 수 있다.The above-described problem solving means are merely exemplary, and should not be construed as limiting the present invention. In addition to the exemplary embodiments described above, there may be additional embodiments described in the drawings and detailed description.

전술한 본 발명의 과제 해결 수단 중 어느 하나에 의하면, 사용자 얼굴 특징에 기반한 사용자 아바타를 생성하여, 영상 통화 중 촬영되는 사용자의 모습을 사용자 아바타로 변환하여 상대방과 영상 통화를 수행하도록 하는 사용자 단말, 통화 중계 서버 및 방법을 제공할 수 있다. According to any one of the above-described problem solving means of the present invention, a user terminal that generates a user avatar based on the user's facial features, converts the image of the user photographed during a video call into a user avatar, and performs a video call with the other party; A call relay server and method may be provided.

영상 통화 중 촬영되는 사용자의 얼굴 표정에 따라 사용자 아바타의 표정이 변화되도록 하여, 사용자 아바타를 통해 상대방에게 사용자의 현재 얼굴 표정 및 감정이 전달되도록 하는 사용자 단말, 통화 중계 서버 및 방법을 제공할 수 있다. It is possible to provide a user terminal, a call relay server, and a method that allow the user's avatar's expression to change according to the user's facial expression captured during a video call, so that the user's current facial expression and emotions are transmitted to the other party through the user avatar. .

도 1은 본 발명의 일 실시예에 따른 영상 통화 시스템의 구성도이다.
도 2는 본 발명의 일 실시예에 따른 사용자 단말의 구성도이다.
도 3a 내지 도 3f는 본 발명의 일 실시예에 따른 사용자 단말에서 사용자 아바타를 생성하는 과정을 설명하기 위한 예시적인 도면이다.
도 4a 내지 도 4c는 본 발명의 일 실시예에 따른 사용자로부터 사용자 아바타의 얼굴에 포함된 부분 중 교체를 요청받은 부분을 교체 요소로 변경하는 과정을 설명하기 위한 예시적인 도면이다.
도 5a 내지 도 5e는 본 발명의 일 실시예에 따른 사용자 단말에서 생성된 아바타의 일부분을 수정하는 과정을 설명하기 위한 예시적인 도면이다.
도 6a 내지 도 6c는 본 발명의 일 실시예에 따른 사용자 단말에서 복수의 사용자 아바타 중 어느 하나를 대표 사용자 아바타로 선택하는 과정을 설명하기 위한 예시적인 도면이다.
도 7a 및 도 7b는 본 발명의 일 실시예에 따른 사용자 단말에서 타사용자 단말로의 영상 통화를 요청하는 과정을 설명하기 위한 예시적인 도면이다.
도 8a 내지 도 8d는 본 발명의 일 실시예에 따른 사용자 단말에서 타사용자 단말과 영상 통화를 수행하는 과정을 설명하기 위한 예시적인 도면이다.
도 9는 본 발명의 일 실시예에 따른 사용자 단말에서 아바타를 이용하여 영상 통화를 수행하는 방법의 순서도이다.
도 10은 본 발명의 일 실시예에 따른 통화 중계 서버의 구성도이다.
도 11은 본 발명의 일 실시예에 따른 통화 중계 서버를 통해 아바타를 이용하여 영상 통화를 수행하는 방법의 순서도이다. 1 is a block diagram of a video call system according to an embodiment of the present invention.
2 is a block diagram of a user terminal according to an embodiment of the present invention.
3A to 3F are exemplary views for explaining a process of generating a user avatar in a user terminal according to an embodiment of the present invention.
4A to 4C are exemplary views for explaining a process of changing a replacement requested part among parts included in a user's avatar's face by a user into a replacement element according to an embodiment of the present invention.
5A to 5E are exemplary diagrams for explaining a process of modifying a part of an avatar created in a user terminal according to an embodiment of the present invention.
6A to 6C are exemplary diagrams for explaining a process of selecting one of a plurality of user avatars as a representative user avatar in a user terminal according to an embodiment of the present invention.
7A and 7B are exemplary diagrams for explaining a process of requesting a video call from a user terminal to another user terminal according to an embodiment of the present invention.
8A to 8D are exemplary diagrams for explaining a process of performing a video call with another user terminal in a user terminal according to an embodiment of the present invention.
9 is a flowchart of a method of performing a video call using an avatar in a user terminal according to an embodiment of the present invention.
10 is a block diagram of a call relay server according to an embodiment of the present invention.
11 is a flowchart of a method of performing a video call using an avatar through a call relay server according to an embodiment of the present invention.

아래에서는 첨부한 도면을 참조하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예를 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those of ordinary skill in the art can easily implement them. However, the present invention may be embodied in several different forms and is not limited to the embodiments described herein. And in order to clearly explain the present invention in the drawings, parts irrelevant to the description are omitted, and similar reference numerals are attached to similar parts throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐 아니라, 그 중간에 다른 소자를 사이에 두고 "전기적으로 연결"되어 있는 경우도 포함한다. 또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미하며, 하나 또는 그 이상의 다른 특징이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다. Throughout the specification, when a part is "connected" with another part, this includes not only the case of being "directly connected" but also the case of being "electrically connected" with another element interposed therebetween. . In addition, when a part "includes" a certain component, it means that other components may be further included, rather than excluding other components, unless otherwise stated, and one or more other features However, it is to be understood that the existence or addition of numbers, steps, operations, components, parts, or combinations thereof is not precluded in advance.

본 명세서에 있어서 '부(部)'란, 하드웨어에 의해 실현되는 유닛(unit), 소프트웨어에 의해 실현되는 유닛, 양방을 이용하여 실현되는 유닛을 포함한다. 또한, 1 개의 유닛이 2 개 이상의 하드웨어를 이용하여 실현되어도 되고, 2 개 이상의 유닛이 1 개의 하드웨어에 의해 실현되어도 된다.In this specification, a "part" includes a unit realized by hardware, a unit realized by software, and a unit realized using both. In addition, one unit may be implemented using two or more hardware, and two or more units may be implemented by one hardware.

본 명세서에 있어서 단말 또는 디바이스가 수행하는 것으로 기술된 동작이나 기능 중 일부는 해당 단말 또는 디바이스와 연결된 서버에서 대신 수행될 수도 있다. 이와 마찬가지로, 서버가 수행하는 것으로 기술된 동작이나 기능 중 일부도 해당 서버와 연결된 단말 또는 디바이스에서 수행될 수도 있다.Some of the operations or functions described as being performed by the terminal or device in this specification may be instead performed by a server connected to the terminal or device. Similarly, some of the operations or functions described as being performed by the server may also be performed in a terminal or device connected to the server.

이하 첨부된 도면을 참고하여 본 발명의 일 실시예를 상세히 설명하기로 한다. Hereinafter, an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일 실시예에 따른 영상 통화 시스템의 구성도이다. 도 1을 참조하면, 영상 통화 시스템(1)은 사용자 단말(110), 타사용자 단말(120) 및 통화 중계 서버(130)를 포함할 수 있다. 사용자 단말(110), 타사용자 단말(120) 및 통화 중계 서버(130)는 영상 통화 시스템(1)에 의하여 제어될 수 있는 구성요소들을 예시적으로 도시한 것이다.1 is a block diagram of a video call system according to an embodiment of the present invention. Referring to FIG. 1 , a video call system 1 may include a user terminal 110 , another user terminal 120 , and a call relay server 130 . The user terminal 110 , the other user terminal 120 , and the call relay server 130 exemplarily illustrate components that can be controlled by the video call system 1 .

도 1의 영상 통화 시스템(1)의 각 구성요소들은 일반적으로 네트워크(network)를 통해 연결된다. 예를 들어, 도 1에 도시된 바와 같이, 사용자 단말(110)은 통화 중계 서버(130)와 동시에 또는 시간 간격을 두고 연결될 수 있다. Each component of the video telephony system 1 of FIG. 1 is generally connected through a network. For example, as shown in FIG. 1 , the user terminal 110 may be connected to the call relay server 130 at the same time or at a time interval.

네트워크는 단말들 및 서버들과 같은 각각의 노드 상호 간에 정보 교환이 가능한 연결 구조를 의미하는 것으로, 근거리 통신망(LAN: Local Area Network), 광역 통신망(WAN: Wide Area Network), 인터넷 (WWW: World Wide Web), 유무선 데이터 통신망, 전화망, 유무선 텔레비전 통신망 등을 포함한다. 무선 데이터 통신망의 일례에는 3G, 4G, 5G, 3GPP(3rd Generation Partnership Project), LTE(Long Term Evolution), WIMAX(World Interoperability for Microwave Access), 와이파이(Wi-Fi), 블루투스 통신, 적외선 통신, 초음파 통신, 가시광 통신(VLC: Visible Light Communication), 라이파이(LiFi) 등이 포함되나 이에 한정되지는 않는다. A network refers to a connection structure in which information can be exchanged between each node, such as terminals and servers, and includes a local area network (LAN), a wide area network (WAN), and the Internet (WWW: World). Wide Web), wired and wireless data communication networks, telephone networks, wired and wireless television networks, and the like. Examples of wireless data communication networks include 3G, 4G, 5G, 3rd Generation Partnership Project (3GPP), Long Term Evolution (LTE), World Interoperability for Microwave Access (WIMAX), Wi-Fi, Bluetooth communication, infrared communication, ultrasound Communication, Visible Light Communication (VLC), LiFi, and the like are included, but are not limited thereto.

사용자 단말(110)은 사용자 단말(110)의 사용자에 대한 얼굴 특징에 기초하여 사용자 아바타를 생성할 수 있다. 예를 들어, 사용자 단말(110)은 사용자 단말(110)의 사용자에 대한 촬영 이미지를 입력받고, 입력된 촬영 이미지의 크기 또는 해상도 중 적어도 하나를 조절하고, 조절된 촬영 이미지로부터 사용자의 특징점을 검출하고, 검출된 특징점 및 복수의 버텍스를 포함하는 기학습된 평균 얼굴 모델에 기초하여 사용자 얼굴 모델을 생성할 수 있다. 이 때, 사용자 단말(110)은 사용자 얼굴 모델 과 뷰티 오프셋을 이용하여 사용자 아바타를 생성할 수 있다. 예를 들어, 뷰티 오프셋은 기모델링된 뷰티 모델에 포함된 복수의 버텍스 좌표로부터 평균 얼굴 모델에 포함된 복수의 버텍스 좌표가 차감된 오프셋에 뷰티 가중치가 적용된 것일 수 있다. The user terminal 110 may generate a user avatar based on facial features of the user of the user terminal 110 . For example, the user terminal 110 receives a photographed image of the user of the user terminal 110 , adjusts at least one of a size or a resolution of the input photographed image, and detects a feature point of the user from the adjusted photographed image and a user face model may be generated based on the pre-learned average face model including the detected feature points and a plurality of vertices. In this case, the user terminal 110 may generate a user avatar using the user face model and the beauty offset. For example, the beauty offset may be a beauty weight applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in a pre-modeled beauty model.

사용자 단말(110)은 생성된 사용자 아바타의 얼굴에 포함된 적어도 하나의 부분에 대한 교체를 요청받고, 교체될 적어도 하나의 부분에 대한 교체 요소를 선택받을 수 있다. 예를 들어, 사용자 단말(110)은 교체를 요청받은 적어도 하나의 부분에 해당하는 버텍스 좌표를 선택된 교체 요소에 해당하는 버텍스 좌표로 변경할 수 있다. The user terminal 110 may be requested to replace at least one part included in the face of the generated user avatar, and may receive a replacement element for at least one part to be replaced. For example, the user terminal 110 may change vertex coordinates corresponding to at least one part requested to be replaced into vertex coordinates corresponding to the selected replacement element.

사용자 단말(110)은 생성된 사용자 아바타가 복수개인 경우, 복수개의 사용자 아바타 중 어느 하나를 대표 사용자 아바타로 설정받을 수 있다. When there are a plurality of generated user avatars, the user terminal 110 may receive any one of the plurality of user avatars set as the representative user avatar.

사용자 단말(110)은 사용자 단말(110)과 적어도 하나의 타사용자 단말(120) 간에 영상 통화를 요청할 수 있다. 예를 들어, 사용자 단말(110)은 사용자 단말(110)에 등록된 친구 리스트 중 적어도 하나의 친구를 선택받고, 선택된 친구에 대응하는 타사용자 단말(120)과의 영상 통화를 통화 중계 서버(130)로 요청할 수 있다. The user terminal 110 may request a video call between the user terminal 110 and at least one other user terminal 120 . For example, the user terminal 110 receives a selection of at least one friend from the list of friends registered in the user terminal 110 , and conducts a video call with another user terminal 120 corresponding to the selected friend to the call relay server 130 . ) can be requested.

사용자 단말(110)은 사용자 단말(110)과 타사용자 단말(120) 간의 영상 통화가 연결된 경우, 사용자 단말(110)과 연동하는 카메라를 이용하여 사용자 실사 영상을 촬영할 수 있다. When a video call is connected between the user terminal 110 and the other user terminal 120 , the user terminal 110 may use a camera interworking with the user terminal 110 to take a user's actual image.

사용자 단말(110)은 촬영된 사용자 실사 영상을 통화 중계 서버(130)로 전송하고, 통화 중계 서버(130)로부터 타사용자 단말(120)에서 촬영된 타사용자 실사 영상을 수신할 수 있다. 예를 들어, 사용자 단말(110)은 일반 통화 모드가 수행되는 경우, 촬영된 사용자 실사 영상을 인코딩하여 통화 중계 서버(130)로 전송할 수 있다. 다른 예를 들어, 사용자 단말(110)은 아바타 통화 모드가 수행되는 경우, 사용자 아바타에 대한 영상을 인코딩하여 통화 중계 서버(130)로 전송할 수 있다. The user terminal 110 may transmit the captured user's actual image to the call relay server 130 , and may receive the other user's actual image captured by the other user terminal 120 from the call relay server 130 . For example, when the general call mode is performed, the user terminal 110 may encode the captured user's live-action image and transmit it to the call relay server 130 . As another example, when the avatar call mode is performed, the user terminal 110 may encode an image of the user avatar and transmit it to the call relay server 130 .

사용자 단말(110)은 영상 통화에 대한 일반 통화 모드를 통해 사용자 단말(110)의 화면의 제 1 영역에 사용자 실사 영상을 표시하고, 화면의 제 2 영역에 타사용자 실사 영상을 표시할 수 있다. The user terminal 110 may display the user's live-action image in the first area of the screen of the user terminal 110 through the general call mode for the video call, and may display the other user's live-action image in the second area of the screen.

일 실시예에 따르면, 사용자 단말(110)은 사용자 실사 영상 기반의 일반 통화 모드로부터 사용자 아바타 기반의 아바타 통화 모드로 변경하여 영상 통화를 수행할 수 있다. 이 때, 사용자 단말(110)은 영상 통화에 대한 아바타 통화 모드를 요청받는 경우, 일반 통화 모드를 아바타 통화 모드로 변경할 수 있다. 예를 들어, 사용자 단말(110)은 화면의 제 1 영역에 표시된 사용자 실사 영상을 기설정된 사용자 아바타로 대체하여 표시할 수 있다. 이 때, 사용자 단말(110)은 촬영된 사용자 실사 영상으로부터 사용자 단말(110)의 사용자의 얼굴 표정을 트래킹하고, 트래킹된 얼굴 표정을 따라 기설정된 사용자 아바타의 표정을 변경할 수 있다. 여기서, 사용자 단말(110)은 아바타 통화 모드가 수행되는 경우, 대표 사용자 아바타로 설정된 사용자 아바타를 표시할 수 있다. According to an embodiment, the user terminal 110 may perform a video call by changing from the normal call mode based on the actual user image to the avatar call mode based on the user avatar. In this case, when the user terminal 110 receives a request for the avatar call mode for the video call, the user terminal 110 may change the normal call mode to the avatar call mode. For example, the user terminal 110 may display the user's live-action image displayed on the first area of the screen by replacing it with a preset user avatar. In this case, the user terminal 110 may track the facial expression of the user of the user terminal 110 from the captured user's actual image, and change the facial expression of a preset user avatar according to the tracked facial expression. Here, when the avatar call mode is performed, the user terminal 110 may display a user avatar set as a representative user avatar.

다른 실시예에 따르면, 사용자 단말(110)은 사용자 실사 영상 기반의 일반 통화 모드에서 뷰티 효과 모드를 적용하여 영상 통화를 수행할 수 있다. 예를 들어, 사용자 단말(110)은 영상 통화 중 뷰티 효과 모드를 요청받은 경우, 화면의 제 1 영역에 표시된 사용자 실사 영상에 뷰티 효과를 적용하여 표시할 수 있다. According to another embodiment, the user terminal 110 may perform a video call by applying the beauty effect mode in the normal call mode based on the actual user image. For example, when the user terminal 110 receives a request for a beauty effect mode during a video call, the user terminal 110 may apply and display the beauty effect to the user's actual image displayed in the first area of the screen.

타사용자 단말(120)은 통화 중계 서버(130)를 통해 사용자 단말(110)로부터 영상 통화를 수신할 수 있다. 여기서, 타사용자 단말(120)은 사용자 단말(110)과 동일한 과정으로 일반 통화 모드, 아바타 통화 모드, 뷰티 효과 모드를 통해 영상 통화를 수행할 수 있다. The other user terminal 120 may receive a video call from the user terminal 110 through the call relay server 130 . Here, the other user terminal 120 may perform a video call through the general call mode, the avatar call mode, and the beauty effect mode in the same process as the user terminal 110 .

통화 중계 서버(130)는 사용자 단말(110)로부터 적어도 하나의 타사용자 단말(120)로의 영상 통화를 요청받을 수 있다. 예를 들어, 통화 중계 서버(130)는 사용자 단말(110)로부터 사용자 단말(110)의 애플리케이션에 등록된 친구 리스트 중 적어도 하나의 친구를 선택받고, 선택된 친구에 대응하는 타사용자 단말(120)과의 영상 통화를 요청받을 수 있다.The call relay server 130 may receive a request for a video call from the user terminal 110 to at least one other user terminal 120 . For example, the call relay server 130 receives the selection of at least one friend from the list of friends registered in the application of the user terminal 110 from the user terminal 110, and the other user terminal 120 corresponding to the selected friend. You can request a video call from

통화 중계 서버(130)는 타사용자 단말(120)에서 영상 통화를 수락한 경우, 사용자 단말(110)로부터 사용자 영상 데이터를 수신하여 타사용자 단말(120)로 전송할 수 있다. 또한, 통화 중계 서버(130)는 타사용자 단말(120)로부터 타사용자 영상 데이터를 수신하여 사용자 단말(110)로 전송할 수 있다. When the video call is accepted by the other user terminal 120 , the call relay server 130 may receive user image data from the user terminal 110 and transmit it to the other user terminal 120 . Also, the call relay server 130 may receive the other user's image data from the other user terminal 120 and transmit it to the user terminal 110 .

통화 중계 서버(130)는 사용자 단말(110) 또는 타사용자 단말(120)에서 수행되는 통화 모드에 기초하여 영상 데이터를 수신할 수 있다. 예를 들어, 통화 중계 서버(130)는 사용자 단말(110)에서 일반 통화 모드가 수행되는 경우, 사용자 단말(110)로부터 사용자 실사 영상이 인코딩된 사용자 영상 데이터를 수신하고, 타사용자 단말(120)에서 일반 통화 모드가 수행되는 경우, 타사용자 단말(120)로부터 타사용자 실사 영상이 인코딩된 타사용자 영상 데이터를 수신할 수 있다. 다른 예를 들어, 통화 중계 서버(130)는 사용자 단말(110)에서 아바타 통화 모드가 수행되는 경우, 사용자 단말(110)로부터 사용자 아바타에 대한 영상이 인코딩된 사용자 영상 데이터를 수신하고, 타사용자 단말(120)에서 아바타 통화 모드가 수행되는 경우, 타사용자 단말(120)로부터 타사용자 아바타에 대한 영상이 인코딩된 타사용자 영상 데이터를 수신할 수 있다. The call relay server 130 may receive image data based on a call mode performed by the user terminal 110 or the other user terminal 120 . For example, when the general call mode is performed in the user terminal 110 , the call relay server 130 receives the user image data encoded with the actual user image from the user terminal 110 , and the other user terminal 120 . When the normal call mode is performed in , it is possible to receive the other user's image data encoded with the actual image of the other user from the other user terminal 120 . For another example, when the avatar call mode is performed in the user terminal 110 , the call relay server 130 receives user image data in which an image of the user avatar is encoded from the user terminal 110 , and another user terminal When the avatar call mode is performed in 120 , image data of another user in which an image of the other user's avatar is encoded may be received from the other user terminal 120 .

도 2는 본 발명의 일 실시예에 따른 사용자 단말의 구성도이다. 도 2를 참조하면, 사용자 단말(110)은 아바타 생성부(210), 영상 통화 요청부(220), 촬영부(230), 통신부(240), 표시부(250) 및 모드 변경부(260)를 포함할 수 있다. 여기서, 사용자 단말(110)에 포함된 구성은 타사용자 단말(120)에 포함된 구성과 동일하며, 타사용자 단말(120) 또한 사용자 단말(110)과 동일한 동작을 통해 영상 통화를 수행할 수 있다. 2 is a block diagram of a user terminal according to an embodiment of the present invention. Referring to FIG. 2 , the user terminal 110 includes an avatar generating unit 210 , a video call requesting unit 220 , a photographing unit 230 , a communication unit 240 , a display unit 250 , and a mode changing unit 260 . may include Here, the configuration included in the user terminal 110 is the same as the configuration included in the other user terminal 120 , and the other user terminal 120 may also perform a video call through the same operation as the user terminal 110 . .

아바타 생성부(210)는 사용자 단말(110)의 사용자에 대한 얼굴 특징에 기초하여 사용자 아바타를 생성할 수 있다. 사용자 아바타를 생성하는 과정에 대해서는 도 3a 내지 도 3f를 통해 상세히 설명하도록 한다. The avatar generator 210 may generate a user avatar based on facial features of the user of the user terminal 110 . The process of generating the user avatar will be described in detail with reference to FIGS. 3A to 3F .

도 3a 내지 도 3f는 본 발명의 일 실시예에 따른 사용자 단말에서 사용자 아바타를 생성하는 과정을 설명하기 위한 예시적인 도면이다. 3A to 3F are exemplary views for explaining a process of generating a user avatar in a user terminal according to an embodiment of the present invention.

도 3a를 참조하면, 사용자는 애플리케이션(300)을 통해 아바타 생성(301) 버튼을 입력받아 사용자 아바타를 생성하기 위한 프로세스로 진입할 수 있다. Referring to FIG. 3A , a user may enter a process for generating a user avatar by receiving an avatar creation 301 button through the application 300 .

도 3b를 참조하면, 아바타 생성부(210)는 사용자 단말(110)의 사용자에 대한 촬영 이미지(302)를 입력받을 수 있다. 예를 들어, 아바타 생성부(210)는 사용자 아바타를 생성하기 위해 사용자의 얼굴을 촬영하고, 촬영된 촬영 이미지(302)를 입력받을 수 있다. 다른 예를 들어, 아바타 생성부(210)는 사용자로부터 사용자 아바타를 생성하기 위해 사용자 단말(110)의 앨범에 저장된 복수의 기촬영된 이미지 중 어느 하나를 촬영 이미지(302)로 선택받아 촬영 이미지(302)로 입력받을 수 있다. Referring to FIG. 3B , the avatar generator 210 may receive a captured image 302 of the user of the user terminal 110 . For example, the avatar generator 210 may photograph a user's face to generate a user avatar, and may receive a photographed photographed image 302 . For another example, the avatar generator 210 selects one of a plurality of pre-photographed images stored in an album of the user terminal 110 as the photographed image 302 to generate a user avatar from the user, and selects the photographed image ( 302) can be entered.

아바타 생성부(210)는 입력된 촬영 이미지의 크기 또는 해상도 중 적어도 하나를 조절할 수 있다. 예를 들어, 아바타 생성부(210)는 사용자로부터 핀치-인(pinch-in) 입력을 통해 촬영 이미지(302)의 축소를 입력받고, 핀치-아웃(pinch-out)을 통해 촬영 이미지(302)의 확대를 입력받아 사용자의 얼굴 크기가 소정의 영역(303, 예를 들어, 원형 또는 사각형 등)의 크기에 대응되도록 할 수 있다. 다른 예를 들어, 아바타 생성부(210)는 사용자로부터 드래그 입력을 통해 촬영 이미지(302)에 포함된 사용자의 얼굴이 소정의 영역(303)의 중심에 위치하도록 입력받은 후, 예를 들어, 1024x1024 사이즈로 촬영 이미지(302)의 크기를 조절할 수 있다. The avatar generator 210 may adjust at least one of a size and a resolution of the input captured image. For example, the avatar generator 210 receives a reduction input of the captured image 302 through a pinch-in input from the user, and receives the captured image 302 through a pinch-out. By receiving the magnification of , the size of the user's face may correspond to the size of a predetermined area (eg, a circle or a rectangle). As another example, the avatar generator 210 receives, through a drag input, from the user so that the user's face included in the photographed image 302 is positioned at the center of the predetermined area 303 , for example, 1024x1024 The size of the photographed image 302 may be adjusted by the size.

도 3c를 참조하면, 아바타 생성부(210)는 사용자로부터 사용자의 성별(304)을 입력받고, 입력된 성별(304)에 기초하여 사용자 아바타가 생성되도록 할 수 있다. Referring to FIG. 3C , the avatar generator 210 may receive a user's gender 304 from the user, and generate a user avatar based on the input gender 304 .

도 3d를 참조하면, 아바타 생성부(210)는 조절된 촬영 이미지로부터 사용자의 특징점(310)을 검출할 수 있다. 여기서, 특징점은 사용자의 눈, 코, 입, 얼굴 윤곽과 같은 형태(shape) 정보일 수 있다. Referring to FIG. 3D , the avatar generator 210 may detect the feature point 310 of the user from the adjusted captured image. Here, the feature point may be shape information such as the user's eyes, nose, mouth, and face outline.

아바타 생성부(210)는 검출된 특징점(310) 및 복수의 버텍스를 포함하는 기학습된 평균 얼굴 모델(311)에 기초하여 사용자 얼굴 모델(312)을 생성할 수 있다. 예를 들어, 아바타 생성부(210)는 검출된 사용자의 특징점(310)에 기학습된 평균 얼굴 모델(311)을 투영시켜 사용자 얼굴 모델(312)을 생성할 수 있다. 여기서, 평균 얼굴 모델(311)은 복수의 얼굴 메쉬를 PCA(Principal Component Analysis) 기반으로 학습시켜 얼굴의 변화도를 나타내는 고유의 벡터값으로 구성된 것일 수 있다. The avatar generator 210 may generate the user face model 312 based on the pre-learned average face model 311 including the detected feature points 310 and a plurality of vertices. For example, the avatar generator 210 may generate the user face model 312 by projecting the previously learned average face model 311 on the detected feature points 310 of the user. Here, the average face model 311 may be formed of a unique vector value representing a degree of change of a face by learning a plurality of face meshes based on Principal Component Analysis (PCA).

도 3e를 참조하면, 아바타 생성부(210)는 사용자 얼굴 모델(312)을 뷰티 오프셋과 합성하여 사용자 아바타(314)를 생성할 수 있다. 여기서, 뷰티 오프셋은 기모델링된 뷰티 모델(313)에 포함된 복수의 버텍스 좌표로부터 평균 얼굴 모델(311)에 포함된 복수의 버텍스 좌표가 차감된 오프셋에 뷰티 가중치가 곱해짐으로써 도출될 수 있다. 뷰티 가중치는 뷰티의 강도를 나타내는 것일 수 있다. 예를 들어, 아바타 생성부(210)는 뷰티 오프셋을 사용자 얼굴 모델(312)에 합성하여 버텍스 좌표 간의 합을 통해 사용자 아바타(314)를 생성할 수 있다. Referring to FIG. 3E , the avatar generator 210 may generate a user avatar 314 by synthesizing a user face model 312 with a beauty offset. Here, the beauty offset may be derived by multiplying an offset obtained by subtracting a plurality of vertex coordinates included in the average face model 311 from a plurality of vertex coordinates included in the pre-modeled beauty model 313 by a beauty weight. The beauty weight may indicate the strength of beauty. For example, the avatar generator 210 may generate the user avatar 314 by combining the beauty offset with the user face model 312 and summing the vertex coordinates.

도 3f를 참조하면, 사용자는 생성이 완료된 사용자 아바타(315)를 확인한 후, 사용자 아바타(315)를 저장(316)하여 관리할 수 있다. Referring to FIG. 3F , after checking the user avatar 315 that has been created, the user can store (316) the user avatar 315 and manage it.

다시 도 2로 돌아와서, 아바타 생성부(210)는 사용자로부터 생성된 사용자 아바타의 수정을 요청받고, 수정 요청에 기초하여 사용자 아바타를 수정할 수 있다. 사용자 아바타를 수정하는 과정에 대해서는 도 4a 내지 도 5e를 통해 상세히 설명하도록 한다. Returning to FIG. 2 , the avatar generator 210 may receive a request to modify the generated user avatar from the user, and may modify the user avatar based on the modification request. The process of modifying the user avatar will be described in detail with reference to FIGS. 4A to 5E .

도 4a 내지 도 4c는 본 발명의 일 실시예에 따른 사용자로부터 사용자 아바타의 얼굴에 포함된 부분 중 교체를 요청받은 부분을 교체 요소로 변경하는 과정을 설명하기 위한 예시적인 도면이다. 4A to 4C are exemplary views for explaining a process of changing a replacement requested part among parts included in a user's avatar's face by a user into a replacement element according to an embodiment of the present invention.

도 4a를 참조하면, 아바타 생성부(210)는 생성된 사용자 아바타(400)의 얼굴에 포함된 적어도 하나의 부분에 대한 교체를 요청받고, 교체될 적어도 하나의 부분에 대한 교체 요소(410)를 선택받을 수 있다. Referring to FIG. 4A , the avatar generator 210 receives a request for replacement of at least one part included in the face of the generated user avatar 400 , and selects a replacement element 410 for at least one part to be replaced. can be chosen

도 4b를 참조하면, 사용자 아바타(400)의 얼굴에서 교체를 요청받은 부분을 교체 요소(410)로 단순 교체한 경우, 단순 교체로 인해 부자연스러움이 발생한다는 단점이 존재한다. Referring to FIG. 4B , when a replacement element 410 is simply replaced with a replacement element 410 for a part of the face of the user avatar 400 , there is a disadvantage in that unnaturalness occurs due to the simple replacement.

도 4c를 참조하면, 아바타 생성부(210)는 단순 교체를 통해 발생되는 부자연스러움을 방지하기 위해 교체를 요청받은 적어도 하나의 부분에 해당하는 버텍스 좌표를 선택된 교체 요소에 해당하는 버텍스 좌표로 변경할 수 있다. 이를 통해, 사용자 아바타(400)의 교체를 요청받은 부분의 모양이 교체 요소(410)의 모양으로 근사되도록 변경하여 부자연스러움을 방지할 수 있다. Referring to FIG. 4C , the avatar generator 210 may change the vertex coordinates corresponding to at least one part requested to be replaced into vertex coordinates corresponding to the selected replacement element in order to prevent unnaturalness caused by simple replacement. there is. Through this, the shape of the part requested to be replaced by the user avatar 400 is changed to approximate the shape of the replacement element 410 , thereby preventing unnaturalness.

도 5a 내지 도 5e는 본 발명의 일 실시예에 따른 사용자 단말에서 생성된 아바타의 일부분을 수정하는 과정을 설명하기 위한 예시적인 도면이다. 5A to 5E are exemplary diagrams for explaining a process of modifying a part of an avatar created in a user terminal according to an embodiment of the present invention.

도 5a를 참조하면, 아바타 생성부(210)는 사용자로부터 애플리케이션(500)의 아바타 수정 버튼(501)을 입력받을 수 있다. Referring to FIG. 5A , the avatar generator 210 may receive an avatar edit button 501 of the application 500 from the user.

도 5b를 참조하면, 아바타 생성부(210)는 생성된 사용자 아바타를 표시할 수 있다. 이 때, 아바타 생성부(210)는 아바타 얼굴 버튼(511), 아바타 바디 버튼, 아바타 액세서리 버튼, 아바타 배경 버튼 등과 같이 사용자 아바타를 수정하기 위한 다양한 버튼(510)을 표시할 수 있다. Referring to FIG. 5B , the avatar generator 210 may display the generated user avatar. In this case, the avatar generator 210 may display various buttons 510 for modifying the user avatar, such as an avatar face button 511 , an avatar body button, an avatar accessory button, and an avatar background button.

아바타 얼굴 버튼(511)은 헤어 스타일, 눈썹 형태 등과 같이 사용자 아바타의 얼굴 요소의 형태를 변경하는 메뉴를 호출할 수 있으며, 아바타 생성부(210)는 사용자로부터 아바타 얼굴 버튼(511)을 입력받으면 아바타 얼굴 메뉴를 표시할 수 있다. The avatar face button 511 may call a menu for changing the shape of the user's avatar's face element, such as a hairstyle and eyebrow shape, and the avatar generator 210 receives an avatar face button 511 from the user, You can display the face menu.

아바타 바디 버튼은 사용자 아바타가 현재 착용하고 있는 외형 요소를 변경하는 메뉴를 호출할 수 있으며, 아바타 생성부(210)는 사용자로부터 아바타 바디 버튼을 입력받으면 아바타 바디 메뉴를 표시할 수 있다. The avatar body button may call a menu for changing an appearance element currently worn by the user's avatar, and the avatar generator 210 may display an avatar body menu upon receiving an avatar body button input from the user.

아바타 액세서리 버튼은 사용자 아바타에 착용하는 안경 등의 액세서리를 선택하는 메뉴를 호출할 수 있으며, 아바타 생성부(210)는 사용자로부터 액세서리 버튼을 입력받으면 액세서리 메뉴를 표시할 수 있다. The avatar accessory button may call a menu for selecting accessories such as glasses to be worn on the user's avatar, and the avatar generator 210 may display an accessory menu when receiving an accessory button input from the user.

아바타 배경 버튼은 사용자 아바타의 배경 버튼을 변경하는 메뉴를 호출할 수 있으며, 아바타 생성부(210)는 사용자로부터 아바타 배경 버튼을 입력받으면 배경 메뉴를 표시할 수 있다. The avatar background button may call a menu for changing the background button of the user's avatar, and the avatar generating unit 210 may display a background menu when receiving an avatar background button from the user.

도 5c를 참조하면, 아바타 생성부(210)는 사용자로부터 아바타 얼굴 버튼(511)을 입력받은 경우, 헤어 카테고리, 얼굴형 카테고리, 눈썹 카테고리, 눈 카테고리, 코 카테고리, 입 카테고리, 수염 카테고리와 같이 얼굴과 관련된 다양한 카테고리(520)를 표시하도록 할 수 있다. Referring to FIG. 5C , when the avatar face button 511 is input by the user, the avatar generator 210 includes a face such as a hair category, a face type category, an eyebrow category, an eye category, a nose category, a mouth category, and a beard category. Various categories 520 related to can be displayed.

도 5d를 참조하면, 아바타 생성부(210)는 사용자로부터 눈썹 카테고리를 선택받은 경우, 다양한 눈썹 모양을 표시하고, 사용자로부터 다양한 눈썹 모양 중 어느 하나의 눈썹 모양(521)을 선택받을 수 있다. Referring to FIG. 5D , when an eyebrow category is selected by the user, the avatar generator 210 may display various eyebrow shapes and receive a selection of any one eyebrow shape 521 from among various eyebrow shapes.

또는, 아바타 생성부(210)는 사용자로부터 얼굴의 전체 형태를 조절하는 일종의 노드에 해당하는 조절점을 입력받고, 조절점에 기초하여 각 카테고리에 해당하는 부위의 모양이 변경되도록 할 수 있다. 조절점은 상/하 또는 좌/우로 조절 가능하며, 좌우 대칭으로 조절점이 배치되어 한 쪽의 조절점을 조절하면, 다른 조절점도 같이 조절될 수 있다. Alternatively, the avatar generator 210 may receive an adjustment point corresponding to a kind of node for adjusting the overall shape of the face from the user, and change the shape of a part corresponding to each category based on the adjustment point. The control points can be adjusted up/down or left/right, and if the control points are arranged symmetrically and one control point is adjusted, the other control points can also be adjusted.

도 5e를 참조하면, 아바타 생성부(210)는 선택된 눈썹 모양(521)이 적용된 사용자 아바타(530)를 표시할 수 있다. Referring to FIG. 5E , the avatar generator 210 may display the user avatar 530 to which the selected eyebrow shape 521 is applied.

다시 도 2로 돌아와서, 아바타 생성부(210)는 생성된 사용자 아바타가 복수개인 경우, 복수개의 사용자 아바타 중 어느 하나를 대표 사용자 아바타로 설정받을 수 있다. 대표 사용자 아바타를 설정하는 과정에 대해서는 도 6a 내지 도 6c를 통해 상세히 설명하도록 한다. Returning to FIG. 2 , when there are a plurality of generated user avatars, the avatar generator 210 may receive any one of the plurality of user avatars as the representative user avatar. The process of setting the representative user avatar will be described in detail with reference to FIGS. 6A to 6C .

도 6a 내지 도 6c는 본 발명의 일 실시예에 따른 사용자 단말에서 복수의 사용자 아바타 중 어느 하나를 대표 사용자 아바타로 선택하는 과정을 설명하기 위한 예시적인 도면이다. 6A to 6C are exemplary diagrams for explaining a process of selecting one of a plurality of user avatars as a representative user avatar in a user terminal according to an embodiment of the present invention.

*65도 6a를 참조하면, 아바타 생성부(210)는 사용자로부터 애플리케이션의 대표 아바타 변경 버튼(600)을 입력받아 대표 사용자 아바타를 설정하기 위한 프로세스로 진입할 수 있다. *65 Referring to FIG. 6A , the avatar generator 210 may receive a representative avatar change button 600 of the application from the user and enter a process for setting the representative user avatar.

도 6b를 참조하면, 아바타 생성부(210)는 생성된 사용자 아바타가 복수개인 경우, 사용자로부터 복수개의 사용자 아바타(610) 중 어느 하나의 사용자 아바타(611)를 선택받은 후 완료 버튼(612)을 입력받을 수 있다. Referring to FIG. 6B , when there are a plurality of generated user avatars, the avatar generator 210 selects any one user avatar 611 among the plurality of user avatars 610 from the user and then clicks the Done button 612 . can be input.

도 6c를 참조하면, 아바타 생성부(210)는 선택된 사용자 아바타(611)를 대표 사용자 아바타(620)로 설정할 수 있다. 여기서, 대표 사용자 아바타(620)는 사용자 단말(110)에서 타사용자 단말(120)과 아바타를 이용하여 영상 통화를 수행하는 경우에 이용될 수 있으며, 추후에 사용자의 선택에 의해 다른 사용자 아바타로 변경될 수 있다. Referring to FIG. 6C , the avatar generator 210 may set the selected user avatar 611 as the representative user avatar 620 . Here, the representative user avatar 620 may be used when the user terminal 110 performs a video call with the other user terminal 120 using the avatar, and is later changed to another user avatar by the user's selection. can be

*68다시 도 2로 돌아와서, 영상 통화 요청부(220)는 사용자 단말(110)과 적어도 하나의 타사용자 단말(120) 간에 영상 통화를 요청할 수 있다. 영상 통화를 요청하는 과정에 대해서는 도 7a 및 도 7b를 통해 상세히 설명하도록 한다. *68 Returning to FIG. 2 again, the video call request unit 220 may request a video call between the user terminal 110 and at least one other user terminal 120 . The process of requesting a video call will be described in detail with reference to FIGS. 7A and 7B .

도 7a 및 도 7b는 본 발명의 일 실시예에 따른 사용자 단말에서 타사용자 단말로의 영상 통화를 요청하는 과정을 설명하기 위한 예시적인 도면이다. 7A and 7B are exemplary diagrams for explaining a process of requesting a video call from a user terminal to another user terminal according to an embodiment of the present invention.

도 7a를 참조하면, 영상 통화 요청부(220)는 사용자로부터 애플리케이션의 화면(700)에 표시된 통화 버튼(701)을 입력받을 수 있다. Referring to FIG. 7A , the video call request unit 220 may receive a call button 701 displayed on the screen 700 of the application from the user.

도 7b를 참조하면, 영상 통화 요청부(220)는 사용자 단말(110)에 등록된 친구 리스트(710) 중 적어도 하나의 친구를 선택받고, 선택된 친구에 대응하는 타사용자 단말(120)과의 영상 통화를 통화 중계 서버(130)로 요청할 수 있다. 예를 들어, 영상 통화 요청부(220)는 영상 통화가 가능한 친구 리스트(710)를 표시하여 사용자로부터 친구를 선택(711)받은 후 통화 중계 서버(130)로 영상 통화를 요청(712)할 수 있다. 여기서, 영상 통화 요청부(220)는 하나의 친구와의 영상 통화를 요청할 수도 있고, 다수의 친구와의 영상 통화를 요청할 수도 있다. Referring to FIG. 7B , the video call request unit 220 receives a selection of at least one friend from the friend list 710 registered in the user terminal 110 , and performs an image with the other user terminal 120 corresponding to the selected friend. A call may be requested to the call relay server 130 . For example, the video call request unit 220 may display a list of friends capable of video calls (710), select a friend from the user (711), and then request (712) a video call to the call relay server 130. there is. Here, the video call request unit 220 may request a video call with one friend or a video call with a plurality of friends.

다시 도 2로 돌아와서, 촬영부(230)는 사용자 단말(110)과 타사용자 단말(120) 간의 영상 통화가 연결된 경우, 사용자 단말(110)과 연동하는 카메라를 이용하여 사용자 실사 영상을 촬영할 수 있다. 예를 들어, 촬영부(230)는 사용자 단말(110)의 전면 카메라를 구동하여 사용자 실사 영상을 촬영할 수 있다. Returning to FIG. 2 again, when the video call between the user terminal 110 and the other user terminal 120 is connected, the photographing unit 230 may take a user's actual image by using a camera interworking with the user terminal 110 . . For example, the photographing unit 230 may drive the front camera of the user terminal 110 to photograph the user's live-action image.

통신부(240)는 촬영된 사용자 실사 영상을 통화 중계 서버(130)로 전송하고, 통화 중계 서버(130)로부터 타사용자 단말(120)에서 촬영된 타사용자 실사 영상을 수신할 수 있다. 예를 들어, 통신부(240)는 일반 통화 모드가 수행되는 경우, 촬영된 사용자 실사 영상을 인코딩하여 통화 중계 서버(130)로 전송할 수 있다. 다른 예를 들어, 통신부(240)는 아바타 통화 모드가 수행되는 경우, 사용자 아바타에 대한 영상을 인코딩하여 통화 중계 서버(130)로 전송할 수 있다. The communication unit 240 may transmit the captured user's live-action image to the call relay server 130 , and may receive the other user's live-action image captured by the other user terminal 120 from the call relay server 130 . For example, when the normal call mode is performed, the communication unit 240 may encode the captured user's live-action image and transmit it to the call relay server 130 . For another example, when the avatar call mode is performed, the communication unit 240 may encode an image of the user avatar and transmit it to the call relay server 130 .

표시부(250)는 영상 통화에 대한 일반 통화 모드를 통해 사용자 단말(110)의 화면의 제 1 영역에 사용자 실사 영상을 표시하고, 화면의 제 2 영역에 타사용자 실사 영상을 표시할 수 있다. 예를 들어, 표시부(250)는 화면의 우측 상단에 위치한 제 1 영역에 사용자 실사 영상을 표시하고, 제 1 영역을 제외한 나머지 영역에 해당하는 제 2 영역에 타사용자 실사 영상을 표시할 수 있다. The display unit 250 may display the user's live-action image in the first area of the screen of the user terminal 110 through the general call mode for the video call, and may display the other user's live-action image in the second area of the screen. For example, the display unit 250 may display the user's live-action image in the first area located at the upper right of the screen, and display the other user's live-action image in the second area corresponding to the remaining area except for the first area.

일 실시예에 따르면, 사용자 단말(110)은 사용자 실사 영상 기반의 일반 통화 모드로부터 사용자 아바타 기반의 아바타 통화 모드로 변경하여 영상 통화를 수행할 수 있다. According to an embodiment, the user terminal 110 may perform a video call by changing from the normal call mode based on the actual user image to the avatar call mode based on the user avatar.

모드 변경부(260)는 영상 통화에 대한 아바타 통화 모드를 요청받는 경우, 일반 통화 모드를 아바타 통화 모드로 변경할 수 있다. When receiving a request for an avatar call mode for a video call, the mode change unit 260 may change the normal call mode to the avatar call mode.

*77표시부(250)는 아바타 통화 모드를 요청 받은 경우, 화면의 제 1 영역에 표시된 사용자 실사 영상을 기설정된 사용자 아바타로 대체 내지 변경하여 표시할 수 있다. 여기서, 표시부(250)는 아바타 통화 모드가 수행되는 경우, 대표 사용자 아바타로 설정된 사용자 아바타를 표시할 수 있다. *77 When the avatar call mode is requested, the display unit 250 may replace or change the user's live-action image displayed on the first area of the screen with a preset user avatar and display it. Here, when the avatar call mode is performed, the display unit 250 may display a user avatar set as a representative user avatar.

표시부(250)는 아바타 통화 모드가 수행되는 경우, 촬영된 사용자 실사 영상으로부터 사용자 단말(110)의 사용자의 얼굴 표정을 트래킹하고, 트래킹된 얼굴 표정에 따라 기설정된 사용자 아바타의 표정을 변경할 수 있다. When the avatar call mode is performed, the display unit 250 may track the facial expression of the user of the user terminal 110 from the captured user's live-action image, and may change the preset facial expression of the user avatar according to the tracked facial expression.

다른 실시예에 따르면, 사용자 단말(110)은 사용자 실사 영상 기반의 일반 통화 모드에서 뷰티 효과 모드를 적용하여 영상 통화를 수행할 수 있다. According to another embodiment, the user terminal 110 may perform a video call by applying the beauty effect mode in the normal call mode based on the actual user image.

모드 변경부(260)는 영상 통화 중 뷰티 효과 모드를 요청받을 수 있다. The mode change unit 260 may receive a request for a beauty effect mode during a video call.

표시부(250)는 화면의 제 1 영역에 표시된 사용자 실사 영상에 뷰티 효과를 적용하여 표시할 수 있다. The display unit 250 may display a beauty effect applied to the user's actual image displayed in the first area of the screen.

도 2에서 도시되지는 않았으나, 사용자 단말(110)은 타사용자 단말(120)과 일반 통화 모드, 아바타 통화 모드 및 뷰티 효과 모드를 이용한 영상 통화를 수행하는 동시에 동일 화면 상에서 채팅을 수행할 수도 있다.Although not shown in FIG. 2 , the user terminal 110 may perform a video call using the general call mode, the avatar call mode, and the beauty effect mode with the other user terminal 120 and chat on the same screen at the same time.

도 8a 내지 도 8d는 본 발명의 일 실시예에 따른 사용자 단말에서 타사용자 단말과 영상 통화를 수행하는 과정을 설명하기 위한 예시적인 도면이다. 8A to 8D are exemplary diagrams for explaining a process of performing a video call with another user terminal in a user terminal according to an embodiment of the present invention.

도 8a는 본 발명의 일 실시예에 따른 사용자 단말(110)에서 일반 통화 모드를 통해 타사용자 단말(120)과 영상 통화를 수행하는 과정을 도시한 예시적인 도면이다. 도 8a를 참조하면, 사용자 단말(110)은 일반 통화 모드를 통해 사용자(800)와 타사용자 간의 영상 통화를 수행할 수 있다. 이 때, 사용자 단말(110)은 사용자(800)와 한 명의 타사용자와 1:1 영상 통화를 수행할 수 있으며, 사용자(800)와 다수의 타사용자와 다자간 영상 통화를 수행할 수도 있다. 8A is an exemplary diagram illustrating a process of performing a video call with another user terminal 120 through a normal call mode in the user terminal 110 according to an embodiment of the present invention. Referring to FIG. 8A , the user terminal 110 may perform a video call between the user 800 and another user through a normal call mode. In this case, the user terminal 110 may perform a 1:1 video call with the user 800 and one other user, and may perform a multi-party video call with the user 800 and a plurality of other users.

도 8b 내지 도 8d는 본 발명의 일 실시예에 따른 사용자 단말에서 아바타 통화 모드를 통해 타사용자 단말과 영상 통화를 수행하는 과정을 도시한 예시적인 도면이다. 8B to 8D are exemplary diagrams illustrating a process of performing a video call with another user terminal through an avatar call mode in a user terminal according to an embodiment of the present invention.

도 8b를 참조하면, 사용자 단말(110)은 하나의 타사용자 단말(120)과 영상 통화를 진행할 수 있다. 예를 들어, 사용자 단말(110)에서 일반 통화 모드를 통해 영상 통화를 수행하고, 타사용자 단말(120)에서 일반 통화 모드로부터 아바타 통화 모드로 변경되어 영상 통화를 수행하는 경우, 사용자 단말(110)은 사용자 단말(110)의 화면의 제 2 영역(820)에 사용자 실사 영상을 표시하고, 화면의 제 1 영역(810)에 타사용자 아바타에 대한 영상을 표시할 수 있다. Referring to FIG. 8B , the user terminal 110 may conduct a video call with one other user terminal 120 . For example, when the user terminal 110 performs a video call through the normal call mode and the other user terminal 120 changes from the normal call mode to the avatar call mode to perform the video call, the user terminal 110 may display the user's live-action image in the second area 820 of the screen of the user terminal 110, and display the image of the other user's avatar in the first area 810 of the screen.

도 8c를 참조하면, 사용자 단말(110)은 다수의 타사용자 단말(120)과 영상 통화를 진행할 수 있다. 예를 들어, 사용자 단말(110)은 다수의 타사용자 단말(120)과 영상 통화를 진행하는 경우, 사용자 단말(110)은 영상 통화에 참여한 총 인원수에 기초하여 화면을 분할할 수 있다. 여기서, 분할된 화면의 크기는 모두 동일한 것일 수 있으나 이에 한정되지는 않는다.Referring to FIG. 8C , the user terminal 110 may conduct a video call with a plurality of other user terminals 120 . For example, when the user terminal 110 conducts a video call with a plurality of other user terminals 120 , the user terminal 110 may divide the screen based on the total number of people participating in the video call. Here, the sizes of the divided screens may all be the same, but are not limited thereto.

예를 들어, 사용자 단말(110)은 사용자에 의해 각 분할 화면의 위치의 변경을 입력받고, 변경된 위치에 기초하여 각 분할 화면을 표시할 수 있다. 다른 예를 들어, 사용자 단말(110)은 사용자로부터 복수의 분할 화면 중 어느 하나의 화면에 확대 입력을 받은 경우, 해당 화면을 확대하여 표시하고, 나머지 화면을 동일한 비율로 축소시켜 표시할 수 있다. For example, the user terminal 110 may receive a change in the location of each split screen by the user, and display each split screen based on the changed location. For another example, when an enlargement input is received from the user on one of the plurality of split screens, the user terminal 110 may enlarge and display the corresponding screen and display the remaining screens by reducing the screen at the same ratio.

사용자 및 다수의 타사용자는 각 단말에서 수행되는 일반 통화 모드 또는 아바타 통화 모드에 기초하여 각각의 영상이 표시될 수 있다. 예를 들어, 사용자 단말(110)에서 아바타 통화 모드를 통해 다수의 타사용자 단말(120)과 영상 통화를 수행하는 경우, 사용자 단말(110)은 분할된 다수의 영역 중 제 1 영역(830)에 사용자의 모습을 사용자 아바타로 대체하여 표시할 수 있다. The user and a plurality of other users may display respective images based on the general call mode or the avatar call mode performed in each terminal. For example, when the user terminal 110 performs a video call with a plurality of other user terminals 120 through the avatar call mode, the user terminal 110 moves to the first area 830 of the plurality of divided areas. The user's appearance may be replaced with a user avatar and displayed.

도 8d를 참조하면, 사용자 단말(110)은 아바타 통화 모드를 통해 영상 통화가 수행되는 경우, 촬영되는 사용자 실사 영상으로부터 사용자 단말(110)의 사용자의 얼굴 표정을 트래킹하고, 트래킹된 얼굴 표정을 따라 사용자 아바타의 표정을 변경시킬 수 있다. 예를 들어, 사용자가 윙크한 표정을 지은 경우, 사용자 아바타(840)도 사용자의 표정을 따라서 윙크 표정을 지을 수 있다. Referring to FIG. 8D , when a video call is performed through the avatar call mode, the user terminal 110 tracks the facial expression of the user of the user terminal 110 from the captured user's live-action image, and follows the tracked facial expression. It is possible to change the facial expression of the user avatar. For example, when the user makes a winking expression, the user avatar 840 may also make a wink expression according to the user's expression.

도 9는 본 발명의 일 실시예에 따른 사용자 단말에서 아바타를 이용하여 영상 통화를 수행하는 방법의 순서도이다. 도 9에 도시된 사용자 단말(110)에서 아바타를 이용하여 영상 통화를 수행하는 방법은 도 1 내지 도 8d에 도시된 실시에예 따라 영상 통화 시스템(1)에 의해 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 8d에 도시된 실시예에 따른 사용자 단말(110)에서 아바타를 이용하여 영상 통화를 수행하는 방법에도 적용된다. 9 is a flowchart of a method of performing a video call using an avatar in a user terminal according to an embodiment of the present invention. The method of performing a video call using an avatar in the user terminal 110 shown in FIG. 9 includes steps processed in time series by the video call system 1 according to the embodiment shown in FIGS. 1 to 8D . do. Therefore, even if omitted below, it is also applied to the method of performing a video call using the avatar in the user terminal 110 according to the embodiment shown in FIGS. 1 to 8D .

단계 S910에서 사용자 단말(110)은 사용자 단말과 적어도 하나의 타사용자 단말(120) 간에 영상 통화를 요청할 수 있다. In step S910 , the user terminal 110 may request a video call between the user terminal and at least one other user terminal 120 .

단계 S920에서 사용자 단말(110)은 사용자 단말(110)과 타사용자 단말(120) 간의 영상 통화가 연결된 경우, 사용자 단말(110)과 연동하는 카메라를 이용하여 사용자 실사 영상을 촬영할 수 있다. In step S920 , when a video call is connected between the user terminal 110 and the other user terminal 120 , the user terminal 110 may use a camera interworking with the user terminal 110 to take a user's actual image.

단계 S930에서 사용자 단말(110)은 촬영된 사용자 실사 영상을 통화 중계 서버(130)로 전송하고, 통화 중계 서버(130)로부터 타사용자 단말(120)에서 촬영된 타사용자 실사 영상을 수신할 수 있다. In step S930, the user terminal 110 transmits the captured user live-action image to the call relay server 130, and may receive the other user's live-action image captured in the other user terminal 120 from the call relay server 130. .

단계 S940에서 사용자 단말(110)은 영상 통화에 대한 일반 통화 모드를 통해 사용자 단말(110)의 화면의 제 1 영역에 사용자 실사 영상을 표시하고, 화면의 제 2 영역에 타사용자 실사 영상을 표시할 수 있다. In step S940, the user terminal 110 displays the user's live-action image on the first area of the screen of the user terminal 110 through the general call mode for the video call, and displays the other user's live-action image on the second area of the screen. can

단계 S950에서 사용자 단말(110)은 영상 통화에 대한 아바타 통화 모드를 요청받는 경우, 일반 통화 모드를 아바타 통화 모드로 변경할 수 있다. In step S950 , when the user terminal 110 receives a request for the avatar call mode for the video call, the user terminal 110 may change the normal call mode to the avatar call mode.

단계 S960에서 사용자 단말(110)은 아바타 통화 모드를 요청 받은 경우, 화면의 제 1 영역에 표시된 사용자 실사 영상을 기설정된 사용자 아바타로 대체하여 표시할 수 있다. When the user terminal 110 receives a request for the avatar call mode in step S960, the user's live-action image displayed on the first area of the screen may be replaced with a preset user avatar and displayed.

상술한 설명에서, 단계 S910 내지 S960은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 전환될 수도 있다.In the above description, steps S910 to S960 may be further divided into additional steps or combined into fewer steps, according to an embodiment of the present invention. In addition, some steps may be omitted as needed, and the order between the steps may be switched.

도 10은 본 발명의 일 실시예에 따른 통화 중계 서버의 구성도이다. 도 10을 참조하면, 통화 중계 서버(130)는 요청부(1010) 및 통신부(1020)를 포함할 수 있다. 10 is a block diagram of a call relay server according to an embodiment of the present invention. Referring to FIG. 10 , the call relay server 130 may include a request unit 1010 and a communication unit 1020 .

요청부(1010)는 사용자 단말(110)로부터 적어도 하나의 타사용자 단말(120)로의 영상 통화를 요청받을 수 있다. 예를 들어, 요청부(1010)는 사용자 단말(110)로부터 사용자 단말(110)의 애플리케이션에 등록된 친구 리스트 중 적어도 하나의 친구를 선택받고, 선택된 친구에 대응하는 타사용자 단말(120)과의 영상 통화를 요청받을 수 있다.The requester 1010 may receive a request for a video call from the user terminal 110 to at least one other user terminal 120 . For example, the requesting unit 1010 receives a selection of at least one friend from the list of friends registered in the application of the user terminal 110 from the user terminal 110, and communicates with the other user terminal 120 corresponding to the selected friend. You may be asked to make a video call.

통신부(1020)는 타사용자 단말(120)에서 영상 통화를 수락한 경우, 사용자 단말(110)로부터 사용자 영상 데이터를 수신하여 타사용자 단말(120)로 전송할 수 있다. 또한, 통신부(1020)는 타사용자 단말(120)로부터 타사용자 영상 데이터를 수신하여 사용자 단말(110)로 전송할 수 있다. When the video call is accepted by the other user terminal 120 , the communication unit 1020 may receive user image data from the user terminal 110 and transmit it to the other user terminal 120 . Also, the communication unit 1020 may receive other user image data from the other user terminal 120 and transmit it to the user terminal 110 .

통신부(1020)는 사용자 단말(110) 또는 타사용자 단말(120)에서 수행되는 통화 모드에 기초하여 영상 데이터를 수신할 수 있다. 예를 들어, 통신부(1020)는 사용자 단말(110)에서 일반 통화 모드가 수행되는 경우, 사용자 단말(110)로부터 사용자 실사 영상이 인코딩된 사용자 영상 데이터를 수신하고, 타사용자 단말(120)에서 일반 통화 모드가 수행되는 경우, 타사용자 단말(120)로부터 타사용자 실사 영상이 인코딩된 타사용자 영상 데이터를 수신할 수 있다. 다른 예를 들어, 통신부(1020)는 사용자 단말(110)에서 아바타 통화 모드가 수행되는 경우, 사용자 단말(110)로부터 사용자 아바타에 대한 영상이 인코딩된 사용자 영상 데이터를 수신하고, 타사용자 단말(120)에서 아바타 통화 모드가 수행되는 경우, 타사용자 단말(120)로부터 타사용자 아바타에 대한 영상이 인코딩된 타사용자 영상 데이터를 수신할 수 있다. The communication unit 1020 may receive image data based on a call mode performed by the user terminal 110 or the other user terminal 120 . For example, when the general call mode is performed in the user terminal 110 , the communication unit 1020 receives the user image data encoded with the actual user image from the user terminal 110 , and in the other user terminal 120 , the general call mode is performed. When the call mode is performed, the other user's image data in which the actual image of the other user is encoded may be received from the other user terminal 120 . For another example, when the avatar call mode is performed in the user terminal 110 , the communication unit 1020 receives user image data in which an image of the user avatar is encoded from the user terminal 110 , and the other user terminal 120 . ), when the avatar call mode is performed, the other user's image data in which an image of the other user's avatar is encoded may be received from the other user terminal 120 .

즉, 통화 중계 서버(130)는 사용자 단말(110) 및 타사용자 단말(120)으로부터 사용자 단말(110) 및 타사용자 단말(120) 각각에서 선택한 영상 통화 모드에 기초하여 일반 통화 모드의 경우 실사 영상이 인코딩된 영상 데이터를 수신하고, 아바타 통화 모드의 경우 아바타에 대한 영상이 인코딩된 영상 데이터를 수신할 수 있다. That is, the call relay server 130 based on the video call mode selected in each of the user terminal 110 and the other user terminal 120 from the user terminal 110 and the other user terminal 120, in the case of a normal call mode, the actual image The encoded image data may be received, and in the case of the avatar call mode, the encoded image data for the avatar may be received.

도 11은 본 발명의 일 실시예에 따른 통화 중계 서버에서 아바타를 이용하여 영상 통화를 수행하는 방법의 순서도이다. 도 11에 도시된 통화 중계 서버(130)에서 아바타를 이용하여 영상 통화를 수행하는 방법은 도 1 내지 도 10에 도시된 실시예에 따라 영상 통화 시스템(1)에 의해 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 10에 도시된 실시예에 따른 통화 중계 서버(130)에서 아바타를 이용하여 영상 통화를 수행하는 방법에도 적용된다. 11 is a flowchart of a method of performing a video call using an avatar in a call relay server according to an embodiment of the present invention. The method of performing a video call using an avatar in the call relay server 130 shown in FIG. 11 includes steps processed in time series by the video call system 1 according to the embodiments shown in FIGS. 1 to 10 . include Therefore, even if omitted below, it is also applied to the method of performing a video call using the avatar in the call relay server 130 according to the embodiment shown in FIGS. 1 to 10 .

단계 S1110에서 통화 중계 서버(130)는 사용자 단말(110)로부터 적어도 하나의 타사용자 단말(120)로의 영상 통화를 요청받을 수 있다. In step S1110 , the call relay server 130 may receive a request for a video call from the user terminal 110 to at least one other user terminal 120 .

단계 S1120에서 통화 중계 서버(130)는 타사용자 단말(120)에서 영상 통화를 수락한 경우, 사용자 단말(110)로부터 사용자 영상 데이터를 수신하여 타사용자 단말(120)로 전송할 수 있다. In step S1120 , when the video call is accepted by the other user terminal 120 , the call relay server 130 may receive user image data from the user terminal 110 and transmit it to the other user terminal 120 .

단계 S1130에서 통화 중계 서버(130)는 타사용자 단말(120)로부터 타사용자 영상 데이터를 수신하여 사용자 단말(110)로 전송할 수 있다. In step S1130 , the call relay server 130 may receive the other user's image data from the other user terminal 120 and transmit it to the user terminal 110 .

상술한 설명에서, 단계 S1110 내지 S1130은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 전환될 수도 있다.In the above description, steps S1110 to S1130 may be further divided into additional steps or combined into fewer steps, according to an embodiment of the present invention. In addition, some steps may be omitted as needed, and the order between the steps may be switched.

도 1 내지 도 11을 통해 설명된 사용자 단말 및 통화 중계 서버에서 아바타를 이용하여 영상 통화를 수행하는 방법은 컴퓨터에 의해 실행되는 매체에 저장된 컴퓨터 프로그램 또는 컴퓨터에 의해 실행 가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 또한, 도 1 내지 도 11을 통해 설명된 사용자 단말 및 통화 중계 서버에서 아바타를 이용하여 영상 통화를 수행하는 방법은 컴퓨터에 의해 실행되는 매체에 저장된 컴퓨터 프로그램의 형태로도 구현될 수 있다. The method of performing a video call using an avatar in the user terminal and the call relay server described with reference to FIGS. 1 to 11 includes a computer program stored in a medium executed by a computer or a recording medium including instructions executable by the computer. It can also be implemented in a form. Also, the method of performing a video call using an avatar in the user terminal and the call relay server described with reference to FIGS. 1 to 11 may be implemented in the form of a computer program stored in a medium executed by a computer.

컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체를 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다. Computer-readable media can be any available media that can be accessed by a computer and includes both volatile and nonvolatile media, removable and non-removable media. Also, computer-readable media may include computer storage media. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다. The above description of the present invention is for illustration, and those of ordinary skill in the art to which the present invention pertains can understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive. For example, each component described as a single type may be implemented in a dispersed form, and likewise components described as distributed may be implemented in a combined form.

본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.The scope of the present invention is indicated by the following claims rather than the above detailed description, and all changes or modifications derived from the meaning and scope of the claims and their equivalent concepts should be interpreted as being included in the scope of the present invention. do.

110: 사용자 단말
120: 타사용자 단말
130: 통화 중계 서버
210: 아바타 생성부
220: 영상 통화 요청부
230: 촬영부
240: 통신부
250: 표시부
260: 모드 변경부
1010: 요청부
1020: 통신부110: user terminal
120: other user terminal
130: call relay server
210: avatar generator
220: video call request unit
230: shooting department
240: communication department
250: display
260: mode change unit
1010: request
1020: communication department

Claims

In the user terminal performing a video call using an avatar,
a video call request unit for requesting a video call between the user terminal and at least one other user terminal;
When the video call is connected between the user terminal and the other user terminal, a photographing unit for photographing a user's live-action image using a camera interworking with the user terminal;
a communication unit for transmitting the captured user's actual image to a call relay server and receiving the other user's actual image taken from the other user's terminal from the call relay server;
a display unit for displaying the user's actual image on the first area of the screen of the user terminal through the general call mode for the video call, and displaying the other user's actual image on the second area of the screen; and
When receiving a request for an avatar call mode for the video call, a mode change unit for changing the normal call mode to the avatar call mode
including,
When the display unit receives a request for the avatar call mode, the user's live-action image displayed on the first area of the screen is replaced and displayed with a preset user avatar,
Further comprising an avatar generator for generating a user face model based on the facial features of the user of the user terminal and the pre-learned average face model,
The avatar generator generates a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the pre-modeled beauty model, and the user through synthesis with the user face model. A user terminal that generates an avatar.

The method of claim 1,
The communication unit, when the normal call mode is performed, the user terminal by encoding the captured user live-action image and transmitting it to the call relay server.

The method of claim 1,
When the avatar call mode is performed, the regular communication unit encodes an image of the user avatar and transmits the encoded image to the call relay server.

The method of claim 1,
When the avatar call mode is performed, the display unit tracks the facial expression of the user of the user terminal from the captured user live-action image, and changes the facial expression of the preset user avatar according to the tracked facial expression. , user terminal.

The method of claim 1,
The video call request unit receives a selection of at least one friend from a list of friends registered in the user terminal, and requests a video call with another user terminal corresponding to the selected friend to the call relay server.

The method of claim 1,
The avatar generating unit receives, when there are a plurality of generated user avatars, one of the plurality of user avatars set as a representative user avatar;
When the avatar call mode is performed, the display unit displays a user avatar set as the representative user avatar.

The method of claim 1,
The avatar generating unit,
receiving an input of a photographed image for a user of the user terminal;
Adjusting at least one of the size or resolution of the input captured image,
Detecting the feature point of the user from the adjusted photographed image,
generating the user face model based on the pre-learned average face model including the detected feature points and a plurality of vertices.

The method of claim 1,
wherein the avatar generator receives a request for replacement of at least one part included in the face of the generated user avatar, and receives a selection of a replacement element for the at least one part to be replaced.

9. The method of claim 8,
and the avatar generator changes vertex coordinates corresponding to the at least one part requested to be replaced into vertex coordinates corresponding to the selected replacement element.

The method of claim 1,
The mode change unit receives a request for a beauty effect mode during the video call,
The display unit, the user terminal by applying a beauty effect to the user's live-action image displayed in the first area of the screen.

In the call relay server for performing a video call using an avatar,
a request unit for receiving a request for a video call from the user terminal to at least one other user terminal; and
When the video call is accepted by the other user terminal, the communication unit receives the user image data from the user terminal and transmits it to the other user terminal, and receives the other user image data from the other user terminal and transmits it to the user terminal including,
When the video call is a normal call mode, the user image data includes a user live-action image of the user terminal, and the other user image data includes an actual image of another user of the other user terminal,
When the video call is changed from the normal call mode to the avatar call mode in the user terminal, the user image data includes an image of the user avatar replaced from the actual user image,
A user face model is generated by the user terminal based on facial features of the user of the user terminal and a pre-learned average face model,
The user avatar is generated by synthesizing the user face model and a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the pre-modeled beauty model. that is, a call relay server.

12. The method of claim 11,
When the general call mode is performed in the user terminal, the communication unit receives the user image data in which the actual user image is encoded from the user terminal,
When the normal call mode is performed in the other user terminal, the call relay server to receive the other user's image data encoded with the actual image of the other user from the other user terminal.

12. The method of claim 11,
When the avatar call mode is performed in the user terminal, the communication unit receives the user image data encoded with the image for the user avatar from the user terminal, and when the avatar call mode is performed in the other user terminal , A call relay server that receives the other user's image data encoded with an image of the other user's avatar from the other user's terminal.

A method of performing a video call using an avatar through a call relay server, the method comprising:
receiving a video call request from the user terminal to at least one other user terminal;
receiving user image data from the user terminal and transmitting the user image data to the other user terminal when the video call is accepted by the other user terminal; and
Receiving other user image data from the other user terminal and transmitting the image data to the user terminal,
When the video call is a normal call mode, the user image data includes a user live-action image of the user terminal, and the other user image data includes an actual image of another user of the other user terminal,
When the video call is changed from the normal call mode to the avatar call mode in the user terminal, the user image data includes an image for the user avatar displayed instead of the actual user image,
A user face model is generated by the user terminal based on facial features of the user of the user terminal and a pre-learned average face model,
The user avatar is generated by synthesizing the user face model and a beauty offset in which a beauty weight is applied to an offset obtained by subtracting a plurality of vertex coordinates included in the average face model from a plurality of vertex coordinates included in the pre-modeled beauty model. What is a call relay method.

15. The method of claim 14,
When the general call mode is performed in the user terminal, receiving from the user terminal the user image data encoded with the actual user image; and
When the normal call mode is performed in the other user terminal, the method further comprising the step of receiving the other user's image data encoded with the actual image of the other user from the other user terminal, the call relay method.

15. The method of claim 14,
receiving, from the user terminal, the user image data in which an image of the user avatar is encoded, when the avatar call mode is performed in the user terminal; and
When the avatar call mode is performed in the other user terminal, the method further comprising: receiving, from the other user terminal, the other user's image data encoded with an image of the other user's avatar.