KR102624635B1

KR102624635B1 - 3D data generation in messaging systems

Info

Publication number: KR102624635B1
Application number: KR1020227009689A
Authority: KR
Inventors: 카일 굿리치; 사무엘 에드워드 하레; 맥심 맥시모브 라자로브; 토니 매튜; 앤드류 제임스 맥피; 다니엘 모레노; 드리티만 사가르; 원타오 샹
Original assignee: 스냅 인코포레이티드
Priority date: 2019-08-28
Filing date: 2020-08-28
Publication date: 2024-01-15
Also published as: EP4022560A1; CN114730483A; KR20220051376A; WO2021042134A1

Abstract

대상 기술은 이미지 데이터 및 깊이 데이터를 수신한다. 대상 기술은 3차원(3D) 효과에 대응하는 증강 현실 콘텐츠 생성기를 선택한다. 대상 기술은 선택된 증강 현실 콘텐츠 생성기에 적어도 부분적으로 기초하여 이미지 데이터 및 깊이 데이터에 3D 효과를 적용한다. 대상 기술은, 프로세서를 사용하여, 적용된 3D 효과, 이미지 데이터, 및 깊이 데이터에 관련된 정보를 포함하는 메시지를 생성한다.The target technology receives image data and depth data. The target technology selects an augmented reality content generator that supports three-dimensional (3D) effects. The subject technology applies 3D effects to image data and depth data based at least in part on a selected augmented reality content generator. The target technology uses a processor to generate messages containing information related to the applied 3D effects, image data, and depth data.

Description

3D data generation in messaging systems

본 출원은 2019년 8월 28일자로 출원된, 발명의 명칭이 "GENERATING 3D DATA IN A MESSAGING SYSTEM"인 미국 가특허 출원 제62/893,037호, 2019년 8월 28일자로 출원된, 발명의 명칭이 "BEAUTIFICATION TECHNIQUES FOR 3D DATA IN A MESSAGING SYSTEM"인 미국 가특허 출원 제62/893,046호, 2019년 8월 28일자로 출원된, 발명의 명칭이 "EFFECTS FOR 3D DATA IN A MESSAGING SYSTEM"인 미국 가특허 출원 제62/893,048호, 2019년 8월 28일자로 출원된, 발명의 명칭이 "PROVIDING 3D DATA FOR MESSAGES IN A MESSAGING SYSTEM"인 미국 가특허 출원 제62/893,050호에 대한 우선권을 주장하며, 이러한 각각의 내용은 모든 목적들을 위해 그 전체가 본 명세서에 참고로 포함된다.This application is based on U.S. Provisional Patent Application No. 62/893,037, filed on August 28, 2019, titled “GENERATING 3D DATA IN A MESSAGING SYSTEM,” filed on August 28, 2019 This U.S. Provisional Patent Application No. 62/893,046, titled “BEAUTIFICATION TECHNIQUES FOR 3D DATA IN A MESSAGING SYSTEM,” filed on August 28, 2019, is entitled “EFFECTS FOR 3D DATA IN A MESSAGING SYSTEM.” Patent Application No. 62/893,048, claims priority to U.S. Provisional Patent Application No. 62/893,050, entitled “PROVIDING 3D DATA FOR MESSAGES IN A MESSAGING SYSTEM,” filed August 28, 2019; Each of these contents is incorporated herein by reference in its entirety for all purposes.

디지털 이미지들의 증가된 사용, 휴대용 컴퓨팅 디바이스들의 적절성, 디지털 저장 매체들의 증가된 용량의 가용성, 및 네트워크 접속들의 증가된 대역폭 및 액세스 가능성에 따라, 디지털 이미지들은 증가하는 수의 사람들에 대한 일상 생활의 일부가 되었다.With the increased use of digital images, the availability of portable computing devices, the availability of increased capacity in digital storage media, and the increased bandwidth and accessibility of network connections, digital images are a part of everyday life for an increasing number of people. It has become.

임의의 특정 엘리먼트 또는 행위의 논의를 용이하게 식별하기 위해, 참조 번호에서 최상위 숫자 또는 숫자들은 그 엘리먼트가 처음 도입되는 도면 번호를 지칭한다.
도 1은 일부 예시적인 실시예들에 따른, 본 개시내용이 배치될 수 있는 네트워크 환경의 도식적 표현이다.
도 2는 일부 예시적인 실시예들에 따른, 메시징 클라이언트 애플리케이션의 도식적 표현이다.
도 3은 일부 예시적인 실시예들에 따른, 데이터베이스에 유지되는 바와 같은 데이터 구조의 도식적 표현이다.
도 4는 일부 예시적인 실시예들에 따른, 메시지의 도식적 표현이다.
도 5는 일부 예시적인 실시예들에 따른 액세스 제한 프로세스에 대한 흐름도이다.
도 6은 일부 실시예들에 따른, 주어진 3D 메시지에 대응하는 부가 정보를 포함하는, 도 4에 기술된 바와 같은, 메시지 주석들의 구조를 예시하는 개략도이다.
도 7은 일부 예시적인 실시예들에 따른, 주석 시스템의 다양한 모듈들을 예시하는 블록도이다.
도 8은 일부 예시적인 실시예들에 따른, 3D 메시지를 생성하는 방법을 예시하는 흐름도이다.
도 9는 일부 예시적인 실시예들에 따라, 3D 메시지를 생성하기 위한 방법과 함께 수행될 수 있는 이미지 및 깊이 데이터를 처리하기 위한 변환 패스들을 수행하는 방법을 예시하는 흐름도이다.
도 10은 일부 예시적인 실시예들에 따라, 3D 메시지를 생성하기 위한 방법과 함께 수행될 수 있는 이미지 및 깊이 데이터의 미화를 수행하는 방법을 예시하는 흐름도이다.
도 11은 일부 예시적인 실시예들에 따라, 3D 메시지를 생성하기 위한 방법과 함께 수행될 수 있는 움직임 데이터에 응답하여 3D 메시지의 뷰를 업데이트하는 방법을 예시하는 흐름도이다.
도 12는 일부 실시예들에 따른, 증강 현실 콘텐츠 생성기를 선택하여 미디어 콘텐츠(예를 들어, 이미지 또는 비디오)에 적용하고, 적용된 증강 현실 콘텐츠 생성기를 메시징 클라이언트 애플리케이션 또는 메시징 시스템에 프레젠테이션하기 위한 캐러셀(carousel)을 도시하는 예시적인 사용자 인터페이스들을 예시한다.
도 13은 일부 예시적인 실시예들에 따른, 클라이언트 디바이스의 디스플레이에서 이미지 정보를 캡처하고 3D 메시지를 생성하는 것을 예시하는 예이다.
도 14는 일부 예시적인 실시예들에 따른, 원시 깊이 맵 및 패킹된 깊이 맵을 예시하는 예이다.
도 15는 일부 예시적인 실시예들에 따른, 깊이 인페인팅 마스크 및 깊이 인페인팅을 예시하는 예이다.
도 16은 일부 예시적인 실시예들에 따라, 입자들, 그래픽 객체(예를 들어, 안경) 상의 반사, 및 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 3D 부착물을 예시하는 3D 효과들의 예, 및 움직임 데이터에 응답하여 렌더링되는 포스트 효과들 및 동적 3D 부착물을 예시하는 3D 효과들의 예이다.
도 17은 일부 예시적인 실시예들에 따른, 움직임 데이터에 응답하여 렌더링되는 동적 인공 조명을 예시하는 3D 효과의 예, 및 움직임 데이터에 응답하여 렌더링되는 안경, 3D 부착물, 및 애니메이션화된 스프라이트 배경 상의 반사/굴절을 예시하는 3D 효과들의 예이다.
도 18은 일부 예시적인 실시예들에 따른, 움직임 데이터에 응답하여 렌더링되는 제어형 입자 시스템(예를 들어, 애니메이션화된 발사체), 및 2D 및 3D 부착물들을 예시하는 3D 효과들의 예의 예, 및 움직임 데이터에 응답하여 렌더링되는 3D 부착물들(예를 들어, 토끼 귀(bunny ears))에 대한 공동 애니메이션을 예시하는 3D 효과들의 예이다.
도 19는 일부 예시적인 실시예들에 따른, 스프라이트들, 안경 상의 반사, 움직임 데이터에 응답하여 렌더링되는 2D 및 3D 부착물들을 예시하는 3D 효과들의 예, 및 움직임 데이터에 응답하여 렌더링되는 안경, 입자들, 및 애니메이션화된 배경 상의 반사/굴절을 예시하는 3D 효과들의 예이다.
도 20은 일부 예시적인 실시예들에 따른, 움직임 데이터에 응답하여 렌더링되는 사용자의 얼굴을 가리는 부착물 및 애니메이션화된 전경을 예시하는 3D 효과들의 예, 및 움직임 데이터에 응답하여 렌더링되는 동적 인공 조명, 입자들, 및 안경 상에서의 반사/굴절을 예시하는 3D 효과들의 예이다.
도 21은 일부 예시적인 실시예들에 따른, 움직임 데이터에 응답하여 렌더링되는 리터치, 포스트 효과, 3D 부착물 및 입자들을 예시하는 3D 효과의 예, 및 움직임 데이터에 응답하여 렌더링되는 3D 부착물, 스프라이트들 및 입자를 예시하는 3D 효과들의 예이다.
도 22는 일부 예시적인 실시예들에 따른, 본 개시내용이 구현될 수 있는 소프트웨어 아키텍처를 도시하는 블록도이다.
도 23은 일부 예시적인 실시예들에 따른, 머신으로 하여금 논의된 방법론들 중 임의의 하나 이상을 수행하게 야기하기 위해 명령어들의 세트가 그 내에서 실행될 수 있는 컴퓨터 시스템 형태의 머신의 도식적 표현이다.To facilitate the identification of discussion of any particular element or act, the most significant digit or digits in a reference number refer to the figure number in which the element is first introduced.
1 is a schematic representation of a network environment in which the present disclosure may be deployed, in accordance with some example embodiments.
2 is a schematic representation of a messaging client application, according to some example embodiments.
3 is a schematic representation of a data structure as maintained in a database, according to some example embodiments.
4 is a diagrammatic representation of a message, according to some example embodiments.
5 is a flow diagram of an access restriction process according to some example embodiments.
Figure 6 is a schematic diagram illustrating the structure of message annotations, as described in Figure 4, containing side information corresponding to a given 3D message, according to some embodiments.
7 is a block diagram illustrating various modules of an annotation system, according to some example embodiments.
8 is a flow diagram illustrating a method for generating a 3D message, according to some example embodiments.
9 is a flow diagram illustrating a method for performing transformation passes for processing image and depth data that may be performed in conjunction with a method for generating a 3D message, according to some example embodiments.
10 is a flow diagram illustrating a method for performing beautification of image and depth data that may be performed in conjunction with a method for generating a 3D message, according to some example embodiments.
11 is a flow diagram illustrating a method for updating a view of a 3D message in response to motion data that may be performed in conjunction with a method for generating a 3D message, according to some example embodiments.
12 illustrates a carousel for selecting and applying an augmented reality content generator to media content (e.g., an image or video) and presenting the applied augmented reality content generator to a messaging client application or messaging system, according to some embodiments. (carousel) illustrates example user interfaces.
13 is an example illustrating capturing image information and generating a 3D message on a display of a client device, according to some example embodiments.
Figure 14 is an example illustrating a raw depth map and a packed depth map, according to some example embodiments.
Figure 15 is an example illustrating a depth inpainting mask and depth inpainting, according to some example embodiments.
16 shows a 3D attachment rendered in response to particles, reflections on a graphical object (e.g., glasses), and motion data (e.g., motion data from a gyroscopic sensor), according to some example embodiments. and examples of 3D effects illustrating dynamic 3D attachments and post effects rendered in response to motion data.
17 is an example of a 3D effect illustrating dynamic artificial lighting rendered in response to motion data, and reflections on glasses, 3D attachments, and an animated sprite background rendered in response to motion data, according to some example embodiments. /This is an example of 3D effects that illustrate refraction.
18 shows an example of a controlled particle system (e.g., an animated projectile) rendered in response to motion data, and 3D effects illustrating 2D and 3D attachments, and to motion data, according to some example embodiments. Here is an example of 3D effects that illustrate joint animation for 3D attachments (eg, bunny ears) that are rendered in response.
19 is an example of 3D effects illustrating sprites, reflections on glasses, 2D and 3D attachments rendered in response to motion data, and glasses and particles rendered in response to motion data, according to some example embodiments. , and 3D effects illustrating reflection/refraction on an animated background.
20 is an example of 3D effects illustrating an animated foreground and an attachment occluding a user's face rendered in response to motion data, and dynamic artificial lighting, particles rendered in response to motion data, according to some example embodiments. , and 3D effects illustrating reflection/refraction on glasses.
21 is an example of a 3D effect illustrating retouches, post effects, 3D attachments and particles rendered in response to motion data, and 3D attachments, sprites and particles rendered in response to motion data, according to some example embodiments. This is an example of 3D effects illustrating particles.
Figure 22 is a block diagram illustrating a software architecture in which the present disclosure may be implemented, in accordance with some example embodiments.
Figure 23 is a schematic representation of a machine in the form of a computer system within which a set of instructions can be executed to cause the machine to perform any one or more of the methodologies discussed, according to some example embodiments.

다양한 위치들로부터 관심의 범위를 갖는 사용자들은 다양한 피사체들의 디지털 이미지들을 캡처하고 캡처된 이미지들을 인터넷과 같은 네트워크들을 통해 다른 사람들에게 이용가능하게 할 수 있다. 디지털 이미지들에 대한 사용자들의 경험들을 향상시키고 다양한 특징들을 제공하기 위해, 컴퓨팅 디바이스들이 광범위한 변화 조건들(예를 들어, 이미지 스케일들, 잡음들, 조명, 움직임, 또는 기하학적 왜곡의 변화들)에서 캡처되는 다양한 객체들 및/또는 특징들에 대해 이미지 처리 동작들을 수행할 수 있게 하는 것은 어렵고 계산 집약적일 수 있다.Users with a range of interests from various locations can capture digital images of various subjects and make the captured images available to others through networks such as the Internet. To enhance users' experiences with digital images and provide a variety of features, computing devices capture under a wide range of changing conditions (e.g., changes in image scales, noise, lighting, motion, or geometric distortion). Being able to perform image processing operations on a variety of objects and/or features can be difficult and computationally intensive.

본 명세서에서 논의된 바와 같이, 대상 인프라스트럭처는 메시징 시스템의 다양한 컴포넌트들 전체에 걸쳐, 본 명세서에서 3D 메시지들로 지칭되는 대화형 3D 미디어의 생성 및 공유를 지원한다. 본 명세서에 기술된 인프라스트럭처는 다른 형태의 3D 미디어가 대상 시스템에 걸쳐 제공될 수 있게 해주며, 이는 깊이 기반 미디어가 메시징 시스템에 걸쳐 그리고 사진 및 비디오 메시지와 함께 공유될 수 있게 해준다. 본 명세서에 설명된 예시적인 실시예들에서, 메시지들은 (예를 들어, 3D 메시지들 및/또는 다른 메시지들이 메모리 또는 데이터베이스에 저장되는 경우) 라이브 카메라로부터 또는 스토리지로부터 시스템에 진입할 수 있다. 대상 시스템은 모션 센서 입력을 지원하고, 깊이 데이터의 전송 및 저장, 및 외부 효과들 및 자산 데이터의 로딩을 관리한다.As discussed herein, the destination infrastructure supports the creation and sharing of interactive 3D media, referred to herein as 3D messages, across the various components of the messaging system. The infrastructure described herein allows other forms of 3D media to be presented across target systems, allowing depth-based media to be shared across messaging systems and along with photo and video messages. In example embodiments described herein, messages may enter the system from a live camera or from storage (e.g., if 3D messages and/or other messages are stored in memory or a database). The target system supports motion sensor input, manages transmission and storage of depth data, and loading of external effects and asset data.

본 명세서에 설명된 바와 같이, 3차원(3D) 메시지는 적어도 이미지 및 깊이 데이터를 포함하는 대화형 3D 이미지를 지칭한다. 예시적인 실시예에서, 전통적인 이미지 텍스처 이외에, 카메라가 보는 것의 공간 상세/지오메트리를 시각화하기 위해 대상 시스템을 사용하여 3D 메시지가 렌더링된다. 시청자가 클라이언트 디바이스를 이동시킴으로써 이 3D 메시지와 상호작용할 때, 움직임은 이미지 및 지오메트리가 시청자에게 렌더링되는 관점에서의 대응하는 변화를 트리거한다.As described herein, a three-dimensional (3D) message refers to an interactive 3D image that includes at least image and depth data. In an example embodiment, in addition to traditional image textures, the 3D message is rendered using an object system to visualize the spatial details/geometry of what the camera sees. When a viewer interacts with this 3D message by moving the client device, the movement triggers a corresponding change in perspective from which images and geometry are rendered to the viewer.

본 명세서에서 언급되는 바와 같이, "증강 현실 경험", "증강 현실 콘텐츠 항목", "증강 현실 콘텐츠 생성기"라는 문구는, 본 명세서에서 추가로 설명되는 바와 같이, 이미지 수정, 필터, 렌즈(LENSES), 미디어 오버레이, 변환 등에 대응하는 다양한 이미지 처리 동작들을 포함하거나 지칭한다.As referred to herein, the phrases “augmented reality experience,” “augmented reality content item,” and “augmented reality content generator” refer to image modifications, filters, and lenses (LENSES), as further described herein. , includes or refers to various image processing operations corresponding to media overlay, conversion, etc.

본 명세서에서 언급된 바와 같이, 3D 증강 현실 콘텐츠 생성기는 메시지에 추가될 수 있는 실시간 특수 효과 및/또는 사운드를 지칭하고, 3D 애니메이화된 그래픽 요소들 등과 같은 AR 효과들 및/또는 다른 3D 콘텐츠로 이미지 및/또는 깊이 데이터를 수정한다.As referred to herein, 3D augmented reality content generator refers to real-time special effects and/or sounds that can be added to messages, AR effects, such as 3D animated graphic elements, and/or other 3D content. Modify image and/or depth data.

도 1은 네트워크를 통해 데이터(예를 들어, 메시지들 및 연관된 콘텐츠)를 교환하기 위한 예시적인 메시징 시스템(100)을 도시하는 블록도이다. 메시징 시스템(100)은 클라이언트 디바이스(102)의 다수의 인스턴스를 포함하고, 이러한 각각은 메시징 클라이언트 애플리케이션(104)을 포함하는 다수의 애플리케이션을 호스팅한다. 각각의 메시징 클라이언트 애플리케이션(104)은 네트워크(106)(예를 들어, 인터넷)를 통해 메시징 클라이언트 애플리케이션(104) 및 메시징 서버 시스템(108)의 다른 인스턴스들에 통신가능하게 결합된다.1 is a block diagram illustrating an example messaging system 100 for exchanging data (e.g., messages and associated content) over a network. Messaging system 100 includes multiple instances of client devices 102, each of which hosts multiple applications, including messaging client applications 104. Each messaging client application 104 is communicatively coupled to other instances of messaging client application 104 and messaging server system 108 over a network 106 (e.g., the Internet).

메시징 클라이언트 애플리케이션(104)은 네트워크(106)를 통해 다른 메시징 클라이언트 애플리케이션(104)과 그리고 메시징 서버 시스템(108)과 통신하고 데이터를 교환할 수 있다. 메시징 클라이언트 애플리케이션(104) 사이에서 그리고 메시징 클라이언트 애플리케이션(104)과 메시징 서버 시스템(108) 사이에서 교환되는 데이터는, 기능들(예를 들어, 기능들을 기동시키는 명령들)뿐만 아니라, 페이로드 데이터(예를 들어, 텍스트, 오디오, 비디오 또는 다른 멀티미디어 데이터)를 포함한다.Messaging client applications 104 may communicate and exchange data with other messaging client applications 104 and with messaging server system 108 over network 106 . Data exchanged between messaging client applications 104 and between messaging client applications 104 and messaging server system 108 include functions (e.g., commands that invoke functions) as well as payload data ( For example, text, audio, video or other multimedia data).

메시징 서버 시스템(108)은 네트워크(106)를 통해 특정 메시징 클라이언트 애플리케이션(104)에 서버 측 기능성을 제공한다. 메시징 시스템(100)의 특정 기능들이 메시징 클라이언트 애플리케이션(104)에 의해 또는 메시징 서버 시스템(108)에 의해 수행되는 것으로 본 명세서에 설명되지만, 메시징 클라이언트 애플리케이션(104) 또는 메시징 서버 시스템(108) 내의 특정 기능성의 위치는 설계 선택사항이다. 예를 들어, 처음에는 특정 기술 및 기능성을 메시징 서버 시스템(108) 내에 배치하지만, 나중에 클라이언트 디바이스(102)가 충분한 처리 용량을 갖는 경우 이 기술 및 기능성을 메시징 클라이언트 애플리케이션(104)으로 이전시키는 것이 기술적으로 바람직할 수 있다.Messaging server system 108 provides server-side functionality to specific messaging client applications 104 over network 106. Although certain functions of messaging system 100 are described herein as being performed by messaging client application 104 or by messaging server system 108, certain functions within messaging client application 104 or messaging server system 108 The location of functionality is a design choice. For example, it may be technically feasible to initially place certain technology and functionality within the messaging server system 108, but later transfer this technology and functionality to the messaging client application 104 when the client device 102 has sufficient processing capacity. It may be desirable.

메시징 서버 시스템(108)은 메시징 클라이언트 애플리케이션(104)에 제공되는 다양한 서비스들 및 동작들을 지원한다. 그러한 동작들은 메시징 클라이언트 애플리케이션(104)에 데이터를 송신하고, 그로부터 데이터를 수신하고, 그에 의해 생성된 데이터를 처리하는 것을 포함한다. 이 데이터는, 예로서, 메시지 콘텐츠, 클라이언트 디바이스 정보, 지오로케이션 정보, 미디어 주석 및 오버레이, 메시지 콘텐츠 지속 조건, 소셜 네트워크 정보, 및 라이브 이벤트 정보를 포함할 수 있다. 메시징 시스템(100) 내의 데이터 교환은 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스(UI)들을 통해 이용 가능한 기능들을 통해 기동되고 제어된다.Messaging server system 108 supports various services and operations provided to messaging client application 104. Such operations include sending data to, receiving data from, and processing data generated by messaging client application 104. This data may include, for example, message content, client device information, geolocation information, media annotations and overlays, message content persistence conditions, social network information, and live event information. Data exchange within messaging system 100 is initiated and controlled through functions available through user interfaces (UIs) of messaging client application 104.

이제 구체적으로 메시징 서버 시스템(108)을 참조하면, 애플리케이션 프로그램 인터페이스(API) 서버(110)가 애플리케이션 서버(112)에 결합되어 프로그램 방식의 인터페이스(programmatic interface)를 제공한다. 애플리케이션 서버(112)는 데이터베이스 서버(118)에 통신가능하게 결합되고, 이는 애플리케이션 서버(112)에 의해 처리되는 메시지들과 연관된 데이터가 저장되는 데이터베이스(120)로의 액세스를 용이하게 한다.Referring now specifically to messaging server system 108, an application program interface (API) server 110 is coupled to application server 112 and provides a programmatic interface. Application server 112 is communicatively coupled to database server 118, which facilitates access to database 120 where data associated with messages processed by application server 112 is stored.

API(Application Program Interface) 서버(110)는 클라이언트 디바이스(102)와 애플리케이션 서버(112) 사이에서 메시지 데이터(예를 들어, 명령들 및 메시지 페이로드들)를 수신하고 송신한다. 구체적으로, 애플리케이션 프로그램 인터페이스(API) 서버(110)는 애플리케이션 서버(112)의 기능성을 기동시키기 위해 메시징 클라이언트 애플리케이션(104)에 의해 호출되거나 조회될 수 있는 인터페이스들(예를 들어, 루틴들 및 프로토콜들)의 세트를 제공한다. 애플리케이션 프로그램 인터페이스(API) 서버(110)는, 계정 등록, 로그인 기능성, 특정 메시징 클라이언트 애플리케이션(104)으로부터 다른 메시징 클라이언트 애플리케이션(104)으로의, 애플리케이션 서버(112)를 통한 메시지의 전송, 메시징 클라이언트 애플리케이션(104)으로부터 메시징 서버 애플리케이션(114)으로의 미디어 파일들(예를 들어, 이미지 또는 비디오)의 전송, 및 다른 메시징 클라이언트 애플리케이션(104)에 의한 가능한 액세스를 위해, 미디어 데이터의 컬렉션(예를 들어, 스토리)의 설정, 클라이언트 디바이스(102)의 사용자의 친구들의 리스트의 검색, 그러한 컬렉션들의 검색, 메시지 및 콘텐츠의 검색, 소셜 그래프로의 친구의 추가 및 삭제, 소셜 그래프 내의 친구들의 위치, 및 (예를 들어, 메시징 클라이언트 애플리케이션(104)에 관련된) 애플리케이션 이벤트를 오픈하는 것 등을 포함한, 애플리케이션 서버(112)에 의해 지원되는 다양한 기능들을 노출시킨다.The Application Program Interface (API) server 110 receives and transmits message data (e.g., commands and message payloads) between the client device 102 and the application server 112. Specifically, application program interface (API) server 110 provides interfaces (e.g., routines and protocols) that can be called or queried by messaging client application 104 to invoke the functionality of application server 112. s) is provided. The application program interface (API) server 110 provides account registration, login functionality, transfer of messages through the application server 112 from one messaging client application 104 to another messaging client application 104, and messaging client application 110. Transfer of media files (e.g., images or videos) from 104 to messaging server application 114, and collection of media data (e.g., for possible access by other messaging client applications 104). , Stories), retrieval of the user's list of friends on the client device 102, retrieval of such collections, retrieval of messages and content, addition and deletion of friends to the social graph, location of friends in the social graph, and ( It exposes various functions supported by the application server 112, including, for example, opening application events (related to the messaging client application 104).

애플리케이션 서버(112)는 메시징 서버 애플리케이션(114), 이미지 처리 시스템(116), 및 소셜 네트워크 시스템(122)을 포함하는 다수의 애플리케이션들 및 서브시스템들을 호스팅한다. 메시징 서버 애플리케이션(114)은, 특히 메시징 클라이언트 애플리케이션(104)의 다수의 인스턴스로부터 수신된 메시지들에 포함된 콘텐츠(예를 들어, 텍스트 및 멀티미디어 콘텐츠)의 집성 및 다른 처리에 관련된, 다수의 메시지 처리 기술들 및 기능들을 구현한다. 더 상세히 설명되는 바와 같이, 다수의 소스로부터의 텍스트 및 미디어 콘텐츠는, 콘텐츠의 컬렉션들(예를 들어, 스토리 또는 갤러리라고 불림)로 집성될 수 있다. 그 후, 이들 컬렉션은 메시징 서버 애플리케이션(114)에 의해, 메시징 클라이언트 애플리케이션(104)이 이용할 수 있게 된다. 다른 프로세서 및 메모리 집약적인 데이터의 처리는 또한, 그러한 처리를 위한 하드웨어 요건을 고려하여, 메시징 서버 애플리케이션(114)에 의해 서버 측에서 수행될 수 있다.Application server 112 hosts a number of applications and subsystems, including messaging server application 114, image processing system 116, and social networking system 122. Messaging server application 114 may perform multiple message processing, particularly related to aggregation and other processing of content (e.g., text and multimedia content) included in messages received from multiple instances of messaging client application 104. Implement technologies and functions. As will be described in greater detail, text and media content from multiple sources may be aggregated into collections of content (e.g., called stories or galleries). These collections are then made available by the messaging server application 114 to the messaging client application 104. Processing of other processor- and memory-intensive data may also be performed server-side by the messaging server application 114, taking into account hardware requirements for such processing.

애플리케이션 서버(112)는 전형적으로 메시징 서버 애플리케이션(114)에서 메시지의 페이로드 내에서 수신된 이미지 또는 비디오에 관하여, 다양한 이미지 처리 동작들을 수행하는 데 전용되는 이미지 처리 시스템(116)을 또한 포함한다.Application server 112 also typically includes an image processing system 116 dedicated to performing various image processing operations on images or video received within the payload of a message in messaging server application 114.

소셜 네트워크 시스템(122)은 다양한 소셜 네트워킹 기능들 서비스들을 지원하고, 이들 기능 및 서비스를 메시징 서버 애플리케이션(114)이 이용할 수 있게 한다. 이를 위해, 소셜 네트워크 시스템(122)은 데이터베이스(120) 내에 엔티티 그래프(304)(도 3에 도시됨)를 유지하고 액세스한다. 소셜 네트워크 시스템(122)에 의해 지원되는 기능들 및 서비스들의 예들은, 특정 사용자가 관계를 가지거나 "팔로우하는" 메시징 시스템(100)의 다른 사용자들의 식별, 및 또한 다른 엔티티들의 식별 및 특정 사용자의 관심사들을 포함한다.Social networking system 122 supports various social networking functions and services and makes these functions and services available to messaging server application 114. To this end, social network system 122 maintains and accesses entity graph 304 (shown in FIG. 3) within database 120. Examples of functions and services supported by social networking system 122 include, but are not limited to, identification of other users of messaging system 100 with which a particular user has relationships or “follows,” and also identification of other entities and functions of a particular user. Include interests.

애플리케이션 서버(112)는 데이터베이스 서버(118)에 통신가능하게 결합되고, 이는 메시징 서버 애플리케이션(114)에 의해 처리되는 메시지들과 연관된 데이터가 저장되는 데이터베이스(120)로의 액세스를 용이하게 한다.Application server 112 is communicatively coupled to database server 118, which facilitates access to database 120 where data associated with messages processed by messaging server application 114 is stored.

도 2는 예시적인 실시예들에 따른, 메시징 시스템(100)에 관한 추가 상세들을 예시하는 블록도이다. 구체적으로, 메시징 시스템(100)은 메시징 클라이언트 애플리케이션(104) 및 애플리케이션 서버(112)를 포함하는 것으로 도시되고, 이는 결국 다수의 일부 서브시스템, 즉, 단기적 타이머 시스템(202), 컬렉션 관리 시스템(204) 및 주석 시스템(206)을 구현한다.2 is a block diagram illustrating additional details regarding messaging system 100, according to example embodiments. Specifically, the messaging system 100 is shown as including a messaging client application 104 and an application server 112, which in turn may include a number of subsystems, namely a short-lived timer system 202, a collection management system 204 ) and annotation system 206.

단기적 타이머 시스템(202)은 메시징 클라이언트 애플리케이션(104) 및 메시징 서버 애플리케이션(114)에 의해 허용되는 콘텐츠에 대한 임시 액세스를 시행하는 것을 담당한다. 이를 위해, 단기적 타이머 시스템(202)은 메시지 또는 메시지들의 컬렉션(예를 들어, 스토리)과 연관된 지속기간 및 디스플레이 파라미터들에 기초하여, 메시징 클라이언트 애플리케이션(104)을 통해 메시지들 및 연관된 콘텐츠를 선택적으로 디스플레이하고 그에 대한 액세스를 가능하게 하는 다수의 타이머를 포함한다. 단기적 타이머 시스템(202)의 동작에 관한 추가의 상세사항들이 아래에 제공된다.The short-term timer system 202 is responsible for enforcing temporary access to content permitted by the messaging client application 104 and messaging server application 114. To this end, the short-lived timer system 202 selectively displays messages and associated content via the messaging client application 104 based on the duration and display parameters associated with the message or collection of messages (e.g., a story). Contains a number of timers that display and enable access to them. Additional details regarding the operation of short-term timer system 202 are provided below.

컬렉션 관리 시스템(204)은 미디어의 컬렉션들(예를 들어, 텍스트, 이미지 비디오 및 오디오 데이터의 컬렉션들)을 관리하는 것을 담당한다. 일부 예들에서, 콘텐츠의 컬렉션(예를 들어, 이미지들, 비디오, 텍스트, 및 오디오를 포함하는 메시지들)은 "이벤트 갤러리" 또는 "이벤트 스토리"로 조직될 수 있다. 그러한 컬렉션은 콘텐츠가 관련되는 이벤트의 지속기간과 같은 지정된 기간 동안 이용 가능하게 될 수 있다. 예를 들어, 음악 콘서트에 관한 콘텐츠는 그 음악 콘서트의 지속기간 동안 "스토리"로서 이용 가능하게 될 수 있다. 컬렉션 관리 시스템(204)은 또한 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스에 특정 컬렉션의 존재의 통지를 제공하는 아이콘을 게시하는 것을 담당할 수 있다.Collection management system 204 is responsible for managing collections of media (eg, collections of text, images video and audio data). In some examples, a collection of content (eg, messages containing images, video, text, and audio) may be organized into an “event gallery” or “event story.” Such collections may be made available for a specified period of time, such as the duration of the event to which the content relates. For example, content about a music concert may be made available as a “story” for the duration of the music concert. Collection management system 204 may also be responsible for posting icons in the user interface of messaging client application 104 that provide notification of the existence of a particular collection.

컬렉션 관리 시스템(204)은 더욱이 컬렉션 관리자가 콘텐츠의 특정 컬렉션을 관리 및 큐레이팅하는 것을 허용하는 큐레이션 인터페이스(208)를 포함한다. 예를 들어, 큐레이션 인터페이스(208)는 이벤트 조직자가 특정 이벤트에 관련된 콘텐츠의 컬렉션을 큐레이팅(예를 들어, 부적절한 콘텐츠 또는 중복 메시지들을 삭제)하는 것을 가능하게 한다. 추가적으로, 컬렉션 관리 시스템(204)은 머신 비전(또는 이미지 인식 기술) 및 콘텐츠 규칙들을 이용하여 콘텐츠 컬렉션을 자동으로 큐레이팅한다. 특정 실시예들에서, 사용자-생성 콘텐츠(user-generated content)를 컬렉션에 포함시키는 것에 대한 보상이 사용자에게 지불될 수 있다. 그러한 경우들에서, 큐레이션 인터페이스(208)는 그러한 사용자들에게 그들의 콘텐츠를 사용하는 것에 대해 자동으로 지불하도록 동작한다.Collection management system 204 further includes a curation interface 208 that allows collection managers to manage and curate specific collections of content. For example, curation interface 208 allows an event organizer to curate a collection of content related to a particular event (e.g., remove inappropriate content or duplicate messages). Additionally, collection management system 204 automatically curates content collections using machine vision (or image recognition technology) and content rules. In certain embodiments, a reward may be paid to a user for including user-generated content in a collection. In such cases, curation interface 208 operates to automatically pay such users for using their content.

주석 시스템(206)은 사용자가 메시지와 연관된 미디어 콘텐츠를 주석하거나 다른 방식으로 수정하거나 편집하는 것을 가능하게 하는 다양한 기능들을 제공한다. 예를 들어, 주석 시스템(206)은 메시징 시스템(100)에 의해 처리된 메시지들에 대한 미디어 오버레이들의 생성 및 게시와 관련된 기능들을 제공한다. 주석 시스템(206)은 클라이언트 디바이스(102)의 지오로케이션에 기초하여 메시징 클라이언트 애플리케이션(104)에게 미디어 오버레이 또는 보완물(예를 들어, 이미지 필터)을 동작적으로 공급한다. 다른 예에서, 주석 시스템(206)은 클라이언트 디바이스(102)의 사용자의 소셜 네트워크 정보와 같은 다른 정보에 기초하여 메시징 클라이언트 애플리케이션(104)에 미디어 오버레이를 동작적으로 공급한다. 미디어 오버레이는 오디오 및 시각 콘텐츠 및 시각 효과를 포함할 수 있다. 오디오 및 시각적 콘텐츠의 예는, 사진들, 텍스트, 로고, 애니메이션, 및 음향 효과를 포함한다. 시각적 효과의 예는 컬러 오버레잉을 포함한다. 오디오 및 시각 콘텐츠 또는 시각 효과는 클라이언트 디바이스(102)에 있는 미디어 콘텐츠 항목(예를 들어, 사진)에 적용될 수 있다. 예를 들어, 미디어 오버레이는 클라이언트 디바이스(102)에 의해 촬영된 사진 위에 오버레이될 수 있는 텍스트를 포함할 수 있다. 다른 예에서, 미디어 오버레이는, 위치 오버레이의 식별(예를 들어, Venice beach), 라이브 이벤트의 이름, 또는 상인 오버레이의 이름(예를 들어, Beach Coffee House)을 포함한다. 다른 예에서, 주석 시스템(206)은 클라이언트 디바이스(102)의 지오로케이션을 이용하여, 클라이언트 디바이스(102)의 지오로케이션에서의 상인의 이름을 포함하는 미디어 오버레이를 식별한다. 미디어 오버레이는 상인과 연관된 다른 표시들을 포함할 수 있다. 미디어 오버레이들은 데이터베이스(120)에 저장되고 데이터베이스 서버(118)를 통해 액세스될 수 있다.Annotation system 206 provides various functions that allow a user to annotate or otherwise modify or edit media content associated with a message. For example, annotation system 206 provides functionality related to the creation and posting of media overlays for messages processed by messaging system 100. The annotation system 206 operationally supplies media overlays or supplements (e.g., image filters) to the messaging client application 104 based on the geolocation of the client device 102. In another example, the annotation system 206 operationally supplies a media overlay to the messaging client application 104 based on other information, such as social network information of the user of the client device 102. Media overlays can include audio and visual content and visual effects. Examples of audio and visual content include photos, text, logos, animations, and sound effects. Examples of visual effects include color overlaying. Audio and visual content or visual effects may be applied to media content items (e.g., photos) on client device 102. For example, a media overlay may include text that may be overlaid over a photo taken by client device 102. In other examples, the media overlay includes the identification of a location overlay (eg, Venice beach), the name of a live event, or the name of a merchant overlay (eg, Beach Coffee House). In another example, annotation system 206 uses the geolocation of client device 102 to identify a media overlay that includes the name of a merchant at the geolocation of client device 102. The media overlay may include other indicia associated with the merchant. Media overlays may be stored in database 120 and accessed through database server 118.

하나의 예시적인 실시예에서, 주석 시스템(206)은 사용자들이 맵 상에서 지오로케이션을 선택하고, 선택된 지오로케이션과 연관된 콘텐츠를 업로드하는 것을 가능하게 하는 사용자-기반 게시 플랫폼(user-based publication platform)을 제공한다. 사용자는 또한 특정 미디어 오버레이가 다른 사용자들에게 제공되어야 하는 상황들을 지정할 수 있다. 주석 시스템(206)은 업로드된 콘텐츠를 포함하고 업로드된 콘텐츠를 선택된 지오로케이션과 연관시키는 미디어 오버레이를 생성한다.In one example embodiment, annotation system 206 provides a user-based publishing platform that allows users to select geolocations on a map and upload content associated with the selected geolocation. to provide. Users can also specify situations in which certain media overlays should be provided to other users. Annotation system 206 creates a media overlay that contains the uploaded content and associates the uploaded content with the selected geolocation.

다른 예시적인 실시예에서, 주석 시스템(206)은 상인들이 입찰 프로세스를 통해 지오로케이션과 연관된 특정 미디어 오버레이를 선택하는 것을 가능하게 하는 상인-기반 게시 플랫폼을 제공한다. 예를 들어, 주석 시스템(206)은 최고 입찰 상인의 미디어 오버레이를 미리 정의된 양의 시간 동안 대응하는 지오로케이션과 연관시킨다.In another example embodiment, annotation system 206 provides a merchant-based publishing platform that allows merchants to select specific media overlays associated with a geolocation through a bidding process. For example, annotation system 206 associates a media overlay of the highest bidding merchant with a corresponding geolocation for a predefined amount of time.

도 3은 일부 예시적인 실시예들에 따른, 메시징 서버 시스템(108)의 데이터베이스(120)에 저장될 수 있는 데이터 구조들(300)을 예시하는 개략도이다. 데이터베이스(120)의 콘텐츠가 다수의 테이블을 포함하는 것으로 도시되어 있지만, 데이터는 다른 타입의 데이터 구조에 (예를 들어, 객체-지향형 데이터베이스로서) 저장될 수 있다는 것을 인식할 것이다.FIG. 3 is a schematic diagram illustrating data structures 300 that may be stored in database 120 of messaging server system 108, according to some example embodiments. Although the contents of database 120 are shown as including a number of tables, it will be appreciated that data may be stored in other types of data structures (eg, as an object-oriented database).

데이터베이스(120)는 메시지 테이블(314) 내에 저장된 메시지 데이터를 포함한다. 엔티티 테이블(302)은 엔티티 그래프(304)를 포함하는 엔티티 데이터를 저장한다. 엔티티 테이블(302) 내에 레코드들이 유지되는 엔티티들은, 개인, 법인 엔티티, 조직, 객체, 장소, 이벤트 등을 포함할 수 있다. 타입에 관계없이, 그에 관해 메시징 서버 시스템(108)이 데이터를 저장하는 임의의 엔티티는 인식된 엔티티일 수 있다. 각각의 엔티티는 고유 식별자뿐만 아니라 엔티티 타입 식별자(도시되지 않음)를 구비한다.Database 120 contains message data stored in message table 314. The entity table 302 stores entity data including the entity graph 304. Entities for which records are maintained within entity table 302 may include individuals, corporate entities, organizations, objects, places, events, etc. Any entity, regardless of type, about which messaging server system 108 stores data may be a recognized entity. Each entity has a unique identifier as well as an entity type identifier (not shown).

엔티티 그래프(304)는 더욱이 엔티티들 사이의 관계 및 연관에 관한 정보를 저장한다. 그러한 관계들은, 단지 예를 들어, 사회의, 전문적(예를 들어, 일반 법인 또는 조직에서의 일), 관심 기반, 또는 활동 기반일 수 있다.The entity graph 304 further stores information regarding relationships and associations between entities. Such relationships may be, for example, social, professional (eg, work in a general corporation or organization), interest-based, or activity-based.

데이터베이스(120)는 또한 주석 데이터를 필터들의 예시적인 형태로 주석 테이블(312)에 저장한다. 주석 테이블(312) 내에 데이터가 저장되는 필터들은, 비디오들(비디오 테이블(310)에 데이터가 저장되는) 및/또는 이미지들(이미지 테이블(308)에 데이터가 저장되는)과 연관되고 이들에 적용된다. 한 예에서, 필터들은 수신자 사용자에의 프레젠테이션 동안 이미지 또는 비디오 상에 오버레이되어 디스플레이되는 오버레이들(overlays)이다. 필터들은, 전송측 사용자가 메시지를 작성하고 있을 때 메시징 클라이언트 애플리케이션(104)에 의해 전송측 사용자에게 프레젠테이션되는 필터들의 갤러리로부터의 사용자-선택된 필터들을 포함하여, 다양한 타입들일 수 있다. 다른 타입의 필터들은, 지리적 위치에 기초하여 전송측 사용자에게 프레젠테이션될 수 있는 지오로케이션 필터들(지오-필터들이라고도 함)을 포함한다. 예를 들어, 이웃 또는 특수한 위치에 특정한 지오로케이션 필터들이 클라이언트 디바이스(102)의 GPS 유닛에 의해 결정된 지오로케이션 정보에 기초하여 메시징 클라이언트 애플리케이션(104)에 의해 사용자 인터페이스 내에 프레젠테이션될 수 있다. 다른 타입의 필터는, 메시지 생성 프로세스 동안 클라이언트 디바이스(102)에 의해 수집된 정보 또는 다른 입력들에 기초하여, 메시징 클라이언트 애플리케이션(104)에 의해 전송측 사용자에게 선택적으로 프레젠테이션될 수 있는 데이터 필터이다. 데이터 필터들의 예는, 특정 위치에서의 현재 온도, 전송측 사용자가 이동하고 있는 현재 속도, 클라이언트 디바이스(102)에 대한 배터리 수명, 또는 현재 시간을 포함한다.Database 120 also stores annotation data in annotation table 312 in the example form of filters. Filters, for which data is stored in annotation table 312, are associated with and applied to videos (for which data is stored in video table 310) and/or images (for which data is stored in image table 308). do. In one example, filters are overlays that are displayed overlaid on an image or video during presentation to a recipient user. Filters can be of various types, including user-selected filters from a gallery of filters that are presented to the sending user by messaging client application 104 when the sending user is composing a message. Other types of filters include geolocation filters (also called geo-filters), which can be presented to the sending user based on geographic location. For example, neighborhood or specific location-specific geolocation filters may be presented within the user interface by messaging client application 104 based on geolocation information determined by the GPS unit of client device 102. Another type of filter is a data filter that can be selectively presented to the sending user by the messaging client application 104 based on information or other inputs collected by the client device 102 during the message creation process. Examples of data filters include the current temperature at a particular location, the current speed at which the sending user is traveling, battery life for the client device 102, or the current time.

이미지 테이블(308) 내에 저장될 수 있는 다른 주석 데이터는 증강 현실 콘텐츠 생성기들(예를 들어, 렌즈(LENSES), 증강 현실 경험들, 또는 증강 현실 콘텐츠 항목들을 적용하는 것에 대응함)이다. 증강 현실 콘텐츠 생성기는 이미지 또는 비디오에 추가될 수 있는 실시간 특수 효과 및 사운드일 수 있다.Other annotation data that may be stored within image table 308 are augmented reality content generators (eg, corresponding to applying LENSES, augmented reality experiences, or augmented reality content items). Augmented reality content generators can be real-time special effects and sounds that can be added to images or videos.

전술한 바와 같이, 증강 현실 콘텐츠 생성기들, 증강 현실 콘텐츠 항목들, 오버레이들, 이미지 변환들, AR 이미지들 및 유사한 용어들은 비디오들 또는 이미지들에 대해 이루어질 수 있는 수정들을 지칭한다. 이것은 이미지가 디바이스 센서를 사용하여 캡처되고 이어서 수정들을 갖는 디바이스의 스크린 상에 디스플레이될 때 이미지를 수정하는 실시간 수정을 포함한다. 이것은 또한 수정될 수 있는 갤러리 내의 비디오 클립들과 같은 저장된 콘텐츠에 대한 수정들을 포함한다. 예를 들어, 다수의 증강 현실 콘텐츠 생성기들에 액세스하는 디바이스에서, 사용자는 다수의 증강 현실 콘텐츠 생성기들을 갖는 단일 비디오 클립을 사용하여 상이한 증강 현실 콘텐츠 생성기들이 저장된 클립을 어떻게 수정할지를 알 수 있다. 예를 들어, 상이한 의사랜덤 움직임 모델들을 적용하는 다수의 증강 현실 콘텐츠 생성기들은 콘텐츠에 대한 상이한 증강 현실 콘텐츠 생성기들을 선택함으로써 동일한 콘텐츠에 적용될 수 있다. 유사하게, 디바이스의 센서들에 의해 현재 캡처되고 있는 비디오 이미지들이 어떻게 캡처된 데이터를 수정할 것인지를 보여주기 위해 도시된 수정과 함께 실시간 비디오 캡처가 사용될 수 있다. 그러한 데이터는 단순히 스크린 상에 디스플레이되고 메모리에 저장되어 있지 않을 수 있거나, 디바이스 센서들에 의해 캡처된 콘텐츠는 수정들과 함께 또는 수정들 없이(또는 둘 다) 메모리에 기록 및 저장될 수 있다. 일부 시스템들에서, 미리보기 특징은 상이한 증강 현실 콘텐츠 생성기들이 디스플레이 내의 상이한 윈도우들 내에서 동시에 어떻게 보일지를 보여줄 수 있다. 이것은, 예를 들어, 상이한 의사 랜덤 애니메이션들을 갖는 다수의 윈도우가 디스플레이 상에서 동시에 시청될 수 있게 할 수 있다.As described above, augmented reality content generators, augmented reality content items, overlays, image transformations, AR images, and similar terms refer to modifications that may be made to videos or images. This involves real-time modification of the image as it is captured using the device sensor and then displayed on the device's screen with the modifications. This includes modifications to stored content, such as video clips in a gallery that can also be modified. For example, on a device that accesses multiple augmented reality content generators, a user can use a single video clip with multiple augmented reality content generators to see how different augmented reality content generators will modify the stored clip. For example, multiple augmented reality content generators applying different pseudorandom motion models can be applied to the same content by selecting different augmented reality content generators for the content. Similarly, real-time video capture can be used with modifications shown to demonstrate how video images currently being captured by the device's sensors may modify the captured data. Such data may simply be displayed on a screen and not stored in memory, or content captured by device sensors may be recorded and stored in memory with or without modifications (or both). In some systems, a preview feature can show how different augmented reality content generators will look simultaneously within different windows within the display. This could, for example, allow multiple windows with different pseudo-random animations to be viewed simultaneously on the display.

따라서, 이 데이터를 사용하여 콘텐츠를 수정하기 위해 증강 현실 콘텐츠 생성기들 또는 다른 이러한 변환 시스템들을 사용하는 데이터 및 다양한 시스템들은 객체들(예를 들어, 얼굴들, 손들, 신체들, 고양이들, 개들, 표면들, 객체들 등)의 검출, 이러한 객체들이 비디오 프레임들에서 시야를 떠나고, 들어가고, 여기저기 이동할 때 이러한 객체들의 추적, 및 이러한 객체들이 추적될 때 이러한 객체들의 수정 또는 변환을 수반할 수 있다. 다양한 실시예들에서, 그러한 변환들을 달성하기 위한 상이한 방법들이 이용될 수 있다. 예를 들어, 일부 실시예들은 객체 또는 객체들의 3차원 메시 모델을 생성하는 것, 및 변환을 달성하기 위해 비디오 내에서의 모델의 변환 및 애니메이션화된 텍스처를 사용하는 것을 수반할 수 있다. 다른 실시예들에서, 객체 상의 포인트들의 추적을 이용하여, (2차원 또는 3차원일 수 있는) 이미지 또는 텍스처를 추적 포지션에 배치할 수 있다. 또 다른 실시예들에서, 비디오 프레임들의 신경망 분석이 콘텐츠(예를 들어, 비디오의 이미지들 또는 프레임들)에서 이미지들, 모델들, 또는 텍스처들을 배치하기 위해 사용될 수 있다. 따라서, 증강 현실 콘텐츠 생성기들은, 콘텐츠의 변환을 생성하는데 이용되는 이미지들, 모델들, 및 텍스처들 뿐만 아니라, 객체 검출, 추적, 및 배치로 이러한 변환을 달성하는데 필요한 추가 모델링 및 분석 정보 양쪽 모두를 지칭한다.Accordingly, data and various systems that use augmented reality content generators or other such transformation systems to modify content using this data can be used to create objects (e.g., faces, hands, bodies, cats, dogs, surfaces, objects, etc.), tracking these objects as they leave, enter, and move around the field of view in video frames, and modifying or transforming these objects as they are tracked. . In various embodiments, different methods may be used to achieve such transformations. For example, some embodiments may involve creating a three-dimensional mesh model of an object or objects, and using a transform and animated texture of the model within the video to accomplish the transformation. In other embodiments, tracking of points on an object may be used to place an image or texture (which may be two-dimensional or three-dimensional) at the tracking position. In still other embodiments, neural network analysis of video frames may be used to place images, models, or textures in content (e.g., images or frames of video). Accordingly, augmented reality content generators provide both the images, models, and textures used to create transformations of the content, as well as the additional modeling and analysis information needed to achieve these transformations with object detection, tracking, and placement. refers to

실시간 비디오 처리는 임의 종류의 컴퓨터화된 시스템의 메모리에 저장된 임의 종류의 비디오 데이터(예를 들어, 비디오 스트림, 비디오 파일 등)로 수행될 수 있다. 예를 들어, 사용자는 비디오 파일들을 로딩하고 이들을 디바이스의 메모리에 저장할 수 있거나, 또는 디바이스의 센서들을 사용하여 비디오 스트림을 생성할 수 있다. 그에 부가하여, 사람의 얼굴 및 인체의 부분, 동물, 또는 의자, 자동차, 또는 다른 객체와 같은 무생물과 같은, 임의의 객체들이 컴퓨터 애니메이션 모델을 사용하여 처리될 수 있다.Real-time video processing can be performed on any type of video data (eg, video stream, video file, etc.) stored in the memory of any type of computerized system. For example, a user can load video files and save them to the device's memory, or use the device's sensors to generate a video stream. In addition, any objects, such as human faces and parts of the human body, animals, or inanimate objects such as chairs, cars, or other objects, can be processed using computer animation models.

일부 실시예들에서, 변환될 콘텐츠와 함께 특정의 수정이 선택될 때, 변환될 요소가 컴퓨팅 디바이스에 의해 식별되고, 이어서 이들이 비디오의 프레임에 존재하는 경우 검출 및 추적된다. 객체의 요소들은 수정을 위한 요청에 따라 수정되고, 따라서 비디오 스트림의 프레임들을 변환한다. 비디오 스트림의 프레임들의 변환은 상이한 종류의 변환을 위한 상이한 방법들에 의해 수행될 수 있다. 예를 들어, 객체의 요소들의 형태들을 변경하는 것을 주로 가리키는 프레임들의 변환들에 대해, 객체의 요소 각각에 대한 특징적인 포인트들이 (예를 들어, ASM(Active Shape Model) 또는 다른 알려진 방법들을 사용하여) 계산된다. 이어서, 객체의 적어도 하나의 요소 각각에 대해 특징적인 포인트들에 기초한 메시가 생성된다. 이 메시는 비디오 스트림에서 객체의 요소들을 추적하는 다음 스테이지에서 사용된다. 추적 프로세스에서, 각각의 요소에 대한 언급된 메시는 각각의 요소의 포지션과 정렬된다. 그 다음, 메시 상에 추가 포인트들이 생성된다. 제1 포인트들의 제1 세트가 수정 요청에 기초하여 각각의 요소에 대해 생성되고, 제2 포인트들의 세트가 제1 포인트들의 세트 및 수정 요청에 기초하여 각각의 요소에 대해 생성된다. 이어서, 비디오 스트림의 프레임들은 제1 및 제2 포인트들의 세트들 및 메시에 기초하여 객체의 요소들을 수정함으로써 변환될 수 있다. 그러한 방법에서, 수정된 객체의 배경은 배경을 추적하고 수정함으로써 마찬가지로 변경 또는 왜곡될 수 있다. In some embodiments, when a particular modification is selected with content to be converted, the elements to be converted are identified by the computing device and then detected and tracked if they are present in a frame of video. The elements of the object are modified according to the request for modification, thus transforming the frames of the video stream. Transformation of frames of a video stream can be performed by different methods for different types of transformation. For example, for transformations of frames that primarily refer to changing the shapes of elements of an object, characteristic points for each of the elements of the object can be identified (e.g. using Active Shape Model (ASM) or other known methods). ) is calculated. A mesh is then created based on the characteristic points for each at least one element of the object. This mesh is used in the next stage to track the elements of the object in the video stream. In the tracking process, the referenced mesh for each element is aligned with the position of each element. Next, additional points are created on the mesh. A first set of points is created for each element based on the modification request, and a second set of points is created for each element based on the first set of points and the modification request. Frames of the video stream can then be transformed by modifying elements of the object based on the mesh and the sets of first and second points. In such a method, the background of the modified object can likewise be altered or distorted by tracking and modifying the background.

하나 이상의 실시예에서, 객체의 요소들을 사용하여 객체의 일부 영역들을 변경하는 변환들은 객체의 각각의 요소에 대한 특징적 포인트들을 계산하고 계산된 특징적 포인트들에 기초하여 메시를 생성하는 것에 의해 수행될 수 있다. 포인트들이 메시 상에서 생성된 다음, 포인트들에 기초한 다양한 영역들이 생성된다. 객체의 요소들은 이후 각각의 요소에 대한 영역을 적어도 하나의 요소 각각에 대한 포지션과 정렬시킴으로써 추적되고, 영역들의 속성들은 수정을 위한 요청에 기초하여 수정될 수 있고, 따라서 비디오 스트림의 프레임들을 변환한다. 수정에 대한 특정적 요청에 좌우되어 언급된 영역들의 속성들이 상이한 방식들로 변환될 수 있다. 이러한 수정들은 영역들의 컬러를 변경하는 것; 비디오 스트림의 프레임들로부터 영역들의 적어도 일부를 제거하는 것; 수정 요청에 기초하는 영역들에 하나 이상의 새로운 객체를 포함시키는 것; 및 영역 또는 객체의 요소들을 수정 또는 왜곡하는 것을 수반할 수 있다. 다양한 실시예들에서, 그러한 수정들 또는 다른 유사한 수정들의 임의의 조합이 사용될 수 있다. 애니메이션화될 특정 모델들에 대해, 일부 특성 포인트들은 모델 애니메이션에 대한 옵션들의 전체 상태-공간을 결정하기 위해 사용될 제어 포인트들로서 선택될 수 있다.In one or more embodiments, transformations that change some regions of an object using elements of the object may be performed by calculating characteristic points for each element of the object and generating a mesh based on the calculated characteristic points. there is. Points are created on the mesh, and then various areas are created based on the points. The elements of the object are then tracked by aligning the region for each element with the position for each of at least one element, and the properties of the regions can be modified based on the request for modification, thus transforming the frames of the video stream. . Depending on the specific request for modification, the properties of the mentioned areas may be transformed in different ways. These modifications include changing the color of areas; removing at least some of the regions from frames of the video stream; including one or more new objects in areas based on the modification request; and modifying or distorting elements of an area or object. In various embodiments, any combination of such modifications or other similar modifications may be used. For specific models to be animated, some characteristic points can be selected as control points to be used to determine the overall state-space of options for model animation.

얼굴 검출을 사용하여 이미지 데이터를 변환하는 컴퓨터 애니메이션 모델의 일부 실시예들에서, 얼굴은 특정 얼굴 검출 알고리즘(예를 들어, 비올라-존스(Viola-Jones))을 이용하여 이미지 상에서 검출된다. 그 후, 얼굴 특징 참조 포인트들을 검출하기 위해 이미지의 얼굴 영역에 ASM(Active Shape Model) 알고리즘이 적용된다. In some embodiments of a computer animation model that transforms image data using face detection, faces are detected on the image using a specific face detection algorithm (eg, Viola-Jones). Afterwards, the Active Shape Model (ASM) algorithm is applied to the facial area of the image to detect facial feature reference points.

다른 실시예들에서, 얼굴 검출에 적합한 다른 방법들 및 알고리즘들이 사용될 수 있다. 예를 들어, 일부 실시예들에서, 특징들은 고려 중인 이미지들의 대부분에 존재하는 구별가능한 포인트를 나타내는 랜드마크를 사용하여 위치된다. 예를 들어, 얼굴 랜드마크들에 대해, 좌측 눈 동공의 위치가 사용될 수 있다. 초기 랜드마크가 식별가능하지 않을 때(예를 들어, 사람이 안대를 갖는 경우), 보조 랜드마크들이 사용될 수 있다. 그러한 랜드마크 식별 절차들은 임의의 그러한 객체들에 대해 사용될 수 있다. 일부 실시예들에서, 랜드마크들의 세트는 형상을 형성한다. 형상들은 형상에서의 포인트들의 좌표들을 사용하여 벡터들로서 표현될 수 있다. 하나의 형상은 형상 포인트들 사이의 평균 유클리드 거리를 최소화하는 유사성(similarity) 변환(병진, 스케일링, 및 회전을 허용함)에 의해 또 다른 형상에 정렬된다. 평균 형상은 정렬된 트레이닝 형상들의 평균이다. In other embodiments, other methods and algorithms suitable for face detection may be used. For example, in some embodiments, features are located using landmarks that represent distinguishable points present in most of the images under consideration. For example, for facial landmarks, the location of the left eye pupil may be used. When the initial landmark is not identifiable (eg, if the person has an eye patch), auxiliary landmarks may be used. Such landmark identification procedures may be used for any such objects. In some embodiments, the set of landmarks forms a shape. Shapes can be expressed as vectors using the coordinates of points in the shape. One shape is aligned to another shape by a similarity transformation (allowing translation, scaling, and rotation) that minimizes the average Euclidean distance between shape points. The average shape is the average of the aligned training shapes.

일부 실시예들에서, 전역적 얼굴 검출기에 의해 결정된 얼굴의 포지션 및 크기에 정렬된 평균 형상으로부터의 랜드마크들에 대한 검색이 시작된다. 이어서 이러한 검색은 각각의 포인트 주변의 이미지 텍스처의 템플릿 매칭에 의해 형상 포인트들의 위치들을 조정함으로써 임시 형상을 제안하는 단계 및 이어서 수렴이 일어날 때까지 임시 형상을 전역적 형상 모델에 부합시키는 단계를 반복한다. 일부 시스템들에서, 개별 템플릿 매칭들은 신뢰할 수 없으며, 형상 모델은 약한 템플릿 매처(matcher)들의 결과들을 풀링(pool)하여 더 강한 전체 분류기를 형성한다. 조악한 해상도로부터 미세한 해상도까지, 이미지 피라미드에서의 각각의 레벨에서 탐색 전체가 반복된다.In some embodiments, a search begins for landmarks from an average shape aligned to the position and size of the face as determined by the global face detector. This search then repeats the steps of proposing a temporary shape by adjusting the positions of the shape points by template matching of the image texture around each point, followed by matching the temporary shape to the global shape model until convergence occurs. . In some systems, individual template matches are unreliable, and the shape model pools the results of weak template matchers to form a stronger overall classifier. The entire search is repeated at each level in the image pyramid, from coarse to fine resolution.

변환 시스템의 실시예들은, 적절한 사용자 경험, 계산 시간, 및 전력 소비를 유지하면서, 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102)) 상에서 이미지 또는 비디오 스트림을 캡처하고 클라이언트 디바이스(102) 상에서 국지적으로 복잡한 이미지 조작을 수행할 수 있다. 복잡한 이미지 조작은, 크기 및 형상 변화, 감정 전달(예를 들어, 찡그림으로부터 미소로 얼굴을 변화시킴), 상태 전달(예를 들어, 피사체의 노후화, 겉보기 나이 감소, 성별 변화), 스타일 전달, 그래픽 요소 애플리케이션, 및 클라이언트 디바이스(102) 상에서 효율적으로 실행되도록 구성된 컨볼루션 신경망에 의해 구현된 기타 임의의 적절한 이미지 또는 비디오 조작을 포함할 수 있다. Embodiments of the conversion system include capturing an image or video stream on a client device (e.g., client device 102) and converting it locally on the client device 102, while maintaining an appropriate user experience, computation time, and power consumption. Complex image manipulation can be performed. Complex image manipulation involves changing size and shape, conveying emotion (e.g., changing a face from a frown to a smile), conveying status (e.g., aging a subject, decreasing apparent age, changing gender), conveying style, and graphics. component applications, and any other suitable image or video manipulation implemented by a convolutional neural network configured to run efficiently on client device 102.

일부 예시적인 실시예들에서, 이미지 데이터를 변환하기 위한 컴퓨터 애니메이션 모델은, 사용자가 클라이언트 디바이스(102) 상에서 동작하는 메시징 클라이언트 애플리케이션(104)의 일부로서 동작하는 신경망을 갖는 클라이언트 디바이스(102)를 사용하여 사용자의 이미지 또는 비디오 스트림(예를 들어, 셀카)을 캡처할 수 있는 시스템에 의해 사용될 수 있다. 메시징 클라이언트 애플리케이션(104) 내에서 동작하는 변환 시스템은 이미지 또는 비디오 스트림 내의 얼굴의 존재를 결정하고, 이미지 데이터를 변환하기 위해 컴퓨터 애니메이션 모델과 연관된 수정 아이콘들을 제공하거나, 컴퓨터 애니메이션 모델은 본 명세서에 설명된 인터페이스와 연관된 것으로서 존재할 수 있다. 수정 아이콘들은 수정 동작의 일부로서 이미지 또는 비디오 스트림 내의 사용자의 얼굴을 수정하기 위한 기초일 수 있는 변경들을 포함한다. 일단 수정 아이콘이 선택되면, 변환 시스템은 선택된 수정 아이콘을 반영하기 위해 사용자의 이미지를 변환하는(예를 들어, 사용자 상에 미소짓은 얼굴을 생성하는) 프로세스를 개시한다. 일부 실시예들에서, 수정된 이미지 또는 비디오 스트림은 이미지 또는 비디오 스트림이 캡처되고 지정된 수정이 선택되자마자 모바일 클라이언트 디바이스 상에 디스플레이되는 그래픽 사용자 인터페이스에 프레젠테이션될 수 있다. 변환 시스템은 선택된 수정을 생성하고 적용하기 위해 이미지 또는 비디오 스트림의 일부에 대해 복소수 컨볼루션 신경망(complex convolutional neural network)을 구현할 수 있다. 즉, 사용자는 이미지 또는 비디오 스트림을 캡처하고, 일단 수정 아이콘이 선택되면 실시간으로 또는 거의 실시간으로 수정된 결과를 프레젠테이션받을 수 있다. 또한, 수정은 비디오 스트림이 캡처되고 있고 선택된 수정 아이콘이 토글링된 채로 남아 있는 동안 지속적일 수 있다. 이러한 수정들을 가능하게 하기 위해 머신 교육 신경망들이 사용될 수 있다.In some example embodiments, a computer animation model for converting image data is created when a user uses a client device 102 with a neural network running as part of a messaging client application 104 running on the client device 102. It can be used by a system to capture an image or video stream of a user (e.g., a selfie). A conversion system operating within messaging client application 104 determines the presence of a face in an image or video stream and provides modification icons associated with a computer-animated model for converting the image data, or a computer-animated model, as described herein. It can exist as something associated with an interface. Modification icons contain changes that may be the basis for modifying the user's face within an image or video stream as part of a modification operation. Once a modification icon is selected, the transformation system initiates the process of transforming the user's image (e.g., creating a smiling face on the user) to reflect the selected modification icon. In some embodiments, the modified image or video stream may be presented in a graphical user interface displayed on the mobile client device as soon as the image or video stream is captured and the specified modification is selected. The transformation system may implement a complex convolutional neural network on a portion of the image or video stream to generate and apply selected modifications. That is, users can capture an image or video stream and, once the edit icon is selected, receive a presentation of the modified results in real time or near real time. Additionally, modifications can be persistent as long as the video stream is being captured and the selected modification icon remains toggled. Machine training neural networks can be used to enable these modifications.

일부 실시예들에서, 변환 시스템에 의해 수행된 수정을 프레젠테이션하는 그래픽 사용자 인터페이스가 사용자에게 추가적인 상호작용 옵션들을 제공할 수 있다. 이러한 옵션들은 콘텐츠 캡처 및 특정 컴퓨터 애니메이션 모델의 선택(예를 들어, 콘텐츠 생성자 사용자 인터페이스로부터의 개시)을 개시하기 위해 사용되는 인터페이스에 기초할 수 있다. 다양한 실시예들에서, 수정 아이콘의 초기 선택 후에 수정이 지속될 수 있다. 사용자는 변환 시스템에 의해 수정되고 있는 얼굴을 탭핑하거나 다른 방식으로 선택함으로써 수정을 온 또는 오프로 토글링하고, 나중에 보거나 이미징 애플리케이션의 다른 영역들로 브라우징하기 위해 수정을 저장할 수 있다. 변환 시스템에 의해 다수의 얼굴이 수정되는 경우, 사용자는 그래픽 사용자 인터페이스 내에 수정되고 디스플레이되는 단일 얼굴을 태핑하거나 선택함으로써 전역적으로 수정을 토글링 온 또는 오프할 수 있다. 일부 실시예들에서, 개별 얼굴들은 다수의 얼굴의 그룹 중에서 개별적으로 수정될 수 있거나 또는 그러한 수정들은 그래픽 사용자 인터페이스 내에 디스플레이된 개별 얼굴 또는 일련의 개별 얼굴들을 태핑하거나 선택함으로써 개별적으로 토글링될 수 있다.In some embodiments, a graphical user interface presenting modifications performed by the conversion system may provide additional interaction options to the user. These options may be based on the interface used to initiate content capture and selection of a particular computer animation model (eg, launching from a content creator user interface). In various embodiments, editing may continue after the initial selection of the editing icon. The user can toggle the correction on or off by tapping or otherwise selecting the face being modified by the transformation system, and save the correction for later viewing or browsing to other areas of the imaging application. When multiple faces are modified by the transformation system, the user can toggle modification on or off globally by tapping or selecting a single face to be modified and displayed within the graphical user interface. In some embodiments, individual faces may be individually modified among a group of multiple faces or such modifications may be individually toggled by tapping or selecting an individual face or series of individual faces displayed within a graphical user interface. .

일부 예시적인 실시예들에서, 상이한 증강 현실 경험들(예를 들어, AR 콘텐츠 생성기들)이 대응하는 상이한 계층들에 적용될 수 있게 하는 그래픽 처리 파이프라인 아키텍처가 제공된다. 이러한 그래픽 처리 파이프라인은 메시징 클라이언트 애플리케이션(104)(또는 메시징 시스템(100))에 의한 렌더링을 위해 합성 미디어(예를 들어, 이미지 또는 비디오)에 포함되는 다수의 증강 현실 경험을 제공하기 위한 확장가능 렌더링 엔진을 제공한다.In some example embodiments, a graphics processing pipeline architecture is provided that allows different augmented reality experiences (e.g., AR content generators) to be applied to corresponding different layers. This graphics processing pipeline is extensible to provide multiple augmented reality experiences embedded in synthetic media (e.g., images or video) for rendering by messaging client application 104 (or messaging system 100). Provides a rendering engine.

위에 언급된 바와 같이, 비디오 테이블(310)은, 일 실시예에서, 그에 대해 레코드들이 메시지 테이블(314) 내에 유지되는 메시지들과 연관되는 비디오 데이터를 저장한다. 유사하게, 이미지 테이블(308)은 그에 대해 메시지 데이터가 엔티티 테이블(302)에 저장되는 메시지들과 연관된 이미지 데이터를 저장한다. 엔티티 테이블(302)은 주석 테이블(312)로부터의 다양한 주석들을 이미지 테이블(308) 및 비디오 테이블(310)에 저장된 다양한 이미지들 및 비디오들과 연관시킬 수 있다.As mentioned above, video table 310 stores video data associated with messages for which, in one embodiment, records are maintained in message table 314. Similarly, image table 308 stores image data associated with messages for which message data is stored in entity table 302. Entity table 302 may associate various annotations from annotation table 312 with various images and videos stored in image table 308 and video table 310.

스토리 테이블(306)은, 컬렉션(예를 들어, 스토리 또는 갤러리)으로 컴파일되는, 메시지들 및 연관된 이미지, 비디오 또는 오디오 데이터의 컬렉션들에 관한 데이터를 저장한다. 특정 컬렉션의 생성은 특정 사용자(예를 들어, 그에 대해 레코드가 엔티티 테이블(302)에서 유지되는 각각의 사용자)에 의해 개시될 수 있다. 사용자는 그 사용자에 의해 생성되고 전송/브로드캐스팅된 콘텐츠의 컬렉션의 형태로 "개인 스토리(personal story)"를 생성할 수 있다. 이를 위해, 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스는, 전송측 사용자가 자신의 개인 스토리에 특정 콘텐츠를 추가하는 것을 가능하게 하기 위해 사용자 선택가능한 아이콘을 포함할 수 있다.Story table 306 stores data about collections of messages and associated image, video or audio data, compiled into a collection (e.g., story or gallery). Creation of a particular collection may be initiated by a particular user (e.g., each user for whom a record is maintained in entity table 302). A user may create a “personal story” in the form of a collection of content created and transmitted/broadcast by that user. To this end, the user interface of the messaging client application 104 may include user-selectable icons to enable the sending user to add specific content to his or her personal story.

컬렉션은 또한, 수동으로, 자동으로 또는 수동 및 자동 기술의 조합을 이용하여 생성된 복수의 사용자로부터의 콘텐츠의 컬렉션인 "라이브 스토리"를 구성할 수 있다. 예를 들어, "라이브 스토리"는 다양한 로케이션 및 이벤트로부터의 사용자-제출 콘텐츠(user-submitted content)의 큐레이팅된 스트림(curated stream)을 구성할 수 있다. 그 클라이언트 디바이스들이 위치 서비스 가능하고 특정 시간에 공통 위치 이벤트에 있는 사용자들에게는, 예를 들어, 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스를 통해, 특정 라이브 스토리에 콘텐츠를 기여하는 옵션이 프레젠테이션될 수 있다. 라이브 스토리는 메시징 클라이언트 애플리케이션(104)에 의해 사용자에게, 자신의 위치에 기초하여 식별될 수 있다. 최종 결과는 커뮤니티 관점에서 말한 "라이브 스토리"이다.Collections may also constitute “live stories,” which are collections of content from multiple users created manually, automatically, or using a combination of manual and automated techniques. For example, “Live Stories” may constitute a curated stream of user-submitted content from various locations and events. Users whose client devices are location services enabled and who are at a common location event at a particular time may be presented with the option to contribute content to a particular live story, for example, via the user interface of the messaging client application 104. . Live stories may be identified to the user by messaging client application 104 based on their location. The end result is a “live story” told from a community perspective.

추가적인 타입의 콘텐츠 컬렉션은, 특정 지리적 위치(예를 들어, 단과대학 또는 대학 캠퍼스) 내에 위치하는 클라이언트 디바이스(102)를 갖는 사용자가 특정 컬렉션에 기여하는 것을 가능하게 하는, "위치 스토리(location story)"라고 알려져 있다. 일부 실시예들에서, 위치 스토리에 대한 기여는 최종 사용자가 특정 조직 또는 다른 엔티티에 속한다는(예를 들어, 대학 캠퍼스의 학생이라는) 것을 검증하기 위해 제2 인증 정도를 요구할 수 있다.An additional type of content collection is a “location story,” which allows users with client devices 102 located within a specific geographic location (e.g., a college or university campus) to contribute to a specific collection. "It is known. In some embodiments, contributions to location stories may require a second degree of authentication to verify that the end user belongs to a particular organization or other entity (e.g., is a student on a college campus).

도 4는 추가 메시징 클라이언트 애플리케이션(104) 또는 메시징 서버 애플리케이션(114)으로의 통신을 위해 메시징 클라이언트 애플리케이션(104)에 의해 생성된, 일부 실시예들에 따른, 메시지(400)의 구조를 예시하는 개략도이다. 특정 메시지(400)의 콘텐츠는 메시징 서버 애플리케이션(114)에 의해 액세스 가능한, 데이터베이스(120) 내에 저장된 메시지 테이블(314)을 채우는 데 사용된다. 유사하게, 메시지(400)의 콘텐츠는 클라이언트 디바이스(102) 또는 애플리케이션 서버(112)의 "수송중" 또는 "비행중" 데이터로서 메모리에 저장된다. 메시지(400)는 다음과 같은 컴포넌트들을 포함하는 것으로 도시된다:4 is a schematic diagram illustrating the structure of a message 400 generated by a messaging client application 104 for communication to an additional messaging client application 104 or a messaging server application 114, according to some embodiments. am. The content of a particular message 400 is used to populate a message table 314 stored within database 120, accessible by messaging server application 114. Similarly, the content of message 400 is stored in memory as “in-transit” or “in-flight” data of client device 102 or application server 112. Message 400 is shown as including the following components:

· 메시지 식별자(402): 메시지(400)를 식별하는 고유 식별자.· Message identifier 402: A unique identifier that identifies the message 400.

· 메시지 텍스트 페이로드(404): 클라이언트 디바이스(102)의 사용자 인터페이스를 통해 사용자에 의해 생성되고 메시지(400)에 포함되는 텍스트.· Message text payload 404: Text generated by the user through the user interface of the client device 102 and included in the message 400.

· 메시지 이미지 페이로드(406): 클라이언트 디바이스(102)의 카메라 컴포넌트에 의해 캡처되거나 클라이언트 디바이스(102)의 메모리 컴포넌트로부터 검색되고, 메시지(400)에 포함되는 이미지 데이터.· Message image payload 406: Image data captured by a camera component of client device 102 or retrieved from a memory component of client device 102 and included in message 400.

· 메시지 비디오 페이로드(408): 카메라 컴포넌트에 의해 캡처되거나 클라이언트 디바이스(102)의 메모리 컴포넌트로부터 검색되고 메시지(400)에 포함되는 비디오 데이터.· Message video payload 408: Video data captured by a camera component or retrieved from a memory component of client device 102 and included in message 400.

· 메시지 오디오 페이로드(410): 마이크로폰에 의해 캡처되거나 또는 클라이언트 디바이스(102)의 메모리 컴포넌트로부터 검색되고 메시지(400)에 포함되는 오디오 데이터.· Message audio payload 410: Audio data captured by a microphone or retrieved from a memory component of client device 102 and included in message 400.

· 메시지 주석(412): 메시지(400)의 메시지 이미지 페이로드(406), 메시지 비디오 페이로드(408), 또는 메시지 오디오 페이로드(410)에 적용될 주석을 나타내는 주석 데이터(예를 들어, 필터, 스티커 또는 다른 강화물).· Message annotation 412: Annotation data (e.g., filter, stickers or other reinforcers).

· 메시지 지속기간 파라미터(414): 메시지의 콘텐츠(예를 들어, 메시지 이미지 페이로드(406), 메시지 비디오 페이로드(408), 메시지 오디오 페이로드(410))가 메시징 클라이언트 애플리케이션(104)을 통해 사용자에게 프레젠테이션되거나 액세스 가능하게 되는 시간의 양을 초 단위로 표시하는 파라미터 값.· Message duration parameter 414: The content of the message (e.g., message image payload 406, message video payload 408, message audio payload 410) is transmitted via messaging client application 104. A parameter value that indicates the amount of time, in seconds, that will be presented or made accessible to the user.

· 메시지 지오로케이션 파라미터(416): 메시지의 콘텐츠 페이로드와 연관된 지오로케이션 데이터(예를 들어, 위도 및 경도 좌표). 다수의 메시지 지오로케이션 파라미터(416) 값이 페이로드에 포함될 수 있고, 이들 파라미터 값들 각각은 콘텐츠(예를 들어, 메시지 이미지 페이로드(406) 내의 특정 이미지, 또는 메시지 비디오 페이로드(408) 내의 특정 비디오)에 포함된 콘텐츠 항목들에 관하여 연관된다.· Message geolocation parameters 416: Geolocation data (e.g., latitude and longitude coordinates) associated with the content payload of the message. Multiple message geolocation parameter 416 values may be included in the payload, each of which may be associated with content (e.g., a specific image within the message image payload 406, or a specific image within the message video payload 408). related to the content items included in the video).

· 메시지 스토리 식별자(418): 그와 메시지(400)의 메시지 이미지 페이로드(406) 내의 특정 콘텐츠 항목이 연관되어 있는 하나 이상의 콘텐츠 컬렉션(예를 들어, "스토리")을 식별하는 식별자 값. 예를 들어, 메시지 이미지 페이로드(406) 내의 다수의 이미지는 각각 식별자 값들을 사용하여 다수의 콘텐츠 컬렉션과 연관될 수 있다.· Message Story Identifier 418: An identifier value that identifies one or more content collections (e.g., “stories”) with which a particular content item within the message image payload 406 of message 400 is associated. For example, multiple images within message image payload 406 may each be associated with multiple content collections using identifier values.

· 메시지 태그(420): 각각의 메시지(400)는 다수의 태그로 태깅될 수 있고, 그 각각은 메시지 페이로드에 포함된 콘텐츠의 주제를 나타낸다. 예를 들어, 메시지 이미지 페이로드(406)에 포함된 특정 이미지가 동물(예를 들어, 사자)을 묘사하는 경우, 관련 동물을 나타내는 태그 값이 메시지 태그(420) 내에 포함될 수 있다. 태그 값은, 사용자 입력에 기초하여 수동으로 생성되거나, 예를 들어, 이미지 인식을 사용하여 자동으로 생성될 수 있다.· Message tag 420: Each message 400 may be tagged with multiple tags, each of which represents the subject of the content included in the message payload. For example, if a particular image included in message image payload 406 depicts an animal (e.g., a lion), a tag value representing the relevant animal may be included within message tag 420. Tag values may be generated manually based on user input, or automatically using image recognition, for example.

· 메시지 발신자 식별자(422): 그를 통해 메시지(400)가 생성되었고 그로부터 메시지(400)가 전송된 클라이언트 디바이스(102)의 사용자를 나타내는 식별자(예를 들어, 메시징 시스템 식별자, 이메일 주소, 또는 디바이스 식별자).· Message sender identifier 422: An identifier representing the user of the client device 102 through which the message 400 was created and from which the message 400 was sent (e.g., a messaging system identifier, email address, or device identifier) ).

· 메시지 수신자 식별자(424): 메시지(400)가 어드레싱되는 클라이언트 디바이스(102)의 사용자를 나타내는 식별자(예를 들어, 메시징 시스템 식별자, 이메일 주소 또는 디바이스 식별자).· Message recipient identifier 424: An identifier representing the user of client device 102 to which message 400 is addressed (e.g., messaging system identifier, email address, or device identifier).

메시지(400)의 다양한 컴포넌트들의 콘텐츠(예를 들어 값들)는 그 안에 콘텐츠 데이터 값들이 저장되어 있는 테이블 내의 위치들에 대한 포인터들일 수 있다. 예를 들어, 메시지 이미지 페이로드(406) 내의 이미지 값은 이미지 테이블(308) 내의 위치에 대한 포인터(또는 그 주소)일 수 있다. 유사하게, 메시지 비디오 페이로드(408) 내의 값들은 비디오 테이블(310) 내에 저장된 데이터를 가리킬 수 있고, 메시지 주석(412) 내에 저장된 값들은 주석 테이블(312)에 저장된 데이터를 가리킬 수 있고, 메시지 스토리 식별자(418) 내에 저장된 값들은 스토리 테이블(306)에 저장된 데이터를 가리킬 수 있고, 메시지 발신자 식별자(422) 및 메시지 수신자 식별자(424) 내에 저장된 값들은 엔티티 테이블(302) 내에 저장된 사용자 레코드들을 가리킬 수 있다.The content (e.g., values) of the various components of message 400 may be pointers to locations within a table where the content data values are stored. For example, an image value in the message image payload 406 may be a pointer to a location (or its address) in the image table 308. Similarly, values in message video payload 408 can point to data stored in video table 310, values stored in message annotation 412 can point to data stored in annotation table 312, and message story Values stored in identifier 418 may point to data stored in story table 306, and values stored in message sender identifier 422 and message recipient identifier 424 may point to user records stored in entity table 302. there is.

도 5는 그에 관하여 콘텐츠(예를 들어, 단기적 메시지(502), 및 데이터의 연관된 멀티미디어 페이로드) 또는 콘텐츠 컬렉션(예를 들어, 단기적 메시지 그룹(504))에의 액세스가 시간-제한될(예를 들어, 단기적으로 될) 수 있는, 액세스-제한 프로세스(500)를 예시하는 개략도이다.5 illustrates an example in which access to content (e.g., a short-lived message 502, and an associated multimedia payload of data) or a collection of content (e.g., a short-lived message group 504) may be time-limited (e.g. Here is a schematic diagram illustrating an access-restriction process 500, which can be (e.g., short term).

단기적 메시지(502)는 메시지 지속기간 파라미터(506)와 연관되는 것으로 도시되어 있고, 그 값은 단기적 메시지(502)가 메시징 클라이언트 애플리케이션(104)에 의해 단기적 메시지(502)의 수신측 사용자에게 디스플레이될 시간의 양을 결정한다. 일 실시예에서, 전송측 사용자가 메시지 지속기간 파라미터(506)를 사용하여 지정하는 시간량에 좌우되어, 최대 10초 동안 수신측 사용자가 단기적 메시지(502)를 볼 수 있다.Short-term message 502 is shown to be associated with a message duration parameter 506, the value of which determines how short-term message 502 will be displayed by messaging client application 104 to the user receiving short-term message 502. Decide on the amount of time. In one embodiment, the short-lived message 502 may be visible to the receiving user for up to 10 seconds, depending on the amount of time the sending user specifies using the message duration parameter 506.

메시지 지속기간 파라미터(506) 및 메시지 수신자 식별자(424)는 메시지 타이머(512)에 대한 입력들인 것으로 도시되어 있으며, 메시지 타이머는 단기적 메시지(502)가 메시지 수신자 식별자(424)에 의해 식별된 특정 수신측 사용자에게 보여지는 시간의 양을 결정하는 것을 담당한다. 특히, 단기적 메시지(502)는 메시지 지속기간 파라미터(506)의 값에 의해 결정된 기간 동안 관련 수신측 사용자에게만 보여질 것이다. 메시지 타이머(512)는 수신측 사용자에게 콘텐츠(예를 들어, 단기적 메시지(502))의 디스플레이의 전체 타이밍을 담당하는 더 일반화된 단기적 타이머 시스템(202)에 출력을 제공하는 것으로 도시된다.The message duration parameter 506 and message recipient identifier 424 are shown as inputs to a message timer 512, which determines whether a short-lived message 502 will be received at a specific reception identified by message recipient identifier 424. Responsible for determining the amount of time a side is visible to the user. In particular, short-lived messages 502 will only be visible to the relevant recipient user for a period of time determined by the value of the message duration parameter 506. Message timer 512 is shown providing output to the more general short-term timer system 202, which is responsible for the overall timing of the display of content (e.g., short-term message 502) to a receiving user.

단기적 메시지(502)는 단기적 메시지 그룹(504)(예를 들어, 개인 스토리 또는 이벤트 스토리에서의 메시지들의 컬렉션) 내에 포함되는 것으로 도 5에 도시된다. 단기적 메시지 그룹(504)은 연관된 그룹 지속기간 파라미터(508)를 가지며, 그 값은 그에 대해 단기적 메시지 그룹(504)이 메시징 시스템(100)의 사용자들에게 프레젠테이션되고 액세스 가능한 시간 지속기간을 결정한다. 그룹 지속기간 파라미터(508)는, 예를 들어, 음악 콘서트의 지속기간일 수 있고, 여기서 단기적 메시지 그룹(504)은 그 콘서트에 관련된 콘텐츠의 컬렉션이다. 대안적으로, 사용자(소유 사용자 또는 큐레이터 사용자)는 단기적 메시지 그룹(504)의 셋업 및 생성을 수행할 때 그룹 지속기간 파라미터(508)에 대한 값을 지정할 수 있다.Short-term message 502 is shown in FIG. 5 as being included within a short-term message group 504 (e.g., a collection of messages in a personal story or event story). Short-term message group 504 has an associated group duration parameter 508, the value of which determines the time duration for which short-term message group 504 is presented and accessible to users of messaging system 100. Group duration parameter 508 may be, for example, the duration of a music concert, where short-lived message group 504 is a collection of content related to that concert. Alternatively, a user (either an owning user or a curator user) may specify a value for the group duration parameter 508 when performing setup and creation of a short-lived message group 504.

추가적으로, 단기적 메시지 그룹(504) 내의 각각의 단기적 메시지(502)는 연관된 그룹 참여 파라미터(510)를 가지며, 그 값은 그에 대해 단기적 메시지(502)가 단기적 메시지 그룹(504)의 컨텍스트 내에서 액세스 가능할 시간의 지속기간을 결정한다. 따라서, 특정 단기적 메시지 그룹(504)은, 단기적 메시지 그룹(504) 자체가 그룹 지속기간 파라미터(508)의 관점에서 만료되기 이전에, 단기적 메시지 그룹(504)의 컨텍스트 내에서 "만료"되고 액세스 불가능하게 될 수 있다. 그룹 지속기간 파라미터(508), 그룹 참여 파라미터(510), 및 메시지 수신자 식별자(424)는 각각 그룹 타이머(514)에 대한 입력을 제공하며, 이는 동작중에, 맨 먼저, 단기적 메시지 그룹(504)의 특정 단기적 메시지(502)가 특정 수신측 사용자에게 디스플레이될 것인지, 그리고 그렇다면, 얼마나 오랫동안 디스플레이될지를 결정한다. 단기적 메시지 그룹(504)은 또한 메시지 수신자 식별자(424)의 결과로서 특정 수신측 사용자의 아이덴티티를 인식한다는 점에 유의한다.Additionally, each short-term message 502 within a short-term message group 504 has an associated group participation parameter 510, the value of which determines whether the short-term message 502 will be accessible within the context of the short-term message group 504. Determines the duration of time. Accordingly, a particular short-term message group 504 is “expired” and inaccessible within the context of the short-term message group 504 before the short-term message group 504 itself expires in terms of the group duration parameter 508. It can be done. The group duration parameter 508, group participation parameter 510, and message recipient identifier 424 each provide inputs to the group timer 514, which, during operation, first sets Determines whether a particular short-term message 502 will be displayed to a particular recipient user and, if so, for how long. Note that the short-term message group 504 also recognizes the identity of a particular recipient user as a result of the message recipient identifier 424.

따라서, 그룹 타이머(514)는 연관된 단기적 메시지 그룹(504)뿐만 아니라, 단기적 메시지 그룹(504)에 포함된 개별 단기적 메시지(502)의 전체 수명을 동작적으로 제어한다. 일 실시예에서, 단기적 메시지 그룹(504) 내의 각각의 및 모든 단기적 메시지(502)는 그룹 지속기간 파라미터(508)에 의해 지정된 시간-기간 동안 볼 수 있고 액세스가능하게 유지된다. 추가 실시예에서, 특정 단기적 메시지(502)는, 그룹 참여 파라미터(510)에 기초하여, 단기적 메시지 그룹(504)의 컨텍스트 내에서 만료될 수 있다. 메시지 지속기간 파라미터(506)는 단기적 메시지 그룹(504)의 맥락 내에서도, 특정 단기적 메시지(502)가 수신측 사용자에게 디스플레이되는 시간의 지속기간을 여전히 결정할 수 있다는 점에 유의한다. 따라서, 메시지 지속기간 파라미터(506)는 수신측 사용자가 단기적 메시지 그룹(504)의 맥락 내부 또는 외부에서 단기적 메시지(502)를 시청하고 있는지에 관계없이, 특정 단기적 메시지(502)가 수신측 사용자에게 디스플레이되는 시간의 지속기간을 결정한다.Accordingly, the group timer 514 operationally controls the overall lifetime of the associated short-term message group 504, as well as the individual short-term messages 502 included in the short-term message group 504. In one embodiment, each and every short-term message 502 within a short-term message group 504 remains visible and accessible for a time-period specified by the group duration parameter 508. In a further embodiment, a particular short-term message 502 may expire within the context of a short-term message group 504 based on the group participation parameter 510. Note that the message duration parameter 506 may still determine the duration of time that a particular short-term message 502 is displayed to the receiving user, even within the context of a short-term message group 504. Accordingly, the message duration parameter 506 determines whether a particular short-term message 502 will be sent to the receiving user, regardless of whether the receiving user is viewing the short-term message 502 within or outside the context of the short-term message group 504. Determines the duration of time displayed.

단기적 타이머 시스템(202)은 더욱이 동작중에 그것이 연관된 스토리 참여 파라미터(510)를 초과했다는 결정에 기초하여 단기적 메시지 그룹(504)으로부터 특정 단기적 메시지(502)를 제거할 수 있다. 예를 들어, 전송측 사용자가 포스팅(posting)으로부터 24 시간의 그룹 참여 파라미터(510)를 확립했을 때, 단기적 타이머 시스템(202)은 지정된 24 시간 후에 단기적 메시지 그룹(504)으로부터 관련된 단기적 메시지(502)를 제거할 것이다. 단기적 타이머 시스템(202)은 또한 단기적 메시지 그룹(504) 내의 각각의 그리고 모든 단기적 메시지(502)에 대한 그룹 참여 파라미터(510)가 만료되었을 때, 또는 단기적 메시지 그룹(504) 자체가 그룹 지속기간 파라미터(508)에 관하여 만료될 때 단기적 메시지 그룹(504)을 제거하도록 동작한다.The short-lived timer system 202 may furthermore, during operation, remove a particular short-lived message 502 from the short-lived message group 504 based on a determination that it has exceeded the associated story participation parameter 510 . For example, when a sending-side user has established a group participation parameter 510 of 24 hours from posting, the short-term timer system 202 may transmit related short-term messages 502 from the short-term message group 504 after the specified 24 hours. ) will be removed. The short-lived timer system 202 may also determine when the group participation parameter 510 for each and every short-lived message 502 within the short-lived message group 504 has expired, or when the short-lived message group 504 itself has expired the group duration parameter. Operates to remove a short-lived message group 504 when it expires with respect to 508 .

특정 사용 사례들에서, 특정 단기적 메시지 그룹(504)의 생성자는 무한 그룹 지속기간 파라미터(508)를 지정할 수 있다. 이 경우, 단기적 메시지 그룹(504) 내의 마지막 나머지 단기적 메시지(502)에 대한 그룹 참여 파라미터(510)의 만료는 단기적 메시지 그룹(504) 자체가 만료될 때를 결정할 것이다. 이 경우, 단기적 메시지 그룹(504)에 추가된 새로운 단기적 메시지(502)는, 새로운 그룹 참여 파라미터(510)와 함께, 단기적 메시지 그룹(504)의 수명을 그룹 참여 파라미터(510)의 값과 동일하도록 효과적으로 연장한다.In certain use cases, the creator of a particular short-term message group 504 may specify an infinite group duration parameter 508. In this case, the expiration of the group participation parameter 510 for the last remaining short-term message 502 within the short-term message group 504 will determine when the short-term message group 504 itself expires. In this case, the new short-term message 502 added to the short-term message group 504, together with the new group participation parameter 510, causes the lifetime of the short-term message group 504 to be equal to the value of the group participation parameter 510. Extend effectively.

단기적 타이머 시스템(202)이 단기적 메시지 그룹(504)이 만료된 것으로(예를 들어, 더 이상 액세스 가능하지 않은 것으로) 결정하는 것에 응답하여, 단기적 타이머 시스템(202)은 메시징 시스템(100)(및 예를 들어, 구체적으로 메시징 클라이언트 애플리케이션(104))과 통신하여, 관련된 단기적 메시지 그룹(504)과 연관된 표시(예를 들어, 아이콘)가 더 이상 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스 내에 디스플레이되지 않게 한다. 유사하게, 단기적 타이머 시스템(202)이 특정 단기적 메시지(502)에 대한 메시지 지속기간 파라미터(506)가 만료된 것으로 결정할 때, 단기적 타이머 시스템(202)은 메시징 클라이언트 애플리케이션(104)으로 하여금 단기적 메시지(502)와 연관된 표시(예를 들어, 아이콘 또는 텍스트 식별)를 더 이상 디스플레이하지 않게 한다.In response to the short-term timer system 202 determining that the short-term message group 504 has expired (e.g., is no longer accessible), the short-term timer system 202 sends the messaging system 100 (and For example, specifically communicate with the messaging client application 104 so that an indication (e.g., an icon) associated with the relevant short-lived message group 504 is no longer displayed within the user interface of the messaging client application 104. do. Similarly, when the short-term timer system 202 determines that the message duration parameter 506 for a particular short-term message 502 has expired, the short-term timer system 202 causes the messaging client application 104 to send the short-term message ( 502) so as not to display any further indications (e.g., icons or text identifications) associated therewith.

본 명세서에서 논의된 바와 같이, 대상 인프라스트럭처는 메시징 시스템(100)의 다양한 컴포넌트들 전체에 걸쳐, 본 명세서에서 3D 메시지들로 지칭되는 대화형 3D 미디어의 생성 및 공유를 지원한다. 본 명세서에 기술된 인프라스트럭처는 다른 형태의 3D 미디어가 대상 시스템에 걸쳐 제공될 수 있게 하며, 이는 깊이 기반 미디어가 메시징 시스템(100)에 걸쳐 그리고 사진 및 비디오 메시지와 함께 공유될 수 있게 한다. 본 명세서에 설명된 예시적인 실시예들에서, 메시지들은 (예를 들어, 3D 메시지들 및/또는 다른 메시지들이 메모리 또는 데이터베이스에 저장되는 경우) 라이브 카메라로부터 또는 스토리지로부터 시스템에 진입할 수 있다. 대상 시스템은 모션 센서 입력을 지원하고, 깊이 데이터의 전송 및 저장, 및 외부 효과들 및 자산 데이터의 로딩을 관리한다.As discussed herein, the destination infrastructure supports the creation and sharing of interactive 3D media, referred to herein as 3D messages, across the various components of messaging system 100. The infrastructure described herein allows other forms of 3D media to be provided across target systems, allowing depth-based media to be shared across messaging system 100 and along with photo and video messages. In example embodiments described herein, messages may enter the system from a live camera or from storage (e.g., if 3D messages and/or other messages are stored in memory or a database). The target system supports motion sensor input, manages transmission and storage of depth data, and loading of external effects and asset data.

전술한 바와 같이, 3차원(3D) 메시지는 적어도 이미지 및 깊이 데이터를 포함하는 대화형 3D 이미지를 지칭한다. 예시적인 실시예에서, 전통적인 이미지 텍스처 이외에, 카메라가 보는 것의 공간 상세/지오메트리를 시각화하기 위해 대상 시스템을 사용하여 3D 메시지가 렌더링된다. 시청자가 클라이언트 디바이스를 이동시킴으로써 이 3D 메시지와 상호작용할 때, 움직임은 이미지 및 지오메트리가 시청자에게 렌더링되는 관점에서의 대응하는 변화를 트리거한다.As mentioned above, a three-dimensional (3D) message refers to an interactive 3D image that includes at least image and depth data. In an example embodiment, in addition to traditional image textures, the 3D message is rendered using an object system to visualize the spatial details/geometry of what the camera sees. When a viewer interacts with this 3D message by moving the client device, the movement triggers a corresponding change in perspective from which images and geometry are rendered to the viewer.

일 실시예에서, 대상 시스템은 깊이 데이터를 처리하기 위해 시스템의 다른 컴포넌트들과 함께 동작하는 3D 효과들을 제공하며, 이는 메시지들 내의 상이한 깊이 평면들에 서식할 수 있는 입자들, 셰이더들, 2D 자산들 및 3D 지오메트리를 제공한다. 예에서, 이것은 대상 시스템이 이러한 메시지들 내의 자산들의 물리적 배치 및 시각적 외관을 변경함으로써 (예를 들어, 모션 센서 데이터를 통해 검출되는 바와 같은) 사용자의 상호작용들을 폐색하고 응답하는 렌즈(LENSES) 및 효과들을 제공하는 것을 가능하게 한다. 예에서, 본 명세서에 설명된 바와 같은 3D 효과들은 사용자에 대해 실시간 방식으로 렌더링되며, 이는 또한 상이한 입자들 및 층들이 그러한 입자들 및 층들을 보는 각각의 사용자에 대해 상이한 포지션들에 배치되는 것을 가능하게 한다. In one embodiment, the target system provides 3D effects that work with other components of the system to process depth data, including particles, shaders, and 2D assets that can populate different depth planes within messages. and 3D geometry. In an example, this would allow the target system to occlude and respond to user interactions (e.g., as detected through motion sensor data) by changing the physical placement and visual appearance of assets within these messages, and It makes it possible to provide effects. In an example, 3D effects as described herein are rendered in real-time to the user, which also allows different particles and layers to be placed in different positions for each user viewing those particles and layers. Let it be done.

본 명세서에서 논의되는 바와 같이, 2D 부착물 또는 3D 부착물은 증강 현실 콘텐츠 생성기에 부착되는 것과 유사한 방식으로 3D 효과에 부착될 수 있는 이미지들(예를 들어, 스프라이트들) 또는 지오메트리(예를 들어, 3D 객체에 대응함)를 지칭한다. As discussed herein, 2D attachments or 3D attachments are images (e.g., sprites) or geometry (e.g., 3D attachments) that can be attached to a 3D effect in a similar manner to those attached to an augmented reality content generator. corresponds to an object).

본 명세서에 설명된 바와 같이, 얼굴 효과들은 증강 현실 콘텐츠 생성기 및/또는 다른 미디어 오버레이를 통해 얼굴을 변형 또는 미화하기 위해 얼굴을 포함하는 이미지에 적용될 수 있는 미화, 얼굴 리터치, 스트레칭 및 다른 효과들을 지칭한다. As described herein, facial effects refer to beautification, facial retouching, stretching and other effects that can be applied to an image containing a face to transform or beautify the face through an augmented reality content generator and/or other media overlay. do.

본원에서 언급된 바와 같이, 자이로-기반 상호작용은 주어진 클라이언트 디바이스의 회전이 효과의 양태를 변경하기 위한 입력으로서 사용되는 상호작용의 타입을 지칭한다(예를 들어, 장면에서의 광의 컬러를 변경하기 위하여 x-축을 따라 폰을 회전시킴).As referred to herein, gyro-based interaction refers to a type of interaction in which the rotation of a given client device is used as input to change aspects of an effect (e.g., changing the color of light in a scene). (rotate the phone along the x-axis).

본 명세서에서 언급된 바와 같이, 3D 증강 현실 콘텐츠 생성기는 3D 메시지에 추가될 수 있고 이미지 및/또는 깊이 데이터를 수정하는 실시간 특수 효과 및/또는 사운드를 지칭한다.As referred to herein, 3D augmented reality content generator refers to real-time special effects and/or sounds that can be added to 3D messages and modify image and/or depth data.

실시예에서, 사용자가 3D 메시지의 캡처를 개시할 때, 대상 시스템은 깊이 정보 및 카메라 이미지에 기초하여 생성된 메시를 사용하여 번들링된 또는 동적 깊이 효과 증강 현실 콘텐츠 생성기를 적용한다. 3D 메시지를 클라우드에(예를 들어, 메시징 서버 시스템(108)의 데이터베이스(120)에) 저장한 후에 동일한 효과를 재생성하기 위해, 원시 입력 데이터 또는 생성된 메시, 증강 현실 콘텐츠 생성기 등이 저장된다. 예에서, 저장 요건들을 감소시키기 위해, 카메라 이미지 이외에 원시 입력 데이터가 저장된다. 이하의 논의는 일부 실시예들에 따른 이러한 3D 메시지와 관련하여 저장되는 예시적인 데이터에 관한 것이다. In an embodiment, when a user initiates capture of a 3D message, the target system applies a bundled or dynamic depth effect augmented reality content generator using a mesh generated based on depth information and camera images. To recreate the same effect after storing the 3D message in the cloud (e.g., in database 120 of messaging server system 108), raw input data or generated meshes, augmented reality content generators, etc. are stored. In an example, raw input data is stored in addition to the camera image to reduce storage requirements. The discussion below relates to example data stored in connection with such 3D messages in accordance with some embodiments.

도 6은 도 4에서 전술한 바와 같이, 메시징 클라이언트 애플리케이션(104)에 의해 생성된, 일부 실시예들에 따른, 주어진 3D 메시지에 대응하는 추가 정보를 포함하는 메시지 주석들(412)의 구조를 예시하는 개략도이다. 실시예에서, 도 6에 도시된 추가 데이터를 포함하는 특정 메시지(400)의 콘텐츠는 주어진 3D 메시지에 대해 데이터베이스(120) 내에 저장된 메시지 테이블(314)을 채우기 위해 사용되며, 이는 이후 메시징 서버 애플리케이션(114) 및/또는 메시징 클라이언트 애플리케이션(104)에 의해 액세스 가능하다. 예시적인 실시예에 예시된 바와 같이, 메시지 주석(412)은 3D 메시지에 대한 데이터에 대응하는 다음의 컴포넌트들을 포함한다:FIG. 6 illustrates the structure of message annotations 412 containing additional information corresponding to a given 3D message, according to some embodiments, generated by messaging client application 104, as described above in FIG. 4. This is a schematic diagram. In an embodiment, the content of a particular message 400, including the additional data shown in Figure 6, is used to populate a message table 314 stored within database 120 for a given 3D message, which is then used by the messaging server application ( 114) and/or messaging client application 104. As illustrated in the example embodiment, message annotation 412 includes the following components corresponding to data for the 3D message:

· 증강 현실 콘텐츠 생성기 식별자(652): 메시지에서 이용되는 증강 현실 콘텐츠 생성기의 식별자(예를 들어, 3D 효과, 렌즈(LENSES), 필터 등을 포함하는 애니메이션 및/또는 효과)· Augmented reality content generator identifier 652: Identifier of the augmented reality content generator used in the message (e.g., animations and/or effects including 3D effects, LENSES, filters, etc.)

· 메시지 식별자(654): 메시지의 식별자· Message identifier (654): Identifier of the message

· 자산 식별자들(656): 메시지 내의 자산들에 대한 식별자들의 세트. 예를 들어, 특정의 3D 증강 현실 콘텐츠 발생기에 의해 결정되는 다수의 자산에 대해 각자의 자산 식별자들이 포함될 수 있다. 실시예에서, 이러한 자산들은 발신자 측에서 3D 증강 현실 콘텐츠 생성기에 의해 생성되고, 메시징 서버 애플리케이션(114)에 업로드되고, 메시지를 재생성하기 위해 수신자 측에서 3D 증강 현실 콘텐츠 생성기에 의해 이용된다. 전형적인 자산들의 예들은 다음을 포함한다: · Asset identifiers 656: A set of identifiers for assets in the message. For example, individual asset identifiers may be included for multiple assets as determined by a particular 3D augmented reality content generator. In an embodiment, these assets are created by a 3D augmented reality content generator on the sender side, uploaded to messaging server application 114, and used by a 3D augmented reality content generator on the recipient side to recreate the message. Examples of typical assets include:

o 카메라에 의해 캡처된 원래의 스틸 RGB 이미지 o Original still RGB image captured by the camera

o 배경으로부터 분리된 사용자의 3D 캡처를 제공하는 조합된 깊이 맵 및 인물사진 분할 마스크. 실시예에서 그리고 본 명세서에서 추가로 기술하면, 이것은 원시 깊이 데이터 및 인물사진 분할로부터 3D 증강 현실 콘텐츠 생성기에서의 렌더 패스들에 의해 생성되며, 인물사진 분할은 송신을 위해 멀티채널 이미지(예를 들어, 알파 채널을 갖는 RGB 채널들)로 패킹될 수 있다.o A combined depth map and portrait segmentation mask that provides a 3D capture of the user isolated from the background. In an embodiment and further described herein, this is generated by render passes in a 3D augmented reality content generator from raw depth data and portrait segmentation, where the portrait segmentation produces a multi-channel image for transmission (e.g. , RGB channels with an alpha channel).

o 3D 사용자 캡처 뒤에 배치할 블러링된 배경 이미지. 실시예에서, 이것은 사용자 뒤의 이미지 콘텐츠의 인페인팅을 수행하기 위해 인물사진 분할 마스크를 사용하는 증강 현실 콘텐츠 생성기에서의 렌더 패스들에 의해 생성된다. o Blurred background image to be placed behind the 3D user capture. In an embodiment, this is generated by render passes in an augmented reality content generator that uses a portrait segmentation mask to perform inpainting of the image content behind the user.

o 3D 깊이 데이터(이하에서 추가로 언급됨)o 3D depth data (further mentioned below)

o 인물사진 분할 마스크(이하에서 추가로 언급됨)o Portrait segmentation mask (further mentioned below)

· 깊이 데이터(658): 원시 깊이 데이터(예를 들어, 16 비트를 갖는 640x360 및/또는 깊이 맵· Depth data 658: Raw depth data (e.g. 640x360 with 16 bits and/or depth map

· 마스크 데이터(660): 원시 깊이 데이터 및/또는 깊이 맵에 기초한 인물사진 분할 마스크에 대응하는 데이터· Mask data 660: data corresponding to a portrait segmentation mask based on raw depth data and/or depth map.

· 다음을 포함하지만 이에 한정되지 않는 추가 메타데이터에 대응하는 메타데이터(662):· Metadata (662) corresponding to additional metadata, including but not limited to:

o 카메라 이미지 메타데이터에 첨부된 3D 메시지 메타데이터 o 3D message metadata attached to camera image metadata

· 카메라 고유 데이터 · Camera-specific data

· 초점 거리 · Focal distance

· 주요 포인트 · Main points

· 카메라 외부 데이터 · External camera data

· 2개의 카메라 간의 회전을 나타내는 4원수 · Quaternary number representing rotation between two cameras

· 2개의 카메라 사이의 변환 · Conversion between two cameras

· 다른 카메라 정보(예를 들어, 카메라 포지션)· Other camera information (e.g. camera position)

o 대응하는 증강 현실 콘텐츠 생성기에서의 3D 깊이 효과의 증강 현실 콘텐츠 생성기 ID o Augmented reality content generator ID of the 3D depth effect in the corresponding augmented reality content generator

o 메시지가 깊이를 가짐을 표시하는 미디어 속성 o Media properties that indicate that the message has depth

o 증강 현실 콘텐츠 생성기에 대응하는 증강 현실 콘텐츠 생성기 자산 메타데이터o Augmented Reality Content Generator asset metadata corresponding to the Augmented Reality Content Generator

비록 도 6에 도시되어 있지는 않지만, 예시적인 실시예에서, 주어진 3D 메시지는 또한 (예를 들어, 도 4와 관련하여 앞서 기술된 바와 같은) 다음과 같은 데이터를 포함한다: 1) 플레이스홀더 2D 이미지(예를 들어, 3D 효과에 대응하는 정보를 갖는 2D 사진), 및 2) 3D 메시지에 적용되는 표준 2D 오버레이(예를 들어, 지오로케이션, 스티커, 캡션 등에 기초한 필터). 따라서, 실시예에서, 3D 메시지는 증강 현실 콘텐츠 생성기에 대한 구성 데이터(예를 들어, 카메라 고유 데이터, 부착된 객체 포지션들)에 대응하는 메타데이터 및 3D 메시지와 관련하여 저장된 자산들에 대한 참조들을 포함하는 플레이스홀더 이미지를 포함한다.Although not shown in FIG. 6 , in an exemplary embodiment, a given 3D message also includes the following data (e.g., as previously described with respect to FIG. 4 ): 1) a placeholder 2D image; (e.g. 2D photos with information corresponding to 3D effects), and 2) standard 2D overlays applied to 3D messages (e.g. filters based on geolocation, stickers, captions, etc.). Accordingly, in an embodiment, the 3D message includes metadata corresponding to configuration data (e.g., camera specific data, attached object positions) for the augmented reality content generator and references to assets stored in association with the 3D message. Contains a placeholder image.

예에서, 사용자는 카메라에 의해 제공된 원시 데이터의 상이한 시각적 처리들을 초래하는 다수의 증강 현실 콘텐츠 생성기들로부터 선택할 수 있다. 예에서, (3D 데이터를 수반하는) 증강 현실 콘텐츠 생성기가 메시징 클라이언트 애플리케이션(104)에서 선택된 후에, 카메라는 원시 깊이 데이터, 및 카메라 이미지를 캡처한다. 예에서, 이 원시 데이터는 깊이 데이터를 포함하는 메시지의 뷰를 렌더링하기 위한 컴포넌트들에 제공된다. 또한, 실시예에서, 이 원시 데이터는 인물사진 분할 마스크를 생성하기 위해 메시지 클라이언트 애플리케이션(104)(또는 그의 컴포넌트)에 의해 이용된다. In an example, a user can select from multiple augmented reality content generators that result in different visual processing of the raw data provided by the camera. In the example, after an augmented reality content generator (involving 3D data) is selected in messaging client application 104, the camera captures raw depth data, and camera images. In the example, this raw data is provided to components to render a view of the message containing depth data. Additionally, in an embodiment, this raw data is used by message client application 104 (or components thereof) to generate a portrait segmentation mask.

실시예에서, 증강 현실 콘텐츠 생성기는 메시징 서버 애플리케이션(114)에 업로드되는 자산들, 및 메시지에 첨부되는 다른 데이터(효과를 재구축하기 위해 수신자가 필요로 하는 다른 임의의 것, 예를 들어, 시야 정보, 구성 파라미터들 등)를 생성하기 위한 로직을 포함한다.In an embodiment, the augmented reality content generator may be configured to store assets that are uploaded to the messaging server application 114, and other data attached to the message (any other data the recipient needs to reconstruct the effect, e.g., field of view). information, configuration parameters, etc.).

예에서, 발신자는 이후 3D 메시지를 재구성하기 위해 수신자에 의해 이용되는 정보를 포함하는 증강 현실 콘텐츠 생성기 메타데이터가 포함되는 표준 2D 이미지 메시지를 생성한다. 이것은 3D 메시지 증강 현실 콘텐츠 생성기의 ID(예를 들어, 수신자는 또한 발신자에 의해 사용된 것과 동일한 증강 현실 콘텐츠 생성기를 다운로드하고 실행함), 3D 메시지 ID(예를 들어, 모든 자산을 이 특정 3D 메시지와 연관시키기 위함), 및 메타데이터에 직접 내장된 임의의 구조화된 데이터(예를 들어, 숫자들, 텍스트, 벡터들 및 행렬들) 및 메시징 서버 애플리케이션(114)에 저장된 더 큰 자산들을 참조하는 임의의 수의 자산 ID들(예를 들어, 이미지들, 비디오들, 3D 메시들)을 포함하는, 3D 증강 현실 콘텐츠 생성기 자체에 의해 생성된 자산들 ID들 및 구성 데이터를 포함한다.In an example, the sender creates a standard 2D image message that includes augmented reality content generator metadata containing information that is later used by the recipient to reconstruct the 3D message. This is the ID of the 3D message augmented reality content generator (e.g. the recipient will also download and run the same augmented reality content generator used by the sender), the 3D message ID (e.g. all assets will be stored in this specific 3D message) (to associate with), and any structured data (e.g., numbers, text, vectors and matrices) embedded directly in the metadata and any referencing larger assets stored in the messaging server application 114. Contains asset IDs and configuration data generated by the 3D augmented reality content generator itself, including a number of asset IDs (e.g., images, videos, 3D meshes).

실시예에서, 얼굴 데이터 처리는 발신자 측에서만 발생한다. 그 후, 이 처리의 결과들은 3D 메시지 증강 현실 콘텐츠 생성기(예를 들어, 변환 행렬로서의 머리의 3D 변환)에 의해 구성 데이터로서 저장되고, 수신자 측에서 이 구성 데이터가 검색되고 사용된다(예를 들어, 수신자는 원래 이미지로부터 얼굴 데이터를 재처리하지 않는다). 이것은 수신측 디바이스가 #D를 디스플레이하기 위해 렌더링할 수 있게 해준다.In an embodiment, facial data processing occurs only on the sender side. The results of this processing are then stored as configuration data by the 3D message augmented reality content generator (e.g. 3D transformation of the head as a transformation matrix), and this configuration data is retrieved and used at the recipient side (e.g. , the recipient does not reprocess the face data from the original image). This allows the receiving device to render #D for display.

예로서, 수신자는 표준 2D 이미지 메시지를 수신하지만, 증강 현실 콘텐츠 생성기 메타데이터의 존재 및 콘텐츠 때문에, 수신자는 이러한 메타데이터에 기초하여 3D 메시지를 재구성한다. 이것은 먼저 그 ID를 사용하여 3D 메시지 증강 현실 콘텐츠 생성기를 페치한 다음, 3D 메시지 ID와 연관된 모든 자산을 페치하는 것을 수반한다. 수신자는 3D 메시지 증강 현실 콘텐츠 생성기를 로딩하고, 증강 현실 콘텐츠 생성기 자체는 적절한 자산들 및 데이터를 요청하고 3D 메시지를 재조립하기 위한 로직을 포함한다. 예에서, 3D 메시지 자산은 주어진 미디어 오버레이에 관하여 추가 처리를 수행하기 위한 정보를 포함하지 않을 것이므로, 예를 들어, 스티커(예를 들어, 오버레이된 이미지)가 3D 메시지의 상부에 적용되었다면, 수신자는 기본의 알려지지 않은 3D 메시지 자산을 수신할 것이고, 스티커는 수신자에 의해 미디어 오버레이로서 적용될 것이다.As an example, the recipient receives a standard 2D image message, but due to the presence and content of the augmented reality content generator metadata, the recipient reconstructs the 3D message based on this metadata. This entails first fetching the 3D message augmented reality content generator using that ID, and then fetching all assets associated with the 3D message ID. The recipient loads the 3D message augmented reality content generator, which itself includes logic to request appropriate assets and data and reassemble the 3D message. In the example, the 3D message asset will not contain information to perform further processing with respect to the given media overlay, so that, for example, if a sticker (e.g., an overlaid image) has been applied on top of the 3D message, the recipient may You will receive a basic unknown 3D message asset, and stickers will be applied as media overlays by the recipient.

도 7은 일부 예시적인 실시예들에 따른, 주석 시스템(206)의 다양한 모듈들을 예시하는 블록도(700)이다. 주석 시스템(206)은 이미지 및 깊이 데이터 수신 모듈(702), 센서 데이터 수신 모듈(704), 이미지 및 깊이 데이터 처리 모듈(706), 3D 효과 모듈(708), 렌더링 모듈(710), 공유 모듈(712), 및 증강 현실 콘텐츠 생성기 모듈(714)을 포함하는 것으로 도시된다. 주석 시스템(206)의 다양한 모듈들은 (예를 들어, 버스, 공유 메모리, 또는 스위치를 통해) 서로 통신하도록 구성된다. 이들 모듈 중 임의의 하나 이상은 하나 이상의 컴퓨터 프로세서(750)를 사용하여(예를 들어, 그 모듈에 대해 설명된 기능들을 수행하도록 이러한 하나 이상의 컴퓨터 프로세서를 구성함으로써) 구현될 수 있으므로, 컴퓨터 프로세서(750) 중 하나 이상(예를 들어, 클라이언트 디바이스(102)에 의해 제공된 프로세서 세트)을 포함할 수 있다. FIG. 7 is a block diagram 700 illustrating various modules of annotation system 206, according to some example embodiments. The annotation system 206 includes an image and depth data receiving module 702, a sensor data receiving module 704, an image and depth data processing module 706, a 3D effects module 708, a rendering module 710, and a sharing module ( 712), and an augmented reality content generator module 714. The various modules of annotation system 206 are configured to communicate with each other (e.g., via a bus, shared memory, or switch). Any one or more of these modules may be implemented using one or more computer processors 750 (e.g., by configuring such one or more computer processors to perform the functions described for those modules), so that the computer processor ( 750) (e.g., a processor set provided by client device 102).

설명된 모듈들 중 임의의 하나 이상은 하드웨어 단독(예를 들어, 머신(예를 들어, 머신(예를 들어, 머신(2300))의 컴퓨터 프로세서들(750) 중 하나 이상) 또는 하드웨어와 소프트웨어의 조합을 사용하여 구현될 수 있다. 예를 들어, 주석 시스템(206)의 임의의 설명된 모듈은 그 모듈에 대해 본 명세서에 설명된 동작들을 수행하도록 구성된 컴퓨터 프로세서들(750) 중 하나 이상(예를 들어, 머신(예를 들어, 머신(2300))의 하나 이상의 컴퓨터 프로세서의 서브세트 또는 그 중에서)의 배열을 물리적으로 포함할 수 있다. 다른 예로서, 주석 시스템(206)의 임의의 모듈은 그 모듈에 대해 본 명세서에 설명된 동작들을 수행하도록(예를 들어, 머신(예를 들어, 머신(2300))의 하나 이상의 컴퓨터 프로세서 중에서) 하나 이상의 컴퓨터 프로세서(750)의 배열을 구성하는 소프트웨어, 하드웨어, 또는 둘 다를 포함할 수 있다. 따라서, 주석 시스템(206)의 상이한 모듈들은 상이한 시점들에서 그러한 컴퓨터 프로세서들(750)의 상이한 배열들 또는 그러한 컴퓨터 프로세서들(750)의 단일 배열을 포함하고 구성할 수 있다. 더욱이, 주석 시스템(206)의 임의의 둘 이상의 모듈이 단일 모듈로 조합될 수 있고, 단일 모듈에 대해 본 명세서에 설명된 기능들은 다수의 모듈 간에 세분될 수 있다. 또한, 다양한 예시적인 실시예들에 따르면, 단일 머신, 데이터베이스, 또는 디바이스 내에 구현되는 것으로 본 명세서에서 설명되는 모듈들은 다수의 머신, 데이터베이스 또는 디바이스에 걸쳐 분산될 수 있다.Any one or more of the described modules may be hardware alone (e.g., one or more of the computer processors 750 of a machine (e.g., machine 2300)) or a combination of hardware and software. For example, any described module of annotation system 206 may be implemented using one or more of computer processors 750 (e.g., For example, it may physically comprise an array of, or a subset of, one or more computer processors of a machine (e.g., machine 2300). As another example, any module of annotation system 206 may include: software that configures an arrangement of one or more computer processors 750 (e.g., among one or more computer processors of a machine (e.g., machine 2300)) to perform the operations described herein for that module; hardware, or both. Accordingly, different modules of the annotation system 206 may include different arrangements of such computer processors 750 or a single arrangement of such computer processors 750 at different times. Moreover, any two or more modules of the annotation system 206 may be combined into a single module, and the functions described herein for a single module may be subdivided among multiple modules. Additionally, various According to example embodiments, modules described herein as being implemented within a single machine, database, or device may be distributed across multiple machines, databases, or devices.

이미지 및 깊이 데이터 수신 모듈(702)은 클라이언트 디바이스(102)에 의해 캡처된 이미지들 및 깊이 데이터를 수신한다. 예를 들어, 이미지는 클라이언트 디바이스(102)의 광학 센서(예를 들어, 카메라)에 의해 캡처된 사진이다. 이미지는 이미지에서 검출된 사용자의 얼굴 또는 실세계 객체(들)와 같은 하나 이상의 실세계 특징을 포함한다. 일부 실시예들에서, 이미지는 이미지를 기술하는 메타데이터를 포함한다. 예를 들어, 깊이 데이터는 상이한 깊이들(예를 들어, 눈, 귀, 코, 입술들 등)을 갖는 특징들을 갖는 객체(예를 들어, 사용자의 얼굴)로 향하는 발광 모듈로부터 방출되는 광선들에 기초한 깊이 정보를 포함하는 깊이 맵에 대응하는 데이터를 포함한다. 예로서, 깊이 맵은 이미지와 유사하지만, 컬러를 제공하는 각각의 픽셀 대신에, 깊이 맵은 카메라로부터 이미지의 그 부분까지의 거리를(예를 들어, 절대적으로, 또는 깊이 맵 내의 다른 픽셀들에 대해) 나타낸다.Image and depth data reception module 702 receives images and depth data captured by client device 102. For example, the image is a picture captured by an optical sensor (e.g., camera) of client device 102. The image includes one or more real-world features, such as a user's face or real-world object(s) detected in the image. In some embodiments, the image includes metadata that describes the image. For example, depth data may be derived from rays emitted from a light emitting module directed at an object (e.g., a user's face) with features having different depths (e.g., eyes, ears, nose, lips, etc.). Contains data corresponding to a depth map containing underlying depth information. For example, a depth map is similar to an image, but instead of each pixel providing a color, the depth map measures the distance of that part of the image from the camera (e.g., absolutely, or relative to other pixels in the depth map). About) indicates.

센서 데이터 수신 모듈(704)은 클라이언트 디바이스(102)로부터 센서 데이터를 수신한다. 센서 데이터는 클라이언트 디바이스(102)의 센서에 의해 캡처된 임의의 타입의 데이터이다. 예에서, 센서 데이터는 자이로스코프에 의해 수집된 클라이언트 디바이스(102)의 모션, 터치 센서(예를 들어, 터치스크린), GPS, 또는 클라이언트 디바이스(102)의 현재 지리적 위치 및/또는 움직임을 기술하는 클라이언트 디바이스(102)의 다른 센서로부터의 터치 입력들 또는 제스처 입력들을 포함할 수 있다. 다른 예로서, 센서 데이터는 클라이언트 디바이스(102)의 센서에 의해 검출된 현재 온도를 나타내는 온도 데이터를 포함할 수 있다. 다른 예로서, 센서 데이터는 클라이언트 디바이스(102)가 어두운 또는 밝은 환경에 있는지를 나타내는 광 센서 데이터를 포함할 수 있다. The sensor data receiving module 704 receives sensor data from the client device 102. Sensor data is any type of data captured by sensors on client device 102. In examples, sensor data may include motion of client device 102 collected by a gyroscope, a touch sensor (e.g., a touchscreen), GPS, or describing the current geographic location and/or movement of client device 102. It may include touch inputs or gesture inputs from other sensors of client device 102. As another example, sensor data may include temperature data representative of the current temperature detected by a sensor of client device 102. As another example, sensor data may include light sensor data that indicates whether client device 102 is in a dark or bright environment.

이미지 및 깊이 데이터 처리 모듈(706)은 수신된 이미지 및/또는 깊이 데이터에 대한 동작들을 수행한다. 예를 들어, 다양한 이미지 처리 및/또는 깊이 처리 동작들이 이미지 및 깊이 데이터 처리 모듈(706)에 의해 수행되며, 이는 본 명세서에서 추가로 논의된다.Image and depth data processing module 706 performs operations on received image and/or depth data. For example, various image processing and/or depth processing operations are performed by image and depth data processing module 706, which are discussed further herein.

3D 효과 모듈(708)은 애니메이션들에 대응하는 3D 알고리즘들 또는 기술들에 기초하여 다양한 동작들을 수행하고/하거나 수신된 이미지 및/또는 깊이 데이터에 시각적 및/또는 청각적 효과들을 제공하며, 이는 본 명세서에서 추가로 설명된다.The 3D effects module 708 performs various operations based on 3D algorithms or techniques corresponding to animations and/or provides visual and/or audio effects to the received image and/or depth data, which This is further explained in the specification.

렌더링 모듈(710)은 전술된 모듈들 중 적어도 하나에 의해 제공된 데이터에 기초하여 메시징 클라이언트 애플리케이션(104)에 의한 디스플레이를 위해 3D 메시지의 렌더링을 수행한다.Rendering module 710 performs rendering of a 3D message for display by messaging client application 104 based on data provided by at least one of the modules described above.

공유 모듈(712)은 메시징 서버 시스템(108)에 저장 및/또는 메시징 서버 시스템으로 전송하기 위한 3D 메시지를 생성한다. 공유 모듈(712)은 메시징 서버 시스템(108)의 다른 사용자들에게 3D 메시지들의 공유를 가능하게 한다.Sharing module 712 generates 3D messages for storage in and/or transmission to messaging server system 108. Sharing module 712 enables sharing of 3D messages with other users of messaging server system 108.

증강 현실 콘텐츠 생성기 모듈(714)은, 실시예에서, 캐러셀 배열로 프레젠테이션되는 선택가능한 그래픽 항목들의 디스플레이를 야기한다. 예로서, 사용자는 그래픽 항목들의 주기적인 뷰를 제공하는 캐러셀에 대응하는 방식으로 디스플레이 스크린 상으로 그리고 디스플레이 스크린으로부터 벗어나게 선택가능한 그래픽 항목들을 회전시키기 위해 다양한 입력들을 이용할 수 있다. 캐러셀 배열은 다수의 그래픽 항목이 디스플레이 스크린 상의 특정 그래픽 영역을 점유하게 한다. 예에서, 증강 현실 콘텐츠 생성기들은 캐러셀 배열 상에 포함시키기 위해 각각의 그룹들로 조직화될 수 있고, 그에 의해 그룹별로 증강 현실 콘텐츠 생성기들을 통해 회전하는 것을 가능하게 한다. Augmented reality content generator module 714, in embodiments, causes display of selectable graphical items presented in a carousel arrangement. By way of example, a user may use various inputs to rotate selectable graphical items on and off the display screen in a manner that corresponds to a carousel that provides a periodic view of graphical items. A carousel arrangement allows multiple graphical items to occupy a specific graphical area on the display screen. In an example, the augmented reality content generators may be organized into respective groups for inclusion on a carousel arrangement, thereby enabling rotating through the augmented reality content generators by group.

주어진 3D 메시지에서, 피사체 또는 장면의 3D 모델이 본 명세서에 기술된 실시예들을 사용하여 캡처될 수 있다. 이러한 3D 모델은 증강 현실 콘텐츠 생성기(들), 예를 들어, (렌즈(LENSES) 및 AR 효과들) 및 3D 효과들과 결합될 수 있고, 대상 시스템 내에서 공유되어, 시청자에 대한 상호작용의 추가적인 요소들을 제공할 수 있다. Given a 3D message, a 3D model of the subject or scene may be captured using the embodiments described herein. These 3D models can be combined with augmented reality content generator(s), such as (LENSES and AR effects) and 3D effects, and shared within the target system to provide additional interactivity for the viewer. elements can be provided.

본 명세서에 설명된 실시예들에서, 깊이 및 이미지 데이터를 사용함으로써, 종래의 2D 사진들(예를 들어, X-축 및 Y-축 치수들)에 Z-축 치수(예를 들어, 깊이 치수)를 추가하는 3D 얼굴 및 장면 재구성이 수행될 수 있다. 이 포맷은 시청자가 3D 메시지와 상호작용할 수 있게 해주고, 3D 메시지가 대상 시스템에 의해 렌더링되는 각도/시점을 변경하며, 3D 메시지를 렌더링하는 데 이용되는 입자 및 셰이더에 영향을 미친다. In embodiments described herein, depth and image data can be used to add Z-axis dimensions (e.g., depth dimensions) to conventional 2D photographs (e.g., X-axis and Y-axis dimensions). ) 3D face and scene reconstruction can be performed. This format allows the viewer to interact with the 3D message, changes the angle/viewpoint at which the 3D message is rendered by the target system, and affects the particles and shaders used to render the 3D message.

예에서, 시청자 상호작용 입력은, 3D 메시지를 시청하는 동안(예를 들어, 3D 메시지를 디스플레이하는 디바이스의 움직임 센서로부터 시청자로의) 움직임으로부터 나오고, 이것은 차례로, 콘텐츠, 입자 및 셰이더가 렌더링되는 방법에 대한 관점에서의 변화로 변환된다. 상호작용은 또한 온스크린 터치 제스처들 및 다른 디바이스 모션으로부터 올 수 있다.In an example, viewer interaction input comes from movement while viewing a 3D message (e.g., from a motion sensor of the device displaying the 3D message to the viewer), which in turn determines how content, particles, and shaders are rendered. It is converted into a change in perspective. Interaction can also come from on-screen touch gestures and other device motion.

도 8은 일부 예시적인 실시예들에 따른, 3D 메시지를 생성하는 방법(800)을 예시하는 흐름도이다. 방법(800)은, 방법(800)의 동작들이, 특히 도 7에서 전술한 주석 시스템(206)의 각각의 컴포넌트들과 관련하여, 메시징 클라이언트 애플리케이션(104)에 의해 부분적으로 또는 전체적으로 수행될 수 있도록 하나 이상의 컴퓨터 프로세서들에 의한 실행을 위한 컴퓨터 판독가능 명령어들로 구현될 수 있으며; 따라서, 방법(800)은 이를 참조하여 예로서 아래에 설명된다. 그러나, 방법(800)의 동작들 중 적어도 일부가 다양한 다른 하드웨어 구성들 상에 배치될 수 있고 방법(800)은 메시징 클라이언트 애플리케이션(104)으로 제한되는 것을 의도하지 않는 것이 인식될 것이다. FIG. 8 is a flow diagram illustrating a method 800 of generating a 3D message, according to some example embodiments. Method 800 is such that the operations of method 800 may be performed, in part or in full, by messaging client application 104, particularly with respect to each of the components of annotation system 206 described above in FIG. 7. may be implemented as computer readable instructions for execution by one or more computer processors; Accordingly, method 800 is described below by way of example and with reference thereto. However, it will be appreciated that at least some of the operations of method 800 may be deployed on a variety of other hardware configurations and method 800 is not intended to be limited to messaging client application 104.

동작(802)에서, 이미지 및 깊이 데이터 수신 모듈(702)은 클라이언트 디바이스(102)의 광학 센서(예를 들어, 카메라)에 의해 캡처된 이미지 데이터 및 깊이 데이터를 수신한다. 예에서, 3D 메시지를 생성하기 위해, 사용자는 메시징 클라이언트 애플리케이션(104)에서 3D 메시지 카메라 모드를 선택하고, 이는 카메라로 하여금 카메라 이미지와 함께 원시 깊이 데이터 및 인물사진 분할 마스크를 캡처하게 한다.At operation 802, the image and depth data receiving module 702 receives image data and depth data captured by an optical sensor (e.g., a camera) of the client device 102. In an example, to create a 3D message, the user selects the 3D message camera mode in the messaging client application 104, which causes the camera to capture raw depth data and a portrait segmentation mask along with the camera image.

동작(804)에서, 3D 효과 모듈(708)은 3D 효과를 선택한다. 예에서, 3D 효과는, 예를 들어, 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스에서 제공되는 바와 같은 3D 증강 현실 콘텐츠 생성기의 선택에 대응하는 사용자 입력에 기초하여 선택될 수 있다.In operation 804, 3D effects module 708 selects a 3D effect. In an example, a 3D effect may be selected based on user input corresponding to selection of a 3D augmented reality content generator, for example, as provided in the user interface of messaging client application 104.

동작(806)에서, 3D 효과 모듈(708)은 선택된 3D 효과를 이미지 데이터 및/또는 깊이 데이터에 적용한다. 예에서, 선택된 3D 효과는 이미지 데이터 및/또는 깊이 데이터를 처리하는 것을 가능하게 하는 로직을 포함한다.In operation 806, 3D effects module 708 applies the selected 3D effect to the image data and/or depth data. In an example, the selected 3D effect includes logic that enables processing image data and/or depth data.

동작(808)에서, 렌더링 모듈(710)은 적용된 3D 효과를 사용하여 3D 메시지의 뷰를 렌더링한다. 예에서, 렌더링 모듈은 메시징 클라이언트 애플리케이션(104)에 의해 디스플레이되는 적용된 3D 효과에 기초하여 3D 메시지의 뷰를 제공한다. 본 명세서에서 추가로 설명되는 바와 같이, 3D 메시지의 시청자는 이러한 입력들에 응답하여 3D 메시지가 업데이트되고 재렌더링되게 하는 추가적인 입력들(예를 들어, 움직임 데이터 및/또는 터치 입력들)을 제공할 수 있다.In operation 808, rendering module 710 renders a view of the 3D message using the applied 3D effect. In an example, the rendering module provides a view of the 3D message based on the applied 3D effect displayed by messaging client application 104. As further described herein, a viewer of a 3D message may provide additional inputs (e.g., movement data and/or touch inputs) that cause the 3D message to be updated and re-rendered in response to such inputs. You can.

동작(810)에서, 공유 모듈(712)은 3D 효과 데이터를 포함하는 3D 메시지를 생성한다. 앞서 논의된 바와 같이, 3D 메시지는 도 6 및/또는 도 4에 기술된 정보를 포함할 수 있고, 이는 3D 메시지가 3D 메시지의 수신 시에 3D 메시지의 시청자에 의해 재구성되고 렌더링될 수 있게 한다.In operation 810, sharing module 712 generates a 3D message containing 3D effect data. As previously discussed, the 3D message may include the information described in Figures 6 and/or 4, which allows the 3D message to be reconstructed and rendered by a viewer of the 3D message upon receipt of the 3D message.

동작(812)에서, 공유 모듈(712)은 생성된 3D 메시지를 메시징 서버 시스템(108)에 저장하거나 메시징 서버 시스템으로 전송한다. 예에서, 메시징 클라이언트 애플리케이션(104)은 3D 메시지를 메시징 서버 시스템(108)에 전송하며, 이는 3D 메시지가 3D 메시지의 특정 수신자 또는 시청자에 의해 나중에 저장 및/또는 시청될 수 있게 한다.In operation 812, sharing module 712 stores or transmits the generated 3D message to messaging server system 108. In an example, messaging client application 104 transmits a 3D message to messaging server system 108 so that the 3D message can be stored and/or viewed at a later time by a particular recipient or viewer of the 3D message.

실시예에서, 주어진 3D 메시지가 (예를 들어, 3D 메시지를 수신자 클라이언트 디바이스에 공유하는 발신자 클라이언트 디바이스에 의해) 수신되는 시나리오에서, 수신된 3D 메시지의 뷰를 렌더링하기 위해 동작(802)에서 동작(808)에 대해 기술된 유사한 동작들이 수행될 수 있다(예를 들어, 그로써 3D 메시지를 생성하고/하거나 3D 메시지를 저장 또는 전송하기 위해 동작(810) 및 동작(812)에 앞서 있음).In an embodiment, in a scenario where a given 3D message is received (e.g., by a sender client device sharing the 3D message to a recipient client device), at operation 802 to render a view of the received 3D message ( Similar operations described for 808 may be performed (e.g., prior to operations 810 and 812 to thereby generate a 3D message and/or store or transmit a 3D message).

이하의 논의는 3D 메시지들을 공유하고/하거나 이러한 3D 메시지들을 영구 스토리지(예를 들어, 데이터베이스(120))에 저장하기 위한 예시적인 실시예들에 관한 것이다.The discussion below relates to example embodiments for sharing 3D messages and/or storing such 3D messages in persistent storage (e.g., database 120).

실시예에서, 사용자는 3D 메시지를 메시징 서버 시스템(108)의 데이터베이스(120)에 저장하기 위한 프로세스를 (예를 들어, 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스에 제공되는 명령을 선택함으로써) 개시할 수 있다. 이 예에서, 이미지 및 깊이 데이터는 물론, 도 6에서 이전에 기술된 바와 같은 정보(예를 들어, 3D 메시지를 로딩하기 위한 3D 증강 현실 콘텐츠 발생기)가 저장된다. In embodiments, a user may initiate a process (e.g., by selecting a command provided in the user interface of messaging client application 104) to store a 3D message in database 120 of messaging server system 108. You can. In this example, image and depth data are stored, as well as information as previously described in Figure 6 (e.g., a 3D augmented reality content generator for loading 3D messages).

실시예에서, 사용자(예를 들어, 3D 메시지의 발신자)는 3D 메시지들을 수신자들의 세트(예를 들어, 3D 메시지의 하나 이상의 수신자)에 전송하기 위해 (예를 들어, 메시징 클라이언트 애플리케이션(104)의 사용자 인터페이스에 제공되는 명령을 선택함으로써) 프로세스를 개시할 수 있다. 실시예에서, 메시징 클라이언트 애플리케이션(104)은 3D 메시지들 및 그들이 사진들 및 비디오들과 어떻게 상이한지(예를 들어, 2D 메시지들)에 관해 사용자에게 알리는 프롬프트 및/또는 메시지를 제공할 수 있다. In an embodiment, a user (e.g., the sender of a 3D message) may use (e.g., the messaging client application 104) to send 3D messages to a set of recipients (e.g., one or more recipients of the 3D message). You can initiate the process (by selecting a command provided in the user interface). In an embodiment, messaging client application 104 may provide prompts and/or messages informing users about 3D messages and how they differ from photos and videos (eg, 2D messages).

실시예에서, 주어진 3D 메시지는 메시징 서버 시스템(108)에 저장된 후에 내보내질 수 있다. 예를 들어, 사용자가 선택된 3D 메시지에 대한 내보내기 명령을 선택할 때, 3D 메시지와 연관된 3D 효과에 대응하는 각각의 증강 현실 콘텐츠 생성기가 검색되고, 3D 효과는 루프화될 수 있는 비디오를 생성하기 위해 루프를 통해 이미지 데이터에 적용된다. 이 비디오에서, 3D 메시는 360도로 회전되어 특정 기간 동안 루프를 완료할 수 있다. In embodiments, a given 3D message may be stored on messaging server system 108 and then exported. For example, when a user selects an export command for a selected 3D message, each augmented reality content generator corresponding to the 3D effect associated with the 3D message is retrieved, and the 3D effect is looped to create a video that can be looped. It is applied to image data through . In this video, the 3D mesh can be rotated 360 degrees to complete the loop over a certain period of time.

도 9는 일부 예시적인 실시예들에 따른, 전술한 바와 같은 3D 메시지를 생성하기 위한 방법(800)과 함께 수행될 수 있는 이미지 및 깊이 데이터를 처리하기 위한 변환 패스들을 수행하는 방법(900)을 예시하는 흐름도이다. 방법(900)은, 방법(900)의 동작들이, 특히 도 7에서 전술한 주석 시스템(206)의 각각의 컴포넌트들과 관련하여, 메시징 클라이언트 애플리케이션(104)에 의해 부분적으로 또는 전체적으로 수행될 수 있도록 하나 이상의 컴퓨터 프로세서들에 의한 실행을 위한 컴퓨터 판독가능 명령어들로 구현될 수 있으며; 따라서, 방법(900)은 이를 참조하여 예로서 아래에 설명된다. 그러나, 방법(900)의 동작들 중 적어도 일부가 다양한 다른 하드웨어 구성들 상에 배치될 수 있고 방법(900)은 메시징 클라이언트 애플리케이션(104)으로 제한되는 것을 의도하지 않는 것이 인식될 것이다. 9 illustrates a method 900 of performing transformation passes for processing image and depth data that may be performed in conjunction with the method 800 for generating a 3D message as described above, according to some example embodiments. This is an illustrative flow chart. Method 900 is such that the operations of method 900 may be performed, in part or in full, by messaging client application 104, particularly with respect to each of the components of annotation system 206 described above in FIG. 7. may be implemented as computer readable instructions for execution by one or more computer processors; Accordingly, method 900 is described below by way of example and with reference thereto. However, it will be appreciated that at least some of the operations of method 900 may be deployed on a variety of other hardware configurations and method 900 is not intended to be limited to messaging client application 104.

동작(902)에서, 이미지 및 깊이 데이터 수신 모듈(702)은 클라이언트 디바이스(102)의 광학 센서(예를 들어, 카메라)에 의해 캡처된 이미지 데이터 및 깊이 데이터를 수신한다. 예에서, 깊이 데이터는 이미지 데이터에 대응하는 깊이 맵을 포함한다. 실시예에서, 이미지 데이터(예를 들어, 컬러 프레임)는 이미지 데이터의 높이가 특정 크기(예를 들어, 2048 픽셀)를 초과하는 경우 다시 크기 조정되어(예를 들어, 본 명세서에서 추가로 설명되는 이미지 처리 동작들에 대한) 처리 이용을 개선하고, 더 다양한 클라이언트 디바이스들과의 호환성을 더 양호하게 보장한다.At operation 902, the image and depth data reception module 702 receives image data and depth data captured by an optical sensor (e.g., a camera) of the client device 102. In an example, the depth data includes a depth map corresponding to the image data. In embodiments, image data (e.g., a color frame) is resized (e.g., as described further herein) if the height of the image data exceeds a certain size (e.g., 2048 pixels). improves processing utilization (for image processing operations) and ensures better compatibility with a wider variety of client devices.

예시적인 실시예에서, 머신 학습 기술들 및 휴리스틱스(heuristics)는 주어진 클라이언트 디바이스가 깊이 정보를 캡처할 수 있게 하는 적절한 하드웨어(예를 들어, 깊이 감지 카메라)를 포함하지 않는 인스턴스들에서 깊이 맵들을 생성하는데 이용된다. 이러한 머신 학습 기술들은 예에서 공유된 3D 메시지들로부터의 트레이닝 데이터(또는 다른 이미지 데이터)로부터 머신 학습 모델을 트레이닝할 수 있다. 이러한 휴리스틱스는 얼굴 추적 및 인물사진 분할을 사용하여 사람의 깊이 맵을 생성하는 것을 포함한다.In an example embodiment, machine learning techniques and heuristics may be used to generate depth maps in instances where a given client device does not include appropriate hardware (e.g., a depth-sensing camera) to enable capturing depth information. It is used to create These machine learning techniques can train a machine learning model from training data from shared 3D messages (or other image data) in the example. These heuristics include generating a depth map of a person using face tracking and portrait segmentation.

예시적인 실시예에서, 전술된 머신 학습 기술들은 신경망 모델을 이용하여 깊이 맵을 생성할 수 있다. 깊이 추정을 위해, 신경망으로의 입력은 RGB 이미지이고, 출력은 깊이 이미지이다. 예에서, 신경망은 RGB 이미지보다 해상도가 낮은 깊이 맵을 생성한다. 이러한 깊이 맵을 사용하여 렌더링되는 3D 효과는, 예에서, 깊이 맵의 더 낮은 해상도에 의해 제한될 수 있다. 특히, 미세한 상세(예를 들어, 머리카락)는 이 깊이 맵의 더 낮은 해상도에서 보존하기 어려울 수 있다. 따라서, 본 명세서에서 추가로 논의되는 바와 같이, 대상 기술은 3D 메시지의 시청 사용자에게 렌더링되고 프레젠테이션될 때 더 자연스럽고 덜 인위적인 것처럼 보이는 더 많은 3D 효과들을 생성하기 위해 깊이 맵과 관련된 이러한 잠재적인 단점을 해결하기 위한 다양한 기술들을 제공한다.In an example embodiment, the machine learning techniques described above may use a neural network model to generate a depth map. For depth estimation, the input to the neural network is an RGB image and the output is a depth image. In the example, the neural network generates a depth map with lower resolution than the RGB image. 3D effects rendered using such a depth map may, in examples, be limited by the lower resolution of the depth map. In particular, fine details (e.g., hair) may be difficult to preserve at the lower resolution of this depth map. Therefore, as discussed further herein, the subject technology takes advantage of these potential drawbacks associated with depth maps to create more 3D effects that appear more natural and less artificial when rendered and presented to a viewing user of a 3D message. It provides various technologies to solve the problem.

예시적인 실시예에서, 멀티-뷰 스테레오 컴퓨터 비전 기술들은 사용자가 장면에 대해 카메라를 이동시키는 다수의 이미지 또는 비디오로부터 깊이 맵들을 생성하기 위해 이용된다. In an example embodiment, multi-view stereo computer vision techniques are used to generate depth maps from multiple images or video as a user moves a camera about the scene.

다른 실시예에서, 신경 네트워크 모델은 클라이언트 디바이스에 의해 분할 마스크(들)를 생성하기 위해 이용될 수 있고, 분할 마스크(들)는 그 후 주어진 이미지의 배경 및 대응하는 깊이 맵의 인페인팅을 수행하기 위해 사용되며, 이는 본 명세서에서 추가로 논의된다.In another embodiment, a neural network model may be utilized by a client device to generate segmentation mask(s), which then perform inpainting of the background and corresponding depth map of a given image. It is used for, and is further discussed herein.

동작(904)에서, 이미지 및 깊이 데이터 처리 모듈(706)은 적어도 깊이 데이터를 사용하여 깊이 맵을 생성한다. 이하에서 추가로 논의되는 바와 같이, 패킹된 깊이 맵이라고 하는 제2 깊이 맵은, 추가의 기술적 이점을 위해 깊이 맵에 적어도 부분적으로 기초하여 생성될 수 있다.At operation 904, image and depth data processing module 706 generates a depth map using at least the depth data. As discussed further below, a second depth map, referred to as a packed depth map, can be generated based at least in part on the depth map for additional technical advantages.

동작(906)에서, 이미지 및 깊이 데이터 처리 모듈(706)은 이미지 데이터에 적어도 부분적으로 기초하여 분할 마스크를 생성한다. 실시예에서, 이미지 및 깊이 데이터 처리 모듈(706)은 컨볼루션 신경망을 사용하여 분할 마스크를 결정하여, 특정 객체 클래스(예를 들어, 얼굴/인물사진 또는 배경)에 픽셀을 할당하기 위해 모든 픽셀에 대해 예측이 이루어지는 밀집 예측 작업들을 수행하고, 분할 마스크는 분류된 픽셀들(예를 들어, 얼굴/인물사진 또는 배경)의 그룹화들에 기초하여 결정된다. 대안적으로, 분할 마스크는 클라이언트 디바이스(102)의 하드웨어(예를 들어, 신경망 프로세서 또는 다른 머신 학습 지향 프로세서)에 의해 생성된 후에 원시 입력 데이터에 포함된 것으로서 수신될 수 있다. At operation 906, image and depth data processing module 706 generates a segmentation mask based at least in part on the image data. In an embodiment, the image and depth data processing module 706 uses a convolutional neural network to determine a segmentation mask to apply to all pixels to assign the pixel to a particular object class (e.g., face/portrait or background). Dense prediction operations are performed, where prediction is made for the segmentation mask, and a segmentation mask is determined based on groupings of the classified pixels (e.g., face/portrait or background). Alternatively, the segmentation mask may be generated by hardware of client device 102 (e.g., a neural network processor or other machine learning oriented processor) and then received as included in the raw input data.

동작(908)에서, 이미지 및 깊이 데이터 처리 모듈(706)은 배경 인페인팅된 이미지 데이터를 생성하기 위해 적어도 분할 마스크를 사용하여 수신된 이미지 데이터의 배경 인페인팅 및 블러링을 수행한다. 예에서, 이미지 및 깊이 데이터 처리 모듈(706)은 배경으로부터 인물사진(예를 들어, 사용자의 얼굴을 포함함)을 제거하고 프레임 내의 사람에 초점을 맞추기 위해 배경을 블러링하는 배경 인페인팅 기술을 수행한다. 실시예에서, 전술한 처리(예로서, 변환 패스들) 중 일부는 깊이 및 컬러 텍스처들에 대해 이용되는 반면, 다른 이미지 처리는 컬러 텍스처에 대해 이용된다(예를 들어, 배경을 블러링함). 실시예에서, 처리(예를 들어, 변환 패스들)는 타깃으로의 렌더링을 위해 체인화되고, 처리된 깊이 맵 및 컬러 텍스처는 효과(들)에 의해 소비되는 방식으로 렌더링된다.At operation 908, the image and depth data processing module 706 performs background inpainting and blurring of the received image data using at least a segmentation mask to generate background inpainted image data. In an example, image and depth data processing module 706 may use a background inpainting technique to remove a portrait (e.g., containing a user's face) from the background and blur the background to focus on the person in the frame. Perform. In an embodiment, some of the processing described above (eg, transform passes) is used for depth and color textures, while other image processing is used for color texture (eg, blurring the background). In an embodiment, processing (e.g., transformation passes) is chained for rendering to a target, and the processed depth map and color texture are rendered in such a way that they are consumed by the effect(s).

동작(910)에서, 이미지 및 깊이 데이터 처리 모듈(706)은 깊이 맵에 적어도 부분적으로 기초하여 깊이 인페인팅 마스크를 생성한다. 예에서, 깊이 맵은 이전에 언급된 패킹된 깊이 맵에 대응할 수 있다. 예에서, 이미지 및 깊이 데이터 처리 모듈(706)은 깊이 맵에서 아티팩트를 클린업하기 위해 깊이 인페인팅 마스크를 사용한다. 대안적으로, 이미지 및 깊이 데이터 처리 모듈(706)은 대신에 깊이 맵을 인페인팅하기 위해 위에서 언급된 분할 마스크를 이용할 수 있다(예를 들어, 깊이 인페인팅 마스크를 생성하는 것에 앞서 있음).At operation 910, image and depth data processing module 706 generates a depth inpainting mask based at least in part on the depth map. In an example, the depth map may correspond to the previously mentioned packed depth map. In an example, the image and depth data processing module 706 uses a depth inpainting mask to clean up artifacts in the depth map. Alternatively, the image and depth data processing module 706 may instead use the segmentation mask mentioned above to inpaint the depth map (e.g., prior to generating the depth inpainting mask).

동작(912)에서, 이미지 및 깊이 데이터 처리 모듈(706)은 깊이 인페인팅 마스크를 사용하여 깊이 맵의 인페인팅을 수행하여 인페인팅된 깊이 맵을 생성한다. 앞서 언급한 바와 같이, 인페인팅된 깊이 맵은 아티팩트가, 있다면, 원래의 깊이 맵으로부터 제거된 후처리된 깊이 맵에 대응한다. 예에서, 후처리된 깊이 맵은 깊이 맵의 알파 채널(예를 들어, 이미지 내의 픽셀들에 대한 컬러 값들을 정의하는 채널들 이외의 채널)에 적용되는 분할을 포함한다.At operation 912, image and depth data processing module 706 performs inpainting of the depth map using a depth inpainting mask to generate an inpainted depth map. As previously mentioned, the inpainted depth map corresponds to a post-processed depth map in which artifacts, if any, have been removed from the original depth map. In an example, the post-processed depth map includes segmentation applied to the alpha channel of the depth map (e.g., a channel other than the channels that define color values for pixels in the image).

동작(914)에서, 이미지 및 깊이 데이터 처리 모듈(706)은 깊이 맵에 적어도 부분적으로 기초하여 깊이 법선 맵을 생성한다. 실시예에서, 이 동작에서의 깊이 맵은 이전에 언급된 패킹된 깊이 맵에 대응할 수 있다. 예에서, 이미지 및 깊이 데이터 처리 모듈(706)은, 깊이 법선 맵을 사용하여, 이미지 데이터의 전경 영역에 3D 효과(들)를 적용함으로써 후처리 전경 이미지를 제공한다.At operation 914, image and depth data processing module 706 generates a depth normal map based at least in part on the depth map. In an embodiment, the depth map in this operation may correspond to the previously mentioned packed depth map. In an example, the image and depth data processing module 706 provides a post-processed foreground image by applying 3D effect(s) to the foreground region of the image data using the depth normal map.

동작(916)에서, 렌더링 모듈(710)은, 생성된 3D 메시지에 포함된 자산인, 적어도 배경 인페인팅된 이미지, 인페인팅된 깊이 맵, 및 후처리된 전경 이미지를 사용하여 3D 메시지의 뷰를 생성한다. 예에서, 렌더링 모듈(710)은 메시징 클라이언트 애플리케이션(104)에 의한 디스플레이를 위해 3D 메시지의 뷰를 렌더링한다. At operation 916, the rendering module 710 creates a view of the 3D message using at least the background inpainted image, the inpainted depth map, and the post-processed foreground image, which are assets included in the generated 3D message. Create. In an example, rendering module 710 renders a view of a 3D message for display by messaging client application 104.

실시예에서, 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))는 복수의 선택가능한 그래픽 항목으로부터 선택가능한 그래픽 항목의 선택을 수신하고, 선택가능한 그래픽 항목은 3D 효과를 포함하는 증강 현실 콘텐츠 생성기에 대응한다. 클라이언트 디바이스는 클라이언트 디바이스의 적어도 하나의 카메라를 사용하여 이미지 데이터를 캡처한다. 클라이언트 디바이스는 캡처된 이미지 데이터에 적어도 부분적으로 기초하여 머신 학습 모델을 사용하여 깊이 데이터를 생성한다. 클라이언트 디바이스는 증강 현실 콘텐츠 생성기에 적어도 부분적으로 기초하여 3D 효과를 이미지 데이터 및 깊이 데이터에 적용한다. In an embodiment, a client device (e.g., client device 102) receives a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item being sent to an augmented reality content generator that includes a 3D effect. Respond. The client device captures image data using at least one camera of the client device. The client device generates depth data using a machine learning model based at least in part on the captured image data. The client device applies 3D effects to the image data and depth data based at least in part on the augmented reality content generator.

예에서, 이미지 데이터는 둘 이상의 카메라로 캡처된다. 대안적으로, In an example, image data is captured with two or more cameras. Alternatively,

이미지 데이터는 단일 카메라로부터의 듀얼 픽셀 오토포커스를 사용하여 캡처된다(깊이 정보는 듀얼 픽셀 오토포커스에 의해 캡처된 다수의 이미지를 사용하여 도출될 수 있다). 이전에 논의된 바와 같이, 머신 학습 모델은 캡처된 이미지 데이터에 기초하여 깊이 데이터의 예측을 제공하는 심층 신경망 또는 컨볼루션 신경망일 수 있고, 머신 학습 모델은 캡처된 이미지 데이터를 입력으로서 수신하고, 깊이 맵을 출력으로서 생성한다. 일부 구현들에서, 머신 학습 모델은 클라이언트 디바이스의 신경망 프로세서 또는 그래픽 처리 유닛 상에서 실행된다.Image data is captured using dual pixel autofocus from a single camera (depth information can be derived using multiple images captured by dual pixel autofocus). As previously discussed, the machine learning model can be a deep neural network or a convolutional neural network that provides predictions of depth data based on the captured image data, and the machine learning model receives the captured image data as input and Produces a map as output. In some implementations, the machine learning model runs on a neural network processor or graphics processing unit of the client device.

이하의 논의는, 실시예에서, 이미지 및 깊이 데이터 처리 모듈(706), 3D 효과 모듈(708) 및/또는 렌더링 모듈(710)과 같은 주석 시스템(206)의 컴포넌트들로서 포함되고, 렌더링 및/또는 3D 메시지의 생성과 함께 이미지 및/또는 깊이 데이터를 처리하기 위한 다양한 동작들을 수행하는 다양한 "카메라들"에 관한 것이다.The discussion below includes, in embodiments, components of the annotation system 206, such as image and depth data processing module 706, 3D effects module 708, and/or rendering module 710, and perform rendering and/or It relates to various “cameras” that perform various operations for processing image and/or depth data together with the generation of 3D messages.

실시예에서, 장면 카메라는 효과(들)의 대부분을 포함한다. 이는 모든 3D 또는 그래픽 부착물, 자이로 기반 상호작용 및 입자가 추가 및/또는 구성되는 경우이다. 예에서, 부착물들은 영구 저장을 통해 포지션들, 회전들 등을 저장함으로써 전송 및 수신 측에서 구성된다. In an embodiment, the scene camera contains most of the effect(s). This is where all 3D or graphical attachments, gyro-based interactions and particles are added and/or configured. In an example, attachments are configured on the transmitting and receiving side by storing positions, rotations, etc. through persistent storage.

실시예에서, 얼굴 효과 카메라는 컬러 및 깊이 텍스처들 둘 다에 또는 컬러 텍스처에 적용될 수 있는 얼굴 효과들을 포함한다. 컬러 및 깊이 맵 텍스처들 모두에 영향을 미치는 효과들에 대해, 얼굴 효과들은 층을 인페인팅하는 깊이에 렌더링된다. 예에서, 이것은 얼굴 신장 효과를 위해 이용된다.In an embodiment, the facial effects camera includes facial effects that can be applied to both color and depth textures or to a color texture. For effects that affect both color and depth map textures, facial effects are rendered at the depth inpainting layer. In an example, this is used for a facial stretching effect.

실시예에서, 컬러 텍스처에만 영향을 미치는 얼굴 효과들에 대해, 그러한 얼굴 효과들은 그들이 깊이 맵 상에 적용되는 것을 방지하기 위해 동일한 카메라 내의 개별 층에 배치된다. 이와 같은 효과의 예는 얼굴 리터치 효과 및 얼굴 마스크 효과이다.In an embodiment, for facial effects that only affect the color texture, those facial effects are placed on a separate layer within the same camera to prevent them from being applied on the depth map. Examples of such effects are the face retouch effect and the face mask effect.

실시예에서, 합성 카메라는 장면 카메라의 출력을 렌더링하고, 파이프라인의 끝에서 전체 장면에 적용되는 임의의 효과들(예를 들어, 컬러 필터 또는 스크린 공간 입자들)을 적용한다.In an embodiment, the composite camera renders the output of the scene camera and applies any effects (e.g., color filters or screen space particles) that apply to the entire scene at the end of the pipeline.

미화 기술은, 신체 세계에서의 성형 수술 또는 메이크업과 유사한 결과를 달성할 수 있는, 얼굴(예를 들어, 얼굴 이미지 데이터)의 표현에 대응하는 이미지 데이터의 영역(들)을 포함한, 이미지 데이터의 리터치와 관련된 이미지 처리 조작(예를 들어, "미화 조작")을 말한다. 예를 들어, 이러한 미화 기술은, 볼을 슬림하게 하는 것, 눈을 확대하는 것, 피부를 매끈하게 하는 것, 치아 또는 피부를 밝게 하는 것, 흠 또는 주름을 제거하는 것, 눈 색상을 변경하는 것, 처진 피부를 축소하는 것, 피부 색상을 향상시키는 것, 얼굴 문신 또는 마킹을 추가하는 것 등과 같은, 디지털 도메인에서의 얼굴 이미지 데이터를 수정할 수 있다. 따라서, 주어진 미화 기술은 얼굴 이미지의 심미적 매력을 향상시킬 수 있다. 사용자로부터의 지루한(예를 들어, 수동으로 선택되거나 사용자에 의해 수행되는) 상호작용들을 회피하기 위해 자동화된 방식으로 얼굴 이미지 데이터의 미화를 제공하는 것이 유용하며, 이에 의해 얼굴 미화의 렌더링을 프레젠테이션하기 위한 더 편리하고 효율적인 프로세스로 이어진다.Beautification techniques include the retouching of image data, including region(s) of the image data corresponding to a representation of the face (e.g., facial image data), which can achieve results similar to plastic surgery or makeup in the physical world. refers to image processing manipulations (e.g., “beautification manipulations”) related to For example, these beautification techniques include slimming the cheeks, enlarging the eyes, smoothing the skin, brightening teeth or skin, removing blemishes or wrinkles, and changing eye color. It is possible to modify facial image data in the digital domain, such as reducing sagging skin, enhancing skin color, adding facial tattoos or markings, etc. Therefore, a given beautification technique can improve the aesthetic appeal of facial images. It is useful to provide beautification of facial image data in an automated manner to avoid tedious (e.g. manually selected or performed by the user) interactions from the user, thereby presenting a rendering of the facial beautification. This leads to a more convenient and efficient process.

또한, 얼굴 이미지 데이터가 수정되어야 할 때, 미화 기술의 더 낮은 품질 또는 미적으로 만족스럽지 않은 적용을 초래할 수 있는 시각적 아티팩트를 피하기 위해 정확한 방식으로 수정을 위해 이미지 데이터의 영역이 선택되어야 한다. 따라서, 얼굴 이미지 데이터에 주어진 미화 기술을 더 정확하게 적용하기 위해 본 명세서에서 추가로 논의되는 인물사진 분할 마스크를 이용하는 것이 유리하다.Additionally, when facial image data must be modified, regions of the image data must be selected for modification in a precise manner to avoid visual artifacts that may result in lower quality or aesthetically unpleasing applications of beautification techniques. Therefore, it is advantageous to use portrait segmentation masks, discussed further herein, to more accurately apply a given beautification technique to facial image data.

대상 시스템에서, 선택된 AR 콘텐츠 생성기는 이미지 데이터 및 깊이 데이터에 적용되어, 생성된 3D 메시지의 디스플레이(예를 들어, 렌더링)에 제공될 수 있는 미화 효과를 야기하는 적어도 하나의 미화 기술을 포함할 수 있다.In the target system, the selected AR content generator may include at least one beautification technique applied to the image data and depth data to cause a beautification effect that may be provided to a display (e.g., rendering) of the generated 3D message. there is.

도 10은 일부 예시적인 실시예들에 따른, 전술한 바와 같은 3D 메시지를 생성하기 위한 방법(900)과 함께 수행될 수 있는 이미지 및 깊이 데이터의 미화를 수행하는 방법(1000)을 예시하는 흐름도이다. 방법(1000)은, 방법(1000)의 동작들이, 특히 도 7에서 전술한 주석 시스템(206)의 각각의 컴포넌트들과 관련하여, 메시징 클라이언트 애플리케이션(104)에 의해 부분적으로 또는 전체적으로 수행될 수 있도록 하나 이상의 컴퓨터 프로세서들에 의한 실행을 위한 컴퓨터 판독가능 명령어들로 구현될 수 있으며; 따라서, 방법(1000)은 이를 참조하여 예로서 아래에 설명된다. 그러나, 방법(1000)의 동작들 중 적어도 일부는 다양한 다른 하드웨어 구성들에 배치될 수 있고, 방법(1000)은 메시징 클라이언트 애플리케이션(104)에 제한되도록 의도되지 않는다는 점이 이해될 것이다. 10 is a flow diagram illustrating a method 1000 of performing beautification of image and depth data that may be performed in conjunction with the method 900 for generating a 3D message as described above, according to some example embodiments. . Method 1000 may be performed in part or in whole by messaging client application 104, particularly with respect to each of the components of annotation system 206 described above in Figure 7. may be implemented as computer readable instructions for execution by one or more computer processors; Accordingly, method 1000 is described below by way of example and with reference thereto. However, it will be appreciated that at least some of the operations of method 1000 may be deployed on a variety of other hardware configurations, and method 1000 is not intended to be limited to messaging client application 104.

전술한 바와 같이, 대상 시스템은 증강 현실 콘텐츠 생성기 및/또는 다른 이미지 처리 동작(들)을 통해 얼굴을 변형 또는 미화하기 위해 얼굴을 포함하는 이미지에 적용될 수 있는 얼굴 효과들(예를 들어, 미화, 얼굴 리터치, 스트레칭 및 다른 효과들)의 적용을 가능하게 한다. 본 명세서에서 논의되는 바와 같이, "미화"는 사용자 제공 기준에 따라 이미지들을 분석하여 기준들을 충족시키도록 이미지들을 수정하는 것을 지칭한다. 그러한 기준은 이미지 내의 픽셀들의 컬러 및 투명도와 연관된 X, Y 및 Z 값들을 포함할 수 있다. "미화 조작"은 얼굴 이미지 데이터의 미화를 수행하기 위한 이미지 처리 조작들의 세트를 지칭한다. As described above, the subject system may be configured to perform facial effects (e.g., beautification, enables the application of facial retouching, stretching and other effects). As discussed herein, “beautification” refers to analyzing images according to user-provided criteria and modifying the images to meet the criteria. Such criteria may include X, Y and Z values associated with the color and transparency of pixels within the image. “Beautification operations” refers to a set of image processing operations to perform beautification of facial image data.

동작(1002)에서, 3D 효과 모듈(708)은 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))에서, (예를 들어, 아래의 도 12에서 추가로 논의되는 바와 같은 인터페이스에서) 복수의 선택가능한 그래픽 항목으로부터 선택가능한 그래픽 항목의 선택을 수신한다. 예에서, 선택가능한 그래픽 항목은 3D 효과를 적용하기 위한 증강 현실 콘텐츠 생성기이거나 이에 대응하고, 3D 효과는 3D 효과와 함께 수행될 적어도 하나의 미화 조작을 포함한다.At operation 1002, 3D effects module 708 selects, at a client device (e.g., client device 102), a plurality of selections (e.g., in an interface as further discussed in Figure 12 below). Receives a selection of selectable graphic items from available graphic items. In an example, the selectable graphical item is or corresponds to an augmented reality content generator for applying a 3D effect, and the 3D effect includes at least one beautification operation to be performed in conjunction with the 3D effect.

그러한 미화 조작의 일례는 부드러운 피부, 치아 미백, 눈 선명화 및 눈 미백과 같은 사용자의 얼굴을 리터치하기 위한 다수의 특징을 포함하는 얼굴 리터치를 포함한다. 미화 조작의 다른 예는 사용자의 얼굴의 스트레칭 포인트들을 가능하게 하는 얼굴 스트레치 효과를 포함한다. 미화 조작의 또 다른 예는 사용자의 눈의 색을 변경하는 것 및/또는 눈 반사를 생성하는 것을 포함한다. 미화 조작의 다른 예는 얼굴을 구형으로 휘게 하는 얼굴 유동화 효과를 포함한다. 미화 조작의 다른 예는 얼굴의 특징(예를 들어, 눈)을 얼굴의 다른 영역들에 매핑하는 얼굴 삽입 효과를 포함한다. 다른 타입의 미화 기술이 고려되고 대상 시스템의 범위 내에 있다는 것이 이해된다.Examples of such beautification operations include facial retouching, which includes a number of features to retouch the user's face, such as smoothing skin, whitening teeth, sharpening eyes, and brightening eyes. Another example of a beautification manipulation includes a facial stretching effect that enables stretching points of the user's face. Another example of beautification manipulation includes changing the color of the user's eyes and/or creating eye reflections. Other examples of beautification manipulation include facial liquification effects that warp the face into a sphere. Other examples of beautification manipulations include face insertion effects that map facial features (e.g., eyes) to other areas of the face. It is understood that other types of beautification techniques are contemplated and within the scope of the target system.

미화 조작은 피부를 매끈하게 하는 것을 증가시키고, 조명을 조정하고, 얼굴 이미지 데이터의 컬러를 수정하기 위해 이미지 데이터를 수정할 수 있다. 전술한 이미지 효과들을 달성하기 위한 예시적인 접근법들은 인물사진 분할, 인물사진 융합, 컬러 정정, GMM(Gaussian mixture model), 가우시안 필터, 베이지안 분할, 피부 컬러 검출, 양방향 필터, HSV 컬러 디스크립터, 웨이블릿 변환, 그래디언트 도메인 이미지 처리, 푸아송 이미지 클로닝, 리 필터(Lee filter), 에지 보존 평활화 필터, 블러링, 잡음 감소, 흠 제거, 특징 검출 및 추출 등을 사용하는 것을 포함한다. 주어진 미화 조작을 수행하기 위해 대상 기술에 의해 다른 접근법들이 이용될 수 있다. 예를 들어, 머신 학습 모델들은 컨볼루션 신경망, 생성 적대 네트워크 등과 같은 미화 조작에 적용될 수 있다. 이러한 머신 학습 모델들은 얼굴 특징 구조들을 보존하거나, 흠을 매끈하게 하거나, 주름들을 제거하거나, 얼굴 이미지 데이터에서 얼굴 피부 텍스처를 보존하기 위해 이용될 수 있다.Beautification manipulations may modify image data to increase skin smoothness, adjust lighting, and modify the color of facial image data. Exemplary approaches for achieving the aforementioned image effects include portrait segmentation, portrait fusion, color correction, Gaussian mixture model (GMM), Gaussian filter, Bayesian segmentation, skin color detection, bidirectional filter, HSV color descriptor, wavelet transform, It includes using gradient domain image processing, Poisson image cloning, Lee filter, edge-preserving smoothing filter, blurring, noise reduction, blemish removal, feature detection and extraction, etc. Different approaches may be used depending on the target technology to perform a given beautification operation. For example, machine learning models can be applied to beautification operations such as convolutional neural networks, generative adversarial networks, etc. These machine learning models can be used to preserve facial feature structures, smooth blemishes, remove wrinkles, or preserve facial skin texture in facial image data.

동작(1004)에서, 이미지 및 깊이 데이터 수신 모듈(702)은 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))의 광학 센서로부터 이미지 데이터 및 깊이 데이터를 캡처한다. 실시예에서, 특정 3D 효과에 대응하는 선택가능한 그래픽 시간의 선택에 응답하여, 클라이언트 디바이스(102)는 캡처된 이미지 데이터 및 깊이 데이터를 수신하기 위해 이미지 및 깊이 데이터 수신 모듈(702)에서 동작들을 개시할 수 있다. 본 명세서에서 논의된 바와 같이, 이러한 카메라로부터의 원시 입력 데이터는 캡처된 이미지 데이터 및 깊이 데이터를 포함할 수 있고, 일부 실시예들에서, 클라이언트 디바이스의 하드웨어 능력들(예를 들어, GPU 또는 신경망 프로세서 등)을 사용하여 생성되는 인물사진 분할 마스크를 또한 포함할 수 있다.At operation 1004, image and depth data reception module 702 captures image data and depth data from an optical sensor of a client device (e.g., client device 102). In an embodiment, in response to selection of a selectable graphics time corresponding to a particular 3D effect, client device 102 initiates operations at image and depth data reception module 702 to receive captured image data and depth data. can do. As discussed herein, raw input data from such cameras may include captured image data and depth data and, in some embodiments, may utilize the hardware capabilities of the client device (e.g., a GPU or neural network processor). etc.) may also include a portrait segmentation mask created using.

동작(1006)에서, 3D 효과 모듈(708)은 적어도 하나의 미화 조작을 포함하는 3D 효과를 이미지 데이터 및 깊이 데이터에 적용한다. 실시예에서, 3D 효과를 적용하는 것의 일부로서, 3D 효과 모듈(708)은 얼굴 이미지 데이터를 포함하는 적어도 이미지 데이터의 영역에 대해 미화 조작을 수행하고, 미화 조작은 영역 내의 픽셀들의 매끈하게 하는 것, 조명 조정, 또는 컬러 수정 중 적어도 하나를 포함한다. 또한, 미화 조작은 캡처된 이미지 데이터에 포함된 얼굴 이미지 데이터에서 얼굴 특징 구조들을 보존하거나, 흠을 매끈하게 하거나, 주름들을 제거하거나, 얼굴 피부 텍스처를 보존하기 위한 머신 학습 모델의 이용을 포함한다.At operation 1006, 3D effects module 708 applies a 3D effect, including at least one beautification manipulation, to image data and depth data. In an embodiment, as part of applying a 3D effect, 3D effects module 708 performs a beautification operation on at least a region of image data containing facial image data, wherein the beautification operation includes smoothing pixels within the region. , lighting adjustments, or color modifications. Beautification manipulation also includes the use of machine learning models to preserve facial feature structures, smooth blemishes, remove wrinkles, or preserve facial skin texture in facial image data included in the captured image data.

동작(1008)에서, 공유 모듈(712)은 적어도 하나의 미화 조작을 포함하는 적용된 3D 효과에 적어도 부분적으로 기초하여 3D 메시지를 생성한다. 실시예에서, 3D 메시지가 메시징 서버 시스템(108)의 데이터(120)에 저장될 때, 수신자에 의한 후속 시청 시에, 3D 메시지가 적용된 미화 조작으로 렌더링되도록, 본 명세서에서 논의된 다른 자산들 중에서, 적용된 미화 기술 및 후처리된 이미지 데이터에 대응하는 정보(예를 들어, 메타데이터)가 3D 메시지에 포함된다.At operation 1008, sharing module 712 generates a 3D message based at least in part on an applied 3D effect that includes at least one beautification manipulation. In embodiments, when a 3D message is stored in data 120 of messaging server system 108, upon subsequent viewing by a recipient, the 3D message is rendered with an applied beautification manipulation, among other assets discussed herein. , Information (e.g., metadata) corresponding to the applied beautification technique and post-processed image data is included in the 3D message.

동작(1010)에서, 렌더링 모듈(710)은 적어도 하나의 미화 조작을 포함하는 적용된 3D 효과에 적어도 부분적으로 기초하여 3D 메시지의 뷰를 렌더링하는 3D 메시지의 뷰를 렌더링한다. 대안적인 실시예에서, 동작(1008 및 1010)은 3D 메시지의 뷰가 (예를 들어, 3D 메시지의 프리뷰를 제공하기 위해) 초기에 수행되고, 이후 3D 메시지가 본 명세서에서 추가로 설명되는 바와 같이 포함된 자산들 및 메타데이터로 생성되도록 상이한 순서로 수행될 수 있다는 점이 인식된다.At operation 1010, rendering module 710 renders a view of the 3D message that renders a view of the 3D message based at least in part on an applied 3D effect that includes at least one beautification operation. In an alternative embodiment, operations 1008 and 1010 are performed where a view of the 3D message is initially displayed (e.g., to provide a preview of the 3D message) and then the 3D message is displayed as further described herein. It is recognized that this may be performed in a different order to produce the included assets and metadata.

도 11은 일부 예시적인 실시예들에 따라, 전술된 바와 같은 3D 메시지를 생성하기 위한 방법(800)과 함께 수행될 수 있는 움직임 데이터에 응답하여 3D 메시지의 뷰를 업데이트하는 방법(1100)을 예시하는 흐름도이다. 방법(1100)은, 방법(1100)의 동작들이, 특히 도 7에서 위에 설명된 주석 시스템(206)의 각각의 컴포넌트들에 대해, 메시징 클라이언트 애플리케이션(104)에 의해 부분적으로 또는 전체적으로 수행될 수 있도록 하나 이상의 컴퓨터 프로세서들에 의한 실행을 위한 컴퓨터 판독가능 명령어들로 구현될 수 있으며; 따라서, 방법(1100)은 이를 참조하여 예로서 아래에 설명된다. 그러나, 방법(1100)의 동작들 중 적어도 일부는 다양한 다른 하드웨어 구성들에 배치될 수 있고, 방법(1100)은 메시징 클라이언트 애플리케이션(104)에 제한되도록 의도되지 않는다는 점이 이해될 것이다. 11 illustrates a method 1100 for updating a view of a 3D message in response to motion data that may be performed in conjunction with the method 800 for generating a 3D message as described above, according to some example embodiments. This is a flow chart. Method 1100 may be performed in part or in full by messaging client application 104, particularly for each of the components of annotation system 206 described above in FIG. 7. may be implemented as computer readable instructions for execution by one or more computer processors; Accordingly, method 1100 is described below by way of example and with reference thereto. However, it will be appreciated that at least some of the operations of method 1100 may be deployed on a variety of other hardware configurations, and method 1100 is not intended to be limited to messaging client application 104.

동작(1102)에서, 센서 데이터 수신 모듈(704)은 움직임 센서(예를 들어, 자이로스코프, 모션 센서, 터치스크린 등)로부터 움직임 데이터를 수신한다. 실시예에서, 메시징 클라이언트 애플리케이션(104)은 위치 또는 움직임 센서와 같은, 클라이언트 디바이스(102)의 센서에 의해 캡처된 센서 데이터를 수신한다. In operation 1102, sensor data reception module 704 receives motion data from a motion sensor (e.g., gyroscope, motion sensor, touch screen, etc.). In an embodiment, messaging client application 104 receives sensor data captured by sensors on client device 102, such as location or motion sensors.

동작(1104)에서, 3D 효과 모듈(708)은 움직임 데이터에 기초하여 3D 메시지의 뷰를 업데이트한다. 실시예에서, 클라이언트 디바이스의 롤/요/피치 배향의 변화에 대응하는 움직임 데이터에 응답하여, 3D 메시지는 3D 효과 모듈(708)에 의해 어떻게 렌더링되는지에 대한 대응하는 변화를 갖는다(예를 들어, -10도 롤의 입력은 콘텐츠의 관점을 +10도 롤로 시프트시킬 것이다). 실시예에서, 특정 기간(예를 들어, 3초) 동안 움직임 데이터(예를 들어, 롤/요/피치)를 수신하지 않는 것에 응답하여, 3D 효과 모듈(708)은 깊이 및 시차를 나타내기 위해 피치, 롤 및 요에 대한 미묘한 시프트를 갖는 애니메이션을 보여줌으로써 3D 메시지의 뷰를 업데이트한다. 또한, 움직임에 응답하여, 전술한 애니메이션은 정지할 것이고 입력에 대한 응답은 3D 효과 모듈(708)에 의해 처리된다. At operation 1104, 3D effects module 708 updates the view of the 3D message based on the motion data. In an embodiment, in response to motion data corresponding to a change in the roll/yaw/pitch orientation of the client device, the 3D message has a corresponding change in how it is rendered by the 3D effects module 708 (e.g. An input of -10 degrees roll will shift the perspective of the content to +10 degrees roll). In an embodiment, in response to not receiving motion data (e.g., roll/yaw/pitch) for a certain period of time (e.g., 3 seconds), 3D effects module 708 configures the 3D effects module 708 to indicate depth and parallax. Updates the view of the 3D message by showing an animation with subtle shifts in pitch, roll, and yaw. Additionally, in response to movement, the above-described animation will stop and the response to input is processed by the 3D effects module 708.

실시예에서, 앞서 기술된 바와 같이, 부가의 3D 효과들 및/또는 증강 현실 콘텐츠 발생기들(예를 들어, 미디어 오버레이들)이 이미지 및/또는 깊이 데이터에 적용될 수 있고, 이 깊이 데이터는 이미지 텍스처, 깊이 맵의 지오메트리는 물론 재구성된 모델의 전방에서의 깊이 치수 둘 다의 특성을 변화시킨다.In an embodiment, as described above, additional 3D effects and/or augmented reality content generators (e.g., media overlays) may be applied to the image and/or depth data, which depth data may be combined with an image texture. , changing the properties of both the geometry of the depth map as well as the depth dimension at the front of the reconstructed model.

동작(1106)에서, 렌더링 모듈(710)은 3D 메시지의 업데이트된 뷰를 렌더링한다. 3D의 업데이트된 뷰는 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))의 디스플레이 상에 디스플레이하기 위해 제공된다.At operation 1106, rendering module 710 renders an updated view of the 3D message. The updated view in 3D is provided for display on a display of a client device (e.g., client device 102).

도 12는 일부 실시예들에 따른, 증강 현실 콘텐츠 생성기를 선택하여 미디어 콘텐츠(예를 들어, 이미지 또는 비디오)에 적용하고, 적용된 증강 현실 콘텐츠 생성기를 메시징 클라이언트 애플리케이션(104)(또는 메시징 시스템(100))에 프레젠테이션하기 위한 캐러셀을 도시하는 예시적인 사용자 인터페이스들을 예시한다. 12 illustrates selecting an augmented reality content generator and applying it to media content (e.g., an image or video) and sending the applied augmented reality content generator to a messaging client application 104 (or messaging system 100), according to some embodiments. )) illustrate example user interfaces showing a carousel for presentation.

이러한 사용자 인터페이스의 실시예들에서, 선택가능한 그래픽 항목(1250)은, 선택가능한 그래픽 항목(1250)의 일부 또는 서브셋이 주어진 컴퓨팅 디바이스(예를 들어, 클라이언트 디바이스(102))의 디스플레이 스크린 상에서 보이는 캐러셀 배열로 프레젠테이션될 수 있다. 예로서, 사용자는 그래픽 항목들의 주기적인 뷰를 제공하는 캐러셀에 대응하는 방식으로 디스플레이 스크린 상으로 그리고 디스플레이 스크린으로부터 벗어나게 선택가능한 그래픽 항목들을 회전시키기 위해 다양한 입력들을 이용할 수 있다. 따라서, 사용자 인터페이스에 제공된 캐러셀 배열은 다수의 그래픽 항목이 디스플레이 스크린 상의 특정 그래픽 영역을 점유할 수 있게 한다.In embodiments of such a user interface, selectable graphical items 1250 may be displayed on a display screen of a given computing device (e.g., client device 102) for which a portion or subset of selectable graphical items 1250 are displayed. It can be presented as a Russell array. By way of example, a user may use various inputs to rotate selectable graphical items on and off the display screen in a manner that corresponds to a carousel that provides a periodic view of graphical items. Accordingly, the carousel arrangement provided in the user interface allows multiple graphical items to occupy a particular graphical area on the display screen.

예에서, 상이한 AR 콘텐츠 생성기들에 대응하는 각각의 AR 경험들은 캐러셀 배열 상에 포함시키기 위한 각각의 그룹들로 조직화될 수 있고, 이에 의해 미디어 오버레이들을 통해 그룹별로 회전하는 것을 가능하게 한다. 캐러셀 인터페이스가 예로서 제공되지만, 다른 그래픽 인터페이스들이 이용될 수 있다는 것이 이해된다. 예를 들어, 증강 현실 콘텐츠 생성기들의 세트는 그래픽 리스트, 스크롤 리스트, 스크롤 그래픽, 또는 선택을 위한 다양한 그래픽 항목들을 통한 내비게이션을 가능하게 하는 다른 그래픽 인터페이스 등을 포함할 수 있다. 본 명세서에서 사용되는 바와 같이, 캐러셀 인터페이스는 원형 리스트와 유사한 배열로 그래픽 항목들을 디스플레이하는 것을 지칭하며, 따라서 사용자 입력들(예를 들면, 터치 또는 제스처들)에 기초하여 원형 리스트를 통해 내비게이션하여 그래픽 항목들을 선택하거나 스크롤하는 것을 가능하게 한다. 예에서, 그래픽 항목들의 세트는 수평(또는 수직) 라인 또는 축 상에 프레젠테이션될 수 있으며, 각각의 그래픽 항목은 특정 섬네일 이미지(또는 아이콘, 아바타 등)로서 표현된다. 임의의 한 시점에, 캐러셀 인터페이스 내의 그래픽 항목들 중 일부는 숨겨질 수 있다. 사용자가 숨겨진 그래픽 항목들을 보기를 원하면, 예에서, 사용자는 특정 방향(예를 들어, 왼쪽, 오른쪽, 위, 또는 아래 등)으로 그래픽 항목들을 스크롤하기 위한 사용자 입력(예를 들어, 터치, 제스처 등)을 제공할 수 있다. 그 후, 캐러셀 인터페이스의 후속 뷰가 디스플레이되고, 여기서, 인터페이스 상에 포함시키기 위한 하나 이상의 추가 그래픽 항목을 프레젠테이션하기 위해 애니메이션이 제공되거나 렌더링되고, 이전에 프레젠테이션된 그래픽 항목들 중 일부는 이 후속 뷰에 숨져질 수 있다. 실시예에서, 이러한 방식으로, 사용자는 그래픽 항목들의 세트를 순환 방식으로 앞뒤로 내비게이션할 수 있다. 따라서, 캐러셀 인터페이스는 그래픽 항목들의 세트로부터의 이미지들의 서브세트만을 순환 뷰로 디스플레이함으로써 스크린 공간을 최적화할 수 있다는 것을 알 것이다. In an example, respective AR experiences corresponding to different AR content generators may be organized into respective groups for inclusion on a carousel arrangement, thereby enabling rotation by group through media overlays. Although the carousel interface is provided as an example, it is understood that other graphical interfaces may be used. For example, a set of augmented reality content generators may include a graphical list, scrolling list, scrolling graphics, or other graphical interface that allows navigation through various graphical items for selection, etc. As used herein, a carousel interface refers to the display of graphical items in an arrangement similar to a circular list, thus allowing navigation through the circular list based on user inputs (e.g., touch or gestures). Allows selection and scrolling of graphical items. In an example, a set of graphical items may be presented on a horizontal (or vertical) line or axis, with each graphical item represented as a specific thumbnail image (or icon, avatar, etc.). At any one time, some of the graphical items within the carousel interface may be hidden. If the user wishes to view hidden graphical items, in an example, the user may input user input (e.g., touch, gesture, etc.) to scroll the graphical items in a specific direction (e.g., left, right, up, or down, etc.). ) can be provided. Subsequent views of the carousel interface are then displayed, where animations are provided or rendered to present one or more additional graphical items for inclusion on the interface, some of the previously presented graphical items being displayed in this subsequent view. You can die in In embodiments, in this manner, a user may navigate back and forth through a set of graphical items in a circular manner. Accordingly, it will be appreciated that a carousel interface can optimize screen space by displaying only a subset of images from a set of graphical items in a circular view.

본원에 기술되는 바와 같이, 증강 현실 콘텐츠 생성기들은 캐러셀 배열(또는 앞서 논의된 바와 같은 다른 인터페이스) 상에 포함될 수 있고, 그로써 증강 현실 콘텐츠 생성기들을 통한 회전을 가능하게 한다. 또한, 증강 현실 콘텐츠 생성기는, 예를 들어, 시간, 날짜, 지오로케이션, 미디어 콘텐츠와 연관된 메타데이터 등을 포함한 다양한 신호에 기초하여 포함되도록 선택될 수 있다. 도 12의 사용자 인터페이스 예들의 캐러셀 배열에서, 각각의 증강 현실 콘텐츠 생성기들은 대상 시스템에 의해 제공되는 이용가능한 증강 현실 콘텐츠 생성기들로부터 선택된다.As described herein, augmented reality content generators may be included on a carousel arrangement (or other interface as previously discussed), thereby enabling rotation through the augmented reality content generators. Additionally, augmented reality content generators may be selected for inclusion based on various signals including, for example, time, date, geolocation, metadata associated with media content, etc. In the carousel arrangement of user interface examples in Figure 12, each augmented reality content generator is selected from the available augmented reality content generators provided by the target system.

이하의 설명에서, 선택가능한 그래픽 항목은 미디어 콘텐츠에 적용되는 각각의 증강 현실 콘텐츠 생성기에 대응한다. 사용자 인터페이스(1200)에 나타낸 바와 같이, 캐러셀 배열에 대응하는 선택가능한 그래픽 항목(1250)은 전자 디바이스(예를 들어, 클라이언트 디바이스(102))의 디스플레이 스크린 내의 선택가능한 그래픽 항목(1251)을 포함한다. 예를 들어, 스와이프 제스처는 클라이언트 디바이스(102)의 터치 스크린을 통해 수신되고, 스와이프 제스처를 수신하는 것에 응답하여, 특정 증강 현실 콘텐츠 생성기의 선택을 용이하게 하기 위해 선택가능한 그래픽 항목들을 통한 내비게이션이 가능하게 된다. 선택가능한 그래픽 항목(1251)은 사용자에 의한 터치 입력(예를 들어, 탭핑, 또는 제스처의 끝에서의 터치 해제)을 통해 선택된다. 이 예에서, 선택가능한 그래픽 항목(1251)은 3D 효과들을 포함하는 특정의 증강 현실 콘텐츠 발생기에 대응한다. In the description below, selectable graphical items correspond to respective augmented reality content generators applied to media content. As shown in user interface 1200, selectable graphical items 1250 corresponding to the carousel arrangement include selectable graphical items 1251 in the display screen of the electronic device (e.g., client device 102). do. For example, a swipe gesture may be received via a touch screen of client device 102 and, in response to receiving the swipe gesture, navigation through selectable graphical items to facilitate selection of a particular augmented reality content generator. This becomes possible. Selectable graphical items 1251 are selected through touch input by the user (e.g., tapping, or releasing the touch at the end of a gesture). In this example, selectable graphical item 1251 corresponds to a specific augmented reality content generator that includes 3D effects.

도 12의 제2 예에서, 선택가능한 그래픽 항목(1251)의 선택시에, 3D 효과들(1260), 3D 효과들(1262) 및 3D 효과들(1264)이 클라이언트 디바이스(102) 상의 디스플레이를 위해 렌더링된다. 이 예에서, 3D 효과들(1260)은 공간적으로 렌더링되고 시청자의 전자 디바이스(예를 들어, 클라이언트 디바이스(102)) 상의 센서 정보(예를 들어, 자이로스코픽 데이터 등)에 응답하여 이동하고 있는 입자-기반 효과들이다. 3D 효과들(1262)은 컬러 필터링 및 셰이더 효과들을 포함할 수 있으며, 이들은 또한 센서 정보에 응답하여 이동할 수 있다. 3D 효과들(1264)은, 일부 예에서, 일부 타입, 형상(들), 컬러, 텍스처 등의 웨어러블 3D 객체 또는 모델을 지칭하는 3D 부착물(예를 들어, 장미의 헤드밴드)을 포함한다. 따라서, 대응하는 증강 현실 콘텐츠 생성기는 이미지 데이터로부터 얼굴 이미지 데이터에 근접하게 렌더링된 3D 객체를 포함한다. 컬러 필터링의 예들은 (예를 들어, 메시지와 함께 포함된 위치 메타데이터에 기초하여) 메시지가 생성되는 위치에 대응하는 위치에 대한 시각과 매칭하는 일광 효과를 포함한다. 셰이더 효과들의 예들은: 스크린 주위를 이동하는 액체, 글림머 효과들, 블룸 효과들, 무지개색 효과들, 및 움직임에 기초하여 배경을 변경하는 것을 포함하지만, 이에 제한되지 않는다.In the second example of FIG. 12 , upon selection of selectable graphics item 1251, 3D effects 1260, 3D effects 1262 and 3D effects 1264 are displayed for display on client device 102. It is rendered. In this example, 3D effects 1260 include particles that are spatially rendered and moving in response to sensor information (e.g., gyroscopic data, etc.) on the viewer's electronic device (e.g., client device 102). -These are based effects. 3D effects 1262 may include color filtering and shader effects, which may also move in response to sensor information. 3D effects 1264 include, in some examples, a 3D attachment (e.g., a headband of roses) that refers to a wearable 3D object or model of some type, shape(s), color, texture, etc. Accordingly, the corresponding augmented reality content generator includes a 3D object rendered approximating the facial image data from the image data. Examples of color filtering include a daylight effect that matches the time of day for a location that corresponds to the location where the message was created (eg, based on location metadata included with the message). Examples of shader effects include, but are not limited to: liquid moving around the screen, glitter effects, bloom effects, rainbow effects, and changing background based on movement.

도 12의 제3 예에서, 사용자는 사용자 인터페이스(1200)에서 3D 효과들(1270) 및 3D 효과들(1272)의 디스플레이에서 클라이언트 디바이스(102)의 움직임을 제공한다. 이 예에서, 3D 효과들(1270), 3D 효과들(1272) 및 3D 효과들(1274)은 각각 제2 예에서 논의된 3D 효과들(1260), 3D 효과들(1262) 및 3D 효과들(1264)의 상이한 버전들이다. 3D 효과들(1270), 3D 효과들(1272) 및 3D 효과들(1274)은 클라이언트 디바이스(102)의 움직임(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 디스플레이를 위해 렌더링되었고, 전술한 3D 효과들의 뷰는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 입자들, 컬러 필터링, 및/또는 3D 부착물은 움직임 데이터에 응답하여 변경된다. In the third example of FIG. 12 , the user provides movement of the client device 102 in the display of 3D effects 1270 and 3D effects 1272 in user interface 1200 . In this example, 3D effects 1270, 3D effects 1272, and 3D effects 1274 are respectively the 3D effects 1260, 3D effects 1262, and 3D effects discussed in the second example ( 1264) are different versions. 3D effects 1270, 3D effects 1272, and 3D effects 1274 were rendered for display in response to movement of the client device 102 (e.g., motion data from a gyroscopic sensor), The view of the 3D effects described above is updated (e.g., re-rendered) in response to newly received motion data, which may change the perspective of the scene being viewed by the viewer. For example, particles, color filtering, and/or 3D attachments change in response to motion data.

실시예에서, 주어진 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))는 각각의 증강 현실 콘텐츠 생성기와 연관된 메타데이터에 기초하여 이용가능한 증강 현실 콘텐츠 생성기로부터 증강 현실 콘텐츠 생성기들의 세트를 선택하고, 메타데이터는 대응하는 증강 현실 콘텐츠 생성기가 적어도 3D 효과를 포함한다는 것을 나타내는 정보를 포함하고, 증강 현실 콘텐츠 생성기들의 세트는 3D 효과를 갖지 않는 적어도 하나의 증강 현실 콘텐츠 생성기 및 3D 효과를 갖는 적어도 하나의 증강 현실 콘텐츠 생성기를 포함한다. 클라이언트 디바이스는 선택가능한 그래픽 항목들로부터 선택가능한 그래픽 항목의 선택을 수신하고, 선택가능한 그래픽 항목은 3D 효과를 포함하는 증강 현실 콘텐츠 생성기를 포함한다. 클라이언트 디바이스는 클라이언트 디바이스의 적어도 하나의 카메라를 사용하여 이미지 데이터 및 깊이 데이터를 캡처한다. 적어도 하나의 카메라는 제1 카메라 및 제2 카메라를 포함하고, 제1 카메라는 제1 초점 거리를 갖고, 제2 카메라는 제2 초점 거리를 갖고, 제1 초점 거리와 제2 초점 거리는 상이하다. 또한, 클라이언트 디바이스는 증강 현실 콘텐츠 생성기에 적어도 부분적으로 기초하여 3D 효과를 이미지 데이터 및 깊이 데이터에 적용한다.In an embodiment, a given client device (e.g., client device 102) selects a set of augmented reality content generators from available augmented reality content generators based on metadata associated with each augmented reality content generator and metadata The data includes information indicating that the corresponding augmented reality content generator includes at least a 3D effect, and the set of augmented reality content generators includes at least one augmented reality content generator without a 3D effect and at least one augmented reality content generator with a 3D effect. Includes a real-life content generator. The client device receives a selection of a selectable graphical item from a list of selectable graphical items, the selectable graphical item including an augmented reality content generator that includes a 3D effect. The client device captures image data and depth data using at least one camera of the client device. The at least one camera includes a first camera and a second camera, the first camera has a first focal length, the second camera has a second focal length, and the first focal length and the second focal length are different. Additionally, the client device applies 3D effects to the image data and depth data based at least in part on the augmented reality content generator.

실시예에서, 디스패리티 맵(disparity map)은 제1 카메라에 의해 캡처된 제1 이미지로부터의 제1 픽셀과 제2 카메라에 의해 캡처된 제2 이미지로부터의 제2 픽셀 사이의 거리에 적어도 부분적으로 기초하여 생성되고, 제1 픽셀과 제2 픽셀은 동일한 객체에 대응한다. 디스패리티 맵은 각각의 픽셀이 제1 이미지로부터의 픽셀과 제2 이미지로부터의 대응하는 픽셀 사이의 거리 값을 포함하는 이미지이다. 디스패리티 맵 내의 제1 객체의 제1 픽셀들은 디스패리티 맵 내의 제2 객체의 제2 픽셀보다 큰 밝기를 갖고, 제1 픽셀은 제2 픽셀의 제2 깊이 값들보다 작은 깊이 값들을 갖는다. 또한, 예에서, 깊이 맵은 디스패리티 맵에 적어도 부분적으로 기초하여 생성된다.In an embodiment, the disparity map is determined at least in part by the distance between the first pixel from the first image captured by the first camera and the second pixel from the second image captured by the second camera. is generated based on the first pixel and the second pixel corresponding to the same object. A disparity map is an image where each pixel contains a distance value between a pixel from a first image and a corresponding pixel from a second image. First pixels of a first object in the disparity map have a brightness greater than second pixels of a second object in the disparity map, and the first pixel has depth values that are less than second depth values of the second pixel. Also, in the example, the depth map is generated based at least in part on the disparity map.

구현에서, 전술한 3D 효과들 및 대응하는 메타데이터는 메시징 시스템(100)의 다른 사용자와 제공(예를 들어, 공유)될 수 있는 메시지에 포함된다. 이 다른 사용자는 메시지를 수신할 수 있고, 액세스될 때, 수신측 클라이언트 디바이스 상에 디스플레이하기 위해 메시지를 볼 수 있다. 수신측 클라이언트 디바이스는, 상기 도 7에 기술된 것과 유사하거나 동일한 컴포넌트를 사용하여, 수신된 메시지에서 지정된 바와 같이 디스플레이를 위해 3D 효과를 렌더링한다. 또한, 이 다른 사용자는, 이에 응답하여, 제공된 움직임에 기초하여 시청자에 의해 시청되고 있는 장면의 관점이 변경되는 3D 효과의 재렌더링을 개시하는 움직임을 수신측 클라이언트 디바이스에 제공할 수 있다. In an implementation, the above-described 3D effects and corresponding metadata are included in a message that can be provided (e.g., shared) with other users of messaging system 100. This other user can receive the message and, when accessed, view the message for display on the receiving client device. The receiving client device renders the 3D effect for display as specified in the received message, using similar or identical components as described in Figure 7 above. Additionally, this other user may, in response, provide a movement to the receiving client device that initiates a re-rendering of the 3D effect in which the perspective of the scene being viewed by the viewer changes based on the provided movement.

도 13은 일부 예시적인 실시예들에 따른, 클라이언트 디바이스의 디스플레이에서 이미지 정보를 캡처하고 3D 메시지를 생성하는 것을 예시하는 예이다.13 is an example illustrating capturing image information and generating a 3D message on a display of a client device, according to some example embodiments.

제1 예에서, 뷰(1300)는 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))의 디스플레이 상에 디스플레이하기 위해 제공된다. 뷰(1300)는 (예를 들어, 얼굴을 포함하는) 사용자의 인물사진의 표현의 이미지를 포함한다. 선택가능한 그래픽 요소(1305)가 뷰(1300)에서의 디스플레이를 위해 제공된다. 실시예에서, 선택가능한 그래픽 요소(1305)는 3D 메시지를 생성하고 본 명세서에서 추가로 논의되는 바와 같이 3D 효과들 및 다른 이미지 처리 동작들을 적용하기 위한 증강 현실 콘텐츠 생성기에 대응한다. 선택가능한 그래픽 요소(1305)의 선택 시에, 제2 인터페이스는 디스플레이를 위한 3D 메시지를 생성하기 위해 (본 명세서의 다른 곳에서 설명된 바와 같은) 동작들을 개시하는(예를 들어, 깊이 캡처링 능력들을 갖는 클라이언트 디바이스(102) 상의 전면 카메라 렌즈를 사용하여) 이미지를 캡처하기 위한 인터페이스를 포함하여 프레젠테이션될 수 있다.In a first example, view 1300 is provided for display on a display of a client device (eg, client device 102). View 1300 includes an image of a portrait representation of the user (e.g., including a face). Selectable graphical elements 1305 are provided for display in view 1300. In an embodiment, selectable graphical element 1305 corresponds to an augmented reality content generator for creating 3D messages and applying 3D effects and other image processing operations as discussed further herein. Upon selection of a selectable graphical element 1305, the second interface initiates operations (as described elsewhere herein) to generate a 3D message for display (e.g., depth capturing capabilities). may be presented including an interface for capturing images (using a front-facing camera lens on a client device 102 having an image).

제2 예에서, 뷰(1350)는 이미지의 배경 영역(예를 들어, 사용자의 인물사진 뒤)에 블러링을 도입하는 깊이 효과를 갖는 뷰(1300) 내의 제1 예에서 캡처된 3D 메시지의 디스플레이를 포함한다. 뷰(1350) 내의 이 디스플레이는, 사용자가 클라이언트 디바이스를 움직이고 있는 센서 데이터(예를 들어, 움직임 데이터, 자이로스코픽 센서 데이터 등)의 수신에 응답하여 3D 메시지와 연관된 3D 효과를 렌더링하도록 업데이트될 수 있다. 예에서, 시청 사용자에 대한 클라이언트 디바이스의 상대적 포지션에 따라, 3D 효과는 포지션 변화를 고려하여 클라이언트 디바이스의 디스플레이 상의 프레젠테이션을 위해 업데이트될 수 있다. 예를 들어, 클라이언트 디바이스의 디스플레이가 특정 방식으로 제1 포지션으로 기울어지면, 한 세트의 3D 효과들이 렌더링되어 디스플레이를 위해 제공될 수 있고, 클라이언트 디바이스가 상이한 포지션으로 이동될 때, 제2 세트의 3D 효과들이 렌더링되어 이미지를 업데이트하고 시청 관점에서의 변화를 나타낼 수 있고, 이것은 시청 사용자에게 더 많은 3D 시청 경험을 제공한다.In a second example, view 1350 displays a 3D message captured in the first example within view 1300 with a depth effect that introduces blurring in background areas of the image (e.g., behind a portrait of the user). Includes. This display in view 1350 may be updated to render 3D effects associated with 3D messages in response to receiving sensor data (e.g., motion data, gyroscopic sensor data, etc.) where the user is moving the client device. . In an example, depending on the relative position of the client device to the viewing user, the 3D effect may be updated for presentation on the client device's display to take into account the position change. For example, when the client device's display is tilted in a particular way to a first position, one set of 3D effects may be rendered and presented for display, and when the client device is moved to a different position, a second set of 3D effects may be rendered and presented for display. Effects can be rendered to update the image and indicate changes in the viewing perspective, providing a more 3D viewing experience for the viewing user.

이하의 논의는, 일부 실시예에 따라 (예를 들어, 클라이언트 디바이스 상의 프리뷰로서, 또는 클라이언트 디바이스와 상이한 수신측 디바이스에서) 렌더링하기 위한 주어진 3D 메시지를 (예를 들어, 뷰(1350)에 나타낸 바와 같이) 생성하는데 이용되는 다양한 기술에 관한 것이다.The discussion below describes a given 3D message (e.g., as shown in view 1350) for rendering (e.g., as a preview on a client device, or on a receiving device that is different from the client device), according to some embodiments. It is about the various technologies used to create.

도 14는 일부 예시적인 실시예들에 따른, 원시 깊이 맵 및 패킹된 깊이 맵을 예시하는 예이다. 이하의 예들은 클라이언트 디바이스의 카메라에 의해 제공되는 적어도 원시 입력 데이터(예를 들어, 이미지 데이터 및 깊이 데이터)를 사용하여 주어진 3D 메시지를 발생시키는 것의 일부로서 주어진 클라이언트 디바이스에 의해 수행된다.Figure 14 is an example illustrating a raw depth map and a packed depth map, according to some example embodiments. The examples below are performed by a given client device as part of generating a given 3D message using at least raw input data (e.g., image data and depth data) provided by the client device's camera.

제1 예에서, 원시 깊이 맵(1400)의 예는 클라이언트 디바이스의 카메라에 의해 캡처된 원시 데이터에 기초하여 주어진 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102))에 의해 생성된다. 이러한 원시 데이터는 이미지 데이터(예를 들어, 사진 이미지) 및 카메라로부터의 깊이 데이터를 포함할 수 있다. 예에서, 클라이언트 디바이스는 단일 채널 부동 소수점 텍스처를 정밀도를 손실하지 않고 처리의 다수의 패스를 가능하게 하는 원시 깊이 맵으로 변환한다. 클라이언트 디바이스는 단일 채널 부동 소수점 텍스처를 패킹된 깊이 맵(1450)으로서 제2 예에 예시된 다수의 더 낮은 정밀도 채널로 확산(예를 들어, 데이터의 부분들을 전송 또는 변환)할 수 있다. 실시예에서, 원시 깊이 맵(1400) 및 패킹된 깊이 맵(1450)은 클라이언트 디바이스의 카메라에 의해 캡처된 원시 이미지 데이터보다 더 낮은 해상도(예를 들어, 더 낮은 수의 전체 픽셀)를 갖는다.In a first example, an example raw depth map 1400 is generated by a given client device (e.g., client device 102) based on raw data captured by a camera of the client device. This raw data may include image data (eg, photographic images) and depth data from a camera. In an example, a client device converts a single channel floating point texture to a raw depth map enabling multiple passes of processing without loss of precision. A client device can spread (e.g., transmit or transform portions of data) a single channel floating point texture to multiple lower precision channels, illustrated in the second example as packed depth map 1450. In an embodiment, raw depth map 1400 and packed depth map 1450 have a lower resolution (e.g., a lower number of total pixels) than the raw image data captured by the client device's camera.

클라이언트 디바이스는 주어진 이미지 내의 주어진 대상(예를 들어, 사용자의 인물사진)을 갖는 전경을 동일한 이미지 내의 배경으로부터 분리하는 동작들을 수행한다. 실시예에서, 클라이언트 디바이스는 적어도 전술한 원시 깊이 맵(1400) 또는 패킹된 깊이 맵(1450)을 사용하여 분할 마스크를 생성한다. 대안적으로, 예에서, 분할 마스크는 클라이언트 디바이스의 능력들이 이미지 캡처링 프로세스의 일부로서 분할 마스크를 생성하는 것을 포함할 때 카메라에 의해 캡처된 원시 데이터에 포함될 수 있다. The client device performs operations to separate a foreground with a given object (e.g., a portrait of a user) within a given image from a background within the same image. In an embodiment, the client device generates a segmentation mask using at least the raw depth map 1400 or the packed depth map 1450 described above. Alternatively, in an example, a segmentation mask may be included in raw data captured by a camera when the capabilities of the client device include generating a segmentation mask as part of the image capturing process.

분할 마스크를 사용하여, 클라이언트 디바이스는 확산 기반 인페인팅 기술을 수행하여 이미지 내의 배경으로부터 전경 피사체를 제거함으로써, (예를 들어, 전경 피사체가 없는) 배경 인페인팅 이미지를 생성한다. 예에서, 확산 기반 인페인팅 기술은 경계로부터 누락 영역의 내부로 이미지 콘텐츠를 전파함으로써 누락 영역(예를 들어, 전경 피사체)을 채우려고 시도한다. 이러한 방식으로 전경 피사체를 제거하는 것이 유리한데, 그 이유는 적어도, 3D 메시지를 렌더링한 후에, 클라이언트 디바이스의 카메라가 이동될 때, 전경 피사체가 제거되지 않을 때(예를 들어, 상기 동작들을 수행하지 않을 때) 피사체의 이미지(의 일부분)의 "고스트(ghost)"가 배경에서 보여질 수 있기 때문이다(예를 들어, 바람직하지 않은 시각적 효과를 가져올 수 있음). Using the segmentation mask, the client device performs a diffusion-based inpainting technique to remove foreground objects from the background in the image, thereby creating a background inpainted image (e.g., without foreground objects). In an example, diffusion-based inpainting techniques attempt to fill missing areas (e.g., foreground objects) by propagating image content from the boundary into the interior of the missing area. Removing the foreground object in this way is advantageous, at least because, after rendering the 3D message, when the camera of the client device is moved, the foreground object is not removed (e.g., when the above operations are not performed). otherwise, a "ghost" of (part of) the image of the subject may be visible in the background (e.g., resulting in an undesirable visual effect).

또한, 3D 메시지를 렌더링한 후에, 클라이언트 디바이스가 이동되고(큰) 깊이 변화(예를 들어, 사용자의 얼굴의 측면의 일부에 대응하는 전경과 배경 사이)를 갖는 이미지의 영역들에서, 분할 마스크 및 인페인팅 기술이 본 명세서에 설명된 바와 같이 수행되지 않으면, 스트레칭 아티팩트는 사용자의 얼굴을 갖는 이미지의 부분에 나타날 수 있고, 사용자의 얼굴 또는 머리의 경계는 이미지의 전경과 배경 사이에서 스미어링(smear)된 것으로 나타날 수 있다. 또한, 본 명세서에 설명된 기술들을 수행하지 않고, 이미지의 전경과 배경 사이의 단단한(예를 들어, 시각적으로 뚜렷한) 경계는 클라이언트 디바이스가 이동될 때 원하지 않는 시각적 효과로서 나타날 수 있어, 이미지가 더 인공적이고, 비현실적이고, 왜곡되고, 과장되게 보이게 한다.Additionally, after rendering the 3D message, the client device is moved and in areas of the image that have (large) depth changes (e.g., between foreground and background corresponding to part of the side of the user's face), a segmentation mask and If the inpainting technique is not performed as described herein, stretching artifacts may appear in the portion of the image that contains the user's face, and the boundaries of the user's face or head may appear smeared between the foreground and background of the image. ) may appear as such. Additionally, without performing the techniques described herein, a hard (e.g., visually distinct) boundary between the foreground and background of an image may appear as an undesirable visual effect when the client device is moved, causing the image to become more distorted. It makes it look artificial, unrealistic, distorted, and exaggerated.

도 15는 일부 예시적인 실시예들에 따른, 깊이 인페인팅 마스크 및 깊이 인페인팅을 예시하는 예이다. 도 15의 예들은 (예를 들어, 도 14의 동작들이 수행된 후에) 도 14의 예들과 함께 수행될 수 있다.Figure 15 is an example illustrating a depth inpainting mask and depth inpainting, according to some example embodiments. The examples of FIG. 15 may be performed in conjunction with the examples of FIG. 14 (e.g., after the operations of FIG. 14 are performed).

실시예에서, 클라이언트 디바이스는, 깊이 인페인팅 마스크를 사용하여, (예를 들어, 배경 인페인팅에 관하여 전술된 바와 같은) 동일한 확산 기반의 인페인팅 기술을 수행하여 깊이 맵(예를 들어, 패킹된 깊이 맵(1450))의 전경 경계를 확장한다. 제1 예에 도시된 바와 같이, 깊이 인페인팅 마스크(1500)는 적어도 깊이 맵을 사용하여 생성된다. 예에서, 깊이 인페인팅 마스크(1500)는, 경계 검출, 및 깊이 맵 내의 각각의 픽셀의 분류를 수행하는 심층 컨볼루션 신경망, 분할을 위한 인코더-디코더 아키텍처, 완전 컨볼루션 네트워크, 피쳐 맵, 디컨볼루션 네트워크, 무감독 피쳐 학습 등의 머신 학습 기술을 포함한 깊이 맵에 적용되는 접근법을 사용하여 결정될 수 있다.In an embodiment, a client device performs the same diffusion-based inpainting technique (e.g., as described above for background inpainting), using a depth inpainting mask to create a depth map (e.g., a packed Extend the foreground boundary of the depth map (1450). As shown in the first example, the depth inpainting mask 1500 is created using at least a depth map. In an example, the depth inpainting mask 1500 may be a deep convolutional neural network that performs edge detection and classification of each pixel in the depth map, an encoder-decoder architecture for segmentation, a fully convolutional network, a feature map, and a deconvolution network. It can be determined using approaches applied to depth maps, including machine learning techniques such as solution networks and unsupervised feature learning.

제2 예에서, 이미지 데이터(1550)(예를 들어, 인페인팅된 깊이 맵)는 깊이 인페인팅 마스크(1500)를 사용한 깊이 맵의 인페인팅의 결과를 도시한다. 이것은, 예에서, 머리카락, 귀, 또는 어깨의 외관을 개선(예를 들어, 더 정확하게 렌더링)하기 위해 수행된다. 앞서 언급된 바와 같이, 깊이 맵은 이미지 데이터(예를 들어, RBG 이미지)보다 낮은 해상도일 수 있고, 깊이 맵에 적용되는 3D 효과 또는 이미지 처리 동작은 낮은 해상도에 의해 제한될 수 있다. 머리카락과 같은 미세한 세부사항들을 더 잘 보존하기 위해, 전술한 깊이 맵 인페인팅 기술이 대상 기술에 의해 제공된다. 깊이 맵 및 깊이 인페인팅 마스크(1500)를 사용함으로써, 클라이언트 디바이스는 이미지 데이터(1550)에 도시된 바와 같이 이미지의 특정 영역들(예를 들어, 누락된 또는 불량 데이터를 갖는 영역들)을 결정하고 채울 수 있다.In a second example, image data 1550 (e.g., an inpainted depth map) shows the result of inpainting a depth map using a depth inpainting mask 1500. This is done, in examples, to improve the appearance (e.g., render more accurately) of hair, ears, or shoulders. As previously mentioned, the depth map may be of lower resolution than the image data (e.g., RBG image), and 3D effects or image processing operations applied to the depth map may be limited by the low resolution. To better preserve fine details such as hair, the depth map inpainting technique described above is provided by Target Technology. By using the depth map and depth inpainting mask 1500, the client device determines specific areas of the image (e.g., areas with missing or bad data) as shown in image data 1550 and It can be filled.

예에서, 클라이언트 디바이스는 깊이 데이터에 적어도 부분적으로 기초하여 깊이 맵을 결정하고, 얼굴 깊이 데이터를 포함하는 깊이 맵의 영역에 대응하는 깊이 인페인팅 마스크를 생성하고; 적어도 생성된 깊이 인페인팅 마스크를 사용하여 깊이 맵의 깊이 맵 인페인팅을 수행한다. 깊이 맵은 실시예에서 패킹된 깊이 맵일 수 있다.In an example, the client device determines a depth map based at least in part on the depth data and generates a depth inpainting mask corresponding to an area of the depth map that includes facial depth data; At least perform depth map inpainting of the depth map using the generated depth inpainting mask. The depth map may be a packed depth map in an embodiment.

실시예에서, 클라이언트 디바이스는 깊이 맵에서 각각의 깊이 픽셀의 이웃 주위에 맞는 평면을 결정함으로써 깊이 법선 맵을 생성한다. 이것은 예를 들어, 흥미로운 조명 효과들 및 미화 효과들을 달성하기 위해 광이 표면과 어떻게 상호작용해야 하는지를 결정하는 데 유리하다. 예에서, 생성된 깊이 법선 맵은 저해상도 이미지이지만, 3D 메시지에서 이러한 효과들을 제공하기 위해 효과적으로 이용될 수 있다. 예에서, 법선 맵은 (예를 들어, 3D 공간에서 X, Y 및 Z 축에 대응하는) RGB 정보를 사용하고, RGB 정보는 클라이언트 디바이스에 의해 이용되어 각각의 다각형에 대해 표면 법선(또는 "법선")이 배향되는 방향을 결정할 수 있고, 여기서, 클라이언트 디바이스는 표면 법선의 결정된 배향을 사용하여 다각형을 음영처리하는 방법을 결정한다. 달리 말하면, 법선 맵은 각각의 픽셀에서 방향을 저장하는 이미지이고, 방향들은 법선들로 불린다. 이미지의 적색, 녹색, 및 청색 채널들은 각각의 픽셀의 법선의 방향을 제어하기 위해 클라이언트 디바이스에 의해 사용될 수 있고, 법선 맵은 저해상도 이미지에 대한 고해상도 상세사항을 모방하는데 사용될 수 있다.In an embodiment, a client device generates a depth normal map by determining a plane that fits around the neighborhood of each depth pixel in the depth map. This is advantageous for determining how light should interact with the surface to achieve interesting lighting effects and beautification effects, for example. In the example, the generated depth normal map is a low resolution image, but can be used effectively to provide these effects in a 3D message. In the example, the normal map uses RGB information (e.g., corresponding to the ") can determine the direction in which the polygon is oriented, where the client device uses the determined orientation of the surface normal to determine how to shade the polygon. In other words, a normal map is an image that stores the direction at each pixel, and the directions are called normals. The red, green, and blue channels of the image can be used by the client device to control the direction of each pixel's normal, and the normal map can be used to mimic high-resolution details for a low-resolution image.

예에서, 클라이언트 디바이스는 이미지 데이터의 얼굴 이미지 데이터에 조명 효과를 적용하기 위한 깊이 맵의 법선 맵을 생성하고, 법선 맵에 적어도 부분적으로 기초하여 얼굴 이미지 데이터에 조명 효과를 적용한다. 예에서, 조명 효과는 적어도 2개의 상이한 컬러를 포함하고, 2개의 상이한 컬러로부터의 제1 컬러는 얼굴 이미지 데이터의 제1 부분에 적용되고, 2개의 상이한 컬러로부터의 제2 컬러는 얼굴 이미지 데이터의 제2 이미지에 적용된다.In an example, the client device generates a normal map of the depth map for applying a lighting effect to facial image data in the image data and applies a lighting effect to the facial image data based at least in part on the normal map. In an example, the lighting effect includes at least two different colors, a first color from the two different colors is applied to a first portion of the facial image data, and a second color from the two different colors is applied to the facial image data. Applies to the second image.

생성된 법선 맵을 사용하여, 클라이언트 디바이스는 미화 기술들(본 명세서에서 추가로 논의됨), 조명 효과들, 및 다른 이미지 처리 기술들을 적용하여 3D 메시지의 시청 사용자에게 설득력 있고 자연스럽게 보이는 3D 효과들을 생성할 수 있다. 클라이언트 디바이스는 생성된 정상 맵을 수반하는 전술한 기술들에 적어도 기초하여 후처리된 전경 이미지를 생성한다.Using the generated normal map, the client device applies beautification techniques (discussed further herein), lighting effects, and other image processing techniques to create 3D effects that appear convincing and natural to the viewing user of the 3D message. can do. The client device generates a post-processed foreground image based at least on the techniques described above involving the generated normal map.

클라이언트 디바이스는 (예를 들어, 전술한 바와 같이) 후처리된 전경 이미지, 후처리된 깊이 맵, 인페이팅된 배경 이미지, 및 포함된 다른 메타데이터와 같은 다양한 자산들을 포함하는 3D 메시지를 생성한다. 실시예에서, 3D 메시지의 수신측 디바이스는 전경 메시 및 배경 메시를 생성함으로써 3D 메시지의 뷰를 렌더링하기 위해 포함된 자산들을 이용할 수 있다. 전경 메시는 후처리된 깊이 맵 및 카메라 고유 메타데이터(예를 들어, 앞서 논의된 렌즈 정보 등)에 관련된 메타데이터를 사용하여 생성될 수 있다. 배경 메시는 적어도 인페이팅된 배경 이미지를 사용하여 생성될 수 있다.The client device generates a 3D message that includes various assets such as a post-processed foreground image, a post-processed depth map, an inpainted background image, and other metadata included (e.g., as described above). . In embodiments, a device receiving a 3D message may use the included assets to render a view of the 3D message by creating a foreground mesh and a background mesh. The foreground mesh may be generated using metadata related to the post-processed depth map and camera-specific metadata (e.g., lens information discussed earlier, etc.). The background mesh can be created using at least an inpainted background image.

도 16 내지 도 21의 이하의 논의는 전술한 기법들 중 적어도 일부를 이용하여 주어진 클라이언트 디바이스(예를 들어, 클라이언트 디바이스(102)) 상에 디스플레이하기 위해 프레젠테이션되는 3D 효과 및 기타 그래픽 효과의 예이다.The following discussion of Figures 16-21 are examples of 3D effects and other graphical effects presented for display on a given client device (e.g., client device 102) using at least some of the techniques described above. .

도 16은 일부 예시적인 실시예들에 따라, 입자들, 그래픽 객체(예를 들어, 안경) 상의 반사, 및 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 3D 부착물을 예시하는 3D 효과들의 예, 및 움직임 데이터에 응답하여 렌더링되는 포스트 효과들 및 동적 3D 부착물을 예시하는 3D 효과들의 예이다.16 shows a 3D attachment rendered in response to particles, reflections on a graphical object (e.g., glasses), and motion data (e.g., motion data from a gyroscopic sensor), according to some example embodiments. and examples of 3D effects illustrating dynamic 3D attachments and post effects rendered in response to motion data.

도 16의 제1 예에서, 3D 효과들의 뷰(1600)는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 입자들, 반사 및/또는 3D 부착물은 움직임 데이터에 응답하여 뷰(1600)에서 변경된다. In the first example of Figure 16, the view 1600 of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, particles, reflections and/or 3D attachments change in view 1600 in response to motion data.

추가로 도시된 바와 같이, 3D 효과들의 제2 예는 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 뷰(1650) 포스트 효과들 및 동적 3D 부착물을 예시한다. 도 13의 이 제2 예에서, 3D 효과들의 뷰는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 3D 텍스트(예를 들어, "산타 모니카(Santa Monica)")는 움직임 데이터에 응답하여 포지션들을 변경한다.As further shown, a second example of 3D effects illustrates dynamic 3D attachments and view 1650 post effects that are rendered in response to motion data (e.g., motion data from a gyroscopic sensor). In this second example of Figure 13, the view of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, 3D text (eg, “Santa Monica”) changes positions in response to motion data.

도 17은 일부 예시적인 실시예들에 따른, 움직임 데이터에 응답하여 렌더링되는 동적 인공 조명을 예시하는 3D 효과의 예, 및 움직임 데이터에 응답하여 렌더링되는 안경, 3D 부착물, 및 애니메이션화된 스프라이트 배경 상의 반사/굴절을 예시하는 3D 효과들의 예이다.17 is an example of a 3D effect illustrating dynamic artificial lighting rendered in response to motion data, and reflections on glasses, 3D attachments, and an animated sprite background rendered in response to motion data, according to some example embodiments. /This is an example of 3D effects that illustrate refraction.

도 17의 제1 예에서, 3D 효과들의 뷰(1700)는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 얼굴 상의 인공 조명은 움직임 데이터에 응답하여 변한다.In the first example of Figure 17, the view 1700 of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, artificial lighting on the face changes in response to motion data.

도 17의 3D 효과의 제2 예는, 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 안경 상의 반사/굴절, 3D 부착물, 및 애니메이션화된 스프라이트 배경을 보여주는 뷰(1750)를 예시한다. 도 17의 제2 예에서, 3D 효과들의 뷰는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 안경 상의 반사/굴절 및 애니메이션화된 스프라이트 배경은 움직임 데이터에 응답하여 변한다.A second example of a 3D effect in FIG. 17 is a view 1750 showing reflections/refractions on glasses, 3D attachments, and animated sprite backgrounds rendered in response to motion data (e.g., motion data from a gyroscopic sensor). ) is an example. In the second example of Figure 17, the view of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, reflections/refractions on glasses and animated sprite backgrounds change in response to motion data.

도 18은 일부 예시적인 실시예들에 따른, 제어된 입자 시스템(예를 들어, 애니메이션화된 발사체), 및 움직임 데이터에 응답하여 렌더링되는 2D 및 3D 부착물들을 예시하는 3D 효과들의 예의 예, 및 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 3D 부착물들(예를 들어, 토끼 귀)에 대한 공동 애니메이션을 예시하는 3D 효과들의 예이다.18 is an example of 3D effects illustrating a controlled particle system (e.g., an animated projectile), and 2D and 3D attachments rendered in response to motion data, according to some example embodiments. This is an example of 3D effects that illustrate joint animation for 3D attachments (e.g., bunny ears) rendered in response to (e.g., motion data from a gyroscopic sensor).

도 18의 제1 예에서, 3D 효과들의 뷰(1800)는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 제어된 입자 시스템의 애니메이션이 변하고 부착물들은 움직임 데이터에 응답하여 이동된다.In the first example of Figure 18, the view 1800 of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, the animation of a controlled particle system changes and attachments move in response to motion data.

도 18의 제2 예에서, 3D 효과들의 뷰(1850)는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 3D 부착물의 애니메이션은 움직임 데이터에 응답하여 변한다.In the second example of Figure 18, the view 1850 of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, the animation of a 3D attachment changes in response to movement data.

도 19는 일부 예시적인 실시예들에 따른, 스프라이트들, 안경 상에서의 반사, 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 2D 및 3D 부착물들을 예시하는 3D 효과들의 예, 및 움직임 데이터에 응답하여 렌더링되는 안경, 입자들, 및 애니메이션화된 배경 상에서의 반사/굴절을 예시하는 3D 효과들의 예이다.19 shows 3D effects illustrating sprites, reflections on glasses, and 2D and 3D attachments rendered in response to motion data (e.g., motion data from a gyroscopic sensor), according to some example embodiments. Examples of 3D effects illustrating glasses, particles, and reflections/refractions on an animated background that are rendered in response to motion data.

도 19의 제1 예에서, 3D 효과들의 뷰(1900)는 시청자에 의해 시청되고 있는 장면의 관점을 변경할 수 있는 새롭게 수신된 움직임 데이터에 응답하여 업데이트(예를 들어, 재렌더링)된다. 예를 들어, 안경, 스프라이트 및 부착물에 대한 반사는 움직임 데이터에 응답하여 변한다.In the first example of Figure 19, the view 1900 of 3D effects is updated (e.g., re-rendered) in response to newly received motion data that may change the perspective of the scene being viewed by the viewer. For example, reflections on glasses, sprites, and attachments change in response to movement data.

도 19의 제2 예에서, 3D 효과들의 뷰(1950)는 움직임 데이터에 응답하여 변하는 안경, 입자들 및 배경 상에서의 반사/굴절을 포함한다.In the second example of Figure 19, the view 1950 of 3D effects includes reflections/refractions on the glasses, particles, and background that change in response to motion data.

도 20은 일부 예시적인 실시예들에 따른, 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 사용자의 얼굴을 가리는 부착물 및 애니메이션화된 전경을 예시하는 3D 효과들의 예, 및 움직임 데이터에 응답하여 렌더링되는 동적 인공 조명, 입자들, 및 안경 상에서의 반사/굴절을 예시하는 3D 효과들의 예이다.20 is an example of 3D effects illustrating an animated foreground and an attachment obscuring a user's face rendered in response to motion data (e.g., motion data from a gyroscopic sensor), according to some example embodiments. and examples of 3D effects illustrating dynamic artificial lighting, particles, and reflections/refractions on glasses rendered in response to motion data.

도 20의 제1 예에서, 뷰(2000)는 움직임 데이터에 응답하여 전경 변화에 대한 폐색 효과(예를 들어, 얼음 또는 동결 효과)를 도시한다.In the first example of Figure 20, view 2000 shows an occlusion effect (eg, an ice or freeze effect) on a foreground change in response to motion data.

도 20의 제2 예에서, 뷰(2050)는 움직임 데이터(예를 들어, 자이로스코픽 센서로부터의 모션 데이터)에 응답하여 렌더링되는 안경 상에서의 동적 인공 조명, 입자들, 및 반사/굴절을 예시하는 3D 효과들을 도시한다. In the second example of Figure 20, view 2050 illustrates dynamic artificial lighting, particles, and reflection/refraction on glasses rendered in response to motion data (e.g., motion data from a gyroscopic sensor). Shows 3D effects.

도 21은 일부 예시적인 실시예들에 따른, 움직임 데이터에 응답하여 렌더링되는 리터치, 포스트 효과, 3D 부착물 및 입자들을 예시하는 3D 효과의 예, 및 움직임 데이터에 응답하여 렌더링되는 3D 부착물, 스프라이트들 및 입자를 예시하는 3D 효과들의 예이다.21 is an example of a 3D effect illustrating retouches, post effects, 3D attachments and particles rendered in response to motion data, and 3D attachments, sprites and particles rendered in response to motion data, according to some example embodiments. This is an example of 3D effects illustrating particles.

도 21의 제1 예에서, 뷰(2100)는 스프라이트들(예를 들어, 꽃들로부터의 꽃잎)이 애니메이션화되고, 입자들이 움직임 데이터에 응답하여 변경되는 것을 도시한다.In the first example of Figure 21, view 2100 shows sprites (e.g., petals from flowers) being animated and particles changing in response to motion data.

도 21의 제2 예에서, 뷰(2150)는 애니메이션화되고 포지션을 변경하고, 스프라이트들이 애니메이션화되고, 입자들이 움직임 데이터에 응답하여 변경되는 3D 부착물(예를 들어, 마스크)을 도시한다.In the second example of Figure 21, view 2150 shows a 3D attachment (e.g., a mask) that is animated and changes position, sprites are animated, and particles change in response to motion data.

도 22는 본 명세서에 설명된 다양한 하드웨어 아키텍처들과 함께 사용될 수 있는 예시적인 소프트웨어 아키텍처(2206)를 예시하는 블록도이다. 도 22는 소프트웨어 아키텍처의 비제한적 예이고, 본 명세서에 설명된 기능성을 용이하게 하기 위해 많은 다른 아키텍처들이 구현될 수 있다는 것이 인식될 것이다. 소프트웨어 아키텍처(2206)는, 특히, 프로세서들(2304), 메모리(2314), 및 (입력/출력) I/O 컴포넌트들(2318)을 포함하는 도 23의 머신(2300)과 같은 하드웨어 상에서 실행될 수 있다. 대표적인 하드웨어 계층(2252)이 예시되어 있고, 예를 들어, 도 23의 머신(2300)을 나타낼 수 있다. 대표적인 하드웨어 계층(2252)은 연관된 실행가능 명령어들(2204)을 갖는 처리 유닛(2254)을 포함한다. 실행가능 명령어들(2204)은 본 명세서에 설명된 방법들, 컴포넌트들 등의 구현을 포함하는, 소프트웨어 아키텍처(2206)의 실행가능 명령어들을 나타낸다. 하드웨어 계층(2252)은 메모리 및/또는 스토리지 모듈들인 메모리/스토리지(2256)를 또한 포함하고, 이들도 실행가능 명령어들(2204)을 갖는다. 하드웨어 계층(2252)은 또한 다른 하드웨어(2258)를 포함할 수 있다.Figure 22 is a block diagram illustrating an example software architecture 2206 that may be used with the various hardware architectures described herein. 22 is a non-limiting example of a software architecture, and it will be appreciated that many other architectures may be implemented to facilitate the functionality described herein. Software architecture 2206 may be executed on hardware, such as machine 2300 of FIG. 23, which includes, inter alia, processors 2304, memory 2314, and (input/output) I/O components 2318. there is. A representative hardware layer 2252 is illustrated and may represent, for example, machine 2300 of FIG. 23. Representative hardware layer 2252 includes processing unit 2254 with associated executable instructions 2204. Executable instructions 2204 represent executable instructions of software architecture 2206, including implementation of methods, components, etc. described herein. Hardware layer 2252 also includes memory/storage 2256, which are memory and/or storage modules, which also have executable instructions 2204. Hardware layer 2252 may also include other hardware 2258.

도 22의 예시적인 아키텍처에서, 소프트웨어 아키텍처(2206)는, 각각의 계층이 특정 기능성을 제공하는, 계층들의 스택으로서 개념화될 수 있다. 예를 들어, 소프트웨어 아키텍처(2206)는 운영 체제(2202), 라이브러리들(2220), 프레임워크들/미들웨어(2218), 애플리케이션들(2216) 및 프레젠테이션 계층(2214)과 같은 계층들을 포함할 수 있다. 동작중에, 애플리케이션들(2216) 및/또는 계층들 내의 다른 컴포넌트들은 소프트웨어 스택을 통해 API 호출들(2208)을 기동시키고, API 호출들(2208)에 응답하여 하나 이상의 메시지(2212)로서 응답을 수신할 수 있다. 예시된 계층들은 본질적으로 대표적인 것이며 소프트웨어 아키텍처들 모두가 모든 계층들을 갖는 것은 아니다. 예를 들어, 일부 모바일 또는 특수 목적 운영 체제들은 프레임워크들/미들웨어(2218)를 제공하지 않을 수도 있지만, 다른 것들은 그러한 계층을 제공할 수도 있다. 다른 소프트웨어 아키텍처들은 추가적인 또는 상이한 계층들을 포함할 수 있다.In the example architecture of Figure 22, software architecture 2206 can be conceptualized as a stack of layers, with each layer providing specific functionality. For example, software architecture 2206 may include layers such as operating system 2202, libraries 2220, frameworks/middleware 2218, applications 2216, and presentation layer 2214. . In operation, applications 2216 and/or other components within the layers invoke API calls 2208 through the software stack and receive responses as one or more messages 2212 in response to the API calls 2208. can do. The layers illustrated are representative in nature and not all software architectures have all layers. For example, some mobile or special purpose operating systems may not provide frameworks/middleware 2218, while others may provide such a layer. Other software architectures may include additional or different layers.

운영 체제(2202)는 하드웨어 리소스들을 관리하고 공통 서비스들을 제공할 수도 있다. 운영 체제(2202)는, 예를 들어, 커널(2222), 서비스들(2224), 및 드라이버들(2226)을 포함할 수 있다. 커널(2222)은 하드웨어와 다른 소프트웨어 계층들 사이에서 추상화 계층(abstraction layer)으로서 역할을 할 수 있다. 예를 들어, 커널(2222)은 메모리 관리, 프로세서 관리(예를 들어, 스케줄링), 컴포넌트 관리, 네트워킹, 보안 설정 등을 담당할 수 있다. 서비스들(2224)은 다른 소프트웨어 계층들을 위한 다른 공통 서비스들을 제공할 수 있다. 드라이버들(2226)은 기본 하드웨어(underlying hardware)를 제어하거나 그와 인터페이싱하는 것을 담당한다. 예를 들어, 드라이버들(2226)은 하드웨어 구성에 따라 디스플레이 드라이버, 카메라 드라이버, Bluetooth® 드라이버, 플래시 메모리 드라이버, 직렬 통신 드라이버(예를 들어, 범용 직렬 버스(USB) 드라이버), Wi-Fi® 드라이버, 오디오 드라이버, 전력 관리 드라이버 등을 포함한다.Operating system 2202 may manage hardware resources and provide common services. Operating system 2202 may include, for example, a kernel 2222, services 2224, and drivers 2226. Kernel 2222 may serve as an abstraction layer between hardware and other software layers. For example, the kernel 2222 may be responsible for memory management, processor management (e.g., scheduling), component management, networking, security settings, etc. Services 2224 may provide other common services for different software layers. Drivers 2226 are responsible for controlling or interfacing with the underlying hardware. For example, drivers 2226 may include a display driver, camera driver, Bluetooth® driver, flash memory driver, serial communication driver (e.g., Universal Serial Bus (USB) driver), Wi-Fi® driver, depending on the hardware configuration. , audio drivers, power management drivers, etc.

라이브러리들(2220)은 애플리케이션들(2216) 및/또는 다른 컴포넌트들 및/또는 계층들에 의해 사용되는 공통 인프라스트럭처를 제공한다. 라이브러리들(2220)은 다른 소프트웨어 컴포넌트들이 기본 운영 체제(2202) 기능성(예를 들어, 커널(2222), 서비스들(2224) 및/또는 드라이버들(2226))과 직접 인터페이스하는 것보다 더 쉬운 방식으로 작업들을 수행할 수 있게 하는 기능성을 제공한다. 라이브러리들(2220)은 메모리 할당 기능들, 문자열 조작 기능들, 수학 기능들 등과 같은 기능들을 제공할 수 있는 시스템 라이브러리들(2244)(예를 들어, C 표준 라이브러리)를 포함할 수 있다. 게다가, 라이브러리들(2220)은 미디어 라이브러리들(예를 들어, MPREG4, H.264, MP3, AAC, AMR, JPG, PNG와 같은 다양한 미디어 포맷의 프레젠테이션 및 조작을 지원하기 위한 라이브러리들), 그래픽 라이브러리들(예를 들어, 디스플레이 상의 그래픽 콘텐츠에서 2D 및 3D를 렌더링하기 위해 사용될 수 있는 OpenGL 프레임워크), 데이터베이스 라이브러리들(예를 들어, 다양한 관계형 데이터베이스 기능들을 제공할 수 있는 SQLite), 웹 라이브러리들(예를 들어, 웹 브라우징 기능성을 제공할 수 있는 WebKit) 등과 같은 API 라이브러리들(2246)을 포함할 수 있다. 라이브러리들(2220)은 또한 많은 다른 API들을 애플리케이션들(2216) 및 다른 소프트웨어 컴포넌트들/모듈들에 제공하는 매우 다양한 다른 라이브러리들(2248)을 포함할 수 있다.Libraries 2220 provide common infrastructure used by applications 2216 and/or other components and/or layers. Libraries 2220 may be used in an easier manner than for other software components to interface directly with base operating system 2202 functionality (e.g., kernel 2222, services 2224, and/or drivers 2226). It provides functionality that allows you to perform tasks. Libraries 2220 may include system libraries 2244 (e.g., C standard library) that may provide functions such as memory allocation functions, string manipulation functions, math functions, etc. In addition, libraries 2220 include media libraries (e.g., libraries to support presentation and manipulation of various media formats such as MPREG4, H.264, MP3, AAC, AMR, JPG, PNG), graphics libraries (e.g., the OpenGL framework, which can be used to render 2D and 3D in graphical content on a display), database libraries (e.g., SQLite, which can provide various relational database functions), web libraries (e.g., For example, it may include API libraries 2246 such as WebKit), which can provide web browsing functionality. Libraries 2220 may also include a wide variety of other libraries 2248 that provide many different APIs to applications 2216 and other software components/modules.

프레임워크들/미들웨어(2218)(때때로 미들웨어라고도 지칭됨)는 애플리케이션들(2216) 및/또는 다른 소프트웨어 컴포넌트들/모듈들에 의해 사용될 수 있는 더 하이-레벨의 공통 인프라스트럭처를 제공한다. 예를 들어, 프레임워크들/미들웨어(2218)는 다양한 그래픽 사용자 인터페이스(GUI) 기능들, 하이-레벨 리소스 관리, 하이-레벨 위치 서비스들 등을 제공할 수 있다. 프레임워크들/미들웨어(2218)는 애플리케이션들(2216) 및/또는 다른 소프트웨어 컴포넌트들/모듈들에 의해 사용될 수 있는 광범위한 스펙트럼의 다른 API들을 제공할 수 있으며, 그 중 일부는 특정 운영 체제(2202) 또는 플랫폼에 특정할 수 있다.Frameworks/middleware 2218 (sometimes referred to as middleware) provides a higher-level, common infrastructure that can be used by applications 2216 and/or other software components/modules. For example, frameworks/middleware 2218 may provide various graphical user interface (GUI) functions, high-level resource management, high-level location services, etc. Frameworks/middleware 2218 may provide a broad spectrum of other APIs that can be used by applications 2216 and/or other software components/modules, some of which may be specific to operating system 2202 Or it can be platform specific.

애플리케이션들(2216)은 빌트인 애플리케이션들(2238) 및/또는 제3자 애플리케이션들(2240)을 포함한다. 대표적인 빌트인 애플리케이션들(2238)의 예들은, 연락처 애플리케이션, 브라우저 애플리케이션, 북 리더 애플리케이션, 위치 애플리케이션, 미디어 애플리케이션, 메시징 애플리케이션, 및/또는 게임 애플리케이션을 포함할 수 있지만, 이들로 제한되지 않는다. 제3자 애플리케이션들(2240)은 특정 플랫폼의 벤더 이외의 엔티티에 의해 ANDROID^TM 또는 IOS^TM 소프트웨어 개발 키트(SDK)를 사용하여 개발된 애플리케이션을 포함할 수 있고, IOS^TM, ANDROID^TM, WINDOWS® Phone, 또는 다른 모바일 운영 체제들과 같은 모바일 운영 체제 상에서 실행되는 모바일 소프트웨어일 수 있다. 제3자 애플리케이션들(2240)은 본 명세서에 설명된 기능성을 용이하게 하기 위해 모바일 운영 체제(예를 들어 운영 체제(2202))에 의해 제공되는 API 호출들(2208)을 기동시킬 수 있다.Applications 2216 include built-in applications 2238 and/or third-party applications 2240. Examples of representative built-in applications 2238 may include, but are not limited to, contact applications, browser applications, book reader applications, location applications, media applications, messaging applications, and/or gaming applications. Third-party applications 2240 may include applications developed using the ANDROID ^™ or IOS ^™ software development kit (SDK) by an entity other than the vendor of a particular platform, such as IOS ^™ , ANDROID ^™ , WINDOWS® Phone. It may be mobile software that runs on a mobile operating system, such as , or other mobile operating systems. Third-party applications 2240 may invoke API calls 2208 provided by a mobile operating system (e.g., operating system 2202) to facilitate the functionality described herein.

애플리케이션들(2216)은 시스템의 사용자들과 상호작용하기 위한 사용자 인터페이스들을 생성하기 위해 빌트인 운영 체제 기능들(예를 들어, 커널(2222), 서비스들(2224) 및/또는 드라이버들(2226)), 라이브러리들(2220), 및 프레임워크들/미들웨어(2218)를 사용할 수 있다. 대안적으로 또는 추가적으로, 일부 시스템들에서, 사용자와의 상호작용은 프레젠테이션 계층(2214)과 같은 프레젠테이션 계층을 통해 발생할 수 있다. 이들 시스템에서, 애플리케이션/컴포넌트 "로직"은 사용자와 상호작용하는 애플리케이션/컴포넌트의 양태들로부터 분리될 수 있다.Applications 2216 may utilize built-in operating system functions (e.g., kernel 2222, services 2224, and/or drivers 2226) to create user interfaces for interacting with users of the system. , libraries 2220, and frameworks/middleware 2218 may be used. Alternatively or additionally, in some systems, interaction with the user may occur through a presentation layer, such as presentation layer 2214. In these systems, the application/component “logic” can be separated from aspects of the application/component that interact with the user.

도 23은 머신-판독가능 매체(예를 들어, 머신-판독가능 스토리지 매체)로부터의 명령어들을 판독하고 본 명세서에서 논의된 방법론들 중 임의의 하나 이상을 수행할 수 있는, 일부 예시적인 실시예들에 따른, 머신(2300)의 컴포넌트들을 예시하는 블록도이다. 구체적으로, 도 23은 컴퓨터 시스템의 예시적인 형태의 머신(2300)의 도식적 표현을 나타내는 것으로, 그 안에서 머신(2300)으로 하여금 본 명세서에서 논의된 방법론들 중 임의의 하나 이상을 수행하게 하기 위한 명령어들(2310)(예를 들어, 소프트웨어, 프로그램, 애플리케이션, 애플릿(applet), 앱, 또는 다른 실행가능 코드)이 실행될 수 있다. 그에 따라, 명령어들(2310)은 본 명세서에 설명된 모듈들 또는 컴포넌트들을 구현하기 위해 사용될 수 있다. 명령어들(2310)은, 일반적인 비-프로그래밍된 머신(2300)을, 설명되고 예시된 기능들을 설명된 방식으로 수행하도록 프로그래밍된 특정 머신(2300)으로 변환한다. 대안적인 실시예들에서, 머신(2300)은 독립형 디바이스로서 동작하거나 다른 머신들에 결합(예를 들어, 네트워킹)될 수 있다. 네트워킹된 배치에서, 머신(2300)은 서버-클라이언트 네트워크 환경에서 서버 머신 또는 클라이언트 머신의 자격으로 동작하거나, 피어-투-피어(또는 분산형) 네트워크 환경에서 피어 머신으로서 동작할 수 있다. 머신(2300)은, 서버 컴퓨터, 클라이언트 컴퓨터, 개인용 컴퓨터(PC), 태블릿 컴퓨터, 랩톱 컴퓨터, 넷북, 셋톱 박스(STB), 개인 휴대 정보 단말(PDA), 엔터테인먼트 미디어 시스템, 셀룰러 폰, 스마트폰, 모바일 디바이스, 웨어러블 디바이스(예를 들어, 스마트 시계), 스마트 홈 디바이스(예를 들어, 스마트 어플라이언스), 다른 스마트 디바이스들, 웹 어플라이언스, 네트워크 라우터, 네트워크 스위치, 네트워크 브리지, 또는 머신(2300)에 의해 취해질 액션들을 지정하는 명령어들(2310)을 순차적으로 또는 다른 방식으로 실행할 수 있는 임의의 머신을 포함할 수 있지만, 이들로 제한되지 않는다. 또한, 단일 머신(2300)만이 예시되어 있지만, "머신"이라는 용어는 또한 본 명세서에서 논의된 방법론들 중 어느 하나 이상을 수행하기 위해 명령어들(2310)을 개별적으로 또는 공동으로 실행하는 머신들의 컬렉션을 포함하는 것으로 간주되어야 한다.23 illustrates some example embodiments that may read instructions from a machine-readable medium (e.g., a machine-readable storage medium) and perform any one or more of the methodologies discussed herein. A block diagram illustrating components of machine 2300, according to . Specifically, Figure 23 shows a schematic representation of a machine 2300 in an example form of a computer system, within which instructions are provided to cause the machine 2300 to perform any one or more of the methodologies discussed herein. 2310 (e.g., software, program, application, applet, app, or other executable code) may be executed. Accordingly, instructions 2310 may be used to implement modules or components described herein. Instructions 2310 transform a general, non-programmed machine 2300 into a specific machine 2300 that is programmed to perform the described and illustrated functions in the described manner. In alternative embodiments, machine 2300 may operate as a standalone device or be coupled (e.g., networked) to other machines. In a networked deployment, machine 2300 may operate in the capacity of a server machine or a client machine in a server-client network environment, or as a peer machine in a peer-to-peer (or distributed) network environment. The machine 2300 is a server computer, a client computer, a personal computer (PC), a tablet computer, a laptop computer, a netbook, a set-top box (STB), a personal digital assistant (PDA), an entertainment media system, a cellular phone, a smartphone, A mobile device, a wearable device (e.g., a smart watch), a smart home device (e.g., a smart appliance), other smart devices, a web appliance, a network router, a network switch, a network bridge, or by machine 2300. It may include, but is not limited to, any machine capable of executing instructions 2310 sequentially or otherwise specifying the actions to be taken. Additionally, although only a single machine 2300 is illustrated, the term “machine” can also refer to a collection of machines that individually or jointly execute instructions 2310 to perform any one or more of the methodologies discussed herein. should be considered to include.

머신(2300)은, 예컨대 버스(2302)를 통해 서로 통신하도록 구성될 수 있는, 프로세서들(2304)(예를 들어, 프로세서(2308) 내지 프로세서(2312)), 메모리/스토리지(2306), 및 I/O 컴포넌트들(2318)을 포함할 수 있다. 메모리/스토리지(2306)는 메인 메모리, 또는 다른 메모리 스토리지와 같은 메모리(2314), 및 스토리지 유닛(2316)을 포함할 수 있고, 이 둘 다 예컨대 버스(2302)를 통해 프로세서들(2304)에 액세스할 수 있다. 스토리지 유닛(2316) 및 메모리(2314)는 본 명세서에 설명된 방법론들 또는 기능들 중 임의의 하나 이상을 구현하는 명령어들(2310)을 저장한다. 명령어들(2310)은 또한, 머신(2300)에 의한 그의 실행 동안, 완전히 또는 부분적으로, 메모리(2314) 내에, 스토리지 유닛(2316) 내에, 프로세서들(2304) 중 적어도 하나 내에(예를 들어, 프로세서의 캐시 메모리 내에), 또는 이들의 임의의 적합한 조합으로 존재할 수 있다. 따라서, 메모리(2314), 스토리지 유닛(2316), 및 프로세서들(2304)의 메모리는 머신-판독가능 매체의 예들이다.Machine 2300 includes processors 2304 (e.g., processors 2308 through processor 2312), memory/storage 2306, and May include I/O components 2318. Memory/storage 2306 may include memory 2314, such as main memory, or other memory storage, and a storage unit 2316, both of which are accessible to processors 2304, e.g., via bus 2302. can do. Storage unit 2316 and memory 2314 store instructions 2310 implementing any one or more of the methodologies or functions described herein. Instructions 2310 may also, during their execution by machine 2300, fully or partially, within memory 2314, within storage unit 2316, within at least one of processors 2304 (e.g., within the processor's cache memory), or any suitable combination thereof. Accordingly, memory 2314, storage unit 2316, and memory of processors 2304 are examples of machine-readable media.

I/O 컴포넌트들(2318)은 입력을 수신하고, 출력을 제공하고, 출력을 생성하고, 정보를 송신하고, 정보를 교환하고, 측정들을 캡처하는 등을 위한 매우 다양한 컴포넌트들을 포함할 수 있다. 특정 머신(2300)에 포함되는 특정 I/O 컴포넌트들(2318)은 머신의 타입에 의존할 것이다. 예를 들어, 모바일 폰과 같은 휴대용 머신은 아마 터치 입력 디바이스 또는 다른 그러한 입력 메커니즘을 포함할 것인 반면, 헤드리스 서버 머신(headless server machine)은 아마 그러한 터치 입력 디바이스를 포함하지 않을 것이다. I/O 컴포넌트들(2318)은 도 23에 도시되지 않은 많은 다른 컴포넌트들을 포함할 수 있다는 것을 인식할 것이다. I/O 컴포넌트들(2318)은 단지 이하의 논의를 간소화하기 위해 기능성에 따라 그룹화되어 있고 이러한 그룹화는 결코 제한적인 것이 아니다. 다양한 예시적인 실시예들에서, I/O 컴포넌트들(2318)은 출력 컴포넌트들(2326) 및 입력 컴포넌트들(2328)을 포함할 수 있다. 출력 컴포넌트들(2326)은, 시각 컴포넌트들(예를 들어, 플라즈마 디스플레이 패널(PDP), 발광 다이오드(LED) 디스플레이, 액정 디스플레이(LCD), 프로젝터, 또는 음극선관(CRT)과 같은 디스플레이), 음향 컴포넌트들(예를 들어, 스피커), 햅틱 컴포넌트들(예를 들어, 진동 모터, 저항 메커니즘), 다른 신호 생성기 등을 포함할 수 있다. 입력 컴포넌트들(2328)은, 영숫자 입력 컴포넌트들(예를 들어, 키보드, 영숫자 입력을 수신하도록 구성된 터치 스크린, 포토-광학 키보드, 또는 다른 영숫자 입력 컴포넌트), 포인트 기반 입력 컴포넌트들(예를 들어, 마우스, 터치패드, 트랙볼, 조이스틱, 모션 센서, 또는 다른 포인팅 기구), 촉각 입력 컴포넌트들(예를 들어, 물리적 버튼, 터치 또는 터치 제스처의 위치 및/또는 힘을 제공하는 터치 스크린, 또는 다른 촉각 입력 컴포넌트), 오디오 입력 컴포넌트들(예를 들어, 마이크로폰) 등을 포함할 수 있다.I/O components 2318 may include a wide variety of components for receiving input, providing output, generating output, transmitting information, exchanging information, capturing measurements, etc. The specific I/O components 2318 included in a particular machine 2300 will depend on the type of machine. For example, a portable machine, such as a mobile phone, will likely include a touch input device or other such input mechanism, whereas a headless server machine will probably not include such a touch input device. It will be appreciated that I/O components 2318 may include many other components not shown in FIG. 23 . I/O components 2318 are grouped according to functionality merely to simplify the discussion below and this grouping is in no way limiting. In various example embodiments, I/O components 2318 may include output components 2326 and input components 2328. Output components 2326 may include visual components (e.g., a display such as a plasma display panel (PDP), light emitting diode (LED) display, liquid crystal display (LCD), projector, or cathode ray tube (CRT)), acoustics, etc. Components (e.g., speakers), haptic components (e.g., vibration motors, resistance mechanisms), other signal generators, etc. Input components 2328 may include alphanumeric input components (e.g., a keyboard, a touch screen configured to receive alphanumeric input, a photo-optical keyboard, or other alphanumeric input component), point-based input components (e.g., a mouse, touchpad, trackball, joystick, motion sensor, or other pointing device), tactile input components (e.g., a physical button, a touch screen that provides the position and/or force of a touch or touch gesture, or other tactile input) component), audio input components (e.g., microphone), etc.

추가의 예시적인 실시예들에서, I/O 컴포넌트들(2318)은, 광범위한 다른 컴포넌트들 중에서도, 바이오메트릭 컴포넌트들(2330), 모션 컴포넌트들(2334), 환경 환경 컴포넌트들(2336), 또는 포지션 컴포넌트들(2338)을 포함할 수 있다. 예를 들어, 바이오메트릭 컴포넌트들(2330)은, 표현들(예를 들어, 손 표현, 얼굴 표정, 음성 표현, 신체 제스처, 또는 시선 추적)을 검출하고, 생체신호들(예를 들어, 혈압, 심박수, 체온, 땀 또는 뇌파)을 측정하고, 사람을 식별(예를 들어, 음성 식별, 망막 식별, 얼굴 식별, 지문 식별, 또는 뇌파계 기반 식별)하는 등의 컴포넌트들을 포함할 수 있다. 모션 컴포넌트들(2334)은, 가속 센서 컴포넌트들(예를 들어, 가속도계), 중력 센서 컴포넌트들, 회전 센서 컴포넌트들(예를 들어, 자이로스코프) 등을 포함할 수 있다. 환경 컴포넌트들(2336)은, 예를 들어, 조명 센서 컴포넌트들(예를 들어, 광도계), 온도 센서 컴포넌트들(예를 들어, 주위 온도를 검출하는 하나 이상의 온도계), 습도 센서 컴포넌트들, 압력 센서 컴포넌트들(예를 들어, 기압계), 음향 센서 컴포넌트들(예를 들어, 배경 노이즈를 검출하는 하나 이상의 마이크로폰), 근접 센서 컴포넌트들(예를 들어, 인근 객체들을 검출하는 적외선 센서들), 가스 센서들(예를 들어, 안전을 위해 유해성 가스들의 농도들을 검출하거나 대기 내의 오염물질들을 측정하기 위한 가스 검출 센서들), 또는 주변 물리적 환경에 대응하는 지시들, 측정들, 또는 신호들을 제공할 수 있는 다른 컴포넌트들을 포함할 수 있다. 포지션 컴포넌트들(2338)은, 위치 센서 컴포넌트들(예를 들어, GPS 수신기 컴포넌트), 고도 센서 컴포넌트들(예를 들어, 고도계 또는 고도가 도출될 수 있는 기압을 검출하는 기압계), 방위 센서 컴포넌트들(예를 들어, 자력계) 등을 포함할 수 있다.In further example embodiments, I/O components 2318 may include biometric components 2330, motion components 2334, environmental environmental components 2336, or positional components, among a wide range of other components. May include components 2338. For example, biometric components 2330 may detect expressions (e.g., hand expressions, facial expressions, vocal expressions, body gestures, or eye tracking) and biometric signals (e.g., blood pressure, It may include components such as measuring heart rate, body temperature, sweat, or brain waves) and identifying a person (e.g., voice identification, retinal identification, facial identification, fingerprint identification, or electroencephalography-based identification). Motion components 2334 may include acceleration sensor components (e.g., accelerometer), gravity sensor components, rotation sensor components (e.g., gyroscope), etc. Environmental components 2336 include, for example, light sensor components (e.g., a photometer), temperature sensor components (e.g., one or more thermometers that detect ambient temperature), humidity sensor components, and a pressure sensor. Components (e.g., a barometer), acoustic sensor components (e.g., one or more microphones to detect background noise), proximity sensor components (e.g., infrared sensors to detect nearby objects), gas sensor (e.g., gas detection sensors to detect concentrations of hazardous gases for safety or measure pollutants in the atmosphere), or provide indications, measurements, or signals corresponding to the surrounding physical environment. May contain other components. Position components 2338 include position sensor components (e.g., a GPS receiver component), altitude sensor components (e.g., an altimeter or a barometer that detects barometric pressure from which altitude can be derived), and orientation sensor components. (e.g., magnetometer), etc. may be included.

통신은 매우 다양한 기술들을 사용하여 구현될 수 있다. I/O 컴포넌트들(2318)은 머신(2300)을 결합(2324) 및 결합(2322)을 통해 각각 네트워크(2332) 또는 디바이스들(2320)에 결합하도록 동작가능한 통신 컴포넌트들(2340)을 포함할 수 있다. 예를 들어, 통신 컴포넌트(2340)는, 네트워크 인터페이스 컴포넌트, 또는 네트워크(2332)와 인터페이스하기 위한 다른 적합한 디바이스를 포함할 수 있다. 추가 예들에서, 통신 컴포넌트들(2340)은 유선 통신 컴포넌트들, 무선 통신 컴포넌트들, 셀룰러 통신 컴포넌트들, 근거리 무선 통신(NFC) 컴포넌트들, Bluetooth® 컴포넌트들(예를 들어, Bluetooth® Low Energy), Wi-Fi® 컴포넌트들, 및 다른 양상들을 통해 통신을 제공하는 다른 통신 컴포넌트들을 포함할 수 있다. 디바이스들(2320)은, 다른 머신 또는 임의의 매우 다양한 주변 디바이스들(예를 들어, USB를 통해 결합된 주변 디바이스)일 수 있다.Communication can be implemented using a wide variety of technologies. I/O components 2318 may include communication components 2340 operable to couple machine 2300 to network 2332 or devices 2320 via coupling 2324 and coupling 2322, respectively. You can. For example, communication component 2340 may include a network interface component, or other suitable device for interfacing with network 2332. In further examples, communication components 2340 include wired communication components, wireless communication components, cellular communication components, near field communication (NFC) components, Bluetooth® components (e.g., Bluetooth® Low Energy), May include Wi-Fi® components, and other communication components that provide communication via other modalities. Devices 2320 may be other machines or any of a wide variety of peripheral devices (eg, peripheral devices coupled via USB).

더욱이, 통신 컴포넌트들(2340)은 식별자들을 검출할 수 있거나 식별자들을 검출하도록 동작가능한 컴포넌트들을 포함할 수 있다. 예를 들어, 통신 컴포넌트들(2340)은 RFID(Radio Frequency Identification) 태그 판독기 컴포넌트들, NFC 스마트 태그 검출 컴포넌트들, 광학 판독기 컴포넌트들(예를 들어, UPC(Universal Product Code) 바 코드와 같은 1-차원 바 코드들, QR(Quick Response) 코드와 같은 다-차원 바 코드들, Aztec 코드, Data Matrix, Dataglyph, MaxiCode, PDF417, Ultra Code, UCC RSS-2D 바 코드, 및 다른 광학 코드들을 검출하기 위한 광학 센서), 또는 음향 검출 컴포넌트들(예를 들어, 태깅된 오디오 신호들을 식별하기 위한 마이크로폰들)을 포함할 수 있다. 게다가, 인터넷 프로토콜(IP) 지오-로케이션을 통한 위치, Wi-Fi® 신호 삼각측량을 통한 위치, 특정 위치를 지시할 수 있는 NFC 비컨 신호 검출을 통한 위치 등과 같은, 다양한 정보가 통신 컴포넌트들(2340)을 통해 도출될 수 있다.Moreover, communication components 2340 can detect identifiers or include components operable to detect identifiers. For example, communication components 2340 may include Radio Frequency Identification (RFID) tag reader components, NFC smart tag detection components, optical reader components (e.g., 1-1, such as a Universal Product Code (UPC) bar code) For detecting dimensional bar codes, multi-dimensional bar codes such as Quick Response (QR) codes, Aztec Code, Data Matrix, Dataglyph, MaxiCode, PDF417, Ultra Code, UCC RSS-2D bar code, and other optical codes. optical sensors), or acoustic detection components (e.g., microphones to identify tagged audio signals). Additionally, a variety of information, such as location via Internet Protocol (IP) geo-location, location via Wi-Fi® signal triangulation, location via NFC beacon signal detection that can indicate a specific location, etc., can be transmitted to the communicating components (2340). ) can be derived through.

하기 논의는 대상 개시내용 전반에 걸쳐 언급된 다양한 용어 또는 어구에 관한 것이다. The following discussion concerns various terms or phrases mentioned throughout the subject disclosure.

"신호 매체"는 머신에 의한 실행을 위한 명령어들을 저장, 인코딩 또는 운반할 수 있는 임의의 무형 매체를 지칭하며, 소프트웨어 또는 데이터의 통신을 용이하게 하기 위해 디지털 또는 아날로그 통신 신호들 또는 다른 무형 매체를 포함한다. 용어 "신호 매체"는 임의의 형태의 변조된 데이터 신호, 반송파 등을 포함하는 것으로 간주되어야 한다. "변조된 데이터 신호"라는 용어는 신호 내의 정보를 인코딩하기 위해 그런 상황에서 설정 또는 변경된 자신의 특성 중 하나 이상을 갖는 신호를 의미한다. "송신 매체", 및 "신호 매체"라는 용어들은 동일한 것을 의미하고 본 개시내용에서 상호교환가능하게 사용될 수 있다.“Signal media” refers to any intangible medium capable of storing, encoding or carrying instructions for execution by a machine, including digital or analog communication signals or other intangible media to facilitate communication of software or data. Includes. The term “signal medium” should be considered to include any form of modulated data signal, carrier wave, etc. The term “modulated data signal” refers to a signal that has one or more of its characteristics set or changed in such circumstances to encode information within the signal. The terms “transmission medium” and “signal medium” mean the same thing and may be used interchangeably in this disclosure.

"통신 네트워크(Communication Network)"는 애드 혹 네트워크, 인트라넷, 엑스트라넷, VPN(virtual private network), LAN(local area network), 무선 LAN(WLAN), WAN(wide area network), 무선 WAN(WWAN), MAN(metropolitan area network), 인터넷, 인터넷의 일부, PSTN(Public Switched Telephone Network)의 일부, POTS(plain old telephone service) 네트워크, 셀룰러 전화 네트워크, 무선 네트워크, Wi-Fi® 네트워크, 다른 유형의 네트워크, 또는 둘 이상의 그러한 네트워크의 조합일 수 있는, 네트워크의 하나 이상의 부분을 지칭한다. 예를 들어, 네트워크 또는 네트워크의 부분은 무선 또는 셀룰러 네트워크를 포함할 수 있고 결합은 CDMA(Code Division Multiple Access) 연결, GSM(Global System for Mobile communications) 연결, 또는 다른 타입의 셀룰러 또는 무선 결합일 수 있다. 이 예에서, 결합은 1xRTT(Single Carrier Radio Transmission Technology), EVDO(Evolution-Data Optimized) 기술, GPRS(General Packet Radio Service) 기술, EDGE(Enhanced Data rates for GSM Evolution) 기술, 3G를 포함하는 3GPP(third Generation Partnership Project), 4G(fourth generation wireless) 네트워크들, UMTS(Universal Mobile Telecommunications System), HSPA(High Speed Packet Access), WiMAX(Worldwide Interoperability for Microwave Access), LTE(Long Term Evolution) 표준, 다양한 표준 설정 조직들에 의해 정의된 다른 것들, 다른 장거리 프로토콜들, 또는 다른 데이터 송신 기술과 같은 임의의 다양한 유형의 데이터 송신 기술을 구현할 수 있다.“Communication Network” refers to an ad hoc network, intranet, extranet, virtual private network (VPN), local area network (LAN), wireless LAN (WLAN), wide area network (WAN), and wireless WAN (WWAN). , metropolitan area network (MAN), Internet, part of the Internet, part of the Public Switched Telephone Network (PSTN), plain old telephone service (POTS) network, cellular telephone network, wireless network, Wi-Fi® network, other types of networks , or a combination of two or more such networks. For example, a network or portion of a network may include a wireless or cellular network and the combination may be a Code Division Multiple Access (CDMA) connection, a Global System for Mobile communications (GSM) connection, or another type of cellular or wireless combination. there is. In this example, the combination is 3GPP (which includes Single Carrier Radio Transmission Technology (1xRTT), Evolution-Data Optimized (EVDO) technology, General Packet Radio Service (GPRS) technology, Enhanced Data rates for GSM Evolution (EDGE) technology, and 3G. third Generation Partnership Project), fourth generation wireless (4G) networks, Universal Mobile Telecommunications System (UMTS), High Speed Packet Access (HSPA), Worldwide Interoperability for Microwave Access (WiMAX), Long Term Evolution (LTE) standards, various standards It may implement any of a variety of types of data transmission technologies, such as other long-distance protocols, or other data transmission technologies, as defined by configuration organizations.

"프로세서(Processor)"는 제어 신호들(예를 들어, "명령들", "op 코드들", "머신 코드" 등)에 따라 데이터 값들을 조작하고 머신을 동작시키기 위해 인가되는 대응하는 출력 신호들을 생성하는 임의의 회로 또는 가상 회로(실제 프로세서 상에서 실행되는 로직에 의해 에뮬레이트되는 물리 회로)를 지칭한다. 프로세서는, 예를 들어, CPU(Central Processing Unit), RISC(Reduced Instruction Set Computing) 프로세서, CISC(Complex Instruction Set Computing) 프로세서, GPU(Graphics Processing Unit), DSP(Digital Signal Processor), ASIC(Application Specific Integrated Circuit), RFIC(Radio-Frequency Integrated Circuit), 또는 이들의 임의의 조합일 수 있다. 프로세서는 또한, 명령어들을 동시에 실행할 수 있는 둘 이상의 독립 프로세서(때때로 "코어"라고도 지칭됨)를 갖는 멀티-코어 프로세서일 수 있다.“Processor” refers to control signals (e.g., “instructions,” “op codes,” “machine code,” etc.) that manipulate data values and corresponding output signals applied to operate the machine. Refers to any circuit or virtual circuit (a physical circuit emulated by logic running on an actual processor) that generates Processors include, for example, a Central Processing Unit (CPU), Reduced Instruction Set Computing (RISC) processor, Complex Instruction Set Computing (CISC) processor, Graphics Processing Unit (GPU), Digital Signal Processor (DSP), and Application Specification (ASIC). It may be an integrated circuit (RFIC), a radio-frequency integrated circuit (RFIC), or any combination thereof. A processor may also be a multi-core processor, having two or more independent processors (sometimes referred to as “cores”) that can execute instructions simultaneously.

"머신-스토리지 매체"는 실행가능 명령어들, 루틴들 및/또는 데이터를 저장하는 단일의 또는 다수의 스토리지 디바이스들 및/또는 매체들(예를 들어, 중앙집중형 또는 분산형 데이터베이스, 및/또는 연관된 캐시들 및 서버들)을 지칭한다. 따라서, 용어는 프로세서들 내부 또는 외부의 메모리를 포함하는 고체-상태 메모리들, 및 광학 및 자기 매체를 포함하지만 이에 제한되지 않는 것으로 이해될 수 있다. 머신-저장 매체, 컴퓨터-저장 매체 및/또는 디바이스-저장 매체의 특정 예들은 예로서 반도체 메모리 디바이스들, 예를 들어, EPROM(erasable programmable read-only memory), EEPROM(electrically erasable programmable read-only memory), FPGA, 및 플래시 메모리 디바이스들을 포함하는 비휘발성 메모리; 내부 하드 디스크 및 이동식 디스크와 같은 자기 디스크; 광자기 디스크; 및 CD-ROM 및 DVD-ROM 디스크를 포함한다. "머신-저장 매체", "디바이스-저장 매체", "컴퓨터-저장 매체"라는 용어들은 동일한 것을 의미하며, 본 개시내용에서 상호교환가능하게 사용될 수 있다. "머신-저장 매체", "컴퓨터-저장 매체", 및 "디바이스-저장 매체"라는 용어들은 구체적으로 반송파들, 변조된 데이터 신호들, 및 다른 그런 매체를 제외하고, 이들 중 적어도 일부는 "신호 매체"라는 용어 하에 포함된다.“Machine-storage medium” means single or multiple storage devices and/or media (e.g., a centralized or distributed database, and/or associated caches and servers). Accordingly, the term may be understood to include, but is not limited to, solid-state memories, including memory internal or external to processors, and optical and magnetic media. Specific examples of machine-storage media, computer-storage media, and/or device-storage media include, for example, semiconductor memory devices, such as erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), ), FPGA, and non-volatile memory, including flash memory devices; Magnetic disks such as internal hard disks and removable disks; magneto-optical disk; and CD-ROM and DVD-ROM disks. The terms “machine-storage media”, “device-storage media”, and “computer-storage media” mean the same thing and may be used interchangeably in this disclosure. The terms “machine-storage media,” “computer-storage media,” and “device-storage media” specifically refer to “signals,” at least some of which exclude carrier waves, modulated data signals, and other such media. Included under the term “medium”.

"컴포넌트(Component)"는 기능 또는 서브루틴 호출, 분기 포인트, API, 또는 특정한 처리 또는 제어 기능들의 분할 또는 모듈화를 제공하는 다른 기술들에 의해 정의된 경계들을 갖는 디바이스, 물리적 엔티티, 또는 로직을 지칭한다. 컴포넌트들은 그들의 인터페이스를 통해 다른 컴포넌트들과 결합되어 머신 프로세스를 실행할 수 있다. 컴포넌트는, 보통 관련된 기능들 중 특정 기능을 수행하는 프로그램의 일부 및 다른 컴포넌트들과 함께 사용되도록 설계된 패키징된 기능 하드웨어 유닛일 수 있다. 컴포넌트들은 소프트웨어 컴포넌트들(예를 들어, 머신-판독가능 매체 상에 구현된 코드) 또는 하드웨어 컴포넌트들 중 어느 하나를 구성할 수 있다. "하드웨어 컴포넌트"는 특정 동작들을 수행할 수 있는 유형 유닛(tangible unit)이고, 특정 물리적 방식으로 구성되거나 배열될 수 있다. 다양한 예시적인 실시예들에서, 하나 이상의 컴퓨터 시스템(예를 들어, 독립형 컴퓨터 시스템, 클라이언트 컴퓨터 시스템, 또는 서버 컴퓨터 시스템) 또는 컴퓨터 시스템의 하나 이상의 하드웨어 컴포넌트(예를 들어, 프로세서 또는 프로세서들의 그룹)는 본 명세서에 설명된 바와 같이 특정 동작들을 수행하도록 동작하는 하드웨어 컴포넌트로서 소프트웨어(예를 들어, 애플리케이션 또는 애플리케이션 부분)에 의해 구성될 수 있다. 하드웨어 컴포넌트는 또한, 기계적으로, 전자적으로, 또는 이들의 임의의 적합한 조합으로 구현될 수 있다. 예를 들어, 하드웨어 컴포넌트는 특정 동작들을 수행하도록 영구적으로 구성된 전용 회로 또는 로직을 포함할 수 있다. 하드웨어 컴포넌트는, FPGA(field-programmable gate array) 또는 ASIC(application specific integrated circuit)과 같은 특수 목적 프로세서일 수 있다. 하드웨어 컴포넌트는 특정 동작들을 수행하기 위해 소프트웨어에 의해 일시적으로 구성되는 프로그래밍 가능한 로직 또는 회로를 또한 포함할 수 있다. 예를 들어, 하드웨어 컴포넌트는 범용 프로세서 또는 다른 프로그래밍 가능한 프로세서에 의해 실행되는 소프트웨어를 포함할 수 있다. 일단 그러한 소프트웨어에 의해 구성되면, 하드웨어 컴포넌트들은 구성된 기능들을 수행하도록 고유하게 맞춤화된 특정 머신들(또는 머신의 특정 컴포넌트들)이 되고 더 이상 범용 프로세서들이 아니다. 하드웨어 컴포넌트를 기계적으로, 전용의 영구적으로 구성된 회로에, 또는 일시적으로 구성된 회로(예를 들어, 소프트웨어에 의해 구성됨)에 구현하기로 하는 결정은 비용 및 시간 고려사항들에 의해 주도될 수 있다는 것을 인식할 것이다. 따라서, "하드웨어 컴포넌트"(또는 "하드웨어-구현된 컴포넌트")라는 구문은, 유형 엔티티, 즉, 특정 방식으로 동작하거나 본 명세서에 설명된 특정 동작들을 수행하도록 물리적으로 구성되거나, 영구적으로 구성되거나(예를 들어, 하드와이어드) 또는 일시적으로 구성되는(예를 들어, 프로그래밍되는) 엔티티를 포괄하는 것으로 이해해야 한다. 하드웨어 컴포넌트들이 일시적으로 구성되는(예를 들어, 프로그래밍되는) 실시예들을 고려할 때, 하드웨어 컴포넌트들 각각이 임의의 하나의 시간 인스턴스에서 구성 또는 인스턴스화될 필요는 없다. 예를 들어, 하드웨어 컴포넌트가 특수 목적 프로세서가 되도록 소프트웨어에 의해 구성된 범용 프로세서를 포함하는 경우에, 범용 프로세서는 상이한 시간들에서 각각 상이한 특수 목적 프로세서들(예를 들어, 상이한 하드웨어 컴포넌트들을 포함함)로서 구성될 수 있다. 따라서 소프트웨어는 예를 들어, 하나의 시간 인스턴스에서는 특정 하드웨어 컴포넌트를 구성하고 상이한 시간 인스턴스에서는 상이한 하드웨어 컴포넌트를 구성하도록 특정 프로세서 또는 프로세서들을 구성한다. 하드웨어 컴포넌트들은 다른 하드웨어 컴포넌트들에 정보를 제공하고 그들로부터 정보를 수신할 수 있다. 따라서, 설명된 하드웨어 컴포넌트들은 통신가능하게 결합되어 있는 것으로 간주될 수 있다. 다수의 하드웨어 컴포넌트가 동시에 존재하는 경우에, 하드웨어 컴포넌트들 중 둘 이상 사이의 또는 그들 사이의(예를 들어, 적절한 회로들 및 버스들을 통한) 신호 송신을 통해 통신이 달성될 수 있다. 다수의 하드웨어 컴포넌트가 상이한 시간들에서 구성되거나 인스턴스화되는 실시예들에서, 그러한 하드웨어 컴포넌트들 사이의 통신은, 예를 들어, 다수의 하드웨어 컴포넌트가 액세스할 수 있는 메모리 구조들 내의 정보의 스토리지 및 검색을 통해 달성될 수 있다. 예를 들어, 하나의 하드웨어 컴포넌트는 동작을 수행하고, 그에 통신가능하게 결합되는 메모리 디바이스에 그 동작의 출력을 저장할 수 있다. 그 후 추가의 하드웨어 컴포넌트가, 나중에, 저장된 출력을 검색 및 처리하기 위해 메모리 디바이스에 액세스할 수 있다. 하드웨어 컴포넌트들은 또한 입력 또는 출력 디바이스들과 통신을 개시할 수 있고, 리소스(예를 들어, 정보의 컬렉션)를 조작할 수 있다. 본 명세서에 설명된 예시적인 방법들의 다양한 동작은 관련 동작들을 수행하도록 일시적으로 구성되거나(예를 들어, 소프트웨어에 의해) 영구적으로 구성되는 하나 이상의 프로세서에 의해 적어도 부분적으로 수행될 수 있다. 일시적으로 구성되는 영구적으로 구성되든 간에, 그러한 프로세서들은 본 명세서에 설명된 하나 이상의 동작 또는 기능을 수행하도록 동작하는 프로세서-구현된 컴포넌트들을 구성할 수 있다. 본 명세서에서 사용된, "프로세서-구현된 컴포넌트"는 하나 이상의 프로세서를 사용하여 구현된 하드웨어 컴포넌트를 지칭한다. 유사하게, 본 명세서에 설명된 방법들은 적어도 부분적으로 프로세서-구현될 수 있고, 특정 프로세서 또는 프로세서들은 하드웨어의 예이다. 예를 들어, 방법의 동작들 중 적어도 일부가 하나 이상의 프로세서 또는 프로세서-구현된 컴포넌트에 의해 수행될 수 있다. 더욱이, 하나 이상의 프로세서는 또한 "클라우드 컴퓨팅" 환경에서 또는 "서비스로서의 소프트웨어(software as a service)"(SaaS)로서 관련 동작들의 수행을 지원하도록 동작할 수 있다. 예를 들어, 동작들 중 적어도 일부는 (프로세서들을 포함하는 머신들의 예들로서) 컴퓨터들의 그룹에 의해 수행될 수 있고, 이들 동작들은 네트워크(예를 들어, 인터넷)를 통해 그리고 하나 이상의 적절한 인터페이스(예를 들어, API)을 통해 액세스 가능하다. 동작들 중 특정 동작의 수행은 단일 머신 내에 존재할 뿐만 아니라, 다수의 머신에 걸쳐 배치되는, 프로세서들 사이에 분산될 수 있다. 일부 예시적인 실시예들에서, 프로세서들 또는 프로세서에 의해 구현되는 컴포넌트들은 단일의 지리적 위치에(예를 들어, 가정 환경, 사무실 환경, 또는 서버 팜(server farm) 내에) 위치할 수 있다. 다른 예시적인 실시예들에서, 프로세서들 또는 프로세서에 의해 구현되는 컴포넌트들은 다수의 지리적 위치에 걸쳐 분산될 수 있다.“Component” refers to a device, physical entity, or logic with boundaries defined by function or subroutine calls, branch points, APIs, or other technologies that provide partitioning or modularization of specific processing or control functions. do. Components can be combined with other components through their interfaces to execute machine processes. A component may be a packaged functional hardware unit designed to be used with other components and usually a portion of a program that performs a specific among related functions. Components may constitute either software components (eg, code implemented on a machine-readable medium) or hardware components. A “hardware component” is a tangible unit that can perform certain operations and be configured or arranged in a certain physical way. In various example embodiments, one or more computer systems (e.g., a stand-alone computer system, a client computer system, or a server computer system) or one or more hardware components of a computer system (e.g., a processor or group of processors) A hardware component that operates to perform specific operations as described herein may be configured by software (e.g., an application or application portion). Hardware components may also be implemented mechanically, electronically, or in any suitable combination thereof. For example, a hardware component may include dedicated circuitry or logic that is permanently configured to perform specific operations. The hardware component may be a special-purpose processor, such as a field-programmable gate array (FPGA) or an application specific integrated circuit (ASIC). A hardware component may also include programmable logic or circuitry that is temporarily configured by software to perform specific operations. For example, a hardware component may include software executed by a general-purpose processor or other programmable processor. Once configured by such software, the hardware components become specific machines (or specific components of a machine) uniquely tailored to perform the configured functions and are no longer general-purpose processors. Recognizing that the decision to implement a hardware component mechanically, in a dedicated, permanently configured circuit, or in a temporarily configured circuit (e.g., configured by software) may be driven by cost and time considerations. something to do. Accordingly, the phrase "hardware component" (or "hardware-implemented component") refers to a tangible entity, i.e., physically constructed, permanently configured (or configured) to operate in a particular way or perform the particular operations described herein. It should be understood to encompass entities that are either hardwired (e.g., hardwired) or temporarily constructed (e.g., programmed). Given embodiments in which hardware components are temporarily configured (eg, programmed), there is no need for each of the hardware components to be configured or instantiated at any one instance of time. For example, if a hardware component includes a general-purpose processor configured by software to be a special-purpose processor, the general-purpose processor may be configured at different times by each of the different special-purpose processors (e.g., comprising different hardware components). It can be configured. The software thus configures a particular processor or processors, for example, to configure a particular hardware component at one instance of time and a different hardware component at a different instance of time. Hardware components can provide information to and receive information from other hardware components. Accordingly, the described hardware components may be considered to be communicatively coupled. In cases where multiple hardware components exist simultaneously, communication may be accomplished through signal transmission between or between two or more of the hardware components (e.g., via appropriate circuits and buses). In embodiments where multiple hardware components are configured or instantiated at different times, communication between such hardware components may include, for example, storage and retrieval of information within memory structures that are accessible to the multiple hardware components. It can be achieved through For example, a hardware component may perform an operation and store the output of that operation in a memory device that is communicatively coupled thereto. Additional hardware components can then access the memory device to retrieve and process the stored output. Hardware components can also initiate communication with input or output devices and manipulate resources (eg, collections of information). The various operations of the example methods described herein may be performed, at least in part, by one or more processors that are temporarily configured (e.g., by software) or permanently configured to perform the relevant operations. Whether temporarily or permanently configured, such processors may constitute processor-implemented components operative to perform one or more operations or functions described herein. As used herein, “processor-implemented component” refers to a hardware component implemented using one or more processors. Similarly, the methods described herein may be at least partially processor-implemented, with a particular processor or processors being an example of hardware. For example, at least some of the operations of the method may be performed by one or more processors or processor-implemented components. Moreover, the one or more processors may also operate to support performance of related operations in a “cloud computing” environment or as “software as a service” (SaaS). For example, at least some of the operations may be performed by a group of computers (as examples of machines comprising processors), which may be performed over a network (e.g. the Internet) and through one or more suitable interfaces (e.g. For example, it can be accessed via API). Performance of certain of the operations may reside within a single machine, as well as be distributed among processors deployed across multiple machines. In some example embodiments, processors or components implemented by a processor may be located in a single geographic location (eg, within a home environment, an office environment, or a server farm). In other example embodiments, processors or components implemented by a processor may be distributed across multiple geographic locations.

"캐리어 신호"는 머신에 의한 실행을 위한 명령어들을 저장, 인코딩, 또는 운반할 수 있는 임의의 무형 매체를 지칭하고, 그러한 명령어들의 통신을 용이하게 하기 위한 디지털 또는 아날로그 통신 신호들 또는 다른 무형 매체를 포함한다. 명령어들은 네트워크 인터페이스 디바이스를 통해 송신 매체를 사용하여 네트워크를 통해 송신 또는 수신될 수 있다.“Carrier signal” refers to any intangible medium capable of storing, encoding, or carrying instructions for execution by a machine, including digital or analog communication signals or other intangible media to facilitate communication of such instructions. Includes. Instructions may be transmitted or received over a network using a transmission medium via a network interface device.

"컴퓨터 판독가능 매체"는 머신-저장 매체 및 송신 매체 모두를 지칭한다. 따라서, 용어들은 저장 디바이스들/매체 및 반송파들/변조된 데이터 신호들 양쪽 모두를 포함한다. "머신-판독가능 매체", "컴퓨터 판독가능 매체", 및 "디바이스-판독가능 매체"라는 용어들은 동일한 것을 의미하고 본 개시내용에서 상호교환가능하게 사용될 수 있다.“Computer-readable media” refers to both machine-storage media and transmission media. Accordingly, the terms include both storage devices/media and carrier waves/modulated data signals. The terms “machine-readable medium,” “computer-readable medium,” and “device-readable medium” mean the same thing and may be used interchangeably in this disclosure.

"클라이언트 디바이스(Client Device)"는 하나 이상의 서버 시스템 또는 다른 클라이언트 디바이스들로부터 리소스들을 획득하기 위해 통신 네트워크에 인터페이스하는 임의의 머신을 지칭한다. 클라이언트 디바이스는, 모바일 폰, 데스크톱 컴퓨터, 랩톱, 휴대 정보 단말(PDA), 스마트폰, 태블릿, 울트라북, 넷북, 랩톱, 멀티-프로세서 시스템, 마이크로프로세서-기반 또는 프로그래밍 가능한 가전 제품, 게임 콘솔, 셋톱 박스, 또는 사용자가 네트워크에 액세스하기 위해 사용할 수 있는 임의의 다른 통신 디바이스일 수 있지만, 이것으로 제한되지 않는다. 대상 개시내용에서, 클라이언트 디바이스는 "전자 디바이스"로도 지칭된다.“Client Device” refers to any machine that interfaces to a communications network to obtain resources from one or more server systems or other client devices. Client devices include mobile phones, desktop computers, laptops, personal digital assistants (PDAs), smartphones, tablets, ultrabooks, netbooks, laptops, multi-processor systems, microprocessor-based or programmable consumer electronics, gaming consoles, and set-tops. It may be, but is not limited to, a box, or any other communication device that a user may use to access the network. In the subject disclosure, a client device is also referred to as an “electronic device.”

"단기적 메시지"는 시간-제한된 지속기간 동안 액세스가능한 메시지를 지칭한다. 단기적 메시지는 텍스트, 이미지, 비디오 등일 수 있다. 단기적 메시지에 대한 액세스 시간은 메시지 발신자에 의해 설정될 수 있다. 대안적으로, 액세스 시간은 디폴트 설정 또는 수신자에 의해 지정된 설정일 수 있다. 설정 기법에 관계없이, 메시지는 일시적이다.“Short-term message” refers to a message that is accessible for a time-limited duration. Short-term messages can be text, images, videos, etc. Access times for short-lived messages may be set by the message sender. Alternatively, the access time may be a default setting or a setting specified by the recipient. Regardless of the configuration technique, messages are temporary.

Claims

As a method,
Using a processor, receiving image data and depth data;
selecting, using the processor, an augmented reality content generator corresponding to a three-dimensional (3D) effect;
Using the processor, applying the 3D effect to the image data and the depth data based at least in part on the selected augmented reality content generator, applying the 3D effect to the image data and the depth data comprising: :
generating a depth map using at least the received depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the received image data, and
performing background inpainting and blurring of the received image data using at least the segmentation mask to generate background inpainted image data; and
Using a processor, generating a message containing information about the applied 3D effect, the image data, and the depth data,
The steps for generating the packed depth map are:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The method of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

delete

According to paragraph 1,
The multiple channels have lower precision than the single channel floating point texture, and the method undergoes multiple image processing operations without losing additional precision.

According to paragraph 1,
generating a depth inpainting mask based at least in part on the depth map; and
The method further comprising performing inpainting of the depth map using the depth inpainting mask to generate an inpainted depth map.

According to clause 6,
The inpainted depth map expands a foreground region of the depth map, and the expanded foreground region corresponds to at least one region of the image data corresponding to a representation of a portion of the face corresponding to hair, ears, or shoulders. Method used to perform image processing operations.

According to paragraph 1,
generating a depth normal map based at least in part on the depth map; and
The method further comprising providing a post-processed foreground image by applying the 3D effect to a foreground region of the image data using the depth normal map.

According to paragraph 1,
further comprising transmitting the message to a messaging server for storing the message, wherein the message includes a set of assets, the set of assets being a post-processed foreground image, a post-processed depth map, and a background. How to include inpainted images.

According to clause 9,
Receiving a command to initiate sharing of the stored message on at least one receiving device; and
The method further comprising transmitting the stored message at at least one receiving device.

As a system,
processor; and
Contains memory containing instructions,
The instructions, when executed by the processor, cause the processor to:
Receiving, using a processor, image data and depth data;
selecting, using the processor, an augmented reality content generator corresponding to a three-dimensional (3D) effect;
Using the processor, applying the 3D effect to the image data and the depth data based at least in part on the selected augmented reality content generator, wherein applying the 3D effect to the image data and the depth data comprises: :
generating a depth map using at least the received depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the received image data, and
At least performing background inpainting and blurring of the received image data using the segmentation mask to generate background inpainted image data; and
cause, using a processor, to perform operations including generating a message containing information regarding the applied 3D effect, the image data, and the depth data;
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The system of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

A non-transitory computer-readable medium containing instructions, comprising:
The instructions, when executed by a computing device, cause the computing device to:
Receiving, using a processor, image data and depth data;
selecting, using the processor, an augmented reality content generator corresponding to a three-dimensional (3D) effect;
Using the processor, applying the 3D effect to the image data and the depth data based at least in part on the selected augmented reality content generator, wherein applying the 3D effect to the image data and the depth data comprises: :
generating a depth map using at least the received depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the received image data, and
At least performing background inpainting and blurring of the received image data using the segmentation mask to generate background inpainted image data; and
cause, using a processor, to perform operations including generating a message containing information regarding the applied 3D effect, the image data, and the depth data;
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

As a method,
Receiving, at a client device, a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect comprising at least one Contains US dollar manipulation -;
capturing image data and depth data using a camera of the client device;
Applying the 3D effect to the image data and the depth data, including the at least one beautification operation based at least in part on the augmented reality content generator, wherein the beautification operation is as part of applying the 3D effect. The steps to be performed and apply the 3D effect are:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
performing background inpainting and blurring of the image data using at least the segmentation mask to generate background inpainted image data;
generating a 3D message based at least in part on the applied 3D effect including the at least one beautification manipulation; and
rendering a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification manipulation,
The steps for generating the packed depth map are:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The method of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

According to clause 13,
The steps of applying the 3D effect to the image data and the depth data are:
performing the beautification operation on at least a region of the image data containing facial image data, wherein the beautification operation includes at least one of smoothing, lighting adjustment, or color modification. method.

According to clause 13,
The method of claim 1, wherein the beautification manipulation further comprises using a machine learning model to preserve facial feature structures, smooth blemishes, remove wrinkles, or preserve facial skin texture.

According to clause 13,
generating a depth inpainting mask corresponding to a region of the depth map containing facial depth data; and
The method further comprising performing depth map inpainting of the depth map using at least the generated depth inpainting mask.

delete

According to clause 13,
The packed depth map has a lower resolution than the image data.

According to clause 16,
generating a normal map of the depth map to apply a lighting effect to facial image data of the image data; and
The method further comprising applying the lighting effect to the facial image data based at least in part on the normal map.

According to clause 19,
The lighting effect includes at least two different colors, a first color from the two different colors is applied to a first portion of the facial image data, and a second color from the two different colors is applied to the facial image data. Method applied to a second image of data.

According to clause 13,
The method further comprising causing display of an interface comprising the plurality of selectable graphical items, each selectable graphical item corresponding to a respective augmented reality content generator of the set of augmented reality content generators.

According to clause 21,
Receiving a swipe gesture through a touch screen of the client device; and
In response to receiving the swipe gesture, the method further includes causing navigation through the plurality of selectable graphical items.

As a system,
processor; and
Contains memory containing instructions,
The instructions, when executed by the processor, cause the processor to:
Receiving, at a client device, a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect comprising at least one Contains US dollar manipulation -;
capturing image data and depth data using a camera of the client device;
Applying the 3D effect to the image data and the depth data, including the at least one beautification operation based at least in part on the augmented reality content generator, wherein the beautification operation is as part of applying the 3D effect. The operations performed and applying the 3D effect are:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data;
generating a 3D message based at least in part on the applied 3D effect including the at least one beautification manipulation; and
perform operations including rendering a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification operation,
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The system of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

A non-transitory computer-readable medium containing instructions, comprising:
The instructions, when executed by a computing device, cause the computing device to:
Receiving, at a client device, a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect comprising at least one Contains US dollar manipulation -;
capturing image data and depth data using a camera of the client device;
Applying the 3D effect to the image data and the depth data, including the at least one beautification operation based at least in part on the augmented reality content generator, wherein the beautification operation is as part of applying the 3D effect. The operations performed and applying the 3D effect are:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data;
generating a 3D message based at least in part on the applied 3D effect including the at least one beautification manipulation; and
perform operations including rendering a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification operation,
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

As a method,
Selecting a set of augmented reality content generators from a plurality of available augmented reality content generators based on metadata associated with each augmented reality content generator, wherein the metadata indicates that the corresponding augmented reality content generator includes at least a 3D effect. and information indicating that the set of augmented reality content generators includes at least one augmented reality content generator without a 3D effect and at least one augmented reality content generator with a 3D effect;
Receiving, at a client device, a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator that includes a 3D effect;
capturing image data and depth data using at least one camera of the client device; and
Applying the 3D effect to the image data and the depth data based at least in part on the augmented reality content generator, wherein applying the 3D effect includes:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data,
The steps for generating the packed depth map are:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The method of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

According to clause 25,
generating a 3D message based at least in part on the applied 3D effect; and
The method further comprising rendering a view of the 3D message based at least in part on the applied 3D effect.

According to clause 25,
The at least one camera includes a first camera and a second camera, the first camera has a first focal length, the second camera has a second focal length, the first focal length and the second camera Focal distances vary in different ways.

According to clause 27,
A disparity map is based at least in part on the distance between a first pixel from a first image captured by the first camera and a second pixel from a second image captured by the second camera. Generated, wherein the first pixel and the second pixel correspond to the same object.

According to clause 28,
The disparity map includes an image where each pixel includes a distance value between a pixel from the first image and a corresponding pixel from the second image.

According to clause 29,
First pixels of a first object in the disparity map have a brightness greater than second pixels of a second object in the disparity map, and the first pixels have a depth value less than second depth values of the second pixels. How to have them.

According to clause 28,
A method wherein a depth map is generated based at least in part on the disparity map.

According to clause 25,
A method wherein the augmented reality content generator includes a beautification operation.

According to clause 32,
The method of claim 1, wherein the augmented reality content generator includes a 3D object rendered to approximate facial image data from the image data.

According to clause 25,
The method further comprising causing display of an interface comprising a plurality of selectable graphical items, each selectable graphical item corresponding to a respective augmented reality content generator of the set of augmented reality content generators.

As a system,
processor; and
Contains memory containing instructions,
The instructions, when executed by the processor, cause the processor to:
Selecting a set of augmented reality content generators from a plurality of available augmented reality content generators based on metadata associated with each augmented reality content generator, wherein the metadata specifies that the corresponding augmented reality content generator includes at least a 3D effect. and information indicating that the set of augmented reality content generators includes at least one augmented reality content generator without a 3D effect and at least one augmented reality content generator with a 3D effect;
At a client device, receiving a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator that includes a 3D effect;
capturing image data and depth data using at least one camera of the client device; and
Applying the 3D effect to the image data and the depth data based at least in part on the augmented reality content generator, wherein applying the 3D effect includes:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data -
Perform actions including,
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The system of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

A non-transitory computer-readable medium containing instructions, comprising:
The instructions, when executed by a computing device, cause the computing device to:
Selecting a set of augmented reality content generators from a plurality of available augmented reality content generators based on metadata associated with each augmented reality content generator, wherein the metadata specifies that the corresponding augmented reality content generator includes at least a 3D effect. and information indicating that the set of augmented reality content generators includes at least one augmented reality content generator without a 3D effect and at least one augmented reality content generator with a 3D effect;
At a client device, receiving a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator that includes a 3D effect;
capturing image data and depth data using at least one camera of the client device; and
Applying the 3D effect to the image data and the depth data based at least in part on the augmented reality content generator, wherein applying the 3D effect includes:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data -
Perform actions including,
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

As a method,
Receiving, at a client device, a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator that includes a 3D effect;
capturing image data using at least one camera of the client device;
generating depth data using a machine learning model based at least in part on the captured image data; and
Applying the 3D effect to the image data and the depth data based at least in part on the augmented reality content generator, wherein applying the 3D effect includes:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data,
The steps for generating the packed depth map are:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The method of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

According to clause 37,
A method wherein the image data is captured by two or more cameras.

According to clause 37,
Wherein the image data is captured using dual pixel autofocus from a single camera.

According to clause 37,
The method of claim 1, wherein the machine learning model includes a deep neural network or a convolutional neural network that provides prediction of depth data based on the captured image data.

According to clause 40,
The method of claim 1, wherein the machine learning model receives the captured image data as input and generates a depth map as output.

According to clause 40,
The method of claim 1, wherein the machine learning model is executed on a neural network processor or graphics processing unit of the client device.

According to clause 37,
generating a 3D message based at least in part on the applied 3D effect;
rendering a view of the 3D message based at least in part on the applied 3D effect;
Receiving motion data from a motion sensor of the client device;
updating a view of the 3D message based at least in part on the received motion data; and
The method further comprising rendering an updated view of the 3D message.

According to clause 37,
The at least one camera includes a first camera and a second camera, the first camera has a first focal length, the second camera has a second focal length, the first focal length and the second camera Focal distances vary in different ways.

According to clause 37,
further comprising selecting a set of augmented reality content generators from the plurality of available augmented reality content generators based on metadata associated with each augmented reality content generator, wherein the metadata determines that the corresponding augmented reality content generator is at least A method comprising information indicating that it includes a 3D effect, wherein the set of augmented reality content generators includes at least one augmented reality content generator without a 3D effect and at least one augmented reality content generator with a 3D effect.

According to clause 45,
The method further comprising causing display of an interface comprising a plurality of selectable graphical items, each selectable graphical item corresponding to a respective augmented reality content generator of the set of augmented reality content generators.

As a system,
processor; and
Contains memory containing instructions,
The instructions, when executed by the processor, cause the processor to:
At a client device, receiving a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator that includes a 3D effect;
capturing image data using at least one camera of the client device;
generating depth data using a machine learning model based at least in part on the captured image data; and
Applying the 3D effect to the image data and the depth data based at least in part on the augmented reality content generator, wherein applying the 3D effect includes:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data -
Perform actions including,
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
The system of claim 1, wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

A non-transitory computer-readable medium containing instructions, comprising:
The instructions, when executed by a computing device, cause the computing device to:
At a client device, receiving a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator that includes a 3D effect;
capturing image data using at least one camera of the client device;
generating depth data using a machine learning model based at least in part on the captured image data; and
Applying the 3D effect to the image data and the depth data based at least in part on the augmented reality content generator, wherein applying the 3D effect includes:
generating a depth map using at least the depth data;
generating a packed depth map based at least in part on the depth map;
generating a segmentation mask based at least on the image data, and
At least performing background inpainting and blurring of the image data using the segmentation mask to generate background inpainted image data -
Perform actions including,
The operation of generating the packed depth map is:
Converting a single-channel floating point texture to a raw depth map; and
generating a plurality of channels based at least in part on the raw depth map,
The operation of performing the background inpainting includes performing a diffusion-based inpainting technique to fill the missing area by propagating image content from the boundary between the missing area and the background area to the interior of the missing area,
wherein the background area includes a specific area of the image data without a foreground subject, and the missing area includes the foreground subject.

delete