KR20240142448A - 방법, 장치 및 컴퓨터 프로그램 - Google Patents

방법, 장치 및 컴퓨터 프로그램 Download PDF

Info

Publication number
KR20240142448A
KR20240142448A KR1020247025950A KR20247025950A KR20240142448A KR 20240142448 A KR20240142448 A KR 20240142448A KR 1020247025950 A KR1020247025950 A KR 1020247025950A KR 20247025950 A KR20247025950 A KR 20247025950A KR 20240142448 A KR20240142448 A KR 20240142448A
Authority
KR
South Korea
Prior art keywords
user
machine learning
learning model
avatar
movements
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
KR1020247025950A
Other languages
English (en)
Korean (ko)
Inventor
파슈미나 조나단 카메론
세실리 페레그린 보르가티 모리슨
마틴 필립 그레이슨
다니엘라 마시세티
매튜 알라스테어 존슨
에드워드 숀 로이드 린텔
마르케스 리타 파이아
Original Assignee
마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 filed Critical 마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Publication of KR20240142448A publication Critical patent/KR20240142448A/ko
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/205Three-dimensional [3D] animation driven by audio data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/40Three-dimensional [3D] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Processing Or Creating Images (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
KR1020247025950A 2022-01-31 2023-01-06 방법, 장치 및 컴퓨터 프로그램 Pending KR20240142448A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP22154373.9A EP4220565A1 (en) 2022-01-31 2022-01-31 Method, apparatus and computer program
EP22154373.9 2022-01-31
PCT/US2023/010261 WO2023146741A1 (en) 2022-01-31 2023-01-06 Method, apparatus and computer program

Publications (1)

Publication Number Publication Date
KR20240142448A true KR20240142448A (ko) 2024-09-30

Family

ID=80119033

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020247025950A Pending KR20240142448A (ko) 2022-01-31 2023-01-06 방법, 장치 및 컴퓨터 프로그램

Country Status (6)

Country Link
US (1) US20250069308A1 (https=)
EP (2) EP4220565A1 (https=)
JP (1) JP2025505340A (https=)
KR (1) KR20240142448A (https=)
CN (1) CN118648026A (https=)
WO (1) WO2023146741A1 (https=)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12518482B2 (en) * 2023-08-10 2026-01-06 Qualcomm Incorporated Virtual representative conditioning system
US12367425B1 (en) 2024-01-12 2025-07-22 THIA ST Co. Copilot customization with data producer(s)
US12242503B1 (en) 2024-01-12 2025-03-04 THIA ST Co. Copilot architecture: network of microservices including specialized machine learning tools
US12536045B2 (en) 2024-01-12 2026-01-27 THIA ST Co. Distribution of tasks among microservices in a copilot
US12367426B1 (en) * 2024-01-12 2025-07-22 THIA ST Co. Customization of machine learning tools with occupation training
US20250267239A1 (en) * 2024-02-15 2025-08-21 Microsoft Technology Licensing, Llc Generative communication session event effects

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160134840A1 (en) * 2014-07-28 2016-05-12 Alexa Margaret McCulloch Avatar-Mediated Telepresence Systems with Enhanced Filtering
US10559111B2 (en) * 2016-06-23 2020-02-11 LoomAi, Inc. Systems and methods for generating computer ready animation models of a human head from captured data images
EP3797404A4 (en) * 2018-05-22 2022-02-16 Magic Leap, Inc. SKELETAL SYSTEMS FOR ANIMATION OF VIRTUAL AVATARS
US10755463B1 (en) * 2018-07-20 2020-08-25 Facebook Technologies, Llc Audio-based face tracking and lip syncing for natural facial animation and lip movement
US11568645B2 (en) * 2019-03-21 2023-01-31 Samsung Electronics Co., Ltd. Electronic device and controlling method thereof
US10949715B1 (en) * 2019-08-19 2021-03-16 Neon Evolution Inc. Methods and systems for image and voice processing
EP4081985A1 (en) * 2020-01-29 2022-11-02 Google LLC Photorealistic talking faces from audio
US11127225B1 (en) 2020-06-01 2021-09-21 Microsoft Technology Licensing, Llc Fitting 3D models of composite objects
WO2022103877A1 (en) * 2020-11-13 2022-05-19 Innopeak Technology, Inc. Realistic audio driven 3d avatar generation
US11734888B2 (en) * 2021-04-23 2023-08-22 Meta Platforms Technologies, Llc Real-time 3D facial animation from binocular video

Also Published As

Publication number Publication date
CN118648026A (zh) 2024-09-13
US20250069308A1 (en) 2025-02-27
EP4220565A1 (en) 2023-08-02
EP4473490A1 (en) 2024-12-11
WO2023146741A1 (en) 2023-08-03
JP2025505340A (ja) 2025-02-26

Similar Documents

Publication Publication Date Title
KR20240142448A (ko) 방법, 장치 및 컴퓨터 프로그램
US12277640B2 (en) Photorealistic real-time portrait animation
KR102758381B1 (ko) 3차원(3d) 환경에 대한 통합된 입/출력
US11062494B2 (en) Electronic messaging utilizing animatable 3D models
US12517697B2 (en) Communication assistance program, communication assistance method, communication assistance system, terminal device, and non-verbal expression program
US11983808B2 (en) Conversation-driven character animation
KR20210119441A (ko) 텍스트 및 오디오 기반 실시간 얼굴 재연
US11005796B2 (en) Animated delivery of electronic messages
US11741650B2 (en) Advanced electronic messaging utilizing animatable 3D models
WO2008087621A1 (en) An apparatus and method for animating emotionally driven virtual objects
KR20250088614A (ko) 사람의 전체-신체를 스타일화함
KR20250067167A (ko) 이미지 워핑을 사용한 실세계 객체 변형
KR20250105409A (ko) 비디오 및 오디오로부터의 로버스트 얼굴 애니메이션
Chandrasiri et al. Communication over the Internet using a 3D agent with real-time facial expression analysis, synthesis and text to speech capabilities
Barakonyi et al. Communicating Multimodal information on the WWW using a lifelike, animated 3D agent
Ishizuka et al. ICCS 2002

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

P11 Amendment of application requested

Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P11-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000