CN117120963A - 用于生成音频信号的装置和方法 - Google Patents

用于生成音频信号的装置和方法 Download PDF

Info

Publication number
CN117120963A
CN117120963A CN202280027110.XA CN202280027110A CN117120963A CN 117120963 A CN117120963 A CN 117120963A CN 202280027110 A CN202280027110 A CN 202280027110A CN 117120963 A CN117120963 A CN 117120963A
Authority
CN
China
Prior art keywords
audio
objects
image
real world
real
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280027110.XA
Other languages
English (en)
Chinese (zh)
Inventor
C·韦雷坎普
J·G·H·科庞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips NV filed Critical Koninklijke Philips NV
Publication of CN117120963A publication Critical patent/CN117120963A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating three-dimensional [3D] models or images for computer graphics
    • G06T19/006Mixed reality
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/20Scenes; Scene-specific elements in augmented reality scenes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computer Hardware Design (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computer Graphics (AREA)
  • Processing Or Creating Images (AREA)
  • User Interface Of Digital Computer (AREA)
CN202280027110.XA 2021-04-08 2022-03-29 用于生成音频信号的装置和方法 Pending CN117120963A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP21167514.5 2021-04-08
EP21167514.5A EP4071585A1 (en) 2021-04-08 2021-04-08 Apparatus and method for generating an audio signal
PCT/EP2022/058273 WO2022214357A1 (en) 2021-04-08 2022-03-29 Apparatus and method for generating an audio signal

Publications (1)

Publication Number Publication Date
CN117120963A true CN117120963A (zh) 2023-11-24

Family

ID=75438686

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280027110.XA Pending CN117120963A (zh) 2021-04-08 2022-03-29 用于生成音频信号的装置和方法

Country Status (9)

Country Link
US (1) US20240370223A1 (https=)
EP (2) EP4071585A1 (https=)
JP (1) JP2024513082A (https=)
KR (1) KR20230164187A (https=)
CN (1) CN117120963A (https=)
BR (1) BR112023020318A2 (https=)
ES (1) ES3013574T3 (https=)
PL (1) PL4320498T3 (https=)
WO (1) WO2022214357A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240242380A1 (en) * 2023-01-13 2024-07-18 Maya Heat Transfer Technologies Ltd. System for generating an image dataset for training an artificial intelligence model for object recognition, and method of use thereof
WO2025263770A1 (ko) * 2024-06-17 2025-12-26 삼성전자주식회사 비디오를 처리하기 위한 전자 장치 및 방법

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8831255B2 (en) * 2012-03-08 2014-09-09 Disney Enterprises, Inc. Augmented reality (AR) audio with position and action triggered virtual sound effects
US10395435B2 (en) * 2016-04-04 2019-08-27 Occipital, Inc. System for multimedia spatial annotation, visualization, and recommendation
GB201800920D0 (en) * 2018-01-19 2018-03-07 Nokia Technologies Oy Associated spatial audio playback
IL305389B2 (en) * 2018-02-15 2024-09-01 Magic Leap Inc Musical instruments in mixed reality
US10565797B2 (en) * 2018-02-17 2020-02-18 Varjo Technologies Oy System and method of enhancing user's immersion in mixed reality mode of display apparatus
CN112514398B (zh) * 2018-06-01 2023-07-14 诺基亚技术有限公司 用于针对全向内容而标记在覆盖物上的用户交互并将对背景的覆盖物分组的方法和装置
US11651567B2 (en) * 2018-12-13 2023-05-16 Maxell, Ltd. Display terminal, display control system and display control method
EP4158908A4 (en) * 2020-05-29 2023-11-29 Magic Leap, Inc. SURFACE COLLISIONS
US20210405743A1 (en) * 2020-06-26 2021-12-30 Apple Inc. Dynamic media item delivery

Also Published As

Publication number Publication date
JP2024513082A (ja) 2024-03-21
BR112023020318A2 (pt) 2023-11-21
KR20230164187A (ko) 2023-12-01
ES3013574T3 (en) 2025-04-14
EP4071585A1 (en) 2022-10-12
EP4320498A1 (en) 2024-02-14
WO2022214357A1 (en) 2022-10-13
EP4320498C0 (en) 2025-01-29
US20240370223A1 (en) 2024-11-07
EP4320498B1 (en) 2025-01-29
PL4320498T3 (pl) 2025-04-14

Similar Documents

Publication Publication Date Title
US11217006B2 (en) Methods and systems for performing 3D simulation based on a 2D video image
Darrell et al. Integrated person tracking using stereo, color, and pattern detection
US10582191B1 (en) Dynamic angle viewing system
CN114631127B (zh) 说话头的小样本合成
US10777016B2 (en) System and method of enhancing user's immersion in mixed reality mode of display apparatus
US11200745B2 (en) Systems, methods, and media for automatically triggering real-time visualization of physical environment in artificial reality
CN109345556B (zh) 用于混合现实的神经网络前景分离
US8878846B1 (en) Superimposing virtual views of 3D objects with live images
US8624962B2 (en) Systems and methods for simulating three-dimensional virtual interactions from two-dimensional camera images
JP7527351B2 (ja) シーンの画像キャプチャの品質を評価するための装置及び方法
US20130207962A1 (en) User interactive kiosk with three-dimensional display
US20110164032A1 (en) Three-Dimensional User Interface
CN112581627A (zh) 用于体积视频的用户控制的虚拟摄像机的系统和装置
KR20210134956A (ko) 이미지에 대한 깊이 맵의 처리
JP7787238B2 (ja) 画像処理装置、画像処理方法及び画像処理システム
EP4320498B1 (en) Apparatus and method for generating an audio signal
US11710273B2 (en) Image processing
CN110809751B (zh) 用于实现介导现实虚拟内容消耗的方法、装置、系统、计算机程序
CN116778058B (zh) 一种智能展厅的智能交互系统
US20250218137A1 (en) Adaptive model updates for dynamic and static scenes
TW202239201A (zh) 影像合成系統及其方法
CN117616760A (zh) 图像生成
HK1174113B (en) Using a three-dimensional environment model in gameplay
HK1174113A1 (en) Using a three-dimensional environment model in gameplay

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination