JP2016513410A5 - - Google Patents

Download PDF

Info

Publication number
JP2016513410A5
JP2016513410A5 JP2015558105A JP2015558105A JP2016513410A5 JP 2016513410 A5 JP2016513410 A5 JP 2016513410A5 JP 2015558105 A JP2015558105 A JP 2015558105A JP 2015558105 A JP2015558105 A JP 2015558105A JP 2016513410 A5 JP2016513410 A5 JP 2016513410A5
Authority
JP
Japan
Prior art keywords
audio
video
objects
data
metadata
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2015558105A
Other languages
English (en)
Japanese (ja)
Other versions
JP6039111B2 (ja
JP2016513410A (ja
Filing date
Publication date
Priority claimed from US13/831,018 external-priority patent/US9338420B2/en
Application filed filed Critical
Publication of JP2016513410A publication Critical patent/JP2016513410A/ja
Publication of JP2016513410A5 publication Critical patent/JP2016513410A5/ja
Application granted granted Critical
Publication of JP6039111B2 publication Critical patent/JP6039111B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2015558105A 2013-02-15 2014-02-12 マルチチャネルオーディオデータのビデオ解析支援生成 Expired - Fee Related JP6039111B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201361765556P 2013-02-15 2013-02-15
US61/765,556 2013-02-15
US13/831,018 US9338420B2 (en) 2013-02-15 2013-03-14 Video analysis assisted generation of multi-channel audio data
US13/831,018 2013-03-14
PCT/US2014/016059 WO2014127019A1 (en) 2013-02-15 2014-02-12 Video analysis assisted generation of multi-channel audio data

Publications (3)

Publication Number Publication Date
JP2016513410A JP2016513410A (ja) 2016-05-12
JP2016513410A5 true JP2016513410A5 (enrdf_load_stackoverflow) 2016-08-12
JP6039111B2 JP6039111B2 (ja) 2016-12-07

Family

ID=51351238

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2015558105A Expired - Fee Related JP6039111B2 (ja) 2013-02-15 2014-02-12 マルチチャネルオーディオデータのビデオ解析支援生成

Country Status (6)

Country Link
US (1) US9338420B2 (enrdf_load_stackoverflow)
EP (1) EP2956941A1 (enrdf_load_stackoverflow)
JP (1) JP6039111B2 (enrdf_load_stackoverflow)
KR (1) KR101761039B1 (enrdf_load_stackoverflow)
CN (1) CN104995681B (enrdf_load_stackoverflow)
WO (1) WO2014127019A1 (enrdf_load_stackoverflow)

Families Citing this family (80)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102804686B (zh) * 2010-03-16 2016-08-24 三星电子株式会社 内容输出系统及其编解码器信息共享方法
US10326978B2 (en) 2010-06-30 2019-06-18 Warner Bros. Entertainment Inc. Method and apparatus for generating virtual or augmented reality presentations with 3D audio positioning
BR122022005104B1 (pt) 2013-03-28 2022-09-13 Dolby Laboratories Licensing Corporation Método para renderizar um áudio de entrada, aparelho para renderizar um áudio de entrada e meio não transitório
US9716959B2 (en) 2013-05-29 2017-07-25 Qualcomm Incorporated Compensating for error in decomposed representations of sound fields
US9466305B2 (en) * 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
KR101681529B1 (ko) 2013-07-31 2016-12-01 돌비 레버러토리즈 라이쎈싱 코오포레이션 공간적으로 분산된 또는 큰 오디오 오브젝트들의 프로세싱
US9137232B2 (en) * 2014-01-14 2015-09-15 Xerox Corporation Method and system for controlling access to document data using augmented reality marker
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US20160179803A1 (en) * 2014-12-22 2016-06-23 Rovi Guides, Inc. Augmenting metadata using commonly available visual elements associated with media content
CN107409264B (zh) * 2015-01-16 2021-02-05 三星电子株式会社 基于图像信息处理声音的方法和对应设备
CN105989845B (zh) * 2015-02-25 2020-12-08 杜比实验室特许公司 视频内容协助的音频对象提取
US9609383B1 (en) * 2015-03-23 2017-03-28 Amazon Technologies, Inc. Directional audio for virtual environments
US10176644B2 (en) * 2015-06-07 2019-01-08 Apple Inc. Automatic rendering of 3D sound
TWI736542B (zh) * 2015-08-06 2021-08-21 日商新力股份有限公司 資訊處理裝置、資料配訊伺服器及資訊處理方法、以及非暫時性電腦可讀取之記錄媒體
US10762911B2 (en) 2015-12-01 2020-09-01 Ati Technologies Ulc Audio encoding using video information
GB2545275A (en) * 2015-12-11 2017-06-14 Nokia Technologies Oy Causing provision of virtual reality content
KR20170106063A (ko) * 2016-03-11 2017-09-20 가우디오디오랩 주식회사 오디오 신호 처리 방법 및 장치
US10979843B2 (en) * 2016-04-08 2021-04-13 Qualcomm Incorporated Spatialized audio output based on predicted position data
JP6959943B2 (ja) * 2016-05-25 2021-11-05 ワーナー ブラザーズ エンターテイメント インコーポレイテッド 3d音声ポジショニングを用いて仮想現実又は拡張現実のプレゼンテーションを生成するための方法及び装置
JP6984596B2 (ja) * 2016-05-30 2021-12-22 ソニーグループ株式会社 映像音響処理装置および方法、並びにプログラム
US10074012B2 (en) 2016-06-17 2018-09-11 Dolby Laboratories Licensing Corporation Sound and video object tracking
CN106162447A (zh) * 2016-06-24 2016-11-23 维沃移动通信有限公司 一种音频播放的方法和终端
US10445936B1 (en) * 2016-08-01 2019-10-15 Snap Inc. Audio responsive augmented reality
EP3324407A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic
EP3324406A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a variable threshold
GB2557241A (en) * 2016-12-01 2018-06-20 Nokia Technologies Oy Audio processing
EP3343483A1 (en) 2016-12-30 2018-07-04 Spotify AB System and method for providing a video with lyrics overlay for use in a social messaging environment
EP3343347A1 (en) * 2016-12-30 2018-07-04 Nokia Technologies Oy Audio processing
EP3343957B1 (en) * 2016-12-30 2022-07-06 Nokia Technologies Oy Multimedia content
US10659906B2 (en) 2017-01-13 2020-05-19 Qualcomm Incorporated Audio parallax for virtual reality, augmented reality, and mixed reality
CN108632551A (zh) * 2017-03-16 2018-10-09 南昌黑鲨科技有限公司 基于深度学习的视频录摄方法、装置及终端
JP7143843B2 (ja) * 2017-04-13 2022-09-29 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
US11574644B2 (en) * 2017-04-26 2023-02-07 Sony Corporation Signal processing device and method, and program
EP3399398B1 (en) * 2017-05-02 2022-04-13 Nokia Technologies Oy An apparatus and associated methods for presentation of spatial audio
TWI687919B (zh) * 2017-06-15 2020-03-11 宏達國際電子股份有限公司 音頻訊號處理方法、音頻定位系統以及非暫態電腦可讀取媒體
US10178490B1 (en) * 2017-06-30 2019-01-08 Apple Inc. Intelligent audio rendering for video recording
US11164606B2 (en) * 2017-06-30 2021-11-02 Qualcomm Incorporated Audio-driven viewport selection
US10224074B2 (en) * 2017-07-12 2019-03-05 Karl Storz Imaging, Inc. Apparatus and methods for improving video quality from a digital video signal including replicated image frames
CN111052770B (zh) 2017-09-29 2021-12-03 苹果公司 空间音频下混频的方法及系统
WO2019067469A1 (en) * 2017-09-29 2019-04-04 Zermatt Technologies Llc FILE FORMAT FOR SPACE
US10469968B2 (en) 2017-10-12 2019-11-05 Qualcomm Incorporated Rendering for computer-mediated reality systems
US10714144B2 (en) 2017-11-06 2020-07-14 International Business Machines Corporation Corroborating video data with audio data from video content to create section tagging
US11003676B2 (en) * 2018-02-27 2021-05-11 Sap Se Software integration object linking data structures
US11847773B1 (en) 2018-04-27 2023-12-19 Splunk Inc. Geofence-based object identification in an extended reality environment
US11145123B1 (en) 2018-04-27 2021-10-12 Splunk Inc. Generating extended reality overlays in an industrial environment
US11450071B2 (en) * 2018-05-23 2022-09-20 Koninklijke Kpn N.V. Adapting acoustic rendering to image-based object
US11715302B2 (en) * 2018-08-21 2023-08-01 Streem, Llc Automatic tagging of images using speech recognition
US11012774B2 (en) 2018-10-29 2021-05-18 Apple Inc. Spatially biased sound pickup for binaural video recording
GB201818959D0 (en) 2018-11-21 2019-01-09 Nokia Technologies Oy Ambience audio representation and associated rendering
US11259134B2 (en) 2018-11-26 2022-02-22 Raytheon Bbn Technologies Corp. Systems and methods for enhancing attitude awareness in telepresence applications
KR102737006B1 (ko) * 2019-03-08 2024-12-02 엘지전자 주식회사 음향 객체 추종을 위한 방법 및 이를 위한 장치
CN111757240B (zh) * 2019-03-26 2021-08-20 瑞昱半导体股份有限公司 音频处理方法与音频处理系统
CN111757239B (zh) * 2019-03-28 2021-11-19 瑞昱半导体股份有限公司 音频处理方法与音频处理系统
US11030479B2 (en) * 2019-04-30 2021-06-08 Sony Interactive Entertainment Inc. Mapping visual tags to sound tags using text similarity
US10869152B1 (en) 2019-05-31 2020-12-15 Dts, Inc. Foveated audio rendering
CN110381336B (zh) * 2019-07-24 2021-07-16 广州飞达音响股份有限公司 基于5.1声道的视频片段情感判定方法、装置和计算机设备
US11276419B2 (en) 2019-07-30 2022-03-15 International Business Machines Corporation Synchronized sound generation from videos
US11356796B2 (en) 2019-11-22 2022-06-07 Qualcomm Incorporated Priority-based soundfield coding for virtual reality audio
US12094476B2 (en) 2019-12-02 2024-09-17 Dolby Laboratories Licensing Corporation Systems, methods and apparatus for conversion from channel-based audio to object-based audio
KR102712458B1 (ko) 2019-12-09 2024-10-04 삼성전자주식회사 오디오 출력 장치 및 오디오 출력 장치의 제어 방법
US11823698B2 (en) * 2020-01-17 2023-11-21 Audiotelligence Limited Audio cropping
US11704087B2 (en) * 2020-02-03 2023-07-18 Google Llc Video-informed spatial audio expansion
US11694084B2 (en) 2020-04-14 2023-07-04 Sony Interactive Entertainment Inc. Self-supervised AI-assisted sound effect recommendation for silent video
US11755275B2 (en) * 2020-06-29 2023-09-12 Meta Platforms Technologies, Llc Generating augmented reality experiences utilizing physical objects to represent analogous virtual objects
CN111863002A (zh) * 2020-07-06 2020-10-30 Oppo广东移动通信有限公司 处理方法、处理装置、电子设备
CN111787464B (zh) * 2020-07-31 2022-06-14 Oppo广东移动通信有限公司 一种信息处理方法、装置、电子设备和存储介质
US11546692B1 (en) 2020-08-19 2023-01-03 Apple Inc. Audio renderer based on audiovisual information
US11521623B2 (en) 2021-01-11 2022-12-06 Bank Of America Corporation System and method for single-speaker identification in a multi-speaker environment on a low-frequency audio recording
US12192738B2 (en) 2021-04-23 2025-01-07 Samsung Electronics Co., Ltd. Electronic apparatus for audio signal processing and operating method thereof
KR102437760B1 (ko) * 2021-05-27 2022-08-29 이충열 컴퓨팅 장치에 의한 음향의 처리 방법, 영상 및 음향의 처리 방법 및 이를 이용한 시스템들
CN113316078B (zh) * 2021-07-30 2021-10-29 腾讯科技(深圳)有限公司 数据处理方法、装置、计算机设备及存储介质
US12039793B2 (en) 2021-11-10 2024-07-16 Meta Platforms Technologies, Llc Automatic artificial reality world creation
CN114842877A (zh) * 2022-03-21 2022-08-02 南京惠积信息科技有限公司 基于人员隐私保护的视频水声检测方法及装置
US20240056761A1 (en) * 2022-08-10 2024-02-15 Samsung Electronics Co., Ltd. Three-dimensional (3d) sound rendering with multi-channel audio based on mono audio input
EP4588247A1 (en) * 2022-09-13 2025-07-23 Dolby Laboratories Licensing Corporation Audio-visual analytic for object rendering in capture

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6829018B2 (en) * 2001-09-17 2004-12-07 Koninklijke Philips Electronics N.V. Three-dimensional sound creation assisted by visual information
DK2215858T4 (da) * 2007-11-14 2020-09-28 Sonova Ag Metode og anordning til justering af et høresystem
US20100098258A1 (en) 2008-10-22 2010-04-22 Karl Ola Thorn System and method for generating multichannel audio with a portable electronic device
EP2380033B1 (en) 2008-12-16 2017-05-17 Koninklijke Philips N.V. Estimating a sound source location using particle filtering
WO2010140254A1 (ja) 2009-06-05 2010-12-09 パイオニア株式会社 映像音声出力装置及び音声定位方法
US8984501B2 (en) * 2009-06-19 2015-03-17 Dolby Laboratories Licensing Corporation Hierarchy and processing order control of downloadable and upgradeable media processing applications
WO2011011737A1 (en) 2009-07-24 2011-01-27 Digimarc Corporation Improved audio/video methods and systems
US8963987B2 (en) * 2010-05-27 2015-02-24 Microsoft Corporation Non-linguistic signal detection and feedback
US8755432B2 (en) * 2010-06-30 2014-06-17 Warner Bros. Entertainment Inc. Method and apparatus for generating 3D audio positioning using dynamically optimized audio 3D space perception cues
US8638951B2 (en) 2010-07-15 2014-01-28 Motorola Mobility Llc Electronic apparatus for generating modified wideband audio signals based on two or more wideband microphone signals
US8433076B2 (en) 2010-07-26 2013-04-30 Motorola Mobility Llc Electronic apparatus for generating beamformed audio signals with steerable nulls
US9031256B2 (en) 2010-10-25 2015-05-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for orientation-sensitive recording control
US8855341B2 (en) * 2010-10-25 2014-10-07 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for head tracking based on recorded sound signals
US9552840B2 (en) 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
US11120818B2 (en) 2010-11-12 2021-09-14 Nokia Technologies Oy Processing audio with a visual representation of an audio source
FR2974097B1 (fr) 2011-04-14 2013-04-19 Michelin Soc Tech Composition de caoutchouc comprenant un derive de la thiazoline
US20130162752A1 (en) * 2011-12-22 2013-06-27 Advanced Micro Devices, Inc. Audio and Video Teleconferencing Using Voiceprints and Face Prints

Similar Documents

Publication Publication Date Title
JP2016513410A5 (enrdf_load_stackoverflow)
WO2017009851A3 (en) Coordinating communication and/or storage based on image analysis
WO2016025623A3 (en) Image linking and sharing
MX2017012505A (es) Configuracion de diferentes sensibilidades de modelos de fondo mediante regiones definidas por el usuario y filtros de fondo.
JP2017505475A5 (enrdf_load_stackoverflow)
MY192140A (en) Information processing method, terminal, and computer storage medium
WO2014155130A3 (en) Method, system and computer program for comparing images
EP3009959A3 (en) Identifying content of interest
JP2016506669A5 (enrdf_load_stackoverflow)
WO2016174524A3 (en) Data processing systems
WO2016106383A3 (en) First-person camera based visual context aware system
JP2019504379A (ja) 煙検出装置、方法及び画像処理装置
RU2017143920A (ru) Устройство, способ и программа аудиообработки
MX364283B (es) Compartir fotos sugeridas.
WO2016050347A3 (en) Audio identification device, audio identification method and audio identification system
JP2015508205A5 (enrdf_load_stackoverflow)
JP2017144521A5 (enrdf_load_stackoverflow)
JP2013161405A5 (ja) 被写体判定装置、被写体判定方法及びプログラム
US20200293179A1 (en) Prioritization for presentation of media based on sensor data collected by wearable sensor devices
EP2809062A3 (en) Image processor, image processing method and program, and recording medium
GB2571686A (en) System and method for analyzing and associating elements of a computer system by shared characteristics
JP2018530821A5 (enrdf_load_stackoverflow)
US9508386B2 (en) Method and apparatus for synchronizing audio and video signals
JP2016517062A5 (enrdf_load_stackoverflow)
JP2017117408A5 (enrdf_load_stackoverflow)