JP2016513410A5 - - Google Patents

Download PDF

Info

Publication number
JP2016513410A5
JP2016513410A5 JP2015558105A JP2015558105A JP2016513410A5 JP 2016513410 A5 JP2016513410 A5 JP 2016513410A5 JP 2015558105 A JP2015558105 A JP 2015558105A JP 2015558105 A JP2015558105 A JP 2015558105A JP 2016513410 A5 JP2016513410 A5 JP 2016513410A5
Authority
JP
Japan
Prior art keywords
audio
video
objects
data
metadata
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2015558105A
Other languages
English (en)
Japanese (ja)
Other versions
JP2016513410A (ja
JP6039111B2 (ja
Filing date
Publication date
Priority claimed from US13/831,018 external-priority patent/US9338420B2/en
Application filed filed Critical
Publication of JP2016513410A publication Critical patent/JP2016513410A/ja
Publication of JP2016513410A5 publication Critical patent/JP2016513410A5/ja
Application granted granted Critical
Publication of JP6039111B2 publication Critical patent/JP6039111B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2015558105A 2013-02-15 2014-02-12 マルチチャネルオーディオデータのビデオ解析支援生成 Expired - Fee Related JP6039111B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201361765556P 2013-02-15 2013-02-15
US61/765,556 2013-02-15
US13/831,018 2013-03-14
US13/831,018 US9338420B2 (en) 2013-02-15 2013-03-14 Video analysis assisted generation of multi-channel audio data
PCT/US2014/016059 WO2014127019A1 (en) 2013-02-15 2014-02-12 Video analysis assisted generation of multi-channel audio data

Publications (3)

Publication Number Publication Date
JP2016513410A JP2016513410A (ja) 2016-05-12
JP2016513410A5 true JP2016513410A5 (enExample) 2016-08-12
JP6039111B2 JP6039111B2 (ja) 2016-12-07

Family

ID=51351238

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2015558105A Expired - Fee Related JP6039111B2 (ja) 2013-02-15 2014-02-12 マルチチャネルオーディオデータのビデオ解析支援生成

Country Status (6)

Country Link
US (1) US9338420B2 (enExample)
EP (1) EP2956941A1 (enExample)
JP (1) JP6039111B2 (enExample)
KR (1) KR101761039B1 (enExample)
CN (1) CN104995681B (enExample)
WO (1) WO2014127019A1 (enExample)

Families Citing this family (83)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102804686B (zh) * 2010-03-16 2016-08-24 三星电子株式会社 内容输出系统及其编解码器信息共享方法
US10326978B2 (en) 2010-06-30 2019-06-18 Warner Bros. Entertainment Inc. Method and apparatus for generating virtual or augmented reality presentations with 3D audio positioning
AU2014241011B2 (en) * 2013-03-28 2016-01-28 Dolby International Ab Rendering of audio objects with apparent size to arbitrary loudspeaker layouts
US9763019B2 (en) 2013-05-29 2017-09-12 Qualcomm Incorporated Analysis of decomposed representations of a sound field
US9466305B2 (en) * 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
KR102327504B1 (ko) * 2013-07-31 2021-11-17 돌비 레버러토리즈 라이쎈싱 코오포레이션 공간적으로 분산된 또는 큰 오디오 오브젝트들의 프로세싱
US9137232B2 (en) * 2014-01-14 2015-09-15 Xerox Corporation Method and system for controlling access to document data using augmented reality marker
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US20160179803A1 (en) * 2014-12-22 2016-06-23 Rovi Guides, Inc. Augmenting metadata using commonly available visual elements associated with media content
US10187737B2 (en) 2015-01-16 2019-01-22 Samsung Electronics Co., Ltd. Method for processing sound on basis of image information, and corresponding device
CN105989845B (zh) * 2015-02-25 2020-12-08 杜比实验室特许公司 视频内容协助的音频对象提取
US9609383B1 (en) 2015-03-23 2017-03-28 Amazon Technologies, Inc. Directional audio for virtual environments
US10176644B2 (en) * 2015-06-07 2019-01-08 Apple Inc. Automatic rendering of 3D sound
TWI736542B (zh) * 2015-08-06 2021-08-21 日商新力股份有限公司 資訊處理裝置、資料配訊伺服器及資訊處理方法、以及非暫時性電腦可讀取之記錄媒體
US10762911B2 (en) * 2015-12-01 2020-09-01 Ati Technologies Ulc Audio encoding using video information
GB2545275A (en) * 2015-12-11 2017-06-14 Nokia Technologies Oy Causing provision of virtual reality content
KR20170106063A (ko) * 2016-03-11 2017-09-20 가우디오디오랩 주식회사 오디오 신호 처리 방법 및 장치
US10979843B2 (en) * 2016-04-08 2021-04-13 Qualcomm Incorporated Spatialized audio output based on predicted position data
WO2017205637A1 (en) * 2016-05-25 2017-11-30 Warner Bros. Entertainment Inc. Method and apparatus for generating virtual or augmented reality presentations with 3d audio positioning
WO2017208820A1 (ja) * 2016-05-30 2017-12-07 ソニー株式会社 映像音響処理装置および方法、並びにプログラム
US10074012B2 (en) 2016-06-17 2018-09-11 Dolby Laboratories Licensing Corporation Sound and video object tracking
CN106162447A (zh) * 2016-06-24 2016-11-23 维沃移动通信有限公司 一种音频播放的方法和终端
US10445936B1 (en) 2016-08-01 2019-10-15 Snap Inc. Audio responsive augmented reality
EP3324406A1 (en) * 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a variable threshold
EP3324407A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic
GB2557241A (en) * 2016-12-01 2018-06-20 Nokia Technologies Oy Audio processing
EP3343483A1 (en) 2016-12-30 2018-07-04 Spotify AB System and method for providing a video with lyrics overlay for use in a social messaging environment
EP3343957B1 (en) 2016-12-30 2022-07-06 Nokia Technologies Oy Multimedia content
EP3343347A1 (en) 2016-12-30 2018-07-04 Nokia Technologies Oy Audio processing
US10659906B2 (en) 2017-01-13 2020-05-19 Qualcomm Incorporated Audio parallax for virtual reality, augmented reality, and mixed reality
CN108632551A (zh) * 2017-03-16 2018-10-09 南昌黑鲨科技有限公司 基于深度学习的视频录摄方法、装置及终端
RU2763391C2 (ru) * 2017-04-13 2021-12-28 Сони Корпорейшн Устройство, способ и постоянный считываемый компьютером носитель для обработки сигналов
KR102759041B1 (ko) * 2017-04-26 2025-01-24 소니그룹주식회사 신호 처리 장치 및 방법, 및 프로그램
EP3399398B1 (en) * 2017-05-02 2022-04-13 Nokia Technologies Oy An apparatus and associated methods for presentation of spatial audio
US20180367935A1 (en) * 2017-06-15 2018-12-20 Htc Corporation Audio signal processing method, audio positional system and non-transitory computer-readable medium
US10178490B1 (en) * 2017-06-30 2019-01-08 Apple Inc. Intelligent audio rendering for video recording
US11164606B2 (en) * 2017-06-30 2021-11-02 Qualcomm Incorporated Audio-driven viewport selection
US10224074B2 (en) * 2017-07-12 2019-03-05 Karl Storz Imaging, Inc. Apparatus and methods for improving video quality from a digital video signal including replicated image frames
US11128977B2 (en) 2017-09-29 2021-09-21 Apple Inc. Spatial audio downmixing
WO2019067469A1 (en) * 2017-09-29 2019-04-04 Zermatt Technologies Llc FILE FORMAT FOR SPACE
US10469968B2 (en) 2017-10-12 2019-11-05 Qualcomm Incorporated Rendering for computer-mediated reality systems
US10714144B2 (en) 2017-11-06 2020-07-14 International Business Machines Corporation Corroborating video data with audio data from video content to create section tagging
JP7252965B2 (ja) * 2018-02-15 2023-04-05 マジック リープ, インコーポレイテッド 複合現実のための二重聴取者位置
US11003676B2 (en) * 2018-02-27 2021-05-11 Sap Se Software integration object linking data structures
US11145123B1 (en) 2018-04-27 2021-10-12 Splunk Inc. Generating extended reality overlays in an industrial environment
US11847773B1 (en) 2018-04-27 2023-12-19 Splunk Inc. Geofence-based object identification in an extended reality environment
US11450071B2 (en) * 2018-05-23 2022-09-20 Koninklijke Kpn N.V. Adapting acoustic rendering to image-based object
US11715302B2 (en) * 2018-08-21 2023-08-01 Streem, Llc Automatic tagging of images using speech recognition
US11012774B2 (en) 2018-10-29 2021-05-18 Apple Inc. Spatially biased sound pickup for binaural video recording
GB201818959D0 (en) 2018-11-21 2019-01-09 Nokia Technologies Oy Ambience audio representation and associated rendering
US11115769B2 (en) 2018-11-26 2021-09-07 Raytheon Bbn Technologies Corp. Systems and methods for providing a user with enhanced attitude awareness
KR102737006B1 (ko) * 2019-03-08 2024-12-02 엘지전자 주식회사 음향 객체 추종을 위한 방법 및 이를 위한 장치
CN111757240B (zh) * 2019-03-26 2021-08-20 瑞昱半导体股份有限公司 音频处理方法与音频处理系统
CN111757239B (zh) * 2019-03-28 2021-11-19 瑞昱半导体股份有限公司 音频处理方法与音频处理系统
US11030479B2 (en) * 2019-04-30 2021-06-08 Sony Interactive Entertainment Inc. Mapping visual tags to sound tags using text similarity
CN113950845B (zh) 2019-05-31 2023-08-04 Dts公司 凹式音频渲染
CN110381336B (zh) * 2019-07-24 2021-07-16 广州飞达音响股份有限公司 基于5.1声道的视频片段情感判定方法、装置和计算机设备
US11276419B2 (en) 2019-07-30 2022-03-15 International Business Machines Corporation Synchronized sound generation from videos
US11356796B2 (en) 2019-11-22 2022-06-07 Qualcomm Incorporated Priority-based soundfield coding for virtual reality audio
JP7182751B6 (ja) 2019-12-02 2022-12-20 ドルビー ラボラトリーズ ライセンシング コーポレイション チャネルベースオーディオからオブジェクトベースオーディオへの変換のためのシステム、方法、及び機器
KR102712458B1 (ko) 2019-12-09 2024-10-04 삼성전자주식회사 오디오 출력 장치 및 오디오 출력 장치의 제어 방법
US11823698B2 (en) * 2020-01-17 2023-11-21 Audiotelligence Limited Audio cropping
US11704087B2 (en) * 2020-02-03 2023-07-18 Google Llc Video-informed spatial audio expansion
US11694084B2 (en) 2020-04-14 2023-07-04 Sony Interactive Entertainment Inc. Self-supervised AI-assisted sound effect recommendation for silent video
US11755275B2 (en) * 2020-06-29 2023-09-12 Meta Platforms Technologies, Llc Generating augmented reality experiences utilizing physical objects to represent analogous virtual objects
CN111863002A (zh) * 2020-07-06 2020-10-30 Oppo广东移动通信有限公司 处理方法、处理装置、电子设备
CN111787464B (zh) * 2020-07-31 2022-06-14 Oppo广东移动通信有限公司 一种信息处理方法、装置、电子设备和存储介质
US11546692B1 (en) * 2020-08-19 2023-01-03 Apple Inc. Audio renderer based on audiovisual information
US11521623B2 (en) 2021-01-11 2022-12-06 Bank Of America Corporation System and method for single-speaker identification in a multi-speaker environment on a low-frequency audio recording
US12192738B2 (en) 2021-04-23 2025-01-07 Samsung Electronics Co., Ltd. Electronic apparatus for audio signal processing and operating method thereof
KR102437760B1 (ko) * 2021-05-27 2022-08-29 이충열 컴퓨팅 장치에 의한 음향의 처리 방법, 영상 및 음향의 처리 방법 및 이를 이용한 시스템들
CN113316078B (zh) * 2021-07-30 2021-10-29 腾讯科技(深圳)有限公司 数据处理方法、装置、计算机设备及存储介质
US12039793B2 (en) 2021-11-10 2024-07-16 Meta Platforms Technologies, Llc Automatic artificial reality world creation
TW202324172A (zh) 2021-11-10 2023-06-16 美商元平台技術有限公司 自動建立人工實境世界
CN114842877A (zh) * 2022-03-21 2022-08-02 南京惠积信息科技有限公司 基于人员隐私保护的视频水声检测方法及装置
US12425797B2 (en) * 2022-08-10 2025-09-23 Samsung Electronics Co., Ltd. Three-dimensional (3D) sound rendering with multi-channel audio based on mono audio input
CN119769109A (zh) * 2022-08-24 2025-04-04 杜比实验室特许公司 渲染用多个设备捕获的音频
CN119856498A (zh) * 2022-09-13 2025-04-18 杜比实验室特许公司 用于在捕获时进行对象渲染的视听分析

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6829018B2 (en) * 2001-09-17 2004-12-07 Koninklijke Philips Electronics N.V. Three-dimensional sound creation assisted by visual information
DK2215858T4 (da) * 2007-11-14 2020-09-28 Sonova Ag Metode og anordning til justering af et høresystem
US20100098258A1 (en) 2008-10-22 2010-04-22 Karl Ola Thorn System and method for generating multichannel audio with a portable electronic device
US8403105B2 (en) 2008-12-16 2013-03-26 Koninklijke Philips Electronics N.V. Estimating a sound source location using particle filtering
WO2010140254A1 (ja) 2009-06-05 2010-12-09 パイオニア株式会社 映像音声出力装置及び音声定位方法
US8914137B2 (en) * 2009-06-19 2014-12-16 Dolby Laboratories Licensing Corporation Upgradeable engine framework for audio and video
WO2011011737A1 (en) 2009-07-24 2011-01-27 Digimarc Corporation Improved audio/video methods and systems
US8963987B2 (en) * 2010-05-27 2015-02-24 Microsoft Corporation Non-linguistic signal detection and feedback
US8755432B2 (en) * 2010-06-30 2014-06-17 Warner Bros. Entertainment Inc. Method and apparatus for generating 3D audio positioning using dynamically optimized audio 3D space perception cues
US8638951B2 (en) 2010-07-15 2014-01-28 Motorola Mobility Llc Electronic apparatus for generating modified wideband audio signals based on two or more wideband microphone signals
US8433076B2 (en) 2010-07-26 2013-04-30 Motorola Mobility Llc Electronic apparatus for generating beamformed audio signals with steerable nulls
US9552840B2 (en) 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
US8855341B2 (en) * 2010-10-25 2014-10-07 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for head tracking based on recorded sound signals
US9031256B2 (en) 2010-10-25 2015-05-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for orientation-sensitive recording control
EP2638694A4 (en) 2010-11-12 2017-05-03 Nokia Technologies Oy An Audio Processing Apparatus
FR2974097B1 (fr) 2011-04-14 2013-04-19 Michelin Soc Tech Composition de caoutchouc comprenant un derive de la thiazoline
US20130162752A1 (en) * 2011-12-22 2013-06-27 Advanced Micro Devices, Inc. Audio and Video Teleconferencing Using Voiceprints and Face Prints

Similar Documents

Publication Publication Date Title
JP2016513410A5 (enExample)
WO2017009851A3 (en) Coordinating communication and/or storage based on image analysis
MX2017012505A (es) Configuracion de diferentes sensibilidades de modelos de fondo mediante regiones definidas por el usuario y filtros de fondo.
WO2016025623A3 (en) Image linking and sharing
JP2017505475A5 (enExample)
WO2014155130A3 (en) Method, system and computer program for comparing images
JP2016506669A5 (enExample)
MY192140A (en) Information processing method, terminal, and computer storage medium
MX373029B (es) Metodo para la reconstruccion en 3d de un ambiente de un dispositivo movil, que corresponde a un producto de programa de computadora y dispositivo.
EP3009959A3 (en) Identifying content of interest
WO2016174524A3 (en) Data processing systems
WO2016106383A3 (en) First-person camera based visual context aware system
JP2019504379A (ja) 煙検出装置、方法及び画像処理装置
WO2016050347A3 (en) Audio identification device, audio identification method and audio identification system
JP2015508205A5 (enExample)
US11782572B2 (en) Prioritization for presentation of media based on sensor data collected by wearable sensor devices
JP2016536715A5 (enExample)
JP2013161405A5 (ja) 被写体判定装置、被写体判定方法及びプログラム
JP2016164748A5 (enExample)
EP2809062A3 (en) Image processor, image processing method and program, and recording medium
US9508386B2 (en) Method and apparatus for synchronizing audio and video signals
GB2571686A (en) System and method for analyzing and associating elements of a computer system by shared characteristics
RU2017105533A (ru) Обнаружение вредоносного программного обеспечения с перекрестным обзором
JP2016517062A5 (enExample)
EP2811456A3 (en) Filtering method and device in image processing