JP2016513410A5 - - Google Patents

Download PDF

Info

Publication number
JP2016513410A5
JP2016513410A5 JP2015558105A JP2015558105A JP2016513410A5 JP 2016513410 A5 JP2016513410 A5 JP 2016513410A5 JP 2015558105 A JP2015558105 A JP 2015558105A JP 2015558105 A JP2015558105 A JP 2015558105A JP 2016513410 A5 JP2016513410 A5 JP 2016513410A5
Authority
JP
Japan
Prior art keywords
audio
video
objects
data
metadata
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2015558105A
Other languages
English (en)
Japanese (ja)
Other versions
JP6039111B2 (ja
JP2016513410A (ja
Filing date
Publication date
Priority claimed from US13/831,018 external-priority patent/US9338420B2/en
Application filed filed Critical
Publication of JP2016513410A publication Critical patent/JP2016513410A/ja
Publication of JP2016513410A5 publication Critical patent/JP2016513410A5/ja
Application granted granted Critical
Publication of JP6039111B2 publication Critical patent/JP6039111B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2015558105A 2013-02-15 2014-02-12 マルチチャネルオーディオデータのビデオ解析支援生成 Expired - Fee Related JP6039111B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201361765556P 2013-02-15 2013-02-15
US61/765,556 2013-02-15
US13/831,018 2013-03-14
US13/831,018 US9338420B2 (en) 2013-02-15 2013-03-14 Video analysis assisted generation of multi-channel audio data
PCT/US2014/016059 WO2014127019A1 (en) 2013-02-15 2014-02-12 Video analysis assisted generation of multi-channel audio data

Publications (3)

Publication Number Publication Date
JP2016513410A JP2016513410A (ja) 2016-05-12
JP2016513410A5 true JP2016513410A5 (enExample) 2016-08-12
JP6039111B2 JP6039111B2 (ja) 2016-12-07

Family

ID=51351238

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2015558105A Expired - Fee Related JP6039111B2 (ja) 2013-02-15 2014-02-12 マルチチャネルオーディオデータのビデオ解析支援生成

Country Status (6)

Country Link
US (1) US9338420B2 (enExample)
EP (1) EP2956941A1 (enExample)
JP (1) JP6039111B2 (enExample)
KR (1) KR101761039B1 (enExample)
CN (1) CN104995681B (enExample)
WO (1) WO2014127019A1 (enExample)

Families Citing this family (82)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011115424A2 (ko) * 2010-03-16 2011-09-22 삼성전자 주식회사 컨텐츠 출력 시스템 및 그 시스템에서 코덱 정보 공유 방법
US10326978B2 (en) 2010-06-30 2019-06-18 Warner Bros. Entertainment Inc. Method and apparatus for generating virtual or augmented reality presentations with 3D audio positioning
EP2926571B1 (en) * 2013-03-28 2017-10-18 Dolby Laboratories Licensing Corporation Rendering of audio objects with apparent size to arbitrary loudspeaker layouts
US9466305B2 (en) * 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US20140358565A1 (en) 2013-05-29 2014-12-04 Qualcomm Incorporated Compression of decomposed representations of a sound field
CN119049486A (zh) * 2013-07-31 2024-11-29 杜比实验室特许公司 用于处理音频数据的方法和装置、介质及设备
US9137232B2 (en) * 2014-01-14 2015-09-15 Xerox Corporation Method and system for controlling access to document data using augmented reality marker
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US20160179803A1 (en) * 2014-12-22 2016-06-23 Rovi Guides, Inc. Augmenting metadata using commonly available visual elements associated with media content
CN107409264B (zh) 2015-01-16 2021-02-05 三星电子株式会社 基于图像信息处理声音的方法和对应设备
CN105989845B (zh) * 2015-02-25 2020-12-08 杜比实验室特许公司 视频内容协助的音频对象提取
US9609383B1 (en) * 2015-03-23 2017-03-28 Amazon Technologies, Inc. Directional audio for virtual environments
US10176644B2 (en) * 2015-06-07 2019-01-08 Apple Inc. Automatic rendering of 3D sound
TWI736542B (zh) * 2015-08-06 2021-08-21 日商新力股份有限公司 資訊處理裝置、資料配訊伺服器及資訊處理方法、以及非暫時性電腦可讀取之記錄媒體
US10762911B2 (en) 2015-12-01 2020-09-01 Ati Technologies Ulc Audio encoding using video information
GB2545275A (en) * 2015-12-11 2017-06-14 Nokia Technologies Oy Causing provision of virtual reality content
KR20170106063A (ko) * 2016-03-11 2017-09-20 가우디오디오랩 주식회사 오디오 신호 처리 방법 및 장치
US10979843B2 (en) * 2016-04-08 2021-04-13 Qualcomm Incorporated Spatialized audio output based on predicted position data
KR20190013900A (ko) * 2016-05-25 2019-02-11 워너 브로스. 엔터테인먼트 인크. 3d 오디오 포지셔닝을 이용하는 가상 또는 증강 현실 프레젠테이션을 생성하기 위한 방법 및 장치 (method and apparatus for generating virtual or augmented reality presentations with 3d audio positioning)
CN117612539A (zh) * 2016-05-30 2024-02-27 索尼公司 视频音频处理设备、视频音频处理方法和存储介质
US10074012B2 (en) 2016-06-17 2018-09-11 Dolby Laboratories Licensing Corporation Sound and video object tracking
CN106162447A (zh) * 2016-06-24 2016-11-23 维沃移动通信有限公司 一种音频播放的方法和终端
US10445936B1 (en) * 2016-08-01 2019-10-15 Snap Inc. Audio responsive augmented reality
EP3324407A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic
EP3324406A1 (en) 2016-11-17 2018-05-23 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for decomposing an audio signal using a variable threshold
GB2557241A (en) * 2016-12-01 2018-06-20 Nokia Technologies Oy Audio processing
EP3343957B1 (en) * 2016-12-30 2022-07-06 Nokia Technologies Oy Multimedia content
EP3343483A1 (en) 2016-12-30 2018-07-04 Spotify AB System and method for providing a video with lyrics overlay for use in a social messaging environment
EP3343347A1 (en) * 2016-12-30 2018-07-04 Nokia Technologies Oy Audio processing
US10659906B2 (en) 2017-01-13 2020-05-19 Qualcomm Incorporated Audio parallax for virtual reality, augmented reality, and mixed reality
CN108632551A (zh) * 2017-03-16 2018-10-09 南昌黑鲨科技有限公司 基于深度学习的视频录摄方法、装置及终端
EP3624116B1 (en) * 2017-04-13 2022-05-04 Sony Group Corporation Signal processing device, method, and program
KR20240042125A (ko) * 2017-04-26 2024-04-01 소니그룹주식회사 신호 처리 장치 및 방법, 및 프로그램
EP3399398B1 (en) * 2017-05-02 2022-04-13 Nokia Technologies Oy An apparatus and associated methods for presentation of spatial audio
CN109151704B (zh) * 2017-06-15 2020-05-19 宏达国际电子股份有限公司 音讯处理方法、音频定位系统以及非暂态电脑可读取媒体
US11164606B2 (en) * 2017-06-30 2021-11-02 Qualcomm Incorporated Audio-driven viewport selection
US10178490B1 (en) * 2017-06-30 2019-01-08 Apple Inc. Intelligent audio rendering for video recording
US10224074B2 (en) * 2017-07-12 2019-03-05 Karl Storz Imaging, Inc. Apparatus and methods for improving video quality from a digital video signal including replicated image frames
CN111052770B (zh) * 2017-09-29 2021-12-03 苹果公司 空间音频下混频的方法及系统
WO2019067469A1 (en) 2017-09-29 2019-04-04 Zermatt Technologies Llc FILE FORMAT FOR SPACE
US10469968B2 (en) 2017-10-12 2019-11-05 Qualcomm Incorporated Rendering for computer-mediated reality systems
US10714144B2 (en) 2017-11-06 2020-07-14 International Business Machines Corporation Corroborating video data with audio data from video content to create section tagging
JP7252965B2 (ja) * 2018-02-15 2023-04-05 マジック リープ, インコーポレイテッド 複合現実のための二重聴取者位置
US11003676B2 (en) * 2018-02-27 2021-05-11 Sap Se Software integration object linking data structures
US11145123B1 (en) 2018-04-27 2021-10-12 Splunk Inc. Generating extended reality overlays in an industrial environment
US11847773B1 (en) 2018-04-27 2023-12-19 Splunk Inc. Geofence-based object identification in an extended reality environment
US11450071B2 (en) * 2018-05-23 2022-09-20 Koninklijke Kpn N.V. Adapting acoustic rendering to image-based object
US11715302B2 (en) * 2018-08-21 2023-08-01 Streem, Llc Automatic tagging of images using speech recognition
US11012774B2 (en) 2018-10-29 2021-05-18 Apple Inc. Spatially biased sound pickup for binaural video recording
GB201818959D0 (en) * 2018-11-21 2019-01-09 Nokia Technologies Oy Ambience audio representation and associated rendering
US11115769B2 (en) 2018-11-26 2021-09-07 Raytheon Bbn Technologies Corp. Systems and methods for providing a user with enhanced attitude awareness
KR102758939B1 (ko) 2019-03-08 2025-01-23 엘지전자 주식회사 음향 객체 추종을 위한 방법 및 이를 위한 장치
CN111757240B (zh) * 2019-03-26 2021-08-20 瑞昱半导体股份有限公司 音频处理方法与音频处理系统
CN111757239B (zh) * 2019-03-28 2021-11-19 瑞昱半导体股份有限公司 音频处理方法与音频处理系统
US11030479B2 (en) * 2019-04-30 2021-06-08 Sony Interactive Entertainment Inc. Mapping visual tags to sound tags using text similarity
JP7285967B2 (ja) * 2019-05-31 2023-06-02 ディーティーエス・インコーポレイテッド フォービエイテッドオーディオレンダリング
CN110381336B (zh) * 2019-07-24 2021-07-16 广州飞达音响股份有限公司 基于5.1声道的视频片段情感判定方法、装置和计算机设备
US11276419B2 (en) 2019-07-30 2022-03-15 International Business Machines Corporation Synchronized sound generation from videos
US11356796B2 (en) 2019-11-22 2022-06-07 Qualcomm Incorporated Priority-based soundfield coding for virtual reality audio
US12094476B2 (en) 2019-12-02 2024-09-17 Dolby Laboratories Licensing Corporation Systems, methods and apparatus for conversion from channel-based audio to object-based audio
KR102712458B1 (ko) 2019-12-09 2024-10-04 삼성전자주식회사 오디오 출력 장치 및 오디오 출력 장치의 제어 방법
US11823698B2 (en) * 2020-01-17 2023-11-21 Audiotelligence Limited Audio cropping
US11704087B2 (en) * 2020-02-03 2023-07-18 Google Llc Video-informed spatial audio expansion
US11694084B2 (en) 2020-04-14 2023-07-04 Sony Interactive Entertainment Inc. Self-supervised AI-assisted sound effect recommendation for silent video
US11755275B2 (en) * 2020-06-29 2023-09-12 Meta Platforms Technologies, Llc Generating augmented reality experiences utilizing physical objects to represent analogous virtual objects
CN111863002A (zh) * 2020-07-06 2020-10-30 Oppo广东移动通信有限公司 处理方法、处理装置、电子设备
CN111787464B (zh) * 2020-07-31 2022-06-14 Oppo广东移动通信有限公司 一种信息处理方法、装置、电子设备和存储介质
US11546692B1 (en) 2020-08-19 2023-01-03 Apple Inc. Audio renderer based on audiovisual information
US11521623B2 (en) 2021-01-11 2022-12-06 Bank Of America Corporation System and method for single-speaker identification in a multi-speaker environment on a low-frequency audio recording
US12192738B2 (en) 2021-04-23 2025-01-07 Samsung Electronics Co., Ltd. Electronic apparatus for audio signal processing and operating method thereof
KR102437760B1 (ko) * 2021-05-27 2022-08-29 이충열 컴퓨팅 장치에 의한 음향의 처리 방법, 영상 및 음향의 처리 방법 및 이를 이용한 시스템들
CN113316078B (zh) * 2021-07-30 2021-10-29 腾讯科技(深圳)有限公司 数据处理方法、装置、计算机设备及存储介质
TW202324172A (zh) 2021-11-10 2023-06-16 美商元平台技術有限公司 自動建立人工實境世界
US12039793B2 (en) 2021-11-10 2024-07-16 Meta Platforms Technologies, Llc Automatic artificial reality world creation
CN114842877A (zh) * 2022-03-21 2022-08-02 南京惠积信息科技有限公司 基于人员隐私保护的视频水声检测方法及装置
US12425797B2 (en) * 2022-08-10 2025-09-23 Samsung Electronics Co., Ltd. Three-dimensional (3D) sound rendering with multi-channel audio based on mono audio input
WO2024059536A1 (en) * 2022-09-13 2024-03-21 Dolby Laboratories Licensing Corporation Audio-visual analytic for object rendering in capture

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6829018B2 (en) * 2001-09-17 2004-12-07 Koninklijke Philips Electronics N.V. Three-dimensional sound creation assisted by visual information
DK2215858T4 (da) * 2007-11-14 2020-09-28 Sonova Ag Metode og anordning til justering af et høresystem
US20100098258A1 (en) 2008-10-22 2010-04-22 Karl Ola Thorn System and method for generating multichannel audio with a portable electronic device
JP5608678B2 (ja) * 2008-12-16 2014-10-15 コーニンクレッカ フィリップス エヌ ヴェ パーティクルフィルタリングを利用した音源位置の推定
WO2010140254A1 (ja) 2009-06-05 2010-12-09 パイオニア株式会社 映像音声出力装置及び音声定位方法
WO2010148227A1 (en) * 2009-06-19 2010-12-23 Dolby Laboratories Licensing Corporation Upgradable engine framework for audio and video
JP2013500544A (ja) 2009-07-24 2013-01-07 ディジマーク コーポレイション 改善された音声/映像の方法及びシステム
US8963987B2 (en) * 2010-05-27 2015-02-24 Microsoft Corporation Non-linguistic signal detection and feedback
US8755432B2 (en) * 2010-06-30 2014-06-17 Warner Bros. Entertainment Inc. Method and apparatus for generating 3D audio positioning using dynamically optimized audio 3D space perception cues
US8638951B2 (en) 2010-07-15 2014-01-28 Motorola Mobility Llc Electronic apparatus for generating modified wideband audio signals based on two or more wideband microphone signals
US8433076B2 (en) 2010-07-26 2013-04-30 Motorola Mobility Llc Electronic apparatus for generating beamformed audio signals with steerable nulls
US9552840B2 (en) 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
US8855341B2 (en) * 2010-10-25 2014-10-07 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for head tracking based on recorded sound signals
US9031256B2 (en) 2010-10-25 2015-05-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for orientation-sensitive recording control
US11120818B2 (en) 2010-11-12 2021-09-14 Nokia Technologies Oy Processing audio with a visual representation of an audio source
FR2974097B1 (fr) 2011-04-14 2013-04-19 Michelin Soc Tech Composition de caoutchouc comprenant un derive de la thiazoline
US20130162752A1 (en) * 2011-12-22 2013-06-27 Advanced Micro Devices, Inc. Audio and Video Teleconferencing Using Voiceprints and Face Prints

Similar Documents

Publication Publication Date Title
JP2016513410A5 (enExample)
WO2017009851A3 (en) Coordinating communication and/or storage based on image analysis
MX2017012505A (es) Configuracion de diferentes sensibilidades de modelos de fondo mediante regiones definidas por el usuario y filtros de fondo.
WO2016025623A3 (en) Image linking and sharing
JP2017505475A5 (enExample)
WO2014155130A3 (en) Method, system and computer program for comparing images
JP2016506669A5 (enExample)
MY192140A (en) Information processing method, terminal, and computer storage medium
EP3009959A3 (en) Identifying content of interest
WO2016174524A3 (en) Data processing systems
WO2016106383A3 (en) First-person camera based visual context aware system
JP2019504379A (ja) 煙検出装置、方法及び画像処理装置
WO2016050347A3 (en) Audio identification device, audio identification method and audio identification system
JP2016536715A5 (enExample)
JP2013161405A5 (ja) 被写体判定装置、被写体判定方法及びプログラム
JP2017144521A5 (enExample)
JP2016164748A5 (enExample)
US10678398B2 (en) Prioritization for presentation of media based on sensor data collected by wearable sensor devices
EP2809062A3 (en) Image processor, image processing method and program, and recording medium
US9508386B2 (en) Method and apparatus for synchronizing audio and video signals
CN109478329A (zh) 图像处理方法和装置
GB2571686A (en) System and method for analyzing and associating elements of a computer system by shared characteristics
JP2018530821A5 (enExample)
RU2017105533A (ru) Обнаружение вредоносного программного обеспечения с перекрестным обзором
JP2016517062A5 (enExample)