JP6518254B2 - オーディオ・コンテンツの空間的誤差メトリック - Google Patents
オーディオ・コンテンツの空間的誤差メトリック Download PDFInfo
- Publication number
- JP6518254B2 JP6518254B2 JP2016544661A JP2016544661A JP6518254B2 JP 6518254 B2 JP6518254 B2 JP 6518254B2 JP 2016544661 A JP2016544661 A JP 2016544661A JP 2016544661 A JP2016544661 A JP 2016544661A JP 6518254 B2 JP6518254 B2 JP 6518254B2
- Authority
- JP
- Japan
- Prior art keywords
- audio
- output
- clusters
- spatial
- objects
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F24—HEATING; RANGES; VENTILATING
- F24C—DOMESTIC STOVES OR RANGES ; DETAILS OF DOMESTIC STOVES OR RANGES, OF GENERAL APPLICATION
- F24C15/00—Details
- F24C15/20—Removing cooking fumes
- F24C15/2028—Removing cooking fumes using an air curtain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/008—Visual indication of individual signal levels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/13—Aspects of volume control, not necessarily automatic, in stereophonic sound systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Chemical & Material Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Mechanical Engineering (AREA)
- Combustion & Propulsion (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Stereophonic System (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| ESP201430016 | 2014-01-09 | ||
| ES201430016 | 2014-01-09 | ||
| US201461951048P | 2014-03-11 | 2014-03-11 | |
| US61/951,048 | 2014-03-11 | ||
| PCT/US2015/010126 WO2015105748A1 (en) | 2014-01-09 | 2015-01-05 | Spatial error metrics of audio content |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2017508175A JP2017508175A (ja) | 2017-03-23 |
| JP2017508175A5 JP2017508175A5 (enExample) | 2018-02-15 |
| JP6518254B2 true JP6518254B2 (ja) | 2019-05-22 |
Family
ID=52469071
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2016544661A Active JP6518254B2 (ja) | 2014-01-09 | 2015-01-05 | オーディオ・コンテンツの空間的誤差メトリック |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US10492014B2 (enExample) |
| EP (1) | EP3092642B1 (enExample) |
| JP (1) | JP6518254B2 (enExample) |
| CN (1) | CN105900169B (enExample) |
| WO (1) | WO2015105748A1 (enExample) |
Families Citing this family (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105432098B (zh) | 2013-07-30 | 2017-08-29 | 杜比国际公司 | 针对任意扬声器布局的音频对象的平移 |
| CN105336335B (zh) | 2014-07-25 | 2020-12-08 | 杜比实验室特许公司 | 利用子带对象概率估计的音频对象提取 |
| CN112802496B (zh) | 2014-12-11 | 2025-01-24 | 杜比实验室特许公司 | 元数据保留的音频对象聚类 |
| EP4576074A3 (en) | 2015-06-17 | 2025-08-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Loudness control for user interactivity in audio coding systems |
| CN106385660B (zh) * | 2015-08-07 | 2020-10-16 | 杜比实验室特许公司 | 处理基于对象的音频信号 |
| EP3332557B1 (en) | 2015-08-07 | 2019-06-19 | Dolby Laboratories Licensing Corporation | Processing object-based audio signals |
| US10278000B2 (en) | 2015-12-14 | 2019-04-30 | Dolby Laboratories Licensing Corporation | Audio object clustering with single channel quality preservation |
| US9949052B2 (en) | 2016-03-22 | 2018-04-17 | Dolby Laboratories Licensing Corporation | Adaptive panner of audio objects |
| EP3488623B1 (en) * | 2016-07-20 | 2020-12-02 | Dolby Laboratories Licensing Corporation | Audio object clustering based on renderer-aware perceptual difference |
| WO2018017394A1 (en) * | 2016-07-20 | 2018-01-25 | Dolby Laboratories Licensing Corporation | Audio object clustering based on renderer-aware perceptual difference |
| US11715459B2 (en) | 2016-08-24 | 2023-08-01 | Gridspace Inc. | Alert generator for adaptive closed loop communication system |
| US11601552B2 (en) | 2016-08-24 | 2023-03-07 | Gridspace Inc. | Hierarchical interface for adaptive closed loop communication system |
| US11721356B2 (en) | 2016-08-24 | 2023-08-08 | Gridspace Inc. | Adaptive closed loop communication system |
| US12132866B2 (en) | 2016-08-24 | 2024-10-29 | Gridspace Inc. | Configurable dynamic call routing and matching system |
| US10861436B1 (en) * | 2016-08-24 | 2020-12-08 | Gridspace Inc. | Audio call classification and survey system |
| CN110537373B (zh) * | 2017-04-25 | 2021-09-28 | 索尼公司 | 信号处理装置和方法以及存储介质 |
| CN118248153A (zh) * | 2017-04-26 | 2024-06-25 | 索尼公司 | 信号处理设备和方法及程序 |
| JP7224302B2 (ja) * | 2017-05-09 | 2023-02-17 | ドルビー ラボラトリーズ ライセンシング コーポレイション | マルチチャネル空間的オーディオ・フォーマット入力信号の処理 |
| WO2019067620A1 (en) * | 2017-09-29 | 2019-04-04 | Zermatt Technologies Llc | SPEECH REDUCTION AUDIO MIXING |
| US10628486B2 (en) * | 2017-11-15 | 2020-04-21 | Google Llc | Partitioning videos |
| WO2019106221A1 (en) * | 2017-11-28 | 2019-06-06 | Nokia Technologies Oy | Processing of spatial audio parameters |
| CN108984628B (zh) * | 2018-06-20 | 2020-01-24 | 北京达佳互联信息技术有限公司 | 内容描述生成模型的损失值获取方法及装置 |
| BR112021008089A2 (pt) * | 2018-11-02 | 2021-08-03 | Dolby International Ab | codificador de áudio e decodificador de áudio |
| US12400666B2 (en) | 2019-03-29 | 2025-08-26 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for low cost error recovery in predictive coding |
| KR20240152948A (ko) | 2019-03-29 | 2024-10-22 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 멀티 채널 오디오 프레임에서 예측적인 코딩에서 에러 복구를 위한 방법 및 장치 |
| CN110493649B (zh) * | 2019-09-12 | 2021-08-20 | 重庆市群众艺术馆 | 基于群众满意度的文化馆数字资源加工方法 |
| US20230010466A1 (en) | 2019-12-09 | 2023-01-12 | Dolby Laboratories Licensing Corporation | Adjusting audio and non-audio features based on noise metrics and speech intelligibility metrics |
| CN113096671B (zh) * | 2020-01-09 | 2022-05-13 | 齐鲁工业大学 | 一种大容量音频文件可逆信息隐藏方法及系统 |
| US11704087B2 (en) * | 2020-02-03 | 2023-07-18 | Google Llc | Video-informed spatial audio expansion |
| WO2025199350A1 (en) * | 2024-03-22 | 2025-09-25 | Dolby Laboratories Licensing Corporation | Low-latency gain interpolation for audio object clustering |
| WO2026006172A1 (en) * | 2024-06-25 | 2026-01-02 | Dolby Laboratories Licensing Corporation | Audio object clustering system |
Family Cites Families (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7617099B2 (en) * | 2001-02-12 | 2009-11-10 | FortMedia Inc. | Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile |
| US20040162721A1 (en) * | 2001-06-08 | 2004-08-19 | Oomen Arnoldus Werner Johannes | Editing of audio signals |
| KR100479478B1 (ko) | 2002-07-26 | 2005-03-31 | 연세대학교 산학협력단 | 객체별 중요도를 고려한 객체 기반의 트랜스코딩 방법 및그 장치 |
| FR2862799B1 (fr) * | 2003-11-26 | 2006-02-24 | Inst Nat Rech Inf Automat | Dispositif et methode perfectionnes de spatialisation du son |
| US8363865B1 (en) | 2004-05-24 | 2013-01-29 | Heather Bottum | Multiple channel sound system using multi-speaker arrays |
| CN101485202B (zh) * | 2005-05-11 | 2013-10-30 | 高通股份有限公司 | 一种用于统一的错误隐匿框架的方法及设备 |
| US8509313B2 (en) | 2006-10-10 | 2013-08-13 | Texas Instruments Incorporated | Video error concealment |
| ATE536612T1 (de) | 2006-10-16 | 2011-12-15 | Dolby Int Ab | Verbesserte kodierungs- und parameterdarstellung von mehrkanaliger abwärtsgemischter objektkodierung |
| BRPI0711094A2 (pt) | 2006-11-24 | 2011-08-23 | Lg Eletronics Inc | método para codificação e decodificação de sinal de áudio baseado em objeto e aparelho deste |
| JP2010515392A (ja) | 2007-01-04 | 2010-05-06 | ブリティッシュ・テレコミュニケーションズ・パブリック・リミテッド・カンパニー | 映像信号符号化 |
| BRPI0802613A2 (pt) | 2007-02-14 | 2011-08-30 | Lg Electronics Inc | métodos e aparelhos para codificação e decodificação de sinais de áudio baseados em objeto |
| US7945119B2 (en) | 2007-06-26 | 2011-05-17 | Microsoft Corporation | Optimizing character rendering |
| US8295494B2 (en) | 2007-08-13 | 2012-10-23 | Lg Electronics Inc. | Enhancing audio with remixing capability |
| KR101303441B1 (ko) | 2007-10-17 | 2013-09-10 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 다운믹스를 이용한 오디오 코딩 |
| GB2459012A (en) * | 2008-03-20 | 2009-10-14 | Univ Surrey | Predicting the perceived spatial quality of sound processing and reproducing equipment |
| MX2011011399A (es) * | 2008-10-17 | 2012-06-27 | Univ Friedrich Alexander Er | Aparato para suministrar uno o más parámetros ajustados para un suministro de una representación de señal de mezcla ascendente sobre la base de una representación de señal de mezcla descendete, decodificador de señal de audio, transcodificador de señal de audio, codificador de señal de audio, flujo de bits de audio, método y programa de computación que utiliza información paramétrica relacionada con el objeto. |
| US8189799B2 (en) | 2009-04-09 | 2012-05-29 | Harman International Industries, Incorporated | System for active noise control based on audio system output |
| CN101547000B (zh) * | 2009-05-08 | 2011-05-04 | 炬力集成电路设计有限公司 | 一种信号转换电路、数模转换装置和音频输出设备 |
| CN101582262B (zh) * | 2009-06-16 | 2011-12-28 | 武汉大学 | 一种空间音频参数帧间预测编解码方法 |
| JP5604933B2 (ja) * | 2010-03-30 | 2014-10-15 | 富士通株式会社 | ダウンミクス装置およびダウンミクス方法 |
| WO2013006325A1 (en) | 2011-07-01 | 2013-01-10 | Dolby Laboratories Licensing Corporation | Upmixing object based audio |
| US9479886B2 (en) * | 2012-07-20 | 2016-10-25 | Qualcomm Incorporated | Scalable downmix design with feedback for object-based surround codec |
| EP2883366B8 (en) | 2012-08-07 | 2016-12-14 | Dolby Laboratories Licensing Corporation | Encoding and rendering of object based audio indicative of game audio content |
| JP6012884B2 (ja) | 2012-12-21 | 2016-10-25 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 知覚的基準に基づいてオブジェクト・ベースのオーディオ・コンテンツをレンダリングするためのオブジェクト・クラスタリング |
-
2015
- 2015-01-05 WO PCT/US2015/010126 patent/WO2015105748A1/en not_active Ceased
- 2015-01-05 JP JP2016544661A patent/JP6518254B2/ja active Active
- 2015-01-05 CN CN201580004002.0A patent/CN105900169B/zh active Active
- 2015-01-05 EP EP15700522.4A patent/EP3092642B1/en active Active
- 2015-01-05 US US15/110,371 patent/US10492014B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| US20160337776A1 (en) | 2016-11-17 |
| EP3092642A1 (en) | 2016-11-16 |
| US10492014B2 (en) | 2019-11-26 |
| JP2017508175A (ja) | 2017-03-23 |
| WO2015105748A1 (en) | 2015-07-16 |
| CN105900169B (zh) | 2020-01-03 |
| CN105900169A (zh) | 2016-08-24 |
| EP3092642B1 (en) | 2018-05-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6518254B2 (ja) | オーディオ・コンテンツの空間的誤差メトリック | |
| Cuevas-Rodríguez et al. | 3D Tune-In Toolkit: An open-source library for real-time binaural spatialisation | |
| US11190898B2 (en) | Rendering scene-aware audio using neural network-based acoustic analysis | |
| CN103403800B (zh) | 确定多声道音频信号的声道间时间差 | |
| US9479886B2 (en) | Scalable downmix design with feedback for object-based surround codec | |
| US10580424B2 (en) | Perceptual audio coding as sequential decision-making problems | |
| US9761229B2 (en) | Systems, methods, apparatus, and computer-readable media for audio object clustering | |
| US11138989B2 (en) | Sound quality prediction and interface to facilitate high-quality voice recordings | |
| KR102132500B1 (ko) | 조화성 기반 단일 채널 음성 품질 추정 기법 | |
| TW201801067A (zh) | 用以估計通道間時間差的裝置及方法 | |
| MX2013013261A (es) | Asignacion de bits, codificacion y decodificacion de audio. | |
| CN105874533A (zh) | 音频对象提取 | |
| US11269589B2 (en) | Inter-channel audio feature measurement and usages | |
| US20200402519A1 (en) | Coding scaled spatial components | |
| CN108780648A (zh) | 用于在时间上失配的信号的音频处理 | |
| US10734006B2 (en) | Audio coding based on audio pattern recognition | |
| JP6442037B2 (ja) | 室内インパルス応答の少なくとも第1のペアに基づいて総ミキシング時間を推定する装置および方法、ならびに対応するコンピュータプログラム | |
| JP2019204097A (ja) | 音声符号化方法および関連装置 | |
| KR20170035781A (ko) | 사운드를 합성하는 방법 및 디바이스 | |
| JP6235725B2 (ja) | マルチ・チャンネル・オーディオ信号分類器 | |
| JP2025540764A (ja) | パラメトリック空間オーディオ符号化 | |
| Lu et al. | An MELP Vocoder Based on UVS and MVF |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20171226 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20171226 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20181126 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20181218 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20190311 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20190326 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20190419 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 6518254 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |