CN105900169B - 音频内容的空间误差度量 - Google Patents

音频内容的空间误差度量 Download PDF

Info

Publication number
CN105900169B
CN105900169B CN201580004002.0A CN201580004002A CN105900169B CN 105900169 B CN105900169 B CN 105900169B CN 201580004002 A CN201580004002 A CN 201580004002A CN 105900169 B CN105900169 B CN 105900169B
Authority
CN
China
Prior art keywords
audio
output
clusters
objects
spatial error
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201580004002.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN105900169A (zh
Inventor
D·J·布瑞巴特
陈联武
芦烈
A·M·索尔
N·R·特斯恩高斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Dolby Laboratories Licensing Corp
Original Assignee
Dolby International AB
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby International AB, Dolby Laboratories Licensing Corp filed Critical Dolby International AB
Publication of CN105900169A publication Critical patent/CN105900169A/zh
Application granted granted Critical
Publication of CN105900169B publication Critical patent/CN105900169B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24CDOMESTIC STOVES OR RANGES ; DETAILS OF DOMESTIC STOVES OR RANGES, OF GENERAL APPLICATION
    • F24C15/00Details
    • F24C15/20Removing cooking fumes
    • F24C15/2028Removing cooking fumes using an air curtain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/008Visual indication of individual signal levels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Chemical & Material Sciences (AREA)
  • Combustion & Propulsion (AREA)
  • Mechanical Engineering (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Stereophonic System (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
CN201580004002.0A 2014-01-09 2015-01-05 音频内容的空间误差度量 Active CN105900169B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
ESP201430016 2014-01-09
ES201430016 2014-01-09
US201461951048P 2014-03-11 2014-03-11
US61/951,048 2014-03-11
PCT/US2015/010126 WO2015105748A1 (en) 2014-01-09 2015-01-05 Spatial error metrics of audio content

Publications (2)

Publication Number Publication Date
CN105900169A CN105900169A (zh) 2016-08-24
CN105900169B true CN105900169B (zh) 2020-01-03

Family

ID=52469071

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580004002.0A Active CN105900169B (zh) 2014-01-09 2015-01-05 音频内容的空间误差度量

Country Status (5)

Country Link
US (1) US10492014B2 (enExample)
EP (1) EP3092642B1 (enExample)
JP (1) JP6518254B2 (enExample)
CN (1) CN105900169B (enExample)
WO (1) WO2015105748A1 (enExample)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015017037A1 (en) 2013-07-30 2015-02-05 Dolby International Ab Panning of audio objects to arbitrary speaker layouts
CN105336335B (zh) * 2014-07-25 2020-12-08 杜比实验室特许公司 利用子带对象概率估计的音频对象提取
CN105895086B (zh) 2014-12-11 2021-01-12 杜比实验室特许公司 元数据保留的音频对象聚类
EP3311379B1 (en) * 2015-06-17 2022-11-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Loudness control for user interactivity in audio coding systems
EP3332557B1 (en) 2015-08-07 2019-06-19 Dolby Laboratories Licensing Corporation Processing object-based audio signals
CN106385660B (zh) * 2015-08-07 2020-10-16 杜比实验室特许公司 处理基于对象的音频信号
US10278000B2 (en) 2015-12-14 2019-04-30 Dolby Laboratories Licensing Corporation Audio object clustering with single channel quality preservation
US9949052B2 (en) 2016-03-22 2018-04-17 Dolby Laboratories Licensing Corporation Adaptive panner of audio objects
WO2018017394A1 (en) * 2016-07-20 2018-01-25 Dolby Laboratories Licensing Corporation Audio object clustering based on renderer-aware perceptual difference
US10779106B2 (en) 2016-07-20 2020-09-15 Dolby Laboratories Licensing Corporation Audio object clustering based on renderer-aware perceptual difference
US12132866B2 (en) 2016-08-24 2024-10-29 Gridspace Inc. Configurable dynamic call routing and matching system
US10861436B1 (en) * 2016-08-24 2020-12-08 Gridspace Inc. Audio call classification and survey system
US11715459B2 (en) 2016-08-24 2023-08-01 Gridspace Inc. Alert generator for adaptive closed loop communication system
US11601552B2 (en) 2016-08-24 2023-03-07 Gridspace Inc. Hierarchical interface for adaptive closed loop communication system
US11721356B2 (en) 2016-08-24 2023-08-08 Gridspace Inc. Adaptive closed loop communication system
RU2763785C2 (ru) * 2017-04-25 2022-01-11 Сони Корпорейшн Способ и устройство обработки сигнала
CN110537220B (zh) * 2017-04-26 2024-04-16 索尼公司 信号处理设备和方法及程序
JP7224302B2 (ja) * 2017-05-09 2023-02-17 ドルビー ラボラトリーズ ライセンシング コーポレイション マルチチャネル空間的オーディオ・フォーマット入力信号の処理
CN111052770B (zh) * 2017-09-29 2021-12-03 苹果公司 空间音频下混频的方法及系统
US10628486B2 (en) * 2017-11-15 2020-04-21 Google Llc Partitioning videos
WO2019106221A1 (en) * 2017-11-28 2019-06-06 Nokia Technologies Oy Processing of spatial audio parameters
CN108984628B (zh) * 2018-06-20 2020-01-24 北京达佳互联信息技术有限公司 内容描述生成模型的损失值获取方法及装置
EP3874491B1 (en) * 2018-11-02 2024-05-01 Dolby International AB Audio encoder and audio decoder
KR102717379B1 (ko) * 2019-03-29 2024-10-15 텔레폰악티에볼라겟엘엠에릭슨(펍) 멀티 채널 오디오 프레임에서 예측적인 코딩에서 에러 복구를 위한 방법 및 장치
WO2020201039A1 (en) * 2019-03-29 2020-10-08 Telefonaktiebolaget Lm Ericsson (Publ) Method and apparatus for low cost error recovery in predictive coding
CN110493649B (zh) * 2019-09-12 2021-08-20 重庆市群众艺术馆 基于群众满意度的文化馆数字资源加工方法
US20230010466A1 (en) * 2019-12-09 2023-01-12 Dolby Laboratories Licensing Corporation Adjusting audio and non-audio features based on noise metrics and speech intelligibility metrics
CN113096671B (zh) * 2020-01-09 2022-05-13 齐鲁工业大学 一种大容量音频文件可逆信息隐藏方法及系统
US11704087B2 (en) * 2020-02-03 2023-07-18 Google Llc Video-informed spatial audio expansion
WO2025199350A1 (en) * 2024-03-22 2025-09-25 Dolby Laboratories Licensing Corporation Low-latency gain interpolation for audio object clustering

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101485202A (zh) * 2005-05-11 2009-07-15 高通股份有限公司 一种用于统一的错误隐匿框架的方法及设备
CN101547000A (zh) * 2009-05-08 2009-09-30 炬力集成电路设计有限公司 一种信号转换电路、数模转换装置和音频输出设备
GB2459012A (en) * 2008-03-20 2009-10-14 Univ Surrey Predicting the perceived spatial quality of sound processing and reproducing equipment
CN101582262A (zh) * 2009-06-16 2009-11-18 武汉大学 一种空间音频参数帧间预测编解码方法
CN101859563A (zh) * 2009-04-09 2010-10-13 哈曼国际工业有限公司 基于音频系统输出的有源噪声控制系统

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7617099B2 (en) * 2001-02-12 2009-11-10 FortMedia Inc. Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile
BR0205527A (pt) * 2001-06-08 2003-07-08 Koninkl Philips Electronics Nv Métodos para editar um sinal de áudio original, e para decodificar um fluxo de áudio, editor de áudio, reprodutor de áudio, sistema de áudio, fluxo de áudio, e, meio de armazenagem
KR100479478B1 (ko) 2002-07-26 2005-03-31 연세대학교 산학협력단 객체별 중요도를 고려한 객체 기반의 트랜스코딩 방법 및그 장치
FR2862799B1 (fr) * 2003-11-26 2006-02-24 Inst Nat Rech Inf Automat Dispositif et methode perfectionnes de spatialisation du son
US8363865B1 (en) 2004-05-24 2013-01-29 Heather Bottum Multiple channel sound system using multi-speaker arrays
US8509313B2 (en) 2006-10-10 2013-08-13 Texas Instruments Incorporated Video error concealment
UA94117C2 (ru) 2006-10-16 2011-04-11 Долби Свиден Ав Усовершенстованное кодирование и отображение параметров многоканального кодирования микшированных объектов
KR101102401B1 (ko) 2006-11-24 2012-01-05 엘지전자 주식회사 오브젝트 기반 오디오 신호의 부호화 및 복호화 방법과 그 장치
CN101578875A (zh) 2007-01-04 2009-11-11 英国电讯有限公司 利用迭代重新编码的视频信号编码
AU2008215230B2 (en) 2007-02-14 2010-03-04 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
US7945119B2 (en) 2007-06-26 2011-05-17 Microsoft Corporation Optimizing character rendering
US8295494B2 (en) 2007-08-13 2012-10-23 Lg Electronics Inc. Enhancing audio with remixing capability
CA2702986C (en) 2007-10-17 2016-08-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio coding using downmix
MX2011011399A (es) 2008-10-17 2012-06-27 Univ Friedrich Alexander Er Aparato para suministrar uno o más parámetros ajustados para un suministro de una representación de señal de mezcla ascendente sobre la base de una representación de señal de mezcla descendete, decodificador de señal de audio, transcodificador de señal de audio, codificador de señal de audio, flujo de bits de audio, método y programa de computación que utiliza información paramétrica relacionada con el objeto.
JP5604933B2 (ja) 2010-03-30 2014-10-15 富士通株式会社 ダウンミクス装置およびダウンミクス方法
EP2727380B1 (en) 2011-07-01 2020-03-11 Dolby Laboratories Licensing Corporation Upmixing object based audio
US9479886B2 (en) * 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
EP2883366B8 (en) 2012-08-07 2016-12-14 Dolby Laboratories Licensing Corporation Encoding and rendering of object based audio indicative of game audio content
CN104885151B (zh) 2012-12-21 2017-12-22 杜比实验室特许公司 用于基于感知准则呈现基于对象的音频内容的对象群集

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101485202A (zh) * 2005-05-11 2009-07-15 高通股份有限公司 一种用于统一的错误隐匿框架的方法及设备
GB2459012A (en) * 2008-03-20 2009-10-14 Univ Surrey Predicting the perceived spatial quality of sound processing and reproducing equipment
CN101859563A (zh) * 2009-04-09 2010-10-13 哈曼国际工业有限公司 基于音频系统输出的有源噪声控制系统
CN101547000A (zh) * 2009-05-08 2009-09-30 炬力集成电路设计有限公司 一种信号转换电路、数模转换装置和音频输出设备
CN101582262A (zh) * 2009-06-16 2009-11-18 武汉大学 一种空间音频参数帧间预测编解码方法

Also Published As

Publication number Publication date
EP3092642B1 (en) 2018-05-16
WO2015105748A1 (en) 2015-07-16
JP2017508175A (ja) 2017-03-23
CN105900169A (zh) 2016-08-24
JP6518254B2 (ja) 2019-05-22
US10492014B2 (en) 2019-11-26
US20160337776A1 (en) 2016-11-17
EP3092642A1 (en) 2016-11-16

Similar Documents

Publication Publication Date Title
CN105900169B (zh) 音频内容的空间误差度量
US9479886B2 (en) Scalable downmix design with feedback for object-based surround codec
US9761229B2 (en) Systems, methods, apparatus, and computer-readable media for audio object clustering
CN103403800B (zh) 确定多声道音频信号的声道间时间差
Manocha et al. Speech quality assessment through MOS using non-matching references
US11138989B2 (en) Sound quality prediction and interface to facilitate high-quality voice recordings
US9451304B2 (en) Sound feature priority alignment
US20240249737A1 (en) Audio encoding and decoding method and related product
MX2013013261A (es) Asignacion de bits, codificacion y decodificacion de audio.
CN105874533A (zh) 音频对象提取
CN102165519A (zh) 处理信号的方法和装置
US11269589B2 (en) Inter-channel audio feature measurement and usages
US9936328B2 (en) Apparatus and method for estimating an overall mixing time based on at least a first pair of room impulse responses, as well as corresponding computer program
CN104900236A (zh) 音频信号处理
US10734006B2 (en) Audio coding based on audio pattern recognition
US10984811B2 (en) Audio coding method and related apparatus
US12424225B2 (en) Lecturer speech signal processing
EP3843428A1 (en) Inter-channel audio feature measurement and display on graphical user interface
JP2025540764A (ja) パラメトリック空間オーディオ符号化
CN116978360A (zh) 语音端点检测方法、装置和计算机设备
CN102760442B (zh) 一种3d音频中水平方位参数量化方法
CN117321680A (zh) 用于处理多声道音频信号的装置和方法
HK1220803A1 (en) Adaptive audio content generation
HK1220803B (en) Adaptive audio content generation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant