CN106688251B - 音频处理系统和方法 - Google Patents

音频处理系统和方法 Download PDF

Info

Publication number
CN106688251B
CN106688251B CN201580045969.3A CN201580045969A CN106688251B CN 106688251 B CN106688251 B CN 106688251B CN 201580045969 A CN201580045969 A CN 201580045969A CN 106688251 B CN106688251 B CN 106688251B
Authority
CN
China
Prior art keywords
audio
metadata
renderer
channel
decoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201580045969.3A
Other languages
English (en)
Chinese (zh)
Other versions
CN106688251A (zh
Inventor
T·J·埃格尔丁格
C·沃尔夫
A·C·诺埃尔
D·M·费舍尔
S·马蒂奈茨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corp filed Critical Dolby Laboratories Licensing Corp
Publication of CN106688251A publication Critical patent/CN106688251A/zh
Application granted granted Critical
Publication of CN106688251B publication Critical patent/CN106688251B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
CN201580045969.3A 2014-07-31 2015-07-27 音频处理系统和方法 Active CN106688251B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201462031723P 2014-07-31 2014-07-31
US62/031,723 2014-07-31
PCT/US2015/042190 WO2016018787A1 (en) 2014-07-31 2015-07-27 Audio processing systems and methods

Publications (2)

Publication Number Publication Date
CN106688251A CN106688251A (zh) 2017-05-17
CN106688251B true CN106688251B (zh) 2019-10-01

Family

ID=53784010

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580045969.3A Active CN106688251B (zh) 2014-07-31 2015-07-27 音频处理系统和方法

Country Status (5)

Country Link
US (1) US9875751B2 (https=)
EP (1) EP3175446B1 (https=)
JP (1) JP6710675B2 (https=)
CN (1) CN106688251B (https=)
WO (1) WO2016018787A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12094476B2 (en) 2019-12-02 2024-09-17 Dolby Laboratories Licensing Corporation Systems, methods and apparatus for conversion from channel-based audio to object-based audio

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160315722A1 (en) * 2015-04-22 2016-10-27 Apple Inc. Audio stem delivery and control
CN107615767B (zh) 2015-06-02 2021-05-25 索尼公司 发送装置、发送方法、媒体处理装置、媒体处理方法以及接收装置
CA3281204A1 (en) * 2015-06-17 2025-10-31 Sony Corporation Transmitting device, transmitting method, receiving device, and receiving method
US10325610B2 (en) 2016-03-30 2019-06-18 Microsoft Technology Licensing, Llc Adaptive audio rendering
US10863297B2 (en) 2016-06-01 2020-12-08 Dolby International Ab Method converting multichannel audio content into object-based audio content and a method for processing audio content having a spatial position
EP3337066B1 (en) 2016-12-14 2020-09-23 Nokia Technologies Oy Distributed audio mixing
EP3566473B8 (en) 2017-03-06 2022-06-15 Dolby International AB Integrated reconstruction and rendering of audio signals
US11303689B2 (en) 2017-06-06 2022-04-12 Nokia Technologies Oy Method and apparatus for updating streamed content
GB2563635A (en) 2017-06-21 2018-12-26 Nokia Technologies Oy Recording and rendering audio signals
KR102483470B1 (ko) * 2018-02-13 2023-01-02 한국전자통신연구원 다중 렌더링 방식을 이용하는 입체 음향 생성 장치 및 입체 음향 생성 방법, 그리고 입체 음향 재생 장치 및 입체 음향 재생 방법
CN108854062B (zh) * 2018-06-24 2019-08-09 广州银汉科技有限公司 一种移动游戏的语音聊天模块
WO2020072364A1 (en) * 2018-10-01 2020-04-09 Dolby Laboratories Licensing Corporation Creative intent scalability via physiological monitoring
RU2768224C1 (ru) * 2018-12-13 2022-03-23 Долби Лабораторис Лайсэнзин Корпорейшн Двусторонняя медийная аналитика
US11544032B2 (en) * 2019-01-24 2023-01-03 Dolby Laboratories Licensing Corporation Audio connection and transmission device
US11432097B2 (en) * 2019-07-03 2022-08-30 Qualcomm Incorporated User interface for controlling audio rendering for extended reality experiences
CN112399189B (zh) * 2019-08-19 2022-05-17 腾讯科技(深圳)有限公司 延时输出控制方法、装置、系统、设备及介质
JP7174755B2 (ja) 2019-11-26 2022-11-17 グーグル エルエルシー 要求時におけるオーディオレコーディングへの補足オーディオコンテンツの動的挿入
KR102874344B1 (ko) 2019-12-02 2025-10-22 삼성전자주식회사 전자 장치 및 그 제어 방법
US12101619B2 (en) 2021-03-08 2024-09-24 Exarion Inc. Sound tracing method and device to improve sound propagation performance
JP7753511B2 (ja) * 2021-07-29 2025-10-14 ドルビー・インターナショナル・アーベー オブジェクトベースのオーディオ及びチャネルベースのオーディオを処理するための方法及び装置
CN113938811A (zh) * 2021-09-01 2022-01-14 赛因芯微(北京)电子科技有限公司 基于音床音频通道元数据和生成方法、设备及存储介质
CN113905322A (zh) * 2021-09-01 2022-01-07 赛因芯微(北京)电子科技有限公司 基于双耳音频通道元数据和生成方法、设备及存储介质
CN113963725A (zh) * 2021-09-18 2022-01-21 赛因芯微(北京)电子科技有限公司 音频对象元数据和产生方法、电子设备及存储介质
CN114363790A (zh) * 2021-11-26 2022-04-15 赛因芯微(北京)电子科技有限公司 串行音频块格式元数据生成方法、装置、设备及介质
FR3131058B1 (fr) * 2021-12-21 2024-08-09 Sagemcom Broadband Sas Boitier décodeur pour la restitution d’une piste audio additionnelle.
WO2025237967A1 (en) * 2024-05-13 2025-11-20 L-Acoustics Spatial audio performance system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103354630A (zh) * 2008-07-17 2013-10-16 弗朗霍夫应用科学研究促进协会 用于使用基于对象的元数据产生音频输出信号的装置和方法
CN103650539A (zh) * 2011-07-01 2014-03-19 杜比实验室特许公司 用于自适应音频信号产生、编码和呈现的系统和方法

Family Cites Families (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5949410A (en) 1996-10-18 1999-09-07 Samsung Electronics Company, Ltd. Apparatus and method for synchronizing audio and video frames in an MPEG presentation system
JP3159098B2 (ja) 1997-01-13 2001-04-23 日本電気株式会社 画像と音声の同期再生装置
US7319703B2 (en) 2001-09-04 2008-01-15 Nokia Corporation Method and apparatus for reducing synchronization delay in packet-based voice terminals by resynchronizing during talk spurts
EP1343162A3 (en) 2002-03-05 2006-06-28 D&M Holdings, Inc. Audio reproducing apparatus
JP2004004274A (ja) * 2002-05-31 2004-01-08 Matsushita Electric Ind Co Ltd 音声信号処理切換装置
EP1427252A1 (en) * 2002-12-02 2004-06-09 Deutsche Thomson-Brandt Gmbh Method and apparatus for processing audio signals from a bitstream
US7167108B2 (en) 2002-12-04 2007-01-23 Koninklijke Philips Electronics N.V. Method and apparatus for selecting particular decoder based on bitstream format detection
WO2005109403A1 (en) 2004-04-21 2005-11-17 Dolby Laboratories Licensing Corporation Audio bitstream format in which the bitstream syntax is described by an ordered transveral of a tree hierarchy data structure
US20070199043A1 (en) 2006-02-06 2007-08-23 Morris Richard M Multi-channel high-bandwidth media network
US7965771B2 (en) 2006-02-27 2011-06-21 Cisco Technology, Inc. Method and apparatus for immediate display of multicast IPTV over a bandwidth constrained network
US8190441B2 (en) 2006-09-11 2012-05-29 Apple Inc. Playback of compressed media files without quantization gaps
US8254248B2 (en) 2007-03-20 2012-08-28 Broadcom Corporation Method and system for implementing redundancy for streaming data in audio video bridging networks
EP2048890A1 (en) 2007-10-11 2009-04-15 Thomson Licensing System and method for an early start of audio-video rendering
US20090100493A1 (en) 2007-10-16 2009-04-16 At&T Knowledge Ventures, Lp. System and Method for Display Format Detection at Set Top Box Device
US8170226B2 (en) 2008-06-20 2012-05-01 Microsoft Corporation Acoustic echo cancellation and adaptive filters
US8639368B2 (en) * 2008-07-15 2014-01-28 Lg Electronics Inc. Method and an apparatus for processing an audio signal
JP2010098460A (ja) 2008-10-15 2010-04-30 Yamaha Corp オーディオ信号処理装置
GB0820920D0 (en) 2008-11-14 2008-12-24 Wolfson Microelectronics Plc Codec apparatus
WO2010076770A2 (en) 2008-12-31 2010-07-08 France Telecom Communication system incorporating collaborative information exchange and method of operation thereof
FR2942096B1 (fr) 2009-02-11 2016-09-02 Arkamys Procede pour positionner un objet sonore dans un environnement sonore 3d, support audio mettant en oeuvre le procede, et plate-forme de test associe
US20100223552A1 (en) 2009-03-02 2010-09-02 Metcalf Randall B Playback Device For Generating Sound Events
WO2011095913A1 (en) 2010-02-02 2011-08-11 Koninklijke Philips Electronics N.V. Spatial sound reproduction
FR2959037A1 (fr) 2010-04-14 2011-10-21 Orange Vallee Procede de creation d'une sequence media par groupes coherents de fichiers medias
US20120089390A1 (en) 2010-08-27 2012-04-12 Smule, Inc. Pitch corrected vocal capture for telephony targets
CN103477651B (zh) 2011-03-25 2017-10-13 爱立信(中国)通信有限公司 混合媒体接收机、中间件服务器和对应方法、计算机程序和计算机程序产品
WO2013006342A1 (en) 2011-07-01 2013-01-10 Dolby Laboratories Licensing Corporation Synchronization and switchover methods and systems for an adaptive audio system
WO2013023287A1 (en) 2011-08-16 2013-02-21 Destiny Software Productions Inc. Script-based video rendering
WO2013117806A2 (en) 2012-02-07 2013-08-15 Nokia Corporation Visual spatial audio
US20130238992A1 (en) 2012-03-08 2013-09-12 Motorola Mobility, Inc. Method and Device for Content Control Based on Data Link Context
EP2873073A1 (en) 2012-07-12 2015-05-20 Dolby Laboratories Licensing Corporation Embedding data in stereo audio using saturation parameter modulation
KR102429953B1 (ko) 2012-07-19 2022-08-08 돌비 인터네셔널 에이비 다채널 오디오 신호들의 렌더링을 향상시키기 위한 방법 및 디바이스
US9532158B2 (en) * 2012-08-31 2016-12-27 Dolby Laboratories Licensing Corporation Reflected and direct rendering of upmixed content to individually addressable drivers
EP2898506B1 (en) 2012-09-21 2018-01-17 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
EP2733964A1 (en) * 2012-11-15 2014-05-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
US9805725B2 (en) 2012-12-21 2017-10-31 Dolby Laboratories Licensing Corporation Object clustering for rendering object-based audio content based on perceptual criteria
GB2501150B (en) 2013-01-14 2014-07-23 Oxalis Group Ltd An audio amplifier
US8751832B2 (en) 2013-09-27 2014-06-10 James A Cashin Secure system and method for audio processing
ES2772851T3 (es) * 2013-11-27 2020-07-08 Dts Inc Mezcla de matriz basada en multipletes para audio de múltiples canales de alta cantidad de canales

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103354630A (zh) * 2008-07-17 2013-10-16 弗朗霍夫应用科学研究促进协会 用于使用基于对象的元数据产生音频输出信号的装置和方法
CN103650539A (zh) * 2011-07-01 2014-03-19 杜比实验室特许公司 用于自适应音频信号产生、编码和呈现的系统和方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Achim Kuntz etc.Delay Handling in MPEG-H 3D audio.《109. MPEG MEETIING;(MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11)》.2014, *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12094476B2 (en) 2019-12-02 2024-09-17 Dolby Laboratories Licensing Corporation Systems, methods and apparatus for conversion from channel-based audio to object-based audio

Also Published As

Publication number Publication date
JP6710675B2 (ja) 2020-06-17
WO2016018787A1 (en) 2016-02-04
US9875751B2 (en) 2018-01-23
CN106688251A (zh) 2017-05-17
JP2017526264A (ja) 2017-09-07
US20170243596A1 (en) 2017-08-24
EP3175446A1 (en) 2017-06-07
EP3175446B1 (en) 2019-06-19

Similar Documents

Publication Publication Date Title
CN106688251B (zh) 音频处理系统和方法
RU2741738C1 (ru) Система, способ и постоянный машиночитаемый носитель данных для генерирования, кодирования и представления данных адаптивного звукового сигнала
EP2727369B1 (en) Synchronization and switchover methods and systems for an adaptive audio system
JP5156110B2 (ja) リアルタイム・マルチチャネル対話型デジタル・オーディオを提供するための方法
EP2451196A1 (en) Method and apparatus for generating and for decoding sound field data including ambisonics sound field data of an order higher than three
AU2012279357A1 (en) System and method for adaptive audio signal generation, coding and rendering
JP2019165494A (ja) オーディオの対スクリーン・レンダリングおよびそのようなレンダリングのためのオーディオのエンコードおよびデコード
RU2820838C2 (ru) Система, способ и постоянный машиночитаемый носитель данных для генерирования, кодирования и представления данных адаптивного звукового сигнала
HK1226887A1 (en) System and method for adaptive audio signal generation, coding and rendering
HK1226887A (en) System and method for adaptive audio signal generation, coding and rendering

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant