CN104995926B - 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 - Google Patents

用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 Download PDF

Info

Publication number
CN104995926B
CN104995926B CN201480008017.XA CN201480008017A CN104995926B CN 104995926 B CN104995926 B CN 104995926B CN 201480008017 A CN201480008017 A CN 201480008017A CN 104995926 B CN104995926 B CN 104995926B
Authority
CN
China
Prior art keywords
sound source
dominant
hoa
time frame
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480008017.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN104995926A (zh
Inventor
亚历山大·克鲁格
斯文·科尔东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Original Assignee
Thomson Licensing SAS
Dolby International AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS, Dolby International AB filed Critical Thomson Licensing SAS
Publication of CN104995926A publication Critical patent/CN104995926A/zh
Application granted granted Critical
Publication of CN104995926B publication Critical patent/CN104995926B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
CN201480008017.XA 2013-02-08 2014-02-07 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 Active CN104995926B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP20130305156 EP2765791A1 (en) 2013-02-08 2013-02-08 Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field
EP13305156.5 2013-02-08
PCT/EP2014/052479 WO2014122287A1 (en) 2013-02-08 2014-02-07 Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field

Publications (2)

Publication Number Publication Date
CN104995926A CN104995926A (zh) 2015-10-21
CN104995926B true CN104995926B (zh) 2017-12-26

Family

ID=47780000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480008017.XA Active CN104995926B (zh) 2013-02-08 2014-02-07 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置

Country Status (7)

Country Link
US (1) US9622008B2 (https=)
EP (2) EP2765791A1 (https=)
JP (1) JP6374882B2 (https=)
KR (1) KR102220187B1 (https=)
CN (1) CN104995926B (https=)
TW (1) TWI647961B (https=)
WO (1) WO2014122287A1 (https=)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) * 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
EP2800401A1 (en) 2013-04-29 2014-11-05 Thomson Licensing Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US20140355769A1 (en) 2013-05-29 2014-12-04 Qualcomm Incorporated Energy preservation for decomposed representations of a sound field
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
EP3357259B1 (en) * 2015-09-30 2020-09-23 Dolby International AB Method and apparatus for generating 3d audio content from two-channel stereo content
CN105516875B (zh) * 2015-12-02 2020-03-06 上海航空电器有限公司 用于快速测量虚拟声音产生设备空间角度分辨率的装置
GR1008860B (el) * 2015-12-29 2016-09-27 Κωνσταντινος Δημητριου Σπυροπουλος Συστημα διαχωρισμου ομιλητων απο οπτικοακουστικα δεδομενα
US10089063B2 (en) 2016-08-10 2018-10-02 Qualcomm Incorporated Multimedia device for processing spatialized audio based on movement
JP6723120B2 (ja) * 2016-09-05 2020-07-15 本田技研工業株式会社 音響処理装置および音響処理方法
CN107147975B (zh) * 2017-04-26 2019-05-14 北京大学 一种面向不规则扬声器摆放的Ambisonics匹配投影解码方法
CN110800048B (zh) 2017-05-09 2023-07-28 杜比实验室特许公司 多通道空间音频格式输入信号的处理
US10405126B2 (en) * 2017-06-30 2019-09-03 Qualcomm Incorporated Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems
FR3074584A1 (fr) * 2017-12-05 2019-06-07 Orange Traitement de donnees d'une sequence video pour un zoom sur un locuteur detecte dans la sequence
CN110751956B (zh) * 2019-09-17 2022-04-26 北京时代拓灵科技有限公司 一种沉浸式音频渲染方法及系统
CN111933182B (zh) * 2020-08-07 2024-04-19 抖音视界有限公司 声源跟踪方法、装置、设备和存储介质
CN112019971B (zh) * 2020-08-21 2022-03-22 安声(重庆)电子科技有限公司 声场构建方法、装置、电子设备及计算机可读存储介质
US11743670B2 (en) 2020-12-18 2023-08-29 Qualcomm Incorporated Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications
CN117041856A (zh) * 2021-03-05 2023-11-10 华为技术有限公司 Hoa系数的获取方法和装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1659926A (zh) * 2002-05-07 2005-08-24 雷米·布鲁诺 表示声场的方法和系统
CN1849844A (zh) * 2003-07-31 2006-10-18 特因诺夫音频公司 确定声场的表示的系统和方法
CN102089634A (zh) * 2008-07-08 2011-06-08 布鲁尔及凯尔声音及振动测量公司 重建声学场

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9915398D0 (en) 1999-07-02 1999-09-01 Baker Matthew J Magnetic particles
FR2801108B1 (fr) 1999-11-16 2002-03-01 Maxmat S A Analyseur chimique ou biochimique a regulation de la temperature reactionnelle
EP2285139B1 (en) * 2009-06-25 2018-08-08 Harpex Ltd. Device and method for converting spatial audio signal
EP2486561B1 (en) * 2009-10-07 2016-03-30 The University Of Sydney Reconstruction of a recorded sound field
KR102294460B1 (ko) 2010-03-26 2021-08-27 돌비 인터네셔널 에이비 오디오 재생을 위한 오디오 사운드필드 표현을 디코딩하는 방법 및 장치
WO2012025580A1 (en) * 2010-08-27 2012-03-01 Sonicemotion Ag Method and device for enhanced sound field reproduction of spatially encoded audio input signals
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
EP2541547A1 (en) * 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2738962A1 (en) 2012-11-29 2014-06-04 Thomson Licensing Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field
US9913064B2 (en) * 2013-02-07 2018-03-06 Qualcomm Incorporated Mapping virtual speakers to physical speakers

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1659926A (zh) * 2002-05-07 2005-08-24 雷米·布鲁诺 表示声场的方法和系统
CN1849844A (zh) * 2003-07-31 2006-10-18 特因诺夫音频公司 确定声场的表示的系统和方法
CN102089634A (zh) * 2008-07-08 2011-06-08 布鲁尔及凯尔声音及振动测量公司 重建声学场

Also Published As

Publication number Publication date
WO2014122287A1 (en) 2014-08-14
JP2016509812A (ja) 2016-03-31
KR20150115779A (ko) 2015-10-14
JP6374882B2 (ja) 2018-08-15
EP2765791A1 (en) 2014-08-13
US20150373471A1 (en) 2015-12-24
US9622008B2 (en) 2017-04-11
EP2954700A1 (en) 2015-12-16
EP2954700B1 (en) 2018-03-07
TWI647961B (zh) 2019-01-11
TW201448616A (zh) 2014-12-16
CN104995926A (zh) 2015-10-21
KR102220187B1 (ko) 2021-02-25

Similar Documents

Publication Publication Date Title
CN104995926B (zh) 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置
JP7276470B2 (ja) 到来方向推定装置、モデル学習装置、到来方向推定方法、モデル学習方法、プログラム
Li et al. Online localization and tracking of multiple moving speakers in reverberant environments
US20080247274A1 (en) Sensor array post-filter for tracking spatial distributions of signals and noise
CN107219512B (zh) 一种基于声传递函数的声源定位方法
Lima et al. A volumetric SRP with refinement step for sound source localization
EP2926482A1 (en) Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field
WO2016119388A1 (zh) 一种基于语音信号构造聚焦协方差矩阵的方法及装置
Hosseini et al. Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function
Christensen Multi-channel maximum likelihood pitch estimation
Tengan et al. Multi-source direction-of-arrival estimation using steered response power and group-sparse optimization
Jia et al. Multi-source DOA estimation in reverberant environments using potential single-source points enhancement
Krause et al. Data diversity for improving DNN-based localization of concurrent sound events
JP5986966B2 (ja) 音場収音再生装置、方法及びプログラム
Günther et al. Microphone utility estimation in acoustic sensor networks using single-channel signal features
CN115497495B (zh) 用于检测或估计多个声源中的目标声源的方法和装置
Dehghan Firoozabadi et al. A novel nested circular microphone array and subband processing-based system for counting and DOA estimation of multiple simultaneous speakers
JP6650245B2 (ja) インパルス応答生成装置及びプログラム
CN116609726A (zh) 一种声源定位方法及装置
Wu et al. Acoustic source tracking in reverberant environment using regional steered response power measurement
Kienegger et al. Adaptive Rotary Steering with Joint Autoregression for Robust Extraction of Closely Moving Speakers in Dynamic Scenarios
CN118671700B (zh) 多声源的融合定位方法、装置、设备、存储介质和产品
Wei et al. Dynamic blind source separation based on source-direction prediction
Wang et al. IPDnet2: an efficient and improved inter-channel phase difference estimation network for sound source localization
Mosayyebpour et al. Time delay estimation via minimum-phase and all-pass component processing

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160713

Address after: Amsterdam

Applicant after: Dolby International AB

Address before: The French Yixilaimu Leo City

Applicant before: Thomson Licensing SA

GR01 Patent grant