CN104995926B - 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 - Google Patents

用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 Download PDF

Info

Publication number
CN104995926B
CN104995926B CN201480008017.XA CN201480008017A CN104995926B CN 104995926 B CN104995926 B CN 104995926B CN 201480008017 A CN201480008017 A CN 201480008017A CN 104995926 B CN104995926 B CN 104995926B
Authority
CN
China
Prior art keywords
dominant
sound source
hoa
time frame
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480008017.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN104995926A (zh
Inventor
亚历山大·克鲁格
斯文·科尔东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Original Assignee
Thomson Licensing SAS
Dolby International AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS, Dolby International AB filed Critical Thomson Licensing SAS
Publication of CN104995926A publication Critical patent/CN104995926A/zh
Application granted granted Critical
Publication of CN104995926B publication Critical patent/CN104995926B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
CN201480008017.XA 2013-02-08 2014-02-07 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置 Active CN104995926B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP13305156.5 2013-02-08
EP20130305156 EP2765791A1 (en) 2013-02-08 2013-02-08 Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field
PCT/EP2014/052479 WO2014122287A1 (en) 2013-02-08 2014-02-07 Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field

Publications (2)

Publication Number Publication Date
CN104995926A CN104995926A (zh) 2015-10-21
CN104995926B true CN104995926B (zh) 2017-12-26

Family

ID=47780000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480008017.XA Active CN104995926B (zh) 2013-02-08 2014-02-07 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置

Country Status (7)

Country Link
US (1) US9622008B2 (https=)
EP (2) EP2765791A1 (https=)
JP (1) JP6374882B2 (https=)
KR (1) KR102220187B1 (https=)
CN (1) CN104995926B (https=)
TW (1) TWI647961B (https=)
WO (1) WO2014122287A1 (https=)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) * 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
EP2800401A1 (en) 2013-04-29 2014-11-05 Thomson Licensing Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
EP3357259B1 (en) 2015-09-30 2020-09-23 Dolby International AB Method and apparatus for generating 3d audio content from two-channel stereo content
CN105516875B (zh) * 2015-12-02 2020-03-06 上海航空电器有限公司 用于快速测量虚拟声音产生设备空间角度分辨率的装置
GR1008860B (el) * 2015-12-29 2016-09-27 Κωνσταντινος Δημητριου Σπυροπουλος Συστημα διαχωρισμου ομιλητων απο οπτικοακουστικα δεδομενα
US10089063B2 (en) 2016-08-10 2018-10-02 Qualcomm Incorporated Multimedia device for processing spatialized audio based on movement
JP6723120B2 (ja) * 2016-09-05 2020-07-15 本田技研工業株式会社 音響処理装置および音響処理方法
CN107147975B (zh) * 2017-04-26 2019-05-14 北京大学 一种面向不规则扬声器摆放的Ambisonics匹配投影解码方法
JP7224302B2 (ja) 2017-05-09 2023-02-17 ドルビー ラボラトリーズ ライセンシング コーポレイション マルチチャネル空間的オーディオ・フォーマット入力信号の処理
US10405126B2 (en) 2017-06-30 2019-09-03 Qualcomm Incorporated Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems
FR3074584A1 (fr) * 2017-12-05 2019-06-07 Orange Traitement de donnees d'une sequence video pour un zoom sur un locuteur detecte dans la sequence
CN110751956B (zh) * 2019-09-17 2022-04-26 北京时代拓灵科技有限公司 一种沉浸式音频渲染方法及系统
CN111933182B (zh) * 2020-08-07 2024-04-19 抖音视界有限公司 声源跟踪方法、装置、设备和存储介质
CN112019971B (zh) * 2020-08-21 2022-03-22 安声(重庆)电子科技有限公司 声场构建方法、装置、电子设备及计算机可读存储介质
US11743670B2 (en) 2020-12-18 2023-08-29 Qualcomm Incorporated Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications
CN115038027B (zh) * 2021-03-05 2023-07-07 华为技术有限公司 Hoa系数的获取方法和装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1659926A (zh) * 2002-05-07 2005-08-24 雷米·布鲁诺 表示声场的方法和系统
CN1849844A (zh) * 2003-07-31 2006-10-18 特因诺夫音频公司 确定声场的表示的系统和方法
CN102089634A (zh) * 2008-07-08 2011-06-08 布鲁尔及凯尔声音及振动测量公司 重建声学场

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9915398D0 (en) 1999-07-02 1999-09-01 Baker Matthew J Magnetic particles
FR2801108B1 (fr) 1999-11-16 2002-03-01 Maxmat S A Analyseur chimique ou biochimique a regulation de la temperature reactionnelle
ES2690164T3 (es) * 2009-06-25 2018-11-19 Dts Licensing Limited Dispositivo y método para convertir una señal de audio espacial
EP2486561B1 (en) * 2009-10-07 2016-03-30 The University Of Sydney Reconstruction of a recorded sound field
ES2472456T3 (es) 2010-03-26 2014-07-01 Thomson Licensing Método y dispositivo para decodificar una representación de un campo ac�stico de audio para reproducción de audio
WO2012025580A1 (en) * 2010-08-27 2012-03-01 Sonicemotion Ag Method and device for enhanced sound field reproduction of spatially encoded audio input signals
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
EP2541547A1 (en) * 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2738962A1 (en) 2012-11-29 2014-06-04 Thomson Licensing Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field
US9736609B2 (en) * 2013-02-07 2017-08-15 Qualcomm Incorporated Determining renderers for spherical harmonic coefficients

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1659926A (zh) * 2002-05-07 2005-08-24 雷米·布鲁诺 表示声场的方法和系统
CN1849844A (zh) * 2003-07-31 2006-10-18 特因诺夫音频公司 确定声场的表示的系统和方法
CN102089634A (zh) * 2008-07-08 2011-06-08 布鲁尔及凯尔声音及振动测量公司 重建声学场

Also Published As

Publication number Publication date
JP2016509812A (ja) 2016-03-31
EP2765791A1 (en) 2014-08-13
WO2014122287A1 (en) 2014-08-14
TW201448616A (zh) 2014-12-16
KR102220187B1 (ko) 2021-02-25
EP2954700B1 (en) 2018-03-07
JP6374882B2 (ja) 2018-08-15
KR20150115779A (ko) 2015-10-14
CN104995926A (zh) 2015-10-21
TWI647961B (zh) 2019-01-11
US20150373471A1 (en) 2015-12-24
EP2954700A1 (en) 2015-12-16
US9622008B2 (en) 2017-04-11

Similar Documents

Publication Publication Date Title
CN104995926B (zh) 用于确定在声场的高阶高保真立体声表示中不相关的声源的方向的方法和装置
US11922965B2 (en) Direction of arrival estimation apparatus, model learning apparatus, direction of arrival estimation method, model learning method, and program
US9689959B2 (en) Method, apparatus and computer program product for determining the location of a plurality of speech sources
Li et al. Online localization and tracking of multiple moving speakers in reverberant environments
Lima et al. A volumetric SRP with refinement step for sound source localization
JP2007523514A (ja) 適応ビームフォーマ、サイドローブキャンセラー、方法、装置、及びコンピュータープログラム
JP2017503388A (ja) マイクロホンアレイを使用した残響音の抽出
WO2016119388A1 (zh) 一种基于语音信号构造聚焦协方差矩阵的方法及装置
Hosseini et al. Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function
Dang et al. An iteratively reweighted steered response power approach to multisource localization using a distributed microphone network
Tengan et al. Multi-source direction-of-arrival estimation using steered response power and group-sparse optimization
Krause et al. Data diversity for improving DNN-based localization of concurrent sound events
JP2019054344A (ja) フィルタ係数算出装置、収音装置、その方法、及びプログラム
Toma et al. Efficient Detection and Localization of Acoustic Sources with a low complexity CNN network and the Diagonal Unloading Beamforming
Dehghan Firoozabadi et al. A novel nested circular microphone array and subband processing-based system for counting and DOA estimation of multiple simultaneous speakers
Günther et al. Microphone utility estimation in acoustic sensor networks using single-channel signal features
CN109901113B (zh) 一种基于复杂环境的语音信号定位方法、装置及系统
Kienegger et al. Adaptive Rotary Steering with Joint Autoregression for Robust Extraction of Closely Moving Speakers in Dynamic Scenarios
Li et al. A cascaded multiple-speaker localization and tracking system
Firoozabadi et al. Multi-speaker localization by central and lateral microphone arrays based on the combination of 2D-SRP and subband GEVD algorithms
Mosayyebpour et al. Time delay estimation via minimum-phase and all-pass component processing
Ayllón et al. Real-time multiple doa estimation of speech sources in wireless acoustic sensor networks
Wang et al. IPDnet2: an efficient and improved inter-channel phase difference estimation network for sound source localization
de Groot et al. Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications
Mitchell et al. Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160713

Address after: Amsterdam

Applicant after: Dolby International AB

Address before: The French Yixilaimu Leo City

Applicant before: Thomson Licensing SA

GR01 Patent grant