TWI647961B - 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 - Google Patents

聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 Download PDF

Info

Publication number
TWI647961B
TWI647961B TW103104224A TW103104224A TWI647961B TW I647961 B TWI647961 B TW I647961B TW 103104224 A TW103104224 A TW 103104224A TW 103104224 A TW103104224 A TW 103104224A TW I647961 B TWI647961 B TW I647961B
Authority
TW
Taiwan
Prior art keywords
sound source
time frame
dominant
hoa
dominant sound
Prior art date
Application number
TW103104224A
Other languages
English (en)
Chinese (zh)
Other versions
TW201448616A (zh
Inventor
亞歷山大 克魯格
斯凡 科登
Original Assignee
瑞典商杜比國際公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 瑞典商杜比國際公司 filed Critical 瑞典商杜比國際公司
Publication of TW201448616A publication Critical patent/TW201448616A/zh
Application granted granted Critical
Publication of TWI647961B publication Critical patent/TWI647961B/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
TW103104224A 2013-02-08 2014-02-10 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置 TWI647961B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
??13305156.5 2013-02-08
EP20130305156 EP2765791A1 (en) 2013-02-08 2013-02-08 Method and apparatus for determining directions of uncorrelated sound sources in a higher order ambisonics representation of a sound field

Publications (2)

Publication Number Publication Date
TW201448616A TW201448616A (zh) 2014-12-16
TWI647961B true TWI647961B (zh) 2019-01-11

Family

ID=47780000

Family Applications (1)

Application Number Title Priority Date Filing Date
TW103104224A TWI647961B (zh) 2013-02-08 2014-02-10 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置

Country Status (7)

Country Link
US (1) US9622008B2 (https=)
EP (2) EP2765791A1 (https=)
JP (1) JP6374882B2 (https=)
KR (1) KR102220187B1 (https=)
CN (1) CN104995926B (https=)
TW (1) TWI647961B (https=)
WO (1) WO2014122287A1 (https=)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) * 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
EP2800401A1 (en) 2013-04-29 2014-11-05 Thomson Licensing Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
EP3357259B1 (en) 2015-09-30 2020-09-23 Dolby International AB Method and apparatus for generating 3d audio content from two-channel stereo content
CN105516875B (zh) * 2015-12-02 2020-03-06 上海航空电器有限公司 用于快速测量虚拟声音产生设备空间角度分辨率的装置
GR1008860B (el) * 2015-12-29 2016-09-27 Κωνσταντινος Δημητριου Σπυροπουλος Συστημα διαχωρισμου ομιλητων απο οπτικοακουστικα δεδομενα
US10089063B2 (en) 2016-08-10 2018-10-02 Qualcomm Incorporated Multimedia device for processing spatialized audio based on movement
JP6723120B2 (ja) * 2016-09-05 2020-07-15 本田技研工業株式会社 音響処理装置および音響処理方法
CN107147975B (zh) * 2017-04-26 2019-05-14 北京大学 一种面向不规则扬声器摆放的Ambisonics匹配投影解码方法
JP7224302B2 (ja) 2017-05-09 2023-02-17 ドルビー ラボラトリーズ ライセンシング コーポレイション マルチチャネル空間的オーディオ・フォーマット入力信号の処理
US10405126B2 (en) 2017-06-30 2019-09-03 Qualcomm Incorporated Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems
FR3074584A1 (fr) * 2017-12-05 2019-06-07 Orange Traitement de donnees d'une sequence video pour un zoom sur un locuteur detecte dans la sequence
CN110751956B (zh) * 2019-09-17 2022-04-26 北京时代拓灵科技有限公司 一种沉浸式音频渲染方法及系统
CN111933182B (zh) * 2020-08-07 2024-04-19 抖音视界有限公司 声源跟踪方法、装置、设备和存储介质
CN112019971B (zh) * 2020-08-21 2022-03-22 安声(重庆)电子科技有限公司 声场构建方法、装置、电子设备及计算机可读存储介质
US11743670B2 (en) 2020-12-18 2023-08-29 Qualcomm Incorporated Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications
CN115038027B (zh) * 2021-03-05 2023-07-07 华为技术有限公司 Hoa系数的获取方法和装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100329466A1 (en) * 2009-06-25 2010-12-30 Berges Allmenndigitale Radgivningstjeneste Device and method for converting spatial audio signal
WO2011041834A1 (en) * 2009-10-07 2011-04-14 The University Of Sydney Reconstruction of a recorded sound field
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9915398D0 (en) 1999-07-02 1999-09-01 Baker Matthew J Magnetic particles
FR2801108B1 (fr) 1999-11-16 2002-03-01 Maxmat S A Analyseur chimique ou biochimique a regulation de la temperature reactionnelle
FR2839565B1 (fr) * 2002-05-07 2004-11-19 Remy Henri Denis Bruno Procede et systeme de representation d'un champ acoustique
FR2858403B1 (fr) * 2003-07-31 2005-11-18 Remy Henri Denis Bruno Systeme et procede de determination d'une representation d'un champ acoustique
EP2297557B1 (en) * 2008-07-08 2013-10-30 Brüel & Kjaer Sound & Vibration Measurement A/S Reconstructing an acoustic field
ES2472456T3 (es) 2010-03-26 2014-07-01 Thomson Licensing Método y dispositivo para decodificar una representación de un campo ac�stico de audio para reproducción de audio
WO2012025580A1 (en) * 2010-08-27 2012-03-01 Sonicemotion Ag Method and device for enhanced sound field reproduction of spatially encoded audio input signals
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
EP2541547A1 (en) * 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2738962A1 (en) 2012-11-29 2014-06-04 Thomson Licensing Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field
US9736609B2 (en) * 2013-02-07 2017-08-15 Qualcomm Incorporated Determining renderers for spherical harmonic coefficients

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100329466A1 (en) * 2009-06-25 2010-12-30 Berges Allmenndigitale Radgivningstjeneste Device and method for converting spatial audio signal
WO2011041834A1 (en) * 2009-10-07 2011-04-14 The University Of Sydney Reconstruction of a recorded sound field
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
EP2469742A2 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field

Also Published As

Publication number Publication date
JP2016509812A (ja) 2016-03-31
EP2765791A1 (en) 2014-08-13
WO2014122287A1 (en) 2014-08-14
TW201448616A (zh) 2014-12-16
KR102220187B1 (ko) 2021-02-25
EP2954700B1 (en) 2018-03-07
JP6374882B2 (ja) 2018-08-15
KR20150115779A (ko) 2015-10-14
CN104995926A (zh) 2015-10-21
US20150373471A1 (en) 2015-12-24
EP2954700A1 (en) 2015-12-16
US9622008B2 (en) 2017-04-11
CN104995926B (zh) 2017-12-26

Similar Documents

Publication Publication Date Title
TWI647961B (zh) 聲場的高階保真立體音響表示法中不相關聲源方向之決定方法及裝置
JP7158806B2 (ja) オーディオ認識方法、ターゲットオーディオを位置決める方法、それらの装置、およびデバイスとコンピュータプログラム
EP2530484B1 (en) Sound source localization apparatus and method
Yang et al. SRP-DNN: Learning direct-path phase difference for multiple moving sound source localization
US20040190730A1 (en) System and process for time delay estimation in the presence of correlated noise and reverberation
KR20180069879A (ko) 음성 강화를 위해 전역적으로 최적화된 최소 제곱 포스트 필터링
Traa et al. Multichannel source separation and tracking with RANSAC and directional statistics
Hosseini et al. Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function
Sharma et al. Development of a speech separation system using frequency domain blind source separation technique
EP2745293B1 (en) Signal noise attenuation
Krause et al. Data diversity for improving DNN-based localization of concurrent sound events
Pirhosseinloo et al. A new feature set for masking-based monaural speech separation
Zhang et al. Multi-Target Ensemble Learning for Monaural Speech Separation.
Athanasopoulos et al. Robust speaker localization for real-world robots
Tourbabin et al. Direction of arrival estimation in highly reverberant environments using soft time-frequency mask
Yang et al. A stacked self-attention network for two-dimensional direction-of-arrival estimation in hands-free speech communication
Dehghan Firoozabadi et al. A novel nested circular microphone array and subband processing-based system for counting and DOA estimation of multiple simultaneous speakers
Wu et al. Acoustic source tracking in reverberant environment using regional steered response power measurement
CN110838303A (zh) 一种利用传声器阵列的语音声源定位方法
Toma et al. Efficient Detection and Localization of Acoustic Sources with a low complexity CNN network and the Diagonal Unloading Beamforming
Taseska et al. Minimum Bayes risk signal detection for speech enhancement based on a narrowband DOA model
Firoozabadi et al. Combination of nested microphone array and subband processing for multiple simultaneous speaker localization
Weisman et al. Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement.
Mosayyebpour et al. Time delay estimation via minimum-phase and all-pass component processing
MESSANA CNN-based estimation of dereverberated relative harmonics coefficients for localization of acoustic sources