TWI763717B - 用於參數音訊解碼之裝置、方法及非暫時性電腦可讀媒體 - Google Patents

用於參數音訊解碼之裝置、方法及非暫時性電腦可讀媒體

Info

Publication number
TWI763717B
TWI763717B TW106132782A TW106132782A TWI763717B TW I763717 B TWI763717 B TW I763717B TW 106132782 A TW106132782 A TW 106132782A TW 106132782 A TW106132782 A TW 106132782A TW I763717 B TWI763717 B TW I763717B
Authority
TW
Taiwan
Prior art keywords
value
frequency
stereo parameter
output signal
signal
Prior art date
Application number
TW106132782A
Other languages
English (en)
Chinese (zh)
Other versions
TW201816775A (zh
Inventor
文卡塔 薩伯拉曼亞姆 強卓 賽克哈爾 奇比亞姆
凡卡特拉曼 阿堤
Original Assignee
美商高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 美商高通公司 filed Critical 美商高通公司
Publication of TW201816775A publication Critical patent/TW201816775A/zh
Application granted granted Critical
Publication of TWI763717B publication Critical patent/TWI763717B/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/07Synergistic effects of band splitting and sub-band processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Stereophonic System (AREA)
TW106132782A 2016-10-13 2017-09-25 用於參數音訊解碼之裝置、方法及非暫時性電腦可讀媒體 TWI763717B (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201662407843P 2016-10-13 2016-10-13
US62/407,843 2016-10-13
US15/708,717 2017-09-19
US15/708,717 US10362423B2 (en) 2016-10-13 2017-09-19 Parametric audio decoding

Publications (2)

Publication Number Publication Date
TW201816775A TW201816775A (zh) 2018-05-01
TWI763717B true TWI763717B (zh) 2022-05-11

Family

ID=61902837

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106132782A TWI763717B (zh) 2016-10-13 2017-09-25 用於參數音訊解碼之裝置、方法及非暫時性電腦可讀媒體

Country Status (10)

Country Link
US (5) US10362423B2 (enExample)
EP (1) EP3526791B1 (enExample)
JP (1) JP6987856B2 (enExample)
KR (2) KR102503904B1 (enExample)
CN (2) CN109804430B (enExample)
AU (1) AU2017342737B2 (enExample)
BR (1) BR112019007240A2 (enExample)
ES (1) ES2846281T3 (enExample)
TW (1) TWI763717B (enExample)
WO (1) WO2018071150A1 (enExample)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
USRE48462E1 (en) * 2009-07-29 2021-03-09 Northwestern University Systems, methods, and apparatus for equalization preference learning
US10362423B2 (en) 2016-10-13 2019-07-23 Qualcomm Incorporated Parametric audio decoding
US11514921B2 (en) * 2019-09-26 2022-11-29 Apple Inc. Audio return channel data loopback
WO2022051076A1 (en) * 2020-09-01 2022-03-10 Sterling Labs Llc. Dynamically changing audio properties
US20250140271A1 (en) * 2021-08-30 2025-05-01 Nokia Technologies Oy Silence descriptor using spatial parameters
CN115277592B (zh) * 2022-07-20 2023-04-11 哈尔滨市科佳通用机电股份有限公司 一种机车信号设备在信号切换时的解码方法
CN119580735B (zh) * 2025-02-06 2025-04-25 泉州市二谷电子科技有限公司 一种数控系统的远程语音控制方法及系统

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150213806A1 (en) * 2012-10-05 2015-07-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7583805B2 (en) * 2004-02-12 2009-09-01 Agere Systems Inc. Late reverberation-based synthesis of auditory scenes
WO2004080125A1 (en) * 2003-03-04 2004-09-16 Nokia Corporation Support of a multichannel audio extension
EP2224430B1 (en) 2004-03-01 2011-10-05 Dolby Laboratories Licensing Corporation Multichannel audio decoding
JP5106115B2 (ja) * 2004-11-30 2012-12-26 アギア システムズ インコーポレーテッド オブジェクト・ベースのサイド情報を用いる空間オーディオのパラメトリック・コーディング
US7983922B2 (en) * 2005-04-15 2011-07-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
US8379868B2 (en) * 2006-05-17 2013-02-19 Creative Technology Ltd Spatial audio coding based on universal spatial cues
US8103005B2 (en) 2008-02-04 2012-01-24 Creative Technology Ltd Primary-ambient decomposition of stereo audio signals using a complex similarity index
MX2011003824A (es) * 2008-10-08 2011-05-02 Fraunhofer Ges Forschung Esquema de codificacion/decodificacion de audio conmutado de resolucion multiple.
CA2966469C (en) 2009-01-28 2020-05-05 Dolby International Ab Improved harmonic transposition
US8457975B2 (en) * 2009-01-28 2013-06-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio decoder, audio encoder, methods for decoding and encoding an audio signal and computer program
WO2011107951A1 (en) 2010-03-02 2011-09-09 Nokia Corporation Method and apparatus for upmixing a two-channel audio signal
EP2720222A1 (en) 2012-10-10 2014-04-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns
RU2665214C1 (ru) 2013-04-05 2018-08-28 Долби Интернэшнл Аб Стереофонический кодер и декодер аудиосигналов
EP2838086A1 (en) * 2013-07-22 2015-02-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. In an reduction of comb filter artifacts in multi-channel downmix with adaptive phase alignment
EP2830059A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Noise filling energy adjustment
US9293143B2 (en) * 2013-12-11 2016-03-22 Qualcomm Incorporated Bandwidth extension mode selection
US10163447B2 (en) 2013-12-16 2018-12-25 Qualcomm Incorporated High-band signal modeling
US10362423B2 (en) 2016-10-13 2019-07-23 Qualcomm Incorporated Parametric audio decoding

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150213806A1 (en) * 2012-10-05 2015-07-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
期刊 D. MAULER et al. A low delay, variable resolution, perfect reconstruction spectral analysis-synthesis system for speech enhancement 2007 15TH EUROPEAN SIGNAL PROCESSING CONFERENCE ISBN 978-83-921340-4-6 IEEE 20070903 https://www.eurasip.org/Proceedings/Eusipco/Eusipco2007/Papers/A2L-D02.pdf *

Also Published As

Publication number Publication date
KR20190064584A (ko) 2019-06-10
US20200336853A1 (en) 2020-10-22
AU2017342737A1 (en) 2019-03-28
CN109804430B (zh) 2023-05-12
JP2019535207A (ja) 2019-12-05
US10362423B2 (en) 2019-07-23
EP3526791A1 (en) 2019-08-21
BR112019007240A2 (pt) 2019-07-02
US20180109896A1 (en) 2018-04-19
US20190297444A1 (en) 2019-09-26
AU2017342737B2 (en) 2022-01-20
US20240031755A1 (en) 2024-01-25
ES2846281T3 (es) 2021-07-28
US20210385601A1 (en) 2021-12-09
US12022274B2 (en) 2024-06-25
WO2018071150A1 (en) 2018-04-19
KR102761057B1 (ko) 2025-01-31
TW201816775A (zh) 2018-05-01
EP3526791B1 (en) 2020-10-21
US10757521B2 (en) 2020-08-25
JP6987856B2 (ja) 2022-01-05
KR102503904B1 (ko) 2023-02-24
US11716584B2 (en) 2023-08-01
CN109804430A (zh) 2019-05-24
US11102600B2 (en) 2021-08-24
KR20230030055A (ko) 2023-03-03
CN116453528A (zh) 2023-07-18

Similar Documents

Publication Publication Date Title
TWI763717B (zh) 用於參數音訊解碼之裝置、方法及非暫時性電腦可讀媒體
KR102019617B1 (ko) 프레임간 시간 시프트 변동들에 대한 채널 조정
US20200335114A1 (en) Stereo parameters for stereo decoding
TWI778073B (zh) 用於具有時域頻道間頻寬延展之高頻帶殘值預測的音訊信號寫碼裝置、方法、包含指令的非暫時性電腦可讀媒體及設備
CN111095403A (zh) 选择用于帧间时间偏移变异的通道调整方法
KR102264105B1 (ko) 멀티 채널 디코딩
HK40020715B (zh) 选择用於帧间时间偏移变异的通道调整方法