TWI664624B - 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 - Google Patents

編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 Download PDF

Info

Publication number
TWI664624B
TWI664624B TW105133088A TW105133088A TWI664624B TW I664624 B TWI664624 B TW I664624B TW 105133088 A TW105133088 A TW 105133088A TW 105133088 A TW105133088 A TW 105133088A TW I664624 B TWI664624 B TW I664624B
Authority
TW
Taiwan
Prior art keywords
channel
value
signal
audio
shift value
Prior art date
Application number
TW105133088A
Other languages
English (en)
Chinese (zh)
Other versions
TW201719634A (zh
Inventor
凡卡特拉曼 阿堤
文卡塔 薩伯拉曼亞姆 強卓 賽克哈爾 奇比亞姆
丹尼爾 賈瑞德 辛德
Original Assignee
美商高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 美商高通公司 filed Critical 美商高通公司
Publication of TW201719634A publication Critical patent/TW201719634A/zh
Application granted granted Critical
Publication of TWI664624B publication Critical patent/TWI664624B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
TW105133088A 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 TWI664624B (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201562258369P 2015-11-20 2015-11-20
US62/258,369 2015-11-20
US15/274,041 US10152977B2 (en) 2015-11-20 2016-09-23 Encoding of multiple audio signals
US15/274,041 2016-09-23

Publications (2)

Publication Number Publication Date
TW201719634A TW201719634A (zh) 2017-06-01
TWI664624B true TWI664624B (zh) 2019-07-01

Family

ID=57137264

Family Applications (2)

Application Number Title Priority Date Filing Date
TW105133088A TWI664624B (zh) 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件
TW108117949A TWI689917B (zh) 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW108117949A TWI689917B (zh) 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件

Country Status (10)

Country Link
US (3) US10152977B2 (enExample)
EP (2) EP3378064B1 (enExample)
JP (2) JP6571281B2 (enExample)
KR (2) KR102391271B1 (enExample)
CN (2) CN108292505B (enExample)
CA (1) CA3001579C (enExample)
ES (1) ES3014625T3 (enExample)
PL (1) PL3378064T3 (enExample)
TW (2) TWI664624B (enExample)
WO (1) WO2017087073A1 (enExample)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9407989B1 (en) 2015-06-30 2016-08-02 Arthur Woodrow Closed audio circuit
US10152977B2 (en) 2015-11-20 2018-12-11 Qualcomm Incorporated Encoding of multiple audio signals
CN109074812B (zh) * 2016-01-22 2023-11-17 弗劳恩霍夫应用研究促进协会 用于具有全局ild和改进的中/侧决策的mdct m/s立体声的装置和方法
US10304468B2 (en) * 2017-03-20 2019-05-28 Qualcomm Incorporated Target sample generation
WO2018203471A1 (ja) * 2017-05-01 2018-11-08 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 符号化装置及び符号化方法
CN108877815B (zh) * 2017-05-16 2021-02-23 华为技术有限公司 一种立体声信号处理方法及装置
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
CN109389987B (zh) 2017-08-10 2022-05-10 华为技术有限公司 音频编解码模式确定方法和相关产品
US10891960B2 (en) * 2017-09-11 2021-01-12 Qualcomm Incorproated Temporal offset estimation
US10872611B2 (en) * 2017-09-12 2020-12-22 Qualcomm Incorporated Selecting channel adjustment method for inter-frame temporal shift variations
US10839814B2 (en) * 2017-10-05 2020-11-17 Qualcomm Incorporated Encoding or decoding of audio signals
CN108428457B (zh) * 2018-02-12 2021-03-23 北京百度网讯科技有限公司 音频去重方法及装置
US11545165B2 (en) * 2018-07-03 2023-01-03 Panasonic Intellectual Property Corporation Of America Encoding device and encoding method using a determined prediction parameter based on an energy difference between channels
US11295726B2 (en) * 2019-04-08 2022-04-05 International Business Machines Corporation Synthetic narrowband data generation for narrowband automatic speech recognition systems
CN113870881B (zh) * 2021-09-26 2024-04-26 西南石油大学 一种鲁棒哈默斯坦子带样条自适应回声消除方法
US11900961B2 (en) * 2022-05-31 2024-02-13 Microsoft Technology Licensing, Llc Multichannel audio speech classification

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030220783A1 (en) * 2002-03-12 2003-11-27 Sebastian Streich Efficiency improvements in scalable audio coding
US20120232912A1 (en) * 2009-09-11 2012-09-13 Mikko Tammi Method, Apparatus and Computer Program Product for Audio Coding
US20130304481A1 (en) * 2011-02-03 2013-11-14 Telefonaktiebolaget L M Ericsson (Publ) Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6317703B1 (en) * 1996-11-12 2001-11-13 International Business Machines Corporation Separation of a mixture of acoustic sources into its components
JP4137202B2 (ja) * 1997-10-17 2008-08-20 株式会社日立メディコ 超音波診断装置
US7240001B2 (en) * 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
WO2006004048A1 (ja) * 2004-07-06 2006-01-12 Matsushita Electric Industrial Co., Ltd. オーディオ信号符号化装置、オーディオ信号復号化装置、方法、及びプログラム
US7716043B2 (en) * 2005-10-24 2010-05-11 Lg Electronics Inc. Removing time delays in signal paths
WO2007080211A1 (en) * 2006-01-09 2007-07-19 Nokia Corporation Decoding of binaural audio signals
EP1853092B1 (en) * 2006-05-04 2011-10-05 LG Electronics, Inc. Enhancing stereo audio with remix capability
GB2453117B (en) * 2007-09-25 2012-05-23 Motorola Mobility Inc Apparatus and method for encoding a multi channel audio signal
US8175291B2 (en) * 2007-12-19 2012-05-08 Qualcomm Incorporated Systems, methods, and apparatus for multi-microphone based speech enhancement
US20100290629A1 (en) 2007-12-21 2010-11-18 Panasonic Corporation Stereo signal converter, stereo signal inverter, and method therefor
WO2009142017A1 (ja) * 2008-05-22 2009-11-26 パナソニック株式会社 ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法
CN102160113B (zh) 2008-08-11 2013-05-08 诺基亚公司 多声道音频编码器和解码器
CN101673545B (zh) * 2008-09-12 2011-11-16 华为技术有限公司 一种编解码方法及装置
EP2209328B1 (en) * 2009-01-20 2013-10-23 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
US20100331048A1 (en) * 2009-06-25 2010-12-30 Qualcomm Incorporated M-s stereo reproduction at a device
US8463414B2 (en) 2010-08-09 2013-06-11 Motorola Mobility Llc Method and apparatus for estimating a parameter for low bit rate stereo transmission
US9552840B2 (en) * 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
US9767822B2 (en) * 2011-02-07 2017-09-19 Qualcomm Incorporated Devices for encoding and decoding a watermarked signal
EP2702776B1 (en) * 2012-02-17 2015-09-23 Huawei Technologies Co., Ltd. Parametric encoder for encoding a multi-channel audio signal
EP2839460A4 (en) * 2012-04-18 2015-12-30 Nokia Technologies Oy STEREOTONSIGNALCODIERER
CN104641414A (zh) * 2012-07-19 2015-05-20 诺基亚公司 立体声音频信号编码器
US9479886B2 (en) * 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
WO2014018950A1 (en) 2012-07-27 2014-01-30 Thorlabs, Inc. Agile imaging system
WO2015077641A1 (en) * 2013-11-22 2015-05-28 Qualcomm Incorporated Selective phase compensation in high band coding
CN104700839B (zh) * 2015-02-26 2016-03-23 深圳市中兴移动通信有限公司 多声道声音采集的方法、装置、手机及系统
US10152977B2 (en) 2015-11-20 2018-12-11 Qualcomm Incorporated Encoding of multiple audio signals

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030220783A1 (en) * 2002-03-12 2003-11-27 Sebastian Streich Efficiency improvements in scalable audio coding
US20120232912A1 (en) * 2009-09-11 2012-09-13 Mikko Tammi Method, Apparatus and Computer Program Product for Audio Coding
US20130304481A1 (en) * 2011-02-03 2013-11-14 Telefonaktiebolaget L M Ericsson (Publ) Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal

Also Published As

Publication number Publication date
KR20190137181A (ko) 2019-12-10
CN112951249A (zh) 2021-06-11
JP2019207430A (ja) 2019-12-05
ES3014625T3 (en) 2025-04-23
EP3378064A1 (en) 2018-09-26
US11094330B2 (en) 2021-08-17
US20170148447A1 (en) 2017-05-25
EP3378064B1 (en) 2025-02-19
TW201935465A (zh) 2019-09-01
WO2017087073A1 (en) 2017-05-26
KR20180084789A (ko) 2018-07-25
PL3378064T3 (pl) 2025-04-22
CN108292505B (zh) 2022-05-13
US10586544B2 (en) 2020-03-10
TWI689917B (zh) 2020-04-01
KR102391271B1 (ko) 2022-04-26
CN108292505A (zh) 2018-07-17
JP2018534625A (ja) 2018-11-22
BR112018010305A2 (pt) 2018-12-04
JP6571281B2 (ja) 2019-09-04
KR102054606B1 (ko) 2019-12-10
CA3001579A1 (en) 2017-05-26
JP6786679B2 (ja) 2020-11-18
US10152977B2 (en) 2018-12-11
TW201719634A (zh) 2017-06-01
US20190035409A1 (en) 2019-01-31
EP3378064C0 (en) 2025-02-19
CA3001579C (en) 2021-01-12
US20200202873A1 (en) 2020-06-25
EP4075428A1 (en) 2022-10-19

Similar Documents

Publication Publication Date Title
TWI664624B (zh) 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件
TWI688243B (zh) 時間性偏移估計
US10714101B2 (en) Target sample generation
US10115403B2 (en) Encoding of multiple audio signals
HK40010036A (en) Target sample generation