TWI689917B - 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 - Google Patents

編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 Download PDF

Info

Publication number
TWI689917B
TWI689917B TW108117949A TW108117949A TWI689917B TW I689917 B TWI689917 B TW I689917B TW 108117949 A TW108117949 A TW 108117949A TW 108117949 A TW108117949 A TW 108117949A TW I689917 B TWI689917 B TW I689917B
Authority
TW
Taiwan
Prior art keywords
audio signal
signal
value
shift value
shift
Prior art date
Application number
TW108117949A
Other languages
English (en)
Chinese (zh)
Other versions
TW201935465A (zh
Inventor
凡卡特拉曼 阿堤
文卡塔 薩伯拉曼亞姆 強卓 賽克哈爾 奇比亞姆
丹尼爾 賈瑞德 辛德
Original Assignee
美商高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 美商高通公司 filed Critical 美商高通公司
Publication of TW201935465A publication Critical patent/TW201935465A/zh
Application granted granted Critical
Publication of TWI689917B publication Critical patent/TWI689917B/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
TW108117949A 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 TWI689917B (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201562258369P 2015-11-20 2015-11-20
US62/258,369 2015-11-20
US15/274,041 2016-09-23
US15/274,041 US10152977B2 (en) 2015-11-20 2016-09-23 Encoding of multiple audio signals

Publications (2)

Publication Number Publication Date
TW201935465A TW201935465A (zh) 2019-09-01
TWI689917B true TWI689917B (zh) 2020-04-01

Family

ID=57137264

Family Applications (2)

Application Number Title Priority Date Filing Date
TW108117949A TWI689917B (zh) 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件
TW105133088A TWI664624B (zh) 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW105133088A TWI664624B (zh) 2015-11-20 2016-10-13 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件

Country Status (10)

Country Link
US (3) US10152977B2 (https=)
EP (2) EP3378064B1 (https=)
JP (2) JP6571281B2 (https=)
KR (2) KR102054606B1 (https=)
CN (2) CN108292505B (https=)
CA (1) CA3001579C (https=)
ES (1) ES3014625T3 (https=)
PL (1) PL3378064T3 (https=)
TW (2) TWI689917B (https=)
WO (1) WO2017087073A1 (https=)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9407989B1 (en) 2015-06-30 2016-08-02 Arthur Woodrow Closed audio circuit
US10152977B2 (en) 2015-11-20 2018-12-11 Qualcomm Incorporated Encoding of multiple audio signals
WO2017125544A1 (en) 2016-01-22 2017-07-27 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision
US10304468B2 (en) * 2017-03-20 2019-05-28 Qualcomm Incorporated Target sample generation
JP6811312B2 (ja) * 2017-05-01 2021-01-13 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 符号化装置及び符号化方法
CN108877815B (zh) * 2017-05-16 2021-02-23 华为技术有限公司 一种立体声信号处理方法及装置
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
CN114898761A (zh) 2017-08-10 2022-08-12 华为技术有限公司 立体声信号编解码方法及装置
US10891960B2 (en) * 2017-09-11 2021-01-12 Qualcomm Incorproated Temporal offset estimation
US10872611B2 (en) * 2017-09-12 2020-12-22 Qualcomm Incorporated Selecting channel adjustment method for inter-frame temporal shift variations
US10839814B2 (en) * 2017-10-05 2020-11-17 Qualcomm Incorporated Encoding or decoding of audio signals
CN108428457B (zh) * 2018-02-12 2021-03-23 北京百度网讯科技有限公司 音频去重方法及装置
CN112352277B (zh) * 2018-07-03 2024-05-31 松下电器(美国)知识产权公司 编码装置及编码方法
US11295726B2 (en) * 2019-04-08 2022-04-05 International Business Machines Corporation Synthetic narrowband data generation for narrowband automatic speech recognition systems
KR20220058236A (ko) 2020-10-30 2022-05-09 삼성전자주식회사 오디오 데이터 처리 방법 및 그 장치
CN113870881B (zh) * 2021-09-26 2024-04-26 西南石油大学 一种鲁棒哈默斯坦子带样条自适应回声消除方法
US11900961B2 (en) * 2022-05-31 2024-02-13 Microsoft Technology Licensing, Llc Multichannel audio speech classification

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030220783A1 (en) * 2002-03-12 2003-11-27 Sebastian Streich Efficiency improvements in scalable audio coding
US20120232912A1 (en) * 2009-09-11 2012-09-13 Mikko Tammi Method, Apparatus and Computer Program Product for Audio Coding
US20130304481A1 (en) * 2011-02-03 2013-11-14 Telefonaktiebolaget L M Ericsson (Publ) Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6317703B1 (en) * 1996-11-12 2001-11-13 International Business Machines Corporation Separation of a mixture of acoustic sources into its components
JP4137202B2 (ja) * 1997-10-17 2008-08-20 株式会社日立メディコ 超音波診断装置
US7240001B2 (en) * 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
CN1922655A (zh) * 2004-07-06 2007-02-28 松下电器产业株式会社 音频信号编码装置、音频信号解码装置、方法及程序
US7761289B2 (en) * 2005-10-24 2010-07-20 Lg Electronics Inc. Removing time delays in signal paths
WO2007080211A1 (en) * 2006-01-09 2007-07-19 Nokia Corporation Decoding of binaural audio signals
ATE527833T1 (de) * 2006-05-04 2011-10-15 Lg Electronics Inc Verbesserung von stereo-audiosignalen mittels neuabmischung
GB2453117B (en) * 2007-09-25 2012-05-23 Motorola Mobility Inc Apparatus and method for encoding a multi channel audio signal
US8175291B2 (en) * 2007-12-19 2012-05-08 Qualcomm Incorporated Systems, methods, and apparatus for multi-microphone based speech enhancement
JPWO2009081567A1 (ja) 2007-12-21 2011-05-06 パナソニック株式会社 ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法
WO2009142017A1 (ja) * 2008-05-22 2009-11-26 パナソニック株式会社 ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法
CN102160113B (zh) 2008-08-11 2013-05-08 诺基亚公司 多声道音频编码器和解码器
CN101673545B (zh) * 2008-09-12 2011-11-16 华为技术有限公司 一种编解码方法及装置
WO2010085083A2 (en) * 2009-01-20 2010-07-29 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
US20100331048A1 (en) * 2009-06-25 2010-12-30 Qualcomm Incorporated M-s stereo reproduction at a device
US8463414B2 (en) 2010-08-09 2013-06-11 Motorola Mobility Llc Method and apparatus for estimating a parameter for low bit rate stereo transmission
US9552840B2 (en) * 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
US9767822B2 (en) * 2011-02-07 2017-09-19 Qualcomm Incorporated Devices for encoding and decoding a watermarked signal
JP5724044B2 (ja) * 2012-02-17 2015-05-27 華為技術有限公司Huawei Technologies Co.,Ltd. 多重チャネル・オーディオ信号の符号化のためのパラメトリック型符号化装置
EP2839460A4 (en) 2012-04-18 2015-12-30 Nokia Technologies Oy STEREOTONSIGNALCODIERER
US9865269B2 (en) * 2012-07-19 2018-01-09 Nokia Technologies Oy Stereo audio signal encoder
US9479886B2 (en) * 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
US10215551B2 (en) * 2012-07-27 2019-02-26 Praevium Research, Inc. Agile imaging system
KR20160087827A (ko) * 2013-11-22 2016-07-22 퀄컴 인코포레이티드 고대역 코딩에서의 선택적 위상 보상
CN104700839B (zh) * 2015-02-26 2016-03-23 深圳市中兴移动通信有限公司 多声道声音采集的方法、装置、手机及系统
US10152977B2 (en) 2015-11-20 2018-12-11 Qualcomm Incorporated Encoding of multiple audio signals

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030220783A1 (en) * 2002-03-12 2003-11-27 Sebastian Streich Efficiency improvements in scalable audio coding
US20120232912A1 (en) * 2009-09-11 2012-09-13 Mikko Tammi Method, Apparatus and Computer Program Product for Audio Coding
US20130304481A1 (en) * 2011-02-03 2013-11-14 Telefonaktiebolaget L M Ericsson (Publ) Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal

Also Published As

Publication number Publication date
TWI664624B (zh) 2019-07-01
EP3378064A1 (en) 2018-09-26
JP2018534625A (ja) 2018-11-22
WO2017087073A1 (en) 2017-05-26
JP2019207430A (ja) 2019-12-05
CN108292505A (zh) 2018-07-17
KR20190137181A (ko) 2019-12-10
KR102054606B1 (ko) 2019-12-10
CN108292505B (zh) 2022-05-13
US10586544B2 (en) 2020-03-10
US11094330B2 (en) 2021-08-17
EP3378064C0 (en) 2025-02-19
KR102391271B1 (ko) 2022-04-26
TW201719634A (zh) 2017-06-01
CA3001579C (en) 2021-01-12
US20170148447A1 (en) 2017-05-25
CN112951249A (zh) 2021-06-11
ES3014625T3 (en) 2025-04-23
CA3001579A1 (en) 2017-05-26
TW201935465A (zh) 2019-09-01
EP4075428A1 (en) 2022-10-19
EP3378064B1 (en) 2025-02-19
US20190035409A1 (en) 2019-01-31
KR20180084789A (ko) 2018-07-25
US20200202873A1 (en) 2020-06-25
BR112018010305A2 (pt) 2018-12-04
JP6786679B2 (ja) 2020-11-18
US10152977B2 (en) 2018-12-11
PL3378064T3 (pl) 2025-04-22
JP6571281B2 (ja) 2019-09-04

Similar Documents

Publication Publication Date Title
TWI689917B (zh) 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件
TWI688243B (zh) 時間性偏移估計
TWI781140B (zh) 用於編碼音訊通道之目標樣本產生之裝置、方法、包含指令之非暫時性電腦可讀媒體及設備
TWI696172B (zh) 多音訊信號之編碼
HK40010036A (en) Target sample generation