CN108292505B - 多重音频信号的编码 - Google Patents

多重音频信号的编码 Download PDF

Info

Publication number
CN108292505B
CN108292505B CN201680066902.2A CN201680066902A CN108292505B CN 108292505 B CN108292505 B CN 108292505B CN 201680066902 A CN201680066902 A CN 201680066902A CN 108292505 B CN108292505 B CN 108292505B
Authority
CN
China
Prior art keywords
audio signal
signal
value
shift value
shift
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201680066902.2A
Other languages
English (en)
Chinese (zh)
Other versions
CN108292505A (zh
Inventor
文卡特拉曼·阿蒂
文卡塔·萨伯拉曼亚姆·强卓·赛克哈尔·奇比亚姆
丹尼尔·贾里德·辛德尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Priority to CN202110193366.5A priority Critical patent/CN112951249A/zh
Publication of CN108292505A publication Critical patent/CN108292505A/zh
Application granted granted Critical
Publication of CN108292505B publication Critical patent/CN108292505B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CN201680066902.2A 2015-11-20 2016-09-26 多重音频信号的编码 Active CN108292505B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110193366.5A CN112951249A (zh) 2015-11-20 2016-09-26 多重音频信号的编码

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201562258369P 2015-11-20 2015-11-20
US62/258,369 2015-11-20
US15/274,041 2016-09-23
US15/274,041 US10152977B2 (en) 2015-11-20 2016-09-23 Encoding of multiple audio signals
PCT/US2016/053799 WO2017087073A1 (en) 2015-11-20 2016-09-26 Encoding of multiple audio signals

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN202110193366.5A Division CN112951249A (zh) 2015-11-20 2016-09-26 多重音频信号的编码

Publications (2)

Publication Number Publication Date
CN108292505A CN108292505A (zh) 2018-07-17
CN108292505B true CN108292505B (zh) 2022-05-13

Family

ID=57137264

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201680066902.2A Active CN108292505B (zh) 2015-11-20 2016-09-26 多重音频信号的编码
CN202110193366.5A Pending CN112951249A (zh) 2015-11-20 2016-09-26 多重音频信号的编码

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN202110193366.5A Pending CN112951249A (zh) 2015-11-20 2016-09-26 多重音频信号的编码

Country Status (10)

Country Link
US (3) US10152977B2 (https=)
EP (2) EP3378064B1 (https=)
JP (2) JP6571281B2 (https=)
KR (2) KR102054606B1 (https=)
CN (2) CN108292505B (https=)
CA (1) CA3001579C (https=)
ES (1) ES3014625T3 (https=)
PL (1) PL3378064T3 (https=)
TW (2) TWI689917B (https=)
WO (1) WO2017087073A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112951249A (zh) * 2015-11-20 2021-06-11 高通股份有限公司 多重音频信号的编码

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9407989B1 (en) 2015-06-30 2016-08-02 Arthur Woodrow Closed audio circuit
WO2017125544A1 (en) 2016-01-22 2017-07-27 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision
US10304468B2 (en) * 2017-03-20 2019-05-28 Qualcomm Incorporated Target sample generation
JP6811312B2 (ja) * 2017-05-01 2021-01-13 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 符号化装置及び符号化方法
CN108877815B (zh) * 2017-05-16 2021-02-23 华为技术有限公司 一种立体声信号处理方法及装置
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
CN114898761A (zh) 2017-08-10 2022-08-12 华为技术有限公司 立体声信号编解码方法及装置
US10891960B2 (en) * 2017-09-11 2021-01-12 Qualcomm Incorproated Temporal offset estimation
US10872611B2 (en) * 2017-09-12 2020-12-22 Qualcomm Incorporated Selecting channel adjustment method for inter-frame temporal shift variations
US10839814B2 (en) * 2017-10-05 2020-11-17 Qualcomm Incorporated Encoding or decoding of audio signals
CN108428457B (zh) * 2018-02-12 2021-03-23 北京百度网讯科技有限公司 音频去重方法及装置
CN112352277B (zh) * 2018-07-03 2024-05-31 松下电器(美国)知识产权公司 编码装置及编码方法
US11295726B2 (en) * 2019-04-08 2022-04-05 International Business Machines Corporation Synthetic narrowband data generation for narrowband automatic speech recognition systems
KR20220058236A (ko) 2020-10-30 2022-05-09 삼성전자주식회사 오디오 데이터 처리 방법 및 그 장치
CN113870881B (zh) * 2021-09-26 2024-04-26 西南石油大学 一种鲁棒哈默斯坦子带样条自适应回声消除方法
US11900961B2 (en) * 2022-05-31 2024-02-13 Microsoft Technology Licensing, Llc Multichannel audio speech classification

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101297594A (zh) * 2005-10-24 2008-10-29 Lg电子株式会社 消除信号路径中的时间延迟
CN101690270A (zh) * 2006-05-04 2010-03-31 Lg电子株式会社 采用再混音能力增强音频
CN103181192A (zh) * 2010-10-25 2013-06-26 高通股份有限公司 利用多麦克风的三维声音捕获和再现
CN104246873A (zh) * 2012-02-17 2014-12-24 华为技术有限公司 用于编码多声道音频信号的参数编码器
CN104700839A (zh) * 2015-02-26 2015-06-10 深圳市中兴移动通信有限公司 多声道声音采集的方法、装置、手机及系统

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6317703B1 (en) * 1996-11-12 2001-11-13 International Business Machines Corporation Separation of a mixture of acoustic sources into its components
JP4137202B2 (ja) * 1997-10-17 2008-08-20 株式会社日立メディコ 超音波診断装置
US7240001B2 (en) * 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
KR100711989B1 (ko) * 2002-03-12 2007-05-02 노키아 코포레이션 효율적으로 개선된 스케일러블 오디오 부호화
CN1922655A (zh) * 2004-07-06 2007-02-28 松下电器产业株式会社 音频信号编码装置、音频信号解码装置、方法及程序
WO2007080211A1 (en) * 2006-01-09 2007-07-19 Nokia Corporation Decoding of binaural audio signals
GB2453117B (en) * 2007-09-25 2012-05-23 Motorola Mobility Inc Apparatus and method for encoding a multi channel audio signal
US8175291B2 (en) * 2007-12-19 2012-05-08 Qualcomm Incorporated Systems, methods, and apparatus for multi-microphone based speech enhancement
JPWO2009081567A1 (ja) 2007-12-21 2011-05-06 パナソニック株式会社 ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法
WO2009142017A1 (ja) * 2008-05-22 2009-11-26 パナソニック株式会社 ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法
CN102160113B (zh) 2008-08-11 2013-05-08 诺基亚公司 多声道音频编码器和解码器
CN101673545B (zh) * 2008-09-12 2011-11-16 华为技术有限公司 一种编解码方法及装置
WO2010085083A2 (en) * 2009-01-20 2010-07-29 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
US20100331048A1 (en) * 2009-06-25 2010-12-30 Qualcomm Incorporated M-s stereo reproduction at a device
US8848925B2 (en) * 2009-09-11 2014-09-30 Nokia Corporation Method, apparatus and computer program product for audio coding
US8463414B2 (en) 2010-08-09 2013-06-11 Motorola Mobility Llc Method and apparatus for estimating a parameter for low bit rate stereo transmission
EP2671221B1 (en) * 2011-02-03 2017-02-01 Telefonaktiebolaget LM Ericsson (publ) Determining the inter-channel time difference of a multi-channel audio signal
US9767822B2 (en) * 2011-02-07 2017-09-19 Qualcomm Incorporated Devices for encoding and decoding a watermarked signal
EP2839460A4 (en) 2012-04-18 2015-12-30 Nokia Technologies Oy STEREOTONSIGNALCODIERER
US9865269B2 (en) * 2012-07-19 2018-01-09 Nokia Technologies Oy Stereo audio signal encoder
US9479886B2 (en) * 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
US10215551B2 (en) * 2012-07-27 2019-02-26 Praevium Research, Inc. Agile imaging system
KR20160087827A (ko) * 2013-11-22 2016-07-22 퀄컴 인코포레이티드 고대역 코딩에서의 선택적 위상 보상
US10152977B2 (en) 2015-11-20 2018-12-11 Qualcomm Incorporated Encoding of multiple audio signals

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101297594A (zh) * 2005-10-24 2008-10-29 Lg电子株式会社 消除信号路径中的时间延迟
CN101690270A (zh) * 2006-05-04 2010-03-31 Lg电子株式会社 采用再混音能力增强音频
CN103181192A (zh) * 2010-10-25 2013-06-26 高通股份有限公司 利用多麦克风的三维声音捕获和再现
CN104246873A (zh) * 2012-02-17 2014-12-24 华为技术有限公司 用于编码多声道音频信号的参数编码器
CN104700839A (zh) * 2015-02-26 2015-06-10 深圳市中兴移动通信有限公司 多声道声音采集的方法、装置、手机及系统

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112951249A (zh) * 2015-11-20 2021-06-11 高通股份有限公司 多重音频信号的编码

Also Published As

Publication number Publication date
TWI664624B (zh) 2019-07-01
EP3378064A1 (en) 2018-09-26
TWI689917B (zh) 2020-04-01
JP2018534625A (ja) 2018-11-22
WO2017087073A1 (en) 2017-05-26
JP2019207430A (ja) 2019-12-05
CN108292505A (zh) 2018-07-17
KR20190137181A (ko) 2019-12-10
KR102054606B1 (ko) 2019-12-10
US10586544B2 (en) 2020-03-10
US11094330B2 (en) 2021-08-17
EP3378064C0 (en) 2025-02-19
KR102391271B1 (ko) 2022-04-26
TW201719634A (zh) 2017-06-01
CA3001579C (en) 2021-01-12
US20170148447A1 (en) 2017-05-25
CN112951249A (zh) 2021-06-11
ES3014625T3 (en) 2025-04-23
CA3001579A1 (en) 2017-05-26
TW201935465A (zh) 2019-09-01
EP4075428A1 (en) 2022-10-19
EP3378064B1 (en) 2025-02-19
US20190035409A1 (en) 2019-01-31
KR20180084789A (ko) 2018-07-25
US20200202873A1 (en) 2020-06-25
BR112018010305A2 (pt) 2018-12-04
JP6786679B2 (ja) 2020-11-18
US10152977B2 (en) 2018-12-11
PL3378064T3 (pl) 2025-04-22
JP6571281B2 (ja) 2019-09-04

Similar Documents

Publication Publication Date Title
CN108292505B (zh) 多重音频信号的编码
TWI688243B (zh) 時間性偏移估計
US10714101B2 (en) Target sample generation
CN108431890B (zh) 多音频信号的编码
HK40010036A (en) Target sample generation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant