CA3001579C - Encoding of multiple audio signals - Google Patents
Encoding of multiple audio signals Download PDFInfo
- Publication number
- CA3001579C CA3001579C CA3001579A CA3001579A CA3001579C CA 3001579 C CA3001579 C CA 3001579C CA 3001579 A CA3001579 A CA 3001579A CA 3001579 A CA3001579 A CA 3001579A CA 3001579 C CA3001579 C CA 3001579C
- Authority
- CA
- Canada
- Prior art keywords
- channel
- audio
- signal
- value
- shift value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title description 649
- 230000002123 temporal effect Effects 0.000 claims abstract description 185
- 238000000034 method Methods 0.000 claims description 126
- 230000004044 response Effects 0.000 claims description 119
- 230000001364 causal effect Effects 0.000 claims description 92
- 230000008859 change Effects 0.000 claims description 61
- 230000003111 delayed effect Effects 0.000 claims description 34
- 238000004891 communication Methods 0.000 claims description 11
- 230000000875 corresponding effect Effects 0.000 description 84
- 238000012952 Resampling Methods 0.000 description 39
- 238000010586 diagram Methods 0.000 description 30
- 230000005540 biological transmission Effects 0.000 description 24
- 230000006870 function Effects 0.000 description 15
- 238000012545 processing Methods 0.000 description 11
- 238000001914 filtration Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 230000002441 reversible effect Effects 0.000 description 6
- 238000009499 grossing Methods 0.000 description 5
- 238000007670 refining Methods 0.000 description 5
- 238000013459 approach Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000010363 phase shift Effects 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 101100454739 Arabidopsis thaliana LUG gene Proteins 0.000 description 1
- 101100305998 Toxoplasma gondii (strain ATCC 50611 / Me49) RON2 gene Proteins 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/11—Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201562258369P | 2015-11-20 | 2015-11-20 | |
| US62/258,369 | 2015-11-20 | ||
| US15/274,041 US10152977B2 (en) | 2015-11-20 | 2016-09-23 | Encoding of multiple audio signals |
| US15/274,041 | 2016-09-23 | ||
| PCT/US2016/053799 WO2017087073A1 (en) | 2015-11-20 | 2016-09-26 | Encoding of multiple audio signals |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA3001579A1 CA3001579A1 (en) | 2017-05-26 |
| CA3001579C true CA3001579C (en) | 2021-01-12 |
Family
ID=57137264
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA3001579A Active CA3001579C (en) | 2015-11-20 | 2016-09-26 | Encoding of multiple audio signals |
Country Status (10)
| Country | Link |
|---|---|
| US (3) | US10152977B2 (enExample) |
| EP (2) | EP3378064B1 (enExample) |
| JP (2) | JP6571281B2 (enExample) |
| KR (2) | KR102391271B1 (enExample) |
| CN (2) | CN108292505B (enExample) |
| CA (1) | CA3001579C (enExample) |
| ES (1) | ES3014625T3 (enExample) |
| PL (1) | PL3378064T3 (enExample) |
| TW (2) | TWI664624B (enExample) |
| WO (1) | WO2017087073A1 (enExample) |
Families Citing this family (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9407989B1 (en) | 2015-06-30 | 2016-08-02 | Arthur Woodrow | Closed audio circuit |
| US10152977B2 (en) | 2015-11-20 | 2018-12-11 | Qualcomm Incorporated | Encoding of multiple audio signals |
| CN109074812B (zh) * | 2016-01-22 | 2023-11-17 | 弗劳恩霍夫应用研究促进协会 | 用于具有全局ild和改进的中/侧决策的mdct m/s立体声的装置和方法 |
| US10304468B2 (en) * | 2017-03-20 | 2019-05-28 | Qualcomm Incorporated | Target sample generation |
| WO2018203471A1 (ja) * | 2017-05-01 | 2018-11-08 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 符号化装置及び符号化方法 |
| CN108877815B (zh) * | 2017-05-16 | 2021-02-23 | 华为技术有限公司 | 一种立体声信号处理方法及装置 |
| US10885921B2 (en) * | 2017-07-07 | 2021-01-05 | Qualcomm Incorporated | Multi-stream audio coding |
| CN109389987B (zh) | 2017-08-10 | 2022-05-10 | 华为技术有限公司 | 音频编解码模式确定方法和相关产品 |
| US10891960B2 (en) * | 2017-09-11 | 2021-01-12 | Qualcomm Incorproated | Temporal offset estimation |
| US10872611B2 (en) * | 2017-09-12 | 2020-12-22 | Qualcomm Incorporated | Selecting channel adjustment method for inter-frame temporal shift variations |
| US10839814B2 (en) * | 2017-10-05 | 2020-11-17 | Qualcomm Incorporated | Encoding or decoding of audio signals |
| CN108428457B (zh) * | 2018-02-12 | 2021-03-23 | 北京百度网讯科技有限公司 | 音频去重方法及装置 |
| US11545165B2 (en) * | 2018-07-03 | 2023-01-03 | Panasonic Intellectual Property Corporation Of America | Encoding device and encoding method using a determined prediction parameter based on an energy difference between channels |
| US11295726B2 (en) * | 2019-04-08 | 2022-04-05 | International Business Machines Corporation | Synthetic narrowband data generation for narrowband automatic speech recognition systems |
| CN113870881B (zh) * | 2021-09-26 | 2024-04-26 | 西南石油大学 | 一种鲁棒哈默斯坦子带样条自适应回声消除方法 |
| US11900961B2 (en) * | 2022-05-31 | 2024-02-13 | Microsoft Technology Licensing, Llc | Multichannel audio speech classification |
Family Cites Families (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6317703B1 (en) * | 1996-11-12 | 2001-11-13 | International Business Machines Corporation | Separation of a mixture of acoustic sources into its components |
| JP4137202B2 (ja) * | 1997-10-17 | 2008-08-20 | 株式会社日立メディコ | 超音波診断装置 |
| US7240001B2 (en) * | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
| DE60214599T2 (de) * | 2002-03-12 | 2007-09-13 | Nokia Corp. | Skalierbare audiokodierung |
| WO2006004048A1 (ja) * | 2004-07-06 | 2006-01-12 | Matsushita Electric Industrial Co., Ltd. | オーディオ信号符号化装置、オーディオ信号復号化装置、方法、及びプログラム |
| US7716043B2 (en) * | 2005-10-24 | 2010-05-11 | Lg Electronics Inc. | Removing time delays in signal paths |
| WO2007080211A1 (en) * | 2006-01-09 | 2007-07-19 | Nokia Corporation | Decoding of binaural audio signals |
| EP1853092B1 (en) * | 2006-05-04 | 2011-10-05 | LG Electronics, Inc. | Enhancing stereo audio with remix capability |
| GB2453117B (en) * | 2007-09-25 | 2012-05-23 | Motorola Mobility Inc | Apparatus and method for encoding a multi channel audio signal |
| US8175291B2 (en) * | 2007-12-19 | 2012-05-08 | Qualcomm Incorporated | Systems, methods, and apparatus for multi-microphone based speech enhancement |
| US20100290629A1 (en) | 2007-12-21 | 2010-11-18 | Panasonic Corporation | Stereo signal converter, stereo signal inverter, and method therefor |
| WO2009142017A1 (ja) * | 2008-05-22 | 2009-11-26 | パナソニック株式会社 | ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法 |
| CN102160113B (zh) | 2008-08-11 | 2013-05-08 | 诺基亚公司 | 多声道音频编码器和解码器 |
| CN101673545B (zh) * | 2008-09-12 | 2011-11-16 | 华为技术有限公司 | 一种编解码方法及装置 |
| EP2209328B1 (en) * | 2009-01-20 | 2013-10-23 | Lg Electronics Inc. | An apparatus for processing an audio signal and method thereof |
| US20100331048A1 (en) * | 2009-06-25 | 2010-12-30 | Qualcomm Incorporated | M-s stereo reproduction at a device |
| EP2476113B1 (en) * | 2009-09-11 | 2014-08-13 | Nokia Corporation | Method, apparatus and computer program product for audio coding |
| US8463414B2 (en) | 2010-08-09 | 2013-06-11 | Motorola Mobility Llc | Method and apparatus for estimating a parameter for low bit rate stereo transmission |
| US9552840B2 (en) * | 2010-10-25 | 2017-01-24 | Qualcomm Incorporated | Three-dimensional sound capturing and reproducing with multi-microphones |
| EP3182409B1 (en) * | 2011-02-03 | 2018-03-14 | Telefonaktiebolaget LM Ericsson (publ) | Determining the inter-channel time difference of a multi-channel audio signal |
| US9767822B2 (en) * | 2011-02-07 | 2017-09-19 | Qualcomm Incorporated | Devices for encoding and decoding a watermarked signal |
| EP2702776B1 (en) * | 2012-02-17 | 2015-09-23 | Huawei Technologies Co., Ltd. | Parametric encoder for encoding a multi-channel audio signal |
| EP2839460A4 (en) * | 2012-04-18 | 2015-12-30 | Nokia Technologies Oy | STEREOTONSIGNALCODIERER |
| CN104641414A (zh) * | 2012-07-19 | 2015-05-20 | 诺基亚公司 | 立体声音频信号编码器 |
| US9479886B2 (en) * | 2012-07-20 | 2016-10-25 | Qualcomm Incorporated | Scalable downmix design with feedback for object-based surround codec |
| WO2014018950A1 (en) | 2012-07-27 | 2014-01-30 | Thorlabs, Inc. | Agile imaging system |
| WO2015077641A1 (en) * | 2013-11-22 | 2015-05-28 | Qualcomm Incorporated | Selective phase compensation in high band coding |
| CN104700839B (zh) * | 2015-02-26 | 2016-03-23 | 深圳市中兴移动通信有限公司 | 多声道声音采集的方法、装置、手机及系统 |
| US10152977B2 (en) | 2015-11-20 | 2018-12-11 | Qualcomm Incorporated | Encoding of multiple audio signals |
-
2016
- 2016-09-23 US US15/274,041 patent/US10152977B2/en active Active
- 2016-09-26 EP EP16781923.4A patent/EP3378064B1/en active Active
- 2016-09-26 KR KR1020197035702A patent/KR102391271B1/ko active Active
- 2016-09-26 CA CA3001579A patent/CA3001579C/en active Active
- 2016-09-26 EP EP22167183.7A patent/EP4075428A1/en active Pending
- 2016-09-26 PL PL16781923.4T patent/PL3378064T3/pl unknown
- 2016-09-26 CN CN201680066902.2A patent/CN108292505B/zh active Active
- 2016-09-26 WO PCT/US2016/053799 patent/WO2017087073A1/en not_active Ceased
- 2016-09-26 ES ES16781923T patent/ES3014625T3/es active Active
- 2016-09-26 KR KR1020187013766A patent/KR102054606B1/ko active Active
- 2016-09-26 CN CN202110193366.5A patent/CN112951249A/zh active Pending
- 2016-09-26 JP JP2018525569A patent/JP6571281B2/ja active Active
- 2016-10-13 TW TW105133088A patent/TWI664624B/zh active
- 2016-10-13 TW TW108117949A patent/TWI689917B/zh active
-
2018
- 2018-10-04 US US16/152,357 patent/US10586544B2/en active Active
-
2019
- 2019-08-07 JP JP2019145550A patent/JP6786679B2/ja active Active
-
2020
- 2020-02-28 US US16/805,289 patent/US11094330B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| KR20190137181A (ko) | 2019-12-10 |
| TWI664624B (zh) | 2019-07-01 |
| CN112951249A (zh) | 2021-06-11 |
| JP2019207430A (ja) | 2019-12-05 |
| ES3014625T3 (en) | 2025-04-23 |
| EP3378064A1 (en) | 2018-09-26 |
| US11094330B2 (en) | 2021-08-17 |
| US20170148447A1 (en) | 2017-05-25 |
| EP3378064B1 (en) | 2025-02-19 |
| TW201935465A (zh) | 2019-09-01 |
| WO2017087073A1 (en) | 2017-05-26 |
| KR20180084789A (ko) | 2018-07-25 |
| PL3378064T3 (pl) | 2025-04-22 |
| CN108292505B (zh) | 2022-05-13 |
| US10586544B2 (en) | 2020-03-10 |
| TWI689917B (zh) | 2020-04-01 |
| KR102391271B1 (ko) | 2022-04-26 |
| CN108292505A (zh) | 2018-07-17 |
| JP2018534625A (ja) | 2018-11-22 |
| BR112018010305A2 (pt) | 2018-12-04 |
| JP6571281B2 (ja) | 2019-09-04 |
| KR102054606B1 (ko) | 2019-12-10 |
| CA3001579A1 (en) | 2017-05-26 |
| JP6786679B2 (ja) | 2020-11-18 |
| US10152977B2 (en) | 2018-12-11 |
| TW201719634A (zh) | 2017-06-01 |
| US20190035409A1 (en) | 2019-01-31 |
| EP3378064C0 (en) | 2025-02-19 |
| US20200202873A1 (en) | 2020-06-25 |
| EP4075428A1 (en) | 2022-10-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA3001579C (en) | Encoding of multiple audio signals | |
| CA3004770C (en) | Temporal offset estimation | |
| AU2018237285B2 (en) | Target sample generation | |
| AU2016370363B2 (en) | Encoding of multiple audio signals | |
| HK40010036A (en) | Target sample generation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request |
Effective date: 20190123 |