TWI664624B - 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 - Google Patents
編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 Download PDFInfo
- Publication number
- TWI664624B TWI664624B TW105133088A TW105133088A TWI664624B TW I664624 B TWI664624 B TW I664624B TW 105133088 A TW105133088 A TW 105133088A TW 105133088 A TW105133088 A TW 105133088A TW I664624 B TWI664624 B TW I664624B
- Authority
- TW
- Taiwan
- Prior art keywords
- channel
- value
- signal
- audio
- shift value
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims description 649
- 238000000034 method Methods 0.000 title claims description 136
- 238000004891 communication Methods 0.000 title claims description 13
- 230000004044 response Effects 0.000 claims description 99
- 230000001364 causal effect Effects 0.000 claims description 93
- 230000008859 change Effects 0.000 claims description 70
- 230000003111 delayed effect Effects 0.000 claims description 26
- 230000000875 corresponding effect Effects 0.000 description 91
- 238000012952 Resampling Methods 0.000 description 38
- 238000010586 diagram Methods 0.000 description 33
- 230000005540 biological transmission Effects 0.000 description 23
- 238000005070 sampling Methods 0.000 description 22
- 230000006870 function Effects 0.000 description 15
- 238000012545 processing Methods 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 9
- 238000001914 filtration Methods 0.000 description 8
- 230000009977 dual effect Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 238000009499 grossing Methods 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000037433 frameshift Effects 0.000 description 3
- 230000010363 phase shift Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/11—Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201562258369P | 2015-11-20 | 2015-11-20 | |
| US62/258,369 | 2015-11-20 | ||
| US15/274,041 US10152977B2 (en) | 2015-11-20 | 2016-09-23 | Encoding of multiple audio signals |
| US15/274,041 | 2016-09-23 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201719634A TW201719634A (zh) | 2017-06-01 |
| TWI664624B true TWI664624B (zh) | 2019-07-01 |
Family
ID=57137264
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW105133088A TWI664624B (zh) | 2015-11-20 | 2016-10-13 | 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 |
| TW108117949A TWI689917B (zh) | 2015-11-20 | 2016-10-13 | 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW108117949A TWI689917B (zh) | 2015-11-20 | 2016-10-13 | 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 |
Country Status (10)
| Country | Link |
|---|---|
| US (3) | US10152977B2 (enExample) |
| EP (2) | EP3378064B1 (enExample) |
| JP (2) | JP6571281B2 (enExample) |
| KR (2) | KR102391271B1 (enExample) |
| CN (2) | CN108292505B (enExample) |
| CA (1) | CA3001579C (enExample) |
| ES (1) | ES3014625T3 (enExample) |
| PL (1) | PL3378064T3 (enExample) |
| TW (2) | TWI664624B (enExample) |
| WO (1) | WO2017087073A1 (enExample) |
Families Citing this family (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9407989B1 (en) | 2015-06-30 | 2016-08-02 | Arthur Woodrow | Closed audio circuit |
| US10152977B2 (en) | 2015-11-20 | 2018-12-11 | Qualcomm Incorporated | Encoding of multiple audio signals |
| CN109074812B (zh) * | 2016-01-22 | 2023-11-17 | 弗劳恩霍夫应用研究促进协会 | 用于具有全局ild和改进的中/侧决策的mdct m/s立体声的装置和方法 |
| US10304468B2 (en) * | 2017-03-20 | 2019-05-28 | Qualcomm Incorporated | Target sample generation |
| WO2018203471A1 (ja) * | 2017-05-01 | 2018-11-08 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 符号化装置及び符号化方法 |
| CN108877815B (zh) * | 2017-05-16 | 2021-02-23 | 华为技术有限公司 | 一种立体声信号处理方法及装置 |
| US10885921B2 (en) * | 2017-07-07 | 2021-01-05 | Qualcomm Incorporated | Multi-stream audio coding |
| CN109389987B (zh) | 2017-08-10 | 2022-05-10 | 华为技术有限公司 | 音频编解码模式确定方法和相关产品 |
| US10891960B2 (en) * | 2017-09-11 | 2021-01-12 | Qualcomm Incorproated | Temporal offset estimation |
| US10872611B2 (en) * | 2017-09-12 | 2020-12-22 | Qualcomm Incorporated | Selecting channel adjustment method for inter-frame temporal shift variations |
| US10839814B2 (en) * | 2017-10-05 | 2020-11-17 | Qualcomm Incorporated | Encoding or decoding of audio signals |
| CN108428457B (zh) * | 2018-02-12 | 2021-03-23 | 北京百度网讯科技有限公司 | 音频去重方法及装置 |
| US11545165B2 (en) * | 2018-07-03 | 2023-01-03 | Panasonic Intellectual Property Corporation Of America | Encoding device and encoding method using a determined prediction parameter based on an energy difference between channels |
| US11295726B2 (en) * | 2019-04-08 | 2022-04-05 | International Business Machines Corporation | Synthetic narrowband data generation for narrowband automatic speech recognition systems |
| CN113870881B (zh) * | 2021-09-26 | 2024-04-26 | 西南石油大学 | 一种鲁棒哈默斯坦子带样条自适应回声消除方法 |
| US11900961B2 (en) * | 2022-05-31 | 2024-02-13 | Microsoft Technology Licensing, Llc | Multichannel audio speech classification |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030220783A1 (en) * | 2002-03-12 | 2003-11-27 | Sebastian Streich | Efficiency improvements in scalable audio coding |
| US20120232912A1 (en) * | 2009-09-11 | 2012-09-13 | Mikko Tammi | Method, Apparatus and Computer Program Product for Audio Coding |
| US20130304481A1 (en) * | 2011-02-03 | 2013-11-14 | Telefonaktiebolaget L M Ericsson (Publ) | Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal |
Family Cites Families (26)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6317703B1 (en) * | 1996-11-12 | 2001-11-13 | International Business Machines Corporation | Separation of a mixture of acoustic sources into its components |
| JP4137202B2 (ja) * | 1997-10-17 | 2008-08-20 | 株式会社日立メディコ | 超音波診断装置 |
| US7240001B2 (en) * | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
| WO2006004048A1 (ja) * | 2004-07-06 | 2006-01-12 | Matsushita Electric Industrial Co., Ltd. | オーディオ信号符号化装置、オーディオ信号復号化装置、方法、及びプログラム |
| US7716043B2 (en) * | 2005-10-24 | 2010-05-11 | Lg Electronics Inc. | Removing time delays in signal paths |
| WO2007080211A1 (en) * | 2006-01-09 | 2007-07-19 | Nokia Corporation | Decoding of binaural audio signals |
| EP1853092B1 (en) * | 2006-05-04 | 2011-10-05 | LG Electronics, Inc. | Enhancing stereo audio with remix capability |
| GB2453117B (en) * | 2007-09-25 | 2012-05-23 | Motorola Mobility Inc | Apparatus and method for encoding a multi channel audio signal |
| US8175291B2 (en) * | 2007-12-19 | 2012-05-08 | Qualcomm Incorporated | Systems, methods, and apparatus for multi-microphone based speech enhancement |
| US20100290629A1 (en) | 2007-12-21 | 2010-11-18 | Panasonic Corporation | Stereo signal converter, stereo signal inverter, and method therefor |
| WO2009142017A1 (ja) * | 2008-05-22 | 2009-11-26 | パナソニック株式会社 | ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法 |
| CN102160113B (zh) | 2008-08-11 | 2013-05-08 | 诺基亚公司 | 多声道音频编码器和解码器 |
| CN101673545B (zh) * | 2008-09-12 | 2011-11-16 | 华为技术有限公司 | 一种编解码方法及装置 |
| EP2209328B1 (en) * | 2009-01-20 | 2013-10-23 | Lg Electronics Inc. | An apparatus for processing an audio signal and method thereof |
| US20100331048A1 (en) * | 2009-06-25 | 2010-12-30 | Qualcomm Incorporated | M-s stereo reproduction at a device |
| US8463414B2 (en) | 2010-08-09 | 2013-06-11 | Motorola Mobility Llc | Method and apparatus for estimating a parameter for low bit rate stereo transmission |
| US9552840B2 (en) * | 2010-10-25 | 2017-01-24 | Qualcomm Incorporated | Three-dimensional sound capturing and reproducing with multi-microphones |
| US9767822B2 (en) * | 2011-02-07 | 2017-09-19 | Qualcomm Incorporated | Devices for encoding and decoding a watermarked signal |
| EP2702776B1 (en) * | 2012-02-17 | 2015-09-23 | Huawei Technologies Co., Ltd. | Parametric encoder for encoding a multi-channel audio signal |
| EP2839460A4 (en) * | 2012-04-18 | 2015-12-30 | Nokia Technologies Oy | STEREOTONSIGNALCODIERER |
| CN104641414A (zh) * | 2012-07-19 | 2015-05-20 | 诺基亚公司 | 立体声音频信号编码器 |
| US9479886B2 (en) * | 2012-07-20 | 2016-10-25 | Qualcomm Incorporated | Scalable downmix design with feedback for object-based surround codec |
| WO2014018950A1 (en) | 2012-07-27 | 2014-01-30 | Thorlabs, Inc. | Agile imaging system |
| WO2015077641A1 (en) * | 2013-11-22 | 2015-05-28 | Qualcomm Incorporated | Selective phase compensation in high band coding |
| CN104700839B (zh) * | 2015-02-26 | 2016-03-23 | 深圳市中兴移动通信有限公司 | 多声道声音采集的方法、装置、手机及系统 |
| US10152977B2 (en) | 2015-11-20 | 2018-12-11 | Qualcomm Incorporated | Encoding of multiple audio signals |
-
2016
- 2016-09-23 US US15/274,041 patent/US10152977B2/en active Active
- 2016-09-26 EP EP16781923.4A patent/EP3378064B1/en active Active
- 2016-09-26 KR KR1020197035702A patent/KR102391271B1/ko active Active
- 2016-09-26 CA CA3001579A patent/CA3001579C/en active Active
- 2016-09-26 EP EP22167183.7A patent/EP4075428A1/en active Pending
- 2016-09-26 PL PL16781923.4T patent/PL3378064T3/pl unknown
- 2016-09-26 CN CN201680066902.2A patent/CN108292505B/zh active Active
- 2016-09-26 WO PCT/US2016/053799 patent/WO2017087073A1/en not_active Ceased
- 2016-09-26 ES ES16781923T patent/ES3014625T3/es active Active
- 2016-09-26 KR KR1020187013766A patent/KR102054606B1/ko active Active
- 2016-09-26 CN CN202110193366.5A patent/CN112951249A/zh active Pending
- 2016-09-26 JP JP2018525569A patent/JP6571281B2/ja active Active
- 2016-10-13 TW TW105133088A patent/TWI664624B/zh active
- 2016-10-13 TW TW108117949A patent/TWI689917B/zh active
-
2018
- 2018-10-04 US US16/152,357 patent/US10586544B2/en active Active
-
2019
- 2019-08-07 JP JP2019145550A patent/JP6786679B2/ja active Active
-
2020
- 2020-02-28 US US16/805,289 patent/US11094330B2/en active Active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030220783A1 (en) * | 2002-03-12 | 2003-11-27 | Sebastian Streich | Efficiency improvements in scalable audio coding |
| US20120232912A1 (en) * | 2009-09-11 | 2012-09-13 | Mikko Tammi | Method, Apparatus and Computer Program Product for Audio Coding |
| US20130304481A1 (en) * | 2011-02-03 | 2013-11-14 | Telefonaktiebolaget L M Ericsson (Publ) | Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20190137181A (ko) | 2019-12-10 |
| CN112951249A (zh) | 2021-06-11 |
| JP2019207430A (ja) | 2019-12-05 |
| ES3014625T3 (en) | 2025-04-23 |
| EP3378064A1 (en) | 2018-09-26 |
| US11094330B2 (en) | 2021-08-17 |
| US20170148447A1 (en) | 2017-05-25 |
| EP3378064B1 (en) | 2025-02-19 |
| TW201935465A (zh) | 2019-09-01 |
| WO2017087073A1 (en) | 2017-05-26 |
| KR20180084789A (ko) | 2018-07-25 |
| PL3378064T3 (pl) | 2025-04-22 |
| CN108292505B (zh) | 2022-05-13 |
| US10586544B2 (en) | 2020-03-10 |
| TWI689917B (zh) | 2020-04-01 |
| KR102391271B1 (ko) | 2022-04-26 |
| CN108292505A (zh) | 2018-07-17 |
| JP2018534625A (ja) | 2018-11-22 |
| BR112018010305A2 (pt) | 2018-12-04 |
| JP6571281B2 (ja) | 2019-09-04 |
| KR102054606B1 (ko) | 2019-12-10 |
| CA3001579A1 (en) | 2017-05-26 |
| JP6786679B2 (ja) | 2020-11-18 |
| US10152977B2 (en) | 2018-12-11 |
| TW201719634A (zh) | 2017-06-01 |
| US20190035409A1 (en) | 2019-01-31 |
| EP3378064C0 (en) | 2025-02-19 |
| CA3001579C (en) | 2021-01-12 |
| US20200202873A1 (en) | 2020-06-25 |
| EP4075428A1 (en) | 2022-10-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI664624B (zh) | 編碼多重音訊信號之器件,通信之方法及裝置及電腦可讀儲存器件 | |
| TWI688243B (zh) | 時間性偏移估計 | |
| US10714101B2 (en) | Target sample generation | |
| US10115403B2 (en) | Encoding of multiple audio signals | |
| HK40010036A (en) | Target sample generation |