CN107851441B - 用于对输入音频信号进行编码的方法和设备 - Google Patents
用于对输入音频信号进行编码的方法和设备 Download PDFInfo
- Publication number
- CN107851441B CN107851441B CN201680045819.7A CN201680045819A CN107851441B CN 107851441 B CN107851441 B CN 107851441B CN 201680045819 A CN201680045819 A CN 201680045819A CN 107851441 B CN107851441 B CN 107851441B
- Authority
- CN
- China
- Prior art keywords
- signal
- band
- input audio
- audio signal
- scaling
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201562206197P | 2015-08-17 | 2015-08-17 | |
| US62/206,197 | 2015-08-17 | ||
| US15/169,633 US9830921B2 (en) | 2015-08-17 | 2016-05-31 | High-band target signal control |
| US15/169,633 | 2016-05-31 | ||
| PCT/US2016/042648 WO2017030705A1 (en) | 2015-08-17 | 2016-07-15 | High-band target signal control |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN107851441A CN107851441A (zh) | 2018-03-27 |
| CN107851441B true CN107851441B (zh) | 2021-09-14 |
Family
ID=56618240
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201680045819.7A Active CN107851441B (zh) | 2015-08-17 | 2016-07-15 | 用于对输入音频信号进行编码的方法和设备 |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US9830921B2 (enExample) |
| EP (1) | EP3338282B1 (enExample) |
| JP (1) | JP6779280B2 (enExample) |
| KR (1) | KR102612134B1 (enExample) |
| CN (1) | CN107851441B (enExample) |
| BR (1) | BR112018002979B1 (enExample) |
| CA (1) | CA2993004C (enExample) |
| ES (1) | ES2842175T3 (enExample) |
| TW (1) | TWI642052B (enExample) |
| WO (1) | WO2017030705A1 (enExample) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MY190424A (en) * | 2016-04-12 | 2022-04-21 | Fraunhofer Ges Forschung | Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band |
| US10431231B2 (en) * | 2017-06-29 | 2019-10-01 | Qualcomm Incorporated | High-band residual prediction with time-domain inter-channel bandwidth extension |
| EP3483882A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Controlling bandwidth in encoders and/or decoders |
| WO2019091576A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits |
| EP3483878A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder supporting a set of different loss concealment tools |
| EP3483880A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Temporal noise shaping |
| EP3483879A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Analysis/synthesis windowing function for modulated lapped transformation |
| EP3483883A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio coding and decoding with selective postfiltering |
| WO2019091573A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters |
| EP3483886A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Selecting pitch lag |
| EP3483884A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Signal filtering |
| KR102271357B1 (ko) * | 2019-06-28 | 2021-07-01 | 국방과학연구소 | 보코더 유형 판별 방법 및 장치 |
| WO2021032719A1 (en) * | 2019-08-20 | 2021-02-25 | Dolby International Ab | Multi-lag format for audio coding |
| TWI835350B (zh) * | 2022-10-14 | 2024-03-11 | 智原科技股份有限公司 | 運用於乙太網路的斷線偵測器與斷線偵測方法 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MXPA04011751A (es) * | 2002-05-31 | 2005-06-08 | Voiceage Corp | Metodo y dispositivo para ocultamiento de borrado adecuado eficiente en codecs de habla de base predictiva lineal. |
| CN101183526A (zh) * | 2006-11-14 | 2008-05-21 | 中兴通讯股份有限公司 | 一种检测语音信号基音周期的方法 |
| CN101228576A (zh) * | 2005-07-21 | 2008-07-23 | 皇家飞利浦电子股份有限公司 | 音频信号修改 |
| CN101379551A (zh) * | 2005-12-28 | 2009-03-04 | 沃伊斯亚吉公司 | 在语音编解码器中用于有效帧擦除隐蔽的方法和装置 |
| CA2917795A1 (en) * | 2013-07-12 | 2015-01-15 | Orange | Optimized scale factor for frequency band extension in an audio frequency signal decoder |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE3166082D1 (en) * | 1980-12-09 | 1984-10-18 | Secretary Industry Brit | Speech recognition systems |
| US7092881B1 (en) * | 1999-07-26 | 2006-08-15 | Lucent Technologies Inc. | Parametric speech codec for representing synthetic speech in the presence of background noise |
| ES2636443T3 (es) * | 2005-04-01 | 2017-10-05 | Qualcomm Incorporated | Sistemas, procedimientos y aparatos para codificación de voz de banda ancha |
| EP1901281B1 (en) * | 2005-06-09 | 2013-03-20 | AGI Inc. | Speech analyzer detecting pitch frequency, speech analyzing method, and speech analyzing program |
| AU2014211479B2 (en) * | 2013-01-29 | 2017-02-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder, method for providing an encoded audio information, method for providing a decoded audio information, computer program and encoded representation using a signal-adaptive bandwidth extension |
-
2016
- 2016-05-31 US US15/169,633 patent/US9830921B2/en active Active
- 2016-07-15 WO PCT/US2016/042648 patent/WO2017030705A1/en not_active Ceased
- 2016-07-15 ES ES16750298T patent/ES2842175T3/es active Active
- 2016-07-15 CA CA2993004A patent/CA2993004C/en active Active
- 2016-07-15 CN CN201680045819.7A patent/CN107851441B/zh active Active
- 2016-07-15 KR KR1020187004516A patent/KR102612134B1/ko active Active
- 2016-07-15 BR BR112018002979-3A patent/BR112018002979B1/pt active IP Right Grant
- 2016-07-15 EP EP16750298.8A patent/EP3338282B1/en active Active
- 2016-07-15 JP JP2018507733A patent/JP6779280B2/ja active Active
- 2016-08-15 TW TW105125969A patent/TWI642052B/zh active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MXPA04011751A (es) * | 2002-05-31 | 2005-06-08 | Voiceage Corp | Metodo y dispositivo para ocultamiento de borrado adecuado eficiente en codecs de habla de base predictiva lineal. |
| CN101228576A (zh) * | 2005-07-21 | 2008-07-23 | 皇家飞利浦电子股份有限公司 | 音频信号修改 |
| CN101379551A (zh) * | 2005-12-28 | 2009-03-04 | 沃伊斯亚吉公司 | 在语音编解码器中用于有效帧擦除隐蔽的方法和装置 |
| CN101183526A (zh) * | 2006-11-14 | 2008-05-21 | 中兴通讯股份有限公司 | 一种检测语音信号基音周期的方法 |
| CA2917795A1 (en) * | 2013-07-12 | 2015-01-15 | Orange | Optimized scale factor for frequency band extension in an audio frequency signal decoder |
Also Published As
| Publication number | Publication date |
|---|---|
| TWI642052B (zh) | 2018-11-21 |
| ES2842175T3 (es) | 2021-07-13 |
| EP3338282B1 (en) | 2020-09-23 |
| US9830921B2 (en) | 2017-11-28 |
| US20170053658A1 (en) | 2017-02-23 |
| CA2993004C (en) | 2023-05-02 |
| BR112018002979B1 (pt) | 2024-03-12 |
| TW201713061A (zh) | 2017-04-01 |
| WO2017030705A1 (en) | 2017-02-23 |
| CN107851441A (zh) | 2018-03-27 |
| JP6779280B2 (ja) | 2020-11-04 |
| EP3338282A1 (en) | 2018-06-27 |
| CA2993004A1 (en) | 2017-02-23 |
| KR102612134B1 (ko) | 2023-12-08 |
| KR20180041131A (ko) | 2018-04-23 |
| BR112018002979A2 (pt) | 2018-09-25 |
| JP2018528464A (ja) | 2018-09-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN107851441B (zh) | 用于对输入音频信号进行编码的方法和设备 | |
| CA2952214C (en) | Temporal gain adjustment based on high-band signal characteristic | |
| CN107851439B (zh) | 在带宽变换周期期间的信号再使用 | |
| US9818419B2 (en) | High-band signal coding using multiple sub-bands | |
| CN106133832A (zh) | 在装置处切换译码技术的设备及方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |