DK3460794T3 - Fremgangsmåde og apparat til lydindkodning - Google Patents
Fremgangsmåde og apparat til lydindkodning Download PDFInfo
- Publication number
- DK3460794T3 DK3460794T3 DK18167140.5T DK18167140T DK3460794T3 DK 3460794 T3 DK3460794 T3 DK 3460794T3 DK 18167140 T DK18167140 T DK 18167140T DK 3460794 T3 DK3460794 T3 DK 3460794T3
- Authority
- DK
- Denmark
- Prior art keywords
- sound encoding
- encoding
- sound
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410288983.3A CN105336338B (zh) | 2014-06-24 | 2014-06-24 | 音频编码方法和装置 |
EP15811228.4A EP3144933B1 (en) | 2014-06-24 | 2015-06-23 | Audio coding method and apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
DK3460794T3 true DK3460794T3 (da) | 2021-08-16 |
Family
ID=54936800
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DK18167140.5T DK3460794T3 (da) | 2014-06-24 | 2015-06-23 | Fremgangsmåde og apparat til lydindkodning |
Country Status (17)
Country | Link |
---|---|
US (3) | US9761239B2 (da) |
EP (2) | EP3144933B1 (da) |
JP (1) | JP6426211B2 (da) |
KR (2) | KR101960152B1 (da) |
CN (3) | CN107424622B (da) |
AU (2) | AU2015281506B2 (da) |
BR (1) | BR112016029380B1 (da) |
CA (1) | CA2951593C (da) |
DK (1) | DK3460794T3 (da) |
ES (2) | ES2883685T3 (da) |
HK (1) | HK1220542A1 (da) |
MX (1) | MX361248B (da) |
MY (1) | MY173129A (da) |
PT (1) | PT3144933T (da) |
RU (1) | RU2667380C2 (da) |
SG (1) | SG11201610302TA (da) |
WO (1) | WO2015196968A1 (da) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107424622B (zh) | 2014-06-24 | 2020-12-25 | 华为技术有限公司 | 音频编码方法和装置 |
CN111739543B (zh) * | 2020-05-25 | 2023-05-23 | 杭州涂鸦信息技术有限公司 | 音频编码方法的调试方法及其相关装置 |
CN113948085B (zh) * | 2021-12-22 | 2022-03-25 | 中国科学院自动化研究所 | 语音识别方法、系统、电子设备和存储介质 |
Family Cites Families (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FI101439B1 (fi) * | 1995-04-13 | 1998-06-15 | Nokia Telecommunications Oy | Transkooderi, jossa on tandem-koodauksen esto |
US6134518A (en) * | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
ES2247741T3 (es) * | 1998-01-22 | 2006-03-01 | Deutsche Telekom Ag | Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio. |
US7139700B1 (en) * | 1999-09-22 | 2006-11-21 | Texas Instruments Incorporated | Hybrid speech coding and system |
US6901362B1 (en) * | 2000-04-19 | 2005-05-31 | Microsoft Corporation | Audio segmentation and classification |
US6658383B2 (en) * | 2001-06-26 | 2003-12-02 | Microsoft Corporation | Method for coding speech and music signals |
US6647366B2 (en) * | 2001-12-28 | 2003-11-11 | Microsoft Corporation | Rate control strategies for speech and music coding |
WO2004082288A1 (en) * | 2003-03-11 | 2004-09-23 | Nokia Corporation | Switching between coding schemes |
US20050096898A1 (en) * | 2003-10-29 | 2005-05-05 | Manoj Singhal | Classification of speech and music using sub-band energy |
FI118834B (fi) * | 2004-02-23 | 2008-03-31 | Nokia Corp | Audiosignaalien luokittelu |
FI118835B (fi) | 2004-02-23 | 2008-03-31 | Nokia Corp | Koodausmallin valinta |
GB0408856D0 (en) | 2004-04-21 | 2004-05-26 | Nokia Corp | Signal encoding |
US7739120B2 (en) * | 2004-05-17 | 2010-06-15 | Nokia Corporation | Selection of coding models for encoding an audio signal |
NZ562188A (en) * | 2005-04-01 | 2010-05-28 | Qualcomm Inc | Methods and apparatus for encoding and decoding an highband portion of a speech signal |
US8892448B2 (en) | 2005-04-22 | 2014-11-18 | Qualcomm Incorporated | Systems, methods, and apparatus for gain factor smoothing |
DE102005046993B3 (de) | 2005-09-30 | 2007-02-22 | Infineon Technologies Ag | Vorrichtung und Verfahren zum Erzeugen eines Leistungssignals aus einem Laststrom |
US8015000B2 (en) * | 2006-08-03 | 2011-09-06 | Broadcom Corporation | Classification-based frame loss concealment for audio signals |
RU2426179C2 (ru) | 2006-10-10 | 2011-08-10 | Квэлкомм Инкорпорейтед | Способ и устройство для кодирования и декодирования аудиосигналов |
KR100964402B1 (ko) * | 2006-12-14 | 2010-06-17 | 삼성전자주식회사 | 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치 |
CN101025918B (zh) * | 2007-01-19 | 2011-06-29 | 清华大学 | 一种语音/音乐双模编解码无缝切换方法 |
KR101149449B1 (ko) * | 2007-03-20 | 2012-05-25 | 삼성전자주식회사 | 오디오 신호의 인코딩 방법 및 장치, 그리고 오디오 신호의디코딩 방법 및 장치 |
JP5156260B2 (ja) * | 2007-04-27 | 2013-03-06 | ニュアンス コミュニケーションズ,インコーポレイテッド | 雑音を除去して目的音を抽出する方法、前処理部、音声認識システムおよびプログラム |
KR100925256B1 (ko) * | 2007-05-03 | 2009-11-05 | 인하대학교 산학협력단 | 음성 및 음악을 실시간으로 분류하는 방법 |
WO2009110751A2 (ko) * | 2008-03-04 | 2009-09-11 | Lg Electronics Inc. | 오디오 신호 처리 방법 및 장치 |
EP2139000B1 (en) * | 2008-06-25 | 2011-05-25 | Thomson Licensing | Method and apparatus for encoding or decoding a speech and/or non-speech audio input signal |
WO2010005224A2 (en) * | 2008-07-07 | 2010-01-14 | Lg Electronics Inc. | A method and an apparatus for processing an audio signal |
EP2144230A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
WO2010003521A1 (en) * | 2008-07-11 | 2010-01-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method and discriminator for classifying different segments of a signal |
US9037474B2 (en) * | 2008-09-06 | 2015-05-19 | Huawei Technologies Co., Ltd. | Method for classifying audio signal into fast signal or slow signal |
CN101615910B (zh) | 2009-05-31 | 2010-12-22 | 华为技术有限公司 | 压缩编码的方法、装置和设备以及压缩解码方法 |
US8606569B2 (en) * | 2009-07-02 | 2013-12-10 | Alon Konchitsky | Automatic determination of multimedia and voice signals |
CN102044244B (zh) * | 2009-10-15 | 2011-11-16 | 华为技术有限公司 | 信号分类方法和装置 |
CN101800050B (zh) * | 2010-02-03 | 2012-10-10 | 武汉大学 | 基于感知自适应比特分配的音频精细分级编码方法及系统 |
US20130114733A1 (en) | 2010-07-05 | 2013-05-09 | Nippon Telegraph And Telephone Corporation | Encoding method, decoding method, device, program, and recording medium |
US9208792B2 (en) * | 2010-08-17 | 2015-12-08 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for noise injection |
US8484023B2 (en) | 2010-09-24 | 2013-07-09 | Nuance Communications, Inc. | Sparse representation features for speech recognition |
US9111526B2 (en) | 2010-10-25 | 2015-08-18 | Qualcomm Incorporated | Systems, method, apparatus, and computer-readable media for decomposition of a multichannel music signal |
US9240191B2 (en) * | 2011-04-28 | 2016-01-19 | Telefonaktiebolaget L M Ericsson (Publ) | Frame based audio signal classification |
EP2770506A4 (en) | 2011-10-19 | 2015-02-25 | Panasonic Ip Corp America | CODING DEVICE AND CODING METHOD |
US9111531B2 (en) * | 2012-01-13 | 2015-08-18 | Qualcomm Incorporated | Multiple coding mode signal classification |
CN102737647A (zh) * | 2012-07-23 | 2012-10-17 | 武汉大学 | 双声道音频音质增强编解码方法及装置 |
CN105976824B (zh) * | 2012-12-06 | 2021-06-08 | 华为技术有限公司 | 信号解码的方法和设备 |
CN103747237B (zh) | 2013-02-06 | 2015-04-29 | 华为技术有限公司 | 视频编码质量的评估方法及设备 |
CN103280221B (zh) | 2013-05-09 | 2015-07-29 | 北京大学 | 一种基于基追踪的音频无损压缩编码、解码方法及系统 |
CN103778919B (zh) * | 2014-01-21 | 2016-08-17 | 南京邮电大学 | 基于压缩感知和稀疏表示的语音编码方法 |
CN107424622B (zh) | 2014-06-24 | 2020-12-25 | 华为技术有限公司 | 音频编码方法和装置 |
CN104217730B (zh) * | 2014-08-18 | 2017-07-21 | 大连理工大学 | 一种基于k‑svd的人工语音带宽扩展方法及装置 |
-
2014
- 2014-06-24 CN CN201710188023.3A patent/CN107424622B/zh active Active
- 2014-06-24 CN CN201410288983.3A patent/CN105336338B/zh active Active
- 2014-06-24 CN CN201710188022.9A patent/CN107424621B/zh active Active
-
2015
- 2015-06-23 SG SG11201610302TA patent/SG11201610302TA/en unknown
- 2015-06-23 AU AU2015281506A patent/AU2015281506B2/en active Active
- 2015-06-23 ES ES18167140T patent/ES2883685T3/es active Active
- 2015-06-23 KR KR1020167036467A patent/KR101960152B1/ko active IP Right Grant
- 2015-06-23 CA CA2951593A patent/CA2951593C/en active Active
- 2015-06-23 MY MYPI2016704527A patent/MY173129A/en unknown
- 2015-06-23 EP EP15811228.4A patent/EP3144933B1/en active Active
- 2015-06-23 BR BR112016029380-0A patent/BR112016029380B1/pt active IP Right Grant
- 2015-06-23 KR KR1020197007222A patent/KR102051928B1/ko active IP Right Grant
- 2015-06-23 DK DK18167140.5T patent/DK3460794T3/da active
- 2015-06-23 MX MX2016016564A patent/MX361248B/es active IP Right Grant
- 2015-06-23 EP EP18167140.5A patent/EP3460794B1/en active Active
- 2015-06-23 RU RU2017101813A patent/RU2667380C2/ru active
- 2015-06-23 WO PCT/CN2015/082076 patent/WO2015196968A1/zh active Application Filing
- 2015-06-23 JP JP2016574980A patent/JP6426211B2/ja active Active
- 2015-06-23 PT PT15811228T patent/PT3144933T/pt unknown
- 2015-06-23 ES ES15811228T patent/ES2703199T3/es active Active
-
2016
- 2016-07-15 HK HK16108373.2A patent/HK1220542A1/zh unknown
- 2016-12-21 US US15/386,246 patent/US9761239B2/en active Active
-
2017
- 2017-08-21 US US15/682,097 patent/US10347267B2/en active Active
-
2018
- 2018-05-22 AU AU2018203619A patent/AU2018203619B2/en active Active
-
2019
- 2019-06-13 US US16/439,954 patent/US11074922B2/en active Active
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK3677032T3 (da) | Fremgangsmåde og apparat til kodning | |
DK3220676T3 (da) | Fremgangsmåde og apparat til afsendelse af informationer | |
DK3094357T3 (da) | Objektdekontamineringsapparat og metode | |
DK3265757T3 (da) | Fremgangsmåde og apparat til optisk detektion | |
DK3108319T3 (da) | Fremgangsmåde og apparat til togstyringssystem | |
DK3339479T3 (da) | Elektrolyseindretning og elektrolysefremgangsmåde | |
DK3347113T3 (da) | Ekstraktionsapparat og metode dertil | |
DK3358338T3 (da) | Billeddannelsesfremgangsmåde og billeddannelsesapparat | |
DK3177876T3 (da) | Fremgangsmåde og indretning til affugtning | |
BR112016027102A2 (pt) | método e aparelho | |
DK3166492T3 (da) | Apparat til optagelse og behandling af billeder | |
DK3304007T3 (da) | Forbedret flowmåleapparat og fremgangsmåde til brug | |
PL3668125T3 (pl) | Sposób i urządzenie do renderowania sygnału akustycznego | |
IL268543B (en) | Method and device for audio coding | |
DK3256300T3 (da) | Apparat og fremgangsmåde til pultrusion | |
DK3621073T3 (da) | Lydkodningsindretning og lydkodningsfremgangsmåde | |
DK3397919T3 (da) | Fremgangsmåde og anordning til tårnsimulation | |
DK2952248T3 (da) | Apparat og metode til fremstilling af skum | |
NO345038B1 (no) | Apparat og framgangsmåte for detektering av korrosjon | |
DK3259543T3 (da) | Apparat og fremgangsmåde til generering af smådråber | |
DK3330724T3 (da) | Fremgangsmåde og apparat til simultan impedansprøvning | |
DK3037905T3 (da) | Apparat og fremgangsmåde til optagelse af positioner | |
HUE054555T2 (hu) | Audió kódoló eljárás és berendezés | |
DK3310093T3 (da) | Trafikstyringsfremgangsmåde og apparat | |
DK3365658T3 (da) | Prøvetestapparat og -fremgangsmåde |