PL3499504T3 - Poprawa klasyfikacji między kodowaniem w domenie czasu a kodowaniem w domenie częstotliwości - Google Patents
Poprawa klasyfikacji między kodowaniem w domenie czasu a kodowaniem w domenie częstotliwościInfo
- Publication number
- PL3499504T3 PL3499504T3 PL18214327.1T PL18214327T PL3499504T3 PL 3499504 T3 PL3499504 T3 PL 3499504T3 PL 18214327 T PL18214327 T PL 18214327T PL 3499504 T3 PL3499504 T3 PL 3499504T3
- Authority
- PL
- Poland
- Prior art keywords
- domain coding
- time
- frequency domain
- improving classification
- coding
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/125—Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0002—Codebook adaptations
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0011—Long term prediction filters, i.e. pitch estimation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0016—Codebook for LPC parameters
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462029437P | 2014-07-26 | 2014-07-26 | |
US14/511,943 US9685166B2 (en) | 2014-07-26 | 2014-10-10 | Classification between time-domain coding and frequency domain coding |
Publications (1)
Publication Number | Publication Date |
---|---|
PL3499504T3 true PL3499504T3 (pl) | 2023-08-14 |
Family
ID=55167212
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PL18214327.1T PL3499504T3 (pl) | 2014-07-26 | 2015-07-23 | Poprawa klasyfikacji między kodowaniem w domenie czasu a kodowaniem w domenie częstotliwości |
Country Status (18)
Country | Link |
---|---|
US (4) | US9685166B2 (pl) |
EP (2) | EP3152755B1 (pl) |
JP (1) | JP6334808B2 (pl) |
KR (2) | KR101960198B1 (pl) |
CN (2) | CN109545236B (pl) |
AU (2) | AU2015296315A1 (pl) |
BR (1) | BR112016030056B1 (pl) |
CA (1) | CA2952888C (pl) |
ES (2) | ES2938668T3 (pl) |
FI (1) | FI3499504T3 (pl) |
HK (1) | HK1232336A1 (pl) |
MX (1) | MX358252B (pl) |
MY (1) | MY192074A (pl) |
PL (1) | PL3499504T3 (pl) |
PT (2) | PT3152755T (pl) |
RU (1) | RU2667382C2 (pl) |
SG (1) | SG11201610552SA (pl) |
WO (1) | WO2016015591A1 (pl) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9589570B2 (en) | 2012-09-18 | 2017-03-07 | Huawei Technologies Co., Ltd. | Audio classification based on perceptual quality for low or medium bit rates |
KR101621774B1 (ko) * | 2014-01-24 | 2016-05-19 | 숭실대학교산학협력단 | 음주 판별 방법, 이를 수행하기 위한 기록매체 및 단말기 |
BR112020004909A2 (pt) * | 2017-09-20 | 2020-09-15 | Voiceage Corporation | método e dispositivo para distribuir, de forma eficiente, um bit-budget em um codec celp |
WO2019091576A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits |
EP3483883A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio coding and decoding with selective postfiltering |
EP3483886A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Selecting pitch lag |
WO2019091573A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters |
EP3483878A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder supporting a set of different loss concealment tools |
EP3483880A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Temporal noise shaping |
EP3483882A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Controlling bandwidth in encoders and/or decoders |
EP3483879A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Analysis/synthesis windowing function for modulated lapped transformation |
EP3483884A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Signal filtering |
US11270721B2 (en) * | 2018-05-21 | 2022-03-08 | Plantronics, Inc. | Systems and methods of pre-processing of speech signals for improved speech recognition |
USD901798S1 (en) | 2018-08-16 | 2020-11-10 | Samsung Electronics Co., Ltd. | Rack for clothing care machine |
JP7130878B2 (ja) * | 2019-01-13 | 2022-09-05 | 華為技術有限公司 | 高分解能オーディオコーディング |
JP7266689B2 (ja) * | 2019-01-13 | 2023-04-28 | 華為技術有限公司 | ハイレゾリューションオーディオ符号化 |
US11367437B2 (en) * | 2019-05-30 | 2022-06-21 | Nuance Communications, Inc. | Multi-microphone speech dialog system for multiple spatial zones |
CN110992963B (zh) * | 2019-12-10 | 2023-09-29 | 腾讯科技(深圳)有限公司 | 网络通话方法、装置、计算机设备及存储介质 |
CN113129910B (zh) * | 2019-12-31 | 2024-07-30 | 华为技术有限公司 | 音频信号的编解码方法和编解码装置 |
CN113132765A (zh) * | 2020-01-16 | 2021-07-16 | 北京达佳互联信息技术有限公司 | 码率决策模型训练方法、装置、电子设备及存储介质 |
CN118414662A (zh) * | 2021-12-15 | 2024-07-30 | 瑞典爱立信有限公司 | 自适应预测编码 |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5504834A (en) | 1993-05-28 | 1996-04-02 | Motrola, Inc. | Pitch epoch synchronous linear predictive coding vocoder and method |
ES2269112T3 (es) | 2000-02-29 | 2007-04-01 | Qualcomm Incorporated | Codificador de voz multimodal en bucle cerrado de dominio mixto. |
US7185082B1 (en) * | 2000-08-09 | 2007-02-27 | Microsoft Corporation | Fast dynamic measurement of connection bandwidth using at least a pair of non-compressible packets having measurable characteristics |
US7630396B2 (en) | 2004-08-26 | 2009-12-08 | Panasonic Corporation | Multichannel signal coding equipment and multichannel signal decoding equipment |
KR20060119743A (ko) | 2005-05-18 | 2006-11-24 | 엘지전자 주식회사 | 구간 속도에 대한 예측정보를 제공하고 이를 이용하는 방법및 장치 |
ES2478004T3 (es) * | 2005-10-05 | 2014-07-18 | Lg Electronics Inc. | Método y aparato para decodificar una señal de audio |
KR100647336B1 (ko) * | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | 적응적 시간/주파수 기반 오디오 부호화/복호화 장치 및방법 |
KR101149449B1 (ko) * | 2007-03-20 | 2012-05-25 | 삼성전자주식회사 | 오디오 신호의 인코딩 방법 및 장치, 그리고 오디오 신호의디코딩 방법 및 장치 |
EP4407610A1 (en) | 2008-07-11 | 2024-07-31 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and computer program |
CN102089814B (zh) * | 2008-07-11 | 2012-11-21 | 弗劳恩霍夫应用研究促进协会 | 对编码的音频信号进行解码的设备和方法 |
KR101756834B1 (ko) * | 2008-07-14 | 2017-07-12 | 삼성전자주식회사 | 오디오/스피치 신호의 부호화 및 복호화 방법 및 장치 |
US9037474B2 (en) * | 2008-09-06 | 2015-05-19 | Huawei Technologies Co., Ltd. | Method for classifying audio signal into fast signal or slow signal |
WO2010031003A1 (en) | 2008-09-15 | 2010-03-18 | Huawei Technologies Co., Ltd. | Adding second enhancement layer to celp based core layer |
US8577673B2 (en) * | 2008-09-15 | 2013-11-05 | Huawei Technologies Co., Ltd. | CELP post-processing for music signals |
JP5519230B2 (ja) * | 2009-09-30 | 2014-06-11 | パナソニック株式会社 | オーディオエンコーダ及び音信号処理システム |
WO2012000882A1 (en) * | 2010-07-02 | 2012-01-05 | Dolby International Ab | Selective bass post filter |
EP3301677B1 (en) | 2011-12-21 | 2019-08-28 | Huawei Technologies Co., Ltd. | Very short pitch detection and coding |
US9015039B2 (en) | 2011-12-21 | 2015-04-21 | Huawei Technologies Co., Ltd. | Adaptive encoding pitch lag for voiced speech |
US9589570B2 (en) | 2012-09-18 | 2017-03-07 | Huawei Technologies Co., Ltd. | Audio classification based on perceptual quality for low or medium bit rates |
CN103915100B (zh) | 2013-01-07 | 2019-02-15 | 中兴通讯股份有限公司 | 一种编码模式切换方法和装置、解码模式切换方法和装置 |
-
2014
- 2014-10-10 US US14/511,943 patent/US9685166B2/en active Active
-
2015
- 2015-07-23 PT PT15828041T patent/PT3152755T/pt unknown
- 2015-07-23 CN CN201811099395.XA patent/CN109545236B/zh active Active
- 2015-07-23 AU AU2015296315A patent/AU2015296315A1/en not_active Abandoned
- 2015-07-23 WO PCT/CN2015/084931 patent/WO2016015591A1/en active Application Filing
- 2015-07-23 KR KR1020177000714A patent/KR101960198B1/ko active IP Right Grant
- 2015-07-23 EP EP15828041.2A patent/EP3152755B1/en active Active
- 2015-07-23 FI FIEP18214327.1T patent/FI3499504T3/fi active
- 2015-07-23 EP EP18214327.1A patent/EP3499504B1/en active Active
- 2015-07-23 RU RU2017103905A patent/RU2667382C2/ru active
- 2015-07-23 MY MYPI2016704691A patent/MY192074A/en unknown
- 2015-07-23 CN CN201580031783.2A patent/CN106663441B/zh active Active
- 2015-07-23 PT PT182143271T patent/PT3499504T/pt unknown
- 2015-07-23 MX MX2017001045A patent/MX358252B/es active IP Right Grant
- 2015-07-23 KR KR1020197007223A patent/KR102039399B1/ko active IP Right Grant
- 2015-07-23 BR BR112016030056-4A patent/BR112016030056B1/pt active IP Right Grant
- 2015-07-23 CA CA2952888A patent/CA2952888C/en active Active
- 2015-07-23 ES ES18214327T patent/ES2938668T3/es active Active
- 2015-07-23 SG SG11201610552SA patent/SG11201610552SA/en unknown
- 2015-07-23 JP JP2017503873A patent/JP6334808B2/ja active Active
- 2015-07-23 PL PL18214327.1T patent/PL3499504T3/pl unknown
- 2015-07-23 ES ES15828041T patent/ES2721789T3/es active Active
-
2017
- 2017-05-11 US US15/592,573 patent/US9837092B2/en active Active
- 2017-06-15 HK HK17105970.4A patent/HK1232336A1/zh unknown
- 2017-10-16 US US15/784,802 patent/US10586547B2/en active Active
-
2018
- 2018-08-16 AU AU2018217299A patent/AU2018217299B2/en active Active
-
2020
- 2020-01-22 US US16/749,755 patent/US10885926B2/en active Active
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1232336A1 (zh) | 改進時域編碼與頻域編碼之間的分類 | |
IL284727B (en) | Noise reduction methods and devices | |
IL252216B (en) | Difluoromethyl-aminopyridines and difluoromethyl-aminopyrimidines | |
GB201621542D0 (en) | Continuous prediction domain | |
PL3511936T3 (pl) | Kodowanie audio z wykorzystaniem procesora w dziedzinie częstotliwości i procesora w dziedzinie czasu | |
HK1221209A1 (zh) | 提高諧振頻率的鉀摻雜的六角晶型鐵氧體 | |
AP2016009592A0 (en) | Injection molded screening apparatuses and methods | |
PL3152226T3 (pl) | Modyfikowane cyklopentapeptydy i ich zastosowania | |
GB201401617D0 (en) | Novel combination and use | |
HK1231555A1 (zh) | 生物標誌物及其用途 | |
HK1243950A1 (zh) | 新型組合物和用途 | |
SG11201704473WA (en) | New methods and uses | |
GB201408075D0 (en) | Closure and latching mechanisms | |
GB201411913D0 (en) | Cosmetic methods and products | |
GB201603311D0 (en) | New uses and methods | |
GB2514927B (en) | Thiamethoxam and uses thereof | |
GB2548839B (en) | New uses and methods | |
ZA201703336B (en) | Novel products and methods | |
GB2532562B (en) | Multi-element comparison and multi-element addition | |
GB201408662D0 (en) | Internet Domain categorization | |
GB201408091D0 (en) | Methods and uses | |
GB2527643B (en) | Security domain prediction | |
IL252932A0 (en) | Anti-cxcl12 antibody molecules and their uses | |
GB201813934D0 (en) | Thiamethoxam and uses thereof | |
GB201416086D0 (en) | Methods and uses |