HK1143237A1 - Improved transform coding of speech and audio signals - Google Patents
Improved transform coding of speech and audio signalsInfo
- Publication number
- HK1143237A1 HK1143237A1 HK10109570.7A HK10109570A HK1143237A1 HK 1143237 A1 HK1143237 A1 HK 1143237A1 HK 10109570 A HK10109570 A HK 10109570A HK 1143237 A1 HK1143237 A1 HK 1143237A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- sub
- determining
- determined
- audio signals
- transform coding
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 4
- 230000000873 masking effect Effects 0.000 abstract 2
- 238000001228 spectrum Methods 0.000 abstract 2
- 238000000034 method Methods 0.000 abstract 1
- 230000009466 transformation Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US96815907P | 2007-08-27 | 2007-08-27 | |
US4424808P | 2008-04-11 | 2008-04-11 | |
PCT/SE2008/050967 WO2009029035A1 (en) | 2007-08-27 | 2008-08-26 | Improved transform coding of speech and audio signals |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1143237A1 true HK1143237A1 (en) | 2010-12-24 |
Family
ID=40387559
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK10109570.7A HK1143237A1 (en) | 2007-08-27 | 2010-10-07 | Improved transform coding of speech and audio signals |
Country Status (8)
Country | Link |
---|---|
US (2) | US20110035212A1 (xx) |
EP (1) | EP2186087B1 (xx) |
JP (1) | JP5539203B2 (xx) |
CN (1) | CN101790757B (xx) |
AT (1) | ATE535904T1 (xx) |
ES (1) | ES2375192T3 (xx) |
HK (1) | HK1143237A1 (xx) |
WO (1) | WO2009029035A1 (xx) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PT2186090T (pt) * | 2007-08-27 | 2017-03-07 | ERICSSON TELEFON AB L M (publ) | Detetor de transitórios e método para suportar codificação de um sinal de áudio |
ATE535904T1 (de) * | 2007-08-27 | 2011-12-15 | Ericsson Telefon Ab L M | Verbesserte transformationskodierung von sprach- und audiosignalen |
US20100324913A1 (en) * | 2009-06-18 | 2010-12-23 | Jacek Piotr Stachurski | Method and System for Block Adaptive Fractional-Bit Per Sample Encoding |
US8498874B2 (en) * | 2009-09-11 | 2013-07-30 | Sling Media Pvt Ltd | Audio signal encoding employing interchannel and temporal redundancy reduction |
KR101483179B1 (ko) * | 2010-10-06 | 2015-01-19 | 에스케이 텔레콤주식회사 | 주파수 마스크 테이블을 이용한 주파수변환 블록 부호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 |
GB2487399B (en) * | 2011-01-20 | 2014-06-11 | Canon Kk | Acoustical synthesis |
PL2908313T3 (pl) * | 2011-04-15 | 2019-11-29 | Ericsson Telefon Ab L M | Adaptacyjny podział współczynnika kształt - wzmocnienie |
JP6189831B2 (ja) * | 2011-05-13 | 2017-08-30 | サムスン エレクトロニクス カンパニー リミテッド | ビット割り当て方法及び記録媒体 |
CN102800317B (zh) * | 2011-05-25 | 2014-09-17 | 华为技术有限公司 | 信号分类方法及设备、编解码方法及设备 |
CN102208188B (zh) | 2011-07-13 | 2013-04-17 | 华为技术有限公司 | 音频信号编解码方法和设备 |
EP2898506B1 (en) | 2012-09-21 | 2018-01-17 | Dolby Laboratories Licensing Corporation | Layered approach to spatial audio coding |
CN103778918B (zh) * | 2012-10-26 | 2016-09-07 | 华为技术有限公司 | 音频信号的比特分配的方法和装置 |
CN103854653B (zh) | 2012-12-06 | 2016-12-28 | 华为技术有限公司 | 信号解码的方法和设备 |
IL294836B1 (en) * | 2013-04-05 | 2024-06-01 | Dolby Int Ab | Audio encoder and decoder |
EP3014609B1 (en) | 2013-06-27 | 2017-09-27 | Dolby Laboratories Licensing Corporation | Bitstream syntax for spatial voice coding |
FR3017484A1 (fr) * | 2014-02-07 | 2015-08-14 | Orange | Extension amelioree de bande de frequence dans un decodeur de signaux audiofrequences |
CN105225671B (zh) | 2014-06-26 | 2016-10-26 | 华为技术有限公司 | 编解码方法、装置及系统 |
US10146500B2 (en) * | 2016-08-31 | 2018-12-04 | Dts, Inc. | Transform-based audio codec and method with subband energy smoothing |
EP3483878A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder supporting a set of different loss concealment tools |
EP3483886A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Selecting pitch lag |
EP3483884A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Signal filtering |
EP3483883A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio coding and decoding with selective postfiltering |
WO2019091576A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits |
EP3483882A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Controlling bandwidth in encoders and/or decoders |
EP3483879A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Analysis/synthesis windowing function for modulated lapped transformation |
EP3483880A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Temporal noise shaping |
WO2019091573A1 (en) * | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters |
CN112105902B (zh) * | 2018-04-11 | 2022-07-22 | 杜比实验室特许公司 | 基于机器学习的用于音频编码和解码的基于感知的损失函数 |
US10455335B1 (en) * | 2018-07-20 | 2019-10-22 | Mimi Hearing Technologies GmbH | Systems and methods for modifying an audio signal using custom psychoacoustic models |
EP3598441B1 (en) * | 2018-07-20 | 2020-11-04 | Mimi Hearing Technologies GmbH | Systems and methods for modifying an audio signal using custom psychoacoustic models |
US10966033B2 (en) * | 2018-07-20 | 2021-03-30 | Mimi Hearing Technologies GmbH | Systems and methods for modifying an audio signal using custom psychoacoustic models |
EP3614380B1 (en) | 2018-08-22 | 2022-04-13 | Mimi Hearing Technologies GmbH | Systems and methods for sound enhancement in audio systems |
Family Cites Families (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
USRE40280E1 (en) * | 1988-12-30 | 2008-04-29 | Lucent Technologies Inc. | Rate loop processor for perceptual encoder/decoder |
US5752225A (en) * | 1989-01-27 | 1998-05-12 | Dolby Laboratories Licensing Corporation | Method and apparatus for split-band encoding and split-band decoding of audio information using adaptive bit allocation to adjacent subbands |
NL9000338A (nl) * | 1989-06-02 | 1991-01-02 | Koninkl Philips Electronics Nv | Digitaal transmissiesysteem, zender en ontvanger te gebruiken in het transmissiesysteem en registratiedrager verkregen met de zender in de vorm van een optekeninrichting. |
JP2560873B2 (ja) * | 1990-02-28 | 1996-12-04 | 日本ビクター株式会社 | 直交変換符号化復号化方法 |
JP3134363B2 (ja) * | 1991-07-16 | 2001-02-13 | ソニー株式会社 | 量子化方法 |
EP0559348A3 (en) * | 1992-03-02 | 1993-11-03 | AT&T Corp. | Rate control loop processor for perceptual encoder/decoder |
JP3150475B2 (ja) * | 1993-02-19 | 2001-03-26 | 松下電器産業株式会社 | 量子化方法 |
JP3123290B2 (ja) * | 1993-03-09 | 2001-01-09 | ソニー株式会社 | 圧縮データ記録装置及び方法、圧縮データ再生方法、記録媒体 |
US5508949A (en) * | 1993-12-29 | 1996-04-16 | Hewlett-Packard Company | Fast subband filtering in digital signal coding |
JP3334419B2 (ja) * | 1995-04-20 | 2002-10-15 | ソニー株式会社 | ノイズ低減方法及びノイズ低減装置 |
SE512719C2 (sv) * | 1997-06-10 | 2000-05-02 | Lars Gustaf Liljeryd | En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion |
JP3784993B2 (ja) * | 1998-06-26 | 2006-06-14 | 株式会社リコー | 音響信号の符号化・量子化方法 |
CN1065400C (zh) * | 1998-09-01 | 2001-05-02 | 国家科学技术委员会高技术研究发展中心 | 兼容ac-3和mpeg-2的音频编解码器 |
US6704705B1 (en) * | 1998-09-04 | 2004-03-09 | Nortel Networks Limited | Perceptual audio coding |
US6578162B1 (en) * | 1999-01-20 | 2003-06-10 | Skyworks Solutions, Inc. | Error recovery method and apparatus for ADPCM encoded speech |
DE19947877C2 (de) * | 1999-10-05 | 2001-09-13 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals |
EP1139336A3 (en) * | 2000-03-30 | 2004-01-02 | Matsushita Electric Industrial Co., Ltd. | Determination of quantizaion coefficients for a subband audio encoder |
JP4021124B2 (ja) * | 2000-05-30 | 2007-12-12 | 株式会社リコー | デジタル音響信号符号化装置、方法及び記録媒体 |
JP2002268693A (ja) * | 2001-03-12 | 2002-09-20 | Mitsubishi Electric Corp | オーディオ符号化装置 |
AU2003213149A1 (en) * | 2002-02-21 | 2003-09-09 | The Regents Of The University Of California | Scalable compression of audio and other signals |
JP2003280695A (ja) * | 2002-03-19 | 2003-10-02 | Sanyo Electric Co Ltd | 音声圧縮方法および音声圧縮装置 |
JP2003280691A (ja) * | 2002-03-19 | 2003-10-02 | Sanyo Electric Co Ltd | 音声処理方法および音声処理装置 |
JP3881946B2 (ja) * | 2002-09-12 | 2007-02-14 | 松下電器産業株式会社 | 音響符号化装置及び音響符号化方法 |
US7272566B2 (en) * | 2003-01-02 | 2007-09-18 | Dolby Laboratories Licensing Corporation | Reducing scale factor transmission cost for MPEG-2 advanced audio coding (AAC) using a lattice based post processing technique |
JP4293833B2 (ja) * | 2003-05-19 | 2009-07-08 | シャープ株式会社 | ディジタル信号記録再生装置及びその制御プログラム |
JP4212591B2 (ja) * | 2003-06-30 | 2009-01-21 | 富士通株式会社 | オーディオ符号化装置 |
KR100595202B1 (ko) * | 2003-12-27 | 2006-06-30 | 엘지전자 주식회사 | 디지털 오디오 워터마크 삽입/검출 장치 및 방법 |
JP2006018023A (ja) * | 2004-07-01 | 2006-01-19 | Fujitsu Ltd | オーディオ信号符号化装置、および符号化プログラム |
US7668715B1 (en) * | 2004-11-30 | 2010-02-23 | Cirrus Logic, Inc. | Methods for selecting an initial quantization step size in audio encoders and systems using the same |
US7539612B2 (en) * | 2005-07-15 | 2009-05-26 | Microsoft Corporation | Coding and decoding scale factor information |
CN1909066B (zh) * | 2005-08-03 | 2011-02-09 | 昆山杰得微电子有限公司 | 音频编码码量控制和调整的方法 |
US8332216B2 (en) * | 2006-01-12 | 2012-12-11 | Stmicroelectronics Asia Pacific Pte., Ltd. | System and method for low power stereo perceptual audio coding using adaptive masking threshold |
JP4350718B2 (ja) * | 2006-03-22 | 2009-10-21 | 富士通株式会社 | 音声符号化装置 |
KR100943606B1 (ko) * | 2006-03-30 | 2010-02-24 | 삼성전자주식회사 | 디지털 통신 시스템에서 양자화 장치 및 방법 |
SG136836A1 (en) * | 2006-04-28 | 2007-11-29 | St Microelectronics Asia | Adaptive rate control algorithm for low complexity aac encoding |
ATE535904T1 (de) * | 2007-08-27 | 2011-12-15 | Ericsson Telefon Ab L M | Verbesserte transformationskodierung von sprach- und audiosignalen |
-
2008
- 2008-08-26 AT AT08828229T patent/ATE535904T1/de active
- 2008-08-26 US US12/674,117 patent/US20110035212A1/en not_active Abandoned
- 2008-08-26 EP EP08828229A patent/EP2186087B1/en active Active
- 2008-08-26 CN CN200880104834XA patent/CN101790757B/zh active Active
- 2008-08-26 WO PCT/SE2008/050967 patent/WO2009029035A1/en active Application Filing
- 2008-08-26 JP JP2010522867A patent/JP5539203B2/ja active Active
- 2008-08-26 ES ES08828229T patent/ES2375192T3/es active Active
-
2010
- 2010-10-07 HK HK10109570.7A patent/HK1143237A1/xx unknown
-
2013
- 2013-07-11 US US13/939,931 patent/US9153240B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
ES2375192T3 (es) | 2012-02-27 |
ATE535904T1 (de) | 2011-12-15 |
US9153240B2 (en) | 2015-10-06 |
JP2010538316A (ja) | 2010-12-09 |
WO2009029035A1 (en) | 2009-03-05 |
US20140142956A1 (en) | 2014-05-22 |
US20110035212A1 (en) | 2011-02-10 |
EP2186087B1 (en) | 2011-11-30 |
CN101790757A (zh) | 2010-07-28 |
CN101790757B (zh) | 2012-05-30 |
EP2186087A1 (en) | 2010-05-19 |
EP2186087A4 (en) | 2010-11-24 |
JP5539203B2 (ja) | 2014-07-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1143237A1 (en) | Improved transform coding of speech and audio signals | |
KR102367538B1 (ko) | 다중 채널 신호 인코딩 방법 및 인코더 | |
AU2009267529B2 (en) | Apparatus and method for calculating bandwidth extension data using a spectral tilt controlling framing | |
Mitra et al. | Normalized amplitude modulation features for large vocabulary noise-robust speech recognition | |
CN101521014B (zh) | 音频带宽扩展编解码装置 | |
KR102248008B1 (ko) | 향상된 스펙트럼 확장을 사용하여 양자화 잡음을 감소시키기 위한 압신 장치 및 방법 | |
CN101443842B (zh) | 信息信号编码 | |
CN101183527B (zh) | 用于对高频信号进行编码和解码的方法和设备 | |
CN101770779B (zh) | 嘈杂的声学信号中的噪声频谱跟踪 | |
WO2007093726A3 (fr) | Dispositif de ponderation perceptuelle en codage/decodage audio | |
CN1938758B (zh) | 确定估计值的方法和装置 | |
EP3457402B1 (en) | Noise-adaptive voice signal processing method and terminal device employing said method | |
WO2009128667A3 (ko) | 오디오 시맨틱 정보를 이용한 오디오 신호의 부호화/복호화 방법 및 그 장치 | |
DK1509906T3 (da) | Fremgangsmåde og anordning til tonehöjdeforbedring af et dekodet talesignal | |
WO2007111646A3 (en) | Speech post-processing using mdct coefficients | |
EP2933799A1 (en) | Voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method | |
IL186436A0 (en) | Method and apparatus for split-band encoding of speech signals | |
CN101161033A (zh) | 编码音频的节约式响度测量 | |
US11694701B2 (en) | Low-complexity tonality-adaptive audio signal quantization | |
WO2011024198A3 (en) | Frequency band scale factor determination in audio encoding based upon frequency band signal energy | |
MX2012002741A (es) | Codificacion de señales de audio utilizando reduccion de redundancia entre caales y temporal. | |
CN102314883A (zh) | 一种判断音乐噪声的方法以及语音消噪方法 | |
CN102332266A (zh) | 一种音频数据的编码方法及装置 | |
ATE450034T1 (de) | Wahrnehmungsbezogene normierung digitaler audiosignale | |
MX359502B (es) | Metodos y dispositivos de codificacion y decodificacion de señal. |