HK1143237A1 - Improved transform coding of speech and audio signals - Google Patents

Improved transform coding of speech and audio signals

Info

Publication number
HK1143237A1
HK1143237A1 HK10109570.7A HK10109570A HK1143237A1 HK 1143237 A1 HK1143237 A1 HK 1143237A1 HK 10109570 A HK10109570 A HK 10109570A HK 1143237 A1 HK1143237 A1 HK 1143237A1
Authority
HK
Hong Kong
Prior art keywords
sub
determining
determined
audio signals
transform coding
Prior art date
Application number
HK10109570.7A
Other languages
English (en)
Inventor
Manuel Briand
Anisse Taleb
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Publication of HK1143237A1 publication Critical patent/HK1143237A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/035Scalar quantisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
HK10109570.7A 2007-08-27 2010-10-07 Improved transform coding of speech and audio signals HK1143237A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US96815907P 2007-08-27 2007-08-27
US4424808P 2008-04-11 2008-04-11
PCT/SE2008/050967 WO2009029035A1 (en) 2007-08-27 2008-08-26 Improved transform coding of speech and audio signals

Publications (1)

Publication Number Publication Date
HK1143237A1 true HK1143237A1 (en) 2010-12-24

Family

ID=40387559

Family Applications (1)

Application Number Title Priority Date Filing Date
HK10109570.7A HK1143237A1 (en) 2007-08-27 2010-10-07 Improved transform coding of speech and audio signals

Country Status (8)

Country Link
US (2) US20110035212A1 (xx)
EP (1) EP2186087B1 (xx)
JP (1) JP5539203B2 (xx)
CN (1) CN101790757B (xx)
AT (1) ATE535904T1 (xx)
ES (1) ES2375192T3 (xx)
HK (1) HK1143237A1 (xx)
WO (1) WO2009029035A1 (xx)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
PT2186090T (pt) * 2007-08-27 2017-03-07 ERICSSON TELEFON AB L M (publ) Detetor de transitórios e método para suportar codificação de um sinal de áudio
ATE535904T1 (de) * 2007-08-27 2011-12-15 Ericsson Telefon Ab L M Verbesserte transformationskodierung von sprach- und audiosignalen
US20100324913A1 (en) * 2009-06-18 2010-12-23 Jacek Piotr Stachurski Method and System for Block Adaptive Fractional-Bit Per Sample Encoding
US8498874B2 (en) * 2009-09-11 2013-07-30 Sling Media Pvt Ltd Audio signal encoding employing interchannel and temporal redundancy reduction
KR101483179B1 (ko) * 2010-10-06 2015-01-19 에스케이 텔레콤주식회사 주파수 마스크 테이블을 이용한 주파수변환 블록 부호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치
GB2487399B (en) * 2011-01-20 2014-06-11 Canon Kk Acoustical synthesis
PL2908313T3 (pl) * 2011-04-15 2019-11-29 Ericsson Telefon Ab L M Adaptacyjny podział współczynnika kształt - wzmocnienie
JP6189831B2 (ja) * 2011-05-13 2017-08-30 サムスン エレクトロニクス カンパニー リミテッド ビット割り当て方法及び記録媒体
CN102800317B (zh) * 2011-05-25 2014-09-17 华为技术有限公司 信号分类方法及设备、编解码方法及设备
CN102208188B (zh) 2011-07-13 2013-04-17 华为技术有限公司 音频信号编解码方法和设备
EP2898506B1 (en) 2012-09-21 2018-01-17 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
CN103778918B (zh) * 2012-10-26 2016-09-07 华为技术有限公司 音频信号的比特分配的方法和装置
CN103854653B (zh) 2012-12-06 2016-12-28 华为技术有限公司 信号解码的方法和设备
IL294836B1 (en) * 2013-04-05 2024-06-01 Dolby Int Ab Audio encoder and decoder
EP3014609B1 (en) 2013-06-27 2017-09-27 Dolby Laboratories Licensing Corporation Bitstream syntax for spatial voice coding
FR3017484A1 (fr) * 2014-02-07 2015-08-14 Orange Extension amelioree de bande de frequence dans un decodeur de signaux audiofrequences
CN105225671B (zh) 2014-06-26 2016-10-26 华为技术有限公司 编解码方法、装置及系统
US10146500B2 (en) * 2016-08-31 2018-12-04 Dts, Inc. Transform-based audio codec and method with subband energy smoothing
EP3483878A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder supporting a set of different loss concealment tools
EP3483886A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Selecting pitch lag
EP3483884A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Signal filtering
EP3483883A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio coding and decoding with selective postfiltering
WO2019091576A1 (en) 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits
EP3483882A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Controlling bandwidth in encoders and/or decoders
EP3483879A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analysis/synthesis windowing function for modulated lapped transformation
EP3483880A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Temporal noise shaping
WO2019091573A1 (en) * 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters
CN112105902B (zh) * 2018-04-11 2022-07-22 杜比实验室特许公司 基于机器学习的用于音频编码和解码的基于感知的损失函数
US10455335B1 (en) * 2018-07-20 2019-10-22 Mimi Hearing Technologies GmbH Systems and methods for modifying an audio signal using custom psychoacoustic models
EP3598441B1 (en) * 2018-07-20 2020-11-04 Mimi Hearing Technologies GmbH Systems and methods for modifying an audio signal using custom psychoacoustic models
US10966033B2 (en) * 2018-07-20 2021-03-30 Mimi Hearing Technologies GmbH Systems and methods for modifying an audio signal using custom psychoacoustic models
EP3614380B1 (en) 2018-08-22 2022-04-13 Mimi Hearing Technologies GmbH Systems and methods for sound enhancement in audio systems

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
USRE40280E1 (en) * 1988-12-30 2008-04-29 Lucent Technologies Inc. Rate loop processor for perceptual encoder/decoder
US5752225A (en) * 1989-01-27 1998-05-12 Dolby Laboratories Licensing Corporation Method and apparatus for split-band encoding and split-band decoding of audio information using adaptive bit allocation to adjacent subbands
NL9000338A (nl) * 1989-06-02 1991-01-02 Koninkl Philips Electronics Nv Digitaal transmissiesysteem, zender en ontvanger te gebruiken in het transmissiesysteem en registratiedrager verkregen met de zender in de vorm van een optekeninrichting.
JP2560873B2 (ja) * 1990-02-28 1996-12-04 日本ビクター株式会社 直交変換符号化復号化方法
JP3134363B2 (ja) * 1991-07-16 2001-02-13 ソニー株式会社 量子化方法
EP0559348A3 (en) * 1992-03-02 1993-11-03 AT&T Corp. Rate control loop processor for perceptual encoder/decoder
JP3150475B2 (ja) * 1993-02-19 2001-03-26 松下電器産業株式会社 量子化方法
JP3123290B2 (ja) * 1993-03-09 2001-01-09 ソニー株式会社 圧縮データ記録装置及び方法、圧縮データ再生方法、記録媒体
US5508949A (en) * 1993-12-29 1996-04-16 Hewlett-Packard Company Fast subband filtering in digital signal coding
JP3334419B2 (ja) * 1995-04-20 2002-10-15 ソニー株式会社 ノイズ低減方法及びノイズ低減装置
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
JP3784993B2 (ja) * 1998-06-26 2006-06-14 株式会社リコー 音響信号の符号化・量子化方法
CN1065400C (zh) * 1998-09-01 2001-05-02 国家科学技术委员会高技术研究发展中心 兼容ac-3和mpeg-2的音频编解码器
US6704705B1 (en) * 1998-09-04 2004-03-09 Nortel Networks Limited Perceptual audio coding
US6578162B1 (en) * 1999-01-20 2003-06-10 Skyworks Solutions, Inc. Error recovery method and apparatus for ADPCM encoded speech
DE19947877C2 (de) * 1999-10-05 2001-09-13 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals
EP1139336A3 (en) * 2000-03-30 2004-01-02 Matsushita Electric Industrial Co., Ltd. Determination of quantizaion coefficients for a subband audio encoder
JP4021124B2 (ja) * 2000-05-30 2007-12-12 株式会社リコー デジタル音響信号符号化装置、方法及び記録媒体
JP2002268693A (ja) * 2001-03-12 2002-09-20 Mitsubishi Electric Corp オーディオ符号化装置
AU2003213149A1 (en) * 2002-02-21 2003-09-09 The Regents Of The University Of California Scalable compression of audio and other signals
JP2003280695A (ja) * 2002-03-19 2003-10-02 Sanyo Electric Co Ltd 音声圧縮方法および音声圧縮装置
JP2003280691A (ja) * 2002-03-19 2003-10-02 Sanyo Electric Co Ltd 音声処理方法および音声処理装置
JP3881946B2 (ja) * 2002-09-12 2007-02-14 松下電器産業株式会社 音響符号化装置及び音響符号化方法
US7272566B2 (en) * 2003-01-02 2007-09-18 Dolby Laboratories Licensing Corporation Reducing scale factor transmission cost for MPEG-2 advanced audio coding (AAC) using a lattice based post processing technique
JP4293833B2 (ja) * 2003-05-19 2009-07-08 シャープ株式会社 ディジタル信号記録再生装置及びその制御プログラム
JP4212591B2 (ja) * 2003-06-30 2009-01-21 富士通株式会社 オーディオ符号化装置
KR100595202B1 (ko) * 2003-12-27 2006-06-30 엘지전자 주식회사 디지털 오디오 워터마크 삽입/검출 장치 및 방법
JP2006018023A (ja) * 2004-07-01 2006-01-19 Fujitsu Ltd オーディオ信号符号化装置、および符号化プログラム
US7668715B1 (en) * 2004-11-30 2010-02-23 Cirrus Logic, Inc. Methods for selecting an initial quantization step size in audio encoders and systems using the same
US7539612B2 (en) * 2005-07-15 2009-05-26 Microsoft Corporation Coding and decoding scale factor information
CN1909066B (zh) * 2005-08-03 2011-02-09 昆山杰得微电子有限公司 音频编码码量控制和调整的方法
US8332216B2 (en) * 2006-01-12 2012-12-11 Stmicroelectronics Asia Pacific Pte., Ltd. System and method for low power stereo perceptual audio coding using adaptive masking threshold
JP4350718B2 (ja) * 2006-03-22 2009-10-21 富士通株式会社 音声符号化装置
KR100943606B1 (ko) * 2006-03-30 2010-02-24 삼성전자주식회사 디지털 통신 시스템에서 양자화 장치 및 방법
SG136836A1 (en) * 2006-04-28 2007-11-29 St Microelectronics Asia Adaptive rate control algorithm for low complexity aac encoding
ATE535904T1 (de) * 2007-08-27 2011-12-15 Ericsson Telefon Ab L M Verbesserte transformationskodierung von sprach- und audiosignalen

Also Published As

Publication number Publication date
ES2375192T3 (es) 2012-02-27
ATE535904T1 (de) 2011-12-15
US9153240B2 (en) 2015-10-06
JP2010538316A (ja) 2010-12-09
WO2009029035A1 (en) 2009-03-05
US20140142956A1 (en) 2014-05-22
US20110035212A1 (en) 2011-02-10
EP2186087B1 (en) 2011-11-30
CN101790757A (zh) 2010-07-28
CN101790757B (zh) 2012-05-30
EP2186087A1 (en) 2010-05-19
EP2186087A4 (en) 2010-11-24
JP5539203B2 (ja) 2014-07-02

Similar Documents

Publication Publication Date Title
HK1143237A1 (en) Improved transform coding of speech and audio signals
KR102367538B1 (ko) 다중 채널 신호 인코딩 방법 및 인코더
AU2009267529B2 (en) Apparatus and method for calculating bandwidth extension data using a spectral tilt controlling framing
Mitra et al. Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
CN101521014B (zh) 音频带宽扩展编解码装置
KR102248008B1 (ko) 향상된 스펙트럼 확장을 사용하여 양자화 잡음을 감소시키기 위한 압신 장치 및 방법
CN101443842B (zh) 信息信号编码
CN101183527B (zh) 用于对高频信号进行编码和解码的方法和设备
CN101770779B (zh) 嘈杂的声学信号中的噪声频谱跟踪
WO2007093726A3 (fr) Dispositif de ponderation perceptuelle en codage/decodage audio
CN1938758B (zh) 确定估计值的方法和装置
EP3457402B1 (en) Noise-adaptive voice signal processing method and terminal device employing said method
WO2009128667A3 (ko) 오디오 시맨틱 정보를 이용한 오디오 신호의 부호화/복호화 방법 및 그 장치
DK1509906T3 (da) Fremgangsmåde og anordning til tonehöjdeforbedring af et dekodet talesignal
WO2007111646A3 (en) Speech post-processing using mdct coefficients
EP2933799A1 (en) Voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method
IL186436A0 (en) Method and apparatus for split-band encoding of speech signals
CN101161033A (zh) 编码音频的节约式响度测量
US11694701B2 (en) Low-complexity tonality-adaptive audio signal quantization
WO2011024198A3 (en) Frequency band scale factor determination in audio encoding based upon frequency band signal energy
MX2012002741A (es) Codificacion de señales de audio utilizando reduccion de redundancia entre caales y temporal.
CN102314883A (zh) 一种判断音乐噪声的方法以及语音消噪方法
CN102332266A (zh) 一种音频数据的编码方法及装置
ATE450034T1 (de) Wahrnehmungsbezogene normierung digitaler audiosignale
MX359502B (es) Metodos y dispositivos de codificacion y decodificacion de señal.