HK1143237A1 - Improved transform coding of speech and audio signals - Google Patents

Improved transform coding of speech and audio signals

Info

Publication number
HK1143237A1
HK1143237A1 HK10109570.7A HK10109570A HK1143237A1 HK 1143237 A1 HK1143237 A1 HK 1143237A1 HK 10109570 A HK10109570 A HK 10109570A HK 1143237 A1 HK1143237 A1 HK 1143237A1
Authority
HK
Hong Kong
Prior art keywords
sub
determining
determined
audio signals
transform coding
Prior art date
Application number
HK10109570.7A
Other languages
English (en)
Inventor
Manuel Briand
Anisse Taleb
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Publication of HK1143237A1 publication Critical patent/HK1143237A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/035Scalar quantisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
HK10109570.7A 2007-08-27 2010-10-07 Improved transform coding of speech and audio signals HK1143237A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US96815907P 2007-08-27 2007-08-27
US4424808P 2008-04-11 2008-04-11
PCT/SE2008/050967 WO2009029035A1 (en) 2007-08-27 2008-08-26 Improved transform coding of speech and audio signals

Publications (1)

Publication Number Publication Date
HK1143237A1 true HK1143237A1 (en) 2010-12-24

Family

ID=40387559

Family Applications (1)

Application Number Title Priority Date Filing Date
HK10109570.7A HK1143237A1 (en) 2007-08-27 2010-10-07 Improved transform coding of speech and audio signals

Country Status (8)

Country Link
US (2) US20110035212A1 (de)
EP (1) EP2186087B1 (de)
JP (1) JP5539203B2 (de)
CN (1) CN101790757B (de)
AT (1) ATE535904T1 (de)
ES (1) ES2375192T3 (de)
HK (1) HK1143237A1 (de)
WO (1) WO2009029035A1 (de)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009029035A1 (en) * 2007-08-27 2009-03-05 Telefonaktiebolaget Lm Ericsson (Publ) Improved transform coding of speech and audio signals
CN101790756B (zh) * 2007-08-27 2012-09-05 爱立信电话股份有限公司 瞬态检测器以及用于支持音频信号的编码的方法
US9245529B2 (en) * 2009-06-18 2016-01-26 Texas Instruments Incorporated Adaptive encoding of a digital signal with one or more missing values
US8498874B2 (en) * 2009-09-11 2013-07-30 Sling Media Pvt Ltd Audio signal encoding employing interchannel and temporal redundancy reduction
KR101483179B1 (ko) * 2010-10-06 2015-01-19 에스케이 텔레콤주식회사 주파수 마스크 테이블을 이용한 주파수변환 블록 부호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치
GB2487399B (en) * 2011-01-20 2014-06-11 Canon Kk Acoustical synthesis
EP2697795B1 (de) * 2011-04-15 2015-06-17 Telefonaktiebolaget L M Ericsson (PUBL) Adaptive gemeinsame nutzung von verstärkungformraten
MY164164A (en) * 2011-05-13 2017-11-30 Samsung Electronics Co Ltd Bit allocating, audio encoding and decoding
CN102800317B (zh) * 2011-05-25 2014-09-17 华为技术有限公司 信号分类方法及设备、编解码方法及设备
CN102208188B (zh) * 2011-07-13 2013-04-17 华为技术有限公司 音频信号编解码方法和设备
WO2014046916A1 (en) 2012-09-21 2014-03-27 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
CN103778918B (zh) * 2012-10-26 2016-09-07 华为技术有限公司 音频信号的比特分配的方法和装置
CN105976824B (zh) 2012-12-06 2021-06-08 华为技术有限公司 信号解码的方法和设备
EP3671738B1 (de) 2013-04-05 2024-06-05 Dolby International AB Audiokodierer und audiodekodierer
US9530422B2 (en) 2013-06-27 2016-12-27 Dolby Laboratories Licensing Corporation Bitstream syntax for spatial voice coding
FR3017484A1 (fr) * 2014-02-07 2015-08-14 Orange Extension amelioree de bande de frequence dans un decodeur de signaux audiofrequences
CN105225671B (zh) 2014-06-26 2016-10-26 华为技术有限公司 编解码方法、装置及系统
US10146500B2 (en) * 2016-08-31 2018-12-04 Dts, Inc. Transform-based audio codec and method with subband energy smoothing
EP3483886A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Auswahl einer grundfrequenz
EP3483879A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analyse-/synthese-fensterfunktion für modulierte geläppte transformation
WO2019091573A1 (en) * 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters
EP3483882A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Steuerung der bandbreite in codierern und/oder decodierern
EP3483884A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Signalfiltrierung
EP3483878A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audiodecoder mit auswahlfunktion für unterschiedliche verlustmaskierungswerkzeuge
WO2019091576A1 (en) 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits
EP3483883A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audiokodierung und -dekodierung mit selektiver nachfilterung
EP3483880A1 (de) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Zeitliche rauschformung
EP3775821A1 (de) * 2018-04-11 2021-02-17 Dolby Laboratories Licensing Corporation Auf der wahrnehmung basierende verlustfunktionen zur audiocodierung und -decodierung auf basis von maschinellem lernen
US10455335B1 (en) * 2018-07-20 2019-10-22 Mimi Hearing Technologies GmbH Systems and methods for modifying an audio signal using custom psychoacoustic models
US10966033B2 (en) * 2018-07-20 2021-03-30 Mimi Hearing Technologies GmbH Systems and methods for modifying an audio signal using custom psychoacoustic models
EP3598441B1 (de) * 2018-07-20 2020-11-04 Mimi Hearing Technologies GmbH Systeme und verfahren zur modifizierung eines audiosignals mittels massgefertigten psycho-akustischen modellen
EP3614380B1 (de) 2018-08-22 2022-04-13 Mimi Hearing Technologies GmbH Systeme und verfahren zur soundverbesserung in audiosystemen

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
USRE40280E1 (en) * 1988-12-30 2008-04-29 Lucent Technologies Inc. Rate loop processor for perceptual encoder/decoder
US5752225A (en) * 1989-01-27 1998-05-12 Dolby Laboratories Licensing Corporation Method and apparatus for split-band encoding and split-band decoding of audio information using adaptive bit allocation to adjacent subbands
NL9000338A (nl) * 1989-06-02 1991-01-02 Koninkl Philips Electronics Nv Digitaal transmissiesysteem, zender en ontvanger te gebruiken in het transmissiesysteem en registratiedrager verkregen met de zender in de vorm van een optekeninrichting.
JP2560873B2 (ja) * 1990-02-28 1996-12-04 日本ビクター株式会社 直交変換符号化復号化方法
JP3134363B2 (ja) * 1991-07-16 2001-02-13 ソニー株式会社 量子化方法
EP0559348A3 (de) * 1992-03-02 1993-11-03 AT&T Corp. Rateurregelschleifenprozessor für einen wahrnehmungsgebundenen Koder/Dekoder
JP3150475B2 (ja) * 1993-02-19 2001-03-26 松下電器産業株式会社 量子化方法
JP3123290B2 (ja) * 1993-03-09 2001-01-09 ソニー株式会社 圧縮データ記録装置及び方法、圧縮データ再生方法、記録媒体
US5508949A (en) * 1993-12-29 1996-04-16 Hewlett-Packard Company Fast subband filtering in digital signal coding
JP3334419B2 (ja) * 1995-04-20 2002-10-15 ソニー株式会社 ノイズ低減方法及びノイズ低減装置
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
JP3784993B2 (ja) * 1998-06-26 2006-06-14 株式会社リコー 音響信号の符号化・量子化方法
CN1065400C (zh) * 1998-09-01 2001-05-02 国家科学技术委员会高技术研究发展中心 兼容ac-3和mpeg-2的音频编解码器
US6704705B1 (en) * 1998-09-04 2004-03-09 Nortel Networks Limited Perceptual audio coding
US6578162B1 (en) * 1999-01-20 2003-06-10 Skyworks Solutions, Inc. Error recovery method and apparatus for ADPCM encoded speech
DE19947877C2 (de) * 1999-10-05 2001-09-13 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals
EP1139336A3 (de) * 2000-03-30 2004-01-02 Matsushita Electric Industrial Co., Ltd. Bestimmung der Quantisierungsfaktoren für einen Audio-Teilbandkodierer
JP4021124B2 (ja) * 2000-05-30 2007-12-12 株式会社リコー デジタル音響信号符号化装置、方法及び記録媒体
JP2002268693A (ja) * 2001-03-12 2002-09-20 Mitsubishi Electric Corp オーディオ符号化装置
AU2003213149A1 (en) * 2002-02-21 2003-09-09 The Regents Of The University Of California Scalable compression of audio and other signals
JP2003280691A (ja) * 2002-03-19 2003-10-02 Sanyo Electric Co Ltd 音声処理方法および音声処理装置
JP2003280695A (ja) * 2002-03-19 2003-10-02 Sanyo Electric Co Ltd 音声圧縮方法および音声圧縮装置
JP3881946B2 (ja) * 2002-09-12 2007-02-14 松下電器産業株式会社 音響符号化装置及び音響符号化方法
US7272566B2 (en) * 2003-01-02 2007-09-18 Dolby Laboratories Licensing Corporation Reducing scale factor transmission cost for MPEG-2 advanced audio coding (AAC) using a lattice based post processing technique
JP4293833B2 (ja) * 2003-05-19 2009-07-08 シャープ株式会社 ディジタル信号記録再生装置及びその制御プログラム
JP4212591B2 (ja) * 2003-06-30 2009-01-21 富士通株式会社 オーディオ符号化装置
KR100595202B1 (ko) * 2003-12-27 2006-06-30 엘지전자 주식회사 디지털 오디오 워터마크 삽입/검출 장치 및 방법
JP2006018023A (ja) * 2004-07-01 2006-01-19 Fujitsu Ltd オーディオ信号符号化装置、および符号化プログラム
US7668715B1 (en) * 2004-11-30 2010-02-23 Cirrus Logic, Inc. Methods for selecting an initial quantization step size in audio encoders and systems using the same
US7539612B2 (en) * 2005-07-15 2009-05-26 Microsoft Corporation Coding and decoding scale factor information
CN1909066B (zh) * 2005-08-03 2011-02-09 昆山杰得微电子有限公司 音频编码码量控制和调整的方法
US8332216B2 (en) * 2006-01-12 2012-12-11 Stmicroelectronics Asia Pacific Pte., Ltd. System and method for low power stereo perceptual audio coding using adaptive masking threshold
JP4350718B2 (ja) * 2006-03-22 2009-10-21 富士通株式会社 音声符号化装置
KR100943606B1 (ko) * 2006-03-30 2010-02-24 삼성전자주식회사 디지털 통신 시스템에서 양자화 장치 및 방법
SG136836A1 (en) * 2006-04-28 2007-11-29 St Microelectronics Asia Adaptive rate control algorithm for low complexity aac encoding
WO2009029035A1 (en) * 2007-08-27 2009-03-05 Telefonaktiebolaget Lm Ericsson (Publ) Improved transform coding of speech and audio signals

Also Published As

Publication number Publication date
WO2009029035A1 (en) 2009-03-05
CN101790757B (zh) 2012-05-30
US9153240B2 (en) 2015-10-06
JP5539203B2 (ja) 2014-07-02
JP2010538316A (ja) 2010-12-09
EP2186087B1 (de) 2011-11-30
ES2375192T3 (es) 2012-02-27
EP2186087A1 (de) 2010-05-19
US20140142956A1 (en) 2014-05-22
US20110035212A1 (en) 2011-02-10
CN101790757A (zh) 2010-07-28
ATE535904T1 (de) 2011-12-15
EP2186087A4 (de) 2010-11-24

Similar Documents

Publication Publication Date Title
HK1143237A1 (en) Improved transform coding of speech and audio signals
AU2009267529B2 (en) Apparatus and method for calculating bandwidth extension data using a spectral tilt controlling framing
KR102367538B1 (ko) 다중 채널 신호 인코딩 방법 및 인코더
Mitra et al. Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
CN101521014B (zh) 音频带宽扩展编解码装置
CN101183527B (zh) 用于对高频信号进行编码和解码的方法和设备
KR102248008B1 (ko) 향상된 스펙트럼 확장을 사용하여 양자화 잡음을 감소시키기 위한 압신 장치 및 방법
CN101770779B (zh) 嘈杂的声学信号中的噪声频谱跟踪
WO2007093726A3 (fr) Dispositif de ponderation perceptuelle en codage/decodage audio
CN1938758B (zh) 确定估计值的方法和装置
WO2009128667A3 (ko) 오디오 시맨틱 정보를 이용한 오디오 신호의 부호화/복호화 방법 및 그 장치
DK1509906T3 (da) Fremgangsmåde og anordning til tonehöjdeforbedring af et dekodet talesignal
WO2007111646A3 (en) Speech post-processing using mdct coefficients
EP3457402B1 (de) Rausch-adaptives sprachsignalverarbeitungsverfahren und das verfahren verwendende endgerätevorrichtung
IL186436A0 (en) Method and apparatus for split-band encoding of speech signals
EP2933799A1 (de) Stimmenaudiocodierungsvorrichtung, stimmenaudiodecodierungsvorrichtung, stimmenaudiocodierungsverfahren und stimmenaudiodecodierungsverfahren
MX2008013078A (es) Metodos y aparatos para codificar y descodificar señales de audio basadas en objeto.
CN102194457A (zh) 音频编解码方法、系统及噪声水平估计方法
US11094332B2 (en) Low-complexity tonality-adaptive audio signal quantization
TW200705385A (en) Audio encoder and method thereof
CN109074812A (zh) 用于具有全局ild和改进的中/侧决策的mdct m/s立体声的装置和方法
WO2011030354A3 (en) Audio signal encoding employing interchannel and temporal redundancy reduction
CN102314883A (zh) 一种判断音乐噪声的方法以及语音消噪方法
CN102169694A (zh) 生成心理声学模型的方法及装置
CN102332266A (zh) 一种音频数据的编码方法及装置