DK3067888T3 - Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed - Google Patents

Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed Download PDF

Info

Publication number
DK3067888T3
DK3067888T3 DK16167229.0T DK16167229T DK3067888T3 DK 3067888 T3 DK3067888 T3 DK 3067888T3 DK 16167229 T DK16167229 T DK 16167229T DK 3067888 T3 DK3067888 T3 DK 3067888T3
Authority
DK
Denmark
Prior art keywords
attenuation
spectral
decoder
reconstructed
bits
Prior art date
Application number
DK16167229.0T
Other languages
English (en)
Inventor
Sebastian Näslund
Erik Norvell
Volodya Grancharov
Original Assignee
ERICSSON TELEFON AB L M (publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ERICSSON TELEFON AB L M (publ) filed Critical ERICSSON TELEFON AB L M (publ)
Application granted granted Critical
Publication of DK3067888T3 publication Critical patent/DK3067888T3/da

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/035Scalar quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Claims (9)

1. Dekoder til bestemmelse af en dæmpning, der skal anvendes på et audio-signal, omfattende en identifikatorenhed (703), der er konfigureret til at identificere spektralområder, der skal dæmpes, hvilken dekoder er kendetegnet ved yderligere at omfatte: en grupperingsenhed (704), der er konfigureret til at gruppere efterfølgende identificerede spektralområder til dannelse af et kontinuerligt spektralområde, en bestemmelsesenhed (705), der er konfigureret til at bestemme en bredde af det kontinuerlige spektralområde, og en applikationsenhed (706), der er konfigureret til at anvende en dæmpning af det kontinuerlige spektralområde, adaptiv til bredden, således at en øget bredde reducerer dæmpningen af det kontinuerlige spektralområde.
2. Dekoder ifølge krav 1, hvor de spektralområder, der skal dæmpes, er kodet med enten et lavt antal bit eller uden tildelte bit.
3. Dekoder ifølge krav 2, hvor identifikatorenheden (703), der er konfigureret til at identificere spektralområder, der skal dæmpes, yderligere er konfigureret til at undersøge rekonstruerede subvektorer.
4. Dekoder ifølge krav 3, hvor et spektralområde siges at være repræsenteret med lav præcision, når det tildelte antal bit til den rekonstruerede subvektor er under en forudbestemt tærskelværdi.
5. Dekoder ifølge krav 3, hvor der anvendes et pulskodningsskema til at kode de spektrale subvektorer, og et spektralområde siges at være repræsenteret med lav præcision, hvis det består af en eller flere konsekutive subvektorer, hvor antallet af pulser P(b) er under en forudbestemt tærskeværdi.
6. Dekoder ifølge et hvilket som helst af kravene 1-5, hvor der identificeres spektralområder, der er kodet uden bit.
7. Dekoder ifølge et hvilket som helst af kravene 1-5, hvor der identificeres spektralområder, der er kodet med et lavt antal bit.
8. Dekoder ifølge et hvilket som helst af kravene 1-7, hvor det rekonstruerede spektrum også indbefatter et område, der er rekonstrueret under anvendelse af en båndbreddeudvidelsesalgoritme.
9. Dekoder ifølge krav 1 eller 8, hvor den omfatter en input-enhed (710), der er konfigureret til at modtage en analyse fra koderen, og hvor indentifikato-renheden (703) yderligere er konfigureret til at identificere de spektralområder, der skal dæmpes, på basis af den modtagne analyse, hvor et afstandsmål mellem et rekonstrueret syntesesignal og et input-målsignal anvendes af koderen, hvis afstandsmålet i et vist frekvensområde er over en vis tærskelværdi, er spektralområdet en potential kandidat til dæmpning.
DK16167229.0T 2011-04-15 2011-12-15 Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed DK3067888T3 (da)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201161475711P 2011-04-15 2011-04-15
EP14184428.2A EP2816556B1 (en) 2011-04-15 2011-12-15 Method and a decoder for attenuation of signal regions reconstructed with low accuracy

Publications (1)

Publication Number Publication Date
DK3067888T3 true DK3067888T3 (da) 2017-07-10

Family

ID=45406733

Family Applications (1)

Application Number Title Priority Date Filing Date
DK16167229.0T DK3067888T3 (da) 2011-04-15 2011-12-15 Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed

Country Status (7)

Country Link
US (4) US8706509B2 (da)
EP (3) EP2816556B1 (da)
KR (1) KR101520212B1 (da)
CN (1) CN103503065B (da)
DK (1) DK3067888T3 (da)
ES (2) ES2540051T3 (da)
WO (1) WO2012139668A1 (da)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104221082B (zh) * 2012-03-29 2017-03-08 瑞典爱立信有限公司 谐波音频信号的带宽扩展
SG11201510353RA (en) * 2013-06-21 2016-01-28 Fraunhofer Ges Forschung Apparatus and method realizing a fading of an mdct spectrum to white noise prior to fdns application
EP2980792A1 (en) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating an enhanced signal using independent noise-filling

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4617676A (en) * 1984-09-04 1986-10-14 At&T Bell Laboratories Predictive communication system filtering arrangement
KR940001817B1 (ko) * 1991-06-14 1994-03-09 삼성전자 주식회사 노이즈 감소회로에 사용되는 액티브 필터용 고역 가중치 회로
JPH08223049A (ja) * 1995-02-14 1996-08-30 Sony Corp 信号符号化方法及び装置、信号復号化方法及び装置、情報記録媒体並びに情報伝送方法
JPH08328599A (ja) * 1995-06-01 1996-12-13 Mitsubishi Electric Corp Mpegオーディオ復号器
GB9512284D0 (en) * 1995-06-16 1995-08-16 Nokia Mobile Phones Ltd Speech Synthesiser
SE9903553D0 (sv) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
US7447631B2 (en) * 2002-06-17 2008-11-04 Dolby Laboratories Licensing Corporation Audio coding system using spectral hole filling
EP1611772A1 (en) * 2003-03-04 2006-01-04 Nokia Corporation Support of a multichannel audio extension
EP2118885B1 (en) * 2007-02-26 2012-07-11 Dolby Laboratories Licensing Corporation Speech enhancement in entertainment audio
ES2858423T3 (es) * 2007-08-27 2021-09-30 Ericsson Telefon Ab L M Método y dispositivo para el llenado de huecos espectrales
US8326617B2 (en) * 2007-10-24 2012-12-04 Qnx Software Systems Limited Speech enhancement with minimum gating

Also Published As

Publication number Publication date
US9691398B2 (en) 2017-06-27
ES2540051T3 (es) 2015-07-08
EP2697796B1 (en) 2015-05-06
KR20140035900A (ko) 2014-03-24
ES2637031T3 (es) 2017-10-10
CN103503065B (zh) 2015-08-05
EP2816556A1 (en) 2014-12-24
EP3067888B1 (en) 2017-05-31
US20160240201A1 (en) 2016-08-18
KR101520212B1 (ko) 2015-05-13
EP2816556B1 (en) 2016-05-04
US9595268B2 (en) 2017-03-14
US8706509B2 (en) 2014-04-22
WO2012139668A1 (en) 2012-10-18
US20120278085A1 (en) 2012-11-01
US20170061977A1 (en) 2017-03-02
US9349379B2 (en) 2016-05-24
US20140081646A1 (en) 2014-03-20
CN103503065A (zh) 2014-01-08
EP3067888A1 (en) 2016-09-14
EP2697796A1 (en) 2014-02-19

Similar Documents

Publication Publication Date Title
JP5608660B2 (ja) エネルギ保存型マルチチャネルオーディオ符号化
CN110197667B (zh) 对音频信号的频谱执行噪声填充的装置
JP5539203B2 (ja) 改良された音声及びオーディオ信号の変換符号化
RU2502138C2 (ru) Кодирующее устройство, декодирующее устройство и способ
US8972270B2 (en) Method and an apparatus for processing an audio signal
US9966082B2 (en) Filling of non-coded sub-vectors in transform coded audio signals
EP1806737A1 (en) Sound encoder and sound encoding method
KR20080049085A (ko) 음성 부호화 장치 및 음성 부호화 방법
US20170330573A1 (en) Post-Quantization Gain Correction in Audio Coding
RU2505921C2 (ru) Способ и устройство кодирования и декодирования аудиосигналов (варианты)
EP3014609B1 (en) Bitstream syntax for spatial voice coding
JP2004512560A (ja) 符号化音響信号の知覚的に改善した増強
US9691398B2 (en) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
JP5172965B2 (ja) 知覚モデルの適応的調整
EP3550563B1 (en) Encoder, decoder, encoding method, decoding method, and associated programs
US8010370B2 (en) Bitrate control for perceptual coding
KR20130047630A (ko) 통신 시스템에서 신호 부호화 장치 및 방법