DK3067888T3 - Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed - Google Patents
Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed Download PDFInfo
- Publication number
- DK3067888T3 DK3067888T3 DK16167229.0T DK16167229T DK3067888T3 DK 3067888 T3 DK3067888 T3 DK 3067888T3 DK 16167229 T DK16167229 T DK 16167229T DK 3067888 T3 DK3067888 T3 DK 3067888T3
- Authority
- DK
- Denmark
- Prior art keywords
- attenuation
- spectral
- decoder
- reconstructed
- bits
- Prior art date
Links
- 230000003595 spectral effect Effects 0.000 claims description 78
- 238000001228 spectrum Methods 0.000 claims description 16
- 230000003044 adaptive effect Effects 0.000 claims description 13
- 230000002238 attenuated effect Effects 0.000 claims description 11
- 230000015572 biosynthetic process Effects 0.000 claims description 5
- 230000005236 sound signal Effects 0.000 claims description 5
- 238000003786 synthesis reaction Methods 0.000 claims description 5
- 239000013598 vector Substances 0.000 description 8
- 230000006870 function Effects 0.000 description 6
- 238000013139 quantization Methods 0.000 description 6
- 238000000034 method Methods 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Claims (9)
1. Dekoder til bestemmelse af en dæmpning, der skal anvendes på et audio-signal, omfattende en identifikatorenhed (703), der er konfigureret til at identificere spektralområder, der skal dæmpes, hvilken dekoder er kendetegnet ved yderligere at omfatte: en grupperingsenhed (704), der er konfigureret til at gruppere efterfølgende identificerede spektralområder til dannelse af et kontinuerligt spektralområde, en bestemmelsesenhed (705), der er konfigureret til at bestemme en bredde af det kontinuerlige spektralområde, og en applikationsenhed (706), der er konfigureret til at anvende en dæmpning af det kontinuerlige spektralområde, adaptiv til bredden, således at en øget bredde reducerer dæmpningen af det kontinuerlige spektralområde.
2. Dekoder ifølge krav 1, hvor de spektralområder, der skal dæmpes, er kodet med enten et lavt antal bit eller uden tildelte bit.
3. Dekoder ifølge krav 2, hvor identifikatorenheden (703), der er konfigureret til at identificere spektralområder, der skal dæmpes, yderligere er konfigureret til at undersøge rekonstruerede subvektorer.
4. Dekoder ifølge krav 3, hvor et spektralområde siges at være repræsenteret med lav præcision, når det tildelte antal bit til den rekonstruerede subvektor er under en forudbestemt tærskelværdi.
5. Dekoder ifølge krav 3, hvor der anvendes et pulskodningsskema til at kode de spektrale subvektorer, og et spektralområde siges at være repræsenteret med lav præcision, hvis det består af en eller flere konsekutive subvektorer, hvor antallet af pulser P(b) er under en forudbestemt tærskeværdi.
6. Dekoder ifølge et hvilket som helst af kravene 1-5, hvor der identificeres spektralområder, der er kodet uden bit.
7. Dekoder ifølge et hvilket som helst af kravene 1-5, hvor der identificeres spektralområder, der er kodet med et lavt antal bit.
8. Dekoder ifølge et hvilket som helst af kravene 1-7, hvor det rekonstruerede spektrum også indbefatter et område, der er rekonstrueret under anvendelse af en båndbreddeudvidelsesalgoritme.
9. Dekoder ifølge krav 1 eller 8, hvor den omfatter en input-enhed (710), der er konfigureret til at modtage en analyse fra koderen, og hvor indentifikato-renheden (703) yderligere er konfigureret til at identificere de spektralområder, der skal dæmpes, på basis af den modtagne analyse, hvor et afstandsmål mellem et rekonstrueret syntesesignal og et input-målsignal anvendes af koderen, hvis afstandsmålet i et vist frekvensområde er over en vis tærskelværdi, er spektralområdet en potential kandidat til dæmpning.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161475711P | 2011-04-15 | 2011-04-15 | |
EP14184428.2A EP2816556B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
Publications (1)
Publication Number | Publication Date |
---|---|
DK3067888T3 true DK3067888T3 (da) | 2017-07-10 |
Family
ID=45406733
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DK16167229.0T DK3067888T3 (da) | 2011-04-15 | 2011-12-15 | Dekoder til dæmpning af signalområder rekonstrueret med lav nøjagtighed |
Country Status (7)
Country | Link |
---|---|
US (4) | US8706509B2 (da) |
EP (3) | EP2816556B1 (da) |
KR (1) | KR101520212B1 (da) |
CN (1) | CN103503065B (da) |
DK (1) | DK3067888T3 (da) |
ES (2) | ES2540051T3 (da) |
WO (1) | WO2012139668A1 (da) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104221082B (zh) * | 2012-03-29 | 2017-03-08 | 瑞典爱立信有限公司 | 谐波音频信号的带宽扩展 |
SG11201510353RA (en) * | 2013-06-21 | 2016-01-28 | Fraunhofer Ges Forschung | Apparatus and method realizing a fading of an mdct spectrum to white noise prior to fdns application |
EP2980792A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating an enhanced signal using independent noise-filling |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4617676A (en) * | 1984-09-04 | 1986-10-14 | At&T Bell Laboratories | Predictive communication system filtering arrangement |
KR940001817B1 (ko) * | 1991-06-14 | 1994-03-09 | 삼성전자 주식회사 | 노이즈 감소회로에 사용되는 액티브 필터용 고역 가중치 회로 |
JPH08223049A (ja) * | 1995-02-14 | 1996-08-30 | Sony Corp | 信号符号化方法及び装置、信号復号化方法及び装置、情報記録媒体並びに情報伝送方法 |
JPH08328599A (ja) * | 1995-06-01 | 1996-12-13 | Mitsubishi Electric Corp | Mpegオーディオ復号器 |
GB9512284D0 (en) * | 1995-06-16 | 1995-08-16 | Nokia Mobile Phones Ltd | Speech Synthesiser |
SE9903553D0 (sv) * | 1999-01-27 | 1999-10-01 | Lars Liljeryd | Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL) |
US7447631B2 (en) * | 2002-06-17 | 2008-11-04 | Dolby Laboratories Licensing Corporation | Audio coding system using spectral hole filling |
EP1611772A1 (en) * | 2003-03-04 | 2006-01-04 | Nokia Corporation | Support of a multichannel audio extension |
EP2118885B1 (en) * | 2007-02-26 | 2012-07-11 | Dolby Laboratories Licensing Corporation | Speech enhancement in entertainment audio |
ES2858423T3 (es) * | 2007-08-27 | 2021-09-30 | Ericsson Telefon Ab L M | Método y dispositivo para el llenado de huecos espectrales |
US8326617B2 (en) * | 2007-10-24 | 2012-12-04 | Qnx Software Systems Limited | Speech enhancement with minimum gating |
-
2011
- 2011-12-15 EP EP14184428.2A patent/EP2816556B1/en active Active
- 2011-12-15 KR KR1020137029473A patent/KR101520212B1/ko active IP Right Grant
- 2011-12-15 EP EP11801709.4A patent/EP2697796B1/en active Active
- 2011-12-15 DK DK16167229.0T patent/DK3067888T3/da active
- 2011-12-15 CN CN201180070142.XA patent/CN103503065B/zh active Active
- 2011-12-15 WO PCT/EP2011/072963 patent/WO2012139668A1/en active Application Filing
- 2011-12-15 EP EP16167229.0A patent/EP3067888B1/en active Active
- 2011-12-15 ES ES11801709.4T patent/ES2540051T3/es active Active
- 2011-12-15 US US13/379,054 patent/US8706509B2/en active Active
- 2011-12-15 ES ES16167229.0T patent/ES2637031T3/es active Active
-
2013
- 2013-11-20 US US14/085,082 patent/US9349379B2/en active Active
-
2016
- 2016-04-26 US US15/138,530 patent/US9595268B2/en active Active
- 2016-11-16 US US15/352,729 patent/US9691398B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US9691398B2 (en) | 2017-06-27 |
ES2540051T3 (es) | 2015-07-08 |
EP2697796B1 (en) | 2015-05-06 |
KR20140035900A (ko) | 2014-03-24 |
ES2637031T3 (es) | 2017-10-10 |
CN103503065B (zh) | 2015-08-05 |
EP2816556A1 (en) | 2014-12-24 |
EP3067888B1 (en) | 2017-05-31 |
US20160240201A1 (en) | 2016-08-18 |
KR101520212B1 (ko) | 2015-05-13 |
EP2816556B1 (en) | 2016-05-04 |
US9595268B2 (en) | 2017-03-14 |
US8706509B2 (en) | 2014-04-22 |
WO2012139668A1 (en) | 2012-10-18 |
US20120278085A1 (en) | 2012-11-01 |
US20170061977A1 (en) | 2017-03-02 |
US9349379B2 (en) | 2016-05-24 |
US20140081646A1 (en) | 2014-03-20 |
CN103503065A (zh) | 2014-01-08 |
EP3067888A1 (en) | 2016-09-14 |
EP2697796A1 (en) | 2014-02-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5608660B2 (ja) | エネルギ保存型マルチチャネルオーディオ符号化 | |
CN110197667B (zh) | 对音频信号的频谱执行噪声填充的装置 | |
JP5539203B2 (ja) | 改良された音声及びオーディオ信号の変換符号化 | |
RU2502138C2 (ru) | Кодирующее устройство, декодирующее устройство и способ | |
US8972270B2 (en) | Method and an apparatus for processing an audio signal | |
US9966082B2 (en) | Filling of non-coded sub-vectors in transform coded audio signals | |
EP1806737A1 (en) | Sound encoder and sound encoding method | |
KR20080049085A (ko) | 음성 부호화 장치 및 음성 부호화 방법 | |
US20170330573A1 (en) | Post-Quantization Gain Correction in Audio Coding | |
RU2505921C2 (ru) | Способ и устройство кодирования и декодирования аудиосигналов (варианты) | |
EP3014609B1 (en) | Bitstream syntax for spatial voice coding | |
JP2004512560A (ja) | 符号化音響信号の知覚的に改善した増強 | |
US9691398B2 (en) | Method and a decoder for attenuation of signal regions reconstructed with low accuracy | |
JP5172965B2 (ja) | 知覚モデルの適応的調整 | |
EP3550563B1 (en) | Encoder, decoder, encoding method, decoding method, and associated programs | |
US8010370B2 (en) | Bitrate control for perceptual coding | |
KR20130047630A (ko) | 통신 시스템에서 신호 부호화 장치 및 방법 |