DK3203470T3 - Fremgangsmåde til afkodning af tale og taleafkodningsapparat - Google Patents

Fremgangsmåde til afkodning af tale og taleafkodningsapparat Download PDF

Info

Publication number
DK3203470T3
DK3203470T3 DK16193849.3T DK16193849T DK3203470T3 DK 3203470 T3 DK3203470 T3 DK 3203470T3 DK 16193849 T DK16193849 T DK 16193849T DK 3203470 T3 DK3203470 T3 DK 3203470T3
Authority
DK
Denmark
Prior art keywords
speech
discussing
procedure
cutting device
speech cutting
Prior art date
Application number
DK16193849.3T
Other languages
English (en)
Inventor
Bin Wang
Zexin Liu
Lei Miao
Original Assignee
Huawei Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Tech Co Ltd filed Critical Huawei Tech Co Ltd
Application granted granted Critical
Publication of DK3203470T3 publication Critical patent/DK3203470T3/da

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/03Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • G10L19/265Pre-filtering, e.g. high frequency emphasis prior to encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0016Codebook for LPC parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
DK16193849.3T 2013-01-15 2013-07-25 Fremgangsmåde til afkodning af tale og taleafkodningsapparat DK3203470T3 (da)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310014342.4A CN103928031B (zh) 2013-01-15 2013-01-15 编码方法、解码方法、编码装置和解码装置
EP13872123.8A EP2905777B1 (en) 2013-01-15 2013-07-25 Encoding method, decoding method, encoding device, and decoding device

Publications (1)

Publication Number Publication Date
DK3203470T3 true DK3203470T3 (da) 2019-05-27

Family

ID=51146229

Family Applications (3)

Application Number Title Priority Date Filing Date
DK13872123.8T DK2905777T3 (da) 2013-01-15 2013-07-25 Fremgangsmåde til kodning, fremgangsmåde til afkodning, kodningsanordning og afkodningsanordning
DK18182328.7T DK3486905T3 (da) 2013-01-15 2013-07-25 Kodningsfremgangsmåde, dekodningsfremgangsmåde, kodningsanordning og dekodningsanordning
DK16193849.3T DK3203470T3 (da) 2013-01-15 2013-07-25 Fremgangsmåde til afkodning af tale og taleafkodningsapparat

Family Applications Before (2)

Application Number Title Priority Date Filing Date
DK13872123.8T DK2905777T3 (da) 2013-01-15 2013-07-25 Fremgangsmåde til kodning, fremgangsmåde til afkodning, kodningsanordning og afkodningsanordning
DK18182328.7T DK3486905T3 (da) 2013-01-15 2013-07-25 Kodningsfremgangsmåde, dekodningsfremgangsmåde, kodningsanordning og dekodningsanordning

Country Status (17)

Country Link
US (6) US9761235B2 (da)
EP (4) EP3764355B1 (da)
JP (3) JP6141443B2 (da)
KR (2) KR101748303B1 (da)
CN (2) CN105551497B (da)
BR (1) BR112015013088B1 (da)
DK (3) DK2905777T3 (da)
ES (3) ES2637741T3 (da)
HK (1) HK1199541A1 (da)
HU (3) HUE043649T2 (da)
NO (1) NO2905777T3 (da)
PL (3) PL3486905T3 (da)
PT (3) PT3203470T (da)
SG (1) SG11201503772RA (da)
SI (3) SI2905777T1 (da)
TR (1) TR201907656T4 (da)
WO (1) WO2014110895A1 (da)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104517610B (zh) 2013-09-26 2018-03-06 华为技术有限公司 频带扩展的方法及装置
CN105225671B (zh) * 2014-06-26 2016-10-26 华为技术有限公司 编解码方法、装置及系统
US10475457B2 (en) 2017-07-03 2019-11-12 Qualcomm Incorporated Time-domain inter-channel prediction
JP7362320B2 (ja) * 2019-07-04 2023-10-17 フォルシアクラリオン・エレクトロニクス株式会社 オーディオ信号処理装置、オーディオ信号処理方法及びオーディオ信号処理プログラム
US10978083B1 (en) * 2019-11-13 2021-04-13 Shure Acquisition Holdings, Inc. Time domain spectral bandwidth replication
CN113079378B (zh) * 2021-04-15 2022-08-16 杭州海康威视数字技术股份有限公司 图像处理方法、装置和电子设备

Family Cites Families (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969192A (en) 1987-04-06 1990-11-06 Voicecraft, Inc. Vector adaptive predictive coder for speech and audio
US5307441A (en) 1989-11-29 1994-04-26 Comsat Corporation Wear-toll quality 4.8 kbps speech codec
US5495555A (en) 1992-06-01 1996-02-27 Hughes Aircraft Company High quality low bit rate celp-based speech codec
FR2720850B1 (fr) * 1994-06-03 1996-08-14 Matra Communication Procédé de codage de parole à prédiction linéaire.
JPH08160996A (ja) * 1994-12-05 1996-06-21 Hitachi Ltd 音声符号化装置
EP0763818B1 (en) * 1995-09-14 2003-05-14 Kabushiki Kaisha Toshiba Formant emphasis method and formant emphasis filter device
US5864798A (en) * 1995-09-18 1999-01-26 Kabushiki Kaisha Toshiba Method and apparatus for adjusting a spectrum shape of a speech signal
DE19643900C1 (de) * 1996-10-30 1998-02-12 Ericsson Telefon Ab L M Nachfiltern von Hörsignalen, speziell von Sprachsignalen
FR2783651A1 (fr) * 1998-09-22 2000-03-24 Koninkl Philips Electronics Nv Dispositif et procede de filtrage d'un signal de parole, recepteur et systeme de communications telephonique
US6377915B1 (en) * 1999-03-17 2002-04-23 Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. Speech decoding using mix ratio table
US6510407B1 (en) * 1999-10-19 2003-01-21 Atmel Corporation Method and apparatus for variable rate coding of speech
DE10041512B4 (de) 2000-08-24 2005-05-04 Infineon Technologies Ag Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen
EP1440433B1 (en) 2001-11-02 2005-05-04 Matsushita Electric Industrial Co., Ltd. Audio encoding and decoding device
ES2237706T3 (es) 2001-11-29 2005-08-01 Coding Technologies Ab Reconstruccion de componentes de alta frecuencia.
CA2415105A1 (en) * 2002-12-24 2004-06-24 Voiceage Corporation A method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding
US20050004793A1 (en) 2003-07-03 2005-01-06 Pasi Ojala Signal adaptation for higher band coding in a codec utilizing band split coding
CA2457988A1 (en) * 2004-02-18 2005-08-18 Voiceage Corporation Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization
WO2006103488A1 (en) * 2005-03-30 2006-10-05 Nokia Corporation Source coding and/or decoding
UA92341C2 (ru) * 2005-04-01 2010-10-25 Квелкомм Инкорпорейтед Системы, способы и устройство широкополосного речевого кодирования
NZ562183A (en) 2005-04-01 2010-09-30 Qualcomm Inc Systems, methods, and apparatus for highband excitation generation
WO2006116024A2 (en) * 2005-04-22 2006-11-02 Qualcomm Incorporated Systems, methods, and apparatus for gain factor attenuation
US7707034B2 (en) 2005-05-31 2010-04-27 Microsoft Corporation Audio codec post-filter
KR100795727B1 (ko) * 2005-12-08 2008-01-21 한국전자통신연구원 Celp기반의 음성 코더에서 고정 코드북 검색 장치 및방법
KR20070115637A (ko) 2006-06-03 2007-12-06 삼성전자주식회사 대역폭 확장 부호화 및 복호화 방법 및 장치
US8135047B2 (en) 2006-07-31 2012-03-13 Qualcomm Incorporated Systems and methods for including an identifier with a packet associated with a speech signal
US9454974B2 (en) 2006-07-31 2016-09-27 Qualcomm Incorporated Systems, methods, and apparatus for gain factor limiting
WO2008022181A2 (en) * 2006-08-15 2008-02-21 Broadcom Corporation Updating of decoder states after packet loss concealment
CN101140759B (zh) * 2006-09-08 2010-05-12 华为技术有限公司 语音或音频信号的带宽扩展方法及系统
JP5061111B2 (ja) * 2006-09-15 2012-10-31 パナソニック株式会社 音声符号化装置および音声符号化方法
US20100332223A1 (en) 2006-12-13 2010-12-30 Panasonic Corporation Audio decoding device and power adjusting method
JP4984983B2 (ja) * 2007-03-09 2012-07-25 富士通株式会社 符号化装置および符号化方法
EP2051245A3 (en) * 2007-10-17 2013-07-10 Gwangju Institute of Science and Technology Wideband audio signal coding/decoding device and method
KR101452722B1 (ko) * 2008-02-19 2014-10-23 삼성전자주식회사 신호 부호화 및 복호화 방법 및 장치
JP4932917B2 (ja) 2009-04-03 2012-05-16 株式会社エヌ・ティ・ティ・ドコモ 音声復号装置、音声復号方法、及び音声復号プログラム
WO2011062538A1 (en) 2009-11-19 2011-05-26 Telefonaktiebolaget Lm Ericsson (Publ) Bandwidth extension of a low band audio signal
US8886523B2 (en) * 2010-04-14 2014-11-11 Huawei Technologies Co., Ltd. Audio decoding based on audio class with control code for post-processing modes
US8600737B2 (en) * 2010-06-01 2013-12-03 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for wideband speech coding
WO2013066238A2 (en) * 2011-11-02 2013-05-10 Telefonaktiebolaget L M Ericsson (Publ) Generation of a high band extension of a bandwidth extended audio signal

Also Published As

Publication number Publication date
ES2728000T3 (es) 2019-10-21
JP2017151466A (ja) 2017-08-31
CN103928031B (zh) 2016-03-30
BR112015013088A2 (pt) 2017-07-11
EP3486905B1 (en) 2020-09-09
EP2905777A4 (en) 2015-09-23
EP3203470A1 (en) 2017-08-09
SI2905777T1 (sl) 2017-11-30
US20150255080A1 (en) 2015-09-10
EP2905777B1 (en) 2017-07-19
PL2905777T3 (pl) 2017-12-29
SI3486905T1 (sl) 2020-12-31
EP3203470B1 (en) 2019-03-13
CN105551497A (zh) 2016-05-04
KR101748303B1 (ko) 2017-06-16
EP3764355B1 (en) 2024-05-01
EP3486905A1 (en) 2019-05-22
KR20160090400A (ko) 2016-07-29
KR20150082530A (ko) 2015-07-15
DK3486905T3 (da) 2020-11-23
JP2018200488A (ja) 2018-12-20
DK2905777T3 (da) 2017-11-06
US10210880B2 (en) 2019-02-19
ES2828004T3 (es) 2021-05-25
BR112015013088B1 (pt) 2020-01-28
CN103928031A (zh) 2014-07-16
US11430456B2 (en) 2022-08-30
EP3764355A1 (en) 2021-01-13
US20170372713A1 (en) 2017-12-28
CN105551497B (zh) 2019-03-19
HUE051171T2 (hu) 2021-03-01
WO2014110895A1 (zh) 2014-07-24
US9761235B2 (en) 2017-09-12
PT3486905T (pt) 2020-10-19
JP6397082B2 (ja) 2018-09-26
US10770085B2 (en) 2020-09-08
TR201907656T4 (tr) 2019-06-21
EP2905777A1 (en) 2015-08-12
KR101966265B1 (ko) 2019-04-05
JP6616470B2 (ja) 2019-12-04
SI3203470T1 (sl) 2019-06-28
US20200381000A1 (en) 2020-12-03
PT2905777T (pt) 2017-08-30
JP6141443B2 (ja) 2017-06-07
US11869520B2 (en) 2024-01-09
HK1199541A1 (zh) 2015-07-03
SG11201503772RA (en) 2015-06-29
US20220366922A1 (en) 2022-11-17
US20190139560A1 (en) 2019-05-09
US20240177722A1 (en) 2024-05-30
ES2637741T3 (es) 2017-10-16
PT3203470T (pt) 2019-06-04
HUE043649T2 (hu) 2019-08-28
JP2015537254A (ja) 2015-12-24
NO2905777T3 (da) 2017-12-16
HUE036710T2 (hu) 2018-07-30
PL3203470T3 (pl) 2019-09-30
PL3486905T3 (pl) 2021-03-08

Similar Documents

Publication Publication Date Title
DK2962637T3 (da) Fremgangsmåde og anordning til overvågning af menneskelig bevægelsesstatus
DK3089994T3 (da) Fabs-in-tandem-immunglobulin og anvendelser deraf
DK3008168T3 (da) Sc-celler og sammensætninger og fremgangsmåde til dannelse deraf
DK3013283T3 (da) Enheder, systemer og metoder til overvågning af knæudskiftninger
DK3071704T3 (da) System og fremgangsmåde til sortering af sperm
DK3036976T3 (da) Programmerbar belysningsindretning og fremgangsmåde og system til programmering af belysningsindretning
DK3073831T3 (da) Skæreapparat og fremgangsmåde til skæring af fødevareprodukter i mindre fødevareprodukter
DK2796762T4 (da) Fremgangsmåde og system til temperaturstyret dispensering af gas
DK2999103T3 (da) Konverter og fremgangsmåde til styring deraf
DK3125239T3 (da) Fremgangsmåde og indretning til styring af maskering af audiorammetab
DK3219707T3 (da) Fremgangsmåde til fremstilling af substituerede oxiraner og triazoler
DK3079719T3 (da) ANTI-SIGLEC-8-antistoffer og fremgangsmåder til anvendelse deraf
DK2961531T3 (da) Anordning, fremgangsmåde og system til overvågning af udvikling af dyrkede prøver
DK2811761T3 (da) Antenneindretning til høreinstrumenter
DK2692668T3 (da) Indretning og fremgangsmåde til palletering af laster med flere referencer
DK3083680T3 (da) Humaniserede anti-Tau(pS422)-antistoffer og fremgangsmåder til anvendelse
DK2979399T3 (da) Fremgangsmåde og anordning til justering af latenstid
DK3078724T3 (da) Fremgangsmåde til fremstilling af biodiesel og relaterede produkter
DK3540429T3 (da) Indretning og fremgangsmåder til anvendelse af indretning til detektion af hyperammoniæmi
DK3008852T3 (da) System og fremgangsmåde til kryptering
DK2964767T3 (da) Toksingener og fremgangsmåder til anvendelse deraf
DK3080302T3 (da) Metoder og sonder til identifikation af genalleler
DK2964374T3 (da) Synteseanordning og fremgangsmåde
DK3057873T3 (da) Tømningsindretning, anordning, og fremgangsmåde til tømning af sugepose
DK2900349T3 (da) Devolatiliseringsapparat og fremgangsmåde til anvendelse deraf