DK2242045T3 - Talesyntese og kodningsfremgangsmåder - Google Patents

Talesyntese og kodningsfremgangsmåder

Info

Publication number
DK2242045T3
DK2242045T3 DK09158056.3T DK09158056T DK2242045T3 DK 2242045 T3 DK2242045 T3 DK 2242045T3 DK 09158056 T DK09158056 T DK 09158056T DK 2242045 T3 DK2242045 T3 DK 2242045T3
Authority
DK
Denmark
Prior art keywords
target
frames
normalised
residual frames
gci
Prior art date
Application number
DK09158056.3T
Other languages
English (en)
Inventor
Thomas Drugman
Geoffrey Wilfart
Thierry Dutoit
Original Assignee
Univ Mons
Acapela Group S A
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Mons, Acapela Group S A filed Critical Univ Mons
Application granted granted Critical
Publication of DK2242045T3 publication Critical patent/DK2242045T3/da

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • G10L19/125Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
DK09158056.3T 2009-04-16 2009-04-16 Talesyntese og kodningsfremgangsmåder DK2242045T3 (da)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP09158056A EP2242045B1 (en) 2009-04-16 2009-04-16 Speech synthesis and coding methods

Publications (1)

Publication Number Publication Date
DK2242045T3 true DK2242045T3 (da) 2012-09-24

Family

ID=40846430

Family Applications (1)

Application Number Title Priority Date Filing Date
DK09158056.3T DK2242045T3 (da) 2009-04-16 2009-04-16 Talesyntese og kodningsfremgangsmåder

Country Status (10)

Country Link
US (1) US8862472B2 (da)
EP (1) EP2242045B1 (da)
JP (1) JP5581377B2 (da)
KR (1) KR101678544B1 (da)
CA (1) CA2757142C (da)
DK (1) DK2242045T3 (da)
IL (1) IL215628A (da)
PL (1) PL2242045T3 (da)
RU (1) RU2557469C2 (da)
WO (1) WO2010118953A1 (da)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011066844A1 (en) * 2009-12-02 2011-06-09 Agnitio, S.L. Obfuscated speech synthesis
JP5591080B2 (ja) * 2010-11-26 2014-09-17 三菱電機株式会社 データ圧縮装置及びデータ処理システム及びコンピュータプログラム及びデータ圧縮方法
KR101402805B1 (ko) * 2012-03-27 2014-06-03 광주과학기술원 음성분석장치, 음성합성장치, 및 음성분석합성시스템
US9978359B1 (en) * 2013-12-06 2018-05-22 Amazon Technologies, Inc. Iterative text-to-speech with user feedback
WO2015183254A1 (en) * 2014-05-28 2015-12-03 Interactive Intelligence, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10255903B2 (en) 2014-05-28 2019-04-09 Interactive Intelligence Group, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10014007B2 (en) 2014-05-28 2018-07-03 Interactive Intelligence, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US9607610B2 (en) * 2014-07-03 2017-03-28 Google Inc. Devices and methods for noise modulation in a universal vocoder synthesizer
JP6293912B2 (ja) * 2014-09-19 2018-03-14 株式会社東芝 音声合成装置、音声合成方法およびプログラム
WO2017061985A1 (en) * 2015-10-06 2017-04-13 Interactive Intelligence Group, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10140089B1 (en) 2017-08-09 2018-11-27 2236008 Ontario Inc. Synthetic speech for in vehicle communication
US10347238B2 (en) 2017-10-27 2019-07-09 Adobe Inc. Text-based insertion and replacement in audio narration
CN108281150B (zh) * 2018-01-29 2020-11-17 上海泰亿格康复医疗科技股份有限公司 一种基于微分声门波模型的语音变调变嗓音方法
US10770063B2 (en) 2018-04-13 2020-09-08 Adobe Inc. Real-time speaker-dependent neural vocoder
CN109036375B (zh) * 2018-07-25 2023-03-24 腾讯科技(深圳)有限公司 语音合成方法、模型训练方法、装置和计算机设备
CN112634914B (zh) * 2020-12-15 2024-03-29 中国科学技术大学 基于短时谱一致性的神经网络声码器训练方法
CN113539231B (zh) * 2020-12-30 2024-06-18 腾讯科技(深圳)有限公司 音频处理方法、声码器、装置、设备及存储介质

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6423300A (en) * 1987-07-17 1989-01-25 Ricoh Kk Spectrum generation system
US5754976A (en) * 1990-02-23 1998-05-19 Universite De Sherbrooke Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech
EP0481107B1 (en) * 1990-10-16 1995-09-06 International Business Machines Corporation A phonetic Hidden Markov Model speech synthesizer
DE69203186T2 (de) * 1991-09-20 1996-02-01 Philips Electronics Nv Verarbeitungsgerät für die menschliche Sprache zum Detektieren des Schliessens der Stimmritze.
JPH06250690A (ja) * 1993-02-26 1994-09-09 N T T Data Tsushin Kk 振幅特徴抽出装置及び合成音声振幅制御装置
JP3093113B2 (ja) * 1994-09-21 2000-10-03 日本アイ・ビー・エム株式会社 音声合成方法及びシステム
JP3747492B2 (ja) * 1995-06-20 2006-02-22 ソニー株式会社 音声信号の再生方法及び再生装置
US6304846B1 (en) * 1997-10-22 2001-10-16 Texas Instruments Incorporated Singing voice synthesis
JP3268750B2 (ja) * 1998-01-30 2002-03-25 株式会社東芝 音声合成方法及びシステム
US6631363B1 (en) * 1999-10-11 2003-10-07 I2 Technologies Us, Inc. Rules-based notification system
DE10041512B4 (de) * 2000-08-24 2005-05-04 Infineon Technologies Ag Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen
DE60127274T2 (de) * 2000-09-15 2007-12-20 Lernout & Hauspie Speech Products N.V. Schnelle wellenformsynchronisation für die verkettung und zeitskalenmodifikation von sprachsignalen
JP2004117662A (ja) * 2002-09-25 2004-04-15 Matsushita Electric Ind Co Ltd 音声合成システム
AU2003284654A1 (en) * 2002-11-25 2004-06-18 Matsushita Electric Industrial Co., Ltd. Speech synthesis method and speech synthesis device
US7842874B2 (en) * 2006-06-15 2010-11-30 Massachusetts Institute Of Technology Creating music by concatenative synthesis
US8140326B2 (en) * 2008-06-06 2012-03-20 Fuji Xerox Co., Ltd. Systems and methods for reducing speech intelligibility while preserving environmental sounds

Also Published As

Publication number Publication date
IL215628A0 (en) 2012-01-31
US20120123782A1 (en) 2012-05-17
EP2242045B1 (en) 2012-06-27
EP2242045A1 (en) 2010-10-20
RU2557469C2 (ru) 2015-07-20
KR101678544B1 (ko) 2016-11-22
CA2757142A1 (en) 2010-10-21
PL2242045T3 (pl) 2013-02-28
JP2012524288A (ja) 2012-10-11
RU2011145669A (ru) 2013-05-27
WO2010118953A1 (en) 2010-10-21
US8862472B2 (en) 2014-10-14
CA2757142C (en) 2017-11-07
KR20120040136A (ko) 2012-04-26
IL215628A (en) 2013-11-28
JP5581377B2 (ja) 2014-08-27

Similar Documents

Publication Publication Date Title
DK2242045T3 (da) Talesyntese og kodningsfremgangsmåder
MY175978A (en) Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection
ATE527433T1 (de) Abschlussanordnung des toten strangs mit einspritzsystem und verfahren
WO2010087614A3 (ko) 오디오 신호의 부호화 및 복호화 방법 및 그 장치
WO2006060563A3 (en) Apparatus and method for producing chlorine dioxide
WO2011059254A3 (en) An apparatus for processing a signal and method thereof
MX356036B (es) Decodificador de audio y método para proveer una información de audio decodificada usando un ocultamiento de error que modifica una señal de excitación de dominio de tiempo.
ZA201006403B (en) Apparatus and method for converting an audio signal into a parameterized representaion,apparatus and method for modifying a paramerized representation,apparatus and mrthod for synthesizing a parameterized representation o an audio signal
MY157499A (en) Method and apparatus for encoding and decoding image by using large transformation unit
DK3288268T3 (da) Fremgangsmåde og apparat til at signalere intraforudsigelse til store blokke til videokodere og afkodere
MY178139A (en) Audio decoder and method for providing a decoded audio information using an errorconcealment based on a time domain excitation signal
BR112012011084A2 (pt) decodificador para gerar um sinal de áudio multicanal, codificador para gerar uma representação codificada de um sinal de áudio multicanal, método de gerar um sinal de áudio multicanal, método de gerar uma representação codificada de um sinal de áudio multicanal, produto de programa de computador, fluxo de bits de áudio para um sinal de áudio multicanal e meio de armazenamento
ATE489015T1 (de) Verfahren zur herstellung von verschlüssen
MY178026A (en) Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates
MY180722A (en) Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information
MX2015009747A (es) Decodificador para generar una señal de audio mejorada en frecuencia, metodo de decodificacion, codificador para generar una señal codificada y metodo de codificacion utilizando informacion secundaria de seleccion compacta.
ATE502380T1 (de) Verfahren, vorrichtung und programmcode zur umwandlung von stimmen
MX2016002561A (es) Decision sorda/sonora para procesamiento de voz.
EP2450881A4 (en) DEVICE FOR CODING AND DECODING AN AUDIO SIGNAL USING A WEIGHTED LINEAR PROGNOSIS TRANSFORM AND METHOD THEREFOR
WO2010090427A3 (ko) 오디오 신호의 부호화 및 복호화 방법 및 그 장치
ATE478417T1 (de) Verfahren und vorrichtung zum verarbeiten codierter audiodaten
MX2016004922A (es) Concepto para codificar una señal de audio y decodificar una señal de audio usando informacion determinista y de tipo ruido.
WO2013048171A3 (ko) 음성 신호 부호화 방법 및 음성 신호 복호화 방법 그리고 이를 이용하는 장치
CL2011002407A1 (es) Un metodo de codificacion, modificacion y sintesis de segmentos de voz
TW200620239A (en) Speech synthesis method capable of adjust prosody, apparatus, and its dialogue system