DK2242045T3 - Talesyntese og kodningsfremgangsmåder - Google Patents
Talesyntese og kodningsfremgangsmåderInfo
- Publication number
- DK2242045T3 DK2242045T3 DK09158056.3T DK09158056T DK2242045T3 DK 2242045 T3 DK2242045 T3 DK 2242045T3 DK 09158056 T DK09158056 T DK 09158056T DK 2242045 T3 DK2242045 T3 DK 2242045T3
- Authority
- DK
- Denmark
- Prior art keywords
- target
- frames
- normalised
- residual frames
- gci
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 2
- 230000015572 biosynthetic process Effects 0.000 title 1
- 238000003786 synthesis reaction Methods 0.000 title 1
- 230000001360 synchronised effect Effects 0.000 abstract 4
- 230000005284 excitation Effects 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/125—Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP09158056A EP2242045B1 (en) | 2009-04-16 | 2009-04-16 | Speech synthesis and coding methods |
Publications (1)
Publication Number | Publication Date |
---|---|
DK2242045T3 true DK2242045T3 (da) | 2012-09-24 |
Family
ID=40846430
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DK09158056.3T DK2242045T3 (da) | 2009-04-16 | 2009-04-16 | Talesyntese og kodningsfremgangsmåder |
Country Status (10)
Country | Link |
---|---|
US (1) | US8862472B2 (da) |
EP (1) | EP2242045B1 (da) |
JP (1) | JP5581377B2 (da) |
KR (1) | KR101678544B1 (da) |
CA (1) | CA2757142C (da) |
DK (1) | DK2242045T3 (da) |
IL (1) | IL215628A (da) |
PL (1) | PL2242045T3 (da) |
RU (1) | RU2557469C2 (da) |
WO (1) | WO2010118953A1 (da) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011066844A1 (en) * | 2009-12-02 | 2011-06-09 | Agnitio, S.L. | Obfuscated speech synthesis |
JP5591080B2 (ja) * | 2010-11-26 | 2014-09-17 | 三菱電機株式会社 | データ圧縮装置及びデータ処理システム及びコンピュータプログラム及びデータ圧縮方法 |
KR101402805B1 (ko) * | 2012-03-27 | 2014-06-03 | 광주과학기술원 | 음성분석장치, 음성합성장치, 및 음성분석합성시스템 |
US9978359B1 (en) * | 2013-12-06 | 2018-05-22 | Amazon Technologies, Inc. | Iterative text-to-speech with user feedback |
WO2015183254A1 (en) * | 2014-05-28 | 2015-12-03 | Interactive Intelligence, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US10255903B2 (en) | 2014-05-28 | 2019-04-09 | Interactive Intelligence Group, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US10014007B2 (en) | 2014-05-28 | 2018-07-03 | Interactive Intelligence, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US9607610B2 (en) * | 2014-07-03 | 2017-03-28 | Google Inc. | Devices and methods for noise modulation in a universal vocoder synthesizer |
JP6293912B2 (ja) * | 2014-09-19 | 2018-03-14 | 株式会社東芝 | 音声合成装置、音声合成方法およびプログラム |
WO2017061985A1 (en) * | 2015-10-06 | 2017-04-13 | Interactive Intelligence Group, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US10140089B1 (en) | 2017-08-09 | 2018-11-27 | 2236008 Ontario Inc. | Synthetic speech for in vehicle communication |
US10347238B2 (en) | 2017-10-27 | 2019-07-09 | Adobe Inc. | Text-based insertion and replacement in audio narration |
CN108281150B (zh) * | 2018-01-29 | 2020-11-17 | 上海泰亿格康复医疗科技股份有限公司 | 一种基于微分声门波模型的语音变调变嗓音方法 |
US10770063B2 (en) | 2018-04-13 | 2020-09-08 | Adobe Inc. | Real-time speaker-dependent neural vocoder |
CN109036375B (zh) * | 2018-07-25 | 2023-03-24 | 腾讯科技(深圳)有限公司 | 语音合成方法、模型训练方法、装置和计算机设备 |
CN112634914B (zh) * | 2020-12-15 | 2024-03-29 | 中国科学技术大学 | 基于短时谱一致性的神经网络声码器训练方法 |
CN113539231B (zh) * | 2020-12-30 | 2024-06-18 | 腾讯科技(深圳)有限公司 | 音频处理方法、声码器、装置、设备及存储介质 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6423300A (en) * | 1987-07-17 | 1989-01-25 | Ricoh Kk | Spectrum generation system |
US5754976A (en) * | 1990-02-23 | 1998-05-19 | Universite De Sherbrooke | Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech |
EP0481107B1 (en) * | 1990-10-16 | 1995-09-06 | International Business Machines Corporation | A phonetic Hidden Markov Model speech synthesizer |
DE69203186T2 (de) * | 1991-09-20 | 1996-02-01 | Philips Electronics Nv | Verarbeitungsgerät für die menschliche Sprache zum Detektieren des Schliessens der Stimmritze. |
JPH06250690A (ja) * | 1993-02-26 | 1994-09-09 | N T T Data Tsushin Kk | 振幅特徴抽出装置及び合成音声振幅制御装置 |
JP3093113B2 (ja) * | 1994-09-21 | 2000-10-03 | 日本アイ・ビー・エム株式会社 | 音声合成方法及びシステム |
JP3747492B2 (ja) * | 1995-06-20 | 2006-02-22 | ソニー株式会社 | 音声信号の再生方法及び再生装置 |
US6304846B1 (en) * | 1997-10-22 | 2001-10-16 | Texas Instruments Incorporated | Singing voice synthesis |
JP3268750B2 (ja) * | 1998-01-30 | 2002-03-25 | 株式会社東芝 | 音声合成方法及びシステム |
US6631363B1 (en) * | 1999-10-11 | 2003-10-07 | I2 Technologies Us, Inc. | Rules-based notification system |
DE10041512B4 (de) * | 2000-08-24 | 2005-05-04 | Infineon Technologies Ag | Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen |
DE60127274T2 (de) * | 2000-09-15 | 2007-12-20 | Lernout & Hauspie Speech Products N.V. | Schnelle wellenformsynchronisation für die verkettung und zeitskalenmodifikation von sprachsignalen |
JP2004117662A (ja) * | 2002-09-25 | 2004-04-15 | Matsushita Electric Ind Co Ltd | 音声合成システム |
AU2003284654A1 (en) * | 2002-11-25 | 2004-06-18 | Matsushita Electric Industrial Co., Ltd. | Speech synthesis method and speech synthesis device |
US7842874B2 (en) * | 2006-06-15 | 2010-11-30 | Massachusetts Institute Of Technology | Creating music by concatenative synthesis |
US8140326B2 (en) * | 2008-06-06 | 2012-03-20 | Fuji Xerox Co., Ltd. | Systems and methods for reducing speech intelligibility while preserving environmental sounds |
-
2009
- 2009-04-16 DK DK09158056.3T patent/DK2242045T3/da active
- 2009-04-16 PL PL09158056T patent/PL2242045T3/pl unknown
- 2009-04-16 EP EP09158056A patent/EP2242045B1/en not_active Not-in-force
-
2010
- 2010-03-30 KR KR1020117027296A patent/KR101678544B1/ko active IP Right Grant
- 2010-03-30 RU RU2011145669/08A patent/RU2557469C2/ru not_active IP Right Cessation
- 2010-03-30 US US13/264,571 patent/US8862472B2/en not_active Expired - Fee Related
- 2010-03-30 CA CA2757142A patent/CA2757142C/en not_active Expired - Fee Related
- 2010-03-30 WO PCT/EP2010/054244 patent/WO2010118953A1/en active Application Filing
- 2010-03-30 JP JP2012505115A patent/JP5581377B2/ja not_active Expired - Fee Related
-
2011
- 2011-10-09 IL IL215628A patent/IL215628A/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
IL215628A0 (en) | 2012-01-31 |
US20120123782A1 (en) | 2012-05-17 |
EP2242045B1 (en) | 2012-06-27 |
EP2242045A1 (en) | 2010-10-20 |
RU2557469C2 (ru) | 2015-07-20 |
KR101678544B1 (ko) | 2016-11-22 |
CA2757142A1 (en) | 2010-10-21 |
PL2242045T3 (pl) | 2013-02-28 |
JP2012524288A (ja) | 2012-10-11 |
RU2011145669A (ru) | 2013-05-27 |
WO2010118953A1 (en) | 2010-10-21 |
US8862472B2 (en) | 2014-10-14 |
CA2757142C (en) | 2017-11-07 |
KR20120040136A (ko) | 2012-04-26 |
IL215628A (en) | 2013-11-28 |
JP5581377B2 (ja) | 2014-08-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2242045T3 (da) | Talesyntese og kodningsfremgangsmåder | |
MY175978A (en) | Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection | |
ATE527433T1 (de) | Abschlussanordnung des toten strangs mit einspritzsystem und verfahren | |
WO2010087614A3 (ko) | 오디오 신호의 부호화 및 복호화 방법 및 그 장치 | |
WO2006060563A3 (en) | Apparatus and method for producing chlorine dioxide | |
WO2011059254A3 (en) | An apparatus for processing a signal and method thereof | |
MX356036B (es) | Decodificador de audio y método para proveer una información de audio decodificada usando un ocultamiento de error que modifica una señal de excitación de dominio de tiempo. | |
ZA201006403B (en) | Apparatus and method for converting an audio signal into a parameterized representaion,apparatus and method for modifying a paramerized representation,apparatus and mrthod for synthesizing a parameterized representation o an audio signal | |
MY157499A (en) | Method and apparatus for encoding and decoding image by using large transformation unit | |
DK3288268T3 (da) | Fremgangsmåde og apparat til at signalere intraforudsigelse til store blokke til videokodere og afkodere | |
MY178139A (en) | Audio decoder and method for providing a decoded audio information using an errorconcealment based on a time domain excitation signal | |
BR112012011084A2 (pt) | decodificador para gerar um sinal de áudio multicanal, codificador para gerar uma representação codificada de um sinal de áudio multicanal, método de gerar um sinal de áudio multicanal, método de gerar uma representação codificada de um sinal de áudio multicanal, produto de programa de computador, fluxo de bits de áudio para um sinal de áudio multicanal e meio de armazenamento | |
ATE489015T1 (de) | Verfahren zur herstellung von verschlüssen | |
MY178026A (en) | Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates | |
MY180722A (en) | Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information | |
MX2015009747A (es) | Decodificador para generar una señal de audio mejorada en frecuencia, metodo de decodificacion, codificador para generar una señal codificada y metodo de codificacion utilizando informacion secundaria de seleccion compacta. | |
ATE502380T1 (de) | Verfahren, vorrichtung und programmcode zur umwandlung von stimmen | |
MX2016002561A (es) | Decision sorda/sonora para procesamiento de voz. | |
EP2450881A4 (en) | DEVICE FOR CODING AND DECODING AN AUDIO SIGNAL USING A WEIGHTED LINEAR PROGNOSIS TRANSFORM AND METHOD THEREFOR | |
WO2010090427A3 (ko) | 오디오 신호의 부호화 및 복호화 방법 및 그 장치 | |
ATE478417T1 (de) | Verfahren und vorrichtung zum verarbeiten codierter audiodaten | |
MX2016004922A (es) | Concepto para codificar una señal de audio y decodificar una señal de audio usando informacion determinista y de tipo ruido. | |
WO2013048171A3 (ko) | 음성 신호 부호화 방법 및 음성 신호 복호화 방법 그리고 이를 이용하는 장치 | |
CL2011002407A1 (es) | Un metodo de codificacion, modificacion y sintesis de segmentos de voz | |
TW200620239A (en) | Speech synthesis method capable of adjust prosody, apparatus, and its dialogue system |