MX2011009873A - Codificacion, modificacion y sintesis de segmentos de voz. - Google Patents
Codificacion, modificacion y sintesis de segmentos de voz.Info
- Publication number
- MX2011009873A MX2011009873A MX2011009873A MX2011009873A MX2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A
- Authority
- MX
- Mexico
- Prior art keywords
- synthesis
- phase
- analysis
- modification
- frames
- Prior art date
Links
- 230000015572 biosynthetic process Effects 0.000 title abstract 6
- 238000003786 synthesis reaction Methods 0.000 title abstract 6
- 230000004048 modification Effects 0.000 title abstract 2
- 238000012986 modification Methods 0.000 title abstract 2
- 238000004458 analytical method Methods 0.000 abstract 5
- 238000000034 method Methods 0.000 abstract 2
- 230000001427 coherent effect Effects 0.000 abstract 1
- 238000012804 iterative process Methods 0.000 abstract 1
- 230000003595 spectral effect Effects 0.000 abstract 1
- 230000001360 synchronised effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/093—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using sinusoidal excitation models
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Electrophonic Musical Instruments (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Complex Calculations (AREA)
- Stereophonic System (AREA)
Abstract
Método de análisis, modificación y síntesis de señal de voz que comprende una fase de localización de ventanas de análisis mediante un proceso iterativo de determinación de la fase de la primera componente sinusoidal y comparación entre el valor de fase de dicha componente y un valor predeterminado, una fase de selección de tramas de análisis correspondientes a un alófono y reajuste de la duración y la frecuencia fundamental según unos umbrales y una fase de generación de voz sintética a partir de las tramas de síntesis tomando como información espectral de la trama de síntesis la información de la trama de análisis más cercana y tomando tantas tramas de síntesis como periodos tenga la señal sintética. El método permite una localización coherente de las ventanas de análisis dentro de los periodos de la señal y generar de forma exacta los instantes de síntesis de manera síncrona con el periodo fundamental.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
ES200931212A ES2374008B1 (es) | 2009-12-21 | 2009-12-21 | Codificación, modificación y síntesis de segmentos de voz. |
PCT/EP2010/070353 WO2011076779A1 (en) | 2009-12-21 | 2010-12-21 | Coding, modification and synthesis of speech segments |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2011009873A true MX2011009873A (es) | 2011-09-30 |
Family
ID=43735039
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2011009873A MX2011009873A (es) | 2009-12-21 | 2010-12-21 | Codificacion, modificacion y sintesis de segmentos de voz. |
Country Status (10)
Country | Link |
---|---|
US (1) | US8812324B2 (es) |
EP (1) | EP2517197B1 (es) |
AR (1) | AR079623A1 (es) |
BR (1) | BR112012015144A2 (es) |
CL (1) | CL2011002407A1 (es) |
CO (1) | CO6362071A2 (es) |
ES (2) | ES2374008B1 (es) |
MX (1) | MX2011009873A (es) |
PE (1) | PE20121044A1 (es) |
WO (1) | WO2011076779A1 (es) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2961938B1 (fr) * | 2010-06-25 | 2013-03-01 | Inst Nat Rech Inf Automat | Synthetiseur numerique audio ameliore |
ES2401014B1 (es) * | 2011-09-28 | 2014-07-01 | Telef�Nica, S.A. | Método y sistema para la síntesis de segmentos de voz |
EP4372602A3 (en) | 2013-01-08 | 2024-07-10 | Dolby International AB | Model based prediction in a critically sampled filterbank |
ES2664968T3 (es) * | 2013-02-05 | 2018-04-24 | Telefonaktiebolaget Lm Ericsson (Publ) | Encubrimiento de pérdida de trama de audio |
JP6733644B2 (ja) * | 2017-11-29 | 2020-08-05 | ヤマハ株式会社 | 音声合成方法、音声合成システムおよびプログラム |
KR102108906B1 (ko) * | 2018-06-18 | 2020-05-12 | 엘지전자 주식회사 | 음성 합성 장치 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05307399A (ja) * | 1992-05-01 | 1993-11-19 | Sony Corp | 音声分析方式 |
US5577160A (en) * | 1992-06-24 | 1996-11-19 | Sumitomo Electric Industries, Inc. | Speech analysis apparatus for extracting glottal source parameters and formant parameters |
US6064960A (en) * | 1997-12-18 | 2000-05-16 | Apple Computer, Inc. | Method and apparatus for improved duration modeling of phonemes |
US6449592B1 (en) * | 1999-02-26 | 2002-09-10 | Qualcomm Incorporated | Method and apparatus for tracking the phase of a quasi-periodic signal |
US7315815B1 (en) * | 1999-09-22 | 2008-01-01 | Microsoft Corporation | LPC-harmonic vocoder with superframe structure |
US20030158734A1 (en) * | 1999-12-16 | 2003-08-21 | Brian Cruickshank | Text to speech conversion using word concatenation |
EP1256931A1 (en) * | 2001-05-11 | 2002-11-13 | Sony France S.A. | Method and apparatus for voice synthesis and robot apparatus |
US7822599B2 (en) * | 2002-04-19 | 2010-10-26 | Koninklijke Philips Electronics N.V. | Method for synthesizing speech |
JP4179268B2 (ja) * | 2004-11-25 | 2008-11-12 | カシオ計算機株式会社 | データ合成装置およびデータ合成処理のプログラム |
ATE443318T1 (de) * | 2005-07-14 | 2009-10-15 | Koninkl Philips Electronics Nv | Audiosignalsynthese |
-
2009
- 2009-12-21 ES ES200931212A patent/ES2374008B1/es not_active Expired - Fee Related
-
2010
- 2010-12-16 AR ARP100104683A patent/AR079623A1/es unknown
- 2010-12-21 US US13/254,479 patent/US8812324B2/en not_active Expired - Fee Related
- 2010-12-21 EP EP10801161.0A patent/EP2517197B1/en not_active Not-in-force
- 2010-12-21 PE PE2011001989A patent/PE20121044A1/es not_active Application Discontinuation
- 2010-12-21 WO PCT/EP2010/070353 patent/WO2011076779A1/en active Application Filing
- 2010-12-21 MX MX2011009873A patent/MX2011009873A/es active IP Right Grant
- 2010-12-21 BR BR112012015144A patent/BR112012015144A2/pt not_active IP Right Cessation
- 2010-12-21 ES ES10801161.0T patent/ES2532887T3/es active Active
-
2011
- 2011-09-12 CO CO11117745A patent/CO6362071A2/es not_active Application Discontinuation
- 2011-09-29 CL CL2011002407A patent/CL2011002407A1/es unknown
Also Published As
Publication number | Publication date |
---|---|
ES2532887T3 (es) | 2015-04-01 |
US20110320207A1 (en) | 2011-12-29 |
CO6362071A2 (es) | 2012-01-20 |
WO2011076779A1 (en) | 2011-06-30 |
EP2517197A1 (en) | 2012-10-31 |
ES2374008A1 (es) | 2012-02-13 |
AR079623A1 (es) | 2012-02-08 |
BR112012015144A2 (pt) | 2019-09-24 |
CL2011002407A1 (es) | 2012-03-16 |
US8812324B2 (en) | 2014-08-19 |
PE20121044A1 (es) | 2012-08-30 |
ES2374008B1 (es) | 2012-12-28 |
EP2517197B1 (en) | 2014-12-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2011009873A (es) | Codificacion, modificacion y sintesis de segmentos de voz. | |
MX2015016892A (es) | Aparato y metodo para realizar un desvanecimiento de un espectro mdct a ruido blanco antes de aplicar fdns. | |
WO2009096713A3 (ko) | 적응적 lpc 계수 보간을 이용한 오디오 신호의 부호화, 복호화 방법 및 장치 | |
MX2015009964A (es) | Corrección mejorada de pérdida de bloqueo cuando se decodifica una señal. | |
CN107533847B (zh) | 音频编码器和音频解码器及对应的方法 | |
MY175978A (en) | Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection | |
WO2011059255A3 (en) | An apparatus for processing an audio signal and method thereof | |
EP2460158A4 (en) | METHOD AND APPARATUS FOR PROCESSING AUDIO SIGNAL | |
MY169354A (en) | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field | |
CA2998689C (en) | Encoder and method for encoding an audio signal with reduced background noise using linear predictive coding | |
MX356036B (es) | Decodificador de audio y método para proveer una información de audio decodificada usando un ocultamiento de error que modifica una señal de excitación de dominio de tiempo. | |
MX350691B (es) | Codificador, decodificador y métodos para la adaptación dinámica inversa compatible de la resolución en tiempo/frecuencia en la codificación espacial de objetos de audio. | |
MX2016002561A (es) | Decision sorda/sonora para procesamiento de voz. | |
MY160265A (en) | Apparatus and Method for Encoding and Decoding an Audio Signal Using an Aligned Look-Ahead Portion | |
MY172752A (en) | Decoder for generating a frequency enhanced audio signal, method of decoding encoder for generating an encoded signal and method of encoding using compact selection side information | |
MY179139A (en) | Noise filling in multichannel audio coding | |
EP4336500A3 (en) | Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates | |
AU2012272779A8 (en) | Method and apparatus for motion compensation prediction | |
MX355091B (es) | Concepto para codificar una señal de audio y decodificar una señal de audio usando información de conformación espectral relacionada con la voz. | |
MY166226A (en) | Method and apparatus for predicting high band excitation signal | |
MX2016004922A (es) | Concepto para codificar una señal de audio y decodificar una señal de audio usando informacion determinista y de tipo ruido. | |
EP4235661A3 (en) | Comfort noise generation method and device | |
MY178143A (en) | Audio decoder, method and computer program using a zero-input-response to obtain a smooth transition | |
ATE443318T1 (de) | Audiosignalsynthese | |
MX368973B (es) | Corrección de pérdida de trama mejorada con información de voz. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |