MX2011009873A - Codificacion, modificacion y sintesis de segmentos de voz. - Google Patents

Codificacion, modificacion y sintesis de segmentos de voz.

Info

Publication number
MX2011009873A
MX2011009873A MX2011009873A MX2011009873A MX2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A MX 2011009873 A MX2011009873 A MX 2011009873A
Authority
MX
Mexico
Prior art keywords
synthesis
phase
analysis
modification
frames
Prior art date
Application number
MX2011009873A
Other languages
English (en)
Inventor
Crespo Miguel Angel Rodriguez
Sardina Jose Gregorio Escalada
Lopez De Vicuna Ana Armenta
Original Assignee
Telefonica Sa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonica Sa filed Critical Telefonica Sa
Publication of MX2011009873A publication Critical patent/MX2011009873A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/093Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using sinusoidal excitation models

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Complex Calculations (AREA)
  • Stereophonic System (AREA)

Abstract

Método de análisis, modificación y síntesis de señal de voz que comprende una fase de localización de ventanas de análisis mediante un proceso iterativo de determinación de la fase de la primera componente sinusoidal y comparación entre el valor de fase de dicha componente y un valor predeterminado, una fase de selección de tramas de análisis correspondientes a un alófono y reajuste de la duración y la frecuencia fundamental según unos umbrales y una fase de generación de voz sintética a partir de las tramas de síntesis tomando como información espectral de la trama de síntesis la información de la trama de análisis más cercana y tomando tantas tramas de síntesis como periodos tenga la señal sintética. El método permite una localización coherente de las ventanas de análisis dentro de los periodos de la señal y generar de forma exacta los instantes de síntesis de manera síncrona con el periodo fundamental.
MX2011009873A 2009-12-21 2010-12-21 Codificacion, modificacion y sintesis de segmentos de voz. MX2011009873A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
ES200931212A ES2374008B1 (es) 2009-12-21 2009-12-21 Codificación, modificación y síntesis de segmentos de voz.
PCT/EP2010/070353 WO2011076779A1 (en) 2009-12-21 2010-12-21 Coding, modification and synthesis of speech segments

Publications (1)

Publication Number Publication Date
MX2011009873A true MX2011009873A (es) 2011-09-30

Family

ID=43735039

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2011009873A MX2011009873A (es) 2009-12-21 2010-12-21 Codificacion, modificacion y sintesis de segmentos de voz.

Country Status (10)

Country Link
US (1) US8812324B2 (es)
EP (1) EP2517197B1 (es)
AR (1) AR079623A1 (es)
BR (1) BR112012015144A2 (es)
CL (1) CL2011002407A1 (es)
CO (1) CO6362071A2 (es)
ES (2) ES2374008B1 (es)
MX (1) MX2011009873A (es)
PE (1) PE20121044A1 (es)
WO (1) WO2011076779A1 (es)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2961938B1 (fr) * 2010-06-25 2013-03-01 Inst Nat Rech Inf Automat Synthetiseur numerique audio ameliore
ES2401014B1 (es) * 2011-09-28 2014-07-01 Telef�Nica, S.A. Método y sistema para la síntesis de segmentos de voz
EP4372602A3 (en) 2013-01-08 2024-07-10 Dolby International AB Model based prediction in a critically sampled filterbank
ES2664968T3 (es) * 2013-02-05 2018-04-24 Telefonaktiebolaget Lm Ericsson (Publ) Encubrimiento de pérdida de trama de audio
JP6733644B2 (ja) * 2017-11-29 2020-08-05 ヤマハ株式会社 音声合成方法、音声合成システムおよびプログラム
KR102108906B1 (ko) * 2018-06-18 2020-05-12 엘지전자 주식회사 음성 합성 장치

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05307399A (ja) * 1992-05-01 1993-11-19 Sony Corp 音声分析方式
US5577160A (en) * 1992-06-24 1996-11-19 Sumitomo Electric Industries, Inc. Speech analysis apparatus for extracting glottal source parameters and formant parameters
US6064960A (en) * 1997-12-18 2000-05-16 Apple Computer, Inc. Method and apparatus for improved duration modeling of phonemes
US6449592B1 (en) * 1999-02-26 2002-09-10 Qualcomm Incorporated Method and apparatus for tracking the phase of a quasi-periodic signal
US7315815B1 (en) * 1999-09-22 2008-01-01 Microsoft Corporation LPC-harmonic vocoder with superframe structure
US20030158734A1 (en) * 1999-12-16 2003-08-21 Brian Cruickshank Text to speech conversion using word concatenation
EP1256931A1 (en) * 2001-05-11 2002-11-13 Sony France S.A. Method and apparatus for voice synthesis and robot apparatus
US7822599B2 (en) * 2002-04-19 2010-10-26 Koninklijke Philips Electronics N.V. Method for synthesizing speech
JP4179268B2 (ja) * 2004-11-25 2008-11-12 カシオ計算機株式会社 データ合成装置およびデータ合成処理のプログラム
ATE443318T1 (de) * 2005-07-14 2009-10-15 Koninkl Philips Electronics Nv Audiosignalsynthese

Also Published As

Publication number Publication date
ES2532887T3 (es) 2015-04-01
US20110320207A1 (en) 2011-12-29
CO6362071A2 (es) 2012-01-20
WO2011076779A1 (en) 2011-06-30
EP2517197A1 (en) 2012-10-31
ES2374008A1 (es) 2012-02-13
AR079623A1 (es) 2012-02-08
BR112012015144A2 (pt) 2019-09-24
CL2011002407A1 (es) 2012-03-16
US8812324B2 (en) 2014-08-19
PE20121044A1 (es) 2012-08-30
ES2374008B1 (es) 2012-12-28
EP2517197B1 (en) 2014-12-17

Similar Documents

Publication Publication Date Title
MX2011009873A (es) Codificacion, modificacion y sintesis de segmentos de voz.
MX2015016892A (es) Aparato y metodo para realizar un desvanecimiento de un espectro mdct a ruido blanco antes de aplicar fdns.
WO2009096713A3 (ko) 적응적 lpc 계수 보간을 이용한 오디오 신호의 부호화, 복호화 방법 및 장치
MX2015009964A (es) Corrección mejorada de pérdida de bloqueo cuando se decodifica una señal.
CN107533847B (zh) 音频编码器和音频解码器及对应的方法
MY175978A (en) Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection
WO2011059255A3 (en) An apparatus for processing an audio signal and method thereof
EP2460158A4 (en) METHOD AND APPARATUS FOR PROCESSING AUDIO SIGNAL
MY169354A (en) Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
CA2998689C (en) Encoder and method for encoding an audio signal with reduced background noise using linear predictive coding
MX356036B (es) Decodificador de audio y método para proveer una información de audio decodificada usando un ocultamiento de error que modifica una señal de excitación de dominio de tiempo.
MX350691B (es) Codificador, decodificador y métodos para la adaptación dinámica inversa compatible de la resolución en tiempo/frecuencia en la codificación espacial de objetos de audio.
MX2016002561A (es) Decision sorda/sonora para procesamiento de voz.
MY160265A (en) Apparatus and Method for Encoding and Decoding an Audio Signal Using an Aligned Look-Ahead Portion
MY172752A (en) Decoder for generating a frequency enhanced audio signal, method of decoding encoder for generating an encoded signal and method of encoding using compact selection side information
MY179139A (en) Noise filling in multichannel audio coding
EP4336500A3 (en) Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates
AU2012272779A8 (en) Method and apparatus for motion compensation prediction
MX355091B (es) Concepto para codificar una señal de audio y decodificar una señal de audio usando información de conformación espectral relacionada con la voz.
MY166226A (en) Method and apparatus for predicting high band excitation signal
MX2016004922A (es) Concepto para codificar una señal de audio y decodificar una señal de audio usando informacion determinista y de tipo ruido.
EP4235661A3 (en) Comfort noise generation method and device
MY178143A (en) Audio decoder, method and computer program using a zero-input-response to obtain a smooth transition
ATE443318T1 (de) Audiosignalsynthese
MX368973B (es) Corrección de pérdida de trama mejorada con información de voz.

Legal Events

Date Code Title Description
FG Grant or registration