DE602008005641D1 - METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES - Google Patents

METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES

Info

Publication number
DE602008005641D1
DE602008005641D1 DE602008005641T DE602008005641T DE602008005641D1 DE 602008005641 D1 DE602008005641 D1 DE 602008005641D1 DE 602008005641 T DE602008005641 T DE 602008005641T DE 602008005641 T DE602008005641 T DE 602008005641T DE 602008005641 D1 DE602008005641 D1 DE 602008005641D1
Authority
DE
Germany
Prior art keywords
modelling
stage
glottal
vocal tract
converted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602008005641T
Other languages
German (de)
Inventor
Pozo Echezarreta Maria Arantzazu Del
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Original Assignee
Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech filed Critical Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Publication of DE602008005641D1 publication Critical patent/DE602008005641D1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/057Time compression or expansion for improving intelligibility
    • G10L2021/0575Aids for the handicapped in speaking

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Numerical Control (AREA)
  • Auxiliary Devices For Music (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A method of converting a source speakers speech signal into a converted speech signal, which comprises a stage of training using a given database of parallel source and target data. For each pitch period modelling a glottal waveform and a vocal tract filter to obtain a set of parameters comprising an excitation strength, parameters modelling a glottal waveform, and all-pole vocal tract filter coefficients. Defining a glottal vector to be converted; defining a vocal tract vector to be converted, obtaining an estimate of a glottal aspiration noise and estimating a vocal tract transformation function. The stage of modelling comprises: modelling said aspiration noise estimate by- modulating Gaussian noise with the said modelled glottal waveform and adjusting its energy to match that of the said aspiration noise estimate. The method further comprises a stage of conversion and a stage of synthesis.
DE602008005641T 2008-09-19 2008-09-19 METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES Active DE602008005641D1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2008/062502 WO2010031437A1 (en) 2008-09-19 2008-09-19 Method and system of voice conversion

Publications (1)

Publication Number Publication Date
DE602008005641D1 true DE602008005641D1 (en) 2011-04-28

Family

ID=40277465

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602008005641T Active DE602008005641D1 (en) 2008-09-19 2008-09-19 METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES

Country Status (5)

Country Link
EP (1) EP2215632B1 (en)
AT (1) ATE502380T1 (en)
DE (1) DE602008005641D1 (en)
ES (1) ES2364005T3 (en)
WO (1) WO2010031437A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101901598A (en) * 2010-06-30 2010-12-01 北京捷通华声语音技术有限公司 Humming synthesis method and system
ES2364401B2 (en) * 2011-06-27 2011-12-23 Universidad Politécnica de Madrid METHOD AND SYSTEM FOR ESTIMATING PHYSIOLOGICAL PARAMETERS OF THE FONATION.
RU2510954C2 (en) * 2012-05-18 2014-04-10 Александр Юрьевич Бредихин Method of re-sounding audio materials and apparatus for realising said method
US9607610B2 (en) 2014-07-03 2017-03-28 Google Inc. Devices and methods for noise modulation in a universal vocoder synthesizer
WO2020062217A1 (en) * 2018-09-30 2020-04-02 Microsoft Technology Licensing, Llc Speech waveform generation
WO2020174356A1 (en) * 2019-02-25 2020-09-03 Technologies Of Voice Interface Ltd Speech interpretation device and system
EP3839947A1 (en) 2019-12-20 2021-06-23 SoundHound, Inc. Training a voice morphing apparatus
US11600284B2 (en) 2020-01-11 2023-03-07 Soundhound, Inc. Voice morphing apparatus having adjustable parameters
CN113780107B (en) * 2021-08-24 2024-03-01 电信科学技术第五研究所有限公司 Radio signal detection method based on deep learning dual-input network model

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100809368B1 (en) * 2006-08-09 2008-03-05 한국과학기술원 Voice Color Conversion System using Glottal waveform

Also Published As

Publication number Publication date
EP2215632A1 (en) 2010-08-11
ES2364005T3 (en) 2011-08-22
WO2010031437A1 (en) 2010-03-25
EP2215632B1 (en) 2011-03-16
ATE502380T1 (en) 2011-04-15

Similar Documents

Publication Publication Date Title
DE602008005641D1 (en) METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES
DE602006017673D1 (en) METHOD AND DEVICE FOR VECTOR-QUANTIZING A SPEKTRALENVELOP REPRESENTATION
ATE542191T1 (en) METHOD FOR IDENTIFYING A PERSON BY HIS IRIS
EP4246516A3 (en) Device and method for reducing quantization noise in a time-domain decoder
ATE549670T1 (en) ELECTRONIC DEVICE AND METHOD FOR PROVIDING AN IMPROVED SLEEP OPERATION MODE
ATE403928T1 (en) VOICE DIALOGUE CONTROL BASED ON SIGNAL PREPROCESSING
ATE492873T1 (en) APPARATUS AND PROGRAM FOR SOUND ANALYSIS
WO2011133766A3 (en) Methods and systems for training dictation-based speech-to-text systems using recorded samples
DE602006010395D1 (en) Use of child-oriented language to automatically generate speech segmentation and a model-based speech recognition system
WO2010148141A3 (en) Apparatus and method for speech analysis
EP2244140A3 (en) Process Simulation Utilizing Component-Specific Consumption Data
ATE524028T1 (en) METHOD FOR FINE ADJUSTMENT OF A HEARING AID AND HEARING AID
EP2301013A4 (en) Systems and methods of improving automated speech recognition accuracy using statistical analysis of search terms
WO2010117712A3 (en) Systems and methods for measuring speech intelligibility
DE602005025057D1 (en) Method for parameterizing a mesh lattice e
EP2214123A3 (en) Model-based comparative measure for vector sequences and word spotting using same
WO2014182453A3 (en) Method and apparatus for training a voice recognition model database
CN104091592A (en) Voice conversion system based on hidden Gaussian random field
JP2019101093A5 (en) Speech synthesis method, speech synthesis system and program
CN102426834A (en) Method for testing rhythm level of spoken English
EP3121808A3 (en) System and method of modeling characteristics of a musical instrument
WO2010123483A3 (en) Analyzing the prosody of speech
DE502007001672D1 (en) Hearing aid adaptation method
ATE329347T1 (en) METHOD FOR MODELING AMOUNTS OF HARMONICS IN SPEECH
WO2012070866A3 (en) Speech signal encoding method and speech signal decoding method