DE602008005641D1 - METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES - Google Patents

METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES

Info

Publication number
DE602008005641D1
DE602008005641D1 DE602008005641T DE602008005641T DE602008005641D1 DE 602008005641 D1 DE602008005641 D1 DE 602008005641D1 DE 602008005641 T DE602008005641 T DE 602008005641T DE 602008005641 T DE602008005641 T DE 602008005641T DE 602008005641 D1 DE602008005641 D1 DE 602008005641D1
Authority
DE
Germany
Prior art keywords
modelling
stage
glottal
vocal tract
converted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602008005641T
Other languages
German (de)
Inventor
Pozo Echezarreta Maria Arantzazu Del
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Original Assignee
Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech filed Critical Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Publication of DE602008005641D1 publication Critical patent/DE602008005641D1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/057Time compression or expansion for improving intelligibility
    • G10L2021/0575Aids for the handicapped in speaking

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Auxiliary Devices For Music (AREA)
  • Numerical Control (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

A method of converting a source speakers speech signal into a converted speech signal, which comprises a stage of training using a given database of parallel source and target data. For each pitch period modelling a glottal waveform and a vocal tract filter to obtain a set of parameters comprising an excitation strength, parameters modelling a glottal waveform, and all-pole vocal tract filter coefficients. Defining a glottal vector to be converted; defining a vocal tract vector to be converted, obtaining an estimate of a glottal aspiration noise and estimating a vocal tract transformation function. The stage of modelling comprises: modelling said aspiration noise estimate by- modulating Gaussian noise with the said modelled glottal waveform and adjusting its energy to match that of the said aspiration noise estimate. The method further comprises a stage of conversion and a stage of synthesis.
DE602008005641T 2008-09-19 2008-09-19 METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES Active DE602008005641D1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2008/062502 WO2010031437A1 (en) 2008-09-19 2008-09-19 Method and system of voice conversion

Publications (1)

Publication Number Publication Date
DE602008005641D1 true DE602008005641D1 (en) 2011-04-28

Family

ID=40277465

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602008005641T Active DE602008005641D1 (en) 2008-09-19 2008-09-19 METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES

Country Status (5)

Country Link
EP (1) EP2215632B1 (en)
AT (1) ATE502380T1 (en)
DE (1) DE602008005641D1 (en)
ES (1) ES2364005T3 (en)
WO (1) WO2010031437A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101901598A (en) * 2010-06-30 2010-12-01 北京捷通华声语音技术有限公司 Humming synthesis method and system
ES2364401B2 (en) * 2011-06-27 2011-12-23 Universidad Politécnica de Madrid METHOD AND SYSTEM FOR ESTIMATING PHYSIOLOGICAL PARAMETERS OF THE FONATION.
RU2510954C2 (en) * 2012-05-18 2014-04-10 Александр Юрьевич Бредихин Method of re-sounding audio materials and apparatus for realising said method
US9607610B2 (en) 2014-07-03 2017-03-28 Google Inc. Devices and methods for noise modulation in a universal vocoder synthesizer
EP3857541B1 (en) * 2018-09-30 2023-07-19 Microsoft Technology Licensing, LLC Speech waveform generation
WO2020174356A1 (en) * 2019-02-25 2020-09-03 Technologies Of Voice Interface Ltd Speech interpretation device and system
EP3839947A1 (en) 2019-12-20 2021-06-23 SoundHound, Inc. Training a voice morphing apparatus
US11600284B2 (en) 2020-01-11 2023-03-07 Soundhound, Inc. Voice morphing apparatus having adjustable parameters
CN113780107B (en) * 2021-08-24 2024-03-01 电信科学技术第五研究所有限公司 Radio signal detection method based on deep learning dual-input network model

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100809368B1 (en) * 2006-08-09 2008-03-05 한국과학기술원 Voice Color Conversion System using Glottal waveform

Also Published As

Publication number Publication date
EP2215632B1 (en) 2011-03-16
ES2364005T3 (en) 2011-08-22
EP2215632A1 (en) 2010-08-11
WO2010031437A1 (en) 2010-03-25
ATE502380T1 (en) 2011-04-15

Similar Documents

Publication Publication Date Title
DE602008005641D1 (en) METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES
WO2011005863A3 (en) Technique for determining and reporting reduction in emissions of greenhouse gases at a site
DE602006017673D1 (en) METHOD AND DEVICE FOR VECTOR-QUANTIZING A SPEKTRALENVELOP REPRESENTATION
ATE542191T1 (en) METHOD FOR IDENTIFYING A PERSON BY HIS IRIS
ATE549670T1 (en) ELECTRONIC DEVICE AND METHOD FOR PROVIDING AN IMPROVED SLEEP OPERATION MODE
EP4246516A3 (en) Device and method for reducing quantization noise in a time-domain decoder
ATE403928T1 (en) VOICE DIALOGUE CONTROL BASED ON SIGNAL PREPROCESSING
ATE551692T1 (en) METHOD FOR REDUCING NOISE IN AN INPUT SIGNAL OF A HEARING AID AND A HEARING AID
ATE492873T1 (en) APPARATUS AND PROGRAM FOR SOUND ANALYSIS
DE602006010395D1 (en) Use of child-oriented language to automatically generate speech segmentation and a model-based speech recognition system
WO2010148141A3 (en) Apparatus and method for speech analysis
ATE524028T1 (en) METHOD FOR FINE ADJUSTMENT OF A HEARING AID AND HEARING AID
DE602005025057D1 (en) Method for parameterizing a mesh lattice e
WO2014182453A3 (en) Method and apparatus for training a voice recognition model database
JP2019101093A5 (en) Speech synthesis method, speech synthesis system and program
CN104091592A (en) Voice conversion system based on hidden Gaussian random field
ATE533146T1 (en) METHOD AND DEVICE FOR SEARCHING A BASE FREQUENCY
ATE521062T1 (en) METHOD AND DEVICE FOR MEASURING THE UNDERSTANDABILITY OF A SOUND EDITION DEVICE
CN102426834A (en) Method for testing rhythm level of spoken English
DE502007002566D1 (en) METHOD AND DEVICE FOR PRODUCING A GAS GENERATOR AND GAS GENERATOR PRODUCED BY THE METHOD
DE502007001672D1 (en) Hearing aid adaptation method
ATE456845T1 (en) LANGUAGE DIFFERENTIATION
ATE329347T1 (en) METHOD FOR MODELING AMOUNTS OF HARMONICS IN SPEECH
WO2012070866A3 (en) Speech signal encoding method and speech signal decoding method
DE102005025040B4 (en) Method and device for producing methane-rich biogas