DE602008005641D1 - METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES - Google Patents

METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES

Info

Publication number
DE602008005641D1
DE602008005641D1 DE602008005641T DE602008005641T DE602008005641D1 DE 602008005641 D1 DE602008005641 D1 DE 602008005641D1 DE 602008005641 T DE602008005641 T DE 602008005641T DE 602008005641 T DE602008005641 T DE 602008005641T DE 602008005641 D1 DE602008005641 D1 DE 602008005641D1
Authority
DE
Germany
Prior art keywords
modelling
stage
glottal
vocal tract
converted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602008005641T
Other languages
German (de)
Inventor
Pozo Echezarreta Maria Arantzazu Del
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Original Assignee
Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech filed Critical Fundacion Centro de Tecnologias de Interaccion Visual y Comunicaciones Vicomtech
Publication of DE602008005641D1 publication Critical patent/DE602008005641D1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/057Time compression or expansion for improving intelligibility
    • G10L2021/0575Aids for the handicapped in speaking

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Auxiliary Devices For Music (AREA)
  • Numerical Control (AREA)

Abstract

A method of converting a source speakers speech signal into a converted speech signal, which comprises a stage of training using a given database of parallel source and target data. For each pitch period modelling a glottal waveform and a vocal tract filter to obtain a set of parameters comprising an excitation strength, parameters modelling a glottal waveform, and all-pole vocal tract filter coefficients. Defining a glottal vector to be converted; defining a vocal tract vector to be converted, obtaining an estimate of a glottal aspiration noise and estimating a vocal tract transformation function. The stage of modelling comprises: modelling said aspiration noise estimate by- modulating Gaussian noise with the said modelled glottal waveform and adjusting its energy to match that of the said aspiration noise estimate. The method further comprises a stage of conversion and a stage of synthesis.
DE602008005641T 2008-09-19 2008-09-19 METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES Active DE602008005641D1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2008/062502 WO2010031437A1 (en) 2008-09-19 2008-09-19 Method and system of voice conversion

Publications (1)

Publication Number Publication Date
DE602008005641D1 true DE602008005641D1 (en) 2011-04-28

Family

ID=40277465

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602008005641T Active DE602008005641D1 (en) 2008-09-19 2008-09-19 METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES

Country Status (5)

Country Link
EP (1) EP2215632B1 (en)
AT (1) ATE502380T1 (en)
DE (1) DE602008005641D1 (en)
ES (1) ES2364005T3 (en)
WO (1) WO2010031437A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101901598A (en) * 2010-06-30 2010-12-01 北京捷通华声语音技术有限公司 Humming synthesis method and system
ES2364401B2 (en) * 2011-06-27 2011-12-23 Universidad Politécnica de Madrid METHOD AND SYSTEM FOR ESTIMATING PHYSIOLOGICAL PARAMETERS OF THE FONATION.
RU2510954C2 (en) * 2012-05-18 2014-04-10 Александр Юрьевич Бредихин Method of re-sounding audio materials and apparatus for realising said method
US9607610B2 (en) 2014-07-03 2017-03-28 Google Inc. Devices and methods for noise modulation in a universal vocoder synthesizer
EP3857541B1 (en) 2018-09-30 2023-07-19 Microsoft Technology Licensing, LLC Speech waveform generation
US20220148570A1 (en) * 2019-02-25 2022-05-12 Technologies Of Voice Interface Ltd. Speech interpretation device and system
EP3839947A1 (en) 2019-12-20 2021-06-23 SoundHound, Inc. Training a voice morphing apparatus
US11600284B2 (en) 2020-01-11 2023-03-07 Soundhound, Inc. Voice morphing apparatus having adjustable parameters
CN113780107B (en) * 2021-08-24 2024-03-01 电信科学技术第五研究所有限公司 Radio signal detection method based on deep learning dual-input network model

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100809368B1 (en) * 2006-08-09 2008-03-05 한국과학기술원 Voice conversion system using vocal cords

Also Published As

Publication number Publication date
WO2010031437A1 (en) 2010-03-25
EP2215632B1 (en) 2011-03-16
EP2215632A1 (en) 2010-08-11
ATE502380T1 (en) 2011-04-15
ES2364005T3 (en) 2011-08-22

Similar Documents

Publication Publication Date Title
DE602008005641D1 (en) METHOD, DEVICE AND PROGRAM CODE FOR CONVERTING VOTES
DE602006017673D1 (en) METHOD AND DEVICE FOR VECTOR-QUANTIZING A SPEKTRALENVELOP REPRESENTATION
ATE542191T1 (en) METHOD FOR IDENTIFYING A PERSON BY HIS IRIS
ATE549670T1 (en) ELECTRONIC DEVICE AND METHOD FOR PROVIDING AN IMPROVED SLEEP OPERATION MODE
ATE403928T1 (en) VOICE DIALOGUE CONTROL BASED ON SIGNAL PREPROCESSING
ATE492873T1 (en) APPARATUS AND PROGRAM FOR SOUND ANALYSIS
WO2010148141A3 (en) Apparatus and method for speech analysis
ATE524028T1 (en) METHOD FOR FINE ADJUSTMENT OF A HEARING AID AND HEARING AID
CN104766603A (en) Method and device for building personalized singing style spectrum synthesis model
EP2301013A4 (en) Systems and methods of improving automated speech recognition accuracy using statistical analysis of search terms
EP2214123A3 (en) Model-based comparative measure for vector sequences and word spotting using same
ATE521062T1 (en) METHOD AND DEVICE FOR MEASURING THE UNDERSTANDABILITY OF A SOUND EDITION DEVICE
JP2019101093A5 (en) Speech synthesis method, speech synthesis system and program
ATE533146T1 (en) METHOD AND DEVICE FOR SEARCHING A BASE FREQUENCY
CN102426834A (en) Method for testing rhythm level of spoken English
EP3121808A3 (en) System and method of modeling characteristics of a musical instrument
WO2010123483A3 (en) Analyzing the prosody of speech
DE602005025057D1 (en) Method for parameterizing a mesh lattice e
DE502007002566D1 (en) METHOD AND DEVICE FOR PRODUCING A GAS GENERATOR AND GAS GENERATOR PRODUCED BY THE METHOD
DE502007001672D1 (en) Hearing aid adaptation method
ATE456845T1 (en) LANGUAGE DIFFERENTIATION
ATE329347T1 (en) METHOD FOR MODELING AMOUNTS OF HARMONICS IN SPEECH
ATE540369T1 (en) REAL-TIME MEASUREMENT OF A POPULATION OF NUCLEIC ACIDS, ESPECIALLY BY PCR
WO2012070866A3 (en) Speech signal encoding method and speech signal decoding method
DE602004016533D1 (en) METHOD AND DEVICE FOR USING A MULTICHANNEL MEASUREMENT SIGNAL IN DETERMINING THE POWER DISTRIBUTION OF A SUBJECT