ATE277405T1 - Stimmumwandlung - Google Patents

Stimmumwandlung

Info

Publication number
ATE277405T1
ATE277405T1 AT98903756T AT98903756T ATE277405T1 AT E277405 T1 ATE277405 T1 AT E277405T1 AT 98903756 T AT98903756 T AT 98903756T AT 98903756 T AT98903756 T AT 98903756T AT E277405 T1 ATE277405 T1 AT E277405T1
Authority
AT
Austria
Prior art keywords
represented
speech frame
voice
voice conversion
average
Prior art date
Application number
AT98903756T
Other languages
English (en)
Inventor
Levent M Arslan
David Talkin
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE277405T1 publication Critical patent/ATE277405T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0007Codebook element generation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Audible-Bandwidth Dynamoelectric Transducers Other Than Pickups (AREA)
  • Amplifiers (AREA)
AT98903756T 1997-01-27 1998-01-27 Stimmumwandlung ATE277405T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US3622797P 1997-01-27 1997-01-27
PCT/US1998/001538 WO1998035340A2 (en) 1997-01-27 1998-01-27 Voice conversion system and methodology

Publications (1)

Publication Number Publication Date
ATE277405T1 true ATE277405T1 (de) 2004-10-15

Family

ID=21887401

Family Applications (1)

Application Number Title Priority Date Filing Date
AT98903756T ATE277405T1 (de) 1997-01-27 1998-01-27 Stimmumwandlung

Country Status (6)

Country Link
US (1) US6615174B1 (de)
EP (1) EP0970466B1 (de)
AT (1) ATE277405T1 (de)
AU (1) AU6044298A (de)
DE (1) DE69826446T2 (de)
WO (1) WO1998035340A2 (de)

Families Citing this family (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100464310B1 (ko) * 1999-03-13 2004-12-31 삼성전자주식회사 선 스펙트럼 쌍을 이용한 패턴 정합 방법
JP2001117576A (ja) 1999-10-15 2001-04-27 Pioneer Electronic Corp 音声合成方法
US6973575B2 (en) * 2001-04-05 2005-12-06 International Business Machines Corporation System and method for voice recognition password reset
JP3709817B2 (ja) * 2001-09-03 2005-10-26 ヤマハ株式会社 音声合成装置、方法、及びプログラム
JP2003248488A (ja) * 2002-02-22 2003-09-05 Ricoh Co Ltd 情報処理システム、情報処理装置、情報処理方法、及びプログラム
US7191134B2 (en) * 2002-03-25 2007-03-13 Nunally Patrick O'neal Audio psychological stress indicator alteration method and apparatus
GB0209770D0 (en) * 2002-04-29 2002-06-05 Mindweavers Ltd Synthetic speech sound
FR2839836B1 (fr) 2002-05-16 2004-09-10 Cit Alcatel Terminal de telecommunication permettant de modifier la voix transmise lors d'une communication telephonique
FR2843479B1 (fr) * 2002-08-07 2004-10-22 Smart Inf Sa Procede de calibrage d'audio-intonation
KR100499047B1 (ko) * 2002-11-25 2005-07-04 한국전자통신연구원 서로 다른 대역폭을 갖는 켈프 방식 코덱들 간의 상호부호화 장치 및 그 방법
KR20040058855A (ko) * 2002-12-27 2004-07-05 엘지전자 주식회사 음성 변조 장치 및 방법
FR2853125A1 (fr) * 2003-03-27 2004-10-01 France Telecom Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse.
US20050123886A1 (en) * 2003-11-26 2005-06-09 Xian-Sheng Hua Systems and methods for personalized karaoke
US7454348B1 (en) * 2004-01-08 2008-11-18 At&T Intellectual Property Ii, L.P. System and method for blending synthetic voices
FR2868586A1 (fr) * 2004-03-31 2005-10-07 France Telecom Procede et systeme ameliores de conversion d'un signal vocal
FR2868587A1 (fr) * 2004-03-31 2005-10-07 France Telecom Procede et systeme de conversion rapides d'un signal vocal
DE102004048707B3 (de) * 2004-10-06 2005-12-29 Siemens Ag Verfahren zur Stimmenkonversion für ein Sprachsynthesesystem
US20060129399A1 (en) * 2004-11-10 2006-06-15 Voxonic, Inc. Speech conversion system and method
US20070027687A1 (en) * 2005-03-14 2007-02-01 Voxonic, Inc. Automatic donor ranking and selection system and method for voice conversion
US20060235685A1 (en) * 2005-04-15 2006-10-19 Nokia Corporation Framework for voice conversion
US20080161057A1 (en) * 2005-04-15 2008-07-03 Nokia Corporation Voice conversion in ring tones and other features for a communication device
EP1955319B1 (de) * 2005-11-15 2016-04-13 Samsung Electronics Co., Ltd. Verfahren zum quantisieren und entquantisieren eines linear-prädiktiven kodierungskoeffizienten
US8417185B2 (en) 2005-12-16 2013-04-09 Vocollect, Inc. Wireless headset and method for robust voice data communication
JP4241736B2 (ja) * 2006-01-19 2009-03-18 株式会社東芝 音声処理装置及びその方法
US7773767B2 (en) 2006-02-06 2010-08-10 Vocollect, Inc. Headset terminal with rear stability strap
US7885419B2 (en) 2006-02-06 2011-02-08 Vocollect, Inc. Headset terminal with speech functionality
US20070213987A1 (en) * 2006-03-08 2007-09-13 Voxonic, Inc. Codebook-less speech conversion method and system
TWI312501B (en) * 2006-03-13 2009-07-21 Asustek Comp Inc Audio processing system capable of comparing audio signals of different sources and method thereof
KR100809368B1 (ko) 2006-08-09 2008-03-05 한국과학기술원 성대파를 이용한 음색 변환 시스템
US8694318B2 (en) * 2006-09-19 2014-04-08 At&T Intellectual Property I, L. P. Methods, systems, and products for indexing content
US7996222B2 (en) * 2006-09-29 2011-08-09 Nokia Corporation Prosody conversion
US20080147385A1 (en) * 2006-12-15 2008-06-19 Nokia Corporation Memory-efficient method for high-quality codebook based voice conversion
JP4966048B2 (ja) * 2007-02-20 2012-07-04 株式会社東芝 声質変換装置及び音声合成装置
US8131549B2 (en) 2007-05-24 2012-03-06 Microsoft Corporation Personality-based device
JP2009020291A (ja) * 2007-07-11 2009-01-29 Yamaha Corp 音声処理装置および通信端末装置
WO2009022454A1 (ja) * 2007-08-10 2009-02-19 Panasonic Corporation 音声分離装置、音声合成装置および声質変換装置
JP4469883B2 (ja) * 2007-08-17 2010-06-02 株式会社東芝 音声合成方法及びその装置
US8706496B2 (en) * 2007-09-13 2014-04-22 Universitat Pompeu Fabra Audio signal transforming by utilizing a computational cost function
JP4445536B2 (ja) * 2007-09-21 2010-04-07 株式会社東芝 移動無線端末装置、音声変換方法およびプログラム
CN101399044B (zh) * 2007-09-29 2013-09-04 纽奥斯通讯有限公司 语音转换方法和系统
US8131550B2 (en) * 2007-10-04 2012-03-06 Nokia Corporation Method, apparatus and computer program product for providing improved voice conversion
JP5038995B2 (ja) * 2008-08-25 2012-10-03 株式会社東芝 声質変換装置及び方法、音声合成装置及び方法
USD605629S1 (en) 2008-09-29 2009-12-08 Vocollect, Inc. Headset
US8401849B2 (en) * 2008-12-18 2013-03-19 Lessac Technologies, Inc. Methods employing phase state analysis for use in speech synthesis and recognition
US8160287B2 (en) 2009-05-22 2012-04-17 Vocollect, Inc. Headset with adjustable headband
US8438659B2 (en) 2009-11-05 2013-05-07 Vocollect, Inc. Portable computing device and headset interface
US10453479B2 (en) 2011-09-23 2019-10-22 Lessac Technologies, Inc. Methods for aligning expressive speech utterances with text and systems therefor
RU2510954C2 (ru) * 2012-05-18 2014-04-10 Александр Юрьевич Бредихин Способ переозвучивания аудиоматериалов и устройство для его осуществления
GB201315142D0 (en) * 2013-08-23 2013-10-09 Ucl Business Plc Audio-Visual Dialogue System and Method
US9613620B2 (en) * 2014-07-03 2017-04-04 Google Inc. Methods and systems for voice conversion
US9659564B2 (en) * 2014-10-24 2017-05-23 Sestek Ses Ve Iletisim Bilgisayar Teknolojileri Sanayi Ticaret Anonim Sirketi Speaker verification based on acoustic behavioral characteristics of the speaker
DK3217399T3 (en) 2016-03-11 2019-02-25 Gn Hearing As Kalman filtering based speech enhancement using a codebook based approach
JP7334942B2 (ja) * 2019-08-19 2023-08-29 国立大学法人 東京大学 音声変換装置、音声変換方法及び音声変換プログラム
US11848005B2 (en) 2022-04-28 2023-12-19 Meaning.Team, Inc Voice attribute conversion using speech to speech

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5113449A (en) * 1982-08-16 1992-05-12 Texas Instruments Incorporated Method and apparatus for altering voice characteristics of synthesized speech
WO1993018505A1 (en) * 1992-03-02 1993-09-16 The Walt Disney Company Voice transformation system
US5793891A (en) * 1994-07-07 1998-08-11 Nippon Telegraph And Telephone Corporation Adaptive training method for pattern recognition
JP3536996B2 (ja) * 1994-09-13 2004-06-14 ソニー株式会社 パラメータ変換方法及び音声合成方法
JPH10260692A (ja) * 1997-03-18 1998-09-29 Toshiba Corp 音声の認識合成符号化/復号化方法及び音声符号化/復号化システム

Also Published As

Publication number Publication date
EP0970466A2 (de) 2000-01-12
US6615174B1 (en) 2003-09-02
DE69826446T2 (de) 2005-01-20
DE69826446D1 (de) 2004-10-28
WO1998035340A2 (en) 1998-08-13
EP0970466A4 (de) 2000-05-31
AU6044298A (en) 1998-08-26
WO1998035340A3 (en) 1998-11-19
EP0970466B1 (de) 2004-09-22

Similar Documents

Publication Publication Date Title
ATE277405T1 (de) Stimmumwandlung
Yoshimura et al. Mixed excitation for HMM-based speech synthesis.
CA2323421A1 (en) Face synthesis system and methodology
Stylianou et al. Continuous probabilistic transform for voice conversion
AU725140B2 (en) Speech encoding method and apparatus and speech decoding method and apparatus
JP3446764B2 (ja) 音声合成システム及び音声合成サーバ
MX9703138A (es) Reconocimiento de lenguaje.
TW487902B (en) Method and apparatus for mandarin Chinese speech recognition by using initial/final phoneme similarity vector
CN101901598A (zh) 一种哼唱合成方法和系统
So et al. Efficient product code vector quantisation using the switched split vector quantiser
Akinbo Representation of Yorùbá Tones by a Talking Drum. An Acoustic Analysis
Matsumoto et al. Evaluation of Mel-LPC cepstrum in a large vocabulary continuous speech recognition
EP1672619A3 (de) Vorrichtung und Verfahren zur Sprachkodierung
CN105765653A (zh) 自适应高通后滤波器
Dehé et al. Voice quality and speaking rate in Icelandic rhetorical questions
JP2001034280A (ja) 電子メール受信装置および電子メールシステム
KR940002437B1 (ko) 음성 인식방법 및 장치
Benesty et al. Introduction to speech processing
Mizuno et al. Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt
Koolagudi et al. Spectral features for emotion classification
Mathew et al. Analysis of LD-CELP coder output with Sound eXchange and Praat software
KR100269357B1 (ko) 음성 인식 방법
Kim et al. On a speech multiple system implementation for speech synthesis
Lee et al. Enhancement of hearing-impaired Mandarin speech.
KR960038733A (ko) 임의의 화자음성을 타화자의 억양으로 변환하는 합성방법 및 컴퓨터실현장치

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties