ATE277405T1 - Stimmumwandlung - Google Patents
StimmumwandlungInfo
- Publication number
- ATE277405T1 ATE277405T1 AT98903756T AT98903756T ATE277405T1 AT E277405 T1 ATE277405 T1 AT E277405T1 AT 98903756 T AT98903756 T AT 98903756T AT 98903756 T AT98903756 T AT 98903756T AT E277405 T1 ATE277405 T1 AT E277405T1
- Authority
- AT
- Austria
- Prior art keywords
- represented
- speech frame
- voice
- voice conversion
- average
- Prior art date
Links
- 238000006243 chemical reaction Methods 0.000 title abstract 2
- 230000005284 excitation Effects 0.000 abstract 1
- 238000013507 mapping Methods 0.000 abstract 1
- 230000003595 spectral effect Effects 0.000 abstract 1
- 230000001131 transforming effect Effects 0.000 abstract 1
- 230000001755 vocal effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0007—Codebook element generation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Audible-Bandwidth Dynamoelectric Transducers Other Than Pickups (AREA)
- Amplifiers (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US3622797P | 1997-01-27 | 1997-01-27 | |
PCT/US1998/001538 WO1998035340A2 (en) | 1997-01-27 | 1998-01-27 | Voice conversion system and methodology |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE277405T1 true ATE277405T1 (de) | 2004-10-15 |
Family
ID=21887401
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT98903756T ATE277405T1 (de) | 1997-01-27 | 1998-01-27 | Stimmumwandlung |
Country Status (6)
Country | Link |
---|---|
US (1) | US6615174B1 (de) |
EP (1) | EP0970466B1 (de) |
AT (1) | ATE277405T1 (de) |
AU (1) | AU6044298A (de) |
DE (1) | DE69826446T2 (de) |
WO (1) | WO1998035340A2 (de) |
Families Citing this family (54)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100464310B1 (ko) * | 1999-03-13 | 2004-12-31 | 삼성전자주식회사 | 선 스펙트럼 쌍을 이용한 패턴 정합 방법 |
JP2001117576A (ja) | 1999-10-15 | 2001-04-27 | Pioneer Electronic Corp | 音声合成方法 |
US6973575B2 (en) * | 2001-04-05 | 2005-12-06 | International Business Machines Corporation | System and method for voice recognition password reset |
JP3709817B2 (ja) * | 2001-09-03 | 2005-10-26 | ヤマハ株式会社 | 音声合成装置、方法、及びプログラム |
JP2003248488A (ja) * | 2002-02-22 | 2003-09-05 | Ricoh Co Ltd | 情報処理システム、情報処理装置、情報処理方法、及びプログラム |
US7191134B2 (en) * | 2002-03-25 | 2007-03-13 | Nunally Patrick O'neal | Audio psychological stress indicator alteration method and apparatus |
GB0209770D0 (en) * | 2002-04-29 | 2002-06-05 | Mindweavers Ltd | Synthetic speech sound |
FR2839836B1 (fr) | 2002-05-16 | 2004-09-10 | Cit Alcatel | Terminal de telecommunication permettant de modifier la voix transmise lors d'une communication telephonique |
FR2843479B1 (fr) * | 2002-08-07 | 2004-10-22 | Smart Inf Sa | Procede de calibrage d'audio-intonation |
KR100499047B1 (ko) * | 2002-11-25 | 2005-07-04 | 한국전자통신연구원 | 서로 다른 대역폭을 갖는 켈프 방식 코덱들 간의 상호부호화 장치 및 그 방법 |
KR20040058855A (ko) * | 2002-12-27 | 2004-07-05 | 엘지전자 주식회사 | 음성 변조 장치 및 방법 |
FR2853125A1 (fr) * | 2003-03-27 | 2004-10-01 | France Telecom | Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. |
US20050123886A1 (en) * | 2003-11-26 | 2005-06-09 | Xian-Sheng Hua | Systems and methods for personalized karaoke |
US7454348B1 (en) * | 2004-01-08 | 2008-11-18 | At&T Intellectual Property Ii, L.P. | System and method for blending synthetic voices |
FR2868586A1 (fr) * | 2004-03-31 | 2005-10-07 | France Telecom | Procede et systeme ameliores de conversion d'un signal vocal |
FR2868587A1 (fr) * | 2004-03-31 | 2005-10-07 | France Telecom | Procede et systeme de conversion rapides d'un signal vocal |
DE102004048707B3 (de) * | 2004-10-06 | 2005-12-29 | Siemens Ag | Verfahren zur Stimmenkonversion für ein Sprachsynthesesystem |
US20060129399A1 (en) * | 2004-11-10 | 2006-06-15 | Voxonic, Inc. | Speech conversion system and method |
US20070027687A1 (en) * | 2005-03-14 | 2007-02-01 | Voxonic, Inc. | Automatic donor ranking and selection system and method for voice conversion |
US20060235685A1 (en) * | 2005-04-15 | 2006-10-19 | Nokia Corporation | Framework for voice conversion |
US20080161057A1 (en) * | 2005-04-15 | 2008-07-03 | Nokia Corporation | Voice conversion in ring tones and other features for a communication device |
EP1955319B1 (de) * | 2005-11-15 | 2016-04-13 | Samsung Electronics Co., Ltd. | Verfahren zum quantisieren und entquantisieren eines linear-prädiktiven kodierungskoeffizienten |
US8417185B2 (en) | 2005-12-16 | 2013-04-09 | Vocollect, Inc. | Wireless headset and method for robust voice data communication |
JP4241736B2 (ja) * | 2006-01-19 | 2009-03-18 | 株式会社東芝 | 音声処理装置及びその方法 |
US7773767B2 (en) | 2006-02-06 | 2010-08-10 | Vocollect, Inc. | Headset terminal with rear stability strap |
US7885419B2 (en) | 2006-02-06 | 2011-02-08 | Vocollect, Inc. | Headset terminal with speech functionality |
US20070213987A1 (en) * | 2006-03-08 | 2007-09-13 | Voxonic, Inc. | Codebook-less speech conversion method and system |
TWI312501B (en) * | 2006-03-13 | 2009-07-21 | Asustek Comp Inc | Audio processing system capable of comparing audio signals of different sources and method thereof |
KR100809368B1 (ko) | 2006-08-09 | 2008-03-05 | 한국과학기술원 | 성대파를 이용한 음색 변환 시스템 |
US8694318B2 (en) * | 2006-09-19 | 2014-04-08 | At&T Intellectual Property I, L. P. | Methods, systems, and products for indexing content |
US7996222B2 (en) * | 2006-09-29 | 2011-08-09 | Nokia Corporation | Prosody conversion |
US20080147385A1 (en) * | 2006-12-15 | 2008-06-19 | Nokia Corporation | Memory-efficient method for high-quality codebook based voice conversion |
JP4966048B2 (ja) * | 2007-02-20 | 2012-07-04 | 株式会社東芝 | 声質変換装置及び音声合成装置 |
US8131549B2 (en) | 2007-05-24 | 2012-03-06 | Microsoft Corporation | Personality-based device |
JP2009020291A (ja) * | 2007-07-11 | 2009-01-29 | Yamaha Corp | 音声処理装置および通信端末装置 |
WO2009022454A1 (ja) * | 2007-08-10 | 2009-02-19 | Panasonic Corporation | 音声分離装置、音声合成装置および声質変換装置 |
JP4469883B2 (ja) * | 2007-08-17 | 2010-06-02 | 株式会社東芝 | 音声合成方法及びその装置 |
US8706496B2 (en) * | 2007-09-13 | 2014-04-22 | Universitat Pompeu Fabra | Audio signal transforming by utilizing a computational cost function |
JP4445536B2 (ja) * | 2007-09-21 | 2010-04-07 | 株式会社東芝 | 移動無線端末装置、音声変換方法およびプログラム |
CN101399044B (zh) * | 2007-09-29 | 2013-09-04 | 纽奥斯通讯有限公司 | 语音转换方法和系统 |
US8131550B2 (en) * | 2007-10-04 | 2012-03-06 | Nokia Corporation | Method, apparatus and computer program product for providing improved voice conversion |
JP5038995B2 (ja) * | 2008-08-25 | 2012-10-03 | 株式会社東芝 | 声質変換装置及び方法、音声合成装置及び方法 |
USD605629S1 (en) | 2008-09-29 | 2009-12-08 | Vocollect, Inc. | Headset |
US8401849B2 (en) * | 2008-12-18 | 2013-03-19 | Lessac Technologies, Inc. | Methods employing phase state analysis for use in speech synthesis and recognition |
US8160287B2 (en) | 2009-05-22 | 2012-04-17 | Vocollect, Inc. | Headset with adjustable headband |
US8438659B2 (en) | 2009-11-05 | 2013-05-07 | Vocollect, Inc. | Portable computing device and headset interface |
US10453479B2 (en) | 2011-09-23 | 2019-10-22 | Lessac Technologies, Inc. | Methods for aligning expressive speech utterances with text and systems therefor |
RU2510954C2 (ru) * | 2012-05-18 | 2014-04-10 | Александр Юрьевич Бредихин | Способ переозвучивания аудиоматериалов и устройство для его осуществления |
GB201315142D0 (en) * | 2013-08-23 | 2013-10-09 | Ucl Business Plc | Audio-Visual Dialogue System and Method |
US9613620B2 (en) * | 2014-07-03 | 2017-04-04 | Google Inc. | Methods and systems for voice conversion |
US9659564B2 (en) * | 2014-10-24 | 2017-05-23 | Sestek Ses Ve Iletisim Bilgisayar Teknolojileri Sanayi Ticaret Anonim Sirketi | Speaker verification based on acoustic behavioral characteristics of the speaker |
DK3217399T3 (en) | 2016-03-11 | 2019-02-25 | Gn Hearing As | Kalman filtering based speech enhancement using a codebook based approach |
JP7334942B2 (ja) * | 2019-08-19 | 2023-08-29 | 国立大学法人 東京大学 | 音声変換装置、音声変換方法及び音声変換プログラム |
US11848005B2 (en) | 2022-04-28 | 2023-12-19 | Meaning.Team, Inc | Voice attribute conversion using speech to speech |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5113449A (en) * | 1982-08-16 | 1992-05-12 | Texas Instruments Incorporated | Method and apparatus for altering voice characteristics of synthesized speech |
WO1993018505A1 (en) * | 1992-03-02 | 1993-09-16 | The Walt Disney Company | Voice transformation system |
US5793891A (en) * | 1994-07-07 | 1998-08-11 | Nippon Telegraph And Telephone Corporation | Adaptive training method for pattern recognition |
JP3536996B2 (ja) * | 1994-09-13 | 2004-06-14 | ソニー株式会社 | パラメータ変換方法及び音声合成方法 |
JPH10260692A (ja) * | 1997-03-18 | 1998-09-29 | Toshiba Corp | 音声の認識合成符号化/復号化方法及び音声符号化/復号化システム |
-
1998
- 1998-01-27 AT AT98903756T patent/ATE277405T1/de not_active IP Right Cessation
- 1998-01-27 EP EP98903756A patent/EP0970466B1/de not_active Expired - Lifetime
- 1998-01-27 WO PCT/US1998/001538 patent/WO1998035340A2/en active IP Right Grant
- 1998-01-27 DE DE69826446T patent/DE69826446T2/de not_active Expired - Lifetime
- 1998-01-27 AU AU60442/98A patent/AU6044298A/en not_active Abandoned
- 1998-01-27 US US09/355,267 patent/US6615174B1/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP0970466A2 (de) | 2000-01-12 |
US6615174B1 (en) | 2003-09-02 |
DE69826446T2 (de) | 2005-01-20 |
DE69826446D1 (de) | 2004-10-28 |
WO1998035340A2 (en) | 1998-08-13 |
EP0970466A4 (de) | 2000-05-31 |
AU6044298A (en) | 1998-08-26 |
WO1998035340A3 (en) | 1998-11-19 |
EP0970466B1 (de) | 2004-09-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE277405T1 (de) | Stimmumwandlung | |
Yoshimura et al. | Mixed excitation for HMM-based speech synthesis. | |
CA2323421A1 (en) | Face synthesis system and methodology | |
Stylianou et al. | Continuous probabilistic transform for voice conversion | |
AU725140B2 (en) | Speech encoding method and apparatus and speech decoding method and apparatus | |
JP3446764B2 (ja) | 音声合成システム及び音声合成サーバ | |
MX9703138A (es) | Reconocimiento de lenguaje. | |
TW487902B (en) | Method and apparatus for mandarin Chinese speech recognition by using initial/final phoneme similarity vector | |
CN101901598A (zh) | 一种哼唱合成方法和系统 | |
So et al. | Efficient product code vector quantisation using the switched split vector quantiser | |
Akinbo | Representation of Yorùbá Tones by a Talking Drum. An Acoustic Analysis | |
Matsumoto et al. | Evaluation of Mel-LPC cepstrum in a large vocabulary continuous speech recognition | |
EP1672619A3 (de) | Vorrichtung und Verfahren zur Sprachkodierung | |
CN105765653A (zh) | 自适应高通后滤波器 | |
Dehé et al. | Voice quality and speaking rate in Icelandic rhetorical questions | |
JP2001034280A (ja) | 電子メール受信装置および電子メールシステム | |
KR940002437B1 (ko) | 음성 인식방법 및 장치 | |
Benesty et al. | Introduction to speech processing | |
Mizuno et al. | Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt | |
Koolagudi et al. | Spectral features for emotion classification | |
Mathew et al. | Analysis of LD-CELP coder output with Sound eXchange and Praat software | |
KR100269357B1 (ko) | 음성 인식 방법 | |
Kim et al. | On a speech multiple system implementation for speech synthesis | |
Lee et al. | Enhancement of hearing-impaired Mandarin speech. | |
KR960038733A (ko) | 임의의 화자음성을 타화자의 억양으로 변환하는 합성방법 및 컴퓨터실현장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |