FR2868587A1 - Procede et systeme de conversion rapides d'un signal vocal - Google Patents

Procede et systeme de conversion rapides d'un signal vocal

Info

Publication number
FR2868587A1
FR2868587A1 FR0403405A FR0403405A FR2868587A1 FR 2868587 A1 FR2868587 A1 FR 2868587A1 FR 0403405 A FR0403405 A FR 0403405A FR 0403405 A FR0403405 A FR 0403405A FR 2868587 A1 FR2868587 A1 FR 2868587A1
Authority
FR
France
Prior art keywords
voice signal
speaker
transformation
converted
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
FR0403405A
Other languages
English (en)
Inventor
Olivier Rosec
Najjary Taoufik En
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Priority to FR0403405A priority Critical patent/FR2868587A1/fr
Priority to PCT/FR2005/000607 priority patent/WO2005106853A1/fr
Priority to US10/591,599 priority patent/US7792672B2/en
Priority to EP05735426A priority patent/EP1730728A1/fr
Publication of FR2868587A1 publication Critical patent/FR2868587A1/fr
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

Ce procédé de conversion d'un signal vocal prononcé par un locuteur source en un signal vocal converti dont les caractéristiques acoustiques ressemblent à celles d'un locuteur cible, comprend :- la détermination (1) d'au moins une fonction de transformation de caractéristiques acoustiques du locuteur source en caractéristiques acoustiques proches de celles du locuteur cible, à partir d'échantillons vocaux des locuteurs source et cible ; et- la transformation de caractéristiques acoustiques du signal vocal à convertir du locuteur source, par l'application de ladite au moins une fonction de transformation.Il est caractérisé en ce que ladite transformation (2) comprend une étape (44) d'application uniquement d'une partie déterminée d'au moins une fonction de transformation sur ledit signal à convertir.
FR0403405A 2004-03-31 2004-03-31 Procede et systeme de conversion rapides d'un signal vocal Pending FR2868587A1 (fr)

Priority Applications (4)

Application Number Priority Date Filing Date Title
FR0403405A FR2868587A1 (fr) 2004-03-31 2004-03-31 Procede et systeme de conversion rapides d'un signal vocal
PCT/FR2005/000607 WO2005106853A1 (fr) 2004-03-31 2005-03-14 Procede et systeme de conversion rapides d'un signal vocal
US10/591,599 US7792672B2 (en) 2004-03-31 2005-03-14 Method and system for the quick conversion of a voice signal
EP05735426A EP1730728A1 (fr) 2004-03-31 2005-03-14 Procede et systeme de conversion rapides d'un signal vocal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FR0403405A FR2868587A1 (fr) 2004-03-31 2004-03-31 Procede et systeme de conversion rapides d'un signal vocal

Publications (1)

Publication Number Publication Date
FR2868587A1 true FR2868587A1 (fr) 2005-10-07

Family

ID=34944345

Family Applications (1)

Application Number Title Priority Date Filing Date
FR0403405A Pending FR2868587A1 (fr) 2004-03-31 2004-03-31 Procede et systeme de conversion rapides d'un signal vocal

Country Status (4)

Country Link
US (1) US7792672B2 (fr)
EP (1) EP1730728A1 (fr)
FR (1) FR2868587A1 (fr)
WO (1) WO2005106853A1 (fr)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101375329A (zh) * 2005-03-14 2009-02-25 沃克索尼克股份有限公司 用于语音转换的自动施主分级和选择系统及方法
JP4928465B2 (ja) * 2005-12-02 2012-05-09 旭化成株式会社 声質変換システム
US20070213987A1 (en) * 2006-03-08 2007-09-13 Voxonic, Inc. Codebook-less speech conversion method and system
JP4966048B2 (ja) * 2007-02-20 2012-07-04 株式会社東芝 声質変換装置及び音声合成装置
EP1970894A1 (fr) * 2007-03-12 2008-09-17 France Télécom Procédé et dispositif de modification d'un signal audio
EP3273442B1 (fr) * 2008-03-20 2021-10-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Appareil et procédé pour synthétiser une représentation paramétrée d'un signal audio
JP5038995B2 (ja) * 2008-08-25 2012-10-03 株式会社東芝 声質変換装置及び方法、音声合成装置及び方法
CN102257566A (zh) * 2008-12-19 2011-11-23 皇家飞利浦电子股份有限公司 用于适配通信的方法和系统
TWI391876B (zh) * 2009-02-16 2013-04-01 Inst Information Industry 利用多重模組混合圖形切割之前景偵測方法、系統以及電腦程式產品
DE102009013020A1 (de) * 2009-03-16 2010-09-23 Hayo Becks Vorrichtung und Verfahren zur Anpassung von Klangbildern
US8321209B2 (en) * 2009-11-10 2012-11-27 Research In Motion Limited System and method for low overhead frequency domain voice authentication
JP5961950B2 (ja) * 2010-09-15 2016-08-03 ヤマハ株式会社 音声処理装置
US8620646B2 (en) * 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US9520138B2 (en) * 2013-03-15 2016-12-13 Broadcom Corporation Adaptive modulation filtering for spectral feature enhancement
WO2016042626A1 (fr) 2014-09-17 2016-03-24 株式会社東芝 Appareil de traitement de la parole, procédé de traitement de la parole, et programme
US20190019500A1 (en) * 2017-07-13 2019-01-17 Electronics And Telecommunications Research Institute Apparatus for deep learning based text-to-speech synthesizing by using multi-speaker data and method for the same
US20190362737A1 (en) * 2018-05-25 2019-11-28 i2x GmbH Modifying voice data of a conversation to achieve a desired outcome
US11380345B2 (en) * 2020-10-15 2022-07-05 Agora Lab, Inc. Real-time voice timbre style transform
CN112750446B (zh) * 2020-12-30 2024-05-24 标贝(青岛)科技有限公司 语音转换方法、装置和系统及存储介质

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002067245A1 (fr) * 2001-02-16 2002-08-29 Imagination Technologies Limited Verification de haut-parleurs

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993018505A1 (fr) * 1992-03-02 1993-09-16 The Walt Disney Company Systeme de transformation vocale
US5572624A (en) * 1994-01-24 1996-11-05 Kurzweil Applied Intelligence, Inc. Speech recognition system accommodating different sources
ATE277405T1 (de) * 1997-01-27 2004-10-15 Microsoft Corp Stimmumwandlung
US6029124A (en) * 1997-02-21 2000-02-22 Dragon Systems, Inc. Sequential, nonparametric speech recognition and speaker identification
US6336092B1 (en) * 1997-04-28 2002-01-01 Ivl Technologies Ltd Targeted vocal transformation
US6317710B1 (en) * 1998-08-13 2001-11-13 At&T Corp. Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
US6879952B2 (en) * 2000-04-26 2005-04-12 Microsoft Corporation Sound source separation using convolutional mixing and a priori sound source knowledge
US7412377B2 (en) * 2003-12-19 2008-08-12 International Business Machines Corporation Voice model for speech processing based on ordered average ranks of spectral features

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002067245A1 (fr) * 2001-02-16 2002-08-29 Imagination Technologies Limited Verification de haut-parleurs

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BANDOIN G ET AL: "On the transformation of the speech spectrum for voice conversion", SPOKEN LANGUAGE, 1996. ICSLP 96. PROCEEDINGS., FOURTH INTERNATIONAL CONFERENCE ON PHILADELPHIA, PA, USA 3-6 OCT. 1996, NEW YORK, NY, USA,IEEE, US, 3 October 1996 (1996-10-03), pages 1405 - 1408, XP010237945, ISBN: 0-7803-3555-4 *
HELENCA DUXANS AND ANTONIO BONAFONTE ET AL: "Estimation of GMM in voice conversion including unaligned data", PROCEEDINGS OF THE EUROSPEECH 2003 CONFERENCE, September 2003 (2003-09-01), pages 861 - 864, XP007007125 *
LAROCHE J ET AL: "HNM: a simple, efficient harmonic+noise model for speech", APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 1993. FINAL PROGRAM AND PAPER SUMMARIES., 1993 IEEE WORKSHOP ON NEW PALTZ, NY, USA 17-20 OCT. 1993, NEW YORK, NY, USA,IEEE, 17 October 1993 (1993-10-17), pages 169 - 172, XP010130052, ISBN: 0-7803-2078-6 *
STYLIANOU Y ET AL: "STATISTICAL METHODS FOR VOICE QUALITY TRANSFORMATION", 4TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. EUROSPEECH '95. MADRID, SPAIN, SEPT. 18 - 21, 1995, EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. (EUROSPEECH), MADRID : GRAFICAS BRENS, ES, vol. VOL. 1 CONF. 4, 18 September 1995 (1995-09-18), pages 447 - 450, XP000854745 *
YINING CHEN1 ET AL: "Voice Conversion with Smoothed GMM and MAP Adaptation", PROCEEDINGS OF THE EUROSPEECH 2003 CONFERENCE, September 2003 (2003-09-01), pages 2413 - 2416, XP007006960 *

Also Published As

Publication number Publication date
US20070192100A1 (en) 2007-08-16
EP1730728A1 (fr) 2006-12-13
US7792672B2 (en) 2010-09-07
WO2005106853A1 (fr) 2005-11-10

Similar Documents

Publication Publication Date Title
FR2868587A1 (fr) Procede et systeme de conversion rapides d'un signal vocal
FR2868586A1 (fr) Procede et systeme ameliores de conversion d'un signal vocal
EP2492912B1 (fr) Appareil de traitement du son, procédé de traitement du son et prothèse auditive
DE60325826D1 (de) Audiovisuelle sprachaktivitätsdetektion für ein spracherkennungssystem
DE602005001142D1 (de) Nachrichtenübertragungsgerät
FR2898209B1 (fr) Procede de debruitage d'un signal audio
ATE336775T1 (de) Intelligente text-sprache-umsetzung
Stern et al. Signal processing for robust speech recognition
AU2001275991A1 (en) System and method for voice recognition with a plurality of voice recognition engines
JPWO2019106517A5 (fr)
CN103124165A (zh) 自动增益控制
ATE400047T1 (de) Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
KR101889465B1 (ko) 음성인식장치와, 음성인식장치가 구비된 조명등기구와, 이를 이용한 조명시스템
US11270691B2 (en) Voice interaction system, its processing method, and program therefor
DE602004023134D1 (de) Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist
US20210050029A1 (en) Methods and Apparatus for Reducing Stuttering
CN1645363A (zh) 便携式即时方言互译装置及其方法
WO2004068893A3 (fr) Procede et appareil d'elimination du bruit dans un systeme de reconnaissance vocale reparti
JP2005227512A (ja) 音信号処理方法及びその装置、音声認識装置並びにプログラム
Laskowski et al. Crosscorrelation-based multispeaker speech activity detection.
KR20190032557A (ko) 음성 기반 통신
Cook et al. Transcription of broadcast television and radio news: The 1996 ABBOT system
Kleban et al. HMM adaptation and microphone array processing for distant speech recognition
Vacher et al. Speech recognition in a smart home: some experiments for telemonitoring
Kingsbury et al. Toward domain-independent conversational speech recognition.