FR2868587A1 - Procede et systeme de conversion rapides d'un signal vocal - Google Patents
Procede et systeme de conversion rapides d'un signal vocalInfo
- Publication number
- FR2868587A1 FR2868587A1 FR0403405A FR0403405A FR2868587A1 FR 2868587 A1 FR2868587 A1 FR 2868587A1 FR 0403405 A FR0403405 A FR 0403405A FR 0403405 A FR0403405 A FR 0403405A FR 2868587 A1 FR2868587 A1 FR 2868587A1
- Authority
- FR
- France
- Prior art keywords
- voice signal
- speaker
- transformation
- converted
- source
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title abstract 2
- 238000006243 chemical reaction Methods 0.000 title 1
- 230000009466 transformation Effects 0.000 abstract 5
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Ce procédé de conversion d'un signal vocal prononcé par un locuteur source en un signal vocal converti dont les caractéristiques acoustiques ressemblent à celles d'un locuteur cible, comprend :- la détermination (1) d'au moins une fonction de transformation de caractéristiques acoustiques du locuteur source en caractéristiques acoustiques proches de celles du locuteur cible, à partir d'échantillons vocaux des locuteurs source et cible ; et- la transformation de caractéristiques acoustiques du signal vocal à convertir du locuteur source, par l'application de ladite au moins une fonction de transformation.Il est caractérisé en ce que ladite transformation (2) comprend une étape (44) d'application uniquement d'une partie déterminée d'au moins une fonction de transformation sur ledit signal à convertir.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0403405A FR2868587A1 (fr) | 2004-03-31 | 2004-03-31 | Procede et systeme de conversion rapides d'un signal vocal |
PCT/FR2005/000607 WO2005106853A1 (fr) | 2004-03-31 | 2005-03-14 | Procede et systeme de conversion rapides d'un signal vocal |
EP05735426A EP1730728A1 (fr) | 2004-03-31 | 2005-03-14 | Procede et systeme de conversion rapides d'un signal vocal |
US10/591,599 US7792672B2 (en) | 2004-03-31 | 2005-03-14 | Method and system for the quick conversion of a voice signal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0403405A FR2868587A1 (fr) | 2004-03-31 | 2004-03-31 | Procede et systeme de conversion rapides d'un signal vocal |
Publications (1)
Publication Number | Publication Date |
---|---|
FR2868587A1 true FR2868587A1 (fr) | 2005-10-07 |
Family
ID=34944345
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
FR0403405A Pending FR2868587A1 (fr) | 2004-03-31 | 2004-03-31 | Procede et systeme de conversion rapides d'un signal vocal |
Country Status (4)
Country | Link |
---|---|
US (1) | US7792672B2 (fr) |
EP (1) | EP1730728A1 (fr) |
FR (1) | FR2868587A1 (fr) |
WO (1) | WO2005106853A1 (fr) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070027687A1 (en) * | 2005-03-14 | 2007-02-01 | Voxonic, Inc. | Automatic donor ranking and selection system and method for voice conversion |
US8099282B2 (en) * | 2005-12-02 | 2012-01-17 | Asahi Kasei Kabushiki Kaisha | Voice conversion system |
US20070213987A1 (en) * | 2006-03-08 | 2007-09-13 | Voxonic, Inc. | Codebook-less speech conversion method and system |
JP4966048B2 (ja) * | 2007-02-20 | 2012-07-04 | 株式会社東芝 | 声質変換装置及び音声合成装置 |
EP1970894A1 (fr) * | 2007-03-12 | 2008-09-17 | France Télécom | Procédé et dispositif de modification d'un signal audio |
ES2796493T3 (es) * | 2008-03-20 | 2020-11-27 | Fraunhofer Ges Forschung | Aparato y método para convertir una señal de audio en una representación parametrizada, aparato y método para modificar una representación parametrizada, aparato y método para sintetizar una representación parametrizada de una señal de audio |
JP5038995B2 (ja) * | 2008-08-25 | 2012-10-03 | 株式会社東芝 | 声質変換装置及び方法、音声合成装置及び方法 |
US20110264453A1 (en) * | 2008-12-19 | 2011-10-27 | Koninklijke Philips Electronics N.V. | Method and system for adapting communications |
TWI391876B (zh) * | 2009-02-16 | 2013-04-01 | Inst Information Industry | 利用多重模組混合圖形切割之前景偵測方法、系統以及電腦程式產品 |
DE102009013020A1 (de) * | 2009-03-16 | 2010-09-23 | Hayo Becks | Vorrichtung und Verfahren zur Anpassung von Klangbildern |
US8321209B2 (en) * | 2009-11-10 | 2012-11-27 | Research In Motion Limited | System and method for low overhead frequency domain voice authentication |
JP5961950B2 (ja) * | 2010-09-15 | 2016-08-03 | ヤマハ株式会社 | 音声処理装置 |
US8620646B2 (en) * | 2011-08-08 | 2013-12-31 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal using harmonic envelope |
US9520138B2 (en) * | 2013-03-15 | 2016-12-13 | Broadcom Corporation | Adaptive modulation filtering for spectral feature enhancement |
WO2016042626A1 (fr) | 2014-09-17 | 2016-03-24 | 株式会社東芝 | Appareil de traitement de la parole, procédé de traitement de la parole, et programme |
US20190019500A1 (en) * | 2017-07-13 | 2019-01-17 | Electronics And Telecommunications Research Institute | Apparatus for deep learning based text-to-speech synthesizing by using multi-speaker data and method for the same |
US20190362737A1 (en) * | 2018-05-25 | 2019-11-28 | i2x GmbH | Modifying voice data of a conversation to achieve a desired outcome |
US11380345B2 (en) * | 2020-10-15 | 2022-07-05 | Agora Lab, Inc. | Real-time voice timbre style transform |
CN112750446B (zh) * | 2020-12-30 | 2024-05-24 | 标贝(青岛)科技有限公司 | 语音转换方法、装置和系统及存储介质 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002067245A1 (fr) * | 2001-02-16 | 2002-08-29 | Imagination Technologies Limited | Verification de haut-parleurs |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1993018505A1 (fr) * | 1992-03-02 | 1993-09-16 | The Walt Disney Company | Systeme de transformation vocale |
US5572624A (en) * | 1994-01-24 | 1996-11-05 | Kurzweil Applied Intelligence, Inc. | Speech recognition system accommodating different sources |
ATE277405T1 (de) * | 1997-01-27 | 2004-10-15 | Microsoft Corp | Stimmumwandlung |
US6029124A (en) * | 1997-02-21 | 2000-02-22 | Dragon Systems, Inc. | Sequential, nonparametric speech recognition and speaker identification |
US6336092B1 (en) * | 1997-04-28 | 2002-01-01 | Ivl Technologies Ltd | Targeted vocal transformation |
US6317710B1 (en) * | 1998-08-13 | 2001-11-13 | At&T Corp. | Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data |
US6879952B2 (en) * | 2000-04-26 | 2005-04-12 | Microsoft Corporation | Sound source separation using convolutional mixing and a priori sound source knowledge |
US7412377B2 (en) * | 2003-12-19 | 2008-08-12 | International Business Machines Corporation | Voice model for speech processing based on ordered average ranks of spectral features |
-
2004
- 2004-03-31 FR FR0403405A patent/FR2868587A1/fr active Pending
-
2005
- 2005-03-14 EP EP05735426A patent/EP1730728A1/fr not_active Withdrawn
- 2005-03-14 WO PCT/FR2005/000607 patent/WO2005106853A1/fr not_active Application Discontinuation
- 2005-03-14 US US10/591,599 patent/US7792672B2/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002067245A1 (fr) * | 2001-02-16 | 2002-08-29 | Imagination Technologies Limited | Verification de haut-parleurs |
Non-Patent Citations (5)
Title |
---|
BANDOIN G ET AL: "On the transformation of the speech spectrum for voice conversion", SPOKEN LANGUAGE, 1996. ICSLP 96. PROCEEDINGS., FOURTH INTERNATIONAL CONFERENCE ON PHILADELPHIA, PA, USA 3-6 OCT. 1996, NEW YORK, NY, USA,IEEE, US, 3 October 1996 (1996-10-03), pages 1405 - 1408, XP010237945, ISBN: 0-7803-3555-4 * |
HELENCA DUXANS AND ANTONIO BONAFONTE ET AL: "Estimation of GMM in voice conversion including unaligned data", PROCEEDINGS OF THE EUROSPEECH 2003 CONFERENCE, September 2003 (2003-09-01), pages 861 - 864, XP007007125 * |
LAROCHE J ET AL: "HNM: a simple, efficient harmonic+noise model for speech", APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 1993. FINAL PROGRAM AND PAPER SUMMARIES., 1993 IEEE WORKSHOP ON NEW PALTZ, NY, USA 17-20 OCT. 1993, NEW YORK, NY, USA,IEEE, 17 October 1993 (1993-10-17), pages 169 - 172, XP010130052, ISBN: 0-7803-2078-6 * |
STYLIANOU Y ET AL: "STATISTICAL METHODS FOR VOICE QUALITY TRANSFORMATION", 4TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. EUROSPEECH '95. MADRID, SPAIN, SEPT. 18 - 21, 1995, EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. (EUROSPEECH), MADRID : GRAFICAS BRENS, ES, vol. VOL. 1 CONF. 4, 18 September 1995 (1995-09-18), pages 447 - 450, XP000854745 * |
YINING CHEN1 ET AL: "Voice Conversion with Smoothed GMM and MAP Adaptation", PROCEEDINGS OF THE EUROSPEECH 2003 CONFERENCE, September 2003 (2003-09-01), pages 2413 - 2416, XP007006960 * |
Also Published As
Publication number | Publication date |
---|---|
US7792672B2 (en) | 2010-09-07 |
US20070192100A1 (en) | 2007-08-16 |
EP1730728A1 (fr) | 2006-12-13 |
WO2005106853A1 (fr) | 2005-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
FR2868587A1 (fr) | Procede et systeme de conversion rapides d'un signal vocal | |
FR2868586A1 (fr) | Procede et systeme ameliores de conversion d'un signal vocal | |
DE602005001142D1 (de) | Nachrichtenübertragungsgerät | |
DE60325826D1 (de) | Audiovisuelle sprachaktivitätsdetektion für ein spracherkennungssystem | |
FR2898209B1 (fr) | Procede de debruitage d'un signal audio | |
DE602005007939D1 (de) | Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb ekennungssystems liegen | |
ATE336775T1 (de) | Intelligente text-sprache-umsetzung | |
Stern et al. | Signal processing for robust speech recognition | |
AU2001275991A1 (en) | System and method for voice recognition with a plurality of voice recognition engines | |
JPWO2019106517A5 (fr) | ||
CN103124165A (zh) | 自动增益控制 | |
CN103137137A (zh) | 一种会议音频中的精彩说话人发现方法 | |
KR101889465B1 (ko) | 음성인식장치와, 음성인식장치가 구비된 조명등기구와, 이를 이용한 조명시스템 | |
DE602004023134D1 (de) | Spracherkennungsverfahren und -system, das an die eigenschaften von nichtmuttersprachlern angepasst ist | |
US11270691B2 (en) | Voice interaction system, its processing method, and program therefor | |
US20210050029A1 (en) | Methods and Apparatus for Reducing Stuttering | |
CN1645363A (zh) | 便携式即时方言互译装置及其方法 | |
WO2004068893A3 (fr) | Procede et appareil d'elimination du bruit dans un systeme de reconnaissance vocale reparti | |
JP2005227512A (ja) | 音信号処理方法及びその装置、音声認識装置並びにプログラム | |
Laskowski et al. | Crosscorrelation-based multispeaker speech activity detection. | |
KR20190032557A (ko) | 음성 기반 통신 | |
Pardo et al. | Speaker diarization for multi-microphone meetings using only between-channel differences | |
Kleban et al. | HMM adaptation and microphone array processing for distant speech recognition | |
Zhang et al. | Effective segmentation based on vocal effort change point detection | |
Vacher et al. | Speech recognition in a smart home: some experiments for telemonitoring |