FR2853125A1 - Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. - Google Patents
Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. Download PDFInfo
- Publication number
- FR2853125A1 FR2853125A1 FR0303790A FR0303790A FR2853125A1 FR 2853125 A1 FR2853125 A1 FR 2853125A1 FR 0303790 A FR0303790 A FR 0303790A FR 0303790 A FR0303790 A FR 0303790A FR 2853125 A1 FR2853125 A1 FR 2853125A1
- Authority
- FR
- France
- Prior art keywords
- fundamental frequency
- samples
- spectrum
- determining
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 238000004458 analytical method Methods 0.000 title claims description 37
- 238000006243 chemical reaction Methods 0.000 title claims description 15
- 238000001228 spectrum Methods 0.000 claims abstract description 56
- 230000003595 spectral effect Effects 0.000 claims description 47
- 230000009466 transformation Effects 0.000 claims description 30
- 239000000203 mixture Substances 0.000 claims description 10
- 230000001131 transforming effect Effects 0.000 claims description 9
- 230000015572 biosynthetic process Effects 0.000 claims description 7
- 238000003786 synthesis reaction Methods 0.000 claims description 7
- 230000001755 vocal effect Effects 0.000 claims description 7
- 239000000523 sample Substances 0.000 claims description 4
- 230000001360 synchronised effect Effects 0.000 claims description 4
- 238000007476 Maximum Likelihood Methods 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000012512 characterization method Methods 0.000 claims description 2
- 230000006870 function Effects 0.000 description 47
- 239000013598 vector Substances 0.000 description 7
- 238000004422 calculation algorithm Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000000354 decomposition reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 210000001260 vocal cord Anatomy 0.000 description 2
- 238000004590 computer program Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Machine Translation (AREA)
Priority Applications (8)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR0303790A FR2853125A1 (fr) | 2003-03-27 | 2003-03-27 | Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. |
| US10/551,224 US7643988B2 (en) | 2003-03-27 | 2004-03-02 | Method for analyzing fundamental frequency information and voice conversion method and system implementing said analysis method |
| JP2006505682A JP4382808B2 (ja) | 2003-03-27 | 2004-03-02 | 基本周波数情報を分析する方法、ならびに、この分析方法を実装した音声変換方法及びシステム |
| CN200480014488.8A CN100583235C (zh) | 2003-03-27 | 2004-03-02 | 分析基频信息的方法以及实现所述分析方法的话音转换方法和系统 |
| EP04716265A EP1606792B1 (fr) | 2003-03-27 | 2004-03-02 | Procede d analyse d informations de frequence fondament ale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d analyse |
| PCT/FR2004/000483 WO2004088633A1 (fr) | 2003-03-27 | 2004-03-02 | Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse |
| DE602004013747T DE602004013747D1 (de) | 2003-03-27 | 2004-03-02 | Verfahren zur analyse der grundfrequenz, verfahren und vorrichtung zur sprachkonversion unter dessen verwendung |
| AT04716265T ATE395684T1 (de) | 2003-03-27 | 2004-03-02 | Verfahren zur analyse der grundfrequenz, verfahren und vorrichtung zur sprachkonversion unter dessen verwendung |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR0303790A FR2853125A1 (fr) | 2003-03-27 | 2003-03-27 | Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| FR2853125A1 true FR2853125A1 (fr) | 2004-10-01 |
Family
ID=32947218
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| FR0303790A Pending FR2853125A1 (fr) | 2003-03-27 | 2003-03-27 | Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US7643988B2 (enExample) |
| EP (1) | EP1606792B1 (enExample) |
| JP (1) | JP4382808B2 (enExample) |
| CN (1) | CN100583235C (enExample) |
| AT (1) | ATE395684T1 (enExample) |
| DE (1) | DE602004013747D1 (enExample) |
| FR (1) | FR2853125A1 (enExample) |
| WO (1) | WO2004088633A1 (enExample) |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4241736B2 (ja) * | 2006-01-19 | 2009-03-18 | 株式会社東芝 | 音声処理装置及びその方法 |
| CN101064104B (zh) * | 2006-04-24 | 2011-02-02 | 中国科学院自动化研究所 | 基于语音转换的情感语音生成方法 |
| US20080167862A1 (en) * | 2007-01-09 | 2008-07-10 | Melodis Corporation | Pitch Dependent Speech Recognition Engine |
| JP4966048B2 (ja) * | 2007-02-20 | 2012-07-04 | 株式会社東芝 | 声質変換装置及び音声合成装置 |
| US8131550B2 (en) * | 2007-10-04 | 2012-03-06 | Nokia Corporation | Method, apparatus and computer program product for providing improved voice conversion |
| JP4577409B2 (ja) * | 2008-06-10 | 2010-11-10 | ソニー株式会社 | 再生装置、再生方法、プログラム、及び、データ構造 |
| CN102063899B (zh) * | 2010-10-27 | 2012-05-23 | 南京邮电大学 | 一种非平行文本条件下的语音转换方法 |
| CN102664003B (zh) * | 2012-04-24 | 2013-12-04 | 南京邮电大学 | 基于谐波加噪声模型的残差激励信号合成及语音转换方法 |
| ES2432480B2 (es) * | 2012-06-01 | 2015-02-10 | Universidad De Las Palmas De Gran Canaria | Método para la evaluación clínica del sistema fonador de pacientes con patologías laríngeas a través de una evaluación acústica de la calidad de la voz |
| US9570087B2 (en) * | 2013-03-15 | 2017-02-14 | Broadcom Corporation | Single channel suppression of interfering sources |
| CN109493880A (zh) * | 2016-01-22 | 2019-03-19 | 大连民族大学 | 一种谐波信号基频初步筛选的方法 |
| WO2018138543A1 (en) * | 2017-01-24 | 2018-08-02 | Hua Kanru | Probabilistic method for fundamental frequency estimation |
| CN108766450B (zh) * | 2018-04-16 | 2023-02-17 | 杭州电子科技大学 | 一种基于谐波冲激分解的语音转换方法 |
| CN108922516B (zh) * | 2018-06-29 | 2020-11-06 | 北京语言大学 | 检测调域值的方法和装置 |
| CN111179902B (zh) * | 2020-01-06 | 2022-10-28 | 厦门快商通科技股份有限公司 | 基于高斯模型模拟共鸣腔的语音合成方法、设备及介质 |
| CN112750446B (zh) * | 2020-12-30 | 2024-05-24 | 标贝(青岛)科技有限公司 | 语音转换方法、装置和系统及存储介质 |
| CN115148225B (zh) * | 2021-03-30 | 2024-09-03 | 北京猿力未来科技有限公司 | 语调评分方法、语调评分系统、计算设备及存储介质 |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1993018505A1 (en) * | 1992-03-02 | 1993-09-16 | The Walt Disney Company | Voice transformation system |
| DE69826446T2 (de) * | 1997-01-27 | 2005-01-20 | Microsoft Corp., Redmond | Stimmumwandlung |
| WO1999003095A1 (en) * | 1997-07-11 | 1999-01-21 | Koninklijke Philips Electronics N.V. | Transmitter with an improved harmonic speech encoder |
| CN1151490C (zh) * | 2000-09-13 | 2004-05-26 | 中国科学院自动化研究所 | 用于语音识别的高精度高分辨率基频提取方法 |
-
2003
- 2003-03-27 FR FR0303790A patent/FR2853125A1/fr active Pending
-
2004
- 2004-03-02 WO PCT/FR2004/000483 patent/WO2004088633A1/fr not_active Ceased
- 2004-03-02 DE DE602004013747T patent/DE602004013747D1/de not_active Expired - Lifetime
- 2004-03-02 CN CN200480014488.8A patent/CN100583235C/zh not_active Expired - Fee Related
- 2004-03-02 US US10/551,224 patent/US7643988B2/en not_active Expired - Fee Related
- 2004-03-02 EP EP04716265A patent/EP1606792B1/fr not_active Expired - Lifetime
- 2004-03-02 JP JP2006505682A patent/JP4382808B2/ja not_active Expired - Fee Related
- 2004-03-02 AT AT04716265T patent/ATE395684T1/de not_active IP Right Cessation
Non-Patent Citations (3)
| Title |
|---|
| DOVAL B ET AL: "Fundamental frequency estimation and tracking using maximum likelihood harmonic matching and HMMs", STATISTICAL SIGNAL AND ARRAY PROCESSING. MINNEAPOLIS, APR. 27 - 30, 1993, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), NEW YORK, IEEE, US, vol. 4, 27 April 1993 (1993-04-27), pages 221 - 224, XP010110214, ISBN: 0-7803-0946-4 * |
| KAIN A ET AL: "Stochastic modeling of spectral adjustment for high quality pitch modification", ICASSP 2000, vol. 2, 5 June 2000 (2000-06-05), pages 949 - 952, XP010504881 * |
| STYLIANOU Y ET AL: "A SYSTEM FOR VOICE CONVERSION BASED ON PROBABILISTIC CLASSIFICATION AND A HARMONIC PLUS NOISE MODEL", PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING. ICASSP '98. SEATTLE, WA, MAY 12 - 15, 1998, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, NEW YORK, NY: IEEE, US, vol. 1 CONF. 23, 12 May 1998 (1998-05-12), pages 281 - 284, XP000854570, ISBN: 0-7803-4429-4 * |
Also Published As
| Publication number | Publication date |
|---|---|
| JP4382808B2 (ja) | 2009-12-16 |
| JP2006521576A (ja) | 2006-09-21 |
| CN1795491A (zh) | 2006-06-28 |
| EP1606792A1 (fr) | 2005-12-21 |
| CN100583235C (zh) | 2010-01-20 |
| US7643988B2 (en) | 2010-01-05 |
| ATE395684T1 (de) | 2008-05-15 |
| EP1606792B1 (fr) | 2008-05-14 |
| DE602004013747D1 (de) | 2008-06-26 |
| WO2004088633A1 (fr) | 2004-10-14 |
| US20060178874A1 (en) | 2006-08-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Helander et al. | Voice conversion using dynamic kernel partial least squares regression | |
| EP1730729A1 (fr) | Procede et systeme ameliores de conversion d'un signal vocal | |
| McLoughlin | Line spectral pairs | |
| EP1606792B1 (fr) | Procede d analyse d informations de frequence fondament ale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d analyse | |
| US9343060B2 (en) | Voice processing using conversion function based on respective statistics of a first and a second probability distribution | |
| Prasad et al. | Bandwidth extension of speech signals: A comprehensive review | |
| EP0759201A1 (en) | Audio analysis/synthesis system | |
| EP1730728A1 (fr) | Procede et systeme de conversion rapides d'un signal vocal | |
| Bhatt | Simulation and overall comparative evaluation of performance between different techniques for high band feature extraction based on artificial bandwidth extension of speech over proposed global system for mobile full rate narrow band coder | |
| Bansal et al. | Low bit-rate speech coding based on multicomponent AFM signal model | |
| Grumiaux et al. | Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model | |
| Liu et al. | Audio bandwidth extension based on temporal smoothing cepstral coefficients | |
| Al-Radhi et al. | Continuous vocoder applied in deep neural network based voice conversion | |
| Srivastava | Fundamentals of linear prediction | |
| Xiao et al. | Speech intelligibility enhancement by non-parallel speech style conversion using cwt and imetricgan based cyclegan | |
| Liu et al. | Audio bandwidth extension based on ensemble echo state networks with temporal evolution | |
| Berisha et al. | Bandwidth extension of speech using perceptual criteria | |
| Gupta et al. | A new framework for artificial bandwidth extension using H∞ filtering | |
| JP7750250B2 (ja) | オーディオ移調 | |
| Orphanidou et al. | Voice morphing using the generative topographic mapping | |
| EP1846918A1 (fr) | Procede d'estimation d'une fonction de conversion de voix | |
| Gowriprasad et al. | Linear prediction on Cent scale for fundamental frequency analysis | |
| Jinachitra | Robust structured voice extraction for flexible expressive resynthesis | |
| Johansen | Bandwidth Extension of Telephony Speech | |
| Li et al. | Variable bit-rate sinusoidal transform coding using variable order spectral estimation. |