CA2185134C - Apparatus for synthesizing speech by varying pitch - Google Patents
Apparatus for synthesizing speech by varying pitch Download PDFInfo
- Publication number
- CA2185134C CA2185134C CA002185134A CA2185134A CA2185134C CA 2185134 C CA2185134 C CA 2185134C CA 002185134 A CA002185134 A CA 002185134A CA 2185134 A CA2185134 A CA 2185134A CA 2185134 C CA2185134 C CA 2185134C
- Authority
- CA
- Canada
- Prior art keywords
- pitch
- speech
- excitation
- synthesis apparatus
- windows
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000002194 synthesizing effect Effects 0.000 title description 2
- 230000005284 excitation Effects 0.000 claims abstract description 35
- 230000003595 spectral effect Effects 0.000 claims abstract description 18
- 230000006870 function Effects 0.000 claims abstract description 9
- 230000001755 vocal effect Effects 0.000 claims abstract description 8
- 230000001360 synchronised effect Effects 0.000 claims abstract description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 31
- 230000015572 biosynthetic process Effects 0.000 claims description 29
- 238000004458 analytical method Methods 0.000 claims description 23
- 230000004044 response Effects 0.000 claims description 6
- 230000006835 compression Effects 0.000 claims description 5
- 238000007906 compression Methods 0.000 claims description 5
- 230000001419 dependent effect Effects 0.000 claims description 3
- 238000000034 method Methods 0.000 description 27
- 230000008569 process Effects 0.000 description 11
- 238000012952 Resampling Methods 0.000 description 10
- 238000001914 filtration Methods 0.000 description 6
- 230000002123 temporal effect Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 2
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 2
- MQJKPEGWNLWLTK-UHFFFAOYSA-N Dapsone Chemical compound C1=CC(N)=CC=C1S(=O)(=O)C1=CC=C(N)C=C1 MQJKPEGWNLWLTK-UHFFFAOYSA-N 0.000 description 2
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 2
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 101150087426 Gnal gene Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000005279 excitation period Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0264—Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
- Electrophonic Musical Instruments (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP94301953.9 | 1994-03-18 | ||
EP94301953 | 1994-03-18 | ||
SG1996003308A SG43076A1 (en) | 1994-03-18 | 1994-03-18 | Speech synthesis |
PCT/GB1995/000588 WO1995026024A1 (en) | 1994-03-18 | 1995-03-17 | Speech synthesis |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2185134A1 CA2185134A1 (en) | 1995-09-28 |
CA2185134C true CA2185134C (en) | 2001-04-24 |
Family
ID=26136991
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002185134A Expired - Fee Related CA2185134C (en) | 1994-03-18 | 1995-03-17 | Apparatus for synthesizing speech by varying pitch |
Country Status (10)
Country | Link |
---|---|
EP (1) | EP0750778B1 (ja) |
JP (1) | JPH09510554A (ja) |
CN (1) | CN1144008A (ja) |
AU (1) | AU692238B2 (ja) |
CA (1) | CA2185134C (ja) |
DE (1) | DE69519086T2 (ja) |
ES (1) | ES2152390T3 (ja) |
NZ (1) | NZ282012A (ja) |
SG (1) | SG43076A1 (ja) |
WO (1) | WO1995026024A1 (ja) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3093113B2 (ja) * | 1994-09-21 | 2000-10-03 | 日本アイ・ビー・エム株式会社 | 音声合成方法及びシステム |
DE69509555T2 (de) * | 1994-11-25 | 1999-09-02 | Fink | Verfahren zur veränderung eines sprachsignales mittels grundfrequenzmanipulation |
EP1019906B1 (en) * | 1997-01-27 | 2004-06-16 | Entropic Research Laboratory Inc. | A system and methodology for prosody modification |
CN104205213B (zh) * | 2012-03-23 | 2018-01-05 | 西门子公司 | 语音信号处理方法及装置以及使用其的助听器 |
JP6446993B2 (ja) * | 2014-10-20 | 2019-01-09 | ヤマハ株式会社 | 音声制御装置およびプログラム |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5163110A (en) * | 1990-08-13 | 1992-11-10 | First Byte | Pitch control in artificial speech |
-
1994
- 1994-03-18 SG SG1996003308A patent/SG43076A1/en unknown
-
1995
- 1995-03-17 ES ES95911420T patent/ES2152390T3/es not_active Expired - Lifetime
- 1995-03-17 AU AU18995/95A patent/AU692238B2/en not_active Ceased
- 1995-03-17 WO PCT/GB1995/000588 patent/WO1995026024A1/en active IP Right Grant
- 1995-03-17 NZ NZ282012A patent/NZ282012A/en not_active IP Right Cessation
- 1995-03-17 EP EP95911420A patent/EP0750778B1/en not_active Expired - Lifetime
- 1995-03-17 DE DE69519086T patent/DE69519086T2/de not_active Expired - Lifetime
- 1995-03-17 CN CN95192141A patent/CN1144008A/zh active Pending
- 1995-03-17 CA CA002185134A patent/CA2185134C/en not_active Expired - Fee Related
- 1995-03-17 JP JP7524461A patent/JPH09510554A/ja not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
DE69519086T2 (de) | 2001-05-10 |
DE69519086D1 (de) | 2000-11-16 |
EP0750778A1 (en) | 1997-01-02 |
JPH09510554A (ja) | 1997-10-21 |
CN1144008A (zh) | 1997-02-26 |
WO1995026024A1 (en) | 1995-09-28 |
ES2152390T3 (es) | 2001-02-01 |
AU1899595A (en) | 1995-10-09 |
AU692238B2 (en) | 1998-06-04 |
SG43076A1 (en) | 1997-10-17 |
EP0750778B1 (en) | 2000-10-11 |
NZ282012A (en) | 1997-05-26 |
CA2185134A1 (en) | 1995-09-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Charpentier et al. | Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. | |
Moulines et al. | Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones | |
US5787398A (en) | Apparatus for synthesizing speech by varying pitch | |
Laroche | Time and pitch scale modification of audio signals | |
US8706496B2 (en) | Audio signal transforming by utilizing a computational cost function | |
Moulines et al. | Non-parametric techniques for pitch-scale and time-scale modification of speech | |
Stylianou | Applying the harmonic plus noise model in concatenative speech synthesis | |
Moulines et al. | Time-domain and frequency-domain techniques for prosodic modification of speech | |
JP4641620B2 (ja) | ピッチ検出の精密化 | |
JPH03501896A (ja) | 波形の加算重畳による音声合成のための処理装置 | |
US5987413A (en) | Envelope-invariant analytical speech resynthesis using periodic signals derived from reharmonized frame spectrum | |
Stylianou et al. | Diphone concatenation using a harmonic plus noise model of speech. | |
Stylianou | Concatenative speech synthesis using a harmonic plus noise model | |
JPH08254993A (ja) | 音声合成装置 | |
O'Brien et al. | Concatenative synthesis based on a harmonic model | |
KR100457414B1 (ko) | 음성합성방법, 음성합성장치 및 기록매체 | |
CA2185134C (en) | Apparatus for synthesizing speech by varying pitch | |
Bonada | High quality voice transformations based on modeling radiated voice pulses in frequency domain | |
EP1500080B1 (en) | Method for synthesizing speech | |
Edgington et al. | Residual-based speech modification algorithms for text-to-speech synthesis | |
Acero | Source-filter models for time-scale pitch-scale modification of speech | |
JP3089940B2 (ja) | 音声合成装置 | |
KR100417092B1 (ko) | 음성합성 방법 | |
Fries | Hybrid time-and frequency-domain speech synthesis with extended glottal source generation | |
Gigi et al. | A mixed-excitation vocoder based on exact analysis of harmonic components |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |