CA2259374A1 - Speech synthesis system - Google Patents
Speech synthesis system Download PDFInfo
- Publication number
- CA2259374A1 CA2259374A1 CA002259374A CA2259374A CA2259374A1 CA 2259374 A1 CA2259374 A1 CA 2259374A1 CA 002259374 A CA002259374 A CA 002259374A CA 2259374 A CA2259374 A CA 2259374A CA 2259374 A1 CA2259374 A1 CA 2259374A1
- Authority
- CA
- Canada
- Prior art keywords
- frame
- voiced
- pitch
- speech
- lpc
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 64
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 64
- 238000005314 correlation function Methods 0.000 claims abstract description 7
- 239000013074 reference sample Substances 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 141
- 239000013598 vector Substances 0.000 claims description 136
- 230000008569 process Effects 0.000 claims description 101
- 230000003595 spectral effect Effects 0.000 claims description 64
- 230000005284 excitation Effects 0.000 claims description 59
- 238000001228 spectrum Methods 0.000 claims description 38
- 238000012549 training Methods 0.000 claims description 21
- 238000013139 quantization Methods 0.000 claims description 18
- 230000004044 response Effects 0.000 claims description 7
- 238000005070 sampling Methods 0.000 claims description 4
- 230000001419 dependent effect Effects 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 230000003247 decreasing effect Effects 0.000 claims description 2
- 238000012804 iterative process Methods 0.000 claims description 2
- 230000006870 function Effects 0.000 description 28
- 238000010586 diagram Methods 0.000 description 18
- 150000002500 ions Chemical class 0.000 description 16
- 238000013459 approach Methods 0.000 description 11
- 230000009466 transformation Effects 0.000 description 10
- 230000000737 periodic effect Effects 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 6
- 101150105814 ERMN gene Proteins 0.000 description 5
- 230000003044 adaptive effect Effects 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 5
- 238000012938 design process Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 230000001755 vocal effect Effects 0.000 description 3
- 241000283986 Lepus Species 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000002674 ointment Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- DHSSDEDRBUKTQY-UHFFFAOYSA-N 6-prop-2-enyl-4,5,7,8-tetrahydrothiazolo[4,5-d]azepin-2-amine Chemical compound C1CN(CC=C)CCC2=C1N=C(N)S2 DHSSDEDRBUKTQY-UHFFFAOYSA-N 0.000 description 1
- 101150071228 Lifr gene Proteins 0.000 description 1
- 241000282320 Panthera leo Species 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000004870 electrical engineering Methods 0.000 description 1
- SEACYXSIPDVVMV-UHFFFAOYSA-L eosin Y Chemical compound [Na+].[Na+].[O-]C(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C([O-])=C(Br)C=C21 SEACYXSIPDVVMV-UHFFFAOYSA-L 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 101150115956 slc25a26 gene Proteins 0.000 description 1
- 229950008418 talipexole Drugs 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G10L2025/937—Signal energy in various frequency bands
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Telephonic Communication Services (AREA)
- Aerials With Secondary Devices (AREA)
- Optical Communication System (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9614209.6 | 1996-07-05 | ||
GBGB9614209.6A GB9614209D0 (en) | 1996-07-05 | 1996-07-05 | Speech synthesis system |
US2181596P | 1996-07-16 | 1996-07-16 | |
US021,815 | 1996-07-16 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2259374A1 true CA2259374A1 (en) | 1998-01-15 |
Family
ID=26309651
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002259374A Abandoned CA2259374A1 (en) | 1996-07-05 | 1997-07-07 | Speech synthesis system |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP0950238B1 (de) |
JP (1) | JP2000514207A (de) |
AT (1) | ATE249672T1 (de) |
AU (1) | AU3452397A (de) |
CA (1) | CA2259374A1 (de) |
DE (1) | DE69724819D1 (de) |
WO (1) | WO1998001848A1 (de) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2784218B1 (fr) * | 1998-10-06 | 2000-12-08 | Thomson Csf | Procede de codage de la parole a bas debit |
GB2357683A (en) * | 1999-12-24 | 2001-06-27 | Nokia Mobile Phones Ltd | Voiced/unvoiced determination for speech coding |
GB2398981B (en) * | 2003-02-27 | 2005-09-14 | Motorola Inc | Speech communication unit and method for synthesising speech therein |
DE102004007184B3 (de) | 2004-02-13 | 2005-09-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Verfahren und Vorrichtung zum Quantisieren eines Informationssignals |
DE102004007191B3 (de) | 2004-02-13 | 2005-09-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audiocodierung |
DE102004007200B3 (de) | 2004-02-13 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audiocodierung |
CN114519996B (zh) * | 2022-04-20 | 2022-07-08 | 北京远鉴信息技术有限公司 | 一种语音合成类型的确定方法、装置、设备以及存储介质 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2670313A1 (fr) * | 1990-12-11 | 1992-06-12 | Thomson Csf | Procede et dispositif pour l'evaluation de la periodicite et du voisement du signal de parole dans les vocodeurs a tres bas debit. |
JP3093113B2 (ja) * | 1994-09-21 | 2000-10-03 | 日本アイ・ビー・エム株式会社 | 音声合成方法及びシステム |
KR19980702608A (ko) * | 1995-03-07 | 1998-08-05 | 에버쉐드마이클 | 음성 합성기 |
-
1997
- 1997-07-07 AT AT97930643T patent/ATE249672T1/de not_active IP Right Cessation
- 1997-07-07 AU AU34523/97A patent/AU3452397A/en not_active Abandoned
- 1997-07-07 DE DE69724819T patent/DE69724819D1/de not_active Expired - Lifetime
- 1997-07-07 CA CA002259374A patent/CA2259374A1/en not_active Abandoned
- 1997-07-07 JP JP10504943A patent/JP2000514207A/ja active Pending
- 1997-07-07 EP EP97930643A patent/EP0950238B1/de not_active Expired - Lifetime
- 1997-07-07 WO PCT/GB1997/001831 patent/WO1998001848A1/en active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
AU3452397A (en) | 1998-02-02 |
EP0950238A1 (de) | 1999-10-20 |
JP2000514207A (ja) | 2000-10-24 |
DE69724819D1 (de) | 2003-10-16 |
WO1998001848A1 (en) | 1998-01-15 |
ATE249672T1 (de) | 2003-09-15 |
EP0950238B1 (de) | 2003-09-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0422232B1 (de) | Stimmenkodierer | |
Sugamura et al. | Speech analysis and synthesis methods developed at ECL in NTT—From LPC to LSP— | |
US6871176B2 (en) | Phase excited linear prediction encoder | |
EP1222659B1 (de) | Lpc-harmonischer sprachkodierer mit überrahmenformat | |
EP0981816B9 (de) | Systeme und verfahren zur audio-kodierung | |
EP1145228B1 (de) | Kodierung periodischer sprache | |
US20020099548A1 (en) | Variable rate speech coding | |
KR20010022092A (ko) | 이격 대역 선형 예상 보코더 | |
JPH01221800A (ja) | 音響波形のコード化方式 | |
CA2259374A1 (en) | Speech synthesis system | |
US6199040B1 (en) | System and method for communicating a perceptually encoded speech spectrum signal | |
US5822721A (en) | Method and apparatus for fractal-excited linear predictive coding of digital signals | |
CA2177226C (en) | Method of and apparatus for coding speech signal | |
US5937374A (en) | System and method for improved pitch estimation which performs first formant energy removal for a frame using coefficients from a prior frame | |
US20050065782A1 (en) | Hybrid speech coding and system | |
Gottesman et al. | High quality enhanced waveform interpolative coding at 2.8 kbps | |
Erzin et al. | Interframe differential coding of line spectrum frequencies | |
Xydeas et al. | A long history quantization approach to scalar and vector quantization of LSP coefficients | |
Srinonchat | New technique to reduce bit rate of LPC-10 speech coder | |
US20050065786A1 (en) | Hybrid speech coding and system | |
Copperi | Rule-based speech analysis and application of CELP coding | |
Taniguchi et al. | Principal axis extracting vector excitation coding: high quality speech at 8 kb/s | |
Viswanathan et al. | A harmonic deviations linear prediction vocoder for improved narrowband speech transmission | |
Chen et al. | Subframe Interpolation Optimized Coding of LSF Parameters | |
So et al. | Empirical lower bound on the bitrate for the transparent memoryless coding of wideband LPC parameters |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Discontinued |