AU3452397A - Speech synthesis system - Google Patents
Speech synthesis systemInfo
- Publication number
- AU3452397A AU3452397A AU34523/97A AU3452397A AU3452397A AU 3452397 A AU3452397 A AU 3452397A AU 34523/97 A AU34523/97 A AU 34523/97A AU 3452397 A AU3452397 A AU 3452397A AU 3452397 A AU3452397 A AU 3452397A
- Authority
- AU
- Australia
- Prior art keywords
- speech
- synthesis system
- series
- frame
- reference sample
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000015572 biosynthetic process Effects 0.000 title abstract 2
- 238000003786 synthesis reaction Methods 0.000 title abstract 2
- 239000013074 reference sample Substances 0.000 abstract 2
- 238000005314 correlation function Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G10L2025/937—Signal energy in various frequency bands
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Telephonic Communication Services (AREA)
- Aerials With Secondary Devices (AREA)
- Optical Communication System (AREA)
Abstract
A speech synthesis system in which a speech signal is divided into a series of frames, and each frame is converted into a coded signal including a voiced/unvoiced classification and a pitch estimate, wherein a low pass filtered speech segment centred about a reference sample is defined in each frame, a correlation value is calculated for each of a series of candidate pitch estimates as the maximum of multiple crosscorrelation values obtained from variable length speech segments centred about the reference sample, the correlation values are used to form a correlation function defining peaks, and the locations of the peaks are determined and used to define a pitch estimate.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9614209 | 1996-07-05 | ||
GBGB9614209.6A GB9614209D0 (en) | 1996-07-05 | 1996-07-05 | Speech synthesis system |
US2181596P | 1996-07-16 | 1996-07-16 | |
US021815 | 1996-07-16 | ||
PCT/GB1997/001831 WO1998001848A1 (en) | 1996-07-05 | 1997-07-07 | Speech synthesis system |
Publications (1)
Publication Number | Publication Date |
---|---|
AU3452397A true AU3452397A (en) | 1998-02-02 |
Family
ID=26309651
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU34523/97A Abandoned AU3452397A (en) | 1996-07-05 | 1997-07-07 | Speech synthesis system |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP0950238B1 (en) |
JP (1) | JP2000514207A (en) |
AT (1) | ATE249672T1 (en) |
AU (1) | AU3452397A (en) |
CA (1) | CA2259374A1 (en) |
DE (1) | DE69724819D1 (en) |
WO (1) | WO1998001848A1 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2784218B1 (en) * | 1998-10-06 | 2000-12-08 | Thomson Csf | LOW-SPEED SPEECH CODING METHOD |
GB2357683A (en) * | 1999-12-24 | 2001-06-27 | Nokia Mobile Phones Ltd | Voiced/unvoiced determination for speech coding |
GB2398981B (en) * | 2003-02-27 | 2005-09-14 | Motorola Inc | Speech communication unit and method for synthesising speech therein |
DE102004007184B3 (en) | 2004-02-13 | 2005-09-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method and apparatus for quantizing an information signal |
DE102004007191B3 (en) | 2004-02-13 | 2005-09-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio coding |
DE102004007200B3 (en) | 2004-02-13 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Device for audio encoding has device for using filter to obtain scaled, filtered audio value, device for quantizing it to obtain block of quantized, scaled, filtered audio values and device for including information in coded signal |
CN114519996B (en) * | 2022-04-20 | 2022-07-08 | 北京远鉴信息技术有限公司 | Method, device and equipment for determining voice synthesis type and storage medium |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2670313A1 (en) * | 1990-12-11 | 1992-06-12 | Thomson Csf | METHOD AND DEVICE FOR EVALUATING THE PERIODICITY AND VOICE SIGNAL VOICE IN VOCODERS AT VERY LOW SPEED. |
JP3093113B2 (en) * | 1994-09-21 | 2000-10-03 | 日本アイ・ビー・エム株式会社 | Speech synthesis method and system |
KR19980702608A (en) * | 1995-03-07 | 1998-08-05 | 에버쉐드마이클 | Speech synthesizer |
-
1997
- 1997-07-07 AT AT97930643T patent/ATE249672T1/en not_active IP Right Cessation
- 1997-07-07 AU AU34523/97A patent/AU3452397A/en not_active Abandoned
- 1997-07-07 DE DE69724819T patent/DE69724819D1/en not_active Expired - Lifetime
- 1997-07-07 CA CA002259374A patent/CA2259374A1/en not_active Abandoned
- 1997-07-07 JP JP10504943A patent/JP2000514207A/en active Pending
- 1997-07-07 EP EP97930643A patent/EP0950238B1/en not_active Expired - Lifetime
- 1997-07-07 WO PCT/GB1997/001831 patent/WO1998001848A1/en active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
EP0950238A1 (en) | 1999-10-20 |
JP2000514207A (en) | 2000-10-24 |
DE69724819D1 (en) | 2003-10-16 |
WO1998001848A1 (en) | 1998-01-15 |
ATE249672T1 (en) | 2003-09-15 |
CA2259374A1 (en) | 1998-01-15 |
EP0950238B1 (en) | 2003-09-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4516259A (en) | Speech analysis-synthesis system | |
EP0732687A3 (en) | Apparatus for expanding speech bandwidth | |
EP0838804A3 (en) | Audio bandwidth extending system and method | |
EP1164578A3 (en) | Speech decoding method and apparatus | |
EP0731449A3 (en) | Method for the modification of PLC coefficients of acoustic signals | |
CA2144823A1 (en) | Estimation of excitation parameters | |
EP0788091A3 (en) | Speech encoding and decoding method and apparatus therefor | |
CA2176665A1 (en) | Method of adapting the noise masking level in an analysis-by-synthesis speech coder employing a short-term perceptual weighting filter | |
CA1321646C (en) | Coded speech communication system having code books for synthesizing small-amplitude components | |
EP1391879A3 (en) | Speech coding apparatus, linear prediction coefficient analyzing apparatus and noise reducing apparatus | |
EP0911807A3 (en) | Sound synthesizing method and apparatus, and sound band expanding method and apparatus | |
EP0714089A3 (en) | Code-excited linear predictive coder and decoder with conversion filter for converting stochastic and impulse excitation signals | |
DE69421354D1 (en) | Data compression for speech recognition | |
EP0932141A3 (en) | Method for signal controlled switching between different audio coding schemes | |
EP0831460A3 (en) | Speech synthesis method utilizing auxiliary information | |
CA2150614A1 (en) | Method of Speech Synthesis by Means of Concatenation and Partial Overlapping of Waveforms | |
EP0762386A3 (en) | Method and apparatus for CELP coding an audio signal while distinguishing speech periods and non-speech periods | |
CA2455059A1 (en) | Speech bandwidth extension apparatus and speech bandwidth extension method | |
CA2309921A1 (en) | Method and apparatus for pitch estimation using perception based analysis by synthesis | |
AU697892B2 (en) | Analysis-by-synthesis speech coding method | |
EP0726560A3 (en) | Variable speed playback system | |
AU3452397A (en) | Speech synthesis system | |
FR2861491B1 (en) | METHOD FOR SELECTING SYNTHESIS UNITS | |
TW334557B (en) | Speech synthesis system | |
TW353748B (en) | Speech encoding method and apparatus and pitch detection method and apparatus |