NZ725925A - Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system - Google Patents
Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis systemInfo
- Publication number
- NZ725925A NZ725925A NZ725925A NZ72592514A NZ725925A NZ 725925 A NZ725925 A NZ 725925A NZ 725925 A NZ725925 A NZ 725925A NZ 72592514 A NZ72592514 A NZ 72592514A NZ 725925 A NZ725925 A NZ 725925A
- Authority
- NZ
- New Zealand
- Prior art keywords
- excitation signal
- model based
- glottal
- forming
- synthesis system
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
Abstract
A method is presented for forming the excitation signal for a glottal pulse model based parametric speech synthesis system. In one embodiment, fundamental frequency values are used to form the excitation signal. The excitation is modeled using a voice source pulse selected from a database of a given speaker. The voice source signal is segmented into glottal segments, which are used in vector representation to identify the glottal pulse used for formation of the excitation signal. Use of a novel distance metric and preserving the original signals extracted from the speakers voice samples helps capture low frequency information of the excitation signal. In addition, segment edge artifacts are removed by applying a unique segment joining method to improve the quality of synthetic speech while creating a true representation of the voice quality of a speaker.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2014/039722 WO2015183254A1 (en) | 2014-05-28 | 2014-05-28 | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
Publications (1)
Publication Number | Publication Date |
---|---|
NZ725925A true NZ725925A (en) | 2020-04-24 |
Family
ID=54699420
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NZ725925A NZ725925A (en) | 2014-05-28 | 2014-05-28 | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP3149727B1 (en) |
JP (1) | JP6449331B2 (en) |
AU (2) | AU2014395554B2 (en) |
BR (1) | BR112016027537B1 (en) |
CA (2) | CA2947957C (en) |
NZ (1) | NZ725925A (en) |
WO (1) | WO2015183254A1 (en) |
ZA (1) | ZA201607696B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10255903B2 (en) | 2014-05-28 | 2019-04-09 | Interactive Intelligence Group, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
US10014007B2 (en) | 2014-05-28 | 2018-07-03 | Interactive Intelligence, Inc. | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system |
NZ749370A (en) * | 2016-06-02 | 2020-03-27 | Genesys Telecommunications Laboratories Inc | Technologies for authenticating a speaker using voice biometrics |
JP2018040838A (en) * | 2016-09-05 | 2018-03-15 | 国立研究開発法人情報通信研究機構 | Method for extracting intonation structure of voice and computer program therefor |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5400434A (en) * | 1990-09-04 | 1995-03-21 | Matsushita Electric Industrial Co., Ltd. | Voice source for synthetic speech system |
US6795807B1 (en) * | 1999-08-17 | 2004-09-21 | David R. Baraff | Method and means for creating prosody in speech regeneration for laryngectomees |
JP2002244689A (en) * | 2001-02-22 | 2002-08-30 | Rikogaku Shinkokai | Synthesizing method for averaged voice and method for synthesizing arbitrary-speaker's voice from averaged voice |
CN102047321A (en) * | 2008-05-30 | 2011-05-04 | 诺基亚公司 | Method, apparatus and computer program product for providing improved speech synthesis |
JP5075865B2 (en) * | 2009-03-25 | 2012-11-21 | 株式会社東芝 | Audio processing apparatus, method, and program |
PL2242045T3 (en) * | 2009-04-16 | 2013-02-28 | Univ Mons | Speech synthesis and coding methods |
JP5085700B2 (en) * | 2010-08-30 | 2012-11-28 | 株式会社東芝 | Speech synthesis apparatus, speech synthesis method and program |
US8744854B1 (en) * | 2012-09-24 | 2014-06-03 | Chengjun Julian Chen | System and method for voice transformation |
-
2014
- 2014-05-28 BR BR112016027537-3A patent/BR112016027537B1/en active IP Right Grant
- 2014-05-28 JP JP2016567717A patent/JP6449331B2/en active Active
- 2014-05-28 EP EP14893138.9A patent/EP3149727B1/en active Active
- 2014-05-28 CA CA2947957A patent/CA2947957C/en active Active
- 2014-05-28 CA CA3178027A patent/CA3178027A1/en active Pending
- 2014-05-28 WO PCT/US2014/039722 patent/WO2015183254A1/en active Application Filing
- 2014-05-28 NZ NZ725925A patent/NZ725925A/en unknown
- 2014-05-28 AU AU2014395554A patent/AU2014395554B2/en active Active
-
2016
- 2016-11-08 ZA ZA2016/07696A patent/ZA201607696B/en unknown
-
2020
- 2020-09-03 AU AU2020227065A patent/AU2020227065B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
ZA201607696B (en) | 2019-03-27 |
CA2947957A1 (en) | 2015-12-03 |
EP3149727A1 (en) | 2017-04-05 |
AU2014395554A1 (en) | 2016-11-24 |
JP6449331B2 (en) | 2019-01-09 |
EP3149727A4 (en) | 2018-01-24 |
CA2947957C (en) | 2023-01-03 |
EP3149727B1 (en) | 2021-01-27 |
JP2017520016A (en) | 2017-07-20 |
WO2015183254A1 (en) | 2015-12-03 |
AU2020227065B2 (en) | 2021-11-18 |
BR112016027537B1 (en) | 2022-05-10 |
CA3178027A1 (en) | 2015-12-03 |
AU2020227065A1 (en) | 2020-09-24 |
AU2014395554B2 (en) | 2020-09-24 |
BR112016027537A2 (en) | 2017-08-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2018004828A (en) | Apparatus and method for generating a filtered audio signal realizing elevation rendering. | |
MX364461B (en) | Method and apparatus for implementing recording of object audio, and electronic device. | |
MX2016012317A (en) | Apparatus and method for audio rendering employing a geometric distance definition. | |
WO2014145960A3 (en) | Method and system for generating advanced feature discrimination vectors for use in speech recognition | |
WO2014133756A3 (en) | Method and apparatus for learning-enhanced altas-based auto-segmentation | |
MY173561A (en) | Audio signal classification method and apparatus | |
NZ725925A (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
WO2014025682A3 (en) | Acoustic data selection for training the parameters of an acoustic model | |
WO2012129255A3 (en) | Systems and methods for segmenting and/or classifying an audio signal from transformed audio information | |
MX352154B (en) | Unvoiced/voiced decision for speech processing. | |
EP2960866A3 (en) | Method and apparatus for creating curved surface model | |
TW201614642A (en) | Method and apparatus for separating speech data from background data in audio communication | |
JP2016071029A5 (en) | ||
GB2539592A (en) | Subsurface formation modeling with integrated stress profiles | |
MY172710A (en) | Apparatus and method for generating a frequency enhancement signal using an energy limitation operation | |
PH12016500600B1 (en) | Method, apparatus, device, computer-readable medium for bandwidth extension of an audio signal using a scaled high-band excitation | |
EP3363015A4 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
GB2565744A (en) | Full waveform inversion of vertical seismic profile data for anisotropic velocities using pseudo-acoustic wave equations | |
SG10201805102PA (en) | Audio coding method and related apparatus | |
MX2016012004A (en) | Apparatus, method and corresponding computer program for generating an error concealment signal using an adaptive noise estimation. | |
TW201612894A (en) | Synthesis method of audio files and synthesis system of audio files using same | |
GB2557056A (en) | Acoustic anisotrophy log visualization | |
GB2574164A (en) | Sound identification utilizing periodic indications | |
Sahidullah | Enhancement of speaker recognition performance using block level, relative and temporal information of subband energies | |
WO2016036163A3 (en) | Method and apparatus for learning and recognizing audio signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PSEA | Patent sealed | ||
RENW | Renewal (renewal fees accepted) |
Free format text: PATENT RENEWED FOR 1 YEAR UNTIL 28 MAY 2022 BY FRKELLY Effective date: 20210528 |
|
RENW | Renewal (renewal fees accepted) |
Free format text: PATENT RENEWED FOR 1 YEAR UNTIL 28 MAY 2023 BY FRKELLY Effective date: 20220506 |
|
RENW | Renewal (renewal fees accepted) |
Free format text: PATENT RENEWED FOR 1 YEAR UNTIL 28 MAY 2024 BY FRKELLY Effective date: 20230505 |