CA2251502A1 - Digital speech acquisition, transmission, storage and search system and method - Google Patents
Digital speech acquisition, transmission, storage and search system and method Download PDFInfo
- Publication number
- CA2251502A1 CA2251502A1 CA002251502A CA2251502A CA2251502A1 CA 2251502 A1 CA2251502 A1 CA 2251502A1 CA 002251502 A CA002251502 A CA 002251502A CA 2251502 A CA2251502 A CA 2251502A CA 2251502 A1 CA2251502 A1 CA 2251502A1
- Authority
- CA
- Canada
- Prior art keywords
- speech
- information
- determined
- prosody
- digital
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0018—Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Abstract
A digital speech system processes acquired speech to determine speech element information, such as the phonemes in the speech, and associated the prosody information. This determined information is then encoded for transmission and/or storage.
The encoded information is decoded to recover the speech element information and the prosody information which can be provided to a speech generator to construct a facsimile of the original speech. Also, the speech element and prosody information can be searched to locate words or phrases of interest in the speech.
The encoded information is decoded to recover the speech element information and the prosody information which can be provided to a speech generator to construct a facsimile of the original speech. Also, the speech element and prosody information can be searched to locate words or phrases of interest in the speech.
Claims (9)
1. A digital speech system, comprising:
speech element determination means operable on speech in a digital format to determine speech element information from said speech;
speech prosody determination means operable on said speech to determine prosody information from said speech;
encoding means to encode speech information comprising said determined speech element information, said determined speech prosody information and timing information relating thereto in a digital form;
decoding means to decode said encoded speech information to obtain said determined speech element information, said determined speech prosody information and said timing information;
comparison means to compare said determined speech element information and said determined speech prosody information to a database of speech elements to select speech sound elements which correspond thereto; and speech generating means to assemble said selected speech sound elements to construct a facsimile of said speech.
speech element determination means operable on speech in a digital format to determine speech element information from said speech;
speech prosody determination means operable on said speech to determine prosody information from said speech;
encoding means to encode speech information comprising said determined speech element information, said determined speech prosody information and timing information relating thereto in a digital form;
decoding means to decode said encoded speech information to obtain said determined speech element information, said determined speech prosody information and said timing information;
comparison means to compare said determined speech element information and said determined speech prosody information to a database of speech elements to select speech sound elements which correspond thereto; and speech generating means to assemble said selected speech sound elements to construct a facsimile of said speech.
2. The digital speech system of claim 1 further comprising:
speech acquisition means to acquire an analog electronic representation of said speech; and digitization means to convert said analog representation of said speech to said digital format.
speech acquisition means to acquire an analog electronic representation of said speech; and digitization means to convert said analog representation of said speech to said digital format.
The digital speech system of claim 1 further comprising:
an output means to produce an output of said facsimile in a manner audible to a user.
an output means to produce an output of said facsimile in a manner audible to a user.
4. The digital speech system of claim 1 wherein said comparison means further comprises recognition means to identify undesired speech characteristics, said selection of speech sound elements being performed to reduce the presence of said identified undesired speech characteristics in said facsimile.
5. The digital speech system of claim 1 further comprising search means operable to receive an input representing a word or phrase of interest and to examine said determined speech element information to locate occurrences of said word or phrase therein.
6. A method of acquiring and constructing digital speech, comprising the steps of:
(i) examining digitized speech to determine speech element information relating to said speech;
(ii) examining said digitized speech to determine prosody information relating to said speech;
(iii) encoding speech information corresponding to said determined speech element information, said determined prosody information and timing information relating thereto;
(iv) receiving and decoding said encoded speech information to obtain said timing information, said determined speech element information and said determined prosody information;
(v) comparing said decoded speech element information and prosody information to a database to select corresponding speech sound elements; and (vi) assembling said selected speech sound elements to construct a facsimile of said speech.
(i) examining digitized speech to determine speech element information relating to said speech;
(ii) examining said digitized speech to determine prosody information relating to said speech;
(iii) encoding speech information corresponding to said determined speech element information, said determined prosody information and timing information relating thereto;
(iv) receiving and decoding said encoded speech information to obtain said timing information, said determined speech element information and said determined prosody information;
(v) comparing said decoded speech element information and prosody information to a database to select corresponding speech sound elements; and (vi) assembling said selected speech sound elements to construct a facsimile of said speech.
7. The method of claim 6 further comprising the steps of:
acquiring an electronic representation of speech in an analog form; and digitizing said analog representation of speech to obtain said digitized speech for step (i).
acquiring an electronic representation of speech in an analog form; and digitizing said analog representation of speech to obtain said digitized speech for step (i).
8. The method of claim 6 wherein step (v) further comprises comparing said decoded speech element information to a predefined database of undesired speech characteristics and selecting speech sound elements which reduce said undesired speech characteristics is said facsimile.
9. The method of claim 6 further comprising the step of receiving from a user a word or phrase of interest and search said determined speech element information and said determined prosody information to locate occurrences of said received word or phrase and to identify said locations to said user.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002251502A CA2251502A1 (en) | 1998-10-26 | 1998-10-26 | Digital speech acquisition, transmission, storage and search system and method |
JP33648199A JP2001154686A (en) | 1998-10-26 | 1999-11-26 | Digital audio system, method of acquisition and construction of digital audio signal |
EP99203979A EP1103954A1 (en) | 1998-10-26 | 1999-11-26 | Digital speech acquisition, transmission, storage and search system and method |
AU61769/99A AU6176999A (en) | 1998-10-26 | 1999-11-29 | digital speech acquisition, transmission, storage and search system and method |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002251502A CA2251502A1 (en) | 1998-10-26 | 1998-10-26 | Digital speech acquisition, transmission, storage and search system and method |
JP33648199A JP2001154686A (en) | 1998-10-26 | 1999-11-26 | Digital audio system, method of acquisition and construction of digital audio signal |
EP99203979A EP1103954A1 (en) | 1998-10-26 | 1999-11-26 | Digital speech acquisition, transmission, storage and search system and method |
AU61769/99A AU6176999A (en) | 1998-10-26 | 1999-11-29 | digital speech acquisition, transmission, storage and search system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2251502A1 true CA2251502A1 (en) | 2000-04-26 |
Family
ID=31999270
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002251502A Abandoned CA2251502A1 (en) | 1998-10-26 | 1998-10-26 | Digital speech acquisition, transmission, storage and search system and method |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1103954A1 (en) |
JP (1) | JP2001154686A (en) |
AU (1) | AU6176999A (en) |
CA (1) | CA2251502A1 (en) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2986345B2 (en) * | 1993-10-18 | 1999-12-06 | インターナショナル・ビジネス・マシーンズ・コーポレイション | Voice recording indexing apparatus and method |
US5933805A (en) * | 1996-12-13 | 1999-08-03 | Intel Corporation | Retaining prosody during speech analysis for later playback |
GB2332841B (en) * | 1997-12-24 | 2002-10-30 | Motorola Ltd | Speech communication systems |
-
1998
- 1998-10-26 CA CA002251502A patent/CA2251502A1/en not_active Abandoned
-
1999
- 1999-11-26 EP EP99203979A patent/EP1103954A1/en not_active Withdrawn
- 1999-11-26 JP JP33648199A patent/JP2001154686A/en active Pending
- 1999-11-29 AU AU61769/99A patent/AU6176999A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP1103954A1 (en) | 2001-05-30 |
AU6176999A (en) | 2001-05-31 |
JP2001154686A (en) | 2001-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5963892A (en) | Translation apparatus and method for facilitating speech input operation and obtaining correct translation thereof | |
CA2366057C (en) | Database annotation and retrieval | |
US4975957A (en) | Character voice communication system | |
CN1742321B (en) | Prosodic mimic method and apparatus | |
JP2954588B2 (en) | Audio encoding device, decoding device, and encoding / decoding system | |
CN101447187A (en) | Apparatus and method for recognizing speech | |
US5673364A (en) | System and method for compression and decompression of audio signals | |
US6611797B1 (en) | Speech coding/decoding method and apparatus | |
JP2003036097A (en) | Device and method for detecting and retrieving information | |
US6269332B1 (en) | Method of encoding a speech signal | |
US5933802A (en) | Speech reproducing system with efficient speech-rate converter | |
CA2251502A1 (en) | Digital speech acquisition, transmission, storage and search system and method | |
US5970454A (en) | Synthesizing speech by converting phonemes to digital waveforms | |
US5987412A (en) | Synthesising speech by converting phonemes to digital waveforms | |
US7346508B2 (en) | Information retrieving method and apparatus | |
AU709376B2 (en) | Automatic speech recognition | |
US5774856A (en) | User-Customized, low bit-rate speech vocoding method and communication unit for use therewith | |
WO2000057401A1 (en) | Computation and quantization of voiced excitation pulse shapes in linear predictive coding of speech | |
AU674246B2 (en) | Synthesising speech by converting phonemes to digital waveforms | |
US6134519A (en) | Voice encoder for generating natural background noise | |
JP3431655B2 (en) | Encoding device and decoding device | |
CN1629933B (en) | Device, method and converter for speech synthesis | |
KR100827074B1 (en) | Apparatus and method for automatic dialling in a mobile portable telephone | |
JPH09179593A (en) | Speech encoding device | |
JPH064598A (en) | Information storage retriever |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Discontinued |