CA2251502A1 - Digital speech acquisition, transmission, storage and search system and method - Google Patents

Digital speech acquisition, transmission, storage and search system and method Download PDF

Info

Publication number
CA2251502A1
CA2251502A1 CA002251502A CA2251502A CA2251502A1 CA 2251502 A1 CA2251502 A1 CA 2251502A1 CA 002251502 A CA002251502 A CA 002251502A CA 2251502 A CA2251502 A CA 2251502A CA 2251502 A1 CA2251502 A1 CA 2251502A1
Authority
CA
Canada
Prior art keywords
speech
information
determined
prosody
digital
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002251502A
Other languages
French (fr)
Inventor
Marc Lutz
Mark Vange
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LANCASTER EQUITIES Ltd
Original Assignee
LANCASTER EQUITIES Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LANCASTER EQUITIES Ltd filed Critical LANCASTER EQUITIES Ltd
Priority to CA002251502A priority Critical patent/CA2251502A1/en
Priority to JP33648199A priority patent/JP2001154686A/en
Priority to EP99203979A priority patent/EP1103954A1/en
Priority to AU61769/99A priority patent/AU6176999A/en
Publication of CA2251502A1 publication Critical patent/CA2251502A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0018Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Abstract

A digital speech system processes acquired speech to determine speech element information, such as the phonemes in the speech, and associated the prosody information. This determined information is then encoded for transmission and/or storage.
The encoded information is decoded to recover the speech element information and the prosody information which can be provided to a speech generator to construct a facsimile of the original speech. Also, the speech element and prosody information can be searched to locate words or phrases of interest in the speech.

Claims (9)

We claim:
1. A digital speech system, comprising:
speech element determination means operable on speech in a digital format to determine speech element information from said speech;
speech prosody determination means operable on said speech to determine prosody information from said speech;
encoding means to encode speech information comprising said determined speech element information, said determined speech prosody information and timing information relating thereto in a digital form;
decoding means to decode said encoded speech information to obtain said determined speech element information, said determined speech prosody information and said timing information;
comparison means to compare said determined speech element information and said determined speech prosody information to a database of speech elements to select speech sound elements which correspond thereto; and speech generating means to assemble said selected speech sound elements to construct a facsimile of said speech.
2. The digital speech system of claim 1 further comprising:
speech acquisition means to acquire an analog electronic representation of said speech; and digitization means to convert said analog representation of said speech to said digital format.
The digital speech system of claim 1 further comprising:
an output means to produce an output of said facsimile in a manner audible to a user.
4. The digital speech system of claim 1 wherein said comparison means further comprises recognition means to identify undesired speech characteristics, said selection of speech sound elements being performed to reduce the presence of said identified undesired speech characteristics in said facsimile.
5. The digital speech system of claim 1 further comprising search means operable to receive an input representing a word or phrase of interest and to examine said determined speech element information to locate occurrences of said word or phrase therein.
6. A method of acquiring and constructing digital speech, comprising the steps of:
(i) examining digitized speech to determine speech element information relating to said speech;
(ii) examining said digitized speech to determine prosody information relating to said speech;
(iii) encoding speech information corresponding to said determined speech element information, said determined prosody information and timing information relating thereto;
(iv) receiving and decoding said encoded speech information to obtain said timing information, said determined speech element information and said determined prosody information;
(v) comparing said decoded speech element information and prosody information to a database to select corresponding speech sound elements; and (vi) assembling said selected speech sound elements to construct a facsimile of said speech.
7. The method of claim 6 further comprising the steps of:
acquiring an electronic representation of speech in an analog form; and digitizing said analog representation of speech to obtain said digitized speech for step (i).
8. The method of claim 6 wherein step (v) further comprises comparing said decoded speech element information to a predefined database of undesired speech characteristics and selecting speech sound elements which reduce said undesired speech characteristics is said facsimile.
9. The method of claim 6 further comprising the step of receiving from a user a word or phrase of interest and search said determined speech element information and said determined prosody information to locate occurrences of said received word or phrase and to identify said locations to said user.
CA002251502A 1998-10-26 1998-10-26 Digital speech acquisition, transmission, storage and search system and method Abandoned CA2251502A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CA002251502A CA2251502A1 (en) 1998-10-26 1998-10-26 Digital speech acquisition, transmission, storage and search system and method
JP33648199A JP2001154686A (en) 1998-10-26 1999-11-26 Digital audio system, method of acquisition and construction of digital audio signal
EP99203979A EP1103954A1 (en) 1998-10-26 1999-11-26 Digital speech acquisition, transmission, storage and search system and method
AU61769/99A AU6176999A (en) 1998-10-26 1999-11-29 digital speech acquisition, transmission, storage and search system and method

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CA002251502A CA2251502A1 (en) 1998-10-26 1998-10-26 Digital speech acquisition, transmission, storage and search system and method
JP33648199A JP2001154686A (en) 1998-10-26 1999-11-26 Digital audio system, method of acquisition and construction of digital audio signal
EP99203979A EP1103954A1 (en) 1998-10-26 1999-11-26 Digital speech acquisition, transmission, storage and search system and method
AU61769/99A AU6176999A (en) 1998-10-26 1999-11-29 digital speech acquisition, transmission, storage and search system and method

Publications (1)

Publication Number Publication Date
CA2251502A1 true CA2251502A1 (en) 2000-04-26

Family

ID=31999270

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002251502A Abandoned CA2251502A1 (en) 1998-10-26 1998-10-26 Digital speech acquisition, transmission, storage and search system and method

Country Status (4)

Country Link
EP (1) EP1103954A1 (en)
JP (1) JP2001154686A (en)
AU (1) AU6176999A (en)
CA (1) CA2251502A1 (en)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2986345B2 (en) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション Voice recording indexing apparatus and method
US5933805A (en) * 1996-12-13 1999-08-03 Intel Corporation Retaining prosody during speech analysis for later playback
GB2332841B (en) * 1997-12-24 2002-10-30 Motorola Ltd Speech communication systems

Also Published As

Publication number Publication date
EP1103954A1 (en) 2001-05-30
AU6176999A (en) 2001-05-31
JP2001154686A (en) 2001-06-08

Similar Documents

Publication Publication Date Title
US5963892A (en) Translation apparatus and method for facilitating speech input operation and obtaining correct translation thereof
CA2366057C (en) Database annotation and retrieval
US4975957A (en) Character voice communication system
CN1742321B (en) Prosodic mimic method and apparatus
JP2954588B2 (en) Audio encoding device, decoding device, and encoding / decoding system
CN101447187A (en) Apparatus and method for recognizing speech
US5673364A (en) System and method for compression and decompression of audio signals
US6611797B1 (en) Speech coding/decoding method and apparatus
JP2003036097A (en) Device and method for detecting and retrieving information
US6269332B1 (en) Method of encoding a speech signal
US5933802A (en) Speech reproducing system with efficient speech-rate converter
CA2251502A1 (en) Digital speech acquisition, transmission, storage and search system and method
US5970454A (en) Synthesizing speech by converting phonemes to digital waveforms
US5987412A (en) Synthesising speech by converting phonemes to digital waveforms
US7346508B2 (en) Information retrieving method and apparatus
AU709376B2 (en) Automatic speech recognition
US5774856A (en) User-Customized, low bit-rate speech vocoding method and communication unit for use therewith
WO2000057401A1 (en) Computation and quantization of voiced excitation pulse shapes in linear predictive coding of speech
AU674246B2 (en) Synthesising speech by converting phonemes to digital waveforms
US6134519A (en) Voice encoder for generating natural background noise
JP3431655B2 (en) Encoding device and decoding device
CN1629933B (en) Device, method and converter for speech synthesis
KR100827074B1 (en) Apparatus and method for automatic dialling in a mobile portable telephone
JPH09179593A (en) Speech encoding device
JPH064598A (en) Information storage retriever

Legal Events

Date Code Title Description
FZDE Discontinued