GB2258936A - Voice recognition apparatus - Google Patents
Voice recognition apparatus Download PDFInfo
- Publication number
- GB2258936A GB2258936A GB9221767A GB9221767A GB2258936A GB 2258936 A GB2258936 A GB 2258936A GB 9221767 A GB9221767 A GB 9221767A GB 9221767 A GB9221767 A GB 9221767A GB 2258936 A GB2258936 A GB 2258936A
- Authority
- GB
- United Kingdom
- Prior art keywords
- voice
- pattern data
- pattern
- recognition apparatus
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012790 confirmation Methods 0.000 claims description 10
- 230000004044 response Effects 0.000 claims description 8
- 230000001755 vocal effect Effects 0.000 claims description 5
- 238000000034 method Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/26—Devices for calling a subscriber
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/26—Devices for calling a subscriber
- H04M1/27—Devices whereby a plurality of signals may be stored simultaneously
- H04M1/271—Devices whereby a plurality of signals may be stored simultaneously controlled by voice recognition
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Telephonic Communication Services (AREA)
- Telephone Function (AREA)
Description
22539-)o 1 VOICE RECOGNITION APPARATUS The present invention relates to
voice recognition apparatus and to such apparatus when combined with a telephone set.
In recent years, a telephone set which dials a telephone number corresponding to the name of a subscriber, which has been stored in advance, according to the result of voice recognition of a person who spoke the name, has been put in practical use.
It is an object of the present invention to provide voice recognition apparatus which is improved over such previously known apparatus.
The present invention provides voice recognition apparatus for recognising voice by comparing a voice with reference to standard voice pattern data previously produced from voice pattern data corresponding to voices with a same meaning spoken by a plurality of persons and registered therein and outputting a confirmation voice corresponding to the standard voice pattern data, wherein the apparatus comprises voice recorder means for recording a voice either associated with the registered voice pattern data or independently thereof as a confirmation voice to be output 2 corresponding to the result of recognition of the standard voice pattern data by input of the voices.
In order that the invention may be more readily understood, it will now be described, by way of example with reference to the accompanying drawings, in which:- Figure 1 is a block diagram showing the configuration of voice recognition apparatus according to the present invention; and Figure 2 is a plan showing the external view of a telephone set equipped with the voice recognition apparatus shown in Figure 1.
Referring now to the drawings, and more particularly to Fig. 1, there is shown an arrangement of the voice recognition apparatus according to the invention which is applied to a telephone set for automatically dialing a telephone number corresponding to a result of recognition of a voice by which personal names, corporation names, etc. desired to be called are input.
-1 k 3 In Fig. 1, a switch 21 selects four mode states of the voice recognition apparatus in this embodiment.
When a contact b of the switch 21 Is closed, the Input voice pattern is stored and the voice from 44:a which the vOice pattern is derived is recorded at the same time (Mode 2). When a contact c is closed, the input voice pattern only is stored and the corresponding voice Is not recorded (Mode 3). When a contact a is closed, no voice pattern Is recorded (Mode 1). When a contact d is closed, the voice recognition state results (Mode 4).
Next, the state where the contact b of the switch 21 Is closed will be explained In detail.
When a voice Is input through the voice input portion 22, that input voice is applied to a voice section detector 23 and at the same time, it is stored or recorded In a voice memory 24.
The voice section detector 23 Is set a threshold value for the level of the input voice. The voice section detector 23 detects a voice section by discriminating the voice section from the absence of sound and noises according to the threshold value.
The voice memory 24 selectively stores only the voice sIgnal of the voice section, based on data on a voice section detected by the voice section detector 23. Then, this extracted voice signal of the voice section is stored In the voice memory 24 under control of the detector 23, In correspondence with the voice pattern obtained by the sound 4 analyzer 25.
More particularly, the detector 23 also gates the detected voice section to the sound analyzer 25. The sound analyzer 25 analyzes a voice characteristic parameter sequence by filtering the detected voice section which is stored In a pattern memory 26 as a standard voice pattern.
After storing the standard voice pattern In the voice pattern memory 26 as described above, and Mode 4 is selected at the time of voice recognition, the voice section Is detected by the voice section detector 23 for voices which are Input through the voice input portion 22. Similarity of the voice pattern obtained by the sound analyzer 25 for the detected voice section with the standard pattern stored In the voice pattern memory 26 is calculated by the voice recognizer 27, and the result of recognition for the Input voice Is obtained by mutually comparing the similarity values.
As the voice recognizing process method used In the voice recognizer 27, the voice recognizing algorithm which has been so far proposed in addition to the similarity calculation may be adopted as appropriate.
A voice signal In the voice storage 24 corresponding to the result of recognition obtained as mentioned above Is. given to the voice reproducer 28, and a voice for confirming the result of recognition is reproduced and output.
Next, a case where the contact c of the switch 21 Is closed will be explainecl in detail.
In this case, a voice Input through the voice input portion 22 Is given to the voice detector 23 only and Is not given to the voice memory 24, and therefore, no voice signal 1 Is stored An the voice section detector 24. The voice applied to the voice section detector 23 is processed by the same manner as in the case where the contact b of the switch 21 Is closed, and the voice pattern is stored In the voice pattern memory 26.
Lastly, a case where the contact a Is closed will be explained in detail.
In this case, a voice input through the voice Input portion 22 Is applied only to the voice memory 24 and not to the voice section detector 23 and therefore, no voice pattern Is stored in the voice pattern memory 26.
The voice memory 24 stores all voices (Including noise, no-sound section, etc.) Input through the voice Input portion 22 for the entire period of the voice storage operation without reference to detection of the Input voice section. As no process for detecting the voice section Is performed, it becomes possible to output voices including a no-sound section like a sentence as a response to confirm the result of voice recognition. Here, by a voice to voice pattern correspondence designator 29, It Is designated to which voice pattern stored In the voice pattern memory 26 a voice stored In the voice memory 24 corresponds.
Thus, by designating the correspondence of the voice to the voice pattern, it is possible to output a verbal confirmation other than lwordn as a response of the result of recognition.
Fig. 2 shows a telephone set equipped with the voice recognition apparatus shown in Fig.l.
In Fig. 2, numeral 30 designates a telephone casing, 31 a handset, and 32 a dial keyboard provided on the surface of 6 the telephone casing 30. Further, numeral 33 designates a mode selector switch, Including four pushbuttons corresponding to the switch 21 In Fig. 1. These four pushbutton switches are structured such that more than two buttons cannot be pushed simultaneously. Numeral 34 designates a key for designating the correspondence of voice with the voice pattern together with the dial keyboard 32, and 35 designates a function selector key used for various telephone services.
Next explained is a case where two users (A and B) each store a voice pattern of a voice pronouncing a personal name "Tanakal In the voice recognition apparatus, and the apparatus produces a response as a result of recognition by means of the voice of 0Tanakal registered by the user.
When the user A Is to register his voice pattern, he sets the mode selector switch 33 In Mode 2, speaks "Tanakal and registers his voice pattern and the voice. Then, assume that the user B Is to register his voice pattern, and he sets the mode selector switch 33 in Mode 3, speaks 'Tanakal and tries to register his voice pattern. At this time, the voice 'Tanakaw spoken by the user B will not be recorded Thus, in this example, when the voice patterns by two users are registered, the verbal confirmation for the result of recognition is always output by the voice of the user A whose voice and voice pattern were entered and stored In Mode 2.
Further, if the user B registers his voice pattern in Mode 2, the voice of the user B is overwritten to the voice of the user A and the verbal confirmation for the result of recognition is output by the voice of the user B. 1 ei 7 When it is desired to provide a confirmation response to the users A and B in the voice of another user C, it is necessary to select Mode 1 and register the voice of the user C.
Thus, according to the voice recognition apparatus in this embodiment, whether a voice spoken when storing the voice pattern is to be recorded can be set by the mode selector switch 33, and whose voice is to be used to output the verbal confirmation for the result of recognition can also be decided optionally. In addition, It Is possible to return a response by a "word" other than a "word' desired to be recognized by recording a confirming response voice Independently of storage of a voice pattern.
As described above, according to the present Invention, only by designating a desired name where it As reproduced by hearing a voice spoken to that name, it is possible to perform a prescribed processing such as deletion of a voice corresponding to the name and the voice patterns, etc. and to reduce significantly a burden on users.
In addition, it is also possible to set and output any voice as a confirming response voice for the result of recognition, as desired.
As described above, the present invention can provide an extremely preferable voice recognition apparatus and a telephone set equipped with the apparatus.
8
Claims (4)
1. voice recognition apparatus for recognising voice by comparing a voice with reference to standard voice pattern data previously produced from voice pattern data corresponding to voices with a same meaning spoken by a plurality of persons and registered therein and outputting a confirmation voice corresponding to the standard voice pattern data, wherein the apparatus comprises voice recorder means for recording a voice either associated with the registered voice pattern data or independently thereof as a confirmation voice to be output corresponding to the result of recognition of the standard voice pattern data by input of the voices.
2. Voice recognition apparatus as claimed in claim 1, which apparatus additionally comprises switching means for turning ON or OFF the recording of the verbal confirmation as occasion demands.
3. voice recognition apparatus as claimed in Claims 1 or 2, which apparatus responds in a prescribed manner in response to voice pattern data.
4. Voice recognition apparatus as claimed in any preceding Claims, wherein the apparatus is in combination with a telephone set.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63330905A JP2793213B2 (en) | 1988-12-29 | 1988-12-29 | Speech recognition device and telephone using the same |
Publications (3)
Publication Number | Publication Date |
---|---|
GB9221767D0 GB9221767D0 (en) | 1992-12-02 |
GB2258936A true GB2258936A (en) | 1993-02-24 |
GB2258936B GB2258936B (en) | 1993-07-21 |
Family
ID=18237804
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB8929267A Expired - Fee Related GB2226675B (en) | 1988-12-29 | 1989-12-28 | Voice recognition apparatus |
GB9221767A Expired - Fee Related GB2258936B (en) | 1988-12-29 | 1992-10-15 | Voice recognition apparatus |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB8929267A Expired - Fee Related GB2226675B (en) | 1988-12-29 | 1989-12-28 | Voice recognition apparatus |
Country Status (3)
Country | Link |
---|---|
JP (1) | JP2793213B2 (en) |
KR (1) | KR930005223B1 (en) |
GB (2) | GB2226675B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0627727A1 (en) * | 1993-06-02 | 1994-12-07 | Telia Ab | Process for evaluating speech quality in speech synthesis |
WO1997032430A1 (en) * | 1996-02-29 | 1997-09-04 | British Telecommunications Public Limited Company | Telecommunications system |
US6044147A (en) * | 1996-05-16 | 2000-03-28 | British Teledommunications Public Limited Company | Telecommunications system |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2307137B (en) * | 1995-11-04 | 2000-03-22 | Motorola Ltd | A communications addressing network and terminal therefor |
KR100229864B1 (en) * | 1996-12-27 | 1999-11-15 | 윤종용 | Method for recognizing recoder in voice mail system |
GB9806401D0 (en) * | 1998-03-25 | 1998-05-20 | Domain Dynamics Ltd | Improvements in voice operated mobile communications |
KR100378439B1 (en) * | 2000-12-14 | 2003-03-29 | 주식회사 티엘아이 | Telephone capable of rejecting a call demand and method using the same |
JP4240807B2 (en) * | 2000-12-25 | 2009-03-18 | 日本電気株式会社 | Mobile communication terminal device, voice recognition method, and recording medium recording the program |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB974850A (en) * | 1963-06-12 | 1964-11-11 | Standard Telephones Cables Ltd | Speech recognition system |
GB1055371A (en) * | 1964-03-06 | 1967-01-18 | Standard Telephones Cables Ltd | Apparatus for the recognition of speech |
-
1988
- 1988-12-29 JP JP63330905A patent/JP2793213B2/en not_active Expired - Lifetime
-
1989
- 1989-12-28 GB GB8929267A patent/GB2226675B/en not_active Expired - Fee Related
- 1989-12-29 KR KR1019890020068A patent/KR930005223B1/en not_active IP Right Cessation
-
1992
- 1992-10-15 GB GB9221767A patent/GB2258936B/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB974850A (en) * | 1963-06-12 | 1964-11-11 | Standard Telephones Cables Ltd | Speech recognition system |
GB1055371A (en) * | 1964-03-06 | 1967-01-18 | Standard Telephones Cables Ltd | Apparatus for the recognition of speech |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0627727A1 (en) * | 1993-06-02 | 1994-12-07 | Telia Ab | Process for evaluating speech quality in speech synthesis |
US5664050A (en) * | 1993-06-02 | 1997-09-02 | Telia Ab | Process for evaluating speech quality in speech synthesis |
WO1997032430A1 (en) * | 1996-02-29 | 1997-09-04 | British Telecommunications Public Limited Company | Telecommunications system |
US6044147A (en) * | 1996-05-16 | 2000-03-28 | British Teledommunications Public Limited Company | Telecommunications system |
Also Published As
Publication number | Publication date |
---|---|
GB9221767D0 (en) | 1992-12-02 |
GB2226675A (en) | 1990-07-04 |
GB8929267D0 (en) | 1990-02-28 |
GB2258936B (en) | 1993-07-21 |
KR930005223B1 (en) | 1993-06-16 |
KR900010649A (en) | 1990-07-09 |
JPH02178698A (en) | 1990-07-11 |
GB2226675B (en) | 1993-07-21 |
JP2793213B2 (en) | 1998-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5007081A (en) | Speech activated telephone | |
CA1294079C (en) | Voice controlled dialer having memories for full-digit dialing for any users and abbreviated dialing for authorized users | |
US5960393A (en) | User selectable multiple threshold criteria for voice recognition | |
US4624008A (en) | Apparatus for automatic speech recognition | |
EP0307137B1 (en) | Multiple language telephone answering machine | |
GB2258936A (en) | Voice recognition apparatus | |
US5499318A (en) | Method and apparatus for access control based on an audible uttering and timing of the audible uttering | |
JPS6126079B2 (en) | ||
WO1990008439A2 (en) | A speech processing apparatus and method therefor | |
EP1315146A2 (en) | Method and apparatus for improving access to numerical information in voice messages | |
JP3592415B2 (en) | Speaker recognition system | |
JPS6132679B2 (en) | ||
JP2656234B2 (en) | Conversation voice understanding method | |
JPH0432900A (en) | Sound recognizing device | |
JP2563624B2 (en) | Answering machine | |
JPH01114898A (en) | Data searcher | |
JPH02184900A (en) | Voice dial device | |
KR100395222B1 (en) | Voice Recognition System for Voice Mail Service (VMS) | |
JPH03173248A (en) | Voice dialing device | |
KR950009425B1 (en) | The phonetic dialing phone | |
JPH02136898A (en) | Voice dialing device | |
JPH09244684A (en) | Person authentication device | |
JPH1063295A (en) | Word voice recognition method for automatically correcting recognition result and device for executing the method | |
KR0154936B1 (en) | Door vision system | |
JPH01284197A (en) | Push-button dial signal detecting system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PCNP | Patent ceased through non-payment of renewal fee |
Effective date: 19981228 |