CA2149012C - Voice activated telephone set - Google Patents

Voice activated telephone set

Info

Publication number
CA2149012C
CA2149012C CA002149012A CA2149012A CA2149012C CA 2149012 C CA2149012 C CA 2149012C CA 002149012 A CA002149012 A CA 002149012A CA 2149012 A CA2149012 A CA 2149012A CA 2149012 C CA2149012 C CA 2149012C
Authority
CA
Canada
Prior art keywords
user
commands
identification
group
decompressed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
CA002149012A
Other languages
French (fr)
Other versions
CA2149012A1 (en
Inventor
Peter A. Tinsley
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitel Networks Corp
Original Assignee
Mitel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitel Corp filed Critical Mitel Corp
Priority to CA002149012A priority Critical patent/CA2149012C/en
Publication of CA2149012A1 publication Critical patent/CA2149012A1/en
Application granted granted Critical
Publication of CA2149012C publication Critical patent/CA2149012C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42204Arrangements at the exchange for service or number selection by voice
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Exchange Systems With Centralized Control (AREA)

Abstract

A method of operating a central call control apparatus by plural users is comprised of storing an identification of each user in a directory, storing in compressed form a plurality of predetermined commands spoken by the users, grouped by user, each group with a pointer to a corresponding identification in the directory, indicating an identification in the directory of a service demanding user wishing to initiate a command, upon receipt of a service demanding signal, retrieving a corresponding group of commands linked by the pointer to the identification, and decompressing and storing the group of commands, receiving spoken command words from the service demanding user, comparing the spoken command words with the decompressed group of commands to find decompressed command words, and invoking commands for which command words are found corresponding to the decompressed commands in the decompressed group of commands.

Description

2~l~9~12 This is a divisional application of Canadian serial number 2,058,644 filed December 31, l991.
FIELD OF THE INVENTION:
This invention relates to the field of voice activated systems, and in particular to voice activation of call control systems such as telephone switching equipment.
BACKGROUND TO THE INVENTION:
Voice recognition systems which could be used to activate switching equipment or to operate station equipment such as computers and telephone sets have been known for some time. One technique to enable operation of the equipment is to perform an analysis of the words spoken, and to compare the results of the analysis with a standardized database stored in mass storage memory.
In another technique, a learning process is used, in which successive correct comparison (hits) reinforce the detection of common sounds of various users which are spoken to generate a particular response. As the system learns, the speed of locating the particular "hit"
increases.
Both of the above techniques, used separately or in combination, require sophisticated analysis of the spoken words, and are th~refore costly in terms of equipment and analysis time. Consequently they have been used in centralized systems, where they can be ~-shared amongst many users. The complexity and cost of such systems have been found to be very high.
Another form of system which has been used is associated with each telephone set, with voice signals stored at each telephone set. Users speak commands, which are recognized in the telephone set based voice recognition system by comparison of the voice command signal with the stored audio signal. ~ ;

2~901~ ~

Such systems are required to be provided for each telephone set, which makes the telephone set very costly. ~ -SUMMARY OF THE PRESENT INVENTION:
The present invention is a centralized system, in which the voices of designated users are digitized and stored at a central location. Particular words are stored of a limited command vocabulary. Each voice signal is digitized, the digitized commands are grouped, linked by a code associated with a user's telephone, compressed and stored as data in a centralized mass storage device.
When a user activates a telephone, the associated data group is retrieved from storage, decompressed and placed in a virtual memory. When the user speaks one of the system commands, the spoken command is digitized and compared against the stored commands in the virtual memory. When a match is found the associated system function is initiated. Typically -up to twenty system commands are used.
The present invention has several advantages over the noted prior art systems. The equipment is centralized, used in common with all users, therefore avoiding the requirement that each telephone set should have a recognition system and voice storage device built into or connected to it. However because only a particularly designated (predetermined) group of co- ~ndc, associated with particular users are stored, sophisticated voice recognition and analysis equipment for all possible voices and all possible words is not required. Once the identity of a voice has been determined, it is comparable against only a small number of commands, e.g. twenty, and therefore determination of words from a large dictionary for reproduction on display or printing equipment is not required. The 21~0i2 establishment of a command "hit" of a small number of commands causes the equipment to execute the command.
One may consider the comparison between a voice recognition system which controls automatic typing S or display of words, with the present invention. In the former system, analysis must be made of each word, a comparison is made with a dictionary stored in a mass storage device, and the word is displayed on a CRT
display or printer. In such a system the equipment must be capable of discerning the words spoken for all potential users (who may have different voices, different accents, etc.). This could involve every potential speaker of the language, e.g. millions of persons. Clearly such equipment must be very sophisticated and is costly to provide.
On the other hand, in the present invention, certain specific commands are stored associated with a single user. The comparison equipment need only find a correspondence between the command spoken by that same user and his previously stored command. Clearly the process of comparison of the spoken cu ~n~ and the ~
stored command, to find a hit, is vastly simpler. The ~;
equipment is used in common with all potential users of their respective telephones, reducing the cost per user even further. --Once a single user has been identified as wanting to use the system, all of his or her potential commands are placed in virtual memory, ready for ;~
comparison of the cl ~n~. In accordance with the present invention, there are two steps to the retrieval process, the recognition of a particular user, which causes retrieval of all of that person's stored commands to virtual memory, and the subsequent recognition of the particular word sound corresponding to a command.
Consequently the process of determining the particular ~ ' 2149~19 commands spoken is vastly simpler than in the prior art system described above.
In accordance with an embodiment of the invention, a method of operating a central call control apparatus by plural users is comprised of storing an identification of each user in a directory, storing in compressed form a plurality of predetermined commands spoken by the users, grouped by user, each group with a pointer to a corresponding identification in the directory, indicating an identification in the directory of a service demanding user wishing to initiate a command, upon receipt of a service demanding signal, retrieving a corresponding group of commands linked by the pointer to the identification, and decompressiny and storing the group of commands, receiving spoken command words from the service demanding user, comparing the spoken command words with the decompressed group of commands to find decompressed command words, and invoking commands for which command words are found corresponding to the decompressed commands in the decompressed group of commands.
BRIEF INTRODUCTION TO THE DRAWINGS:
A better understanding of the invention will be obtained by reference to the detailed description below, in conjunction with the following drawings, in which:
Figure 1 is a block diagram of the present invention, Figure 2 is a block diagram used to illustrate the process and hardware of an embodiment of the present invention, Figure 3 is a block diagram illustrating a process of establishing the system for a particular user, 21~9012 s Figure 4 is similar to Figure 3 but used to illustrate another aspect of the process, Figure 5 is a block diagram used to illustrate operation of the system in dialing a call, and S Figure 6 is a block diagram used to illustrate operation of the system to implement a special feature call.
DETAILED DESCRIPTION OF THE INVENTION: :~
Figure 1 illustrates in block diagram apparatus which can be used to implement an embodiment ;
of the invention. A representative telephone set 1 is connected via a subscriber's line to a switching machine 3, which is controlled by a processor 5. Various trunks and other peripherals are connected to the switching machine 3 in a well known manner.
A peripheral connected to switching machine 3 in a manner in which it can be connected by the switching machine to any of the telephone sets is a ~
digitizer 11. The output of a voice synthesizer 9 is .
connected to the same peripheral terminal as the digitizer 11. A memory is connected to the output of the digitizer 11 containing a directory 13. A memory 15 is also connected to the output of digitizer 11, as well ~-as a compressor/decompressor 17. Additional memories, referred to as voice boxes l9A-19N are connected to compressor/decompressor 17. The input to the voice synthesizer 9 is connected to the memory 15 for receiving signals which can be converted to synthesized ~-~
speech, and applied to the peripheral terminal on the switching machine 3.
' Each of the voice synthesizer and memories is connected for control to processor 5. Alternatively a -separate processor may be used.
The switching machine 3 may be implemented using for example the Mitel SX2000TM, a system as ~ `'''' described in U.S. Patent 4,615,028. In such systems, any of the peripherals can have access to other peripherals for translation of circuit signals or message signals, i.e. information including signalling and control signals. Switching systems other than the above-mentioned may be used.
operation of the invention will be described below with further reference to Figures 2-6.
In order to establish the operation of the invention, a user alerts the system by pressing a predetermined key or his telephone set 1. The switching system, under control of processor 5, establishes a connection between telephone set 1 and digitizer 11, through to directory 13. Under control of the processor, the identity of the telephone set (user) is stored in the directory 13.
Turning now to Figure 3, it may be seen that when the system is alerted as described above, it establishes in the directory the identity of the particular user. The identity of three users, identified as User Code A, User Code B and User Code C
are shown as stored. As shown, when user B alerts the system, under control of the microprocessor a "voice box" (memory record space) for user B is established in memory 15. Once this is established, a sequence of stored prompts generate code which is passed to voice synthesizer 9, which generates speech which is passed to the telephone set, through the switching machine. The prompts request the user to repeat the prompts, e.g~ as shown in Figure 3, to repeat the words "dial", "1", "2", ..., and a sequence of offered feature names, such as "call forward".
The user repeats the prompts, which repeated prompts are passed through digitizer 11, and are stored as digital signals in the voice box for user B in the 21~0~2 system memory. The voice box of user B also contains, with each digitized repeated prompt, a pointer to a system command signal corresponding to each word, e.g. a "dial" pointer, a "1" pointer, a "2" pointer, a S particular feature pointer, etc. These pointers are used to initiate the generation of the command signal, ;
e.g. the generation of the digit 1, etc.
As shown in Figure 4, once the repeated prompts have been stored in the system, the digitized voice and pointers (the voice box of the user) are compressed in compressor/decompressor 17, and stored as --~
a group in a mass storage device with a link to the system directory. This is repeated for each of the users, as they come on stream.
Therefore in the system memory, a directory 13 contains a link by means of pointers to each user. In the mass storage, there is a stored voice box for each of the users, in which each voice box contains compressed prompts of the voice of the corresponding user, for each command. Each voice box contains a link ~;
to the directory entry for the particular user.
With reference to Figure 5, in order to utilize the system, the user, e.g. user B, alerts the -system by pressing a telephone set key to generate a multi-frequency tone or tones or by saying the user's ;~ -name, after the switching system 3 connects the -~-telephone set 1 to the particular peripheral line to which digitizer 11 is connected. The signal resulting from pressing the set key or saying the name is digitized in digitizer 11, and is compared under control of or by processor 5 with the data stored in the system directory 13. The task is not onerous, since only those names, in the voices of particular users are stored in directory 13. On finding a match, the link to the voice -~
box corresponding to the matched voice causes the voice ", 21~9012 :

box to be retrieved. The voice box signals are decompressed and stored in memory 15, preferably in virtual memory in order to minimize the memory requirements. Once the voice box has been retrieved, S the system is set for receiving spoken commands, and microprocessor 5, recognizing that the user's voice box has been stored in memory 15 causes dial tone to be passed to the telephone set.
The user then speaks the required words or phrases, e.g. "dial", "7", "2", ... i.e. a command to dial a particular telephone number. The voice signals are digitized in digitizer 11, and are compared, by means of microprocessor 5, with the data stored in the decompressed voice box in memory 15 for that particular user, e.g. in the example shown, user B. It will be recognized that since in the voice of the particular user particular -ommands have been prestored in the voice box, the task of comparing the digitized command signals with those in the voice box is significantly easier and quicker than if an expected comparison of any voice with any word were to be contemplated. The only voice to be compared with is the actual voice giving the commands.
With the finding of a match of a command, the pointer corresponding to the matched words, e.g. to begin dialing followed by the designation of a particular number to be dialed, is obtained. The command pointers stored with the voice box commands are retrieved by microprocessor 5, and are used to control the switching system to execute the desired operation.
Figure 6 illustrates operation of the invention when invoking a feature, and corresponds to the lower portion of Figure 5. The figure illustrates that a dialed tone has been provided back to the user, as described above.

- - ~ - - .............................. . . . - , . -:.. , , - . . ~, . i: . - . . .
" ~

21~9~12 g : ~.
However in this case, rather than saying "dial" or a numeral, the user says the name of a feature, e.g. "call forward". After digitizing, a comparison is carried out in which the "call forward" -~
feature stored in the voice box of user B is found. In this case the pointer points to a feature operation and description list 21 in the mass storage, and retrieves corresponding call forward command data signals stored therein. The corresponding data signals pass into the memory 150 Under control of microprocessor 5 accessing the call forward operation and description data signals, the user is interactively stepped through the required feature operation steps. ~or example, the words "state the number that you wish calls to be forwarded to and ~hen say the word TRANSFER" can be retrieved from memory 15, and passed to voice synthesizer 9, which translates the sentence into analog which is passed via the ~
subscriber's line to the user telephone 1. The user ~ -then states the numbers, which are compared with the stored numbers in the voice box stored in memory 15.
The resulting pointers, and the working of the call forwarding feature as described above, are used by the -;;
system to control the switching ma~hine forward future calls to the designated telephone line.
Thus the telephone system in effect mimics the user friendly operation of the old fashioned telephone ~`~
operator, in which the user can speak and have the system reach and respond to his or her commands. Where sophisticated features are to be used, the user hears, ~
in a friendly and understandable voice, instructions on -how to invoke the feature which instructions can be carried out in a conversational procedure.
Because the system has stored predetermined command words from particular users by voice prompts, it is a relatively simple task to compare the spoken ~ .:

21~9012 " . ,. . I o -commands with the stored voice signals relating to those commands. The system can therefore operate at high speed, and a centralized system can be used for a large number of users. With the stored signals being located S in the described voice boxes in a compressed form, such a system can be provided with relatively low cost to a large number of users.
A person understanding this invention may now conceive of alternative structures and embodiments or variations of the above. All of those which fall within the scope of the claims appended hereto are considered to be part of the present invention. -~

Claims (7)

I claim:
1. A method of operating a central call control apparatus by plural users comprising:
(a) storing an identification of each user in a directory, (b) storing in compressed form a plurality of predetermined commands spoken by the users, grouped by user, each group with a pointer to a corresponding identification in the directory, (c) indicating an identification in said directory of a service demanding user wishing to initiate a command, upon receipt of a service demanding signal, (d) retrieving a corresponding group of commands linked by said pointer to said identification, and decompressing and storing said group of commands, (e) receiving spoken command words from said service demanding user, (f) comparing said spoken command words with said decompressed group of commands to find decompressed command words, and (g) invoking commands for which command words are found corresponding to said decompressed commands in said decompressed group of commands.
2. A method as defined in claim 1, in which the identification of each user is a code corresponding to one or more multi-frequency tones.
3. A method as defined in claim 1, in which the identification of each user is a digitized form of a predetermined word.
4. A method as defined in claim 1 in which the step of indicating an identification of a service demanding user is comprised of receiving a predetermined service demanding signal from a user, comparing it with said user identifications in said directory, and providing said identification upon finding correspondence therebetween.
5. A method as defined in claim 4, in which the commands invoked comprise retrieving signals from a storage device corresponding to a feature, stepping through said signals and thereby providing voice prompts to said user to perform a predetermined procedure, receiving procedure signals from said user resulting from his carrying out of said procedure, and carrying out a command sequence resulting from receipt of said procedure signals.
6. A method as defined in claim 5 in which said procedure signals include words spoken by the user, and the method further includes the steps of comparing said words with said decompressed group of commands, and in response to finding correspondence therebetween, carrying out said command sequence.
7. A method as defined in claim 6 in which said groups of commands are stored in a mass storage memory.
CA002149012A 1991-12-31 1991-12-31 Voice activated telephone set Expired - Lifetime CA2149012C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA002149012A CA2149012C (en) 1991-12-31 1991-12-31 Voice activated telephone set

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CA002058644A CA2058644C (en) 1991-12-31 1991-12-31 Voice activated telephone set
CA002149012A CA2149012C (en) 1991-12-31 1991-12-31 Voice activated telephone set

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CA002058644A Division CA2058644C (en) 1991-12-31 1991-12-31 Voice activated telephone set

Publications (2)

Publication Number Publication Date
CA2149012A1 CA2149012A1 (en) 1993-07-01
CA2149012C true CA2149012C (en) 1997-09-02

Family

ID=4149012

Family Applications (2)

Application Number Title Priority Date Filing Date
CA002058644A Expired - Lifetime CA2058644C (en) 1991-12-31 1991-12-31 Voice activated telephone set
CA002149012A Expired - Lifetime CA2149012C (en) 1991-12-31 1991-12-31 Voice activated telephone set

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CA002058644A Expired - Lifetime CA2058644C (en) 1991-12-31 1991-12-31 Voice activated telephone set

Country Status (5)

Country Link
CA (2) CA2058644C (en)
DE (1) DE4243181C2 (en)
GB (1) GB2263042B (en)
IT (1) IT1256746B (en)
MX (1) MX9207608A (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE4315947C2 (en) * 1993-05-12 1995-11-16 Siemens Ag Arrangement for signal transmission between an audio device and a telephone device
JPH08274858A (en) * 1995-03-30 1996-10-18 Kokusai Denshin Denwa Co Ltd <Kdd> Dialless call origination equipment
SE513993C2 (en) * 1995-05-22 2000-12-11 Telia Ab Device and method of voice control of functions in a telecommunication system
US5960395A (en) * 1996-02-09 1999-09-28 Canon Kabushiki Kaisha Pattern matching method, apparatus and computer readable memory medium for speech recognition using dynamic programming
DE19730920A1 (en) * 1997-07-18 1999-01-21 Ise Interactive Systems Entwic Computer system adapted for hands-free speech communications
US9232037B2 (en) 1999-10-20 2016-01-05 Curo Interactive Incorporated Single action sensory prompt interface utilising binary state time domain selection protocol
US6804539B2 (en) 1999-10-20 2004-10-12 Curo Interactive Incorporated Single action audio prompt interface utilizing binary state time domain multiple selection protocol
US6771982B1 (en) 1999-10-20 2004-08-03 Curo Interactive Incorporated Single action audio prompt interface utlizing binary state time domain multiple selection protocol
DE10143292B4 (en) * 2001-09-04 2015-06-03 Deutsche Telekom Ag language window

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4827520A (en) * 1987-01-16 1989-05-02 Prince Corporation Voice actuated control system for use in a vehicle

Also Published As

Publication number Publication date
IT1256746B (en) 1995-12-15
DE4243181A1 (en) 1993-07-01
MX9207608A (en) 1994-05-31
CA2058644A1 (en) 1993-07-01
ITMI922888A1 (en) 1994-06-18
CA2149012A1 (en) 1993-07-01
DE4243181C2 (en) 1996-12-19
GB2263042A (en) 1993-07-07
CA2058644C (en) 1996-06-18
GB2263042B (en) 1995-06-28
GB9227115D0 (en) 1993-02-24
ITMI922888A0 (en) 1992-12-18

Similar Documents

Publication Publication Date Title
EP0477688B1 (en) Voice recognition telephone dialing
US5912949A (en) Voice-dialing system using both spoken names and initials in recognition
JP3561076B2 (en) Automatic call recognition method for arbitrarily spoken words
EP0890249B1 (en) Apparatus and method for reducing speech recognition vocabulary perplexity and dynamically selecting acoustic models
US6393403B1 (en) Mobile communication devices having speech recognition functionality
US6462616B1 (en) Embedded phonetic support and TTS play button in a contacts database
US6687673B2 (en) Speech recognition system
EP0780829B1 (en) Method for automatic speech recognition in telephony
US5832063A (en) Methods and apparatus for performing speaker independent recognition of commands in parallel with speaker dependent recognition of names, words or phrases
AU598999B2 (en) Voice controlled dialer with separate memories for any users and authorized users
US5651055A (en) Digital secretary
US5752230A (en) Method and apparatus for identifying names with a speech recognition program
US4461023A (en) Registration method of registered words for use in a speech recognition system
CA2149012C (en) Voice activated telephone set
US7844459B2 (en) Method for creating a speech database for a target vocabulary in order to train a speech recognition system
US20010056345A1 (en) Method and system for speech recognition of the alphabet
US4831653A (en) System for registering speech information to make a voice dictionary
US20050049858A1 (en) Methods and systems for improving alphabetic speech recognition accuracy
US20010025240A1 (en) Speech recognition device with reference transformation means
US6845356B1 (en) Processing dual tone multi-frequency signals for use with a natural language understanding system
EP0175503A1 (en) Method and apparatus for use in interactive dialogue
EP1056265A2 (en) Voice message search system and method
Immencloerfer Recent achievements in speech processing technologies and their applications

Legal Events

Date Code Title Description
EEER Examination request
MKEX Expiry