EP0840488A2 - Voice-dialling system using both spoken names and initial letters in recognition - Google Patents

Voice-dialling system using both spoken names and initial letters in recognition Download PDF

Info

Publication number
EP0840488A2
EP0840488A2 EP97308792A EP97308792A EP0840488A2 EP 0840488 A2 EP0840488 A2 EP 0840488A2 EP 97308792 A EP97308792 A EP 97308792A EP 97308792 A EP97308792 A EP 97308792A EP 0840488 A2 EP0840488 A2 EP 0840488A2
Authority
EP
European Patent Office
Prior art keywords
directory
name
names
speech pattern
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP97308792A
Other languages
German (de)
French (fr)
Other versions
EP0840488A3 (en
Inventor
Conway Chan
Craig Alexander Will
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nortel Networks Ltd
Original Assignee
Nortel Networks Ltd
Northern Telecom Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nortel Networks Ltd, Northern Telecom Ltd filed Critical Nortel Networks Ltd
Publication of EP0840488A2 publication Critical patent/EP0840488A2/en
Publication of EP0840488A3 publication Critical patent/EP0840488A3/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/271Devices whereby a plurality of signals may be stored simultaneously controlled by voice recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42204Arrangements at the exchange for service or number selection by voice
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4931Directory assistance systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q3/00Selecting arrangements
    • H04Q3/58Arrangements providing connection between main exchange and sub-exchange or satellite
    • H04Q3/62Arrangements providing connection between main exchange and sub-exchange or satellite for connecting to private branch exchanges
    • H04Q3/625Arrangements in the private branch exchange
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/1307Call setup
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13093Personal computer, PC
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13096Digital apparatus individually associated with a subscriber line, digital line circuits
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13103Memory
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13106Microprocessor, CPU
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/1322PBX
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13377Recorded announcement
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13378Speech recognition, speech analysis

Definitions

  • This invention relates generally to systems for telephonic communications with audio message storage and retrieval and, more particularly, to telephonic communications involving repertory or abbreviated call signal generation and abbreviated dialing.
  • Voice-dialing systems enable telephone users to speak the name of an individual or destination into the microphone of a telephone handset to initiate a telephone call. Voice-dialing thus allows a connection to be made directly, and avoids the necessity of dialing telephone numbers or looking up names to locate corresponding telephone numbers and then dialing the numbers.
  • performance and ease of use of a voice-dialing system can be improved by providing a selection procedure that enables users to select a stored name corresponding to a spoken name by inputting one or more spoken-letters associated with the spoken name when the system indicates that the spoken name alone is insufficient to select a name to initiate a telephone call.
  • a method for dialing a telephone by voice comprises the steps of (a) receiving from a user a speech pattern corresponding to a name in a directory the user intends to call and at least one spoken letter associated with the name, and (b) retrieving a telephone number corresponding to a name associated with the speech pattern.
  • the names in the directory may be represented by a sequence of orthographic letters, in which case the retrieving step may include the substeps of converting sequences of orthographic letters corresponding to the names in the directory into sequences phonemes, and comparing the sequences phonemes to the speech pattern to identify a sequence of phonemes for a name in the directory that best matches the speech pattern.
  • the names in the directory may be represented by sound patterns, in which case the retrieving step may include the substeps of converting the sound patterns for the names in the directory to orthographic letters, and comparing the orthographic letters for the names in the directory with an orthographic representation of the spoken letter.
  • the retrieving step may include the substep of comparing the names in the directory with the speech pattern using a phoneme-level representation for the names as an intermediary.
  • a method for dialing a telephone by voice comprises providing a directory of different names represented by phoneme strings and corresponding telephone numbers, said phoneme strings including initials for each of the directory names, and providing a user with access to the directory to initiate a telephone call by inputting a speech pattern corresponding to a name in the directory and at least one letter for the name.
  • the input speech pattern and letter being compared with the phoneme strings of the directory to select from the directory a telephone number for one of the directory names that best matches the name of the input speech pattern.
  • a method for dialing a telephone by voice comprises receiving from a user a speech pattern, the speech pattern indicating a name corresponding to a telephone number that the user intends to call.
  • the speech pattern includes a spoken name and at least one letter corresponding to the spoken name.
  • the method includes steps of utilizing the speech pattern to identify a portion of a directory containing different names and corresponding telephone numbers, providing to the user a selection of names from the directory determined to best match the speech pattern, and initiating a telephone call to one of the telephone numbers in accordance with the user's selection of a name.
  • a method comprises the steps of receiving from a user a speech pattern corresponding to a name in a directory the user intends to call, presenting the user with a name determined to correspond to the speech pattern, and receiving from the user an indication as to whether the presented name correctly matches the name the user intends to call.
  • the indication includes at least one spoken letter associated with the name the user intends to call.
  • This method may further include a step of retrieving a telephone number corresponding to a name associated with the speech pattern and spoken letter.
  • the present invention may include apparatus having to components configured to perform functions similar to those performed in the methods summarized herein.
  • a voice-actuated dialing system is built around a directory stored in the memory of a computer that holds names and associated telephone numbers.
  • a person can use the system either locally, by picking up a telephone and speaking the name associated with the desired number, or by connecting from a remote location and speaking the name.
  • the invention may be implemented in a personal computer having a telephone interface card and software to perform speech recognition and speech synthesis, to dial a telephone number, and to control the voice-dialing system. It may also be used to provide automatic directory assistance by speaking the number aloud rather than dialing it.
  • the architecture of the system consists of a speech recognition component, a speech synthesizer, and a controller.
  • the first two components may use conventional techniques, with the speech recognition component recognizing input speech patterns representing names and comparing those patterns against stored names, and the speech synthesizer generating and outputting spoken phrases, including the stored names.
  • the controller uses a unique procedure to control the selection of stored names.
  • the controller uses a procedure in which the speech recognition component matches a spoken name against representations of different names in the directory to produce the name of the person that, based on a comparison of speech patterns for the spoken name with the speech patterns for stored names, the user most likely desires to call.
  • the controller also engages the synthesizer to present the selected name to the user for verification. If the name presented is correct, the controller initiates another procedure to dial the corresponding telephone number in the stored directory.
  • the controller permits the user to input individual spoken letters associated with the spoken name to facilitate the name selection process when the initial selection fails.
  • FIG. 1 shows a flow chart 100 of a voice-dialing procedure 100 that the controller uses to initiate telephone dialing.
  • the steps of procedure 100 are preferably implemented in software.
  • Flow chart 100 assumes that a user has previously created and stored, such as on a hard disk, a directory of names and associated telephone numbers.
  • a directory of names and associated telephone numbers.
  • One conventional software package that may be used to create such a directory is Microsoft Schedule+®, developed by Microsoft Corporation. This package includes a "contacts" capability for entering names and telephone numbers, the database of which can be accessed remotely by other application programs running under the Windows 95® operating system.
  • the speech recognition component processes the input speech data and attempts to match it against the set of stored representations corresponding to each name in the directory.
  • these representations are orthographic, consisting of sequences of letters spelling the names.
  • a speech recognition system typically processes incoming speech in terms of the phoneme representations that correspond to the stored orthographic representations by means of rules for converting between phonemic and orthographic representations.
  • An example of a speech recognition system with the desired capabilities is the "Model asr1500/M" speech engine from Lernout & Hauspie Speech Products N.V., Ieper, Belgium. This speech recognition system can run on a personal computer with a Pentium® microprocessor in close to real time without needing an additional coprocessor.
  • the speech recognition component can also use speaker-dependent technology, with names stored with phonemic representations. Such systems also require rules for converting phonemic representations to orthographic representations to process input letter sequences in selecting names.
  • the controller engages the speech recognition component to receive speech input for a name (step 120), and the speech recognition component processes the speech input by comparing it to the corresponding representations in the directory for each name.
  • the component determines the name in memory that best matches the speech input (step 130), and instructs the speech synthesizer to play a combination of recorded and synthesized speech, "Did you say ⁇ name>?", where ⁇ name> is the synthesized speech from the stored representations corresponding to the best matching name (step 140).
  • the controller then waits to receive a response from the user (step 150). If the speech recognition component determines that the user said "Yes,” the controller looks up the appropriate telephone number (step 170) and proceeds to dial it (step 180). Control procedure 100 is then finished.
  • step 160 If the user does not recognize the name as being the name of the person he intended to call, he can respond in one of two proper ways (step 160): By saying "No” or by speaking one or more initials as input. Although new users may simply say “No,” experienced users will know to speak the initials corresponding to the first and last names, or to the first, middle, and last names. Alternatively, users may spell the full first and/or last name. In all cases, initial input accelerates the voice recognition process.
  • step 160 initial input
  • the speech recognition component integrates the resulting information together with the name previously spoken to determine the name in the directory that is the best match (step 130).
  • the controller engages the speech recognizer to play "Did you say ⁇ name>?” with the new name (step 140), and the process continues.
  • step 160 If the user replies "No" either because the user is new to the system and does not know about entering initials, or has unsuccessfully tried to use initials (step 160: “No"), the system will test whether the user has already used initials (step 190). If so. the controller will instruct the speech recognition component to obtain the next best matching name (step 220) and the speech synthesizer to play "Did you say ⁇ name>?" (step 140).
  • step 190 the controller directs the speech synthesizer to play "Please enter the initials of the person you want to call" (step 200), and then waits to receive the initials (step 210). Once the initials are received, control passes to select a stored name that best matches the spoken name and letter sequence (step 130), and the process continues in the manner explained above.
  • step 160 If the user fails to respond or responds in a manner that is not recognizable by the system when requested to confirm a match (step 160: other), the controller instructs the speech synthesizer to play "I don't understand you” (step 230). Subsequently, process flow continues with step 110.
  • procedure 100 may ask the user directly for both the name and the initials of the desired name before any attempt to recognize the name is made.
  • other letter sequences may be used, particularly spelling out part or all of the first, last, or both names.
  • the determination of the best matching name using both the spoken name and spoken letter initials uses an "N best" matching algorithm in which possible matches are provided by the speech recognition algorithm in a list in decreasing best match together with a measure of the quality of that match. This is done for the name and for each of the letters. A calculation is made of the confidence level for each of the matches, and an overall estimate is determined of the best N matching names on the basis of all sources of information. This list of overall N best matching names is used to provide to the user the synthetically spoken name for verification, continuing to next best matches should the user respond to the verification request with a "No."
  • the voice-dialing system according to the present invention is particularly suitable for use with PBX-based systems. Such systems control calls from multiple telephones at a physical or virtual site.
  • FIG. 2 illustrates an exemplary PBX-based voice-dialing system 300
  • System 300 includes PBX system 310, random access memory (RAM) 320, hard disk 330, and microprocessor 340.
  • PBX system 310 also connects telephones 352, 354, 356, and 358 to a public switched telephone network. A typical PBX would have tens to hundreds of these lines.
  • PBX system 310 may be a Northern Telecom Meridian 1® PBX system, with a T1 digital connection between microprocessor 320 and PBX system 310.
  • Microprocessor 340 may be a conventional microprocessor such as a Pentium processor.
  • RAM 320 and hard disk 330 may also be conventional. In operation, however, they store the programs for the speech recognition component, speech synthesizer, and controller for voice-dialing. They also store the directory of names and corresponding telephone numbers that is available to users for purposes of implementing voice-dialing according to the present invention.
  • the directory of names stored on hard disk 330 is updated from a directory maintained in PBX system 310.
  • microprocessor 340 executes software for control procedure 100 and the functions of the speech recognition component and speech synthesizer.
  • microprocessor 340 instructs PBX system 310 to place a call to the stored telephone number for the selected name.
  • FIG. 3 shows another architecture in which the voice-dialing system according to the present invention may be implemented.
  • Personal directory system 400 includes hardware for a standard personal computer (for example, an IBM compatible personal computer), together with some additions related to telephony, and an ordinary telephone 490.
  • a standard personal computer for example, an IBM compatible personal computer
  • System 400 consists of RAM 410, hard disk 420, telephone port 430, microprocessor 440, mouse 450, keyboard 460, video display 470, and telephone port 480. These components may be standard off-the-shelf hardware.
  • microprocessor 440 may be a Pentium processor and video display 470 may be a NEC MultiSync 3V monitor.
  • Telephone port 430 connects microprocessor 440 to a public switched telephone network, and telephone port 480 connects microprocessor to telephone 490.
  • the input/output devices i.e., mouse 450, keyboard 460, and monitor 470, may be used to create a directory of names and telephone numbers used for voice-dialing.
  • Telephone 490 may be used for the user to interface with the speech recognition component to create the stored representations for the names in the directory.
  • a standard graphical user interface for a conventional database application may be used for this function.
  • the conventional database application must interface with both the speech recognition component and speech synthesizer in the manner described above.
  • telephone port 480 and telephone 490 may be replaced by a microphone and speaker connected directly to microprocessor 440 via appropriate digital-to-analog and analog-to-digital converters and amplifiers.
  • the microphone and speaker would be used for voice-dialing and data input.
  • microprocessor 440 executes software for control procedure 100 and the functions of the speech recognition component and speech synthesizer.
  • microprocessor 440 places a call to the stored telephone number for the selected name.
  • Performance of voice-dialing systems can be improved by providing a selection procedure that enables users to select a stored name corresponding to a spoken name by inputting one or more spoken letters associated with the spoken name. This increases the accuracy of the automatic speech recognition component in matching of incoming spoken names with names stored in the directory. It also makes voice-dialing systems easier to use.
  • the present invention also facilitates fast and accurate voice-dialing within a site using a PBX system.
  • a site-wide directory permits all users connected to the PBX system to use voice-dialing quickly, easily, and efficiently to make telephone calls.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Astronomy & Astrophysics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

A system for dialing a telephone by voice receives from a user a spoken name corresponding to a telephone number that the user desires to call and at least one initial. The systems uses both forms of speech information to retrieve a stored telephone number that corresponds to a stored name that best matches the spoken name.

Description

Background Art
This application relates to U.S.S.N. 08/726,604, entitled "Voice-Dialing System Using An Adaptive Model of Calling Behavior," filed October 7, 1996 and incorporated herein by reference.
Technical Field
This invention relates generally to systems for telephonic communications with audio message storage and retrieval and, more particularly, to telephonic communications involving repertory or abbreviated call signal generation and abbreviated dialing.
Voice-dialing systems enable telephone users to speak the name of an individual or destination into the microphone of a telephone handset to initiate a telephone call. Voice-dialing thus allows a connection to be made directly, and avoids the necessity of dialing telephone numbers or looking up names to locate corresponding telephone numbers and then dialing the numbers.
Examples of experimental voice-dialing systems appear in L. R. Rabiner, J. G. Wilpon, and A. E. Rosenberg, "A voice-controlled, repertory-dialer system," Bell System Technical Journal, Vol. 59, No. 7 (September, 1980), and U.S. Patent No. 4,348,550 to Pirz et al. Longstanding problems with such systems, however, limited their performance in terms of both accuracy and computational speed.
Recent advances in automatic speech recognition have improved performance dramatically, particularly for systems that are not trained to a particular speaker, which have, until recently, performed much worse than systems trained to particular speakers. In addition, the increasing computational and memory capacity and decreasing cost of computing hardware have significantly improved the commercial viability for the simpler applications of speech recognition such as voice-dialing.
Limitations on the performance of voice-dialing systems, however, still significantly reduce their commercial applicability. Such systems frequently make mistakes, the rate of error increasing with increasing vocabulary size, changes in environment, unusual accents, and the use of foreign or unusual names that might be difficult to pronounce. This limited accuracy restricts the possible range of applications for conventional systems to those with limited vocabularies, tightly controlled environments, and small user populations. There are also restrictions placed on the hardware platforms on which the systems can run.
It is therefore desirable to seek techniques that will improve the accuracy, speed, and ease of use of voice-dialing systems. A number of altemative techniques have been used in the past. One approach uses an interactive scheme in which the user is asked to verify the name before dialing (e.g., "Did you say Amanda Graham?"), and presenting a different name if the user says "No." See, for example, U.S. Patent No.5,222,121 to Shimada, and U.S. Patent No. 5,301,227 to Kamei et al.
Disclosure of the Invention
There is, therefore, a need to improve the accuracy of voice-dialing systems. In accordance with the present invention, performance and ease of use of a voice-dialing system can be improved by providing a selection procedure that enables users to select a stored name corresponding to a spoken name by inputting one or more spoken-letters associated with the spoken name when the system indicates that the spoken name alone is insufficient to select a name to initiate a telephone call.
In accordance with the present invention, as embodied and broadly described herein, a method for dialing a telephone by voice, comprises the steps of (a) receiving from a user a speech pattern corresponding to a name in a directory the user intends to call and at least one spoken letter associated with the name, and (b) retrieving a telephone number corresponding to a name associated with the speech pattern. The names in the directory may be represented by a sequence of orthographic letters, in which case the retrieving step may include the substeps of converting sequences of orthographic letters corresponding to the names in the directory into sequences phonemes, and comparing the sequences phonemes to the speech pattern to identify a sequence of phonemes for a name in the directory that best matches the speech pattern. Alternatively, the names in the directory may be represented by sound patterns, in which case the retrieving step may include the substeps of converting the sound patterns for the names in the directory to orthographic letters, and comparing the orthographic letters for the names in the directory with an orthographic representation of the spoken letter. In another alternative, when the names in the directory are represented by a sequence of orthographic letters, the retrieving step may include the substep of comparing the names in the directory with the speech pattern using a phoneme-level representation for the names as an intermediary.
In accordance with another aspect of the present invention, as embodied and broadly described herein, a method for dialing a telephone by voice, comprises providing a directory of different names represented by phoneme strings and corresponding telephone numbers, said phoneme strings including initials for each of the directory names, and providing a user with access to the directory to initiate a telephone call by inputting a speech pattern corresponding to a name in the directory and at least one letter for the name. The input speech pattern and letter being compared with the phoneme strings of the directory to select from the directory a telephone number for one of the directory names that best matches the name of the input speech pattern.
In accordance with yet another aspect of the present invention, as embodied and broadly described herein, a method for dialing a telephone by voice, comprises receiving from a user a speech pattern, the speech pattern indicating a name corresponding to a telephone number that the user intends to call. The speech pattern includes a spoken name and at least one letter corresponding to the spoken name. The method includes steps of utilizing the speech pattern to identify a portion of a directory containing different names and corresponding telephone numbers, providing to the user a selection of names from the directory determined to best match the speech pattern, and initiating a telephone call to one of the telephone numbers in accordance with the user's selection of a name.
In accordance with still another aspect of the present invention, as embodied and broadly described herein, a method comprises the steps of receiving from a user a speech pattern corresponding to a name in a directory the user intends to call, presenting the user with a name determined to correspond to the speech pattern, and receiving from the user an indication as to whether the presented name correctly matches the name the user intends to call. The indication includes at least one spoken letter associated with the name the user intends to call. This method may further include a step of retrieving a telephone number corresponding to a name associated with the speech pattern and spoken letter.
The present invention, as embodied and broadly described herein, may include apparatus having to components configured to perform functions similar to those performed in the methods summarized herein.
Brief Description of the Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate an implementation of the invention and, together with the description, explain the goals, advantages, and principles of the invention. In the drawings,
  • FIG. 1 is a flow chart of a procedure used to initiate telephone calls according to a preferred implementation of the voice-dialing system of the present invention;
  • FIG. 2 is a block diagram of a PBX-based system in which the present invention may be implemented; and
  • FIG. 3 is a block diagrarn of a personal directory system in which the present invention may be implemented.
  • Best Mode for Carrying Out the Invention
    Reference will now be made in detail to an implementation of the present invention as illustrated in the accompanying drawings. Wherever possible, the reference numbers used in the drawings will appear in the following description to refer to the same or like parts.
    A. Introduction
    A voice-actuated dialing system according to the present invention is built around a directory stored in the memory of a computer that holds names and associated telephone numbers. A person can use the system either locally, by picking up a telephone and speaking the name associated with the desired number, or by connecting from a remote location and speaking the name. The invention may be implemented in a personal computer having a telephone interface card and software to perform speech recognition and speech synthesis, to dial a telephone number, and to control the voice-dialing system. It may also be used to provide automatic directory assistance by speaking the number aloud rather than dialing it.
    The architecture of the system consists of a speech recognition component, a speech synthesizer, and a controller. The first two components may use conventional techniques, with the speech recognition component recognizing input speech patterns representing names and comparing those patterns against stored names, and the speech synthesizer generating and outputting spoken phrases, including the stored names. The controller, however, uses a unique procedure to control the selection of stored names.
    In particular, the controller uses a procedure in which the speech recognition component matches a spoken name against representations of different names in the directory to produce the name of the person that, based on a comparison of speech patterns for the spoken name with the speech patterns for stored names, the user most likely desires to call. The controller also engages the synthesizer to present the selected name to the user for verification. If the name presented is correct, the controller initiates another procedure to dial the corresponding telephone number in the stored directory. Alternatively, the controller permits the user to input individual spoken letters associated with the spoken name to facilitate the name selection process when the initial selection fails.
    B. Voice-Dialing Controller Procedure
    FIG. 1 shows a flow chart 100 of a voice-dialing procedure 100 that the controller uses to initiate telephone dialing. The steps of procedure 100 are preferably implemented in software.
    Flow chart 100 assumes that a user has previously created and stored, such as on a hard disk, a directory of names and associated telephone numbers. One conventional software package that may be used to create such a directory is Microsoft Schedule+®, developed by Microsoft Corporation. This package includes a "contacts" capability for entering names and telephone numbers, the database of which can be accessed remotely by other application programs running under the Windows 95® operating system.
    The speech recognition component processes the input speech data and attempts to match it against the set of stored representations corresponding to each name in the directory. In a speaker-independent recognition system, these representations are orthographic, consisting of sequences of letters spelling the names. A speech recognition system typically processes incoming speech in terms of the phoneme representations that correspond to the stored orthographic representations by means of rules for converting between phonemic and orthographic representations. An example of a speech recognition system with the desired capabilities is the "Model asr1500/M" speech engine from Lernout & Hauspie Speech Products N.V., Ieper, Belgium. This speech recognition system can run on a personal computer with a Pentium® microprocessor in close to real time without needing an additional coprocessor.
    The speech recognition component can also use speaker-dependent technology, with names stored with phonemic representations. Such systems also require rules for converting phonemic representations to orthographic representations to process input letter sequences in selecting names.
    Users wanting to place calls using the voice-dialing capability press a preset button on their telephone instrument or dial an extension that connects to the controller for the voice-dialing system. The controller then invokes the speech synthesizer to play "Who do you want to call" to the user via synthesized or recorded speech (step 110). The system then enters a wait state during which it waits for speech input from the user. Alternatively, if the user does not speak a name after some predetermined period, control may pass back to ask the user who he or she wants to call (step 110).
    The controller then engages the speech recognition component to receive speech input for a name (step 120), and the speech recognition component processes the speech input by comparing it to the corresponding representations in the directory for each name. The component determines the name in memory that best matches the speech input (step 130), and instructs the speech synthesizer to play a combination of recorded and synthesized speech, "Did you say <name>?", where <name> is the synthesized speech from the stored representations corresponding to the best matching name (step 140).
    The controller then waits to receive a response from the user (step 150). If the speech recognition component determines that the user said "Yes," the controller looks up the appropriate telephone number (step 170) and proceeds to dial it (step 180). Control procedure 100 is then finished.
    If the user does not recognize the name as being the name of the person he intended to call, he can respond in one of two proper ways (step 160): By saying "No" or by speaking one or more initials as input. Although new users may simply say "No," experienced users will know to speak the initials corresponding to the first and last names, or to the first, middle, and last names. Alternatively, users may spell the full first and/or last name. In all cases, initial input accelerates the voice recognition process.
    If the user enters such initials (step 160: initial input), the speech recognition component integrates the resulting information together with the name previously spoken to determine the name in the directory that is the best match (step 130). The controller then engages the speech recognizer to play "Did you say <name>?" with the new name (step 140), and the process continues.
    If the user replies "No" either because the user is new to the system and does not know about entering initials, or has unsuccessfully tried to use initials (step 160: "No"), the system will test whether the user has already used initials (step 190). If so. the controller will instruct the speech recognition component to obtain the next best matching name (step 220) and the speech synthesizer to play "Did you say <name>?" (step 140).
    If the initials have not been entered (step 190), the controller directs the speech synthesizer to play "Please enter the initials of the person you want to call" (step 200), and then waits to receive the initials (step 210). Once the initials are received, control passes to select a stored name that best matches the spoken name and letter sequence (step 130), and the process continues in the manner explained above.
    If the user fails to respond or responds in a manner that is not recognizable by the system when requested to confirm a match (step 160: other), the controller instructs the speech synthesizer to play "I don't understand you" (step 230). Subsequently, process flow continues with step 110.
    A number of variations on procedure 100 are also possible. For example, the system may ask the user directly for both the name and the initials of the desired name before any attempt to recognize the name is made. Also, other letter sequences may be used, particularly spelling out part or all of the first, last, or both names.
    The determination of the best matching name using both the spoken name and spoken letter initials uses an "N best" matching algorithm in which possible matches are provided by the speech recognition algorithm in a list in decreasing best match together with a measure of the quality of that match. This is done for the name and for each of the letters. A calculation is made of the confidence level for each of the matches, and an overall estimate is determined of the best N matching names on the basis of all sources of information. This list of overall N best matching names is used to provide to the user the synthetically spoken name for verification, continuing to next best matches should the user respond to the verification request with a "No."
    C. PBX-Based System Architecture
    The voice-dialing system according to the present invention is particularly suitable for use with PBX-based systems. Such systems control calls from multiple telephones at a physical or virtual site.
    FIG. 2 illustrates an exemplary PBX-based voice-dialing system 300 System 300 includes PBX system 310, random access memory (RAM) 320, hard disk 330, and microprocessor 340. PBX system 310 also connects telephones 352, 354, 356, and 358 to a public switched telephone network. A typical PBX would have tens to hundreds of these lines. PBX system 310 may be a Northern Telecom Meridian 1® PBX system, with a T1 digital connection between microprocessor 320 and PBX system 310.
    Microprocessor 340 may be a conventional microprocessor such as a Pentium processor. RAM 320 and hard disk 330 may also be conventional. In operation, however, they store the programs for the speech recognition component, speech synthesizer, and controller for voice-dialing. They also store the directory of names and corresponding telephone numbers that is available to users for purposes of implementing voice-dialing according to the present invention. The directory of names stored on hard disk 330 is updated from a directory maintained in PBX system 310.
    When a user picks up the handset of one of the telephones 352-358 and initiates voice-dialing, the controller begins operation and microprocessor 340 executes software for control procedure 100 and the functions of the speech recognition component and speech synthesizer. When the user confirms the selection of a name from the stored directory, microprocessor 340 instructs PBX system 310 to place a call to the stored telephone number for the selected name.
    D Personal Voice-Dialing System
    FIG. 3 shows another architecture in which the voice-dialing system according to the present invention may be implemented. Personal directory system 400 includes hardware for a standard personal computer (for example, an IBM compatible personal computer), together with some additions related to telephony, and an ordinary telephone 490.
    System 400 consists of RAM 410, hard disk 420, telephone port 430, microprocessor 440, mouse 450, keyboard 460, video display 470, and telephone port 480. These components may be standard off-the-shelf hardware. For example, microprocessor 440 may be a Pentium processor and video display 470 may be a NEC MultiSync 3V monitor. Telephone port 430 connects microprocessor 440 to a public switched telephone network, and telephone port 480 connects microprocessor to telephone 490.
    The input/output devices, i.e., mouse 450, keyboard 460, and monitor 470, may be used to create a directory of names and telephone numbers used for voice-dialing. Telephone 490 may be used for the user to interface with the speech recognition component to create the stored representations for the names in the directory.
    A standard graphical user interface for a conventional database application may be used for this function. The conventional database application, however, must interface with both the speech recognition component and speech synthesizer in the manner described above.
    Alternatively, telephone port 480 and telephone 490 may be replaced by a microphone and speaker connected directly to microprocessor 440 via appropriate digital-to-analog and analog-to-digital converters and amplifiers. In this configuration, the microphone and speaker would be used for voice-dialing and data input.
    When a user picks up the handset of telephone 490 and initiates voice-dialing, the controller begins operation and microprocessor 440 executes software for control procedure 100 and the functions of the speech recognition component and speech synthesizer. When the user confirms the selection of a name from the stored directory, microprocessor 440 places a call to the stored telephone number for the selected name.
    D. Conclusion
    Performance of voice-dialing systems can be improved by providing a selection procedure that enables users to select a stored name corresponding to a spoken name by inputting one or more spoken letters associated with the spoken name. This increases the accuracy of the automatic speech recognition component in matching of incoming spoken names with names stored in the directory. It also makes voice-dialing systems easier to use.
    The present invention also facilitates fast and accurate voice-dialing within a site using a PBX system. According to this approach, a site-wide directory permits all users connected to the PBX system to use voice-dialing quickly, easily, and efficiently to make telephone calls.
    The foregoing description of an implementation of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practice of the invention. For example, the above description relates to voice-dialing systems, whereas the present invention may be implemented in connection with other types of systems that use a directory including speech patterns and employ voice input to select names or other identifiers from the directory. Voice-mail systems are an example of such other systems. The scope of the invention is defined by the claims and their equivalents.

    Claims (16)

    1. A method for dialing a telephone by voice, comprising the steps of:
      receiving from a user a speech pattern corresponding to a name in a directory the user intends to call and at least one spoken letter associated with the name; and
      retrieving a telephone number corresponding to a name associated with the speech pattern.
    2. The method of claim 1, wherein the names in the directory are represented by a sequence of orthographic letters, and wherein the retrieving step includes the substeps of:
      converting sequences of orthographic letters corresponding to the names in the directory into sequences phonemes; and
      comparing the sequences phonemes to the speech pattern to identify a sequence of phonemes for a name in the directory that best matches the speech pattern.
    3. The method of claim 1, wherein the names in the directory are represented by sound patterns, and wherein the retrieving step includes the substeps of:
      converting the sound patterns for the names in the directory to orthographic letters; and
      comparing the orthographic letters for the names in the directory with an orthographic representation of the spoken letter.
    4. The method of claim 1, wherein the names in the directory are represented by a sequence of orthographic letters, and wherein the retrieving step includes the substep of:
         comparing the names in the directory with the speech pattern using a phoneme-level representation for the names as an intermediary.
    5. A method for providing voice-dialing assistance to user, comprising the steps of:
      providing a directory of different names represented by phoneme strings and corresponding telephone numbers, said phoneme strings including initials for each of the directory names; and
      providing a user with access to the directory to initiate a telephone call by inputting a speech pattern corresponding to a name in the directory and at least one letter for the name, the input speech pattern and letter being compared with the phoneme strings of the directory to select from the directory a telephone number for one of the directory names that best matches the name of the input speech pattern.
    6. A method for providing voice-dialing to users, comprising the steps of:
      receiving from a user a speech pattern, the speech pattern indicating a name corresponding to a telephone number that the user intends to call, said speech pattern including a spoken name and at least one letter corresponding to the spoken name;
      utilizing the speech pattern to identify a portion of a directory containing different names and corresponding telephone numbers;
      providing to the user a selection of names from the directory determined to best match the speech pattern; and
      initiating a telephone call to one of the telephone numbers in accordance with the user's selection of a name.
    7. A method comprising the steps of:
      receiving from a user a speech pattern corresponding to a name in a directory the user intends to call;
      presenting the user with a name determined to correspond to the speech pattern; and
      receiving from the user an indication as to whether the presented name correctly matches the name the user intends to call, said indication including at least one spoken letter associated with the name the user intends to call.
    8. The method of claim 7 further comprising the step of:
         retrieving a telephone number corresponding to a name associated with the speech pattern and spoken letter.
    9. Apparatus for dialing a telephone by voice, comprising:
      a receiver configured to receive from a user a speech pattern corresponding to a name in a directory the user intends to call and at least one spoken letter associated with the name; and
      retrieving mechanism configured to retrieve a telephone number corresponding to a name associated with the speech pattern.
    10. The apparatus of claim 9, wherein the names in the directory are represented by a sequence of orthographic letters, and wherein the retrieving mechanism includes:
      a converter configured to convert sequences of orthographic letters corresponding to the names in the directory into sequences phonemes; and
      a comparator configured to compare the sequences phonemes to the speech pattern to identify a sequence of phonemes for a name in the directory that best matches the speech pattern.
    11. The apparatus of claim 10, wherein the names in the directory are represented by sound patterns, and wherein the retrieving mechanism includes:
      a converter configured to convert the sound patterns for the names in the directory to orthographic letters; and
      a comparator configured to compare the orthographic letters for the names in the directory with an orthographic representation of the spoken letter.
    12. The apparatus of claim 10, wherein the names in the directory are represented by a sequence of orthographic letters, and wherein the retrieving mechanism includes:
         a comparator configured to compare the names in the directory with the speech pattern using a phoneme-level representation for the names as an intermediary.
    13. Apparatus for providing voice-dialing assistance to user, comprising:
      means for providing a directory of different names represented by phoneme strings and corresponding telephone numbers, said phoneme strings including initials for each of the directory names; and
      means for providing a user with access to the directory to initiate a telephone call by inputting a speech pattern corresponding to a name in the directory and at least one letter for the name, the input speech pattern and letter being compared with the phoneme strings of the directory to select from the directory a telephone number for one of the directory names that best matches the name of the input speech pattern.
    14. Apparatus for providing voice-dialing to users, comprising:
      a receiver configured to receive from a user a speech pattern, the speech pattern indicating a name corresponding to a telephone number that the user intends to call, said speech pattern including a spoken name and at least one letter corresponding to the spoken name;
      identifying mechanism configured to utilize the speech pattern to identify a portion of a directory containing different names and corresponding telephone numbers;
      selection mechanism configured to provide to the user a selection of names from the directory determined to best match the speech pattern; and
      an initiator configured to initiate a telephone call to one of the telephone numbers in accordance with the user's selection of a name.
    15. Apparatus comprising:
      a receiver configured to receive from a user a speech pattern corresponding to a name in a directory the user intends to call;
      presenting mechanism configured to present the user with a name determined to correspond to the speech pattern;
      said receiver configured to receive from the user an indication as to whether the presented name correctly matches the name the user intends to call, said indication including at least one spoken letter associated with the name the user intends to call; and
      retrieving mechanism to retrieve a telephone number corresponding to a name associated with the speech pattern and spoken letter.
    16. The apparatus of claim 15 further comprising:
         retrieving mechanism configured to retrieve a telephone number corresponding to a name associated with the speech pattern and spoken letter.
    EP97308792A 1996-11-05 1997-11-03 Voice-dialling system using both spoken names and initial letters in recognition Withdrawn EP0840488A3 (en)

    Applications Claiming Priority (2)

    Application Number Priority Date Filing Date Title
    US743933 1996-11-05
    US08/743,933 US5912949A (en) 1996-11-05 1996-11-05 Voice-dialing system using both spoken names and initials in recognition

    Publications (2)

    Publication Number Publication Date
    EP0840488A2 true EP0840488A2 (en) 1998-05-06
    EP0840488A3 EP0840488A3 (en) 2000-03-15

    Family

    ID=24990765

    Family Applications (1)

    Application Number Title Priority Date Filing Date
    EP97308792A Withdrawn EP0840488A3 (en) 1996-11-05 1997-11-03 Voice-dialling system using both spoken names and initial letters in recognition

    Country Status (4)

    Country Link
    US (1) US5912949A (en)
    EP (1) EP0840488A3 (en)
    JP (1) JPH10215319A (en)
    CA (1) CA2220256C (en)

    Cited By (5)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    WO2001035620A1 (en) * 1999-11-12 2001-05-17 British Telecommunications Public Limited Company Voice activated dialling
    WO2001095600A1 (en) * 2000-06-09 2001-12-13 France Telecom Sa Method and device for connection without telephone number disclosure
    WO2002009395A2 (en) * 2000-07-07 2002-01-31 Science Applications International Corporation A system or method for calling a vanity number using speech recognition
    WO2014019036A1 (en) * 2012-07-31 2014-02-06 Boris Ivanov Tsigov System for facilitated telephone dialing and connection of telephone subscribers through module-directory and voice recognition module
    WO2019023763A1 (en) * 2017-08-04 2019-02-07 Tsigov Boris Method and system for rewarding when charging

    Families Citing this family (41)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    US6208713B1 (en) 1996-12-05 2001-03-27 Nortel Networks Limited Method and apparatus for locating a desired record in a plurality of records in an input recognizing telephone directory
    FR2761848B1 (en) * 1997-04-04 2004-09-17 Parrot Sa RADIOTELEPHONE VOICE COMMAND DEVICE, ESPECIALLY FOR USE IN A MOTOR VEHICLE
    US6236715B1 (en) * 1997-04-15 2001-05-22 Nortel Networks Corporation Method and apparatus for using the control channel in telecommunications systems for voice dialing
    JPH1117796A (en) * 1997-06-19 1999-01-22 Matsushita Electric Ind Co Ltd Voice recognition telephony equipment
    CA2219008C (en) * 1997-10-21 2002-11-19 Bell Canada A method and apparatus for improving the utility of speech recognition
    US6327346B1 (en) * 1998-09-01 2001-12-04 At&T Corp. Method and apparatus for setting user communication parameters based on voice identification of users
    US6839410B2 (en) 1998-09-01 2005-01-04 At&T Corp. Method and apparatus for setting user communication parameters based on voice identification of users
    US6128482A (en) * 1998-12-22 2000-10-03 General Motors Corporation Providing mobile application services with download of speaker independent voice model
    US6438520B1 (en) * 1999-01-20 2002-08-20 Lucent Technologies Inc. Apparatus, method and system for cross-speaker speech recognition for telecommunication applications
    US6574596B2 (en) * 1999-02-08 2003-06-03 Qualcomm Incorporated Voice recognition rejection scheme
    DE19914631A1 (en) * 1999-03-31 2000-10-12 Bosch Gmbh Robert Input procedure in a driver information system
    US7260187B1 (en) * 1999-05-11 2007-08-21 Verizon Services Corp. Voice response apparatus and method of providing automated voice responses with silent prompting
    US6690772B1 (en) * 2000-02-07 2004-02-10 Verizon Services Corp. Voice dialing using speech models generated from text and/or speech
    US6862610B2 (en) * 2000-05-08 2005-03-01 Ideaflood, Inc. Method and apparatus for verifying the identity of individuals
    US6865403B1 (en) * 2000-11-28 2005-03-08 Sprint Spectrum L.P. Method and system for simplified control of a subscriber terminal
    US6845251B2 (en) 2000-11-29 2005-01-18 Visteon Global Technologies, Inc. Advanced voice recognition phone interface for in-vehicle speech recognition applications
    US6671354B2 (en) * 2001-01-23 2003-12-30 Ivoice.Com, Inc. Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs, for telephones without private branch exchanges
    US6940951B2 (en) * 2001-01-23 2005-09-06 Ivoice, Inc. Telephone application programming interface-based, speech enabled automatic telephone dialer using names
    US7177402B2 (en) * 2001-03-01 2007-02-13 Applied Voice & Speech Technologies, Inc. Voice-activated interactive multimedia information processing system
    KR100396817B1 (en) * 2001-06-04 2003-09-02 (주)씨에스테크놀로지 Voice Cognition Call Exchange System and The Method for Voice Cognition Exchange using the same thereof
    US7483520B2 (en) * 2001-08-06 2009-01-27 Qualcomm Incorporated Method and apparatus for prompting a cellular telephone user with instructions
    US7124085B2 (en) * 2001-12-13 2006-10-17 Matsushita Electric Industrial Co., Ltd. Constraint-based speech recognition system and method
    KR100433550B1 (en) * 2002-05-25 2004-05-31 삼성전자주식회사 Apparatus and method for speedy voice dialing
    GB0312271D0 (en) * 2003-05-29 2003-07-02 Ibm A voice operated directory dialler
    US6983244B2 (en) * 2003-08-29 2006-01-03 Matsushita Electric Industrial Co., Ltd. Method and apparatus for improved speech recognition with supplementary information
    GB0327416D0 (en) * 2003-11-26 2003-12-31 Ibm Directory dialler name recognition
    GB0328035D0 (en) * 2003-12-03 2004-01-07 British Telecomm Communications method and system
    CN100419751C (en) * 2004-03-11 2008-09-17 台达电子工业股份有限公司 Query pattern employing voice input and mobile electronic device employing voice input
    KR100827074B1 (en) 2004-04-06 2008-05-02 삼성전자주식회사 Apparatus and method for automatic dialling in a mobile portable telephone
    US7110949B2 (en) * 2004-09-13 2006-09-19 At&T Knowledge Ventures, L.P. System and method for analysis and adjustment of speech-enabled systems
    WO2007019307A2 (en) 2005-08-03 2007-02-15 Somatic Technologies, Inc. Somatic, auditory and cochlear communication system and method
    US20070217396A1 (en) * 2006-03-14 2007-09-20 Aibelive Co., Ltd. Method and apparatus for making VoIP connection through network
    US20070286398A1 (en) * 2006-06-07 2007-12-13 Venkatesan Ramamoorthy Voice Recognition Dialing For Alphabetic Phone Numbers
    US20080045256A1 (en) * 2006-08-16 2008-02-21 Microsoft Corporation Eyes-free push-to-talk communication
    US7599921B2 (en) * 2007-03-02 2009-10-06 International Business Machines Corporation System and method for improved name matching using regularized name forms
    KR100883105B1 (en) 2007-03-30 2009-02-11 삼성전자주식회사 Method and apparatus for dialing voice recognition in a portable terminal
    US8484034B2 (en) * 2008-03-31 2013-07-09 Avaya Inc. Arrangement for creating and using a phonetic-alphabet representation of a name of a party to a call
    US8787977B2 (en) * 2010-04-08 2014-07-22 General Motors Llc Method of controlling dialing modes in a vehicle
    US9691377B2 (en) 2013-07-23 2017-06-27 Google Technology Holdings LLC Method and device for voice recognition training
    US9275638B2 (en) 2013-03-12 2016-03-01 Google Technology Holdings LLC Method and apparatus for training a voice recognition model database
    US9548047B2 (en) 2013-07-31 2017-01-17 Google Technology Holdings LLC Method and apparatus for evaluating trigger phrase enrollment

    Citations (3)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    US5007081A (en) * 1989-01-05 1991-04-09 Origin Technology, Inc. Speech activated telephone
    US5369685A (en) * 1991-03-07 1994-11-29 Sprint Communications Company L.P. Voice-activated telephone directory and call placement system
    US5371779A (en) * 1992-03-13 1994-12-06 Nec Corporation Call initiating system for mobile telephone units

    Family Cites Families (91)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    US3742143A (en) * 1971-03-01 1973-06-26 Bell Telephone Labor Inc Limited vocabulary speech recognition circuit for machine and telephone control
    US4313035A (en) * 1980-01-18 1982-01-26 Bell Telephone Laboratories, Incorporated Method of providing person locator service
    US4348550A (en) * 1980-06-09 1982-09-07 Bell Telephone Laboratories, Incorporated Spoken word controlled automatic dialer
    US4593157A (en) * 1984-09-04 1986-06-03 Usdan Myron S Directory interface and dialer
    FR2571191B1 (en) * 1984-10-02 1986-12-26 Renault RADIOTELEPHONE SYSTEM, PARTICULARLY FOR MOTOR VEHICLE
    ATE43467T1 (en) * 1985-09-03 1989-06-15 Motorola Inc HANDS-FREE RADIO TELEPHONE.
    US5182765A (en) * 1985-11-26 1993-01-26 Kabushiki Kaisha Toshiba Speech recognition system with an accurate recognition function
    US4959855A (en) * 1986-10-08 1990-09-25 At&T Bell Laboratories Directory assistance call processing and calling customer remote signal monitoring arrangements
    US4829576A (en) * 1986-10-21 1989-05-09 Dragon Systems, Inc. Voice recognition system
    JP2584249B2 (en) * 1986-10-31 1997-02-26 三洋電機株式会社 Voice recognition phone
    US4862498A (en) * 1986-11-28 1989-08-29 At&T Information Systems, Inc. Method and apparatus for automatically selecting system commands for display
    US4827500A (en) * 1987-01-30 1989-05-02 American Telephone And Telegraph Company, At&T Bell Laboratories Automatic speech recognition to select among call destinations
    US4959850A (en) * 1987-05-29 1990-09-25 Kabushiki Kaisha Toshiba Radio telephone apparatus
    EP0293259A3 (en) * 1987-05-29 1990-03-07 Kabushiki Kaisha Toshiba Voice recognition system used in telephone apparatus
    DE3819538C3 (en) * 1987-06-08 1996-08-14 Ricoh Kk Voice activated dialer
    US4979206A (en) * 1987-07-10 1990-12-18 At&T Bell Laboratories Directory assistance systems
    US4754951A (en) * 1987-08-14 1988-07-05 Union Carbide Corporation Tuyere assembly and positioning method
    EP0307193B1 (en) * 1987-09-11 1993-11-18 Kabushiki Kaisha Toshiba Telephone apparatus
    EP0311414B2 (en) * 1987-10-08 1997-03-12 Nec Corporation Voice controlled dialer having memories for full-digit dialing for any users and abbreviated dialing for authorized users
    US4928302A (en) * 1987-11-06 1990-05-22 Ricoh Company, Ltd. Voice actuated dialing apparatus
    JPH01167898A (en) * 1987-12-04 1989-07-03 Internatl Business Mach Corp <Ibm> Voice recognition equipment
    US4924496A (en) * 1988-05-12 1990-05-08 Romek Figa D/B/A Abraham & Sons Automatic incoming telephone call originating number and party display system
    JPH02209055A (en) * 1989-02-09 1990-08-20 Toshiba Corp Telephone set
    US5301227A (en) * 1989-04-17 1994-04-05 Sanyo Electic Co., Ltd. Automatic dial telephone
    JP2927891B2 (en) * 1989-06-19 1999-07-28 日本電気株式会社 Voice dialing device
    US5121423A (en) * 1989-07-13 1992-06-09 Sharp Kabushiki Kaisha Communication unit comprising caller identification function and caller identifying method in a digital communication network
    JP3045510B2 (en) * 1989-12-06 2000-05-29 富士通株式会社 Speech recognition processor
    JPH03270453A (en) * 1990-03-20 1991-12-02 Fujitsu Ltd Automatic follow-up telephony device
    US5187735A (en) * 1990-05-01 1993-02-16 Tele Guia Talking Yellow Pages, Inc. Integrated voice-mail based voice and information processing system
    US5168548A (en) * 1990-05-17 1992-12-01 Kurzweil Applied Intelligence, Inc. Integrated voice controlled report generating and communicating system
    US5313516A (en) * 1990-05-31 1994-05-17 Phonemate Inc. Telephone answering device with automatic function
    FI89652C (en) * 1990-09-27 1993-10-25 Nokia Mobile Phones Ltd Procedure for speed dialing on a telephone set
    US5165095A (en) * 1990-09-28 1992-11-17 Texas Instruments Incorporated Voice telephone dialing
    US5181237A (en) * 1990-10-12 1993-01-19 At&T Bell Laboratories Automation of telephone operator assistance calls
    US5185781A (en) * 1990-10-12 1993-02-09 At&T Bell Laboratories Automation of telephone operator assistance calls
    US5243645A (en) * 1990-11-01 1993-09-07 At&T Bell Laboratories Automatic system for forwarding of calls
    US5163081A (en) * 1990-11-05 1992-11-10 At&T Bell Laboratories Automated dual-party-relay telephone system
    US5204894A (en) * 1990-11-09 1993-04-20 Bell Atlantic Network Services, Inc. Personal electronic directory
    JPH04207341A (en) * 1990-11-30 1992-07-29 Sony Corp Radio telephone system
    US5155763A (en) * 1990-12-11 1992-10-13 International Business Machines Corp. Look ahead method and apparatus for predictive dialing using a neural network
    GB2251763B (en) * 1991-01-11 1995-06-21 Technophone Ltd Telephone apparatus with calling line identification
    US5553125A (en) * 1991-01-11 1996-09-03 Nokia Mobile Phones (U.K.) Limited Telephone apparatus with calling line identification
    JP2707854B2 (en) * 1991-02-06 1998-02-04 日本電気株式会社 Mobile phone
    US5230017A (en) * 1991-11-08 1993-07-20 British Technology Group Usa Communication line monitoring system
    EP0543329B1 (en) * 1991-11-18 2002-02-06 Kabushiki Kaisha Toshiba Speech dialogue system for facilitating human-computer interaction
    JP3064627B2 (en) * 1992-01-28 2000-07-12 富士通株式会社 Service control device
    US5315649A (en) * 1992-04-15 1994-05-24 Vcs Industries, Inc. Toll call telephone service center
    US5333184A (en) * 1992-05-06 1994-07-26 At&T Bell Laboratories Call message recording for telephone systems
    US5329578A (en) * 1992-05-26 1994-07-12 Northern Telecom Limited Personal communication service with mobility manager
    JPH0614098A (en) * 1992-06-26 1994-01-21 Sharp Corp Telephone set with abbreviation dial list generation aid tool
    US5274699A (en) * 1992-07-24 1993-12-28 Motorola, Inc. Method for providing caller identification to a call recipient
    US5353336A (en) * 1992-08-24 1994-10-04 At&T Bell Laboratories Voice directed communications system archetecture
    US5325421A (en) * 1992-08-24 1994-06-28 At&T Bell Laboratories Voice directed communications system platform
    CA2078045C (en) * 1992-09-11 1999-11-16 Mark R. Sestak Global management of telephone directory
    JPH06121014A (en) * 1992-10-06 1994-04-28 Kyocera Corp Communication terminal equipment
    US5452397A (en) * 1992-12-11 1995-09-19 Texas Instruments Incorporated Method and system for preventing entry of confusingly similar phases in a voice recognition system vocabulary list
    US5465401A (en) * 1992-12-15 1995-11-07 Texas Instruments Incorporated Communication system and methods for enhanced information transfer
    AU5803394A (en) * 1992-12-17 1994-07-04 Bell Atlantic Network Services, Inc. Mechanized directory assistance
    US5717738A (en) * 1993-01-11 1998-02-10 Texas Instruments Incorporated Method and device for generating user defined spoken speed dial directories
    US5483579A (en) * 1993-02-25 1996-01-09 Digital Acoustics, Inc. Voice recognition dialing system
    US5430791A (en) * 1993-02-26 1995-07-04 At&T Corp. Technique for administering personal telephone numbers
    CA2091658A1 (en) * 1993-03-15 1994-09-16 Matthew Lennig Method and apparatus for automation of directory assistance using speech recognition
    US5452340A (en) * 1993-04-01 1995-09-19 Us West Advanced Technologies, Inc. Method of voice activated telephone dialing
    DE69402716T2 (en) * 1993-06-11 1997-12-11 Northern Telecom Ltd., Montreal, Quebec METHOD FOR SUPPLYING CALL MANAGEMENT SERVICES CONTROLLED BY THE USER
    US5487111A (en) * 1993-07-29 1996-01-23 At&T Ipm Corp. Telecommunications system sequence calling
    JPH0795279A (en) * 1993-09-20 1995-04-07 Fujitsu Ltd Memory dialing control system
    US5371781A (en) * 1993-09-30 1994-12-06 At&T Corp. System and method for identifying the incoming directory number when multiple directory numbers are assigned to one wireless device
    US5392342A (en) * 1993-10-27 1995-02-21 At&T Corp. Technique for use in sequentially routing personal telephone calls
    CA2136796C (en) * 1993-11-29 1998-11-24 Shinichi Urasaka Cordless telephone apparatus
    US5535503A (en) 1993-12-03 1996-07-16 Globe Products Inc. Stator lead wire connection method and apparatus
    US5394464A (en) * 1994-03-01 1995-02-28 At&T Corp. Progressive automatic activation and automatic resetting of call coverage
    JPH07283858A (en) * 1994-04-06 1995-10-27 Nippon Telegr & Teleph Corp <Ntt> Talking opposite party automatic registration type voice dialer
    JPH07282203A (en) * 1994-04-07 1995-10-27 Hitachi Ltd Character input device
    US5488652A (en) * 1994-04-14 1996-01-30 Northern Telecom Limited Method and apparatus for training speech recognition algorithms for directory assistance applications
    US5642411A (en) * 1994-04-25 1997-06-24 Illinois Technology Transfer Llc Anticipatory call distributor
    US5509103A (en) * 1994-06-03 1996-04-16 Motorola, Inc. Method of training neural networks used for speech recognition
    JPH07336426A (en) * 1994-06-08 1995-12-22 Sanyo Electric Co Ltd Communication equipment
    JPH07332001A (en) * 1994-06-13 1995-12-19 Kazuo Kimiwada Sealed type gas pressure engine
    JP2776400B2 (en) * 1994-08-04 1998-07-16 日本電気株式会社 Phone number display
    US5600704A (en) * 1994-08-30 1997-02-04 Ericsson Inc. Systems and methods for prioritized routing of telephone calls to a subscriber
    CA2132610C (en) * 1994-09-21 1998-04-28 Deborah L. Pinard Delayed seizure on associated devices
    US5568546A (en) * 1994-10-31 1996-10-22 Lucent Technologies, Inc. Method and apparatus for dynamic abbreviated dialing assignment
    US5479489A (en) * 1994-11-28 1995-12-26 At&T Corp. Voice telephone dialing architecture
    US5706339A (en) * 1994-11-30 1998-01-06 At&T Technique for use in processing personal telephone calls
    US5724411A (en) * 1995-03-22 1998-03-03 At&T Corp. Method for selectively alerting multiple telephones of an incoming call
    US5524145A (en) * 1995-04-06 1996-06-04 Bell Atlantic Network Services, Inc. Incoming call completion threshold restriction
    US5583564A (en) * 1995-04-24 1996-12-10 Lucent Technologies Inc. Intelligent call forwarding with videophone display of forwarding destination
    US5712957A (en) * 1995-09-08 1998-01-27 Carnegie Mellon University Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists
    US5742674A (en) * 1995-12-22 1998-04-21 At&T Corp. Automatic call-back system and method using data indicating best time to call
    US5719921A (en) * 1996-02-29 1998-02-17 Nynex Science & Technology Methods and apparatus for activating telephone services in response to speech
    CA2180684C (en) * 1996-07-08 2001-08-21 Paul Erb Automatic call forwarding

    Patent Citations (3)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    US5007081A (en) * 1989-01-05 1991-04-09 Origin Technology, Inc. Speech activated telephone
    US5369685A (en) * 1991-03-07 1994-11-29 Sprint Communications Company L.P. Voice-activated telephone directory and call placement system
    US5371779A (en) * 1992-03-13 1994-12-06 Nec Corporation Call initiating system for mobile telephone units

    Non-Patent Citations (1)

    * Cited by examiner, † Cited by third party
    Title
    ATTWATER D J ET AL: "ISSUES IN LARGE-VOCABULARY INTERACTIVE SPEECH SYSTEMS" BT TECHNOLOGY JOURNAL,GB,BT LABORATORIES, vol. 14, no. 1, page 177-186 XP000554647 ISSN: 1358-3948 *

    Cited By (6)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    WO2001035620A1 (en) * 1999-11-12 2001-05-17 British Telecommunications Public Limited Company Voice activated dialling
    WO2001095600A1 (en) * 2000-06-09 2001-12-13 France Telecom Sa Method and device for connection without telephone number disclosure
    WO2002009395A2 (en) * 2000-07-07 2002-01-31 Science Applications International Corporation A system or method for calling a vanity number using speech recognition
    WO2002009395A3 (en) * 2000-07-07 2003-01-09 Science Applic Int Corp A system or method for calling a vanity number using speech recognition
    WO2014019036A1 (en) * 2012-07-31 2014-02-06 Boris Ivanov Tsigov System for facilitated telephone dialing and connection of telephone subscribers through module-directory and voice recognition module
    WO2019023763A1 (en) * 2017-08-04 2019-02-07 Tsigov Boris Method and system for rewarding when charging

    Also Published As

    Publication number Publication date
    EP0840488A3 (en) 2000-03-15
    JPH10215319A (en) 1998-08-11
    CA2220256C (en) 2003-06-17
    US5912949A (en) 1999-06-15
    CA2220256A1 (en) 1998-05-05

    Similar Documents

    Publication Publication Date Title
    US5912949A (en) Voice-dialing system using both spoken names and initials in recognition
    US5917891A (en) Voice-dialing system using adaptive model of calling behavior
    EP0780829B1 (en) Method for automatic speech recognition in telephony
    US5930336A (en) Voice dialing server for branch exchange telephone systems
    US7062435B2 (en) Apparatus, method and computer readable memory medium for speech recognition using dynamic programming
    US6766295B1 (en) Adaptation of a speech recognition system across multiple remote sessions with a speaker
    US6167117A (en) Voice-dialing system using model of calling behavior
    US5615296A (en) Continuous speech recognition and voice response system and method to enable conversational dialogues with microprocessors
    US6462616B1 (en) Embedded phonetic support and TTS play button in a contacts database
    US5752232A (en) Voice activated device and method for providing access to remotely retrieved data
    US5651055A (en) Digital secretary
    EP0890249B1 (en) Apparatus and method for reducing speech recognition vocabulary perplexity and dynamically selecting acoustic models
    EP0804850B1 (en) Automatic vocabulary generation for telecommunications network-based voice-dialing
    US6873951B1 (en) Speech recognition system and method permitting user customization
    US8694316B2 (en) Methods, apparatus and computer programs for automatic speech recognition
    JP3561076B2 (en) Automatic call recognition method for arbitrarily spoken words
    US7318029B2 (en) Method and apparatus for a interactive voice response system
    US5752230A (en) Method and apparatus for identifying names with a speech recognition program
    US6671354B2 (en) Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs, for telephones without private branch exchanges
    US20050049858A1 (en) Methods and systems for improving alphabetic speech recognition accuracy
    Smith et al. Voice activated automated telephone call routing
    CA2256781A1 (en) Method and apparatus for automatically dialling a desired telephone number using speech commands
    EP1213707B1 (en) Pattern matching method and apparatus and telephony system
    JPS61143798A (en) Voice dialing apparatus
    JPS5860863A (en) Transmission system for tone for indicating premission to voice

    Legal Events

    Date Code Title Description
    PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

    Free format text: ORIGINAL CODE: 0009012

    AK Designated contracting states

    Kind code of ref document: A2

    Designated state(s): DE FR GB

    RAP3 Party data changed (applicant data changed or rights of an application transferred)

    Owner name: NORTEL NETWORKS CORPORATION

    PUAL Search report despatched

    Free format text: ORIGINAL CODE: 0009013

    AK Designated contracting states

    Kind code of ref document: A3

    Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

    RAP1 Party data changed (applicant data changed or rights of an application transferred)

    Owner name: NORTEL NETWORKS LIMITED

    17P Request for examination filed

    Effective date: 20000915

    AKX Designation fees paid

    Free format text: DE FR GB

    RAP1 Party data changed (applicant data changed or rights of an application transferred)

    Owner name: NORTEL NETWORKS LIMITED

    17Q First examination report despatched

    Effective date: 20041104

    GRAP Despatch of communication of intention to grant a patent

    Free format text: ORIGINAL CODE: EPIDOSNIGR1

    STAA Information on the status of an ep patent application or granted ep patent

    Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

    18D Application deemed to be withdrawn

    Effective date: 20070303