USH1646H - Speech recognition adapter for telephone system - Google Patents

Speech recognition adapter for telephone system Download PDF

Info

Publication number
USH1646H
USH1646H US08/059,035 US5903593A USH1646H US H1646 H USH1646 H US H1646H US 5903593 A US5903593 A US 5903593A US H1646 H USH1646 H US H1646H
Authority
US
United States
Prior art keywords
voice sound
telephone
speech recognition
telephone set
memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US08/059,035
Inventor
Yasuhiko Kato
Masao Watari
Miyuki Tanaka
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Assigned to SONY CORPORATION reassignment SONY CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KATO, YASUHIKO, TANAKA, MIYUKI, WATARI, MASAO
Application granted granted Critical
Publication of USH1646H publication Critical patent/USH1646H/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/65Recording arrangements for recording a message from the calling party
    • H04M1/652Means for playing back the recorded messages by remote control over a telephone line
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/271Devices whereby a plurality of signals may be stored simultaneously controlled by voice recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/274Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc
    • H04M1/2745Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Definitions

  • the present invention relates to a speech recognition adapter for use with one such as a telephone set with a telephone answering function or capabilities.
  • Conventional telephone sets with a telephone answering function generally record messages from incoming calls that arrive while a called party is out or otherwise unavailable to answer the telephone, on magnetic tape or on some type of electronic memory device.
  • a called party can call his or her own telephone set from a remote telephone set, such as a telephone set outside his or her home, and press pushbuttons of the keypad of the remote telephone set to transmit a signal representing a pre-registered password as well as a predetermined signal to reproduce and listen to any recorded incoming call messages that may have been recorded on the tape or memory.
  • a telephone set having a telephone answering function can be remotely controlled via commands dialed or keyed in through a remote telephone keypad, or dial, to perform various functions, e.g., reproduce and erase recorded messages.
  • pushbuttons thereof are generally depressed to dial a desired telephone number of a person to whom the telephone call is to be made. Since different telephone numbers are used to call different persons, those different telephone numbers generally also have to be memorized by a user.
  • a voice activated telephone set having a memory, for example, which stores the names of persons and their telephone numbers.
  • the telephone set transmits a pulse or dial tone signal corresponding to the stored telephone number that is associated with the uttered name.
  • Wide spread use of this type of telephone set is currently impractical due to the relatively high cost.
  • a speech recognition adapter for use in a telephone system having a telephone set and an exchange office or central switching office, adapted to be connected between a modular jack coupled to the telephone set and a modular jack coupled to the exchange office, the speech recognition adapter incorporates a speech recognizing unit for recognizing a voice sound, or voice sound pattern, a storage control unit for storing a voice sound recognized by the speech recognizing unit in association with particular command information such as, for example, telephone numbers, names or telephone set operational functions and a transmitting unit for transmitting or dialing a telephone number or issuing a signal representing telephone set operational functions in response to a recognized voice sound pattern.
  • the speech recognizing unit recognizes or detects a voice sound representing the name of a person to be called via the telephone set. If the detected voice sound corresponds to a name stored in the storage control unit, the transmitting unit transmits or dials the telephone number associated with the name stored in memory to which the detected voice sound corresponds.
  • the speech recognizing unit may alternatively recognize a voice sound transmitted from the exchange office representing or corresponding to a function of the auxiliary answer machine of the telephone set.
  • the speech recognizing adapter may be responsive to the voice sound transmitted from the exchange office and recognized by the speech recognizing unit for transmitting a message or performing an operation associated with a command stored in the storage control means which corresponds to a detected voice sound.
  • the speech recognition adapter is connected between the modular jack coupled to the telephone set and the modular jack coupled to the exchange office.
  • the speech recognition adapter recognizes a voice sound corresponding to or the name of a person to be called or a function of the telephone set.
  • the voice sound is stored in memory in association with a telephone number transmitted from the telephone set.
  • the speech recognition adapter transmits or dials a telephone number.
  • the speech recognition adapter can dial a telephone number or operate the telephone set under the control of a voice sound. The speech recognition adapter can thus operate the telephone set with ease at low cost.
  • the speech recognizing unit recognizes a voice sound transmitted from the telephone set, which represents the name of a person to be called, and the transmitting unit transmits or dials a telephone number representing the name of a person to be called which is stored in the storage control means to the exchange office in response to the voice sound from the telephone set and recognized by the speech recognizing unit.
  • the telephone set can be operated with ease because the user can make a telephone call simply by pronouncing or uttering the name of the person to be called.
  • the speech recognizing unit recognizes a voice sound from the exchange office, which represents a function of the telephone set.
  • the transmitting means transmits a telephone number stored in the storage control unit to the telephone set in response to the voice sound representing the function of the telephone set from the exchange office and recognized by the speech recognizing means.
  • the user is not required to memorize how to operate the telephone set, and the telephone set can be operated with ease because the user can control the telephone set simply by pronouncing or uttering the desired function of the telephone set.
  • FIG. 1 is a block diagram of a speech recognition adapter according to the present invention which is directly connected to modular jacks on telephone set and exchange office sides;
  • FIG. 2 is a flowchart of an operation sequence of the speech recognition adapter shown in FIG. 1 when it operates in a name registration mode;
  • FIG. 3 is a flowchart of an operation sequence of the speech recognition adapter shown in FIG. 1 when it operates in a command registration mode;
  • FIG. 4 is an illustration showing the relation of FIG. 4A and 4B to each other;
  • FIG. 4A and FIG. 4B are a flowchart of an operation sequence of the speech recognition adapter shown in FIG. 1 when it operates in a speech registration mode;
  • FIG. 5 is a block diagram of the speech recognition adapter that is connected to modular jacks on telephone set and exchange office sides through a three-way modular jack.
  • a speech recognition adapter As shown in FIG. 1, a speech recognition adapter, generally designated by the reference numeral 1, according to the present invention is connected between a telephone set 10 and an exchange office.
  • the telephone set 10 has a telephone answering function, and is of a known nature capable of performing various functions, e.g., reproducing and erasing messages that have been recorded on a built-in tape or electronic memory device (not shown) while the user is away, under the control of dial numbers transmitted from a remote telephone set (not shown).
  • the speech recognition adapter 1 incorporates a tone signal generator 2, a noise filter 3, a speech recognition unit 4, a line sensor 5, a controller 6, a registration switch 7, and a name/command selector switch 8.
  • the speech recognition adapter 1 is connected to a modular jack 11 which is coupled to the telephone set 10 and a modular jack 12 which is coupled to the exchange office.
  • the tone signal generator 2 is controlled by the controller 6 to transmit a pushbutton or touchtone signal or a dial pulse signal corresponding to a dialed number to the telephone set 10 or the exchange office.
  • the noise filter 3 serves to remove noise that may be present on the telephone transmission line and also a dial tone transmitted from the exchange office (switchboard, not shown) when the handset (not shown) of the telephone set 10 is off-hook.
  • the speech recognition unit 4 recognizes or detects, through the noise filter 3, a speech or voice signal transmitted from the telephone set 10 or the exchange office, and supplies a recognized result output signal to the controller 6.
  • the line sensor 5 monitors the condition of the telephone transmission line for a change in the line voltage, for example, it detects whether the handset of the telephone set 10 is off-hook to make a telephone call from the telephone set 10 or to answer a telephone call from the exchange office, and outputs a recognized result output signal to the controller 6.
  • the line sensor 5 also detects which keys have been activated or dialed from the telephone set 10 or the exchange office, to the telephone transmission line. The information or commands represented by the activated keys is output from line sensor 5 to controller 6.
  • the controller 6 When registration switch 7 is turned on, i.e., when the speech recognition adapter 1 is in a registration mode, the controller 6 associates an uttered voice sound, which has been outputted from the telephone set 10 and recognized by the speech recognition unit 4, indicative of the name of a person to be called or desired function of the telephone set 10, with a dial number entered into memory by keys of actuating the keypad or dial (not shown) of telephone set 10.
  • the dialed number or command is stored in memory in association with the appropriate voice or verbal sound pattern information (not shown).
  • the controller 6 When the registration switch 7 is turned off, i.e., when the speech recognition adapter 1 is in a speech recognition mode, the controller 6 reads, from the built-in memory, a dial number associated with a pronounced uttered voice sound indicative of the name of a person to be called or a function of the telephone set 10. A voice sound is outputted from the telephone set 10 or the exchange office and recognized or detected by the speech recognition unit 4. A control signal is sent to tone signal generator 2 to transmit the dial number to the telephone set 10 or the exchange office.
  • the registration switch 7 serves to switch the operation mode of the speech recognition adapter 1 between a registration mode and a speech recognition mode.
  • the name/command selector switch 8 is effective or active when the registration switch 7 is turned on.
  • the name/command selector switch 8 is shifted to select a name mode when a dial number to be stored in the built-in memory of the controller 6 is a telephone number associated with the name of a person to be called.
  • the name/command selector switch 8 is shifted to select a command mode when a dial number to be stored in the built-in memory of the controller 6 is a command associated with a function of the telephone set 10.
  • the registration switch 7 is turned on and the name/command selector switch 8 is shifted to select the name mode, i.e., the speech recognition adapter 1 is in a name registration mode.
  • the line sensor 5 detects the off-hook of the handset, and indicates the off-hook condition to the controller 6.
  • the controller 6 then applies a control signal to the speech recognition unit 4 to register the voice sound (indicating the name of the person) that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3.
  • the speech recognition unit 4 Upon detection of the control signal from the controller 6, the speech recognition unit 4 registers the voice sound (indicating the name of the person) that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3, as a standard pattern in a built-in standard pattern memory.
  • the speech recognition unit 4 uses one of the two uttered names of a person, which has been uttered twice while the handset of the telephone unit 10 is lifted off-hook, for registration, and the other utterance for confirmation. Specifically, only when the voice sound patterns of the two utterances of the name agree with each other, will the speech recognition unit 4 store either one of the utterances of the name in the built-in standard pattern memory.
  • the telephone number entered is supplied through the line sensor 5 to the controller 6.
  • the telephone number is associated with the registered name stored in the built-in standard pattern memory of the speech recognition unit 4, and stored in the built-in memory of the controller 6.
  • the registration switch 7 is turned on and the name/command selector switch 8 is shifted to select the command mode, i.e., the speech recognition adapter 1 is in a command (function of the telephone set 10) registration mode.
  • a password number is first registered. Specifically, the handset of the telephone set 10 is lifted off-hook and a multi-digit dial number, for example, a four-digit dial number, is input as a password number via pushbuttons of the keypad of the telephone set 10, and then the handset of the telephone set 10 is placed on-hook.
  • the password number is supplied through the line sensor 5 to the controller 6, and stored in the built-in memory of the controller 6.
  • a voice sound representing a command for executing a function of the telephone set 10 for example, a voice sound of "PLAYBACK" representing a command to play back messages that have been recorded on the tape, is uttered twice by a party.
  • the line sensor 5 detects the on-hook of the handset, and indicates the on-hook condition to the controller 6.
  • the controller 6 then applies a control signal to the speech recognition unit 4 to register the voice sound (indicating the command "PLAYBACK") that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3.
  • the speech recognition unit 4 Upon detection of the control signal from the controller 6, the speech recognition unit 4 registers the voice sound (indicating the command "PLAYBACK") that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3, as a standard pattern in a built-in standard pattern memory.
  • the speech recognition unit 4 uses one of the two utterances of the name of the person, which has been uttered while the handset of the telephone unit 10 is lifted off-hook, for registration, and the other utterance is used for confirmation.
  • dial number for performing the function of the telephone set 10 which corresponds to the previously uttered command "PLAYBACK", i.e., a dial number (e.g., "#1") for playing back messages that have been recorded on the tape, and the handset of the telephone set 10 is placed on-hook
  • the dial number is supplied through the line sensor 5 to the controller 6.
  • the dial number is associated with the registered voice sound (command "PLAYBACK") stored in the standard pattern memory of the speech recognition unit 4, and stored in the built-in memory of the controller 6.
  • the registration switch 7 is turned off, i.e., the speech recognition adapter 1 is in a speech recognition mode.
  • the line sensor 5 detects the off-hook of the handset of the telephone set 10, and indicates the off-hook condition to the controller 6.
  • the controller 6 applies a control signal to the speech recognition unit 4 to recognize a voice sound (representing the name of a person to be called) output or transmitted from the telephone set 10.
  • the speech recognition unit 4 starts sampling or recognizing the voice sound which represents the name of a person to be called.
  • the line sensor 5 detects the dial numbers forwards information represented by the detected dial numbers to the controller 6.
  • the controller 6 outputs a control signal to the speech recognition unit 4 to cause the speech recognition unit 4 to stop the speech recognition process.
  • the speech recognition adapter 1 establishes a through connection allowing a signal from the modular jack 11 to be outputted to the modular jack 12 and also allowing a signal from the modular jack 12 to be outputted to the modular jack 11.
  • the uttered voice sound (representing the name of the person) is supplied from the telephone set 10 through the modular jack 11 and the noise filter 3 to the speech recognition unit 4.
  • the speech recognition unit 4 refers to the standard patterns stored in the built-in standard pattern memory. If the uttered voice sound corresponds to a voice sound pattern stored in the standard pattern memory, a signal is sent to the controller 6. The controller 6 then reads dial numbers (representing the telephone number) which are associated with the name of the person from the speech recognition unit 4 from the built-in memory, and supplies the dial numbers to the tone signal generator 2.
  • the tone signal generator 2 supplies a touch-tone or dial pulse signal corresponding to the dial numbers (representing the telephone number) supplied from the controller 6, through the modular jack 12 to the exchange office to dial the telephone number associated with the name of the person whose name was pronounced or uttered.
  • the user can make a telephone call to a person simply by uttering the name of the person.
  • a message of the telephone answering mode for example, "I AM OUT NOW. PLEASE LEAVE A MESSAGE.”, is transmitted from the telephone set 10. If any dial numbers indicating a password number are not immediately thereafter transmitted from the calling party's telephone set, the speech recognition adapter 1 sets up a through connection, and the telephone set 10 operates in the telephone answering mode.
  • the line sensor 5 detects the password number and supplies it to the controller 6.
  • the controller 6 determines whether or not the password number from the line sensor 5 agrees or is identical to a password number that has been registered in a memory. If the password number detected by the line sensor 5 agrees with a password number that has been registered in memory, then the controller 6 outputs a control signal to the speech recognition unit 4 to recognize a voice sound (representing a command) transmitted from a calling party's telephone set. Upon detection of the control signal from the controller 6, the speech recognition unit 4 is activated to receive the voice sound representing the command.
  • dial numbers indicative of a command are detected by the line sensor 5, which outputs a signal indicating the reception of a command from a calling party's telephone set, to the controller 6.
  • the controller 6 outputs a control signal to the speech recognition unit 4 to cause the speech recognition unit 4 to stop the speech recognition process.
  • the speech recognition adapter 1 establishes a through connection between telephone set 10 and the telephone exchange (i.e. calling party).
  • the voice sound indicating the command "PLAYBACK” is supplied from the calling party's telephone set through the modular jack 12 and the noise filter 3 to the speech recognition unit 4.
  • the speech recognition unit 4 scans the standard patterns stored in the standard pattern memory. If the voice sound command corresponds to a standard pattern stored in memory, the command associated with the standard pattern is output by speech recognition unit 4 to controller 6 (i.e. "PLAYBACK").
  • the controller 6 then reads a dial number, for example, "#1", which is associated with the command "PLAYBACK” from memory, and supplies the dial number to tone signal generator 2.
  • Tone signal generator 2 supplies a touchtone or a dial pulse signal corresponding to the dial number "#1" through the modular jack 11 to the telephone set 10.
  • the telephone set 10 now performs the function corresponding to or associated with the dial number "#1", for example, playback of messages that have been recorded on the tape or electronic or optical memory.
  • a user can control functions of the telephone set 10 (i.e., operate the telephone set 10) simply by uttering or verbally pronouncing a command from a remote telephone set.
  • the registration switch 7 is turned on in step S1, the name/command selector switch 8 is shifted to select name in step S2.
  • the speech recognition adapter 1 enters the name registration mode (for registering the name of a person to be called) in the steps S1, S2.
  • step S3 the handset of the telephone set 10 is lifted off-hook, and the name of a person to be called is uttered or verbally pronounced twice by the user.
  • Voice pattern information representing the verbalized name is stored in the standard pattern memory of the speech recognition unit 4.
  • step S4 the telephone number of the person whose name has been uttered or verbally pronounced in step S3 is input, associated with the name of the person, and stored in the memory of the controller 6. The name registration mode is now completed.
  • step S3 If there is a plurality of persons whose names have to be registered, the above steps S3 and S4 are repeated. With each name registration, one of the two utterances of the name of the person which has been uttered twice in step S3, is used for registration while the other utterance is used for confirmation.
  • the registration switch 7 is turned on in step S11, then a password number is input in step S12, and the name/command selector switch 8 is shifted to select command in step S13.
  • the speech recognition adapter 1 enters the command registration mode (for registering a function of the telephone set 10) in the steps S11,S12, S13.
  • step S14 the handset of the telephone set 10 is lifted off-hook, and a verbal command is uttered twice by a user and stored in standard pattern memory of the speech recognition unit 4.
  • step S15 a dial number corresponding to the command which has been uttered in step S14 is input, associated with the command, and stored in memory of the controller 6. The command registration mode is now complete.
  • step S3 If there is a plurality of persons whose names have to be registered, the above steps S3 and S4 are repeated. With each name registration, one of the two utterances of the name of the person which has been uttered twice in step S3, is used for registration, and the other utterance is used for confirmation.
  • FIG. 4A and FIG. 4B show the speech recognition mode of the speech recognition adapter 1.
  • step S21 determines whether the handset of the telephone set 10 is lifted off-hook or not. If the handset of the telephone set 10 is lifted off-hook, then the process proceeds to step S22 to start a speech recognition process for recognizing the name of a person to be called.
  • step S23 determines whether pushbuttons of the keypad of telephone set 10 are pressed or activated to input dial numbers (representing a telephone number). If dial numbers (representing a telephone number) are input, the speech recognition process is brought to an end because a signal is transmitted using the pushbuttons of the telephone set 10.
  • step S23 determines whether a recognized voice signal is input or received. If no recognized voice signal is received during the speech recognition process started in the step S22, then control goes back to step S23. The steps S23, S24 are repeated until a recognized voice signal is produced or received during the speech recognition process started in the step S22.
  • step S22 If a recognized voice signal is produced or received during the speech recognition process started in the step S22, i.e., if a recognized name of a person to be called is uttered in step S24 after the handset is lifted off-hook in step S21, control goes to step S25 in which dial numbers (representing telephone number) associated with the recognized name of step S24 are transmitted from the telephone set 10. Thereafter, the speech recognition process is complete.
  • step S21 determines whether a telephone call from another telephone set is received by the telephone set 10. If not, then control returns to step S21. Steps S21, S26 are repeated until a telephone call from a remote telephone set is received by telephone set 10.
  • step S26 If a telephone call from a remote telephone set is received by the telephone set 10 in the step S26, control proceeds to step S27 which starts the telephone answering mode of telephone set 10.
  • step S28 it is determined whether the telephone call received from a remote telephone, or calling party, is cut off or not. If cut off, the speech recognition mode comes to an end. If not, control goes to step S29 .which determines whether a password number is transmitted, or dialed, from the remote telephone set. If a password number is not transmitted or dialed, in step S29, control returns to step S28. Steps S28, S29 are repeated until a password number is transmitted or dialed, from the other telephone set.
  • step S30 If a password number is transmitted from the other telephone set in step S29, then control proceeds to step S30 which starts a speech recognition process for recognizing a command. Then, control goes to step S31 to determine whether pushbuttons of the remote telephone set from which the telephone call has been initiated, are pressed or actuated to transmit or dial a dial number. If a dial number is transmitted from the remote telephone set in step S31, the speech recognition mode is ended since a command has been transmitted via the pushbuttons of the keypad of the remote telephone set.
  • step S31 determines whether the telephone call from the remote telephone set is cut off or not. If cut off, the speech recognition mode comes to an end. If not cut off, control goes to step S33 which determines whether a recognized voice signal is received in step S30. If no recognized voice signal is received during the speech recognition process started in step S30, control goes back to step S31. Steps S31, S32, S33 are repeated until a recognized voice signal is received during the speech recognition process started in step S30.
  • step S30 If a recognized voice signal is received during the speech recognition process started in step S30, control goes to step S34 in which a dial number, associated with the command whose voice sound or signal has been received in step S33, is transmitted to telephone set 10. Thereafter, the speech recognition process is complete.
  • step S21 After the speech recognition process is complete, it is started again from step S21 by a timer-initiated interrupt signal.
  • the telephone set 10 can be controlled (operated) by voice sounds or signals, simply by connecting the speech recognition adapter 1 between the modular jack 11 coupled to the telephone set 10 and the modular jack 12 coupled to the exchange office.
  • the telephone set 10 can be operated with ease and at low cost.
  • the speech recognition adapter 1 is connected in series between the modular jack 11 coupled to the telephone set 10 and the modular jack 12 coupled to the exchange office.
  • the modular jack 11 coupled to the telephone set 10, the modular jack 12 coupled to the exchange office, and the speech recognition adapter 1 may be connected by a three-way modular jack 21.

Abstract

A voice responsive telephone adapter, or speech recognition adapter, for use with a telephone set having for example, telephone answering capabilities. The voice responsive telephone adapter may be used to dial telephone numbers or control auxiliary functional capabilities relating the message playback, for example in response to recognized verbal commands, such as, for example, names or operational functions.

Description

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a speech recognition adapter for use with one such as a telephone set with a telephone answering function or capabilities.
2. Description of the Related Art
Conventional telephone sets with a telephone answering function generally record messages from incoming calls that arrive while a called party is out or otherwise unavailable to answer the telephone, on magnetic tape or on some type of electronic memory device. A called party can call his or her own telephone set from a remote telephone set, such as a telephone set outside his or her home, and press pushbuttons of the keypad of the remote telephone set to transmit a signal representing a pre-registered password as well as a predetermined signal to reproduce and listen to any recorded incoming call messages that may have been recorded on the tape or memory.
A telephone set having a telephone answering function can be remotely controlled via commands dialed or keyed in through a remote telephone keypad, or dial, to perform various functions, e.g., reproduce and erase recorded messages.
Conventional telephone sets with telephone answering functions are generally, however, not so simple to use because the pushbuttons of a remote telephone set must generally be pressed in various different sequential combinations to perform the respective various functions, e.g., to reproduce and erase recorded messages, and such different pushbutton combinations must generally be memorized by a user.
To place a telephone call from a telephone set with a telephone answering function, pushbuttons thereof are generally depressed to dial a desired telephone number of a person to whom the telephone call is to be made. Since different telephone numbers are used to call different persons, those different telephone numbers generally also have to be memorized by a user.
There also exists a voice activated telephone set having a memory, for example, which stores the names of persons and their telephone numbers. When a user verbally pronounces or utters the name of a person to be called, the telephone set transmits a pulse or dial tone signal corresponding to the stored telephone number that is associated with the uttered name. Wide spread use of this type of telephone set, however, is currently impractical due to the relatively high cost.
SUMMARY OF THE INVENTION
It is therefore an object of the present invention to provide a speech recognition adapter for use in a telephone system, which is of low cost and allows a telephone set to be operated with ease.
According to the present invention, there is provided a speech recognition adapter for use in a telephone system having a telephone set and an exchange office or central switching office, adapted to be connected between a modular jack coupled to the telephone set and a modular jack coupled to the exchange office, the speech recognition adapter incorporates a speech recognizing unit for recognizing a voice sound, or voice sound pattern, a storage control unit for storing a voice sound recognized by the speech recognizing unit in association with particular command information such as, for example, telephone numbers, names or telephone set operational functions and a transmitting unit for transmitting or dialing a telephone number or issuing a signal representing telephone set operational functions in response to a recognized voice sound pattern.
The speech recognizing unit recognizes or detects a voice sound representing the name of a person to be called via the telephone set. If the detected voice sound corresponds to a name stored in the storage control unit, the transmitting unit transmits or dials the telephone number associated with the name stored in memory to which the detected voice sound corresponds.
The speech recognizing unit may alternatively recognize a voice sound transmitted from the exchange office representing or corresponding to a function of the auxiliary answer machine of the telephone set. The speech recognizing adapter may be responsive to the voice sound transmitted from the exchange office and recognized by the speech recognizing unit for transmitting a message or performing an operation associated with a command stored in the storage control means which corresponds to a detected voice sound.
The speech recognition adapter is connected between the modular jack coupled to the telephone set and the modular jack coupled to the exchange office. The speech recognition adapter recognizes a voice sound corresponding to or the name of a person to be called or a function of the telephone set. The voice sound is stored in memory in association with a telephone number transmitted from the telephone set. Then, in response to a voice sound representing the name of a person to be called or a function of the telephone set, the speech recognition adapter transmits or dials a telephone number. Simply by being connected to the existing telephone set, the speech recognition adapter can dial a telephone number or operate the telephone set under the control of a voice sound. The speech recognition adapter can thus operate the telephone set with ease at low cost.
In one mode, the speech recognizing unit recognizes a voice sound transmitted from the telephone set, which represents the name of a person to be called, and the transmitting unit transmits or dials a telephone number representing the name of a person to be called which is stored in the storage control means to the exchange office in response to the voice sound from the telephone set and recognized by the speech recognizing unit. In this mode, the telephone set can be operated with ease because the user can make a telephone call simply by pronouncing or uttering the name of the person to be called.
In another mode, the speech recognizing unit recognizes a voice sound from the exchange office, which represents a function of the telephone set. The transmitting means transmits a telephone number stored in the storage control unit to the telephone set in response to the voice sound representing the function of the telephone set from the exchange office and recognized by the speech recognizing means. In this mode, the user is not required to memorize how to operate the telephone set, and the telephone set can be operated with ease because the user can control the telephone set simply by pronouncing or uttering the desired function of the telephone set.
The above and other objects, features, and advantages of the present invention will become apparent from the following description when taken in conjunction with the accompanying drawings which illustrate a preferred embodiment of the present invention by way of example.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a block diagram of a speech recognition adapter according to the present invention which is directly connected to modular jacks on telephone set and exchange office sides;
FIG. 2 is a flowchart of an operation sequence of the speech recognition adapter shown in FIG. 1 when it operates in a name registration mode;
FIG. 3 is a flowchart of an operation sequence of the speech recognition adapter shown in FIG. 1 when it operates in a command registration mode;
FIG. 4 is an illustration showing the relation of FIG. 4A and 4B to each other;
FIG. 4A and FIG. 4B are a flowchart of an operation sequence of the speech recognition adapter shown in FIG. 1 when it operates in a speech registration mode; and
FIG. 5 is a block diagram of the speech recognition adapter that is connected to modular jacks on telephone set and exchange office sides through a three-way modular jack.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
As shown in FIG. 1, a speech recognition adapter, generally designated by the reference numeral 1, according to the present invention is connected between a telephone set 10 and an exchange office. The telephone set 10 has a telephone answering function, and is of a known nature capable of performing various functions, e.g., reproducing and erasing messages that have been recorded on a built-in tape or electronic memory device (not shown) while the user is away, under the control of dial numbers transmitted from a remote telephone set (not shown).
The speech recognition adapter 1 incorporates a tone signal generator 2, a noise filter 3, a speech recognition unit 4, a line sensor 5, a controller 6, a registration switch 7, and a name/command selector switch 8. The speech recognition adapter 1 is connected to a modular jack 11 which is coupled to the telephone set 10 and a modular jack 12 which is coupled to the exchange office. The tone signal generator 2 is controlled by the controller 6 to transmit a pushbutton or touchtone signal or a dial pulse signal corresponding to a dialed number to the telephone set 10 or the exchange office.
The noise filter 3 serves to remove noise that may be present on the telephone transmission line and also a dial tone transmitted from the exchange office (switchboard, not shown) when the handset (not shown) of the telephone set 10 is off-hook. The speech recognition unit 4 recognizes or detects, through the noise filter 3, a speech or voice signal transmitted from the telephone set 10 or the exchange office, and supplies a recognized result output signal to the controller 6. The line sensor 5 monitors the condition of the telephone transmission line for a change in the line voltage, for example, it detects whether the handset of the telephone set 10 is off-hook to make a telephone call from the telephone set 10 or to answer a telephone call from the exchange office, and outputs a recognized result output signal to the controller 6. The line sensor 5 also detects which keys have been activated or dialed from the telephone set 10 or the exchange office, to the telephone transmission line. The information or commands represented by the activated keys is output from line sensor 5 to controller 6.
When registration switch 7 is turned on, i.e., when the speech recognition adapter 1 is in a registration mode, the controller 6 associates an uttered voice sound, which has been outputted from the telephone set 10 and recognized by the speech recognition unit 4, indicative of the name of a person to be called or desired function of the telephone set 10, with a dial number entered into memory by keys of actuating the keypad or dial (not shown) of telephone set 10. The dialed number or command is stored in memory in association with the appropriate voice or verbal sound pattern information (not shown).
When the registration switch 7 is turned off, i.e., when the speech recognition adapter 1 is in a speech recognition mode, the controller 6 reads, from the built-in memory, a dial number associated with a pronounced uttered voice sound indicative of the name of a person to be called or a function of the telephone set 10. A voice sound is outputted from the telephone set 10 or the exchange office and recognized or detected by the speech recognition unit 4. A control signal is sent to tone signal generator 2 to transmit the dial number to the telephone set 10 or the exchange office.
The registration switch 7 serves to switch the operation mode of the speech recognition adapter 1 between a registration mode and a speech recognition mode. The name/command selector switch 8 is effective or active when the registration switch 7 is turned on. The name/command selector switch 8 is shifted to select a name mode when a dial number to be stored in the built-in memory of the controller 6 is a telephone number associated with the name of a person to be called. The name/command selector switch 8 is shifted to select a command mode when a dial number to be stored in the built-in memory of the controller 6 is a command associated with a function of the telephone set 10.
Operation of the speech recognition adapter 1 will be described below. It is assumed that the registration switch 7 is turned on and the name/command selector switch 8 is shifted to select the name mode, i.e., the speech recognition adapter 1 is in a name registration mode. When the handset of the telephone set 10 is lifted off-hook and the name of a person to be registered is pronounced or uttered twice, the line sensor 5 detects the off-hook of the handset, and indicates the off-hook condition to the controller 6. The controller 6 then applies a control signal to the speech recognition unit 4 to register the voice sound (indicating the name of the person) that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3.
Upon detection of the control signal from the controller 6, the speech recognition unit 4 registers the voice sound (indicating the name of the person) that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3, as a standard pattern in a built-in standard pattern memory.
The speech recognition unit 4 uses one of the two uttered names of a person, which has been uttered twice while the handset of the telephone unit 10 is lifted off-hook, for registration, and the other utterance for confirmation. Specifically, only when the voice sound patterns of the two utterances of the name agree with each other, will the speech recognition unit 4 store either one of the utterances of the name in the built-in standard pattern memory.
Immediately thereafter, when pushbuttons of the keypad of telephone set 10 are pressed to input the telephone number of the person whose name has been uttered and the handset of the telephone set 10 is placed on-hook, the telephone number entered is supplied through the line sensor 5 to the controller 6. The telephone number is associated with the registered name stored in the built-in standard pattern memory of the speech recognition unit 4, and stored in the built-in memory of the controller 6.
If there are a plurality of persons whose names have to be registered, the above process is repeated.
It is assumed that the registration switch 7 is turned on and the name/command selector switch 8 is shifted to select the command mode, i.e., the speech recognition adapter 1 is in a command (function of the telephone set 10) registration mode. In the command registration mode, a password number is first registered. Specifically, the handset of the telephone set 10 is lifted off-hook and a multi-digit dial number, for example, a four-digit dial number, is input as a password number via pushbuttons of the keypad of the telephone set 10, and then the handset of the telephone set 10 is placed on-hook. The password number is supplied through the line sensor 5 to the controller 6, and stored in the built-in memory of the controller 6.
Then, the handset of the telephone set 10 is lifted off-hook, and a voice sound representing a command for executing a function of the telephone set 10, for example, a voice sound of "PLAYBACK" representing a command to play back messages that have been recorded on the tape, is uttered twice by a party. The line sensor 5 detects the on-hook of the handset, and indicates the on-hook condition to the controller 6. The controller 6 then applies a control signal to the speech recognition unit 4 to register the voice sound (indicating the command "PLAYBACK") that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3.
Upon detection of the control signal from the controller 6, the speech recognition unit 4 registers the voice sound (indicating the command "PLAYBACK") that is supplied from the telephone set 10 (handset of the telephone set 10) through the modular jack 11 and the noise filter 3, as a standard pattern in a built-in standard pattern memory.
As in the name registration mode, the speech recognition unit 4 uses one of the two utterances of the name of the person, which has been uttered while the handset of the telephone unit 10 is lifted off-hook, for registration, and the other utterance is used for confirmation.
Immediately thereafter, when pushbuttons of the keypad of telephone set 10 are activated to input a dial number for performing the function of the telephone set 10 which corresponds to the previously uttered command "PLAYBACK", i.e., a dial number (e.g., "#1") for playing back messages that have been recorded on the tape, and the handset of the telephone set 10 is placed on-hook, the dial number is supplied through the line sensor 5 to the controller 6. The dial number is associated with the registered voice sound (command "PLAYBACK") stored in the standard pattern memory of the speech recognition unit 4, and stored in the built-in memory of the controller 6.
If there are a plurality of commands (functions of the telephone set 10) which have to be registered, the above process except the registration of the password number is repeated.
It is assumed that the registration switch 7 is turned off, i.e., the speech recognition adapter 1 is in a speech recognition mode. When the handset of the telephone set 10 is lifted off-hook in order to transmit a signal from the telephone set 10 (i.e., to make a telephone call from the telephone set 10, the line sensor 5 detects the off-hook of the handset of the telephone set 10, and indicates the off-hook condition to the controller 6. The controller 6 then applies a control signal to the speech recognition unit 4 to recognize a voice sound (representing the name of a person to be called) output or transmitted from the telephone set 10. Upon detection of the control signal from the controller 6, the speech recognition unit 4 starts sampling or recognizing the voice sound which represents the name of a person to be called.
Immediately thereafter, when pushbuttons of the keypad of telephone set 10 are pressed to transmit dial numbers indicative of a telephone number, the line sensor 5 detects the dial numbers forwards information represented by the detected dial numbers to the controller 6. The controller 6 outputs a control signal to the speech recognition unit 4 to cause the speech recognition unit 4 to stop the speech recognition process. Specifically, when pushbuttons of the keypad of telephone set 10 are pressed to transmit dial numbers indicative of a telephone number, the speech recognition adapter 1 establishes a through connection allowing a signal from the modular jack 11 to be outputted to the modular jack 12 and also allowing a signal from the modular jack 12 to be outputted to the modular jack 11.
If no pushbuttons of the keypad of telephone unit 10 are activated and the name of a person to be called is uttered, then the uttered voice sound (representing the name of the person) is supplied from the telephone set 10 through the modular jack 11 and the noise filter 3 to the speech recognition unit 4. The speech recognition unit 4 refers to the standard patterns stored in the built-in standard pattern memory. If the uttered voice sound corresponds to a voice sound pattern stored in the standard pattern memory, a signal is sent to the controller 6. The controller 6 then reads dial numbers (representing the telephone number) which are associated with the name of the person from the speech recognition unit 4 from the built-in memory, and supplies the dial numbers to the tone signal generator 2. The tone signal generator 2 supplies a touch-tone or dial pulse signal corresponding to the dial numbers (representing the telephone number) supplied from the controller 6, through the modular jack 12 to the exchange office to dial the telephone number associated with the name of the person whose name was pronounced or uttered.
As described above, the user can make a telephone call to a person simply by uttering the name of the person.
If a call arrives from another telephone set while the telephone set 10 is in a telephone answering mode, a message of the telephone answering mode, for example, "I AM OUT NOW. PLEASE LEAVE A MESSAGE.", is transmitted from the telephone set 10. If any dial numbers indicating a password number are not immediately thereafter transmitted from the calling party's telephone set, the speech recognition adapter 1 sets up a through connection, and the telephone set 10 operates in the telephone answering mode.
If dial numbers indicating a password number are transmitted from the calling party's telephone set after the outgoing message of the telephone answering mode has been transmitted from the telephone set 10, then the line sensor 5 detects the password number and supplies it to the controller 6. The controller 6 determines whether or not the password number from the line sensor 5 agrees or is identical to a password number that has been registered in a memory. If the password number detected by the line sensor 5 agrees with a password number that has been registered in memory, then the controller 6 outputs a control signal to the speech recognition unit 4 to recognize a voice sound (representing a command) transmitted from a calling party's telephone set. Upon detection of the control signal from the controller 6, the speech recognition unit 4 is activated to receive the voice sound representing the command.
If pushbuttons of the keypad of the calling party's telephone set are thereafter pressed to transmit dial numbers indicative of a command, the dial numbers are detected by the line sensor 5, which outputs a signal indicating the reception of a command from a calling party's telephone set, to the controller 6. The controller 6 outputs a control signal to the speech recognition unit 4 to cause the speech recognition unit 4 to stop the speech recognition process. Where no dial numbers indicative of a command are transmitted from a calling party's telephone set, the speech recognition adapter 1 establishes a through connection between telephone set 10 and the telephone exchange (i.e. calling party).
If no pushbuttons of the keypad of a calling party's telephone set are pressed and a command "PLAYBACK", for example, is uttered by a calling party via their telephone set, then the voice sound indicating the command "PLAYBACK" is supplied from the calling party's telephone set through the modular jack 12 and the noise filter 3 to the speech recognition unit 4. The speech recognition unit 4 scans the standard patterns stored in the standard pattern memory. If the voice sound command corresponds to a standard pattern stored in memory, the command associated with the standard pattern is output by speech recognition unit 4 to controller 6 (i.e. "PLAYBACK"). The controller 6 then reads a dial number, for example, "#1", which is associated with the command "PLAYBACK" from memory, and supplies the dial number to tone signal generator 2. Tone signal generator 2 supplies a touchtone or a dial pulse signal corresponding to the dial number "#1" through the modular jack 11 to the telephone set 10. The telephone set 10 now performs the function corresponding to or associated with the dial number "#1", for example, playback of messages that have been recorded on the tape or electronic or optical memory.
As described above, a user can control functions of the telephone set 10 (i.e., operate the telephone set 10) simply by uttering or verbally pronouncing a command from a remote telephone set.
Operation of the speech recognition adapter 1 in the name registration mode (for registering the name of a person to be called), the command registration mode (for registering a function of the telephone set 10), the speech recognition mode will be described below with reference to the flowcharts of FIGS. 2, 3, and 4.
In FIG. 2, the registration switch 7 is turned on in step S1, the name/command selector switch 8 is shifted to select name in step S2. The speech recognition adapter 1 enters the name registration mode (for registering the name of a person to be called) in the steps S1, S2.
In step S3, the handset of the telephone set 10 is lifted off-hook, and the name of a person to be called is uttered or verbally pronounced twice by the user. Voice pattern information representing the verbalized name is stored in the standard pattern memory of the speech recognition unit 4. In a step S4, the telephone number of the person whose name has been uttered or verbally pronounced in step S3 is input, associated with the name of the person, and stored in the memory of the controller 6. The name registration mode is now completed.
If there is a plurality of persons whose names have to be registered, the above steps S3 and S4 are repeated. With each name registration, one of the two utterances of the name of the person which has been uttered twice in step S3, is used for registration while the other utterance is used for confirmation.
In FIG. 3, the registration switch 7 is turned on in step S11, then a password number is input in step S12, and the name/command selector switch 8 is shifted to select command in step S13. The speech recognition adapter 1 enters the command registration mode (for registering a function of the telephone set 10) in the steps S11,S12, S13.
In step S14 the handset of the telephone set 10 is lifted off-hook, and a verbal command is uttered twice by a user and stored in standard pattern memory of the speech recognition unit 4. In step S15, a dial number corresponding to the command which has been uttered in step S14 is input, associated with the command, and stored in memory of the controller 6. The command registration mode is now complete.
If there is a plurality of persons whose names have to be registered, the above steps S3 and S4 are repeated. With each name registration, one of the two utterances of the name of the person which has been uttered twice in step S3, is used for registration, and the other utterance is used for confirmation.
FIG. 4A and FIG. 4B show the speech recognition mode of the speech recognition adapter 1. In FIG. 4A, step S21 determines whether the handset of the telephone set 10 is lifted off-hook or not. If the handset of the telephone set 10 is lifted off-hook, then the process proceeds to step S22 to start a speech recognition process for recognizing the name of a person to be called. Step S23 determines whether pushbuttons of the keypad of telephone set 10 are pressed or activated to input dial numbers (representing a telephone number). If dial numbers (representing a telephone number) are input, the speech recognition process is brought to an end because a signal is transmitted using the pushbuttons of the telephone set 10.
If dial numbers (representing a telephone number) are not input in step S23, then control proceeds to step S24 which determine whether a recognized voice signal is input or received. If no recognized voice signal is received during the speech recognition process started in the step S22, then control goes back to step S23. The steps S23, S24 are repeated until a recognized voice signal is produced or received during the speech recognition process started in the step S22.
If a recognized voice signal is produced or received during the speech recognition process started in the step S22, i.e., if a recognized name of a person to be called is uttered in step S24 after the handset is lifted off-hook in step S21, control goes to step S25 in which dial numbers (representing telephone number) associated with the recognized name of step S24 are transmitted from the telephone set 10. Thereafter, the speech recognition process is complete.
If the handset of the telephone set 10 is not lifted off-hook in the step S21, then control proceeds to step S26 to determine whether a telephone call from another telephone set is received by the telephone set 10. If not, then control returns to step S21. Steps S21, S26 are repeated until a telephone call from a remote telephone set is received by telephone set 10.
If a telephone call from a remote telephone set is received by the telephone set 10 in the step S26, control proceeds to step S27 which starts the telephone answering mode of telephone set 10. In step S28, it is determined whether the telephone call received from a remote telephone, or calling party, is cut off or not. If cut off, the speech recognition mode comes to an end. If not, control goes to step S29 .which determines whether a password number is transmitted, or dialed, from the remote telephone set. If a password number is not transmitted or dialed, in step S29, control returns to step S28. Steps S28, S29 are repeated until a password number is transmitted or dialed, from the other telephone set.
If a password number is transmitted from the other telephone set in step S29, then control proceeds to step S30 which starts a speech recognition process for recognizing a command. Then, control goes to step S31 to determine whether pushbuttons of the remote telephone set from which the telephone call has been initiated, are pressed or actuated to transmit or dial a dial number. If a dial number is transmitted from the remote telephone set in step S31, the speech recognition mode is ended since a command has been transmitted via the pushbuttons of the keypad of the remote telephone set.
If no dial number is transmitted from the remote telephone set in step S31, then control proceeds to step S32 which determines whether the telephone call from the remote telephone set is cut off or not. If cut off, the speech recognition mode comes to an end. If not cut off, control goes to step S33 which determines whether a recognized voice signal is received in step S30. If no recognized voice signal is received during the speech recognition process started in step S30, control goes back to step S31. Steps S31, S32, S33 are repeated until a recognized voice signal is received during the speech recognition process started in step S30.
If a recognized voice signal is received during the speech recognition process started in step S30, control goes to step S34 in which a dial number, associated with the command whose voice sound or signal has been received in step S33, is transmitted to telephone set 10. Thereafter, the speech recognition process is complete.
After the speech recognition process is complete, it is started again from step S21 by a timer-initiated interrupt signal.
As described above, the telephone set 10 can be controlled (operated) by voice sounds or signals, simply by connecting the speech recognition adapter 1 between the modular jack 11 coupled to the telephone set 10 and the modular jack 12 coupled to the exchange office. The telephone set 10 can be operated with ease and at low cost.
In the above embodiment, the speech recognition adapter 1 is connected in series between the modular jack 11 coupled to the telephone set 10 and the modular jack 12 coupled to the exchange office. However, as shown in FIG. 5, the modular jack 11 coupled to the telephone set 10, the modular jack 12 coupled to the exchange office, and the speech recognition adapter 1 may be connected by a three-way modular jack 21.
Although the foregoing description relates to embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made therein without departing from the scope and spirit of the claimed invention.

Claims (16)

What is claimed is:
1. A speech recognition adapter for use in a telephone system having a telephone set, said speech recognition adapter comprising:
speech recognizing means including a first memory for performing a voice recognition process on a voice sound received from the telephone set, said voice sound including the name of a person to be called, and for storing said voice sound as a voice sound pattern in said first memory;
storage control means including a second memory and a controller for controlling said speech recognizing means to store said voice sound pattern in said first memory in response to receiving repeated utterances of said voice sound only when said voice sound patterns of said repeated utterances of said voice sound agree with each other, said storage control means storing said voice sound in said second memory in association with at least one of a plurality of telephone numbers transmitted from the telephone set and,
means for entering said at least one of a plurality of telephone numbers.
2. A speech recognition adapter according to claim 1, wherein said storage control means is capable of selecting a registration mode, said controller being operable to store the voice sound pattern of said voice sound recognized by said speech recognizing means and the telephone number associated with said voice sound pattern in said second memory in said registration mode.
3. A speech recognition adapter according to claim 2 wherein said storage control means is capable of selecting a speech recognition mode, said controller being operable to output the telephone number and said associated voice sound pattern from said second memory in response to said voice sound recognized by said speech recognizing means in said registration mode.
4. A speech recognition adapter according to claim 1, wherein said storage control means is capable of selecting a name registration mode, and said control is operable to store the voice sound pattern of said voice sound recognized by said speech recognizing means as name information in said first memory in said name registration mode.
5. A speech recognition adapter according to claim 1, wherein said storage control means is capable of selecting a command registration mode, said controller being operable to store the voice sound pattern of said voice sound recognized by said speech recognizing means as command information in said first memory in said command registration mode.
6. A speech recognition adapter according to claim 1, further comprising a noise filter connected to an input terminal of said speech recognizing means, for removing noise from said voice sound.
7. A speech recognition adapter according to claim 1, wherein the telephone set has a telephone answering function.
8. A speech recognition adapter for use with a multi-function telephone set which receives telephone signals from an exchange office, comprising:
data entry means for providing command information and for selectively providing telephone number signals comprising telephone numbers;
detection means for detecting voice sounds having voice sound patterns;
first memory means for storing voice sound pattern information; and,
controller means including second memory means for controlling said detection means to repeatedly detect said voice sounds and to store said voice sound pattern information in said first memory means in response to said repeated detections of said voice sounds only when said voice sound pattern information of said repeated detections of said voice sound agree with each other, said controller means further storing said voice sound pattern information in association with a telephone number provided by said data entry means.
9. A speech recognition adapter for use in a telephone system having a telephone set and a central switching exchange according to claim 8 wherein said telephone set comprises a multifunctional telephone answering device.
10. A speech recognition adapter for use in a telephone system having a telephone set and a central switching exchange, according to claim 8 wherein said command information further comprises a telephone number.
11. A speech recognition adapter for use in a telephone system having a telephone set and a central switching exchange, according to claim 8 wherein said command information comprises telephone set function commands.
12. A speech recognition adapter for use in a telephone system having a telephone set and a central switching exchange, according to claim 8 wherein said detected voice sound patterns correspond to names of individual persons.
13. A speech recognition adapter for use in a telephone system having a telephone set and a central switching exchange, according to claim 8 wherein said voice sound patterns correspond to operational functions of said telephone set.
14. A speech recognition adapter for use in a telephone system having a telephone set and a central switching exchange, according to claim 8 wherein said data entry means further comprises a keypad of said telephone set.
15. A speech recognition adapter for use in a telephone system having a telephone set, said speech recognition adapter comprising:
name registration means, including a first memory, for receiving from said telephone set a voice sound including a name of a person to be called, and for storing said voice sound as a voice sound pattern in said first memory;
control means, including a second memory, for controlling said name registration means to store said voice sound pattern in said first memory in response to receiving repeated utterances of said voice sound when said voice sound patterns of said repeated utterances of said voice sound agree with each other; and,
mean for entering a telephone number, wherein said control means stores in said second memory said telephone number in association with said stored voice sound pattern.
16. A speech recognition adapter for use in a telephone system having a telephone set, said speech recognition adapter comprising:
a name registration unit, including a first memory, for receiving from said telephone set a voice sound including a name of a person to be called, and for storing said voice sound as a voice sound pattern in said first memory;
a controller, including a second memory, for controlling said name registration unit to store said voice pattern in response to successive utterances of said voice sounds, when said voice sound patterns of said successive utterances of said voice sounds agree with each other; have the same voice sound patterns; and,
a keypad for entering a telephone number, wherein said controller stores in said second memory said telephone number in association with said stored voice sound pattern.
US08/059,035 1992-05-08 1993-05-07 Speech recognition adapter for telephone system Abandoned USH1646H (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP14342992A JP3144595B2 (en) 1992-05-08 1992-05-08 Voice Adapter
JP4-143429 1992-05-08

Publications (1)

Publication Number Publication Date
USH1646H true USH1646H (en) 1997-05-06

Family

ID=15338519

Family Applications (1)

Application Number Title Priority Date Filing Date
US08/059,035 Abandoned USH1646H (en) 1992-05-08 1993-05-07 Speech recognition adapter for telephone system

Country Status (4)

Country Link
US (1) USH1646H (en)
EP (1) EP0568979A1 (en)
JP (1) JP3144595B2 (en)
KR (1) KR930024375A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930336A (en) * 1996-09-30 1999-07-27 Matsushita Electric Industrial Co., Ltd. Voice dialing server for branch exchange telephone systems
US6041300A (en) * 1997-03-21 2000-03-21 International Business Machines Corporation System and method of using pre-enrolled speech sub-units for efficient speech synthesis
US20050100141A1 (en) * 2000-11-17 2005-05-12 Hiroyuki Otagaki Telephone device having operation function by voice input
US20070121814A1 (en) * 2005-11-30 2007-05-31 Mypeople, L.L.C. Speech recognition based computer telephony system

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5822727A (en) 1995-03-30 1998-10-13 At&T Corp Method for automatic speech recognition in telephony
EP0778689A1 (en) * 1995-12-06 1997-06-11 WILHELM, Siegfried E Telecommunication end user device
US5905773A (en) * 1996-03-28 1999-05-18 Northern Telecom Limited Apparatus and method for reducing speech recognition vocabulary perplexity and dynamically selecting acoustic models
JP3684669B2 (en) * 1996-04-26 2005-08-17 松下電工株式会社 Loudspeaker
US6314165B1 (en) * 1998-04-30 2001-11-06 Matsushita Electric Industrial Co., Ltd. Automated hotel attendant using speech recognition
US6493670B1 (en) 1999-10-14 2002-12-10 Ericsson Inc. Method and apparatus for transmitting DTMF signals employing local speech recognition
DE10114997A1 (en) * 2001-03-26 2002-10-17 Elchanan Tanzer Recording system for cordless telephone, files administrative information enabling identification of at least one subscriber
CN102917143B (en) * 2012-10-09 2016-01-27 广东欧珀移动通信有限公司 A kind of call method of special number and device
US10326869B2 (en) 2016-08-19 2019-06-18 Amazon Technologies, Inc. Enabling voice control of telephone device
US10194023B1 (en) 2017-08-31 2019-01-29 Amazon Technologies, Inc. Voice user interface for wired communications system
US10911596B1 (en) 2017-08-31 2021-02-02 Amazon Technologies, Inc. Voice user interface for wired communications system
US10326886B1 (en) 2017-08-31 2019-06-18 Amazon Technologies, Inc. Enabling additional endpoints to connect to audio mixing device

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0168039A2 (en) * 1984-07-13 1986-01-15 Alcatel N.V. Circuit arrangement for connecting a voice-controlled auxiliary device assigned to a telephone set to a telephone line
EP0194387A2 (en) * 1984-12-28 1986-09-17 Tomy Kogyo Co. Inc. A voice-controlled telephone and method for controlling same
GB2188812A (en) * 1986-04-01 1987-10-07 Leonard Rubenstein Telephone answering device
DE3837385A1 (en) * 1987-11-06 1989-05-24 Ricoh Kk VOICE ACTIVATED DIALING DEVICE
US4850005A (en) * 1986-08-06 1989-07-18 Hashimoto Corporation Telephone answering device with artificial intelligence
US4883713A (en) * 1986-04-28 1989-11-28 Eurosteel S.A. Moldable material reinforcement fibers with hydraulic or non-hydraulic binder and manufacturing thereof
US5111501A (en) * 1989-02-09 1992-05-05 Kabushiki Kaisha Toshiba Telephone terminal device having speech recognition unit
US5125024A (en) * 1990-03-28 1992-06-23 At&T Bell Laboratories Voice response unit
US5220595A (en) * 1989-05-17 1993-06-15 Kabushiki Kaisha Toshiba Voice-controlled apparatus using telephone and voice-control method
US5222121A (en) * 1989-06-19 1993-06-22 Nec Corporation Voice recognition dialing unit

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0168039A2 (en) * 1984-07-13 1986-01-15 Alcatel N.V. Circuit arrangement for connecting a voice-controlled auxiliary device assigned to a telephone set to a telephone line
US4837808A (en) * 1984-07-13 1989-06-06 Alcatel N.V. Circuit connecting a voice-controlled device to a telephone station and line
EP0194387A2 (en) * 1984-12-28 1986-09-17 Tomy Kogyo Co. Inc. A voice-controlled telephone and method for controlling same
GB2188812A (en) * 1986-04-01 1987-10-07 Leonard Rubenstein Telephone answering device
US4883713A (en) * 1986-04-28 1989-11-28 Eurosteel S.A. Moldable material reinforcement fibers with hydraulic or non-hydraulic binder and manufacturing thereof
US4850005A (en) * 1986-08-06 1989-07-18 Hashimoto Corporation Telephone answering device with artificial intelligence
DE3837385A1 (en) * 1987-11-06 1989-05-24 Ricoh Kk VOICE ACTIVATED DIALING DEVICE
US5111501A (en) * 1989-02-09 1992-05-05 Kabushiki Kaisha Toshiba Telephone terminal device having speech recognition unit
US5220595A (en) * 1989-05-17 1993-06-15 Kabushiki Kaisha Toshiba Voice-controlled apparatus using telephone and voice-control method
US5222121A (en) * 1989-06-19 1993-06-22 Nec Corporation Voice recognition dialing unit
US5125024A (en) * 1990-03-28 1992-06-23 At&T Bell Laboratories Voice response unit

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Electrical Communication vol. 59, No. 3, 1985, Harlow GB, pp. 281 285 Immendorfer Voice Dialer . *
Electrical Communication vol. 59, No. 3, 1985, Harlow GB, pp. 281-285 Immendorfer "Voice Dialer".

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930336A (en) * 1996-09-30 1999-07-27 Matsushita Electric Industrial Co., Ltd. Voice dialing server for branch exchange telephone systems
US6041300A (en) * 1997-03-21 2000-03-21 International Business Machines Corporation System and method of using pre-enrolled speech sub-units for efficient speech synthesis
US20050100141A1 (en) * 2000-11-17 2005-05-12 Hiroyuki Otagaki Telephone device having operation function by voice input
US7340041B2 (en) * 2000-11-17 2008-03-04 Sanyo Electric Co., Ltd. Telephone device having operation function by voice input
US20070121814A1 (en) * 2005-11-30 2007-05-31 Mypeople, L.L.C. Speech recognition based computer telephony system

Also Published As

Publication number Publication date
JPH05316189A (en) 1993-11-26
JP3144595B2 (en) 2001-03-12
EP0568979A1 (en) 1993-11-10
KR930024375A (en) 1993-12-22

Similar Documents

Publication Publication Date Title
USH1646H (en) Speech recognition adapter for telephone system
US4945557A (en) Voice activated dialing apparatus
US5007081A (en) Speech activated telephone
US5953656A (en) Method and apparatus for remotely accessing a telephone answering device
JPH02209055A (en) Telephone set
JP2001308970A (en) Speech recognition operation method and system for portable telephone
EP0893901B1 (en) Method for controlling a telecommunication service and a terminal
US4907258A (en) Key telephone apparatus
US5638437A (en) Telecommunication system and method enabling a user to get access to an automated call processing from a central station operating on pulse dialling mode
JP2528345B2 (en) Multifunction telephone device
JP3341365B2 (en) Voice adapter
JPH098956A (en) Hand-free device
JP2937968B2 (en) Voice response telephone
WO1989012937A1 (en) Telephone answering apparatus for mobile telephone systems
KR930009849B1 (en) Memorized phone number displaying and automatic dialing method
JPH08163228A (en) Voice remote control automatic answering telephone system and operation instruction receiving method by voice
AU635210B2 (en) Telephone answering apparatus for mobile telephone systems
JPS6251861A (en) Automatic answering telephone set
JPH0637912A (en) Remote controller
JPH04192852A (en) Dial number outgoing method
KR100674799B1 (en) Auto-dialing method for speaker-dependant and word-isolated recognition 114 telephone number directory
KR20000067091A (en) Method and system for operating a portable phone by voice recognition technique
JPH03123258A (en) Automatic answering telephone set
JPH0582104B2 (en)
JPH03139945A (en) Telephone set

Legal Events

Date Code Title Description
AS Assignment

Owner name: SONY CORPORATION, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KATO, YASUHIKO;WATARI, MASAO;TANAKA, MIYUKI;REEL/FRAME:006681/0916

Effective date: 19930720

STCF Information on status: patent grant

Free format text: PATENTED CASE