WO2002086865A1 - Speaker verification in a spoken dialogue system - Google Patents

Speaker verification in a spoken dialogue system Download PDF

Info

Publication number
WO2002086865A1
WO2002086865A1 PCT/IB2002/001280 IB0201280W WO02086865A1 WO 2002086865 A1 WO2002086865 A1 WO 2002086865A1 IB 0201280 W IB0201280 W IB 0201280W WO 02086865 A1 WO02086865 A1 WO 02086865A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
computer
target device
communication
database
Prior art date
Application number
PCT/IB2002/001280
Other languages
French (fr)
Inventor
Martin Holley
Karin Huber
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to EP02720373A priority Critical patent/EP1382033A1/en
Priority to JP2002584300A priority patent/JP2004533752A/en
Publication of WO2002086865A1 publication Critical patent/WO2002086865A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • GPHYSICS
    • G07CHECKING-DEVICES
    • G07CTIME OR ATTENDANCE REGISTERS; REGISTERING OR INDICATING THE WORKING OF MACHINES; GENERATING RANDOM NUMBERS; VOTING OR LOTTERY APPARATUS; ARRANGEMENTS, SYSTEMS OR APPARATUS FOR CHECKING NOT PROVIDED FOR ELSEWHERE
    • G07C9/00Individual registration on entry or exit
    • G07C9/30Individual registration on entry or exit not involving the use of a pass
    • G07C9/32Individual registration on entry or exit not involving the use of a pass in combination with an identity check
    • G07C9/33Individual registration on entry or exit not involving the use of a pass in combination with an identity check by means of a password
    • GPHYSICS
    • G07CHECKING-DEVICES
    • G07CTIME OR ATTENDANCE REGISTERS; REGISTERING OR INDICATING THE WORKING OF MACHINES; GENERATING RANDOM NUMBERS; VOTING OR LOTTERY APPARATUS; ARRANGEMENTS, SYSTEMS OR APPARATUS FOR CHECKING NOT PROVIDED FOR ELSEWHERE
    • G07C9/00Individual registration on entry or exit
    • G07C9/30Individual registration on entry or exit not involving the use of a pass
    • G07C9/32Individual registration on entry or exit not involving the use of a pass in combination with an identity check
    • G07C9/37Individual registration on entry or exit not involving the use of a pass in combination with an identity check using biometric data, e.g. fingerprints, iris scans or voice recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques

Definitions

  • the invention relates to a method of supporting the dialogue between a user and a target device.
  • a target device is understood to mean, for example, a computer of a provider on the Internet, via which the user or customer can acquire a certain product or a certain service.
  • the term target device also covers household appliances, such as video recorders, kitchen appliances or heating systems which also require input from a user to activate or control them. Apart from these devices in the private sector, industrial equipment can also be included under the term target device.
  • the invention also relates to a computer for editing the information for a target device to support the communication between a user and the target device.
  • the invention further relates to a computer program product which can be loaded directly into the internal memory of a digital computer and comprises sections of software code.
  • Neural networks provide an adaptive system.
  • WO 00/51050 Al describes a method that supports finding the correct address on the Internet in electronic trade.
  • the personal needs of the user or customer are taken into account when corresponding homepages are searched for. This takes place by storing a number of products in a database along with at least one preference criterion for each product as well as the storage of information about the user, such as size of clothes, certain tastes in music, sport, entertainment, films or books, or date of birth.
  • the system provides a recommendation for certain products or similar generated in accordance with the user profile.
  • US 5 970 469 A describes a method of supporting Internet sales in which the purchasing behavior of the purchaser in the past can be used in processing. With this system the information on the purchaser is combined with other data and a corresponding proposal made to the customer.
  • the known method has proven to have the disadvantage that communication with the Internet is supported only inadequately or the dialog with other target devices, such as household appliances or similar, is not supported at all. Furthermore, there is no method by which, for example, electronic sales procedures can be supported in such a way that input is simplified and thus also with, for example, mobile telephones or other hand-held devices, such as palm-top computers, an order can be placed on the Internet in a fast and simple manner.
  • the object of the invention is to provide a method of supporting the dialog between a user and a target device as well as a computer for editing the information for a target device to support communications between a user and the target device, through which a simpler dialog between the user and a target device and an extensive application, that is not just restricted to computers on the Internet, can be achieved.
  • the method or the computer must be adaptive, which means that for recurring dialog between user and target device it can learn the corresponding steps of the method or the preconditions over time and apply them as necessary, so that the necessary steps for performing the dialog between user and target device are simplified.
  • the user will be identified and user-specific data stored in a database, which data will be called up when the information for the target device is edited.
  • a user accesses the system for the first time, the latter detects this and stores certain user-specific data in the database.
  • the data so stored may be that which normally occurs during a dialog between the user and the target device or also certain user-specific data, such as name or address can be established for the user and stored in the database.
  • the information necessary for the target device is edited, such as the data necessary for completing an order form on a computer on the
  • the user is identified by his speech input.
  • This requires no manual input from the user, which in particular with small operating devices, such as mobile telephones, represents a considerable simplification. Thus it is not necessary to enter certain passwords or similar for identification purposes by laborious use of the keys.
  • the devices required for the speech analysis can, for example, be provided in the actual communication means of the user or in the computer for the editing of the information for a target device. Identification of the user can take place by analysis of any speech input or by analysis of specified speech input such as code words or the like.
  • the former can also be automatically identified by means of his mobile telephone number.
  • GSM Global System for Mobile Communication
  • functions are implemented as standard, allowing display of the number of the calling subscriber at the called subscriber. With this function, an additional or alternative identification of the user can therefore take place when the mobile telephone is used.
  • an identification can also take place by entering a password, an identifier, a PIN code or the like.
  • a credit card number, social security number or other clear identifier of the user can be used.
  • this information is established through a dialog with the user. To this end a communication is established by means of a computer for example, and the user is asked a corresponding question that the user answers preferably by speech input by means of his communication means, such as a mobile telephone. Depending upon the communication means used, manual input via keys or similar can also take place.
  • the user-specific data are preferably updated and expanded regularly, while a confirmation from the user is preferably requested before any updating so as to avoid or at least reduce incorrect entries.
  • synthetic speech output can be provided that returns the information from the target device to the user in acoustic form. This possibility is particularly advantageous when a mobile telephone is used as the communication means between the user and the target device.
  • a computer for editing information for a target device to support communication between a user and the target device, comprising a communication means for commumcation between the user and the target device, an interface between the computer and the target device and a link between the computer and the communication means, with a database linked to the computer for storage of user-specific data and identification means for the identification of the user.
  • the interface between the computer and the target device may be, for example, a respective link to a data network such as the Internet, or also a standardized or individually designed link to a device such as a video recorder, a heating system or a kitchen appliance.
  • the communication or dialog, respectively, between a user and the target device is interrupted by the computer and through regular querying of the database the dialog between the user and the target is supported in that data which is present and which the target device needs, is taken from the database and thus need not be entered by the user via the communication means.
  • inventive method is built up as an adaptive system, in that user-specific data are regularly updated and expanded in the database and thus the data file of the user is continually updated and expanded.
  • User-specific data such as name, address, date of birth, but also certain preferences, can be called up so as to support the dialog with a target unit.
  • the identification means are preferably in the form of a speech recognition unit. With the appropriate speech input by the user, it is immediately assigned to the corresponding user-specific data in the database and the further dialog with the target unit is thereby supported.
  • encryption and decryption means for encrypting and decrypting communications between the user and the computer and/or communication between the computer and the target device are provided.
  • Such encryption is important to secure personal data and thus protect the privacy of the user. For financial transactions in particular, such encryption is also a protection against abuse by others.
  • acoustic references for the speech recognition and/or information on the purchasing behavior of the user or similar are provided in the database, the corresponding support for the dialog and the identification of the user is further enhanced.
  • a recognition device for recognition of the communication means can also be provided.
  • This recognition device may be effected, for example, in the event of use of a mobile telephone as the communication means, by means of the mobile telephone number that always accompanies a call.
  • the interface between the computer and the target device is formed by a data network, in particular the Internet.
  • the communication means can be integrated with the computer.
  • a home computer may serve both as a communication means and a computer for editing the information for the target device.
  • a voice synthesis device may be provided for acoustic output of the information to the user. Through acoustic output the dialog between the user and the target device is further enhanced, since reading of a display or similar is unnecessary.
  • a corresponding device or entry in the database can be provided.
  • the communication means may be in the form of a mobile telephone through which the target device can be reached from virtually anywhere.
  • a computer program product is used that can be loaded directly into the internal memory of a digital computer and comprises sections of software code, in which the computer is used to process the steps of the method described above if the product is running on the computer.
  • the computer program product is preferably stored on a medium that can be read by a computer.
  • Fig. 1 shows in a schematic manner the components for executing the method in accordance with the invention during dialog between a user and a computer on the Internet.
  • Fig. 2 shows the components for executing the method in accordance with the invention to support the dialog between a user and a household appliance.
  • Fig. 3 shows a flow chart to illustrate the functional sequence of the method in accordance with the invention.
  • Fig. 1 shows a communication means 1 in the form of a mobile telephone, with which a user establishes a dialog with a target device 2, which target device 2 in the present case is comprised of a computer that is connected to a data network, in particular the Internet 3.
  • a mobile telephone as the communication means 1 a personal computer, a palm-top computer or the like may be provided.
  • the target device 2 in the form of a computer can for example be a server of a provider of certain products on the Internet 3.
  • a computer 4 is provided that serves to edit the information for the target device 2 to support the communication between the user and the target device 2.
  • the computer 4 has an interface 5 with the target device 2, that can be comprised of a corresponding link to the Internet 3, for example by means of a modem link.
  • the interface 5 can also be comprised of a standard interface of the computer 4.
  • the communication means 1 which by way of example can be comprised of the corresponding mobile radio network and a corresponding receiver unit on the computer 4 (not shown).
  • a database 6 is also provided for storage of user-specific data, which is preferably integrated with the computer 4.
  • the computer 4 and the database 6 can also be combined to a single device.
  • user-specific data are searched for in the database 6 and these data are used for the information for the target device 2 as required.
  • Identification means 7 are used to identify the user and can, for example, be comprised of a speech recognition unit, with which automatic assignment to the respective user takes place through the corresponding speech input of the user via the communication means 1.
  • the identification can also be extended to the entry of a password, an identifier, a PIN code or similar or take place automatically through the mobile telephone number of a mobile telephone as the communication means 1.
  • the communication between the mobile telephone 1 and the computer 4 and or the computer 4 and the target device 2 can also take place via corresponding encryption and decryption devices 8, 9.
  • These encryption and decryption devices 8, 9 are naturally preferably integrated with the computer 4 or with the target device 2.
  • a speech synthesis device 10 can also be provided for output of the data transferred from the target device 2 or computer 4 to the user or communication means 1 in acoustic form.
  • Fig. 2 shows a realization of the method in accordance with the invention in the support of the dialog between a user and a target device 2 in the form of a domestic appliance, for example a video recorder.
  • the communication means 1 of the user is formed by a personal computer that also contains the function of the computer 4.
  • the target device or video recorder is connected to the computer 4.
  • the user-specific information stored in the database 6 is used for programming the video recorder and thus supports the programming process. In this way different behaviors of family members can be taken into account and used when programming the video recorder.
  • Fig. 3 shows a flow chart of the most important functional sequences of the method in accordance with the invention.
  • the method in accordance with the invention begins at step 101.
  • identification of the user takes place, for example by analysis of speech input.
  • a query takes place on whether data on the identified user is present in the database.
  • step 105 If the user is a new user and if, accordingly, no user data is present in the database, in accordance with step 104, certain user data is demanded from the user and stored in the database.
  • step 105 the target device is asked for desired data, whereupon, in accordance with step 106, a search is made to see if these data are present in the database. If the data desired by the target device are stored in the database, these are called up from the database in step 107 and transferred to the target device. In the event of the desired data not being stored in the database, they are established from the user in step 108 and passed to the target device and, in accordance with step 109, stored in the database.
  • step 110 The procedure continues with a query in accordance with step 110 on whether further data are required for the target device and in the affirmative it continues with step 105. This loop between step 110 and step 105 is repeated as often as necessary. If all the data required for the target device are present, the procedure is ended in accordance with step 111.
  • the computer recognizes the content of the question and converts this into a question that can be answered by the target device, which answers the question to the computer. With the speech synthesis means of the computer the computer answers the question of the user.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Business, Economics & Management (AREA)
  • Finance (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Accounting & Taxation (AREA)
  • Computational Linguistics (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention relates to a method of supporting the dialog between a user and a target device and a computer for editing the information for a target device to support the communication between a user and the target device and also to a computer program product with which the steps in accordance with the method can be executed. In order to provide such a method and such a computer, by which a simple dialog between the user and a target device and a wide application, that is not just restricted to computers on the Internet, can be achieved, it is envisaged that the user is identified and user-specific data, which are used when editing the information for the target device are stored in a database. A computer (4) for editing the information for a target device (2) to support the communication between a user and the target device (2) has a communication means (1) for communication between the user and the target device (2), an interface (5) between the computer (4) and the target device (2) and a link between the computer (4) and the communication means (1) with a database (6) linked to the computer (4) for storage of user-specific data as well as identification means (7) for identification of the user.

Description

SPEAKER VERIFICATION IN A SPOKEN DIALOGUE SYSTEM
The invention relates to a method of supporting the dialogue between a user and a target device. A target device is understood to mean, for example, a computer of a provider on the Internet, via which the user or customer can acquire a certain product or a certain service. The term target device also covers household appliances, such as video recorders, kitchen appliances or heating systems which also require input from a user to activate or control them. Apart from these devices in the private sector, industrial equipment can also be included under the term target device.
The invention also relates to a computer for editing the information for a target device to support the communication between a user and the target device. The invention further relates to a computer program product which can be loaded directly into the internal memory of a digital computer and comprises sections of software code.
The importance of electronic trade is increasing at a tremendous rate due to worldwide data networks, in particular the Internet, or similar communications media. Modern electronic trade, known by the term e-commerce, is increasingly changing the behavior of consumers. Since the consumer no longer needs to buy or receive the goods or services from a business or a service company in person, the number of goods and services on offer can be increased considerably. At the press of a button on a computer terminal suppliers of the most varied of products or services from around the world can be brought to the customer. Because of this abundance of offerings, however, there is also a difficulty in finding the correct address on the Internet, for example.
But in other areas of daily life as well, such as the operation of household appliances or industrial equipment, problems that are further amplified by the pace of modern life always arise because of the rapid changes in technology. Furthermore, modern communications media such as mobile radio networks or data networks such as the Internet open up possibilities of operating such devices from virtually anywhere using simple communication means, such as mobile telephones.
So there is a great need for methods or devices to support the dialog between a user and a target device. A method of the type in question is described for example in WO 00/63837 Al, in which user-specific data are evaluated for more efficiently searching for websites on the Internet. Operation is simplified by a speech processor. Neural networks provide an adaptive system.
WO 00/51050 Al describes a method that supports finding the correct address on the Internet in electronic trade. Here the personal needs of the user or customer are taken into account when corresponding homepages are searched for. This takes place by storing a number of products in a database along with at least one preference criterion for each product as well as the storage of information about the user, such as size of clothes, certain tastes in music, sport, entertainment, films or books, or date of birth. The system provides a recommendation for certain products or similar generated in accordance with the user profile. US 5 970 469 A describes a method of supporting Internet sales in which the purchasing behavior of the purchaser in the past can be used in processing. With this system the information on the purchaser is combined with other data and a corresponding proposal made to the customer.
The known method has proven to have the disadvantage that communication with the Internet is supported only inadequately or the dialog with other target devices, such as household appliances or similar, is not supported at all. Furthermore, there is no method by which, for example, electronic sales procedures can be supported in such a way that input is simplified and thus also with, for example, mobile telephones or other hand-held devices, such as palm-top computers, an order can be placed on the Internet in a fast and simple manner.
Accordingly, the object of the invention is to provide a method of supporting the dialog between a user and a target device as well as a computer for editing the information for a target device to support communications between a user and the target device, through which a simpler dialog between the user and a target device and an extensive application, that is not just restricted to computers on the Internet, can be achieved. In particular, the method or the computer must be adaptive, which means that for recurring dialog between user and target device it can learn the corresponding steps of the method or the preconditions over time and apply them as necessary, so that the necessary steps for performing the dialog between user and target device are simplified.
To achieve this object in respect of the method, it is provided that the user will be identified and user-specific data stored in a database, which data will be called up when the information for the target device is edited. When a user accesses the system for the first time, the latter detects this and stores certain user-specific data in the database. The data so stored may be that which normally occurs during a dialog between the user and the target device or also certain user-specific data, such as name or address can be established for the user and stored in the database. When the information necessary for the target device is edited, such as the data necessary for completing an order form on a computer on the
Internet, the user-specific data stored in the database where available are used. Any missing data is established through dialog with the user and stored in the database and also passed on to the target device.
Advantageously, the user is identified by his speech input. This requires no manual input from the user, which in particular with small operating devices, such as mobile telephones, represents a considerable simplification. Thus it is not necessary to enter certain passwords or similar for identification purposes by laborious use of the keys. The devices required for the speech analysis can, for example, be provided in the actual communication means of the user or in the computer for the editing of the information for a target device. Identification of the user can take place by analysis of any speech input or by analysis of specified speech input such as code words or the like.
Alternatively or additionally for the identification by means of speech input, in the case of communication between the user and the target device via a mobile telephone the former can also be automatically identified by means of his mobile telephone number. In the GSM (Global System for Mobile Communication) mobile telephone network such functions are implemented as standard, allowing display of the number of the calling subscriber at the called subscriber. With this function, an additional or alternative identification of the user can therefore take place when the mobile telephone is used.
Furthermore or as an alternative to the possibilities stated, an identification can also take place by entering a password, an identifier, a PIN code or the like. To this end a credit card number, social security number or other clear identifier of the user can be used. Depending on the application it may be an advantage for the dialog between the user and the target device to be encrypted. For this purpose normal encryption and decryption methods can be used. In the event that the information required by the target device is not all available in the database, this information is established through a dialog with the user. To this end a communication is established by means of a computer for example, and the user is asked a corresponding question that the user answers preferably by speech input by means of his communication means, such as a mobile telephone. Depending upon the communication means used, manual input via keys or similar can also take place.
The user-specific data are preferably updated and expanded regularly, while a confirmation from the user is preferably requested before any updating so as to avoid or at least reduce incorrect entries. To simplify communications between the target device and the user, synthetic speech output can be provided that returns the information from the target device to the user in acoustic form. This possibility is particularly advantageous when a mobile telephone is used as the communication means between the user and the target device.
If the information transferred by the user to the target device is restricted as a function of the user, a restriction of the possible transactions can be achieved, which is for example advisable when the method in accordance with the invention is used by children, but also in other areas.
To achieve the object according to the invention, also a computer is used for editing information for a target device to support communication between a user and the target device, comprising a communication means for commumcation between the user and the target device, an interface between the computer and the target device and a link between the computer and the communication means, with a database linked to the computer for storage of user-specific data and identification means for the identification of the user. The interface between the computer and the target device may be, for example, a respective link to a data network such as the Internet, or also a standardized or individually designed link to a device such as a video recorder, a heating system or a kitchen appliance. The communication or dialog, respectively, between a user and the target device is interrupted by the computer and through regular querying of the database the dialog between the user and the target is supported in that data which is present and which the target device needs, is taken from the database and thus need not be entered by the user via the communication means. Furthermore the inventive method is built up as an adaptive system, in that user- specific data are regularly updated and expanded in the database and thus the data file of the user is continually updated and expanded. User-specific data, such as name, address, date of birth, but also certain preferences, can be called up so as to support the dialog with a target unit.
The identification means are preferably in the form of a speech recognition unit. With the appropriate speech input by the user, it is immediately assigned to the corresponding user-specific data in the database and the further dialog with the target unit is thereby supported.
In accordance with a further characteristic of the invention, encryption and decryption means for encrypting and decrypting communications between the user and the computer and/or communication between the computer and the target device are provided. Such encryption is important to secure personal data and thus protect the privacy of the user. For financial transactions in particular, such encryption is also a protection against abuse by others.
If acoustic references for the speech recognition and/or information on the purchasing behavior of the user or similar are provided in the database, the corresponding support for the dialog and the identification of the user is further enhanced.
Furthermore, a recognition device for recognition of the communication means can also be provided. This recognition device may be effected, for example, in the event of use of a mobile telephone as the communication means, by means of the mobile telephone number that always accompanies a call. In accordance with another characteristic of the invention, the interface between the computer and the target device is formed by a data network, in particular the Internet.
In respective fields of application the communication means can be integrated with the computer. For example, a home computer may serve both as a communication means and a computer for editing the information for the target device.
This also applies to the database for the user-specific data that can be integrated with the computer.
For acoustic output of the information to the user a voice synthesis device may be provided. Through acoustic output the dialog between the user and the target device is further enhanced, since reading of a display or similar is unnecessary.
For user-specific restriction of the information transferred by a user to the target device a corresponding device or entry in the database can be provided. In this way, by way of example, a parental control or other access restriction can be created. The communication means may be in the form of a mobile telephone through which the target device can be reached from virtually anywhere.
To achieve the object according to the invention also a computer program product is used that can be loaded directly into the internal memory of a digital computer and comprises sections of software code, in which the computer is used to process the steps of the method described above if the product is running on the computer.
For this purpose the computer program product is preferably stored on a medium that can be read by a computer.
The present invention is further explained using preferred examples of embodiment and with reference to the drawing, in which
Fig. 1 shows in a schematic manner the components for executing the method in accordance with the invention during dialog between a user and a computer on the Internet.
Fig. 2 shows the components for executing the method in accordance with the invention to support the dialog between a user and a household appliance.
Fig. 3 shows a flow chart to illustrate the functional sequence of the method in accordance with the invention.
Fig. 1 shows a communication means 1 in the form of a mobile telephone, with which a user establishes a dialog with a target device 2, which target device 2 in the present case is comprised of a computer that is connected to a data network, in particular the Internet 3. Instead of the use of a mobile telephone as the communication means 1 a personal computer, a palm-top computer or the like may be provided. The target device 2 in the form of a computer can for example be a server of a provider of certain products on the Internet 3. In accordance with the invention a computer 4 is provided that serves to edit the information for the target device 2 to support the communication between the user and the target device 2. The computer 4 has an interface 5 with the target device 2, that can be comprised of a corresponding link to the Internet 3, for example by means of a modem link. The interface 5 can also be comprised of a standard interface of the computer 4. Likewise there is a link between the computer 4 and the communication means 1, which by way of example can be comprised of the corresponding mobile radio network and a corresponding receiver unit on the computer 4 (not shown).
In accordance with the invention a database 6 is also provided for storage of user-specific data, which is preferably integrated with the computer 4. Depending on the application the communication means 1, the computer 4 and the database 6 can also be combined to a single device. During the dialog between the user and the target device 2 according to the invention user-specific data are searched for in the database 6 and these data are used for the information for the target device 2 as required. In the case of the first communication from the user the most important user-specific data are entered by the user when requested via the communication means 1 and stored in the database 6 via the computer 4. Identification means 7 are used to identify the user and can, for example, be comprised of a speech recognition unit, with which automatic assignment to the respective user takes place through the corresponding speech input of the user via the communication means 1. The identification can also be extended to the entry of a password, an identifier, a PIN code or similar or take place automatically through the mobile telephone number of a mobile telephone as the communication means 1. In order to prevent abuse and to ensure data protection, the communication between the mobile telephone 1 and the computer 4 and or the computer 4 and the target device 2 can also take place via corresponding encryption and decryption devices 8, 9. These encryption and decryption devices 8, 9 are naturally preferably integrated with the computer 4 or with the target device 2. For output of the data transferred from the target device 2 or computer 4 to the user or communication means 1 in acoustic form a speech synthesis device 10 can also be provided.
Fig. 2 shows a realization of the method in accordance with the invention in the support of the dialog between a user and a target device 2 in the form of a domestic appliance, for example a video recorder. In this case the communication means 1 of the user is formed by a personal computer that also contains the function of the computer 4. Via a corresponding interface 5 the target device or video recorder is connected to the computer 4. Through according identification of the user, for example by the entry of a password, the user-specific information stored in the database 6 is used for programming the video recorder and thus supports the programming process. In this way different behaviors of family members can be taken into account and used when programming the video recorder.
The application in accordance with the invention of the method or computer or computer program product in accordance with the invention is, however, not restricted to the two examples described. Rather, the invention allows a very wide range of applications in the most varied of fields. For example, the dialog of a user with a heating system or kitchen appliances can be supported and simplified. Furthermore, it is conceivable for the method in accordance with the invention to also support the completion of forms, for example from authorities over the Internet. Fig. 3 shows a flow chart of the most important functional sequences of the method in accordance with the invention. The method in accordance with the invention begins at step 101. At step 102 identification of the user takes place, for example by analysis of speech input. At step 103 a query takes place on whether data on the identified user is present in the database. If this is the case, the sequence continues from step 105. If the user is a new user and if, accordingly, no user data is present in the database, in accordance with step 104, certain user data is demanded from the user and stored in the database. In accordance with step 105 the target device is asked for desired data, whereupon, in accordance with step 106, a search is made to see if these data are present in the database. If the data desired by the target device are stored in the database, these are called up from the database in step 107 and transferred to the target device. In the event of the desired data not being stored in the database, they are established from the user in step 108 and passed to the target device and, in accordance with step 109, stored in the database. The procedure continues with a query in accordance with step 110 on whether further data are required for the target device and in the affirmative it continues with step 105. This loop between step 110 and step 105 is repeated as often as necessary. If all the data required for the target device are present, the procedure is ended in accordance with step 111.
Mention may be made of the fact that the user identified by the computer can by speech input also ask quite general questions or also questions in a purchasing procedure. For example, the user can ask when the book ordered from the target device will be delivered. The computer recognizes the content of the question and converts this into a question that can be answered by the target device, which answers the question to the computer. With the speech synthesis means of the computer the computer answers the question of the user.

Claims

CLAIMS:
1. A method of supporting the dialog between a user and a target device (2), in which the user is identified and user-specific data are stored in a database (6) that are used when the information for the target device (2) is edited.
2. A method as claimed in claim 1, in which the user is identified on the basis of his speech input.
3. A method as claimed in claim 1, in which the user communicates with the target device (2) via a mobile telephone (1) and the user is identified on the basis of his mobile telephone number.
4. A method as claimed in claim 1 , in which the user is identified on the basis of the input of a password or an identifier, a PIN code or similar.
5. A method as claimed in claim 1, in which the dialog between the user and the target device (2) is encrypted.
6. A method as claimed in claim 1, in which information that is missing for the target device (2) is determined through a dialog with the user.
7. A method as claimed in claim 1, in which the user-specific data are updated and extended.
8. A method as claimed in claim 1, in which in the case of different user-specific data entered by the user and stored in the database (6), confirmation of the user is required for updating of the user-specific data.
9. A method as claimed in claim 1 , in which the communication between the target device (2) and the user takes place via synthetic voice output.
10. A method as claimed in claim 1 , in which the information transferred by the user to the target device is restricted in dependence on the user.
11. A computer (4) for editing information for a target device (2) to support communication between a user and the target device (2), comprising a communication means (1) for communication between the user and the target device (2), an interface (5) between the computer (4) and the target device (2) and a link between the computer (4) and the communication means (1), with a database (6) linked to the computer (4) for storage of user- specific data, and identification means (7) for the identification of the user.
12. A computer as claimed in claim 11 , in which the identification means are formed by a speech recognition unit.
13. A computer as claimed in claim 11 , in which encryption and decryption means
(8, 9) for encrypting and decrypting the communication between the user and the computer (4) and/or the communication between the computer (4) and the target device (2) is provided.
14. A computer as claimed in claim 11 , in which acoustic references for the speech recognition and/or information about the purchasing behavior of the user or similar is contained in the database (6).
15. A computer as claimed in claim 11 , in which a recognition device for recognition of the communication means is provided.
16. A computer as claimed in claim 11, in which the interface (5) between the computer (4) and the target device (2) is formed by a data network (3), in particular the Internet.
17. A computer as claimed in claim 11 , in which the communication means ( 1 ) are integrated with the computer (4).
18. A computer as claimed in claim 11 , in which the database (6) for the user- specific data is integrated with the computer (4).
19. A computer as claimed in claim 11, in which a speech recognition device (10) is provided for acoustic output of the information.
20. A computer as claimed in claim 11 , in which a device is provided for restricting the information transferred by the user to the target device in dependence on the user.
21. A computer as claimed in claim 11, in which the communication means (1) are formed by a mobile telephone.
22. A computer program product, which can be loaded directly into the internal memory of a digital computer and comprises sections of software code, in which the steps of the method in accordance with any of claims 1 to 10 are executed with the computer if the product is running on the computer.
23. A computer program product as claimed in claim 22, in which it is stored on a medium that can be read by a computer.
PCT/IB2002/001280 2001-04-13 2002-04-09 Speaker verification in a spoken dialogue system WO2002086865A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP02720373A EP1382033A1 (en) 2001-04-13 2002-04-09 Speaker verification in a spoken dialogue system
JP2002584300A JP2004533752A (en) 2001-04-13 2002-04-09 Speaker authentication in dialog systems

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP01890115 2001-04-13
EP01890115.7 2001-04-13

Publications (1)

Publication Number Publication Date
WO2002086865A1 true WO2002086865A1 (en) 2002-10-31

Family

ID=8185107

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2002/001280 WO2002086865A1 (en) 2001-04-13 2002-04-09 Speaker verification in a spoken dialogue system

Country Status (6)

Country Link
US (1) US20020152300A1 (en)
EP (1) EP1382033A1 (en)
JP (1) JP2004533752A (en)
KR (1) KR20030012877A (en)
CN (1) CN1302455C (en)
WO (1) WO2002086865A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050023941A (en) * 2003-09-03 2005-03-10 삼성전자주식회사 Audio/video apparatus and method for providing personalized services through voice recognition and speaker recognition
CN104601832A (en) * 2008-04-29 2015-05-06 台达电子工业股份有限公司 Conversation system and speech conversation processing method
CN102479396A (en) * 2010-11-25 2012-05-30 王正伟 Target device selection method, system and facility
US20130066634A1 (en) * 2011-03-16 2013-03-14 Qualcomm Incorporated Automated Conversation Assistance
CN103738295B (en) * 2013-12-25 2016-03-02 科大讯飞股份有限公司 A kind of active fire alarm of the stolen power actuated vehicle based on speech recognition and track channel and method
CN105489218A (en) * 2015-11-24 2016-04-13 江苏惠通集团有限责任公司 Speech control system, remote control and server

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5629981A (en) * 1994-07-29 1997-05-13 Texas Instruments Incorporated Information management and security system
WO1999000719A1 (en) * 1997-06-27 1999-01-07 Lernout & Hauspie Speech Products N.V. Access-controlled computer system with automatic speech recognition
EP0890167A2 (en) * 1996-09-09 1999-01-13 VCS INDUSTRIES, Inc. d.b.a. VOICE CONTROL SYSTEMS Speech recognition and verification system enabling authorized data transmission over networked computer systems
US6138100A (en) * 1998-04-14 2000-10-24 At&T Corp. Interface for a voice-activated connection system
WO2000065814A1 (en) * 1999-04-23 2000-11-02 Nuance Communications Object-orientated framework for interactive voice response applications
EP1074974A2 (en) * 1999-06-07 2001-02-07 Nokia Mobile Phones Ltd. Secure wireless communication user identification by voice recognition

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5517558A (en) * 1990-05-15 1996-05-14 Voice Control Systems, Inc. Voice-controlled account access over a telephone network
US6304864B1 (en) * 1999-04-20 2001-10-16 Textwise Llc System for retrieving multimedia information from the internet using multiple evolving intelligent agents
US7146505B1 (en) * 1999-06-01 2006-12-05 America Online, Inc. Secure data exchange between date processing systems
US20010049636A1 (en) * 2000-04-17 2001-12-06 Amir Hudda System and method for wireless purchases of goods and services
US20040078276A1 (en) * 2000-12-22 2004-04-22 Kotaro Shimogori System for electronic merchandising and shopping

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5629981A (en) * 1994-07-29 1997-05-13 Texas Instruments Incorporated Information management and security system
EP0890167A2 (en) * 1996-09-09 1999-01-13 VCS INDUSTRIES, Inc. d.b.a. VOICE CONTROL SYSTEMS Speech recognition and verification system enabling authorized data transmission over networked computer systems
US6292782B1 (en) * 1996-09-09 2001-09-18 Philips Electronics North America Corp. Speech recognition and verification system enabling authorized data transmission over networked computer systems
WO1999000719A1 (en) * 1997-06-27 1999-01-07 Lernout & Hauspie Speech Products N.V. Access-controlled computer system with automatic speech recognition
US6138100A (en) * 1998-04-14 2000-10-24 At&T Corp. Interface for a voice-activated connection system
WO2000065814A1 (en) * 1999-04-23 2000-11-02 Nuance Communications Object-orientated framework for interactive voice response applications
US6314402B1 (en) * 1999-04-23 2001-11-06 Nuance Communications Method and apparatus for creating modifiable and combinable speech objects for acquiring information from a speaker in an interactive voice response system
EP1074974A2 (en) * 1999-06-07 2001-02-07 Nokia Mobile Phones Ltd. Secure wireless communication user identification by voice recognition

Also Published As

Publication number Publication date
EP1382033A1 (en) 2004-01-21
US20020152300A1 (en) 2002-10-17
JP2004533752A (en) 2004-11-04
KR20030012877A (en) 2003-02-12
CN1302455C (en) 2007-02-28
CN1461465A (en) 2003-12-10

Similar Documents

Publication Publication Date Title
US7149545B2 (en) Method and apparatus for facilitating over-the-air activation of pre-programmed memory devices
EP2933981B1 (en) Method and system of user authentication
US7953641B2 (en) Method for listing goods for sale by telephone
EP1430452B1 (en) Point-of-sale (pos) voice authentication transaction system
US20180032755A1 (en) Computer-Implemented System And Method For Storing And Retrieving Sensitive Information
US10567578B1 (en) Applying user preferences, behavioral patterns and environmental factors to an automated customer support application
JP2005513649A (en) Voice-enabled consumer transaction system
US20110051913A1 (en) Method and System for Consolidating Communication
US8675828B2 (en) Authentication of a user to a telephonic communication device
US20120028604A1 (en) Certification and activation of used phones purchased through an online auction
JPH1021305A (en) Electronic commodity transaction system
CN101217375A (en) A saving and acquisition method and device of accounts and passwords
US20140302814A1 (en) Centralized caller profile and payment system and methods for processing telephone payments
KR20040017190A (en) Method for approving service using a mobile communication terminal equipment
US20020152300A1 (en) Internet shopping assistant
US20100063905A1 (en) Method and system for performing banking transactions by simulating a virtual atm by means of a mobile telecommunications device
GB2419970A (en) Application Generation System and Method
JP2002133198A (en) Contents utilization control system, contents utilization control server, communication terminal, and contents utilization control method
WO2006018892A1 (en) Telephone authentication system preventing spoofing even when personal information is leaked
US20030048890A1 (en) System and method for changing a personal identification number
KR20190104019A (en) Method for Providing Network type OTP based on Program
KR20090020104A (en) System and method for processing validity with mobile phone's advertisement selection, server for processing information and program recording medium
KR20050003720A (en) Mobile phonograph contents service system and using method for mobile phonograph contents
US20070228147A1 (en) Application generation system, method and machine readable medium
JP2003030472A (en) Membership shopping system by portable telephone set

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CN IN JP KR

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

WWE Wipo information: entry into national phase

Ref document number: 2002720373

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020027016825

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 02801202X

Country of ref document: CN

Ref document number: IN/PCT/2002/2071/CHE

Country of ref document: IN

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1020027016825

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2002584300

Country of ref document: JP

WWP Wipo information: published in national office

Ref document number: 2002720373

Country of ref document: EP

WWR Wipo information: refused in national office

Ref document number: 2002720373

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2002720373

Country of ref document: EP