DE10043271A1

DE10043271A1 - Method for voice recognition picks up a spoken word/word component to be processed electronically and converted to digital code before being compared with a data file of stored references.

Info

Publication number: DE10043271A1
Application number: DE2000143271
Authority: DE
Inventors: Marian Trinkel; Christel Mueller; Detlef Hardt; Heinrich-Helmut Brenig
Original assignee: Deutsche Telekom AG
Current assignee: Deutsche Telekom AG
Priority date: 2000-09-02
Filing date: 2000-09-02
Publication date: 2002-10-02

Abstract

A word spoken by a person in conversation or a word component is picked up, electronically processed and converted to digital code before it is compared with a data file of stored references. A reference matching a word that has been spoken is processed further. The person speaking the words is identified in respect of certain features by using speech characterizing that person.

Description

Die vorliegende Erfindung betrifft ein Verfahren zur Spracherkennung, bei dem ein von einer sprechenden Person gesprochenes Wort oder ein Wortbestandteil aufgenommen, elektronisch verarbeitet und in einen digitalen Code umgesetzt wird, bevor es mit einer Datei enthaltend gespeicherte Bedeutungen verglichen wird, wobei die dem gesprochen Wort zuzuordnende Bedeutung weiterverarbeitet wird. Die Erfindung betrifft gleichfalls ein System zur Durchführung des Verfahrens.The present invention relates to a method for speech recognition, in which a Word or part of a word spoken by a speaking person recorded, electronically processed and converted into a digital code is compared before it is saved with a file containing stored meanings the meaning of the spoken word is further processed becomes. The invention also relates to a system for performing the Process.

Spracherkennungsysteme sind generell in verschiedenen Anwendungen bekannt. So werden heute beispielsweise Mobiltelephone mit speicherbaren Telephonbuch angeboten, wobei der Inhalt des Telephonbuches vom Benutzer über das Sprechen eines Namens abgerufen werden kann ("Name Dailling"). Nach der Vorgabe des Namens wählt das Telephon die Nummer des ausgewählten Teilnehmers automatisch an. Ein solches System ist für die Benutzung von Mobiltelephonen während anderer Beschäftigungen, beispielsweise während des Autofahrens, vorteilhaft.Speech recognition systems are generally known in various applications. So today, for example, cell phones with storable phone book offered, the content of the phone book by the user on the Speaking a name can be retrieved ("Name Dailling"). After Given the name, the telephone dials the number of the selected one Participant automatically. Such a system is for the use of Mobile phones during other activities, for example during the Driving, advantageous.

Nachteil der bekannten Systeme ist generell, daß sie vergleichsweise wenig intelligent sind und die sprachlichen Vorgaben ohne weiter "nachzudenken" weiterverarbeiten. So kommt es jedoch gezwungenermaßen zu Problemen, wenn beispielsweise der Benutzer das Wort "Mama" vorgibt, um die Mutter anzuwählen. The disadvantage of the known systems is generally that they are comparatively little are intelligent and the language requirements without "thinking" processed. However, this inevitably leads to problems if for example, the user specifies the word "mom" to dial the mother.

Gibt nämlich der Vater den Adressaten "Mama" vor, so will er die Oma sprechen, während das Kind seine Mutter wünscht.If the father specifies the addressee "mom", he wants to speak to the grandma, while the child wants his mother.

Aufgabe der vorliegenden Erfindung ist es, ein Verfahren zur Spracherkennung zu schaffen, das mit einfachen Mitteln umsetzbar ist und das mit hoher Zuverlässigkeit dem gesprochenen Wort die vom Nutzer gewollte Bedeutung zuordnet. Die Aufgabe ist es auch, ein System zur Durchführung des Verfahrens zu schaffen.The object of the present invention is to provide a method for speech recognition create that can be implemented with simple means and with high Reliability of the spoken word is the meaning the user wants assigns. The task is also to create a system to carry out the procedure to accomplish.

Diese Aufgaben werden durch das Verfahren nach Anspruch 1 und das System nach Anspruch 4 gelöst.These tasks are achieved by the method according to claim 1 and the system solved according to claim 4.

Der erfindungsgemäße Gedanke liegt darin, bei der Zuordnung der Bedeutung zu dem entsprechenden über Sprache vorgegebenen Wort die Individualität des Sprechers zu erkennen, um damit mögliche individuelle Unterschiede in den Bedeutungen einzelner Worte stärker berücksichtigen zu können. Das Verfahren identifiziert die sprechende Person anhand der die Person charakterisierenden Aussprache einzelner Worte oder bestimmter systematischer Charakteristika. Dabei reicht es unter Umständen aus, die Person zumindest bezüglich eines oder einiger Merkmale, wie beispielsweise Alter, Geschlecht oder Gemütszustand zu identifizieren, um eine vernünftige Zuordnung der Bedeutungen zu gewährleisten. Bei der Zuordnung der Bedeutung zum vorgegebenen Wort oder zum Wortbestandteil berücksichtigt das Verfahren erfindungsgemäß eine der Person oder der Personengruppe individuell zuzuordnende Korrektur, die zu einer individualisierten Bedeutung des vorgegebenen Wortes führt.The idea of the invention is to assign the meaning to the corresponding word given by language the individuality of the Recognizing the speaker in order to identify possible individual differences in the To be able to take the meanings of individual words more into account. The procedure identifies the speaking person based on the characterizing person Pronunciation of individual words or certain systematic characteristics. It may be sufficient to at least refer to one or the person some characteristics, such as age, gender or mood identify in order to ensure a reasonable assignment of the meanings. When assigning the meaning to the given word or to According to the invention, the word component takes into account one of the person or the correction to be individually assigned to the group of people, which leads to a individualized meaning of the given word.

Das erfindungsgemäße Verfahren läßt sich generell zur Korrektur eines jeden Systems zur Spracherkennung einsetzen, da eine Individualisierung von Bedeutungen in jedem Fall zu einer höheren Treffsicherheit bei der Zuordnung führt. Beispielsweise kann jedes Diktierprogramm von der Erfindung profitieren, indem es zunächst den Nutzer identifiziert, bevor es die individuelle Zuordnung von Bedeutungen vornimmt. Besonders vorteilhaft ist es jedoch der Einsatz der individualisierenden Spracherkennung als Feature bei (Mobil)-Telephonen. Bei diesen kann der Name des anzurufenden Teilnehmers gesprochen werden (name dailling), und das erfindungsgemäße Verfahren setzt daraufhin den Namen in die entsprechende Netzkennung um. So wird mittels der Erfindung dem Wort "Mama" von einem Kind ausgesprochen die Bedeutung "Mutter" zugeordnet, während die Oma angewählt wird, wenn der Vater "Mama" vorgibt.The method according to the invention can generally be used to correct everyone Use systems for speech recognition because an individualization of Meanings in any case to a higher accuracy in the assignment leads. For example, any dictation program can benefit from the invention, by first identifying the user before making the individual assignment of meanings. However, it is particularly advantageous to use the individualizing voice recognition as a feature on (mobile) telephones. at the name of the subscriber to be called can be spoken to them (name dailling), and the inventive method then sets the name in the corresponding network identifier. So the word "mom" pronounced by a child the meaning "mother" while the Grandma is chosen when the father pretends "mom".

Als System wird der erfindungsgemäße Gedanke dadurch realisiert, daß die Spracherkennungseinheit ein Verifikationsmodul aufweist, das anhand des gesprochenen Wortes zumindest ein Merkmal des anrufenden Teilnehmers, wie beispielsweise sein Alter und/oder sein Geschlecht, registriert und dieses als Korrektur berücksichtigt bei der Umsetzung des Wortes in die Netzkennung. Dabei umfaßt das System ein Datenleitungsnetz, insbesondere das Internet oder ein Telephonnetz, mit einer Vielzahl individueller Anschlüsse, wobei ein Teilnehmer sich über einen Anschluß in das Netz einwählen und vermittels einer Netzkennung eine Verbindung zum Anschluß eines anderen Teilnehmers herstellen kann. Es ist besonders vorteilhaft, das Verifikationsmodul mit einer intelligenten Elektronik eines Telephones, wie sie in den modernen Telephonen üblich ist, zu realisieren. In einer anderen vorteilhaften Ausführungsform ist die Spracherkennung und das Verifikationsmodul nicht im Telephon dezentral, sondern zentral von einem Server realisiert sind, der an das Datenleitungsnetz angeschlossen ist.As a system, the inventive idea is realized in that the Speech recognition unit has a verification module, which is based on the spoken word at least one feature of the calling party, such as for example, its age and / or gender, registered and this as Correction taken into account when converting the word into the network identifier. there the system comprises a data line network, in particular the Internet or a Telephone network, with a large number of individual connections, one subscriber dial into the network via a connection and use a network identifier can establish a connection to the connection of another subscriber. It is the verification module with intelligent electronics is particularly advantageous a telephone, as is common in modern telephones. In another advantageous embodiment, the speech recognition and that Verification module not decentralized in the telephone, but centrally from a server are realized, which is connected to the data line network.

Neben der verbesserten Treffsicherheit der gewünschten Bedeutung hat die Erfindung folgende weitere Vorteile: Über die Identifizierung des anrufenden Teilnehmers ist es möglich, für spezielle Personengruppen (Jugendliche oder Rentner) besondere Tarife einzuführen. Bestimmte Netzkennungen, z. B. 0190- Hotlines, können über die Identifizierung auch gesperrt werden. Vorteilhaft ist, daß der Nutzer jederzeit über seine Sprache einen präzisen Zugriff auf seine im Telephonbuch gespeicherten Nummern hat.In addition to the improved accuracy of the desired meaning, the Invention following further advantages: On the identification of the calling Participant, it is possible for special groups of people (young people or Pensioners) to introduce special tariffs. Certain network identifiers, e.g. B. 0190- Hotlines can also be blocked using identification. It is advantageous that the user has precise access to his im at any time via his language Phone book has stored numbers.

Claims

1. A method for speech recognition, in which a word or a part of a word spoken by a speaking person is recorded, electronically processed and converted into a digital code before it is compared with a file of stored meanings, the meaning corresponding to the spoken word being processed further, characterized in that the speaking person is identified on the basis of the language characterizing them at least with regard to some features and when assigning the spoken word or the word component to the meaning, a correction which can be individually assigned to the person and which leads to an individualized meaning is taken into account.

2. The method according to claim 1, characterized in that an individualization with regard to the characteristic "age" and / or "gender" becomes.

3. The method according to claim 1 or 2, characterized in that the individualizing Speech recognition for the spoken selection of network identifiers (name dailling), especially in telephone networks.

4. System comprising a data line network, in particular the Internet or a telephone network, with a large number of individual connections, wherein a subscriber can dial into the network via a connection and can use a network identifier to establish a connection to the connection of another subscriber, with a speech recognition unit converts the word spoken by the calling subscriber into an electronic and system-compatible network identifier, characterized in that the speech recognition unit has a verification module which uses the spoken word to register at least one characteristic of the calling subscriber, such as its anus, and when the word is converted into the network identifier considered as a correction.

-

5. System according to claim 4, characterized in that the verification module of intelligent electronics of a telephone is implemented.

6. System according to claim 4, characterized in that the speech recognition and the verification module are implemented by a central server that is on the data line network is connected.