KR100414064B1

KR100414064B1 - Mobile communication device control system and method using voice recognition

Info

Publication number: KR100414064B1
Application number: KR10-2001-0019584A
Authority: KR
Inventors: 오종인
Original assignee: 엘지전자 주식회사
Priority date: 2001-04-12
Filing date: 2001-04-12
Publication date: 2004-01-07
Also published as: KR20020080174A

Abstract

본 발명은 음성인식에 의한 이동통신 단말기 제어시스템 및 방법에 관한 것으로, 음성인식에 의해 이동통신 단말기를 제어함에 있어서, 단말기에는 사용자가 발음하는 음성명령으로부터 음성특징을 추출하여 기지국으로 전송할 수 있도록 하는 음성특징 추출부를 더 구비하고, 기지국 시스템에는 단말기에서 전송된 음성특징으로부터 음소 데이터베이스를 검색하여 각각에 해당하는 어휘를 인식해 등록하는 어휘 등록부와; 각 사용자별로 인식된 어휘를 저장하기 위한 데이터베이스와; 상기 어휘등록 완료 후, 사용자가 내리는 음성명령을 상기 등록된 어휘와 비교하여 인식할 수 있도록 하는 음성인식 처리부를 포함하여 구성된 음성인식 서버를 구비하여 기지국 시스템의 음성인식 서버와 단말기가 상호간 통신에 의해 음성을 인식하도록 함으로써, 종래와 달리 여러번의 발음을 통하여 음성명령을 학습시킬 필요가 없어졌기 때문에 음성명령 사용을 위한 사용자의 편의성이 향상되고, 기존의 단말기에 간단한 프로그램을 추가하는 것만으로 음성인식을 구현할 수 있게 되어 제작단가의 상승을 최소화하는 효과가 있다.The present invention relates to a system and method for controlling a mobile terminal by voice recognition. In controlling the mobile terminal by voice recognition, the terminal extracts a voice feature from a voice command pronounced by a user and transmits the voice feature to a base station. Further comprising a speech feature extraction unit, the base station system includes: a vocabulary registration unit for retrieving and registering the vocabulary corresponding to each phoneme database from the voice feature transmitted from the terminal; A database for storing the recognized vocabulary for each user; After the vocabulary registration is completed, a voice recognition server including a voice recognition processing unit for recognizing a voice command issued by a user is compared with the registered vocabulary, and the voice recognition server of the base station system and the terminal are connected by mutual communication. By recognizing the voice, unlike the conventional method, it is not necessary to learn the voice command through multiple pronunciations, thereby improving the user's convenience for using the voice command, and recognizing the voice recognition by simply adding a simple program to the existing terminal. Since it can be implemented, there is an effect of minimizing the increase in production cost.

Description

Mobile communication terminal control system and method by voice recognition {MOBILE COMMUNICATION DEVICE CONTROL SYSTEM AND METHOD USING VOICE RECOGNITION}

본 발명은 음성인식에 의한 이동통신 단말기의 제어 기술에 관한 것으로, 특히 이동통신 단말기에서 음성특징만 추출하여 기지국의 음성인식 서버에 전송하고, 음성인식 서버는 상기 음성특징을 수신하여 음성명령으로 등록하거나, 기 등록된 음성명령 코드를 단말기로 전송하여 음성인식에 의해 단말기를 제어할 수 있도록 하는 음성인식에 의한 이동통신 단말기 제어시스템 및 방법에 관한 것이다.The present invention relates to a control technology of a mobile communication terminal by voice recognition, and more particularly, extracts only a voice feature from a mobile communication terminal and transmits it to a voice recognition server of a base station, and the voice recognition server receives the voice feature and registers it as a voice command. Or, it relates to a mobile communication terminal control system and method by the voice recognition to transmit the pre-registered voice command code to the terminal to control the terminal by voice recognition.

일반적으로, 이동통신 단말기에 적용할 수 있는 음성인식 방법에는 여러 가지가 있는데, 그중 사용자종속 어휘종속 방법은 음성인식으로 학습한 어휘에 대해서만 인식을 수행할 수 있으며, 부가적인 어휘를 인식시키기 위해서는 계속적으로 학습을 수행해야 하며 학습한 사용자에 대해서만 인식이 가능한 문제점이 있다.In general, there are a number of speech recognition methods that can be applied to a mobile communication terminal. Among them, the user-dependent vocabulary-dependent method can recognize only the vocabulary learned by speech recognition, and continuously to recognize additional vocabulary. There is a problem in that learning must be performed, and only the learned user can be recognized.

다음, 사용자독립 어휘종속 음성인식은 사용자종속 어휘종속에 비해서 사용자별로 학습과정이 생략되기 때문에, 음성인식을 사용하기 위해서 부가적인 작업이 불필요해서 편리해지는 점이 있으나, 등록할 수 있는 어휘가 제한되어 그 효용성이 많이 떨어지는 문제점이 있다.Next, since the user-independent vocabulary-dependent speech recognition eliminates the learning process for each user as compared to the user-dependent vocabulary, there is a convenience in that additional work is unnecessary to use the speech recognition, but the vocabulary that can be registered is limited. There is a problem that the utility falls a lot.

다음, 사용자독립 어휘독립 음성인식은 사용자별 학습과정이 생략될 뿐만 아니라, 등록할 수 있는 어휘수에 제한을 받지 않기 때문에 그 응용분야가 많지만, 이를 수행하기 위해서는 음성인식 특징 추출후 확률계산을 위한 계산량과 무제한 어휘 등록을 위한 많은 메모리를 필요로 하기 때문에 현실적으로 구현이 불가능한 문제점이 있다.Next, user-independent vocabulary-independent speech recognition has many application fields because not only the learning process for each user is omitted, but also the number of registered vocabularies is not limited. Since a large amount of memory is required for the calculation amount and unlimited vocabulary registration, there is a problem in that the implementation is impossible.

한편, 종래에 이동통신 단말기를 이용한 음성명령 기능을 사용하기 위해서는 음성명령 메뉴를 통한 등록과정을 먼저 수행해야 했다.On the other hand, conventionally, in order to use the voice command function using a mobile communication terminal, the registration process through the voice command menu had to be performed first.

음성명령 등록과정은 사용자에게 음성인식을 수행할 어휘를 2회 내지 3회 발음하도록 하고, 이 중에서 음성 상태가 양호하지 못한 음성이 있을 경우에는 추가적으로 발음하도록 요구한다.The voice command registration process requires the user to pronounce the vocabulary for performing voice recognition two or three times, and if there is a voice in which the voice state is not good, additional pronunciation is required.

그러면, 이동통신 단말기의 음성인식부(미도시)는 발음된 음성에서 시작점과 끝점을 추출한 후에, 음성특징 추출 알고리즘에 의하여 특징을 추출하고 이를 이동통신 단말기 내부의 비휘발성 메모리(미도시)에 저장하게 된다.Then, the voice recognition unit (not shown) of the mobile communication terminal extracts the start point and the end point from the pronounced voice, extracts the feature by the voice feature extraction algorithm, and stores it in a nonvolatile memory (not shown) inside the mobile communication terminal. Done.

다음, 상기와 같이 음성명령의 저장이 완료된 후 단말기의 음성인식 모드에서 사용자가 음성명령을 내리면, 음성명령 등록시와 마찬가지 과정을 거쳐 그 음성명령에 대한 음성특징을 추출하고, 기 등록된 음성특징 중에서 가장 비슷한 어휘를 찾아서 인식 결과로 출력한다.Next, when a user gives a voice command in the voice recognition mode of the terminal after the storage of the voice command is completed as described above, the voice feature of the voice command is extracted through the same process as when the voice command is registered, and among the pre-registered voice features. Find the most similar vocabulary and print it out as recognition result.

즉, 상기 음성인식부에 의해서 출력된 결과 데이터는 이동통신 단말기의 사용자 인터페이스에 의해서 전화발신 등의 해당 동작을 수행하게 된다.That is, the result data output by the voice recognition unit performs a corresponding operation such as telephone call by the user interface of the mobile communication terminal.

또한, 최근의 이동통신 단말기는 단문전송 서비스(SMS : Short Message Service)를 많이 이용하고 있는데, 이는 가입자가 단말기나 인터넷을 이용하여 특정 가입자에게 단문 메시지를 전송하는 서비스로서, 일반 음성 호 처리나 데이터 호 처리에 비해서 시스템 부하를 적게 요구하므로, 비교적 저렴한 비용으로 메시지의 송,수신이 가능하며 레이어-2 ACK와 레이어-3 ACK를 지원하므로 수신 및 발신에 대한 신뢰성을 보장하는 편리한 서비스이다.In addition, recent mobile communication terminals use Short Message Service (SMS), which is a service that a subscriber transmits a short message to a specific subscriber using a terminal or the Internet. Since it requires less system load than call processing, it is possible to send and receive messages at a relatively low cost. It supports Layer-2 ACK and Layer-3 ACK, which is a convenient service that guarantees the reliability of receiving and sending.

즉, SMS는 발신자가 메시지를 작성하여 발신을 시도하면 기지국 시스템에 메시지를 먼저 저장하고, 발신자에게 ACK신호를 주어서 전송이 완료됨을 알려준다. 그 후, 시스템이 착신 단말에게 메시지를 전송한다.That is, when the sender writes a message and attempts to send the message, the SMS first stores the message in the base station system and gives the sender an ACK signal to inform that the transmission is completed. The system then sends a message to the called terminal.

만일 메시지 전송 후, 착신 단말에서 ACK 신호가 오지 않으면 재전송을 하게 되고, 착신 단말의 전원이 꺼져 있거나 음영지역에 있을 경우 전원을 켜거나 서비스 지역에 들어올 때 전송을 시도한다.If the ACK signal does not come from the destination terminal after the message is transmitted, it is retransmitted. If the destination terminal is turned off or is in a shaded area, the transmission is attempted when the power is turned on or enters the service area.

상술한 바와 같이, 종래의 이동통신 단말기에는 사용자종속 어휘종속 음성인식 기술이 탑재되어 있는데, 이는 사용자가 음성명령을 사용하기 전에 반드시 2회에서 3회 정도의 학습과정을 거쳐야 하며, 사용할 수 있는 어휘의 수에 제한을 받기 때문에 이름에 의한 전화 발신 정도의 제한된 기능에만 사용이 되었다.As described above, the conventional mobile communication terminal is equipped with a user-dependent vocabulary-dependent speech recognition technology, which must undergo two to three learning processes before the user can use the voice command. Because of the limited number of users, it was only used for limited functions such as calling by name.

또한 이동통신 단말기에 내장된 중앙처리장치의 속도가 늦어서 인식률을 높이기 위한 알고리즘 수정이나 음성 데이터베이스 추가 등의 구현이 불가능해서 낮은 인식률을 가지는 문제점이 있었다.In addition, since the speed of the central processing unit embedded in the mobile communication terminal is slow, it is impossible to implement an algorithm modification or a voice database for increasing the recognition rate, and thus there is a problem of having a low recognition rate.

따라서, 본 발명은 상기와 같은 종래의 문제점을 해결하기 위하여 창출한 것으로, 이동통신 단말기에서 음성특징만 추출하여 기지국의 음성인식 서버에 전송하고, 음성인식 서버는 상기 음성특징을 수신하여 음성명령으로 등록하거나, 기 등록된 음성명령 코드를 단말기로 전송하여 음성인식에 의해 단말기를 제어할 수 있도록 하는 음성인식에 의한 이동통신 단말기 제어시스템 및 방법을 제공함에 그 목적이 있다.Therefore, the present invention has been created to solve the above-mentioned problems, and extracts only the voice feature from the mobile communication terminal and transmits it to the voice recognition server of the base station, and the voice recognition server receives the voice feature as a voice command. It is an object of the present invention to provide a system and method for controlling a mobile communication terminal by voice recognition, by registering or transmitting a pre-registered voice command code to a terminal to control the terminal by voice recognition.

도 1은 본 발명에 의한 시스템의 전체적인 구성을 나타낸 예시도.1 is an exemplary view showing the overall configuration of a system according to the present invention.

도 2는 본 발명에 의한 음성명령 등록 과정에 대한 신호의 흐름을 보인 예시도.Figure 2 is an exemplary view showing a signal flow for the voice command registration process according to the present invention.

도 3은 본 발명에 의한 음성명령 인식 과정에 대한 신호의 흐름을 보인 예시도.Figure 3 is an exemplary view showing a signal flow for the voice command recognition process according to the present invention.

***도면의 주요 부분에 대한 부호의 설명****** Description of the symbols for the main parts of the drawings ***

10 : 음성특징 추출부 20 : 어휘 등록부10: speech feature extraction unit 20: vocabulary register

30 : 데이터베이스 40 : 음성인식 처리부30: database 40: speech recognition processing unit

이와 같은 목적을 달성하기 위한 본 발명은, 이동통신 단말기를 이용한 음성인식 장치에 있어서, 단말기에는 사용자가 발음하는 음성명령으로부터 음성특징을 추출하여 기지국으로 전송할 수 있도록 하는 음성특징 추출부를 더 구비하고, 기지국 시스템에는 단말기에서 전송된 음성특징으로부터 음소 데이터베이스를 검색하여 각각에 해당하는 어휘를 인식해 등록하는 어휘 등록부와; 각 사용자별로 인식된 어휘를 저장하기 위한 데이터베이스와; 상기 어휘등록 완료 후, 사용자가 내리는 음성명령을 상기 등록된 어휘와 비교하여 인식할 수 있도록 하는 음성인식 처리부를 포함하여 구성된 음성인식 서버를 구비하여 구성한 것을 특징으로 한다.In accordance with another aspect of the present invention, there is provided a voice recognition apparatus using a mobile communication terminal, the terminal further including a voice feature extraction unit for extracting a voice feature from a voice command spoken by a user and transmitting the voice feature to a base station, The base station system includes: a vocabulary registration unit for retrieving and registering a vocabulary corresponding to each phoneme database from a voice feature transmitted from a terminal; A database for storing the recognized vocabulary for each user; After the completion of the vocabulary registration, it characterized in that it comprises a speech recognition server comprising a speech recognition processing unit for recognizing the voice command issued by the user compared to the registered vocabulary.

또한, 상기와 같이 구성된 본 발명은 음성인식에 의한 이동통신 단말기의 제어방법에 있어서, 단말기를 통해 입력된 사용자의 음성에서 음성특징을 추출하는 단계와; 상기 추출된 음성특징을 음성명령 등록과정 또는 음성명령 인식과정인지 구분하고, 음성명령 등록과정인 경우 사업자에 의해 기 설정된 텔리서비스 번호 및 명령번호 등을 더 포함하여, 단문메시지 또는 데이터 서비스 프로토콜을 이용하여 기지국 시스템의 음성인식 서버에 전송하는 단계와; 상기 음성특징 전송과정이 음성명령 등록과정인 경우, 음성인식 서버는 단말기로부터 전송된 음성명령에 대한 음성특징을 자체에 내장된 음소 데이터베이스에서 검색하여, 상기 검색결과 일치하는 음소를 추출하여 문자열을 구성하고, 상기 인식된 문자열을 각 사용자의 데이터베이스에 저장한 후 그 결과를 단말기로 전송하는 단계와; 상기 음성특징 전송과정이 음성명령 인식과정인 경우, 음성인식 서버는 단말기로부터 전송된 음성명령에 대한 음성특징을 각 사용자의 등록 어휘가 저장되어 있는 데이터베이스에서 검색하여, 그 검색결과와 일치하는 어휘를 추출하고 그 추출된 어휘와 함께 등록되어 있는 텔리서비스 번호 및 명령 번호 등이 포함된 결과코드를 단말기로 전송하는 단계를 포함하여 이루어진 것을 특징으로 한다.In addition, the present invention configured as described above is a control method of a mobile communication terminal by voice recognition, comprising: extracting a voice feature from a user's voice input through the terminal; The extracted voice feature may be classified as a voice command registration process or a voice command recognition process, and in the case of the voice command registration process, further including a teleservice number and a command number preset by an operator, using a short message or a data service protocol. Transmitting to a voice recognition server of a base station system; When the voice feature transmission process is a voice command registration process, the voice recognition server retrieves a voice feature of the voice command transmitted from the terminal from its own phoneme database, extracts a phoneme that matches the search result, and forms a character string. Storing the recognized character string in a database of each user and transmitting the result to the terminal; When the voice feature transmission process is a voice command recognition process, the voice recognition server retrieves a voice feature of the voice command transmitted from the terminal from a database in which the registered vocabulary of each user is stored, and searches for a word that matches the search result. And extracting a result code including a teleservice number and a command number registered together with the extracted vocabulary, to the terminal.

이하, 본 발명에 따른 일실시예를 첨부한 도면을 참조하여 상세히 설명하면 다음과 같다.Hereinafter, an embodiment according to the present invention will be described in detail with reference to the accompanying drawings.

도1은 본 발명에 의한 시스템의 전체적인 구성을 나타낸 예시도로서, 사용자가 발음하는 음성으로부터 음성특징을 추출하여 기지국으로 전송할 수 있도록 하는 음성특징 추출부(10)를 단말기에 구비하고, 기지국 시스템에는 기존의 교환기 서버 이외에 음성인식을 위한 전용 서버(SRS)를 더 구비하여, 단말기에서 전송된 음성특징으로부터 음소 데이터베이스(TDB)를 검색하여 각각에 해당하는 어휘를 인식해 등록하는 어휘 등록부(20)와; 각 사용자별로 인식된 어휘를 저장하기 위한 데이터베이스(30)와; 상기 어휘등록 완료 후, 사용자가 내리는 음성명령을 상기 등록된 어휘와 비교하여 인식할 수 있도록 하는 음성인식 처리부(40)를 포함하여 구성된다.1 is an exemplary view showing the overall configuration of a system according to the present invention, which is provided with a voice feature extraction unit 10 in a terminal for extracting a voice feature from a user's pronounced voice and transmitting it to a base station. In addition to the existing exchange server server further includes a dedicated server (SRS) for speech recognition, the vocabulary register 20 for retrieving and registering the vocabulary corresponding to each phoneme database (TDB) from the voice feature transmitted from the terminal and ; A database 30 for storing the recognized vocabulary for each user; After the completion of the vocabulary registration, it is configured to include a voice recognition processing unit 40 to recognize the voice command issued by the user compared to the registered vocabulary.

물론, 상기 이동통신 단말기나 기지국 시스템의 구성에서 무선데이터 송수신부, 음성 입/출력부와 같은 통상의 지식을 가진자가 알 수 있는 일반적인 구성에 대해서는 그 설명을 생략하였다.Of course, in the configuration of the mobile communication terminal or base station system, a description of the general configuration that can be known by those skilled in the art such as a wireless data transmission and reception unit, voice input / output unit is omitted.

이하, 상기와 같이 구성된 본 발명의 동작을 개략적으로 설명하면 다음과 같다.Hereinafter, the operation of the present invention configured as described above will be described schematically.

먼저, 이동통신 단말기는 음성 등록을 위해 기지국 시스템의 음성인식 서버(SRS)에 미리 설정된 텔리서비스 번호와 추출된 음성특징 및 음성인식 후 전달 받을 명령번호 등을 발신한다.First, the mobile communication terminal transmits a preset teleservice number, an extracted voice feature, a command number to be delivered after voice recognition, and the like to the voice recognition server (SRS) of the base station system for voice registration.

그러면, 상기 음성등록 메시지를 수신한 기지국 시스템의 음성인식 서버(SRS)는 해당 사용자의 데이터베이스(30)에 인식된 어휘와 함께 텔리서비스 번호 및 명령번호 등을 등록하고, 등록 성공 또는 실패에 대한 ACK 메시지를 전송한다.Then, the voice recognition server (SRS) of the base station system receiving the voice registration message registers the teleservice number and command number, etc. together with the recognized vocabulary in the database 30 of the user, and acknowledges the registration success or failure. Send a message.

상기와 같은 과정에 의해 음성 등록이 완료되면, 이후로는 단말기에 음성명령이 입력될 경우, 그 음성명령에 대한 음성특징을 추출하여 전송하면 음성인식 서버(SRS)의 데이터베이스(30)에 기 등록되어 있는 어휘와 비교하여, 그에 대한 텔리서비스 번호 및 명령번호 등을 단말기로 전송한다.After the voice registration is completed by the above process, when the voice command is input to the terminal, the voice feature for the voice command is extracted and transmitted to the database 30 of the voice recognition server (SRS). Compared to the existing vocabulary, the teleservice number and command number thereof are transmitted to the terminal.

이를 수신한 단말기에서는 그 명령번호에 해당하는 동작을 수행하는 것이다.The terminal receiving the message performs an operation corresponding to the command number.

그럼, 상기와 같이 구성된 장치를 이용한 음성 등록과정 및 음성 인식과정에 대해서 보다 구체적으로 설명하기로 한다.Then, a voice registration process and a voice recognition process using the device configured as described above will be described in more detail.

먼저, 이동통신 단말기의 음성특징 추출부(10)는 단말기에서 음성인식 명령 메뉴나 할당된 버튼에 의해서 음성등록이나 음성인식 처리과정이 시작되면, 사용자의 음성을 단말기의 마이크 등으로부터 입력받아 음성의 시작점과 끝점을 추출한 후, LPC(Linear Prediction Coding) 알고리즘에 의해서 음성특징을 추출한다.First, the voice feature extraction unit 10 of the mobile communication terminal receives a user's voice from the terminal's microphone or the like when the voice registration command or voice recognition processing is started by a voice recognition command menu or an assigned button. After extracting the starting point and the end point, the speech feature is extracted by the Linear Prediction Coding (LPC) algorithm.

다음, 단말기는 상기 음성특징 추출부(10)에서 추출된 음성특징을 단문메시지 또는 데이터서비스 프로토콜을 이용하여 기지국 시스템의 음성인식 서버(SRS)로 전송한다.Next, the terminal transmits the voice feature extracted by the voice feature extraction unit 10 to a voice recognition server (SRS) of the base station system using a short message or a data service protocol.

그러면, 기지국 시스템의 음성인식 서버(SRS)는 음성등록일 경우에는 수신된 음성특징을 음소 데이터베이스(TDB)에서 일치하는 음소를 검색하여 일치하는 어휘를 등록한다.Then, in the case of voice registration, the voice recognition server SRS of the base station system searches for a phoneme that matches the received voice feature in the phoneme database TDB and registers the matching vocabulary.

다시 말해, 어휘 등록부(20)는 기지국 시스템을 통해 수신된 음성등록 메시지를 전처리 및 모음조화, 구개음화 현상등 한국어 조음 현상에 따라 처리하고, 자체에 내장된 음소 데이터베이스(TDB)를 검색하여 일치하는 음소를 추출하여 문자열(어휘)를 생성하게 되고, 이와 같이 생성된 어휘는 사용자 각각의 데이터베이스(30)에 저장되는 것이다.In other words, the vocabulary registration unit 20 processes the voice registration message received through the base station system according to the Korean articulation phenomenon such as preprocessing, vowel harmony, and palatalization phenomenon, and searches the phoneme database (TDB) built in the phone to match the phoneme. To extract a character string (vocabulary) is generated, the vocabulary generated in this way is to be stored in each database 30 of the user.

이때 상기 음성등록 메시지에는 사용자가 발음한 음성명령의 음성특징 및 사업자에 의해 기 설정된 텔리서비스 번호 및 명령번호 등이 포함될 수 있다.In this case, the voice registration message may include a voice feature of the voice command pronounced by the user and a teleservice number and a command number preset by the operator.

다음, 상기와 같이 음성등록이 완료된 후 음성인식 수행시에는 수신된 음성특징을 해당 사용자의 데이터베이스(30)에서 일치하는 어휘를 검색하여, 그 결과코드를 다시 단문메시지 또는 데이터 서비스를 프로토콜을 이용하여 단말기로 전송한다.Next, when voice recognition is performed after the voice registration is completed as described above, the received voice feature is searched for a vocabulary that matches the user's database 30, and the result code is read again using a short message or data service protocol. Send to the terminal.

도2 및 도3은 본 발명에 의한 음성명령 등록 및 인식 과정에 대한 신호의 흐름을 보인 예시도로서, 먼저 도2에서 음성명령 등록메시지가 기지국 시스템으로 전송되면(S11), 기지국 시스템은 이를 수신하여 응답(ACK) 신호를 전송하고(S12), 다시 상기 음성명령 등록메시지를 음성인식 서버에 출력한다(S13).2 and 3 are exemplary diagrams showing the flow of a signal for a voice command registration and recognition process according to the present invention. First, when a voice command registration message is transmitted to the base station system (S11), the base station system receives it. A ACK signal is transmitted (S12), and the voice command registration message is again output to a voice recognition server (S13).

그러면, 음성인식 서버는 내부의 각 구성부를 거쳐 음성명령을 등록하고, 그 결과를 다시 기지국 시스템을 통해 단말기로 전송하게 된다(S14, S15).Then, the voice recognition server registers the voice command through each component inside, and transmits the result back to the terminal through the base station system (S14, S15).

다음, 상기와 같이 음성명령이 기지국 시스템의 음성인식 서버에 등록되면, 이후 단말기에 음성명령이 입력될 경우, 그 음성특징 등 단말기에서 수행하는 음성인식 과정을 거쳐 생성된 음성인식 메시지를 기지국 시스템으로 전송한다(S21).Next, when the voice command is registered in the voice recognition server of the base station system as described above, if the voice command is input to the terminal afterwards, the voice recognition message generated through the voice recognition process performed by the terminal, such as the voice feature, to the base station system It transmits (S21).

이를 수신한 기지국 시스템은 그에 대한 응답(ACK)을 단말기로 전송함과 아울러(S22), 상기 음성명령 인식 메시지를 음성인식 서버에 출력한다(S23).Upon receiving this, the base station system transmits a response (ACK) to the terminal (S22), and outputs the voice command recognition message to the voice recognition server (S23).

그러면, 음성인식 서버는 상술한 바와 같이 각 사용자의 어휘가 저장되어 있는 데이터베이스에서 일치하는 어휘 및 그와함께 등록되어 있는 명령번호 등을 검색하여 그 결과코드를 기지국 시스템을 통해 단말기로 전송해주게 된다(S24, S25).Then, as described above, the voice recognition server searches for a matching vocabulary and a command number registered with the vocabulary stored in the vocabulary of each user and transmits the result code to the terminal through the base station system. S24, S25).

이상에서 설명한 바와 같이 본 발명 이동통신 단말기의 메모리와 연산장치의 한계로 인하여 구현할 수 없었던 사용자독립 어휘독립 음성인식을 기지국 시스템에 음성인식 서버를 두고, 단말기를 클라이언트로 하여 상호간 통신에 의해 음성을 인식하도록 함으로써, 종래와 달리 여러번의 발음을 통하여 음성명령을 학습시킬 필요가 없어졌기 때문에, 음성명령 사용을 위한 사용자의 편의성이 향상되고, 기존의 단말기에 음성특징을 추출하는 간단한 프로그램을 추가하는 것만으로 인식율이 높은 음성인식 기술을 구현함으로써, 제작단가의 상승을 최소화하는 효과가 있다.As described above, the user independent lexical independent speech recognition, which could not be implemented due to the limitations of the memory and the computing device of the mobile communication terminal of the present invention, has a speech recognition server in the base station system, and the terminal is used as a client to recognize the speech through communication. By eliminating the need to learn voice commands through multiple pronunciations, the user's convenience for using voice commands is improved, and by simply adding a simple program to extract voice features to an existing terminal. By implementing a voice recognition technology with a high recognition rate, there is an effect of minimizing the increase in production cost.

Claims

In controlling the mobile communication terminal by voice recognition, the terminal further includes a voice feature extraction unit for extracting the voice feature from the voice command pronounced by the user and transmitting the voice feature to the base station, and the base station system includes a voice feature transmitted from the voice feature transmitted from the terminal. A vocabulary register for recognizing and registering a vocabulary corresponding to a phoneme database; A database for storing the recognized vocabulary for each user; After the vocabulary registration is completed, a voice recognition server configured to include a voice recognition processing unit for recognizing a voice command issued by the user compared to the registered vocabulary, characterized in that the mobile communication terminal control by speech recognition system.

A method of controlling a mobile communication terminal using voice recognition, the method comprising: extracting a voice feature from a voice of a user input through a terminal;

The extracted voice feature may be classified as a voice command registration process or a voice command recognition process, and in the case of the voice command registration process, further including a teleservice number and a command number preset by an operator, using a short message or a data service protocol. Transmitting to a voice recognition server of a base station system;

When the voice feature transmission process is a voice command registration process, the voice recognition server retrieves a voice feature of the voice command transmitted from the terminal from its own phoneme database, extracts a phoneme that matches the search result, and forms a character string. Storing the recognized character string in a database of each user and transmitting the result to the terminal;

When the voice feature transmission process is a voice command recognition process, the voice recognition server retrieves a voice feature of the voice command transmitted from the terminal from a database in which the registered vocabulary of each user is stored, and searches for a word that matches the search result. And extracting and transmitting a result code including a teleservice number and a command number registered together with the extracted vocabulary to the terminal.

delete