KR20040107232A

KR20040107232A - Speech recognition method

Info

Publication number: KR20040107232A
Application number: KR1020030038242A
Authority: KR
Inventors: 임규형
Original assignee: 현대자동차주식회사
Priority date: 2003-06-13
Filing date: 2003-06-13
Publication date: 2004-12-20

Abstract

PURPOSE: A speech recognition method is provided to improve a speech recognition rate and simplify hardware configuration for speech recognition by performing speech recognition based on a command utilizing pattern of a user. CONSTITUTION: An instruction database is searched for an instruction word similar to an instruction when a user gives the instruction using his/her voice(S4). When the user gives the instruction using his/her voice, a user database is searched for an instruction word frequently used in the corresponding situation(S5). One of the instruction word searched in the instruction database and the instruction word searched in the user database is recognized as the voice instruction of the user(S6).

Description

Speech recognition method {Speech recognition method}

본 발명은 음성 인식 방법에 관한 것으로, 좀더 상세하게는 입력 음성에 포함되는 언어의 정보를 기계적으로 추출하여 그 의미 내용을 인식하는 음성 인식 방법에 관한 것이다.The present invention relates to a speech recognition method, and more particularly, to a speech recognition method for mechanically extracting information of a language included in an input speech and recognizing its meaning.

최근 들어, 음성 인식 기술이 급격히 발전함에 따라, 자동차에도 음성 인식 기술을 적용시키고자 하는 다양한 노력이 이루어지고 있다.Recently, with the rapid development of speech recognition technology, various efforts have been made to apply the speech recognition technology to automobiles.

상기한 바와 같은 음성 인식 기술은 인식해야할 언어들을 미리 가지고 있다가 사용자가 하는 말과 가장 근접한 언어를 선택하여 출력하는 방법을 사용하고 있는데, 이러한 기술을 자동차에 적용하기 위해서는 자동차에서 사용 가능한 명령어들을 선정한 후 음성 인식 시스템에 저장하고, 미리 저장되어 있는 명령어들 가운데에서 사용자가 발성한 명령어와 가장 근접한 명령어를 찾아서 이 명령어에 따라 자동차 제어 동작을 수행하면 된다.The speech recognition technology described above uses a method of having languages to be recognized in advance and selecting and outputting a language that is closest to the user's words. After that, it is stored in the voice recognition system, and among the pre-stored commands, the command closest to the user uttered command is found and the vehicle control operation is performed according to the command.

그러나, 상기한 바와 같은 음성 인식 기술은 아직까지 음성 인식률이 사용자의 기대에 미치지 못하므로 상기한 음성 인식 기술을 자동차에 적용시키기에는 무리가 있다.However, the speech recognition technology as described above is still difficult to apply the speech recognition technology to automobiles because the speech recognition rate does not meet the expectations of the user.

자동차 환경에서는 음성 인식 기술의 최대 걸림돌인 극한의 노이즈가 발생되기 쉬우므로 음성 인식률을 향상시키기 어려운 것이다. 즉, 음성 인식 엔진 자체의 성능 향상에 의한 음성 인식률 향상에도 한계가 있고, 이와 같은 방법으로 자동차 환경의 다양한 노이즈 발생에 모두 대처한다는 것 또한 매우 어려운 과제이다.In the automotive environment, it is difficult to improve the speech recognition rate because it is easy to generate extreme noise, which is the biggest obstacle to the speech recognition technology. In other words, there is a limit to the improvement of the speech recognition rate by the performance of the speech recognition engine itself, and it is also very difficult to cope with the occurrence of various noises in the vehicle environment in this way.

이에, 본 발명은 상기한 바와 같은 과제를 해소하기 위해 발명된 것으로, 사용자의 명령어 사용 패턴을 기반으로 하여 음성 인식 판정을 함으로써, 음성 인식률을 향상시킬 수 있을 뿐만 아니라 음성 인식을 위한 하드웨어 구성이 매우 간단해지도록 하는 음성 인식 방법을 제공하는데 그 목적이 있다.Accordingly, the present invention has been invented to solve the above problems, and by determining the speech recognition based on the user's command usage pattern, not only can the speech recognition rate be improved, but also the hardware configuration for speech recognition is very high. It is an object of the present invention to provide a speech recognition method that is simplified.

상기한 바와 같은 목적을 달성하기 위한 본 발명에 따른 음성 인식 방법은, 사용자가 음성으로 명령을 하면 이에 근접한 명령어를 명령어데이터베이스에서 검색하는 단계와, 사용자가 음성으로 명령을 하면 이 상황에서 많이 사용되는 명령어를 사용자데이터베이스에서 검색하는 단계와, 상기 명령어데이터베이스에서 검색된 명령어와 상기 사용자데이터베이스에서 검색된 명령어 중에서 어느 하나의 명령어를 사용자의 음성 명령으로 인식 판정하는 단계를 포함하여 구성된 것을 특징으로 한다.Speech recognition method according to the present invention for achieving the object as described above, the step of searching for a command in close proximity to the command database when the user commands by voice, and when the user commands by voice is used a lot in this situation And retrieving a command from a user database, and recognizing and determining one of the commands retrieved from the command database and the commands retrieved from the user database as a voice command of a user.

상기 사용자데이터베이스 검색단계는, 사용자가 음성으로 명령을 하면 이 상황을 나타내는 특정인자 또는 입력요소에 근거하여 많이 사용된 명령어를 사용자데이터베이스에서 검색하는 것을 특징으로 한다.The user database search step is characterized in that when a user commands by voice, the user database searches for a command frequently used based on a specific factor or an input element representing this situation.

상기 사용자데이터베이스 검색단계는, 사용자가 음성으로 명령을 하면 이 시점에 해당하는 계절, 날짜, 시간대에 많이 사용된 명령어를 사용자데이터베이스에서 검색하는 것을 특징으로 한다.The user database search step is characterized in that when the user commands by voice, the user database searches for a command frequently used in seasons, dates, and time zones corresponding to this point in time.

상기 음성 인식 판정 단계는, 상기 명령어데이터베이스에서 검색된 명령어와 상기 사용자데이터베이스에서 검색된 명령어가 서로 다르면, 명령어데이터베이스에서 검색된 명령어의 유사정도(%)와 사용자데이터베이스에서 검색된 명령어의 사용빈도(%)중에서 큰 값을 가지는 명령어를 사용자의 음성 명령으로 인식 판정하는 것을 특징으로 한다.In the voice recognition determination step, if the command retrieved from the command database and the command retrieved from the user database are different from each other, a larger value is used among the similarity (%) of the command retrieved from the command database and the usage frequency (%) of the command retrieved from the user database. Characterized in that it is determined to recognize the command having a voice command of the user.

상기 인식 판정된 명령어가 맞는지를 사용자가 확인하는 단계와, 상기 사용자에 의해 맞다고 확인된 명령어에 따라 명령을 수행하는 단계와, 상기 인식 판정된 명령어에 대해 사용자가 맞다고 확인하면 상기 사용자데이터베이스에서 상기 명령어의 사용빈도를 업데이트하는 단계를 더 포함하여 구성된 것을 특징으로 한다.A user confirming whether the recognition command is correct, performing a command according to the command confirmed to be correct by the user, and if the user confirms that the command is correct for the recognition command, the user database checks the command Characterized in that it further comprises the step of updating the frequency of use.

도 1은 본 발명에 따른 음성 인식 방법의 순서도이고,1 is a flowchart of a speech recognition method according to the present invention,

도 2는 본 발명에 따른 음성 인식 방법을 수행하는 시스템의 구성도이다.2 is a block diagram of a system for performing a voice recognition method according to the present invention.

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for the main parts of the drawings>

10 : 음성입력부 20 : 음성처리부10: voice input unit 20: voice processing unit

30 : 명령어데이터베이스 40 : 사용자데이터베이스30: command database 40: user database

50 : 엑추에이터50: actuator

이하, 첨부된 도면을 참조하여 본 발명의 바람직한 실시예를 설명한다.Hereinafter, exemplary embodiments of the present invention will be described with reference to the accompanying drawings.

도 1은 본 발명에 따른 음성 인식 방법의 순서도이고, 도 2는 본 발명에 따른 음성 인식 방법을 수행하는 시스템의 구성도이다.1 is a flow chart of a speech recognition method according to the present invention, Figure 2 is a block diagram of a system for performing the speech recognition method according to the present invention.

본 발명에 따른 음성 인식 방법은 도 1에 도시된 바와 같이, 단계(S1)에서 사용자가 음성으로 명령을 하면 단계(S2, S3)에서 미도시된 음성 인식 시스템이 음성 인식을 시작하여 명령어를 분석하기 시작한다. 즉, 음성입력부(10)를 통해 사용자의 음성이 음성인식부(20)로 입력되고, 상기 음성인식부(20)는 명령어데이터베이스(30)와 사용자데이터베이스(40)를 근거로 사용자의 음성 명령을 분석하는 것이다.In the voice recognition method according to the present invention, as shown in FIG. 1, when a user commands a voice in step S1, a voice recognition system not shown in steps S2 and S3 starts voice recognition to analyze a command. To start. That is, the voice of the user is input to the voice recognition unit 20 through the voice input unit 10, and the voice recognition unit 20 receives the voice command of the user based on the command database 30 and the user database 40. To analyze.

이어서, 단계(S4)에서는 음성 인식 시스템이 사용자의 음성 명령에 가장 근접한 명령어를 명령어데이터베이스(30)에서 검색하고, 단계(S5)에서는 음성 인식 시스템이 사용자가 음성으로 명령한 상황에서 가장 많이 사용되는 명령어를 사용자데이터베이스(40)에서 검색한다.Subsequently, in step S4, the voice recognition system searches for a command closest to the user's voice command in the command database 30, and in step S5, the voice recognition system is most frequently used in a situation where the user commands by voice. The command is retrieved from the user database 40.

이때, 상기 사용자데이터베이스(40)에는 계절, 날짜, 시간대 등과 같은 여러 가지 상황에 따라 많이 사용되는 명령어의 사용빈도를 저장하고 있고, 사용자가 음성으로 명령을 한 시점에 해당하는 계절, 날짜, 시간대 등에서 많이 사용된 명령어를 사용자데이터베이스에서 검색하는 것이다.In this case, the user database 40 stores the frequency of use of commands that are frequently used according to various situations such as seasons, dates, time zones, and the like. The most common command is to search the user database.

이어서, 단계(S6)에서는 음성 인식 시스템이 상기 명령어데이터베이스(30)에서 검색된 명령어와 상기 사용자데이터베이스(40)에서 검색된 명령어 중에서 어느 하나의 명령어를 사용자의 음성 명령으로 인식 판정한다.Subsequently, in step S6, the voice recognition system recognizes one of the commands retrieved from the command database 30 and the commands retrieved from the user database 40 as a voice command of the user.

예를 들어, 상기 명령어데이터베이스(30)에서 검색된 명령어와 상기 사용자데이터베이스(40)에서 검색된 명령어가 서로 같으면, 이 명령어를 사용자의 음성 명령으로 인식 판정하고, 상기 명령어데이터베이스(30)에서 검색된 명령어와 상기 사용자데이터베이스(40)에서 검색된 명령어가 서로 다르면, 명령어데이터베이스(30)에서 검색된 명령어의 유사정도(%)와 사용자데이터베이스(40)에서 검색된 명령어의 사용빈도(%)중에서 큰 값을 가지는 명령어를 사용자의 음성 명령으로 인식 판정한다.For example, if the command retrieved from the command database 30 and the command retrieved from the user database 40 are the same as each other, it is determined that the command is recognized as a voice command of the user, and the command retrieved from the command database 30 and the If the commands retrieved from the user database 40 are different from each other, a command having a larger value among the similarity (%) of the commands retrieved from the command database 30 and the frequency of use (%) of the commands retrieved from the user database 40 is determined. The voice command determines the recognition.

즉, 2개의 데이터베이스에서 검색한 결과를 비교하여 사용자의 음성 명령을 인식 판정함에 따라 사용자의 음성 명령에 대한 음성 인식률을 향상시킬 수 있는 것이다.That is, the voice recognition rate of the user's voice command can be improved by comparing the search results of the two databases and determining the user's voice command.

이어서, 단계(S7)에서는 상기 단계(S6)에서 인식 판정된 명령어를 사용자에게 들려주고 사용자가 맞는지 또는 틀린지를 확인하여 틀리다고 하면 상기 단계(S1)를 반복 수행하고, 맞다고 하면 단계(S8, S9)를 수행한다.Subsequently, in step S7, if the command determined in step S6 is notified to the user and the user is correct or wrong, the step S1 is repeated, and if it is correct, steps S8 and S9. Perform

상기 단계(S8)에서는 음성 인식 시스템의 제어에 따라 각종 액추에이터들(50)이 작동하여 사용자의 명령을 수행하고, 단계(S9)에서는 상기 단계(S6)에서 인식 판정된 명령어의 사용빈도를 상기 사용자데이터베이스에 업데이트시켜서 다음 음성 인식시에 근거자료로 사용한다.In the step S8, various actuators 50 operate under the control of the speech recognition system to perform a user's command. In step S9, the user's frequency of use of the command determined and recognized in the step S6 is determined. Update it to the database and use it as the basis for the next speech recognition.

이상에서 설명한 바와 같이 본 발명에 따르면, 사용자의 명령어 사용 패턴을 기반으로 하여 음성 인식 판정을 함으로써, 음성 인식률을 향상시킬 수 있을 뿐만 아니라 음성 인식을 위한 하드웨어 구성이 매우 간단해지도록 하는 효과가 있다.As described above, according to the present invention, by determining the speech recognition based on the user's command usage pattern, not only can the speech recognition rate be improved, but also the hardware configuration for the speech recognition becomes very simple.

Claims

Retrieving a command adjacent to the command from the command database when the user performs a voice command; retrieving a command frequently used in this situation from the user database when the user commands by the voice; And recognizing and determining any one command among the commands searched in the user database as the voice command of the user.

The method of claim 1, wherein the searching of the user database comprises searching a user database for a command frequently used based on a specific factor or an input element indicating a situation when the user makes a voice command.

The voice recognition method of claim 2, wherein the searching of the user database comprises searching a user database for a command frequently used in seasons, dates, and time zones corresponding to the time point when the user makes a voice command.

The method of claim 1, wherein the determining of the voice recognition comprises: when the command retrieved from the command database and the command retrieved from the user database are different from each other, the degree of similarity (%) of the command retrieved from the command database and the frequency of use of the command retrieved from the user database. A voice recognition method, characterized in that for determining a command having a large value among (%) as a voice command of the user.

The speech recognition method of claim 1, further comprising: checking, by a user, whether the command determined to be recognized is correct, and performing a command according to the command confirmed as being correct by the user.

The voice recognition method of claim 5, further comprising updating a frequency of use of the command in the user database when the user determines that the command is correct.