KR20190061705A

KR20190061705A - Apparatus and method for recognizing voice using manual operation

Info

Publication number: KR20190061705A
Application number: KR1020170160366A
Authority: KR
Inventors: 신용진
Original assignee: 현대자동차주식회사; 기아자동차주식회사
Priority date: 2017-11-28
Filing date: 2017-11-28
Publication date: 2019-06-05
Also published as: KR102417901B1

Abstract

According to the present invention, an apparatus for recognizing voice using manual operation comprises: an input unit receiving a command from a user; and a control unit recognizing manual operation corresponding to the command when the command received by the input unit is misrecognized, determining the command as a candidate command by tagging the received command and manual operation, and determining the candidate command as a formal command when the candidate command is learned. The apparatus may store a term desired by a user as the formal command and allow the term to be recognized as voice. By using the term desired by the user as a voice command, the apparatus may perform an operation like control of an in-vehicle device even if the voice command is not a voice recognition command corresponding to a pre-stored command.

Description

[0001] APPARATUS AND METHOD FOR RECOGNIZING VOICE USING MANUAL OPERATION [0002]

본 발명은 수동조작을 이용한 음성인식 장치 및 방법에 관한 것으로, 보다 자세하게는 명령어의 음성인식이 실패한 경우, 이후 명령어에 대응하는 수동조작을 통하여 기능을 동작시키는 것을 기반으로 해당 명령어가 인식되도록 하는 음성인식 장치 및 방법에 관한 것이다. The present invention relates to a voice recognition apparatus and method using manual operation, and more particularly, to a voice recognition apparatus and a voice recognition method for recognizing a command based on operating a function through a manual operation corresponding to a subsequent command And to a method and apparatus for recognition.

음성인식이란 음성에 포함된 음향 신호, 즉 음향학적 정보로부터 음운, 즉 언어적 정보를 추출하여 이를 기계적으로 인식하는 기술이다. 인간 대 기계 인터페이스 방법론 중 음성인식은 가장 편리한 방법으로 여겨지지만 음성인식을 통해 인간이 기계를 제어하기 위해서는 인간의 음성을 기계가 처리할 수 있는 정보로 변환하여야 하며, 이러한 변환 과정이 음성인식 기술이다.Speech recognition is a technology that extracts phonemes, that is, linguistic information from acoustic signals included in speech, that is, acoustic information, and mechanically recognizes them. Speech recognition is considered to be the most convenient method among human-machine interface methodologies. However, in order to control a machine through speech recognition, a human voice must be converted into information that can be processed by a machine. .

최근에는 음성인식 기술을 차량에 적용하여 차량의 편의 장치, 예컨대 윈도우의 승강, 와이퍼의 작동 및 정지, 에어컨의 작동, 전조등의 점등 및 소등의 또는 멀티미디어 기기의 작동을 사용자의 음성 명령에 따라 제어하는 기술이 사용되고 있다. In recent years, speech recognition technology has been applied to a vehicle to control the convenience of the vehicle, such as elevating and lowering the window, activating and deactivating the wiper, operating the air conditioner, lighting and turning off the headlight, Technology is being used.

그러나 차량에서 사용자의 음성 명령에 대해 음성인식이 이루어지지 않는 경우, 전통적인 헤드유닛을 통해 버튼 등을 조작하여 차량 내 기기의 작동을 제어하는 방법 및 터치스크린을 통해 차량 내 기기를 제어하는 수동제어 방법으로 멀티미디어 기기의 작동을 수행하여야 한다. However, in the case where speech recognition is not performed on the voice command of the user in the vehicle, a method of controlling the operation of the in-vehicle apparatus by operating a button or the like through a conventional head unit and a manual control method of controlling the in- The operation of the multimedia device should be performed.

따라서, 이러한 수동제어를 통한 차량 내 기기의 제어와 같은 동작을 수행하는 음성인식 명령어가 차량 내 기기의 음성인식을 통한 제어에 사용되게 되며, 사용자는 수동제어 방법을 통해 동작을 제어하던 기능을 음성인식을 통해 제어하기 위하여 수동 명령어에 해당하는 음성인식 명령어를 기억하여야 할 필요가 있다.Therefore, the voice recognition command for performing the same operation as the control of the in-vehicle apparatus through the manual control is used for the control through the voice recognition of the in-vehicle apparatus. It is necessary to memorize the speech recognition command corresponding to the manual command in order to control through recognition.

그러나, 음성인식 명령어를 기억하지 못하는 경우, 사용자가 음성인식을 통하여 차량 내 기기의 제어와 같은 동작을 수행하기 어려운 문제점이 있다. 또한, 음성인식 명령어 이외의 사용자가 직관적으로 의도하고자 하는 명령어를 사용하기 어려운 한계가 있다.However, when the voice recognition command can not be stored, it is difficult for the user to perform the same operation as the control of the in-vehicle apparatus through voice recognition. In addition, there is a limit in which it is difficult to use a command other than a speech recognition command intuitively intended by a user.

본 발명은 상술한 한계점을 극복하기 위한 것으로, 사용자가 명령어를 발화한 이후, 수행된 수동동작과 명령어를 연결하여, 사용자로부터 발화된 명령어를 정식명령어로 추가함으로써, 사용자가 발화한 명령어의 음성인식을 용이하게 하도록 하는 수동조작을 이용한 음성인식 장치 및 방법을 제공하는데 목적이 있다.SUMMARY OF THE INVENTION The present invention has been made in order to overcome the above-described problems, and it is an object of the present invention to provide a method and apparatus for connecting a manual operation and a command, which are performed after a user has uttered a command, And to provide a voice recognition apparatus and method using a manual operation.

본 발명의 수동조작을 이용한 음성인식 장치는 사용자로부터 명령어를 수신하는 입력부 및 상기 입력부에 수신된 명령어가 오인식되면 상기 명령어에 대응하는 상기 사용자의 수동조작을 인식하고, 상기 수신된 명령어 및 상기 수동조작을 태깅하여 상기 명령어를 후보명령어로 판단하고 상기 후보명령어가 학습되면 상기 후보명령어를 정식명령어로 판단하는 제어부를 포함하는 것을 특징으로 한다.A speech recognition apparatus using a manual operation of the present invention includes an input unit for receiving a command from a user and a manual operation of the user corresponding to the command if the command received at the input unit is misrecognized, And judges the candidate instruction as a formal instruction if the candidate instruction is learned.

그리고, 상기 입력부는 자연어를 수신하는 것을 특징으로 한다.The input unit receives the natural language.

그리고, 상기 제어부는 상기 수신된 명령어가 기 저장되어 학습되어진 정식명령어가 아닌 경우, 상기 수신된 명령어가 오인식된 것으로 판단하는 것을 특징으로 한다.The controller determines that the received command is misrecognized if the received command is not a formal command that has been stored and learned.

그리고, 상기 제어부는 상기 사용자가 상기 명령어가 오인식된 후 소정시간 내에 상기 수신된 명령어를 실행하기 위해 수행되는 상기 수동조작을 인식하는 것을 특징으로 한다.The control unit recognizes the manual operation performed by the user to execute the received instruction word within a predetermined time after the instruction word is misrecognized.

그리고, 상기 제어부는 상기 수동조작의 도메인을 판단하면, 상기 도메인 내에서 상기 수신된 명령어와 상기 수동조작을 태깅(tagging)하는 것을 특징으로 한다.The controller may tag the received command and the manual operation in the domain when determining the domain of the manual operation.

그리고, 상기 제어부는 상기 수동조작의 도메인이 판단되지 않으면, 신규 도메인을 생성하는 것을 특징으로 한다.If the domain of the manual operation is not determined, the controller creates a new domain.

그리고, 상기 제어부는 상기 후보명령어가 소정횟수 재수신되어 상기 재수신된 후보명령어와 상기 수동조작을 태깅하는 동작이 소정횟수 반복수행되면, 상기 후보명령어가 학습된 것으로 판단하는 것을 특징으로 한다.The controller determines that the candidate command is learned when the candidate command is re-received a predetermined number of times and the re-received candidate command and the manual operation are tagged repeatedly a predetermined number of times.

그리고, 상기 제어부는 상기 후보명령어가 학습되면, 상기 후보명령어를 정식명령어로 저장할 지 여부를 상기 운전자에게 선택하도록 하는 메세지를 출력하도록 하고, 상기 운전자가 선택하면 상기 후보명령어를 정식명령어로 판단하는 것을 특징으로 한다.When the candidate instruction is learned, the control unit outputs a message to the driver to select whether to store the candidate instruction in the formal instruction word, and if the driver selects the candidate instruction, the control unit determines the candidate instruction as a formal instruction word .

본 발명의 수동조작을 이용한 음성인식 방법은 사용자로부터 명령어를 수신하는 단계와, 상기 수신된 명령어가 오인식되면, 상기 명령어에 대응하는 수동조작을 인식하는 단계와, 상기 수신된 명령어를 후보명령어로 판단하는 단계와, 상기 후보명령어를 학습하는 단계 및 상기 후보명령어가 학습되면, 상기 후보명령어를 정식명령어로 판단하는 단계를 포함하는 것을 특징으로 한다.According to another aspect of the present invention, there is provided a speech recognition method comprising: receiving a command from a user; recognizing a manual operation corresponding to the command if the received command is misrecognized; Determining the candidate instruction as a formal instruction if the candidate instruction is learned; and determining the candidate instruction as a formal instruction if the candidate instruction is learned.

그리고, 사용자로부터 명령어를 수신하는 단계는 자연어를 수신하는 것을 특징으로 한다.The step of receiving a command from a user is characterized by receiving a natural language.

그리고, 상기 수신된 명령어가 오인식되면, 상기 명령어에 대응하는 수동조작을 인식하는 단계에서, 상기 수신된 명령어가 기 저장되어 학습되어진 정식명령어가 아닌 경우, 상기 수신된 명령어를 오인식된 것으로 판단하는 것을 특징으로 한다.If the received instruction word is erroneously recognized, it is determined that the received instruction word is misrecognized when the received instruction word is not a formal instruction that has been learned and stored in the step of recognizing a manual operation corresponding to the instruction word .

그리고, 상기 수신된 명령어가 오인식되면, 상기 명령어에 대응하는 수동조작을 인식하는 단계는 상기 사용자가 상기 명령어가 오인식된 후 소정시간 내에 상기 수신된 명령어를 실행하기 위해 수행되는 상기 수동조작을 인식하는 것을 특징으로 한다.If the received instruction is misrecognized, the step of recognizing the manual operation corresponding to the instruction recognizes the manual operation performed to execute the received instruction within a predetermined time after the user is misidentified .

그리고, 상기 수신된 명령어가 오인식되면, 상기 명령어에 대응하는 수동조작을 인식하는 단계 이후, 상기 수동조작의 도메인을 판단하는 단계를 더 수행하는 것을 특징으로 한다.If the received instruction word is misrecognized, the step of recognizing the manual operation corresponding to the instruction word may further include the step of determining the domain of the manual operation.

그리고, 상기 수신된 명령어를 후보명령어로 판단하는 단계는, 상기 수동조작의 도메인이 판단되면, 상기 도메인 내에서 상기 수신된 명령어와 상기 수동조작을 태깅(tagging)하는 것을 특징으로 한다.The step of determining the received command as a candidate command may tag the received command and the manual operation in the domain if the domain of the manual operation is determined.

그리고, 상기 수신된 명령어를 후보명령어로 판단하는 단계는, 상기 수동조작의 도메인이 판단되지 않으면, 신규 도메인을 생성하여 상기 신규 도메인 내에서 상기 수신된 명령어와 상기 수동조작을 태깅하는 것을 특징으로 한다.The step of determining the received command as a candidate command may include generating a new domain and tagging the received command and the manual operation in the new domain if the domain of the manual operation is not determined .

그리고, 상기 후보명령어를 학습하는 단계는 상기 후보명령어가 소정횟수 재수신되어 상기 후보명령어와 상기 수동조작을 태깅하는 동작을 소정횟수 반복수행하는 것을 특징으로 한다.The learning of the candidate command may include repeating the operation of the candidate command and the manual operation by a predetermined number of times.

그리고, 상기 후보명령어가 학습되면, 상기 후보명령어를 정식명령어로 판단하는 단계는 상기 후보명령어를 정식명령어로 저장할 지 여부를 상기 운전자에게 선택하도록 하는 메세지를 출력하고, 상기 운전자가 선택하면 상기 후보명령어를 정식명령어로 판단하는 것을 특징으로 한다.When the candidate instruction is learned, the step of judging the candidate instruction as a formal instruction may output a message for the driver to select whether to store the candidate instruction as a formal instruction word. When the driver selects the candidate instruction, As a formal command.

본 발명은 사용자가 명령어를 발화한 후, 사용자가 수행한 수동조작을 명령어와 연결하여, 사용자가 발화한 명령어를 정식명령어로 저장하여 음성인식 되도록 할 수 있다. 사용자가 발화한 명령어를 정식명령어로 사용함으로써, 기 저장된 명령어에 해당하는 음성인식 명령어가 아니더라도 차량 내 기기의 제어와 같은 동작을 용이하게 수행할 수 있다. In the present invention, after a user utters a command, a manual operation performed by the user is connected to the command, and the command uttered by the user is stored as a formal command word so as to be recognized. By using a command issued by the user as a formal command, it is possible to easily perform the same operation as the control of the in-vehicle apparatus even if the command is not the speech recognition command corresponding to the pre-stored command.

도 1은 본 발명의 수동조작을 이용한 음성인식 장치이다.
도 2 내지 도 6은 본 발명의 실시예를 나타낸 도면이다.
도 7은 본 발명의 수동조작을 이용한 음성인식 방법을 나타낸 순서도이다.1 is a speech recognition apparatus using manual operation of the present invention.
2 to 6 are views showing an embodiment of the present invention.
7 is a flowchart showing a speech recognition method using manual operation of the present invention.

이하, 본 발명의 일부 실시예들을 예시적인 도면을 통해 상세하게 설명한다. 각 도면의 구성요소들에 참조부호를 부가함에 있어서, 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가능한 한 동일한 부호를 가지도록 하고 있음에 유의해야 한다. 또한, 본 발명의 실시예를 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 실시예에 대한 이해를 방해한다고 판단되는 경우에는 그 상세한 설명은 생략한다.Hereinafter, some embodiments of the present invention will be described in detail with reference to exemplary drawings. It should be noted that, in adding reference numerals to the constituent elements of the drawings, the same constituent elements are denoted by the same reference symbols as possible even if they are shown in different drawings. In the following description of the embodiments of the present invention, a detailed description of known functions and configurations incorporated herein will be omitted when it may make the difference that the embodiments of the present invention are not conclusive.

본 발명의 실시예의 구성 요소를 설명하는 데 있어서, 제 1, 제 2, A, B, (a), (b) 등의 용어를 사용할 수 있다. 이러한 용어는 그 구성 요소를 다른 구성 요소와 구별하기 위한 것일 뿐, 그 용어에 의해 해당 구성 요소의 본질이나 차례 또는 순서 등이 한정되지 않는다. 또한, 다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가진다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥상 가지는 의미와 일치하는 의미를 가진 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.In describing the components of the embodiment of the present invention, terms such as first, second, A, B, (a), and (b) may be used. These terms are intended to distinguish the constituent elements from other constituent elements, and the terms do not limit the nature, order or order of the constituent elements. Also, unless otherwise defined, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in commonly used dictionaries should be interpreted as having a meaning consistent with the meaning in the context of the relevant art and are to be interpreted in an ideal or overly formal sense unless explicitly defined in the present application Do not.

본 발명은 사용자가 발화한 명령어에 대하여 음성인식이 되지 않는 경우, 사용자가 명령어에 해당하는 기능을 직접 수동으로 조작하는데서 착안된 것이다. 본 발명은 발화한 명령어와 수동조작된 기능을 연결하고 해당 명령어를 정식명령어로판단함으로써, 추후에 사용자가 동일한 명령어를 발화한 경우 음성이 인식되도록 하여 원하는 기능을 수행되도록 할 수 있다. 여기서, 정식명령어는 학습이 완료되어 음성인식이 가능한 명령어를 의미할 수 있다. The present invention is based on the fact that the user manually manipulates a function corresponding to a command in the case where speech recognition is not possible with respect to a command uttered by the user. The present invention can connect a spoken command and a manually operated function and judge the command as a formal command so that the user can recognize the voice when the same command is uttered later so that a desired function can be performed. Herein, the formal instruction may mean a command which is learned and speech recognition is possible.

도 1은 본 발명의 수동조작을 이용한 음성인식 장치이다. 도 1에 도시된 바와 같이, 본 발명의 수동조작을 이용한 음성인식 장치는 입력부(10), 저장부(20), 제어부(30) 및 출력부(40)를 포함할 수 있다. 1 is a speech recognition apparatus using manual operation of the present invention. 1, the speech recognition apparatus using the manual operation of the present invention may include an input unit 10, a storage unit 20, a control unit 30, and an output unit 40.

입력부(10)는 사용자가 발화한 명령어를 수신할 수 있다. 명령어는 '아이폰 뮤직', '화면밝기 좀 올려줘', '홍길동 책임에게 전화걸어줘'등의 자연어를 포함할 수 있다. The input unit 10 can receive a command that the user has uttered. The commands can include natural language such as 'iPhone music', 'Raise the screen brightness', 'Give me a call to Hong Gil-dong'.

입력부(10)는 외부의 음향신호를 입력받는 과정에서 발생 되는 잡음(noise)을 제거하기 위한 다양한 잡음 제거 알고리즘에 기초한 동작을 수행하도록 구성될 수 있다. 입력부(10)는 헤드유닛(head unit, H/U) 내에 구비된 마이크로 폰으로 구성될 수 있다. The input unit 10 may be configured to perform an operation based on various noise reduction algorithms for eliminating noise generated in receiving an external sound signal. The input unit 10 may be a microphone provided in a head unit (H / U).

저장부(20)는 사용자로부터 최초로 발화된 명령어를 도메인과 연결하여 저장할 수 있다. 예를들면, 저장부(20)는 '아이폰 뮤직'라는 명령어를 미디어의 도메인 내에 추가하여 저장할 수 있다. 또한, 저장부(20)는 '화면밝기 좀 올려줘'라는 명령어를 셋업의 도메인 내에 추가하여 저장할 수 있다. 또한, 저장부(20)는 '홍길도 책임에게 전화걸어줘'라는 명령어를 블루투스의 도메인 내에서 추가하여 저장할 수 있다. The storage unit 20 may store the first uttered command from the user in connection with the domain. For example, the storage unit 20 may store the 'iPhone music' command in the domain of the media. In addition, the storage unit 20 may store the 'increase brightness of the screen' command in the domain of the setup. In addition, the storage unit 20 can add and store the command 'Give a phone call to the responsible person' in the Bluetooth domain.

저장부(20)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램(RAM, Random Access Memory), SRAM(Static Random Access Memory), 롬(ROM, Read-Only Memory), EEPROM(Electrically Erasable Programmable Read-Only Memory), PROM(Programmable Read-Only Memory), 자기메모리, 자기 디스크, 또는 광디스크 타입의 저장매체를 포함할 수 있다.The storage unit 20 may be a flash memory type, a hard disk type, a multimedia card micro type, a card type memory (for example, SD or XD memory) (Random Access Memory), SRAM (Static Random Access Memory), ROM (Read Only Memory), EEPROM (Electrically Erasable Programmable Read-Only Memory), PROM (Programmable Read- Magnetic disk, or optical disk type storage media.

제어부(30)는 음성인식모드로 설정하고, 사용자가 의도하고자 하는 기능에 대한 명령어가 입력부(10)에 수신되는 경우, 명령어에 대한 음성신호를 서버로 전송한다. 서버는 수신된 음성신호에 대하여 받아쓰기(DICTATION)의 결과를 차량으로 전송할 수 있다. The control unit 30 sets the voice recognition mode, and when a command for a function intended by the user is received at the input unit 10, the control unit 30 transmits a voice signal for the command to the server. The server can transmit the result of the DICTATION to the vehicle with respect to the received voice signal.

제어부(30)는 입력부(10)에 수신된 명령어가 음성인식 명령어 즉, 기 설정되어 학습된 명령어일 경우, 음성인식이 된 것으로 판단할 수 있다. 또한, 입력부(10)에 수신된 명령어가 음성인식 명령어가 아닌경우 즉, 기 설정된 명령어가 아닌 명령어일 경우 음성이 오인식된 것으로 판단할 수 있다. The control unit 30 can determine that the speech recognition is performed when the command received in the input unit 10 is a speech recognition command, that is, a previously learned command. In addition, if the command received by the input unit 10 is not a voice recognition command, that is, a command other than a predetermined command, it can be determined that the voice is misrecognized.

음성이 오인식된 것으로 판단하는 경우, 제어부(30)는 음성인식모드를 종료할 수 있다. 제어부(30)는 음성인식모드가 종료되면, 사용자가 의도하고자 하는 기능을 직접 조작하도록 수동입력모드로 전환할 수 있다. If it is determined that the voice is misrecognized, the control unit 30 can end the voice recognition mode. When the voice recognition mode is ended, the control unit 30 can switch to the manual input mode so that the user directly manipulates the intended function.

제어부(30)는 수동입력모드로 전환되는 경우, 사용자가 직접 수행한 수동조작을 인식할 수 있다. 여기서, 수동조작은 소정시간 내에 수신된 명령어를 실행하기 위해 헤드유닛 내에 구비된 소정버튼이나 디스플레이를 조작하는 것을 의미할 수 있다. When the control unit 30 is switched to the manual input mode, the control unit 30 can recognize the manual operation performed by the user. Here, the manual operation may mean manipulating a certain button or display provided in the head unit to execute the command received within a predetermined time.

그리고, 수동조작에 따른 동작이 포함되는 도메인이 존재하는지 판단한다. 여기서 도메인은 사용자가 발화한 명령어가 사용될 수 있는 상황을 분류하는 개념으로 이해될 수 있다. 예를들면 미디어, 내비게이션, 블루투스 등으로 분류될 수 있다. Then, it is determined whether a domain including an operation according to a manual operation exists. Here, the domain can be understood as a concept of classifying a situation in which a command uttered by the user can be used. For example, media, navigation, Bluetooth, and the like.

그리고, 각각의 도메인은 상황에 따른 복수의 명령어를 포함할 수 있다. 여기서, 도메인 내에 포함된 복수의 명령어는 기 설정되어 학습된 명령어일 수 있다. 예를들면, 미디어의 도메인은 '음악검색','USB','CD'등의 명령어를 포함할 수 있다. 그리고, 내비게이션의 도메인은 '목적지 설정', '주변 검색' 등의 명령어를 포함할 수 있다. 그리고, 블루투스의 도메인은 '..에게 전화', '전화걸기', '블루투스 기기변경'등의 명령어를 포함할 수 있다. Each domain may include a plurality of instructions depending on the situation. Here, a plurality of commands included in the domain may be pre-set and learned commands. For example, the domain of the media may include commands such as " music search ", " USB ", and " CD ". The domain of the navigation may include commands such as 'destination setting', 'search around', and the like. The domain of Bluetooth may include commands such as 'call to', 'dial up', or 'change Bluetooth device'.

제어부(30)는 인식된 수동조작에 따른 동작이 소정의 도메인이 존재하는 것으로 판단되면, 해당 도메인 내에서 입력부(10)에 수신된 음성신호의 받아쓰기 결과에 수동조작에 따른 기능을 태깅(tagging)할 수 있다. If it is determined that a predetermined domain exists in the operation according to the recognized manual operation, the control unit 30 performs tagging of the function according to the manual operation in the result of dictation of the voice signal received in the input unit 10 in the corresponding domain, can do.

제어부(30)는 수신된 명령와 수동조작이 태깅되면, 수신된 명령어를 후보명령어로 판단하고, 저장부(20)에 임시저장되도록 할 수 있다. 그리고, 후보명령어가 소정횟수 이상 반복되고, 후보명령어와 해당 수동조작이 소정횟수 이상 태깅되면, 제어부(30)는 후보명령어를 정식명령어로 판단할 수 있다. 보다 자세한 설명은 도 2 내지 도 5를 참조한다. When the received command and the manual operation are tagged, the control unit 30 may determine that the received command is a candidate command, and temporarily store the received command in the storage unit 20. [ If the candidate command is repeated a predetermined number of times or more and the candidate command and the corresponding manual operation are tagged more than a predetermined number of times, the control unit 30 can determine the candidate command as a formal command. Refer to Figures 2 to 5 for a more detailed description.

제어부(30)는 사용자로부터 발화된 명령어에 따라 도 2에 도시된 바와 같이, 도메인 내에서 신규 명령어를 추가할 수 있고, 도 3에 도시된 바와 같이, 신규 도메인을 추가할 수 있고, 도 4 및 도 5에 도시된 바와 같이, 통합 명령어를 추가할 수 있다.The control unit 30 may add new commands in the domain as shown in Fig. 2, add new domains as shown in Fig. 3, As shown in FIG. 5, an integrated instruction can be added.

일 실시예에 따르면, 도 2에 도시된 바와 같이, 사용자는 '아이폰 뮤직'이라는 명령어를 입력할 수 있다. 본 발명에서 사용자가 입력한 명령어는 기 저장되어 학습되어진 정식명령어가 아닐 수 있다. 즉, 사용자가 직관적으로 의도하고자 하는 자연어를 포함할 수 있다. According to one embodiment, as shown in FIG. 2, a user may input an instruction 'iPhone music'. In the present invention, a command input by a user may not be a formal command which has been stored and learned. That is, it may include a natural language that a user intends to intend to intend.

제어부(30)는 사용자가 발화한 명령어 '아이폰 뮤직'의 음성신호를 서버로 송신할 수 있고, 서버는 '아이폰 뮤직'에 대한 받아쓰기 결과를 수신할 수 있다. 제어부(30)는 사용자가 입력한 명령어가 기 저장되어 학습되어진 정식명령어가 아니기 때문에 음성신호를 오인식하고 음성인식모드를 종료할 수 있다. The control unit 30 can transmit the voice signal of the command 'iPhone music' uttered by the user to the server, and the server can receive the dictation result of 'iPhone music'. The control unit 30 can recognize the voice signal and terminate the voice recognition mode because the command inputted by the user is not stored and learned as a formal command.

제어부(30)는 음성인식모드가 종료된 경우, 사용자가 직접 조작한 수동조작을 인식할 수 있다. 여기서, 사용자가 직접 조작한 수동조작은 사용자가 입력한 명령어를 실행하기 위해 수행되는 조작일 수 있다. 즉, '아이폰 뮤직'에 대한 수동조작일 수 있다. 예를들면, 수동조작은 아이폰의 음악을 재생하기 위하여 블루투스 오디오로 전환하는 동작을 포함할 수 있으며, 제어부(30)는 수동조작을 통하여 블루투스 오디오 모드로 전환된 것을 인지할 수 있다. When the voice recognition mode is ended, the control unit 30 can recognize the manual operation directly operated by the user. Here, the manual operation directly operated by the user may be an operation performed to execute the command inputted by the user. That is, it may be a manual operation for 'iPhone music'. For example, the manual operation may include switching to Bluetooth audio to reproduce the music of the iPhone, and the controller 30 may recognize that the Bluetooth audio mode has been switched through the manual operation.

제어부(30)는 수동조작 즉, 블루투스 오디오로 전환하는 동작의 도메인을 판단할 수 있다. 실시예에 따르면, 미디어의 도메인으로 판단할 수 있으며, 도메인이 판단되면 해당 도메인 내에서 입력부(10)에 수신된 음성신호의 받아쓰기 결과에 수동조작에 따른 기능을 태깅(tagging)하도록 할 수 있다. 여기서 태깅은 받아쓰기 결과에 키 워드 처리를 해 주는 것으로 이해되는 것이 바람직하고, 수신된 음성신호의 받아쓰기 결과와 수동조작에 따른 기능이 연결되도록 하는 것을 의미할 수 있다. The control unit 30 can determine the domain of the manual operation, that is, the operation of switching to the Bluetooth audio. According to the embodiment, it is possible to determine the domain as the domain of the media. When the domain is determined, the function of the manual operation can be tagged in the result of dictation of the voice signal received in the input unit 10 in the domain. Here, tagging is preferably understood to perform key word processing on a dictation result, and it may mean that a dictation result of a received speech signal and a function according to a manual operation are linked.

제어부(30)는 수신된 명령어 '아이폰 뮤직'과 블루투스 오디오를 전환하는 기능이 태깅되면, 도 6에 도시된 바와 같이, '아이폰 뮤직'을 후보명령어로 판단할 수 있으며, 저장부(20)에 임시저장할 수 있다. 그리고, '아이폰 뮤직'이 소정횟수 이상 수신되고, 아이폰 뮤직과 블루투스 오디오를 전환하는 기능을 태깅하는 것이 소정횟수 이상 수행되면 학습된 것으로 판단할 수 있다. 이는, 후보명령어 '아이폰 뮤직'이 정식명령어로 판단될 수 있는 조건을 만족한 것으로 판단할 수 있다.6, the control unit 30 can determine 'iPhone music' as a candidate command, and when the function to switch the received command 'iPhone music' and Bluetooth audio is tagged, You can save it temporarily. If it is determined that the 'iPhone music' has been received a predetermined number of times or more and tagging the function of switching between the iPhone music and the Bluetooth audio has been performed a predetermined number of times or more, it can be determined that it has been learned. This can determine that the candidate command 'iPhone music' satisfies a condition that can be determined as a formal command.

후보명령어의 학습이 완료된 후, 제어부(30)는 '아이폰 뮤직'이 재수신되는 경우, 태깅된 기능을 수행할지 여부를 사용자에게 선택하도록 하기 위해, 출력부(40)를 통하여 메세지를 음성 또는 이미지로 출력할 수 있다. 즉, '아이폰 뮤직'의 명령어가 수신되면 예를들어 '블루투스 오디오로 전환할까요?'라는 메세지를 출력하고, 사용자에게 선택하도록 할 수 있다. After completion of the learning of the candidate command, the control unit 30 transmits a message through the output unit 40 to the user through the output unit 40 in order to allow the user to select whether or not to perform the tagged function, . That is, when the command of 'iPhone Music' is received, for example, a message 'Would you like to switch to Bluetooth audio?' Is displayed and the user can select it.

사용자가 해당기능을 선택하는 경우, 수신된 명령어를 정식명령어로 판단하고, 저장부(20)에 정식명령어로서 저장될 수 있다. 그러나 사용자가 해당기능을 선택하지 않는 경우 수신된 명령어는 삭제될 수 있다. When the user selects the function, the received command may be determined as a formal command and stored as a formal command in the storage unit 20. [ However, if the user does not select the function, the received command can be deleted.

다른 실시예에 따르면, 도 3에 도시된 바와 같이, 사용자는 '화면밝기 좀 올려줘'라는 명령어를 입력할 수 있다. 본 발명에서 사용자가 입력한 명령어는 기 저장되어 학습되어진 정식명령어가 아닐 수 있다. 즉, 사용자가 직관적으로 의도하고자 하는 명령어를 포함할 수 있다. According to another embodiment, as shown in FIG. 3, the user can input a command 'Raise the screen brightness'. In the present invention, a command input by a user may not be a formal command which has been stored and learned. That is, the user may intuitively intend to include the instruction.

제어부(30)는 사용자가 발화한 명령어 '화면밝기 좀 올려줘'의 음성신호를 서버로 송신할 수 있고, 서버는 '화면밝기 좀 올려줘'에 대한 받아쓰기 결과를 수신할 수 있다. 제어부(30)는 사용자가 입력한 명령어가 기 저장되어 학습되어진 정식명령어가 아니기 때문에 음성신호를 오인식하고 음성인식모드를 종료할 수 있다. The control unit 30 can transmit a voice signal of the command 'raise brightness of the screen' to the server, and the server can receive the result of the dictation of 'raise the brightness of the screen'. The control unit 30 can recognize the voice signal and terminate the voice recognition mode because the command inputted by the user is not stored and learned as a formal command.

제어부(30)는 음성인식모드가 종료된 경우, 사용자가 직접 조작한 수동조작을 인식할 수 있다. 여기서, 사용자가 직접 조작한 수동조작은 사용자가 입력한 명령어에 대한 동작일 수 있다. 즉, '화면밝기 좀 올려줘'에 대한 수동조작일 수 있다. 예를들면, 수동조작은 화면밝기를 밝게하기 위한 소정의 동작, 예를들면 차량 내 구비된 스크롤 휠, 버튼 하우징을 조작하거나 터치스크린의 화면을 터치하는 동작을 포함할 수 있으며, 제어부(30)는 수동조작을 통하여 화면밝기가 제어되는 것을 인지할 수 있다. When the voice recognition mode is ended, the control unit 30 can recognize the manual operation directly operated by the user. Here, the manual operation directly operated by the user may be an operation for the command inputted by the user. That is, it may be a manual operation for 'Raise the screen brightness'. For example, the manual operation may include a predetermined operation for brightening the screen brightness, for example, an operation of operating a scroll wheel, a button housing, or a touch screen of the touch screen, It is possible to recognize that the screen brightness is controlled through manual operation.

제어부(30)는 화면밝기가 제어되도록 하는 동작의 도메인을 판단할 수 있다. 여기서, 도메인이 판단되지 않는 경우, 신규 도메인을 생성할 수 있다. 그리고, 신규 도메인 내에서 입력부(10)에 수신된 음성신호의 받아쓰기 결과에 수동조작에 따른 기능을 태깅(tagging)할 수 있다. 여기서 태깅은 받아쓰기 결과에 키 워드 처리를 해 주는 것으로 이해되는 것이 바람직하고, 수신된 음성신호의 받아쓰기 결과와 수동조작에 따른 기능이 연결되도록 할 수 있다. The control unit 30 can determine the domain of the operation in which the screen brightness is controlled. Here, if the domain is not determined, a new domain can be created. In addition, the function of the manual operation can be tagged in the result of the voice signal received in the input unit 10 in the new domain. It is preferable that the tagging is understood to perform key word processing on the dictation result, and the dictation result of the received voice signal and the function according to the manual operation can be linked.

제어부(30)는 수신된 명령어 '화면밝기 좀 올려줘'와 화면밝기를 제어하는 기능이 태깅되면, '화면밝기 좀 올려줘'을 후보명령어로 판단할 수 있으며, 저장부(20)에 임시저장할 수 있다. 그리고, '화면밝기 좀 올려줘'가 소정횟수 이상 수신되고, 후보명령어와 화면밝기를 제어하는 기능을 태깅하는 것이 소정횟수 이상 수행되면 학습된 것으로 판단할 수 있다. 이는, 후보명령어 '화면밝기 좀 올려줘'가 정식명령어로 판단될 수 있는 조건을 만족한 것으로 판단할 수 있다.The control unit 30 can judge the 'increase brightness of the screen' as a candidate command when the received command 'raise the brightness of the screen' and the function of controlling the brightness of the screen are tagged, and can temporarily store it in the storage unit 20 . If it is determined that a predetermined number of times or more of "increase screen brightness" is received and tagging of the function of controlling the candidate command and screen brightness is performed a predetermined number of times or more, it can be determined that the learning has been performed. This can be determined to satisfy the condition that the candidate command 'raise screen brightness' can be determined as a formal command.

후보 명령어의 학습이 완료된 후, 제어부(30)는 '화면밝기 좀 올려줘'가 재수신되는 경우, 태깅된 기능을 수행할지 여부를 사용자에게 선택하도록 하기 위해, 출력부(40)를 통하여 메세지를 음성 또는 이미지로 출력할 수 있다. 즉, '화면밝기 좀 올려줘'의 명령어가 수신되면 화면밝기를 제어하는 셋팅화면으로 전환할지 여부를 출력하고, 사용자에게 선택하도록 할 수 있다. After the learning of the candidate command is completed, the controller 30 transmits a message through the output unit 40 in order to allow the user to select whether or not to perform the tagged function, Or an image. That is, when the command 'Raise the screen brightness' is received, it is outputted to the setting screen for controlling the brightness of the screen, and the user can select the screen.

또 다른 실시예에 따르면, 도 4에 도시된 바와 같이, 사용자는 '홍길동 책임에게 전화걸어줘'라는 명령어를 입력할 수 있다. 본 발명에서 사용자가 입력한 명령어는 기 저장된 수동 명령어에 해당하고 학습되어진 고정의 음성인식 명령어가 아닐 수 있다. 즉, 사용자가 직관적으로 의도하고자 하는 명령어를 포함할 수 있다. According to another embodiment, as shown in FIG. 4, the user can input an instruction 'call to charge Mr. Hong Gil-dong'. In the present invention, the command input by the user may not correspond to the previously stored manual command and the learned fixed speech recognition command. That is, the user may intuitively intend to include the instruction.

제어부(30)는 사용자가 발화한 명령어 '홍길동 책임에게 전화걸어줘'의 음성신호를 서버로 송신할 수 있고, 서버는 '홍길동 책임에게 전화걸어줘'에 대한 받아쓰기 결과를 수신할 수 있다. 제어부(30)는 사용자가 입력한 명령어가 기 저장된 수동 명령어에 해당하고, 학습되어진 고정의 음성인식 명령어가 아니기 때문에 음성신호를 오인식하고 음성인식모드를 종료할 수 있다. The control unit 30 can transmit a voice signal of 'Call to responsible Hong Gil-dong', which is uttered by the user, to the server, and the server can receive the result of the dictation about 'Call to the responsible Mr. Hong Gil.' The control unit 30 may recognize the voice signal and terminate the voice recognition mode because the command inputted by the user corresponds to the manually stored command and is not the fixed voice recognition command that has been learned.

제어부(30)는 음성인식모드가 종료된 경우, 사용자가 직접 조작한 수동조작을 인식할 수 있다. 여기서, 사용자가 직접 조작한 수동조작은 사용자가 입력한 명령어에 대한 동작일 수 있다. 즉, '홍길동 책임에게 전화걸어줘'에 대한 수동조작일 수 있다. 예를들면, 수동조작은 홍길동 책임에게 전화를 걸기위한 소정의 동작, 예를들면 차량 내 구비된 스크롤 휠, 버튼 하우징을 조작하거나 터치스크린의 화면을 터치하는 동작을 포함을 포함할 수 있으며, 제어부(30)는 수동조작을 통하여 전화걸기로 전환되는 것을 인지할 수 있다.When the voice recognition mode is ended, the control unit 30 can recognize the manual operation directly operated by the user. Here, the manual operation directly operated by the user may be an operation for the command inputted by the user. In other words, it can be a manual operation for 'Call me responsible'. For example, the manual operation may include a predetermined operation for making a call to Hong Gil-dong, for example, a scroll wheel provided in the vehicle, an operation of operating a button housing, or an operation of touching a screen of a touch screen, (30) can be recognized as being switched to dialing through manual operation.

제어부(30)는 홍길동 책임에게 전화거는 것의 도메인을 판단할 수 있다. 실시예에 따르면, '블루투스' 도메인으로 판단할 수 있으며, 도메인이 판단되면 해당 도메인 내에서 입력부(10)에 수신된 음성신호의 받아쓰기 결과에 수동조작에 따른 기능을 태깅(tagging)할 수 있다. 여기서 태깅은 받아쓰기 결과에 키 워드 처리를 해 주는 것으로 이해되는 것이 바람직하고, 수신된 음성신호의 받아쓰기 결과와 수동조작이 연결되도록 할 수 있다. The control unit 30 can determine the domain of the call to Hong Kil Dong. According to the embodiment, it can be determined that the domain is a 'Bluetooth' domain. If the domain is determined, a function according to a manual operation can be tagged in a result of dictation of a voice signal received in the input unit 10 in the domain. Here, it is preferable that tagging is understood to perform key word processing on the dictation result, and the dictation result of the received voice signal and the manual operation can be linked.

제어부(30)는 수신된 명령어 '홍길동 책임에게 전화걸어줘'와 홍길동 책임에게 전화걸기 동작이 태깅되면, '홍길동 책임에게 전화걸어줘'를 후보명령어로 판단할 수 있으며, 이를 저장부(20)에 임시저장할 수 있다. 그리고, '홍길동 책임에게 전화걸어줘'가 소정횟수 이상 수신되고, '홍길동 책임에게 전화걸어줘'와 홍길동 책임에게 전화걸기 동작을 태깅하는 것이 소정횟수 이상 수행되면 학습된 것으로 판단할 수 있다. 이는, 후보명령어 '홍길동 책임에게 전화걸어줘'가 정식명령어로 판단될 수 있는 조건을 만족한 것으로 판단할 수 있다.The control unit 30 can determine the 'command to call the responsible person' by the command command 'Give me a call to Mr. Hong Gil-dong' and the call-to-charge action to Mr. Hong Gil- As shown in FIG. Then, it can be judged that the mobile phone has been learned when a predetermined number of times or more are received from the 'Hong Gil-dong Responsibility to the responsible person', and 'Tagging of Hong Gil-dong' This can be judged as satisfying the condition that the candidate command " Let's call Mr. Hong Gil-dong 'can be judged as a formal command.

후보 명령어의 학습이 완료된 후, 제어부(30)는 '홍길동 책임에게 전화걸어줘'가 재수신되는 경우, 태깅된 기능을 수행할지 여부를 사용자에게 선택하도록 하기 위해, 출력부(40)를 통하여 음성 또는 이미지로 출력할 수 있다. 즉, '홍길동 책임에게 전화걸어줘'의 명령어가 수신되면 홍길동 책임에게 전화걸기를 수행할지 여부를 출력하고, 사용자에게 선택하도록 할 수 있다. After completion of the learning of the candidate command, the control unit 30 instructs the user to select whether or not to perform the tagged function in the case where ' Or an image. In other words, when the command 'Call me responsible' is received, the operator can output whether or not to perform telephone call to the responsible person, and let the user select the command.

한편, 전화걸기의 대상은 변화될 수 있다. 즉, 홍길동 책임 이외에도 도 4에 도시된 바와 같이 '김삿갓 책임에게 전화걸어줘', '김현대 책임에게 전화걸어줘' 등으로 사용자는 전화걸기의 대상을 달리하여 발화할 수 있다. On the other hand, the object of dialing can be changed. That is, in addition to the responsibilities of Hong Gil-dong, as shown in Fig. 4, the user can make a call by changing the object of telephone dialing, for example, "Give me a call to Kim Sang-gap," or "Call me Kim Hyun-

따라서, 제어부(30)는 '~에게 전화걸어줘'와 같이 유사한 명령어가 반복 입력되는 경우, 통합 명령어가 입력된 것으로 판단할 수 있다. 예를들면, 도 5에 도시된 바와 같이, '홍길동 책임에게 전화걸어줘'라고 수신된 명령어에 대하여 '~에게 전화걸어줘'라는 통합 명령어가 수신된 것으로 판단할 수 있다. Accordingly, when a similar command is repeatedly input, such as 'dial to ~', the control unit 30 can determine that the integrated command is input. For example, as shown in FIG. 5, it can be determined that a combined command 'Give a phone call to ~' to a command received as 'Call me by responsible phone call' is received.

제어부(30)는 '~에게 전화걸어줘'에 대한 동작의 도메인을 판단할 수 있다. 실시예에 따라, 제어부(30)는 '~에게 전화걸어줘'에 대한 수동조작의 도메인을 블루투스의 도메인으로 판단할 수 있으며, 통합 명령어에 대한 음성신호의 받아쓰기 결과에 전화걸기 기능을 태깅할 수 있다. The control unit 30 can determine the domain of the operation for 'dial to ~'. According to the embodiment, the control unit 30 can determine the domain of the manual operation for 'give a phone call to' as a domain of Bluetooth, and can tag the dialing function in the dictation result of the voice signal for the integrated command have.

제어부(30)는 수신된 명령어 '~에게 전화걸어줘'와 화면밝기를 제어하는 기능이 태깅되면, '~에게 전화걸어줘'을 후보명령어로 판단할 수 있으며, 저장부(20)에 임시저장할 수 있다. 그리고, '~에게 전화걸어줘'가 소정횟수 이상 수신되고, 후보명령어와 화면밝기를 제어하는 기능을 태깅하는 것이 소정횟수 이상 수행되면 학습된 것으로 판단할 수 있다. 이는, 후보명령어 '~에게 전화걸어줘'가 정식명령어로 판단될 수 있는 조건을 만족한 것으로 판단할 수 있다.The control unit 30 can determine the 'call to ~' as a candidate command when the received command 'dial call' and the function to control the screen brightness are tagged, and temporarily store it in the storage unit 20 . It can be determined that the learning has been performed when a predetermined number of times or more of "call to" is received and the function of controlling the candidate command and the screen brightness is performed a predetermined number of times or more. This can be judged as satisfying the condition that the candidate command 'call to ~' can be determined as a formal command.

후보 명령어의 학습이 완료된 후, 제어부(30)는 '~에게 전화걸어줘'가 재수신되는 경우, 태깅된 기능을 수행할지 여부를 사용자에게 선택하도록 하기 위해, 출력부(40)를 통하여 메세지를 음성 또는 이미지로 출력할 수 있다. 즉, '~에게 전화걸어줘'의 명령어가 수신되면 화면밝기를 제어하는 셋팅화면으로 전환할지 여부를 출력하고, 사용자에게 선택하도록 할 수 있다. After completion of the learning of the candidate command, the control unit 30 transmits a message through the output unit 40 to the user in order to allow the user to select whether or not to perform the tagged function, Audio or image. That is, when the command 'call to ~' is received, it is outputted whether or not to switch to the setting screen for controlling the brightness of the screen, so that the user can select the screen.

도 7은 본 발명의 수동조작을 이용한 음성인식 방법을 나타낸 순서도이다.7 is a flowchart showing a speech recognition method using manual operation of the present invention.

먼저, 사용자가 발화한 명령어를 수신할 수 있다(S100). S100에서 사용자가 발화한 명령어는 기 저장된 수동 명령어에 해당하고 학습되어진 고정의 음성인식 명령어가 아닐 수 있다. 즉, 사용자가 직관적으로 의도하고자 하는 자연어를 포함할 수 있다. 실시예에 따르면, '아이폰 뮤직', '화면밝기 좀 올려줘', '홍길동 책임에게 전화걸어줘'등을 포함할 수 있다. First, the user can receive a command that has been uttered (S100). The command issued by the user in S100 corresponds to the previously stored manual command and may not be the learned fixed speech recognition command. That is, it may include a natural language that a user intends to intend to intend. According to the embodiment, it may include 'iPhone music', 'raise the brightness of the screen', 'give a phone call to Hong Gil-dong' and the like.

명령어에 대한 음성신호를 서버로 전송한다(S110). 그리고, 서버는 수신된 음성신호에 대하여 받아쓰기(DICTATION)의 결과를 차량으로 전송할 수 있다(S120).The voice signal for the command is transmitted to the server (S110). Then, the server can transmit the result of the DICTATION to the vehicle with respect to the received voice signal (S120).

그리고, 수신된 명령어가 인식되었는지 판단한다(S130). S130는 수신된 명령어가 기 설정된 명령어인지 여부를 판단한다. 수신된 명령어가 기 설정된 명령어인 경우(YES) 음성이 인식된 것으로 판단하고, 인식결과에 따라 기능을 동작시킬 수 있다(S140). 수신된 명령어가 기 설정된 명령어가 아닌 경우(NO), 음성은 오인식된 것으로 판단하고, 음성인식모드를 종료할 수 있다. Then, it is determined whether the received command word is recognized (S130). S130 determines whether the received command is a predetermined command. If the received command is a predetermined command (YES), it is determined that the voice is recognized, and the function is operated according to the recognition result (S140). If the received command is not a predetermined command (NO), it is determined that the voice is erroneous and the voice recognition mode can be terminated.

음성인식모드가 종료된 후, 사용자가 의도하고자 하는 기능을 직접 조작하도록 수동입력모드로 전환하고, 소정시간 이내에 사용자가 수동으로 조작하는 기능을 인식할 수 있다(S150). After the voice recognition mode is terminated, the user manually switches the mode to the manual input mode so that the user can manually operate the desired function (S150).

S150은 일 실시예에 따르면, S100에서 수신된 명령어 '아이폰 뮤직'에 대한 수동조작은 아이폰의 음악을 재생하기 위하여 블루투스 오디오로 전환하는 동작을 포함할 수 있으며, 수동조작을 통하여 블루투스 오디오 모드로 전환된 것을 인지할 수 있다.In operation S150, the manual operation for the command 'iPhone music' received in S100 may include switching to Bluetooth audio to reproduce the music of the iPhone, and switching to the Bluetooth audio mode through manual operation Can be recognized.

S150은 다른 실시예에 따르면, S100에서 수신된 명령어 '화면밝기 좀 올려줘'에 대한 수동조작은 화면밝기를 밝게하기 위한 소정의 동작, 예를들면 차량 내 구비된 스크롤 휠, 버튼 하우징을 조작하거나 터치스크린의 화면을 터치하는 동작을 포함할 수 있으며, 수동조작을 통하여 화면밝기가 제어되는 것을 인지할 수 있다. In step S150, according to another embodiment, the manual operation on the command 'Raise the screen brightness' received in step S100 may be a predetermined operation for brightening the screen brightness, for example, a scroll wheel provided in the vehicle, May include an operation of touching the screen of the screen, and it may be recognized that the screen brightness is controlled through manual operation.

S150은 또 다른 실시예에 따르면, S100에서 입력된 명령어 '홍길동 책임에게 전화걸어줘'에 대한 수동조작은 홍길동 책임에게 전화를 걸기위한 소정의 동작, 예를들면 차량 내 구비된 스크롤 휠, 버튼 하우징을 조작하거나 터치스크린의 화면을 터치하는 동작을 포함을 포함할 수 있으며, 수동조작을 통하여 전화걸기로 전환되는 것을 인지할 수 있다.In step S150, according to another embodiment, the manual operation of 'command to call Mr. Hong Gil-dong' inputted in S100 is to perform a predetermined operation for making a call to Mr. Hong's office, for example, a scroll wheel, Or touching the screen of the touch screen, and recognizing that the user is switched to dialing through manual operation.

수동조작에 따른 동작이 포함되는 도메인이 존재하는지 여부를 판단한다(S160). 여기서 도메인은 사용자가 발화한 명령어가 사용될 수 있는 상황을 분류하는 개념으로 이해될 수 있다. 예를들면 도메인은 미디어, 내비게이션, 블루투스 등으로 분류될 수 있다. It is determined whether a domain including an operation according to manual operation exists (S160). Here, the domain can be understood as a concept of classifying a situation in which a command uttered by the user can be used. For example, a domain can be classified into media, navigation, Bluetooth, and the like.

S160에서 수동조작에 대한 도메인이 존재하는 것으로 판단되면, 해당 수동조작에 대응하는 명령어를 후보명령어로 판단하고 임시저장한다(S170). S170은 판단된 도메인 내에서 S100에서 수신된 음성신호의 받아쓰기 결과에 수동조작에 따른 기능을 태깅(tagging)하도록 하고, 태깅되면 수신된 명령어를 후보명령어로 판단할 수 있다. 여기서 태깅은 받아쓰기 결과에 키 워드 처리를 해 주는 것으로 이해되는 것이 바람직하고, 수신된 음성신호의 받아쓰기 결과와 수동조작에 따른 기능이 연결되도록 하는 것을 의미할 수 있다. If it is determined in S160 that the domain for the manual operation exists, the instruction corresponding to the manual operation is determined as a candidate instruction word and temporarily stored (S170). In step S170, the function corresponding to the manual operation is tagged in the result of dictation of the voice signal received in step S100 in the determined domain, and when the tagging is performed, the received command can be determined as the candidate command. Here, tagging is preferably understood to perform key word processing on a dictation result, and it may mean that a dictation result of a received speech signal and a function according to a manual operation are linked.

S170은 일 실시예에 따르면 S150에서 수행된 블루투스 오디오로 전환하는 동작에 대하여 S160에서 도메인이 존재하는 것으로 판단(YES)되면, 판단된 도메인 내에서 '아이폰 뮤직'의 명령어와 블루투스 오디오를 전환하는 기능을 태깅하고, '아이폰 뮤직'을 후보명령어로 판단하고 임시저장할 수 있다. In operation S170, if it is determined in operation S160 that the domain exists in operation S150, the operation of switching the Bluetooth audio from the command " iPhone Music " Tag ', and judge' iPhone music 'as a candidate command and temporarily store it.

S170은 다른 실시예에 따르면 S150에서 수행된 전화걸기 동작에 대하여 도메인이 존재하는 것으로 판단(YES)되면, 판단된 도메인 내에서 '홍길동 책임에게 전화걸어줘'의 명령어와 전화걸기 기능을 태깅하고, '홍길동 책임에게 전화걸어줘'를 후보명령어로 판단하고 임시저장할 수 있다. In step S170, if it is determined that the domain exists in the dialing operation performed in step S150 according to another embodiment (YES in step S170), the controller 150 may tag the command " It is possible to judge 'Call Gimhyeon-dong' as a candidate command and save it temporarily.

한편, S160에서 수동조작에 대한 도메인이 존재하지 않는 것으로 판단되면, 신규 도메인을 생성한다(S180). On the other hand, if it is determined in S160 that the domain for the manual operation does not exist, a new domain is created (S180).

S180은 실시예에 따르면, S160에서 수행된 화면밝기를 제어하는 동작에 대하여 도메인이 존재하지 않는 것으로 판단(NO)되면, '셋업' 도메인을 신규로 생성할 수 할 수 있다. 그리고, 신규 도메인에서 '화면밝기 좀 올려줘'의 명령어와 화면밝기를 제어하는 기능을 태깅하고 '화면밝기 좀 올려줘'를 후보명령어로 판단하고 임시저장할 수 있다.In operation S180, if it is determined that the domain does not exist in the operation of controlling the screen brightness performed in S160 (NO), a 'setup' domain may be newly generated. In the new domain, it is possible to tag the function of 'increase the brightness of the screen' and the function of controlling the brightness of the screen and to judge the 'increase brightness of the screen' as a candidate command and temporarily store it.

후보명령어의 학습 여부를 판단한다(S190). S190은 후보명령어가 소정횟수 이상 수신되고, 후보명령어와 수동조작을 태깅하는 것이 소정횟수 이상 수행되면, 후보명령어가 학습된 것으로 판단할 수 있다. 이는 후보명령어가 정식명령어로 판단될 수 있는 조건을 만족하는지 여부를 판단하는 것으로 이해될 수 있다.It is determined whether the candidate command is learned (S190). S190 may determine that the candidate instruction is learned if the candidate instruction is received more than a predetermined number of times and tagging of the candidate instruction and the manual operation is performed a predetermined number of times or more. This can be understood as judging whether the candidate instruction satisfies a condition that can be judged as a formal instruction.

후보명령어가 학습된 것으로 판단(YES)되면, 후보명령어가 재수신 되었을때, 태깅된 기능을 수행할지 여부를 사용자에게 선택하도록 한다(S200). 이를 위하여 S200에서 출력부(40)를 통하여 메세지를 음성 또는 이미지로 출력할 수 있다. 한편, 후보명령어가 학습되지 않은 것으로 판단(No)되면, 후보명령어를 정식명령어로 판단 불가한 것으로 판단한다(S220).If it is determined that the candidate command is learned (YES), the user is prompted to select whether to perform the tagged function when the candidate command is received again (S200). For this purpose, a message may be outputted as speech or image through the output unit 40 in S200. On the other hand, if it is determined that the candidate instruction is not learned (No), it is determined that the candidate instruction can not be judged as a formal instruction (S220).

실시예에 따르면, '아이폰 뮤직'의 명령어가 수신되면 블루투스 오디오로 전환할지 여부를 출력하고 사용자에게 선택하도록 할 수 있다. 또한, '화면밝기 좀 올려줘'의 명령어가 수신되면 화면밝기를 제어하는 셋팅화면으로 전환할지 여부를 출력하고, 사용자에게 선택하도록 할 수 있다.According to the embodiment, when the 'iPhone music' command is received, it is displayed whether or not to switch to Bluetooth audio, and the user can select the switch. In addition, when the command 'Raise the screen brightness' is received, it is outputted to the setting screen for controlling the brightness of the screen, and the user can select the screen.

사용자가 해당기능을 선택하면(YES), 후보명령어를 정식명령어로 판단하고, 저장한다(S210). 한편, 사용자가 해당기능을 선택하지 않으면(NO), 후보명령어를 정식명령어로 저장 불가한 것으로 판단한다(S220). S220에서 후보명령어는 삭제될 수 있다. When the user selects the corresponding function (YES), the candidate command is determined as a formal command and stored (S210). On the other hand, if the user does not select the corresponding function (NO), it is determined that the candidate command can not be stored with the formal command (S220). The candidate instruction may be deleted in S220.

이상의 설명은 본 발명의 기술 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 다양한 수정 및 변형이 가능할 것이다. The foregoing description is merely illustrative of the technical idea of the present invention, and various changes and modifications may be made by those skilled in the art without departing from the essential characteristics of the present invention.

따라서, 본 발명에 개시된 실시예들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리범위에 포함되는 것으로 해석되어야 할 것이다.Therefore, the embodiments disclosed in the present invention are intended to illustrate rather than limit the scope of the present invention, and the scope of the technical idea of the present invention is not limited by these embodiments. The scope of protection of the present invention should be construed according to the following claims, and all technical ideas within the scope of equivalents should be construed as falling within the scope of the present invention.

입력부 10
저장부 20
제어부 30
출력부 40Input section 10
The storage unit 20
The control unit 30
Output section 40

Claims

An input unit for receiving a command from a user; And
Wherein the control unit recognizes the manual operation of the user corresponding to the instruction if the instruction received by the input unit is misrecognized, judges the instruction as a candidate instruction by tagging the received instruction and the manual operation, And a controller for determining the command as a formal command.

The method according to claim 1,
The input unit
And a natural language is received.

The method according to claim 1,
The control unit
Wherein the controller determines that the received command is misrecognized if the received command is not a formal command that has been stored and learned.

The method according to claim 1,
The control unit
Wherein the user recognizes the manual operation performed to execute the received instruction word within a predetermined time after the user is mistaken for the instruction word.

The method according to claim 1,
The control unit
Wherein the controller identifies the domain of the manual operation, and tagging the received instruction and the manual operation in the domain.

The method according to claim 1,
The control unit
And if the domain of the manual operation is not determined, a new domain is created.

The method according to claim 1,
The control unit
Wherein when the candidate instruction is re-received a predetermined number of times and the operation of tagging the re-received candidate instruction and the manual operation is repeated a predetermined number of times, it is determined that the candidate instruction is learned. .

The method according to claim 1,
The control unit
Wherein when the candidate instruction is learned, a message is output to the driver to select whether to store the candidate instruction in a formal instruction word, and when the driver selects the candidate instruction, the candidate instruction is determined to be a formal instruction word .

Receiving a command from a user;
Recognizing a manual operation corresponding to the instruction if the received instruction is misrecognized;
Determining the received command as a candidate command;
Learning the candidate instruction; And
And if the candidate instruction is learned, determining the candidate instruction as a formal instruction word.

The method of claim 9,
The step of receiving a command from a user
And a natural language is received.

The method of claim 9,
If the received instruction is misrecognized, recognizing a manual operation corresponding to the instruction,
And if the received instruction word is not a formal instruction that has been stored and learned, it is determined that the received instruction word is erroneous.

The method of claim 9,
If the received command is misrecognized, recognizing the manual operation corresponding to the command
Wherein the user recognizes the manual operation performed in order to execute the received instruction word within a predetermined time after the user recognizes the instruction word.

The method of claim 9,
If the received instruction is misidentified, recognizing a manual operation corresponding to the instruction,
Further comprising the step of determining a domain of the manual operation.

14. The method of claim 13,
Wherein the step of determining the received instruction as a candidate instruction comprises:
And if the domain of the manual operation is determined, tagging the received instruction and the manual operation in the domain.

14. The method of claim 13,
Wherein the step of determining the received instruction as a candidate instruction comprises:
If the domain of the manual operation is not determined, a new domain is created and the received command and the manual operation are tagged in the new domain.

The method of claim 9,
The step of learning the candidate instruction
Wherein the candidate command is re-received a predetermined number of times and the candidate command and the manual operation are tagged repeatedly a predetermined number of times.

The method of claim 9,
Wherein if the candidate instruction is learned, the step of determining the candidate instruction as a formal instruction
And outputting a message to the driver to select whether to store the candidate command as a formal command, and when the driver selects the candidate command, the command is determined as a formal command.