KR102582360B1

KR102582360B1 - Apparatus, device and method for gernerating customised language model

Info

Publication number: KR102582360B1
Application number: KR1020180015769A
Authority: KR
Inventors: 이성재; 나민수; 김대현; 유승우
Original assignee: 주식회사 케이티
Priority date: 2018-02-08
Filing date: 2018-02-08
Publication date: 2023-09-25
Also published as: KR20190096159A

Abstract

사용자 맞춤형 언어 모델을 생성하는 장치는 사용자 단말로부터 신규 어휘를 입력받는 신규 어휘 입력부; 상기 입력된 신규 어휘에 대한 검증을 수행하는 신규 어휘 검증부; 상기 검증이 완료된 신규 어휘로부터 언어 모델의 구성 요소를 학습하여 상기 신규 어휘에 대한 언어 모델을 구축하는 신규 어휘 언어 모델 구축부; 및 상기 구축된 언어 모델을 상기 사용자 단말로 제공하는 언어 모델 제공부를 포함할 수 있다. An apparatus for generating a user-customized language model includes a new vocabulary input unit that receives new vocabulary from a user terminal; a new vocabulary verification unit that verifies the input new vocabulary; a new vocabulary language model construction unit that constructs a language model for the new vocabulary by learning components of the language model from the verified new vocabulary; and a language model providing unit that provides the constructed language model to the user terminal.

Description

Apparatus, terminal, and method for generating a user customized language model {APPARATUS, DEVICE AND METHOD FOR GERNERATING CUSTOMISED LANGUAGE MODEL}

본 발명은 사용자 맞춤형 언어 모델을 생성하는 장치, 단말 및 방법에 관한 것이다. The present invention relates to an apparatus, terminal, and method for generating a user-customized language model.

최근 들어, 음성 인식 기술의 발전으로 다양한 분야(예컨대, 홈 IoT, 의료, 금융, 전화망 콜센터 등)에서 음성 인식 기술을 자사의 시스템 혹은 장치에 적용하려는 시도가 증가하고 있다. Recently, with the development of voice recognition technology, attempts to apply voice recognition technology to their systems or devices are increasing in various fields (e.g., home IoT, medical, finance, telephone network call centers, etc.).

종래의 음성 인식 플랫폼은 획일화된 음향 모델 및 언어 모델을 통해 음성 서비스를 제공하고 있으므로 금융, 의료와 같은 전문적인 음성인식 분야에 음성 인식 서비스를 적용하기 어렵다. Conventional voice recognition platforms provide voice services through standardized acoustic models and language models, so it is difficult to apply voice recognition services to professional voice recognition fields such as finance and medicine.

좋은 성능을 갖는 음성 인식 플랫폼이라 하더라도, 새로운 어휘에 대한 학습모델이 부재하다면 좋은 음성인식률을 기대할 수 없다. 예를 들어 의료분야의 특정 질병 또는 처방약 이름과 같은 전문어휘에 대한 언어모델이 존재하지 않는다면, 음성 인식 플랫폼은 특정 질병(예를 들면, 인티푸라민) 또는 처방약 이름과 가장 유사한 단어를 검색하여 인식 결과(예를 들면, 안티 프라자)로 제공하기 때문에 오인식을 유발할 수 있다. Even if it is a speech recognition platform with good performance, a good speech recognition rate cannot be expected if there is no learning model for new vocabulary. For example, if there is no language model for specialized vocabulary, such as the name of a specific disease or prescription drug in the medical field, the speech recognition platform searches for and recognizes words that are most similar to the name of the specific disease (e.g., intipuramine) or prescription drug. Because it is provided as a result (for example, anti-plaza), it may cause misperception.

한국등록특허공보 제10-1626382호 (2011.06.10. 공개)Korean Patent Publication No. 10-1626382 (published on June 10, 2011)

본 발명은 전술한 종래 기술의 문제점을 해결하기 위한 것으로서, 사용자 단말로부터 입력받은 신규 어휘에 대한 검증을 수행하고, 신규 어휘에 대한 언어 모델을 구축하고자 한다. 또한, 본 발명은 구축된 신규 어휘에 대한 언어 모델을 이용한 음성 인식 서비스를 사용자 단말에게 제공하고자 한다. 다만, 본 실시예가 이루고자 하는 기술적 과제는 상기된 바와 같은 기술적 과제들로 한정되지 않으며, 또 다른 기술적 과제들이 존재할 수 있다. The present invention is intended to solve the problems of the prior art described above, and seeks to verify new vocabulary input from a user terminal and build a language model for the new vocabulary. In addition, the present invention seeks to provide a voice recognition service using a language model for a new vocabulary constructed to a user terminal. However, the technical challenges that this embodiment aims to achieve are not limited to the technical challenges described above, and other technical challenges may exist.

상술한 기술적 과제를 달성하기 위한 기술적 수단으로서, 본 발명의 제 1 측면에 따른 사용자 맞춤형 언어 모델을 생성하는 장치는 사용자 단말로부터 신규 어휘를 입력받는 신규 어휘 입력부; 상기 입력된 신규 어휘에 대한 검증을 수행하는 신규 어휘 검증부; 상기 검증이 완료된 신규 어휘로부터 언어 모델의 구성 요소를 학습하여 상기 신규 어휘에 대한 언어 모델을 구축하는 신규 어휘 언어 모델 구축부; 및 상기 구축된 언어 모델을 상기 사용자 단말로 제공하는 언어 모델 제공부를 포함할 수 있다. As a technical means for achieving the above-described technical problem, an apparatus for generating a user-customized language model according to the first aspect of the present invention includes a new vocabulary input unit that receives a new vocabulary from a user terminal; a new vocabulary verification unit that verifies the input new vocabulary; a new vocabulary language model construction unit that constructs a language model for the new vocabulary by learning components of the language model from the verified new vocabulary; and a language model providing unit that provides the constructed language model to the user terminal.

본 발명의 제 2 측면에 따른 사용자 맞춤형 언어 모델을 제공하는 사용자 단말은 사용자로부터 입력받은 신규 어휘를 언어 모델 생성 장치로 전송하는 전송부; 및 상기 신규 어휘에 대한 언어 모델을 상기 언어 모델 생성 장치로부터 수신하는 언어 모델 수신부를 포함하고, 상기 언어 모델 생성 장치에 의해 상기 신규 어휘에 대한 검증이 수행되고, 검증이 완료된 신규 어휘로부터 학습된 구성 요소에 기초하여 상기 신규 어휘에 대한 언어 모델이 구축될 수 있다. A user terminal that provides a user-customized language model according to the second aspect of the present invention includes a transmission unit that transmits a new vocabulary input from the user to a language model generating device; and a language model receiver that receives a language model for the new vocabulary from the language model generating device, wherein verification of the new vocabulary is performed by the language model generating device, and a configuration learned from the verified new vocabulary. Based on the elements, a language model for the new vocabulary can be built.

본 발명의 제 3 측면에 따른 사용자 맞춤형 언어 모델을 생성하는 방법은 사용자 단말로부터 신규 어휘를 입력받는 단계; 상기 입력된 신규 어휘에 대한 검증을 수행하는 단계; 상기 검증이 완료된 신규 어휘로부터 언어 모델의 구성 요소를 학습하여 상기 신규 어휘에 대한 언어 모델을 구축하는 단계; 및 상기 구축된 언어 모델을 상기 사용자 단말로 제공하는 단계를 포함할 수 있다.A method of generating a user-customized language model according to a third aspect of the present invention includes receiving a new vocabulary input from a user terminal; Verifying the input new vocabulary; constructing a language model for the new vocabulary by learning components of a language model from the verified new vocabulary; And it may include providing the constructed language model to the user terminal.

상술한 과제 해결 수단은 단지 예시적인 것으로서, 본 발명을 제한하려는 의도로 해석되지 않아야 한다. 상술한 예시적인 실시예 외에도, 도면 및 발명의 상세한 설명에 기재된 추가적인 실시예가 존재할 수 있다.The above-described means for solving the problem are merely illustrative and should not be construed as limiting the present invention. In addition to the exemplary embodiments described above, there may be additional embodiments described in the drawings and detailed description of the invention.

전술한 본 발명의 과제 해결 수단 중 어느 하나에 의하면, 본 발명은 사용자 단말로부터 입력받은 신규 어휘에 대한 검증을 수행하고, 신규 어휘에 대한 언어 모델을 구축할 수 있다. 또한, 본 발명은 구축된 신규 어휘에 대한 언어 모델을 이용한 음성 인식 서비스를 사용자 단말에게 제공할 수 있다. According to one of the means for solving the problems of the present invention described above, the present invention can perform verification of a new vocabulary input from a user terminal and build a language model for the new vocabulary. Additionally, the present invention can provide a voice recognition service using a language model for a new vocabulary constructed to a user terminal.

또한, 본 발명은 사용자가 음성인식에 대한 전문적인 지식을 갖고 있지 않더라도 신규 어휘에 대한 언어 모델을 구축할 수 있다. 또한, 본 발명은 사용자가 신규 어휘를 등록하는 것만으로 해당 신규 어휘가 반영된 독자적인 음성인식 엔진을 사용자에게 제공할 수 있다. Additionally, the present invention can build a language model for new vocabulary even if the user does not have expert knowledge of speech recognition. Additionally, the present invention can provide the user with a unique voice recognition engine that reflects the new vocabulary simply by the user registering the new vocabulary.

도 1은 본 발명의 일 실시예에 따른, 사용자 맞춤형 언어 모델 생성 시스템의 구성도이다.
도 2는 본 발명의 일 실시예에 따른, 도 1에 도시된 언어 모델 생성 장치의 블록도이다.
도 3a 내지 3c는 본 발명의 일 실시예에 따른, 사용자 단말로부터 입력받는 신규 어휘를 설명하기 위한 도면이다.
도 4는 본 발명의 일 실시예에 따른, 신규 어휘에 대한 검증을 수행하는 방법을 설명하기 위한 도면이다.
도 5a 내지 5b는 본 발명의 일 실시예에 따른, 문장 리스트의 생성 및 신규 어휘의 발음열을 생성하는 방법을 설명하기 위한 도면이다.
도 6은 본 발명의 일 실시예에 따른, 언어 모델 생성 장치에서 사용자 맞춤형 언어 모델을 생성하는 방법을 나타낸 흐름도이다.
도 7은 본 발명의 일 실시예에 따른, 도 1에 도시된 사용자 단말의 블록도이다.
도 8은 본 발명의 일 실시예에 따른, 사용자 단말에서 신규 어휘를 입력하는 사용자 인터페이스의 예시 도면이다.
도 9는 본 발명의 일 실시예에 따른, 사용자 단말에서 사용자 맞춤형 언어 모델을 제공하는 방법을 나타낸 흐름도이다. Figure 1 is a configuration diagram of a user-customized language model generation system according to an embodiment of the present invention.
FIG. 2 is a block diagram of the language model generating device shown in FIG. 1 according to an embodiment of the present invention.
3A to 3C are diagrams for explaining a new vocabulary input from a user terminal according to an embodiment of the present invention.
Figure 4 is a diagram illustrating a method of performing verification on a new vocabulary according to an embodiment of the present invention.
Figures 5A and 5B are diagrams for explaining a method of generating a sentence list and a pronunciation sequence of a new vocabulary, according to an embodiment of the present invention.
Figure 6 is a flowchart showing a method of generating a user-customized language model in a language model generating device according to an embodiment of the present invention.
Figure 7 is a block diagram of the user terminal shown in Figure 1, according to an embodiment of the present invention.
Figure 8 is an example diagram of a user interface for entering a new vocabulary in a user terminal, according to an embodiment of the present invention.
Figure 9 is a flowchart showing a method of providing a user-customized language model in a user terminal, according to an embodiment of the present invention.

아래에서는 첨부한 도면을 참조하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예를 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다. Below, with reference to the attached drawings, embodiments of the present invention will be described in detail so that those skilled in the art can easily implement the present invention. However, the present invention may be implemented in many different forms and is not limited to the embodiments described herein. In order to clearly explain the present invention in the drawings, parts that are not related to the description are omitted, and similar parts are given similar reference numerals throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐 아니라, 그 중간에 다른 소자를 사이에 두고 "전기적으로 연결"되어 있는 경우도 포함한다. 또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다. Throughout the specification, when a part is said to be "connected" to another part, this includes not only the case where it is "directly connected," but also the case where it is "electrically connected" with another element in between. . Additionally, when a part "includes" a certain component, this means that it may further include other components rather than excluding other components, unless specifically stated to the contrary.

본 명세서에 있어서 '부(部)'란, 하드웨어에 의해 실현되는 유닛(unit), 소프트웨어에 의해 실현되는 유닛, 양방을 이용하여 실현되는 유닛을 포함한다. 또한, 1 개의 유닛이 2 개 이상의 하드웨어를 이용하여 실현되어도 되고, 2 개 이상의 유닛이 1 개의 하드웨어에 의해 실현되어도 된다. In this specification, 'part' includes a unit realized by hardware, a unit realized by software, and a unit realized using both. Additionally, one unit may be realized using two or more pieces of hardware, and two or more units may be realized using one piece of hardware.

본 명세서에 있어서 단말 또는 디바이스가 수행하는 것으로 기술된 동작이나 기능 중 일부는 해당 단말 또는 디바이스와 연결된 서버에서 대신 수행될 수도 있다. 이와 마찬가지로, 서버가 수행하는 것으로 기술된 동작이나 기능 중 일부도 해당 서버와 연결된 단말 또는 디바이스에서 수행될 수도 있다. In this specification, some of the operations or functions described as being performed by a terminal or device may instead be performed on a server connected to the terminal or device. Likewise, some of the operations or functions described as being performed by the server may also be performed on a terminal or device connected to the server.

이하, 첨부된 구성도 또는 처리 흐름도를 참고하여, 본 발명의 실시를 위한 구체적인 내용을 설명하도록 한다. Hereinafter, specific details for implementing the present invention will be described with reference to the attached configuration diagram or processing flow diagram.

도 1은 본 발명의 일 실시예에 따른, 사용자 맞춤형 언어 모델 생성 시스템의 구성도이다. Figure 1 is a configuration diagram of a user-customized language model generation system according to an embodiment of the present invention.

도 1을 참조하면, 사용자 맞춤형 언어 모델 생성 시스템은 사용자 단말(100) 및 언어 모델 생성 장치(110)를 포함할 수 있다. 다만, 이러한 도 1의 사용자 맞춤형 언어 모델 생성 시스템은 본 발명의 일 실시예에 불과하므로 도 1을 통해 본 발명이 한정 해석되는 것은 아니며, 본 발명의 다양한 실시예들에 따라 도 1과 다르게 구성될 수도 있다. Referring to FIG. 1, a user-customized language model generation system may include a user terminal 100 and a language model generation device 110. However, since this user-customized language model generation system of FIG. 1 is only one embodiment of the present invention, the present invention is not limited to FIG. 1, and may be configured differently from FIG. 1 according to various embodiments of the present invention. It may be possible.

일반적으로, 도 1의 사용자 맞춤형 언어 모델 생성 시스템의 각 구성요소들은 네트워크(미도시)를 통해 연결된다. 네트워크는 단말들 및 서버들과 같은 각각의 노드 상호 간에 정보 교환이 가능한 연결 구조를 의미하는 것으로, 근거리 통신망(LAN: Local Area Network), 광역 통신망(WAN: Wide Area Network), 인터넷 (WWW: World Wide Web), 유무선 데이터 통신망, 전화망, 유무선 텔레비전 통신망 등을 포함한다. 무선 데이터 통신망의 일례에는 3G, 4G, 5G, 3GPP(3rd Generation Partnership Project), LTE(Long Term Evolution), WIMAX(World Interoperability for Microwave Access), 와이파이(Wi-Fi), 블루투스 통신, 적외선 통신, 초음파 통신, 가시광 통신(VLC: Visible Light Communication), 라이파이(LiFi) 등이 포함되나 이에 한정되지는 않는다. In general, each component of the user-customized language model generation system of FIG. 1 is connected through a network (not shown). Network refers to a connection structure that allows information exchange between nodes such as terminals and servers, including Local Area Network (LAN), Wide Area Network (WAN), and World Wide Area Network (WWW). Wide Web), wired and wireless data communication networks, telephone networks, wired and wireless television communication networks, etc. Examples of wireless data communication networks include 3G, 4G, 5G, 3GPP (3rd Generation Partnership Project), LTE (Long Term Evolution), WIMAX (World Interoperability for Microwave Access), Wi-Fi, Bluetooth communication, infrared communication, and ultrasound. This includes, but is not limited to, communication, Visible Light Communication (VLC), LiFi, etc.

사용자 단말(100)은 사용자로부터 입력받은 신규 어휘를 언어 모델 생성 장치(110)에게 전송할 수 있다. The user terminal 100 may transmit a new vocabulary input from the user to the language model generating device 110.

언어 모델 생성 장치(110)는 사용자 단말(100)로부터 수신된 신규 어휘에 대한 검증을 수행하고, 검증이 완료된 신규 어휘로부터 신규 어휘에 대한 언어 모델의 구성 요소를 학습할 수 있다. The language model generating device 110 may perform verification of the new vocabulary received from the user terminal 100 and learn components of a language model for the new vocabulary from the new vocabulary for which verification has been completed.

언어 모델 생성 장치(110)는 학습된 신규 어휘에 대한 언어 모델의 구성 요소에 기초하여 신규어휘에 대한 언어 모델을 구축할 수 있다. The language model generating device 110 may build a language model for the new vocabulary based on the components of the language model for the learned new vocabulary.

이후, 사용자 단말(100)로부터 발화 음성이 수신된 경우, 언어 모델 생성 장치(110)는 구축된 신규 어휘의 언어 모델을 이용하여 해당 발화 음성에 대한 음성 인식 서비스를 사용자 단말(100)에게 제공할 수 있다. Thereafter, when a spoken voice is received from the user terminal 100, the language model generating device 110 provides a voice recognition service for the spoken voice to the user terminal 100 using the language model of the new vocabulary constructed. You can.

이하에서는 도 1의 사용자 맞춤형 언어 모델 생성 시스템의 각 구성요소의 동작에 대해 보다 구체적으로 설명한다. Hereinafter, the operation of each component of the user-customized language model generation system of FIG. 1 will be described in more detail.

도 2는 본 발명의 일 실시예에 따른, 도 1에 도시된 언어 모델 생성 장치(110)의 블록도이다. FIG. 2 is a block diagram of the language model generating device 110 shown in FIG. 1 according to an embodiment of the present invention.

도 2를 참조하면, 언어 모델 생성 장치(110)는 신규 어휘 입력부(200), 신규 어휘 검증부(210), 신규 어휘 언어 모델 구축부(220), 음성 인식 수행부(230), 및 오류 메시지 전송부(240)를 포함할 수 있다. 다만, 도 2에 도시된 언어 모델 생성 장치(110)는 본 발명의 하나의 구현 예에 불과하며, 도 2에 도시된 구성요소들을 기초로 하여 여러 가지 변형이 가능하다. Referring to FIG. 2, the language model generating device 110 includes a new vocabulary input unit 200, a new vocabulary verification unit 210, a new vocabulary language model construction unit 220, a voice recognition performance unit 230, and an error message. It may include a transmission unit 240. However, the language model generating device 110 shown in FIG. 2 is only one implementation example of the present invention, and various modifications are possible based on the components shown in FIG. 2.

이하에서는 도 2와 함께 3a 내지 5b를 참조하여 설명하기로 한다. Hereinafter, it will be described with reference to 3a to 5b along with FIG. 2.

신규 어휘 입력부(200)는 사용자 단말(100)로부터 신규 어휘를 입력받을 수 있다. 예를 들면, 신규 어휘 입력부(200)는 사용자 단말(100)에 설치된 음성 인식 개발 어플리케이션 또는 사용자 단말(100)이 접속한 웹 기반의 음성 인식 개발 유저 인터페이스를 통해 사용자 단말(100)이 입력한 신규 어휘를 사용자 단말(100)로부터 수신할 수 있다. The new vocabulary input unit 200 can receive new vocabulary input from the user terminal 100. For example, the new vocabulary input unit 200 is a voice recognition development application installed on the user terminal 100 or a new vocabulary input by the user terminal 100 through a web-based voice recognition development user interface accessed by the user terminal 100. Vocabulary can be received from the user terminal 100.

구체적으로, 신규 어휘 입력부(200)는 사용자 단말(100)로부터 신규 어휘에 대한 인식 문장 리스트, 클래스 리스트 및 수동 발음 사전 중 적어도 하나를 입력받을 수 있다. Specifically, the new vocabulary input unit 200 may receive at least one of a recognition sentence list, a class list, and a manual pronunciation dictionary for the new vocabulary from the user terminal 100.

도 3a를 참조하면, 신규 어휘 입력부(200)는 단어 또는 복수의 단어를 포함하는 클래스의 조합으로 이루어진 신규 어휘에 대한 인식 문장 리스트를 사용자 단말(100)로부터 입력받을 수 있다. 여기서, 클래스는 복수의 단어를 대표하는 집단명이다. 예를 들면, '서울, 경기, 강원 등'을 포괄하는 클래스는 <지역명>이 될 수 있고, '개천절, 삼일절, 광복절 등'을 포괄하는 클래스는 <공휴일>이 될 수 있다. Referring to FIG. 3A, the new vocabulary input unit 200 may receive a recognition sentence list for a new vocabulary consisting of a word or a combination of classes including a plurality of words from the user terminal 100. Here, the class is a group name representing multiple words. For example, a class encompassing ‘Seoul, Gyeonggi, Gangwon, etc.’ could be <region name>, and a class encompassing ‘National Foundation Day, Independence Day, Liberation Day, etc.’ could be <public holiday>.

신규 어휘 입력부(200)는 사용자에 의해 정의된 인식 문장 리스트로서 고립어 또는 연속어(질의어 또는 명령어로 구성된 문장)를 사용자 단말(100)로부터 선택 입력받을 수 있다. 인식 문장 리스트는 예를 들면 '오늘 <지역명> 날씨는', '오늘 <지역명> 날씨 알려 줘', '오늘 <지역명> 날씨 알려 줄래', '<공휴일>에 <지역명> 날씨는' 등과 같이 복수의 단어를 대표하는 클래스의 조합으로 이루어진 문장을 포함할 수 있다. The new vocabulary input unit 200 is a recognition sentence list defined by the user and can receive input from the user terminal 100 by selecting an isolated word or continuous word (a sentence consisting of a query word or a command). The list of recognition sentences is, for example, ‘What is the weather for <region name> today’, ‘Tell me the weather for <region name> today’, ‘Can you tell me the weather for <region name> today’, ‘What is the weather for <region name> on <public holiday>? It may include sentences consisting of a combination of classes representing multiple words, such as '.

도 3b를 참조하면, 신규 어휘 입력부(200)는 인식 문장 리스트에 포함된 클래스에 대한 정의를 포함하는 클래스 리스트를 사용자 단말(100)로부터 입력받을 수 있다. 신규 어휘 입력부(200)는 상위 클래스에 대한 적어도 하나의 하위 클래스에 대한 정의를 사용자 단말(100)로부터 입력받을 수 있다. 예를 들면, 상위 클래스가 '<지역명>'인 경우, 하위 클래스에는 '<국내_도>', '<국내-도시>'가 포함될 수 있다. Referring to FIG. 3B, the new vocabulary input unit 200 may receive a class list including definitions for classes included in the recognition sentence list from the user terminal 100. The new vocabulary input unit 200 may receive a definition of at least one lower class for a higher class from the user terminal 100. For example, if the upper class is '<region name>', the lower class may include '<domestic_province>' and '<domestic-city>'.

신규 어휘 입력부(200)는 각 하위 클래스에 포함되는 단어셋에 대한 정의를 사용자 단말(100)로부터 입력받을 수 있다. 예를 들면, 제 1 하위 클래스가 '<국내_도>'인 경우, 제 1 하위 클래스에는 <국내_도>에 속하는 '경기도, 전라북도 등'의 단어셋이 포함될 수 있다. 제 2 하위 클래스가 '<국내_도시>'인 경우, 제 2 하위 클래스에는 <국내_도시>에 속하는 '서울시, 부산시 등'의 단어셋이 포함될 수 있다. The new vocabulary input unit 200 can receive definitions for word sets included in each subclass from the user terminal 100. For example, if the first subclass is '<domestic_province>', the first subclass may include a word set such as 'Gyeonggi-do, Jeollabuk-do, etc.' belonging to <domestic_province>. When the second subclass is '<domestic_city>', the second subclass may include a word set such as 'Seoul, Busan, etc.' belonging to <domestic_city>.

신규 어휘 입력부(200)는 적어도 하나의 하위 클래스가 존재하지 않는 개별 클래스에 포함되는 단어셋에 대한 정의를 사용자 단말(100)로부터 입력받을 수 있다. 예를 들면, 개별 클래스가 '<공휴일>'인 경우, <공휴일>에 포함된 단어의 셋에는 '개천절, 삼일절, 광복절 등'이 포함될 수 있다. The new vocabulary input unit 200 may receive from the user terminal 100 a definition for a word set included in an individual class in which at least one subclass does not exist. For example, if the individual class is '<public holiday>', the set of words included in <public holiday> may include 'National Foundation Day, Independence Day, Liberation Day, etc.'

도 3c를 참조하면, 신규 어휘 입력부(200)는 사용자 단말(100)의 사용자에 의해 정의된 특수 단어셋에 대한 수동 발음 사전을 사용자 단말(100)로부터 입력받을 수 있다. 여기서, 수동 발음 사전은 특수 단어셋에 포함된 단어 하나당 발음될 수 있는 복수의 경우에 대한 발음을 포함할 수 있다. 예를 들면, 사용자에 의해 정의된 특수 단어셋이 'CES 2017(국제 행사명)'인 경우, 'CES 2017'에 대한 수동 발음 사전에는 '씨이에스 이공일칠', '씨이에스 이천십칠'이 사용자에 의해 입력될 수 있다. Referring to FIG. 3C, the new vocabulary input unit 200 may receive a manual pronunciation dictionary for a special word set defined by the user of the user terminal 100 from the user terminal 100. Here, the manual pronunciation dictionary may include pronunciations for a plurality of cases that can be pronounced for each word included in the special word set. For example, if the special word set defined by the user is 'CES 2017 (international event name)', the manual pronunciation dictionary for 'CES 2017' includes 'CS Lee Gong Il Seven' and 'CS Two Thousand Seven Seven'. It can be entered by .

신규 어휘 검증부(210)는 사용자 단말(100)로부터 입력된 신규 어휘에 대한 검증을 수행할 수 있다. 구체적으로, 신규 어휘 검증부(210)는 인코딩 오류 검사, 부호 오류 검사 및 음절 오류 검사 중 적어도 하나를 이용하여 신규 어휘에 대한 오류 검증을 수행할 수 있다. The new vocabulary verification unit 210 may perform verification of the new vocabulary input from the user terminal 100. Specifically, the new vocabulary verification unit 210 may perform error verification on the new vocabulary using at least one of an encoding error check, a sign error check, and a syllable error check.

도 4를 참조하면, 신규 어휘 검증부(210)는 사용자 단말(100)의 운영체제에서 사용하는 문자 인코딩 방식(예컨대, UTF-8, Unicode 등)을 이용하여 신규 어휘에 대한 인식 문장 리스트 및 클래스 리스트가 포함된 파일(400)에 대한 인코딩 오류 검사(402)를 수행할 수 있다. Referring to FIG. 4, the new vocabulary verification unit 210 uses the character encoding method (e.g., UTF-8, Unicode, etc.) used in the operating system of the user terminal 100 to create a recognition sentence list and class list for the new vocabulary. An encoding error check (402) can be performed on the file (400) included.

이후, 신규 어휘 검증부(210)는 신규 어휘에 대한 인식 문장 리스트 및 클래스 리스트의 부호 오류 검사(404)를 수행할 수 있다. 구체적으로, 신규 어휘 검증부(210)는 언어 모델에서 사용되지 못하는 특수 기호(예컨대, '~!@#' 등)가 신규 어휘에 사용되고 있는지 여부를 신규 어휘의 음절 단위 마다 검사하고, 신규 어휘에 특수 기호가 포함되어 있는 경우, 특수 기호가 포함된 문자를 검출할 수 있다. Thereafter, the new vocabulary verification unit 210 may perform a sign error check 404 on the recognition sentence list and class list for the new vocabulary. Specifically, the new vocabulary verification unit 210 checks for each syllable of the new vocabulary whether special symbols (e.g., '~!@#', etc.) that cannot be used in the language model are used in the new vocabulary, and If special symbols are included, characters containing the special symbols can be detected.

이후, 신규 어휘 검증부(210)는 신규 어휘에 대한 인식 문장 리스트 및 클래스 리스트의 음절 오류 검사(406)를 수행할 수 있다. 구체적으로, 한국어 음성인식기의 경우, 표준 음절 규격(예컨대, KS X 1001)에 속하지 않는 비정상 음절은 언어 모델 훈련시 오류가 발생할 수 있으므로 신규 어휘 검증부(210)는 신규 어휘에 대한 인식 문장 리스트 및 클래스 리스트가 포함된 파일(400) 전체를 음절 단위 마다 검사를 실시하여 오류가 발생하는 음절이 포함된 문자를 검출할 수 있다. Afterwards, the new vocabulary verification unit 210 may perform a syllable error check 406 of the recognition sentence list and class list for the new vocabulary. Specifically, in the case of the Korean speech recognizer, abnormal syllables that do not belong to the standard syllable standard (e.g., KS By inspecting the entire file 400 containing the class list for each syllable, characters containing error-producing syllables can be detected.

만일, 신규 어휘에 대한 검증(인코딩 오류 검사(402), 부호 오류 검사(404) 및 음절 오류 검사(406))이 통과되지 못한 경우, 신규 어휘 검증부(210)는 검출된 오류에 대한 검증 보고서(408)를 작성하고, 오류 메시지 전송부(240)는 사용자 단말(100)에게 검증 보고서(408)와 함께 신규 어휘에 대한 재입력을 요청하는 오류 메시지를 사용자 단말(100)에게 전송할 수 있다. If the verification of the new vocabulary (encoding error check 402, sign error check 404, and syllable error check 406) does not pass, the new vocabulary verification unit 210 provides a verification report for the detected error. After writing (408), the error message transmission unit 240 may transmit to the user terminal 100 an error message requesting re-input of the new vocabulary along with the verification report 408.

만일, 신규 어휘에 대한 검증이 통과되면 신규 어휘 언어 모델 구축부(220)에서 신규 어휘에 대한 언어 모델의 구성 요소(410)를 학습할 수 있다. If the verification of the new vocabulary passes, the new vocabulary language model construction unit 220 can learn the components 410 of the language model for the new vocabulary.

신규 어휘 언어 모델 구축부(220)는 검증이 완료된 신규 어휘로부터 언어 모델의 구성 요소를 학습하여 신규 어휘에 대한 언어 모델을 구축할 수 있다. The new vocabulary language model construction unit 220 may build a language model for the new vocabulary by learning the components of the language model from the new vocabulary that has been verified.

신규 어휘 언어 모델 구축부(220)는 검증이 완료된 신규 어휘에 대한 인식 문장 리스트 및 클래스 리스트를 이용하여 문장 리스트를 생성할 수 있다. The new vocabulary language model construction unit 220 may generate a sentence list using the recognition sentence list and class list for the new vocabulary for which verification has been completed.

도 5a를 참조하면, 신규 어휘 언어 모델 구축부(220)는 신규 어휘에 대한 인식 문장 리스트에 포함된 적어도 하나의 문장의 어절을 분석하고, 분석된 문장의 어절마다 클래스 리스트에 포함된 적어도 하나의 클래스와 일치하는 어절이 있는지를 판단할 수 있다. 이후, 신규 어휘 언어 모델 구축부(220)는 클래스가 포함된 문장의 어절이 있는 경우, 문장의 어절에 클래스에 포함된 복수의 단어를 이용하여 문장 리스트를 생성할 수 있다. Referring to FIG. 5A, the new vocabulary language model construction unit 220 analyzes the word of at least one sentence included in the recognition sentence list for the new vocabulary, and at least one word included in the class list for each word of the analyzed sentence. You can determine whether there is a word that matches the class. Afterwards, if there is a word of the sentence that includes a class, the new vocabulary language model building unit 220 may generate a sentence list using a plurality of words included in the class in the word of the sentence.

구체적으로, 신규 어휘 언어 모델 구축부(220)는 인식 문장 리스트로부터 개별 문장을 추출하여 제 1 버퍼에 저장하고, 클래스 리스트로부터 클래스-클래스에 포함된 단어셋을 추출하여 제 2 버퍼에 저장할 수 있다. Specifically, the new vocabulary language model construction unit 220 may extract individual sentences from the recognized sentence list and store them in the first buffer, and extract the word set included in the class-class from the class list and store them in the second buffer. .

이후, 신규 어휘 언어 모델 구축부(220)는 제 1 버퍼에 저장된 각 개별 문장마다 문장의 어절을 분석하고, 분석된 문장의 어절이 제 2 버퍼에 저장된 클래스에 대응되는지 여부를 판단할 수 있다. Thereafter, the new vocabulary language model construction unit 220 may analyze the word phrase of each individual sentence stored in the first buffer and determine whether the word phrase of the analyzed sentence corresponds to the class stored in the second buffer.

만일, 분석된 문장의 특정 어절이 제 2 버퍼에 저장된 하나의 클래스에 대응되는 경우, 신규 어휘 언어 모델 구축부(220)는 해당 특정 어절을 해당 특정 어절에 대응되는 클래스에 포함된 단어셋으로 교체하여 제 1 버퍼에 다시 저장할 수 있다. 제 1 버퍼에 저장된 문장의 모든 어절이 클래스가 아닌 단어로만 구성된 경우, 신규 어휘 언어 모델 구축부(220)는 해당 문장을 제 3 버퍼에 추가 저장할 수 있다. If a specific word in the analyzed sentence corresponds to one class stored in the second buffer, the new vocabulary language model construction unit 220 replaces the specific word with a word set included in the class corresponding to the specific word. Thus, it can be stored again in the first buffer. If all words in the sentence stored in the first buffer consist only of words and not classes, the new vocabulary language model construction unit 220 may additionally store the corresponding sentence in the third buffer.

예를 들면, 인식 문장 리스트에 "오늘 <지역명> 날씨는"이라는 제 1 문장이 있고 클래스 리스트가 도 3b와 같이 정의된다고 가정하면, 신규 어휘 언어 모델 구축부(220)는 제 1 문장을 제 1 버퍼(도 5a의 Q1에 해당)에 넣고, 제 1 문장의 첫번째 어절부터 클래스에 대응되는지 여부를 판단할 수 있다. 이 때, 도 3b에 도시된 클래스 리스트를 살펴보면, <지역명>의 클래스는 <국내_도>와 <국내_도시> 등의 단어셋을 포함하고, <국내_도>의 클래스는 <경기도>와 <전라북도> 등의 단어셋을 포함하고, <국내_도시>의 클래스는 <서울시>와 <부산시> 등의 단어셋을 포함한다. For example, assuming that the recognition sentence list includes the first sentence “Today’s <area name> weather is” and the class list is defined as shown in Figure 3b, the new vocabulary language model construction unit 220 creates the first sentence. 1 buffer (corresponding to Q1 in FIG. 5A), and it can be determined whether the first word of the first sentence corresponds to a class. At this time, looking at the class list shown in Figure 3b, the class of <region name> includes word sets such as <domestic_province> and <domestic_city>, and the class of <domestic_province> is <Gyeonggi-do> It includes word sets such as <Jeollabuk-do> and <Domestic_City>, and the class of <Domestic_City> includes word sets such as <Seoul> and <Busan>.

따라서, 신규 어휘 언어 모델 구축부(220)는 제 1 문장의 <지역명>을 <국내_도>로 교체하고, 교체된 제 2 문장인 "오늘 <국내_도> 날씨는"을 제 1 버퍼에 추가 저장하고, 제 1 문장의 <지역명>을 <국내_도시>로 교체한 제 3 문장인 "오늘 <국내_도시> 날씨는"을 제 1 버퍼에 추가 저장한다. 이와 같은 방식으로 신규 어휘 언어 모델 구축부(220)는 제 1 버퍼에서 제 2 문장인 "오늘 <국내_도> 날씨는"을 꺼내 <국내_도>를 <경기도>, <전라북도>로 교체하여 "오늘 경기도 날씨는"과 "오늘 전라북도 날씨는" 등의 문장들로 확장하고 확장된 문장(클래스가 포함되지 않은 문장)들을 제 2 버퍼(도 5a의 Q2에 해당)에 저장한다. Therefore, the new vocabulary language model construction unit 220 replaces <region name> in the first sentence with <domestic_province>, and replaces the replaced second sentence, “What is the weather in <domestic_province> today” in the first buffer. , and the third sentence, "What is the weather in <domestic_city> today", in which <region name> in the first sentence is replaced with <domestic_city>, is additionally stored in the first buffer. In this way, the new vocabulary language model construction unit 220 takes out the second sentence, “What is the weather in <Korea_Province> today” from the first buffer, and replaces <Korea_Province> with <Gyeonggi-do> and <Jeollabuk-do> It is expanded into sentences such as “What is the weather in Gyeonggi-do today” and “What is the weather in Jeollabuk-do today”, and the expanded sentences (sentences that do not include classes) are stored in the second buffer (corresponding to Q2 in Figure 5a).

신규 어휘 언어 모델 구축부(220)는 제 3 문장인 "오늘 <국내_도시> 날씨는"를 "오늘 서울 날씨는", "오늘 부산시 날씨는" 등으로 확장하고 해당 확장된 문장들을 제 2 버퍼에 저장한다. 이를 통해 신규 어휘 언어 모델 구축부(220)는 총 4개의 문장("오늘 경기도 날씨는", "오늘 전라북도 날씨는", "오늘 서울 날씨는", "오늘 부산시 날씨는")을 포함하는 문장 리스트를 생성할 수 있다. The new vocabulary language model construction unit 220 expands the third sentence, “Today’s weather in <domestic_city>” to “Today’s weather in Seoul,” “Today’s weather in Busan,” etc., and stores the expanded sentences in the second buffer. Save it to Through this, the new vocabulary language model construction unit 220 creates a sentence list containing a total of four sentences (“Today’s weather in Gyeonggi-do”, “Today’s weather in Jeollabuk-do”, “Today’s weather in Seoul”, “Today’s weather in Busan”). can be created.

신규 어휘 언어 모델 구축부(220)는 검증이 완료된 신규 어휘에 대한 수동 발음 사전 및 신규 어휘의 발음열을 병합하여 통합 발음 사전을 구축할 수 있다. 여기서, 통합 발음 사전은 문장을 구성하는 모든 단어에 대한 음소 단위의 발음열 집합을 의미한다. 수동 발음 사전은 사용자에 의해 작성된 신규 어휘에 대한 발음열을 포함하고, 신규 어휘의 발음열은 모든 단어에 대해 기설정된 규칙에 의해 생성되는 발음열을 포함한다. The new vocabulary language model construction unit 220 may build an integrated pronunciation dictionary by merging the manual pronunciation dictionary for the new vocabulary that has been verified and the pronunciation string of the new vocabulary. Here, the integrated pronunciation dictionary refers to a set of pronunciation strings in phoneme units for all words constituting a sentence. The manual pronunciation dictionary includes a pronunciation sequence for new vocabulary created by the user, and the pronunciation sequence of the new vocabulary includes a pronunciation sequence generated by preset rules for all words.

도 5b를 참조하면, 신규 어휘의 발음열은 신규 어휘가 자소 단위로 분할되고, 분할된 자소 단위에 음운 규칙이 적용되어 생성된 자소 단위의 발음열일 수 있다. 예를 들면, 신규 어휘가 '광복절'인 경우, 신규 어휘 언어 모델 구축부(220)는 '광복절'을 자소 단위(ㄱ1 ㅘ2 ㅇ3 ㅂ1 ㅗ2 ㄱ3 ㅈ1 ㅓ2 ㄹ3)로 분할할 수 있다. 이후, 신규 어휘 언어 모델 구축부(220)는 분할된 신규 어휘의 자소 단위에 음운 규칙(ㄱ3ㅈ1 -> ㄱ3ㅉ1)을 적용하여 자소 단위에 대한 발음 정보인 음소열(ㄱ1 ㅘ2 ㅇ3 ㅂ1 ㅗ2 ㄱ3 ㅉ1 ㅓ2 ㄹ3)을 생성할 수 있다. 이후, 신규 어휘 언어 모델 구축부(220)는 생성된 음소열을 '광복절'에 대한 발음열로 통합 발음 사전에 추가할 수 있다. Referring to Figure 5b, the pronunciation sequence of the new vocabulary may be a pronunciation sequence of grapheme units created by dividing the new vocabulary into grapheme units and applying phonological rules to the divided grapheme units. For example, if the new vocabulary is 'Liberation Day', the new vocabulary language model construction unit 220 converts 'Liberation Day' into grapheme units (ㄱ1 ㅘ2 ㅇ3 ㅂ1 ㅗ2 ㄱ3 ㅈ1 ㅓ2 ㄹ3). It can be divided. Afterwards, the new vocabulary language model construction unit 220 applies the phoneme rule (ㄱ3ㅈ1 -> ㄱ3ㅉ1) to the grapheme unit of the divided new vocabulary to create a phoneme string (ㄱ1 ㅘ), which is pronunciation information for the grapheme unit. 2 ㅇ3 ㅂ1 ㅗ2 ㄱ3 ㅉ1 ㅓ2 ㄹ3) can be created. Afterwards, the new vocabulary language model construction unit 220 may add the generated phoneme string to the integrated pronunciation dictionary as a pronunciation string for 'Liberation Day'.

신규 어휘 언어 모델 구축부(220)는 생성된 문장 리스트로부터 n-gram 확률 및 back-off 확률을 계산하여 신규 어휘에 대한 언어 모델을 구축할 수 있다. The new vocabulary language model construction unit 220 may build a language model for the new vocabulary by calculating the n-gram probability and back-off probability from the generated sentence list.

음성 인식 수행부(230)는 구축된 언어 모델을 이용한 음성 인식 서비스를 사용자 단말(100)에게 제공할 수 있다. 구체적으로, 음성 인식 수행부(230)는 신규 어휘에 대한 언어 모델을 이용하여 사용자 단말(100)에서 입력된 발화 음성에 대한 음성 인식을 수행하고, 수행된 음성 인식에 대한 결과를 사용자 단말(100)에게 제공할 수 있다. The voice recognition performing unit 230 may provide the user terminal 100 with a voice recognition service using the constructed language model. Specifically, the voice recognition performing unit 230 performs voice recognition on the spoken voice input from the user terminal 100 using a language model for the new vocabulary, and reports the results of the performed voice recognition to the user terminal 100. ) can be provided to.

한편, 당업자라면, 신규 어휘 입력부(200), 신규 어휘 검증부(210), 신규 어휘 언어 모델 구축부(220), 음성 인식 수행부(230) 및 오류 메시지 전송부(240) 각각이 분리되어 구현되거나, 이 중 하나 이상이 통합되어 구현될 수 있음을 충분히 이해할 것이다. Meanwhile, those skilled in the art will know that the new vocabulary input unit 200, the new vocabulary verification unit 210, the new vocabulary language model construction unit 220, the voice recognition performance unit 230, and the error message transmission unit 240 are implemented separately. Alternatively, it will be fully understood that one or more of these may be integrated and implemented.

도 6은 본 발명의 일 실시예에 따른, 언어 모델 생성 장치(110)에서 사용자 맞춤형 언어 모델을 생성하는 방법을 나타낸 흐름도이다. Figure 6 is a flowchart showing a method of generating a user-customized language model in the language model generating device 110 according to an embodiment of the present invention.

도 6에 도시된 실시예에 따른 사용자 맞춤형 언어 모델 생성 방법은 도 1 내지 도 5b에 도시된 실시예에 따른 사용자 단말(100) 및 언어 모델 생성 장치(110)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 5b의 사용자 단말(100) 및 언어 모델 생성 장치(110)에 관하여 기술된 내용은 도 6에 도시된 실시예에 따른 사용자 맞춤형 언어 모델 생성 방법에도 적용될 수 있다. The method for generating a user-customized language model according to the embodiment shown in FIG. 6 includes steps processed in time series in the user terminal 100 and the language model generating device 110 according to the embodiment shown in FIGS. 1 to 5B. do. Therefore, even if the content is omitted below, the content described regarding the user terminal 100 and the language model generating device 110 of FIGS. 1 to 5B can also be applied to the user-customized language model generation method according to the embodiment shown in FIG. 6. You can.

도 6을 참조하면, 단계 S601에서 언어 모델 생성 장치(110)는 사용자 단말(100)로부터 신규 어휘를 입력받을 수 있다. Referring to FIG. 6, in step S601, the language model generating device 110 may receive a new vocabulary input from the user terminal 100.

단계 S603에서 언어 모델 생성 장치(110)는 사용자 단말(100)로부터 입력된 신규 어휘에 대한 검증을 수행할 수 있다. In step S603, the language model generating device 110 may verify the new vocabulary input from the user terminal 100.

단계 S605에서 언어 모델 생성 장치(110)는 검증이 완료된 신규 어휘로부터 언어 모델의 구성 요소를 학습하여 신규 어휘에 대한 언어 모델을 구축할 수 있다. In step S605, the language model generating device 110 may build a language model for the new vocabulary by learning components of the language model from the new vocabulary that has been verified.

단계 S607에서 언어 모델 생성 장치(110)는 구축된 언어 모델을 이용한 음성 인식 서비스를 사용자 단말(100)에게 제공할 수 있다. In step S607, the language model generating device 110 may provide a voice recognition service using the constructed language model to the user terminal 100.

도 6에는 도시되지 않았으나, 단계 S603에서 언어 모델 생성 장치(110)는 인코딩 오류 검사, 부호 오류 검사 및 음절 오류 검사 중 적어도 하나를 이용하여 신규 어휘에 대한 오류 검증을 수행할 수 있다. Although not shown in FIG. 6, in step S603, the language model generating device 110 may perform error verification on the new vocabulary using at least one of encoding error check, sign error check, and syllable error check.

도 6에는 도시되지 않았으나, 단계 S603 이후에 언어 모델 생성 장치(110)는 신규 어휘에 대한 검증이 통과되지 못한 경우, 사용자 단말(100)에게 신규 어휘에 대한 재입력을 요청하는 오류 메시지를 사용자 단말(100)에게 전송할 수 있다. Although not shown in FIG. 6, if verification of the new vocabulary does not pass after step S603, the language model generating device 110 sends an error message to the user terminal 100 requesting re-input of the new vocabulary. It can be sent to (100).

상술한 설명에서, 단계 S601 내지 S607은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the above description, steps S601 to S607 may be further divided into additional steps or combined into fewer steps, depending on the implementation of the present invention. Additionally, some steps may be omitted or the order between steps may be changed as needed.

도 7은 본 발명의 일 실시예에 따른, 도 1에 도시된 사용자 단말(100)의 블록도이다. FIG. 7 is a block diagram of the user terminal 100 shown in FIG. 1 according to an embodiment of the present invention.

도 7을 참조하면, 사용자 단말(100)은 신규 어휘 입력부(700), 전송부(710), 음성 인식 제공부(720), 발화 음성 입력부(730) 및 오류 메시지 출력부(740)를 포함할 수 있다. 다만, 도 7에 도시된 사용자 단말(100)은 본 발명의 하나의 구현 예에 불과하며, 도 7에 도시된 구성요소들을 기초로 하여 여러 가지 변형이 가능하다. Referring to FIG. 7, the user terminal 100 may include a new vocabulary input unit 700, a transmission unit 710, a voice recognition provider 720, a spoken voice input unit 730, and an error message output unit 740. You can. However, the user terminal 100 shown in FIG. 7 is only one implementation example of the present invention, and various modifications are possible based on the components shown in FIG. 7.

신규 어휘 입력부(700)는 사용자로부터 신규 언어 모델 생성을 위한 신규 어휘를 입력받을 수 있다. 예를 들면, 도 8을 참조하면, 신규 어휘 입력부(700)는 신규 어휘에 대한 인식 문장 리스트(800), 클래스 리스트(802) 및 사용자의 수동 발음 사전(804)을 사용자로부터 입력받을 수 있다. 여기서, 인식 문장 리스트는 단어 또는 복수의 단어를 포함하는 클래스의 조합을 포함하고, 클래스는 복수의 단어를 대표하는 집단명일 수 있다. 클래스 리스트는 인식 문장 리스트에 포함된 클래스에 대한 정의를 포함할 수 있다. 수동 발음 사전은 사용자 단말(100)의 사용자에 의해 정의된 특수 단어셋에 대한 발음 사전이고, 특수 단어셋에 포함된 단어 하나당 발음될 수 있는 복수의 경우에 대한 발음을 포함할 수 있다. The new vocabulary input unit 700 can receive input of a new vocabulary for creating a new language model from the user. For example, referring to FIG. 8 , the new vocabulary input unit 700 may receive a recognition sentence list 800, a class list 802, and the user's manual pronunciation dictionary 804 as input for the new vocabulary from the user. Here, the recognition sentence list includes a word or a combination of classes including a plurality of words, and the class may be a group name representing a plurality of words. The class list may include definitions for classes included in the recognition sentence list. The manual pronunciation dictionary is a pronunciation dictionary for a special word set defined by the user of the user terminal 100, and may include pronunciations for a plurality of cases that can be pronounced for each word included in the special word set.

전송부(710)는 사용자로부터 입력받은 신규 어휘를 언어 모델 생성 장치(110)에게 전송할 수 있다. 구체적으로, 전송부(710)는 신규 어휘에 대한 인식 문장 리스트(800), 클래스 리스트(802) 및 사용자의 수동 발음 사전(804)을 언어 모델 생성 장치(110)에게 전송할 수 있다.The transmission unit 710 may transmit the new vocabulary input from the user to the language model generating device 110. Specifically, the transmission unit 710 may transmit the recognition sentence list 800 for the new vocabulary, the class list 802, and the user's manual pronunciation dictionary 804 to the language model generating device 110.

이 후, 음성 인식 제공부(720)는 신규 어휘에 대한 언어 모델을 이용한 음성 인식 서비스를 언어 모델 생성 장치(110)로부터 제공받을 수 있다. 여기서, 신규 어휘에 대한 언어 모델은 언어 모델 생성 장치(110)에 의해 검증된 신규 어휘로부터 학습된 구성 요소에 기초하여 구축된 모델일 수 있다. Afterwards, the voice recognition provider 720 may receive a voice recognition service using a language model for the new vocabulary from the language model generation device 110. Here, the language model for the new vocabulary may be a model built based on components learned from the new vocabulary verified by the language model generating device 110.

신규 어휘에 대한 언어 모델이 구축된 이후, 발화 음성 입력부(730)는 사용자로부터 발화 음성을 입력받고, 전송부(710)는 사용자로부터 입력된 발화 음성을 언어 모델 생성 장치(110)에게 전송할 수 있다. After the language model for the new vocabulary is built, the spoken voice input unit 730 receives the spoken voice input from the user, and the transmission unit 710 can transmit the spoken voice input from the user to the language model generating device 110. .

이후, 음성 인식 제공부(720)는 신규 어휘에 대한 모델 및 발화 음성에 기초하여 수행된 음성 인식에 대한 결과를 언어 모델 생성 장치(110)로부터 수신하고, 음성 인식에 대한 결과를 사용자 단말(100)의 화면에 출력할 수 있다. Afterwards, the speech recognition provider 720 receives the results of speech recognition performed based on the model for the new vocabulary and the speech voice from the language model generating device 110, and sends the results of the speech recognition to the user terminal 100. ) can be output on the screen.

오류 메시지 출력부(740)는 언어 모델 생성 장치(110)에서 신규 어휘에 대한 검증이 통과되지 못한 경우, 언어 모델 생성 장치(110)로부터 신규 어휘에 대한 재입력을 요청하는 오류 메시지를 수신하여 사용자 단말(100)에 출력할 수 있다. If the verification of the new vocabulary in the language model generating device 110 fails, the error message output unit 740 receives an error message requesting re-input of the new vocabulary from the language model generating device 110 and It can be output to the terminal 100.

전송부(710)는 사용자로부터 재입력받은 신규 어휘를 언어 모델 생성 장치(110)에게 재전송할 수 있다.The transmission unit 710 may retransmit the new vocabulary re-entered by the user to the language model generating device 110.

한편, 당업자라면, 신규 어휘 입력부(700), 전송부(710), 음성 인식 제공부(720), 발화 음성 입력부(730) 및 오류 메시지 출력부(740) 각각이 분리되어 구현되거나, 이 중 하나 이상이 통합되어 구현될 수 있음을 충분히 이해할 것이다. Meanwhile, those skilled in the art will know that the new vocabulary input unit 700, transmission unit 710, voice recognition provider 720, spoken voice input unit 730, and error message output unit 740 are each implemented separately, or one of them. It will be fully understood that the above can be integrated and implemented.

도 9는 본 발명의 일 실시예에 따른, 사용자 단말(100)에서 사용자 맞춤형 언어 모델을 제공하는 방법을 나타낸 흐름도이다.Figure 9 is a flowchart showing a method of providing a user-customized language model in the user terminal 100, according to an embodiment of the present invention.

도 9에 도시된 실시예에 따른 사용자 맞춤형 언어 모델 제공 방법은 도 1 내지 도 8에 도시된 실시예에 따른 사용자 단말(100) 및 언어 모델 생성 장치(110)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 8의 사용자 단말(100) 및 언어 모델 생성 장치(110)에 관하여 기술된 내용은 도 9에 도시된 실시예에 따른 사용자 맞춤형 언어 모델 제공 방법에도 적용될 수 있다. The method for providing a user-customized language model according to the embodiment shown in FIG. 9 includes steps processed in time series in the user terminal 100 and the language model generating device 110 according to the embodiment shown in FIGS. 1 to 8. do. Therefore, even if the content is omitted below, the content described regarding the user terminal 100 and the language model generating device 110 of FIGS. 1 to 8 can also be applied to the method of providing a user customized language model according to the embodiment shown in FIG. 9. You can.

도 9를 참조하면, 단계 S901에서 사용자 단말(100)은 사용자로부터 신규 어휘를 입력받을 수 있다. Referring to FIG. 9, in step S901, the user terminal 100 may receive a new vocabulary input from the user.

단계 S903에서 사용자 단말(100)은 사용자로부터 입력받은 신규 어휘를 언어 모델 생성 장치(110)로 전송할 수 있다. In step S903, the user terminal 100 may transmit the new vocabulary input from the user to the language model generating device 110.

단계 S905에서 사용자 단말(100)은 신규 어휘에 대한 언어 모델을 통한 음성 인식 서비스를 언어 모델 생성 장치(110)로부터 제공받을 수 있다. 여기서, 신규 어휘에 대한 언어 모델은 언어 모델 생성 장치(110)에 의해 검증된 신규 어휘로부터 학습된 구성 요소에 기초하여 구축된 모델일 수 있다. In step S905, the user terminal 100 may receive a voice recognition service through a language model for a new vocabulary from the language model generating device 110. Here, the language model for the new vocabulary may be a model built based on components learned from the new vocabulary verified by the language model generating device 110.

상술한 설명에서, 단계 S901 내지 S905는 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the above description, steps S901 to S905 may be further divided into additional steps or combined into fewer steps, depending on the implementation of the present invention. Additionally, some steps may be omitted or the order between steps may be changed as needed.

본 발명의 일 실시예는 컴퓨터에 의해 실행되는 프로그램 모듈과 같은 컴퓨터에 의해 실행 가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체를 모두 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다. One embodiment of the present invention may also be implemented in the form of a recording medium containing instructions executable by a computer, such as program modules executed by a computer. Computer-readable media can be any available media that can be accessed by a computer and includes both volatile and non-volatile media, removable and non-removable media. Additionally, computer-readable media may include all computer storage media. Computer storage media includes both volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules or other data.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다. The description of the present invention described above is for illustrative purposes, and those skilled in the art will understand that the present invention can be easily modified into other specific forms without changing the technical idea or essential features of the present invention. will be. Therefore, the embodiments described above should be understood in all respects as illustrative and not restrictive. For example, each component described as single may be implemented in a distributed manner, and similarly, components described as distributed may also be implemented in a combined form.

본 발명의 범위는 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다. The scope of the present invention is indicated by the claims described below rather than the detailed description, and all changes or modified forms derived from the meaning and scope of the claims and their equivalent concepts should be construed as being included in the scope of the present invention. .

100: 사용자 단말
110: 언어 모델 생성 장치
200: 신규 어휘 입력부
210: 신규 어휘 검증부
220: 신규 어휘 언어 모델 구축부
230: 음성 인식 수행부
240: 오류 메시지 전송부100: user terminal
110: Language model generation device
200: New vocabulary input unit
210: New vocabulary verification unit
220: New vocabulary language model construction unit
230: Voice recognition performance unit
240: Error message transmission unit

Claims

In a device for generating a user-customized language model,
A new vocabulary input unit that receives new vocabulary from the user terminal;
a new vocabulary verification unit that verifies the input new vocabulary;
a new vocabulary language model construction unit that constructs a language model for the new vocabulary by learning components of the language model from the verified new vocabulary; and
A voice recognition performing unit that provides a voice recognition service related to the constructed language model to the user terminal,
The new vocabulary language model construction unit generates a sentence list based on at least one word of the sentence included in the recognition sentence list for the new vocabulary and a word of the sentence that matches at least one class included in the class list. A language model generation device that does this.

According to claim 1,
The speech recognition performing unit performs speech recognition on the spoken voice input from the user terminal using a language model for the new vocabulary, and provides results of the performed speech recognition to the user terminal, a language model. Generating device.

According to claim 1,
The new vocabulary input unit
A language model generating device that receives at least one of a recognition sentence list, a class list, and a manual pronunciation dictionary for the new vocabulary from the user terminal.

According to claim 3,
The recognition sentence list includes a word or a combination of classes containing a plurality of words,
The language model generating device, wherein the class is a group name representing the plurality of words.

According to claim 4,
The class list includes definitions for classes included in the recognition sentence list.

According to claim 3,
The manual pronunciation dictionary is a pronunciation dictionary for a special word set defined by the user of the user terminal,
A language model generating device that includes pronunciations for a plurality of cases that can be pronounced for each word included in the special word set.

According to claim 1,
The new vocabulary verification unit performs error verification on the new vocabulary using at least one of an encoding error check, a sign error check, and a syllable error check.

According to claim 1,
If the verification of the new vocabulary does not pass, an error message transmission unit that transmits an error message to the user terminal requesting the user terminal to re-enter the new vocabulary.
A language model generation device further comprising:

According to claim 3,
The new vocabulary language model building unit generates a sentence list using a recognition sentence list and a class list for the new vocabulary for which the verification has been completed.

According to clause 9,
The new vocabulary language model construction unit
A language model generating device that generates a sentence list using a plurality of words included in the class in the word clause of the sentence when there is a word clause in the sentence containing the class.

According to clause 9,
The new vocabulary language model building unit constructs an integrated pronunciation dictionary by merging a manual pronunciation dictionary for the new vocabulary for which the verification has been completed and a pronunciation sequence of the new vocabulary.

According to claim 11,
The pronunciation sequence of the new vocabulary is a pronunciation sequence of grapheme units generated by dividing the new vocabulary into grapheme units and applying phonological rules to the divided grapheme units.

According to claim 12,
The new vocabulary language model building unit constructs a language model for the new vocabulary by calculating n-gram probability and back-off probability from the generated sentence list.

In a user terminal that provides a user-customized language model,
A transmission unit that transmits the new vocabulary input from the user to the language model creation device; and
A voice recognition provider that receives a voice recognition service through a language model for the new vocabulary from the language model generating device.
Including,
Verification of the new vocabulary is performed by the language model generation device, and a language model for the new vocabulary is constructed based on components learned from the verified new vocabulary,
The language model is
It is constructed using a sentence list generated based on a word of at least one sentence included in the recognition sentence list for the new vocabulary and a word of the sentence that matches at least one class included in the class list. , user terminal.

According to claim 14,
The transmitting unit transmits the speech voice input from the user to the language model generating device.

According to claim 15,
The voice recognition provider receives a result of voice recognition performed based on the model for the new vocabulary and the spoken voice from the language model generating device and outputs the results on the screen of the user terminal.

According to claim 14,
If the verification of the new vocabulary does not pass, an error message output unit receives an error message requesting re-input of the new vocabulary from the language model generation device and outputs it to the user terminal.
A user terminal further comprising:

In a method of generating a user-customized language model by a language model generation device,
Receiving a new vocabulary input from a user terminal;
Verifying the input new vocabulary;
constructing a language model for the new vocabulary by learning components of a language model from the verified new vocabulary; and
Providing a speech recognition service for the constructed language model to the user terminal,
The step of building a language model for the new vocabulary is
Generating a sentence list based on at least one word of the sentence included in the recognition sentence list for the new vocabulary and a word of the at least one sentence that matches at least one class included in the class list. How to create a language model.