KR20200041199A

KR20200041199A - Method, apparatus and computer-readable medium for operating chatbot

Info

Publication number: KR20200041199A
Application number: KR1020180121338A
Authority: KR
Inventors: 김종모; 오경모; 김종범; 박리; 박철민; 정덕산; 정미라
Original assignee: 삼성생명보험주식회사
Priority date: 2018-10-11
Filing date: 2018-10-11
Publication date: 2020-04-21

Abstract

The present disclosure is to provide a method for enhancing the accuracy of a chatbot′s answer, to correspond to a chatbot driving technique. According to an embodiment of the present disclosure, disclosed is a method for driving a chatbot. The operations stored in a computer program to perform the method include: an operation of receiving a first input sentence from a user terminal; an operation of determining the degree of sentence completion of the first input sentence; an operation of transmitting an incomplete sentence standard answer to the user terminal in a case where it is determined that the degree of sentence completion of the first input sentence is lower than a predetermined reference; an operation of receiving a second input sentence from the user terminal; an operation of generating a third input sentence in which the first input sentence and the second input sentence are combined; an operation of determining the degree of sentence completion of the third input sentence; and an operation of transmitting an answer based on the third input sentence to the user terminal in a case where it is determined that the degree of sentence completion of the third input sentence is higher than the predetermined reference. The second input sentence is an input sentence received from the user terminal after the first input sentence.

Description

METHOD, APPARATUS AND COMPUTER-READABLE MEDIUM FOR OPERATING CHATBOT

본 개시는 챗봇을 구동하는 기술에 관한 것으로서, 보다 구체적으로 챗봇의 답변 정확성을 높이기 위한 방법에 관한 것이다.The present disclosure relates to a technology for driving a chatbot, and more particularly, to a method for improving the response accuracy of the chatbot.

최근 들어, 사회가 복잡해지고, 미래에 대한 불안감이 고조됨에 따라, 보험을 가입하는 사람이 크게 늘어가고 있다. 이와 같은 수요에 맞춰 다양한 금융 상품이 개발되고 있으며, 주5일제 근무 등 여가 활동 시간이 늘어나면서 보험가입자의 여가 활동이나 레저 활동 등을 타겟으로 하는 금융 상품들이 제시되고 있다.In recent years, as the society has become complicated and the anxiety about the future has increased, the number of people insured has increased significantly. Various financial products have been developed to meet such demands, and financial products targeting leisure activities or leisure activities of insurance subscribers have been proposed as leisure time such as five-day work week increases.

이와 관련하여, 여러 보험사에서 다양한 종류의 보험 상품이 제공되고 있어서, 보험가입자는 자신에게 유리한 조건을 제시하는 보험사 및 상품을 선택해서 계약을 맺을 수 있다. 또한, 보험사는 보험가입자에게 맞는 상품을 추천할 수도 있다.In this regard, various types of insurance products are provided by various insurance companies, and the insurer can select an insurance company and products that offer favorable conditions to the contract. In addition, the insurance company may recommend products suitable for the insurance subscriber.

대한민국 특허공개 공보 KR20000030728A에는 양음, 사주, 팔자 및 오행에 따른 보험 상품을 사용자에게 권유할 수 있는 보험 추천 및 제공 방법을 개시하고 있다.Korean Patent Publication No. KR20000030728A discloses a method of recommending and providing insurance that can incentivize a user to insure insurance products in accordance with positive, negative, negative, and five elements.

한편, 보험 회사는 보험 설계사 및 상담사를 통해 고객의 상담 및 문의를 접수 받고, 그에 따라 대응하고 있다. 예컨대, 자동차 보험에 관심있는 잠재 고객의 경우, 보험 가입을 위해 보험 회사의 상담사, 또는 보험 설계사에게 연락을 한 뒤, 상담을 통해 잠재 고객에게 알맞은 보험을 추천하는 시스템으로 이루어져 있다.Meanwhile, insurance companies receive customer consultations and inquiries through insurance designers and counselors, and respond accordingly. For example, in the case of a potential customer interested in auto insurance, the system consists of a system that recommends insurance suitable for a potential customer through consultation after contacting an insurance company counselor or an insurance architect for insurance.

하지만, 설계사, 또는 상담사를 통한 보험 문의는 인력 문제 및 상담 시간의 제약 때문에 고객이 원하는 시간에 바로 처리되기 어려운 문제가 있다. 또한, 이를 전문적으로 하는 보험 설계사 및 상담사 역시 간단한 업무 조차도 일일이 대응해야 하여 업무가 불필요하게 늘어나는 불편함이 있었다.However, insurance inquiries through a designer or a counselor have a problem that is difficult to be processed immediately at a time desired by a customer because of manpower issues and limitations in consultation time. In addition, insurance architects and counselors who specialize in this also had the inconvenience of unnecessarily increasing work because they had to deal with even simple tasks one by one.

또한, 챗봇은 고객의 질문에 기초하여 예상되는 답변을 실시간으로 검색하여 고객에게 답변을 하는 방식으로, 고객들은 실제 상담사로부터 상담을 받는 느낌을 받을 수 없다는 한계가 있다.In addition, the chatbot is a method of real-time searching for an expected answer based on a customer's question and responding to the customer, and there is a limitation that customers cannot receive the feeling of being consulted by an actual counselor.

따라서, 챗봇을 통해 고객이 실제 상담사에게 상담을 받는 느낌을 받을 수 있도록 하며, 실제 상담사가 답변하는 것과 같은 정확한 답변을 제공하는 방법에 대한 당업계의 수요가 있을 수 있다.Therefore, there may be a demand in the art for a method of providing a customer with a feeling of being counseled by an actual counselor through a chatbot, and providing an accurate answer as the real counselor answers.

본 개시는 전술한 배경 기술에 대응하여 안출된 것으로, 챗봇의 답변 정확도를 높이기 위한 방법을 제공하기 위함이다.The present disclosure has been devised in response to the above-described background technology, and is intended to provide a method for improving the response accuracy of a chatbot.

전술한 바와 같은 과제를 실현하기 위한 본 개시의 일 실시예에 따라 컴퓨터 판독가능 저장 매체에 저장된 컴퓨터 프로그램이 개시된다. 상기 컴퓨터 프로그램은 컴퓨팅 장치의 하나 이상의 프로세서에서 실행되는 경우 챗봇의 구동을 위한 이하의 동작들을 수행하도록 하며 상기 동작들은: 사용자 단말로부터 제 1 입력 문장을 수신하는 동작; 상기 제 1 입력 문장의 문장 완성도를 판단하는 동작; 상기 제 1 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말로 전송하는 동작; 상기 사용자 단말로부터 제 2 입력 문장을 수신하는 동작; 상기 제 1 입력 문장 및 상기 제 2 입력 문장을 결합한 제 3 입력 문장을 생성하는 동작; 상기 제 3 입력 문장의 문장 완성도를 판단하는 동작; 및 상기 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송하는 동작을 포함하고, 상기 제 2 입력 문장은 상기 제 1 입력 문장 이후에 상기 사용자 단말로부터 수신된 입력 문장일 수 있다.Disclosed is a computer program stored in a computer-readable storage medium in accordance with one embodiment of the present disclosure for realizing the above-described problems. The computer program, when executed on one or more processors of the computing device, performs the following operations for driving the chatbot, the operations comprising: receiving a first input sentence from a user terminal; Determining a sentence completion level of the first input sentence; When it is determined that the sentence completion level of the first input sentence is lower than a predetermined criterion, transmitting an incomplete sentence standard answer to the user terminal; Receiving a second input sentence from the user terminal; Generating a third input sentence combining the first input sentence and the second input sentence; Determining a sentence completion level of the third input sentence; And when it is determined that the sentence completeness of the third input sentence is higher than the predetermined criterion, transmitting an answer based on the third input sentence to the user terminal, wherein the second input sentence is the first It may be an input sentence received from the user terminal after the input sentence.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 2 입력 문장은, 상기 제 1 입력 문장을 상기 사용자 단말로부터 수신한 시점부터 사전결정된 시간 범위 이내에 수신되거나 또는, 상기 제 1 입력 문장을 상기 사용자 단말로부터 수신한 이후 상기 사용자 단말로부터 수신되는 사전결정된 개수 범위 내의 입력 문장일 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, the second input sentence is received within a predetermined time range from the time when the first input sentence is received from the user terminal, or , After receiving the first input sentence from the user terminal, it may be an input sentence within a predetermined number range received from the user terminal.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 1 입력 문장의 문장 완성도를 판단하는 동작은, 상기 제 1 입력 문장에 대한 형태소 분석을 수행하는 동작; 상기 형태소 분석을 통해 상기 제 1 입력 문장에 포함된 문장의 구성 요소를 획득하는 동작; 및 상기 문장의 구성요소에 기초하여 상기 제 1 입력 문장의 문장 완성도를 판단하는 동작을 포함할 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, the operation of determining the sentence completion of the first input sentence may include: performing a morpheme analysis on the first input sentence; Obtaining a component of a sentence included in the first input sentence through the morpheme analysis; And determining a sentence completion level of the first input sentence based on the components of the sentence.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 1 입력 문장의 문장 완성도를 판단하는 동작은, 상기 제 1 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득하는 동작; 상기 입력 문장 벡터와 하나 이상의 기준 문장에 대한 기준 문장 벡터 각각에 대한 유사도를 연산하는 동작; 및 상기 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단하는 동작을 포함할 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, the operation of determining the sentence completeness of the first input sentence includes a preprocessing model including the first input sentence including one or more network functions. Obtaining an input sentence vector by calculating through; Calculating similarity for each of the input sentence vector and the reference sentence vector for one or more reference sentences; And when there is a reference sentence having the similarity level equal to or greater than a threshold value, determining that the sentence completion level is higher than the predetermined criterion.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 1 입력 문장의 문장 완성도를 판단하는 동작은, 상기 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하는 동작; 상기 연산을 통해 상기 제 1 입력 문장의 주제 및 서브 주제를 모두 획득하는 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단하는 동작; 및 상기 연산을 통해 상기 주제 및 상기 서브 주제 중 적어도 하나를 획득하지 못하는 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단하는 동작을 포함할 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, the operation of determining the sentence completeness of the first input sentence is an operation of calculating the first input sentence through a sentence subject determination model ; Determining that sentence completion is higher than the predetermined criterion when both the subject and the sub-subject of the first input sentence are obtained through the operation; And when at least one of the subject and the sub-subject is not obtained through the operation, determining that sentence completion is lower than the predetermined criterion.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 문장 주제 판단 모델은, 하나 이상의 기준 문장 및 기준 답변 중 적어도 하나를 임의의 길이로 분할하여 생성된 학습 입력 문장 및 상기 학습 입력 문장에 라벨링 된 주제 및 서브 주제 중 적어도 하나 포함하는 학습 데이터 세트에 기초하여 상기 학습 입력 문장을 입력하는 경우 상기 주제 및 상기 서브 주제 중 적어도 하나를 출력하도록 학습되는 모델일 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving a chatbot, the sentence subject judgment model is a learning input sentence generated by dividing at least one of one or more reference sentences and reference responses to an arbitrary length. And a model learned to output at least one of the subject and the sub-subject when the learning input sentence is input based on a learning data set including at least one of a subject and a sub-subject labeled in the learning input sentence.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 1 입력 문장의 문장 완성도를 판단하는 동작은, 상기 제 1 입력 문장을 전처리 모델을 통해 연산함으로써 제 1 입력 문장 벡터를 획득하는 동작; 상기 제 1 입력 문장 벡터가 상기 하나 이상의 완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단하는 동작; 및 상기 제 1 입력 문장 벡터가 상기 하나 이상의 미완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단하는 동작을 포함할 수 있고, 상기 완성 문장 벡터는, 상기 하나 이상의 기준 문장 또는 기준 답변을 상기 전처리 모델을 통해 연산함으로써 획득되고, 그리고 상기 미완성 문장 벡터는, 하나 이상의 기준 문장 또는 기준 답변 각각을 임의의 길이로 분할한 분할된 기준 문장 또는 분할된 기준 답변을 상기 전처리 모델을 통해 연산함으로써 획득될 수 있다.In an alternative embodiment of the computer program operations to perform the following operations for driving the chatbot, the operation of determining the sentence completion of the first input sentence is the first input by calculating the first input sentence through a pre-processing model. Obtaining a sentence vector; Determining that sentence completion is higher than the predetermined criterion when the first input sentence vector has a similarity to a threshold value of at least one of the one or more completion sentence vectors; And when the first input sentence vector has a similarity to a threshold value of at least one of the one or more incomplete sentence vectors, determining that sentence completion is lower than the predetermined criterion. The one or more reference sentences or reference answers are obtained by calculating through the pre-processing model, and the incomplete sentence vector is divided into one or more reference sentences or reference answers, each of which is divided into arbitrary lengths or divided reference answers Can be obtained by calculating through the pre-processing model.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하여 상기 제 1 입력 문장에 대한 주제 및 서브 주제 중 적어도 하나를 획득하는 동작; 및 상기 사용자 단말로부터 상기 제 1 입력 문장 이후에 수신한 하나 이상의 입력 문장 중 상기 획득된 주제 및 서브 주제 중 적어도 하나에 기초하여 제 2 입력 문장을 결정하는 동작을 더 포함할 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, the first input sentence is computed through a sentence subject judgment model to at least one of the subject and sub-themes for the first input sentence. Obtaining an action; And determining a second input sentence based on at least one of the obtained subject and sub-subjects from one or more input sentences received after the first input sentence from the user terminal.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 2 입력 문장은, 상기 제 1 입력 문장 이후에 수신된 하나 이상의 입력 문장 중 문장 기호를 포함하는 입력 문장일 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, the second input sentence is an input sentence including a sentence symbol among one or more input sentences received after the first input sentence You can.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서 상기 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송하는 동작은, 상기 제 3 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득하는 동작; 상기 입력 문장 벡터와 상기 하나 이상의 기준 문장 벡터 각각에 대한 유사도를 연산하는 동작; 상기 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 유사도가 임계 값 이상인 기준 문장에 기초한 답변을 상기 사용자 단말로 전송하는 동작; 및 상기 유사도가 임계 값 이상인 기준 문장이 없는 경우, 미분류 문장 표준 답변을 상기 사용자 단말로 전송하는 동작을 포함할 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, when it is determined that the sentence completion of the third input sentence is higher than the predetermined criterion, an answer based on the third input sentence is answered. The transmitting to the user terminal may include: obtaining an input sentence vector by calculating the third input sentence through a pre-processing model including one or more network functions; Calculating similarity for each of the input sentence vector and the one or more reference sentence vectors; When there is a reference sentence having the similarity of a threshold value or higher, transmitting an answer based on the reference sentence having the similarity of a threshold value or higher to the user terminal; And when there is no reference sentence having a similarity or higher than a threshold, transmitting an unclassified sentence standard answer to the user terminal.

챗봇의 구동을 위한 이하의 동작들을 수행하도록 하는 컴퓨터 프로그램 동작들의 대안적인 실시예에서, 본 개시의 다른 일 실시예에 따라 챗봇의 구동 방법이 개시된다. 상기 방법은, 사용자 단말로부터 제 1 입력 문장을 수신하는 단계; 상기 제 1 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말로 전송하는 단계; 상기 사용자 단말로부터 제 2 입력 문장을 수신하는 단계; 상기 제 1 입력 문장 및 상기 제 2 입력 문장을 결합한 제 3 입력 문장을 생성하는 단계; 및 상기 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송하는 단계를 포함할 수 있고, 상기 제 2 입력 문장은 상기 제 1 입력 문장 이후에 상기 사용자 단말로부터 수신된 입력 문장일 수 있다.In an alternative embodiment of computer program operations to perform the following operations for driving the chatbot, a method of driving the chatbot according to another embodiment of the present disclosure is disclosed. The method comprises: receiving a first input sentence from a user terminal; When it is determined that the sentence completion level of the first input sentence is lower than a predetermined criterion, transmitting an incomplete sentence standard answer to the user terminal; Receiving a second input sentence from the user terminal; Generating a third input sentence combining the first input sentence and the second input sentence; And when it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion, transmitting an answer based on the third input sentence to the user terminal, wherein the second input sentence is the It may be an input sentence received from the user terminal after the first input sentence.

본 개시의 다른 일 실시예에 따라 컴퓨팅 장치가 개시된다. 상기 컴퓨팅 장치는, 하나 이상의 코어를 포함하는 프로세서; 사용자 단말로부터 제 1 입력 문장 및 제 2 입력 문장 중 적어도 하나를 수신하고, 그리고 상기 사용자 단말에 상기 프로세서에 의하여 결정되는 답변을 전송하는 네트워크부; 및 메모리를 포함하고, 상기 프로세서는, 상기 제 1 입력 문장, 상기 제 2 입력 문장 및 상기 제 1 입력 문장과 상기 제 2 입력 문장을 결합한 제 3 입력 문장의 문장 완성도를 판단하고, 상기 제 1 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말로 전송할 것을 결정하고, 그리고 상기 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송할 것을 결정할 수 있다.A computing device is disclosed in accordance with another embodiment of the present disclosure. The computing device includes a processor including one or more cores; A network unit that receives at least one of a first input sentence and a second input sentence from a user terminal, and transmits an answer determined by the processor to the user terminal; And a memory, wherein the processor determines sentence completion degree of the first input sentence, the second input sentence, and the third input sentence combining the first input sentence and the second input sentence, and the first input If it is determined that the sentence completeness of a sentence is lower than a predetermined criterion, it is determined to transmit an incomplete sentence standard answer to the user terminal, and when it is determined that the sentence completeness of the third input sentence is higher than the predetermined criterion, It may be determined to transmit an answer based on the third input sentence to the user terminal.

본 개시는 챗봇의 구동 방법을 제공할 수 있다.The present disclosure can provide a method of driving a chatbot.

도 1은 본 개시의 일 실시예에 따른 챗봇의 구동을 위한 동작을 수행하는 컴퓨팅 장치의 블록 구성도를 도시한 도면이다.
도 2는 본 개시의 일 실시예에 따른 서버와 사용자 단말 간의 흐름도를 도시한 도면이다.
도 3은 본 개시의 일 실시예에 따른 챗봇의 동작 인터페이스의 예시를 도시한 도면이다.
도 4는 본 개시의 일 실시예에 따른 챗봇의 구동 방법의 순서도이다.
도 5는 본 개시의 일 실시예에 따른 챗봇의 구동 방법을 구현하기 위한 로직을 도시한 블록 구성도이다.
도 6은 본 개시의 일 실시예에 따른 컴퓨팅 장치의 블록 구성도이다.1 is a block diagram of a computing device performing an operation for driving a chatbot according to an embodiment of the present disclosure.
2 is a diagram illustrating a flowchart between a server and a user terminal according to an embodiment of the present disclosure.
3 is a diagram illustrating an example of an operation interface of a chatbot according to an embodiment of the present disclosure.
4 is a flowchart of a method for driving a chatbot according to an embodiment of the present disclosure.
5 is a block diagram illustrating logic for implementing a method of driving a chatbot according to an embodiment of the present disclosure.
6 is a block diagram of a computing device according to an embodiment of the present disclosure.

다양한 실시예들이 이제 도면을 참조하여 설명되며, 전체 도면에서 걸쳐 유사한 도면번호는 유사한 구성요소를 나타내기 위해서 사용된다. 본 명세서에서, 다양한 설명들이 본 개시의 이해를 제공하기 위해서 제시된다. 그러나 이러한 실시예들은 이러한 구체적인 설명 없이도 실행될 수 있음이 명백하다. 다른 예들에서, 공지된 구조 및 장치들은 실시예들의 설명을 용이하게 하기 위해서 블록 다이어그램 형태로 제공된다.Various embodiments are now described with reference to the drawings, and like reference numbers throughout the drawings are used to indicate similar elements. In this specification, various descriptions are presented to provide an understanding of the present disclosure. However, it is clear that these embodiments can be practiced without these specific details. In other examples, well-known structures and devices are provided in block diagram form in order to facilitate describing the embodiments.

본 명세서에서 사용되는 용어 "컴포넌트", "모듈", "시스템" 등은 컴퓨터-관련 엔티티, 하드웨어, 펌웨어, 소프트웨어, 소프트웨어 및 하드웨어의 조합, 또는 소프트웨어의 실행을 지칭한다. 예를 들어, 컴포넌트는 프로세서상에서 실행되는 처리과정, 프로세서, 객체, 실행 스레드, 프로그램, 및/또는 컴퓨터일 수 있지만, 이들로 제한되는 것은 아니다. 예를 들어, 컴퓨팅 장치에서 실행되는 애플리케이션 및 컴퓨팅 장치 모두 컴포넌트일 수 있다. 하나 이상의 컴포넌트는 프로세서 및/또는 실행 스레드 내에 상주할 수 있고, 일 컴포넌트는 하나의 컴퓨터 내에 로컬화될 수 있고, 또는 2개 이상의 컴퓨터들 사이에 분배될 수 있다. 또한, 이러한 컴포넌트들은 그 내부에 저장된 다양한 데이터 구조들을 갖는 다양한 컴퓨터 판독가능한 매체로부터 실행할 수 있다. 컴포넌트들은 예를 들어 하나 이상의 데이터 패킷들을 갖는 신호(예를 들면, 로컬 시스템, 분산 시스템에서 다른 컴포넌트와 상호작용하는 하나의 컴포넌트로부터 데이터 및/또는 신호를 통해 다른 시스템과 인터넷과 같은 네트워크를 통한 데이터)에 따라 로컬 및/또는 원격 처리들을 통해 통신할 수 있다. As used herein, the terms “component”, “module”, “system” and the like refer to computer-related entities, hardware, firmware, software, combinations of software and hardware, or execution of software. For example, a component can be, but is not limited to, a process running on a processor, a processor, an object, a thread of execution, a program, and / or a computer. For example, both the application and the computing device running on the computing device can be components. One or more components can reside within a processor and / or thread of execution, and one component can be localized within one computer, or distributed between two or more computers. In addition, these components can execute from various computer readable media having various data structures stored therein. The components are, for example, signals having one or more data packets (e.g., data from one component interacting with another component in a local system, distributed system, and / or data via another system and a network such as the Internet via a signal). ) Through local and / or remote processes.

제시된 실시예들에 대한 설명은 본 개시의 기술 분야에서 통상의 지식을 가진 자가 본 개시를 이용하거나 또는 실시할 수 있도록 제공된다. 이러한 실시예들에 대한 다양한 변형들은 본 개시의 기술 분야에서 통상의 지식을 가진 자에게 명백할 것이며, 여기에 정의된 일반적인 원리들은 본 개시의 범위를 벗어남이 없이 다른 실시예들에 적용될 수 있다. 그리하여, 본 개시는 여기에 제시된 실시예들로 한정되는 것이 아니라, 여기에 제시된 원리들 및 신규한 특징들과 일관되는 최광의의 범위에서 해석되어야 할 것이다. Descriptions of the presented embodiments are provided to enable any person skilled in the art to use or practice the present disclosure. Various modifications to these embodiments will be apparent to those skilled in the art of the present disclosure, and the general principles defined herein can be applied to other embodiments without departing from the scope of the present disclosure. Thus, the present disclosure should not be limited to the embodiments presented herein, but should be interpreted in the broadest scope consistent with the principles and novel features presented herein.

본 개시의 일 실시예에서 서버는 서버의 서버 환경을 수행하기 위한 다른 구성들이 포함될 수도 있다. 서버는 임의의 형태의 장치는 모두 포함할 수 있다. 서버는 디지털 기기로서, 랩탑 컴퓨터, 노트북 컴퓨터, 데스크톱 컴퓨터, 웹 패드, 이동 전화기와 같이 프로세서를 탑재하고 메모리를 구비한 연산 능력을 갖춘 디지털 기기일 수 있다. 서버는 서비스를 처리하는 웹 서버일 수 있다. 전술한 서버의 종류는 예시일 뿐이며 본 개시는 이에 제한되지 않는다. In one embodiment of the present disclosure, the server may include other configurations for performing the server environment of the server. The server can include any type of device. The server is a digital device, and may be a digital device equipped with a computing power equipped with a processor and having a memory such as a laptop computer, a notebook computer, a desktop computer, a web pad, and a mobile phone. The server may be a web server that processes services. The above-mentioned types of servers are only examples, and the present disclosure is not limited thereto.

본 명세서에 걸쳐, 모델, 신경망, 네트워크 함수, 뉴럴 네트워크(neural network)는 동일한 의미로 사용될 수 있다. 신경망은 일반적으로 "노드"라 지칭될 수 있는 상호 연결된 계산 단위들의 집합으로 구성될 수 있다. 이러한 "노드"들은 "뉴런(neuron)"들로 지칭될 수도 있다. 신경망은 적어도 하나 이상의 노드들을 포함하여 구성된다. 신경망들을 구성하는 노드(또는 뉴런)들은 하나 이상의 "링크"에 의해 상호 연결될 수 있다.Throughout this specification, models, neural networks, network functions, and neural networks may be used in the same sense. A neural network may consist of a set of interconnected computational units, which may generally be referred to as "nodes." These “nodes” may also be referred to as “neurons”. The neural network comprises at least one node. The nodes (or neurons) that make up the neural networks can be interconnected by one or more “links”.

본 명세서에 걸쳐, 모델은 하나 이상의 네트워크 함수를 포함하는 의미로 사용될 수 있다. 모델은 다른 모델과 함께 데이터의 처리를 수행할 수 있다. 모델은 다른 모델과 직렬 또는 병렬로 상호 연결될 수 있다.Throughout this specification, a model can be used to mean including one or more network functions. The model can perform data processing along with other models. The models can be interconnected in series or in parallel with other models.

도 1은 본 개시의 일 실시예에 따른 챗봇의 구동을 위한 동작을 수행하는 컴퓨팅 장치의 블록 구성도를 도시한 도면이다.1 is a block diagram of a computing device performing an operation for driving a chatbot according to an embodiment of the present disclosure.

본 개시의 일 실시예에 따른 챗봇의 구동 방법을 위한 컴퓨팅 장치(100)는 챗봇의 상담을 이용하는 사용자의 입력에 기초하여 정확한 답변을 제공하기 위해 사용될 수 있다. 답변은, 사용자의 입력에 기초하여 결정된 금융 상품에 관련한 상담, 문의 또는 고객 서비스 중 적어도 하나에 대한 답변일 수 있다.The computing device 100 for a method of driving a chatbot according to an embodiment of the present disclosure may be used to provide an accurate answer based on a user input using a chatbot consultation. The answer may be an answer to at least one of consultation, inquiry, or customer service related to the financial product determined based on user input.

본 개시의 일 실시예에 따른 챗봇의 구동 방법을 위한 컴퓨팅 장치(100)는 네트워크부(110), 프로세서(120) 및 메모리(130)를 포함할 수 있다.The computing device 100 for a method of driving a chatbot according to an embodiment of the present disclosure may include a network unit 110, a processor 120, and a memory 130.

네트워크부(110)는 사용자 단말(200)로부터 하나 이상의 단어를 포함하는 입력 문장을 수신할 수 있다. 여기서 입력 문장은 컴퓨팅 장치(100)가 사용자 단말(200)로부터 수신하는 문장을 포함할 수 있다. 입력 문장은, 컴퓨팅 장치(100)가 사용자 단말(200)로부터 수신하는 첫번째 문장일 수도 있고, 첫번째 문장 이후의 문장일 수도 있다. 또한, 상기 입력 문장은 컴퓨팅 장치(100)가 사용자 단말(200)에 대해 임의의 동작을 수행하기 전까지 수신되는 하나 이상의 문장일 수 있다. 예를 들어, 네트워크부(110)는 사용자 단말(200)을 통해 수신되는 첫 입력 문장으로 "안녕 챗봇아"를 수신할 수 있고, 이후 "보험 상담을 하고 싶어"라는 두번째 입력 문장을 수신할 수 있다. 여기서 컴퓨팅 장치(100)가 상기 첫 입력 문장 및 두번째 입력 문장에 대한 답변, 또는 결과값 출력 동작을 수행하지 않은 경우, 상기 첫 입력 문장 및 두번째 입력 문장을 하나의 입력 문장으로 판단할 수 있다. 전술한 입력 문장의 자세한 기재는 예시일 뿐, 본 개시는 이에 제한되지 않는다.The network unit 110 may receive an input sentence including one or more words from the user terminal 200. Here, the input sentence may include a sentence received by the computing device 100 from the user terminal 200. The input sentence may be the first sentence that the computing device 100 receives from the user terminal 200 or may be a sentence after the first sentence. Further, the input sentence may be one or more sentences received until the computing device 100 performs an arbitrary operation on the user terminal 200. For example, the network unit 110 may receive the “Hello Chatbot” as the first input sentence received through the user terminal 200, and then receive the second input sentence “I want to have insurance consultation”. have. Here, when the computing device 100 does not perform the response to the first input sentence and the second input sentence or outputting a result value, the first input sentence and the second input sentence may be determined as one input sentence. The detailed description of the aforementioned input sentence is only an example, and the present disclosure is not limited thereto.

네트워크부(110)는 답변을 사용자 단말(200)로 송신할 수 있다. 답변은 사용자 단말(200)로부터 수신한 입력 문장에 기초하여 결정된 답변일 수 있다. 답변은 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장을 결합한 입력 문장에 기초하여 결정된 답변일 수 있다. 보다 구체적으로, 네트워크부(110)는 프로세서(120)가 생성한 답변을 획득할 수 있다. 추가적으로, 네트워크부(110)는 금융사 컴퓨팅 장치 그룹 중 적어도 하나로부터 답변을 획득할 수도 있다. 그리고 네트워크부(110)는 답변을 사용자 단말(200)과 연결되어 있는 챗봇을 통해 인터페이스와 함께 사용자 단말(200)로 송신할 수 있다. 상기 과정에서 네트워크부(110)는 유/무선 네트워크를 통하여 사용자 단말(200)과 통신할 수 있다.The network unit 110 may transmit an answer to the user terminal 200. The answer may be an answer determined based on the input sentence received from the user terminal 200. The answer may be an answer determined based on an input sentence combining one or more input sentences received from the user terminal 200. More specifically, the network unit 110 may obtain an answer generated by the processor 120. Additionally, the network unit 110 may obtain an answer from at least one of the financial company computing device groups. In addition, the network unit 110 may transmit the response to the user terminal 200 together with the interface through a chatbot connected to the user terminal 200. In the above process, the network unit 110 may communicate with the user terminal 200 through a wired / wireless network.

추가적으로, 네트워크부(110)는 메모리(130)상에 저장된 입력 문장에 매칭되는 답변을 사용자 단말(200)로 전송할 수 있다.Additionally, the network unit 110 may transmit an answer matching the input sentence stored in the memory 130 to the user terminal 200.

사용자 단말(200)은 PC(personal computer), 노트북(note book), 모바일 단말기(mobile terminal), 스마트 폰(smart phone), 태블릿 PC(tablet pc), VR(Virtual Reality) 기기 등을 포함할 수 있으며, 유/무선 네트워크에 접속할 수 있는 모든 종류의 단말을 포함할 수 있다. 또한, 네트워크부(110)는 네트워크 접속을 위한 유/무선 인터넷 모듈을 포함할 수 있다. 무선 인터넷 기술로는 WLAN(Wireless LAN)(Wi-Fi), Wibro(Wireless broadband), Wimax(World Interoperability for Microwave Access), HSDPA(High Speed Downlink Packet Access) 등이 이용될 수 있다. 유선 인터넷 기술로는 XDSL(Digital Subscriber Line), FTTH(Fibers to the home), PLC(Power Line Communication) 등이 이용될 수 있다.The user terminal 200 may include a personal computer (PC), a notebook (note book), a mobile terminal (smart phone), a smart phone (smart phone), a tablet PC (tablet pc), VR (Virtual Reality) device, etc. It can include all types of terminals that can access the wired / wireless network. In addition, the network unit 110 may include a wired / wireless Internet module for network access. Wireless Internet technology (Wireless LAN) (Wi-Fi), Wibro (Wireless broadband), Wimax (World Interoperability for Microwave Access), HSDPA (High Speed Downlink Packet Access) may be used. As a wired Internet technology, digital subscriber line (XDSL), fibers to the home (FTTH), and power line communication (PLC) may be used.

그리고, 네트워크부(110)는 근거리 통신 모듈을 포함하여, 비교적 근거리에 위치하고 근거리 통신 모듈을 포함한 전자 장치(예컨대, 외부 단말)와 데이터를 송수신할 수 있다. 근거리 통신(short range communication) 기술로 블루투스(Bluetooth), RFID(Radio Frequency Identification), 적외선 통신(IrDA, infrared Data Association), UWB(Ultra Wideband), ZigBee 등이 이용될 수 있다.In addition, the network unit 110 includes a short-range communication module, and is located at a relatively short distance and transmits and receives data to and from an electronic device (eg, an external terminal) including the short-range communication module. Bluetooth, radio frequency identification (RFID), infrared data association (IrDA), ultra wideband (UWB), ZigBee, etc. may be used as short range communication technology.

프로세서(120)는 하나 이상의 코어로 구성될 수 있으며, 컴퓨팅 장치의 중앙 처리 장치(CPU: central processing unit), 범용 그래픽 처리 장치 (GPGPU: general purpose graphics processing unit), 텐서 처리 장치(TPU: tensor processing unit) 등의 챗봇의 구동을 위한 프로세서를 포함할 수 있다. 프로세서(120)는 메모리(130)에 저장된 컴퓨터 프로그램을 판독하여 본 개시의 일 실시예에 따른 챗봇의 구동 방법을 수행할 수 있다. 본 개시의 일 실시예에 따라 프로세서(120)는 챗봇의 구동 방법을 위한 계산을 수행할 수 있다.The processor 120 may include one or more cores, a central processing unit (CPU) of a computing device, a general purpose graphics processing unit (GPGPU), and a tensor processing unit (TPU). unit), and a processor for driving a chatbot. The processor 120 may read a computer program stored in the memory 130 and perform a method of driving a chatbot according to an embodiment of the present disclosure. According to an embodiment of the present disclosure, the processor 120 may perform calculation for a method of driving a chatbot.

프로세서(120)의 CPU, GPGPU, 및 TPU 중 적어도 하나가 챗봇의 구동 방법을 위한 입력 문장에 대한 문장 완성도 연산, 입력 문장 및 기준 문장에 대한 유사도 연산 등을 처리할 수 있다. 예를 들어, CPU 와 GPGPU가 함께 챗봇의 구동 방법을 위하여 하나 이상의 입력 문장을 결합한 문장에 대한 문장 완성도 연산 등을 처리할 수 있다. 또한, 본 개시의 일 실시예에서 복수의 컴퓨팅 장치의 프로세서를 함께 사용하여 챗봇의 구동 방법을 위한 스코어링 값에 기초한 유사도 판단 등을 처리할 수 있다. 또한, 본 개시의 일 실시예에 따른 컴퓨팅 장치에서 수행되는 컴퓨터 프로그램은 CPU, GPGPU 또는 TPU 실행가능 프로그램일 수 있다.At least one of the CPU, GPGPU, and TPU of the processor 120 may process a sentence completeness operation for an input sentence for a method of driving a chatbot, and a similarity operation for an input sentence and a reference sentence. For example, for the method of driving the chatbot, the CPU and GPGPU may process a sentence completion operation for a sentence in which one or more input sentences are combined. In addition, in an embodiment of the present disclosure, similarity determination based on a scoring value for a method of driving a chatbot may be processed by using processors of a plurality of computing devices together. Further, the computer program executed in the computing device according to an embodiment of the present disclosure may be a CPU, GPGPU, or TPU executable program.

프로세서(120)는 사용자 단말(200)로부터 입력 문장을 수신할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 입력 문장에 대하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 입력 문장을 수신할 때 마다, 각각의 입력 문장에 대하여 문장 완성도를 판단할 수 있다. 예를 들어, 프로세서(120)는 네트워크부(110)를 통해 제 1 입력 문장을 수신하는 경우, 제 1 입력 문장에 대한 문장 완성도를 판단할 수 있다. 예를 들어, 프로세서(120) 네트워크부(110)를 통해 제 1 입력 문장을 수신한 이후에 제 2 입력 문장을 수신하는 경우, 제 2 입력 문장에 대한 문장 완성도를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장에 대하여 하나 이상의 입력 문장을 결합하여 문장 완성도를 판단할 수 있다. 문장 완성도는, 사용자의 질의의 목적을 파악할 수 있는지 여부를 나타내는 척도일 수 있다. 문장 완성도는, 사용자 단말(200)로부터 수신한 입력 문장에 기초하여 답변을 할 수 있는 정도의 문장을 수신하였는지 여부를 의미할 수 있다.The processor 120 may receive an input sentence from the user terminal 200. The processor 120 may determine sentence completion with respect to the input sentence received from the user terminal 200. Whenever the input sentence is received from the user terminal 200, the processor 120 may determine sentence completion for each input sentence. For example, when the first input sentence is received through the network unit 110, the processor 120 may determine sentence completion for the first input sentence. For example, when the second input sentence is received after the first input sentence is received through the network unit 110 of the processor 120, the sentence completeness of the second input sentence may be determined. The processor 120 may combine one or more input sentences with one or more input sentences received from the user terminal 200 to determine sentence completion. The sentence completion level may be a measure indicating whether a user's query purpose can be grasped. The sentence completion level may indicate whether a sentence of a degree capable of answering is received based on an input sentence received from the user terminal 200.

본 개시의 일 실시예에 따라, 프로세서(120)는 사용자 단말(200)로부터 수신한 입력 문장에 대하여 문장 완성도를 판단하고, 해당 입력 문장의 문장 완성도가 사전 결정된 기준보다 낮은 경우, 해당 입력 문장을 메모리(130)에 저장하고, 해당 입력 문장 이후에 사용자 단말(200)로부터 수신된 다른 입력 문장과 메모리(130)에 저장했던 입력 문장을 결합하여 문장 완성도를 판단할 수 있다. 예를 들어, 프로세서(120)는 사용자 단말(200)로부터 수신한 제 1 입력 문장에 대한 문장 완성도 판단 결과, 제 1 입력 문장에 대한 문장 완성도가 사전결정된 기준보다 낮다고 판단되는 경우, 프로세서(120)는 제 1 입력 문장을 메모리(130)에 저장할 수 있다. 프로세서(120)는 제 1 입력 문장 이후에 수신되는 제 2 입력 문장을, 메모리(130)에 저장된 제 1 입력 문장과 결합하여, 제 3 입력 문장을 생성하고, 제 3 입력 문장에 대한 문장 완성도를 판단할 수 있다.According to an embodiment of the present disclosure, the processor 120 determines sentence completion with respect to the input sentence received from the user terminal 200, and if the sentence completion degree of the corresponding input sentence is lower than a predetermined criterion, the processor 120 The completion of the sentence may be determined by combining the other input sentence stored in the memory 130 with the other input sentence received from the user terminal 200 and the input sentence stored in the memory 130 after the corresponding input sentence. For example, when the processor 120 determines that the sentence completeness of the first input sentence received from the user terminal 200 is lower than the predetermined criterion, the processor 120 determines that the processor 120 is the processor 120. May store the first input sentence in the memory 130. The processor 120 combines the second input sentence received after the first input sentence with the first input sentence stored in the memory 130 to generate a third input sentence, and completes sentence completion for the third input sentence. I can judge.

본 개시의 일 실시예에 따라, 프로세서(120)는 사용자 단말(200)로부터 수신한 입력 문장에 대하여 문장 완성도를 판단하고, 해당 입력 문장의 문장 완성도가 사전 결정된 기준보다 낮은 경우, 해당 입력 문장 전에 수신한 입력 문장과 결합하여, 문장 완성도를 다시 판단할 수 있다. 예를 들어, 프로세서(120)는 사용자 단말(200)로부터 제 2 입력 문장을 수신하고, 제 2 입력 문장에 대한 문자 완성도를 판단할 수 있다. 프로세서(120)는 제 2 입력 문장에 대한 문장 완성도가 사전결정된 기준보다 낮다고 판단하는 경우, 제 2 입력 문장 이전에 사용자 단말(200)로부터 수신한 제 1 입력 문장과 제 2 입력 문장을 결합하여 제 3 입력 문장을 생성하고, 제 3 입력 문장에 대한 문장 완성도를 판단할 수 있다.According to an embodiment of the present disclosure, the processor 120 determines the sentence completion level with respect to the input sentence received from the user terminal 200, and if the sentence completion level of the corresponding input sentence is lower than a predetermined criterion, before the corresponding input sentence In combination with the received input sentence, the sentence completion can be determined again. For example, the processor 120 may receive the second input sentence from the user terminal 200 and determine the character completeness of the second input sentence. When the processor 120 determines that the sentence completion level for the second input sentence is lower than a predetermined criterion, the processor 120 combines the first input sentence and the second input sentence received from the user terminal 200 before the second input sentence. It is possible to generate 3 input sentences and determine sentence completion for the 3rd input sentence.

문장 완성도가 사전결정된 기준 이상인 경우, 사용자 단말(200)로부터 수신한 입력 문장에 기초하여 상기 사용자의 질의의 목적을 파악할 수 있는 경우 일 수 있다. 문장 완성도가 사전결정된 기준 미만인 경우, 사용자 단말(200)로부터 수신한 입력 문장에 기초하여 상기 사용자의 질의의 목적을 파악할 수 없어, 사용자의 질의의 목적을 파악하기 위해서는 추가적인 입력 문장을 사용자 단말(200)로부터 수신할 필요가 있는 경우일 수 있다.When the sentence completeness is higher than a predetermined criterion, it may be a case in which the purpose of the user's query can be grasped based on the input sentence received from the user terminal 200. When the sentence completion is less than a predetermined criterion, the purpose of the user's query cannot be grasped based on the input sentence received from the user terminal 200. ).

프로세서(120)는 사용자 단말(200)로부터 수신한 입력 문장에 대하여 문장 완성도를 판단하고, 상기 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말(200)로 전송할 수 있다. 미완성 문장 표준 답변은, 사용자 단말(200)로부터 수신한 입력 문장이 완성되지 않은 문장이라고 판단한 것에 기초하여, 사용자 단말(200)로부터 추가적인 입력 문장을 수신하기 위한 대기 답변을 의미할 수 있다. 프로세서(120)는 미완성 문장 표준 답변을 사용자 단말(200)로 전송하는 경우, 입력 문장에 기초한 답변을 전송하기 위한 연산을 수행하지 않을 수 있다. 예를 들어, 미완성 표준 문장 답변은 “네~ 고객님”, “네~”, “예” 등을 포함할 수 있다. 프로세서(120)는 메모리(130)에 저장된 하나 이상의 미완성 표준 문장 답변 중 하나의 미완성 표준 문장 답변을 사용자 단말(200)에 네트워크부(110)를 통해 전송할 수 있다. 전술한 미완성 표준 문장 답변에 대한 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.The processor 120 determines the sentence completion level with respect to the input sentence received from the user terminal 200, and when it is determined that the sentence completion level is lower than a predetermined criterion, transmits an incomplete sentence standard answer to the user terminal 200 You can. The incomplete sentence standard answer may mean a waiting response for receiving an additional input sentence from the user terminal 200 based on the determination that the input sentence received from the user terminal 200 is an incomplete sentence. When the incomplete sentence standard answer is transmitted to the user terminal 200, the processor 120 may not perform an operation for transmitting an answer based on the input sentence. For example, an uncompleted standard sentence answer may include “yes ~ customer”, “yes ~”, “yes”, and the like. The processor 120 may transmit one unfinished standard sentence answer of one or more incomplete standard sentence answers stored in the memory 130 to the user terminal 200 through the network unit 110. The description of the above-mentioned incomplete standard sentence answer is merely an example, and the present disclosure is not limited thereto.

프로세서(120)는 사용자 단말(200)로부터 수신한 제 1 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말로 전송할 수 있다. 제 1 입력 문장은, 사용자 단말(200)로부터 수신한 첫번째 입력 문장일 수 있다. 제 1 입력 문장은, 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장 중 해당 입력 문장에 기초한 답변을 사용자 단말(200)로 전송하지 않은 문장 중 가장 먼저 수신한 입력 문장일 수 있다. 프로세서(120)는 네트워크부(110)를 통해 사용자 단말로부터 제 2 입력 문장을 수신할 수 있다. 제 2 입력 문장은 제 1 입력 문장 이후에 사용자 단말(200)로부터 수신된 입력 문장일 수 있다. 프로세서(120)는 제 2 입력 문장에 대한 문장 완성도를 판단할 수 있다. 프로세서(120)는 제 2 입력 문장에 대한 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단하는 경우, 프로세서(120)는 제 1 입력 문장 및 제 2 입력 문장을 결합한 제 3 입력 문장을 생성할 수 있다. 프로세서(120)는 제 3 입력 문장의 문장 완성도를 판단할 수 있다.When it is determined that the sentence completion level of the first input sentence received from the user terminal 200 is lower than a predetermined criterion, the processor 120 may transmit an incomplete sentence standard answer to the user terminal. The first input sentence may be the first input sentence received from the user terminal 200. The first input sentence may be an input sentence received first among sentences that do not transmit an answer based on the corresponding input sentence among the one or more input sentences received from the user terminal 200 to the user terminal 200. The processor 120 may receive the second input sentence from the user terminal through the network unit 110. The second input sentence may be an input sentence received from the user terminal 200 after the first input sentence. The processor 120 may determine sentence completion for the second input sentence. When the processor 120 determines that sentence completion for the second input sentence is lower than a predetermined criterion, the processor 120 may generate a third input sentence combining the first input sentence and the second input sentence. The processor 120 may determine sentence completion of the third input sentence.

제 2 입력 문장은, 제 1 입력 문장을 상기 사용자 단말(200)로부터 수신한 시점부터 사전결정된 시간 범위 이내에 수신된 입력 문장일 수 있다. 프로세서(120)는 제 1 입력 문장을 사용자 단말(200)로부터 수신한 시점에서 사전결정된 시간 범위 이후에 수신한 입력 문장은, 제 1 입력 문장과 다른 주제(예를 들어, 각각의 입력 문장이 약관 대출에 관한 주제, 암 보험 가입에 관한 주제로 상이한 주제)일 수도 있다고 판단하여, 해당 입력 문장은 제 2 입력 문장에서 제외할 수 있다. 예를 들어, 프로세서(120)가 네트워크부(110)를 통해 제 1 입력 문장인 “암 보험은”에 대한 입력 문장을 사용자 단말(200)로부터 수신하고, 사전결정된 시간인 30분 후에 “건강검진을 받아야”라는 입력 문장을 수신하는 경우, 상기 30분 후에 수신한 입력 문장은 제 2 입력 문장으로 결정하지 않을 수 있다. 제 2 입력 문장은, 제 1 입력 문장을 사용자 단말(200)로부터 수신한 이후 사용자 단말(200)로부터 수신되는 사전결정된 개수 범위 내의 입력 문장일 수 있다. 프로세서(120)는, 제 1 입력 문장을 수신한 후 사전 결정된 입력 문장의 개수 범위 이후에 사용자 단말(200)로부터 수신되는 입력 문장에 관하여는, 제 2 입력 문장으로 결정하지 않을 수 있다. 예를 들어, 프로세서(120)가 제 1 입력 문장을 수신한 이후에 10개의 입력 문장을 사용자 단말(200)로부터 추가로 수신하는 경우, 그 이후의 입력 문장에 관하여는 제 2 입력 문장으로 결정하지 않을 수 있다. 전술한 제 2 입력 문장에 관한 구체적인 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다. 제 2 입력 문장으로 결정한 입력 문장에 대해서는, 제 1 입력 문장과 결합하여 문장 완성도를 판단하는데, 제 1 입력 문장 이후에 수신되는 모든 입력 문장에 대해서 제 1 입력 문장과 결합하여 문장 완성도를 판단하게 된다면, 프로세서(120)의 연산에 부담을 줄 수 있으므로, 제 2 입력 문장 범위에 대한 제한을 둠으로써 컴퓨팅 장치의 처리 성능 및 속도를 향상시키기 위함이다.The second input sentence may be an input sentence received within a predetermined time range from the time when the first input sentence is received from the user terminal 200. The processor 120 receives the first input sentence from the user terminal 200 at a point in time after the predetermined time range, and the input sentence is different from the first input sentence (for example, each input sentence is in terms of terms and conditions) It may be determined that the topic may be a loan topic, a topic related to cancer insurance, or a different topic), and the corresponding input sentence may be excluded from the second input sentence. For example, the processor 120 receives an input sentence for “cancer insurance”, which is the first input sentence, from the user terminal 200 through the network unit 110, and after 30 minutes which is a predetermined time, a “health checkup” is performed. In case of receiving the input sentence “to be received”, the input sentence received after the 30 minutes may not be determined as the second input sentence. The second input sentence may be an input sentence within a predetermined number range received from the user terminal 200 after receiving the first input sentence from the user terminal 200. The processor 120 may not determine the input sentence received from the user terminal 200 after the predetermined number of input sentences after receiving the first input sentence as the second input sentence. For example, if the processor 120 additionally receives 10 input sentences from the user terminal 200 after receiving the first input sentence, the subsequent input sentence is not determined as the second input sentence. It may not. The specific description of the second input sentence described above is merely an example, and the present disclosure is not limited thereto. For the input sentence determined as the second input sentence, the sentence completion is determined by combining with the first input sentence. If all input sentences received after the first input sentence are combined with the first input sentence, the sentence completeness is determined. In order to improve the processing performance and speed of the computing device by limiting the range of the second input sentence, since it may burden the operation of the processor 120.

형태소 분석은 사용자 단말(200)로부터 수신한 입력 문장을 형태소 사전에 기초하여 입력 문장의 형태소를 분석하는 것일 수 있다. 프로세서(120)는 하나 이상의 단어를 포함하는 입력 문장을 의미를 가질 수 있는 최소 단위로 쪼갤 수 있다. 프로세서(120)는 쪼개진 입력 문장을 형태소 사전이 포함하는 각각의 품사와 매칭시킬 수 있다. 프로세서(120)는 메모리(130)에 저장된 형태소 사전에 기초하여 입력 문장을 형태소 단위로 분석하고 품사 태깅(Tagging)을 수행할 수 있다. 여기서, 형태소 단위는 일반 명사/NNG, 주격 조사/JKS, 형용사/VA, 종결어미/EF, 마침표, 물음표, 느낌표/SF 등이 있다. 예를 들어, 입력 문장이 "약관 대출은 어때?"인 경우, 프로세서(120)는 상기 입력 문장을 "약관 /NNG + 대출/NNG + 은/JKS + 어떻/VA + 어/EF + ?/SF" 와 같은 형태소 단위로 분석 하고 품사 태깅을 수행할 수 있다.The morpheme analysis may be to analyze the morpheme of the input sentence based on the morpheme dictionary based on the input sentence received from the user terminal 200. The processor 120 may split an input sentence including one or more words into a minimum unit that can have meaning. The processor 120 may match the split input sentence with each part of speech included in the morpheme dictionary. The processor 120 may analyze the input sentence on a morpheme basis based on a morpheme dictionary stored in the memory 130 and perform part-of-speech tagging. Here, the morpheme unit includes general noun / NNG, pronoun / JKS, adjective / VA, closing ending / EF, period, question mark, exclamation mark / SF, and the like. For example, if the input sentence is "How about the terms and conditions loan?", The processor 120 may read the input sentence as "Terms / NNG + Loans / NNG + Silver / JKS + How / VA + Er / EF +? / SF It is possible to analyze and perform part-of-speech tagging in units such as ".

프로세서(120)는 사전 정의된 룰(rule) 및 형태소 단위로 분석된 입력 문장에 기초하여 입력 문장을 구문 단위로 분석할 수 있다. 여기서 구문은 상기 입력 문장이 형태소 단위로 분석되고 품사 태깅이 수행된 후, 형태소 단위를 보다 큰 단위인 명사구, 동사구, 형용사구 등으로 묶은 덩어리를 의미할 수 있다. 예를 들어, "약관/NNG", "대출/NNG + 은/JKS", 어떻/VA + 어/EF + ?/SF"로 사용자의 메시지의 구문을 분석할 수 있다. 프로세서(120)는 명사구, 동사구, 형용사구 등으로 묶은 덩어리에 기초하여 문장의 구성 요소 중 주어 및 술어를 식별할 수 있다. 문장의 주어는, 명사구를 포함할 수 있고, 문장의 술어는 동사구 및 형용사구 중 적어도 하나를 포함할 수 있다.The processor 120 may analyze the input sentence in the syntax unit based on the predefined rule and the input sentence analyzed in the morpheme unit. Here, the phrase may refer to a lump in which the morpheme unit is grouped into larger units such as noun phrases, verb phrases, and adjective phrases after the input sentence is analyzed in morpheme units and part-of-speech tagging is performed. For example, the user's message can be parsed with "Terms / NNG", "Loan / NNG + Silver / JKS", How / VA + Uh / EF +? / SF. Processor 120 is a noun phrase , Subjects and predicates among the components of a sentence may be identified based on a chunk grouped with a verb phrase, an adjective phrase, etc. The subject of the sentence may include a noun phrase, and the predicate of the sentence may include at least one of a verb phrase and an adjective phrase. You can.

프로세서(120)는 형태소, 또는 구문 단위로 분석된 입력 문장으로부터 개체명을 인식할 수 있다. 여기서 개체명은 문서에서 나타나는 고유한 의미를 가지는 명사나 숫자 표현과 같이 고유한 성질의 표현을 말하며 인명(Person), 지명(Location), 기관명(Organization)과 같은 이름 표현, 날짜나 시간과 같은 시간 표현, 금액이나 퍼센트와 같은 수치 표현으로 구분할 수 있다. 프로세서(120)는 데이터 베이스에 저장된 개체명 사전에 기초하여 형태소, 또는 구문 단위로 분석된 입력 문장으로부터 개체명을 인식할 수 있다. 예를 들어, "약관/NNG + 대출/NNG + 은/JKS + 어떻/VA + 어/EF + ?/SF"에서 "약관" 및 "대출"를 각각 개체명으로 인식할 수 있다. 또한, "약관 대출"을 하나의 개체명으로 인식할 수도 있다.The processor 120 may recognize an object name from an input sentence analyzed in a morpheme or syntax unit. Here, the object name refers to expressions of unique properties such as nouns or numeric expressions that have a unique meaning in the document, and name expressions such as Person, Location, and Organization, and time expressions such as date and time. It can be distinguished by numerical expressions such as, amount or percent. The processor 120 may recognize an entity name from an input sentence analyzed in a morpheme or syntax unit based on the dictionary of entity names stored in the database. For example, in "Terms / NNG + Loan / NNG + Silver / JKS + How / VA + Er / EF +? / SF", "Terms" and "Loans" may be recognized as individual names, respectively. It is also possible to recognize "term loan" as a single entity name.

전술한 형태소 분석을 위한 프로세서(120)의 구체적인 동작은 예시일 뿐, 본 개시는 이에 제한되지 않는다. The specific operation of the processor 120 for morphological analysis described above is only an example, and the present disclosure is not limited thereto.

제 2 입력 문장은, 상기 제 1 입력 문장 이후에 수신된 하나 이상의 입력 문장 중 문장 기호를 포함하는 입력 문장일 수 있다. 프로세서(120)는 입력 문장에 대한 형태소 분석을 수행할 수 있다. The second input sentence may be an input sentence including a sentence symbol among one or more input sentences received after the first input sentence. The processor 120 may perform morpheme analysis on the input sentence.

프로세서(120)는 입력 문장에 대한 형태소 분석에 기초하여 문장의 구성 요소를 획득할 수 있다. 프로세서(120)는 입력 문장에 대한 문장의 구성 요소 중 문장 기호를 포함하는 경우, 해당 입력 문장을 제 2 입력 문장으로 결정할 수 있다. 예를 들어, 문장 기호는 물음표, 온점 등을 포함할 수 있다. 프로세서(120)는 문장 기호를 포함하는 입력 문장의 경우, 사용자의 질의 사항을 포함하는 마지막 입력 문장인 것으로 판단할 수 있다. 예를 들어, 컴퓨팅 장치(100)가 사용자 단말(200)로부터 “얼마야?”라는 입력 문장을 수신하는 경우, 상기 입력 문장에 대한 형태소 분석 결과, 문장 기호인 “?” 를 포함하고 있으므로, 프로세서(120)는 “얼마야?”에 대한 입력 문장을 제 2 입력 문장으로 결정할 수 있다. 예를 들어, 프로세서(120)는 “얼마야?”에 대한 제 2 입력 문장을 제 2 입력 문장 전에 수신된 제 1 입력 문장인 “약관 대출 이자율은”과 결합하여, 문장 완성도를 판단할 수 있다. 전술한 제 2 입력 문장에 대한 구체적인 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.The processor 120 may acquire components of the sentence based on the morpheme analysis of the input sentence. The processor 120 may determine a corresponding input sentence as the second input sentence when a sentence symbol is included among the constituent elements of the sentence for the input sentence. For example, the punctuation marks may include question marks, hot points, and the like. The processor 120 may determine that the input sentence including the sentence symbol is the last input sentence including the user's query. For example, when the computing device 100 receives the input sentence “How long?” From the user terminal 200, as a result of the morpheme analysis on the input sentence, the sentence symbol “?” Since it includes, the processor 120 may determine the input sentence for “How long?” As the second input sentence. For example, the processor 120 may combine the second input sentence for “How long?” With the first input sentence “Contract Loan Interest Rate” received before the second input sentence to determine sentence completion. . The specific description of the above-described second input sentence is only an example, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따라, 프로세서(120)는 입력 문장의 문장 완성도를 판단할 수 있다. 예를 들어, 프로세서(120)는 입력 문장의 구성 요소에 기초하여 문장 완성도를 판단할 수 있다. According to an embodiment of the present disclosure, the processor 120 may determine sentence completion of the input sentence. For example, the processor 120 may determine sentence completion based on the components of the input sentence.

프로세서(120)는 입력 문장에 대한 형태소 분석을 수행할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 입력 문장을 수신하는 경우, 각 입력 문장에 대하여 형태소 분석을 수행하고, 입력 문장의 구성 요소에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장을 결합한 입력 문장에 대하여 형태소 분석을 수행하고, 입력 문장의 구성 요소에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 하나의 입력 문장에 대한 형태소 분석을 수행할 수도 있고, 하나 이상의 입력 문장을 결합한 문장에 대한 형태소 분석을 수행할 수도 있다. 프로세서(120)는 형태소 분석을 통해 입력 문장에 포함된 문장의 구성 요소를 획득할 수 있다. 문장의 구성 요소는, 입력 문장에 포함된 하나 이상의 글자 중 의미를 가지는 글자들의 조합을 표시한 것일 수 있다. 예를 들어, 문장의 구성 요소는, 명사구, 동사구, 형용사구, 부사구 등을 포함할 수 있다. 프로세서(120)는 문장의 구성 요소에 기초하여 입력 문장의 문장 완성도를 판단할 수 있다. 프로세서(120)는 입력 문장에 대한 문장의 구성 요소에 주어 및 술어가 포함되어 있는 경우, 문장 완성도가 사전결정된 기준보다 높은 것으로 판단할 수 있다. 프로세서(120)는 입력 문장에 대한 문장의 구성 요소에 주어 및 술어 중 적어도 하나가 포함되어 있지 않은 경우, 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단할 수 있다. 예를 들어, 프로세서(120)는 하나의 입력 문장인 “암보험이”에 대한 형태소 분석을 수행하는 경우, 해당 입력 문장에 대한 문장의 구성 요소는 주어 및 조사로 판단할 수 있고, 해당 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단할 수 있다. 예를 들어, 프로세서(120)는 제 1 입력 문장인 “암보험이”와 제 2 입력 문장인 “뭐야”를 결합한 제 3 입력 문장(본 예시에서 “암보험이 뭐야”)에 대한 형태소 분석을 수행하는 경우, 제 3 입력 문장에 대한 구성 요소는 주어, 조사 및 술어로 판단할 수 있고, 주어 및 술어를 모두 포함하는 제 3 입력 문장에 대하여 해당 입력 문장의 문장 완성도가 사전결정된 기준보다 높은 것으로 판단할 수 있다. 전술한 문장 완성도 판단 방법에 대한 구체적인 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.The processor 120 may perform morpheme analysis on the input sentence. When receiving an input sentence from the user terminal 200, the processor 120 may perform a morpheme analysis on each input sentence and determine sentence completion based on the components of the input sentence. The processor 120 may perform morpheme analysis on an input sentence combining one or more input sentences received from the user terminal 200 and determine sentence completion based on the components of the input sentence. The processor 120 may perform morpheme analysis on one input sentence or may perform morpheme analysis on a sentence combining one or more input sentences. The processor 120 may acquire components of sentences included in the input sentence through morpheme analysis. The component of the sentence may be a combination of letters having meaning among one or more letters included in the input sentence. For example, the components of the sentence may include noun phrases, verb phrases, adjective phrases, adverb phrases, and the like. The processor 120 may determine sentence completion of the input sentence based on the components of the sentence. When a subject and a predicate are included in a component of a sentence for the input sentence, the processor 120 may determine that the sentence completeness is higher than a predetermined criterion. When at least one of a subject and a predicate is not included in a component of a sentence for an input sentence, the processor 120 may determine that the sentence completion is lower than a predetermined criterion. For example, when performing the morpheme analysis of the “cancer insurance” which is one input sentence, the processor 120 may determine a component of the sentence for the corresponding input sentence as a subject and an investigation, and the corresponding input sentence Can be judged to be lower than the predetermined criterion. For example, the processor 120 performs morphological analysis on the third input sentence (“what is cancer insurance” in this example) combining the first input sentence “cancer insurance” and the second input sentence “what is”? When performing, the component for the third input sentence can be determined as a subject, an investigation, and a predicate, and for the third input sentence including both the subject and the predicate, the sentence completion degree of the corresponding input sentence is higher than a predetermined criterion. I can judge. The detailed description of the method for determining sentence completion is merely an example, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따라, 프로세서(120)는 입력 문장과 기준 문장의 유사도가 임계 값 이상인지 여부에 기초하여 문장 완성도를 판단할 수 있다.According to an embodiment of the present disclosure, the processor 120 may determine sentence completion based on whether the similarity between the input sentence and the reference sentence is greater than or equal to a threshold value.

프로세서(120)는 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득할 수 있다. 전처리 모델을 통해 연산하는 입력 문장은, 하나의 입력 문장일 수도 있고, 하나 이상의 입력 문장을 결합한 입력 문장일 수도 있다. 프로세서(120)는 사용자 단말(200)로부터 입력 문장을 수신하는 경우, 각 입력 문장에 대하여 기준 문장과의 유사도를 비교하여, 상기 유사도가 임계 값 이상인지 여부에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장을 결합한 입력 문장에 대하여 기준 문장과의 유사도를 비교하여, 상기 유사도가 임계 값 이상인지 여부에 기초하여 문장 완성도를 판단할 수 있다.본 개시의 일 실시예에 따른 전처리 모델은 RNN(Recurrent Neural Network)을 이용하여 자연어 처리를 수행할 수 있다. 여기서 RNN은 하나 이상의 단어를 포함하는 문장의 데이터에 기초하여 패턴을 학습하기 위한 딥 러닝 모델을 의미할 수 있다. 즉, 프로세서(120)는 RNN에 기초하여 입력 문장을 입력 데이터, 그리고 상기 입력 문장에 대한 입력 문장 벡터를 출력 데이터로써 반복 학습할 수 있다. 이에 따라, 프로세서(120)는 네트워크부(110)를 통해 수신되는 입력 문장이 축적되고, 상기 입력 문장에 대한 입력 문장 벡터 정보가 축적될수록 보다 정확한 입력 문장의 벡터화를 수행할 수 있다. The processor 120 may obtain an input sentence vector by calculating an input sentence through a pre-processing model including one or more network functions. The input sentence calculated through the pre-processing model may be one input sentence or an input sentence combining one or more input sentences. When receiving the input sentence from the user terminal 200, the processor 120 may compare the similarity with the reference sentence for each input sentence, and determine sentence completion based on whether the similarity is greater than or equal to a threshold. . The processor 120 may compare the similarity with the reference sentence with respect to the input sentence combining one or more input sentences received from the user terminal 200 to determine sentence completion based on whether the similarity is greater than or equal to a threshold. The pre-processing model according to an embodiment of the present disclosure may perform natural language processing using a Recurrent Neural Network (RNN). Here, RNN may mean a deep learning model for learning a pattern based on data of a sentence including one or more words. That is, the processor 120 may repeatedly learn an input sentence based on an RNN as input data and an input sentence vector for the input sentence as output data. Accordingly, the processor 120 may perform more accurate vectorization of input sentences as input sentences received through the network unit 110 are accumulated and input sentence vector information for the input sentences is accumulated.

그리고, 본 개시의 일 실시예에 따른 RNN은 LSTM(Long Short-Term Memory Units) 방식 및 GRU(Gated Recurrent Units) 방식 중 적어도 하나를 통해 연속 학습으로 인한 그라디언트 소실(Vanishing Gradient) 문제를 해결할 수 있다. 보다 구체적으로, 프로세서(120)는 기본적인 RNN 구조에 각각의 가중치를 지니고 있는 입력, 출력 및 망각 게이트를 추가하는 LSTM 방식을 사용하여 그라디언트 소실을 방지할 수 있다. 또한, 다른 일 실시예에서 프로세서(120)는 RNN 구조에 입력 및 망각 게이트만 추가적으로 포함하는 GRU 방식(예컨대, 다이나믹 GRU(Dynamic Gated Recurrent Units) 방식)으로 그라디언트 소실을 방지할 수 있다. 전술한 RNN의 구조에 대한 자세한 기재는 예시일 뿐, 본 개시는 이에 제한되지 않는다.In addition, the RNN according to an embodiment of the present disclosure may solve a problem of vanishing gradient due to continuous learning through at least one of a Long Short-Term Memory Units (LSTM) method and a Gated Recurrent Units (GRU) method. . More specifically, the processor 120 may prevent gradient loss by using an LSTM method that adds input, output, and forget gates having respective weights to the basic RNN structure. In addition, in another embodiment, the processor 120 may prevent gradient loss by a GRU method (eg, a dynamic Gated Recurrent Units (GRU) method) that additionally includes only input and forget gates in the RNN structure. The detailed description of the structure of the RNN described above is merely an example, and the present disclosure is not limited thereto.

보다 구체적으로, 프로세서(120)는 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터 형태로 환산할 수 있다. 여기서 전처리 모델은 형태소 분석을 통해 입력 문장을 하나 이상의 형태소들의 집합으로 변환하기 위한 함수를 포함할 수 있다. 그리고, 전처리 모델은 하나 이상의 형태소 각각을 사전 설정된 수의 차원(예컨대, 100차원)의 벡터로 환산하기 위한 함수를 포함할 수 있다. 예를 들어, 전처리 모델은 "Word2Vec"함수를 포함할 수 있다. 그리고, 전처리 모델은 형태소 단위로 환산된 벡터 간의 거리를 산출하기 위한 함수를 이용하여 가중치 행렬의 형태로 산출할 수 있다. 그리고, 전처리 모델은 입력 문장에 대한 가중치에 기초하여 입력 문장 자체의 벡터를 산출하기 위한 함수를 포함할 수 있다. 전술한 전처리 모델에 대한 자세한 기재는 예시일 뿐, 본 개시는 이에 제한되지 않는다.More specifically, the processor 120 may convert the input sentence into an input sentence vector form by calculating through a pre-processing model including one or more network functions. Here, the pre-processing model may include a function for transforming an input sentence into a set of one or more morphemes through morpheme analysis. In addition, the pre-processing model may include a function for converting each of one or more morphemes into a vector of a predetermined number of dimensions (eg, 100 dimensions). For example, the pre-processing model may include a "Word2Vec" function. In addition, the pre-processing model may be calculated in the form of a weighting matrix using a function for calculating a distance between vectors converted in morpheme units. In addition, the pre-processing model may include a function for calculating a vector of the input sentence itself based on the weight of the input sentence. The detailed description of the above-described pretreatment model is merely an example, and the present disclosure is not limited thereto.

프로세서(120)는 하나 이상의 기준 문장 각각을 전처리 모델을 통해 연산함으로써 하나 이상의 기준 문장 벡터를 획득할 수 있다.The processor 120 may obtain one or more reference sentence vectors by calculating each of the one or more reference sentences through a pre-processing model.

기준 문장은 사용자의 입력 문장을 분류하기 위한 주제 및 서브 주제 중 적어도 하나를 대표하는 문장일 수 있다. 주제는, 챗봇이 사용자와 대화를 이어 나가는데 있어 중심이 되는 문제일 수 있다. 하나의 주제는 하나 이상의 서브 주제를 포함할 수 있다. 서브 주제는, 사용자와 대화를 이어 나가는데 있어 중심이 되는 문제에 속하는 하위 문제일 수 있다. 하나의 서브 주제는 하나 이상의 기준 문장을 포함할 수 있다. 예를 들어, 신용대출 안내 및 약관대출 안내는 각각 주제일 수 있다. 약관대출 안내의 주제는 하나 이상의 서브 주제를 포함할 수 있다. 약관대출 안내의 주제는 하나 이상의 서브 주제를 포함할 수 있다. 약관대출 안내의 주제는 약관대출의 정의 안내, 약관 대출 이자율 안내, 로그인 요청, 실제 적용 이자율 안내 및 약관대출 실행에 대한 서브 주제 각각을 포함할 수 있다. 그리고, 기준 문장은 데이터베이스, 또는 메모리(130) 상에 사전 저장될 수 있다. 그리고, 각각의 기준 문장은 주제 및 하나 이상의 서브 주제를 포함하고 있을 수 있다. 예를 들어, "약관 대출은 무엇인가요?"라는 기준 문장은 "약관 대출" 주제의 "정의"라는 서브 주제로 메모리(130) 상에 사전에 저장될 수 있다. 또한, "약관 대출은 무엇인가요?"라는 기준 문장은 기준 문장 벡터와 연관되어 저장될 수 있다. 다른 실시예에서, "약대가 뭐야?"라는 기준 문장은 메모리(130)에 저장되어 있고, 전처리 모델(300)을 통해 기준 문장 벡터로 연산 될 수 있다. 또한, "보험"주제에 대한 "문의"서브 주제를 가진 기준 문장은 "보험에 대해 물어보고 싶어", "보험 좀 알아보고 싶습니다" 및 "보험 관련 문의 드립니다"를 각각 포함할 수도 있다. 전술한 기준 문장의 자세한 기재는 예시일 뿐, 본 개시는 이에 제한되지 않는다.The reference sentence may be a sentence representing at least one of a subject and a sub-subject to classify a user's input sentence. The topic may be a central problem in the chatbot's conversation with the user. One subject may include one or more sub-themes. The sub topic may be a sub-problem belonging to a central problem in continuing a conversation with a user. One sub-subject may include one or more reference sentences. For example, the credit loan guide and the terms loan guide may each be topics. The subject of the terms and conditions loan guide may include one or more sub-topics. The subject of the terms and conditions loan guide may include one or more sub-topics. The subject of the terms and conditions loan guide may include a definition of terms and conditions loan, a terms loan interest rate guide, a login request, an actual applied interest rate guide, and sub-subjects for executing the terms loan. In addition, the reference sentence may be pre-stored in the database or the memory 130. In addition, each reference sentence may include a subject and one or more sub-subjects. For example, the reference sentence “What is the terms and conditions of the loan?” May be stored in advance on the memory 130 as a sub-subject of “Definition” of the subject of the terms and conditions. In addition, the reference sentence "What are the terms and conditions of loan?" May be stored in association with the reference sentence vector. In another embodiment, the reference sentence "What is the drugstore?" Is stored in the memory 130 and may be calculated as a reference sentence vector through the pre-processing model 300. In addition, a reference sentence with a "Contact" sub-subject to the "Insurance" topic may include "I want to ask about insurance", "I want to know some insurance" and "I would like to inquire about insurance" respectively. The detailed description of the above-mentioned reference sentence is only an example, and the present disclosure is not limited thereto.

프로세서(120)는 입력 문장 벡터와 상기 하나 이상의 기준 문장 벡터 각각에 대한 유사도가 임계 값 이상인 기준 문장이 있는지 여부를 결정할 수 있다. 프로세서(120)는 입력 문장 벡터와 상기 하나 이상의 기준 문장 벡터 각각에 대한 유사 정도를 나타내는 하나 이상의 스코어링 값을 출력할 수 있다. 프로세서(120)는 상기 하나 이상의 스코어링 값 중 임계 값 이상의 스코어링 값의 클래스에 기초하여 기준 문장을 결정할 수 있다. 프로세서(120)는 입력 문장과 기준 문장이 포함하고 있는 하나 이상의 단어의 의미, 배열 및 각 단어 간의 관계에 기초한 분석을 수행하여 유사도를 산출할 수 있다. 프로세서(120)는 전처리 모델에서 입력 문장 벡터 형태로 환산된 입력 문장과, 사전에 저장된 기준 문장 벡터의 간의 유사도를 산출할 수 있다. 여기서 유사도는 문장 단위로 스코어링 된 점수 형태일 수 있다.The processor 120 may determine whether there is a reference sentence having a similarity level equal to or greater than a threshold value for each of the input sentence vector and the one or more reference sentence vectors. The processor 120 may output an input sentence vector and one or more scoring values indicating the degree of similarity to each of the one or more reference sentence vectors. The processor 120 may determine a reference sentence based on a class of scoring values of a threshold value or more among the one or more scoring values. The processor 120 may calculate similarity by performing analysis based on the meaning, arrangement, and relationship between each word of one or more words included in the input sentence and the reference sentence. The processor 120 may calculate the similarity between an input sentence converted into an input sentence vector form in a preprocessing model and a reference sentence vector stored in advance. Here, the similarity may be a score form scored in units of sentences.

예를 들어, 프로세서(120)는 입력 문장 "약관 대출 문의를 하고 싶다"와 "약관 대출"주제의 "문의"서브 주제에 대한 기준 문장인 "약관 대출을 원합니다"의 유사도를 분석할 수 있다. 그리고, 프로세서(120)는 입력 문장과 기준 문장이 포함하는 "약관" 및 "대출"이라는 단어의 일치, 그리고 "문의"와 "원합니다"의 의미 유사도에 기초하여, 입력 문장 벡터와 기준 문장 벡터와의 유사도가 "80" 이라고 산출할 수 있다. 또한, 프로세서(120)는 상기 입력 문장 "약관 대출 문의를 하고 싶다"와 "보험 상품"주제의 "추천"서브 주제에 대한 기준 문장인 "보험 상품을 추천해줘"의 유사도를 분석할 수도 있다. 그리고, 프로세서(120)는 상기 입력 문장이 포함하는 "약관 대출"과 "문의"의 관계가 기준 문장이 포함하는 "보험"과 "추천"의 관계와 일치하는 정도를 분석하여, 입력 문장 벡터와 기준 문장 벡터와의 유사도가 "35"라고 산출할 수 있다. 전술한 프로세서(120)의 자세한 동작 기재와 수치는 예시일뿐, 본 개시는 이에 제한되지 않는다.For example, the processor 120 may analyze the similarity between the input sentences "I want to inquire about the terms of the loan" and "I want the terms of the loan", which are the reference sentences on the "Contact" sub topic of the "Terms of Loans" topic. Then, the processor 120 based on the matching of the terms "term" and "loan" included in the input sentence and the reference sentence, and the semantic similarity between "inquiry" and "want", the input sentence vector and the reference sentence vector The similarity of can be calculated as "80". In addition, the processor 120 may analyze the similarity between the input sentences "I want to inquire about a loan agreement" and "Recommend an insurance product" which is a reference sentence on a "recommended" sub theme of the "Insurance product" topic. Then, the processor 120 analyzes the degree to which the relationship between the terms & conditions and "inquiry" included in the input sentence matches the relationship between "insurance" and "recommended" included in the reference sentence, and the input sentence vector and The similarity with the reference sentence vector can be calculated as "35". The detailed operation description and numerical values of the above-described processor 120 are merely examples, and the present disclosure is not limited thereto.

다른 예시에 있어서, 입력 문장이 "내가 든 상품에서 골절이 보장될까?"이고, "보험 상품"주제의 "보장 범위"서브 주제에 대한 제 1 기준 문장이 "가입한 보험 상품의 보장 정보는?", 그리고 제 2 기준 문장이 "이것도 보험금 청구할 수 있어?"일 수 있다. 이러한 경우, 프로세서(120)는 입력 문장 벡터와 제 1 및 2 기준 문장 벡터와의 유사도를 각각 산출하여 제 1 및 2 유사도를 생성할 수 있다. 보다 구체적으로, 프로세서(120)는 입력 문장의 "내가 든 상품" 및 "보장"과 제 1 기준 문장의 "가입한 보험 상품" 및 "보장 정보"의 의미 유사도에 기초하여, 입력 문장 벡터와 제 1 기준 문장 벡터의 제 1 유사도를 "95"라고 산출할 수 있다. 그리고 프로세서(120)는 입력 문장의 "가입한 상품" 및 "보장 정보"와 제 2 기준 문장의 "보험금 청구"의 의미 유사도에 기초하여, 입력 문장 벡터와 제 2 기준 문장 벡터의 제 2 유사도를 "61"라고 산출할 수 있다. 이에 따라, 프로세서(120)는 제 1 기준 문장에 기초하여 유사도가 임계 값 이상인 기준 문장이 있는 것으로 결정할 수 있다. 프로세서(120)는 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단할 수 있다. 전술한 프로세서(120)의 동작은 예시일 뿐, 본 개시는 이에 제한되지 않는다.In another example, the input sentence is “Are fractures guaranteed in my product?”, And the first reference sentence on the “coverage coverage” sub-topic of the “Insurance product” topic is “What is the coverage information of the insurance product that is subscribed? And, the second standard sentence may be "Can I claim insurance for this too?" In this case, the processor 120 may generate first and second similarities by calculating similarities between the input sentence vector and the first and second reference sentence vectors, respectively. More specifically, the processor 120 is based on the semantic similarity of the "I'm the product" and "Guarantee" of the input sentence and the "Subscribed insurance product" and the "Guarantee information" of the first reference sentence, and The first similarity of the 1 reference sentence vector can be calculated as “95”. And the processor 120 based on the semantic similarity of the "subscribed product" and "guaranteed information" of the input sentence and "insurance claim" of the second reference sentence, the second similarity of the input sentence vector and the second reference sentence vector It can be calculated as "61". Accordingly, the processor 120 may determine that there is a reference sentence having similarity or higher than a threshold value based on the first reference sentence. When there is a reference sentence having similarity or higher than a threshold, the processor 120 may determine that the sentence completion is higher than the predetermined reference. The above-described operation of the processor 120 is merely an example, and the present disclosure is not limited thereto.

프로세서(120)는 유사도가 임계 값 이상인 기준 문장이 없는 경우, 상기 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단할 수 있다. 또는, 프로세서(120)는 유사도가 임계 값 이상인 기준 문장이 없는 경우, 입력 문장의 구성 요소에 기초하여 문장 완성도를 판단할 수 있다. 즉, 프로세서(120)는 입력 문장을 기준 문장에 기초하여 분류할 수 없는 경우 문장의 완성도가 사전결정된 기준보다 낮은 것으로 판단할 수 있으며, 이 경우 프로세서(120)는 문장의 구성 요소에 기초하여 입력 문장의 문장 완성도를 추가적으로 판단할 수도 있다. When there is no reference sentence having a similarity level higher than a threshold, the processor 120 may determine that the sentence completion level is lower than the predetermined reference level. Alternatively, when there is no reference sentence having a similarity level greater than or equal to a threshold, the processor 120 may determine sentence completion based on the components of the input sentence. That is, when the input sentence cannot be classified based on the reference sentence, the processor 120 may determine that the sentence completeness is lower than a predetermined criterion. In this case, the processor 120 may input based on the sentence component. It is also possible to additionally judge the sentence completion level.

본 개시의 일 실시예에 따라, 프로세서(120)는 입력 문장의 주제 및 서브 주제에 기초하여 문장 완성도를 판단할 수 있다.According to an embodiment of the present disclosure, the processor 120 may determine sentence completion based on the subject and sub-subject of the input sentence.

프로세서(120)는 입력 문장을 문장 주제 판단 모델을 통해 연산할 수 있다. 입력 문장은, 사용자 단말(200)로부터 수신한 하나의 입력 문장을 포함할 수도 있고, 하나 이상의 입력 문장을 결합한 문장을 포함할 수도 있다. 프로세서(120)는 사용자 단말(200)로부터 입력 문장을 수신하는 경우, 각 입력 문장의 주제 및 서브 주제에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장을 결합한 입력 문장의 주제 및 서브 주제에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 상기 연산을 통해 상기 입력 문장의 주제 및 서브 주제를 모두 획득하는 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단할 수 있다. 프로세서(120)는 상기 연산을 통해 상기 주제 및 상기 서브 주제 중 적어도 하나를 획득하지 못하는 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단할 수 있다.The processor 120 may calculate an input sentence through a sentence subject judgment model. The input sentence may include one input sentence received from the user terminal 200, or may include a sentence combining one or more input sentences. When receiving an input sentence from the user terminal 200, the processor 120 may determine sentence completion based on the subject and sub-subject of each input sentence. The processor 120 may determine sentence completion based on the subject and sub-subject of the input sentence combining one or more input sentences received from the user terminal 200. When the processor 120 acquires both the subject and the sub-subject of the input sentence through the operation, it may determine that the sentence completion is higher than the predetermined criterion. When the processor 120 does not acquire at least one of the subject and the sub-subject through the operation, it may determine that sentence completion is lower than the predetermined criterion.

문장 주제 판단 모델은, 하나 이상의 기준 문장 및 기준 답변 중 적어도 하나를 임의의 길이로 분할하여 생성된 학습 입력 문장 및 상기 학습 입력 문장에 라벨링 된 주제 및 서브 주제 중 적어도 하나 포함하는 학습 데이터 세트에 기초하여 상기 학습 입력 문장을 입력하는 경우 상기 주제 및 상기 서브 주제 중 적어도 하나를 출력하도록 학습된 모델일 수 있다.The sentence subject judgment model is based on a learning data set including at least one of a learning input sentence generated by dividing at least one of one or more reference sentences and a reference answer into an arbitrary length, and at least one of topics and sub-topics labeled in the learning input sentence. In the case of inputting the learning input sentence, it may be a model trained to output at least one of the subject and the sub subject.

본 개시의 일 실시예에 따른 문장 주제 판단 모델은 학습 데이터를 포함하는 학습 데이터 세트를 통해 학습되어 생성될 수 있다. 보다 구체적으로, 프로세서(120)는 네트워크부(110)를 이용하여 학습 데이터 세트를 획득할 수 있다. 학습 데이터 세트는 학습 입력 문장 및 주제 및 서브 주제를 포함할 수 있다. 학습 입력 문장은, 하나 이상의 기준 문장 및 기준 답변 중 적어도 하나를 임의의 길이로 분할하여 생성한 것일 수 있다. 기준 답변은, 메모리(130)에 기준 문장에 매칭되어 저장된 기준 문장에 대한 답변일 수 있다. 예를 들어, 하나 이상의 기준 문장 중 하나의 기준 문장이 "약관대출이 뭔지 알려줘"인 경우, 기준 문장을 임의의 길이로 분할한 "약관대출이 뭔지 알려줘", "약관대출이 뭔", "뭔지 알려줘" 각각이 학습 입력 문장일 수 있다. The sentence subject judgment model according to an embodiment of the present disclosure may be learned and generated through a learning data set including learning data. More specifically, the processor 120 may acquire a training data set using the network unit 110. The learning data set may include learning input sentences and subjects and sub subjects. The learning input sentence may be generated by dividing at least one of the one or more reference sentences and the reference answer into an arbitrary length. The reference answer may be an answer to the reference sentence stored in the memory 130 matching the reference sentence. For example, if one of the reference sentences of one or more of the reference sentences is "tell me what the terms and conditions are", divide the reference sentence into arbitrary lengths, "tell me what the terms and conditions are," and "what is the terms and conditions of the loan?" Each can be a learning input sentence.

프로세서(120)는 학습 데이터 세트에 포함된 학습 입력 문장에 주제 및 서브 주제 중 적어도 하나를 라벨링하여 학습 데이터를 생성할 수 있다. 예를 들어, 학습 입력 문장 "약관대출이 뭔"에 대해, "약관 대출"에 대한 주제를 라벨로 하여 학습 데이터를 생성할 수 있다. 예를 들어, 학습 입력 문장 "뭔지 알려줘"에 대해, "정의"에 대한 서브 주제를 라벨로 하여 학습 데이터를 생성할 수 있다. 전술한 학습 데이터의 생성에 대한 기재는 예시일 뿐, 본 개시는 이에 제한되지 않는다.The processor 120 may generate learning data by labeling at least one of a subject and a sub-subject to a learning input sentence included in the learning data set. For example, for the learning input sentence "What is a loan agreement", the subject for "a loan agreement" may be labeled to generate training data. For example, for the learning input sentence "tell me what it is", the sub-topic for "definition" can be labeled to generate learning data. The description of the generation of the learning data described above is merely an example, and the present disclosure is not limited thereto.

프로세서(120)는 하나 이상의 네트워크 함수를 포함하는 주제 판단 모델을 생성할 수 있다. 프로세서(120) 입력 레이어, 하나 이상의 히든 레이어 및 출력 레이어 중 적어도 하나를 포함하는 하나 이상의 네트워크 함수로 이루어진 주제 판단 모델을 생성할 수 있다. 프로세서(120)는 하나 이상의 입력 노드를 포함하는 입력 레이어로 구성된 네트워크 함수를 생성할 수 있다. 프로세서(120)는 네트워크 함수에 포함된 히든 레이어는 하나 이상의 히든 노드를 포함하도록 네트워크 함수를 생성할 수 있다. 프로세서(120)는 네트워크 함수에 포함된 출력 레이어는 하나 이상의 출력 노드를 포함하도록 네트워크 함수를 생성할 수 있다. 프로세서(120)는 네트워크 함수의 레이어에 포함된 각각의 노드는 다른 레이어의 하나 이상의 노드와 링크를 통해 각각 연결되도록 생성할 수 있다. 각각의 링크에는 각각의 가중치가 설정될 수 있다.The processor 120 may generate a subject judgment model including one or more network functions. The processor 120 may generate a subject determination model composed of one or more network functions including at least one of an input layer, one or more hidden layers, and an output layer. The processor 120 may generate a network function composed of an input layer including one or more input nodes. The processor 120 may generate a network function such that the hidden layer included in the network function includes one or more hidden nodes. The processor 120 may generate a network function such that the output layer included in the network function includes one or more output nodes. The processor 120 may generate each node included in the layer of the network function to be connected to one or more nodes of the other layer through links. Each link may have a respective weight.

프로세서(120)는 주제 판단 모델의 입력으로 학습 데이터의 학습 입력 문장을 입력할 수 있다. 프로세서(120)는 주제 판단 모델의 입력 레이어에 포함된 하나 이상의 입력 노드에 학습 입력 문장을 입력할 수 있다. 프로세서(120)는 입력 레이어, 하나 이상의 히든 레이어 및 출력 레이어를 포함하는 문장 주제 판단 모델을 생성할 수 있다. 레이어는 하나 이상의 노드를 포함할 수 있다. 노드는 다른 노드와 링크를 통해 연결될 수 있다. 프로세서(120)는 문장 주제 판단 모델의 입력 레이어의 입력 노드에 입력된 학습 입력 문장을 상기 입력 노드와 연결된 링크를 통해 연산하여 히든 레이어로 전파할 수 있다. 연산은 임의의 수학적 연산을 포함할 수 있다. 예를 들어, 연산은 곱 또는 합성 곱일 수 있으나, 전술한 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다. 프로세서(120)는 문장 주제 판단 모델의 입력 노드에 입력된 학습 데이터의 입력을 상기 입력 노드와 연결된 링크를 통해 연산하여 하나 이상의 히든 레이어를 거쳐 출력 레이어로 전파할 수 있다. 프로세서(120)는 문장 주제 판단 모델의 출력 레이어로 전파된 값에 기초하여 주제 및 서브 주제 중 적어도 하나를 생성할 수 있다.The processor 120 may input a learning input sentence of learning data as an input of the subject judgment model. The processor 120 may input a learning input sentence to one or more input nodes included in the input layer of the subject judgment model. The processor 120 may generate a sentence subject judgment model including an input layer, one or more hidden layers, and an output layer. The layer may include one or more nodes. Nodes can be connected through links with other nodes. The processor 120 may compute a learning input sentence input to an input node of the input layer of the sentence subject determination model through a link connected to the input node and propagate it to the hidden layer. Operations can include any mathematical operation. For example, the operation may be a product or a product of synthesis, but the above description is only examples and the present disclosure is not limited thereto. The processor 120 may compute input of learning data input to an input node of the sentence subject judgment model through a link connected to the input node and propagate it to an output layer through one or more hidden layers. The processor 120 may generate at least one of the subject and sub-subjects based on the value propagated to the output layer of the sentence subject determination model.

프로세서(120)는 문장 주제 판단 모델의 제 1 노드의 제 1 노드 값을, 상기 제 1 노드와 연결된 이전 레이어에 포함된 제 2 노드의 제 2 노드 값과 상기 이전 레이어에 포함된 제 2 노드와 상기 제 1 노드를 연결하는 링크에 설정된 링크의 제 1 링크 가중치로 연산하여 도출할 수 있다. 프로세서(120)는 문장 주제 판단 모델의 제 1 노드의 제 1 노드 값을, 제 1 노드와 연결된 다음 레이어에 포함된 제 3 노드를 연결하는 링크에 설정된 제 2 링크 가중치로 연산하여 제 3 노드에 전파할 수 있다.The processor 120 may determine the first node value of the first node of the sentence subject determination model, the second node value of the second node included in the previous layer connected to the first node, and the second node included in the previous layer. It can be derived by calculating the first link weight of the link established in the link connecting the first node. The processor 120 calculates the value of the first node of the first node of the sentence subject determination model as the second link weight set in the link connecting the third node included in the next layer connected to the first node to the third node. Can propagate.

프로세서(120)는 문장 주제 판단 모델을 생성하기 위하여 학습 데이터의 학습 입력 문장을 피드백 평가 모델(400)의 입력 레이어에 포함된 하나 이상의 입력 노드에 입력시키고, 피드백 평가 모델(400)의 출력 레이어에서 연산한 주제 및 서브 주제 중 적어도 하나(즉, 출력값)와 학습 데이터에 라벨링된 주제 및 서브 주제(즉, 정답)를 비교하여 오차를 계산할 수 있다. 프로세서(120)는 오차에 기초하여 문장 주제 판단 모델의 가중치를 조정할 수 있다. 프로세서(120)는 상기 오차에 기초하여 문장 주제 판단 모델의 하나 이상의 네트워크 함수에 포함된 출력 레이어로부터 하나 이상의 히든 레이어를 거쳐 입력 레이어로 전파함으로써, 각각의 링크에 설정된 가중치를 업데이트할 수 있다. 또한, 프로세서(120)는 문장 주제 판단 모델을 생성하는데 있어, 과적합(overfitting)을 방지하기 위하여 히든 노드의 출력 일부가 다음 히든 노드에 전달되지 못하도록 드롭 아웃을 설정할 수 있다.The processor 120 inputs the learning input sentence of the training data to one or more input nodes included in the input layer of the feedback evaluation model 400 to generate the sentence subject judgment model, and in the output layer of the feedback evaluation model 400. An error may be calculated by comparing at least one of the calculated subject and sub-subject (ie, output value) with the subject and sub-subject (ie, correct answer) labeled in the training data. The processor 120 may adjust the weight of the sentence subject judgment model based on the error. The processor 120 may update the weight set for each link by propagating from the output layer included in one or more network functions of the sentence subject judgment model to the input layer through one or more hidden layers based on the error. In addition, in generating the sentence subject judgment model, the processor 120 may set a dropout so that a part of the output of the hidden node is not transmitted to the next hidden node in order to prevent overfitting.

학습 에폭(epoch)은, 학습 데이터 세트에 포함된 모든 학습 데이터에 관하여 학습 피드백 데이터를 문장 주제 판단 모델의 하나 이상의 네트워크 함수의 입력 레이어에 포함된 하나 이상의 입력 노드 각각에 입력하고, 학습 입력 문장에 라벨링 된 주제 및 서브 주제 중 적어도 하나(즉, 정답)와 문장 주제 판단 모델의 주제 및 서브 주제(즉, 출력값)를 비교하여 오차를 도출하고, 상기 도출된 오차를 문장 주제 판단 모델의 하나 이상의 네트워크 함수의 출력 레이어로부터 하나 이상의 히든 레이어를 거쳐 입력 레이어로 전파함으로써, 각각의 링크에 설정된 가중치를 업데이트하는 동작일 수 있다. 즉, 학습 데이터 세트에 포함된 모든 학습 데이터에 대하여 문장 주제 판단 모델을 이용한 연산과 문장 주제 판단 모델에 대한 가중치 업데이트 과정을 수행한 경우 1 에폭일 수 있다. The learning epoch inputs learning feedback data to each of the one or more input nodes included in the input layer of one or more network functions of the sentence subject judgment model with respect to all the learning data included in the learning data set, and to the learning input sentences. At least one of the labeled subject and sub-subjects (ie, the correct answer) is compared with the subject and sub-subjects (ie, output value) of the sentence subject judgment model to derive errors, and the derived errors are one or more networks of the sentence subject judgment model It may be an operation of updating the weight set in each link by propagating from the output layer of the function to the input layer through one or more hidden layers. That is, when all the learning data included in the learning data set are performed using the sentence subject judgment model and the weight update process for the sentence subject judgment model may be 1 epoch.

프로세서(120)는 문장 주제 판단 모델을 생성하는데 있어, 상기 문장 주제 판단 모델을 학습시키기 위한 학습 에폭이 사전결정된 에폭 이하인 경우, 상기 문장 주제 판단 모델의 학습률을 사전 결정된 수치 이상으로 설정할 수 있다. 프로세서(120)는 문장 주제 판단 모델을 생성하는데 있어, 상기 문장 주제 판단 모델을 학습시키기 위한 학습 에폭이 사전결정된 에폭 이상인 경우, 상기 문장 주제 판단 모델의 학습률을 사전 결정된 수치 이하로 설정할 수 있다. 상기 학습률은, 가중치의 업데이트 정도를 의미할 수 있다. 예를 들어, 학습 초반에는 학습률을 빠르게 하여(즉, 가중치의 업데이트 정도를 큰 폭으로 하여), 학습 데이터(정답으로 라벨링 된 학습 경험 서비스)와 출력 데이터(모델에서 출력되는 경험 서비스)가 동일한 값에 수렴하도록 할 수 있다. 예를 들어, 학습 후반에는 학습률을 낮게 하여(즉, 가중치의 업데이트 정도를 작은 폭으로 하여), 학습 데이터와 출력 데이터의 오차를 줄이도록(즉, 정확성을 높이도록)할 수 있다. 전술한 학습률에 대한 개시는 예시일 뿐이며, 본 개시는 이에 제한되지 않는다.In generating the sentence subject judgment model, the processor 120 may set the learning rate of the sentence subject judgment model to a predetermined value or more when the learning epoch for learning the sentence subject judgment model is equal to or less than a predetermined epoch. In generating the sentence subject judgment model, the processor 120 may set the learning rate of the sentence subject judgment model to a predetermined value or less when the learning epoch for learning the sentence subject judgment model is greater than or equal to a predetermined epoch. The learning rate may mean an update degree of weights. For example, at the beginning of learning, the learning rate is fast (i.e., the weight is updated significantly), so that the learning data (the learning experience service labeled as the correct answer) and the output data (the experience service output from the model) are the same value. Can converge to For example, in the second half of learning, the learning rate may be lowered (that is, the update degree of the weight may be made smaller), and the error between the learning data and the output data may be reduced (ie, to increase accuracy). The disclosure on the learning rate described above is only an example, and the present disclosure is not limited thereto.

프로세서(120)는 문장 주제 판단 모델의 학습을 사전결정된 에폭 이상 수행한 후, 검증 데이터 세트를 이용하여 학습의 중단 여부를 결정할 수 있다. 사전 결정된 에폭은 전체 학습 목표 에폭의 일부일 수 있다. 프로세서(120)는 학습 데이터 세트의 일부를 검증 데이터 세트로 할 수 있다. 검증 데이터는 학습 데이터에 대응되는 데이터로, 학습의 중단 여부를 결정하기 위한 데이터일 수 있다. 프로세서(120)는 문장 주제 판단 모델의 학습이 사전결정된 에폭 이상 반복된 후, 검증 데이터를 이용하여 문장 주제 판단 모델의 학습 효과가 사전결정된 수준 이상인지 판단할 수 있다. 예를 들어, 프로세서(120)는 100만개의 학습 데이터를 이용하여 목표 반복 학습 횟수가 10만회인 학습을 수행하는 경우, 사전결정된 에폭인 10000회의 반복 학습을 수행한 후, 1000개의 검증 데이터를 이용하여 10회의 반복 학습(즉, 10 에폭)을 수행하여, 10회의 반복 학습 동안 신경망 출력의 변화가 사전결정된 수준 이하인 경우 더 이상의 학습이 무의미한 것으로 판단하고 학습을 종료할 수 있다. 즉, 검증 데이터는 신경망의 반복 학습에서 에폭별 학습의 효과가 일정 이상인지 이하인지 여부에 기초하여 학습의 완료를 결정하는 데 이용될 수 있다. 전술한 학습 데이터, 검증 데이터의 수 및 반복 횟수는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.The processor 120 may perform learning of the sentence subject judgment model more than a predetermined epoch, and then determine whether to stop learning using the verification data set. The predetermined epoch may be part of the overall learning target epoch. The processor 120 may use a part of the training data set as a verification data set. The verification data is data corresponding to the learning data, and may be data for determining whether to stop learning. The processor 120 may determine whether the learning effect of the sentence subject judgment model is greater than or equal to a predetermined level using the verification data after the learning of the sentence subject judgment model is repeated more than a predetermined epoch. For example, if the processor 120 performs learning with a target repetition number of times of 100,000 using 1 million pieces of learning data, after performing 10000 iterations of a predetermined epoch, and using 1000 pieces of verification data By performing 10 iterative learning (ie, 10 epochs), if the change in neural network output is less than or equal to a predetermined level during 10 iterative learning, it may be determined that further learning is meaningless and the learning may be terminated. That is, the verification data may be used to determine completion of learning based on whether the effect of learning by epoch in a repetitive learning of a neural network is greater than or equal to a certain level. The above-described learning data, the number of verification data, and the number of repetitions are only examples, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따라, 입력 문장의 벡터와 문장 완성도가 결정된 문장 벡터 각각과의 유사도에 기초하여 문장 완성도를 판단할 수 있다.According to an embodiment of the present disclosure, sentence completion may be determined based on the similarity between the vector of the input sentence and each of the sentence vectors whose sentence completion is determined.

프로세서(120)는 입력 문장을 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득할 수 있다. 프로세서(120)는 입력 문장 벡터를 하나 이상의 완성 문장 벡터 및 하나 이상의 미완성 문장 벡터 각각과 비교하여 유사도를 획득할 수 있다. 프로세서(120)는 상기 비교한 유사도 값에 기초하여 입력 문장이 완성된 문장인지 여부를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 입력 문장을 수신하는 경우, 각 입력 문장을 전처리 모델을 통해 연산하여 입력 문장 벡터를 획득하고, 각각의 입력 문장 벡터를 하나 이상의 완성 문장 벡터 및 하나 이상의 미완성 문장 벡터 각각과 비교한 유사도에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장을 결합한 입력 문장을 전처리 모델을 통해 연산하여 입력 문장 벡터를 획득하고, 하나 이상의 입력 문장을 결합한 입력 문장에 대한 입력 문장 벡터를 하나 이상의 완성 문장 벡터 및 하나 이상의 미완성 문장 벡터 각각과 비교한 유사도에 기초하여 문장 완성도를 판단할 수 있다. 프로세서(120)는 입력 문장 벡터가 완성 문장 벡터와 유사도가 높은 경우, 완성된 문장으로 판단할 수 있다. 프로세서(120)는 입력 문장 벡터가 미완성 문장 벡터와 유사도가 높은 경우, 미완성 문장으로 판단할 수 있다. 프로세서(120)는 입력 문장 벡터가 상기 하나 이상의 완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단할 수 있다. 프로세서(120)는 입력 문장 벡터가 상기 하나 이상의 미완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단할 수 있다.The processor 120 may obtain an input sentence vector by calculating the input sentence through a pre-processing model. The processor 120 may obtain similarity by comparing the input sentence vector with each of one or more complete sentence vectors and one or more incomplete sentence vectors. The processor 120 may determine whether the input sentence is a completed sentence based on the comparison similarity value. When receiving the input sentence from the user terminal 200, the processor 120 calculates each input sentence through a pre-processing model to obtain an input sentence vector, and each input sentence vector is one or more complete sentence vectors and one or more incompletes. Sentence completion can be determined based on the similarity compared to each sentence vector. The processor 120 obtains an input sentence vector by computing an input sentence combining one or more input sentences received from the user terminal 200 through a pre-processing model, and obtains one input sentence vector for the input sentence combining one or more input sentences. The sentence completeness may be determined based on the similarity compared with each of the above completed sentence vectors and one or more incomplete sentence vectors. When the input sentence vector has a high degree of similarity with the completed sentence vector, the processor 120 may determine the completed sentence vector. When the input sentence vector has a high degree of similarity to the incomplete sentence vector, the processor 120 may determine the incomplete sentence. When the similarity between the input sentence vector and at least one of the one or more completion sentence vectors is equal to or greater than a threshold, the processor 120 may determine that the sentence completion is higher than the predetermined criterion. When the similarity between the input sentence vector and at least one of the one or more incomplete sentence vectors is greater than or equal to a threshold, the processor 120 may determine that sentence completion is lower than the predetermined criterion.

완성 문장 벡터는, 하나 이상의 기준 문장 또는 기준 답변을 상기 전처리 모델을 통해 연산함으로써 획득될 수 있다. 프로세서(120)는 메모리(130)에 저장된 하나 이상의 기준 문장 또는 기준 답변은, 완성된 문장인 것으로 판단할 수 있고, 상기 기준 문장 또는 기준 답변을 전처리 모델을 통해 연산한 벡터는 완성 문장 벡터로 결정할 수 있다. 예를 들어, 프로세서(120)는 메모리(130)에 저장된 기준 문장인 “암보험의 가입조건을 알려줘”는 완성된 문장인 것으로 판단하고, “암보험의 가입조건을 알려줘”을 전처리 모델을 통해 연산함으로써 획득한 벡터는 완성 문장 벡터로 결정할 수 있다. 전술한 완성 문장 벡터에 관한 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.The complete sentence vector may be obtained by calculating one or more reference sentences or reference answers through the pre-processing model. The processor 120 may determine that one or more reference sentences or reference answers stored in the memory 130 are completed sentences, and the vector that computes the reference sentences or reference answers through a pre-processing model is determined as a complete sentence vector. You can. For example, the processor 120 determines that the standard sentence stored in the memory 130, “tell me about the condition of cancer insurance”, is a completed sentence, and “tell me about the condition of cancer insurance” through the pre-processing model. The vector obtained by calculation can be determined as a complete sentence vector. The above description of the completed sentence vector is merely an example, and the present disclosure is not limited thereto.

미완성 문장 벡터는, 하나 이상의 기준 문장 또는 기준 답변 각각을 임의의 길이로 분할한 분할된 기준 문장 또는 분할된 기준 답변을 상기 전처리 모델을 통해 연산함으로써 획득될 수 있다. 프로세서(120)는 메모리(130)에 저장된 하나 이상의 기준 문장 또는 기준 답변을 임의의 길이로 분할한 문장은, 미완성 문장인 것으로 판단할 수 있고, 상기 임의의 길이로 분할한 문장을 전처리 모델을 통해 연산한 벡터는 미완성 문장 벡터로 결정할 수 있다. 예를 들어, 프로세서(120)는 메모리(130)에 저장된 기준 문장인 “암보험의 가입조건을 알려줘”를 임의의 길이로 분할한 “조건을 알려줘”에 대한 문장은 미완성 문장으로 판단하고, “조건을 알려줘”를 전처리 모델을 통해 연산함으로써 획득한 벡터는 미완성 문장 벡터로 결정할 수 있다. 전술한 완성 문장 벡터에 관한 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다. The incomplete sentence vector may be obtained by calculating a divided reference sentence or a divided reference answer in which each of one or more reference sentences or reference answers is divided into arbitrary lengths through the pre-processing model. The processor 120 may determine that one or more reference sentences stored in the memory 130 or a sentence in which a reference answer is divided into an arbitrary length is an incomplete sentence, and the sentence divided into the arbitrary length through a pre-processing model The computed vector can be determined as an incomplete sentence vector. For example, the processor 120 determines that the sentence for “tell me the condition”, which is the reference sentence stored in the memory 130, “tell me about the condition of cancer insurance” divided by an arbitrary length, is an incomplete sentence, and the “ The vector obtained by calculating the conditional condition ”through the pre-processing model may be determined as an incomplete sentence vector. The above description of the completed sentence vector is merely an example, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따라, 제 2 입력 문장은, 제 1 입력 문장의 주제 및 서브 주제 중 적어도 하나에 기초하여 결정될 수 있다.According to an embodiment of the present disclosure, the second input sentence may be determined based on at least one of a subject and a sub subject of the first input sentence.

프로세서(120)는 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하여 상기 제 1 입력 문장에 대한 주제 및 서브 주제 중 적어도 하나를 획득할 수 있다. 프로세서(120)는 제 1 입력 문장에 대한 주제 및 서브 주제 중 적어도 하나를 메모리(130)에 저장할 수 있다. 프로세서(120)는 제 1 입력 문장 이후에 사용자 단말(200)로부터 수신한 입력 문장을 문장 주제 판단 모델을 통해 연산하여 주제 및 서브 주제 중 적어도 하나를 획득할 수 있다. 프로세서(120)는 제 1 입력 문장 이후에 수신한 입력 문장 각각에 대한 주제 및 서브 주제 중 적어도 하나를 메모리(130)에 저장된 제 1 입력 문장에 대한 주제 및 서브 주제 중 적어도 하나와 비교하여, 제 2 입력 문장을 결정할 수 있다.The processor 120 may obtain at least one of a subject and a sub-subject for the first input sentence by calculating a first input sentence through a sentence subject determination model. The processor 120 may store at least one of a subject and a sub subject for the first input sentence in the memory 130. The processor 120 may obtain at least one of a subject and a sub-subject by calculating an input sentence received from the user terminal 200 after the first input sentence through a sentence subject determination model. The processor 120 compares at least one of a subject and a sub-subject with respect to each input sentence received after the first input sentence with at least one of a subject and a sub-subject with respect to the first input sentence stored in the memory 130. 2 You can determine the input sentence.

프로세서(120)는 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하여 주제를 획득할 수 있다. 프로세서(120)는 제 1 입력 문장 이후에 수신한 입력 문장을 문장 주제 판단 모델을 통해 연산하여 주제 또는 서브 주제를 획득할 수 있다. 프로세서(120)는, 제 1 입력 문장 이후에 수신한 입력 문장을 문장 주제 판단 모델을 통해 연산하여 서브 주제를 획득하는 경우, 상기 제 1 입력 문장에 기초한 주제에 속하는 하나 이상의 서브 주제 중 하나의 서브 주제와 동일한 서브 주제인 경우, 해당 입력 문장을 제 2 입력 문장으로 결정할 수 있다.The processor 120 may obtain the subject by calculating the first input sentence through the sentence subject judgment model. The processor 120 may obtain a subject or sub-subject by calculating an input sentence received after the first input sentence through a sentence subject determination model. When the processor 120 obtains a sub-topic by calculating an input sentence received after the first input sentence through a sentence subject determination model, one of one or more sub-subjects belonging to the subject based on the first input sentence If the sub-subject is the same as the subject, the corresponding input sentence may be determined as the second input sentence.

예를 들어, 제 1 입력 문장이 “약관 대출은”인 경우, 프로세서(120)는 “약관 대출은”을 문장 주제 판단 모델을 통해 연산하여 “약관 대출” 주제를 획득할 수 있다. “약관 대출” 주제는, “정의”, “조건”, “계약 문서 작성” 등의 서브 주제를 포함할 수 있다. 프로세서(120)는 제 1 입력 문장 이후에 수신된 제 2 입력 문장인 “어느 경우에 받을 수 있어?”를 문장 주제 판단 모델을 통해 연산하여 “조건” 서브 주제를 획득할 수 있다. 상기 “조건” 서브 주제는, 제 1 입력 문장에 기초한 “약관 대출” 주제에 포함되는 하나 이상의 서브 주제 중 하나의 서브 주제와 동일하므로, 상기 입력 문장을 제 2 입력 문장으로 결정할 수 있다. 전술한 제 2 입력 문장을 결정하는 구체적인 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.For example, when the first input sentence is “contract loan”, the processor 120 may obtain “contract loan” subject by calculating “contract loan” through the sentence subject judgment model. The “Terms and Conditions” subject may include sub-themes such as “Definition”, “Conditions”, and “Creating Contract Documentation”. The processor 120 may obtain a “condition” sub-topic by calculating the second input sentence “in which case?” Received after the first input sentence through the sentence subject judgment model. Since the “condition” sub-subject is the same as one sub-subject among one or more sub-subjects included in the “term loan” subject based on the first input sentence, the input sentence can be determined as the second input sentence. The specific description for determining the second input sentence described above is merely an example, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따라, 문장 완성도가 사전결정된 기준 이상인 것으로 판단되는 입력 문장의 입력 문장 벡터와 기준 문장 벡터의 유사도에 기초하여 답변을 결정할 수 있다.According to an embodiment of the present disclosure, an answer may be determined based on the similarity between the input sentence vector of the input sentence and the reference sentence vector, which is determined to be more than a predetermined criterion.

프로세서(120)는 완성된 문장으로 판단되는 입력 문장에 기초하여 네트워크부(110)를 통해 답변을 사용자 단말(200)로 전송할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나의 입력 문장이 완성된 문장으로 판단되는 경우, 상기 입력 문장에 기초한 답변을 네트워크부(110)를 통해 사용자 단말(200)로 전송할 수 있다. 프로세서(120)는 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장을 결합한 입력 문장이 완성된 문장으로 판단되는 경우, 상기 결합한 입력 문장에 기초한 답변을 네트워크부(110)를 통해 사용자 단말(200)로 전송할 수 있다. 프로세서(120)는 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득할 수 있다. 여기서 입력 문장은, 문장 완성도가 사전결정된 기준 이상인 것으로 판단되는 입력 문장일 수 있다. 프로세서(120)는 입력 문장 벡터와 상기 하나 이상의 기준 문장 벡터 각각에 대한 유사도를 연산할 수 있다. 프로세서(120)는 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 유사도가 임계 값 이상인 기준 문장에 기초한 답변을 네트워크부(110)를 통해 사용자 단말로 전송할 수 있다. 기준 문장에 기초한 답변은, 메모리(130)에 기준 문장에 매칭되어 저장된 기준 문장의 질의 사항에 기초한 답변일 수 있다.The processor 120 may transmit an answer to the user terminal 200 through the network unit 110 based on the input sentence determined as the completed sentence. When it is determined that one input sentence received from the user terminal 200 is the completed sentence, the processor 120 may transmit an answer based on the input sentence to the user terminal 200 through the network unit 110. When the input sentence combining one or more input sentences received from the user terminal 200 is determined to be a complete sentence, the processor 120 answers the answer based on the combined input sentence through the network unit 110 to the user terminal 200 Can be transferred to. The processor 120 may obtain an input sentence vector by calculating an input sentence through a pre-processing model including one or more network functions. Here, the input sentence may be an input sentence judged to have a sentence completion level higher than a predetermined criterion. The processor 120 may calculate the similarity of each of the input sentence vector and the one or more reference sentence vectors. The processor 120 may transmit an answer based on the reference sentence having the similarity threshold or higher to the user terminal through the network unit 110 when there is a reference sentence having the similarity threshold or higher. The answer based on the reference sentence may be an answer based on the question of the reference sentence stored in the memory 130 by matching the reference sentence.

프로세서(120)는 상기 유사도가 임계 값 이상인 기준 문장이 없는 경우, 미분류 문장 표준 답변을 상기 사용자 단말로 전송할 수 있다. 프로세서(120)는, 사용자 단말(200)로부터 수신한 입력 문장이 완성된 문장으로 판단되나, 해당 입력 문장에 대한 답변을 찾을 수 없는 경우, 네트워크부(110)를 통해 사용자 단말(200)로 미분류 문장 표준 답변을 전송할 수 있다. 미분류 문장 표준 답변은, 사용자 단말(200)로부터 수신한 입력 문장에 대한 적절한 답변을 찾을 수 없다는 답변일 수 있다. 예를 들어, 미분류 문장 표준 답변은, “죄송하지만 질문하신 내용은 아직 학습 중입니다.”일 수 있다. 미분류 문장 표준 답변은, 사용자 단말(200)로부터 수신한 입력 문장에 대한 적절한 답변을 찾을 수 없어, 콜센터 또는 챗봇이 아닌 상담원과의 채팅으로 연결하는 링크를 포함하는 답변일 수 있다. 전술한 미분류 문장 표준 답변에 대한 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.When there is no reference sentence having the similarity level equal to or greater than the threshold, the processor 120 may transmit an unclassified sentence standard answer to the user terminal. The processor 120 determines that the input sentence received from the user terminal 200 is a completed sentence, but if an answer to the input sentence cannot be found, it is unclassified to the user terminal 200 through the network unit 110. Sentence standard responses can be sent. The unclassified sentence standard answer may be an answer that an appropriate answer to an input sentence received from the user terminal 200 cannot be found. For example, the standard answer to an unclassified sentence may be, "Sorry, but the question you are questioning is still learning." The unclassified sentence standard answer may be an answer including a link to a chat with an agent other than a call center or chatbot because an appropriate answer to the input sentence received from the user terminal 200 cannot be found. The description of the above-mentioned unclassified sentence standard answer is merely an example, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따라, 컴퓨팅 장치(100)가 사용자 단말(200)로부터 수신한 입력 문장의 문장 완성도가 사전결정된 기준 이상이 되기 전까지는, 입력 문장에 대한 답변을 하지 않고, 답변에 대한 검색을 보류함으로써, 사용자 단말(200)에 전송하는 답변의 정확도를 증가시킬 수 있다.According to an embodiment of the present disclosure, until the sentence completion degree of the input sentence received from the user terminal 200 is greater than or equal to a predetermined criterion, the computing device 100 does not respond to the input sentence and answers By holding the search, the accuracy of the response transmitted to the user terminal 200 can be increased.

메모리(130)는 본 개시의 일 실시예에 따른 챗봇의 구동 방법을 수행하기 위한 컴퓨터 프로그램을 저장할 수 있으며, 저장된 컴퓨터 프로그램은 프로세서(120)에 의하여 판독되어 구동될 수 있다.The memory 130 may store a computer program for performing a method of driving a chatbot according to an embodiment of the present disclosure, and the stored computer program may be read and driven by the processor 120.

본 개시의 실시예들에 따른 메모리(130)는 프로세서(120)의 동작을 위한 프로그램을 저장할 수 있고, 입/출력되는 데이터들(예를 들어, 입력 문장, 기준 문장, 답변 등)을 임시 또는 영구 저장할 수도 있다. 상기 메모리(130)는 디스플레이 및 음향에 관한 데이터를 저장할 수 있다. 메모리(130)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램(Random Access Memory, RAM), SRAM(Static Random Access Memory), 롬(Read-Only Memory, ROM), EEPROM(Electrically Erasable Programmable Read-Only Memory), PROM(Programmable Read-Only Memory), 자기 메모리, 자기 디스크, 광디스크 중 적 어도 하나의 타입의 저장매체를 포함할 수 있다.The memory 130 according to embodiments of the present disclosure may store a program for the operation of the processor 120, and temporarily input / output data (eg, input sentences, reference sentences, answers, etc.) or It can also be stored permanently. The memory 130 may store data related to a display and sound. The memory 130 is a flash memory type, a hard disk type, a multimedia card micro type, a card type memory (for example, SD or XD memory, etc.), RAM (Random Access Memory, RAM), Static Random Access Memory (SRAM), Read-Only Memory (ROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Programmable Read-Only Memory (PROM), magnetic memory, magnetic It may include at least one type of storage medium among disks and optical disks.

도 2는 본 개시의 일 실시예에 따른 서버와 사용자 단말 간의 흐름도를 도시한 도면이다.2 is a diagram illustrating a flowchart between a server and a user terminal according to an embodiment of the present disclosure.

컴퓨팅 장치(100)는 사용자 단말(200)로부터 제 1 입력 문장(1)을 수신할 수 있다. 제 1 입력 문장(1)은, 사용자 단말(200)로부터 수신한 첫번째 입력 문장일 수 있다. 제 1 입력 문장(1)은, 사용자 단말(200)로부터 수신한 하나 이상의 입력 문장 중 해당 입력 문장에 기초한 답변을 사용자 단말(200)로 전송하지 않은 문장 중 가장 먼저 수신한 입력 문장일 수 있다.The computing device 100 may receive the first input sentence 1 from the user terminal 200. The first input sentence 1 may be the first input sentence received from the user terminal 200. The first input sentence 1 may be an input sentence received first among sentences that do not transmit an answer based on the corresponding input sentence among the one or more input sentences received from the user terminal 200 to the user terminal 200.

컴퓨팅 장치(100)는 제 1 입력 문장에 대한 문장 완성도가 사전 결정된 기준 이상인지 여부를 결정(2)할 수 있다.The computing device 100 may determine (2) whether the sentence completion level for the first input sentence is greater than or equal to a predetermined criterion.

컴퓨팅 장치(100)는 제 1 입력 문장(1)에 대한 문장 완성도가 사전 결정된 기준 이상인 경우, 제 1 입력 문장에 기초한 답변(3)을 사용자 단말(200)에 전송할 수 있다. 제 1 입력 문장에 기초한 답변(3)은, 제 1 입력 문장(1)의 입력 문장 벡터와 하나 이상의 기준 문장 벡터의 유사도에 기초하여, 상기 유사도가 임계값 이상인 기준 문장 벡터에 기초하여 결정된 답변일 수 있다.The computing device 100 may transmit an answer 3 based on the first input sentence to the user terminal 200 when the sentence completion level for the first input sentence 1 is greater than or equal to a predetermined criterion. The answer (3) based on the first input sentence is an answer date determined based on the similarity between the input sentence vector of the first input sentence (1) and one or more reference sentence vectors, based on a reference sentence vector having the similarity of a threshold or higher. You can.

컴퓨팅 장치(100)는 제 1 입력 문장(1)에 대한 문장 완성도가 사전 결정된 기준 미만인 경우, 미완성 문장 표준 답변(4)을 사용자 단말(200)로 전송할 수 있다. 미완성 문장 표준 답변(4,11)은, 사용자 단말(200)로부터 수신한 입력 문장이 완성되지 않은 문장이라고 판단한 것에 기초하여, 사용자 단말(200)로부터 추가적인 입력 문장을 수신하기 위한 대기 답변을 의미할 수 있다.The computing device 100 may transmit the incomplete sentence standard answer 4 to the user terminal 200 when the sentence completion level for the first input sentence 1 is less than a predetermined criterion. The incomplete sentence standard answer (4, 11) means a waiting response for receiving an additional input sentence from the user terminal 200, based on the determination that the input sentence received from the user terminal 200 is an incomplete sentence. You can.

컴퓨팅 장치(100)는 사용자 단말(200)로부터 제 2 입력 문장(5)을 수신할 수 있다. 컴퓨팅 장치(100)는 제 2 입력 문장(5)의 문장 완성도를 판단할 수 있다. 컴퓨팅 장치(100)는 제 2 입력 문장에 대한 문장 완성도가 사전 결정된 기준 이상인지 여부를 결정(6)할 수 있다. 컴퓨팅 장치(100)는 제 2 입력 문장(5)의 문장 완성도가 사전결정된 기준보다 높은 경우, 사용자 단말(200)에 제 2 입력 문장에 기초한 답변(7)을 전송할 수 있다.The computing device 100 may receive the second input sentence 5 from the user terminal 200. The computing device 100 may determine sentence completion of the second input sentence 5. The computing device 100 may determine (6) whether sentence completion for the second input sentence is greater than or equal to a predetermined criterion. The computing device 100 may transmit the answer 7 based on the second input sentence to the user terminal 200 when the sentence completion level of the second input sentence 5 is higher than a predetermined criterion.

컴퓨팅 장치(100)는 제 2 입력 문장(5)의 문장 완성도가 사전결정된 기준보다 낮은 경우, 제 2 입력 문장(5)에 기초하여 사용자 단말(200)에 답변을 전송할 수 없음을 결정할 수 있다. 컴퓨팅 장치(100)는 제 2 입력 문장(5) 이전에 사용자 단말(200)로부터 수신한 다른 입력 문장(본 예시에서 제 1 입력 문장(1))과 결합하여 문장 완성도를 판단할 것을 결정할 수 있다. 컴퓨팅 장치(100)는 제 2 입력 문장의 문장 완성도가 사전결정된 기준 이하인 경우, 제 1 입력 문장 및 제 2 입력 문장을 결합한 제 3 입력 문장을 생성(8)할 수 있다. 컴퓨팅 장치(100)는 제 2 입력 문장(5)을 사용자 단말(200)로부터 수신하는 경우, 제 2 입력 문장에 대한 문장 완성도를 판단하는 과정 없이, 제 1 입력 문장 및 제 2 입력 문장을 결합한 제 3 입력 문장을 생성(8)할 수도 있다.The computing device 100 may determine that an answer cannot be transmitted to the user terminal 200 based on the second input sentence 5 when the sentence completion level of the second input sentence 5 is lower than a predetermined criterion. The computing device 100 may determine to determine sentence completion by combining with other input sentences (the first input sentence 1 in this example) received from the user terminal 200 before the second input sentence 5. . The computing device 100 may generate (8) a third input sentence combining the first input sentence and the second input sentence when the sentence completion level of the second input sentence is below a predetermined criterion. When the second input sentence 5 is received from the user terminal 200, the computing device 100 combines the first input sentence and the second input sentence without determining a sentence completion level for the second input sentence. 3 You can also create (8) input sentences.

컴퓨팅 장치(100)는 제 3 입력 문장에 대한 문장 완성도가 사전 결정된 기준 이상인지 여부를 결정(9)할 수 있다.The computing device 100 may determine whether the sentence completion level for the third input sentence is greater than or equal to a predetermined criterion (9).

컴퓨팅 장치(100)는 제 3 입력 문장(201)에 대한 문장 완성도가 사전 결정된 기준 이상인 경우, 제 3 입력 문장에 기초한 답변(10)을 사용자 단말(200)에 전송할 수 있다. 제 3 입력 문장에 기초한 답변(10)은, 제 3 입력 문장의 입력 문장 벡터와 하나 이상의 기준 문장 벡터의 유사도에 기초하여, 상기 유사도가 임계값 이상인 기준 문장 벡터에 기초하여 결정된 답변일 수 있다.The computing device 100 may transmit an answer 10 based on the third input sentence to the user terminal 200 when the sentence completion level for the third input sentence 201 is greater than or equal to a predetermined criterion. The answer 10 based on the third input sentence may be an answer determined based on the similarity between the input sentence vector of the third input sentence and one or more reference sentence vectors, based on the reference sentence vector having the similarity of a threshold or higher.

컴퓨팅 장치(100)는 제 3 입력 문장에 대한 문장 완성도가 사전 결정된 기준 미만인 경우, 미완성 문장 표준 답변(11)을 사용자 단말(200)로 전송할 수 있다. The computing device 100 may transmit the incomplete sentence standard answer 11 to the user terminal 200 when the sentence completion level for the third input sentence is less than a predetermined criterion.

도 3은 본 개시의 일 실시예에 따른 챗봇의 동작 인터페이스의 예시를 도시한 도면이다.3 is a diagram illustrating an example of an operation interface of a chatbot according to an embodiment of the present disclosure.

컴퓨팅 장치(100)는 사용자 단말(200)로부터 챗봇에 관련된 세션을 개설하기 위한 요청을 수신할 수 있다. 그리고, 컴퓨팅 장치(100)는 상기 요청에 기초하여 챗봇과 사용자 단말(200) 간의 채팅 세션을 개설할 수 있다. 또한, 컴퓨팅 장치(100)는 사용자 단말(200)로 사용자 단말(200)이 입력한 입력 문장을 출력하거나, 답변을 출력하기 위한 인터페이스를 전송할 수 있다.The computing device 100 may receive a request for establishing a session related to a chatbot from the user terminal 200. Then, the computing device 100 may establish a chat session between the chatbot and the user terminal 200 based on the request. Also, the computing device 100 may output an input sentence input by the user terminal 200 to the user terminal 200 or transmit an interface for outputting an answer.

컴퓨팅 장치(100)는 사용자 단말(200)로부터 입력 문장(311) "저기"를 수신할 수 있다. 여기서 입력 문장(311)은 인터페이스의 일 측(예컨대, 도 3에 도시된 바와 같이 좌측)에 인접하게 배치되고, 사용자 단말(200)을 식별할 수 있는 사용자 단말 아이콘(310)이 태그 될 수 있다. 컴퓨팅 장치(100)는 입력 문장(311)의 문장 완성도를 판단할 수 있다.The computing device 100 may receive the input sentence 311 "over there" from the user terminal 200. Here, the input sentence 311 is disposed adjacent to one side of the interface (for example, the left side as illustrated in FIG. 3), and a user terminal icon 310 capable of identifying the user terminal 200 may be tagged. . The computing device 100 may determine sentence completion of the input sentence 311.

컴퓨팅 장치(100)는 입력 문장(311)의 문장 완성도가 사전결정된 기준 보다 낮은 것으로 판단하는 경우, 미완성 표준 문장 답변(321) "네~ 고객님"을 사용자 단말(200)로 전송할 수 있다. 여기서 미완성 표준 문장 답변(321) "네~ 고객님"은 인터페이스의 반대편 일 측(예컨대, 도 3에 도시된 바와 같이 우측)에 인접하게 배치되고, 챗봇을 식별할 수 있는 챗봇 아이콘(320)이 태그 될 수 있다. If it is determined that the sentence completion level of the input sentence 311 is lower than a predetermined criterion, the computing device 100 may transmit the unfinished standard sentence answer 321 "yes ~ customer" to the user terminal 200. Here, the uncompleted standard sentence answer 321 "Yes ~ Customer" is placed adjacent to one side of the opposite side of the interface (for example, the right side as shown in FIG. 3), and the chatbot icon 320 that can identify the chatbot is tagged. Can be.

컴퓨팅 장치(100)는 사용자 단말(200)로부터 입력 문장(312) “암 보험 가입하려면”을 수신할 수 있다. 컴퓨팅 장치(100)는 입력 문장(312)의 문장 완성도가 사전결정된 기준 보다 낮은 것으로 판단하는 경우, 미완성 표준 문장 답변(322) "네~ 고객님"을 사용자 단말(200)로 전송할 수 있다.The computing device 100 may receive the input sentence 312 "To join cancer insurance" from the user terminal 200. If it is determined that the sentence completion level of the input sentence 312 is lower than a predetermined criterion, the computing device 100 may transmit the unfinished standard sentence answer 322 "yes ~ customer" to the user terminal 200.

컴퓨팅 장치(100)는 사용자 단말(200)로부터 입력 문장(313) “건강검진 받아야 돼?”을 수신할 수 있다. 컴퓨팅 장치(100)는 입력 문장(312)을 제 1 입력 문장으로 하고, 입력 문장(313)을 제 2 입력 문장으로 하여, 제 3 입력 문장 “암 보험 가입하려면 건강검진 받아야 돼?”을 생성할 수 있다.The computing device 100 may receive the input sentence 313 “Do you need to undergo a medical examination?” From the user terminal 200. The computing device 100 may generate an input sentence 312 as a first input sentence and an input sentence 313 as a second input sentence, and generate a third input sentence “Do I have to undergo a medical examination to get cancer insurance?” You can.

컴퓨팅 장치(100)는 제 3 입력 문장 “암 보험 가입하려면 건강검진 받아야 돼?”에 기초하여 결정된 답변(323) “회사에서 정한 기준에 따라, 그 대상자에 한해서~”을 사용자 단말(200)로 전송할 수 있다.The computing device 100 answers the answer 323 determined based on the third input sentence “Do I need to undergo a medical examination to get cancer insurance?” “According to the criteria set by the company, only those who are subject to the user terminal 200” Can transmit.

도 4는 본 개시의 일 실시예에 따른 챗봇의 구동 방법의 순서도이다.4 is a flowchart of a method for driving a chatbot according to an embodiment of the present disclosure.

컴퓨팅 장치(100)는 사용자 단말로부터 제 1 입력 문장을 수신(410)할 수 있다. 입력 문장은 컴퓨팅 장치(100)가 사용자 단말(200)로부터 수신하는 문장을 포함할 수 있다.The computing device 100 may receive 410 a first input sentence from the user terminal. The input sentence may include a sentence received by the computing device 100 from the user terminal 200.

컴퓨팅 장치(100)는 제 1 입력 문장의 문장 완성도를 판단(420)할 수 있다. 문장 완성도는, 사용자의 질의의 목적을 파악할 수 있는지 여부를 나타내는 척도일 수 있다.The computing device 100 may determine 420 the sentence completion level of the first input sentence. The sentence completion level may be a measure indicating whether a user's query purpose can be grasped.

컴퓨팅 장치(100)는 제 1 입력 문장에 대한 형태소 분석을 수행할 수 있다. 컴퓨팅 장치(100)는 상기 형태소 분석을 통해 상기 제 1 입력 문장에 포함된 문장의 구성 요소를 획득할 수 있다. 형태소 분석은 사용자 단말(200)로부터 수신한 입력 문장을 형태소 사전에 기초하여 입력 문장의 형태소를 분석하는 것일 수 있다. 컴퓨팅 장치(100)는 상기 문장의 구성요소에 기초하여 상기 제 1 입력 문장의 문장 완성도를 판단할 수 있다.The computing device 100 may perform morpheme analysis on the first input sentence. The computing device 100 may acquire a component of a sentence included in the first input sentence through the morpheme analysis. The morpheme analysis may be to analyze the morpheme of the input sentence based on the morpheme dictionary based on the input sentence received from the user terminal 200. The computing device 100 may determine the sentence completion degree of the first input sentence based on the components of the sentence.

컴퓨팅 장치(100)는 상기 제 1 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득할 수 있다. 컴퓨팅 장치(100)는 상기 입력 문장 벡터와 하나 이상의 기준 문장에 대한 기준 문장 벡터 각각에 대한 유사도를 연산할 수 있다. 컴퓨팅 장치(100)는 상기 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단할 수 있다.The computing device 100 may obtain an input sentence vector by calculating the first input sentence through a pre-processing model including one or more network functions. The computing device 100 may calculate similarity for each of the input sentence vector and the reference sentence vector for one or more reference sentences. When there is a reference sentence in which the similarity is greater than or equal to a threshold, the computing device 100 may determine that the sentence completion is higher than the predetermined reference.

컴퓨팅 장치(100)는 상기 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산할 수 있다. 컴퓨팅 장치(100)는 상기 연산을 통해 상기 제 1 입력 문장의 주제 및 서브 주제를 모두 획득하는 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단할 수 있다. 컴퓨팅 장치(100)는 상기 연산을 통해 상기 주제 및 상기 서브 주제 중 적어도 하나를 획득하지 못하는 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단할 수 있다. 상기 문장 주제 판단 모델은, 하나 이상의 기준 문장 및 기준 답변 중 적어도 하나를 임의의 길이로 분할하여 생성된 학습 입력 문장 및 상기 학습 입력 문장에 라벨링 된 주제 및 서브 주제 중 적어도 하나 포함하는 학습 데이터 세트에 기초하여 상기 학습 입력 문장을 입력하는 경우 상기 주제 및 상기 서브 주제 중 적어도 하나를 출력하도록 학습된 모델일 수 있다.The computing device 100 may calculate the first input sentence through a sentence subject judgment model. When the subject and sub-subjects of the first input sentence are obtained through the operation, the computing device 100 may determine that the sentence completion is higher than the predetermined criterion. When the computing device 100 does not acquire at least one of the subject and the sub-subject through the operation, it may determine that sentence completion is lower than the predetermined criterion. The sentence subject determination model includes a learning data set generated by dividing at least one of the one or more reference sentences and the reference answers into an arbitrary length, and a learning data set including at least one of a learning input sentence and a subject and sub-themes labeled in the learning input sentence. On the basis of the input of the learning input sentence, it may be a model trained to output at least one of the subject and the sub subject.

컴퓨팅 장치(100)는 상기 제 1 입력 문장을 전처리 모델을 통해 연산함으로써 제 1 입력 문장 벡터를 획득할 수 있다. 컴퓨팅 장치(100)는 상기 제 1 입력 문장 벡터가 상기 하나 이상의 완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단할 수 있다. 컴퓨팅 장치(100)는 상기 제 1 입력 문장 벡터가 상기 하나 이상의 미완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단할 수 있다. 상기 완성 문장 벡터는, 상기 하나 이상의 기준 문장 또는 기준 답변을 상기 전처리 모델을 통해 연산함으로써 획득될 수 있고, 그리고 상기 미완성 문장 벡터는, 하나 이상의 기준 문장 또는 기준 답변 각각을 임의의 길이로 분할한 분할된 기준 문장 또는 분할된 기준 답변을 상기 전처리 모델을 통해 연산함으로써 획득될 수 있다.The computing device 100 may obtain a first input sentence vector by calculating the first input sentence through a pre-processing model. When the similarity between the first input sentence vector and at least one of the one or more completion sentence vectors is equal to or greater than a threshold, the computing device 100 may determine that the sentence completion is higher than the predetermined criterion. When the similarity between the first input sentence vector and at least one of the one or more incomplete sentence vectors is equal to or greater than a threshold, the computing device 100 may determine that the sentence completion is lower than the predetermined criterion. The completed sentence vector may be obtained by calculating the one or more reference sentences or reference answers through the pre-processing model, and the incomplete sentence vector may be divided by dividing each of the one or more reference sentences or reference answers into arbitrary lengths. It can be obtained by calculating a reference sentence or a divided reference answer through the pre-processing model.

컴퓨팅 장치(100)는 상기 제 1 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말로 전송(430)할 수 있다. 미완성 문장 표준 답변은, 사용자 단말(200)로부터 수신한 입력 문장이 완성되지 않은 문장이라고 판단한 것에 기초하여, 사용자 단말(200)로부터 추가적인 입력 문장을 수신하기 위한 대기 답변을 의미할 수 있다.When it is determined that the sentence completion level of the first input sentence is lower than a predetermined criterion, the computing device 100 may transmit an unfinished sentence standard answer to the user terminal (430). The incomplete sentence standard answer may mean a waiting response for receiving an additional input sentence from the user terminal 200 based on the determination that the input sentence received from the user terminal 200 is an incomplete sentence.

컴퓨팅 장치(100)는 사용자 단말로부터 제 2 입력 문장을 수신(440)할 수 있다. 상기 제 2 입력 문장은, 상기 제 1 입력 문장을 상기 사용자 단말로부터 수신한 시점부터 사전결정된 시간 범위 이내에 수신되거나 또는, 상기 제 1 입력 문장을 상기 사용자 단말로부터 수신한 이후 상기 사용자 단말로부터 수신되는 사전결정된 개수 범위 내의 입력 문장일 수 있다.The computing device 100 may receive 440 a second input sentence from the user terminal. The second input sentence is received within a predetermined time range from the time when the first input sentence is received from the user terminal, or a dictionary received from the user terminal after receiving the first input sentence from the user terminal It may be an input sentence within the determined number range.

컴퓨팅 장치(100)는 상기 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하여 상기 제 1 입력 문장에 대한 주제 및 서브 주제 중 적어도 하나를 획득할 수 있다. 컴퓨팅 장치(100)는 상기 사용자 단말로부터 상기 제 1 입력 문장 이후에 수신한 하나 이상의 입력 문장 중 상기 획득된 주제 및 서브 주제 중 적어도 하나에 기초하여 제 2 입력 문장을 결정할 수 있다.The computing device 100 may obtain at least one of a subject and a sub-subject for the first input sentence by calculating the first input sentence through a sentence subject determination model. The computing device 100 may determine a second input sentence based on at least one of the acquired subject and sub-subject among one or more input sentences received after the first input sentence from the user terminal.

상기 제 2 입력 문장은, 상기 제 1 입력 문장 이후에 수신된 하나 이상의 입력 문장 중 문장 기호를 포함하는 입력 문장일 수 있다.The second input sentence may be an input sentence including a sentence symbol among one or more input sentences received after the first input sentence.

컴퓨팅 장치(100)는 상기 제 1 입력 문장 및 상기 제 2 입력 문장을 결합한 제 3 입력 문장을 생성(450)할 수 있다. The computing device 100 may generate 450 a third input sentence combining the first input sentence and the second input sentence.

컴퓨팅 장치(100)는 제 3 입력 문장의 문장 완성도를 판단(460)할 수 있다. The computing device 100 may determine 460 the sentence completion level of the third input sentence.

컴퓨팅 장치(100)는 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송(470)할 수 있다. 컴퓨팅 장치(100)는 상기 제 3 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득할 수 있다. 컴퓨팅 장치(100)는 상기 입력 문장 벡터와 상기 하나 이상의 기준 문장 벡터 각각에 대한 유사도를 연산할 수 있다. 컴퓨팅 장치(100)는 상기 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 유사도가 임계 값 이상인 기준 문장에 기초한 답변을 상기 사용자 단말로 전송할 수 있다. 컴퓨팅 장치(100)는 상기 유사도가 임계 값 이상인 기준 문장이 없는 경우, 미분류 문장 표준 답변을 상기 사용자 단말로 전송할 수 있다. 미분류 문장 표준 답변은, 사용자 단말(200)로부터 수신한 입력 문장에 대한 적절한 답변을 찾을 수 없다는 답변일 수 있다.When it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion, the computing device 100 may transmit 470 an answer based on the third input sentence to the user terminal. The computing device 100 may obtain an input sentence vector by calculating the third input sentence through a pre-processing model including one or more network functions. The computing device 100 may calculate similarity between each of the input sentence vector and the one or more reference sentence vectors. The computing device 100 may transmit an answer based on the reference sentence having the similarity of the threshold value or higher to the user terminal when there is a reference sentence having the similarity of the threshold value or higher. The computing device 100 may transmit the unclassified sentence standard answer to the user terminal when there is no reference sentence having the similarity level greater than or equal to the threshold. The unclassified sentence standard answer may be an answer that an appropriate answer to an input sentence received from the user terminal 200 cannot be found.

도 5는 본 개시의 일 실시예에 따른 챗봇의 구동 방법을 구현하기 위한 로직을 도시한 블록 구성도이다.5 is a block diagram illustrating logic for implementing a method of driving a chatbot according to an embodiment of the present disclosure.

본 개시의 일 실시예에 따라 챗봇의 구동 방법은 다음과 같은 로직에 의해 구현될 수 있다.According to an embodiment of the present disclosure, a method of driving a chatbot may be implemented by the following logic.

본 개시의 일 실시예에 따라 사용자 단말로부터 제 1 입력 문장을 수신하기 위한 로직(510); 상기 제 1 입력 문장의 문장 완성도를 판단하기 위한 로직(520); 상기 제 1 입력 문장의 문장 완성도가 사전결정된 기준보다 낮은 것으로 판단되는 경우, 미완성 문장 표준 답변을 상기 사용자 단말로 전송하기 위한 로직(530); 상기 사용자 단말로부터 제 2 입력 문장을 수신하기 위한 로직(540); 상기 제 1 입력 문장 및 상기 제 2 입력 문장을 결합한 제 3 입력 문장을 생성하기 위한 로직(550); 상기 제 3 입력 문장의 문장 완성도를 판단하기 위한 로직(560); 상기 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송하기 위한 로직(570)에 의하여 구현될 수 있다.A logic 510 for receiving a first input sentence from a user terminal according to an embodiment of the present disclosure; Logic (520) for determining sentence completion of the first input sentence; A logic 530 for transmitting an incomplete sentence standard answer to the user terminal when it is determined that the sentence completion level of the first input sentence is lower than a predetermined criterion; Logic 540 for receiving a second input sentence from the user terminal; A logic 550 for generating a third input sentence combining the first input sentence and the second input sentence; Logic (560) for determining sentence completion of the third input sentence; When it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion, it may be implemented by logic 570 for transmitting an answer based on the third input sentence to the user terminal.

본 개시의 일 실시예에 따라 상기 제 1 입력 문장의 문장 완성도를 판단하기 위한 로직(520)은, 상기 제 1 입력 문장에 대한 형태소 분석을 수행하기 위한 로직; 상기 형태소 분석을 통해 상기 제 1 입력 문장에 포함된 문장의 구성 요소를 획득하기 위한 로직; 및 상기 문장의 구성요소에 기초하여 상기 제 1 입력 문장의 문장 완성도를 판단하기 위한 로직에 의하여 구현될 수도 있다.According to an embodiment of the present disclosure, the logic 520 for determining sentence completion of the first input sentence includes: logic for performing a morpheme analysis on the first input sentence; Logic for obtaining a component of a sentence included in the first input sentence through the morpheme analysis; And logic for determining sentence completion of the first input sentence based on the components of the sentence.

본 개시의 일 실시예에 따라 상기 제 1 입력 문장의 문장 완성도를 판단하기 위한 로직(520)은, 상기 제 1 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득하기 위한 로직; 상기 입력 문장 벡터와 하나 이상의 기준 문장에 대한 기준 문장 벡터 각각에 대한 유사도를 연산하기 위한 로직; 및 상기 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단하기 위한 로직에 의하여 구현될 수도 있다.According to an embodiment of the present disclosure, logic 520 for determining sentence completion of the first input sentence obtains an input sentence vector by calculating the first input sentence through a pre-processing model including one or more network functions. Logic to do; Logic for calculating similarity to each of the input sentence vector and the reference sentence vector for one or more reference sentences; And when there is a reference sentence having the similarity level equal to or greater than a threshold value, the sentence completion level may be implemented by logic for determining that it is higher than the predetermined criterion.

본 개시의 일 실시예에 따라 상기 제 1 입력 문장의 문장 완성도를 판단하기 위한 로직(520)은, 상기 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하기 위한 로직; 상기 연산을 통해 상기 제 1 입력 문장의 주제 및 서브 주제를 모두 획득하는 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단하기 위한 로직; 및 상기 연산을 통해 상기 주제 및 상기 서브 주제 중 적어도 하나를 획득하지 못하는 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단하기 위한 로직에 의하여 구현될 수도 있다.According to an embodiment of the present disclosure, logic 520 for determining sentence completion of the first input sentence includes: logic for calculating the first input sentence through a sentence subject determination model; Logic for determining that sentence completion is higher than the predetermined criterion when both the subject and the sub-subject of the first input sentence are obtained through the operation; And when at least one of the subject and the sub-subject is not obtained through the operation, the sentence completion level may be implemented by logic for determining that the predetermined level is lower than the predetermined criterion.

본 개시의 일 실시예에 따라 상기 제 1 입력 문장의 문장 완성도를 판단하기 위한 로직(520)은, 상기 제 1 입력 문장을 전처리 모델을 통해 연산함으로써 제 1 입력 문장 벡터를 획득하기 위한 로직; 상기 제 1 입력 문장 벡터가 상기 하나 이상의 완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단하기 위한 로직; 및 상기 제 1 입력 문장 벡터가 상기 하나 이상의 미완성 문장 벡터 중 적어도 하나와 유사도가 임계값 이상인 경우, 문장 완성도가 상기 사전결정된 기준보다 낮은 것으로 판단하기 위한 로직에 의하여 구현될 수도 있다.According to an embodiment of the present disclosure, logic 520 for determining sentence completion of the first input sentence includes: logic for obtaining a first input sentence vector by calculating the first input sentence through a pre-processing model; Logic for determining that sentence completion is higher than the predetermined criterion when the first input sentence vector has a similarity to a threshold value of at least one of the one or more completion sentence vectors; And when the similarity between the first input sentence vector and at least one of the one or more incomplete sentence vectors is greater than or equal to a threshold, the sentence completion level may be implemented by logic for determining that the sentence completion level is lower than the predetermined criterion.

본 개시의 일 실시예에 따라 상기 제 1 입력 문장을 문장 주제 판단 모델을 통해 연산하여 상기 제 1 입력 문장에 대한 주제 및 서브 주제 중 적어도 하나를 획득하기 위한 로직; 및 상기 사용자 단말로부터 상기 제 1 입력 문장 이후에 수신한 하나 이상의 입력 문장 중 상기 획득된 주제 및 서브 주제 중 적어도 하나에 기초하여 제 2 입력 문장을 결정하기 위한 로직에 의하여 구현될 수도 있다.Logic for acquiring at least one of a subject and a sub-subject for the first input sentence by calculating the first input sentence through a sentence subject judgment model according to an embodiment of the present disclosure; And a logic for determining a second input sentence based on at least one of the obtained subject and sub-subjects from one or more input sentences received after the first input sentence from the user terminal.

본 개시의 일 실시예에 따라 상기 제 3 입력 문장의 문장 완성도가 상기 사전결정된 기준보다 높은 것으로 판단되는 경우, 상기 제 3 입력 문장에 기초한 답변을 상기 사용자 단말로 전송하기 위한 로직(570)은, 상기 제 3 입력 문장을 하나 이상의 네트워크 함수를 포함하는 전처리 모델을 통해 연산함으로써 입력 문장 벡터를 획득하기 위한 로직; 상기 입력 문장 벡터와 상기 하나 이상의 기준 문장 벡터 각각에 대한 유사도를 연산하기 위한 로직; 상기 유사도가 임계 값 이상인 기준 문장이 있는 경우, 상기 유사도가 임계 값 이상인 기준 문장에 기초한 답변을 상기 사용자 단말로 전송하기 위한 로직; 및 상기 유사도가 임계 값 이상인 기준 문장이 없는 경우, 미분류 문장 표준 답변을 상기 사용자 단말로 전송하기 위한 로직에 의하여 구현될 수도 있다.According to an embodiment of the present disclosure, when it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion, the logic 570 for transmitting an answer based on the third input sentence to the user terminal, Logic for obtaining an input sentence vector by computing the third input sentence through a pre-processing model including one or more network functions; Logic for calculating similarity to each of the input sentence vector and the one or more reference sentence vectors; Logic for transmitting an answer based on the reference sentence having the similarity of the threshold value or higher to the user terminal when there is a reference sentence having the similarity of the threshold value or higher; And when there is no reference sentence having a similarity or higher than a threshold value, it may be implemented by logic for transmitting an unclassified sentence standard answer to the user terminal.

본 개시의 일 실시예에 따른 챗봇의 구동 방법을 구현하기 위한 로직은, 챗봇의 구동 방법을 구현하기 위한 모듈, 회로 또는 수단에 의하여 구현될 수도 있다.Logic for implementing a method for driving a chatbot according to an embodiment of the present disclosure may be implemented by a module, circuit, or means for implementing a method for driving a chatbot.

도 6은 본 개시의 일 실시예에 따른 컴퓨팅 장치의 블록 구성도이다.6 is a block diagram of a computing device according to an embodiment of the present disclosure.

도 6은 본 개시의 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도를 도시한다.6 shows a simplified and general schematic diagram of an exemplary computing environment in which embodiments of the present disclosure can be implemented.

본 개시가 일반적으로 하나 이상의 컴퓨터 상에서 실행될 수 있는 컴퓨터 실행가능 명령어와 관련하여 전술되었지만, 당업자라면 본 개시가 기타 프로그램 모듈들과 결합되어 및/또는 하드웨어와 소프트웨어의 조합으로써 구현될 수 있다는 것을 잘 알 것이다.Although the present disclosure has been described above in general with respect to computer-executable instructions that can be executed on one or more computers, those skilled in the art will appreciate that the present disclosure can be implemented in combination with other program modules and / or as a combination of hardware and software. will be.

일반적으로, 프로그램 모듈은 특정의 태스크를 수행하거나 특정의 추상 데이터 유형을 구현하는 루틴, 프로그램, 컴포넌트, 데이터 구조, 기타 등등을 포함한다. 또한, 당업자라면 본 개시의 방법이 단일-프로세서 또는 멀티프로세서 컴퓨터 시스템, 미니컴퓨터, 메인 프레임 컴퓨터는 물론 퍼스널 컴퓨터, 핸드헬드 컴퓨팅 장치, 마이크로프로세서-기반 또는 프로그램가능 가전 제품, 기타 등등(이들 각각은 하나 이상의 연관된 장치와 연결되어 동작할 수 있음)을 비롯한 다른 컴퓨터 시스템 구성으로 실시될 수 있다는 것을 잘 알 것이다.Generally, program modules include routines, programs, components, data structures, etc. that perform particular tasks or implement particular abstract data types. In addition, those of ordinary skill in the art may find that the methods of the present disclosure include single-processor or multiprocessor computer systems, minicomputers, mainframe computers, as well as personal computers, handheld computing devices, microprocessor-based or programmable consumer electronics, and the like (each of which is). It will be appreciated that it may be implemented in other computer system configurations, including one that may operate in conjunction with one or more associated devices.

본 개시의 설명된 실시예들은 또한 어떤 태스크들이 통신 네트워크를 통해 연결되어 있는 원격 처리 장치들에 의해 수행되는 분산 컴퓨팅 환경에서 실시될 수 있다. 분산 컴퓨팅 환경에서, 프로그램 모듈은 로컬 및 원격 메모리 저장 장치 둘 다에 위치할 수 있다.The described embodiments of the present disclosure may also be practiced in distributed computing environments where certain tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.

컴퓨터는 통상적으로 다양한 컴퓨터 판독가능 매체를 포함한다. 컴퓨터에 의해 액세스 가능한 매체는 그 어떤 것이든지 컴퓨터 판독가능 매체가 될 수 있다. 컴퓨터 판독가능 매체는 휘발성 및 비 휘발성 매체, 일시적(transitory) 및 비일시적(non-transitory) 매체, 이동식 및 비-이동식 매체를 포함한다. 제한이 아닌 예로서, 컴퓨터 판독가능 매체는 컴퓨터 판독가능 저장 매체 및 컴퓨터 판독가능 전송 매체를 포함할 수 있다. 컴퓨터 판독가능 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보를 저장하는 임의의 방법 또는 기술로 구현되는 휘발성 및 비 휘발성 매체, 일시적 및 비-일시적 매체, 이동식 및 비이동식 매체를 포함한다. 컴퓨터 저장 매체는 RAM, ROM, EEPROM, 플래시 메모리 또는 기타 메모리 기술, CD-ROM, DVD(digital video disk) 또는 기타 광 디스크 저장 장치, 자기 카세트, 자기 테이프, 자기 디스크 저장 장치 또는 기타 자기 저장 장치, 또는 컴퓨터에 의해 액세스될 수 있고 원하는 정보를 저장하는 데 사용될 수 있는 임의의 기타 매체를 포함하지만, 이에 한정되지 않는다.Computers typically include a variety of computer readable media. Any computer-accessible medium can be a computer-readable medium. Computer-readable media includes volatile and non-volatile media, transitory and non-transitory media, removable and non-removable media. By way of example, and not limitation, computer readable media may include computer readable storage media and computer readable transmission media. Computer-readable storage media are volatile and non-volatile media, temporary and non-transitory media, removable and non-removable media implemented in any method or technology for storing information such as computer readable instructions, data structures, program modules or other data Includes media. Computer storage media includes RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital video disk (DVD) or other optical disk storage devices, magnetic cassettes, magnetic tapes, magnetic disk storage devices or other magnetic storage devices, Or any other medium that can be accessed by a computer and used to store desired information.

컴퓨터 판독가능 전송 매체는 통상적으로 반송파(carrier wave) 또는 기타 전송 메커니즘(transport mechanism)과 같은 피 변조 데이터 신호(modulated data signal)에 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터 등을 구현하고 모든 정보 전달 매체를 포함한다. 피 변조 데이터 신호라는 용어는 신호 내에 정보를 인코딩하도록 그 신호의 특성들 중 하나 이상을 설정 또는 변경시킨 신호를 의미한다. 제한이 아닌 예로서, 컴퓨터 판독가능 전송 매체는 유선 네트워크 또는 직접 배선 접속(direct-wired connection)과 같은 유선 매체, 그리고 음향, RF, 적외선, 기타 무선 매체와 같은 무선 매체를 포함한다. 상술된 매체들 중 임의의 것의 조합도 역시 컴퓨터 판독가능 전송 매체의 범위 안에 포함되는 것으로 한다.Computer readable transmission media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and Includes all information delivery media. The term modulated data signal means a signal in which one or more of the characteristics of the signal are set or changed to encode information in the signal. By way of example, and not limitation, computer readable transmission media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared, and other wireless media. Combinations of any of the above-described media are also intended to be included within the scope of computer-readable transmission media.

컴퓨터(1102)를 포함하는 본 개시의 여러가지 측면들을 구현하는 예시적인 환경(1100)이 나타내어져 있으며, 컴퓨터(1102)는 처리 장치(1104), 시스템 메모리(1106) 및 시스템 버스(1108)를 포함한다. 시스템 버스(1108)는 시스템 메모리(1106)(이에 한정되지 않음)를 비롯한 시스템 컴포넌트들을 처리 장치(1104)에 연결시킨다. 처리 장치(1104)는 다양한 상용 프로세서들 중 임의의 프로세서일 수 있다. 듀얼 프로세서 및 기타 멀티프로세서 아키텍처도 역시 처리 장치(1104)로서 이용될 수 있다.An exemplary environment 1100 is shown that implements various aspects of the present disclosure, including a computer 1102, and the computer 1102 includes a processing device 1104, a system memory 1106, and a system bus 1108. do. System bus 1108 connects system components, including, but not limited to, system memory 1106 to processing device 1104. The processing device 1104 can be any of a variety of commercial processors. Dual processor and other multiprocessor architectures may also be used as the processing unit 1104.

시스템 버스(1108)는 메모리 버스, 주변장치 버스, 및 다양한 상용 버스 아키텍처 중 임의의 것을 사용하는 로컬 버스에 추가적으로 상호 연결될 수 있는 몇 가지 유형의 버스 구조 중 임의의 것일 수 있다. 시스템 메모리(1106)는 판독 전용 메모리(ROM)(1110) 및 랜덤 액세스 메모리(RAM)(1112)를 포함한다. 기본 입/출력 시스템(BIOS)은 ROM, EPROM, EEPROM 등의 비 휘발성 메모리(1110)에 저장되며, 이 BIOS는 시동 중과 같은 때에 컴퓨터(1102) 내의 구성요소들 간에 정보를 전송하는 일을 돕는 기본적인 루틴을 포함한다. RAM(1112)은 또한 데이터를 캐싱하기 위한 정적 RAM 등의 고속 RAM을 포함할 수 있다.The system bus 1108 can be any of several types of bus structures that can be further interconnected to a memory bus, peripheral bus, and local bus using any of a variety of commercial bus architectures. System memory 1106 includes read-only memory (ROM) 1110 and random access memory (RAM) 1112. The basic input / output system (BIOS) is stored in a non-volatile memory 1110 such as ROM, EPROM, EEPROM, etc., and this BIOS is basic to help transfer information between components in the computer 1102 at the same time as during startup. Contains routines. The RAM 1112 may also include high-speed RAM, such as static RAM for caching data.

컴퓨터(1102)는 또한 내장형 하드 디스크 드라이브(HDD)(1114)(예를 들어, EIDE, SATA)-이 내장형 하드 디스크 드라이브(1114)는 또한 적당한 섀시(도시 생략) 내에서 외장형 용도로 구성될 수 있음-, 자기 플로피 디스크 드라이브(FDD)(1116)(예를 들어, 이동식 디스켓(1118)으로부터 판독을 하거나 그에 기록을 하기 위한 것임), 및 광 디스크 드라이브(1120)(예를 들어, CD-ROM 디스크(1122)를 판독하거나 DVD 등의 기타 고용량 광 매체로부터 판독을 하거나 그에 기록을 하기 위한 것임)를 포함한다. 하드 디스크 드라이브(1114), 자기 디스크 드라이브(1116) 및 광 디스크 드라이브(1120)는 각각 하드 디스크 드라이브 인터페이스(1124), 자기 디스크 드라이브 인터페이스(1126) 및 광 드라이브 인터페이스(1128)에 의해 시스템 버스(1108)에 연결될 수 있다. 외장형 드라이브 구현을 위한 인터페이스(1124)는 USB(Universal Serial Bus) 및 IEEE 1394 인터페이스 기술 중 적어도 하나 또는 그 둘 다를 포함한다.Computer 1102 is also an internal hard disk drive (HDD) 1114 (eg, EIDE, SATA)-this internal hard disk drive 1114 can also be configured for external use within a suitable chassis (not shown). Yes, magnetic floppy disk drive (FDD) 1116 (e.g., for reading from or writing to removable diskette 1118), and optical disk drive 1120 (e.g., CD-ROM Disc 1122, or for reading from or writing to other high-capacity optical media such as a DVD). The hard disk drive 1114, the magnetic disk drive 1116, and the optical disk drive 1120 are respectively connected to the system bus 1108 by the hard disk drive interface 1124, the magnetic disk drive interface 1126, and the optical drive interface 1128. ). The interface 1124 for implementing an external drive includes at least one or both of Universal Serial Bus (USB) and IEEE 1394 interface technologies.

이들 드라이브 및 그와 연관된 컴퓨터 판독가능 매체는 데이터, 데이터 구조, 컴퓨터 실행가능 명령어, 기타 등등의 비휘발성 저장을 제공한다. 컴퓨터(1102)의 경우, 드라이브 및 매체는 임의의 데이터를 적당한 디지털 형식으로 저장하는 것에 대응한다. 상기에서의 컴퓨터 판독가능 매체에 대한 설명이 HDD, 이동식 자기 디스크, 및 CD 또는 DVD 등의 이동식 광 매체를 언급하고 있지만, 당업자라면 집 드라이브(zip drive), 자기 카세트, 플래쉬 메모리 카드, 카트리지, 기타 등등의 컴퓨터에 의해 판독가능한 다른 유형의 매체도 역시 예시적인 운영 환경에서 사용될 수 있으며 또 임의의 이러한 매체가 본 개시의 방법들을 수행하기 위한 컴퓨터 실행가능 명령어를 포함할 수 있다는 것을 잘 알 것이다.These drives and their associated computer-readable media provide non-volatile storage of data, data structures, computer-executable instructions, and the like. In the case of computer 1102, drives and media correspond to storing any data in a suitable digital format. Although the above description of computer readable media refers to HDDs, removable magnetic disks, and removable optical media such as CDs or DVDs, those of ordinary skill in the art can use a zip drive, magnetic cassette, flash memory card, cartridge, etc. It will be appreciated that other types of media readable by a computer, etc., may also be used in the exemplary operating environment and any such media may include computer-executable instructions for performing the methods of the present disclosure.

운영 체제(1130), 하나 이상의 애플리케이션 프로그램(1132), 기타 프로그램 모듈(1134) 및 프로그램 데이터(1136)를 비롯한 다수의 프로그램 모듈이 드라이브 및 RAM(1112)에 저장될 수 있다. 운영 체제, 애플리케이션, 모듈 및/또는 데이터의 전부 또는 그 일부분이 또한 RAM(1112)에 캐싱될 수 있다. 본 개시가 여러가지 상업적으로 이용가능한 운영 체제 또는 운영 체제들의 조합에서 구현될 수 있다는 것을 잘 알 것이다.A number of program modules may be stored in the drive and RAM 1112, including the operating system 1130, one or more application programs 1132, other program modules 1134, and program data 1136. All or part of the operating system, applications, modules and / or data may also be cached in RAM 1112. It will be appreciated that the present disclosure can be implemented in various commercially available operating systems or combinations of operating systems.

사용자는 하나 이상의 유선/무선 입력 장치, 예를 들어, 키보드(1138) 및 마우스(1140) 등의 포인팅 장치를 통해 컴퓨터(1102)에 명령 및 정보를 입력할 수 있다. 기타 입력 장치(도시 생략)로는 마이크, IR 리모콘, 조이스틱, 게임 패드, 스타일러스 펜, 터치 스크린, 기타 등등이 있을 수 있다. 이들 및 기타 입력 장치가 종종 시스템 버스(1108)에 연결되어 있는 입력 장치 인터페이스(1142)를 통해 처리 장치(1104)에 연결되지만, 병렬 포트, IEEE 1394 직렬 포트, 게임 포트, USB 포트, IR 인터페이스, 기타 등등의 기타 인터페이스에 의해 연결될 수 있다.The user may input commands and information to the computer 1102 through one or more wired / wireless input devices, for example, pointing devices such as a keyboard 1138 and a mouse 1140. Other input devices (not shown) may include a microphone, IR remote control, joystick, game pad, stylus pen, touch screen, etc. These and other input devices are often connected to the processing unit 1104 through an input device interface 1142 connected to the system bus 1108, but the parallel port, IEEE 1394 serial port, game port, USB port, IR interface, And other interfaces.

모니터(1144) 또는 다른 유형의 디스플레이 장치도 역시 비디오 어댑터(1146) 등의 인터페이스를 통해 시스템 버스(1108)에 연결된다. 모니터(1144)에 부가하여, 컴퓨터는 일반적으로 스피커, 프린터, 기타 등등의 기타 주변 출력 장치(도시 생략)를 포함한다.The monitor 1144 or other type of display device is also connected to the system bus 1108 through an interface such as a video adapter 1146. In addition to the monitor 1144, the computer generally includes other peripheral output devices (not shown) such as speakers, printers, and the like.

컴퓨터(1102)는 유선 및/또는 무선 통신을 통한 원격 컴퓨터(들)(1148) 등의 하나 이상의 원격 컴퓨터로의 논리적 연결을 사용하여 네트워크화된 환경에서 동작할 수 있다. 원격 컴퓨터(들)(1148)는 워크스테이션, 컴퓨팅 디바이스 컴퓨터, 라우터, 퍼스널 컴퓨터, 휴대용 컴퓨터, 마이크로프로세서-기반 오락 기기, 피어 장치 또는 기타 통상의 네트워크 노드일 수 있으며, 일반적으로 컴퓨터(1102)에 대해 기술된 구성요소들 중 다수 또는 그 전부를 포함하지만, 간략함을 위해, 메모리 저장 장치(1150)만이 도시되어 있다. 도시되어 있는 논리적 연결은 근거리 통신망(LAN)(1152) 및/또는 더 큰 네트워크, 예를 들어, 원거리 통신망(WAN)(1154)에의 유선/무선 연결을 포함한다. 이러한 LAN 및 WAN 네트워킹 환경은 사무실 및 회사에서 일반적인 것이며, 인트라넷 등의 전사적 컴퓨터 네트워크(enterprise-wide computer network)를 용이하게 해주며, 이들 모두는 전세계 컴퓨터 네트워크, 예를 들어, 인터넷에 연결될 수 있다.The computer 1102 can operate in a networked environment using logical connections to one or more remote computers, such as remote computer (s) 1148 via wired and / or wireless communication. The remote computer (s) 1148 may be a workstation, computing device computer, router, personal computer, portable computer, microprocessor-based entertainment device, peer device, or other conventional network node, typically in computer 1102. It includes many or all of the components described with respect to, but for simplicity, only the memory storage device 1150 is shown. The illustrated logical connections include wired / wireless connections to a local area network (LAN) 1152 and / or a larger network, such as a wide area network (WAN) 1154. Such LAN and WAN networking environments are common in offices and companies, and facilitate enterprise-wide computer networks such as intranets, all of which can be connected to computer networks around the world, for example, the Internet.

LAN 네트워킹 환경에서 사용될 때, 컴퓨터(1102)는 유선 및/또는 무선 통신 네트워크 인터페이스 또는 어댑터(1156)를 통해 로컬 네트워크(1152)에 연결된다. 어댑터(1156)는 LAN(1152)에의 유선 또는 무선 통신을 용이하게 해줄 수 있으며, 이 LAN(1152)은 또한 무선 어댑터(1156)와 통신하기 위해 그에 설치되어 있는 무선 액세스 포인트를 포함하고 있다. WAN 네트워킹 환경에서 사용될 때, 컴퓨터(1102)는 모뎀(1158)을 포함할 수 있거나, WAN(1154) 상의 통신 컴퓨팅 디바이스에 연결되거나, 또는 인터넷을 통하는 등, WAN(1154)을 통해 통신을 설정하는 기타 수단을 갖는다. 내장형 또는 외장형 및 유선 또는 무선 장치일 수 있는 모뎀(1158)은 직렬 포트 인터페이스(1142)를 통해 시스템 버스(1108)에 연결된다. 네트워크화된 환경에서, 컴퓨터(1102)에 대해 설명된 프로그램 모듈들 또는 그의 일부분이 원격 메모리/저장 장치(1150)에 저장될 수 있다. 도시된 네트워크 연결이 예시적인 것이며 컴퓨터들 사이에 통신 링크를 설정하는 기타 수단이 사용될 수 있다는 것을 잘 알 것이다.When used in a LAN networking environment, the computer 1102 is connected to the local network 1152 via a wired and / or wireless communication network interface or adapter 1156. The adapter 1156 may facilitate wired or wireless communication to the LAN 1152, which also includes a wireless access point installed therein to communicate with the wireless adapter 1156. When used in a WAN networking environment, the computer 1102 may include a modem 1158, connect to a communication computing device on the WAN 1154, or establish communication over the WAN 1154, such as through the Internet. Have other means. The modem 1158, which may be an internal or external and wired or wireless device, is connected to the system bus 1108 through a serial port interface 1142. In a networked environment, program modules described for the computer 1102 or portions thereof may be stored in the remote memory / storage device 1150. It will be appreciated that the network connections shown are exemplary and other means of establishing a communication link between computers may be used.

컴퓨터(1102)는 무선 통신으로 배치되어 동작하는 임의의 무선 장치 또는 개체, 예를 들어, 프린터, 스캐너, 데스크톱 및/또는 휴대용 컴퓨터, PDA(portable data assistant), 통신 위성, 무선 검출가능 태그와 연관된 임의의 장비 또는 장소, 및 전화와 통신을 하는 동작을 한다. 이것은 적어도 Wi-Fi 및 블루투스 무선 기술을 포함한다. 따라서, 통신은 종래의 네트워크에서와 같이 미리 정의된 구조이거나 단순하게 적어도 2개의 장치 사이의 애드혹 통신(ad hoc communication)일 수 있다.The computer 1102 is associated with any wireless device or object that is deployed and operates in wireless communication, such as a printer, scanner, desktop and / or portable computer, a portable data assistant (PDA), communication satellite, or wireless detectable tag. It operates to communicate with any equipment or place and telephone. This includes at least Wi-Fi and Bluetooth wireless technology. Accordingly, the communication may be a predefined structure as in a conventional network or simply ad hoc communication between at least two devices.

Wi-Fi(Wireless Fidelity)는 유선 없이도 인터넷 등으로의 연결을 가능하게 해준다. Wi-Fi는 이러한 장치, 예를 들어, 컴퓨터가 실내에서 및 실외에서, 즉 기지국의 통화권 내의 아무 곳에서나 데이터를 전송 및 수신할 수 있게 해주는 셀 전화와 같은 무선 기술이다. Wi-Fi 네트워크는 안전하고 신뢰성 있으며 고속인 무선 연결을 제공하기 위해 IEEE 802.11(a,b,g, 기타)이라고 하는 무선 기술을 사용한다. 컴퓨터를 서로에, 인터넷에 및 유선 네트워크(IEEE 802.3 또는 이더넷을 사용함)에 연결시키기 위해 Wi-Fi가 사용될 수 있다. Wi-Fi 네트워크는 비인가 2.4 및 5 GHz 무선 대역에서, 예를 들어, 11Mbps(802.11a) 또는 54 Mbps(802.11b) 데이터 레이트로 동작하거나, 양 대역(듀얼 대역)을 포함하는 제품에서 동작할 수 있다.Wi-Fi (Wireless Fidelity) enables a connection to the Internet or the like without a wired connection. Wi-Fi is a wireless technology such as a cell phone that allows a device, for example, a computer to transmit and receive data indoors and outdoors, ie anywhere within the base station's coverage area. Wi-Fi networks use a wireless technology called IEEE 802.11 (a, b, g, etc.) to provide a secure, reliable and high-speed wireless connection. Wi-Fi can be used to connect computers to each other, to the Internet, and to a wired network (using IEEE 802.3 or Ethernet). Wi-Fi networks can operate in unlicensed 2.4 and 5 GHz radio bands, for example, at 11 Mbps (802.11a) or 54 Mbps (802.11 b) data rates, or in products that include both bands (dual band). have.

본 개시의 기술 분야에서 통상의 지식을 가진 자는 정보 및 신호들이 임의의 다양한 상이한 기술들 및 기법들을 이용하여 표현될 수 있다는 것을 이해할 것이다. 예를 들어, 위의 설명에서 참조될 수 있는 데이터, 지시들, 명령들, 정보, 신호들, 비트들, 심볼들 및 칩들은 전압들, 전류들, 전자기파들, 자기장들 또는 입자들, 광학장들 또는 입자들, 또는 이들의 임의의 결합에 의해 표현될 수 있다.One of ordinary skill in the art of the present disclosure will understand that information and signals can be represented using any of a variety of different technologies and techniques. For example, data, instructions, instructions, information, signals, bits, symbols and chips that can be referenced in the above description are voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields Or particles, or any combination thereof.

본 개시의 기술 분야에서 통상의 지식을 가진 자는 여기에 개시된 실시예들과 관련하여 설명된 다양한 예시적인 논리 블록들, 모듈들, 프로세서들, 수단들, 회로들 및 알고리즘 단계들이 전자 하드웨어, (편의를 위해, 여기에서 "소프트웨어"로 지칭되는) 다양한 형태들의 프로그램 또는 설계 코드 또는 이들 모두의 결합에 의해 구현될 수 있다는 것을 이해할 것이다. 하드웨어 및 소프트웨어의 이러한 상호 호환성을 명확하게 설명하기 위해, 다양한 예시적인 컴포넌트들, 블록들, 모듈들, 회로들 및 단계들이 이들의 기능과 관련하여 위에서 일반적으로 설명되었다. 이러한 기능이 하드웨어 또는 소프트웨어로서 구현되는지 여부는 특정한 애플리케이션 및 전체 시스템에 대하여 부과되는 설계 제약들에 따라 좌우된다. 본 개시의 기술 분야에서 통상의 지식을 가진 자는 각각의 특정한 애플리케이션에 대하여 다양한 방식들로 설명된 기능을 구현할 수 있으나, 이러한 구현 결정들은 본 개시의 범위를 벗어나는 것으로 해석되어서는 안 될 것이다.Those of ordinary skill in the art of the present disclosure may use various examples of logical blocks, modules, processors, means, circuits, and algorithm steps described in connection with the embodiments disclosed herein in electronic hardware, (convenience For the sake of understanding, it may be implemented by various forms of program or design code (referred to herein as “software”) or a combination of both. To clearly illustrate this interoperability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends on the particular application and design constraints imposed on the overall system. Those skilled in the art of the present disclosure may implement the functions described in various ways for each particular application, but such implementation decisions should not be interpreted as being outside the scope of the present disclosure.

여기서 제시된 다양한 실시예들은 방법, 장치, 또는 표준 프로그래밍 및/또는 엔지니어링 기술을 사용한 제조 물품(article)으로 구현될 수 있다. 용어 "제조 물품"은 임의의 컴퓨터-판독가능 장치로부터 액세스 가능한 컴퓨터 프로그램, 캐리어, 또는 매체(media)를 포함한다. 예를 들어, 컴퓨터-판독가능 매체는 자기 저장 장치(예를 들면, 하드 디스크, 플로피 디스크, 자기 스트립, 등), 광학 디스크(예를 들면, CD, DVD, 등), 스마트 카드, 및 플래쉬 메모리 장치(예를 들면, EEPROM, 카드, 스틱, 키 드라이브, 등)를 포함하지만, 이들로 제한되는 것은 아니다. 또한, 여기서 제시되는 다양한 저장 매체는 정보를 저장하기 위한 하나 이상의 장치 및/또는 다른 기계-판독가능한 매체를 포함한다. Various embodiments presented herein may be implemented as a method, apparatus, or article of manufacture using standard programming and / or engineering techniques. The term "manufactured article" includes a computer program, carrier, or media accessible from any computer-readable device. For example, computer-readable media include magnetic storage devices (eg, hard disks, floppy disks, magnetic strips, etc.), optical disks (eg, CDs, DVDs, etc.), smart cards, and flash memory Devices (eg, EEPROMs, cards, sticks, key drives, etc.), but are not limited to these. Also, various storage media presented herein include one or more devices and / or other machine-readable media for storing information.

제시된 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조는 예시적인 접근들의 일례임을 이해하도록 한다. 설계 우선순위들에 기반하여, 본 개시의 범위 내에서 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조가 재배열될 수 있다는 것을 이해하도록 한다. 첨부된 방법 청구항들은 샘플 순서로 다양한 단계들의 엘리먼트들을 제공하지만 제시된 특정한 순서 또는 계층 구조에 한정되는 것을 의미하지는 않는다.It is understood that the specific order or hierarchy of steps in the processes presented is an example of exemplary approaches. Based on design priorities, it is understood that within the scope of the present disclosure, a specific order or hierarchy of steps in the processes may be rearranged. The accompanying method claims provide elements of the various steps in a sample order, but are not meant to be limited to the specific order or hierarchy presented.

제시된 실시예들에 대한 설명은 임의의 본 개시의 기술 분야에서 통상의 지식을 가진 자가 본 개시를 이용하거나 또는 실시할 수 있도록 제공된다. 이러한 실시예들에 대한 다양한 변형들은 본 개시의 기술 분야에서 통상의 지식을 가진 자에게 명백할 것이다. 여기에 정의된 일반적인 원리들은 본 개시의 범위를 벗어남이 없이 다른 실시예들에 적용될 수 있다. 그리하여, 본 개시는 여기에 제시된 실시예들로 한정되는 것이 아니라, 여기에 제시된 원리들 및 신규한 특징들과 일관되는 최광의의 범위에서 해석되어야 할 것이다.Descriptions of the presented embodiments are provided to enable any person of ordinary skill in the art to use or practice the present disclosure. Various modifications to these embodiments will be apparent to those skilled in the art of the present disclosure. The general principles defined herein can be applied to other embodiments without departing from the scope of the present disclosure. Thus, the present disclosure should not be limited to the embodiments presented herein, but should be interpreted in the broadest scope consistent with the principles and novel features presented herein.

Claims

A computer program stored in a computer-readable storage medium, wherein the computer program, when executed on one or more processors, performs the following operations for driving the chatbot, the operations being:
Receiving a first input sentence from a user terminal;
Determining a sentence completion level of the first input sentence;
When it is determined that the sentence completion level of the first input sentence is lower than a predetermined criterion, transmitting an incomplete sentence standard answer to the user terminal;
Receiving a second input sentence from the user terminal;
Generating a third input sentence combining the first input sentence and the second input sentence;
Determining a sentence completion level of the third input sentence; And
Transmitting a response based on the third input sentence to the user terminal when it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion;
Including,
The second input sentence is an input sentence received from the user terminal after the first input sentence,
A computer program stored on a computer readable storage medium.

According to claim 1,
The second input sentence,
The input sentence within a predetermined number range received from the user terminal after receiving the first input sentence from the user terminal or within a predetermined time range, or after receiving the first input sentence from the user terminal ,
A computer program stored on a computer readable storage medium.

According to claim 1,
The operation of determining the sentence completion of the first input sentence is
Performing a morpheme analysis on the first input sentence;
Obtaining a component of a sentence included in the first input sentence through the morpheme analysis; And
Determining a sentence completion level of the first input sentence based on a component of the sentence;
Containing,
A computer program stored on a computer readable storage medium.

According to claim 1,
The operation of determining the sentence completion of the first input sentence is
Obtaining an input sentence vector by computing the first input sentence through a pre-processing model including one or more network functions;
Calculating similarity for each of the input sentence vector and the reference sentence vector for one or more reference sentences; And
Determining that the sentence completion is higher than the predetermined criterion when there is a reference sentence having the similarity of a threshold value or higher;
Containing,
A computer program stored on a computer readable storage medium.

According to claim 1,
The operation of determining the sentence completion of the first input sentence is
Calculating the first input sentence through a sentence subject judgment model;
Determining that sentence completion is higher than the predetermined criterion when both the subject and the sub-subject of the first input sentence are obtained through the operation; And
Determining that sentence completion is lower than the predetermined criterion when at least one of the subject and the sub-subject is not obtained through the operation;
Containing,
A computer program stored on a computer readable storage medium.

The method of claim 5,
The sentence subject judgment model,
The learning input sentence is based on a learning data set including at least one of a learning input sentence generated by dividing at least one of the one or more reference sentences and a reference answer into an arbitrary length, and at least one of a subject and a sub-subject labeled on the learning input sentence. Learning to output at least one of the subject and the sub subject when inputting,
A computer program stored on a computer readable storage medium.

According to claim 1,
The operation of determining the sentence completion of the first input sentence is
Obtaining a first input sentence vector by calculating the first input sentence through a pre-processing model;
Determining that sentence completion is higher than the predetermined criterion when the first input sentence vector has a similarity to a threshold value of at least one of the one or more completion sentence vectors; And
Determining that sentence completion is lower than the predetermined criterion when the first input sentence vector has a similarity to a threshold value of at least one of the one or more incomplete sentence vectors;
Including,
The completion sentence vector,
Is obtained by calculating the one or more reference sentences or reference answers through the pre-processing model, and
The incomplete sentence vector,
It is obtained by calculating a divided reference sentence or a divided reference answer in which each of the one or more reference sentences or reference answers is divided into arbitrary lengths through the pre-processing model.
A computer program stored on a computer readable storage medium.

According to claim 1,
Calculating at least one of a subject and a sub-subject for the first input sentence by calculating the first input sentence through a sentence subject judgment model; And
Determining a second input sentence based on at least one of the obtained subject and sub-subjects from one or more input sentences received after the first input sentence from the user terminal;
Containing more,
A computer program stored on a computer readable storage medium.

According to claim 1,
The second input sentence,
An input sentence including a sentence symbol among one or more input sentences received after the first input sentence,
A computer program stored on a computer readable storage medium.

According to claim 1,
When it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion, the operation of transmitting an answer based on the third input sentence to the user terminal is
Obtaining an input sentence vector by computing the third input sentence through a pre-processing model including one or more network functions;
Calculating similarity for each of the input sentence vector and the one or more reference sentence vectors;
When there is a reference sentence having the similarity of a threshold value or higher, transmitting an answer based on the reference sentence having the similarity of a threshold value or higher to the user terminal; And
If there is no reference sentence having a similarity or higher than a threshold value, transmitting an unclassified sentence standard answer to the user terminal;
Containing,
A computer program stored on a computer readable storage medium.

A method for driving a chatbot implemented in one or more processors of a computing device,
Receiving a first input sentence from a user terminal;
When it is determined that the sentence completion level of the first input sentence is lower than a predetermined criterion, transmitting an incomplete sentence standard answer to the user terminal;
Receiving a second input sentence from the user terminal;
Generating a third input sentence combining the first input sentence and the second input sentence; And
When it is determined that the sentence completion level of the third input sentence is higher than the predetermined criterion, transmitting an answer based on the third input sentence to the user terminal;
Including,
The second input sentence is an input sentence received from the user terminal after the first input sentence,
A method of driving a chatbot implemented on one or more processors of a computing device.

As a computing device for driving the chatbot,
A processor comprising one or more cores;
A network unit that receives at least one of a first input sentence and a second input sentence from a user terminal, and transmits an answer determined by the processor to the user terminal; And
Memory;
Including,
The processor,
The sentence completion degree of the first input sentence, the second input sentence, and the third input sentence combining the first input sentence and the second input sentence is determined, and the sentence completion degree of the first input sentence is lower than a predetermined criterion. If it is judged that, it is determined to transmit an incomplete sentence standard answer to the user terminal, and when it is determined that the sentence completion degree of the third input sentence is higher than the predetermined criterion, the answer based on the third input sentence is recalled. Deciding to transmit to the user terminal,
A computing device for driving the chatbot.