KR101624184B1

KR101624184B1 - Method and apparatus of intention-response interface of behavior learning model for an angent system

Info

Publication number: KR101624184B1
Application number: KR1020150033839A
Authority: KR
Inventors: 조성배
Original assignee: 연세대학교 산학협력단
Priority date: 2015-03-11
Filing date: 2015-03-11
Publication date: 2016-06-07

Abstract

The present invention relates to an intention-response interface device of a behavior learning model for an agent system and a method thereof. The intention-response interface device comprises: an information input unit configured to receive user input information and surrounding environment information; a user intention analyzing unit; a simple response behavior generating unit; and a complex response behavior generating unit. Therefore, unnecessary operations can be reduced.

Description

Technical Field [0001] The present invention relates to an intention-correspondence interface apparatus and method for a behavior learning model for an agent system,

본 발명은 주변 환경에 따라 사용자의 명령에 적합한 대응행동을 생성하는 사용자 에이전트 장치와 그 방법에 관한 것이다.The present invention relates to a user agent apparatus and a method thereof for generating a response action suitable for a command of a user according to a surrounding environment.

사용자의 일정관리와 필요한 정보를 검색해주는 비서 에이전트, 사용자를 대신하여 심부름 또는 청소 업무를 수행하는 로봇 에이전트, 가정환경에서 가전기기를 제어하는 홈 에이전트 등 다양한 에이전트 시스템이 개발되고 있다. 특히 사용자와 에이전트 간의 상호작용과 사용자 맞춤형의 개인화 서비스 제공에 대한 많은 관심이 기울여지고 있다.A variety of agent systems are being developed, including a secretarial agent that retrieves the user's schedule and information, a robot agent that performs errands or cleaning tasks on behalf of the user, and a home agent that controls home appliances in the home environment. In particular, much attention has been paid to the interaction between users and agents and the provision of customized personalization services.

이와 같은 사용자 맞춤형 에이전트 서비스를 제공하기 위하여는 사용자 중심의 에이전트 기술 개발과 범용적으로 적용될 수 있는 에이전트 인터페이스 기술개발이 필요하다. 그리고 이를 위한 종래의 기술로 행동 학습 모델이 있다. 행동 학습 모델은 다양한 에이전트의 행동 조합에 따라 적응적으로 에이전트가 대응행동을 생성하고 이를 수행할 수 있도록 행동을 학습하고 학습된 모델을 이용하여 사용자의 의도에 대응하는 행동을 생성하는 에이전트 시스템에 관한 기술이다.In order to provide such a customized agent service, it is necessary to develop a user - oriented agent technology and develop an agent interface technology which can be applied universally. And there is a behavior learning model as a conventional technology for this. Behavioral learning model is an agent system that learns behavior so that agent can generate adaptive behavior according to behavioral combination of various agents and can perform it and generates behavior corresponding to user 's intention using learned model Technology.

이와 같이 에이전트의 행동을 학습한 모델을 설계하고 획득하기 위해 사용되는 대표적인 기준은 에이전트가 최종 목적을 달성하는데 소비한 비용에 기반하는 것과, 대응 행동의 결과에 대한 사용자 만족도에 기반하는 것이다. 이러한 기준들을 이용하여 에이전트의 행동을 학습하는 방법으로는 행동 간의 연결관계를 변환하는 방식과 규칙을 기반으로 다음에 수행할 행동의 조합을 생성하는 방식이 있다. The representative criteria used to design and acquire a model that learns agent behavior are based on the cost spent by the agent to achieve the end goal and based on user satisfaction with the outcome of the response behavior. There are two ways of learning agent behavior using these criteria: a method of converting the connection relationship between actions and a method of generating a combination of actions to be performed next based on the rule.

그러나 위와 같은 기존의 방법들은 에이전트의 대응행동에 대한 효율성 문제에만 집중했기 때문에 사용자의 의도를 충분히 고려하지 못하였고, 그에 따라 사용자가 원하는 대응행동을 생성함에 있어서 한계를 갖고 있다. 또한 사용자의 의도를 고려하는 대응행동 생성 시스템의 경우에도 에이전트 시스템에 정의된 전체 행동을 학습함으로써 환경변화 요소와 해결해야 하는 문제의 복잡도가 증가할 때, 대응 및 학습시간이 급격히 증가하는 문제점이 있었다.However, since the existing methods concentrate only on the efficiency problem of the agent 's response behavior, the user' s intention is not fully taken into consideration and accordingly, the user has a limitation in generating the desired response behavior. Also, even in the case of the response behavior generating system considering the user's intention, learning of the entire behavior defined in the agent system has a problem that the correspondence and the learning time are rapidly increased when the environment change factor and the complexity of the problem to be solved increase .

공개특허공보 제10-2013-0083099호 (2013.07.22.)Published Japanese Patent Application No. 10-2013-0083099 (Feb.

본 발명이 해결하고자 하는 과제는 다양한 에이전트 시스템에 적용하기 위한 사용자 의도에 기반한 대응행동을 생성하는 장치 및 그에 관한 방법을 제공하는 것이다.SUMMARY OF THE INVENTION It is an object of the present invention to provide an apparatus and a method for generating a response based on user intention for applying to various agent systems.

본 발명은 행동학습 모델기반의 홈 에이전트, 모바일 에이전트, 로봇 에이전트 등 다양한 에이전트 시스템에 적용 가능한 대응행동 생성 방법을 제공한다. 이를 위하여 본 발명은 사람이 타인의 의도를 이해하고 이에 대응하는 과정을 뇌 과학적으로 접근하여 뇌의 학습 메커니즘과 대응행동 메커니즘을 모방하여, 그에 따른 대응행동 생성 방법을 제공한다.The present invention provides a corresponding behavior generation method applicable to various agent systems such as a home agent, a mobile agent, and a robot agent based on a behavior learning model. To this end, the present invention provides a method of generating a corresponding behavior by imitating a brain learning mechanism and a corresponding behavioral mechanism by approaching a process in which a person understands and responds to the intention of the other person in a brain scientific manner.

더욱 상세하게는 낮은 수준의 대응행동을 필요로 하는 단순의도에는 빠르고 직관적인 대응행동을 생성하고, 상위 수준의 대응행동을 필요로 하는 복합의도에는 사용자 의도에 기반하여 학습된 대응기법을 적용하여 장기적인 대응행동을 생성하는 방법을 제공한다.More precisely, it generates quick and intuitive response behaviors for simple intentions that require low-level response behaviors, and applies the learned response techniques to complex intentions that require high-level response behaviors based on user intent Thereby generating a long-term response behavior.

이를 위하여 본 발명에 따른 사용자 에이전트 장치는, 사용자 입력 정보와 주변 환경정보를 입력받는 정보 입력부, 상기 사용자 입력 정보를 분석하여, 하나의 중간 목적이 획득된 경우 사용자의 의도를 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류하는 사용자 의도 분석부, 상기 사용자의 의도가 상기 단순의도로 분류된 경우, 목적 노드와 환경 노드와 행동 노드로 이루어진 행동 선택 네트워크를 이용하여 상기 단순의도에 대응하는 행동을 생성하는 단순 대응행동 생성부, 및 상기 사용자의 의도가 상기 복합의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 복합의도에 대응하는 행동을 생성하는 복합 대응행동 생성부를 포함한다. To this end, the user agent apparatus according to the present invention includes an information input unit for receiving user input information and surrounding environment information, analyzing the user input information, A user intention analysis unit that classifies the user's intention into a plurality of roads when the combined purpose including the at least two intermediate objectives is obtained; a case where the user's intention is classified as the simple road, A simple response behavior generating unit for generating a behavior corresponding to the simple intention by using a behavior selection network made up of nodes and a simple response behavior generating unit for, when the intention of the user is classified as the compound road, And a complex response behavior generating unit for generating a behavior corresponding to the degree of the response.

상기 과제를 해결하기 위해, 본 발명의 일 유형에 따른 사용자 에이전트 장치는, 사용자 입력 정보와 주변 환경정보를 입력받는 정보 입력부; 상기 사용자 입력 정보를 분석하여, 하나의 중간 목적이 획득된 경우 사용자의 의도를 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류하는 사용자 의도 분석부; 상기 사용자의 의도가 상기 단순의도로 분류된 경우, 목적 노드와 환경 노드와 행동 노드로 이루어진 행동 선택 네트워크를 이용하여 상기 단순의도에 대응하는 행동을 생성하는 단순 대응행동 생성부; 및 상기 사용자의 의도가 상기 복합의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 복합의도에 대응하는 행동을 생성하는 복합 대응행동 생성부를 포함할 수 있다.According to an aspect of the present invention, there is provided a user agent apparatus comprising: an information input unit receiving user input information and surrounding environment information; Analyzing the user input information to classify the intention of the user when the one intermediate object is obtained and the intention of the user when the compound object including two or more intermediate objects is obtained; User intention analysis unit; A simple response behavior generating unit for generating a behavior corresponding to the simple intention using a behavior selection network composed of a target node, an environment node, and a behavior node when the intention of the user is classified as the simple road; And a complex response behavior generating unit for generating a behavior corresponding to the compound intention using the behavior selection network when the intention of the user is classified as the compound road.

여기서 상기 사용자 에이전트 장치는 상기 행동 선택 네트워크를 저장하는 행동 선택 네트워크 저장부를 더 포함하는 것을 특징으로 할 수 있다.Here, the user agent apparatus may further include a behavior selection network storage unit for storing the behavior selection network.

여기서 상기 정보 입력부는 영상 센서 또는 음성 센서 또는 터치 센서 중 적어도 어느 하나를 이용하여 사용자의 행위에 따른 상기 사용자 입력 정보를 입력받고, 상기 사용자 에이전트 장치가 속한 환경에 관한 상기 환경정보를 입력받는 것을 특징으로 할 수 있다.Here, the information input unit receives the user input information according to a user's action using at least one of an image sensor, a voice sensor, and a touch sensor, and receives the environment information related to the environment to which the user agent apparatus belongs .

여기서 상기 행동 선택 네트워크는 적어도 하나 이상의 행동 네트워크 모듈을 포함하고, 상기 행동 네트워크 모듈은 하나의 상기 중간 목적에 관한 상기 목적 노드에 대하여 적어도 하나 이상의 상기 환경 노드와 적어도 하나 이상의 상기 행동 노드들 간의 연결로 이루어지는 것을 특징으로 할 수 있다.Wherein the behavioral selection network comprises at least one behavioral network module and wherein the behavioral network module is operable to establish a connection between the at least one environment node and at least one or more of the behavioral nodes for the one or more intermediate nodes, .

여기서 상기 사용자 의도 분석부는, 상기 사용자 입력 정보를 분석하여 사용자 명령 구문을 획득하고, 상기 사용자 명령 구문을 미리 정해진 의도 분류 온톨로지를 이용하여 분석하여 상기 사용자 명령 구문에 포함된 상기 중간 목적 또는 상기 복합 목적을 획득하고, 상기 하나의 중간 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 단순의도로, 상기 복합 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 복합의도로, 각 분류하는 것을 특징으로 할 수 있다.Herein, the user's intention analysis unit analyzes the user input information to obtain a user command syntax, analyzes the user command syntax using a predetermined intention classification ontology, and analyzes the user's command syntax using the intention classification ontology, The user intention corresponding to the user command syntax may be referred to as the simple road if the one intermediate goal is obtained and the user intention corresponding to the user command syntax if the combined purpose is obtained, Multi-way roads, each classified.

여기서, 상기 단순 대응행동 생성부는, 상기 단순의도에 대응하는 상기 중간 목적을 가지는 상기 행동 네트워크 모듈을 상기 행동 선택 네트워크에서 검색하여 선택하고, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드를 이용하여 단순 대응행동을 생성하고, 상기 단순 대응행동은 일정한 순서로 배열된 적어도 하나 이상의 단위 행동을 포함하는 것을 특징으로 할 수 있다.Here, the simple corresponding behavior generating unit searches for and selects the behavior network module having the intermediate purpose corresponding to the simple intention in the behavior selection network, and uses the behavior node included in the selected behavior network module A simple corresponding action is generated, and the simple corresponding action includes at least one or more unit actions arranged in a predetermined order.

여기서, 상기 단순 대응행동 생성부는, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드 각각에 대하여 활성화 에너지를 산출하고, 상기 활성화 에너지가 가장 큰 상기 행동 노드를 선택하고, 상기 선택된 행동 노드를 이용하여 상기 단순 대응행동을 생성하는 것을 특징으로 할 수 있다.Here, the simple corresponding behavior generating unit may be configured to calculate activation energy for each of the behavior nodes included in the selected behavior network module, to select the behavior node having the largest activation energy, And a simple response action is generated.

여기서, 상기 단순 대응행동 생성부는 일정한 시간 간격으로 상기 활성화 에너지를 산출하고, 각 시간 별로 가장 큰 상기 활성화 에너지를 가지는 상기 행동 노드를 선택하고, 상기 선택한 상기 행동 노드에 대응하는 상기 단위 행동을 생성하여, 상기 단순 대응행동을 생성하는 것을 특징으로 할 수 있다.Here, the simple corresponding behavior generating unit may calculate the activation energy at a predetermined time interval, select the behavior node having the largest activation energy for each time, generate the unit behavior corresponding to the selected behavior node , And generates the simple corresponding action.

여기서, 상기 단순 대응행동 생성부는, 특정 시간에서의 해당 행동 노드에서의 상기 활성화 에너지를, 상기 특정 시간 이전 시간의 상기 해당 행동 노드의 상기 활성화 에너지, 상기 해당 행동 노드와 연결된 상기 환경 노드에 따른 에너지 값의 가중합, 상기 해당 행동 노드에 연결된 상기 목적 노드에 따른 에너지 값에 가중치를 적용한 값, 상기 해당 행동 노드와 다른 노드들 간의 연결 링크에 따른 에너지 값들의 가중합 중 적어도 어느 하나 이상을 이용하여 산출하는 것을 특징으로 할 수 있다.Here, the simple corresponding behavior generating unit may be configured to calculate the activation energy at the corresponding behavior node at a specific time, the activation energy of the corresponding behavior node at the time before the specific time, the energy Using a weighted sum of energy values according to a connection link between the corresponding behavior node and other nodes, a value obtained by applying a weight to an energy value according to the destination node connected to the corresponding behavior node, .

여기서 상기 복합 대응행동 생성부는, 상기 복합 목적을 계층 분할하여 일정한 순서를 가지는 상기 중간 목적들의 시퀀스를 획득하고, 상기 획득한 중간 목적들의 시퀀스에 포함된 각 상기 중간 목적에 대하여, 상기 행동 네트워크를 이용하여 상기 중간 목적에 대응하는 단위 행동의 시퀀스를 생성하고, 상기 중간 목적들의 시퀀스의 순서 대로 상기 중간 목적 별 상기 단위 행동의 시퀀스를 배열하여 이루어지는 복합 대응행동을 생성하는 것을 특징으로 할 수 있다.Here, the compound response generating unit may divide the compound object into layers to obtain the sequence of intermediate objects having a predetermined order, and for each intermediate object included in the acquired sequence of intermediate objects, Generating a sequence of unit behaviors corresponding to the intermediate purpose and arranging sequences of the unit behaviors for the intermediate purpose in the sequence of the intermediate objectives to generate a composite corresponding behavior.

여기서 상기 복합 대응행동 생성부는, 상기 중간 목적 별로, 상기 행동 선택 네트워크에서 상기 중간 목적을 가지는 상기 행동 네트워크 모듈을 검색하여 선택하고, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드를 이용하여 상기 단위행동의 시퀀스를 생성하는 것을 특징으로 할 수 있다.Here, the complex corresponding behavior generating unit searches for and selects the behavior network module having the intermediate purpose in the behavior selection network for each intermediate purpose, and selects the behavior node using the behavior node included in the selected behavior network module And a sequence of the first and second sequences are generated.

여기서 상기 복합 대응행동 생성부는, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드에 대하여 활성화 에너지를 산출하고, 상기 활성화 에너지가 가장 큰 상기 행동 노드를 선택하고, 상기 선택된 행동 노드를 이용하여 상기 단위 행동의 시퀀스를 생성하는 것을 특징으로 할 수 있다.Here, the complex corresponding behavior generating unit may calculate an activation energy for the behavior node included in the selected behavior network module, select the behavior node having the largest activation energy, and use the selected behavior node to calculate the unit behavior And a sequence of the first and second sequences are generated.

여기서 상기 복합 대응행동 생성부는 일정한 시간 간격으로 상기 활성화 에너지를 산출하고, 각 시간 별로 가장 큰 상기 활성화 에너지를 가지는 상기 행동 노드를 선택하고, 상기 선택한 상기 행동 노드에 대응하는 단위 행동을 생성하여, 상기 선택한 상기 행동 노드의 순서에 따라 상기 단위 행동의 시퀀스를 생성하는 것을 특징으로 할 수 있다.Wherein the complex corresponding behavior generating unit calculates the activation energy at a predetermined time interval, selects the behavior node having the largest activation energy for each time, generates a unit behavior corresponding to the selected behavior node, And generates the sequence of unit behaviors according to the order of the selected action nodes.

여기서 상기 사용자 에이전트 장치는 상기 행동 선택 네트워크에 포함된 상기 행동 네트워크 모듈의 배열의 집합을, 상기 사용자 입력 정보와 상기 환경정보와 사용자의 피드백 정보를 이용하여 학습하는 학습부를 더 포함하는 것을 특징으로 할 수 있다.Here, the user agent apparatus may further include a learning unit that learns a set of the behavior network modules included in the behavior selection network using the user input information, the environment information, and feedback information of the user .

여기서 상기 학습부는 행동 네트워크 모듈들이 일정한 순서대로 배열된 모듈 배열의 집합을 적어도 하나 이상 포함하고, 상기 모듈 배열의 집합에 대한 효용성 점수를 상기 사용자의 피드백 정보를 이용하여 산출하는 것을 특징으로 할 수 있다.Here, the learning unit may include at least one set of module arrays in which the behavioral network modules are arranged in a predetermined order, and calculate the utility score for the set of module arrays using the feedback information of the user .

여기서 상기 복합 대응행동 생성부는 상기 학습부에서 산출된 상기 모듈 배열의 집합에 대한 효용성 점수를 기준으로 상기 중간 목적들의 배열 순서를 정하고, 상기 배열 순서에 따른 상기 중간 목적들의 시퀀스를 획득하는 것을 특징으로 할 수 있다.Wherein the complex corresponding behavior generating unit determines an arrangement order of the intermediate objectives on the basis of the utility score for the set of module arrays calculated by the learning unit and acquires the sequence of intermediate goals according to the arrangement order can do.

상기 과제를 해결하기 위해, 본 발명의 또 다른 유형에 따른 사용자 의도 대응 행동 생성 방법은, 사용자 입력 정보와 주변 환경정보를 입력받는 정보 입력 단계; 상기 사용자 입력 정보를 분석하여, 하나의 중간 목적이 획득된 경우 사용자의 의도를 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류하는 사용자 의도 분석 단계; 상기 사용자의 의도가 상기 단순의도로 분류된 경우, 목적 노드와 환경 노드와 행동 노드로 이루어진 행동 선택 네트워크를 이용하여 상기 단순의도에 대응하는 행동을 생성하는 단순 대응행동 생성 단계; 및 상기 사용자의 의도가 상기 복합의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 복합의도에 대응하는 행동을 생성하는 복합 대응행동 생성 단계를 포함하는 것을 특징으로 할 수 있다.According to another aspect of the present invention, there is provided a method for generating a user's intent response behavior comprising: inputting user input information and surrounding environment information; Analyzing the user input information to classify the intention of the user when the one intermediate object is obtained and the intention of the user when the compound object including two or more intermediate objects is obtained; User intention analysis step; A simple response behavior generating step of generating a behavior corresponding to the simple intention by using a behavior selection network composed of a destination node, an environment node and a behavior node when the intention of the user is classified into the simple road; And a complex response behavior generating step of, when the user's intention is classified as the compound road, using the behavior selection network to generate a behavior corresponding to the compound intention.

여기서, 상기 사용자 의도 분석 단계는, 상기 사용자 입력 정보를 분석하여 사용자 명령 구문을 획득하고, 상기 사용자 명령 구문을 미리 정해진 의도 분류 온톨로지를 이용하여 분석하여 상기 사용자 명령 구문에 포함된 상기 중간 목적 또는 상기 복합 목적을 획득하고, 상기 하나의 중간 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 단순의도로, 상기 복합 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 복합의도로, 각 분류하는 것을 특징으로 할 수 있다.Here, the user's intent analysis step may include analyzing the user input information to obtain a user command syntax, analyzing the user command syntax using a predetermined intention classification ontology, The user intention corresponding to the user command syntax is referred to as the simple road, when the combined purpose is obtained, the user intention corresponding to the user command syntax corresponding to the user command syntax Are classified into the composite roads, respectively.

여기서, 상기 단순 대응행동 생성 단계는, 상기 단순의도에 대응하는 상기 중간 목적을 가지는 상기 행동 네트워크 모듈을 상기 행동 선택 네트워크에서 검색하여 선택하고, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드를 이용하여 단순 대응행동을 생성하고, 상기 단순 대응행동은 일정한 순서로 배열된 적어도 하나 이상의 단위 행동을 포함하는 것을 특징으로 할 수 있다.Here, the simple corresponding behavior generating step may include searching and selecting the behavior network module having the intermediate purpose corresponding to the simple intention in the behavior selection network, and using the behavior node included in the selected behavior network module And generating a simple response action, and the simple response action includes at least one or more unit actions arranged in a predetermined order.

여기서 상기 단순 대응행동 생성 단계는, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드 각각에 대하여 활성화 에너지를 산출하고, 상기 활성화 에너지가 가장 큰 상기 행동 노드를 선택하고, 상기 선택된 행동 노드를 이용하여 상기 단순 대응행동을 생성하는 것을 특징으로 할 수 있다.Here, the simple corresponding behavior generating step may include calculating an activation energy for each of the behavior nodes included in the selected behavior network module, selecting the behavior node having the largest activation energy, And a simple response action is generated.

여기서, 상기 복합 대응행동 생성 단계는, 상기 복합 목적을 계층 분할하여 일정한 순서를 가지는 상기 중간 목적들의 시퀀스를 획득하고, 상기 획득한 중간 목적들의 시퀀스에 포함된 각 상기 중간 목적에 대하여, 상기 행동 네트워크를 이용하여 상기 중간 목적에 대응하는 단위 행동의 시퀀스를 생성하고, 상기 중간 목적들의 시퀀스의 순서 대로 상기 중간 목적 별 상기 단위 행동의 시퀀스를 배열하여 이루어지는 복합 대응행동을 생성하는 것을 특징으로 할 수 있다.Here, the complex corresponding behavior generating step may include hierarchically dividing the combined object to obtain a sequence of intermediate objectives having a predetermined order, and for each intermediate purpose included in the obtained sequence of intermediate objectives, Generating a sequence of unit behaviors corresponding to the intermediate purpose and arranging sequences of the unit behaviors for the intermediate purpose in the sequence of the intermediate objectives to generate a composite corresponding behavior .

본 발명에 따른 사용자 에이전트 장치에 의하면, 사용자의 의도의 복잡도에 따라 최적화된 대응행동을 생성할 수 있는 효과가 있다. 즉 인식된 사용자의 의도에 따라 대응행동 기법을 달리하여 대응행동을 생성함으로써, 불필요한 연산과 대응 시간을 줄일 수 있다.According to the user agent apparatus of the present invention, it is possible to generate optimized response actions according to the complexity of the user's intention. That is, unnecessary calculation and response time can be reduced by generating the corresponding behavior by changing the corresponding behavior technique according to the intention of the recognized user.

또한 학습에 있어서도 사용자 의도에 따라 대응기법을 달리하여 의도를 중심으로 대응행동을 학습함으로써, 구체적인 명령이나 도메인 환경에 독립적으로 사용자가 원하는 최적의 서비스를 제공하는 동시에 효율적인 서비스를 제공하는 효과가 있다.Also, in learning, by learning the response behavior based on the intention by varying the corresponding technique according to the user's intention, it is possible to provide an efficient service while providing the optimal service desired by the user independent of the specific command or domain environment.

도 1은 본 발명의 일 실시예에 따른 사용자 에이전트 장치의 블록도이다.
도 2는 본 발명의 또 다른 실시예에 따른 사용자 에이전트 장치의 블록도이다.
도 3은 사용자 의도 분석부의 동작을 설명하기 위한 참고도이다.
도 4는 사용자 의도 분석부가 의미 분류 온톨로지를 이용하여 하나의 복합 목적을 계층적으로 분석하는 동작에 관한 참고도이다.
도 5는 행동 네트워크 모듈을 설명하기 위한 참고도이다.
도 6은 단순 대응행동을 설명하기 위한 참고도이다.
도 7은 복합 대응행동 생성부의 동작을 설명하기 위한 참고도이다.
도 8은 복합 대응행동을 설명하기 위한 참고도이다.
도 9는 본 발명의 또 다른 실시예에 따른 사용자 에이전트 장치의 블록도이다.
도 10은 학습부의 동작을 보다 상세히 설명하기 위한 참고도이다.
도 11은 본 발명의 또 다른 실시예에 따른 사용자 의도 대응 행동 생성 방법에 관한 흐름도이다.1 is a block diagram of a user agent device in accordance with an embodiment of the present invention.
2 is a block diagram of a user agent device in accordance with another embodiment of the present invention.
3 is a reference diagram for explaining the operation of the user's intention analysis unit.
FIG. 4 is a reference diagram for an operation of hierarchically analyzing a compound object using a user intention analysis part semantic classification ontology.
5 is a reference diagram for explaining a behavioral network module.
6 is a reference diagram for explaining a simple response action.
7 is a reference diagram for explaining the operation of the complex response behavior generating unit.
Fig. 8 is a reference diagram for explaining the complex response behavior.
9 is a block diagram of a user agent device in accordance with another embodiment of the present invention.
10 is a reference diagram for explaining the operation of the learning unit in more detail.
11 is a flowchart illustrating a method for generating a user's intention response behavior according to another embodiment of the present invention.

이하, 본 발명의 바람직한 실시예를 첨부된 도면들을 참조하여 상세히 설명한다. 우선 각 도면의 구성요소들에 참조 부호를 부가함에 있어서, 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가능한 동일한 부호를 가지도록 하고 있음에 유의해야 한다. 또한, 본 발명을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다. 또한, 이하에서 본 발명의 바람직한 실시예를 설명할 것이나, 본 발명의 기술적 사상은 이에 한정하거나 제한되지 않고 당업자에 의해 변형되어 다양하게 실시될 수 있음은 물론이다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the drawings, the same reference numerals are used to designate the same or similar components throughout the drawings. In the following description of the present invention, a detailed description of known functions and configurations incorporated herein will be omitted when it may make the subject matter of the present invention rather unclear. In addition, the preferred embodiments of the present invention will be described below, but it is needless to say that the technical idea of the present invention is not limited thereto and can be variously modified by those skilled in the art.

본 발명은 행동학습 모델을 이용한 에이전트 대응행동 생성 기술분야에 있어 이전까지 도메인 지식을 기반으로 경험적 행동에 대한 비용을 고려하여 새로운 대응행동을 규칙기반으로 생성하거나, 행동간의 연결구조를 변환하여 대응행동을 생성하는 것과는 달리 도메인 지식과는 독립적으로 인식된 사용자의 의도에 따라 대응행동 기법을 달리하고, 사용자 의도에 대한 학습을 필요로 하는 복합-의도대응 과정에만 행동학습 모델을 적용하여 효율적인 대응행동을 생성하는 의도-대응 인터페이스 장치 및 그 방법에 관한 것이다. 더욱 상세하게는 행동학습 모델을 통해 복합-의도 대응에 필요한 행동 네트워크 모듈을 구성하여 사용자의 의도에 부합하는 대응 행동을 생성하고, 불필요한 연산과 대응 시간을 줄이는 것이다. The present invention relates to a technology for generating agent-responsive behavior using a behavioral learning model, which has previously generated a new response action based on domain knowledge, taking into account the cost of empirical action, , The behavior learning model is applied only to the complex-intention correspondence process that requires learning about the user's intention, and the effective response behavior And to an intention-response interface device and method thereof. More specifically, a behavioral learning model is used to construct a behavioral network module necessary for complex-intent correspondence to generate a behavior corresponding to a user's intention and to reduce unnecessary operations and response time.

사용자의 의도에 반응하여 적절한 대응행동을 생성하는 에이전트 장치는 자동화된 서비스를 제공함에 있어서 필수적인 구성이다. 업무를 돕는 비서 에이전트, 심부름 또는 청소 업무를 수행하는 로봇 에이전트, 가정에서 가전기기를 비롯한 전자 장비들을 관리하는 홈 에이전트 등 다양한 에이전트 시스템이 존재하고 있다. 그리고 이들 사용자 에이전트 시스템을 사용자가 이용하는 방식은 기존의 직접적인 명령 입력 방식에서, 보다 고차원적이고 상호작용을 지원하며 사용자에게 최적화된 인터페이스를 제공하는 방식으로 발전하고 있다.An agent device that generates an appropriate response in response to a user's intention is an essential configuration in providing automated services. There are various agent systems such as a secretary agent for helping work, a robot agent for performing errand or cleaning work, and a home agent for managing electronic equipment including household appliances at home. And, the way that users use these user agent systems is evolving from the existing direct command input method to a method of providing higher-dimensional, interactive and user-optimized interfaces.

이와 같은 사용자 에이전트 시스템은 사용자의 의도 또는 목적을 파악하는 수단과 파악된 의도 또는 목적에 대응하는 행동을 생성하는 수단을 포함할 수 있다. 그리고 의도에 대응하는 행동을 생성하기 위하여는 미리 정의된 규칙에 따라 대응행동을 생성하거나 또는 사용자와의 상호 작용을 통해 학습된 행동 학습 모델을 이용하여 대응행동을 생성할 수도 있다.Such a user agent system may include means for grasping the intention or purpose of the user and means for generating actions corresponding to the intent or purpose of the grasp. In order to generate a behavior corresponding to the intention, a corresponding behavior may be generated according to a predefined rule, or a corresponding behavior may be generated using a learned behavior learning model through interaction with a user.

따라서 사용자의 의도 또는 목적을 보다 효율적이고 정확하게 파악하는 것, 보다 신뢰도 있게 사용자 의도 또는 목적에 대응하는 행동을 생성하는 것이 사용자 에이전트 시스템에 있어서 해결해야 할 중요한 과제이다. 또한 사용자 에이전트 시스템이 실시간으로 사용자와 상호 작용을 하면서 동작하기 위하여는 대응행동을 생성하는 계산 속도가 일정 기준 보다 신속하게 이루어져야 할 필요가 있다.Therefore, it is an important task to be solved in the user agent system to more efficiently and accurately grasp the intention or purpose of the user, and more reliably to create an action corresponding to the user's intention or purpose. Also, in order for the user agent system to operate in real time while interacting with the user, it is necessary that the calculation speed for generating the corresponding action should be made faster than a certain standard.

본 발명에서는 사람이 타인의 의도를 이해하고 이에 대응하는 뇌의 학습 및 대응 매커니즘을 모방하여, 사용자 에이전트 장치가 사용자의 의도를 그 추상성 정도에 따라 적절히 이해하고, 이에 대응하는 행동을 생성하는 방법을 개시한다. In the present invention, a method for a user agent device to appropriately understand a user's intention according to the degree of abstraction and to generate a corresponding action by imitating a learning and response mechanism of a brain that understands a person's intention of a person and copes with it .

인간의 뇌는 입력받은 정보의 난이도와 추상성 정도에 따라 상이한 방식으로 작용하여 입력받은 정보에 따른 대응행동을 생성한다. 즉 예를 들면 '컵을 집어'라는 단순한 명령에 대하여는 뇌의 영역 중 aIPS(anterior IntraParietal Sulcus), PMC(PreMotor Cortex), pSTS(posterior Superior Temporal Sulcus)와 같은 영역이 활성화되어 단순한 명령에 대한 대응 행동을 생성한다. 반면 '로멘틱한 저녁을 준비해'라는 보다 추상적이고 고차원적인 명령에 대하여는 뇌의 영역 중 TPJ(Temporo Parietal Junctino), mPFC(medial PreFrontal Cortex)와 같은 영역이 활성화되어 고차원적인 명령에 대한 대응행동을 생성한다.The human brain works in a different way depending on the degree of difficulty and abstraction of the input information, and generates a corresponding behavior according to the input information. For example, for a simple command to 'pick a cup', areas such as aIPS (anterior IntraParietal Sulcus), PMC (PreMotor Cortex), and pSTS (posterior Superior Temporal Sulcus) . On the other hand, for more abstract and higher order commands such as 'prepare a romantic evening', areas such as TPJ (Temporo Parietal Junctino) and mPFC (medial PreFrontal Cortex) are activated in the brain area to generate a response action to a higher order command .

본 발명은 위와 같이 인간이 입력받은 정보 또는 명령의 복잡도 또는 난이도 또는 추상성의 정도에 따라 뇌가 작용하는 매카니즘이 상이한 점에 착안하여, 사용자 에이전트 장치에 있어서도 사용자의 의도의 수준에 따라 서로 다른 방식으로 대응행동을 생성하는 방법을 제안한다.The present invention is based on the fact that the mechanism in which the brain acts differs depending on the complexity of the information or command inputted by the human or the degree of difficulty or abstraction, We propose a method to generate the corresponding action.

이를 위하여 본 발명은 낮은 수준의 대응행동을 필요로 하는 단순의도에는 빠르고 직관적인 대응행동을 생성하고, 상위 수준의 대응행동을 필요로 하는 복합의도에는 사용자 의도에 기반하여 학습된 대응기법을 적용하여 장기적인 대응행동을 생성한다. 또한 행동 네트워크 모듈로 구성되는 행동 선택 네트워크를 이용하여 사용자의 의도에 대한 대응행동을 빠르게 생성한다.To this end, the present invention creates quick and intuitive response behaviors for simple intentions that require low-level response behaviors, and uses the learned correspondence techniques for complex intentions that require high-level response behaviors To generate long-term response behavior. In addition, a behavioral selection network composed of behavioral network modules is used to quickly generate a response behavior to a user's intention.

이해를 돕기 위하여 본 발명에 따른 사용자 에이전트 장치의 간단한 동작 예를 들면, 사용자가 '창문을 열어'라는 문장을 이야기한 경우, 사용자 에이전트 장치는 마이크 센서를 이용하여 상기 사용자의 말을 입력받고, 입력받은 음성 신호로부터 '창문을 열어'라는 사용자 명령 문장을 획득하고, 그에 대응하는 사용자의 의도 또는 목적을 '창문을 연다'로 파악할 수 있다. 그리고 파악된 사용자의 의도에 대응하는 행동으로, '창문으로 이동', '창문을 열기'라는 대응행동을 생성할 수 있다.For the sake of understanding, when the user tells a statement 'open a window', for example, a simple operation of the user agent device according to the present invention, the user agent device inputs the user's word using a microphone sensor, A user command sentence 'Open a window' is obtained from a received voice signal, and a user's intention or purpose corresponding to the obtained user command sentence can be identified as 'Open a window'. Then, in response to the identified user's intention, a corresponding action 'move to a window' or 'open a window' can be generated.

여기서 대응행동이라 함은 파악된 사용자의 의도에 따라 사용자 에이전트 시스템이 수행하거나 또는 외부의 장치를 이용하여 수행을 지시하여야 하는 행동을 의미한다. 이와 같은 대응행동은 미리 정해진 어느 하나의 행동을 서술하는 문구로 표현될 수 있고, 필요에 따라 프로그래밍 언어 또는 인스트럭션 셋 등 다양한 형식으로 표현될 수 있다. 예를 들면 대응행동은 '물체를 잡는다'와 같은 단위 행동이 될 수 있다. 또한 여기서 대응행동은 '물체를 잡는다'와 '물체를 든다'와 같이 하나 이상의 일련의 단위 행동으로 이루어질 수도 있다.Here, the 'corresponding action' means an action to be performed by the user agent system according to the intention of the user or to instruct execution using an external device. Such a corresponding action may be expressed as a phrase describing a predetermined action, and may be expressed in various forms such as a programming language or an instruction set as needed. For example, a corresponding action can be a unit action such as 'catch an object'. Also, the corresponding action may consist of one or more series of unit behaviors such as 'catch an object' and 'pick up an object'.

이하에서는 본 발명에 따른 사용자 에이전트 장치의 세부 구성과 그 동작에 대하여 보다 상세히 설명한다.Hereinafter, the detailed configuration and operation of the user agent apparatus according to the present invention will be described in detail.

도 1은 본 발명의 일 실시예에 따른 사용자 에이전트 장치의 블록도이다.1 is a block diagram of a user agent device in accordance with an embodiment of the present invention.

상기 본 발명에 따른 사용자 에이전트 장치는 정보 입력부(100), 사용자 의도 분석부(200), 단순 대응행동 생성부(300), 복합 대응행동 생성부(400)를 포함한다.The user agent apparatus according to the present invention includes an information input unit 100, a user's intention analysis unit 200, a simple response behavior generation unit 300, and a complex response behavior generation unit 400.

정보 입력부(100)는 사용자 입력 정보와 주변 환경정보를 입력받는다.The information input unit 100 receives user input information and surrounding environment information.

사용자 의도 분석부(200)는 상기 사용자 입력 정보를 분석하여, 하나의 중간 목적이 획득된 경우 사용자의 의도를 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류한다.The user's intention analysis unit 200 analyzes the user input information to classify the intention of the user when the one intermediate goal is acquired and the intention of the user if the two or more intermediate goals are acquired. Classify users' intentions into multiple roads.

단순 대응행동 생성부(300)는 상기 사용자의 의도가 상기 단순의도로 분류된 경우, 목적 노드와 환경 노드와 행동 노드로 이루어진 행동 선택 네트워크를 이용하여 상기 단순의도에 대응하는 행동을 생성한다.The simple response behavior generating unit 300 generates a behavior corresponding to the simple intention by using a behavior selection network including a destination node, an environment node, and a behavior node when the intention of the user is classified as the simple road.

복합 대응행동 생성부(400)는 상기 사용자의 의도가 상기 복합의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 복합의도에 대응하는 행동을 생성한다.The complex response behavior generating unit 400 generates a behavior corresponding to the compound intention using the behavior selection network when the user's intention is classified as the compound road.

도 2는 본 발명의 또 다른 실시예에 따른 사용자 에이전트 장치의 블록도이다.2 is a block diagram of a user agent device in accordance with another embodiment of the present invention.

상기 본 발명에 따른 사용자 에이전트 장치는 정보 입력부(100), 사용자 의도 분석부(200), 단순 대응행동 생성부(300), 복합 대응행동 생성부(400)와 함께 행동 선택 네트워크 저장부(500)를 더 포함할 수 있다.The user agent apparatus according to the present invention includes an action selection network storage unit 500 together with an information input unit 100, a user's intention analysis unit 200, a simple response action generation unit 300, As shown in FIG.

행동 선택 네트워크 저장부(500)는 상기 행동 선택 네트워크를 저장한다.The behavior selection network storage unit 500 stores the behavior selection network.

이하에서는 본 발명에 따른 사용자 에이전트 장치의 각 부분에 대하여 보다 상세히 설명한다.Hereinafter, each part of the user agent apparatus according to the present invention will be described in detail.

여기서 상기 사용자 입력 정보는 다양한 종류의 센서를 이용하여 사용자의 행위를 인지한 정보를 의미한다. 예를 들면 사용자 입력 정보는 마이크를 이용하여 녹음한 사용자의 음성 신호가 될 수 있고, 또는 카메라를 이용하여 녹화한 사용자의 동작에 관한 영상 신호가 될 수도 있다. 또는 터치 센서를 이용하여 감지한 사용자의 센서 터치 정보가 될 수도 있다.Herein, the user input information is information perceived by a user using various kinds of sensors. For example, the user input information may be a voice signal of a user recorded using a microphone, or may be a video signal related to a motion of a user recorded using the camera. Or the sensor touch information of the user sensed by using the touch sensor.

여기서 상기 환경정보는 다양한 종류의 센서를 이용하여 상기 에이전트 장치가 속한 환경을 인지한 정보를 의미한다. 예를 들면 환경정보는 온도 센서를 이용하여 획득한 온도 정보가 될 수 있고, 조도 센서를 이용하여 획득한 밝기 센서가 될 수도 있다.Here, the environment information indicates information on the environment of the agent apparatus using various sensors. For example, the environmental information may be temperature information acquired using a temperature sensor, or may be a brightness sensor acquired using an illuminance sensor.

정보 입력부(100)는 영상 센서 또는 음성 센서 또는 터치 센서 중 적어도 어느 하나를 이용하여 사용자의 행위에 따른 상기 사용자 입력 정보를 입력받고, 상기 사용자 에이전트 장치가 속한 환경에 관한 상기 환경정보를 입력받을 수 있다.The information input unit 100 receives the user input information according to a user's action using at least one of an image sensor, a voice sensor, and a touch sensor and receives the environment information related to the environment to which the user agent apparatus belongs have.

여기서 사용자의 의도는 복잡도 또는 추상성의 정도에 따라 단순의도와 복합의도 중 어느 하나로 분류된다. 예를 들어 단순의도는 '컵을 잡다' 또는 '컵을 놓다'와 같이 복잡도가 낮은 수준의 의도를 의미하고, 복합의도는 '목이 마르다'와 같은 복잡도가 높은 수준의 의도를 의미하는 것으로, 복합의도는 단순의도에 비해 상대적으로 추상적인 특성을 갖는다. Here, the intention of the user is classified into one of simplicity and complexity according to the degree of complexity or abstraction. For example, simple intent refers to intention with low complexity such as 'catch cup' or 'put cup', and composite intent refers to intention with high complexity such as 'thirsty' , Complexity has relatively abstract characteristics compared to simple intention.

여기서 사용자 의도 분석부(200)는 사용자 입력 정보를 분석하여 사용자 입력 정보에 포함된 사용자의 목적을 획득한다. 여기서 사용자의 목적은 하나의 중간 목적이 되거나, 또는 두 개 이상의 중간 목적을 포함하는 복합 목적이 될 수 있다. 또한 여기서 사용자 의도 분석부(200)는 획득된 사용자의 목적이 하나의 중간 목적만을 포함하는 경우, 사용자의 의도는 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류할 수 있다.Here, the user's intention analysis unit 200 analyzes the user input information to obtain the purpose of the user included in the user input information. Here, the user's purpose may be one intermediate purpose, or a combination of two or more intermediate purposes. Here, the user's intention analysis unit 200 may classify the intention of the user as simple if the acquired user's purpose includes only one intermediate purpose, and if the combined purpose including two or more of the intermediate purposes is obtained The intention of the user can be classified into a compound road.

사용자 의도 분석부(200)는, 상기 사용자 입력 정보를 분석하여 사용자 명령 구문을 획득하고, 상기 사용자 명령 구문을 미리 정해진 의도 분류 온톨로지를 이용하여 분석하여 상기 사용자 명령 구문에 포함된 상기 중간 목적 또는 상기 복합 목적을 획득한다.The user intention analysis unit 200 analyzes the user input information to obtain a user command syntax, analyzes the user command syntax using a predetermined intention classification ontology, Obtain multiple objectives.

여기서 사용자 입력 정보는 사용자의 행위 자체에 관한 정보이고, 사용자 명령 구문은 사용자 입력 정보를 분석하여 획득한 사용자의 행위에 따른 특정 명령에 관한 정보를 포함하는 구문 정보이다. 예를 들면 사용자 입력 정보는 사용자가 '창문을 열어'라고 말한 것을 녹음한 음성 신호라면, 사용자 명령 구문은 위 음성 신호를 분석하여 획득한 '창문을 열어'라는 명령 구문이 될 수 있다. Herein, the user input information is information on a user's action itself, and the user command syntax is syntax information including information on a specific command according to a user's behavior acquired by analyzing user input information. For example, if the user input information is a voice signal in which the user has said 'open a window', the user command syntax may be a command phrase 'open a window' obtained by analyzing the above voice signal.

또한 사용자의 특정 동작에 대하여 미리 정해진 명령 구문이 있다면, 해당 특정 동작에 관한 정보가 사용자 입력 정보로 입력되었을 때 이를 분석하여 그에 대응하는 특정 명령 구문을 사용자 명령 구문으로 획득할 수도 있다. 예를 들면 사용자가 손을 왼쪽으로 이동하는 동작에 대하여 물체를 왼쪽으로 옮기는 명령 구문을 미리 정해놓았다면, 사용자 입력 정보가 사용자가 손을 왼쪽으로 이동시키는 동작을 촬영한 영상일 때, 이를 분석하여 물체를 왼쪽으로 옮기는 명령 구문을 사용자 명령 구문으로 획득할 수 있다.Also, if there is a predetermined command syntax for a specific operation of the user, information on the specific operation may be analyzed as input when the user input information is input, and a specific command syntax corresponding thereto may be acquired as a user command syntax. For example, if a command syntax for moving an object to the left with respect to an operation in which a user moves his / her hand to the left is predetermined, if the user input information is an image captured by a user moving a hand to the left, You can acquire the command syntax to move objects to the left with the user command syntax.

여기서 사용자 의도 분석부(200)는 상기 하나의 중간 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 단순의도로, 상기 복합 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 복합의도로, 각 분류한다.Here, the user's intention analysis unit 200 may determine whether the user intention corresponding to the user command syntax is the simple road, if the one intermediate goal is obtained, The user's intention is classified into the composite roads.

즉 사용자 의도 분석부(200)는 상기 사용자 입력 정보를 의도 분류 온톨로지(Ontology)를 이용하여 분석하여, 상기 사용자 입력 정보에 대응하는 사용자의 의도가 단순의도인지 복합의도인지 여부를 판단할 수 있다.That is, the user's intention analysis unit 200 analyzes the user input information using an intention classification ontology to determine whether the intention of the user corresponding to the user input information is simple or complex have.

예를 들어 사용자 의도 분석부(200)는 '창문을 열어줘' 또는 '상자를 왼쪽으로 옮김'과 같은 명령에 관한 사용자 입력 정보가 입력된 경우, 이에 포함된 사용자의 목적이 하나의 중간 목적을 가지는 것으로 판단하여, 이에 대응하는 사용자의 의도를 단순의도로 분류할 수 있다. 또는 사용자 의도 분석부(200)는 '덥다' 또는 '상자를 분류하여 쌓음'과 같은 명령에 관한 사용자 입력 정보가 입력된 경우, 이에 포함된 사용자의 목적이 복합 목적을 가지는 것으로 판단하여, 이에 대응하는 사용자의 의도를 복합의도로 분류할 수 있다.For example, if the user intention analysis unit 200 inputs user input information related to a command such as 'open a window' or 'move a box to the left' It is possible to classify the intention of the user corresponding thereto in a simple manner. Or the user's intention analyzing unit 200 determines that the user's intent of the user included in the command is a combined purpose when the user input information related to the command such as 'hot' or 'box classified and accumulated' is input, The user's intention can be classified into a plurality of roads.

여기서 사용자 의도 분석부(200)가 상기 사용자 입력 정보를 의도 분류 온톨로지를 이용하여 분석한 결과, 미리 정해진 일정한 기준 이상의 추상성을 가지는 사용자 의도가 도출된 경우 이를 복합의도로 판단할 수 있다. 아니면 미리 정해진 일정한 기준 이하의 추상성을 가지는 사용자 의도가 도출된 경우 이를 단순의도로 판단할 수 있다.Here, if the user's intention analysis unit 200 analyzes the user input information using the intention classification ontology, if the intention of the user having an abstraction more than a predetermined standard is derived, it can be determined as a composite. Or if the intention of the user having the abstraction below a predetermined standard is derived, it can be determined simply.

여기서 의도 분류 온톨로지는 언어들 간의 관계를 사용자의 의도를 중심으로 정의한 미리 정해진 온톨로지이다. Here, the ontology of ontology classification is a predefined ontology that defines the relationship between languages based on user's intention.

여기서 사용자 의도의 추상성을 판단하는 기준은 중간 목적의 수가 될 수 있다. Here, the criterion for determining the abstraction of the user's intention can be a number of intermediate objectives.

여기서 중간 목적은 사용자 에이전트 장치가 수행 또는 지시하는 특정 단위행동에 대응하는 목적으로 특정 주체 또는 객체와 그에 대한 동작 또는 상태에 관한 정보를 포함할 수 있다.Here, the intermediate object may include information about a specific subject or object and an operation or a state thereof for the purpose of corresponding to a specific unit behavior performed or instructed by the user agent apparatus.

여기서 사용자 의도 분석부(200)는 사용자 입력 정보를 의도 분류 온톨로지를 이용하여 분석하여 사용자 입력 정보에 포함된 사용자의 목적을 획득한다. 여기서 사용자의 목적은 하나의 중간 목적이 되거나, 또는 두 개 이상의 중간 목적을 포함하는 복합 목적이 될 수 있다. 또한 여기서 사용자 의도 분석부(200)는 획득된 사용자의 목적이 하나의 중간 목적만을 포함하는 경우 사용자의 의도를 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류할 수 있다. Here, the user's intention analysis unit 200 analyzes the user input information using the intention classification ontology and obtains the purpose of the user included in the user input information. Here, the user's purpose may be one intermediate purpose, or a combination of two or more intermediate purposes. Here, the user's intention analysis unit 200 classifies the intention of the user into a simple one if the acquired user's purpose includes only one intermediate purpose, and if the combined purpose including two or more of the above- The user's intention can be classified into a plurality of roads.

도 3은 사용자 의도 분석부(200)의 동작을 설명하기 위한 참고도이다. FIG. 3 is a reference diagram for explaining the operation of the user's intention analysis unit 200. FIG.

도 3과 같이 사용자 입력 정보를 의도 분류 온톨로지를 이용하여 트리 구조로 의미를 분석할 수 있다. 도 3과 같이 '쌀쌀한데'라는 음성 신호인 사용자 입력 정보가 입력된 경우 사용자 의도 분석부(200)는 사용자 입력 정보를 분석하여 '쌀쌀한데'에 해당하는 사용자 명령 구문을 획득하고, 이를 의미 분류 온톨로지를 이용하여 분류하여 적어도 2개 이상의 중간 목적(에어컨 온도 조정, 선풍기 제어, 창문 제어 등)을 포함하는 복합 목적(적정 실내 온도 맞추기)를 획득할 수 있다. 이 경우 사용자의 의도는 복합의도가 된다. 또는 도 3과 같이 '창문 좀'이라는 사용자 입력 정보가 입력된 경우 사용자 의도 분석부(200)는 이를 분석하여 '창문 제어'라는 하나의 중간 목적을 획득할 수도 있다. 이 경우 사용자의 의도는 단순의도가 된다.As shown in FIG. 3, the user input information can be analyzed in a tree structure using an ontology for classifying an intention. 3, the user's intention analysis unit 200 analyzes the user's input information to obtain a user command syntax corresponding to 'chilly', and assigns meaning to the ' It is possible to classify them using an ontology to obtain a combined purpose (proper indoor temperature matching) including at least two intermediate purposes (air conditioner temperature control, fan control, window control, etc.). In this case, the intention of the user is a composite intention. 3, the user's intention analysis unit 200 may analyze the user input information to acquire one intermediate purpose of 'window control'. In this case, the intention of the user is a simple intention.

여기서 복합목적은 다시 계층적으로 분할되어 하나 이상의 복합 목적을 포함할 수 있다. Where the composite object may be further hierarchically segmented to include one or more composite objects.

도 4는 사용자 의도 분석부(200)가 의미 분류 온톨로지를 이용하여 하나의 복합 목적을 계층적으로 분석하는 동작에 관한 참고도이다. 도 4와 같이 복합목적 #1은 계층적으로 분할되어 하위 계층의 복합 목적 #2들을 포함할 수 있고, 복합목적 #2가 다시 분할되어 하나 이상의 중간 목적들을 포함할 수 있다. 이하 상술할 바와 같이 하나의 중간 목적에 대하여는 그에 대응하는 대응행동이 생성된다.4 is a reference diagram for an operation in which the user's intention analysis unit 200 hierarchically analyzes one combined object using a semantic classification ontology. As shown in FIG. 4, the composite object # 1 may be hierarchically divided to include the composite objects # 2 of the lower layer, and the composite object # 2 may be divided again to include one or more intermediate objects. As will be described below, corresponding action is generated for one intermediate purpose.

다음으로 단순 대응행동 생성부(300)는 상기 사용자의 의도가 상기 단순의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 단순의도에 대응하는 행동을 생성한다.Next, the simple response behavior generating unit 300 generates a behavior corresponding to the simple intention using the behavior selection network when the intention of the user is classified as the simple road.

여기서 행동 선택 네트워크는 목적 노드와 환경 노드와 행동 노드로 이루어진 노드들 간의 연결로 이루어진 네트워크이다. Here, the behavior selection network is a network consisting of connections between destination nodes, environment nodes, and action nodes.

여기서 상기 행동 선택 네트워크는 적어도 하나 이상의 행동 네트워크 모듈을 포함한다.Wherein the behavior selection network comprises at least one behavioral network module.

여기서 상기 행동 네트워크 모듈은 하나의 상기 중간 목적에 관한 상기 목적 노드에 대하여 적어도 하나 이상의 상기 환경 노드와 적어도 하나 이상의 상기 행동 노드들 간의 연결로 이루어지는 것을 특징으로 한다.Wherein the behavior network module comprises a connection between at least one of the environment node and at least one of the behavior nodes with respect to the target node for one intermediate purpose.

도 5는 행동 네트워크 모듈을 설명하기 위한 참고도이다. 5 is a reference diagram for explaining a behavioral network module.

도 5와 같이 하나의 행동 네트워크 모듈은 하나의 중간 목적에 대응하는 목적 노드를 가지고, 하나 이상의 환경 노드와 행동 노드들을 가진다. 여기서 환경 노드는 상기 환경정보에 대응하여 사용자 에이전트가 속한 환경에 관한 정보를 나타내는 노드이고, 상기 행동 노드는 사용자 에이전트가 수행하거나 또는 지시하는 행동에 관한 정보를 나타내는 노드이다. 여기서 하나의 행동 노드는 하나의 단위 행동을 대표한다. 따라서 일련의 행동 노드들의 연결을 통해 일련의 단위 행동들을 나타낼 수 있다.As shown in FIG. 5, one behavioral network module has one or more environment nodes and action nodes having a corresponding object node for one intermediate purpose. Here, the environment node is a node representing information on the environment to which the user agent belongs in response to the environment information, and the behavior node is a node representing information about actions performed or indicated by the user agent. Here, one action node represents one unit action. Thus, a set of behavioral nodes can represent a series of unit behaviors through connections.

단순 대응행동 생성부(300)에서는 낮은 수준의 의도인 단순의도에 빠르게 대응행동을 생성하기 위한 것으로, 하나의 중간목적을 달성하기 위한 행동을 생성한다. 예를 들어 단순의도로 분류된 "창문을 열어줘"라는 사용자 입력 정보에 대응하는 '창문 열기'라는 중간 목적이 획득된 경우, 이를 달성하기 위해 '창문 열기'라는 중간 목적을 가지는 행동 네트워크 모듈을 행동 선택 네트워크에서 검색하여 찾고, 선택된 행동 네트워크 모듈을 이용하여 상기 단순의도에 대응하는 행동을 생성한다.The simple response behavior generating unit 300 generates a response to achieve a single intermediate purpose in order to quickly generate a response response to a simple intention which is a low level intention. For example, in the case where an intermediate purpose of 'opening a window' corresponding to user input information of a simple classified 'open window' is obtained, in order to achieve this, a behavior network module having an intermediate purpose of 'opening a window' The behavior selection network is searched and found, and a behavior corresponding to the simple intention is generated using the selected behavior network module.

여기서 단순 대응행동 생성부(300)는, 상기 단순의도에 대응하는 상기 중간 목적을 가지는 상기 행동 네트워크 모듈을 상기 행동 선택 네트워크에서 검색하여 선택하고, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드를 이용하여 단순 대응행동을 생성한다.Here, the simple response behavior generating unit 300 searches for and selects the behavior network module having the intermediate purpose corresponding to the simple intention in the behavior selection network, and selects the behavior node included in the selected behavior network module To generate a simple response action.

여기서 상기 단순 대응행동은 일정한 순서로 배열된 적어도 하나 이상의 단위 행동을 포함한다.Here, the simple corresponding action includes at least one or more unit actions arranged in a predetermined order.

도 5를 참조하면 단순 대응행동 생성부(300)는 중간 목적(Sgi)에 대응하는 행동 네트워크 모듈의 행동 노드들(b1, b2, b3)의 연결을 생성하여, 일정한 순서로 배열된 단위 행동들로 구성되는 단순 대응행동을 생성할 수 있다. 5, the simple corresponding behavior generating unit 300 generates a connection of the behavior nodes b1, b2, and b3 of the behavior network module corresponding to the intermediate goal Sgi, Can be generated.

도 6은 단순 대응행동을 설명하기 위한 참고도이다.6 is a reference diagram for explaining a simple response action.

예를 들면 도 6과 같이 (b2 -> b1 -> b3)으로 연결되어 중간 목적 (Sgi)를 달성하는 단순 대응행동을 생성할 수 있다. 여기서 단순 대응행동은 각 행동 노드에 대응하는 일련의 단위 행동들이 된다. 예를 들어 b1가 물체를 이동함, b2가 물체를 듬, b3이 물체를 내림이라는 단위 행동에 대응하는 행동 노드라면, 상기 단순 대응행동은 '물체를 듬', '물체를 이동함', '물체를 내림'이라는 일련의 단위 행동들의 집합이 될 수 있다.For example, as shown in FIG. 6, it is possible to generate a simple response action that is connected to (b2 -> b1 -> b3) to achieve the intermediate goal Sgi. Here, the simple response behavior is a series of unit behaviors corresponding to each action node. For example, if b1 is a behavior node corresponding to a unit behavior in which an object moves, b2 is an object, and b3 is an object, the simple corresponding action may be 'move object', 'move object' It can be a set of a set of unit behaviors called 'lowering an object'.

다음으로는 단순 대응행동 생성부(300)가 상기 선택한 행동 네트워크 모듈에 포함된 행동 노드들을 이용하여 중간 목적을 달성하기 위해 일정한 순서를 가지는 행동 노드들의 집합을 획득하는 과정을 보다 상세히 설명한다.Next, a process of acquiring a set of action nodes having a certain order to achieve an intermediate goal using the action nodes included in the selected action network module will be described in more detail.

여기서 단순 대응행동 생성부(300)는, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드 각각에 대하여 활성화 에너지를 산출하고, 상기 활성화 에너지가 가장 큰 상기 행동 노드를 선택하고, 상기 선택된 행동 노드를 이용하여 상기 단순 대응행동을 생성하는 것이 바람직하다.Here, the simple corresponding behavior generating unit 300 may calculate the activation energy for each of the behavior nodes included in the selected behavior network module, select the behavior node having the largest activation energy, and use the selected behavior node Thereby generating the simple corresponding action.

여기서 단순 대응행동 생성부(300)는 일정한 시간 간격으로 상기 활성화 에너지를 산출하고, 각 시간 별로 가장 큰 상기 활성화 에너지를 가지는 상기 행동 노드를 선택하고, 상기 선택한 상기 행동 노드에 대응하는 상기 단위 행동을 생성하여, 상기 단순 대응행동을 생성하는 것이 바람직하다.Here, the simple corresponding behavior generating unit 300 may calculate the activation energy at a predetermined time interval, select the behavior node having the largest activation energy for each time, and calculate the unit behavior corresponding to the selected behavior node And generates the simple corresponding action.

즉 단순 대응행동 생성부(300)는 각 시간 별로 최대의 활성화 에너지를 가지는 행동 노드를 선택하여 이들을 차례로 연결함으로써, 일정한 순서를 가지고 연결된 행동 노드들의 집합을 획득할 수 있고, 이에 대응하는 일정한 순서를 가지는 단위 행동들의 집합을 생성할 수 있다. 그리고 이와 같이 생성된 단위 행동들의 집합이 상기 단순 대응행동이 된다.That is, the simple response behavior generating unit 300 can select the action nodes having the maximum activation energies for each time and connect them in order to acquire a set of connected action nodes in a predetermined order, Can generate a set of unit behaviors. And the set of unit behaviors generated in this way becomes the simple corresponding action.

여기서 활성화 에너지는 각 행동 노드에 대하여 산출하고, 해당 행동 노드에서의 활성화 에너지는 그 이전 시간의 해당 행동 노드의 활성화 에너지와, 해당 행동 노드와 연결된 환경 노드에 따른 에너지 값들의 가중합과, 해당 행동 노드에 연결된 목적 노드에 따른 에너지 값에 가중치를 적용한 값과, 해당 행동 노드와 타 노드들 간의 연결 링크에 따른 에너지 값들의 가중합을 이용하여 산출할 수 있다.Here, the activation energy is calculated for each action node, and the activation energy at the action node is calculated by adding the weighted sum of the activation energy of the corresponding action node at the previous time and the energy value according to the environmental node connected to the action node, A weighted sum of the energy value according to the destination node connected to the node and a weighted sum of energy values according to the connection link between the behavior node and the other nodes can be calculated.

여기서 단순 대응행동 생성부(300)는 하기 수학식 1과 같이 특정 노드에서의 활성화 에너지를 산출할 숭 있다.Here, the simple response behavior generating unit 300 may calculate the activation energy at a specific node as shown in the following Equation (1).

(여기서 x는 해당 행동 노드이고, a(t)는 t 시간에서의 활성화 에너지이고, G는 목적 노드에 따른 에너지 값이고, L은 해당 행동 노드와 타 노드 간의 연결 링크에 따른 에너지 값이고, E는 해당 행동 노드와 연결된 환경 노드에 따른 에너지 값이고, W는 각 에너지 값들에 적용되는 가중치이다.)(Where x is the corresponding action node, a (t) is the activation energy at time t, G is the energy value according to the destination node, L is the energy value according to the connection link between the behavior node and the other node, Is the energy value according to the environment node connected to the action node, and W is a weight applied to each energy value.)

필요에 따라 해당 행동 노드에서의 활성화 에너지는 그 이전 시간의 해당 행동 노드의 활성화 에너지, 해당 행동 노드와 연결된 환경 노드에 따른 에너지 값들의 가중합, 해당 행동 노드에 연결된 목적 노드에 따른 에너지 값에 가중치를 적용한 값, 해당 행동 노드와 타 노드들 간의 연결 링크에 따른 에너지 값들의 가중합 중 적어도 어느 하나 이상을 이용하여 산출할 수 있다.If necessary, the activation energy of the corresponding action node is a weighted sum of the activation energy of the corresponding action node of the previous time, the energy value of the environment node connected to the action node, and the energy value of the target node connected to the action node, And a weighted sum of energy values according to connection links between the behavior node and other nodes.

예를 들면 단순 대응행동 생성부(300)는, 특정 시간에서의 상기 활성화 에너지를, 상기 특정 시간에서의 상기 환경정보와 상기 사용자 에이전트 장치의 상태 정보에 따라 설정되는 상기 환경 노드를 이용하여 산출한 에너지 값과 상기 특정 시간에 인접한 이전 시간의 상기 활성화 에너지를 이용하여 산출할 수도 있다. 여기서 해당 행동 노드에 연결된 목적 노드에 따른 에너지 값에 가중치를 적용한 값을 더 이용할 수 있고, 나아가 해당 행동 노드와 타 노드들 간의 연결 링크에 따른 에너지 값들의 가중합을 더 이용하여 활성화 에너지를 산출할 수도 있다.For example, the simple response behavior generating unit 300 may calculate the activation energy at a specific time using the environment node set according to the environment information at the specific time and the state information of the user agent apparatus Energy value and the activation energy of the previous time adjacent to the specific time. Here, we can use the weighted value of the energy value according to the destination node connected to the corresponding action node, and further calculate the activation energy using the weighted sum of the energy values according to the connection link between the behavior node and the other nodes It is possible.

또한 단순 대응행동 생성부(300)는 위와 같이 산출한 활성화 에너지를 이용하여 각 시간 별로 최대의 활성화 에너지를 가지는 행동 노드를 선택함에 있어서, 산출된 활성화 에너지가 미리 정해진 일정한 기준의 문턱값 이상이어야 한다는 부가 조건을 충족하는 경우에만 해당 행동 노드를 선택할 수도 있다. Also, in selecting the behavior node having the maximum activation energy for each time using the activation energy calculated as described above, the simple response behavior generation unit 300 must determine that the calculated activation energy should be equal to or greater than a predetermined threshold value The behavior node may be selected only when the additional condition is satisfied.

만일 위와 같은 문턱값 조건이 충족되지 않는 경우 단순 대응행동 생성부(300)는 문턱값을 조절한 다음 활성화 에너지를 산출하는 위 과정을 반복하여 조건을 만족하는 행동 노드를 선택할 수도 있다.If the above threshold condition is not satisfied, the simple corresponding behavior generating unit 300 may select a behavior node satisfying the condition by repeating the above process of adjusting the threshold value and then calculating the activation energy.

다음으로 복합 대응행동 생성부(400)는 상기 사용자의 의도가 상기 복합의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 복합의도에 대응하는 행동을 생성한다.Next, the complex response behavior generating unit 400 generates a behavior corresponding to the compound intention using the behavior selection network when the intention of the user is classified as the compound road.

복합 대응행동 생성부(400)는, 상기 복합 목적을 계층 분할하여 일정한 순서를 가지는 상기 중간 목적들의 시퀀스를 획득하고, 상기 획득한 중간 목적들의 시퀀스에 포함된 각 상기 중간 목적에 대하여, 상기 행동 네트워크를 이용하여 상기 중간 목적에 대응하는 단위 행동의 시퀀스를 생성한다.The complex response behavior generating unit 400 divides the compound object into hierarchies to obtain a sequence of intermediate objectives having a predetermined order, and for each intermediate purpose included in the obtained sequence of intermediate objectives, To generate a sequence of unit behaviors corresponding to the intermediate purpose.

사용자 의도 분석부(200)에서 상기 사용자 입력 정보를 분석하여 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우, 복합 대응행동 생성부(400)는 상기 복합 목적을 계층 분할하여 먼저 일정한 순서를 가지는 상기 중간 목적들의 시퀀스를 획득한다.When the user intention analysis unit 200 analyzes the user input information and obtains a compound object including two or more intermediate objects, the composite action behavior generating unit 400 divides the object of the compound object into a predetermined sequence To obtain the sequence of intermediate goals having the intermediate goals.

도 7은 복합 대응행동 생성부(400)의 동작을 설명하기 위한 참고도이다.FIG. 7 is a reference diagram for explaining the operation of the complex response behavior generating unit 400. FIG.

복합 목적 gi가 사용자 의도 분석부(200)에서 획득된 경우, 먼저 복합 대응행동 생성부(400)는 도 7과 같이 복합 목적 gi를 계층 분할하여 적어도 2개 이상의 중간 목적들(sga, sgb, sgc, sgd)을 획득할 수 있다. 그리고 상기 계층 분할된 중간 목적들에 대하여 일정한 순서를 부여하여 상기 중간 목적들의 시퀀스(sga -> sgb -> sgc -> sgd)를 획득할 수 있다.When the compound object gi is obtained by the user's intention analysis unit 200, the compound response action generation unit 400 first divides the compound object gi into hierarchies and generates at least two intermediate objects sga, sgb, sgc , sgd). The sequence of intermediate goals (sga - > sgb - > sgc -> sgd) may be obtained by assigning a constant order to the hierarchical intermediate objectives.

도 8 (a)는 위와 같이 획득된 상기 중간 목적들의 시퀀스를 나타내는 참고도이다.FIG. 8 (a) is a reference diagram showing the sequence of the intermediate objectives obtained as described above.

다음으로 복합 대응행동 생성부(400)는, 상기 중간 목적 별로, 상기 행동 선택 네트워크에서 상기 중간 목적을 가지는 상기 행동 네트워크 모듈을 검색하여 선택하고, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드를 이용하여 상기 단위행동의 시퀀스를 생성한다.Next, the complex response behavior generation unit 400 searches for and selects the behavior network module having the intermediate purpose in the behavior selection network for each intermediate purpose, and uses the behavior node included in the selected behavior network module To generate the sequence of unit behaviors.

복합 대응행동 생성부(400)는 상기 중간 목적들의 시퀀스에 포함된 각각의 중간 목적들에 대하여 일정한 순서를 가지는 단위 행동들의 집합인 단위 행동의 시퀀스를 획득할 수 있다. 여기서 복합 대응행동 생성부(400)는 단순 대응행동 생성부(300)가 하나의 중간 목적에 대하여 일련의 단위 행동들의 집합을 획득하는 방식과 동일한 방식으로, 각각의 중간 목적들에 대하여 상기 단위 행동의 시퀀스를 획득할 수 있다.The complex response behavior generating unit 400 may acquire a sequence of the unit behaviors, which is a set of the unit behaviors having a predetermined order with respect to each intermediate purpose included in the sequence of the intermediate objectives. Here, the complex response behavior generation unit 400 may generate the response action for each of the intermediate objectives in the same manner as the simple response behavior generation unit 300 acquires a set of the unit actions for one intermediate purpose Can be obtained.

도 8 (b)는 각각의 중간 목적들에 대하여 획득한 상기 단위 행동의 시퀀스를 나타내는 참고도이다. 도 8 (b)와 같이 sga에 대하여는 (ba2 -> ba1)의 단위 행동 시퀀스를, sgb에 대하여는 (bb7)의 단위 행동 시퀀스를, sgc에 대하여는 (bc3 -> bc5)의 단위 행동 시퀀스를, sgd에 대하여는 (bd4)의 단위 행동 시퀀스를 각각 획득할 수 있다.Fig. 8 (b) is a reference diagram showing the sequence of the unit behaviors obtained for each intermediate purpose. The unit action sequence of (ba2 - > ba1) for sga, the unit action sequence of (bb7) for sgb and the unit action sequence of (bc3 - > bc5) The unit action sequence of (bd4) can be obtained, respectively.

그리고 복합 대응행동 생성부(400)는 상기 중간 목적들의 시퀀스의 순서 대로 상기 중간 목적 별 상기 단위 행동의 시퀀스를 배열하여 이루어지는 복합 대응행동을 생성한다. 즉 도 8 (b)를 참조하면 각각의 중간 목적들에 대하여 획득된 단위 행동의 시퀀스가, 중간 목적들의 시퀀스의 순서대로 배열하여 (ba2 -> ba1) -> (bb7) -> (bc3 -> bc5) -> (bd4)와 같은 배열이 획득됨을 확인할 수 있다. 위와 같이 획득된 배열이 상기 복합 대응행동을 나타내는 단위 행동들의 배열이 된다. 여기서 행동 네트워크 모듈의 행동 노드는 각각의 단위 행동들을 나타내는 관계로, 도 8 (b)의 행동 노드들의 연결을 위에서는 단위 행동의 연결로 이루어지는 단위 행동의 시퀀스라고 설명하였다.The complex response behavior generating unit 400 generates a complex response action by arranging the sequence of the unit actions according to the intermediate purpose in the sequence of the intermediate objectives. That is, referring to FIG. 8 (b), a sequence of unit behaviors obtained for each intermediate purpose is arranged in the sequence of intermediate goals (ba2 -> ba1) -> (bb7) -> (bc3 -> bc5) -> (bd4) are obtained. The array obtained as described above becomes an array of unit behaviors representing the complex response behavior. Here, the behavior node of the behavior network module is a relation representing each unit behavior, and the connection of the behavior nodes in FIG. 8 (b) is described as a sequence of unit behavior consisting of a connection of unit behavior.

상술한 바와 같이 복합 대응행동 생성부(400)는 단순 대응행동 생성부(300)가 하나의 중간 목적에 대하여 일련의 단위 행동들의 집합을 획득하는 방식과 동일한 방식으로, 각각의 중간 목적들에 대하여 상기 단위 행동의 시퀀스를 획득할 수 있다. 이에 중간 목적에 대한 단위 행동의 시퀀스를 획득하는 방법에 대하여는 간략하게 서술한다.As described above, the complex response behavior generating unit 400 generates a response response for each of the intermediate objectives in the same manner as the simple response behavior generating unit 300 acquires a set of a plurality of unit behaviors for one intermediate purpose A sequence of unit behaviors can be obtained. A method of acquiring a sequence of unit behaviors for intermediate purposes will be briefly described.

여기서 복합 대응행동 생성부(400)는, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드에 대하여 활성화 에너지를 산출하고, 상기 활성화 에너지가 가장 큰 상기 행동 노드를 선택하고, 상기 선택된 행동 노드를 이용하여 상기 단위 행동의 시퀀스를 생성하는 것이 바람직하다.Here, the complex response behavior generating unit 400 may calculate the activation energy for the behavior node included in the selected behavior network module, select the behavior node having the largest activation energy, and use the selected behavior node It is preferable to generate the sequence of unit behaviors.

여기서 활성화 에너지는 단순 대응행동 생성부(300)와 동일한 방식으로 산출할 수 있다.Here, the activation energy can be calculated in the same manner as the simple response behavior generating unit 300.

여기서 복합 대응행동 생성부(400)는 일정한 시간 간격으로 상기 활성화 에너지를 산출하고, 각 시간 별로 가장 큰 상기 활성화 에너지를 가지는 상기 행동 노드를 선택하고, 상기 선택한 상기 행동 노드에 대응하는 단위 행동을 생성하여, 상기 선택한 상기 행동 노드의 순서에 따라 상기 단위 행동의 시퀀스를 생성하는 것이 바람직하다.Here, the complex response behavior generating unit 400 may calculate the activation energy at a predetermined time interval, select the action node having the largest activation energy for each time, generate a unit action corresponding to the selected action node And generate the sequence of unit behaviors according to the order of the selected behavior nodes.

도 9는 본 발명의 또 다른 실시예에 따른 사용자 에이전트 장치의 블록도이다.9 is a block diagram of a user agent device in accordance with another embodiment of the present invention.

상기 본 발명에 따른 사용자 에이전트 장치는 정보 입력부(100), 사용자 의도 분석부(200), 단순 대응행동 생성부(300), 복합 대응행동 생성부(400), 행동 선택 네트워크 저장부(500)와 함께 학습부(600)를 더 포함할 수 있다.The user agent apparatus according to the present invention includes an information input unit 100, a user's intention analysis unit 200, a simple response behavior generation unit 300, a complex response behavior generation unit 400, a behavior selection network storage unit 500, The learning unit 600 may be further included.

학습부(600)는 상기 행동 선택 네트워크에 포함된 행동 네트워크 모듈의 배열의 집합을, 상기 사용자 입력 정보와 상기 환경정보와 사용자의 피드백 정보를 이용하여 학습한다.The learning unit 600 learns the set of the behavior network modules included in the behavior selection network using the user input information, the environment information, and the feedback information of the user.

학습부(600)는 적어도 하나 이상의 행동 네트워크 모듈들로 이루어지는 모듈 배열의 집합을 포함할 수 있다. 여기서 각각의 모듈 배열의 집합은 일정한 순서를 가지고 배열된 행동 네트워크 모듈들을 포함하고 그에 대하여 산출된 효용성 점수 정보를 포함하는 것이 바람직하다.The learning unit 600 may include a set of modules arranged by at least one behavioral network module. Here, each of the sets of module arrays preferably includes behavior network modules arranged in a predetermined order and includes the calculated utility score information.

도 10은 학습부(600)의 동작을 보다 상세히 설명하기 위한 참고도이다.10 is a reference diagram for explaining the operation of the learning unit 600 in more detail.

먼저 학습부(600)는 사용자 입력 정보를 분석하고 그에 대응하는 복합 목적들을 이루는 중간 목적들을 획득한다(S10). 다음으로는 행동 선택 네트워크 저장소(500)를 검색하여 상기 획득한 중간 목적들에 대응하는 행동 네트워크 모듈들을 선택한다(S20). 다음으로는 각 중간 목적들에 대응하는 행동 네트워크 모듈들을 가능한 순서대로 배열하여 여러개의 모듈 배열의 집합을 생성한다(S30). 다음으로는 상기 생성한 모듈 배열의 집합 중 하나의 모듈 배열을 선택하고(S40), 선택된 모듈 배열에 대하여 사용자의 피드백 정보를 입력 받는다(S50). 만일 사용자의 피드백 정보가 상기 선택된 모듈 배열로 표현되는 중간 목적들의 시퀀스가 사용자의 의도에 부합하는 것으로 판단되면 학습 과정을 종료하고, 그렇지 않으면 다시 학습 과정을 진행한다(S60). 즉 사용자의 의도에 부합되지 않는 것으로 판단되면 사용자 피드백 결과를 갱신하고(S70), 모듈 배열의 집합을 조정한 다음(S80), 다시 모듈 배열의 집합 중 하나의 모듈 배열을 선택하고(S40) 위 과정을 반복한다.First, the learning unit 600 analyzes the user input information and acquires intermediate objectives that form the corresponding combined purposes (S10). Next, the behavior selection network storage 500 is searched to select the behavior network modules corresponding to the obtained intermediate objectives (S20). Next, the behavior network modules corresponding to the respective intermediate purposes are arranged in the order as possible to generate a plurality of module array sets (S30). Next, one of the generated module arrays is selected (S40), and the user's feedback information is input to the selected module array (S50). If it is determined that the sequence of the intermediate objectives in which the feedback information of the user is expressed by the selected module array satisfies the user's intention, the learning process is terminated. Otherwise, the learning process is performed again (S60). The user feedback result is updated (S70), the set of module arrays is adjusted (S80), the module array of one of the sets of module arrays is selected again (S40) Repeat the process.

여기서 상기 S60 단계에서 사용자 피드백 정보에 따라 평가 대상이 된 모듈 배열에 대한 효용성 점수를 산출하거나 부여할 수 있다. 학습부(600)는 이상과 같은 과정을 통하여 모듈 배열에 대하여 효용성 점수가 부여된 모듈 배열의 집합을 생성할 수 있다.In step S60, the utility score for the module array to be evaluated can be calculated or given according to the user feedback information. The learning unit 600 can generate a set of module arrays to which the utility score is assigned to the module array through the above process.

이상과 같이 학습부(600)가 포함되는 경우 복합 대응행동 생성부(400)는 상기 학습부(600)에서 학습된 상기 모듈 구성 집합을 이용하여 상기 복합 대응행동을 생성할 수 있다. 즉 복합 대응행동 생성부(400)는, 상기 복합 목적을 계층 분할하여 일정한 순서를 가지는 상기 중간 목적들의 시퀀스를 획득함에 있어서, 상기 학습부에서 학습된 상기 모듈 구성 집합 데이터베이스를 이용하여, 효용성 점수 정보를 기준으로 상기 중간 목적들의 배열 순서를 정하고, 상기 배열 순서에 따른 상기 중간 목적들의 시퀀스를 획득할 수 있다.When the learning unit 600 is included as described above, the complex response behavior generation unit 400 can generate the complex response action using the module configuration set learned in the learning unit 600. [ In other words, the complex response behavior generating unit 400 may be configured to classify the complex objectives by hierarchically dividing the complex objectives into a sequence of intermediate objectives having a predetermined order, using the module configuration set database learned in the learning unit, Order of the intermediate destinations on the basis of the order of the intermediate destinations, and obtain the sequence of the intermediate destinations according to the arrangement order.

예를 들면 복합 대응행동 생성부(400)가 상기 복합 목적을 계층 분할하여 sga, sgb, sgc, sgd의 중간 목적들이 획득된 경우, 위 중간 목적들을 배열하는 순서를 결정하기 위하여, 각 중간 목적에 대응하는 행동 네트워크 모듈의 배열들에 대하여 학습부(600)에서 산출된 효용성 점수 정보를 이용하여 가장 높은 효용성 점수 정보를 가지는 행동 네트워크 모듈의 배열을 가지도록, 상기 중간 목적들을 배열하여, 상기 중간 목적들의 시퀀스를 획득할 수 있다.For example, in order to determine the order of arranging the above intermediate objectives when the complex response action generating unit 400 divides the compound object and obtains intermediate objectives of sga, sgb, sgc, and sgd, Arranging the intermediate objectives so as to have an array of behavioral network modules having the highest utility score information using the utility score information calculated by the learning unit 600 with respect to the arrays of corresponding behavioral network modules, Can be obtained.

도 11은 본 발명의 또 다른 실시예에 따른 사용자 의도 대응 행동 생성 방법에 관한 흐름도이다.11 is a flowchart illustrating a method for generating a user's intention response behavior according to another embodiment of the present invention.

상기 본 발명에 따른 사용자 의도 대응 행동 생성 방법은 정보 입력 단계(S100), 사용자 의도 분석 단계(S200), 단순 대응행동 생성 단계(S300), 복합 대응행동 생성 단계(S400)를 포함한다. 여기서 상기 본 발명에 따른 사용자 의도 대응 행동 생성 방법은 도 1 내지 도 10을 참조하면서 설명한 본 발명에 따른 사용자 에이전트 장치와 동일한 방식으로 동작할 수 있다. 이에 중복되는 부분은 생략하고 간략히 서술한다.The method for generating a user's intention response behavior according to the present invention includes an information input step S100, a user's intention analysis step S200, a simple response behavior generation step S300, and a complex response behavior generation step S400. Herein, the method for generating a user's intention response behavior according to the present invention can operate in the same manner as the user agent apparatus according to the present invention described with reference to FIG. 1 to FIG. The overlapping portions will be omitted and briefly described.

정보 입력 단계(S100)는 사용자 입력 정보와 주변 환경정보를 입력받는다.The information input step S100 receives user input information and surrounding environment information.

사용자 의도 분석 단계(S200)는 상기 사용자 입력 정보를 분석하여, 하나의 중간 목적이 획득된 경우 사용자의 의도를 단순의도로 분류하고, 두 개 이상의 상기 중간 목적을 포함하는 복합 목적이 획득된 경우 상기 사용자의 의도를 복합의도로 분류한다.The user intention analysis step (S200) analyzes the user input information, classifies the intention of the user when the one intermediate goal is obtained, and if the combined goal including the two or more intermediate goals is obtained, Classify users' intentions into multiple roads.

단순 대응행동 생성 단계(S300)는 상기 사용자의 의도가 상기 단순의도로 분류된 경우, 목적 노드와 환경 노드와 행동 노드로 이루어진 행동 선택 네트워크를 이용하여 상기 단순의도에 대응하는 행동을 생성한다.In the simple response behavior generating step S300, when the intention of the user is classified as the simple road, a behavior corresponding to the simple intention is generated using a behavior selection network including a destination node, an environment node, and a behavior node.

복합 대응행동 생성 단계(S400)는 상기 사용자의 의도가 상기 복합의도로 분류된 경우, 상기 행동 선택 네트워크를 이용하여 상기 복합의도에 대응하는 행동을 생성한다.In the complex response behavior generating step S400, when the intention of the user is classified into the compound road, the behavior corresponding to the compound intention is generated using the behavior selection network.

또한 여기서 상기 행동 네트워크 모듈은 하나의 상기 중간 목적에 관한 상기 목적 노드에 대하여 적어도 하나 이상의 상기 환경 노드와 적어도 하나 이상의 상기 행동 노드들 간의 연결로 이루어지는 것이 바람직하다.And wherein the behavioral network module comprises a connection between at least one of the environment node and at least one of the behavior nodes for the target node for one intermediate purpose.

여기서 사용자 의도 분석 단계(S200)는, 상기 사용자 입력 정보를 분석하여 사용자 명령 구문을 획득하고, 상기 사용자 명령 구문을 미리 정해진 의도 분류 온톨로지를 이용하여 분석하여 상기 사용자 명령 구문에 포함된 상기 중간 목적 또는 상기 복합 목적을 획득한다.Here, the user's intention analysis step (S200) may include analyzing the user input information to obtain a user command syntax, analyzing the user command syntax using a predetermined intention classification ontology, And obtains the combined purpose.

여기서 상기 하나의 중간 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 단순의도로, 상기 복합 목적이 획득된 경우는 상기 사용자 명령 구문에 대응하는 상기 사용자 의도를 상기 복합의도로, 각 분류하는 것이 바람직하다.Wherein if the one intermediate goal is obtained, the user intention corresponding to the user command syntax is referred to as the simple road, and if the combined purpose is obtained, the user intention corresponding to the user command syntax is referred to as the composite road , Respectively.

단순 대응행동 생성 단계(S300)는, 상기 단순의도에 대응하는 상기 중간 목적을 가지는 상기 행동 네트워크 모듈을 상기 행동 선택 네트워크에서 검색하여 선택하고, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드를 이용하여 단순 대응행동을 생성한다.In the simple response behavior generating step S300, the behavior network module having the intermediate purpose corresponding to the simple intention is searched and selected in the behavior selection network, and the behavior node included in the selected behavior network module is used To generate a simple response action.

여기서 상기 단순 대응행동은 일정한 순서로 배열된 적어도 하나 이상의 단위 행동을 포함하는 것이 바람직하다.Here, the simple corresponding action may include at least one unit action arranged in a predetermined order.

여기서 단순 대응행동 생성 단계(S300)는, 상기 선택한 행동 네트워크 모듈에 포함된 상기 행동 노드 각각에 대하여 활성화 에너지를 산출하고, 상기 활성화 에너지가 가장 큰 상기 행동 노드를 선택하고, 상기 선택된 행동 노드를 이용하여 상기 단순 대응행동을 생성하는 것이 바람직하다.Here, the simple corresponding behavior generating step S300 may include calculating activation energy for each of the behavior nodes included in the selected behavior network module, selecting the behavior node having the largest activation energy, and using the selected behavior node Thereby generating the simple corresponding action.

복합 대응행동 생성 단계(S400)는, 상기 복합 목적을 계층 분할하여 일정한 순서를 가지는 상기 중간 목적들의 시퀀스를 획득하고, 상기 획득한 중간 목적들의 시퀀스에 포함된 각 상기 중간 목적에 대하여, 상기 행동 네트워크를 이용하여 상기 중간 목적에 대응하는 단위 행동의 시퀀스를 생성한다.The complex response behavior generating step S400 includes hierarchically dividing the compound object to obtain a sequence of the intermediate objects having a predetermined order, and for each intermediate object included in the obtained sequence of intermediate objects, To generate a sequence of unit behaviors corresponding to the intermediate purpose.

다음으로 복합 대응행동 생성 단계(S400)는 상기 중간 목적들의 시퀀스의 순서 대로 상기 중간 목적 별 상기 단위 행동의 시퀀스를 배열하여 이루어지는 복합 대응행동을 생성할 수 있다.Next, the complex response behavior generating step S400 may generate a complex response action by arranging the sequence of the unit behaviors for the intermediate purpose in the sequence of the intermediate objectives.

또한 학습 단계(미도시)는 상기 행동 선택 네트워크에 포함된 행동 네트워크 모듈의 배열의 집합을, 상기 사용자 입력 정보와 상기 환경정보와 사용자의 피드백 정보를 이용하여 학습하는 학습 단계를 더 포함할 수 있다. 이 경우 복합 대응행동 생성 단계는 학습 단계(미도시)에서 학습된 상기 모듈 구성 집합을 이용하여 상기 복합 대응행동을 생성할 수 있다.Further, the learning step (not shown) may further include a learning step of learning the set of the behavior network modules included in the behavior selection network using the user input information, the environment information, and the feedback information of the user . In this case, the complex corresponding behavior generating step may generate the complex corresponding behavior using the learned module configuration set in the learning step (not shown).

이상에서 설명한 본 발명의 실시예를 구성하는 모든 구성요소들이 하나로 결합하거나 결합하여 동작하는 것으로 기재되어 있다고 해서, 본 발명이 반드시 이러한 실시예에 한정되는 것은 아니다. 즉, 본 발명의 목적 범위 안에서라면, 그 모든 구성요소들이 하나 이상으로 선택적으로 결합하여 동작할 수도 있다. It is to be understood that the present invention is not limited to these embodiments, and all elements constituting the embodiment of the present invention described above are described as being combined or operated in one operation. That is, within the scope of the present invention, all of the components may be selectively coupled to one or more of them.

또한, 그 모든 구성요소들이 각각 하나의 독립적인 하드웨어로 구현될 수 있지만, 각 구성요소들의 그 일부 또는 전부가 선택적으로 조합되어 하나 또는 복수개의 하드웨어에서 조합된 일부 또는 전부의 기능을 수행하는 프로그램 모듈을 갖는 컴퓨터 프로그램으로서 구현될 수도 있다. 또한, 이와 같은 컴퓨터 프로그램은 USB 메모리, CD 디스크, 플래쉬 메모리 등과 같은 컴퓨터가 읽을 수 있는 기록매체(Computer Readable Media)에 저장되어 컴퓨터에 의하여 읽혀지고 실행됨으로써, 본 발명의 실시예를 구현할 수 있다. 컴퓨터 프로그램의 기록매체로서는 자기 기록매체, 광 기록매체, 캐리어 웨이브 매체 등이 포함될 수 있다.In addition, although all of the components may be implemented as one independent hardware, some or all of the components may be selectively combined to perform a part or all of the functions in one or a plurality of hardware. As shown in FIG. In addition, such a computer program may be stored in a computer readable medium such as a USB memory, a CD disk, a flash memory, etc., and read and executed by a computer to implement an embodiment of the present invention. As the recording medium of the computer program, a magnetic recording medium, an optical recording medium, a carrier wave medium, and the like can be included.

또한, 기술적이거나 과학적인 용어를 포함한 모든 용어들은, 상세한 설명에서 다르게 정의되지 않는 한, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 갖는다. 사전에 정의된 용어와 같이 일반적으로 사용되는 용어들은 관련 기술의 문맥상의 의미와 일치하는 것으로 해석되어야 하며, 본 발명에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Furthermore, all terms including technical or scientific terms have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs, unless otherwise defined in the Detailed Description. Commonly used terms, such as predefined terms, should be interpreted to be consistent with the contextual meanings of the related art, and are not to be construed as ideal or overly formal, unless expressly defined to the contrary.

이상의 설명은 본 발명의 기술 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위 내에서 다양한 수정, 변경 및 치환이 가능할 것이다. 따라서, 본 발명에 개시된 실시예 및 첨부된 도면들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예 및 첨부된 도면에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구 범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리 범위에 포함되는 것으로 해석되어야 할 것이다.It will be apparent to those skilled in the art that various modifications, substitutions and substitutions are possible, without departing from the scope and spirit of the invention as disclosed in the accompanying claims. will be. Therefore, the embodiments disclosed in the present invention and the accompanying drawings are intended to illustrate and not to limit the technical spirit of the present invention, and the scope of the technical idea of the present invention is not limited by these embodiments and the accompanying drawings . The scope of protection of the present invention should be construed according to the following claims, and all technical ideas within the scope of equivalents should be construed as falling within the scope of the present invention.

100 : 정보입력부
200 : 사용자 의도 분석부
300 : 단순 대응행동 생성부
400 : 복합 대응행동 생성부
500 : 행동 선택 네트워크 저장부
600 : 학습부100: Information input unit
200: user intention analysis unit
300: Simple response action generating unit
400: a complex response behavior generating unit
500: action selection network storage unit
600: Learning department

Claims

In the user agent device,
An information input unit for receiving user input information and surrounding environment information;
Analyzing the user input information to classify the intention of the user when the one intermediate object is obtained and the intention of the user when the compound object including two or more intermediate objects is obtained; User intention analysis unit;
A simple response behavior generating unit for generating a behavior corresponding to the simple intention using a behavior selection network composed of a target node, an environment node, and a behavior node when the intention of the user is classified as the simple road; And
And a complex response behavior generating unit for generating a behavior corresponding to the compound intention using the behavior selection network when the intention of the user is classified as the compound road,
Wherein the behavioral selection network comprises at least one behavioral network module and wherein the behavioral network module comprises a connection between at least one of the environment node and at least one of the behavioral nodes for the one or more intermediate nodes under,
The user intention analysis unit may analyze the user input information to obtain a user command syntax, analyze the user command syntax using a predetermined intention classification ontology, and analyze the intermediate purpose or the combined purpose included in the user command syntax The user intention corresponding to the user command syntax is referred to as the simple road in the case where the one intermediate goal is obtained and the user intention corresponding to the user command syntax in the case where the combined purpose is obtained, Intentionally classify each,
Wherein the simple action corresponding to the behavioral network module includes a step of searching for and selecting the behavioral network module having the intermediate purpose corresponding to the simple intention in the behavioral selection network, &Lt; / RTI > wherein the user agent generates an action.

The method according to claim 1,
Further comprising a behavior selection network storage for storing the behavior selection network,

The method according to claim 1,
The information input unit receives the user input information according to a user's action using at least one of a video sensor, a voice sensor, and a touch sensor,
Wherein the environment information related to the environment to which the user agent device belongs is input.

delete

The method according to claim 1,
Wherein the simple response action comprises at least one or more unit actions arranged in a predetermined order.

The method according to claim 6,
Wherein the simple action corresponding to the selected action node includes a step of calculating an activation energy for each action node included in the selected action network module, selecting the action node having the largest activation energy, &Lt; / RTI > wherein the user agent generates an action.

8. The method of claim 7,
Wherein the simple action generating unit calculates the activation energy at a predetermined time interval, selects the action node having the largest activation energy for each time, generates the unit action corresponding to the selected action node, And generates a simple response action.

9. The method of claim 8,
Wherein the simple response behavior generating unit generates the simple action corresponding to the activation energy of the corresponding behavior node in the specific time before the specific time and the activation energy of the environment node connected to the behavior node A weighted sum, a value obtained by applying a weight to an energy value according to the destination node connected to the corresponding action node, and a weighted sum of energy values according to a connection link between the corresponding action node and other nodes Lt; / RTI > user agent device.

The apparatus according to claim 1,
Dividing the composite object into hierarchies to obtain a sequence of intermediate goals having a predetermined order,
Generating a sequence of unit behaviors corresponding to the intermediate object using the behavior network for each intermediate object included in the acquired sequence of intermediate objectives,
And a sequence of the unit behaviors for the intermediate purpose is arranged in the order of the sequence of the intermediate objectives.

11. The method of claim 10,
Wherein the complex response behavior generation unit searches for and selects the behavior network module having the intermediate purpose in the behavior selection network for each intermediate purpose and selects the action node of the unit behavior using the behavior node included in the selected behavior network module, Wherein the user agent generates a sequence.

12. The method of claim 11,
Wherein the complex corresponding behavior generating unit calculates activation energy for the behavior node included in the selected behavior network module, selects the behavior node having the largest activation energy, Wherein the user agent generates a sequence.

13. The method of claim 12,
Wherein the complex response behavior generating unit calculates the activation energy at a predetermined time interval, selects the action node having the largest activation energy for each time, generates a unit action corresponding to the selected action node, And generates the sequence of unit behaviors according to the order of the action nodes.

12. The method of claim 11,
Further comprising a learning unit for learning a set of the behavior network modules included in the behavior selection network using the user input information, the environment information, and feedback information of the user,

15. The method of claim 14,
Wherein the learning unit includes at least one set of module arrays in which the behavioral network modules are arranged in a predetermined order, calculates the utility score for the set of module arrays using the feedback information of the user,
Wherein the complex response behavior generation unit determines an arrangement order of the intermediate destinations based on the utility score for the set of module arrays calculated by the learning unit and acquires the sequence of intermediate objectives according to the arrangement order , A user agent device.

A method for generating a user's intent response behavior,
An information input step of receiving user input information and surrounding environment information;
Analyzing the user input information to classify the intention of the user when the one intermediate object is obtained and the intention of the user when the compound object including two or more intermediate objects is obtained; User intention analysis step;
A simple response behavior generating step of generating a behavior corresponding to the simple intention by using a behavior selection network composed of a destination node, an environment node and a behavior node when the intention of the user is classified into the simple road; And
And generating a behavior corresponding to the compound intention using the behavior selection network when the intention of the user is classified as the compound road,
Wherein the behavioral selection network comprises at least one behavioral network module and wherein the behavioral network module comprises a connection between at least one of the environment node and at least one of the behavioral nodes for the one or more intermediate nodes under,
The user's intention analysis step may include analyzing the user input information to obtain a user command syntax, analyzing the user command syntax using a predetermined intention classification ontology, analyzing the user command syntax using the intention classification ontology, The user intention corresponding to the user command syntax may be referred to as the simple road if the one intermediate goal is obtained and the user intention corresponding to the user command syntax if the combined purpose is obtained, Compound roads, respectively,
The simple action corresponding to the behavioral network module may include searching for and selecting the behavioral network module having the intermediate purpose corresponding to the simple intention in the behavioral selection network, And generating a corresponding behavior,

delete

17. The method of claim 16,
Wherein the simple response action includes at least one or more unit actions arranged in a predetermined order,

20. The method of claim 19,
Wherein the simple action generating step includes the steps of: calculating activation energy for each of the action nodes included in the selected action network module, selecting the action node having the largest activation energy, And generating a corresponding behavior,

17. The method of claim 16,
Dividing the composite object into hierarchies to obtain a sequence of intermediate goals having a predetermined order,
Generating a sequence of unit behaviors corresponding to the intermediate object using the behavior network for each intermediate object included in the acquired sequence of intermediate objectives,
And generating a compound corresponding behavior by arranging the sequence of the unit behaviors for the intermediate purpose in the order of the sequence of the intermediate objectives