KR102568143B1

KR102568143B1 - Method and device for providing customized service mode

Info

Publication number: KR102568143B1
Application number: KR1020160122296A
Authority: KR
Inventors: 김미희; 권순종
Original assignee: 주식회사 케이티
Priority date: 2016-09-23
Filing date: 2016-09-23
Publication date: 2023-08-18
Also published as: KR20180032946A

Abstract

음성 인식 단말에 의해 수행되는 맞춤형 서비스 모드 제공 방법은 음성 인식 단말에 설치된 어플리케이션에 대한 음성 명령을 입력받는 단계, 어플리케이션의 복수의 서비스 모드 중 제 1 서비스 모드를 통해 음성 서비스를 제공하는 단계, 제 1 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식하는 단계, 제 1 서비스 모드를 제 2 서비스 모드로 변경하는 단계, 제 2 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식하는 단계, 각 서비스 모드의 사용 이력 및 각 서비스 모드에 대한 상기 리액션중 하나 이상에 기초하여 각 서비스 모드의 선호도를 분석하는 단계 및 분석된 선호도에 기초하여 복수의 서비스 모드 중 선호 서비스 모드를 결정하는 단계를 포함할 수 있다. A method for providing a customized service mode performed by a voice recognition terminal includes the steps of receiving a voice command for an application installed in a voice recognition terminal, providing a voice service through a first service mode among a plurality of service modes of an application, a first Recognizing a user's reaction while providing a voice service through a service mode, changing a first service mode into a second service mode, recognizing a user's reaction while providing a voice service through the second service mode Analyzing a preference for each service mode based on a history of using each service mode and one or more of the reactions for each service mode, and determining a preferred service mode among a plurality of service modes based on the analyzed preference. can include

Description

Method and terminal for providing customized service mode {METHOD AND DEVICE FOR PROVIDING CUSTOMIZED SERVICE MODE}

본 발명은 맞춤형 서비스 모드를 제공하는 방법 및 단말에 관한 것이다. The present invention relates to a method and terminal for providing a customized service mode.

스마트 단말(예컨대, 스마트폰, IPTV 등)에서 실행되는 서비스에는 복수의 서비스 모드(예컨대, 테마)를 포함하고 있다. 기존에는 사용자가 특정 어플리케이션에서 특정 서비스 모드를 통해 서비스를 제공받고자 하는 경우, 수동으로 기본 서비스 모드에서 특정 서비스 모드로 변경해야 하는 번거로움이 있었다. A service executed in a smart terminal (eg, smart phone, IPTV, etc.) includes a plurality of service modes (eg, themes). Conventionally, when a user wants to be provided with a service through a specific service mode in a specific application, it is inconvenient to manually change the basic service mode to the specific service mode.

선행기술인 한국공개특허 제2013-0005656호에는 휴대단말의 카메라에 의해 촬영된 영상으로부터 검출된 얼굴에서 사용자 시선을 판별하고, 휴대단말의 자세와 사용자 시선에 따라 휴대단말의 보기 모드를 결정하여 결정된 보기 모드로 화면 데이터를 표시하는 구성이 개시되어 있다. Korean Patent Publication No. 2013-0005656, which is a prior art, identifies a user's gaze from a face detected from an image captured by a camera of a mobile terminal, and determines a viewing mode of the mobile terminal according to the posture of the mobile terminal and the user's gaze. A configuration for displaying screen data in a mode is disclosed.

음성 인식 단말에 설치된 어플리케이션에 대한 음성 명령을 입력받은 경우, 디폴트 서비스 모드를 통해 음성 서비스를 제공하고, 디폴트 서비스 모드의 음성 서비스에 대한 사용자의 리액션을 인식하여 디폴트 서비스 모드에서 다른 서비스 모드로 변경하고자 한다. 또한, 다른 서비스 모드를 통한 음성 서비스 제공 중에 사용자의 리액션을 인식하여 어플리케이션의 복수의 서비스 모드 중 해당 다른 서비스 모드를 제외한 다른 서비스 모드로 변경하고자 한다. 또한, 어플리케이션의 복수의 서비스 모드 각각을 통한 음성 서비스에 대한 사용자의 리액션 및 각 서비스 모드의 사용 이력을 저장하고자 한다. 또한, 어플리케이션에 대한 음성 명령을 입력받게 되면, 각 서비스 모드의 사용 이력 및 각 서비스 모드의 리액션에 기초하여 서비스 모드의 선호도를 분석함으로써 어플리케이션의 복수의 서비스 모드 중 선호 서비스 모드를 결정하고, 선호 서비스 모드를 통해 음성 서비스를 제공하고자 한다. 다만, 본 실시예가 이루고자 하는 기술적 과제는 상기된 바와 같은 기술적 과제들로 한정되지 않으며, 또 다른 기술적 과제들이 존재할 수 있다. When a voice command for an application installed in the voice recognition terminal is received, the voice service is provided through the default service mode, and the user's reaction to the voice service in the default service mode is recognized to change the default service mode to another service mode. do. In addition, while a voice service is provided through another service mode, a user's reaction is recognized to change to another service mode excluding the corresponding other service mode among a plurality of service modes of the application. In addition, the user's reaction to the voice service through each of the plurality of service modes of the application and the use history of each service mode are intended to be stored. In addition, when a voice command for an application is received, a preferred service mode is determined among a plurality of service modes of the application by analyzing the preference of the service mode based on the use history of each service mode and the reaction of each service mode, and the preferred service We want to provide voice service through the mode. However, the technical problem to be achieved by the present embodiment is not limited to the technical problems described above, and other technical problems may exist.

상술한 기술적 과제를 달성하기 위한 기술적 수단으로서, 본 발명의 제 1 측면에 따른 음성 인식 단말에 의해 수행되는 맞춤형 서비스 모드 제공 방법은 상기 음성 인식 단말에 설치된 어플리케이션에 대한 음성 명령을 입력받는 단계; 상기 어플리케이션의 복수의 서비스 모드 중 제 1 서비스 모드를 통해 음성 서비스를 제공하는 단계; 상기 제 1 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식하는 단계; 상기 제 1 서비스 모드를 제 2 서비스 모드로 변경하는 단계; 상기 제 2 서비스 모드를 통한 음성 서비스의 제공 중에, 상기 사용자의 리액션을 인식하는 단계; 상기 각 서비스 모드의 사용 이력 및 상기 각 서비스 모드에 대한 상기 리액션중 하나 이상에 기초하여 상기 각 서비스 모드의 선호도를 분석하는 단계; 및 상기 분석된 선호도에 기초하여 상기 복수의 서비스 모드 중 선호 서비스 모드를 결정하는 단계를 포함할 수 있다. 상기 리액션은 상기 사용자의 음성, 표정 및 제스쳐 중 적어도 하나일 수 있다.As a technical means for achieving the above technical problem, a method for providing a customized service mode performed by a voice recognition terminal according to a first aspect of the present invention includes receiving a voice command for an application installed in the voice recognition terminal; providing a voice service through a first service mode among a plurality of service modes of the application; Recognizing a user's reaction while providing a voice service through the first service mode; changing the first service mode to a second service mode; Recognizing the user's reaction while providing the voice service through the second service mode; Analyzing a preference for each service mode based on at least one of the use history of each service mode and the reaction for each service mode; and determining a preferred service mode among the plurality of service modes based on the analyzed preference. The reaction may be at least one of the user's voice, facial expression, and gesture.

일예에 있어서, 상기 선호 서비스 모드를 결정하는 단계 이후에, 상기 어플리케이션에 대한 음성 명령을 입력받는 단계; 및 상기 결정된 선호 서비스 모드를 통해 음성 서비스를 제공하는 단계를 더 포함할 수 있다. In one example, after the step of determining the preferred service mode, receiving a voice command for the application; and providing a voice service through the determined preferred service mode.

일예에 있어서, 상기 제 1 서비스 모드를 통해 음성 서비스를 제공하는 단계는 상기 음성 명령에 대한 음성 인식 및 상기 사용자에 대한 영상 인식 중 적어도 하나를 수행하여 상기 사용자의 연령, 성별 및 인원수 중 적어도 하나를 식별하는 단계; 및 상기 식별된 사용자의 연령, 성별 및 인원수에 기초하여 상기 제 1 서비스 모드를 결정하는 단계를 포함할 수 있다. In one example, the providing of the voice service through the first service mode may include performing at least one of voice recognition for the voice command and image recognition for the user to determine at least one of age, gender, and number of users of the user. identifying; and determining the first service mode based on the age, gender, and number of the identified users.

일예에 있어서, 상기 사용자의 리액션을 인식하는 단계는 상기 사용자의 음성을 인식하여 리액션 음성을 획득하는 단계를 포함하고, 상기 각 서비스 모드의 선호도를 분석하는 단계는 상기 리액션 음성에 포함된 어휘가 상기 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 어휘인지 판단하는 단계를 포함할 수 있다. 상기 사용자의 리액션을 인식하는 단계는 상기 사용자의 영상을 인식하여 리액션 표정 또는 리액션 제스쳐를 획득하는 단계를 포함하고, 상기 각 서비스 모드의 선호도를 분석하는 단계는 상기 리액션 표정 또는 리액션 제스쳐가 상기 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 표정인지 판단하는 단계를 더 포함할 수 있다. In one example, the recognizing the user's reaction includes recognizing the user's voice and obtaining a reaction voice, and the analyzing the preference of each service mode includes the vocabulary included in the reaction voice. A step of determining whether a word corresponds to a positive evaluation or a negative evaluation for each service mode may be included. The step of recognizing the user's reaction includes the step of recognizing the user's image and obtaining a reaction expression or reaction gesture, and the step of analyzing the preference of each service mode is such that the reaction expression or reaction gesture is used for each service. A step of determining whether the facial expression corresponds to a positive evaluation or a negative evaluation of the mode may be further included.

일예에 있어서, 상기 판단에 기초하여 상기 각 서비스 모드에 대하여 상기 긍정 평가 및 상기 부정 평가에 대한 점수를 누적하는 단계; 및 상기 누적된 점수에 기초하여 상기 각 서비스 모드의 선호도를 분석하는 단계를 포함할 수 있다. In one example, accumulating scores for the positive evaluation and the negative evaluation for each service mode based on the determination; and analyzing the preference of each service mode based on the accumulated score.

일예에 있어서, 상기 어플리케이션의 설치 데이터 및 상기 복수의 서비스 모드에 대한 정보를 서버로부터 수신하는 단계; 및 상기 어플리케이션을 설치하는 단계를 더 포함할 수 있다. In one example, receiving installation data of the application and information about the plurality of service modes from the server; and installing the application.

일예에 있어서, 상기 복수의 서비스 모드에 대한 정보는 상기 각 서비스 모드의 선호 나이 정보, 선호 성별 정보를 포함할 수 있다. In one example, the information on the plurality of service modes may include preferred age information and preferred gender information of each service mode.

일예에 있어서, 상기 사용자의 리액션을 인식하는 단계는 상기 리액션 음성이 긍정 평가에 해당하는 어휘를 포함할 때 인식된 사용자의 표정을 긍정 평가에 해당하는 표정으로 저장하는 단계; 및 상기 리액션 음성이 부정 평가에 해당하는 어휘를 포함할 때 인식된 사용자의 표정을 부정 평가에 해당하는 표정으로 저장하는 단계를 포함할 수 있다. In one example, recognizing the user's reaction may include: storing a recognized facial expression of the user as a facial expression corresponding to a positive evaluation when the reaction voice includes a vocabulary corresponding to a positive evaluation; and storing a recognized facial expression of the user as a facial expression corresponding to a negative evaluation when the reaction voice includes a vocabulary corresponding to a negative evaluation.

일예에 있어서, 상기 제 1 서비스 모드를 제 2 서비스 모드로 변경하는 단계는 상기 제 2 서비스 모드로 변경하기 위해 기설정된 조건이 충족되었는지 여부를 판단하는 단계를 포함하고, 상기 기설정된 조건은 기설정된 시간의 경과 또는 상기 리액션의 입력 횟수일 수 있다. In one example, the changing of the first service mode to the second service mode includes determining whether a preset condition for changing to the second service mode is met, and the preset condition is a preset condition. It may be the passage of time or the number of inputs of the reaction.

일예에 있어서, 상기 제 1 서비스 모드를 제 2 서비스 모드로 변경하는 단계는 상기 제 2 서비스 모드로 변경하는 요청이 입력되었는지를 판단하는 단계를 포함할 수 있다. In one example, the changing of the first service mode to the second service mode may include determining whether a request to change to the second service mode is input.

일예에 있어서, 상기 선호 서비스 모드를 통한 음성 서비스의 제공 중에, 상기 사용자의 리액션을 인식하는 단계; 및 상기 리액션이 부정 평가에 해당하는 경우, 상기 선호 서비스 모드를 변경하는 단계를 더 포함할 수 있다. In one example, recognizing the user's reaction while providing the voice service through the preferred service mode; and changing the preferred service mode when the reaction corresponds to a negative evaluation.

일예에 있어서, 서버로부터 신규 서비스 모드에 대한 데이터를 수신하는 단계; 및 상기 선호 서비스 모드를 상기 신규 서비스 모드로 변경하는 단계를 더 포함할 수 있다. In one example, receiving data on a new service mode from the server; and changing the preferred service mode to the new service mode.

본 발명의 제 2 측면에 따른 음성 인식 단말은 상기 음성 인식 단말에 설치된 어플리케이션에 대한 음성 명령을 입력받는 음성 입력부; 기설정된 조건 동안 상기 어플리케이션의 서비스 모드를 제 1 서비스 모드로 설정하고, 상기 기설정된 조건 이후에 상기 어플리케이션의 서비스 모드를 제 2 서비스 모드로 변경하는 제어부; 상기 각 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션 을 인식하는 리액션 인식부; 상기 각 서비스 모드의 사용 이력 및 상기 각 서비스 모드에 대한 상기 리액션 중 하나 이상에 기초하여 상기 각 서비스 모드의 선호도를 분석하는 선호도 분석부; 및 상기 분석된 선호도에 기초하여 상기 복수의 서비스 모드 중 선호 서비스 모드를 결정하는 선호 서비스 모드 결정부를 포함할 수 있다. 상기 리액션은 상기 사용자의 음성, 표정 및 제스쳐 중 적어도 하나일 수 있다. A voice recognition terminal according to a second aspect of the present invention includes a voice input unit for receiving a voice command for an application installed in the voice recognition terminal; a controller that sets the service mode of the application to a first service mode during a preset condition and changes the service mode of the application to a second service mode after the preset condition; A reaction recognition unit for recognizing a user's reaction while providing voice service through each of the service modes; a preference analyzer configured to analyze a preference of each service mode based on at least one of a use history of each service mode and the reaction for each service mode; and a preferred service mode determination unit configured to determine a preferred service mode among the plurality of service modes based on the analyzed preference. The reaction may be at least one of the user's voice, facial expression, and gesture.

일예에 있어서, 상기 음성 입력부는 상기 선호 서비스 모드가 결정된 이후에, 상기 어플리케이션에 대한 음성 명령을 입력받고, 상기 제어부는 상기 어플리케이션의 서비스 모드를 상기 결정된 선호 서비스 모드로 변경할 수 있다. In one example, the voice input unit may receive a voice command for the application after the preferred service mode is determined, and the control unit may change the service mode of the application to the determined preferred service mode.

일예에 있어서, 상기 리액션 인식부는 상기 사용자의 음성을 인식하여 리액션 음성을 획득하는 음성 인식부; 및 상기 사용자의 영상을 인식하여 리액션 표정 또는 리액션 제스쳐를 획득하는 영상 인식부를 포함할 수 있다. In one example, the reaction recognition unit may include a voice recognition unit for acquiring a reaction voice by recognizing the user's voice; and an image recognition unit that recognizes the user's image and obtains a reaction expression or reaction gesture.

일예에 있어서, 상기 선호도 분석부는 상기 리액션 음성에 포함된 어휘가 상기 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 어휘인지 판단할 수 있다. 상기 선호도 분석부는 상기 리액션 표정 또는 리액션 제스쳐가 상기 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 표정인지 판단할 수 있다. 상기 선호도 분석부는 상기 판단에 기초하여 상기 각 서비스 모드에 대하여 상기 긍정 평가 및 상기 부정 평가에 대한 점수를 누적하고, 상기 누적된 점수에 기초하여 상기 각 서비스 모드의 선호도를 분석할 수 있다. In one example, the preference analysis unit may determine whether a vocabulary included in the reaction voice corresponds to a positive evaluation or a negative evaluation for each service mode. The preference analysis unit may determine whether the reaction expression or reaction gesture corresponds to a positive evaluation or a negative evaluation for each service mode. The preference analysis unit may accumulate scores for the positive evaluation and the negative evaluation for each service mode based on the determination, and analyze the preference for each service mode based on the accumulated score.

일예에 있어서, 상기 어플리케이션의 설치 데이터 및 상기 복수의 서비스 모드에 대한 정보를 서버로부터 수신하는 수신부를 더 포함하고, 상기 복수의 서비스 모드에 대한 정보는 상기 각 서비스 모드의 선호 나이 정보, 선호 성별 정보를 포함할 수 있다. In one example, the method further includes a receiver configured to receive installation data of the application and information about the plurality of service modes from a server, wherein the information about the plurality of service modes includes preferred age information and preferred gender information of each service mode. can include

일예에 있어서, 상기 리액션 인식부는 상기 리액션 음성이 긍정 평가에 해당하는 어휘를 포함할 때 인식된 사용자의 표정을 긍정 평가에 해당하는 표정으로 판단하고, 상기 리액션 음성이 부정 평가에 해당하는 어휘를 포함할 때 인식된 사용자의 표정을 부정 평가에 해당하는 표정으로 판단할 수 있다. In one example, the reaction recognition unit determines a recognized user's facial expression as a facial expression corresponding to a positive evaluation when the reaction voice includes a vocabulary corresponding to a positive evaluation, and the reaction voice includes a vocabulary corresponding to a negative evaluation. When performing the task, the recognized user's facial expression may be determined as a facial expression corresponding to negative evaluation.

상술한 과제 해결 수단은 단지 예시적인 것으로서, 본 발명을 제한하려는 의도로 해석되지 않아야 한다. 상술한 예시적인 실시예 외에도, 도면 및 발명의 상세한 설명에 기재된 추가적인 실시예가 존재할 수 있다.The above-described means for solving the problems is only illustrative and should not be construed as limiting the present invention. In addition to the exemplary embodiments described above, there may be additional embodiments described in the drawings and detailed description.

전술한 본 발명의 과제 해결 수단 중 어느 하나에 의하면, 음성 인식 단말에 설치된 어플리케이션에 대한 음성 명령을 입력받은 경우, 디폴트 서비스 모드를 통해 음성 서비스를 제공하고, 디폴트 서비스 모드의 음성 서비스에 대한 사용자의 리액션을 인식하여 디폴트 서비스 모드에서 다른 서비스 모드로 변경할 수 있다. 또한, 다른 서비스 모드를 통한 음성 서비스 제공 중에 사용자의 리액션을 인식하여 어플리케이션의 복수의 서비스 모드 중 해당 다른 서비스 모드를 제외한 다른 서비스 모드로 변경할 수 있다. 또한, 어플리케이션의 복수의 서비스 모드 각각을 통한 음성 서비스에 대한 사용자의 리액션 및 각 서비스 모드의 사용 이력을 저장할 수 있다. 또한, 어플리케이션에 대한 음성 명령을 입력받게 되면, 각 서비스 모드의 사용 이력 및 각 서비스 모드의 리액션에 기초하여 서비스 모드의 선호도를 분석함으로써 어플리케이션의 복수의 서비스 모드 중 선호 서비스 모드를 결정하고, 선호 서비스 모드를 통해 음성 서비스를 제공할 수 있다.According to any one of the above-described problem solving means of the present invention, when a voice command for an application installed in a voice recognition terminal is input, a voice service is provided through a default service mode, and a user's response to the voice service in the default service mode is provided. By recognizing the reaction, you can change from the default service mode to another service mode. In addition, a user's reaction may be recognized while a voice service is provided through another service mode, and the service mode may be changed to another service mode excluding the corresponding other service mode among a plurality of service modes of the application. In addition, the user's reaction to the voice service through each of the plurality of service modes of the application and the use history of each service mode may be stored. In addition, when a voice command for an application is received, a preferred service mode is determined among a plurality of service modes of the application by analyzing the preference of the service mode based on the use history of each service mode and the reaction of each service mode, and the preferred service A voice service may be provided through a mode.

도 1은 본 발명의 일 실시예에 따른, 맞춤형 서비스 모드 제공 시스템의 구성도이다.
도 2는 본 발명의 일 실시예에 따른, 도 1에 도시된 음성 인식 단말의 블록도이다.
도 3은 본 발명의 일 실시예에 따른, 맞춤형 서비스 모드 제공 방법을 나타낸 동작 흐름도이다.
도 4는 본 발명의 일 실시예에 따른, 디폴트 서비스 모드를 통한 음성 서비스 제공 방법을 나타낸 동작 흐름도이다.
도 5는 본 발명의 일 실시예에 따른, 사용자의 리액션을 인식하는 방법을 나타낸 동작 흐름도이다.
도 6은 본 발명의 일 실시예에 따른, 서비스 모드의 선호도를 분석하는 방법을 나타낸 동작 흐름도이다.
도 7은 본 발명의 다른 실시예에 따른, 서비스 모드의 선호도를 분석하는 방법을 나타낸 동작 흐름도이다.
도 8은 본 발명의 다른 실시예에 따른, 맞춤형 서비스 모드 제공 방법을 나타낸 동작 흐름도이다.
도 9는 본 발명의 다른 실시예에 따른, 맞춤형 서비스 모드 제공 방법을 나타낸 동작 흐름도이다.
도 10은 본 발명의 일 실시예에 따른, 어플리케이션을 등록 및 관리하는 방법을 설명하기 위한 도면이다.
도 11은 본 발명의 일 실시예에 따른, 사용자가 선호하는 어플리케이션의 선호 서비스 모드를 결정하는 방법을 설명하기 위한 도면이다.1 is a configuration diagram of a customized service mode providing system according to an embodiment of the present invention.
2 is a block diagram of a voice recognition terminal shown in FIG. 1 according to an embodiment of the present invention.
3 is an operational flowchart illustrating a method for providing a customized service mode according to an embodiment of the present invention.
4 is an operational flowchart illustrating a method of providing a voice service through a default service mode according to an embodiment of the present invention.
5 is an operational flowchart illustrating a method of recognizing a user's reaction according to an embodiment of the present invention.
6 is an operational flowchart illustrating a method of analyzing a preference of a service mode according to an embodiment of the present invention.
7 is an operational flowchart illustrating a method of analyzing service mode preference according to another embodiment of the present invention.
8 is an operational flowchart illustrating a method for providing a customized service mode according to another embodiment of the present invention.
9 is an operational flowchart illustrating a method for providing a customized service mode according to another embodiment of the present invention.
10 is a diagram for explaining a method of registering and managing an application according to an embodiment of the present invention.
11 is a diagram for explaining a method of determining a preferred service mode of an application preferred by a user according to an embodiment of the present invention.

아래에서는 첨부한 도면을 참조하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예를 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다.Hereinafter, embodiments of the present invention will be described in detail so that those skilled in the art can easily practice the present invention with reference to the accompanying drawings. However, the present invention may be embodied in many different forms and is not limited to the embodiments described herein. And in order to clearly explain the present invention in the drawings, parts irrelevant to the description are omitted, and similar reference numerals are attached to similar parts throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐 아니라, 그 중간에 다른 소자를 사이에 두고 "전기적으로 연결"되어 있는 경우도 포함한다. 또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다.Throughout the specification, when a part is said to be "connected" to another part, this includes not only the case where it is "directly connected" but also the case where it is "electrically connected" with another element interposed therebetween. . In addition, when a certain component is said to "include", this means that it may further include other components without excluding other components unless otherwise stated.

본 명세서에 있어서 '부(部)'란, 하드웨어에 의해 실현되는 유닛(unit), 소프트웨어에 의해 실현되는 유닛, 양방을 이용하여 실현되는 유닛을 포함한다. 또한, 1 개의 유닛이 2 개 이상의 하드웨어를 이용하여 실현되어도 되고, 2 개 이상의 유닛이 1 개의 하드웨어에 의해 실현되어도 된다.In this specification, a "unit" includes a unit realized by hardware, a unit realized by software, and a unit realized using both. Further, one unit may be realized using two or more hardware, and two or more units may be realized by one hardware.

본 명세서에 있어서 단말 또는 디바이스가 수행하는 것으로 기술된 동작이나 기능 중 일부는 해당 단말 또는 디바이스와 연결된 서버에서 대신 수행될 수도 있다. 이와 마찬가지로, 서버가 수행하는 것으로 기술된 동작이나 기능 중 일부도 해당 서버와 연결된 단말 또는 디바이스에서 수행될 수도 있다.In this specification, some of the operations or functions described as being performed by a terminal or device may be performed instead by a server connected to the terminal or device. Likewise, some of the operations or functions described as being performed by the server may also be performed in a terminal or device connected to the corresponding server.

이하, 첨부된 구성도 또는 처리 흐름도를 참고하여, 본 발명의 실시를 위한 구체적인 내용을 설명하도록 한다.Hereinafter, specific details for the implementation of the present invention will be described with reference to the accompanying configuration diagram or process flow chart.

도 1은 본 발명의 일 실시예에 따른, 맞춤형 서비스 모드 제공 시스템의 구성도이다.1 is a configuration diagram of a customized service mode providing system according to an embodiment of the present invention.

도 1을 참조하면, 맞춤형 서비스 모드 제공 시스템은 음성 인식 단말(100) 및 서버(110)를 포함할 수 있다. 다만, 이러한 도 1의 맞춤형 서비스 모드 제공 시스템은 본 발명의 일 실시예에 불과하므로 도 1을 통해 본 발명이 한정 해석되는 것은 아니며, 본 발명의 다양한 실시예들에 따라 도 1과 다르게 구성될 수도 있다. Referring to FIG. 1 , a system for providing a customized service mode may include a voice recognition terminal 100 and a server 110 . However, since the customized service mode providing system of FIG. 1 is only one embodiment of the present invention, the present invention is not limitedly interpreted through FIG. 1, and may be configured differently from FIG. 1 according to various embodiments of the present invention. there is.

일반적으로, 도 1의 맞춤형 서비스 모드 제공 시스템의 각 구성요소들은 네트워크(120)를 통해 연결된다. 네트워크는 단말들 및 서버들과 같은 각각의 노드 상호 간에 정보 교환이 가능한 연결 구조를 의미하는 것으로, 이러한 네트워크(network)의 일례에는 3G, 4G, 5G, 3GPP(3rd Generation Partnership Project), LTE(Long Term Evolution), WIMAX(World Interoperability for Microwave Access), 와이파이(Wi-Fi) 등이 포함되나 이에 한정되지는 않는다.In general, each component of the customized service mode providing system of FIG. 1 is connected through a network 120 . A network refers to a connection structure capable of exchanging information between nodes such as terminals and servers, and examples of such networks include 3G, 4G, 5G, 3GPP (3rd Generation Partnership Project), LTE (Long Term Evolution), WIMAX (World Interoperability for Microwave Access), Wi-Fi, etc. are included, but are not limited thereto.

음성 인식 단말(100)은 사물인터넷(IoT) 장치와 연동되어 사용자의 음성 명령에 따라 사물인터넷 장치를 제어할 수 있는 단말일 수 있다. 또한, 음성 인식 단말(100)에는 복수의 음성 서비스 각각에 대응하는 복수의 어플리케이션이 설치되어 있다.The voice recognition terminal 100 may be a terminal capable of interworking with an Internet of Things (IoT) device and controlling the IoT device according to a user's voice command. In addition, a plurality of applications corresponding to each of a plurality of voice services are installed in the voice recognition terminal 100 .

음성 인식 단말(100)에는 사용자의 음성을 인식하는 음성 인식 장치가 탑재되어 있고, 사용자를 촬영하여 사용자의 영상을 인식하는 영상 인식 장치가 탑재되어 있다.The voice recognition terminal 100 includes a voice recognition device for recognizing a user's voice and an image recognition device for photographing a user and recognizing a user's image.

음성 인식 단말(100)은 음성 인식 단말(100)에 설치된 어플리케이션에 대한 음성 명령을 입력받아 해당 어플리케이션의 복수의 서비스 모드 중 사용자가 선호하는 선호 서비스 모드를 결정하여 결정된 선호 서비스 모드로 음성 서비스를 제공할 수 있다.The voice recognition terminal 100 receives a voice command for an application installed in the voice recognition terminal 100, determines a user's preferred service mode among a plurality of service modes of the corresponding application, and provides a voice service in the determined preferred service mode. can do.

구체적으로, 음성 인식 단말(100)은 어플리케이션의 각 서비스 모드를 통한 음성 서비스의 제공 중에 음성 인식 장치 또는 영상 인식 장치를 통해 사용자의 리액션을 인식하고, 각 서비스 모드의 사용 이력 및 각 서비스 모드에 대한 리액션 중 적어도 하나 이상에 기초하여 각 서비스 모드의 선호도를 분석할 수 있다. 음성 인식 단말(100)은 분석된 선호도에 기초하여 복수의 서비스 모드 중 선호 서비스 모드를 결정하고, 결정된 선호 서비스 모드로 음성 서비스를 제공할 수 있다. 여기서, 리액션은 사용자의 음성, 표정 및 제스쳐 중 적어도 하나를 포함할 수 있다.Specifically, the voice recognition terminal 100 recognizes a user's reaction through a voice recognition device or an image recognition device while providing a voice service through each service mode of an application, and provides information on the use history of each service mode and each service mode. Preference of each service mode may be analyzed based on at least one of the reactions. The voice recognition terminal 100 may determine a preferred service mode among a plurality of service modes based on the analyzed preference and provide a voice service in the determined preferred service mode. Here, the reaction may include at least one of a user's voice, facial expression, and gesture.

서버(110)는 음성 인식 단말(100)로부터 요청받은 어플리케이션의 설치 데이터 및 복수의 서비스 모드에 대한 정보를 음성 인식 단말(100)로 전송하고, 신규 서비스 모드에 대한 데이터를 음성 인식 단말(100)에게 제공할 수 있다. 여기서, 복수의 서비스 모드에 대한 정보는 각 서비스 모드의 선호 나이 정보, 선호 성별 정보를 포함할 수 있다.The server 110 transmits application installation data and information on a plurality of service modes requested from the voice recognition terminal 100 to the voice recognition terminal 100, and transmits data on a new service mode to the voice recognition terminal 100. can be provided to Here, information on a plurality of service modes may include preferred age information and preferred gender information of each service mode.

이하에서는 도 1의 맞춤형 서비스 모드 제공 시스템의 각 구성요소의 동작에 대해 보다 구체적으로 설명한다.Hereinafter, the operation of each component of the customized service mode providing system of FIG. 1 will be described in more detail.

도 2는 본 발명의 일 실시예에 따른, 도 1에 도시된 음성 인식 단말(100)의 블록도이다.FIG. 2 is a block diagram of the voice recognition terminal 100 shown in FIG. 1 according to an embodiment of the present invention.

도 2를 참조하면, 음성 인식 단말(100)은 음성 입력부(200), 제어부(210), 리액션 인식부(220), 선호도 분석부(230), 선호도 서비스 모드 결정부(240), 수신부(250) 및 메모리(260)를 포함할 수 있다. 여기서, 리액션 인식부(220)는 음성 인식부(222) 및 영상 인식부(224)를 포함할 수 있다. 다만, 도 2에 도시된 음성 인식 단말(100)은 본 발명의 하나의 구현 예에 불과하며, 도 2에 도시된 구성요소들을 기초로 하여 여러 가지 변형이 가능하다.Referring to FIG. 2 , the voice recognition terminal 100 includes a voice input unit 200, a controller 210, a reaction recognizer 220, a preference analyzer 230, a preference service mode determiner 240, and a receiver 250. ) and a memory 260 . Here, the reaction recognizer 220 may include a voice recognizer 222 and an image recognizer 224 . However, the voice recognition terminal 100 shown in FIG. 2 is only one implementation example of the present invention, and various modifications are possible based on the components shown in FIG. 2 .

수신부(250)는 어플리케이션의 설치 데이터 및 어플리케이션에서 제공하는 복수의 서비스 모드에 대한 정보를 서버(110)로부터 수신할 수 있다. 여기서, 복수의 서비스 모드에 대한 정보는 각 서비스 모드의 선호 나이 정보, 선호 성별 정보를 포함할 수 있다. 예를 들면, 수신부(250)는 어플리케이션이 오디오 어플리케이션인 경우, 복수의 음향 모드(예컨대, 재즈 버전 모드, 락 버전 모드, 클래식 버전 모드, 발라드 버전 모드 등)에 대한 정보를 서버(110)로부터 수신할 수 있다. 수신부(250)는 어플리케이션이 TV 어플리케이션인 경우, 복수의 런처 화면에 관한 서비스 모드에 대한 정보를 서버(110)로부터 수신할 수 있다. 수신부(250)는 어플리케이션이 음성을 통한 정보 제공 서비스 어플리케이션인 경우, 목소리 성별, 목소리의 톤, 발화 속도 각각에 대한 서비스 모드의 정보를 서버(110)로부터 수신할 수 있다.The receiving unit 250 may receive installation data of the application and information about a plurality of service modes provided by the application from the server 110 . Here, information on a plurality of service modes may include preferred age information and preferred gender information of each service mode. For example, when the application is an audio application, the receiving unit 250 receives information about a plurality of sound modes (eg, jazz version mode, rock version mode, classical version mode, ballad version mode, etc.) from the server 110. can do. When the application is a TV application, the receiving unit 250 may receive information about a service mode related to a plurality of launcher screens from the server 110 . When the application is an information providing service application through voice, the receiver 250 may receive information on a service mode for each voice gender, voice tone, and speech speed from the server 110 .

잠시 도 10을 참조하여 어플리케이션을 등록 및 관리하는 방법을 설명하기로 한다.A method of registering and managing applications will be described with reference to FIG. 10 for a moment.

도 10은 본 발명의 일 실시예에 따른, 어플리케이션을 등록 및 관리하는 방법을 설명하기 위한 도면이다.10 is a diagram for explaining a method of registering and managing an application according to an embodiment of the present invention.

도 10을 참조하면, 음성 인식 단말(100)은 서버(110)로부터 어플리케이션과 어플리케이션에서 제공하는 복수의 서비스 모드에 대한 xml 파일(1000)을 수신할 수 있다. 음성 인식 단말(100)의 서비스 프로비져닝(Service Provisioning) 모듈(1010)은 해당 어플리케이션을 설치하고, 설치된 어플리케이션의 복수의 서비스 모드에 대한 정보를 분석할 수 있다. 또한, 서비스 프로비져닝 모듈(1010)은 분석된 각 서비스 모드에 대한 정보를 서비스 앱 모드 셋팅 테이블(1020)과 모드별 인터페이스 규격 테이블(1030)에 등록 및 저장할 수 있다.Referring to FIG. 10 , the voice recognition terminal 100 may receive an xml file 1000 for an application and a plurality of service modes provided by the application from the server 110 . The service provisioning module 1010 of the voice recognition terminal 100 may install a corresponding application and analyze information about a plurality of service modes of the installed application. In addition, the service provisioning module 1010 may register and store information on each analyzed service mode in the service app mode setting table 1020 and the interface specification table 1030 for each mode.

여기서, 서비스 앱 모드 셋팅 테이블(1020)에는 각 서비스 모드 별 선호 나이 정보, 선호 성별 정보가 포함될 수 있다. 어플리케이션이 추가되는 경우 또는 어플리케이션 별로 선택된 서비스 모드에 대한 정보가 변경되는 경우에 모드별 인터페이스 규격 테이블(1030)은 서비스 컨트롤러의 메모리에 업로드되어 관리되며, 이를 통해 음성 인식 단말(100)의 데이터베이스 접근 시간을 줄일 수 있다.Here, the service app mode setting table 1020 may include preferred age information and preferred gender information for each service mode. When an application is added or when information on the service mode selected for each application is changed, the interface specification table 1030 for each mode is uploaded to the memory of the service controller and managed, and through this, the database access time of the voice recognition terminal 100 can reduce

음성 인식 단말(100)은 최초로 어플리케이션을 구동할 시, 디폴트 서비스 모드에 대한 정보와 관련된 인터페이스 규격에 기초하여 어플리케이션을 구동시킬 수 있다. 여기서, 디폴트 서비스 모드는 제 1 서비스 모드일 수 있다. When the voice recognition terminal 100 initially drives an application, it may drive the application based on an interface standard related to information on a default service mode. Here, the default service mode may be the first service mode.

다시 도 2를 참조하면, 음성 입력부(200)는 음성 인식 단말(100)에 설치된 어플리케이션에 대한 음성 명령을 입력받을 수 있다. 여기서, 어플리케이션은 복수의 서비스 모드를 포함하고 있고, 디폴트 서비스 모드가 기본 모드로 셋팅되어 있을 수 있다.Referring back to FIG. 2 , the voice input unit 200 may receive a voice command for an application installed in the voice recognition terminal 100 . Here, the application may include a plurality of service modes, and a default service mode may be set as a basic mode.

제어부(210)는 기설정된 조건 동안 어플리케이션의 서비스 모드를 디폴트 서비스 모드로 설정하고, 기설정된 조건 이후에 어플리케이션의 서비스 모드를 다른 서비스 모드로 변경할 수 있다. 디폴트 서비스 모드는 제 1 서비스 모드이고, 다른 서비스 모드는 제 2 서비스 모드일 수 있다. The controller 210 may set the service mode of the application as the default service mode during a preset condition and change the service mode of the application to another service mode after the preset condition. The default service mode may be the first service mode, and another service mode may be the second service mode.

제어부(210)는 최초로 사용자로부터 어플리케이션에 대한 음성 명령을 입력받은 경우, 기설정된 조건 동안 디폴트 서비스 모드를 통해 음성 서비스를 제공할 수 있다. 예를 들면, 음성 인식 단말(100)이 음성 인식 단말(100)과 연동된 IPTV(미도시)에 대한 VOD 컨텐츠 검색 명령을 입력받은 경우, 제어부(210)는 디폴트 모드인 여성 음성 모드를 통해 VOD 컨텐츠 검색 결과를 안내할 수 있다.When receiving a voice command for an application from a user for the first time, the controller 210 may provide a voice service through a default service mode during a preset condition. For example, when the voice recognition terminal 100 receives a VOD content search command for an IPTV (not shown) interlocked with the voice recognition terminal 100, the controller 210 uses the default mode, the female voice mode, for VOD Content search results may be guided.

제어부(210)는 사용자의 리액션(사용자의 음성, 표정 및 제스쳐 중 적어도 하나)을 인식하기 전까지 디폴트 서비스 모드를 통해 음성 서비스를 제공할 수 있다. 제어부(210)는 기설정된 시간 동안 디폴트 서비스 모드를 통해 음성 서비스를 제공할 수 있다.The controller 210 may provide a voice service through a default service mode until recognizing the user's reaction (at least one of the user's voice, facial expression, and gesture). The controller 210 may provide a voice service through a default service mode for a preset time.

제어부(210)는 음성 인식부(222)에 의한 사용자의 음성 명령에 대한 음성 인식 및 영상 인식부(224)에 의한 사용자에 대한 영상 인식으로부터 사용자의 연령, 성별 및 인원수 중 적어도 하나를 식별하고, 식별된 사용자의 연령, 성별 및 인원수에 기초하여 디폴트 서비스 모드를 결정하고, 디폴트 서비스 모드를 통해 음성 서비스를 제공할 수 있다.The control unit 210 identifies at least one of the user's age, gender, and number of people from the voice recognition of the user's voice command by the voice recognition unit 222 and the image recognition of the user by the image recognition unit 224, A default service mode may be determined based on the age, gender, and number of identified users, and a voice service may be provided through the default service mode.

리액션 인식부(220)는 어플리케이션의 각 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식할 수 있다.The reaction recognition unit 220 may recognize a user's reaction while providing a voice service through each service mode of an application.

음성 인식부(222)는 사용자의 음성을 인식하여 음성으로부터 리액션 음성을 획득할 수 있다. 영상 인식부(224)는 사용자의 영상을 인식하여 영상으로부터 리액션 표정 또는 리액션 제스쳐를 획득할 수 있다. The voice recognition unit 222 may recognize the user's voice and obtain a reaction voice from the voice. The image recognition unit 224 may recognize a user's image and obtain a reaction expression or reaction gesture from the image.

리액션 인식부(220)는 디폴트 서비스 모드를 통한 음성 서비스의 제공 중에 사용자의 리액션 음성을 획득한 경우, 획득된 리액션 음성을 긍정 평가 또는 부정 평가 각각으로 분류하여 메모리(260)에 저장할 수 있다. 예를 들면, 리액션 인식부(220)는 리액션 음성이 긍정 평가에 해당하는 어휘를 포함할 때, 함께 인식된 사용자의 표정을 긍정 평가에 해당하는 표정으로 판단하여 저장할 수 있다. 리액션 인식부(220)는 리액션 음성이 부정 평가에 해당하는 어휘를 포함할 때, 함께 인식된 사용자의 표정을 부정 평가에 해당하는 표정으로 판단하여 저장할 수 있다. 리액션 인식부(220)는 긍정 평가로 저장한 표정 및 부정 평가로 저장한 표정을 다음에 입력된 리액션 표정 또는 리액션 제스쳐가 긍정 평가 또는 부정 평가 중 어느 것에 해당하는 표정인지 분류하는데 이용할 수 있다.When a user's reaction voice is acquired while providing a voice service through a default service mode, the reaction recognition unit 220 may classify the obtained reaction voice into a positive evaluation or a negative evaluation and store it in the memory 260 . For example, when the reaction voice includes a vocabulary corresponding to a positive evaluation, the reaction recognition unit 220 may determine and store the recognized user's facial expression as a facial expression corresponding to the positive evaluation. When the reaction voice includes a vocabulary corresponding to a negative evaluation, the reaction recognition unit 220 may determine and store the recognized user's facial expression as a facial expression corresponding to a negative evaluation. The reaction recognition unit 220 may use facial expressions stored as positive evaluations and facial expressions stored as negative evaluations to classify whether a reaction expression or a reaction gesture input next corresponds to a positive evaluation or a negative evaluation.

리액션 인식부(220)는 디폴트 서비스 모드를 통한 음성 서비스의 제공 중에 사용자의 리액션 표정 또는 리액션 제스쳐를 획득한 경우, 획득된 리액션 표정 또는 리액션 제스쳐가 긍정 평가 또는 부정 평가에 해당하는 표정인지 분류하여 메모리(260)에 저장할 수 있다.When the reaction recognition unit 220 acquires a user's reaction expression or reaction gesture while providing a voice service through the default service mode, the reaction recognition unit 220 classifies whether the acquired reaction expression or reaction gesture corresponds to a positive evaluation or a negative evaluation, and stores the memory. (260).

제어부(210)는 기설정된 조건이 충족되면, 디폴트 서비스 모드를 제 1 서비스 모드로 변경할 수 있다. The controller 210 may change the default service mode to the first service mode when a preset condition is satisfied.

예를 들면, 제어부(210)는 기설정된 시간이 경과되면, 디폴트 서비스 모드를 다른 서비스 모드로 변경할 수 있다. 예를 들면, 제어부(210)는 서비스의 이용 시간이 20분 경과할 때마다 서비스 모드를 변경할 수 있다.For example, the controller 210 may change the default service mode to another service mode when a predetermined time elapses. For example, the controller 210 may change the service mode whenever 20 minutes of service use time elapse.

다른 예를 들어, 제어부(210)는 사용자의 리액션 입력 횟수에 기초하여 디폴트 서비스 모드를 제 1 서비스 모드로 변경할 수 있다. 예를 들면, 제어부(210)는 사용자의 리액션 입력 횟수가 5회 입력될 때마다 서비스 모드를 변경할 수 있다.For another example, the controller 210 may change the default service mode to the first service mode based on the number of times the user inputs a reaction. For example, the controller 210 may change the service mode every time the user's reaction is input 5 times.

제어부(210)는 부정 평가에 해당하는 표정이 기설정된 횟수 이상 입력된 경우, 디폴트 서비스 모드를 제 1 서비스 모드로 변경할 수 있다. 또는, 제어부(210)는 사용자가 부정 평가에 해당하는 어휘를 발화한 경우, 디폴트 서비스 모드를 다른 서비스 모드로 변경할 수 있다. 예를 들면, IPTV에서의 VOD 컨텐츠 검색 결과에 대하여 여성 음성 모드로 검색 결과를 안내한 후, 부정 평가에 해당하는 표정을 짓고 있는 경우, 여성 음성 모드를 남성 음성 모드로 변경할 수 있다.The controller 210 may change the default service mode to the first service mode when expressions corresponding to negative evaluations are input more than a preset number of times. Alternatively, the controller 210 may change the default service mode to another service mode when the user utters a vocabulary corresponding to negative evaluation. For example, when a search result for a VOD content search in IPTV is provided in a female voice mode, and a facial expression corresponding to a negative evaluation is made, the female voice mode may be changed to a male voice mode.

제어부(210)는 사용자로부터 제 1 서비스 모드로 변경하는 요청이 입력된 경우, 디폴트 서비스 모드를 제 1 서비스 모드로 변경할 수 있다. The controller 210 may change the default service mode to the first service mode when a request to change to the first service mode is input from the user.

리액션 인식부(220)는 제 1 서비스 모드를 통한 음성 서비스의 제공 중에 사용자의 리액션을 인식할 수 있다. 음성 인식부(222)는 제 1 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 음성을 인식하여 리액션 음성을 획득할 수 있다. 리액션 인식부(220)는 제 1 서비스 모드를 통한 음성 서비스의 제공 중에 사용자의 리액션 음성을 획득한 경우, 획득된 리액션 음성을 긍정 평가 또는 부정 평가 각각으로 분류하여 메모리(260)에 저장할 수 있다.The reaction recognition unit 220 may recognize a user's reaction while providing a voice service through the first service mode. The voice recognition unit 222 may acquire a reaction voice by recognizing the user's voice while providing the voice service through the first service mode. When a user's reaction voice is obtained while providing a voice service through the first service mode, the reaction recognition unit 220 may classify the obtained reaction voice into a positive evaluation or a negative evaluation and store it in the memory 260 .

리액션 인식부(220)는 제 1 서비스 모드를 통한 음성 서비스의 제공 중에 사용자의 리액션 표정 또는 리액션 제스쳐를 획득한 경우, 획득된 리액션 표정 또는 리액션 제스쳐가 긍정 평가 또는 부정 평가에 해당하는 표정인지 분류하여 메모리(260)에 저장할 수 있다.When the reaction recognition unit 220 acquires a user's reaction expression or reaction gesture while providing a voice service through the first service mode, the reaction recognition unit 220 classifies whether the obtained reaction expression or reaction gesture corresponds to a positive evaluation or a negative evaluation, may be stored in the memory 260 .

선호도 분석부(230)는 리액션 음성에 포함된 어휘가 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 어휘인지 판단하고, 리액션 표정 또는 리액션 제스쳐가 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 표정 또는 제스처인지 판단한 후, 각 서비스 모드에 대한 긍정 평가 및 부정 평가에 대한 점수를 누적할 수 있다.The preference analysis unit 230 determines whether the vocabulary included in the reaction voice corresponds to a positive evaluation or negative evaluation for each service mode, and the reaction expression or reaction gesture corresponds to a positive evaluation or negative evaluation for each service mode. After determining whether it is a facial expression or gesture, it is possible to accumulate scores for positive and negative evaluations for each service mode.

선호도 분석부(230)는 제 1 서비스 모드를 통한 음성 서비스의 제공 중에 획득된 리액션 음성에 포함된 어휘가, 제 1 서비스 모드에 대한 긍정 평가이면 긍정 평가에 대한 점수를 누적하고, 제 1 서비스 모드에 대한 부정 평가이면 부정 평가에 대한 점수를 누적할 수 있다.The preference analyzer 230 accumulates scores for positive evaluations when the vocabulary included in the reaction voice acquired while providing the voice service through the first service mode is positively evaluated for the first service mode, and If it is a negative evaluation for , scores for negative evaluations can be accumulated.

선호도 분석부(230)는 제 1 서비스 모드를 통한 음성 서비스의 제공 중에 획득된 리액션 표정 또는 리액션 제스쳐가 제 1 서비스 모드에 대한 긍정 평가인 경우, 긍정 평가에 대한 점수를 누적하고, 제 1 서비스 모드에 대한 부정 평가인 경우, 부정 평가에 대한 점수를 누적할 수 있다.The preference analyzer 230 accumulates scores for the positive evaluation when the reaction expression or reaction gesture obtained during the provision of the voice service through the first service mode is a positive evaluation for the first service mode, and In the case of a negative evaluation for , scores for the negative evaluation may be accumulated.

선호도 분석부(230)는 각 서비스 모드의 사용 이력 및 각 서비스 모드에 대한 리액션 중 적어도 하나 이상에 기초하여 각 서비스 모드의 선호도를 분석할 수 있다. 선호도 분석부(230)는 각 서비스 모드에 대하여 누적된 점수에 기초하여 각 서비스 모드의 선호도를 분석할 수 있다.The preference analysis unit 230 may analyze the preference of each service mode based on at least one of a use history of each service mode and a reaction to each service mode. The preference analyzer 230 may analyze the preference of each service mode based on the accumulated scores for each service mode.

선호도 서비스 모드 결정부(240)는 분석된 선호도에 기초하여 복수의 서비스 모드 중 선호 서비스 모드를 결정할 수 있다.The preferred service mode determination unit 240 may determine a preferred service mode among a plurality of service modes based on the analyzed preference.

잠시 도 11을 참조하여 어플리케이션의 선호 서비스 모드를 결정하는 방법을 설명하기로 한다. 도 11은 본 발명의 일 실시예에 따른, 사용자가 선호하는 어플리케이션의 선호 서비스 모드를 결정하는 방법을 설명하기 위한 도면이다. A method of determining the preferred service mode of an application will be described with reference to FIG. 11 for a moment. 11 is a diagram for explaining a method of determining a preferred service mode of an application preferred by a user according to an embodiment of the present invention.

도 11을 참조하면, 음성 인식 단말(100)은 사용자로부터 음성 명령을 통해 어플리케이션의 특정 서비스 모드의 실행 요청을 수신한 경우, 어플리케이션의 디폴트 서비스 모드에 대한 정보를 확인하고, 디폴트 서비스 모드로의 동작을 위한 인터페이스 규격 정보에 기초하여 특정 서비스 모드를 호출할 수 있다. 디폴트 서비스 모드 구동 시, 음성 인식 단말(100)은 사용자의 리액션(음성, 표정 및 제스쳐 중 하나)을 분석하여 분석된 리액션을 리액션 테이블(1100)에 저장하고, 리액션 상태(1102)와 히트수(1104)에 따라 점수(1106)를 누적할 수 있다. Referring to FIG. 11 , when the voice recognition terminal 100 receives a request for execution of a specific service mode of an application from a user through a voice command, the voice recognition terminal 100 checks information on the default service mode of the application and operates in the default service mode. A specific service mode may be called based on interface standard information for When the default service mode is running, the voice recognition terminal 100 analyzes the user's reaction (one of voice, facial expression, and gesture), stores the analyzed reaction in the reaction table 1100, and displays the reaction state 1102 and the number of hits ( Scores 1106 may be accumulated according to 1104).

음성 인식 단말(100)은 리액션 테이블(1100)을 감시하는 와치 도그(Watch Dog) 모듈(1110)을 포함하고, 와치 도그 모듈(1110)에 의해 특정 서비스 모드의 점수가 변경되면 서비스 앱 모드 셋팅 테이블(1020)의 결정 상태(1108)를 변경하여 다음 서비스 구동 요청 때 결정된 서비스 모드로 구동되도록 할 수 있다. The voice recognition terminal 100 includes a watch dog module 1110 that monitors the reaction table 1100, and when the score of a specific service mode is changed by the watch dog module 1110, the service app mode setting table The decision state 1108 of 1020 may be changed so that the service mode is driven in the determined service mode at the next service driving request.

다시 도 2를 참조하면, 음성 입력부(200)는 선호 서비스 모드로 결정된 이후, 어플리케이션이 비활성화된 상태에서 사용자로부터 어플리케이션에 대한 음성 명령을 입력받을 수 있다. 이 경우, 제어부(210)는 어플리케이션의 서비스 모드를 결정된 선호 서비스 모드를 통해 음성 서비스를 제공할 수 있다.Referring back to FIG. 2 , after the preferred service mode is determined, the voice input unit 200 may receive a voice command for the application from the user in a state in which the application is deactivated. In this case, the controller 210 may provide a voice service through the preferred service mode determined as the service mode of the application.

리액션 인식부(220)는 결정된 선호 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식할 수 있다.The reaction recognition unit 220 may recognize a user's reaction while providing a voice service through the determined preferred service mode.

제어부(210)는 인식된 사용자의 리액션이 부정 평가에 해당하는 경우, 선호 서비스 모드를 다른 서비스 모드로 변경할 수 있다.When the recognized user's reaction corresponds to a negative evaluation, the controller 210 may change the preferred service mode to another service mode.

제어부(210)는 서버(110)로부터 수신한 신규 서비스 모드에 대한 데이터가 존재하는 경우, 선호 서비스 모드를 일정 기간 동안 신규 서비스 모드로 변경할 수도 있다.When data on the new service mode received from the server 110 exists, the controller 210 may change the preferred service mode to the new service mode for a certain period of time.

메모리(260)는 음성 인식 단말(100) 내부의 각 구성요소들 간에 입력 및 출력되는 데이터를 저장하고, 음성 인식 단말(100)와 음성 인식 단말(100) 외부의 구성요소들 간에 입력 및 출력되는 데이터를 저장한다. The memory 260 stores data input and output between components inside the voice recognition terminal 100, and is input and output between the voice recognition terminal 100 and components outside the voice recognition terminal 100. save the data

한편, 당업자라면, 음성 입력부(200), 제어부(210), 리액션 인식부(220), 음성 인식부(222), 영상 인식부(224), 선호도 분석부(230), 선호도 서비스 모드 결정부(240), 수신부(250) 및 메모리(260) 각각이 분리되어 구현되거나, 이 중 하나 이상이 통합되어 구현될 수 있음을 충분히 이해할 것이다. On the other hand, those skilled in the art, voice input unit 200, control unit 210, reaction recognition unit 220, voice recognition unit 222, image recognition unit 224, preference analysis unit 230, preference service mode determination unit ( 240), the receiver 250, and the memory 260 may be separately implemented, or one or more of them may be integrated and implemented.

도 3은 본 발명의 일 실시예에 따른, 맞춤형 서비스 모드 제공 방법을 나타낸 동작 흐름도이다.3 is an operational flowchart illustrating a method for providing a customized service mode according to an embodiment of the present invention.

도 3에 도시된 실시예에 따른 맞춤형 서비스 모드 제공 방법은 도 1 내지 도 2에 도시된 실시예에 음성 인식 단말(100) 및 서버(110)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 2의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 3에 도시된 실시예에 따른 맞춤형 서비스 모드 제공 방법에도 적용될 수 있다.The method for providing a customized service mode according to the embodiment shown in FIG. 3 includes steps sequentially processed by the voice recognition terminal 100 and the server 110 in the embodiment shown in FIGS. 1 and 2 . Therefore, even if omitted below, the description of the voice recognition terminal 100 and server 110 of FIGS. 1 and 2 can be applied to the method for providing a customized service mode according to the embodiment shown in FIG. 3 .

도 3을 참조하면, 단계 S301에서 음성 인식 단말(100)은 사용자로부터 음성 인식 단말(100)에 설치된 어플리케이션에 대한 음성 명령을 입력받을 수 있다. Referring to FIG. 3 , in step S301, the voice recognition terminal 100 may receive a voice command for an application installed in the voice recognition terminal 100 from the user.

단계 S303에서 음성 인식 단말(100)은 어플리케이션의 복수의 서비스 모드 중 디폴트 서비스 모드를 통해 음성 서비스를 제공할 수 있다.In step S303, the voice recognition terminal 100 may provide a voice service through a default service mode among a plurality of service modes of an application.

단계 S305에서 음성 인식 단말(100)은 디폴트 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식할 수 있다. 여기서, 리액션은 사용자의 음성, 표정 및 제스쳐 중 적어도 하나일 수 있다.In step S305, the voice recognition terminal 100 may recognize a user's reaction while providing a voice service through a default service mode. Here, the reaction may be at least one of a user's voice, facial expression, and gesture.

단계 S307에서 음성 인식 단말(100)은 디폴트 서비스 모드를 다른 서비스 모드로 변경할 수 있다.In step S307, the voice recognition terminal 100 may change the default service mode to another service mode.

단계 S309에서 음성 인식 단말(100)은 다른 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식할 수 있다. 여기서, 리액션은 사용자의 음성, 표정 및 제스쳐 중 적어도 하나일 수 있다.In step S309, the voice recognition terminal 100 may recognize a user's reaction while providing a voice service through another service mode. Here, the reaction may be at least one of a user's voice, facial expression, and gesture.

단계 S311에서 음성 인식 단말(100)은 각 서비스 모드의 사용 이력 및 각 서비스 모드에 대한 리액션 중 하나 이상에 기초하여 각 서비스 모드의 선호도를 분석할 수 있다.In step S311, the voice recognition terminal 100 may analyze the preference of each service mode based on at least one of a use history of each service mode and a reaction to each service mode.

단계 S313에서 음성 인식 단말(100)은 분석된 선호도에 기초하여 복수의 서비스 모드 중 선호 서비스 모드를 결정할 수 있다.In step S313, the voice recognition terminal 100 may determine a preferred service mode among a plurality of service modes based on the analyzed preference.

도 3에는 도시되지 않았으나 단계 S307에서 음성 인식 단말(100)은 다른 서비스 모드로 변경하기 위해 기설정된 조건이 충족되었는지 여부를 판단할 수 있다. 여기서, 기설정된 조건은 기설정된 시간의 경과 또는 리액션의 입력 횟수일 수 있다. 음성 인식 단말(100)은 다른 서비스 모드로 변경하기 위해 기설정된 조건이 충족된 경우, 디폴트 서비스 모드를 다른 서비스 모드로 변경할 수 있다. Although not shown in FIG. 3 , in step S307 , the voice recognition terminal 100 may determine whether or not a preset condition for changing to another service mode is met. Here, the preset condition may be the lapse of a preset time or the number of inputs of a reaction. The voice recognition terminal 100 may change the default service mode to another service mode when a preset condition for changing to another service mode is met.

도 3에는 도시되지 않았으나 단계 S307에서 음성 인식 단말(100)은 다른 서비스 모드로 변경하는 요청이 입력되었는지를 판단할 수 있다. 음성 인식 단말(100)은 사용자로부터 다른 서비스 모드로의 변경 요청이 수신되면, 디폴트 서비스 모드를 다른 서비스 모드로 변경할 수 있다.Although not shown in FIG. 3 , in step S307 , the voice recognition terminal 100 may determine whether a request to change to another service mode is input. The voice recognition terminal 100 may change the default service mode to another service mode when a request to change to another service mode is received from the user.

도 3에는 도시되지 않았으나, S313 이후에 음성 인식 단말(100)은 서버(100)로부터 어플리케이션의 설치 데이터 및 복수의 서비스 모드에 대한 정보를 수신하고, 어플리케이션을 설치할 수 있다. 여기서, 복수의 서비스 모드에 대한 정보는 각 서비스 모드의 선호 나이 정보, 선호 성별 정보를 포함할 수 있다. Although not shown in FIG. 3 , after S313 , the voice recognition terminal 100 may receive application installation data and information about a plurality of service modes from the server 100 and install the application. Here, information on a plurality of service modes may include preferred age information and preferred gender information of each service mode.

상술한 설명에서, 단계 S301 내지 S313은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the above description, steps S301 to S313 may be further divided into additional steps or combined into fewer steps, depending on the implementation of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

도 4는 본 발명의 일 실시예에 따른, 디폴트 서비스 모드를 통한 음성 서비스 제공 방법을 나타낸 동작 흐름도이다.4 is an operational flowchart illustrating a method of providing a voice service through a default service mode according to an embodiment of the present invention.

도 4에 도시된 실시예에 따른 디폴트 서비스 모드를 통한 음성 서비스 제공 방법은 도 1 내지 도 3에 도시된 실시예에 음성 인식 단말(100)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 3의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 4에 도시된 실시예에 따른 디폴트 서비스 모드를 통한 음성 서비스 제공 방법에도 적용될 수 있다.The method for providing a voice service through the default service mode according to the embodiment shown in FIG. 4 includes steps processed time-sequentially by the voice recognition terminal 100 in the embodiment shown in FIGS. 1 to 3 . Therefore, even if the content is omitted below, the description of the voice recognition terminal 100 and the server 110 of FIGS. 1 to 3 is also applied to the method of providing a voice service through the default service mode according to the embodiment shown in FIG. 4. can be applied

도 4를 참조하면, 단계 S401에서 음성 인식 단말(100)은 사용자의 음성 명령에 대한 음성 인식 및 사용자에 대한 영상 인식 중 적어도 하나를 수행하여 사용자의 연령, 성별 및 인원수 중 적어도 하나를 식별할 수 있다.Referring to FIG. 4 , in step S401, the voice recognition terminal 100 may perform at least one of voice recognition of a user's voice command and image recognition of the user to identify at least one of the user's age, gender, and number of people. there is.

단계 S403에서 음성 인식 단말(100)은 식별된 사용자의 연령, 성별 및 인원수에 기초하여 디폴트 서비스 모드를 결정할 수 있다.In step S403, the voice recognition terminal 100 may determine a default service mode based on the age, gender, and number of identified users.

상술한 설명에서, 단계 S401 내지 S403은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the above description, steps S401 to S403 may be further divided into additional steps or combined into fewer steps, depending on the implementation of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

도 5는 본 발명의 일 실시예에 따른, 사용자의 리액션을 인식하는 방법을 나타낸 동작 흐름도이다.5 is an operational flowchart illustrating a method of recognizing a user's reaction according to an embodiment of the present invention.

도 5에 도시된 실시예에 따른 사용자의 리액션을 인식하는 방법은 도 1 내지 도 4에 도시된 실시예에 음성 인식 단말(100)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 4의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 5에 도시된 실시예에 따른 사용자의 리액션을 인식하는 방법에도 적용될 수 있다.The method for recognizing a user's reaction according to the embodiment shown in FIG. 5 includes steps processed time-sequentially by the voice recognition terminal 100 in the embodiment shown in FIGS. 1 to 4 . Therefore, even if the content is omitted below, the description of the voice recognition terminal 100 and the server 110 of FIGS. 1 to 4 can be applied to the method for recognizing a user's reaction according to the embodiment shown in FIG. 5. there is.

도 5를 참조하면, 단계 S501에서 음성 인식 단말(100)은 디폴트 서비스 모드 또는 다른 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 음성을 인식하여 리액션 음성을 획득할 수 있다.Referring to FIG. 5 , in step S501 , the voice recognition terminal 100 may acquire a reaction voice by recognizing a user's voice while providing a voice service through a default service mode or another service mode.

단계 S503에서 음성 인식 단말(100)은 획득된 리액션 음성이 긍정 평가에 해당하는 어휘를 포함할 때 인식된 사용자의 표정을 긍정 평가에 해당하는 표정으로 저장할 수 있다.In step S503, the voice recognition terminal 100 may store the recognized facial expression of the user as a facial expression corresponding to the positive evaluation when the acquired reaction voice includes a vocabulary corresponding to the positive evaluation.

단계 S505에서 음성 인식 단말(100)은 리액션 음성이 부정 평가에 해당하는 어휘를 포함할 때 인식된 사용자의 표정을 부정 평가에 해당하는 표정으로 저장할 수 있다.In step S505, the voice recognition terminal 100 may store the recognized facial expression of the user as a facial expression corresponding to the negative evaluation when the reaction voice includes a vocabulary corresponding to the negative evaluation.

상술한 설명에서, 단계 S501 내지 S505는 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the above description, steps S501 to S505 may be further divided into additional steps or combined into fewer steps, depending on the implementation of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

도 6은 본 발명의 일 실시예에 따른, 서비스 모드의 선호도를 분석하는 방법을 나타낸 동작 흐름도이다.6 is an operational flowchart illustrating a method of analyzing a preference of a service mode according to an embodiment of the present invention.

도 6에 도시된 실시예에 따른 서비스 모드의 선호도 분석 방법은 도 1 내지 도 5에 도시된 실시예에 음성 인식 단말(100)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 5의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 6에 도시된 실시예에 따른 서비스 모드의 선호도 분석 방법에도 적용될 수 있다.The service mode preference analysis method according to the embodiment shown in FIG. 6 includes steps processed time-sequentially in the voice recognition terminal 100 in the embodiment shown in FIGS. 1 to 5 . Therefore, even if the content is omitted below, the description of the voice recognition terminal 100 and the server 110 of FIGS. 1 to 5 can be applied to the service mode preference analysis method according to the embodiment shown in FIG. 6. .

도 6을 참조하면, 단계 S601에서 음성 인식 단말(100)은 사용자의 음성을 인식하여 리액션 음성을 획득할 수 있다.Referring to FIG. 6 , in step S601, the voice recognition terminal 100 may acquire a reaction voice by recognizing the user's voice.

단계 S603에서 음성 인식 단말(100)은 획득된 리액션 음성에 포함된 어휘가 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 어휘인지 판단할 수 있다.In step S603, the voice recognition terminal 100 may determine whether a vocabulary included in the obtained reaction voice corresponds to a positive evaluation or a negative evaluation for each service mode.

단계 S605에서 음성 인식 단말(100)은 단계 S603에서의 판단에 기초하여 각 서비스 모드에 대하여 긍정 평가 및 부정 평가에 대한 점수를 누적할 수 있다.In step S605, the voice recognition terminal 100 may accumulate scores for positive evaluation and negative evaluation for each service mode based on the determination in step S603.

단계 S607에서 음성 인식 단말(100)은 누적된 점수에 기초하여 각 서비스 모드의 선호도를 분석할 수 있다.In step S607, the voice recognition terminal 100 may analyze the preference of each service mode based on the accumulated scores.

상술한 설명에서, 단계 S601 내지 S607은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다.In the foregoing description, steps S601 to S607 may be further divided into additional steps or combined into fewer steps, depending on an embodiment of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

도 7은 본 발명의 다른 실시예에 따른, 서비스 모드의 선호도를 분석하는 방법을 나타낸 동작 흐름도이다.7 is an operational flowchart illustrating a method of analyzing service mode preference according to another embodiment of the present invention.

도 7에 도시된 실시예에 따른 서비스 모드의 선호도 분석 방법은 도 1 내지 도 6에 도시된 실시예에 음성 인식 단말(100)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 6의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 7에 도시된 실시예에 따른 서비스 모드의 선호도 분석 방법에도 적용될 수 있다.The service mode preference analysis method according to the embodiment shown in FIG. 7 includes steps processed time-sequentially in the voice recognition terminal 100 in the embodiment shown in FIGS. 1 to 6 . Therefore, even if omitted below, the description of the voice recognition terminal 100 and the server 110 of FIGS. 1 to 6 can be applied to the service mode preference analysis method according to the embodiment shown in FIG. 7 . .

도 7을 참조하면, 단계 S701에서 음성 인식 단말(100)은 사용자의 영상을 인식하여 리액션 표정 또는 리액션 제스쳐를 획득할 수 있다.Referring to FIG. 7 , in step S701, the voice recognition terminal 100 may acquire a reaction expression or a reaction gesture by recognizing a user's image.

단계 S703에서 음성 인식 단말(100)은 획득된 리액션 표정 또는 리액션 제스쳐가 각 서비스 모드에 대한 긍정 평가 또는 부정 평가에 해당하는 표정인지 판단할 수 있다.In step S703, the voice recognition terminal 100 may determine whether the acquired reaction expression or reaction gesture corresponds to a positive evaluation or a negative evaluation for each service mode.

단계 S705에서 음성 인식 단말(100)은 단계 S703에서의 판단에 기초하여 각 서비스 모드에 대하여 긍정 평가 및 부정 평가에 대한 점수를 누적할 수 있다.In step S705, the voice recognition terminal 100 may accumulate scores for positive evaluation and negative evaluation for each service mode based on the determination in step S703.

단계 S707에서 음성 인식 단말(100)은 누적된 점수에 기초하여 각 서비스 모드의 선호도를 분석할 수 있다.In step S707, the voice recognition terminal 100 may analyze the preference of each service mode based on the accumulated scores.

상술한 설명에서, 단계 S701 내지 S707은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the foregoing description, steps S701 to S707 may be further divided into additional steps or combined into fewer steps, depending on an embodiment of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

도 8은 본 발명의 다른 실시예에 따른, 맞춤형 서비스 모드 제공 방법을 나타낸 동작 흐름도이다.8 is an operational flowchart illustrating a method for providing a customized service mode according to another embodiment of the present invention.

도 8에 도시된 실시예에 따른 맞춤형 서비스 모드 제공 방법은 도 1 내지 도 7에 도시된 실시예에 음성 인식 단말(100)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 7의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 8에 도시된 실시예에 따른 맞춤형 서비스 모드 제공 방법에도 적용될 수 있다.The method for providing a customized service mode according to the embodiment shown in FIG. 8 includes steps sequentially processed by the voice recognition terminal 100 in the embodiment shown in FIGS. 1 to 7 . Therefore, even if the content is omitted below, the description of the voice recognition terminal 100 and the server 110 of FIGS. 1 to 7 can be applied to the method for providing a customized service mode according to the embodiment shown in FIG. 8 .

도 8을 참조하면, 단계 S801에서 음성 인식 단말(100)은 사용자로부터 어플리케이션에 대한 음성 명령을 입력받을 수 있다.Referring to FIG. 8 , in step S801, the voice recognition terminal 100 may receive a voice command for an application from a user.

단계 S803에서 음성 인식 단말(100)은 각 서비스 모드의 선호도 분석을 통해 기결정된 선호 서비스 모드로 음성 서비스를 제공할 수 있다.In step S803, the voice recognition terminal 100 may provide a voice service in a preferred service mode predetermined through preference analysis of each service mode.

단계 S805에서 음성 인식 단말(100)은 기결정된 선호 서비스 모드를 통한 음성 서비스의 제공 중에, 사용자의 리액션을 인식할 수 있다. 여기서, 리액션은 사용자의 음성, 표정 및 제스쳐 중 적어도 하나일 수 있다.In step S805, the voice recognition terminal 100 may recognize a user's reaction while providing a voice service through a predetermined preferred service mode. Here, the reaction may be at least one of a user's voice, facial expression, and gesture.

단계 S807에서 음성 인식 단말(100)은 인식된 리액션이 부정 평가에 해당하는 경우, 선호 서비스 모드를 변경할 수 있다.In step S807, the voice recognition terminal 100 may change the preferred service mode when the recognized reaction corresponds to a negative evaluation.

상술한 설명에서, 단계 S801 내지 S807은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다.In the above description, steps S801 to S807 may be further divided into additional steps or combined into fewer steps, depending on an embodiment of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

도 9는 본 발명의 다른 실시예에 따른, 맞춤형 서비스 모드 제공 방법을 나타낸 동작 흐름도이다.9 is an operational flowchart illustrating a method for providing a customized service mode according to another embodiment of the present invention.

도 9에 도시된 실시예에 따른 맞춤형 서비스 모드 제공 방법은 도 1 내지 도 8에 도시된 실시예에 음성 인식 단말(100) 및 서버(110)에서 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 8의 음성 인식 단말(100) 및 서버(110)에 관하여 기술된 내용은 도 9에 도시된 실시예에 따른 맞춤형 서비스 모드 제공 방법에도 적용될 수 있다.The method for providing a customized service mode according to the embodiment shown in FIG. 9 includes steps sequentially processed by the voice recognition terminal 100 and the server 110 in the embodiment shown in FIGS. 1 to 8 . Therefore, even if omitted below, the description of the voice recognition terminal 100 and server 110 of FIGS. 1 to 8 can be applied to the method for providing a customized service mode according to the embodiment shown in FIG. 9 .

도 9를 참조하면, 단계 S901에서 음성 인식 단말(100)은 사용자로부터 어플리케이션에 대한 음성 명령을 입력받을 수 있다.Referring to FIG. 9 , in step S901, the voice recognition terminal 100 may receive a voice command for an application from a user.

단계 S903에서 음성 인식 단말(100)은 각 서비스 모드의 선호도 분석을 통해 기결정된 선호 서비스 모드로 음성 서비스를 제공할 수 있다. In step S903, the voice recognition terminal 100 may provide a voice service in a predetermined preferred service mode through preference analysis of each service mode.

단계 S905에서 음성 인식 단말(100)은 서버(110)로부터 신규 서비스 모드에 대한 데이터를 수신할 수 있다.In step S905, the voice recognition terminal 100 may receive data on a new service mode from the server 110.

단계 S907에서 음성 인식 단말(100)은 기결정된 선호 서비스 모드를 신규 서비스 모드로 변경할 수 있다.In step S907, the voice recognition terminal 100 may change the predetermined preferred service mode to a new service mode.

상술한 설명에서, 단계 S901 내지 S907은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 변경될 수도 있다. In the above description, steps S901 to S907 may be further divided into additional steps or combined into fewer steps, depending on an embodiment of the present invention. Also, some steps may be omitted if necessary, and the order of steps may be changed.

본 발명의 일 실시예는 컴퓨터에 의해 실행되는 프로그램 모듈과 같은 컴퓨터에 의해 실행 가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체 및 통신 매체를 모두 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다. 통신 매체는 전형적으로 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈, 또는 반송파와 같은 변조된 데이터 신호의 기타 데이터, 또는 기타 전송 메커니즘을 포함하며, 임의의 정보 전달 매체를 포함한다. An embodiment of the present invention may be implemented in the form of a recording medium including instructions executable by a computer, such as program modules executed by a computer. Computer readable media can be any available media that can be accessed by a computer and includes both volatile and nonvolatile media, removable and non-removable media. Also, computer readable media may include both computer storage media and communication media. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Communication media typically includes computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave, or other transport mechanism, and includes any information delivery media.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다. The above description of the present invention is for illustrative purposes, and those skilled in the art can understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, the embodiments described above should be understood as illustrative in all respects and not limiting. For example, each component described as a single type may be implemented in a distributed manner, and similarly, components described as distributed may be implemented in a combined form.

본 발명의 범위는 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다. The scope of the present invention is indicated by the claims to be described later rather than the detailed description, and all changes or modifications derived from the meaning and scope of the claims and equivalent concepts thereof should be construed as being included in the scope of the present invention. .

100: 음성 인식 단말
110: 서버
200: 음성 입력부
210: 제어부
220: 리액션 인식부
222: 음성 인식부
224: 영상 인식부
230: 선호도 분석부
240: 선호도 서비스 모드 결정부
250: 수신부
260: 메모리100: voice recognition terminal
110: server
200: voice input unit
210: control unit
220: reaction recognition unit
222: voice recognition unit
224: image recognition unit
230: preference analysis unit
240: Preference service mode determining unit
250: receiver
260: memory

Claims

A method for providing a customized service mode performed by a voice recognition terminal,
receiving a voice command for an application installed in the voice recognition terminal;
providing a voice service through a first service mode among a plurality of service modes of the application;
Recognizing a user's reaction while providing a voice service through the first service mode;
changing the first service mode to a second service mode;
Recognizing the user's reaction while providing the voice service through the second service mode;
Analyzing a preference for each service mode based on at least one of a use history of each service mode and the reaction for each service mode; and
Determining a preferred service mode among the plurality of service modes based on the analyzed preference
including,
The reaction includes the user's voice and expression,
Recognizing the user's reaction
storing a facial expression of a recognized user as a facial expression corresponding to a positive evaluation when the reaction voice obtained from the user's voice includes a vocabulary corresponding to a positive evaluation; and
When the reaction voice includes a vocabulary corresponding to negative evaluation, storing the recognized user's facial expression as a facial expression corresponding to negative evaluation,
The facial expression stored as the positive evaluation and the facial expression stored as the negative evaluation are used to classify whether a user's reaction expression, which is input next, corresponds to either a positive evaluation or a negative evaluation.

According to claim 1,
receiving a voice command for the application after the step of determining the preferred service mode; and
providing a voice service through the determined preferred service mode;
Further comprising, a method for providing a customized service mode.

According to claim 1,
Providing a voice service through the first service mode
identifying at least one of age, gender, and number of users of the user by performing at least one of voice recognition of the voice command and image recognition of the user; and
determining the first service mode based on the age, gender, and number of the identified users;
To include, a method for providing a customized service mode.

According to claim 1,
Wherein the reaction further comprises a gesture of the user, a method for providing a customized service mode.

According to claim 4,
Recognizing the user's reaction
Acquiring the reaction voice by recognizing the user's voice;
Analyzing the preference of each service mode
Determining whether the vocabulary included in the reaction voice corresponds to a positive evaluation or a negative evaluation for each service mode
To include, a method for providing a customized service mode.

According to claim 4,
Recognizing the user's reaction
Acquiring a reaction expression or a reaction gesture by recognizing the user's image;
Analyzing the preference of each service mode
The method of providing a customized service mode further comprising determining whether the reaction expression or reaction gesture corresponds to a positive evaluation or a negative evaluation for each of the service modes.

According to claim 5 or 6,
accumulating scores for the positive evaluation and the negative evaluation for each service mode based on the determination; and
Analyzing the preference of each service mode based on the accumulated score
To include, a method for providing a customized service mode.

According to claim 1,
receiving installation data of the application and information about the plurality of service modes from a server; and
installing the application;
Further comprising, a method for providing a customized service mode.

According to claim 8,
Wherein the information on the plurality of service modes includes preferred age information and preferred gender information of each service mode.

delete

According to claim 1,
Changing the first service mode to the second service mode
Determining whether a predetermined condition for changing to the second service mode is satisfied;
Wherein the predetermined condition is the lapse of a predetermined time or the number of inputs of the reaction, a method for providing a customized service mode.

According to claim 1,
Changing the first service mode to the second service mode
and determining whether a request to change to the second service mode is input.

According to claim 2,
Recognizing the user's reaction while providing the voice service through the preferred service mode; and
The method of providing a customized service mode further comprising the step of changing the preferred service mode when the reaction corresponds to a negative evaluation.

According to claim 2,
Receiving data on a new service mode from a server; and
Changing the preferred service mode to the new service mode
Further comprising, a method for providing a customized service mode.

In the voice recognition terminal,
a voice input unit that receives a voice command for an application installed in the voice recognition terminal;
a controller that sets the service mode of the application to a first service mode during a preset condition and changes the service mode of the application to a second service mode after the preset condition;
a reaction recognition unit for recognizing a user's reaction while providing voice service through each of the service modes;
a preference analyzer configured to analyze a preference of each service mode based on at least one of a use history of each service mode and the reaction for each service mode; and
A preferred service mode determining unit determining a preferred service mode from among the plurality of service modes based on the analyzed preference.
including,
The reaction includes the user's voice and expression,
The reaction recognition unit
When the reaction voice obtained from the user's voice includes a vocabulary corresponding to a positive evaluation, the recognized user's facial expression is stored as a facial expression corresponding to a positive evaluation, and the reaction voice includes a vocabulary corresponding to a negative evaluation. At the same time, the user's facial expression recognized together is stored as a facial expression corresponding to negative evaluation,
The reaction recognition unit
Based on the facial expression stored as the positive evaluation and the facial expression stored as the negative evaluation, a reaction expression of a user input next is classified as a facial expression corresponding to a positive evaluation or a negative evaluation.

According to claim 15,
The voice input unit receives a voice command for the application after the preferred service mode is determined;
Wherein the control unit changes the service mode of the application to the determined preferred service mode.

According to claim 15,
The reaction further comprises a gesture of the user, the voice recognition terminal.

18. The method of claim 17,
The reaction recognition unit
a voice recognition unit that recognizes the user's voice and obtains the reaction voice; and
An image recognition unit that recognizes the user's image and obtains a reaction expression or a reaction gesture.
To include, a voice recognition terminal.

According to claim 18,
The voice recognition terminal, wherein the preference analysis unit is configured to determine whether the vocabulary included in the reaction voice corresponds to a positive evaluation or a negative evaluation for each of the service modes.

According to claim 18,
The voice recognition terminal, wherein the preference analysis unit is configured to determine whether the reaction expression or reaction gesture corresponds to a positive evaluation or a negative evaluation for each service mode.

According to claim 19 or 20,
Wherein the preference analyzer accumulates scores for the positive evaluation and the negative evaluation for each service mode based on the determination, and analyzes the preference for each service mode based on the accumulated score. Terminal.

According to claim 15,
Further comprising a receiving unit for receiving installation data of the application and information on the plurality of service modes from a server,
The information on the plurality of service modes includes preferred age information and preferred gender information of each of the service modes.

According to claim 18,
The reaction recognizing unit judges a recognized user's facial expression as a facial expression corresponding to a positive evaluation when the reaction voice includes a vocabulary corresponding to a positive evaluation, and recognizes a recognized expression when the reaction voice includes a vocabulary corresponding to a negative evaluation. A voice recognition terminal that determines a user's facial expression as a facial expression corresponding to negative evaluation.