KR20220053795A

KR20220053795A - System and method for providing artificial intelligence secretary service

Info

Publication number: KR20220053795A
Application number: KR1020200137945A
Authority: KR
Inventors: 당유상; 이양규
Original assignee: 주식회사 동영엠텍
Priority date: 2020-10-23
Filing date: 2020-10-23
Publication date: 2022-05-02

Abstract

The present invention relates to a system and a method for providing an artificial intelligence secretary service, which are implemented to allow a service function and an artificial intelligence secretary function to interwork with each other, as well as a content (particularly, multimedia content) provision function. According to the present invention, a content server provides content, a service server provides a service, and an artificial intelligence server manages the content provided by the content server and the service provided by the service server to interwork with an artificial intelligence secretary function, and recognizes user voice to search for the content and to use the service. Also, a user device receives the user voice to transmit the same to the artificial intelligence server, and then, searches for the content for which a user desires to search, and uses the service, and a network connects the content server, the service server, the artificial intelligence server, and the user device to transceive data with each other.

Description

System and method for providing artificial intelligence secretary service {System and method for providing artificial intelligence secretary service}

본 발명의 기술 분야는 인공지능 비서 서비스 제공 시스템 및 방법에 관한 것으로, 더욱 상세하게 설명하면, 화면이 보이는 비서 서비스의 제공이 간단한 구조로 가능하게 되고 화면출력부에서 화면의 수신이 스캔을 통한 데이터에서 간단하게 수신이 가능하게 되고 클라우드 방식으로 사용하므로 저비용으로 많은 콘텐츠의 사용이 가능하게 되며 콘텐츠(특히, 멀티미디어 콘텐츠) 제공 기능뿐만 아니라 서비스 기능과 인공지능 비서 기능을 상호 연동할 수 있도록 구현한 인공지능 비서 서비스 제공 시스템 및 방법에 관한 것이다.The technical field of the present invention relates to a system and method for providing an artificial intelligence assistant service. More specifically, the provision of the assistant service with a screen visible is possible with a simple structure, and the reception of the screen in the screen output unit is data through scan Since it can be easily received from , and it is used in the cloud method, it is possible to use a lot of content at low cost. It relates to a system and method for providing an intelligent assistant service.

음성 인식 기술을 이용한 인공지능 비서 서비스는 다양하게 연구 개발되고 있는데, 이때 음성 인식 시에 인공지능 스피커를 이용하게 된다. 해당 인공지능 스피커는, 동작대기 상태에서의 활성화 모드(wake-up mode)로의 전환을 위해, 정해진 호출어의 발성을 사용자에게 요구하고 있으며, 인공지능 스피커가 호출어의 음성 인식을 통해 활성화된 상태에서 이후, 사용자의 서비스 제공 요청이 있는 경우에 해당 요청 음성에 대한 음성 인식 및 그에 따른 서비스의 제공이 실행된다.Artificial intelligence assistant service using voice recognition technology is being researched and developed in various ways. At this time, an artificial intelligence speaker is used for voice recognition. The corresponding artificial intelligence speaker requires the user to pronounce a predetermined call word in order to switch from the standby state to the wake-up mode, and the artificial intelligence speaker is activated through voice recognition of the call word Thereafter, when there is a user's request for providing a service, voice recognition for the requested voice and provision of the service accordingly are performed.

이와 같은 인공지능 스피커의 활성화에 요구되고 있는 호출어의 발성은 최초 1회만 요구되는 것은 아니며, 동일한 사용자라고 하더라도 시간 간격을 두고 추가의 인공 지능 서비스를 요청하는 경우에는 매 요청에 앞서 호출어를 발성함으로써 인공지능 스피커를 활성화 상태로 전환해야 하는 번거로운 절차가 요구된다.The call word required for activation of such an artificial intelligence speaker is not required only the first time, and even if the same user requests additional artificial intelligence service at a time interval, the call word is uttered prior to every request. This requires a cumbersome procedure to switch the artificial intelligence speaker to an active state.

이러한 인공지능 스피커는, 또한 호출어가 인식된 경우에 그 이후의 사용자의 서비스 요청 음성에 대해서는 해당 사용자에 대한 별도의 인증을 거치지 않고 서비스 요청에 따른 서비스를 제공한다. 이러한 이유에서 인공지능 스피커가 설치된 공간에 복수의 사용자(예를 들어, A, B, C, D 등)가 있는 경우에 사용자 A가 호출어를 입력한 다음, 동일 공간에 있는 사용자 B가 서비스 요청의 의사 없이 발성한 소리에 대해서도 인공지능 스피커는 사용자 A의 서비스 요청 음성으로 인식하여 오작동되는 오류가 발생하게 된다.In addition, when the call word is recognized, the artificial intelligence speaker provides a service according to the service request without going through separate authentication for the user with respect to the user's service request voice after that. For this reason, when there are multiple users (eg, A, B, C, D, etc.) in the space where the artificial intelligence speaker is installed, user A inputs a call word, and then user B in the same space requests service. Even for sounds uttered without intention, the artificial intelligence speaker recognizes it as user A's service request voice and causes a malfunctioning error.

이러한 인공지능 스피커는, 아울러 복수의 사용자의 음성을 구분하여 인식할 수 없기 때문에, 사용자 A의 서비스 요청, 사용자 B의 서비스 요청, 다시 사용자 A의 서비스 요청이 순차적으로 이루어지는 경우 이들 요청을 사용자별로 구분하여 연계 처리하지 못하고, 각각의 요청을 병렬적으로 처리할 수밖에 없다.In addition, since the artificial intelligence speaker cannot distinguish and recognize the voices of a plurality of users, when user A's service request, user B's service request, and user A's service request are sequentially made, these requests are classified by user. Therefore, connection processing cannot be performed, and each request has no choice but to be processed in parallel.

한국등록특허 제10-2087202호(2020.03.04. 등록)는 인공지능 비서 서비스 제공 방법, 및 이에 사용되는 음성 인식 장비에 관하여 개시되어 있는데, 음성 인식 장비가, 사용자로부터 호출어 음성을 입력받는 단계; 음성 인식 장비가, 사용자가 입력한 호출어가 기 설정된 호출어와 일치하는지 여부를 판단하는 단계; 음성 인식 장비가, 호출어가 기 설정된 호출어와 일치하는 것으로 판단되는 경우에, 호출어 음성 이후에 입력된 1차 서비스 요청 음성 신호에서의 성문 분석용 파라미터 값과, 1차 서비스 요청 음성 이전에 사용자로부터 입력된 호출어 음성 신호에서의 성문 분석용 파라미터 값을 비교함으로써 화자를 인증하는 단계; 음성 인식 장비가, 사용자가 1차 서비스 요청 이후에 호출어의 발성없이 2차 서비스 요청 음성을 발성하는 경우에, 2차 서비스 요청 음성 신호에의 성분 분석용 파라미터 값과 1차 서비스 요청 음성 신호에서의 성분 분석용 파라미터 값을 비교함으로써 화자를 인증하는 단계; 및 음성 인식 장비가, 사용자가 2차 서비스 요청 이후에 호출어의 발성 없이 3차 서비스 요청 음성을 발성하는 경우에, 3차 서비스 요청 음성 신호에서의 성분 분석용 파라미터 값과 1차 서비스 요청 음성 신호 또는 2차 서비스 요청 음성 신호에서의 성분 분석용 파라미터 값을 비교함으로써 화자를 인증하는 단계를 포함하며, 음성 인식 장비는 복수의 사용자의 각 아이디 별로 호출어 음성에서의 성분 분석용 파라미터 값, 1차 서비스 요청 음성 신호에의 성분 분석용 파라미터 값과 서비스 요청 내용, 2차 서비스 요청 음성 신호에의 성분 분석용 파라미터 값과 서비스 요청 내용 및 3차 서비스 요청 음성 신호에의 성분 분석용 파라미터 값과 서비스 요청 내용을 누적 저장하는 것을 특징으로 한다. 개시된 기술에 따르면, 사용자가 소정의 호출어를 반복적으로 입력할 필요 없이 연속적으로 인공지능 비서 서비스를 이용할 수 있게 되고, 사용자의 서비스 요청에 대한 화자 인증 절차를 별도로 실행함으로써 권한 없는 제3자의 음성에 의한 오작동을 방지할 수 있게 될 뿐만 아니라, 정당한 권한 있는 복수의 사용자들로부터 서비스 요청 음성이 누적적으로 입력되는 경우에 서비스 요청을 사용자별로 구분하여 연계 처리할 수 있도록 한다.Korea Patent Registration No. 10-2087202 (registered on March 4, 2020) discloses a method for providing an artificial intelligence assistant service, and a voice recognition device used therein, the step of receiving a call word voice from a user, the voice recognition device ; determining, by the voice recognition device, whether a call word input by a user matches a preset call word; When the voice recognition device determines that the call word matches the preset call word, the parameter value for voiceprint analysis in the first service request voice signal input after the call word voice, and the first service request voice from the user authenticating the speaker by comparing parameter values for voiceprint analysis in the input caller voice signal; In the case where the voice recognition equipment utters the second service request voice without the call word being uttered after the first service request by the user, the parameter value for component analysis of the secondary service request voice signal and the first service request voice signal authenticating the speaker by comparing the parameter values for component analysis of ; and the voice recognition equipment, when the user utters a third service request voice without uttering a call word after the second service request, parameter values for component analysis in the tertiary service request voice signal and the first service request voice signal or authenticating the speaker by comparing parameter values for component analysis in the secondary service request voice signal, wherein the voice recognition equipment includes a parameter value for component analysis in the calling word voice for each ID of a plurality of users, the primary Parameter values and service request content for component analysis in the service request voice signal, parameter values for component analysis in the secondary service request voice signal, service request content, and parameter values and service request for component analysis in the tertiary service request voice signal It is characterized in that the contents are accumulated and stored. According to the disclosed technology, it is possible to continuously use the artificial intelligence assistant service without the need for the user to repeatedly input a predetermined call word, and by separately executing the speaker authentication procedure for the user's service request, the voice of an unauthorized third party can be heard. In addition to being able to prevent a malfunction by the user, when service request voices are accumulated from a plurality of users with legitimate authority, service requests can be classified for each user and linked processing can be performed.

한국등록특허 제10-1976355호(2019.05.01. 등록)는 인공지능(AI) 기능을 결여하는 셋톱박스의 외부에 디지털 인터페이스(예를 들어, USB)를 통해 AI 스피커를 연결하여 상호 연동하여 동작함으로써, 전체적으로 셋톱박스에 AI 기능이 일체화된 것과 같은 셋톱박스 외부 연결형의 AI 스피커 장치 및 이를 이용한 AI 스피커 시스템에 관하여 개시되어 있다. 개시된 기술에 따르면, 셋톱박스에 대해 디지털 인터페이스를 통해 외부 연결되어 상호 연동을 통해 AI 스피커 시스템을 제공하기 위한 AI 스피커 장치로서, AI 스피커와 관련하여 주변 음성 신호를 수집하여 입력하는 마이크 음성입력부; 디지털 인터페이스를 통해 셋톱박스와 외부 접속하기 위한 디지털 외부 접속부; 디지털 인터페이스를 통해 셋톱박스와 연동하여 동작하기 위한 셋톱박스 연동부; 셋톱박스로부터 제공되는 멀티미티어 콘텐츠의 재생 오디오를 에코 기준 신호로 수신하여 임시 저장하는 재생오디오 버퍼부로서, 셋톱박스로부터 제공되는 멀티미티어 콘텐츠의 재생 오디오를 에코 기준 신호로 수신하여 순서대로 임시 저장하는 버퍼메모리와, 버퍼메모리의 점유율에 비례 대응하도록 버퍼메모리의 동작 클럭을 제어하는 클럭제어기를 포함하여 구성되는 재생오디오 버퍼부; 에코 기준 신호를 참조하여 마이크 음성입력부가 수집하는 주변 음성 신호로부터 셋톱박스에 기인한 콘텐츠의 재생 오디오 에코 성분을 제거 처리하는 에코 캔슬레이션부; 재생 오디오 에코 성분이 제거된 주변 음성 신호를 이용하여 사용자 음성을 전처리하고 디지털 외부 접속부를 통해 셋톱박스로 전달하는 사용자 음성처리부; 셋톱박스를 통해 획득되는 인공지능 응답 데이터를 음성 대역으로 출력하기 위한 스피커 음성출력부를 포함하여 구성되는 것을 특징으로 한다.Korean Patent No. 10-1976355 (registered on May 1, 2019) operates by connecting an AI speaker through a digital interface (eg, USB) to the outside of a set-top box lacking an artificial intelligence (AI) function and interworking. By doing so, it is disclosed with respect to an AI speaker device of an external connection type of a set-top box such as an AI function integrated in the set-top box as a whole, and an AI speaker system using the same. According to the disclosed technology, an AI speaker device for providing an AI speaker system through interworking by being externally connected through a digital interface to a set-top box, comprising: a microphone voice input unit for collecting and inputting surrounding voice signals in relation to the AI speaker; a digital external connection unit for externally connecting to the set-top box through a digital interface; a set-top box interlocking unit for operating in conjunction with the set-top box through a digital interface; As a playback audio buffer unit that receives and temporarily stores playback audio of multimedia content provided from a set-top box as an echo reference signal, it receives and temporarily stores playback audio of multimedia content provided from a set-top box as an echo reference signal in order a playback audio buffer unit configured to include a buffer memory and a clock controller controlling an operation clock of the buffer memory to be proportional to the occupancy rate of the buffer memory; an echo cancellation unit that removes an audio echo component of the content due to the set-top box from the surrounding audio signal collected by the microphone audio input unit with reference to the echo reference signal; a user voice processing unit that pre-processes the user's voice using the surrounding voice signal from which the reproduced audio echo component has been removed and transmits it to the set-top box through a digital external connection unit; It is characterized in that it is configured to include a speaker voice output unit for outputting the artificial intelligence response data obtained through the set-top box to the voice band.

상술한 바와 같은 종래의 기술에서는, 콘텐츠(특히, 멀티미디어 콘텐츠) 제공 기능뿐만 아니라 서비스 기능과 인공지능 비서 기능을 상호 연동하는 고성능 시스템이 구축되어 있지 않았으며, 이에 사용자의 음성을 인식하여 사용자가 검색하고자 하는 콘텐츠를 용이하게 검색하거나 서비스를 편리하게 이용할 수 없었던 단점을 가지고 있다.In the prior art as described above, a high-performance system that interoperates with a service function and an artificial intelligence assistant function as well as a content (especially multimedia content) provision function has not been built. It has a disadvantage in that it is not possible to easily search for the desired content or use the service conveniently.

한국등록특허 제10-2087202호Korean Patent Registration No. 10-2087202 한국등록특허 제10-1976355호Korean Patent Registration No. 10-1976355

본 발명이 해결하고자 하는 과제는, 전술한 바와 같은 단점을 해결하기 위한 것으로, 화면이 보이는 비서 서비스의 제공이 간단한 구조로 가능하게 되고 화면출력부에서 화면의 수신이 스캔을 통한 데이터에서 간단하게 수신이 가능하게 되고 클라우드 방식으로 사용하므로 저비용으로 많은 콘텐츠의 사용이 가능하게 되며 콘텐츠(특히, 멀티미디어 콘텐츠) 제공 기능뿐만 아니라 서비스 기능과 인공지능 비서 기능을 상호 연동할 수 있도록 구현한 인공지능 비서 서비스 제공 시스템 및 방법을 제공하는 것이다.The problem to be solved by the present invention is to solve the disadvantages described above, and it is possible to provide a screen-visible assistant service with a simple structure, and the reception of the screen from the screen output unit is simply received from the data through the scan This makes it possible and uses the cloud method, so it is possible to use a lot of content at low cost, and provides not only the content (especially multimedia content) function, but also the artificial intelligence assistant service implemented so that the service function and the artificial intelligence assistant function can be interconnected. To provide a system and method.

상술한 과제를 해결하는 수단으로는, 본 발명의 한 특징에 따르면, 콘텐츠를 제공하기 위한 콘텐츠 서버; 서비스를 제공하기 위한 서비스 서버; 상기 콘텐츠 서버에서 제공한 콘텐츠와 상기 서비스 서버에서 제공한 서비스를 인공지능 비서 기능과 함께 상호 연동시켜 관리하며, 사용자 음성을 인식하여 콘텐츠를 검색하도록 하고 서비스를 이용하도록 하기 위한 인공지능 서버; 사용자 음성을 입력받아 상기 인공지능 서버에 전달한 후에, 사용자가 검색하고자 하는 콘텐츠를 검색해 주고 서비스를 이용해 주기 위한 사용자 기기; 및 상기 콘텐츠 서버, 상기 서비스 서버, 상기 인공지능 서버, 상기 사용자 기기 간을 서로 연결시켜 서로 간의 데이터를 송수신해 주기 위한 네트워크를 포함하는 인공지능 비서 서비스 제공 시스템을 제공한다.As means for solving the above problems, according to one aspect of the present invention, a content server for providing content; a service server for providing services; an artificial intelligence server for managing the content provided by the content server and the service provided by the service server by interworking with an artificial intelligence assistant function, and for recognizing a user's voice to search for content and use the service; a user device for receiving a user's voice and transmitting it to the artificial intelligence server, then searching for content that the user wants to search for and using a service; and a network for connecting the content server, the service server, the artificial intelligence server, and the user device to transmit and receive data between each other.

일 실시 예에서, 상기 콘텐츠 서버는, 상기 인공지능 서버로부터 수신되는 콘텐츠 명에 해당하는 콘텐츠 정보를 상기 인공지능 서버로 전송하는 것을 특징으로 한다.In an embodiment, the content server transmits content information corresponding to the content name received from the artificial intelligence server to the artificial intelligence server.

일 실시 예에서, 상기 콘텐츠 서버는, 상기 사용자 기기로부터 통보되는 콘텐츠 선택에 따라 이에 해당하는 콘텐츠 내용을 검색하여 상기 사용자 기기로 전송하는 것을 특징으로 한다.In an embodiment, the content server searches for content corresponding to the content selection notified from the user device and transmits the searched content to the user device.

일 실시 예에서, 상기 콘텐츠 서버는, 검색한 콘텐츠 내용을 포함한 콘텐츠 파일을 사용자가 화면을 통해 시청할 수 있도록 실시간으로 상기 사용자 기기로 전송하는 것을 특징으로 한다.In an embodiment, the content server transmits a content file including the searched content to the user device in real time so that the user can view it through a screen.

일 실시 예에서, 상기 콘텐츠 서버는, 콘텐츠에 대한 데이터를 각 콘텐츠별 세부항목정보와 매칭시켜 상기 사용자 기기로 전송하는 것을 특징으로 한다.In one embodiment, the content server is characterized in that the data on the content is matched with the detailed item information for each content and transmitted to the user device.

일 실시 예에서, 상기 콘텐츠 서버는, 개인별 서비스 플랫폼에 해당하는 콘텐츠를 제공해 주기 위해서, 상기 사용자 기기별로 기 설정 등록된 복수 개의 사용자 ID에 대해서 각 사용자 ID별로 콘텐츠의 이용 내역을 분류시켜, 콘텐츠 정보를 상기 사용자 기기로 전송하는 것을 특징으로 한다.In an embodiment, the content server classifies the usage history of content by user ID for a plurality of user IDs preset for each user device in order to provide content corresponding to the individual service platform, and provides content information is transmitted to the user device.

일 실시 예에서, 상기 서비스 서버는, 서비스에 대한 데이터를 각 서비스별 세부항목정보와 매칭시켜 상기 사용자 기기로 전송하는 것을 특징으로 한다.In an embodiment, the service server is characterized in that the data on the service is matched with detailed item information for each service and transmitted to the user device.

일 실시 예에서, 상기 서비스 서버는, 개인별 서비스 플랫폼에 해당하는 서비스를 제공해 주기 위해서, 상기 사용자 기기별로 기 설정 등록된 복수 개의 사용자 ID에 대해서 각 사용자 ID별로 서비스의 이용 내역을 분류시켜, 서비스를 상기 사용자 기기로 전송하는 것을 특징으로 한다.In one embodiment, the service server, in order to provide a service corresponding to the individual service platform, classifies the service usage history for each user ID for a plurality of user IDs preset and registered for each user device, and provides the service. It is characterized in that it is transmitted to the user device.

일 실시 예에서, 상기 인공지능 서버는, 상기 콘텐츠 서버로부터 제공받은 콘텐츠를 분배하여 분배 콘텐츠를 상기 사용자 기기로 전송하는 것을 특징으로 한다.In an embodiment, the artificial intelligence server distributes the content provided from the content server and transmits the distributed content to the user device.

일 실시 예에서, 상기 인공지능 서버는, 상기 사용자 기기로부터 입력되는 사용자 응답 신호를 처리하는 것을 특징으로 한다.In an embodiment, the artificial intelligence server processes a user response signal input from the user device.

일 실시 예에서, 상기 인공지능 서버는, 상기 사용자 기기로부터 수신되는 사용자 ID에 대응하는 기기 정보 및 개인별 서비스 플랫폼을 기 설정 등록해 둔 데이터베이스에서 조회한 후에, 사용자 ID에 대응하는 개인별 서비스 플랫폼을 사용자 ID에 대응하는 사용자 기기로 활성화시켜 주는 것을 특징으로 한다.In one embodiment, the artificial intelligence server, after inquiring from a database in which the device information corresponding to the user ID and the individual service platform corresponding to the user ID received from the user device is preset and registered, the individual service platform corresponding to the user ID is retrieved from the user. It is characterized in that it is activated by a user device corresponding to the ID.

일 실시 예에서, 상기 인공지능 서버는, 상기 사용자 기기로부터 전달받은 사용자 음성을 분석하여 사용자가 검색하고자 하는 콘텐츠 명을 추출한 후에, 추출한 콘텐츠 명을 상기 콘텐츠 서버로 전송하는 것을 특징으로 한다.In an embodiment, the artificial intelligence server analyzes the user voice received from the user device to extract the content name that the user wants to search, and then transmits the extracted content name to the content server.

일 실시 예에서, 상기 인공지능 서버는, 인공지능 비서 서비스를 제공하려는 사용자 기기와 연동할 수 있는 인터페이스를 제공하는 플랫폼을 구비하는 것을 특징으로 한다.In one embodiment, the artificial intelligence server is characterized in that it is provided with a platform that provides an interface capable of interworking with a user device to provide an artificial intelligence assistant service.

일 실시 예에서, 상기 인공지능 서버는, 접근 권한을 위임하기 위한 공개 표준으로 인터넷 사용자가 다른 웹 서비스나 응용 프로그램에 사용자 계정에 접근할 수 있는 권한을 부여하는 규약에 따라, 인공지능 비서 플랫폼에서 클라이언트가 인공지능 비서 액세스 토큰을 획득하거나 사용자가 특정 익스텐션을 사용할 시에 자신의 계정을 연결할 때에 사용하도록 하는 것을 특징으로 한다.In one embodiment, the artificial intelligence server is an open standard for delegating access rights, and according to the agreement for granting an Internet user the right to access a user account to another web service or application, in the AI assistant platform It is characterized in that the client acquires an AI assistant access token or uses it when a user connects his or her account when using a specific extension.

일 실시 예에서, 상기 인공지능 서버는, 인공지능 비서 디벨로퍼 콘솔(developer console)을 통해 클라이언트를 등록하고 획득한 인증 정보인 클라이언트 인증 정보를, 인공지능 비서 액세스 토큰을 획득하는데 사용하도록 하는 것을 특징으로 한다.In one embodiment, the artificial intelligence server registers a client through an artificial intelligence assistant developer console and uses client authentication information, which is authentication information obtained, to acquire an artificial intelligence assistant access token. do.

일 실시 예에서, 상기 사용자 기기는, 상기 인공지능 서버로부터 분배 콘텐츠를 수신받아 화면을 통해 사용자에게 제공하며, 상기 인공지능 서버로부터 서비스를 수신받아 화면을 통해 사용자에게 제공하는 것을 특징으로 한다.In an embodiment, the user device receives the distributed content from the artificial intelligence server and provides it to the user through a screen, and receives the service from the artificial intelligence server and provides it to the user through the screen.

일 실시 예에서, 상기 사용자 기기는, 사용자 ID를 상기 인공지능 서버로 전송한 후에, 상기 인공지능 서버에 의해서 제공되는 개인별 서비스 플랫폼을 활성화하는 것을 특징으로 한다.In one embodiment, the user device, after transmitting the user ID to the artificial intelligence server, characterized in that the activation of the individual service platform provided by the artificial intelligence server.

일 실시 예에서, 상기 사용자 ID는, 개개의 메시지를 구분하기 위한 식별자며, 이벤트 메시지와 지시 메시지가 모두 개개의 사용자 ID를 가지도록 한 것을 특징으로 한다.In an embodiment, the user ID is an identifier for distinguishing individual messages, and it is characterized in that both the event message and the instruction message have individual user IDs.

일 실시 예에서, 상기 이벤트 메시지는, 클라이언트에서 상기 인공지능 서버로 전달하는 메시지이며, 사용자 요청을 전달하거나 클라이언트의 상태 값이 변경된 것을 알릴 때에 전송되는 것을 특징으로 한다.In an embodiment, the event message is a message transmitted from the client to the artificial intelligence server, and is transmitted when a user request is transmitted or a state value of the client is changed.

일 실시 예에서, 상기 지시 메시지는, 상기 인공지능 서버가 클라이언트의 행동을 제어하도록 명세한 메시지이며, 클라이언트가 요청한 이벤트 메시지에 응답을 하거나 특정 조건에 의해 클라이언트로 정보를 전달할 때에 사용되는 것을 특징으로 한다.In one embodiment, the instruction message is a message that specifies that the artificial intelligence server controls the behavior of the client, and is used when responding to an event message requested by the client or transmitting information to the client according to a specific condition. do.

일 실시 예에서, 상기 사용자 기기는, 상기 콘텐츠 서버로부터 수신되는 콘텐츠 정보를 화면을 통해 사용자에게 제공하며, 사용자에 의해서 콘텐츠를 선택하는 경우에 콘텐츠 선택을 상기 콘텐츠 서버로 통보하며, 상기 콘텐츠 서버로부터 수신되는 콘텐츠 내용을 화면을 통해 사용자에게 제공하는 것을 특징으로 한다.In an embodiment, the user device provides the content information received from the content server to the user through a screen, and notifies the content selection to the content server when the user selects content, and from the content server It is characterized in that the received content is provided to the user through a screen.

일 실시 예에서, 상기 사용자 기기는, 댁내 공중파 방송 콘텐츠를 출력하고, 입력수단을 통해 입력받은 사용자 응답 신호를 상기 인공지능 서버로 전송해 주는 지상파TV를 구비하는 것을 특징으로 한다.In one embodiment, the user device is characterized in that it is provided with a terrestrial TV that outputs in-house airwave broadcasting content and transmits a user response signal received through an input means to the artificial intelligence server.

일 실시 예에서, 상기 사용자 기기는, 멀티 코덱을 지원하며 서비스를 선택하기 위한 보이스 에이전트 처리부를 포함하는 애플리케이션 계층; 자바 가상 머신, 스트리밍 프로토콜을 탑재한 미들웨어 계층; 디바이스 드라이버와 운영체제의 시스템 소프트웨어를 포함하는 시스템 소프트웨어 계층; CPU, 미디어 프로세서, 플래시 램, 이더넷 모듈의 하드웨어로 구성된 하드웨어 계층을 포함하는 것을 특징으로 한다.In an embodiment, the user device includes: an application layer that supports multi-codecs and includes a voice agent processing unit for selecting a service; Java virtual machine, middleware layer with streaming protocol; a system software layer including a device driver and system software of an operating system; It is characterized in that it includes a hardware layer consisting of hardware of CPU, media processor, flash RAM, and Ethernet module.

일 실시 예에서, 상기 사용자 기기는, IPv4 주소 또는 IPv6 주소가 할당되는 것을 특징으로 한다.In an embodiment, the user equipment is characterized in that an IPv4 address or an IPv6 address is allocated.

일 실시 예에서, 상기 사용자 기기는, 콘텐츠를 수신하기 위한 디지털 방송 수신 장치를 이용하여 영상데이터, 프로그램에 관련된 디지털정보를 수신하여 사용자에게 제공하는 것을 특징으로 한다.In an embodiment, the user device receives image data and digital information related to a program by using a digital broadcast receiving device for receiving content, and provides the received image data and digital information to the user.

일 실시 예에서, 상기 사용자 기기는, 상기 인공지능 서버로부터 수신되는 정보를 화면을 통해 출력 표시하여, 사용자가 원하는 디지털정보를 검색하도록 하는 것을 특징으로 한다.In an embodiment, the user device is characterized in that the information received from the artificial intelligence server is output and displayed on a screen so that the user can search for desired digital information.

일 실시 예에서, 상기 사용자 기기는, 인공지능 비서 플랫폼을 탑재한 화면 터치 모니터가 장착된 데스크 뷰를 포함하는 것을 특징으로 한다.In one embodiment, the user device is characterized in that it includes a desk view equipped with a screen touch monitor equipped with an artificial intelligence assistant platform.

일 실시 예에서, 상기 사용자 기기는, 사용자가 새로운 발화를 시작할 때마다 대화 ID를 생성시켜 주며, 클라이언트가 인지 이벤트 메시지를 상기 인공지능 서버에 전달할 때에 대화 ID를 포함시켜 주는 것을 특징으로 한다.In an embodiment, the user device generates a conversation ID whenever the user starts a new utterance, and includes the conversation ID when the client transmits a cognitive event message to the artificial intelligence server.

일 실시 예에서, 상기 대화 ID는, 상기 인공지능 서버 측 응답을 내려줄 때 어떤 이벤트 메시지에 대한 응답인지 연결할 때 사용되는 것을 특징으로 한다.In an embodiment, the conversation ID is characterized in that it is used when connecting the response to which event message when sending the AI server-side response.

일 실시 예에서, 상기 대화 ID는, 클라이언트가 지시 메시지에 포함된 대화 ID를 보고 어떤 이벤트 메시지의 응답인지를 판단하도록 하며, 클라이언트가 현재 가지고 있는 대화 ID와 지시 메시지의 대화 ID가 다르면 수신한 지시 메시지를 무시하도록 하기 위해서, 지시 메시지에 포함되는 것을 특징으로 한다.In one embodiment, the conversation ID allows the client to determine which event message is a response by looking at the conversation ID included in the instruction message, and if the conversation ID currently possessed by the client is different from the conversation ID of the instruction message, the instruction received In order to ignore the message, it is characterized in that it is included in the indication message.

일 실시 예에서, 상기 네트워크는, 상기 콘텐츠 서버에서 제공한 콘텐츠를 상기 사용자 기기로 전송하고, 상기 사용자 기기에서 전송한 사용자 응답 신호를 상기 콘텐츠 서버로 제공하기 위한 프로토콜과; 상기 서비스 서버에서 제공한 서비스를 상기 사용자 기기로 전송하고, 상기 사용자 기기에서 전송한 사용자 응답 신호를 상기 서비스 서버로 제공하기 위한 프로토콜을 사용하는 IP망인 것을 특징으로 한다.In an embodiment, the network includes: a protocol for transmitting the content provided by the content server to the user device and providing a user response signal transmitted from the user device to the content server; It is characterized in that it is an IP network using a protocol for transmitting the service provided by the service server to the user device and providing a user response signal transmitted from the user device to the service server.

일 실시 예에서, 상기 사용자 기기는, 상기 인공지능 서버 또는 상기 콘텐츠 서버나 상기 서비스 서버와 연동하여 데이터를 송수신하는 외부 서버 연동부; 상기 외부 서버 연동부와 연결되고 사용자 기기의 구동을 제어하는 제어부; 상기 제어부와 연결되어 상기 제어부의 제어에 필요한 프로그램이나 데이터를 저장하는 저장부; 상기 제어부와 연결되고 상기 인공지능 서버로부터 수신한 영상데이터에 대한 신호 처리를 수행하는 영상신호 처리부; 상기 영상신호 처리부에서 신호 처리한 영상데이터를 화면을 통해 출력 표시하는 화면 출력부; 상기 제어부와 연결되고 멀티 코덱을 지원하며 서비스를 선택하도록 하는 보이스 에이전트 처리부; 상기 제어부와 연결되고 마이크를 통해 사용자 음성을 입력받아 오디오 처리해서 상기 인공지능 서버로 전송하도록 하는 오디오 처리부; 및 상기 제어부와 연결되어 상기 인공지능 서버로부터 수신한 오디오데이터를 출력하는 스피커 출력부를 포함하는 것을 특징으로 한다.In an embodiment, the user device may include: an external server interworking unit for transmitting and receiving data in conjunction with the artificial intelligence server or the content server or the service server; a control unit connected to the external server interworking unit and controlling operation of a user device; a storage unit connected to the control unit to store programs or data necessary for the control of the control unit; an image signal processing unit connected to the control unit and performing signal processing on the image data received from the artificial intelligence server; a screen output unit for outputting and displaying the image data signal-processed by the image signal processing unit through a screen; a voice agent processing unit connected to the control unit, supporting multiple codecs, and selecting a service; an audio processing unit connected to the control unit, receiving a user's voice through a microphone, processing the audio, and transmitting the audio to the artificial intelligence server; and a speaker output unit connected to the control unit to output audio data received from the artificial intelligence server.

일 실시 예에서, 상기 영상신호 처리부는, 상기 인공지능 서버로부터 수신한 콘텐츠에 대한 신호 처리를 수행하여 상기 화면 출력부로 제공하는 것을 특징으로 한다.In an embodiment, the image signal processing unit performs signal processing on the content received from the artificial intelligence server and provides it to the screen output unit.

일 실시 예에서, 상기 화면 출력부는, 상기 영상신호 처리부에서 신호 처리한 콘텐츠를 화면에 출력 표시하며, 사용자 응답 신호를 입력받아 상기 사용자 기기의 리턴 채널을 통하여 상기 인공지능 서버로 전송해 주는 것을 특징으로 한다.In an embodiment, the screen output unit outputs and displays content processed by the image signal processing unit on a screen, receives a user response signal and transmits it to the artificial intelligence server through a return channel of the user device do it with

일 실시 예에서, 상기 보이스 에이전트 처리부는, 콘텐츠를 재생 처리해 주기 위한 콘텐츠 재생 처리모듈; 서비스를 재생 처리해 주기 위한 서비스 재생 처리모듈; 및 외부 IoT와 연동하여 외부 IoT의 데이터를 처리해 주기 위한 외부 IoT 처리모듈을 포함하는 것을 특징으로 한다.In one embodiment, the voice agent processing unit, a content reproduction processing module for processing the reproduction of the content; a service reproduction processing module for reproducing a service; and an external IoT processing module for processing external IoT data by interworking with external IoT.

일 실시 예에서, 상기 오디오 처리부는, 상기 사용자 기기 자체에서 생성되는 오디오를 인지하여 위상을 역으로 하면 음이 상쇄가 되는 원리로 내부 음원의 오디오를 제거하기 위한 에코 캔슬레이션묘듈;, 마이크에 유입된 주변 소음을 제거하기 위한 노이즈 캔슬레이션모듈; 및 마이크로 오디오를 입력받기 위한 마이크 음성입력모듈을 포함하는 것을 특징으로 한다.In one embodiment, the audio processing unit recognizes the audio generated by the user device itself, and when the phase is reversed, an echo cancellation module for removing the audio of the internal sound source on the principle that the sound is canceled; introduced into the microphone a noise canceling module for removing ambient noise; and a microphone voice input module for receiving a microphone audio input.

상술한 과제를 해결하는 수단으로는, 본 발명의 다른 한 특징에 따르면, 콘텐츠 서버가 콘텐츠를 제공하는 단계; 서비스 서버가 서비스를 제공하는 단계; 인공지능 서버가 상기 콘텐츠 서버에서 제공한 콘텐츠와 상기 서비스 서버에서 제공한 서비스를 인공지능 비서 기능과 함께 상호 연동시켜 관리하는 단계; 사용자 기기가 사용자 음성을 입력받아 상기 인공지능 서버에 전달하는 단계; 상기 인공지능 서버가 사용자 음성을 인식하여 콘텐츠를 검색하도록 하고 서비스를 이용하도록 하는 단계; 및 상기 사용자 기기가 사용자가 검색하고자 하는 콘텐츠를 검색해 주고 서비스를 이용해 주는 단계를 포함하는 인공지능 비서 서비스 제공 방법을 제공한다.As a means for solving the above problems, according to another aspect of the present invention, the content server provides the content; providing a service by the service server; managing, by an artificial intelligence server, the content provided by the content server and the service provided by the service server by interworking with an artificial intelligence assistant function; receiving, by the user device, the user's voice and transmitting it to the artificial intelligence server; allowing the artificial intelligence server to search for content by recognizing a user's voice and use a service; and providing, by the user device, a step in which the user searches for the content the user wants to search for and uses the service.

본 발명의 효과로는, 화면이 보이는 비서 서비스의 제공이 간단한 구조로 가능하게 되고 화면출력부에서 화면의 수신이 스캔을 통한 데이터에서 간단하게 수신이 가능하게 되고 클라우드 방식으로 사용하므로 저비용으로 많은 콘텐츠의 사용이 가능하게 되며 콘텐츠(특히, 멀티미디어 콘텐츠) 제공 기능뿐만 아니라 서비스 기능과 인공지능 비서 기능을 상호 연동할 수 있도록 구현한 인공지능 비서 서비스 제공 시스템 및 방법을 제공함으로써, 사용자의 음성을 인식하여 사용자가 검색하고자 하는 콘텐츠를 용이하게 검색할 수 있고 서비스를 편리하게 이용할 수 있다는 것이다.As an effect of the present invention, it is possible to provide a screen-visible assistant service with a simple structure, and the reception of the screen from the screen output unit can be easily received from the data through the scan, and since it is used in the cloud method, many contents at a low cost By providing a system and method for providing an artificial intelligence assistant service that enables the use of content (especially multimedia content) as well as a service function and an artificial intelligence assistant function to interoperate with each other, by recognizing the user's voice This means that users can easily search for the content they want to search for and use the service conveniently.

도 1은 본 발명의 실시 예에 따른 인공지능 비서 서비스 제공 시스템을 설명하는 도면이다.
도 2는 도 1에 있는 사용자 기기를 설명하는 도면이다.
도 3은 본 발명의 실시 예에 따른 인공지능 비서 서비스 제공 방법을 설명하는 도면이다.
도 4는 본 발명의 제1실시예에 따른 인공지능 비서 서비스 제공 시스템의 실제사용 상태가 도시된 개략적인 설명도
도 5는 본 발명의 제1실시예에 따른 인공지능 비서 서비스 제공 시스템의 데스크뷰의 초기시작화면
도 6a,6b,6c는 본 발명의 제1실시예에 따른 인공지능 비서 서비스 제공 시스템의 데스크뷰의 작동을 설명하는 도면1 is a view for explaining an artificial intelligence assistant service providing system according to an embodiment of the present invention.
FIG. 2 is a diagram for explaining the user device shown in FIG. 1 .
3 is a view for explaining a method of providing an artificial intelligence assistant service according to an embodiment of the present invention.
4 is a schematic explanatory diagram showing an actual use state of the artificial intelligence assistant service providing system according to the first embodiment of the present invention;
5 is an initial start screen of the desk view of the artificial intelligence assistant service providing system according to the first embodiment of the present invention;
6A, 6B, and 6C are diagrams for explaining the operation of the desk view of the artificial intelligence assistant service providing system according to the first embodiment of the present invention;

아래에서는 첨부한 도면을 참고로 하여 본 발명의 실시 예에 대하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 상세히 설명한다. 그러나 본 발명에 관한 설명은 구조적 내지 기능적 설명을 위한 실시 예에 불과하므로, 본 발명의 권리범위는 본문에 설명된 실시 예에 의하여 제한되는 것으로 해석되어서는 아니 된다. 즉, 실시 예는 다양한 변경이 가능하고 여러 가지 형태를 가질 수 있으므로 본 발명의 권리범위는 기술적 사상을 실현할 수 있는 균등물들을 포함하는 것으로 이해되어야 한다. 또한, 본 발명에서 제시된 목적 또는 효과는 특정 실시예가 이를 전부 포함하여야 한다거나 그러한 효과만을 포함하여야 한다는 의미는 아니므로, 본 발명의 권리범위는 이에 의하여 제한되는 것으로 이해되어서는 아니 될 것이다.Hereinafter, with reference to the accompanying drawings, embodiments of the present invention will be described in detail so that those of ordinary skill in the art to which the present invention pertains can easily implement them. However, since the description of the present invention is merely an embodiment for structural or functional description, the scope of the present invention should not be construed as being limited by the embodiment described in the text. That is, since the embodiment is capable of various changes and may have various forms, it should be understood that the scope of the present invention includes equivalents capable of realizing the technical idea. In addition, since the object or effect presented in the present invention does not mean that a specific embodiment should include all of them or only such effects, it should not be understood that the scope of the present invention is limited thereby.

본 발명에서 서술되는 용어의 의미는 다음과 같이 이해되어야 할 것이다.The meaning of the terms described in the present invention should be understood as follows.

"제1", "제2" 등의 용어는 하나의 구성요소를 다른 구성요소로부터 구별하기 위한 것으로, 이들 용어들에 의해 권리범위가 한정되어서는 아니 된다. 예를 들어, 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. 어떤 구성요소가 다른 구성요소에 "연결되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결될 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어" 있다고 언급된 때에는 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다. 한편, 구성요소들 간의 관계를 설명하는 다른 표현들, 즉 "~사이에"와 "바로 ~사이에" 또는 "~에 이웃하는"과 "~에 직접 이웃하는" 등도 마찬가지로 해석되어야 한다.Terms such as “first” and “second” are for distinguishing one component from another, and the scope of rights should not be limited by these terms. For example, a first component may be termed a second component, and similarly, a second component may also be termed a first component. When a component is referred to as being “connected to” another component, it may be directly connected to the other component, but it should be understood that other components may exist in between. On the other hand, when it is mentioned that a certain element is "directly connected" to another element, it should be understood that the other element does not exist in the middle. Meanwhile, other expressions describing the relationship between elements, that is, "between" and "between" or "neighboring to" and "directly adjacent to", etc., should be interpreted similarly.

단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한 복수의 표현을 포함하는 것으로 이해되어야 하고, "포함하다" 또는 "가지다" 등의 용어는 설시된 특징, 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것이 존재함을 지정하려는 것이며, 하나 또는 그 이상의 다른 특징이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The singular expression is to be understood to include the plural expression unless the context clearly dictates otherwise, and terms such as "comprise" or "have" are not intended to refer to the specified feature, number, step, action, component, part or any of them. It is intended to indicate that a combination exists, and it should be understood that it does not preclude the possibility of the existence or addition of one or more other features or numbers, steps, operations, components, parts, or combinations thereof.

여기서 사용되는 모든 용어들은 다르게 정의되지 않는 한, 본 발명이 속하는 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가진다. 일반적으로 사용되는 사전에 정의되어 있는 용어들은 관련 기술의 문맥상 가지는 의미와 일치하는 것으로 해석되어야 하며, 본 발명에서 명백하게 정의하지 않는 한 이상적이거나 과도하게 형식적인 의미를 지니는 것으로 해석될 수 없다.All terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the present invention belongs, unless otherwise defined. Terms defined in the dictionary should be interpreted as being consistent with the meaning of the context of the related art, and cannot be interpreted as having an ideal or excessively formal meaning unless explicitly defined in the present invention.

이제 본 발명의 실시 예에 따른 인공지능 비서 서비스 제공 시스템 및 방법에 대하여 도면을 참고로 하여 상세하게 설명한다.Now, a system and method for providing an artificial intelligence assistant service according to an embodiment of the present invention will be described in detail with reference to the drawings.

도 1은 본 발명의 실시 예에 따른 인공지능 비서 서비스 제공 시스템을 설명하는 도면이다.1 is a view for explaining an artificial intelligence assistant service providing system according to an embodiment of the present invention.

도 1을 참조하면, 인공지능 비서 서비스 제공 시스템(100)은, 콘텐츠 서버(110), 서비스 서버(120), 인공지능 서버(130), 사용자 기기(140), 네트워크(150)를 포함한다.Referring to FIG. 1 , the artificial intelligence assistant service providing system 100 includes a content server 110 , a service server 120 , an artificial intelligence server 130 , a user device 140 , and a network 150 .

콘텐츠 서버(110)는, 각종 콘텐츠(특히, 멀티미디어 콘텐츠)를 네트워크(150)를 통해 인공지능 서버(130)에 제공해 준다.The content server 110 provides various kinds of content (especially, multimedia content) to the artificial intelligence server 130 through the network 150 .

일 실시 예에서, 콘텐츠 서버(110)는, 인공지능 서버(130)로부터 전송되는 콘텐츠 명을 수신받아, 해당 수신받은 콘텐츠 명에 해당하는 콘텐츠 정보를 (또는, 인공지능 서버(130)를 거쳐) 사용자 기기(140)로 전송하여 사용자가 화면을 통해 확인할 수 있도록 해 준 후에, 사용자 기기(140)로부터 (또는, 인공지능 서버(130)를 거쳐) 콘텐츠 선택을 통보받아 이에 해당하는 콘텐츠 내용을 검색하여 (또는, 인공지능 서버(130)를 거쳐) 사용자 기기(140)로 전송해 줄 수 있다.In an embodiment, the content server 110 receives the content name transmitted from the artificial intelligence server 130, and provides content information corresponding to the received content name (or via the artificial intelligence server 130). After transmitting to the user device 140 so that the user can check it through the screen, the user device 140 (or via the artificial intelligence server 130) is notified of the content selection and searches for the corresponding content Thus (or via the artificial intelligence server 130 ), it can be transmitted to the user device 140 .

일 실시 예에서, 콘텐츠 서버(110)는, 사용자 기기(140)로부터 통보받은 콘텐츠 선택에 해당하는 콘텐츠 내용을 검색하고, 해당 검색된 콘텐츠 내용을 포함한 콘텐츠 파일을 사용자가 화면을 통해 시청할 수 있도록 실시간으로 사용자 기기(140)로 전송해 줄 수 있다.In an embodiment, the content server 110 searches for content content corresponding to the content selection notified from the user device 140 and in real time so that the user can view the content file including the searched content content through the screen. It can be transmitted to the user device 140 .

일 실시 예에서, 콘텐츠 서버(110)는, 각종 콘텐츠에 대한 데이터를 각 콘텐츠별 세부항목정보와 매칭시켜 사용자 기기(140)로 전송해 줄 수 있는데, 예를 들어 음악이라 콘텐츠의 경우에, 콘텐츠 명, 가수, 재생 시간 등의 세부항목정보와 매칭시켜 사용자 기기(140)로 전송해 줄 수 있다. 다른 콘텐츠로는, 요리, 맛집, TV편성내용, 이야기, 운동경기, 주식 시세, 부동산 시세, 라디오 수신과 같은 다양한 콘텐츠를 포함한다.In an embodiment, the content server 110 may match data on various content with detailed item information for each content and transmit it to the user device 140. For example, in the case of content such as music, the content It may be matched with detailed item information such as name, singer, and playback time and transmitted to the user device 140 . Other contents include various contents such as cuisine, restaurants, TV programming, stories, sports games, stock quotes, real estate quotes, and radio reception.

일 실시 예에서, 콘텐츠 서버(110)는, 개인별 서비스 플랫폼에 해당하는 콘텐츠를 제공해 주기 위해서, 사용자 기기(140)별로 기 설정 등록된 복수 개의 사용자 ID에 대해서 각 사용자 ID별로 콘텐츠의 이용 내역을 분류시켜, 콘텐츠 정보를 사용자 기기(140)로 전송해 줄 수 있다.In an embodiment, the content server 110 categorizes content usage history for each user ID for a plurality of user IDs preset and registered for each user device 140 in order to provide content corresponding to an individual service platform. to transmit the content information to the user device 140 .

서비스 서버(120)는, 각종 서비스를 네트워크(150)를 통해 인공지능 서버(130)에 제공해 준다.The service server 120 provides various services to the artificial intelligence server 130 through the network 150 .

상기 서비스 서버(120)의 각종 서비스는, 예를 들면, 스마트홈 서비스에서는 디지털도어락, 조명 등 동기화된 IoT 기기를 그룹으로 제어하거나 개별적으로 제어가 가능하고, 음악 & 콘텐츠 서비스는 맞춤추선음악, 오디오북, 오디오클립 등의 풍부한 콘텐츠를 제공하고, 어린이 & 학습 서비스에서는 국어, 영어 및 중국어 그 밖에 외국어로 듣는 동요, 동화, 어학 및 육아 콘텐츠를 제공하고, 생활정보 서비스에서는 브리핑, 날씨, 뉴스, 금융, 지식, 인물, 스포츠 등 검색을 제공하고, 일정관린 서비스에서는 일정, 알람, 타이머, 브리핑, 메모 등 개인비서 기능을 제공하고, 요리백과 서비스에서는 220여 가지 이상의 요리 및 주재료 음성검색 지원 서비스를 제공한다. Various services of the service server 120, for example, can control synchronized IoT devices such as digital door locks and lighting as a group or individually in a smart home service, and music & content services include customized music and audio It provides rich contents such as books and audio clips, children's & learning service provides children's songs, fairy tales, language and childcare contents in Korean, English, Chinese and other foreign languages, and life information service provides briefing, weather, news, finance , knowledge, people, sports, etc., the schedule management service provides personal assistant functions such as schedule, alarm, timer, briefing, and memo, and the cooking encyclopedia service provides voice search support services for more than 220 dishes and main ingredients. to provide.

일 실시 예에서, 서비스 서버(120)는, 각종 서비스에 대한 데이터를 각 서비스별 세부항목정보와 매칭시켜 (또는, 인공지능 서버(130)를 거쳐) 사용자 기기(140)로 전송해 줄 수 있는데, 예를 들어 날씨라 콘텐츠의 경우에, 지역에 따른 기간별 날씨정보 등의 세부항목정보와 매칭시켜 사용자 기기(140)로 전송해 줄 수 있다.In an embodiment, the service server 120 may match data on various services with detailed item information for each service (or via the artificial intelligence server 130 ) and transmit it to the user device 140 . , for example, in the case of weather content, it may be matched with detailed item information such as weather information for each period according to region and transmitted to the user device 140 .

일 실시 예에서, 서비스 서버(120)는, 개인별 서비스 플랫폼에 해당하는 서비스(예를 들어, 캘린더 등)를 제공해 주기 위해서, 사용자 기기(140)별로 기 설정 등록된 복수 개의 사용자 ID에 대해서 각 사용자 ID별로 서비스의 이용 내역을 분류시켜, 서비스를 사용자 기기(140)로 전송해 줄 수 있다.In one embodiment, in order to provide a service (eg, calendar, etc.) corresponding to an individual service platform, the service server 120 provides each user with respect to a plurality of user IDs preset and registered for each user device 140 . By classifying service usage details by ID, the service may be transmitted to the user device 140 .

인공지능 서버(130)는, 콘텐츠 서버(110)로부터 네트워크(150)를 통해 제공되는 각종 콘텐츠와 서비스 서버(120)로부터 네트워크(150)를 통해 제공되는 각종 서비스를 인공지능 비서 기능과 함께 상호 연동시켜 관리해 주며, 사용자 기기(140)로부터 전달되는 사용자 음성을 인식하여 사용자가 검색하고자 하는 콘텐츠를 검색하도록 해 주고 서비스를 이용하도록 해 준다.The artificial intelligence server 130 interworks various contents provided from the content server 110 through the network 150 and various services provided from the service server 120 through the network 150 together with the artificial intelligence assistant function. and manages it, and recognizes the user's voice transmitted from the user device 140 so that the user can search for the content he or she wants to search for and use the service.

일 실시 예에서, 인공지능 서버(130)는, 콘텐츠 서버(110)로부터 제공받은 각종 콘텐츠를 분배하여 해당 분배 콘텐츠를 사용자 기기(140)로 전송해 줄 수 있다.In an embodiment, the artificial intelligence server 130 may distribute various types of content provided from the content server 110 and transmit the distributed content to the user device 140 .

일 실시 예에서, 인공지능 서버(130)는, 음성 사용자 인터페이스(VUI,Voice User Interface)로부터 입력되는 사용자 응답 신호를 처리해 줄 수 있다.In an embodiment, the artificial intelligence server 130 may process a user response signal input from a voice user interface (VUI).

일 실시 예에서, 인공지능 서버(130)는, 사용자 기기(140)로부터 전송되는 사용자 ID를 수신받아, 해당 수신받은 사용자 ID에 대응하는 기기 정보 및 개인별 서비스 플랫폼을 기 설정 등록해 둔 데이터베이스에서 조회한 후에, 해당 수신받은 사용자 ID에 대응하는 개인별 서비스 플랫폼을 해당 수신받은 사용자 ID에 대응하는 사용자 기기(140)로 활성화시켜 줄 수 있다.In one embodiment, the artificial intelligence server 130 receives the user ID transmitted from the user device 140, and inquires in a database in which device information and individual service platform corresponding to the received user ID are preset and registered. After that, the individual service platform corresponding to the received user ID may be activated by the user device 140 corresponding to the received user ID.

일 실시 예에서, 인공지능 서버(130)는, 사용자 기기(140)로부터 전달받은 사용자 음성을 분석하여 사용자가 검색하고자 하는 콘텐츠 명을 추출한 후에, 해당 추출한 콘텐츠 명을 콘텐츠 서버(110)로 전송해 줄 수 있다.In one embodiment, the artificial intelligence server 130 analyzes the user voice received from the user device 140 to extract the content name that the user wants to search, and then transmits the extracted content name to the content server 110 can give

일 실시 예에서, 인공지능 서버(130)는, 인공지능 비서 인터페이스 커넥트(AIC)로서, 인공지능 비서 서비스를 제공하려는 사용자 기기(140)에 인공지능 서버(130)와 연동할 수 있는 인터페이스를 제공하는 플랫폼을 구비할 수 있다.In an embodiment, the artificial intelligence server 130 provides an interface capable of interworking with the artificial intelligence server 130 to the user device 140 that intends to provide an artificial intelligence assistant service as an artificial intelligence assistant interface connect (AIC). A platform can be provided.

일 실시 예에서, 인공지능 서버(130)는, 접근 권한을 위임하기 위한 공개 표준으로 인터넷 사용자가 다른 웹 서비스나 응용 프로그램에 사용자 계정에 접근할 수 있는 권한을 부여하는 규약(예를 들어, OAuth 2.0)에 따라, 인공지능 비서 플랫폼에서는 클라이언트가 인공지능 비서 액세스 토큰(access token)을 획득하거나 사용자가 특정 익스텐션(extension)을 사용할 시에 자신의 계정을 연결할 때에 사용하도록 할 수 있다. 이때, 인공지능 비서 디벨로퍼 콘솔(developer console)을 통해 클라이언트를 등록하고 획득한 인증 정보인 클라이언트 인증 정보를, 인공지능 비서 액세스 토큰을 획득하는데 사용할 수 있다.In one embodiment, the artificial intelligence server 130 is an open standard for delegating access rights, and a protocol (eg, OAuth 2.0), in the AI assistant platform, the client can obtain an AI assistant access token or use it when the user connects his or her account when using a specific extension. In this case, client authentication information, which is authentication information obtained by registering a client through an artificial intelligence assistant developer console, may be used to acquire an artificial intelligence assistant access token.

사용자 기기(140)는, 사용자의 음성을 입력받아 해당 입력받은 사용자 음성을 네트워크(150)를 통해 인공지능 서버(130)에 전달한 후에, 인공지능 서버(130)를 통해 사용자가 검색하고자 하는 콘텐츠를 검색해 주고 서비스를 이용해 준다.The user device 140 receives the user's voice and transmits the received user's voice to the artificial intelligence server 130 through the network 150 , and then retrieves the content the user wants to search through the artificial intelligence server 130 . Search and use the service.

일 실시 예에서, 사용자 기기(140)는, 콘텐츠 및 서비스를 이용하기 위한 장치로서, 인공지능 서버(130)로부터 전송되는 분배 콘텐츠를 수신받아 화면을 통해 사용자에게 제공해 줄 수 있으며, 또한 인공지능 서버(130)로부터 전송되는 서비스를 수신받아 화면을 통해 사용자에게 제공해 줄 수 있다.In an embodiment, the user device 140 is a device for using content and services, and may receive distributed content transmitted from the artificial intelligence server 130 and provide it to the user through a screen, and also the artificial intelligence server The service transmitted from the 130 may be received and provided to the user through the screen.

일 실시 예에서, 사용자 기기(140)는, 사용자의 ID를 인공지능 서버(130)로 전송한 후에, 인공지능 서버(130)에 의해서 제공되는 개인별 서비스 플랫폼을 활성화해 줄 수 있다. 여기서, 사용자 ID는 개개의 메시지를 구분하기 위한 식별자이며, 이벤트 메시지와 지시 메시지는 모두 개개의 사용자 ID를 가진다. 또한, 이벤트 메시지는 클라이언트에서 인공지능 서버(130)로 전달하는 메시지이며, 사용자 요청(음성 입력)을 전달하거나 클라이언트의 상태 값이 변경된 것을 알릴 때에 해당 메시지를 전송해 준다. 그리고 지시(directive) 메시지는 인공지능 서버(130)가 클라이언트의 행동을 제어하도록 명령한 지시에 대한 메시지로서, 클라이언트가 요청한 이벤트 메시지에 응답을 하거나 특정 조건에 의해 클라이언트로 정보를 전달할 때에 사용된다.In an embodiment, the user device 140 may activate the individual service platform provided by the artificial intelligence server 130 after transmitting the user's ID to the artificial intelligence server 130 . Here, the user ID is an identifier for distinguishing individual messages, and both the event message and the instruction message have individual user IDs. In addition, the event message is a message transmitted from the client to the artificial intelligence server 130, and the corresponding message is transmitted when a user request (voice input) is transmitted or when the status value of the client is changed. And, the directive message is a message for an instruction commanded by the artificial intelligence server 130 to control the behavior of the client, and is used when responding to an event message requested by the client or transmitting information to the client according to a specific condition.

일 실시 예에서, 사용자 기기(140)는, 콘텐츠 서버(110)로부터 전송되는 콘텐츠 정보를 수신받아 화면을 통해 사용자에게 제공해 줄 수 있으며, 사용자에 의해서 콘텐츠를 선택하는 경우에 해당 콘텐츠 선택을 콘텐츠 서버(110)로 통보해 줄 수 있으며, 그런 다음에 콘텐츠 서버(110)로부터 전송되는 콘텐츠 내용을 수신받아 화면을 통해 사용자에게 제공해 줄 수 있다.In an embodiment, the user device 140 may receive content information transmitted from the content server 110 and provide it to the user through a screen, and when the user selects content, the content server selects the content 110 , and then the content transmitted from the content server 110 may be received and provided to the user through the screen.

일 실시 예에서, 사용자 기기(140)는, 댁내 공중파 방송 콘텐츠를 출력하고, 리모컨이나 화면 터치 등의 입력수단을 통해 입력받은 사용자 응답 신호를 VA를 통해 인공지능 서버(130)로 전송해 주는 지상파TV를 구비할 수 있다.In an embodiment, the user device 140 outputs in-house airwave broadcasting content, and transmits a user response signal received through an input means such as a remote control or a screen touch to the artificial intelligence server 130 through the VA. A TV can be provided.

일 실시 예에서, 사용자 기기(140)는, 4계층을 포함할 수 있으며, IPv4 주소 또는 IPv6 주소가 할당될 수 있다. 여기서, 4계층에는, MPEG2, MPEG4, MPEG7, H.264, WMV-9 등의 다양한 멀티 코덱(Multi CODEC)을 지원하며 각종 서비스를 선택하기 위한 보이스 에이전트 처리부(146)(도 2 참조)를 포함하는 애플리케이션 계층; 자바 가상 머신(Java Virtual Machine: JVM), 스트리밍 프로토콜(RTP, RTSP)을 탑재한 미들웨어 계층; 디바이스 드라이버와 운영체제 등의 시스템 소프트웨어를 포함하는 시스템 소프트웨어 계층; CPU, 미디어 프로세서, 플래시 램, 이더넷 모듈 등의 하드웨어로 구성된 하드웨어 계층이 있다.In an embodiment, the user device 140 may include 4 layers, and an IPv4 address or an IPv6 address may be assigned. Here, the 4th layer includes a voice agent processing unit 146 (see FIG. 2) for supporting various multi CODECs such as MPEG2, MPEG4, MPEG7, H.264, and WMV-9 and for selecting various services. application layer; Middleware layer loaded with Java Virtual Machine (JVM) and streaming protocols (RTP, RTSP); a system software layer including system software such as a device driver and an operating system; There is a hardware layer consisting of hardware such as CPU, media processor, flash RAM, and Ethernet module.

상기 사용자 기기(140)는 상기 4개의 계층이 모두 포함되며, 상기 4개의 계층은, 제1 계층으로 CPU, 미디어 프로세서, 플래시 램, 이더넷 모듈 등의 하드웨어로 구성된 하드웨어 계층과, 제2 계층으로 디바이스 드라이버와 운영체계 등의 시스템 소프트웨어를 포함하는 시스템 소프트웨어 계층과, 제3 계층으로 자바 가상 머신(Java Virtual Machine: JVM), 스트리밍 프로토콜(RTP, RTSP)을 탑재한 미들웨어 계층과, 제4 계층으로 MPEG2, MPEG4, MPEG7, H.264, WMV-9 등의 다양한 멀티 코덱(Multi CODEC)을 지원하며 각종 서비스를 선택하기 위한 보이스 에이전트 처리부를 포함하는 어플리케이션 계층을 포함한다. The user equipment 140 includes all four layers, and the four layers include a hardware layer consisting of hardware such as a CPU, a media processor, a flash RAM, and an Ethernet module as a first layer, and a device as a second layer. A system software layer including system software such as drivers and operating systems, a middleware layer equipped with a Java Virtual Machine (JVM) and streaming protocols (RTP, RTSP) as a third layer, and MPEG2 as a fourth layer , MPEG4, MPEG7, H.264, WMV-9, etc. support various multi codecs (Multi CODEC) and includes an application layer including a voice agent processing unit for selecting various services.

일 실시 예에서, 사용자 기기(140)는, 네트워크(150)를 통해 전송되는 각종 콘텐츠를 수신하기 위한 디지털 방송 수신 장치를 포함할 수 있으며, 해당 디지털 방송 수신 장치를 이용하여 영상데이터뿐만 아니라, 모든 프로그램에 관련된 디지털정보를 수신하여 사용자에게 제공해 줄 수 있다.In an embodiment, the user device 140 may include a digital broadcast receiving device for receiving various types of content transmitted through the network 150 , and using the corresponding digital broadcast receiving device, not only image data but also all It is possible to receive digital information related to the program and provide it to the user.

일 실시 예에서, 사용자 기기(140)는, 인공지능 서버(130)로부터 전송되는 정보를 수신한 후에, 해당 수신한 정보를 화면 출력부(145)(도 2 참조)의 화면을 통해 출력 표시하여, 사용자가 원하는 디지털정보를 용이하게 검색하도록 할 수 있다.In one embodiment, after receiving the information transmitted from the artificial intelligence server 130, the user device 140 outputs and displays the received information through the screen of the screen output unit 145 (refer to FIG. 2). , it is possible to allow the user to easily search for desired digital information.

일 실시 예에서, 사용자 기기(140)는, 인공지능 비서 플랫폼을 탑재한 화면 터치 모니터가 장착된 기기(데스크 뷰)를 포함할 수 있다.In an embodiment, the user device 140 may include a device (desk view) equipped with a screen touch monitor equipped with an artificial intelligence assistant platform.

일 실시 예에서, 사용자 기기(140)는, 사용자가 새로운 발화를 시작할 때마다 대화 ID를 생성시켜 주며, 클라이언트가 인지(recognize) 이벤트 메시지를 인공지능 서버(130)에 전달할 때에 해당 생성시킨 대화 ID를 포함시켜 줄 수 있다. 여기서, 대화 ID는 인공지능 서버(130) 측 응답을 내려줄 때 어떤 이벤트 메시지에 대한 응답인지 연결할 때 사용되며, 지시 메시지에도 포함될 수 있다. 클라이언트는 지시 메시지에 포함된 대화 ID를 보고 어떤 이벤트 메시지의 응답인지를 판단할 수 있으며, 만약 클라이언트가 현재 가지고 있는 대화 ID와 지시 메시지의 대화 ID가 다르면 수신한 지시 메시지를 무시하도록 할 수 있다.In an embodiment, the user device 140 generates a conversation ID whenever the user starts a new utterance, and the generated conversation ID when the client transmits a recognize event message to the artificial intelligence server 130 . can be included. Here, the conversation ID is used to connect the response to which event message when the artificial intelligence server 130 gives a response, and may be included in the instruction message. The client can determine which event message the response is by looking at the conversation ID included in the instruction message. If the conversation ID the client currently has and the conversation ID of the instruction message are different, the received instruction message can be ignored.

네트워크(150)는, 유선 또는 무선의 통신망을 구비하여, 콘텐츠 서버(110), 서비스 서버(120), 인공지능 서버(130), 사용자 기기(140) 간을 서로 연결시켜 서로 간의 유선 또는 무선 데이터를 송수신해 준다.The network 150 includes a wired or wireless communication network, and connects the content server 110 , the service server 120 , the artificial intelligence server 130 , and the user device 140 to each other to provide wired or wireless data between each other. send and receive

일 실시 예에서, 네트워크(150)는, 콘텐츠 서버(110)로부터 (또는, 인공지능 서버(130)를 거쳐) 제공되는 각종 콘텐츠를 사용자 기기(140)로 전송해 줄 수 있으며, 사용자 기기(140)로부터 (또는, 인공지능 서버(130)를 거쳐) 전송되는 사용자 응답 신호를 콘텐츠 서버(110)로 제공해 줄 수 있도록 하기 위한 프로토콜과; 서비스 서버(120)로부터 (또는, 인공지능 서버(130)를 거쳐) 제공되는 각종 서비스를 사용자 기기(140)로 전송해 줄 수 있으며, 사용자 기기(140)로부터 (또는, 인공지능 서버(130)를 거쳐) 전송되는 사용자 응답 신호를 서비스 서버(120)로 제공해 줄 수 있도록 하기 위한 프로토콜을 사용하는 IP(internet protocol)망일 수 있다.In an embodiment, the network 150 may transmit various types of content provided from the content server 110 (or via the artificial intelligence server 130 ) to the user device 140 , and the user device 140 . ) (or via the artificial intelligence server 130) a protocol for providing a user response signal transmitted to the content server 110; Various services provided from the service server 120 (or via the artificial intelligence server 130 ) may be transmitted to the user device 140 , and from the user device 140 (or the artificial intelligence server 130 ) It may be an IP (internet protocol) network using a protocol for providing a transmitted user response signal to the service server 120 ).

상술한 바와 같은 구성을 가진 인공지능 비서 서비스 제공 시스템(100)은, 콘텐츠(특히, 멀티미디어 콘텐츠) 제공 기능뿐만 아니라 서비스 기능과 인공지능 비서 기능을 상호 연동할 수 있도록 구현함으로써, 사용자의 음성을 인식하여 사용자가 검색하고자 하는 콘텐츠를 용이하게 검색할 수 있고 서비스를 편리하게 이용할 수 있다.The artificial intelligence assistant service providing system 100 having the configuration as described above recognizes the user's voice by implementing not only the content (especially multimedia content) provision function but also the service function and the artificial intelligence assistant function to interoperate. Thus, users can easily search for the content they want to search for and use the service conveniently.

도 2는 도 1에 있는 사용자 기기를 설명하는 도면이다.FIG. 2 is a diagram for explaining the user device shown in FIG. 1 .

도 2를 참조하면, 사용자 기기(140)는, 외부 서버 연동부(141), 제어부(142), 저장부(143), 영상신호 처리부(144), 화면 출력부(145), 보이스 에이전트 처리부(146), 오디오 처리부(147), 스피커 출력부(148)를 포함한다.Referring to FIG. 2 , the user device 140 includes an external server interworking unit 141 , a control unit 142 , a storage unit 143 , an image signal processing unit 144 , a screen output unit 145 , and a voice agent processing unit ( 146 ), an audio processing unit 147 , and a speaker output unit 148 .

이를 더욱 구체적으로 설명하면, 상기 사용자 기기(140)는, 상기 인공지능 서버(130) 또는 상기 콘텐츠 서버(110)나 상기 서비스 서버(120)와 연동하여 데이터를 송수신하는 외부 서버 연동부(141); 상기 외부 서버 연동부(141)와 연결되고 사용자 기기(140)의 구동을 제어하는 제어부(142); 상기 제어부(142)와 연결되어 상기 제어부의 제어에 필요한 프로그램이나 데이터를 저장하는 저장부(143); 상기 제어부(142)와 연결되고 상기 인공지능 서버(130)로부터 수신한 영상데이터에 대한 신호 처리를 수행하는 영상신호 처리부(144); 상기 영상신호 처리부(144)에서 신호 처리한 영상데이터를 화면을 통해 출력 표시하는 화면 출력부(145); 상기 제어부(142)와 연결되고 멀티 코덱을 지원하며 서비스를 선택하도록 하는 보이스 에이전트 처리부(146); 상기 제어부(142)와 연결되고 마이크를 통해 사용자 음성을 입력받아 오디오 처리해서 상기 인공지능 서버(130)로 전송하도록 하는 오디오 처리부(147); 및 상기 제어부(142)와 연결되어 상기 인공지능 서버(130)로부터 수신한 오디오데이터를 출력하는 스피커 출력부(148)를 포함한다.To explain this in more detail, the user device 140 interlocks with the artificial intelligence server 130 or the content server 110 or the service server 120 to transmit and receive data to and from an external server interworking unit 141 . ; a control unit 142 connected to the external server interworking unit 141 and controlling the operation of the user device 140; a storage unit 143 connected to the control unit 142 to store programs or data necessary for the control of the control unit; an image signal processing unit 144 connected to the control unit 142 and performing signal processing on the image data received from the artificial intelligence server 130; a screen output unit 145 for outputting and displaying the image data signal processed by the image signal processing unit 144 through a screen; a voice agent processing unit 146 connected to the control unit 142, supporting multiple codecs, and selecting a service; an audio processing unit (147) connected to the control unit (142), receiving a user's voice through a microphone, processing the audio, and transmitting the audio to the artificial intelligence server (130); and a speaker output unit 148 connected to the control unit 142 to output audio data received from the artificial intelligence server 130 .

상기 외부 서버 연동부(141)는, 제어부(142)의 제어에 따라 인공지능 서버(130)(또는, 콘텐츠 서버(110)나 서비스 서버(120))와 연동하여 데이터를 송수신해 준다.The external server interworking unit 141 transmits and receives data by interworking with the artificial intelligence server 130 (or the content server 110 or the service server 120 ) under the control of the control unit 142 .

상기 제어부(142)는, 사용자 기기(140)(구성 요소들(외부 서버 연동부(141), 영상신호 처리부(144), 화면 출력부(145), 보이스 에이전트 처리부(146), 오디오 처리부(147), 스피커 출력부(148)))의 구동을 제어해 준다.The control unit 142 includes the user device 140 (components (external server interworking unit 141 , image signal processing unit 144 ), screen output unit 145 , voice agent processing unit 146 , and audio processing unit 147 . ), the speaker output unit 148)) controls the driving.

상기 저장부(143)는, 제어부(142)의 제어에 필요한 프로그램이나 데이터를 저장해 준다.The storage unit 143 stores a program or data necessary for the control of the control unit 142 .

상기 영상신호 처리부(144)는, 인공지능 서버(130)로부터 수신받은 영상데이터를 제어부(142)를 통해 전달받아, 해당 전달받은 영상데이터에 대한 신호 처리를 수행하여 화면 출력부(145)로 제공해 준다.The image signal processing unit 144 receives the image data received from the artificial intelligence server 130 through the control unit 142, performs signal processing on the received image data, and provides it to the screen output unit 145. give.

일 실시 예에서, 영상신호 처리부(144)는, 인공지능 서버(130)로부터 수신받은 콘텐츠를 제어부(142)를 통해 전달받아, 해당 전달받은 콘텐츠에 대한 신호 처리를 수행하여 화면 출력부(145)로 제공해 줄 수 있다.In one embodiment, the image signal processing unit 144 receives the content received from the artificial intelligence server 130 through the control unit 142, and performs signal processing on the received content to the screen output unit 145 can be provided as

상기 화면 출력부(145)는, 영상신호 처리부(144)로부터 신호 처리된 영상데이터를 제공받아 화면을 통해 출력 표시해 준다.The screen output unit 145 receives the signal-processed image data from the image signal processing unit 144 and outputs and displays the image data through the screen.

상기 화면 출력부(145)는, 주방에서 사용되는 주방용 티브이도 될 수 있고, 거실이나 방에서 사용되는 데스크뷰가 될 수도 있으며, 화면이 보이는 다른 시청 수단도 될 수 있을 것이다.The screen output unit 145 may be a kitchen TV used in the kitchen, a desk view used in a living room or a room, or other viewing means for viewing a screen.

일 실시 예에서, 화면 출력부(145)는, 영상신호 처리부(144)로부터 신호 처리된 콘텐츠를 제공받아 화면에 출력 표시해 줄 수 있으며, VA를 통하여 입력받은 사용자 응답 신호를 사용자 기기(140)의 리턴 채널을 통하여 인공지능 서버(130)로 전송해 주도록 할 수 있다.In an embodiment, the screen output unit 145 may receive the signal-processed content from the image signal processing unit 144 and output and display it on the screen, and transmit the user response signal input through the VA of the user device 140 . It can be transmitted to the artificial intelligence server 130 through the return channel.

상기 보이스 에이전트 처리부(146)는, MPEG2, MPEG4, MPEG7, H.264, WMV-9 등의 다양한 멀티 코덱(Multi CODEC)을 지원하며 각종 서비스를 선택해 준다.The voice agent processing unit 146 supports various multi CODECs such as MPEG2, MPEG4, MPEG7, H.264, and WMV-9 and selects various services.

일 실시 예에서, 보이스 에이전트 처리부(146)는, 애플리케이션 계층의 보이스 에이전트(voice agent) 영역에서 담당을 수행하고 20여 가지의 소프트웨어 템플릿을 활용하여 응용할 수 있다.In one embodiment, the voice agent processing unit 146 may perform a role in the voice agent area of the application layer and apply it by utilizing 20 kinds of software templates.

일 실시 예에서, 보이스 에이전트 처리부(146)는, 콘텐츠를 재생 처리해 주기 위한 콘텐츠 재생 처리모듈(461), 서비스를 재생 처리해 주기 위한 서비스 재생 처리모듈(462), 외부 IoT와 연동하여 외부 IoT의 데이터를 처리해 주기 위한 외부 IoT 처리모듈(463)을 포함할 수 있다.In an embodiment, the voice agent processing unit 146 includes a content reproduction processing module 461 for reproducing and processing content, a service reproduction processing module 462 for reproducing and processing services, and external IoT data in conjunction with an external IoT. It may include an external IoT processing module 463 for processing.

상기 오디오 처리부(147)는, 마이크를 통해 사용자의 음성을 입력받아, 해당 입력받은 사용자 음성을 오디오 처리해서 제어부(142)를 통해 인공지능 서버(130)로 전송하도록 해 준다.The audio processing unit 147 receives a user's voice through a microphone, processes the inputted user's voice as audio, and transmits the received user's voice to the artificial intelligence server 130 through the control unit 142 .

일 실시 예에서, 오디오 처리부(147)는, 사용자 기기(140) 자체에서 생성되는 오디오를 인지하여 위상을 역으로 하면 음이 상쇄가 되는 원리로 내부 음원의 오디오를 제거하기 위한 에코 캔슬레이션모듈(471), 마이크에 유입된 에어컨 소리 등의 주변 소음을 제거하여 좀 더 선명하고 깨끗한 음질의 음성을 유지하기 위한 노이즈 캔슬레이션모듈(472), 고성능의 마이크로 오디오를 입력받기 위한 마이크 음성입력모듈(473)을 포함할 수 있다.In one embodiment, the audio processing unit 147 recognizes the audio generated by the user device 140 itself, and when the phase is reversed, the echo cancellation module ( 471), a noise cancellation module 472 for maintaining a clearer and cleaner voice by removing ambient noise such as air conditioner sound introduced into the microphone, and a microphone voice input module 473 for receiving high-performance microphone audio ) may be included.

상기 에코 캔슬레이션모듈(471)은 기기 자체에서 생성되는 오디오를 인지하여 위상을 역으로 하면 음이 상쇄가 되는 원리를 사용하여 내부 음원의 오디오를 제거한다. The echo cancellation module 471 recognizes the audio generated by the device itself and removes the audio of the internal sound source by using the principle of canceling the sound when the phase is reversed.

상기 노이즈 캔슬레이션모듈(472)은 마이크에 유입된 에어컨 소리 등의 주변 소음을 제거하여 좀 더 선명하고 깨끗한 음질의 음성을 유지하는 기능을 수행한다.The noise cancellation module 472 performs a function of maintaining a clearer and clearer voice by removing ambient noise, such as an air conditioner sound, introduced into the microphone.

상기 마이크 음성입력모듈(473)는 고성능의 마이크로 오디오를 입력 받는다.The microphone voice input module 473 receives high-performance micro audio.

스피커 출력부(148)는, 인공지능 서버(130)로부터 수신받은 오디오데이터를 제어부(142)를 통해 전달받아, 해당 전달받은 오디오데이터를 출력해 준다.The speaker output unit 148 receives the audio data received from the artificial intelligence server 130 through the control unit 142, and outputs the received audio data.

도 3은 본 발명의 실시 예에 따른 인공지능 비서 서비스 제공 방법을 설명하는 도면이다.3 is a view for explaining a method of providing an artificial intelligence assistant service according to an embodiment of the present invention.

도 3을 참조하면, 콘텐츠 서버(110)에서는, 각종 콘텐츠(특히, 멀티미디어 콘텐츠)를 네트워크(150)를 통해 인공지능 서버(130)에 제공해 주게 된다(S301).Referring to FIG. 3 , the content server 110 provides various types of content (particularly, multimedia content) to the artificial intelligence server 130 through the network 150 ( S301 ).

상술한 단계 S301에서 콘텐츠를 제공함에 있어서, 콘텐츠 서버(110)에서는, 인공지능 서버(130)로부터 전송되는 콘텐츠 명을 수신받아, 해당 수신받은 콘텐츠 명에 해당하는 콘텐츠 정보를 (또는, 인공지능 서버(130)를 거쳐) 사용자 기기(140)로 전송하여 사용자가 화면을 통해 확인할 수 있도록 해 준 후에, 사용자 기기(140)로부터 (또는, 인공지능 서버(130)를 거쳐) 콘텐츠 선택을 통보받아 이에 해당하는 콘텐츠 내용을 검색하여 (또는, 인공지능 서버(130)를 거쳐) 사용자 기기(140)로 전송해 줄 수 있다.In providing the content in step S301 described above, the content server 110 receives the content name transmitted from the artificial intelligence server 130, and provides content information corresponding to the received content name (or the artificial intelligence server). After transmitting it to the user device 140 (via 130) so that the user can check it through the screen, the user device 140 (or via the artificial intelligence server 130) is notified of the content selection. The corresponding content may be searched for (or via the artificial intelligence server 130 ) and transmitted to the user device 140 .

상술한 단계 S301에서 콘텐츠를 제공함에 있어서, 콘텐츠 서버(110)에서는, 사용자 기기(140)로부터 통보받은 콘텐츠 선택에 해당하는 콘텐츠 내용을 검색하고, 해당 검색된 콘텐츠 내용을 포함한 콘텐츠 파일을 사용자가 화면을 통해 시청할 수 있도록 실시간으로 사용자 기기(140)로 전송해 줄 수 있다.In providing the content in step S301 described above, the content server 110 searches for content corresponding to the content selection notified from the user device 140, and displays a content file including the searched content on the screen. It can be transmitted to the user device 140 in real time so that it can be viewed through the

상술한 단계 S301에서 콘텐츠를 제공함에 있어서, 콘텐츠 서버(110)에서는, 각종 콘텐츠에 대한 데이터를 각 콘텐츠별 세부항목정보와 매칭시켜 사용자 기기(140)로 전송해 줄 수 있는데, 예를 들어 음악이라 콘텐츠의 경우에, 콘텐츠 명, 가수, 재생 시간 등의 세부항목정보와 매칭시켜 사용자 기기(140)로 전송해 줄 수 있다.In providing the content in step S301 described above, the content server 110 may match data on various content with detailed item information for each content and transmit it to the user device 140, for example, music. In the case of content, it can be transmitted to the user device 140 by matching it with detailed item information such as the content name, singer, and playback time.

상술한 단계 S301에서 콘텐츠를 제공함에 있어서, 개인별 서비스 플랫폼에 해당하는 콘텐츠를 제공해 주기 위해서, 콘텐츠 서버(110)에서는, 사용자 기기(140)별로 기 설정 등록된 복수 개의 사용자 ID에 대해서 각 사용자 ID별로 콘텐츠의 이용 내역을 분류시켜, 콘텐츠 정보를 사용자 기기(140)로 전송해 줄 수 있다.In providing the content in step S301 described above, in order to provide the content corresponding to the individual service platform, in the content server 110 , for a plurality of user IDs preset and registered for each user device 140 , each user ID By classifying the usage history of the content, the content information may be transmitted to the user device 140 .

상술한 단계 S301에서 콘텐츠를 제공해 주는 한편, 서비스 서버(120)에서는, 각종 서비스를 네트워크(150)를 통해 인공지능 서버(130)에 제공해 주게 된다(S302).While the content is provided in the above-described step S301, the service server 120 provides various services to the artificial intelligence server 130 through the network 150 (S302).

상술한 단계 S302에서 서비스를 제공함에 있어서, 서비스 서버(120)에서는, 각종 서비스에 대한 데이터를 각 서비스별 세부항목정보와 매칭시켜 (또는, 인공지능 서버(130)를 거쳐) 사용자 기기(140)로 전송해 줄 수 있는데, 예를 들어 날씨라 콘텐츠의 경우에, 지역에 따른 기간별 날씨정보 등의 세부항목정보와 매칭시켜 사용자 기기(140)로 전송해 줄 수 있다.In providing the service in step S302 described above, the service server 120 matches the data for each service with detailed item information for each service (or via the artificial intelligence server 130 ) the user device 140 . For example, in the case of weather content, it may be matched with detailed item information such as weather information for each period according to region and transmitted to the user device 140 .

상술한 단계 S302에서 서비스를 제공함에 있어서, 개인별 서비스 플랫폼에 해당하는 서비스(예를 들어, 캘린더 등)를 제공해 주기 위해서, 서비스 서버(120)에서는, 사용자 기기(140)별로 기 설정 등록된 복수 개의 사용자 ID에 대해서 각 사용자 ID별로 서비스의 이용 내역을 분류시켜, 서비스를 사용자 기기(140)로 전송해 줄 수 있다.In providing the service in the above-described step S302 , in order to provide a service (eg, calendar, etc.) corresponding to the individual service platform, the service server 120 includes a plurality of preset registered for each user device 140 . With respect to the user ID, the service usage history may be classified for each user ID, and the service may be transmitted to the user device 140 .

상술한 단계 S302에서 서비스를 제공하게 되면, 인공지능 서버(130)에서는, 콘텐츠 서버(110)로부터 네트워크(150)를 통해 제공되는 각종 콘텐츠와 서비스 서버(120)로부터 네트워크(150)를 통해 제공되는 각종 서비스를 인공지능 비서 기능과 함께 상호 연동시켜 관리해 주며, 사용자 기기(140)로부터 전달되는 사용자 음성을 인식하여 사용자가 검색하고자 하는 콘텐츠를 검색하도록 해 주고 서비스를 이용하도록 해 주게 된다(S303).When the service is provided in the above-described step S302, in the artificial intelligence server 130, various contents provided from the content server 110 through the network 150 and the service server 120 provided through the network 150 It manages various services by interworking with the artificial intelligence assistant function, and recognizes the user's voice transmitted from the user device 140 to allow the user to search for the content to be searched and to use the service (S303).

상술한 단계 S303에서 콘텐츠 검색 및 서비스 이용을 제공함에 있어서, 인공지능 서버(130)에서는, 콘텐츠 서버(110)로부터 제공받은 각종 콘텐츠를 분배하여 해당 분배 콘텐츠를 사용자 기기(140)로 전송해 줄 수 있다.In providing content search and service use in the above-described step S303, the artificial intelligence server 130 distributes various types of content provided from the content server 110 and transmits the distributed content to the user device 140 . there is.

상술한 단계 S303에서 콘텐츠 검색 및 서비스 이용을 제공함에 있어서, 인공지능 서버(130)에서는, VUI로부터 입력되는 사용자 응답 신호를 처리해 줄 수 있다.In providing content search and service use in step S303 described above, the artificial intelligence server 130 may process a user response signal input from the VUI.

상술한 단계 S303에서 콘텐츠 검색 및 서비스 이용을 제공함에 있어서, 인공지능 서버(130)에서는, 사용자 기기(140)로부터 전송되는 사용자 ID를 수신받아, 해당 수신받은 사용자 ID에 대응하는 기기 정보 및 개인별 서비스 플랫폼을 기 설정 등록해 둔 데이터베이스에서 조회한 후에, 해당 수신받은 사용자 ID에 대응하는 개인별 서비스 플랫폼을 해당 수신받은 사용자 ID에 대응하는 사용자 기기(140)로 활성화시켜 줄 수 있다.In providing content search and service use in the above-described step S303, the artificial intelligence server 130 receives the user ID transmitted from the user device 140, and device information and individual service corresponding to the received user ID. After the platform is inquired from the previously registered database, the individual service platform corresponding to the received user ID may be activated by the user device 140 corresponding to the received user ID.

상술한 단계 S303에서 콘텐츠 검색 및 서비스 이용을 제공함에 있어서, 인공지능 서버(130)에서는, 사용자 기기(140)로부터 전달받은 사용자 음성을 분석하여 사용자가 검색하고자 하는 콘텐츠 명을 추출한 후에, 해당 추출한 콘텐츠 명을 콘텐츠 서버(110)로 전송해 줄 수 있다.In providing the content search and service use in the above-described step S303, the artificial intelligence server 130 analyzes the user voice received from the user device 140 to extract the name of the content that the user wants to search, and then the extracted content name may be transmitted to the content server 110 .

상술한 단계 S303에서 콘텐츠 검색 및 서비스 이용을 제공함에 따라, 사용자 기기(140)에서는, 사용자의 음성을 입력받아 해당 입력받은 사용자 음성을 네트워크(150)를 통해 인공지능 서버(130)에 전달한 후에, 인공지능 서버(130)를 통해 사용자가 검색하고자 하는 콘텐츠를 검색해 주고 서비스를 이용해 주게 된다(S304).As the content search and service use are provided in the above-described step S303, the user device 140 receives the user's voice and transmits the received user's voice to the artificial intelligence server 130 through the network 150, The content that the user wants to search is searched for through the artificial intelligence server 130 and the service is used (S304).

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 콘텐츠 및 서비스를 이용하기 위한 장치인 사용자 기기(140)에서는, 인공지능 서버(130)로부터 전송되는 분배 콘텐츠를 수신받아 화면을 통해 사용자에게 제공해 줄 수 있으며, 또한 인공지능 서버(130)로부터 전송되는 서비스를 수신받아 화면을 통해 사용자에게 제공해 줄 수 있다.In performing the content search and service use in the above-described step S304, the user device 140, which is a device for using the content and service, receives the distributed content transmitted from the artificial intelligence server 130 and provides it to the user through the screen. In addition, the service transmitted from the artificial intelligence server 130 may be received and provided to the user through the screen.

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 사용자 기기(140)에서는, 사용자의 ID를 인공지능 서버(130)로 전송한 후에, 인공지능 서버(130)에 의해서 제공되는 개인별 서비스 플랫폼을 활성화해 줄 수 있다.In performing the content search and service use in the above-described step S304 , the user device 140 transmits the user ID to the artificial intelligence server 130 , and then the individual service platform provided by the artificial intelligence server 130 . can be activated.

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 사용자 기기(140)에서는, 콘텐츠 서버(110)로부터 전송되는 콘텐츠 정보를 수신받아 화면을 통해 사용자에게 제공해 줄 수 있으며, 사용자에 의해서 콘텐츠를 선택하는 경우에 해당 콘텐츠 선택을 콘텐츠 서버(110)로 통보해 줄 수 있으며, 그런 다음에 콘텐츠 서버(110)로부터 전송되는 콘텐츠 내용을 수신받아 화면을 통해 사용자에게 제공해 줄 수 있다.In performing the content search and service use in the above-described step S304, the user device 140 may receive the content information transmitted from the content server 110 and provide it to the user through the screen, and provide the content to the user by the user. In the case of selection, the content selection may be notified to the content server 110 , and then, content content transmitted from the content server 110 may be received and provided to the user through a screen.

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 지상파TV를 구비하는 사용자 기기(140)에서는, 지상파TV를 이용하여, 구비댁내 공중파 방송 콘텐츠를 출력하고, 리모컨이나 화면 터치 등의 입력수단을 통해 입력받은 사용자 응답 신호를 VA를 통해 인공지능 서버(130)로 전송해 줄 수 있다.In performing the content search and service use in the above-described step S304, the user device 140 having a terrestrial TV uses the terrestrial TV to output the in-house airwave broadcasting content, and input means such as a remote control or screen touch It is possible to transmit the user response signal received through the VA to the artificial intelligence server 130 .

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 사용자 기기(140)에서는, MPEG2, MPEG4, MPEG7, H.264, WMV-9 등의 다양한 멀티 코덱(Multi CODEC)을 지원하며 각종 서비스를 선택하기 위한 보이스 에이전트 처리부(146)(도 2 참조)를 포함하는 애플리케이션 계층; 자바 가상 머신(Java Virtual Machine: JVM), 스트리밍 프로토콜(RTP, RTSP)을 탑재한 미들웨어 계층; 디바이스 드라이버와 운영체제 등의 시스템 소프트웨어를 포함하는 시스템 소프트웨어 계층; CPU, 미디어 프로세서, 플래시 램, 이더넷 모듈 등의 하드웨어로 구성된 하드웨어 계층과 같은 4계층을 포함할 수 있으며, IPv4 주소 또는 IPv6 주소가 할당될 수 있다. In performing content search and service use in step S304 described above, the user device 140 supports various multi CODECs such as MPEG2, MPEG4, MPEG7, H.264, and WMV-9 and provides various services. an application layer comprising a voice agent processing unit 146 (see FIG. 2) for selecting; Middleware layer loaded with Java Virtual Machine (JVM) and streaming protocols (RTP, RTSP); a system software layer including system software such as a device driver and an operating system; It may include four layers such as a hardware layer composed of hardware such as CPU, media processor, flash RAM, Ethernet module, and the like, and an IPv4 address or an IPv6 address may be assigned.

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 네트워크(150)를 통해 전송되는 각종 콘텐츠를 수신하기 위한 디지털 방송 수신 장치를 포함하는 사용자 기기(140)에서는, 해당 디지털 방송 수신 장치를 이용하여 영상데이터뿐만 아니라, 모든 프로그램에 관련된 디지털정보를 수신하여 사용자에게 제공해 줄 수 있다.In performing the content search and service use in step S304 described above, the user device 140 including the digital broadcast receiving device for receiving various contents transmitted through the network 150 uses the corresponding digital broadcast receiving device. Thus, it is possible to receive not only image data but also digital information related to all programs and provide it to the user.

상술한 단계 S304에서 콘텐츠 검색 및 서비스 이용을 수행함에 있어서, 사용자 기기(140)에서는, 인공지능 서버(130)로부터 전송되는 정보를 수신한 후에, 해당 수신한 정보를 화면 출력부(145)(도 2 참조)의 화면을 통해 출력 표시하여, 사용자가 원하는 디지털정보를 용이하게 검색하도록 할 수 있다.In performing the content search and service use in the above-described step S304, the user device 140 receives the information transmitted from the artificial intelligence server 130, and then displays the received information to the screen output unit 145 (Fig. 2), the output is displayed on the screen so that the user can easily search for desired digital information.

상술한 바와 같은 구성을 가진 본 발명의 실시 예에 따른 인공지능 비서 서비스 제공 방법은, 크게 기기 인증 단계, 콘텐츠 이용 단계, 서비스 이용 단계를 포함할 수 있다.The method for providing an artificial intelligence assistant service according to an embodiment of the present invention having the above-described configuration may largely include a device authentication step, a content use step, and a service use step.

첫 번째 기기 인증 단계는, 사용자 기기(140)를 인증하는 단계로서, 먼저 사용자가 사용자 기기(140)를 이용하여 사용자 ID를 입력하게 되면, 사용자 기기(140)는 사용자 ID를 사용하여 인공지능 서버(130)에 접속하게 된다. 이때, 인공지능 서버(130)는, 사용자 ID를 수신하게 되면, 해당 수신된 사용자 ID에 대응하는 기기 정보를 조회한 다음, 해당 수신된 사용자 ID가 인공지능 서버(130)의 제어가 가능한 ID로서 등록된 ID인지를 판단하게 된다.The first device authentication step is a step of authenticating the user device 140 . First, when the user inputs a user ID using the user device 140 , the user device 140 uses the user ID to generate an artificial intelligence server. (130) is connected. At this time, when the artificial intelligence server 130 receives the user ID, inquires about device information corresponding to the received user ID, and then, the received user ID is an ID that can be controlled by the artificial intelligence server 130 . It is determined whether it is a registered ID.

상술한 판단 결과, 등록된 ID가 아닌 경우에는, 인공지능 서버(130)에서는 이용 불가 메시지를 사용자 기기(140)로 송출해 주게 된다.As a result of the above determination, if the ID is not a registered ID, the artificial intelligence server 130 transmits an unavailable message to the user device 140 .

반면에 상술한 판단 결과, 해당 수신된 ID가 등록된 경우에는, 다음 과정으로 진행하여, 인공지능 서버(130)에서는, 해당 수신된 ID에 대응하는 개인별 서비스 플랫폼을 조회한 다음, 해당 수신된 ID에 대응하는 조회된 개인별 서비스 플랫폼을 활성화시켜 주게 된다.On the other hand, as a result of the above determination, if the received ID is registered, the following process is performed, and the artificial intelligence server 130 inquires the individual service platform corresponding to the received ID, and then the received ID It activates the searched individual service platform corresponding to the

두 번째 콘텐츠 이용 단계는, 사용자가 사용자 기기(140)를 통해 콘텐츠를 이용하는 단계로서, 먼저 사용자가 사용자 기기(140)에 음성 입력(즉, 특정 어휘를 발화(發話))하면, 사용자 기기(140)는 VA를 활성화시켜 사용자의 음성을 받아들일 수 있는 대기상태로 변환해 주게 된다. 그런 후에, 사용자가 검색하고자 하는 콘텐츠 명을 음성으로서 발화하면, 사용자 기기(140)는 해당 음성 발화한 콘텐츠 명을 인공지능 서버(130)로 전송하게 된다.The second content use step is a step in which the user uses the content through the user device 140 . First, when the user inputs a voice into the user device 140 (ie, utters a specific vocabulary), the user device 140 ) activates the VA and converts the user's voice into a standby state that can accept it. Then, when the user utters the name of the content to be searched as a voice, the user device 140 transmits the name of the content uttered by voice to the artificial intelligence server 130 .

인공지능 서버(130)는, 사용자 기기(140)를 통해 사용자가 발화한 음성을 인식한 다음에, 해당 인식된 사용자의 음성을 분석하여 콘텐츠 명을 추출하게 되며, 해당 추출된 콘텐츠 명을 콘텐츠 서버(110)로 전달하게 된다.The artificial intelligence server 130, after recognizing the voice uttered by the user through the user device 140, analyzes the recognized user's voice to extract the content name, and sets the extracted content name to the content server (110) is forwarded.

콘텐츠 서버(110)는, 인공지능 서버(130)로부터 전달받은 콘텐츠 명에 해당하는 콘텐츠 정보(예를 들어, 콘텐츠 명, 가수, 재생시간 등의 정보)를 사용자 기기(140)로 전송해 주게 된다.The content server 110 transmits content information (eg, content name, singer, playback time, etc. information) corresponding to the content name received from the artificial intelligence server 130 to the user device 140 . .

상술한 바와 같이 분석된 콘텐츠 명이 여러 개일 경우에, 인공지능 서버(130)에서는 해당 콘텐츠의 리스트를 사용자 기기(140)로 해당하는 콘텐츠 리스트와 이에 관련된 정보들을 전송할 수도 있다.As described above, when there are several analyzed content names, the artificial intelligence server 130 may transmit the corresponding content list to the user device 140 and the corresponding content list and related information.

사용자 기기(140)는, 콘텐츠 서버(110)로부터 수신한 콘텐츠 정보를 화면 출력부(145)의 화면상에 표시한 다음에, 콘텐츠 재생을 시작하게 된다.The user device 140 displays the content information received from the content server 110 on the screen of the screen output unit 145 and then starts playing the content.

세 번째 서비스 이용 단계는, 사용자가 사용자 기기(140)를 통해 서비스를 이용하는 단계로서, 사용자가 사용자 기기(140)에 음성 입력(즉, 특정 어휘를 발화)하면, 사용자 기기(140)는 VA를 활성화시켜 사용자의 음성을 받아들일 수 있는 대기상태로 변환해 주게 된다. 그런 후에, 사용자가 검색하고자 하는 서비스 명을 음성으로서 발화하면, 사용자 기기(140)는 해당 음성 발화한 서비스 명을 인공지능 서버(130)로 전송하게 된다.The third service use step is a step in which the user uses the service through the user device 140 . When the user inputs a voice (ie, utters a specific vocabulary) to the user device 140 , the user device 140 uses the VA. It is activated to convert the user's voice into an acceptable standby state. Then, when the user utters a service name to be searched as a voice, the user device 140 transmits the uttered service name to the artificial intelligence server 130 .

인공지능 서버(130)는, 사용자 기기(140)를 통해 사용자가 발화한 음성을 인식한 다음에, 해당 인식된 사용자의 음성을 분석하여 서비스 명을 추출하게 되며, 해당 추출된 서비스 명을 서비스 서버(120)로 전달하게 된다.The artificial intelligence server 130, after recognizing the voice uttered by the user through the user device 140, analyzes the recognized user's voice to extract the service name, and uses the extracted service name as the service server. (120).

서비스 서버(120)는, 인공지능 서버(130)로부터 전달받은 서비스 명에 해당하는 서비스 정보(예를 들어, 오늘 날씨에 대한 지역 명, 시간별 상태 등의 정보)를 사용자 기기(140)로 전송해 주게 된다.The service server 120 transmits service information corresponding to the service name received from the artificial intelligence server 130 (for example, information such as a local name for today's weather, hourly status, etc.) to the user device 140, will give

상술한 바와 같이 분석된 서비스 명이 여러 개일 경우에, 인공지능 서버(130)에서는 해당 서비스의 리스트를 사용자 기기(140)로 해당하는 서비스 리스트와 이에 관련된 정보들을 전송할 수도 있다.As described above, when there are several analyzed service names, the artificial intelligence server 130 may transmit the corresponding service list to the user device 140 and the corresponding service list and related information.

사용자 기기(140)는, 서비스 서버(120)로부터 수신한 서비스 정보를 화면 출력부(145)의 화면상에 표시한 다음에, 서비스 재생을 시작하게 된다.The user device 140 displays the service information received from the service server 120 on the screen of the screen output unit 145 and then starts playing the service.

이와 같은 구성을 갖는 본 발명에 따른 인공지능 비서 서비스 제공 시스템의 구체적인 실제사용 상태를, 도 4를 통하여 설명하면, 유선으로 통신사 IoT 플랫폼을 통하여 거실에서 사용되는 데스크뷰(145a)와 주방에서 사용되는 주방용 티브이(145b)가 화면출력부(145)로 사용되었다.The specific actual use state of the artificial intelligence assistant service providing system according to the present invention having such a configuration will be described with reference to FIG. 4 , the desk view 145a used in the living room and the kitchen using the wired communication company IoT platform. A kitchen TV 145b was used as the screen output unit 145 .

상기 데스크뷰(145a)의 초기시작화면은, 도 5에 도시된 바와 같이, 라이브TV를 비롯하여 다수의 아이콘들이 표시되어 있고 우측 하단에 마이크표시(451)의 아이콘이 표시되어 있다.As shown in FIG. 5 , on the initial start screen of the desk view 145a, a number of icons including live TV are displayed, and an icon of a microphone display 451 is displayed at the lower right corner.

이와 같은 데스크뷰(145a)의 작동을 설명하면, 도 6a에 도시된 바와 같이, 네이버 ID로 접속하여, 사용자가 '헤이 클로바'라고 하면, 상기 데스크뷰(145a)의 화면에 '네, 말씀해 주세요'라는 자막과 함께 음성신호가 발화되고, 사용자가 '네이버NOW, 틀어줘'라고 하면, 도 6a에 도시된 바와 같이, 데스크뷰(145a)의 화면에 '네이버NOW'의 화면이 틀어주게 되는 것이다.To explain the operation of the desk view 145a, as shown in FIG. 6a, when the user accesses the Naver ID and says 'Hey Clover', the screen of the desk view 145a displays 'Yes, please tell me. When a voice signal is uttered together with the subtitle ' and the user says 'Naver NOW, play it', the screen of 'Naver NOW' is played on the screen of the desk view 145a as shown in FIG. 6a. .

또한, 주방용 티브이(145b)의 작동을 설명하면, 네이버 ID로 접속하여, 사용자가 '헤이 클로바'라고 하면, 상기 주방용 티브이(145b)의 화면에 '네, 말씀해 주세요'라는 자막과 함께 음성신호가 발화되고, 사용자가 '샐러드 레시피 찾아줘'라고 하면, 도 6b에 도시된 바와 같이, 주방용 티브이(145b)의 화면에 '다수의 샐러드 레시피'의 화면이 틀어주게 되고, 여기서 원하는 샐러드 레시피를 선택하고 샐러드를 만들 수 있게 준비되는 것이다.In addition, when explaining the operation of the kitchen TV 145b, when the user accesses the Naver ID and says 'Hey Clova', a voice signal is displayed along with the subtitle 'Yes, please tell me' on the screen of the kitchen TV 145b. When a fire is ignited and the user says 'find a salad recipe', as shown in FIG. 6b , a screen of 'multiple salad recipes' is played on the screen of the kitchen TV 145b, and a desired salad recipe is selected here. The salad is ready to be made.

마찬가지로, 상기 데스크뷰(145a)나 주방용 티브이(145b)와 같은 화면출력부(145)를 이용하여, 도 6c에 도시된 바와 같은, 추천맛집이나, 날씨정보와 같은 다양한 콘텐츠를 편리하게 이용할 수가 있게 되는 것이다.Similarly, by using the screen output unit 145 such as the desk view 145a or the kitchen TV 145b, as shown in FIG. 6c , various contents such as recommended restaurants and weather information can be conveniently used. will become

그리고, 도 6d에 도시된 바와 같이, 네이버 ID로 접속하여, 사용자가 '헤이 클로바'라고 하고 나서, 'MBC 틀어줘'와 같은 TV채널 변경이나, '볼륨 올려줘/내려줘'와 같은 음량변경이나, '라디오 실행해줘'와 같은 라디오 선택과 라디오의 구체적인 AM이나 FM선택과 같은 것도 음성으로 실행할 수가 있으며, '현관보가'나 '경비실 통화'도 음성으로 실행하는 것이 가능하게 되는 것이다.And, as shown in Fig. 6d, after connecting with Naver ID, the user says 'Hey Clova', and then changing the TV channel such as 'Play MBC' or changing the volume such as 'Volume up/down' , 'Run the radio', and specific radio AM or FM selections can be performed with voice, and 'Doorwalker' and 'Security room call' can also be performed with voice.

이상, 본 발명의 실시 예는 상술한 장치 및/또는 운용방법을 통해서만 구현이 되는 것은 아니며, 본 발명의 실시 예의 구성에 대응하는 기능을 실현하기 위한 프로그램, 그 프로그램이 기록된 기록 매체 등을 통해 구현될 수도 있으며, 이러한 구현은 앞서 설명한 실시 예의 기재로부터 본 발명이 속하는 기술분야의 전문가라면 쉽게 구현할 수 있는 것이다. 이상에서 본 발명의 실시 예에 대하여 상세하게 설명하였지만 본 발명의 권리범위는 이에 한정되는 것은 아니고 다음의 청구범위에서 정의하고 있는 본 발명의 기본 개념을 이용한 당업자의 여러 변형 및 개량 형태 또한 본 발명의 권리범위에 속하는 것이다.Above, the embodiment of the present invention is not implemented only through the above-described apparatus and/or operation method, but through a program for realizing a function corresponding to the configuration of the embodiment of the present invention, a recording medium in which the program is recorded, etc. It may be implemented, and such an implementation can be easily implemented by an expert in the technical field to which the present invention pertains from the description of the above-described embodiments. Although the embodiments of the present invention have been described in detail above, the scope of the present invention is not limited thereto, and various modifications and improved forms of the present invention are also provided by those skilled in the art using the basic concept of the present invention as defined in the following claims. is within the scope of the right.

100: 인공지능 비서 서비스 제공 시스템
110: 콘텐츠 서버
120: 서비스 서버
130: 인공지능 서버
140: 사용자 기기
141: 외부 서버 연동부
142: 제어부
143: 저장부
144: 영상신호 처리부
145: 화면 출력부
146: 보이스 에이전트 처리부
461: 콘텐츠 재생 처리모듈
462: 서비스 재생 처리모듈
463: 외부 IoT 처리모듈
147: 오디오 처리부
471: 에코 캔슬레이션묘듈
472: 노이즈 캔슬레이션모듈
473: 마이크 음성입력모듈
148: 스피커 출력부
150: 네트워크100: artificial intelligence assistant service provision system
110: content server
120: service server
130: artificial intelligence server
140: user device
141: external server linkage
142: control unit
143: storage
144: video signal processing unit
145: screen output unit
146: voice agent processing unit
461: content reproduction processing module
462: service regeneration processing module
463: external IoT processing module
147: audio processing unit
471: echo cancellation module
472: noise cancellation module
473: microphone voice input module
148: speaker output unit
150: network

Claims

In the artificial intelligence assistant service providing system 100,
The artificial intelligence assistant service providing system 100,
a content server 110 for providing content;
a service server 120 for providing a service;
Artificial intelligence server 130 for managing the content provided by the content server and the service provided by the service server by interworking with the artificial intelligence assistant function, recognizing the user's voice to search for content and use the service ;
a user device 140 for receiving a user's voice and transmitting it to the artificial intelligence server, searching for a content that the user wants to search for and using a service; and
and a network (150) for connecting the content server, the service server, the artificial intelligence server, and the user devices to each other to transmit and receive data.

According to claim 1,
The content server 110,
Transmitting content information corresponding to the content name received from the artificial intelligence server 130 to the artificial intelligence server 130,
According to the content selection notified from the user device 140, the corresponding content is retrieved and transmitted to the user device 140,
The artificial intelligence assistant service providing system, characterized in that the content file including the searched content is transmitted to the user device (140) in real time so that the user can view it through the screen.

According to claim 1,
The user device 140 may receive the distributed content transmitted from the artificial intelligence server 130 and provide it to the user through the screen, and receive the service transmitted from the artificial intelligence server 130 through the screen can be provided to users,
The user device 140 includes an external server interworking unit 141 for transmitting and receiving data in conjunction with the artificial intelligence server 130 or the content server 110 or the service server 120;
a control unit 142 connected to the external server interworking unit 141 and controlling the operation of the user device 140; a storage unit 143 connected to the control unit 142 to store programs or data necessary for the control of the control unit;
an image signal processing unit 144 connected to the control unit 142 and performing signal processing on the image data received from the artificial intelligence server 130;
a screen output unit 145 for outputting and displaying the image data signal processed by the image signal processing unit 144 through a screen;
a voice agent processing unit 146 connected to the control unit 142, supporting multiple codecs, and selecting a service;
an audio processing unit 147 connected to the control unit 142, receiving a user's voice through a microphone, processing the audio, and transmitting the audio to the artificial intelligence server 130;
and a speaker output unit (148) connected to the control unit (142) to output audio data received from the artificial intelligence server (130).

4. The method of claim 3,
The voice agent processing unit 146 includes a content reproduction processing module 461 for reproducing and processing content, a service reproduction processing module 462 for reproducing and processing services, and interworking with external IoT to process external IoT data. Including an external IoT processing module 463,
The audio processing unit 147 recognizes the audio generated by the user device 140 itself, and when the phase is reversed, the sound is canceled by an echo cancellation module 471 for removing the audio of the internal sound source, a microphone Including a noise cancellation module 472 for maintaining a clearer and cleaner voice by removing ambient noise such as the sound of the air conditioner introduced into the air conditioner, and a microphone voice input module 473 for receiving high-performance micro audio Artificial intelligence assistant service providing system, characterized in that.

According to claim 1,
The artificial intelligence server 130 may process a user response signal input from a voice user interface (VUI),
The artificial intelligence server 130 is provided with an artificial intelligence assistant platform,
In the AI assistant platform, the client can obtain an AI assistant access token or use it when a user connects his or her account when using a specific extension, and the AI assistant developer console (developer console) An artificial intelligence assistant service providing system, characterized in that the client authentication information, which is authentication information obtained by registering a client through the console), can be used to acquire an artificial intelligence assistant access token.

In the method of providing an artificial intelligence assistant service,
The method of providing the artificial intelligence assistant service,
providing content by a content server;
providing a service by the service server;
managing, by an artificial intelligence server, the content provided by the content server and the service provided by the service server by interworking with an artificial intelligence assistant function;
receiving, by the user device, the user's voice and transmitting it to the artificial intelligence server;
allowing the artificial intelligence server to search for content by recognizing a user's voice and use a service; and
and providing, by the user device, searching for the content the user wants to search for and using the service.