KR20210015417A

KR20210015417A - System and method for providing service based on object detection

Info

Publication number: KR20210015417A
Application number: KR1020190094219A
Authority: KR
Inventors: 이채영
Original assignee: 주식회사 케이티
Priority date: 2019-08-02
Filing date: 2019-08-02
Publication date: 2021-02-10

Abstract

The present invention provides a system for providing a service based on object recognition to provide the convenience to a user. The system comprises: a secretary terminal receiving a voice command of a user and outputting a response to the voice command; a server receiving the voice command from the secretary terminal, detecting food from video content that a user is watching according to the voice command, and searching food name of the food and product information related to food ingredients to output the food name and product information to the secretary terminal; and a display device receiving the food name and product information from the secretary terminal and displaying the food name and product information.

Description

Object recognition-based service provision system and method {SYSTEM AND METHOD FOR PROVIDING SERVICE BASED ON OBJECT DETECTION}

본 발명은 객체 인식 기반 서비스 제공 시스템 및 방법에 관한 것으로, 더욱 상세하게는 사용자 맞춤별 상품 연계 서비스를 제공하는 시스템 및 방법에 관한 것이다.The present invention relates to a system and method for providing service based on object recognition, and more particularly, to a system and method for providing a product link service for each user.

최근 음성 인식 기술이 빠른 속도로 발전하고 있다. 이러한 음성 인식 기술을 통하여 방송, 영화, 음악, 인터넷, 쇼핑 등 다양한 분야에 걸쳐 사용자에게 많은 정보와 콘텐트 제공 등의 서비스가 시행되고 있다. Recently, speech recognition technology is developing at a rapid pace. Through such speech recognition technology, services such as providing a lot of information and contents to users in various fields such as broadcasting, movies, music, internet, and shopping are being implemented.

일 예로, 음성 인식 서비스를 이용하여 사용자는 쇼핑 의사를 표현하는 발화를 통하여 원하는 상품을 추천 받거나 구매할 수 있다.For example, using a voice recognition service, a user may receive or purchase a desired product through speech expressing a shopping intention.

그러나, 기존의 음성 인식 기반 서비스는, 각 상품마다 검색하여 쇼핑을 수행해야 하기 때문에, 관련된 상품들을 일괄적으로 구매하고자 하는 사용자에게 불편함을 주고 있는 실정이다. 또한, 사용자 마다 선호하는 상품이 다름에도 불구하고 획일적으로 상품을 추천하기 때문에 서비스 이용 효용성이 떨어지고 있는 실정이다.However, since the existing voice recognition-based service has to perform shopping by searching for each product, it is inconvenient to users who want to collectively purchase related products. In addition, even though the products they prefer are different for each user, the utility of service use is deteriorating because products are uniformly recommended.

본 발명은, 사용자 맞춤별로 상품을 추천하고, 사용자에게 편의성을 제공할 수 있는 객체 인식 기반 서비스 제공 시스템 및 방법을 제공하는 것을 목적으로 한다.An object of the present invention is to provide a system and method for providing an object recognition-based service capable of recommending products for each user and providing convenience to users.

본 발명에서 이루고자 하는 기술적 과제들은 이상에서 언급한 기술적 과제로 제한되지 않으며, 언급하지 않은 또 다른 기술적 과제들은 아래의 기재로부터 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.The technical problems to be achieved in the present invention are not limited to the technical problems mentioned above, and other technical problems that are not mentioned can be clearly understood by those of ordinary skill in the technical field to which the present invention belongs from the following description. There will be.

상술한 바와 같은 과제를 해결하기 위하여, 본 발명은, 사용자의 음성 명령을 입력 받고, 음성 명령에 대한 응답을 출력하는 비서 단말과, 비서 단말로부터 음성 명령을 입력 받으며, 음성 명령에 따라 사용자가 시청 중인 영상 컨텐츠에서 음식 객체를 검출하고, 음식 객체의 음식명 및 음식 재료와 관련된 상품 정보를 검색하여 이를 비서 단말에 출력하는 서버와, 비서 단말로부터 음식명 및 상품 정보를 입력 받아 이를 표시하는 디스플레이 장치를 포함하는 객체 인식 기반 서비스 제공 시스템을 제공한다.In order to solve the above-described problems, the present invention provides a secretary terminal that receives a user's voice command and outputs a response to the voice command, and receives a voice command from the secretary terminal, and is viewed by the user according to the voice command. A server that detects a food object in the video content being processed, searches for product information related to the food name and food material of the food object, and outputs it to the secretary terminal, and a display device that receives food name and product information from the secretary terminal and displays it It provides an object recognition-based service providing system including.

여기서, 서버는 사용자의 상품 구매 이력 또는 상품 검색 이력을 기반으로 상품 정보를 검색할 수 있다.Here, the server may search for product information based on the user's product purchase history or product search history.

또한, 상품 정보는 최저가 상품, 유기농 상품 및 레토르트 상품 정보를 포함하여 복수 개로 분류될 수 있다.In addition, product information may be classified into a plurality of items including information on the lowest price product, organic product, and retort product.

또한, 음성 명령은, 음식 객체 검출 명령과, 상품 정보에 포함된 상품 구매 또는 장바구니 담기 명령과, 음식 레시피 검출 명령 및 음식 재료 검출 명령 중 적어도 하나를 포함할 수 있다.In addition, the voice command may include at least one of a food object detection command, a product purchase or shopping cart addition command included in the product information, a food recipe detection command, and a food ingredient detection command.

또한, 서버는 음성 명령에 따라 상품 구매, 상품 장바구니 담기, 음식 레시피 검색 및 음식 재료 검색 중 적어도 하나를 수행하고 그 수행 결과를 비서 단말에 출력할 수 있다.In addition, the server may perform at least one of a product purchase, a product shopping cart, a food recipe search, and a food material search according to a voice command, and output the execution result to the secretary terminal.

또한, 디스플레이 장치는 비서 단말로부터 음성 명령에 따른 수행 결과를 입력 받아 이를 표시할 수 있다.Also, the display device may receive an execution result according to the voice command from the secretary terminal and display it.

또한, 서버는, 음식 영상 데이터를 학습하여 학습 모델을 생성하는 학습 모델 생성부와, 영상 컨텐츠에서 음식 객체를 검출하는 음식 객체 검출부와, 학습 모델에 음식 객체를 입력하여 음식 객체의 음식명을 검색하는 음식명 검색부를 포함할 수 있다.In addition, the server includes a learning model generation unit that generates a learning model by learning food image data, a food object detection unit that detects a food object from image content, and a food name of the food object by inputting the food object into the learning model. It may include a food name search unit.

여기서, 음식명 검색부는 음식명이 복수 개로 검색되는 경우 음식 객체 및 음식명의 매칭도에 따라 적어도 하나의 음식명을 비서 단말에 출력할 수 있다.Here, when a plurality of food names are searched, the food name search unit may output at least one food name to the secretary terminal according to a matching degree of the food object and the food name.

또한, 서버는 음성 명령에 따라, 음식 객체를 포함하는 음식 영상을 저장하고, 영상 컨텐츠 시청 중 또는 종료 후 음식 영상을 비서 단말에 출력할 수 있다.In addition, the server may store a food image including a food object according to a voice command, and output the food image to the secretary terminal during or after viewing the video content.

또한, 서버는 사용자의 상품 구매 이력 또는 상품 검색 이력을 저장하고, 상품 정보를 저장하는 데이터 베이스를 포함할 수 있다.In addition, the server may include a database for storing product purchase history or product search history of the user and storing product information.

또한, 본 발명은, 사용자가 디스플레이 장치를 통해 영상 컨테츠를 시청 중 비서 단말에 음성 명령을 발화하는 단계와, 비서 단말이 음성 명령을 입력 받아 이를 서버로 출력하는 단계와, 서버가 음성 명령에 따라 영상 컨텐츠에서 음식 객체를 검출하고, 음식 객체의 음식명 및 음식 재료와 관련된 상품 정보를 사용자의 상품 구매 이력 또는 상품 검색 이력을 기반으로 검색하여 이를 비서 단말에 출력하는 단계와, 디스플레이 장치가 비서 단말로부터 음식명 및 상품 정보를 입력 받아 이를 표시하는 단계와, 비서 단말이 음성 명령에 대한 응답을 출력하는 단계를 포함하는 객체 인식 기반 서비스 제공 방법을 제공한다.In addition, the present invention provides a step of uttering a voice command to a secretary terminal while a user is watching video content through a display device, a step of receiving the voice command from the secretary terminal and outputting the voice command to the server, and the server according to the voice command. Detecting a food object from the video content, searching for product information related to the food name and food material of the food object based on the user's product purchase history or product search history, and outputting it to the secretary terminal, and the display device is the secretary terminal Provides an object recognition-based service providing method comprising the steps of receiving food name and product information from and displaying them, and outputting a response to a voice command by a secretary terminal.

또한, 본 발명의 객체 인식 기반 서비스 제공 방법은, 사용자가 상품 정보에 포함된 상품 구매 또는 장바구니 담기 명령을 비서 단말에 발화하는 단계와, 비서 단말이 상품 정보에 포함된 상품 구매 또는 장바구니 담기 명령을 입력 받아 이를 서버로 출력하는 단계와, 서버가 상품 구매 또는 상품 장바구니 담기를 수행하고 그 수행 결과를 비서 단말에 출력하는 단계와, 디스플레이 장치가 비서 단말로부터 수행 결과를 입력 받아 이를 표시하는 단계를 더 포함할 수 있다.In addition, the object recognition-based service providing method of the present invention includes the steps of a user firing a command to purchase a product or add a shopping cart included in product information to a secretary terminal, and a command to purchase a product or add a shopping cart included in the product information by the secretary terminal. Receiving the input and outputting it to the server, the server performing a product purchase or adding a product shopping cart and outputting the execution result to the secretary terminal, and the display device receiving the execution result from the secretary terminal and displaying it. Can include.

본 발명에 따르면, 특정 음식과 관련된 상품들을 일괄적으로 한번에 구매할 수 있어 사용자에게 편의성을 제공할 수 있고, 사용자의 상품 선호 성향을 분석하여 사용자 맞춤별로 상품을 추천할 수 있는 효과가 있다.According to the present invention, it is possible to collectively purchase products related to a specific food at a time, thereby providing convenience to a user, and analyzing a user's product preference tendency to recommend products for each user.

본 발명에서 얻을 수 있는 효과는 이상에서 언급한 효과들로 제한되지 않으며, 언급하지 않은 또 다른 효과들은 아래의 기재로부터 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.The effects obtainable in the present invention are not limited to the above-mentioned effects, and other effects not mentioned can be clearly understood by those of ordinary skill in the art from the following description. will be.

도 1은 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 시스템의 블록도이다.
도 2는 본 발명의 실시예에 따른 비서 단말의 구체적인 블록도이다.
도 3은 본 발명의 실시예에 따른 서버의 구체적인 블록도이다.
도 4는 도 3의 제어부의 구체적인 블록도이다.
도 5는 영상 컨텐츠에서 음식 객체를 검출하는 방법을 설명하기 위한 도면이다.
도 6 및 도 7은 본 발명의 실시예에 따른 디스플레이 장치에 음식명 및 상품 정보가 표시되는 일례를 도시한 도면이다.
도 8은 본 발명의 실시예에 따른 디스플레이 장치에 음식명, 음식 레시피 및 상품 정보가 표시되는 일례를 도시한 도면이다.
도 9는 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 방법의 흐름도이다.1 is a block diagram of an object recognition-based service providing system according to an embodiment of the present invention.
2 is a detailed block diagram of a secretary terminal according to an embodiment of the present invention.
3 is a detailed block diagram of a server according to an embodiment of the present invention.
4 is a detailed block diagram of the control unit of FIG. 3.
5 is a diagram for describing a method of detecting a food object in video content.
6 and 7 are diagrams illustrating an example of displaying food names and product information on a display device according to an exemplary embodiment of the present invention.
8 is a diagram illustrating an example in which food name, food recipe, and product information are displayed on a display device according to an embodiment of the present invention.
9 is a flowchart of an object recognition-based service providing method according to an embodiment of the present invention.

이하, 본 발명에 따른 바람직한 실시 형태를 첨부된 도면을 참조하여 상세하게 설명한다. 첨부된 도면과 함께 이하에 개시될 상세한 설명은 본 발명의 예시적인 실시형태를 설명하고자 하는 것이며, 본 발명이 실시될 수 있는 유일한 실시형태를 나타내고자 하는 것이 아니다. 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략할 수 있고, 명세서 전체를 통하여 동일 또는 유사한 구성 요소에 대해서는 동일한 참조 부호를 사용할 수 있다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. The detailed description to be disclosed hereinafter together with the accompanying drawings is intended to describe exemplary embodiments of the present invention, and is not intended to represent the only embodiments in which the present invention may be practiced. In the drawings, parts irrelevant to the description may be omitted in order to clearly describe the present invention, and the same reference numerals may be used for the same or similar components throughout the specification.

본 발명의 일 실시 예에서, “또는”, “적어도 하나” 등의 표현은 함께 나열된 단어들 중 하나를 나타내거나, 또는 둘 이상의 조합을 나타낼 수 있다. 예를 들어, “A 또는 B”, “A 및 B 중 적어도 하나”는 A 또는 B 중 하나만을 포함할 수 있고, A와 B를 모두 포함할 수도 있다.In an embodiment of the present invention, expressions such as "or" and "at least one" may represent one of words listed together, or a combination of two or more. For example, “A or B” and “at least one of A and B” may include only one of A or B, and may include both A and B.

도 1은 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 시스템의 블록도이다.1 is a block diagram of an object recognition-based service providing system according to an embodiment of the present invention.

도 1을 참조하면, 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 시스템은 디스플레이 장치(100), 비서 단말(200) 및 서버(300)를 포함할 수 있다.Referring to FIG. 1, an object recognition-based service providing system according to an embodiment of the present invention may include a display device 100, a secretary terminal 200, and a server 300.

디스플레이 장치(100)는, 비서 단말(200)을 통해 서버(300)로부터 영상 컨텐츠를 제공 받아 이를 표시할 수 있을 뿐만 아니라, 영상 컨텐츠 표시 중 시각 기반 서비스를 제공할 수 있다.The display apparatus 100 may receive and display image content from the server 300 through the secretary terminal 200 and may provide a time-based service during image content display.

이와 같은 디스플레이 장치(100)는, 액정 디스플레이(LCD; liquid crystal display), 발광 다이오드(LED; light emitting diode) 디스플레이, 유기 발광 다이오드(OLED; organic LED) 디스플레이, 마이크로 전자기계 시스템(MEMS; micro electro mechanical systems) 디스플레이 및 전자 종이(electronic paper) 디스플레이 등을 포함하나 이에 한정되지는 않는다.Such a display device 100 includes a liquid crystal display (LCD), a light emitting diode (LED) display, an organic light emitting diode (OLED) display, and a micro electromechanical system (MEMS). mechanical systems) displays and electronic paper displays, and the like.

비서 단말(200)은 네트워크(Network)를 통해 서버(300) 또는 디스플레이 장치(100)와 동시에 또는 시간 간격을 두고 연결될 수 있다.The assistant terminal 200 may be connected to the server 300 or the display device 100 at the same time or at a time interval through a network.

여기서, 네트워크는 근거리 통신망(LAN: Local Area Network), 광역 통신망(WAN: Wide Area Network), 인터넷 (WWW: World Wide Web), 유무선 데이터 통신망, 전화망, 유무선 텔레비전 통신망 등을 포함한다. 무선 데이터 통신망의 일례에는 3G, 4G, 5G, 3GPP(3rd Generation Partnership Project), LTE(Long Term Evolution), WIMAX(World Interoperability for Microwave Access), 와이파이(Wi-Fi), 블루투스 통신, 적외선 통신, 초음파 통신, 가시광 통신(VLC: Visible Light Communication), 라이파이(LiFi) 등을 포함하나 이에 한정되지는 않는다.Here, the network includes a local area network (LAN), a wide area network (WAN), the Internet (WWW), a wired/wireless data communication network, a telephone network, a wired/wireless television communication network, and the like. Examples of wireless data networks include 3G, 4G, 5G, 3rd Generation Partnership Project (3GPP), Long Term Evolution (LTE), World Interoperability for Microwave Access (WIMAX), Wi-Fi, Bluetooth communication, infrared communication, and ultrasound. Communication, visible light communication (VLC), LiFi, and the like, but are not limited thereto.

비서 단말(200)은 사용자의 음성 명령을 입력 받고, 입력 받은 음성 명령에 대한 응답을 출력한다. 예를 들어, 사용자가 "기가지니"라는 음성을 발화하면, 비서 단말(200)은 "기가지니"라는 호출어를 인식하고, 인식 결과에 따라 명령어 대기 UI를 노출시킬 수 있다. 그 후, 사용자가 "티비켜"라는 음성을 발화하면, 비서 단말(200)은 "티비켜"라는 음성 명령을 인식하고, 서버(300)로 음성 명령에 대한 분석을 요청할 수 있다. 그리고, 비서 단말(200)은 서버(300)로부터 음성 명령에 대한 분석을 수신하고, 분석 결과에 따라 디스플레이 장치(100)의 전원을 켜도록 제어할 수 있다.The secretary terminal 200 receives a user's voice command and outputs a response to the received voice command. For example, when the user utters the voice "Gi Genie", the secretary terminal 200 may recognize the call word "Gi Genie" and expose a command waiting UI according to the recognition result. Thereafter, when the user utters a voice “Tove on”, the secretary terminal 200 may recognize the voice command “Tove on” and request the server 300 to analyze the voice command. In addition, the secretary terminal 200 may receive an analysis of a voice command from the server 300 and control the display apparatus 100 to be powered on according to the analysis result.

다른 예를 들어, 사용자가 "홍길동이 누구야"라는 음성을 발화하면, 비서 단말(200)은 "홍길동이 누구야?"라는 음성 명령을 인식하고, 서버(300)로 음성 명령에 대한 분석을 요청할 수 있다. 그리고, 비서 단말(200)은 서버(300)로부터 음성 명령에 대한 분석을 수신하고, 분석 결과에 따라 "홍길동은 OO입니다."라는 발화를 할 수 있고, 디스플레이 장치(100)를 통해 "홍길동은 OO입니다."라는 문자를 표시할 수 있다.For another example, when the user utters the voice "Who is Hong Gil-dong", the secretary terminal 200 recognizes the voice command "Who is Hong Gil-dong?", and requests the server 300 to analyze the voice command. have. And, the secretary terminal 200 receives the analysis of the voice command from the server 300, according to the analysis result, "Kil-dong Hong is OO" can utter, through the display device 100, "Kil-dong Hong is OO” can be displayed.

이와 같이 비서 단말(200)은, 비서 단말(200)과 연결된 디스플레이 장치(100)를 통해 시각 기반 서비스를 제공하고, 자체 음성 신호 입출력부를 통해 음성 기반 서비스를 제공할 수 있다. In this way, the secretary terminal 200 may provide a time-based service through the display device 100 connected to the secretary terminal 200 and may provide a voice-based service through its own voice signal input/output unit.

또한, 비서 단말(200)은 셋탑 박스를 포함할 수 있으며, 이 셋탑 박스를 통해 서버(300)로부터 영상 컨텐츠를 제공받을 수 있고, 제공 받은 영상 컨텐츠를 디스플레이 장치(100)에 출력하여 사용자가 디스플레이 장치(100)를 통해 영상 컨테츠를 시청할 수 있도록 할 수 있다.In addition, the assistant terminal 200 may include a set-top box, through which video content can be provided from the server 300, and the received video content is output to the display device 100 to be displayed by the user. The video content can be viewed through the device 100.

도 2는 본 발명의 실시예에 따른 비서 단말의 구체적인 블록도이다.2 is a detailed block diagram of a secretary terminal according to an embodiment of the present invention.

도 2를 참조하면, 비서 단말(200)은 통신부(210), 출력부(220), 입력부(230), 메모리(240) 및 제어부(250)를 포함할 수 있다.Referring to FIG. 2, the secretary terminal 200 may include a communication unit 210, an output unit 220, an input unit 230, a memory 240, and a control unit 250.

통신부(210)는 근거리 통신 모듈 또는 유무선 통신 모듈을 포함할 수 있다. 여기서, 근거리 통신 모듈은 근거리 통신(Short range communication)을 위한 것으로서, 블루투스(Bluetooth™), RFID(Radio Frequency Identification), 적외선 통신(Infrared Data Association; IrDA), UWB(Ultra Wideband), ZigBee, NFC(Near Field Communication), Wi-Fi(Wireless-Fidelity), Wi-Fi Direct, Wireless USB(Wireless Universal Serial Bus) 기술 중 적어도 하나를 이용하여, 근거리 통신을 지원할 수 있다.The communication unit 210 may include a short-range communication module or a wired/wireless communication module. Here, the short range communication module is for short range communication, and includes Bluetooth™, Radio Frequency Identification (RFID), Infrared Data Association (IrDA), Ultra Wideband (UWB), ZigBee, and NFC ( Near Field Communication), Wi-Fi (Wireless-Fidelity), Wi-Fi Direct, and Wireless Universal Serial Bus (USB) technologies may be used to support short-range communication.

출력부(220)는 시각, 청각 또는 촉각 등과 관련된 출력을 발생시키기 위한 것으로, 디스플레이부, 음향 출력부, 햅팁 모듈, 광 출력부 중 적어도 하나를 포함할 수 있다.The output unit 220 is for generating an output related to visual, auditory or tactile sense, and may include at least one of a display unit, an audio output unit, a hap tip module, and a light output unit.

음향 출력부는 메모리(240)에 저장된 오디오 데이터를 출력할 수 있다. 음향 출력부는 비서 단말(200)에서 수행되는 기능과 관련된 음향 신호를 출력하기도 한다. 이러한 음향 출력부에는 스피커(speaker) 및 버저(buzzer) 등이 포함될 수 있다.The sound output unit may output audio data stored in the memory 240. The sound output unit also outputs sound signals related to functions performed by the secretary terminal 200. The sound output unit may include a speaker and a buzzer.

입력부(230)는 오디오 신호 입력을 위한 마이크로폰(microphone)을 포함할 수 있다. 마이크로폰은 외부의 음향 신호를 전기적인 음성 데이터로 처리한다. 처리된 음성 데이터는 비서 단말(200)에서 수행 중인 기능(또는 실행 중인 응용 프로그램)에 따라 다양하게 활용될 수 있다. 한편, 마이크로폰에는 외부의 음향 신호를 입력 받는 과정에서 발생되는 잡음(noise)을 제거하기 위한 다양한 잡음 제거 알고리즘이 구현될 수 있다.The input unit 230 may include a microphone for inputting an audio signal. The microphone processes external sound signals into electrical voice data. The processed voice data may be used in various ways according to a function (or an application program being executed) being executed by the secretary terminal 200. Meanwhile, various noise removal algorithms may be implemented in the microphone to remove noise generated in the process of receiving an external sound signal.

메모리(240)는 비서 단말(200)의 다양한 기능을 지원하는 데이터를 저장한다. 메모리(240)는 비서 단말(200)에서 구동되는 응용 프로그램(application program 또는 애플리케이션(application)), 비서 단말(200)의 동작을 위한 데이터들, 명령어들을 저장할 수 있다.The memory 240 stores data supporting various functions of the secretary terminal 200. The memory 240 may store an application program or application driven by the assistant terminal 200, data for the operation of the assistant terminal 200, and instructions.

제어부(250)는 메모리(240)에 저장된 응용 프로그램과 관련된 동작과, 통상적으로 비서 단말(200)의 전반적인 동작을 제어한다. 나아가 제어부(250)는 이하에서 설명되는 다양한 실시 예들을 본 발명에 따른 비서 단말(200) 상에서 구현하기 위하여, 위에서 살펴본 구성 요소들을 중 적어도 하나를 조합하여 제어할 수 있다.The controller 250 controls an operation related to an application program stored in the memory 240 and, in general, an overall operation of the secretary terminal 200. Furthermore, in order to implement various embodiments described below on the secretary terminal 200 according to the present invention, the controller 250 may control by combining at least one of the above-described components.

제어부(250)는 음성 인식 모듈(255)을 더 포함할 수 있다. 음성 인식 모듈(255)은 음성 인식 알고리즘이 적용된 음성 인식 엔진을 구동하여 마이크로폰을 통해 입력된 외부 음성을 인식한다.The control unit 250 may further include a voice recognition module 255. The voice recognition module 255 drives a voice recognition engine to which a voice recognition algorithm is applied to recognize an external voice input through a microphone.

즉, 음성 인식 모듈(255)은 마이크로폰을 통해 입력되는 외부 음성을 디지털 데이터로 변환하고, 상기 변환된 디지털 데이터를 증폭(Pre-emphasis)한 후, 디지털 변환된 음성의 시작 지점과 끝 지점을 검출한다. 이어서, 음성 인식 모듈(255)은 검출한 시작 지점과 끝 지점 사이의 음성에 대한 음성 특징값들을 추출하여 고유의 음성 또는 음색을 인식한다.That is, the voice recognition module 255 converts the external voice input through the microphone into digital data, amplifies the converted digital data (Pre-emphasis), and then detects the start and end points of the digitally converted voice. do. Subsequently, the voice recognition module 255 extracts voice feature values for the voice between the detected start point and the end point to recognize a unique voice or tone.

한편, 본 실시 예에서는, 음성 인식 모듈(255)이 제어부(250) 내에 구현되는 것을 예시하고 있으나 이를 제한하지는 않으며, 제어부(250)와 독립적으로 구성될 수 있음은 당업자에게 자명할 것이다.Meanwhile, in the present embodiment, although the speech recognition module 255 is exemplified in the controller 250, it is not limited thereto, and it will be apparent to those skilled in the art that the voice recognition module 255 may be configured independently of the controller 250.

서버(300)는, 비서 단말(200)에 영상 컨텐츠를 제공할 수 있다. 또한, 서버(300)는 비서 단말(200)로부터 사용자의 음성 명령을 입력 받고 이 음성 명령에 따른 응답을 비서 단말(200)로 출력할 수 있다.The server 300 may provide video content to the secretary terminal 200. In addition, the server 300 may receive a user's voice command from the secretary terminal 200 and output a response according to the voice command to the secretary terminal 200.

도 3은 본 발명의 실시예에 따른 서버의 구체적인 블록도이다.3 is a detailed block diagram of a server according to an embodiment of the present invention.

도 3을 참조하면, 본 발명의 실시예에 따른 서버(300)는 통신부(310), 제어부(320) 및 데이터 베이스(330)를 포함할 수 있다.Referring to FIG. 3, a server 300 according to an embodiment of the present invention may include a communication unit 310, a control unit 320, and a database 330.

통신부(310)는, 비서 단말(200)과 통신을 수행하기 위한 구성으로서, 유선 통신을 지원하기 위한 통신 모듈과, 무선 통신을 지원하기 위한 이동 통신 모듈을 포함할 수 있다. 여기서, 이동 통신 모듈은, 이동 통신을 위한 기술 표준들 또는 통신 방식(예를 들어, GSM(Global System for Mobile communication), CDMA(Code Division Multi Access), CDMA2000(Code Division Multi Access 2000), EVDO(Enhanced Voice-Data Optimized or Enhanced Voice-Data Only), WCDMA(Wideband CDMA), HSDPA(High Speed Downlink Packet Access), HSUPA(High Speed Uplink Packet Access), LTE(Long Term Evolution), LTE-A(Long Term Evolution-Advanced) 등)에 따라 구축된 이동 통신망 상에서 기지국 및 외부의 단말 중 적어도 하나와 무선 신호를 송수신한다.The communication unit 310 is a component for performing communication with the secretary terminal 200, and may include a communication module for supporting wired communication and a mobile communication module for supporting wireless communication. Here, the mobile communication module includes technical standards or communication methods for mobile communication (eg, Global System for Mobile Communication (GSM), Code Division Multi Access (CDMA)), Code Division Multi Access 2000 (CDMA2000), and EVDO ( Enhanced Voice-Data Optimized or Enhanced Voice-Data Only), WCDMA (Wideband CDMA), HSDPA (High Speed Downlink Packet Access), HSUPA (High Speed Uplink Packet Access), LTE (Long Term Evolution), LTE-A (Long Term) Evolution-Advanced), etc.), transmits and receives radio signals with at least one of a base station and an external terminal on a mobile communication network.

데이터 베이스(330)는, 사용자의 상품 구매 이력 또는 상품 검색 이력을 저장하고, 음식 재료와 관련된 상품 정보를 저장한다. 여기서, 상품 구매 이력은 비서 단말(200)을 이용하여 상품을 구매한 이력이고, 상품 검색 이력은 비서 단말(200)을 이용하여 상품을 검색한 이력일 수 있다.The database 330 stores a user's product purchase history or product search history, and stores product information related to food ingredients. Here, the product purchase history may be a history of purchasing a product using the secretary terminal 200, and the product search history may be a history of searching for a product using the secretary terminal 200.

제어부(320)는, 서버(300)의 전반적인 동작을 제어하며, 비서 단말(200)로부터 입력 받은 사용자의 음성 명령에 따라 사용자가 시청 중인 영상 컨텐츠에서 음식 객체를 검출하고, 음식 객체의 음식명 및 음식 재료와 관련된 상품 정보를 검색한다. 이 때, 제어부(320)는 사용자의 상품 구매 이력 또는 상품 검색 이력을 기반으로 상품 정보를 검색할 수 있다.The controller 320 controls the overall operation of the server 300, detects a food object from the video content being viewed by the user according to the user's voice command input from the secretary terminal 200, and detects the food name of the food object and Search for product information related to food ingredients. In this case, the controller 320 may search for product information based on the user's product purchase history or product search history.

이와 같이, 제어부(320)에 의해 검색된 상품 정보는 통신부(310)를 통해 비서 단말(200)로 출력된다.In this way, product information searched by the control unit 320 is output to the secretary terminal 200 through the communication unit 310.

도 4는 도 3의 제어부의 구체적인 블록도이고, 도 5는 영상 컨텐츠에서 음식 객체를 검출하는 방법을 설명하기 위한 도면이다.FIG. 4 is a detailed block diagram of the controller of FIG. 3, and FIG. 5 is a diagram illustrating a method of detecting a food object from image content.

도 4를 참조하면, 제어부(320)는, 학습 모델 생성부(321), 음식 객체 검출부(322), 음식명 검색부(323), 상품 분류부(324) 및 상품 정보 검색부(325)를 포함할 수 있다.4, the controller 320 includes a learning model generation unit 321, a food object detection unit 322, a food name search unit 323, a product classification unit 324, and a product information search unit 325. Can include.

학습 모델 생성부(321)는 음식 영상 학습 데이터를 학습하여 학습 모델(예컨대, Darknet YOLO 모델)을 생성한다. 여기서, 음식 영상 학습 데이터는 Food 101 dataset과, google 및 naver와 같은 각종 포털 사이트에서 이미지 크롤링하여 생성될 수 있다.The learning model generation unit 321 generates a learning model (eg, a Darknet YOLO model) by learning food image learning data. Here, the food image learning data may be generated by image crawling on the Food 101 dataset and various portal sites such as google and naver.

도 5를 참조하면, 음식 객체 검출부(322)는, 사용자가 디스플레이 장치(100)를 통해 영상 컨텐츠 시청 중 비서 단말(200)에 음성 명령(예컨대, TV 속 음식이 뭐야?)을 하게 되면, 시청 중인 영상 컨텐츠(10)에서 음식 객체(10a)를 검출한다.Referring to FIG. 5, when a user makes a voice command (eg, what food is on TV?) to the secretary terminal 200 while a user is watching video content through the display device 100, the food object detection unit 322 is viewed, and the food object detection unit 322 is viewed. The food object 10a is detected from the video content 10 being processed.

음식명 검색부(323)은 학습 모델에 음식 객체 검출부(322)에 의해 검출된 음식 객체(10a)를 입력하여 음식 객체(10a)의 음식명(예컨대, 제육볶음)을 검색한다.The food name search unit 323 searches for a food name (eg, fried pork chop) of the food object 10a by inputting the food object 10a detected by the food object detection unit 322 into the learning model.

한편, 음식명 검색부(323)는, 음식명이 복수 개로 검색되는 경우 음식 객체 및 음식명의 매칭도에 따라 적어도 하나의 음식명을 비서 단말(200)에 출력할 수 있다. 예를 들어, 시각적으로 유사한 음식이 복수 개가 있어 음식 객체에 대한 정확한 음식명 검색이 어려울 경우(예컨대, 제육 볶음, 순대 볶음 및 낙지 볶음), 그 매칭도가 높은 상위 3개의 음식명을 비서 단말(200)에 출력할 수 있고, 사용자는 이들 음식명 중 하나를 음성 명령으로 선택할 수 있다.Meanwhile, when a plurality of food names are searched, the food name search unit 323 may output at least one food name to the secretary terminal 200 according to a matching degree of the food object and the food name. For example, when there are a plurality of visually similar foods and it is difficult to search for an accurate food name for a food object (e.g., stir-fried jeyuk, stir-fried sundae, and stir-fried octopus), the names of the top three foods with high matching degree are selected from the secretary terminal ( 200), and the user can select one of these food names by voice command.

도 6 및 도 7은 본 발명의 실시예에 따른 디스플레이 장치에 음식명 및 상품 정보가 표시되는 일례를 도시한 도면이고, 도 8은 본 발명의 실시예에 따른 디스플레이 장치에 음식명, 음식 레시피 및 상품 정보가 표시되는 일례를 도시한 도면이다.6 and 7 are views showing an example in which food names and product information are displayed on a display device according to an embodiment of the present invention, and FIG. 8 is a diagram illustrating a food name, food recipe, and information on a display device according to an embodiment of the present invention. It is a diagram showing an example in which product information is displayed.

상품 분류부(324)는 음식의 재료와 관련된 상품 정보를 최저가 상품, 유기농 상품 및 레토르트 상품 정보를 포함하여 복수 개로 분류할 수 있다.The product classification unit 324 may classify product information related to food ingredients into a plurality of items, including information on the lowest price product, organic product, and retort product.

상품 정보 검색부(325)는 음식명 검색부(323)가 검색한 음식의 재료와 관련된 상품 정보를 검색한다. 그리고, 상품 정보 검색부(325)가 검색한 상품 정보는 통신부(310)를 통해 비서 단말(200)로 출력한다.The product information search unit 325 searches for product information related to the ingredients of the food searched by the food name search unit 323. Then, the product information searched by the product information search unit 325 is output to the secretary terminal 200 through the communication unit 310.

도 6 및 도 7을 참조하면, 디스플레이 장치(100)는 비서 단말(200)로부터 음식명(예컨대, 부대찌개) 및 상품 정보를 입력 받아 이를 표시한다. 그리고, 사용자는 디스플레이 장치(100)에 표시된 상품에 대한 구매 명령(일괄 구매 또는 개별 구매)을 비서 단말(200)에 발화하면, 비서 단말(200)은 상품 구매 명령을 서버(300)에 출력하고, 서버(300)는 해당 상품 구매를 수행하고, 그 수행 결과를 비서 단말(200)에 출력한다. 그리고, 디스플레이 장치(100)는 비서 단말(200)로부터 상품 구매 명령에 따른 수행 결과를 입력 받아 이를 표시한다.6 and 7, the display device 100 receives food name (eg, bag stew) and product information from the secretary terminal 200 and displays it. In addition, when the user utters a purchase command (batch purchase or individual purchase) for a product displayed on the display device 100 to the secretary terminal 200, the secretary terminal 200 outputs the product purchase command to the server 300 , The server 300 purchases a corresponding product and outputs the result of the execution to the secretary terminal 200. In addition, the display apparatus 100 receives the execution result according to the product purchase command from the secretary terminal 200 and displays it.

이와 같이, 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 시스템은, 특정 음식과 관련된 상품들을 일괄적으로 한번에 구매할 수 있어 사용자에게 편의성을 제공할 수 있다.As described above, the object recognition-based service providing system according to an exemplary embodiment of the present invention can provide convenience to a user since products related to a specific food can be collectively purchased at a time.

특히, 상품 정보 검색부(325)는 사용자의 상품 구매 이력 또는 상품 검색 이력을 기반으로 상품 정보를 검색할 수 있다. 예를 들어, 도 6을 참조하면, 비서 단말(200)을 통해 유기농 상품을 주로 구매한 사용자의 경우 상품 분류부(324)에 의해 분류된 상품 중 유기농 상품 정보를 검색하여 출력하고, 도 7을 참조하면, 레토르트 상품을 주로 구매한 사용자의 경우 상품 분류부(324)에 의해 분류된 상품 중 레토르트 상품 정보를 검색하여 출력할 수 있다. 그리고, 최저가 상품을 주로 구매한 사용자의 경우 상품 분류부(324)에 의해 분류된 상품 중 최저가 상품 정보를 검색하여 출력할 수 있다.In particular, the product information search unit 325 may search for product information based on the user's product purchase history or product search history. For example, referring to FIG. 6, in the case of a user who mainly purchases organic products through the secretary terminal 200, organic product information among products classified by the product classification unit 324 is searched and output, and FIG. 7 For reference, in the case of a user who mainly purchases a retort product, retort product information may be searched and output from among the products classified by the product classification unit 324. In the case of a user who mainly purchases the lowest price product, information about the lowest price product among the products classified by the product classification unit 324 may be searched and output.

좀 더 구체적으로, 상품 정보 검색부(325)는 사용자가 유기농 상품을 선호하는 것으로 판단되면, 음식 조리에 필요한 모든 재료에 대해 사용자가 자주 구매한 상표가 부착된 유기농 상품을 우선적으로 검색하여 출력할 수 있다. More specifically, when it is determined that the user prefers organic products, the product information search unit 325 first searches and outputs organic products with trademarks frequently purchased by the user for all ingredients required for food cooking. I can.

또한, 상품 정보 검색부(325)는 사용자가 유기농 최저가 상품을 선호하는 것으로 판단되면, 음식 조리에 필요한 모든 재료에 대해 최저가 유기농 상품 우선적으로 검색하여 출력할 수 있고, 이 때, 동일가 상품이 2개 이상인 경우 더 자주 구매한 상표가 부착된 상품을 우선적으로 검색하여 출력할 수 있다.In addition, when it is determined that the user prefers the lowest organic product, the product information search unit 325 may preferentially search and output the lowest organic product for all ingredients required for food cooking, and at this time, two products of the same price If this is the case, products with trademarks purchased more often can be searched and printed first.

또한, 상품 정보 검색부(325)는 사용자가가 레토르트 상품을 선호하는 것으로 판단되면, 사용자가 자주 구매한 상표가 부착된 레토르트 상품을 우선적으로 검색하여 출력할 수 있다.In addition, when it is determined that the user prefers the retort product, the product information search unit 325 may preferentially search and output a retort product with a trademark that the user has frequently purchased.

또한, 상품 정보 검색부(325)는 사용자가 레토르트 최저가 상품을 선호하는 것으로 판단되면, 최저가 레토르트 상품을 우선적으로 검색하여 출력할 수 있고, 이 때, 동일가 상품이 2개 이상인 경우 더 자주 구매한 상표가 부착된 상품을 우선적으로 검색하여 출력할 수 있다.In addition, when it is determined that the user prefers the lowest price retort product, the product information search unit 325 may preferentially search and output the lowest priced retort product. In this case, when there are two or more products of the same price, the trademark purchased more often You can preferentially search and print the products with is attached.

또한, 상품 정보 검색부(325)는 사용자가 최저가 상품을 선호하는 것으로 판단되면, 음식 조리에 필요한 모든 재료에 대해 최저가 상품을 우선적으로 검색하여 출력할 수 있고, 이 때, 동일가 상품이 2개 이상인 경우 더 자주 구매한 상표가 부착된 상품을 우선적으로 검색하여 출력할 수 있다.In addition, when it is determined that the user prefers the lowest price product, the product information search unit 325 may preferentially search and output the lowest price product for all ingredients required for food cooking. In this case, products with trademarks purchased more often can be searched and printed first.

이와 같이, 본 발명에 실시예에 따른 객체 인식 기반 서비스 제공 시스템은, 사용자의 상품 선호 성향을 분석하여 사용자 맞춤별로 상품을 추천할 수 있다.As described above, the object recognition-based service providing system according to an embodiment of the present invention may analyze a user's product preference tendency to recommend a product for each user.

데이터 베이스(330)는 사용자의 상품 구매 이력 또는 상품 검색 이력을 저장하고, 음식 재료와 관련된 상품 정보를 저장한다. 이와 같이, 데이터 베이스(330)에 저장된 정보들은, 제어부(320)가 음식 객체의 음식명 및 상품 정보를 검색하는데 이용된다.The database 330 stores a user's product purchase history or product search history, and stores product information related to food ingredients. In this way, the information stored in the database 330 is used by the control unit 320 to search food names and product information of the food object.

본 발명의 일 실시예로서, 사용자는 디스플레이 장치(100)에 표시된 상품에 대한 장바구니 담기 명령(일괄 담기 또는 개별 담기)을 비서 단말(200)에 발화할 수 있다. 이 경우, 비서 단말(200)은 상품 장바구니 담기 명령을 서버(300)에 출력하고, 서버(300)는 해당 상품 장바구니 담기를 수행하고, 그 수행 결과를 비서 단말(200)에 출력한다. 그리고, 디스플레이 장치(100)는 비서 단말(200)로부터 상품 장바구니 담기 명령에 따른 수행 결과를 입력 받아 이를 표시한다. 이 후, 사용자의 선택에 따라 장바구니에 담긴 상품 구매가 수행될 수 있다.As an embodiment of the present invention, the user may utter a command to add a shopping cart (add collectively or individually) to the product displayed on the display device 100 to the secretary terminal 200. In this case, the secretary terminal 200 outputs a command to add a product shopping cart to the server 300, the server 300 performs the product shopping cart addition, and outputs the execution result to the secretary terminal 200. In addition, the display device 100 receives the execution result according to the command to add the product shopping cart from the secretary terminal 200 and displays the result. Thereafter, the purchase of products in the shopping cart may be performed according to the user's selection.

도 8을 참조하면, 본 발명의 다른 실시예로서, 사용자는 디스플레이 장치(100)에 표시된 음식명에 해당하는 음식 레시피 검출 명령 및 음식 재료 검출 명령을 비서 단말(200)에 발화할 수 있다. 이 경우, 비서 단말(200)은 음식 레시피 검출 명령 및 음식 재료 검출 명령을 서버(300)에 출력하고, 서버(300)는 음식 레시피 검색 및 음식 재료 검색을 수행하고, 그 수행 결과를 비서 단말(200)에 출력한다. 그리고, 디스플레이 장치(100)는 비서 단말(200)로부터 음식 레시피 검출 명령 및 음식 재료 검출 명령에 따른 수행 결과 즉, 해당 음식의 레시피 및 재료를 입력 받아 이를 표시한다.Referring to FIG. 8, as another embodiment of the present invention, a user may ignite a food recipe detection command and a food ingredient detection command corresponding to a food name displayed on the display device 100 to the secretary terminal 200. In this case, the secretary terminal 200 outputs a food recipe detection command and a food material detection command to the server 300, and the server 300 performs a food recipe search and food material search, and the result of the execution is sent to the secretary terminal ( 200). In addition, the display apparatus 100 receives the food recipe detection command and the execution result according to the food material detection command from the secretary terminal 200, that is, the recipe and ingredients of the food and displays them.

사용자가 디스플레이 장치(100)를 통해 영상 컨텐츠 시청 중 관심있는 음식 객체가 표시되면 해당 영상에 대한 저장 명령을 비서 단말(200)에 발화할 수 있다. 이 경우, 비서 단말(200)은 영상 저장 명령을 서버(300)에 출력하고, 서버(300)는 사용자의 음성 명령에 따라, 음식 객체를 포함하는 음식 영상을 저장하고, 영상 컨텐츠 시청 중 또는 종료 후 사용자의 음성 명령에 따라 저장된 음식 영상을 비서 단말(200)에 출력할 수 있다. 그리고, 비서 단말(200)은 입력 받은 음식 영상을 디스플레이 장치(100)에 출력하고, 디스플레이 장치(100)는 음식 영상을 표시할 수 있다.When a food object of interest is displayed while the user is watching video content through the display device 100, a storage command for the video may be issued to the secretary terminal 200. In this case, the secretary terminal 200 outputs an image storage command to the server 300, and the server 300 stores a food image including a food object according to the user's voice command, and while viewing or ending the video content Then, the stored food image may be output to the secretary terminal 200 according to the user's voice command. In addition, the secretary terminal 200 may output the received food image to the display device 100, and the display device 100 may display the food image.

아래의 표 1은 비서 단말이 사용자에게 서비스를 제공하는 여러 상황들을 도시한 표이다.Table 1 below is a table showing various situations in which a secretary terminal provides a service to a user.

서비스 내용Service contents 대화 예시Conversation example 상품 구매Product purchase 사용자: TV 속 저 음식이 뭐야?
비서 단말: 제육볶음입니다.
사용자: 제육볶음 관련 상품 정보를 보여줘.
비서 단말: 제육볶음 관련 상품입니다. 일괄 구매를 원하시면 일괄 구매를, 개별 구매를 원하시면 *번 구매해줘라고 말씀해 주세요.
사용자: 일괄 구매해죠.
비서 단말: 제육볶음 관련 상품을 일괄 구매했습니다. User : What is that food on the TV?
Secretary's Terminal : This is Jeyuk-bokkeum.
User : Show product information related to Jeyuk-bokkeum.
Secretarial Terminal : This is a product related to Jeyuk-bokkeum. If you want to purchase in bulk, please tell us to purchase in bulk, if you want to purchase individually, please purchase *times.
User : I buy in bulk.
Secretary's Terminal : I bought pork-bokkeum related products in bulk. 상품 장바구니 담기Product Add to Cart 사용자: TV 속 저 음식이 뭐야?비서 단말: 제육볶음입니다.
사용자: 제육볶음 관련 상품 정보를 보여줘.
비서 단말: 제육볶음 관련 상품입니다.
사용자: 장바구니 담아죠.
비서 단말: 제육볶음 관련 상품을 장바구니에 담았습니다. User : What is that food on the TV? Secretary's Terminal : This is Jeyuk-bokkeum.
User : Show product information related to Jeyuk-bokkeum.
Secretarial Terminal : This is a product related to Jeyuk-bokkeum.
User : Add to cart.
Secretary's Terminal : The product related to stir-fried pork is put in the shopping cart. 레시피 및 재료 안내Recipe and Ingredient Guide 사용자: TV 속 저 음식이 뭐야?비서 단말: 제육볶음입니다.
사용자: 제육볶음 레시피 및 재료 보여줘.
비서 단말: 제육볶음 레시피 및 재료입니다. User : What is that food on the TV? Secretary's Terminal : This is Jeyuk-bokkeum.
User : Show me the recipe and ingredients for stir-fried pork.
Secretarial Terminal : Recipes and ingredients for pork stir-fry. 음식 영상 메모Food video memo 사용자: TV 속 저 음식이 뭐야?비서 단말: 제육볶음입니다.
사용자: 제육볶음 영상 메모해줘.
비서 단말: 제육볶음 영상을 저장합니다.
사용자: 제육볶음 영상 장면으로 이동해줘.
비서 단말: 제육볶음 영상입니다. User : What is that food on the TV? Secretary's Terminal : This is Jeyuk-bokkeum.
User : Please take note of the video of Jeyuk-bokkeum.
Secretary terminal : Save the video of stir-fried pork.
User : Go to the video scene of Jeyuk-bokkeum.
Secretary's Terminal : This is a video of Jeyuk-bokkeum.

상기 표 1에 도시한 바와 같이, 사용자와 비서 단말(200) 간 대화를 통해 사용자에게 객체 인식 기반으로 다양한 서비스를 제공할 수 있다.As shown in Table 1, various services may be provided to the user based on object recognition through a conversation between the user and the assistant terminal 200.

도 9는 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 방법의 흐름도이다.9 is a flowchart of an object recognition-based service providing method according to an embodiment of the present invention.

이하, 도 1 내지 도 9를 참조하여 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 방법을 설명하되, 전술한 본 발명의 실시예에 따른 객체 인식 기반 서비스 제공 시스템과 동일한 내용은 생략하겠다.Hereinafter, a method for providing an object recognition-based service according to an embodiment of the present invention will be described with reference to FIGS. 1 to 9, but the same contents as in the object recognition-based service providing system according to the embodiment of the present invention will be omitted.

먼저, 디스플레이 장치(100)가 비서 단말(200)을 통해 서버(300)로부터 영상 컨테츠를 제공받아 이를 표시한다.First, the display device 100 receives image content from the server 300 through the secretary terminal 200 and displays it.

다음, 사용자가 디스플레이 장치(100)를 통해 영상 컨테츠를 시청 중 비서 단말(200)에 음성 명령을 발화한다(S10). 여기서, 음성 명령은 현재 시청 중인 영상 컨테츠에서의 음식 객체 검출 명령일 수 있다.Next, while the user is watching video content through the display device 100, the user speaks a voice command to the secretary terminal 200 (S10). Here, the voice command may be a command for detecting food objects in the video content currently being viewed.

다음, 비서 단말(200)이 음성 명령을 입력 받아 이를 서버(300)로 출력한다(S20).Next, the secretary terminal 200 receives a voice command and outputs it to the server 300 (S20).

다음, 서버(300)가 음성 명령에 따라 영상 컨텐츠에서 음식 객체를 검출하고(S31), 음식 객체의 음식명 및 음식 재료와 관련된 상품 정보를 사용자의 상품 구매 이력 또는 상품 검색 이력을 기반으로 검색하여(S32) 이를 비서 단말(200)에 출력한다(S33).Next, the server 300 detects a food object from the video content according to a voice command (S31), and searches for product information related to the food name and food ingredient of the food object based on the user's product purchase history or product search history. (S32) This is output to the secretary terminal 200 (S33).

다음, 디스플레이 장치(100)가 비서 단말(200)로부터 음식명 및 상품 정보를 입력 받아(S41) 이를 표시한다(S42).Next, the display apparatus 100 receives food name and product information from the secretary terminal 200 (S41) and displays them (S42).

다음, 비서 단말(200)이 음성 명령에 대한 응답을 사용자에게 출력한다(S50).Next, the secretary terminal 200 outputs a response to the voice command to the user (S50).

다음, 사용자가 상품 정보에 포함된 상품 구매 또는 장바구니 담기 명령을 비서 단말(200)에 발화한다(S60).Next, the user ignites a command to purchase a product or add a shopping cart included in the product information to the secretary terminal 200 (S60).

다음, 비서 단말(200)이 상품 정보에 포함된 상품 구매 또는 장바구니 담기 명령을 입력 받아 이를 서버(300)로 출력한다(S70).Next, the secretary terminal 200 receives a command to purchase a product or add a shopping cart included in the product information and outputs it to the server 300 (S70).

다음, 서버(300)가 상품 구매 또는 상품 장바구니 담기를 수행하고(S81), 그 수행 결과를 비서 단말(200)에 출력한다(S82).Next, the server 300 performs product purchase or product shopping cart addition (S81), and outputs the execution result to the secretary terminal 200 (S82).

다음, 디스플레이 장치(100)가 비서 단말(200)로부터 수행 결과를 입력 받아(S91), 이를 표시한다(S92).Next, the display apparatus 100 receives the execution result from the secretary terminal 200 (S91) and displays it (S92).

이와 같이, 본 발명에 실시예에 따른 객체 인식 기반 서비스 제공 방법은, 특정 음식과 관련된 상품들을 일괄적으로 한번에 구매할 수 있어 사용자에게 편의성을 제공할 수 있고, 사용자의 상품 선호 성향을 분석하여 사용자 맞춤별로 상품을 추천할 수 있다.As described above, the object recognition-based service providing method according to an embodiment of the present invention can provide convenience to a user because it is possible to purchase products related to a specific food at once, and analyze the user's product preference tendency to customize the user. You can recommend products for each.

본 명세서와 도면에 개시된 본 발명의 실시 예들은 본 발명의 기술 내용을 쉽게 설명하고 본 발명의 이해를 돕기 위해 특정 예를 제시한 것일 뿐이며, 본 발명의 범위를 한정하고자 하는 것은 아니다. 따라서 본 발명의 범위는 여기에 개시된 실시 예들 이외에도 본 발명의 기술적 사상을 바탕으로 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.The embodiments of the present invention disclosed in the present specification and drawings are only provided for specific examples to easily explain the technical content of the present invention and to aid understanding of the present invention, and are not intended to limit the scope of the present invention. Therefore, the scope of the present invention should be construed that all changes or modified forms derived based on the technical idea of the present invention in addition to the embodiments disclosed herein are included in the scope of the present invention.

100: 디스플레이 장치
200: 비서 단말
300: 서버100: display device
200: secretary terminal
300: server

Claims

A secretary terminal receiving a user's voice command and outputting a response to the voice command;
The secretary terminal receives the voice command from the secretary terminal, detects a food object in the video content being viewed by the user according to the voice command, searches for product information related to the food name and food material of the food object, and obtains it from the secretary terminal. Output to the server; And
Display device that receives the food name and product information from the secretary terminal and displays it
Object recognition-based service providing system comprising a.

The method of claim 1,
The server is
Searching for the product information based on the user's product purchase history or product search history
Object recognition-based service provision system.

The method of claim 2,
The above product information
Including the lowest price product, organic product and retort product information,
Object recognition-based service provision system.

The method of claim 1,
The voice command is
Including at least one of the food object detection command, a product purchase or shopping cart addition command included in the product information, the food recipe detection command, and a food ingredient detection command
Object recognition-based service provision system.

The method of claim 4,
The server is
Performing at least one of the product purchase, product shopping cart, food recipe search, and food material search according to the voice command, and outputting the execution result to the secretary terminal
Object recognition-based service provision system.

The method of claim 5,
The display device
Receives an execution result according to the voice command from the secretary terminal and displays it
Object recognition-based service provision system.

The method of claim 1,
The server is
A learning model generation unit that generates a learning model by learning food image data;
A food object detection unit that detects the food object in the video content; And
A food name search unit that searches for a food name of the food object by inputting the food object into the learning model
Object recognition-based service providing system comprising a.

The method of claim 7,
The food name search unit
When a plurality of food names are searched, outputting at least one food name to the secretary terminal according to the matching degree of the food object and food name
Object recognition-based service provision system.

The method of claim 1,
The server is
According to the voice command, storing a food image including the food object, and outputting the food image to the secretary terminal during or after viewing the video content
Object recognition-based service provision system.

The method of claim 2,
The server is
Database for storing the product purchase history or product search history of the user and storing the product information
Object recognition-based service providing system comprising a.

Uttering a voice command to the secretary terminal while the user is watching the video content through the display device;
Receiving, by the secretary terminal, the voice command and outputting it to a server;
The server detects a food object in the video content according to the voice command, and searches for product information related to the food name and food ingredient of the food object based on the user's product purchase history or product search history, and the secretary Outputting to the terminal;
Receiving, by the display device, the food name and product information from the secretary terminal and displaying the information; And
Outputting, by the secretary terminal, a response to the voice command
Object recognition-based service providing method comprising a.

The method of claim 11,
Igniting a command to purchase a product or add a shopping cart included in the product information to the secretary terminal by the user;
Receiving, by the secretary terminal, a command to purchase a product or add a shopping cart included in the product information and output it to the server;
Performing, by the server, purchasing the product or adding a product shopping cart and outputting a result of the execution to the secretary terminal; And
Receiving, by the display device, the execution result from the secretary terminal and displaying it
Object recognition-based service providing method further comprising.