KR20120111855A

KR20120111855A - Apparatus and method for generating story according to user information

Info

Publication number: KR20120111855A
Application number: KR1020110055102A
Authority: KR
Inventors: 정희석; 박영희; 차정원
Original assignee: 삼성전자주식회사; 창원대학교 산학협력단
Priority date: 2011-03-31
Filing date: 2011-06-08
Publication date: 2012-10-11
Also published as: KR101804889B1

Abstract

PURPOSE: An apparatus and a method for generating a story according to user information collected from at least one electronic device are provided to easily recognize a life pattern and life information by configuring user life information according to user information collected from at least one electronic device. CONSTITUTION: An information collecting unit(110) collects log information including life information of a user from an electronic device. An analyzing unit(120) analyzes collected log information. The analyzing unit determines at least one topic. A story generating unit(130) generates one sentence which indicates the life information by using one topic. [Reference numerals] (100) Controlling unit; (110) Information collecting unit; (120) Analyzing unit; (130) Story generating unit; (140) Storing unit; (150) Displaying unit; (160) Input unit; (170) Communication module

Description

Apparatus and method for generating story according to user information {APPARATUS AND METHOD FOR GENERATING STORY ACCORDING TO USER INFORMATION}

본 발명은 사용자 정보에 따른 스토리를 생성하기 위한 장치 및 방법에 관한 것으로서, 특히, 적어도 하나의 전자 장치로부터 수집한 사용자 정보에 따른 사용자의 생활 정보를 스토리를 요약하여 사용자에게 제공하기 위한 장치 및 방법에 관한 것이다.
The present invention relates to an apparatus and method for generating a story according to user information, and more particularly, to an apparatus and method for providing a user by summarizing a story of user's living information according to user information collected from at least one electronic device. It is about.

유무선 기술의 발전으로 사용자의 일상과 관련된 정보를 지속적으로 수집할 수 있게 되었다. 예를 들어, 사용자는 이동 통신 기술의 발전으로 이동장치(mobile device)를 항상 휴대한다. 이에 따라, 상기 이동 장치는 사용자의 통화 기록, 사진, 음악 파일 재생, 위치 정보 등의 다양한 정보를 수집할 수 있다.With the development of wired and wireless technology, it is possible to continuously collect information related to users' daily lives. For example, users always carry a mobile device with the development of mobile communication technology. Accordingly, the mobile device may collect various information such as a call log, a photo, a music file, and location information of the user.

상술한 바와 같이 사용자가 항상 휴대하는 이동 장치는 사용자의 일상 정보를 수집할 수 있다. 이 경우, 상기 이동 장치는 상기 수집한 사용자의 일상 정보를 토대로 사용자에게 다양한 서비스를 제공할 수 있다. As described above, the mobile device which is always carried by the user may collect daily information of the user. In this case, the mobile device may provide various services to the user based on the collected daily information of the user.

하지만, 현재 이동 장치는 사용자의 일상 정보를 토대로 사용자의 편의를 고려한 서비스를 제공하기 않는다. 이에 따라, 이동 장치에서 사용자의 일상 정보를 토대로 상기 사용자의 생활 정보를 스토리를 요약하여 사용자에게 제공할 수 있는 기술이 필요하다.However, the current mobile device does not provide a service considering the user's convenience based on the user's daily information. Accordingly, there is a need for a technology in which a mobile device can provide the user with the user's life information by summarizing the story based on the user's daily information.

따라서, 본 발명의 목적은 적어도 하나의 전자 장치에서 수집한 사용자 정보에 따라 스토리를 생성하기 위한 장치 및 방법을 제공함에 있다.Accordingly, an object of the present invention is to provide an apparatus and method for generating a story according to user information collected by at least one electronic device.

본 발명의 다른 목적은 적어도 하나의 전자 장치에서 수집한 사용자 정보에 따라 스토리를 생성하여 사용자에게 제공하기 위한 장치 및 방법을 제공함에 있다.Another object of the present invention is to provide an apparatus and method for generating and providing a story to a user according to user information collected by at least one electronic device.

본 발명의 또 다른 목적은 적어도 하나의 전자 장치에서 수집한 사용자 정보에 따라 시간별 스토리를 생성하기 위한 장치 및 방법을 제공함에 있다.Another object of the present invention is to provide an apparatus and method for generating an hourly story based on user information collected by at least one electronic device.

본 발명의 또 다른 목적은 적어도 하나의 전자 장치에서 수집한 사용자 정보에 따라 주제별 스토리를 생성하기 위한 장치 및 방법을 제공함에 있다.
Another object of the present invention is to provide an apparatus and method for generating a story by theme based on user information collected by at least one electronic device.

본 발명의 목적들을 달성하기 위한 본 발명의 제 1 견지에 따르면, 사용자의 일상 정보를 요약하기 위한 장치는, 적어도 하나의 전자 기기로부터 사용자의 일상 정보를 포함하는 로그 정보를 수집하는 정보 수집부와, 상기 정보 수집부에서 수집한 로그 정보를 분석하여 사용자의 일상 정보를 나타내는 적어도 하나의 토픽을 결정하는 분석부와, 상기 분석부에서 결정한 적어도 하나의 토픽을 이용하여 사용자의 일상 정보를 나타내는 적어도 하나의 문장을 생성하는 스토리 생성부를 포함하여 구성되는 것을 특징으로 한다.According to a first aspect of the present invention for achieving the objects of the present invention, an apparatus for summarizing daily information of a user comprises: an information collecting unit for collecting log information including daily information of the user from at least one electronic device; An analysis unit determining at least one topic representing daily information of a user by analyzing log information collected by the information collecting unit, and at least one representing daily information of a user using at least one topic determined by the analysis unit Characterized in that it comprises a story generating unit for generating a sentence.

본 발명의 제 2 견지에 따르면, 사용자의 일상 정보를 요약하기 위한 방법은, 적어도 하나의 전자 기기로부터 사용자의 일상 정보를 포함하는 로그 정보를 수집하는 과정과, 상기 정보 수집부에서 수집한 로그 정보를 분석하여 사용자의 일상 정보를 나타내는 적어도 하나의 토픽을 결정하는 과정과, 상기 분석부에서 결정한 적어도 하나의 토픽을 이용하여 사용자의 일상 정보를 나타내는 적어도 하나의 문장을 생성하는 과정을 포함하는 것을 특징으로 한다.
According to a second aspect of the present invention, a method for summarizing daily information of a user includes: collecting log information including daily information of a user from at least one electronic device, and log information collected by the information collecting unit; Determining at least one topic representing the user's daily information by analyzing the information, and generating at least one sentence representing the user's daily information using the at least one topic determined by the analyzer. It is done.

상술한 바와 같이 적어도 하나의 전자 장치를 통해 수집한 사용자 정보에 따라 사용자 생활 정보를 스토리로 구성하여 사용자에게 제공함으로써, 사용자가 자신의 생활 정보 및 생활 패턴을 손쉽게 인식할 수 있는 이점이 있다.
As described above, the user's life information is composed as a story and provided to the user according to the user information collected through the at least one electronic device, so that the user can easily recognize his / her life information and life pattern.

도 1은 본 발명의 실시 예에 따른 사용자 정보에 따른 스토리 생성 장치의 구성을 도시하는 도면,
도 2는 본 발명의 실시 예에 따른 스토리 생성 장치의 분석부의 상세 구성을 도시하는 도면,
도 3은 본 발명의 실시 예에 따른 스토리 생성 장치의 자질 추출부의 상세 구성을 도시하는 도면,
도 4는 본 발명의 실시 예에 따른 사용자 정보에 따른 스토리 생성 절차를 도시하는 도면,
도 5는 본 발명의 실시 예에 따른 토픽 세그먼트를 구성하기 위한 절차를 도시하는 도면,
도 6은 본 발명의 실시 예에 따른 개체 인식 절차를 도시하는 도면,
도 7은 본 발명의 실시 예에 따른 시간별로 스토리를 생성하기 위한 절차를 도시하는 도면,
도 8은 본 발명의 실시 예에 따른 주제별로 스토리를 생성하기 위한 절차를 도시하는 도면, 및
도 9는 본 발명의 실시 예에 따른 스토리의 표시 화면을 도시하는 도면. 1 is a view showing the configuration of a story generating apparatus according to user information according to an embodiment of the present invention;
2 is a diagram illustrating a detailed configuration of an analysis unit of a story generating device according to an embodiment of the present invention;
3 is a diagram illustrating a detailed configuration of a feature extraction unit of a story generating device according to an embodiment of the present invention;
4 is a diagram illustrating a story generation procedure according to user information according to an embodiment of the present invention;
5 is a diagram illustrating a procedure for configuring a topic segment according to an embodiment of the present invention;
6 is a diagram illustrating an object recognition procedure according to an embodiment of the present invention;
7 is a diagram illustrating a procedure for generating a story by time according to an embodiment of the present invention;
8 is a diagram illustrating a procedure for generating a story for each topic according to an embodiment of the present invention; and
9 is a diagram illustrating a display screen of a story according to an embodiment of the present invention.

이하 본 발명의 바람직한 실시 예를 첨부된 도면을 참조하여 상세히 설명한다. 그리고, 본 발명을 설명함에 있어서, 관련된 공지기능 혹은 구성에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단된 경우 그 상세한 설명은 생략한다. 그리고 후술되는 용어들은 본 발명에서의 기능을 고려하여 정의된 용어들로서 이는 사용자, 운용자의 의도 또는 관례 등에 따라 달라질 수 있다. 그러므로 그 정의는 본 명세서 전반에 걸친 내용을 토대로 내려져야 할 것이다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description, a detailed description of known functions and configurations incorporated herein will be omitted when it may make the subject matter of the present invention rather unclear. The following terms are defined in consideration of the functions of the present invention, and may be changed according to the intentions or customs of the user, the operator, and the like. Therefore, the definition should be based on the contents throughout this specification.

이하 본 발명은 적어도 하나의 전자 장치로부터 수집한 사용자 정보에 따른 사용자의 생활 정보를 스토리로 요약하여 사용자에게 제공하기 위한 기술에 대해 설명한다.Hereinafter, a description will be given of a technology for providing a user by summarizing a user's life information according to user information collected from at least one electronic device into a story.

이하 설명에서 스토리 생성 장치는 사용자의 일상 정보를 수집하고, 수집한 일상 정보를 기반으로 사용자의 생활 정보를 요약한 스토리를 생성한다. In the following description, the story generating device collects daily information of the user and generates a story summarizing the living information of the user based on the collected daily information.

이하 설명에서 스토리 생성 장치는 휴대 전화로 구성되는 것으로 가정한다. 하지만, 상기 스토리 생성 장치는 데이터를 처리할 수 있는 컴퓨터, 디지털 카메라, 디지털 가전기기, 디지털 전화, 디지털 프로젝트, 홈서버, 디지털 비디오 레코더, 디지털 위성 방송 수신기, 셋탑 박스 및 디지털 TV 방송 수신기 중 어느 하나로 구성될 수 있다. In the following description, it is assumed that the story generating device is composed of a mobile phone. However, the story generating device may be any one of a computer, a digital camera, a digital home appliance, a digital telephone, a digital project, a home server, a digital video recorder, a digital satellite broadcast receiver, a set-top box, and a digital TV broadcast receiver capable of processing data. Can be configured.

도 1은 본 발명의 실시 예에 따른 사용자 정보에 따른 스토리 생성 장치의 구성을 도시하고 있다.1 is a block diagram of a story generating apparatus based on user information according to an exemplary embodiment.

상기 도 1에 도시된 바와 같이 상기 스토리 생성 장치는 제어부(100), 정보 수집부(110), 분석부(120), 스토리 생성부(130), 저장부(140), 표시부(150) 및 입력부(160)를 포함하여 구성된다.As shown in FIG. 1, the story generating apparatus includes a controller 100, an information collector 110, an analyzer 120, a story generator 130, a storage 140, a display 150, and an input unit. And 160.

상기 제어부(100)는 상기 스토리 생성 장치의 전반적인 동작을 제어한다. 상기 제어부(100)는 상기 스토리 생성부(130)로부터 제공받은 스토리를 상기 표시부(150)에 표시하도록 제어한다.The controller 100 controls the overall operation of the story generating device. The controller 100 controls the display unit 150 to display the story provided from the story generation unit 130.

상기 정보 수집부(110)는 적어도 하나의 전자 기기의 로그 데이터를 활용하여 사용자의 일상 정보를 수집한다. 예를 들어, 상기 정보 수집부(110)는 외부로부터 수신되는 데이터, 내부에서 생성되는 데이터 및 외부 저장소에 저장된 데이터를 이용하여 사용자의 일상 정보를 수집한다. 여기서, 상기 외부로부터 수신되는 데이터는 날씨, 위경도 정보, 블로그 정보 및 소셜 데이터 등을 포함한다. 상기 내부에서 생성되는 데이터는 통화기록, 단문 메시지(SMS: Short Message Service), 멀티미디어 메시지(MMS: Multi-media Message Service), 위성 항법 장비(GPS: Global Positioning System)에서 측정된 위경도 정보, 일정 및 주소록 등을 포함한다. 상기 외부 저장소에 저장된 데이터는 이미지 촬영 영상, 재생한 음악 파일 목록, TV 시청 정보 등을 포함한다.The information collecting unit 110 collects daily information of a user by using log data of at least one electronic device. For example, the information collecting unit 110 collects daily information of a user using data received from the outside, data generated inside, and data stored in an external storage. Here, the data received from the outside includes weather, latitude and longitude information, blog information, social data, and the like. The internally generated data includes call history, short message service (SMS), multi-media message service (MMS), and latitude and longitude information measured by a global positioning system (GPS) and a schedule. And address book and the like. Data stored in the external storage includes an image photographed image, a list of played music files, TV viewing information, and the like.

상기 분석부(120)는 상기 정부 수집부(110)에 의해 수집된 정보를 분석하여 사용자의 일상 정보를 나타내기 위한 에피소드를 추정한다. 예를 들어, 상기 분석부(120)는 하기 도 2에 도시된 바와 같이 구성된다.The analyzer 120 estimates an episode for representing the user's daily information by analyzing the information collected by the government collector 110. For example, the analyzer 120 is configured as shown in FIG. 2.

상기 스토리 생성부(130)는 상기 분석부(120)에서 추정한 에피소드를 이용하여 사용자의 일상 정보를 요약하여 표현한 스토리를 생성한다. 예를 들어, 상기 스토리 생성부(130)는 상기 분석부(120)에서 추정한 에피소드를 이용하여 시간별로 스토리를 생성한다. 다른 예를 들어, 상기 스토리 생성부(130)는 상기 분석부(120)에서 추정한 에피소드를 이용하여 주제별로 스토리를 생성할 수도 있다. 이때, 상기 스토리 생성부(130)는 유의 동사를 이용하여 추출한 자질과 다른 형식의 스토리를 생성할 수 있다. 또한, 상기 스토리 생성부(130)는 사용자의 연령 및 지역 등을 고려하여 스토리를 생성할 수도 있다.The story generator 130 generates a story summarizing and expressing daily information of the user by using the episode estimated by the analyzer 120. For example, the story generator 130 generates a story by time using the episode estimated by the analyzer 120. For another example, the story generator 130 may generate a story for each topic by using an episode estimated by the analyzer 120. In this case, the story generator 130 may generate a story having a format different from that of the feature extracted using the significant verb. In addition, the story generator 130 may generate a story in consideration of the age and region of the user.

상기 저장부(140)는 상기 스토리 생성 장치의 동작을 제어하기 위한 프로그램을 저장하는 프로그램 저장부 및 프로그램 수행 중에 발생되는 데이터를 저장하는 데이터 저장부로 구성될 수 있다. 예를 들어, 상기 저장부(140)는 상기 정부 수집부(110)에서 수집한 사용자 일정 정보를 하기 <표 1>와 같은 형태로 저장한다.The storage unit 140 may include a program storage unit for storing a program for controlling the operation of the story generating device and a data storage unit for storing data generated during program execution. For example, the storage unit 140 stores the user schedule information collected by the government collecting unit 110 in the form as shown in Table 1 below.

Data1Data1 Data2Data2 TypeType Type_detailType_detail Data1Data1 Data2Data2 Data3Data3 Data4Data4 Data5Data5 Data6Data6 13188132000001318813200000 13188168000001318816800000 AA 00 OS 수업OS class 창원대학교 55호관 110호Changwon National University Building 55 55110호실에서 11시까지 OS 수업 듣기Listen to OS lessons from room 55110 to 11:00 13188250800001318825080000 13188250800001318825080000 TT 00 학교 앞 롯데리아. 친구를 기다리며.... 햄버거 맛있다~ ㅎㅎLotteria in front of the school. Waiting for a friend .... Hamburger is delicious ~ 0.0 0.00.0 0.0 한국 창원시 사림동Salim-dong, Changwon-si, Korea 13188243600001318824360000 13188243600001318824360000 SS 22 0105114141501051141415 심심하면 옆에 롯데리아에 들어가 있어라ㅋ 안에서 기다려.If you're bored, stay inside Lotteria. Wait inside. 12898097572491289809757249 12898097572491289809757249 WW 00 맑음Sunny Changwon-siChangwon-si 12897809780001289780978000 12897809780001289780978000 PP 00 35.25032446 128.6007371235.25032446 128.60073712 대한민국 마산시 구암2동Guam 2-dong, Masan, South Korea 12898313480001289831348000 12898313480001289831348000 GG 00 35.2397167 128.6962380135.2397167 128.69623801 대한민국 창원시 용동Yong-dong, Changwon-si, Korea /sdcard/DCIM/Camera/2010-11-17 22.59.30.jpg/ sdcard / DCIM / Camera / 2010-11-17 22.59.30.jpg …... …... …... …... …... …... …... …... …... …...

상기 표시부(150)는 상기 제어부(100)의 제어에 따라 상기 스토리 생성 장치의 상태 정보, 메뉴 화면 및 스토리 정보를 표시한다. 예를 들어, 상기 표시부(150)는 상기 제어부(100)의 제어에 따라 상기 스토리 생성부(130)에서 생성한 스토리를 사용자가 확인할 수 있도록 표시한다.The display unit 150 displays the state information, the menu screen, and the story information of the story generating apparatus under the control of the controller 100. For example, the display unit 150 displays the story generated by the story generator 130 under the control of the controller 100 so that the user can check the story.

상기 입력부(160)는 사용자의 선택에 의해 발생하는 입력 데이터를 상기 제어부(100)로 제공한다.The input unit 160 provides input data generated by a user's selection to the controller 100.

상기 통신 모듈(170)은 음성 및 데이터 통신을 위해 안테나를 통해 송수신되는 신호를 처리한다.The communication module 170 processes signals transmitted and received through an antenna for voice and data communication.

상술한 구성에서, 상기 제어부(100)는 상기 정보 수집부(110), 상기 분석부(120) 및 상기 스토리 생성부(130)의 기능을 수행할 수 있다. 본 발명에서 이를 별도로 구성한 것은 각 기능들을 구별하여 설명하기 위함이다. 따라서, 실제로 구현하는 경우, 상기 정보 수집부(110), 상기 분석부(120) 및 상기 스토리 생성부(130)의 모든 기능 또는 일부 기능을 상기 제어부(100)에서 처리하도록 구성할 수 있다.In the above configuration, the control unit 100 may perform the functions of the information collecting unit 110, the analyzing unit 120, and the story generating unit 130. In the present invention, it is separately configured to describe each function separately. Therefore, in actual implementation, the controller 100 may be configured to process all or some functions of the information collecting unit 110, the analyzing unit 120, and the story generating unit 130.

도 2는 본 발명의 실시 예에 따른 스토리 생성 장치의 분석부의 상세 구성을 도시하고 있다.2 illustrates a detailed configuration of an analysis unit of a story generating device according to an embodiment of the present invention.

상기 도 2에 도시된 바와 같이 상기 분석부(120)는 토픽 세그먼트 구성부(201), 자질 추출부(203), 토픽 결정부(205) 및 에피소드 결정부(207)를 포함하여 구성된다.As illustrated in FIG. 2, the analysis unit 120 includes a topic segment configuration unit 201, a feature extraction unit 203, a topic determination unit 205, and an episode determination unit 207.

상기 토픽 세그먼트 구성부(201)는 상기 정보 수집부(110)에서 수집한 사용자 일정 정보를 이용하여 스토리를 구성하기 위한 후보 토픽들의 집합인 토픽 세그먼트를 구성한다. 예를 들어, 상기 토픽 세그먼트 구성부(201)는 주사건들을 시간 구간으로 분리하여 각각의 시간 구간에 포함되는 주사건으로 토픽 세그먼트를 구성한다. 만일, 일정 정보가 주사건인 경우, 상기 토픽 세그먼트 구성부(201)는 각각의 일정 정보를 각각의 토픽 세그먼트로 구성한다. 이후, 상기 토픽 세그먼트 구성부(201)는 각각의 보조 사건의 발생 시간이 토픽 세그먼트의 시간 구간에 포함되는 보조 사건을 해당 토픽 세그먼트에 매핑한다. 만일, 보조 사건이 매핑될 토픽 세그먼트가 없는 경우, 상기 토픽 세그먼트 구성부(201)는 토픽 세그먼트에 매핑되지 않는 보조 사건을 이용하여 더미 세그먼트를 구성한다. 이때, 상기 더미 세그먼크의 시간 구간은 기 설정된다. 예를 들어, 상기 더미 세그먼트는 1시간 간격으로 구성된다. 여기서, 상기 주사건은 상기 정보 수집부(110)에서 수집한 사용자 일정 정보 중 스토리 구성하는데 주체가 되는 일정 정보를 나타낸다. 상기 보조 사건은 상기 정보 수집부(110)에서 수집한 사용자 일정 정보에서 상기 주사건을 제외한 정보를 나타낸다.The topic segment configuration unit 201 configures a topic segment that is a set of candidate topics for composing a story using user schedule information collected by the information collection unit 110. For example, the topic segment configuration unit 201 divides the scan guns into time intervals and configures the topic segment with the scan guns included in each time interval. If the schedule information is a scan gun, the topic segment configuration unit 201 configures each schedule information into each topic segment. Subsequently, the topic segment configuration unit 201 maps an auxiliary event whose occurrence time of each auxiliary event is included in a time interval of the topic segment to the corresponding topic segment. If there is no topic segment to which an auxiliary event is mapped, the topic segment constructer 201 configures a dummy segment using an auxiliary event not mapped to the topic segment. In this case, a time interval of the dummy segment is preset. For example, the dummy segments are configured at one hour intervals. In this case, the scan gun represents schedule information that is mainly used to construct a story among user schedule information collected by the information collection unit 110. The incident event represents information excluding the scan gun from user schedule information collected by the information collecting unit 110.

상기 자질 추출부(203)는 상기 토픽 세그먼트 구성부(201)에서 구성한 토픽 세그먼트에 포함되는 주사건 및 보조 사건들로부터 자질을 추출한다. 즉, 상기 자질 추출부(203)는 각 토픽 세그먼트에서 자질을 추출한다. 예를 들어, 상기 자질 추출부(203)는 하기 도 3에 도시된 바와 같이 구성되어 자질을 추출한다. 여기서, 자질은 문장 구성 요소가 될 수 있는 데이터를 나타낸다. 예를 들어, 상기 자질은 문장 구성 요소가 될 수 있는 동사, 개체명, 문장 연결 정보, 개체명과 연관된 토픽 정보 및 부가 정보를 포함한다. 이때, 상기 개체명은 분류, 개체명 인식기 및 문장 템플릿 중 적어도 하나에 따라 달라질 수 있지만, 하기 <표 2>와 같이 환경 설정에 정의된 개체명만을 이용할 수 있다. The feature extracting unit 203 extracts a feature from scan guns and auxiliary events included in the topic segment configured in the topic segment constructing unit 201. That is, the feature extraction unit 203 extracts features from each topic segment. For example, the feature extraction unit 203 is configured as shown in Figure 3 below to extract the feature. Here, the qualities represent data that may be sentence components. For example, the qualities include verbs, entity names, sentence association information, topic information associated with entity names, and additional information that may be sentence elements. In this case, the entity name may vary depending on at least one of a classification, an entity name recognizer, and a sentence template. However, only the entity name defined in the environment setting may be used as shown in Table 2 below.

개체명 이름Object name name 태그 정보Tag information 세부정보Details 부가자질Additional features EXT_NONEEXT_NONE 부가자질 인식 태그, 실제 개체명으로 사용되지 않고 시스템 내부적으로 이용Additional feature recognition tag, not used as actual object name but used internally in system 사람Person PERSONPERSON 단체명Group name ORGORG 날짜date DAYDAY 시간time TIMETIME 지명nomination LOCATIONLOCATION 물건stuff OBJECTOBJECT 행사명, 물건Event name, thing 사진 파일명Photo file name JPEG-FILENAMEJPEG-FILENAME JPEG 형식의 그림 파일명Picture file name in JPEG format

상기 토픽 결정부(205)는 상기 자질 추출부(203)가 각 토픽 세그먼트에서 추출한 자질들을 이용하여 토픽을 결정한다. 예를 들어, 상기 토픽 결정부(205)는 하기 <수학식 1>과 같이 토픽의 발생 확률을 고려하여 토픽을 결정한다. 이때, 상기 토픽 결정부(205)는 각 토픽 세그먼트에서 토픽을 결정한다.The topic determiner 205 determines a topic by using the features extracted by the feature extractor 203 in each topic segment. For example, the topic determination unit 205 determines a topic in consideration of the occurrence probability of the topic as shown in Equation 1 below. In this case, the topic determiner 205 determines a topic in each topic segment.

여기서, 상기 T^*는 토픽 세그먼트 내에서 최적의 토픽을 나타내고, 상기 T는 전체 토픽의 집합을 나타내며, 상기 S_t는 토픽 t에 대한 토픽 스코어를 나타낸다. 여기서, 상기 S_t는 하기 <수학식 2>와 같이 나타낼 수 있다.Here, T ^* denotes an optimal topic in a topic segment, T denotes a set of all topics, and S _t denotes a topic score for topic t. Here, S _t may be represented by Equation 2 below.

여기서, 상기 S_t는 토픽 t에 대한 토픽 스코어를 나타내고, 상기 L은 토픽 세그먼트에 포함되는 로그들의 종류를 나타내며, 상기 Wx는 해당 로그 타입에 대한 가중치를 나타내고, 상기 P(word)는 해당 토픽과 관련된 단어의 발생 확률을 나타내며, 상기 P(exts)는 부가 자질의 특성을 나타낸다.Here, S _t denotes a topic score for topic t, L denotes a type of logs included in a topic segment, Wx denotes a weight for the corresponding log type, and P (word) corresponds to a corresponding topic. The probability of occurrence of a related word is represented, and P (exts) represents a characteristic of an additional feature.

상기 에피소드 결정부(207)는 시간 대별로 최적의 토픽 세그먼트를 선택한다. 이때, 상기 에피소드 결정부(207)는 상기 최적의 토픽 세그먼트의 토픽을 해당 시간의 에피스드로 인식한다. 상기 에피소드 결정부(207)는 동일한 시간 구간에 다수 개의 토픽 세그먼트들이 존재하는 경우, 상기 토픽 결정부(205)에서 결정한 각 토픽 세그먼트에 대한 토픽의 토픽 스코어를 비교하여 해당 시간 구간의 최적 토픽 세그먼트를 선택한다. 이후, 연속된 토픽 세그먼트의 토픽이 동일한 경우, 상기 에피소드 결정부(207)는 상기 연속된 토픽 세그먼트들을 하나로 토픽 세그먼트로 병합한다.The episode determiner 207 selects an optimal topic segment for each time period. In this case, the episode determination unit 207 recognizes the topic of the optimal topic segment as an epid of the corresponding time. When there are a plurality of topic segments in the same time interval, the episode determiner 207 compares the topic scores of the topics for each topic segment determined by the topic determiner 205 and selects an optimal topic segment in the corresponding time interval. Choose. Then, when the topics of the consecutive topic segments are the same, the episode determiner 207 merges the consecutive topic segments into one topic segment.

또한, 상기 에피소드 결정부(207)는 각 시간 구간의 에피스드를 고려하여 사용자 일정 정보의 가장 중요한 특징을 나타낼 수 있는 주토픽을 결정한다. 예를 들어, 상기 에피소드 결정부(207)는 하기 <수학식 3>과 같이 주토픽을 결정한다.In addition, the episode determination unit 207 determines the main topic that can represent the most important feature of the user schedule information in consideration of the episodes of each time interval. For example, the episode determination unit 207 determines the main topic as shown in Equation 3 below.

여기서, 상기 T^**는 주토픽을 나타내고, 상기 T는 전체 토픽의 집합을 나타내며, 상기 EP는 각 시간 구간의 에피소드를 나타내고, 상기 d는 에피소드의 길이를 나타내며, 상기 S_t는 토픽 t에 대한 토픽 스코어를 나타낸다. Where T ^** represents the main topic, T represents a set of all topics, EP represents an episode of each time interval, d represents the length of the episode, and S _t represents a topic t Indicates a topic score.

또한, 상기 에피소드 결정부(207)는 각 시간 구간의 에피스드를 결정하는 과정에서 확인된 감정 단어의 발생 확률을 고려하여 사용자의 감정을 추정할 수도 있다.In addition, the episode determiner 207 may estimate the user's emotion in consideration of the occurrence probability of the emotion word identified in the process of determining the episode of each time interval.

도 3은 본 발명의 실시 예에 따른 스토리 생성 장치의 자질 추출부의 상세 구성을 도시하고 있다. 3 illustrates a detailed configuration of a feature extraction unit of a story generating device according to an embodiment of the present invention.

상기 도 3에 도시된 바와 같이 상기 자질 추출부(203)는 소셜 정보 추출부(301), 위치 정보 추출부(303), 텍스트 추출부(305) 및 개체명 설정부(307)를 포함하여 구성된다.As shown in FIG. 3, the feature extracting unit 203 includes a social information extracting unit 301, a location information extracting unit 303, a text extracting unit 305, and an object name setting unit 307. do.

상기 소셜 정보 추출부(301)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건에서 휴대폰 정보, 그룹 정보 등과 같은 소셜정보를 추출한다. The social information extracting unit 301 extracts social information such as mobile phone information, group information, etc. from the scan and auxiliary events included in each topic segment configured in the topic segment monk 201.

상기 위치 정보 추출부(303)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건에서 위경도, 주소 및 지명과 같은 위치 정보를 추출한다. 예를 들어, 상기 위치정보 추출부(303)는 추출한 주소 정보를 LOCATION이란 개체명으로 설정한다. The location information extractor 303 extracts location information, such as latitude, longitude, address, and place name, from scans and auxiliary events included in each topic segment configured in the topic segment monk 201. For example, the location information extraction unit 303 sets the extracted address information to an entity name of LOCATION.

또한, 상기 위치 정보 추출부(303)는 거리 정보를 추출한다. 여기서, 거리 정보는 상기 스토리 생성 장치의 이동 거리, 이동하기 위한 목적지까지의 거리 등을 포함한다.In addition, the location information extractor 303 extracts distance information. Here, the distance information includes a moving distance of the story generating device, a distance to a destination for moving, and the like.

상기 텍스트 추출부(305)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건의 텍스트 정보에서 자질을 추출한다. 예를 들어, 상기 텍스트 추출부(305)는 개체명 인식기를 이용하여 동사 포함 텍스트 추출, 제목 텍스트 추출, 정형화된 텍스트 추출, 파일명 추출, 문장 연결 정보 추출 및 감정 정보를 추출한다.The text extractor 305 extracts a feature from text information of scan and auxiliary events included in each topic segment configured in the topic segment 201. For example, the text extractor 305 extracts a verb-containing text, extracts a title text, extracts a formatted text, extracts a file name, extracts sentence information, and emotions using an entity name recognizer.

상기 동사 포함 텍스트 추출의 경우, 상기 텍스트 추출부(305)는 텍스트에 포함되는 단어가 명사 사전에 포함되는지 확인한다. 이때, 상기 텍스트 추출부(305)는 해당 단어에 포함된 조사를 제거한 후, 명사 사전에 포함되는지 확인한다. In the case of extracting the verb-containing text, the text extracting unit 305 checks whether a word included in the text is included in the noun dictionary. In this case, the text extracting unit 305 removes the survey included in the word and checks whether it is included in the noun dictionary.

상기 단어가 명사 사전에 포함되는 경우, 상기 텍스트 추출부(305)는 상기 단어의 개체명을 확인한다. 예를 들어, 상기 텍스트 추출부(305)는 하기 <표 3>와 같은 개체명 사전을 이용하여 인식된 단어의 개체명을 확인한다.When the word is included in the noun dictionary, the text extracting unit 305 checks the entity name of the word. For example, the text extraction unit 305 checks the entity name of the recognized word using the entity name dictionary as shown in Table 3 below.

명사noun 개체명Object name 토픽Topic 자전거bicycle OBJECTOBJECT 축제festival OBJECTOBJECT 관람, 축제Visit, festival 역station LOCATIONLOCATION 여행, 업무Travel, work 친구friend PERSONPERSON 생활, 소풍Life, picnic 칠판blackboard 수업class 집House LOCATIONLOCATION …... …... …...

상기 텍스트 추출부(305)는 명사 사전의 크기를 줄이기 위해 일정한 패턴을 갖는 명사들을 포함하는 개체명 패턴 사전을 하기 <표 4>와 같이 별도로 구성할 수 있다.The text extractor 305 may separately configure an entity name pattern dictionary including nouns having a predetermined pattern to reduce the size of the noun dictionary as shown in Table 4 below.

명사noun 개체명Object name 토픽Topic ~도서관~ Library LOCATIONLOCATION 공부 시험/자격증Study Exam / Certificate ~독서실~ Reading room LOCATIONLOCATION 공부 시험/자격증Study Exam / Certificate ~학원School LOCATIONLOCATION 공부 시험/자격증Study Exam / Certificate ~미술관Museum of Art LOCATIONLOCATION 관람 그림Spectator painting ~전시장~ Exhibition Hall LOCATIONLOCATION 관람 그림Spectator painting ~학교~ School LOCATIONLOCATION 수업 축제 발표회Class festival presentation ~대학University LOCATIONLOCATION 수업 축제 발표회Class festival presentation ~대학교University LOCATIONLOCATION 수업 축제 발표회Class festival presentation ~경기~ Match OBJECTOBJECT 체육대회Athletic Competition ~수목원Arboretum LOCATIONLOCATION 여행Travel ~영화관Movie theater LOCATIONLOCATION 관람Preview ~화실Fire LOCATIONLOCATION 그림Drawing ~음식점~ Restaurant LOCATIONLOCATION 기념일Anniversary ~케익Cake OBJECTOBJECT 기념일Anniversary ~케이크Cake OBJECTOBJECT 기념일Anniversary ~보험Insurance OBJECTOBJECT 금융finance ~적금Savings OBJECTOBJECT 금융finance ~증권Securities OBJECTOBJECT 금융finance ~은행~ Bank LOCATIONLOCATION 금융finance ~보험사Insurer LOCATIONLOCATION 금융finance 구운~Baked ~ OBJECTOBJECT 요리Cooking 튀긴~Fried ~ OBJECTOBJECT 요리Cooking 절은~Clause ~ OBJECTOBJECT 요리Cooking 볶은~Roasted ~ OBJECTOBJECT 요리Cooking 조린~Chorin ~ OBJECTOBJECT 요리Cooking 양념~Seasoning ~ OBJECTOBJECT 요리Cooking 다진~Minced ~ OBJECTOBJECT 요리Cooking 비빈~Bibin ~ OBJECTOBJECT 요리Cooking 뭍친~Chinchin ~ OBJECTOBJECT 요리Cooking 채썬~Chasun ~ OBJECTOBJECT 요리Cooking 썰어둔~Sliced ~ OBJECTOBJECT 요리Cooking 버무린~Toss ~ OBJECTOBJECT 요리Cooking 냄비~Pot ~ OBJECTOBJECT 요리Cooking ~그릇~ Bowl OBJECTOBJECT 요리Cooking ~썰기Cutting OBJECTOBJECT 요리Cooking ~덮밥Rice bowls OBJECTOBJECT 요리Cooking ~요리Cooking OBJECTOBJECT 요리Cooking ~찌개To stew OBJECTOBJECT 요리Cooking ~고기Meat OBJECTOBJECT 요리Cooking ~갈비Ribs OBJECTOBJECT 요리Cooking ~레시피Recipe OBJECTOBJECT 요리Cooking ~스튜디오Studio LOCATIONLOCATION 사진Picture ~사진사Photographer OBJECTOBJECT 사진Picture ~렌즈... lens OBJECTOBJECT 사진Picture ~동물Animals LOCATIONLOCATION 애완동물Pets 애견~Dog ~ LOCATIONLOCATION 에완동물Ewan ~손가락Finger OBJECTOBJECT 아픔pain ~인대Ligament OBJECTOBJECT 아픔pain

상기 <표 4>와 같은 개체명 패턴 사전을 사용하는 경우, 상기 텍스트 추출부(305)는 상기 단어가 명사 사전에 포함되지 않으면 상기 개체명 패턴 사전에 포함되는지 확인한다. When using the entity name pattern dictionary as shown in Table 4, the text extracting unit 305 checks whether the word is included in the entity name pattern dictionary if the word is not included in the noun dictionary.

상기 단어가 명사 사전과 개체명 패턴 사전에 포함되지 않는 경우, 상기 텍스츠 추출부(305)는 상기 단어가 동사 사전 또는 동사 활용형 패턴 사전에 포함되는지 확인한다. 예를 들어, 상기 동사 사전은 하기 <표 5>와 같이 구성된다.When the word is not included in the noun dictionary and the entity name pattern dictionary, the text extractor 305 checks whether the word is included in the verb dictionary or the verb utilization pattern dictionary. For example, the verb dictionary is configured as shown in Table 5 below.

topictopic verbverb synonymssynonyms tmplKeytmplKey 공부study 전해주Convey 건내주Counting 1One 공부study 건내주Counting 전해주Convey 22 입시입사Entrance examination 집합하Gathering 소집하Convene 33 여행Travel 집합하Gathering 소집하Convene 33 체육Athletic 집합하Gathering 소집하Convene 33 입시입사Entrance examination 통과하Through 붙, 합격하Stuck, passing 44 …... …... …... …...

다른 예를 들어, 상기 동사 활용형 패턴 사전은 하기 <표 6>과 같이 구성된다.For another example, the verb utilization pattern dictionary is configured as shown in Table 6 below.

활용형 패턴Conjugation Pattern 원형 패턴Circular pattern ~가To ~|~그~ | ~ That ~갔었어I went ~가To ~시켰나Did you ~시키Shiki ~울까To cry ~우~ Woo ~왔지To ~오Oh ~혔다는Pulled ~히To …... …...

여기서, 상기 "~|"는 "~그"가 여러 개의 패턴들로 사용될 수 있음을 나타냅니다.Here, "~ |" indicates that "~" can be used in multiple patterns.

상기 제목 텍스트 추출의 경우, 상기 텍스트 추출부(305)는 상기 제목 텍스트에 포함되는 단어가 명사 사전에 포함되는지 확인한다. 상기 단어가 명사 사전에 포함되는 경우, 상기 텍스트 추출부(305)는 상기 단어의 개체명을 확인한다. In the case of extracting the title text, the text extracting unit 305 checks whether a word included in the title text is included in the noun dictionary. When the word is included in the noun dictionary, the text extracting unit 305 checks the entity name of the word.

상술한 바와 같이 동사 포함 텍스트와 제목 텍스트를 추출하는 경우, 상기 텍스트 추출부(305)는 인식한 개체명에 동사가 포함되는지 여부를 확인한다. 만일, 인식한 개체명에 동사가 포함되지 않은 경우, 상기 텍스트 추출부(305)는 상기 개체명에 기본 동사를 추가한다.As described above, when extracting the verb-containing text and the title text, the text extraction unit 305 checks whether or not the verb is included in the recognized entity name. If the recognized entity name does not include a verb, the text extractor 305 adds a basic verb to the entity name.

상기 문장 연결 정보 추출의 경우, 상기 텍스트 추출부(305)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건에서 문장 연결 정보를 추출한다. 이때, 상기 텍스트 추출부(305)는 하기 <표 7>과 같은 문장 연결어미 사전을 이용하여 문장 연결 정보를 추출한다.In the case of the sentence linkage information extraction, the text extractor 305 extracts the sentence linkage information from the scan cases and auxiliary events included in each topic segment configured in the topic segment 201. In this case, the text extracting unit 305 extracts sentence linking information using a sentence linking ending dictionary as shown in Table 7 below.

키key 연결명Connection name ~고, ~며, ~고서, ~다/~다가,...... high, ..., old book, ... 합동형Conjoint ~거나, ~든지, ~든가, ~나, ...Or, or, or, me, ... 분리형Detachable ~나, ~건만, ~나마, ~되, ...I, I, I, I, I, ... 대립Opposition ~므로, ~더니, ~나니, ...So, it's ... 원인종속Cause ~면, ~더라도, ~거든, ~았자, ...If, if, ... 조건종속 Condition ~려, ~고저, ...I'm going to ... 목적/의도 종속Purpose / Intention Dependency ~고저, ~면서, ...... high, low, ... 방식/정도 종속Method / degree dependence

예를 들어, "철수와 영희가 집에 갔고, 영민이도 같이 집에 갔다."의 문장이 존재하는 경우, 상기 텍스트 추출부(305)는 "철수와 영희가 집에 갔다"와 "영민이도 같이 집에 갔다."라는 문장이 어미 "고"에 의해 결합된 형태의 복문이다. 이때, 상기 텍스트 추출부(305)는 "갔고(원형 "가")"와 "갔다(원형 "가")"라는 키에 대해 합동형이라는 정보가 삽입된다.For example, if the sentence "Cheol and Young-hee went home, and Yeong-min went home," the text extractor 305, "Cheol and Young-hee went home" and "Young-min" Also went home together. " At this time, the text extraction unit 305 is inserted into the information "joint (circular" ga ")" and "go (circular" ga ")" is a joint type.

상기 감정 정보 추출의 경우, 상기 텍스트 추출부(305)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건에서 감정 정보를 추출한다. 이때, 상기 텍스트 추출부(305)는 하기 <표 8>과 같은 감정 사전을 이용하여 감정 정보를 추출한다.In the case of extracting the emotion information, the text extractor 305 extracts the emotion information from the scan cases and the auxiliary events included in each topic segment configured in the topic segment 201. In this case, the text extractor 305 extracts emotion information using an emotion dictionary as shown in Table 8 below.

감정관련 명사/동사Emotional nouns / verbs 감정 정보Emotion information 기쁘Glad PLEASUREPLEASURE 즐겁Happy PLEASUREPLEASURE 화나mad ANGERANGER 분하Split ANGERANGER ㅠㅠㅠㅠ SADNESSSADNESS 울fence SADNESSSADNESS …... …...

상기 정형화된 텍스트를 추출하는 경우, 상기 텍스트 추출부(305)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건에 포함된 정형화된 형식의 데이터에서 자질을 추출한다. 이때, 상기 텍스트 추출부(305)는 하기 <표 9>와 같은 정형화 패턴 사전을 이용하여 정형화된 형식의 데이터에서 자질을 추출한다.When extracting the standardized text, the text extraction unit 305 extracts a feature from data of a standardized form included in an injection case and an auxiliary event included in each topic segment configured in the topic segment 201. do. In this case, the text extracting unit 305 extracts a feature from the data in the standardized format using a standardized pattern dictionary as shown in Table 9 below.

카드사Card company 기본동사Basic verb 패턴pattern KB카드KB Card 지불하다Pay 1234|M월d일|h:m::\\[KB(?:체크|카드)\\](?:[^ ]+?)님 (?:.*?)카드 ([0-9]?[0-9]월[0-9]?[0-9]일)([0-9]?[0-9]\\:[0-9]?[0-9]) ([0-9,]+)원 ([^ ]+) 사용1234 | Mdd | H: m :: \\ [KB (?: Check | card) \\] (?: [^] +?) 'S card (?:. *?) ? [0-9] month [0-9]? [0-9] day) ([0-9]? [0-9] \\: [0-9]? [0-9]) ([0 -9,] +) using circle ([^] +) BC/NH카드BC / NH card 지불하다Pay 3124|M/d|h:m::\\[(?:.*?)승인\\] ([0-9,]+)원 [^ ]+ (?:[^ ]+?)님 ([0-9]?[0-9]\\/[0-9]?[0-9]) ([0-9]?[0-9]\\:[0-9]?[0-9]) ([^ ]+)$3124 | M / d | h: m :: \\ [(?:. *?) Approval \\] ([0-9,] +) 원 [^] + (?: [^] +?) [0-9]? [0-9] \\ / [0-9]? [0-9]) ([0-9]? [0-9] \\: [0-9]? [0- 9]) ([^] +) $ 신한카드Shinhan Card 지불하다Pay 1243|M월 d일|h:m::신한카드 (?:[^ ]+?) ([0-9]?[0-9]월 [0-9]?[0-9]일) ([0-9]?[0-9]\\:[0-9]?[0-9]) ([^ ]+) ([0-9,]+)원 (?:.*?)사용1243 | Md d | h: m :: Shinhan Card (?: [^] +?) ([0-9]? [0-9] Month [0-9]? [0-9])) ( [0-9]? [0-9] \\: [0-9]? [0-9]) ([^] +) ([0-9,] +) Using circle (?:. *?) 삼성카드Samsung Card 지불하다Pay 1243|M/d|h:m::\\[삼성카드 사용\\] ([0-9]?[0-9]\\/[0-9]?[0-9]) ([0-9]?[0-9]\\:[0-9]?[0-9]) ([^ ]+) ([0-9,]+) ([^ ]+)$1243 | M / d | h: m :: \\ [Use Samsung Card \\] ([0-9]? [0-9] \\ / [0-9]? [0-9]) ([0 -9]? [0-9] \\: [0-9]? [0-9]) ([^] +) ([0-9,] +) ([^] +) $ 롯데카드Lotte Card 지불하다Pay 3124|M/d|h:m::롯데카드 (?:[^ ]+?)님 ([0-9,]+)원 ([^ ]+) ([0-9]?[0-9]\\/[0-9]?[0-9]) ([0-9]?[0-9]\\:[0-9]?[0-9]) ([^ ]+)$3124 | M / d | h: m :: Lotte Card (?: [^] +?) Won ([0-9,] +) won ([^] +) ([0-9]? [0-9 ] \\ / [0-9]? [0-9]) ([0-9]? [0-9] \\: [0-9]? [0-9]) ([^] +) $ 하나SK카드Hana SK Card 지불하다Pay 1234|M/d|h:m::\\[하나SK카드\\]\$[0-9*]+\$ (?:[^ ]+?)님 ([0-9]?[0-9]\\/[0-9]?[0-9]) ([0-9]?[0-9]\\:[0-9]?[0-9]) ([^/]+)\\/([0-9,]+)원\\/승인\\/([^ ]+)$1234 | M / d | h: m :: \\ [Hana SK Card \\] \\ ([0-9 *] + \\) (?: [^] +?) [0-9] \\ / [0-9]? [0-9]) ([0-9]? [0-9] \\: [0-9]? [0-9]) ([^ /] +) \\ / ([0-9,] +) won \\ / approved \\ / ([^] +) $ 현대카드Hyundai Card 지불하다Pay 234||h:m::\\[현대카드[^]]*\\]\\-승인 (?:[^ ]+?)님 ([0-9]?[0-9]\\:[0-9]?[0-9]) ([0-9,]+)원\$[^)]+\$ ([^ ]+)$234 || h: m :: \\ [Hyundai Card [^]] * \\] \\-Approval (?: [^] +?) ([0-9]? [0-9] \\: [0-9]? [0-9]) ([0-9,] +) one \\ ([^)] + \\) ([^] +) $ …... …... …...

상기 사진 파일명을 추출하는 경우, 상기 텍스트 추출부(305)는 상기 토픽 세그먼크(201)에서 구성된 각각의 토픽 세그먼트에 포함되는 주사건 및 보조사건에 포함된 사진 관련 파일의 경로 정보와 태그 정보를 이용하여 개체명을 추출한다. When extracting the photo file name, the text extracting unit 305 extracts the path information and tag information of the photo-related file included in the scan gun and the sub-event included in each topic segment configured in the topic segment monk 201. Extract the entity name by using

상기 개체명 설정부(307)는 입력 데이터를 미리 정해진 개체명으로 설정한다. 예를 들어, 사진 로그 중 파일명 컬럼(column)의 데이터를 OBJECT라는 개체명으로 설정한다.The entity name setting unit 307 sets input data to a predetermined entity name. For example, set the data of the file name column in the photo log to the object name OBJECT.

상기 자질 추출부(203)는 추출한 자질 정보를 이용하여 각 토픽의 발생 빈도를 확인한다. 예를 들어, 상기 자질 추출부(203)에서 추출한 자질 중 개체명은 상기 <표 2>와 같이 해당 토픽 정보를 포함한다. 이에 따라, 상기 자질 추출부(203)는 추출한 개체명의 토픽 정보에 따라 토픽 발생 빈도를 갱신한다. 예를 들어, 상기 위치 정보 추출부(301)에서 추출한 주소가 "대한민국 제주도 서귀포시"이고, "제주도"와 "서귀포시"가 명사 사전에 여행이란 토픽으로 설정된 경우, 상기 자질 추출부(203)는 여행 토픽의 발생 빈도를 2로 설정한다.The feature extracting unit 203 checks occurrence frequency of each topic by using extracted feature information. For example, the entity name among the features extracted by the feature extraction unit 203 includes corresponding topic information as shown in Table 2 above. Accordingly, the feature extraction unit 203 updates the frequency of occurrence of the topic according to the extracted topic information. For example, when the address extracted by the location information extractor 301 is "Seogwipo-si, Jeju-do, Korea," and "Jeju-do" and "Seogwipo-si" are set to the topic of travel in the noun dictionary, the feature extractor 203 may travel. Set the frequency of occurrences of the topic to 2.

상술한 바와 같이 구성되는 자질 추출부(203)는 로그 데이터의 형식에 따라 구조가 달라질 수 있다. 이에 따라, 상기 도 3에 도시된 자질 추출부(203)의 구성은 로그 데이터의 형식에 따라 변경될 수 있다.The feature extraction unit 203 configured as described above may have a different structure depending on the format of log data. Accordingly, the configuration of the feature extraction unit 203 shown in FIG. 3 may be changed according to the format of log data.

상술한 바와 같이 구성되는 자질 추출부(203)에서 추출한 자질은 하기 <표 10>과 같은 형태로 관리될 수 있다.The features extracted by the feature extracting unit 203 configured as described above may be managed in the form as shown in Table 10 below.

구조체 이름Structure name 세부 정부Cebu Government 주요 자질 관리Main quality management 문장 구성 요소가 될 수 있는 동사와 개체명 관리Manage verbs and entity names that can be sentence components 부가 자질 관리Additional feature management 문장 구성 요소가 될 수는 없으나 토픽 결정에 영향을 주는 자질 관리A feature that cannot be a sentence component but affects topic decisions 문장 연결 어미 관리Sentence linking ending management 동사:동사 간 연결 관계 정보Verbs: linkage information between verbs 로그별 토픽 빈도 정보Topic frequency information by log 토픽 결정에 필요한 빈도 정보/데이터를 관리Manage frequency information / data required for topic determination

상기 <표 3>과 같이 상기 자질 추출부(203)에서 추출한 자질 중 문장 연결 어미와 로그별 토픽의 빈도 정보를 제외한 다른 자질 정보는 주요 자질과 부가 자질로 구분되어 관리된다. 예를 들어, 상기 소셜 정보 추출부(301), 상기 위치 정보 추출부(303), 상기 텍스트 추출부(305) 및 상기 개체명 설정부(307)는 각각 추출한 자질을 주요 자질 또는 부가 자질로 구분한다. 다른 예를 들어, 상기 자질 추출부(203)는 상기 소셜 정보 추출부(301), 상기 위치 정보 추출부(303), 상기 텍스트 추출부(305) 및 상기 개체명 설정부(307)에서 추출한 자질을 주요 자질 또는 부가 자질로 구분하는 자질 추출 후처리기를 포함하여 구성될 수도 있다.As shown in Table 3, among the features extracted by the feature extraction unit 203, other feature information except for the frequency information of the sentence connection ending and the log-specific topic are classified and managed as the main feature and the additional feature. For example, the social information extracting unit 301, the location information extracting unit 303, the text extracting unit 305, and the entity name setting unit 307 respectively classify the extracted features into main or additional features. do. In another example, the feature extracting unit 203 may extract features extracted from the social information extracting unit 301, the location information extracting unit 303, the text extracting unit 305, and the entity name setting unit 307. It may be configured to include a feature extraction post-processor to classify the main feature or additional features.

상술한 바와 같이 스토리 생성 장치는 적어도 하나의 전자기기로부터 수집한 로그 정보를 분석하여 사용자 일상 정보에 따른 에피소드를 결정한다. 이때, 상기 스토리 생성 장치는 하기 도 4에 도시된 바와 같이 사용자의 일상 정보에 따른 스토리를 생성한다.As described above, the story generating device determines the episode according to the user's daily information by analyzing log information collected from at least one electronic device. At this time, the story generating device generates a story according to the user's daily information, as shown in FIG.

도 4는 본 발명의 실시 예에 따른 사용자 정보에 따른 스토리 생성 절차를 도시하고 있다.4 illustrates a story generation procedure based on user information according to an exemplary embodiment of the present invention.

상기 도 4를 참조하면 상기 스토리 생성 장치는 401단계에서 적어도 하나의 전자 기기로부터 로그 정보를 수집한다. 예를 들어, 상기 스토리 생성 장치는 외부로부터 수신되는 데이터, 내부에서 생성되는 데이터 및 외부 저장소에 저장된 데이터를 이용하여 사용자의 일상 정보를 수집한다. 여기서, 상기 외부로부터 수신되는 데이터는 날씨, 위경도 정보, 블로그 정보 및 소셜 데이터 등을 포함한다. 상기 내부에서 생성되는 데이터는 통화기록, 단문 메시지(SMS), 멀티미디어 메시지(MMS), 위성 항법 장비(GPS)에서 측정된 위경도 정보, 일정 및 주소록 등을 포함한다. 상기 외부 저장소에 저장된 데이터는 이미지 촬영 영상, 재생한 음악 파일 목록, TV 시청 정보 등을 포함한다.Referring to FIG. 4, in operation 401, the story generating device collects log information from at least one electronic device. For example, the story generating device collects daily information of the user using data received from the outside, data generated inside, and data stored in an external storage. Here, the data received from the outside includes weather, latitude and longitude information, blog information, social data, and the like. The internally generated data includes call logs, short messages (SMS), multimedia messages (MMS), latitude and longitude information measured by satellite navigation equipment (GPS), schedules and address books. Data stored in the external storage includes an image photographed image, a list of played music files, TV viewing information, and the like.

이후, 상기 스토리 생성 장치는 403단계로 진행하여 상기 수집한 로그 정보를 이용하여 스토리를 구성하기 위한 후보 토픽들의 집합인 토픽 세그먼트를 구성한다. 예를 들어, 상기 스토리 생성 장치는 하기 도 5에 도시된 바와 같이 토픽 세그먼트를 구성한다. In operation 403, the story generating apparatus constructs a topic segment that is a set of candidate topics for composing a story using the collected log information. For example, the story generating device configures a topic segment as shown in FIG. 5.

상기 403단계에서 토픽 세그먼트를 구성한 후, 상기 스토리 생성 장치는 405단계로 진행하여 각각의 토픽 세그먼트의 주사건 및 보조 사건에서 자질을 추출한다. 예를 들어, 상기 스토리 생성 장치는 상기 도 3과 같이 구성되는 자질 추출부(203)를 이용하여 하기 도 6에 도시된 바와 같이 각 토픽 세그먼트에서 자질을 추출한다. 여기서, 자질은 문장 구성 요소가 될 수 있는 데이터를 나타낸다. 예를 들어, 상기 자질은 문장 구성 요소가 될 수 있는 동사, 개체명, 문장 연결 정보, 개체명과 연관된 토픽 정보 및 부가 정보를 포함한다After configuring the topic segment in step 403, the story generating apparatus proceeds to step 405 to extract the features from the scan gun and the auxiliary event of each topic segment. For example, the story generating apparatus extracts a feature from each topic segment as shown in FIG. 6 using the feature extracting unit 203 configured as shown in FIG. 3. Here, the qualities represent data that may be sentence components. For example, the qualities include verbs, entity names, sentence association information, topic information associated with entity names, and additional information which may be sentence elements.

각 토픽 세그먼트에서 자질을 추출한 후, 상기 스토리 생성 장치는 407단계로 진행하여 로그들의 토픽 발생 빈도를 고려하여 각각의 토픽 세그먼트에 대한 토픽을 결정한다. 예를 들어, 상기 스토리 생성 장치는 상기 <수학식 1>을 이용하여 각 토픽 세그먼트의 토픽을 결정한다.After extracting the feature from each topic segment, the story generating apparatus proceeds to step 407 to determine a topic for each topic segment in consideration of the frequency of occurrence of the topic in the logs. For example, the story generating apparatus determines a topic of each topic segment by using Equation 1.

각각의 토픽 세그먼트에 대한 토픽을 결정한 후, 상기 스토리 생성 장치는 409단계로 진행하여 각 시간 구간에서 최적의 토픽 세그먼트를 결정한다. 예를 들어, 동일한 시간 구간에 다수 개의 토픽 세그먼트들이 존재하는 경우, 상기 스토리 생성 장치는 각 토픽 세그먼트에 포함되는 토픽의 토픽 스코어를 비교하여 해당 시간 구간의 최적 토픽 세그먼트를 선택한다. 이때, 상기 스토리 생성 장치는 상기 최적의 토픽 세그먼트의 토픽을 해당 시간의 에피스드로 인식한다.After determining the topic for each topic segment, the story generating apparatus proceeds to step 409 to determine the optimal topic segment in each time interval. For example, when there are a plurality of topic segments in the same time interval, the story generating apparatus selects an optimal topic segment of the corresponding time interval by comparing topic scores of topics included in each topic segment. In this case, the story generating apparatus recognizes the topic of the optimal topic segment as an epid of the corresponding time.

이후, 상기 스토리 생성 장치는 411단계로 진행하여 동일한 토픽을 갖는 토픽 세그먼트를 하나의 토픽 세그먼트로 병합한다. 예를 들어, 연속된 토픽 세그먼트들의 토픽이 동일한 경우, 상기 스토리 생성 장치는 상기 연속된 토픽 세그먼트들을 하나의 토픽 세그먼트로 병합한다. 다른 예를 들어, 동일한 시간 구간의 토픽 세그먼트들의 토픽이 동일한 경우, 상기 스토리 생성 장치는 상기 토픽 세그먼트들을 하나의 토픽 세그먼트로 병합한다.In operation 411, the story generating apparatus merges the topic segments having the same topic into one topic segment. For example, if the topics of consecutive topic segments are the same, the story generating device merges the consecutive topic segments into one topic segment. In another example, when the topics of the same segment of the same time interval are the same, the story generating apparatus merges the topic segments into one topic segment.

상기 동일한 토픽을 갖는 토픽 세그먼트를 병합한 후, 상기 스토리 생성 장치는 413단계로 진행하여 시간별 스토리를 생성할 것인지 결정한다.After merging the topic segments having the same topic, the story generating apparatus proceeds to step 413 and determines whether to generate a story over time.

시간별 스토리를 생성하는 것으로 결정한 경우, 상기 스토리 생성 장치는 415단계로 진행하여 상기 411단계에서 결정한 에피소드를 이용하여 시간별로 각각의 스토리를 생성한다. 예를 들어, 상기 스토리 생성 장치는 하기 도 7에 도시된 바와 같이 시간별 스토리를 생성한다.If it is determined to generate the hourly story, the story generating apparatus proceeds to step 415 to generate each story by time using the episode determined in step 411. For example, the story generating device generates a story by time as shown in FIG. 7.

이후, 상기 스토리 생성 장치는 417단계로 진행하여 상기 시간별로 생성한 스토리를 표시부에 표시한다. 예를 들어, 상기 스토리 생성 장치는 도 9의 (a)에 도시된 바와 같이 시간별로 생성한 스토리를 표시부에 표시한다. 이때, 상기 스토리 생성 장치는 텍스트 형식의 문장뿐만 아니라 문장과 관련된 멀티미디어 정보를 함께 출력할 수도 있다.In operation 417, the story generating apparatus displays the story generated by the time on the display unit. For example, the story generating apparatus displays a story generated by time as shown in FIG. 9A on the display unit. In this case, the story generating device may output not only a text-type sentence but also multimedia information related to the sentence.

한편, 시간별 스토리를 생성하지 않는 것으로 결정한 경우, 상기 스토리 생성 장치는 주제별 스토리를 생성하는 것으로 인식한다. 이에 따라, 상기 스토리 생성 장치는 419단계로 진행하여 주토픽을 결정한다. 즉, 상기 스토리 생성 장치는 각 시간 구간의 에피스드를 고려하여 사용자 일정 정보의 가장 중요한 특징을 나타낼 수 있는 주토픽을 결정한다. 예를 들어, 상기 스토리 생성장치는 상기 <수학식 3>을 이용하여 주토픽을 결정한다.On the other hand, if it is decided not to generate a story by time, the story generating device recognizes that the story by theme is generated. Accordingly, the story generating apparatus proceeds to step 419 to determine the main topic. That is, the story generation device determines the main topic that can represent the most important feature of the user schedule information in consideration of the episodes of each time section. For example, the story generating apparatus determines a main topic by using Equation 3.

상기 주토픽을 결정한 후, 상기 스토리 생성 장치는 421단계로 진행하여 주토픽에 관련된 에피소드 내 자질을 병합하여 하나의 주토픽에 관련된 스토리를 생성한다. 이때, 상기 스토리 생성 장치는 주토픽뿐만 아니라 날씨, 사용자의 감정에 관련된 스토리를 생성할 수도 있다. 예를 들어, 상기 스토리 생성 장치는 하기 도 8에 도시된 바와 같이 주제별 스토리를 생성한다.After determining the main topic, the story generating apparatus proceeds to step 421 to merge the qualities in the episode related to the main topic to generate a story related to one main topic. In this case, the story generating device may generate a story related to the weather, the user's emotion as well as the main topic. For example, the story generating device generates a story by theme as shown in FIG. 8.

이후, 상기 스토리 생성 장치는 상기 417단계로 진행하여 상기 주제별로 생성한 스토리를 표시부에 표시한다. 예를 들어, 상기 스토리 생성 장치는 도 9의 (b)에 도시된 바와 같이 주제별로 생성한 스토리를 표시부에 표시한다. 이때, 상기 스토리 생성 장치는 텍스트 형식의 문장뿐만 아니라 문장과 관련된 멀티미디어 정보를 함께 출력할 수도 있다.In operation 417, the story generating apparatus displays the story generated for each subject on the display unit. For example, the story generating apparatus displays a story generated for each subject on the display unit as illustrated in FIG. 9B. In this case, the story generating device may output not only a text-type sentence but also multimedia information related to the sentence.

이후, 상기 스토리 생성 장치는 본 알고리즘을 종료한다.Thereafter, the story generating device ends the present algorithm.

도 5는 본 발명의 실시 예에 따른 토픽 세그먼트를 구성하기 위한 절차를 도시하고 있다.5 illustrates a procedure for configuring a topic segment according to an embodiment of the present invention.

상기 도 5를 참조하면, 상기 도 4의 401단계에서 로그 정보를 수집한 후, 상기 스토리 생성 장치는 501단계로 진행하여 주 사건이 존재하는지 확인한다. 여기서, 상기 주사건은 상기 수집한 사용자 일정 정보 중 스토리 구성하는데 주체가 되는 일정 정보를 나타낸다.Referring to FIG. 5, after collecting log information in step 401 of FIG. 4, the story generating apparatus checks whether a main event exists in step 501. In this case, the scan gun represents schedule information that is mainly used to construct a story among the collected user schedule information.

주사건이 존재하지 않는 경우, 상기 스토리 생성 장치는 507단계로 진행하여 더미 세그먼트를 생성할 것인지 확인한다.If there is no scan gun, the story generating apparatus proceeds to step 507 and determines whether to generate a dummy segment.

한편, 주 사건이 존재하는 경우, 상기 스토리 생성 장치는 503단계로 진행하여 주 사건으로 토픽 세그먼트를 구성한다. 예를 들어, 일정 정보가 주 사건인 경우, 상기 스토리 생성 장치는 일정 정보를 시간 순으로 나열하고 각각의 일정 정보를 토픽 세그먼트로 구성한다. 만일, 주 사건의 발생 시간이 중복되는 경우, 상기 스토리 생성 장치는 짧은 시간 구간의 주 사건을 기준으로 토픽 세그먼트를 분할한다. 예를 들어, 13시부터 14시까지 일정 A가 있고, 13시부터 15시까지 일정 B가 있는 경우, 상기 스토리 생성 장치는 13시부터 14시까지 일정 A와 일정 B 각각에 대한 토픽 세그먼트를 구성한다. 또한, 상기 스토리 생성 장치는 14시부터 15시까지 일정 B에 대한 토픽 세그먼트를 구성한다.On the other hand, if there is a main event, the story generating device proceeds to step 503 to form a topic segment with the main event. For example, when the schedule information is a main event, the story generating device lists the schedule information in chronological order and organizes each schedule information into topic segments. If the occurrence time of the main event overlaps, the story generating device divides the topic segment based on the main event of the short time interval. For example, if there is a schedule A from 13:00 to 14:00 and a schedule B from 13:00 to 15:00, the story generating device constructs a topic segment for each of schedules A and B from 13:00 to 14:00. do. In addition, the story generating device configures a topic segment for schedule B from 14:00 to 15:00.

토픽 세그먼트를 구성한 후, 상기 스토리 생성 장치는 505단계로 진행하여 보조 사건의 발생 시간이 토픽 세그먼트의 시간 구간에 포함되는 경우, 해당 보조 사건을 상기 토픽 세그먼트에 매핑한다. 여기서, 상기 보조 사건은 상기 수집한 사용자 일정 정보에서 상기 주사건을 제외한 정보를 나타낸다. After configuring the topic segment, the story generating apparatus proceeds to step 505 and, when the occurrence time of the auxiliary event is included in the time segment of the topic segment, maps the corresponding auxiliary event to the topic segment. Here, the auxiliary event represents information excluding the scan gun from the collected user schedule information.

이후, 상기 스토리 생성 장치는 상기 507단계로 진행하여 더미 세그먼트를 생성할 것인지 확인한다. 예를 들어, 상기 스토리 생성 장치는 토픽 세그먼트에 매핑되지 않은 보조 사건을 이용하여 더미 세그먼트를 생성한다. 이에 따라, 상기 스토리 생성장치는 토픽 세그먼트에 매핑되지 않은 보조 사건이 존재하는지 확인한다.In operation 507, the story generating apparatus determines whether to generate a dummy segment. For example, the story generating device generates a dummy segment using an auxiliary event not mapped to a topic segment. Accordingly, the story generating apparatus checks whether there is an auxiliary event not mapped to the topic segment.

토픽 세그먼트에 매핑되지 않은 보조 사건이 존재하지 않는 경우, 상기 스토리 생성 장치는 더미 세그먼트를 생성하지 않는 것으로 인식한다. 이에 따라 상기 스토리 생성 장치는 405단계로 진행하여 각각의 토픽 세그먼트에서 자질을 추출한다.If there is no auxiliary event not mapped to the topic segment, the story generating device recognizes that it does not generate a dummy segment. Accordingly, the story generating apparatus proceeds to step 405 to extract the features from each topic segment.

한편, 토픽 세그먼트에 매핑되지 않은 보조 사건이 존재하는 경우, 상기 스토리 생성 장치는 509단계로 진행하여 토픽 세그먼트에 매핑되지 않은 보조 사건을 이용하여 더미 세그먼트를 생성한다. 이때, 상기 더미 세그먼크는 기설정된 시간 구간으로 설정된다. On the other hand, if there is an auxiliary event not mapped to the topic segment, the story generating device proceeds to step 509 to generate a dummy segment using the auxiliary event not mapped to the topic segment. In this case, the dummy segment is set to a predetermined time interval.

이후, 상기 스토리 생성 장치는 405단계로 진행하여 각각의 토픽 세그먼트에서 자질을 추출한다.In operation 405, the story generating apparatus extracts a feature from each topic segment.

이하 설명은 상기 텍스트 추출부(305)에서 텍스트 형식의 데이터에서 자질을 추출하기 위한 방법에 대해 설명한다. Hereinafter, a description will be given of a method for extracting a feature from text data in the text extractor 305.

도 6은 본 발명의 실시 예에 따른 개체 인식 절차를 도시하고 있다.6 illustrates an object recognition procedure according to an embodiment of the present invention.

상기 도 6을 참조하면 상기 도 4의 403단계에서 토픽 세그먼트를 구성한 후, 상기 스토리 생성 장치는 601단계로 진행하여 개체명 인식기를 이용하여 개체명 인식을 수행할 것인지 결정한다. 예를 들어, 상기 스토리 생성 장치는 상기 개채명 인식기를 통한 개체명 인식이 필요한 후보 단어가 존재하는지 확인한다. Referring to FIG. 6, after configuring a topic segment in step 403 of FIG. 4, the story generating apparatus determines in step 601 whether to perform object name recognition using an object name recognizer. For example, the story generating apparatus checks whether there is a candidate word for entity name recognition through the individual name recognizer.

상기 개체명 인식기를 이용하여 개체명 인식을 수행하지 않는 경우, 상기 스토리 생성 장치는 개체명 인식이 필요한 후보 단말이 존재하지 않는 것으로 인식한다. 이에 따라, 상기 스토리 생성 장치는 본 알고리즘을 종료한다. 즉, 상기 스토리 생성 장치는 407단계로 진행하여 각 토픽 세그먼트의 토픽을 결정한다.If the entity name recognition is not performed using the entity name recognizer, the story generating apparatus recognizes that there is no candidate terminal for entity name recognition. Accordingly, the story generating device ends the present algorithm. In other words, the story generating apparatus proceeds to step 407 to determine the topic of each topic segment.

한편, 개체명 인식을 수행하는 경우, 상기 스토리 생성 장치는 개체명 인식이 필요한 후보 단말이 존재하는 것으로 인식한다. 이에 따라, 상기 스토리 생성 장치는 603단계로 진행하여 개체명 인식을 수행할 단어를 확인한다. 예를 들어, "철수와 영희가 집에 갔다"의 텍스트에서 자질을 추출하는 경우, 상기 스토리 생성 장치는 개체명 인식을 위해 "철수와"를 확인한다.Meanwhile, when performing entity name recognition, the story generating device recognizes that there is a candidate terminal requiring entity name recognition. In step 603, the story generating apparatus checks a word for object name recognition. For example, when extracting the qualities from the text of "Cheol and Alice went home," the story generating apparatus checks "Cheolwa" for object name recognition.

이후, 상기 스토리 생성 장치는 605단계로 진행하여 상기 603단계에서 확인한 단어의 개체명을 확인한다. 예를 들어, "철수와"의 개체명을 확인하는 경우, 상기 스토리 생성 장치는 "철수와"에서 조사인 "와"를 제거한 후, 상기 "철수"가 명사 사전에 포함되는지 확인한다. 만일, 상기 "철수"가 명사 사전에 포함되지 않는 경우, 상기 스토리 생성 장치는 상기 <표 4>와 같이 구성되는 개체명 패턴 사전에 상기 "철수"가 포함되는지 확인할 수도 있다. In operation 605, the story generating apparatus checks the entity name of the word identified in operation 603. For example, in the case of confirming the entity name of "heolhwa", the story generating apparatus checks whether "heol" is included in the noun dictionary after removing "he" which is a survey from "heolhwa". If the "withdrawal" is not included in the noun dictionary, the story generation device may check whether the "withdrawal" is included in the entity name pattern dictionary configured as shown in Table 4 below.

상기 603단계에서 확인한 단어의 개체명을 확인한 경우, 상기 스토리 생성 장치는 611단계로 진행하여 상기 인식한 개체명의 토픽의 발생 빈도를 갱신한다. 예를 들어, "친구"라는 단어의 PERSON 개체명을 확인한 경우, 상기 스토리 생성 장치는 PERSON의 토픽인 "생활"과 "소풍"의 발생 빈도를 하나 추가한다.When the entity name of the word checked in step 603 is confirmed, the story generating apparatus proceeds to step 611 to update a frequency of occurrence of the topic of the recognized entity name. For example, when the PERSON entity name of the word "friend" is confirmed, the story generating device adds one occurrence frequency of "life" and "excursion" which are topics of PERSON.

한편, 단어의 개체명을 확인할 수 없는 경우, 상기 스토리 생성 장치는 607단계로 진행하여 상기 단어가 동사 사전 또는 동사 활용형 패턴 사전에 포함되는지 확인한다.On the other hand, if the entity name of the word cannot be confirmed, the story generating apparatus proceeds to step 607 to check whether the word is included in the verb dictionary or verb utilization pattern dictionary.

상기 단어가 동사 사전 또는 동사 활용형 패턴 사전에 포함되지 않는 경우, 상기 스토리 생성 장치는 상기 단어에서 자질을 추출할 수 없는 것으로 인식한다. 이에 따라, 상기 스토리 생성 장치는 609단계로 진행하여 다음 단어가 존재하는지 확인한다.If the word is not included in the verb dictionary or the verb utilization pattern dictionary, the story generating device recognizes that the feature cannot be extracted from the word. Accordingly, the story generating apparatus proceeds to step 609 to check whether the next word exists.

상기 607단계에서 단어가 동사 사전 또는 동사 활용형 패턴 사전에 포함되는 경우, 상기 스토리 생성 장치는 동사 추정을 성공한 것으로 인식한다. 이에 따라, 상기 스토리 생성 장치는 611단계로 진행하여 상기 동사 정보에 따른 토픽의 발생 빈도를 갱신한다.When the word is included in the verb dictionary or the verb utilization pattern dictionary in step 607, the story generating device recognizes that the verb estimation is successful. Accordingly, the story generating apparatus proceeds to step 611 to update the frequency of occurrence of the topic according to the verb information.

이후, 상기 스토리 생성 장치는 609단계로 진행하여 다음 단어가 존재하는지 확인한다.In operation 609, the story generating apparatus determines whether a next word exists.

다음 단어가 존재하지 않는 경우, 상기 스토리 생성 장치는 모든 자질을 추출한 것으로 인식한다. 이에 따라, 상기 스토리 생성 장치는 상기 407단계로 진행하여 각 토픽 세그먼트의 토픽을 결정한다.If the next word does not exist, the story generating device recognizes that all the qualities are extracted. Accordingly, the story generating apparatus proceeds to step 407 to determine the topic of each topic segment.

한편, 다음 단어가 존재하는 경우, 상기 스토리 생성 장치는 상기 603단계로 진행하여 자질을 추출할 다음 단어를 확인한다.On the other hand, if there is a next word, the story generating apparatus proceeds to step 603 to identify the next word to extract the feature.

미 도시되었지만, 상기 스토리 생성 장치는 인식한 개체명에 동사가 포함되는지 여부를 확인한다. 예를 들어, "철수와 영희가 집에 갔다"의 텍스트에서 "철수", "영희", "집"의 명사와 "갔다"의 동사를 추출한 경우, 상기 스토리 생성 장치는 "철수", "영희", "집"의 명사는 "갔다"라는 동사를 포함하는 것으로 인식한다. 하지만, 인식한 개체명에 동사가 포함되지 않은 경우, 상기 스토리 생성 장치는 기본 동사 사전에서 상기 개체명에 추가할 동사를 추정한다.Although not shown, the story generating apparatus checks whether the verb is included in the recognized entity name. For example, in the case of extracting the nouns of "Abstract", "Younghee", "House", and the verb "Go" from the text of "Cheon and Young-Hee went home," the story generating device is "Hole", "Young-Hee". The noun of "house" is understood to include the verb "go". However, when no verb is included in the recognized entity name, the story generating device estimates a verb to be added to the entity name in the basic verb dictionary.

도 7은 본 발명의 실시 예에 따른 시간별로 스토리를 생성하기 위한 절차를 도시하고 있다.7 illustrates a procedure for generating a story by time according to an embodiment of the present invention.

상기 도 7을 참조하면 상기 419단계에서 시간별로 스토리를 생성하는 것으로 결정한 경우, 상기 스토리 생성 장치는 701단계에서 각 에피소드의 주요 자질을 이용하여 각 에피소드에 대한 중간 문장을 생성한다. 예를 들어, 상기 스토리 생성 장치는 에피소드의 주요 자질에서 토픽과 동사를 확인한다. 이후, 상기 스토리 생성 장치는 하기 <표 11>에서 상기 토픽과 동사에 해당하는 문장 템플릿을 선택한다. 여기서, 상기 문장 템플릿은 특정 토픽과 동사에 의해 구성되는 문장 형식을 나타낸다. Referring to FIG. 7, if it is determined in step 419 that the story is generated by time, the story generating device generates an intermediate sentence for each episode using the main features of each episode in step 701. For example, the story generating device identifies topics and verbs in the main qualities of the episode. Subsequently, the story generating device selects a sentence template corresponding to the topic and a verb in Table 11 below. Here, the sentence template indicates a sentence format constituted by a specific topic and a verb.

토픽 Topic 동사verb 유의동사A significant verb 문장 템플릿Heraldry template 필수격Required 공부study 전해주Convey 건내주Counting <DAY> <TIME>에 <LOCATION>에서 <OBJECT><을>/전해줬You gave <OBJECT> <> from <LOCATION> on <DAY> <TIME> OBJECTOBJECT 공부study 건내주Counting 전해주Convey <DAY> <TIME>에 <LOCATION>에서 <OBJECT><을>/건내줬Passed <OBJECT> <> from <LOCATION> on <DAY> <TIME> OBJECTOBJECT 입시입사, 여행,체육Admission, Travel, Sports 집합하Gathering 소집하Convene <ORG> <LOCATION>로/집합했<ORG> to <LOCATION> / set LOCATIONLOCATION 입시입사Entrance examination 통과하Through 붙, 합격하Stuck, passing <DAY> <TIME>에 <PERSON><이> <OBJECT><을>/통과했<PERSON> <this> <OBJECT> <> passed on <DAY> <TIME> OBJECTOBJECT 다이어트Diet 운동하Exercise <DAY> <TIME>에 <LOCATION>에서 <PERSON><와> <OBJECT><으로>/운동했Worked from <LOCATION> to <PERSON> <and> <OBJECT> <> on <DAY> <TIME> $$ 휴일holiday 휴일에 여자친구와 데이트 약속을 잡았다I have a date with my girlfriend on a holiday. $$ 분노anger 오늘은 화가 나는 일이 있었다I was angry today …... …... …... …... …...

상기 <표 11>에서 첫 번째 열(colunm)은 토픽을 나타낸다. 이때, ＄는 기본 문장 생성에 이용되는 문장 템플릿을 나타낸다. 두 번째 열은 동사를 나타내며, 세 번째 열은 두 번째 열의 동사와 동일한 의미를 가지면서 표현 방식만 다른 유사 동사를 나타낸다. 네 번째 열은 문장 템플릿으로 동사와 토픽에 따라 구성된다. 다섯 번째 열은 문장 템플릿이 사용될 때 반드시 필요한 개체명 정보를 나타낸다. 예를 들어, 에피소드의 토픽이 "공부"이고 동사가 "전해주다"인 경우, 상기 스토리 생성 장치는 상기 <표 11>에 따라 "<DAY> <TIME>에 <LOCATION>에서 <OBJECT><을>/전해줬"과 같은 문장 템플릿을 해당 에피소드의 중간 문장으로 인식한다. 이때, 상기 스토리 생성 장치는 상기 <DAY>, <TIME>, <LOCATION> 및 <OBJECT> 중 어느 하나를 필수적으로 필요로 한다. 예를 들어, <OBJECT>를 필수적으로 가져야 하는 경우, 필수적으로 <OBJECT> 개체명을 포함하여 중간 문장을 생성한다. 만일, 필수적으로 필요한 <OBJECT> 개체명이 없는 경우, 상기 스토리 생성 장치는 문장 템플릿 생성 조건을 만족하지 않으므로 에피소드를 구성하지 않는다. In Table 11, the first column indicates a topic. In this case, ＄ represents a sentence template used for generating a basic sentence. The second column represents verbs, and the third column represents similar verbs with the same meaning as the verbs in the second column, but with different expressions. The fourth column is a sentence template, organized by verb and topic. The fifth column shows entity name information that is essential when a sentence template is used. For example, if the topic of the episode is "study" and the verb is "tell", the story generating device may change <OBJECT> <from <LOCATION> to "<DAY> <TIME> according to <Table 11>. Recognize a sentence template, such as> / ", as the middle sentence of the episode. In this case, the story generating device essentially requires any one of the <DAY>, <TIME>, <LOCATION>, and <OBJECT>. For example, if it is necessary to have an <OBJECT>, then an intermediate statement is created, including the <OBJECT> object name. If there is no required <OBJECT> entity name, the story generating device does not form an episode because the story template generation condition is not satisfied.

추가적으로, 상기 스토리 생성 장치는 문장 생성의 다양성을 제공하기 위해 에피소드의 동사가 아닌 유사 동사로 중간 문장을 생성할 수도 있다. 예를 들어, 에피소드의 토픽이 "공부"이고 동사가 "전해주다"인 경우, 상기 스토리 생성 장치는 상기 "전해주다"의 유의 동사인 "건내주"를 이용하여 "<DAY> <TIME>에 <LOCATION>에서 <OBJECT><을>/건네줬"과 같은 문장 템플릿을 해당 에피소드의 중간 문장으로 인식할 수도 있다.Additionally, the story generation device may generate intermediate sentences with similar verbs rather than verbs of episodes to provide diversity of sentence generation. For example, if the topic of the episode is "study" and the verb is "tell", the story generating device uses "Kun-na-ju", which is a significant verb of "tell", to "<DAY> <TIME>". Sentence templates such as <OBJECT> <and / handed> in <LOCATION> may be recognized as intermediate sentences of the episode.

에피소드에 대한 중간 문장을 생성한 후, 상기 스토리 생성 장치는 703단계로 진행하여 중복되는 문장을 제거한다. 예를 들어, “학교에/갔/가”과 “오늘 영희랑 학교에/갔/가"의 중간 문장을 생성한 경우, 상기 스토리 생성 장치는 “오늘 영희랑 학교에/갔/가"의 중간 문장이 “학교에/갔/가"의 중간 문장을 포괄하기 때문에 “학교에/갔/가"를 삭제한다. After generating the intermediate sentence for the episode, the story generating apparatus proceeds to step 703 to remove the duplicate sentence. For example, if a middle sentence between “going to / from school” and “to go to / from school today” is generated, the story generating device may be in the middle of “to / from / going to school with today”. Delete "go to school" because the sentence covers the middle sentence of "go / go".

중복되는 문장을 제거한 후, 상기 스토리 생성 장치는 705단계로 진행하여 중간 문장을 완전한 형태의 문장으로 변환한다. 즉, 상기 스토리 생성 장치는 어미 정보와 문장 연결 정보를 이용하여 중간 문장을 완전한 형태의 문장으로 변환한다. 예를 들어, “오늘 학교에서/시험쳤/시험치”와 “철수와/농구했/농구하”라는 중간 문장이 존재하고, 문장 연결 정보가 합동인 경우, 상기 스토리 생성 장치는 상기 두 문장을 하나의 문장으로 연결한다. 이때, 상기 스토리 생성 장치는 상기 <표 7>의 문장 연결어미 사전을 이용하여 두 문장을 연결할 문장 연결 어미를 선택한다. 이후, 상기 스토리 생성 장치는 종결 어미를 선택하여 "오늘 학교에서 시험쳤고, 철수와 농구했다"라는 완전한 문장을 생성한다. 이때, 상기 스토리 생성 장치는 지역, 성별 및 나이 중 적어도 하나를 고려하여 종결 어미를 선택할 수 있다. 만일, 연령별로 종결 어미를 다르게 설정하는 경우, 상기 스토리 생성 장치는 하기 <표 12>를 이용하여 종결 어미를 선택할 수 있다.After removing the duplicate sentences, the story generating apparatus proceeds to step 705 to convert the intermediate sentences into complete sentences. That is, the story generating device converts the intermediate sentence into a complete sentence using the mother information and the sentence connection information. For example, if there is an intermediate sentence of “today at school / tested / tested” and “withdrawn / basketballed / basketballed” and sentence linking information is conjoined, the story generating device may interpret the two sentences. Concatenate one sentence. In this case, the story generating apparatus selects a sentence connecting ending to connect two sentences using the sentence connecting ending dictionary of <Table 7>. The story generating device then selects the ending mother to generate a complete sentence, "tested at school today, withdrawn and basketball." In this case, the story generating device may select the ending ending in consideration of at least one of region, gender and age. If the ending ending is set differently for each age, the story generating device may select the ending ending using the following Table 12.

ageage sexsex eomieomi 30하30 ha 남south 삼 어요 엉 쪄요 어 당 음 습니다 답니다I'm eating it. 30하30 ha 여female 쪄요 엉 어요 당 삼 어 징 답니다 습니다 어용 네요I'm using it. 30상30 phases 남south 더라구 네 답니다 구만요 네 음 습니다 다우 니라Yes, yes, yes, yes. 30상30 phases 여female 더라구 다우 구나 어요 어야지요 사옵니다 네요I'm buying it.

한편, 지역별로 종결 어미를 다르게 설정하는 경우, 상기 스토리 생성 장치는 하기 <표 13>을 이용하여 종결 어미를 선택할 수 있다.On the other hand, when the ending ending is set differently for each region, the story generating device may select the ending ending using the following <Table 13>.

locloc eomieomi 강원Gangwon 다All 경기game 다All 전라Jeolla 다 어라 당께All done 충청Chungcheong 다 슈 어유Dashu fish oil 경상Ordinary 다 다아임니꺼 어라예It's all daimyeo 제주Jeju 다All

상기 705단계에서 중간 문장을 완전한 문장으로 변환한 후, 상기 스토리 생성 장치는 상기 417단계로 진행하여 시간별로 생성한 문장을 사용자가 확인할 수 있도록 표시부에 표시한다.After converting the intermediate sentence into a complete sentence in step 705, the story generating apparatus proceeds to step 417 to display the sentence generated by the user on the display unit for checking.

도 8은 본 발명의 실시 예에 따른 주제별로 스토리를 생성하기 위한 절차를 도시하고 있다.8 illustrates a procedure for generating a story for each topic according to an embodiment of the present invention.

상기 도 8을 참조하면 상기 419단계에서 주제별로 스토리를 생성하는 것으로 결정한 경우, 상기 스토리 생성 장치는 801단계에서 주 토픽에 관련된 에피소드들의 자질을 수집한다. 이 경우, 상기 스토리 생성 장치는 동일한 토픽이지만 발생 시간이 다른 에피소드들을 하나의 에피소드로 병합한다.Referring to FIG. 8, if it is determined in step 419 that a story is generated for each topic, the story generating device collects the quality of episodes related to the main topic in step 801. In this case, the story generating apparatus merges episodes of the same topic but having different occurrence times into one episode.

이후, 상기 스토리 생성 장치는 803단계로 진행하여 상기 수집한 자질을 이용하여 중간 문장을 생성한다. 예를 들어, 상기 스토리 생성 장치는 상기 수집한 자질에서 토픽과 동사를 확인한다. 이후, 상기 스토리 생성 장치는 상기 <표 11>에서 상기 토픽과 동사에 해당하는 문장 템플릿을 선택한다. 만일, 에피소드의 토픽이 "공부"이고 동사가 "전해주다"인 경우, 상기 스토리 생성 장치는 상기 <표 11>에 따라 "<DAY> <TIME>에 <LOCATION>에서 <OBJECT><을>/전해줬"과 같은 문장 템플릿을 해당 에피소드의 중간 문장으로 인식한다. 이때, 상기 스토리 생성 장치는 상기 <DAY>, <TIME>, <LOCATION> 및 <OBJECT> 중 어느 하나를 필수적으로 필요로 한다. 예를 들어, <OBJECT>를 필수적으로 가져야 하는 경우, 필수적으로 <OBJECT> 개체명을 포함하여 중간 문장을 생성한다. 만일, 필수적으로 필요한 <OBJECT> 개체명이 없는 경우, 상기 스토리 생성 장치는 문장 템플릿 생성 조건을 만족하지 않으므로 에피소드를 구성하지 않는다. In operation 803, the story generating apparatus generates an intermediate sentence using the collected features. For example, the story generating device identifies a topic and a verb from the collected qualities. Subsequently, the story generating device selects a sentence template corresponding to the topic and a verb from the <Table 11>. If the topic of the episode is "Study" and the verb is "tell", the story generating device uses <OBJECT> <<> in <LOCATION> at << DAY> <TIME> according to <Table 11>. Sentence template, such as "received" as the middle sentence of the episode. In this case, the story generating device essentially requires any one of the <DAY>, <TIME>, <LOCATION>, and <OBJECT>. For example, if it is necessary to have an <OBJECT>, then an intermediate statement is created, including the <OBJECT> object name. If there is no required <OBJECT> entity name, the story generating device does not form an episode because the story template generation condition is not satisfied.

에피소드에 대한 중간 문장을 생성한 후, 상기 스토리 생성 장치는 805단계로 진행하여 중복되는 문장을 제거한다. 예를 들어, “학교에/갔/가”과 “오늘 영희랑 학교에/갔/가"의 중간 문장을 생성한 경우, 상기 스토리 생성 장치는 “오늘 영희랑 학교에/갔/가"의 중간 문장이 “학교에/갔/가"의 중간 문장을 포괄하기 때문에 “학교에/갔/가"를 삭제한다. After generating the intermediate sentence for the episode, the story generating device proceeds to step 805 to remove the duplicate sentence. For example, if a middle sentence between “going to / from school” and “to go to / from school today” is generated, the story generating device may be in the middle of “to / from / going to school with today”. Delete "go to school" because the sentence covers the middle sentence of "go / go".

중복되는 문장을 제거한 후, 상기 스토리 생성 장치는 807단계로 진행하여 중간 문장을 완전한 형태의 문장으로 변환한다. 즉, 상기 스토리 생성 장치는 어미 정보와 문장 연결 정보를 이용하여 중간 문장을 완전한 형태의 문장으로 변환한다. 예를 들어, “오늘 학교에서/시험쳤/시험치”와 “철수와/농구했/농구하”라는 중간 문장이 존재하고, 문장 연결 정보가 합동인 경우, 상기 스토리 생성 장치는 상기 두 문장을 하나의 문장으로 연결한다. 이때, 상기 스토리 생성 장치는 상기 <표 7>의 문장 연결어미 사전을 이용하여 두 문장을 연결할 문장 연결 어미를 선택한다. 이후, 상기 스토리 생성 장치는 종결 어미를 선택하여 "오늘 학교에서 시험쳤고, 철수와 농구했다"라는 완전한 문장을 생성한다. 이때, 상기 스토리 생성 장치는 지역, 성별 및 나이 중 적어도 하나를 고려하여 종결 어미를 선택할 수 있다. 만일, 연령별로 종결 어미를 다르게 설정하는 경우, 상기 스토리 생성 장치는 하기 <표 12>를 이용하여 종결 어미를 선택할 수 있다.After removing the duplicate sentences, the story generating apparatus proceeds to step 807 and converts the intermediate sentences into complete sentences. That is, the story generating device converts the intermediate sentence into a complete sentence using the mother information and the sentence connection information. For example, if there is an intermediate sentence of “today at school / tested / tested” and “withdrawn / basketballed / basketballed” and sentence linking information is conjoined, the story generating device may interpret the two sentences. Concatenate one sentence. In this case, the story generating apparatus selects a sentence connecting ending to connect two sentences using the sentence connecting ending dictionary of <Table 7>. The story generating device then selects the ending mother to generate a complete sentence, "tested at school today, withdrawn and basketball." In this case, the story generating device may select the ending ending in consideration of at least one of region, gender and age. If the ending ending is set differently for each age, the story generating device may select the ending ending using the following Table 12.

상기 807단계에서 중간 문장을 완전한 문장으로 변환한 후, 상기 스토리 생성 장치는 상기 417단계로 진행하여 주제별로 생성한 문장을 사용자가 확인할 수 있도록 표시부에 표시한다. After converting the intermediate sentence into a complete sentence in step 807, the story generating apparatus proceeds to step 417 and displays the sentence generated by the subject on the display unit for the user to check.

상술한 바와 같이 주제별 스토리를 생성하는 경우, 상기 스토리 생성 장치는 주토픽뿐만 아니라 날씨, 사용자의 감정에 관련된 스토리를 함께 생성할 수도 있다.As described above, when generating a story by theme, the story generating device may generate a story related to weather and user emotion as well as a main topic.

상술한 실시 예에서 이동 장치는 사용자 정보 수집, 사용자 정보 분석 및 스토리 생성을 모두 수행한다. 이때, 상기 이동 장치는 사용자 정보를 축척하여 상기 사용자의 행동 패턴을 추정할 수 있다. 이에 따라, 상기 이동장치는 사용자의 행동 패턴을 요약할 수 있는 토픽 정보를 토대로 스토리를 생성할 수 있다.In the above-described embodiment, the mobile device performs user information collection, user information analysis, and story generation. In this case, the mobile device may estimate user's behavior pattern by accumulating user information. Accordingly, the mobile device can generate a story based on topic information that can summarize a user's behavior pattern.

다른 실시 예에서 상기 이동 장치에서 사용자 정보를 수집하고, 정보 분석 서버에서 정보 분석 및 스토리를 생성할 수 있다. 예를 들어, 상기 이동 장치는 수집한 사용자 정보를 상기 정보 분석 서버로 제공한다. 상기 정보 분석 서버는 상기 이동 장치로부터 제공받은 사용자 정보를 분석하여 스토리를 생성한다. 이후, 상기 정보 분석 서버는 상기 생성한 스토리를 상기 이동 장치로 제공한다. 상기 이동 장치는 상기 정보 분석 서버에서 생성한 스토리를 사용자가 확인할 수 있도록 표시부에 표시한다. 즉, 상기 정보 분석 서버는 상기 도 1의 분석부(120) 및 스토리 생성부(130)와 동일한 동작을 수행한다. 이때, 상기 정보 분석 서버는 사용자 정보를 축척하여 상기 사용자의 행동 패턴을 추정할 수 있다. 이에 따라, 상기 정보 분석 서버는 상기 사용자의 행동 패턴을 요약할 수 있는 토픽 정보를 토대로 스토리를 생성할 수 있다.In another embodiment, user information may be collected by the mobile device, and information analysis and stories may be generated by an information analysis server. For example, the mobile device provides the collected user information to the information analysis server. The information analysis server analyzes user information provided from the mobile device to generate a story. Thereafter, the information analysis server provides the generated story to the mobile device. The mobile device displays a story generated by the information analysis server on a display unit for the user to check. That is, the information analysis server performs the same operations as the analysis unit 120 and the story generation unit 130 of FIG. 1. In this case, the information analysis server may estimate user's behavior pattern by accumulating user information. Accordingly, the information analysis server may generate a story based on topic information that can summarize the behavior pattern of the user.

또 다른 실시 예에서 상기 이동 장치는 사용자 정보를 수집하고, 정보 분석 서버에서 정보를 분석하며, 스토리 생성 서버에서 스토리를 생성할 수도 있다. 예를 들어, 상기 이동 장치는 수집한 사용자 정보를 상기 정보 분석 서버로 제공한다. 상기 정보 분석 서버는 상기 이동 장치로부터 제공받은 사용자 정보를 분석하여 스토리 생성 서버로 제공한다. 상기 스토리 생성 서버는 상기 정보 분석 서버로부터 제공받은 사용자 정보에 대한 분석 정보를 이용하여 스토리를 생성하고, 상기 생성한 스토리를 상기 이동 장치로 제공한다. 상기 이동 장치는 상기 정보 분석 서버에서 생성한 스토리를 사용자가 확인할 수 있도록 표시부에 표시한다. 즉, 상기 정보 분석 서버는 상기 도 1의 분석부(120)와 동일한 동작을 수행하고, 상기 스토리 생성 서버는 상기 도 1의 스토리 생성부(130)와 동일한 동작을 수행한다. 이때, 상기 정보 분석 서버는 사용자 정보를 축척하여 상기 사용자의 행동 패턴을 추정할 수 있다. 이에 따라, 상기 스토리 생성 서버는 상기 정보 분석 서버로부터 제공받은 사용자의 행동 패턴을 요약할 수 있는 토픽 정보를 토대로 스토리를 생성할 수 있다.In another embodiment, the mobile device may collect user information, analyze information in an information analysis server, and generate a story in a story generation server. For example, the mobile device provides the collected user information to the information analysis server. The information analysis server analyzes the user information provided from the mobile device and provides it to the story generation server. The story generation server generates a story using the analysis information on the user information provided from the information analysis server, and provides the generated story to the mobile device. The mobile device displays a story generated by the information analysis server on a display unit for the user to check. That is, the information analysis server performs the same operation as that of the analysis unit 120 of FIG. 1, and the story generation server performs the same operation as the story generation unit 130 of FIG. 1. In this case, the information analysis server may estimate user's behavior pattern by accumulating user information. Accordingly, the story generation server may generate a story based on topic information that can summarize the behavior pattern of the user provided from the information analysis server.

또 다른 실시 예에서 상기 이동 장치는 사용자 정보의 수집 및 스토리 생성을 수행하고, 정보 분석 서버에서 정보를 분석할 수도 있다. 예를 들어, 상기 이동 장치는 수집한 사용자 정보를 상기 정보 분석 서버로 제공한다. 상기 정보 분석 서버는 상기 이동 장치로부터 제공받은 사용자 정보를 분석하여 상기 이동 장치로 제공한다. 상기 이동 장치는 상기 정보 분석 서버로부터 제공받은 사용자 정보에 대한 분석 정보를 이용하여 스토리를 생성하고, 상기 스토리를 사용자가 확인할 수 있도록 표시부에 표시한다. 즉, 상기 정보 분석 서버는 상기 도 1의 분석부(120)와 동일한 동작을 수행한다. 이때, 상기 정보 분석 서버는 사용자 정보를 축척하여 상기 사용자의 행동 패턴을 추정할 수 있다. 이에 따라, 상기 이동 장치는 상기 정보 분석 서버로부터 제공받은 사용자의 행동 패턴을 요약할 수 있는 토픽 정보를 토대로 스토리를 생성할 수 있다. In another embodiment, the mobile device may collect user information and generate a story, and analyze the information in the information analysis server. For example, the mobile device provides the collected user information to the information analysis server. The information analysis server analyzes the user information provided from the mobile device and provides it to the mobile device. The mobile device generates a story using analysis information on user information provided from the information analysis server, and displays the story on a display unit for the user to check. That is, the information analysis server performs the same operation as the analysis unit 120 of FIG. In this case, the information analysis server may estimate user's behavior pattern by accumulating user information. Accordingly, the mobile device can generate a story based on topic information that can summarize the behavior pattern of the user provided from the information analysis server.

상술한 실시 예에서 스토리 생성 장치는 사용자 일상 정보와 관련된 토픽을 선정하여 스토리를 생성한다. 이때, 상기 스토리 생성 장치는 하기 <표 14>와 같은 토픽 분류 목록을 이용하여 토픽을 선정할 수 있다.In the above-described embodiment, the story generating device generates a story by selecting a topic related to user daily information. In this case, the story generating device may select a topic using a topic classification list as shown in Table 14 below.

대분류Main Category 소분류Small classification 상황 정의Situation Definition 모든 토픽에 포함Include in all topics 신용카드Credit card 신용카드 사용 내역Credit card usage history 통화currency 휴대폰의 통화Call on your phone 편지letter 일상적인 편지, 휴대폰의 문자, 이메일Everyday letters, text on your phone, email 생활life 종교religion 종교와 관련된 활동Religious Activities 금융finance 주식투자, 금융과 관련된 활동Activities related to equity investment and finance 계획/약속Plan / Promise 약속을 잡거나 계획을 잡는 것이 주 된 일인 경우When making appointments or planning is the main thing 집안 일housework 집안에 관련된 일을 돕거나 수행하는 것Helping or carrying out household work 다이어트Diet 다이어트 활동과 관련된 일이 주 된 일의 경우When work related to diet activity is main 아픔pain 아파서 병원에 가거나 치료하는 것이 주 된 일인 경우If you get sick and go to the hospital or care 기념일Anniversary 결혼기념일, 백일과 같은 일상생활에서 생기는 개인적인 기념일들Personal holidays from everyday life such as wedding anniversary and hundred days 특별한 휴일Special holiday 크리스마스, 새해, 설날, 추석과 같이 달력상에 휴일이지만 특수한 속성을 띄는 것들Holidays on the calendar with special attributes, such as Christmas, New Year, New Year's Day, and Chuseok 대인관계human relationship 대인관계human relationship 회식, 외식 등 가족이나 친구, 직장 직원들끼리에 일어나는 활동이면서 다른 어떠한 토픽에 속하지 않는 경우An activity that occurs between family, friends, or employees, such as a meal or a meal, and does not belong to any other topic. 취미활동Hobby 운동Exercise 축구, 야구와 같이 다이어트 목적이 아닌 운동을 즐기는 목적으로 수행하는 경우When performing for the purpose of enjoying sports, not for diet, such as soccer or baseball 독서Reading 도서관 등지에서 책을 빌리거나 책을 읽는 행위Borrow books or read books in libraries 낚시fishing 음악music 음악 감상과 관련된 활동Activities related to listening to music 게임game 컴퓨터 게임, 보드 게임Computer games, board games 그림Drawing 그림을 그리거나 감상하는 일련의 활동A series of activities to draw or watch 사진Picture 사진을 찍거나 사진과 관련된 활동Taking photos or activities related to photography 파티party 생일 파티, 축하 파티Birthday party, celebration party 요리Cooking 요리를 만들거나 하는 행위Making or cooking 애완동물Pets 애완견을 씻기거나 같이 놀아주는 행위Washing or playing with dogs TV시청TV TV를 보는 행위Watching TV 야외활동Outdoor activities 여행Travel 동물원, 식물원, 놀이동산 등에서의 일
해외 여행, 국내 여행Work in zoo, botanical garden, amusement park
Overseas travel, domestic travel 관람Preview 연극, 영화 관람
뮤지컬
콘서트Theater
musical
concert 쇼핑shopping 무언가를 사는 행위와 아이쇼핑Buying something and shopping 학교생활school life 수업class 공부외 수업과 관련된 행위Acts related to classes other than study 공부study 자기 개발, 공부와 관련된 행위Self-development, study related activities 체육대회Athletic Competition 운동회, 체육대회Athletic meet, athletic meet 축제festival 학교 축제School festival 시험exam 중간고사, 기말고사, 쪽지시험Midterm exam, Final exam, Note test 발표회Presentation 발표회Presentation 세미나, 강연회, 학술회Seminars, lectures, academic conferences 시험, 고시Exam, Notice 시험/자격증Exam / Certification 중간, 기말고사, 각종 고시, 자격증 취득과 관련된 활동Activities related to midterm, final exams, various examinations, and certification 입시, 입사Entrance examination 대학에서의 면접, 회사 입사와 관련된 활동Interview with a university, activities related to joining a company 직업job 접대Entertainment 계약과 관련된 접대 업무Hospitality Related to the Contract 회의conference 주간회의, 월간회의, 업무 관련 회의Weekly meetings, monthly meetings, business meetings 출장Business trip 업무task 다른 직업과 관련된 토픽과 연관없는 일상적으로 직장에서 일어나는 일What happens in the workplace on a daily basis, not related to topics related to other jobs

또한, 상기 스토리 생성 장치는 하기 <표 15>와 같이 기 설정된 환경 설정 파일을 토대로 로그 정보를 저장 및 분석할 수 있다.In addition, the story generating apparatus may store and analyze log information based on a preset environment setting file as shown in Table 15 below.

라인번호Line number 구성정보Configuration Information 구성 예Configuration example 1라인1 line 토픽 정보Topic Information 생활, 수업, 공부, 체육대회, ..., 소풍, ...Life, class, study, athletic meet, ..., picnic, ... 2라인2 lines 토픽별 거리 정보Distance Information by Topic 0, 0, 0, 0, ..., 100, ...0, 0, 0, 0, ..., 100, ... 3라인3 lines 개체명 정보Entity Name Information PERSON, ORG, DAY, TIME, LOCATION, OBJECTPERSON, ORG, DAY, TIME, LOCATION, OBJECT 4라인
...
n라인4 lines
...
n lines 로그별 설정 정보Log-specific setting information SMS\tS\t\t0.2\t000010\t010000
일정\t
GPS\tG\t1:걷|산책하|여행하:0|100|50000\t0.1\t100100\t100200
통화기록\tC\t2:통화하:20\t0.1\t020000\t100002SMS \ tS \ t \ t0.2 \ t000010 \ t010000
Event \ t
GPS \ tG \ t1: Walking | Walking | Travel: 0 | 100 | 50000 \ t0.1 \ t100100 \ t100200
Call Log \ tC \ t2: Call To: 20 \ t0.1 \ t020000 \ t100002

상기 <표 15>를 이용하여 구성한 GPS의 환경 설정 정보는 하기 <표 16>과 같이 구성될 수 있다.The environment setting information of the GPS configured using the <Table 15> may be configured as shown in the following <Table 16>.

항목Item 구성정보Configuration Information 구성 예Configuration example 항목 1Item 1 로그 이름Log name GPSGPS 항목 2Item 2 로그 코드Log code GG 항목 3Item 3 기본 동사 정보Basic verb information 1:걷|산책하|여행하:0|100|500001: Walking | Walking | Travel: 0 | 100 | 50000 항목 4Item 4 로그 가중치Log weight 0.10.1 항목 5
...
항목 nItem 5
...
Item n 로그 데이터
처리 방법 정보Log data
Processing method information 100100\t100200100100 \ t100200

상기 <표 16>에서 기본 동사 정보는 거리에 따라 사용되는 기본 동사를 정의한다. 즉, 거리가 0m 이상 100m미만인 경우, 상기 스토리 생성 장치는 기본동사로 “걷다”를 선택하고, 거리가 100m 이상 50km미만인 경우, 기본 동사로 “산책하다”를 선택한다. In Table 16, basic verb information defines basic verbs used according to distance. That is, when the distance is 0m or more and less than 100m, the story generating device selects "walk" as the basic verb, and when the distance is 100m or more and less than 50km, the "verb" is selected as the basic verb.

한편 본 발명의 상세한 설명에서는 구체적인 실시 예에 관해 설명하였으나, 본 발명의 범위에서 벗어나지 않는 한도 내에서 여러 가지 변형이 가능하다. 그러므로 본 발명의 범위는 설명된 실시 예에 국한되어 정해져서는 아니 되며 후술하는 특허청구의 범위뿐만 아니라 이 특허청구의 범위와 균등한 것들에 의해 정해져야 한다.Meanwhile, in the detailed description of the present invention, specific embodiments have been described, but various modifications may be made without departing from the scope of the present invention. Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined not only by the scope of the following claims, but also by the equivalents of the claims.

Claims

An apparatus for summarizing daily information of a user,
An information collecting unit collecting log information including daily information of a user from at least one electronic device;
An analysis unit for analyzing at least one log information collected by the information collection unit to determine at least one topic representing daily information of a user;
And a story generator for generating at least one sentence representing daily information of a user using at least one topic determined by the analyzer.

The method of claim 1,
The analysis unit,
A topic segment configuration unit constituting at least one topic segment including at least one candidate topic using log information collected by the information collection unit;
An extraction unit for extracting sentence components for constructing sentences from each topic segment;
A topic determiner which determines a topic for each topic segment in consideration of the frequency of occurrence of the topic according to the sentence component extracted by the extractor;
And an episode determination unit for selecting one topic segment for a reference time in consideration of the frequency of occurrence of the topic included in each topic segment.

The method of claim 2,
The topic segment configuration unit configures at least one topic segment using any one of the plurality of log information using reference log information, and maps auxiliary log information to each topic segment in consideration of an occurrence time of the auxiliary log information. Dummy segments are constructed using secondary log information that is not mapped to topic segments.
The auxiliary log information, characterized in that the log information of the plurality of log information except the reference log information, characterized in that the log information.

The method of claim 2,
The extractor is characterized in that it comprises an entity name recognizer.

The method of claim 2,
The extractor, characterized in that for updating the frequency of occurrence of the topic for the sentence component extracted from each topic segment.

The method of claim 2,
The episode extraction unit, characterized in that for selecting any one of the reference topic in consideration of the frequency of occurrence of the topic of the selected topic segment at each reference time.

The method according to claim 6,
The episode determiner, when the reference topics of the consecutive topic segments are the same, characterized in that the consolidation of consecutive topic segments including the same reference topic into one topic segment.

The method of claim 1,
The analysis unit, characterized in that for analyzing the log information collected by the information collecting unit according to the preset environment setting information.

The method of claim 1,
The story generator comprises at least one intermediate sentence using sentence components included in an episode set at a reference time, removes an intermediate sentence having the same meaning, and adds a sentence linking ending and ending ending information to the intermediate sentence. Device for generating a sentence for the episode.

The method of claim 9,
The story generation unit, characterized in that for setting the ending ending in consideration of at least one of gender, region and age.

The method of claim 1,
The story generation unit, characterized in that for generating at least one sentence representing the user's daily information in consideration of at least one of the reference topic, weather, and emotion that is representative of the user's daily information.

The method of claim 1,
And a display unit for displaying at least one sentence generated by the story generator.

In a method for summarizing daily information of a user,
Collecting log information including daily information of a user from at least one electronic device;
Determining at least one topic representing daily information of a user by analyzing log information collected by the information collector;
And generating at least one sentence representing daily information of a user using at least one topic determined by the analysis unit.

The method of claim 13,
The process of determining the topic,
Constructing at least one topic segment including at least one candidate topic using the collected log information;
Extracting sentence components for constructing sentences from each topic segment;
Determining a topic for each topic segment in consideration of the frequency of occurrence of the topic according to the extracted sentence component;
Selecting one topic segment for a reference time in consideration of the frequency of occurrence of the topic included in each topic segment.

The method of claim 14,
The process of configuring the topic segment,
Configuring at least one topic segment by using any one of the plurality of log information based on the reference log information;
Mapping the auxiliary log information to each topic segment in consideration of the occurrence time of the auxiliary log information;
If there is auxiliary log information not mapped to the topic segment, including a step of configuring a dummy segment by using the auxiliary log information not mapped to the topic segment;
And the auxiliary log information indicates log information excluding the reference log information among the plurality of log information.

The method of claim 14,
Extracting the sentence component; extracting the sentence component from textual data using an entity name recognizer.

The method of claim 14,
And after extracting the sentence component, updating the frequency of occurrence of the topic for the sentence component extracted from each topic segment.

The method of claim 14,
The process of selecting the topic segment,
And selecting one reference topic in consideration of the frequency of occurrence of the topic of the selected topic segment at each reference time.

19. The method of claim 18,
Selecting the reference topic and checking whether the reference topics of consecutive topic segments are the same;
If the reference topics of consecutive segments are the same, further comprising merging consecutive topic segments comprising the same reference topic into one topic segment.

The method of claim 13,
Determining the at least one topic,
And determining at least one topic by analyzing the collected log information according to preset environment setting information.

The method of claim 13,
The process of generating the sentence,
Constructing at least one intermediate sentence using sentence components included in the episode set at the reference time;
Removing the middle sentence of the same meaning,
And adding a sentence linking ending and ending ending information to the middle sentence to generate a sentence for the episode.

22. The method of claim 21,
The ending ending is set in consideration of at least one of gender, region and age.

The method of claim 13,
The process of generating the sentence,
And generating at least one sentence representing the user's daily information in consideration of at least one of reference topics, weather, and emotions that are representative of the user's daily information.

The method of claim 13,
And displaying the generated at least one sentence.