KR20160000132A

KR20160000132A - Data processing system and method of operating data processing system

Info

Publication number: KR20160000132A
Application number: KR1020140077069A
Authority: KR
Inventors: 안태훈
Original assignee: 주식회사 아이디인큐
Priority date: 2014-06-24
Filing date: 2014-06-24
Publication date: 2016-01-04
Also published as: KR101586012B1

Abstract

A data processing system comprises a terminal, a data sorter, a first processing unit, a second processing unit, a third processing unit, and a result unit. The terminal provides a survey result. The data sorter divides the survey result into structured data and unstructured data. The first processing unit determines a process reference inquiry group based on personal information of a surveyee. The first processing unit provides first processed data by correcting unstructured data according to previous and post accumulated processed data corresponding to the process reference inquiry group. The second processing unit corrects a syntax error fort the first processed data and provides second processed data. The third processing unit groups the second processed data as a set of representative data based on synonym information and provides representative data as third processed data. A result unit analyzes structured data and third processed data. When the data processing system including first to third processing units, and a method for operating the data processing system according to the present invention are used, a past database of first to third processing units is accumulated, so unstructured data may be processed. Accordingly, the accuracy and operation speed of the data processing system may be increased.

Description

TECHNICAL FIELD [0001] The present invention relates to a data processing system and a data processing system,

본 발명은 데이터 처리에 관한 것으로서, 보다 상세하게는 데이터 처리 시스템 및 데이터 처리 시스템의 동작 방법에 관한 것이다. The present invention relates to data processing, and more particularly, to a data processing system and a method of operating the data processing system.

설문 조사 참여자의 의견 및 생각을 고정된 형식이 아닌 자유로운 방식으로 수집하는 주관식 응답 수집 방식의 경우, 수집된 응답 결과는 비정형 데이터일 수 있다. 비정형 데이터를 분석하기 위해서는 최초 수집된 응답 결과에 대한 보정 처리가 필요하다. In the case of the questionnaire response collection method, which collects opinions and thoughts of survey participants in a free way rather than a fixed form, the collected response result may be unstructured data. In order to analyze unstructured data, it is necessary to correct the result of the first collected response.

기존의 경우 비정형 데이터를 분석하기 위해서 별도의 인력을 고용하여 처리하거나, 일부 자동처리 기술을 활용하여 단어 검색 및 변환 처리를 적용하고 있으나 최종적인 보정을 위해서 인력이 요구된다. In the conventional case, a separate manpower is employed to analyze unstructured data, or some automatic processing technology is applied to word search and conversion processing, but manpower is required for final correction.

상기와 같은 문제점을 해결하기 위한 본 발명의 일 목적은 비정형 데이터에 대한 가공 과정을 거친 후 보정 결과를 제공함으로써 비정형 데이터의 처리 속도를 증가시킬 수 있는 데이터 처리 시스템을 제공하는 것이다. SUMMARY OF THE INVENTION An object of the present invention is to provide a data processing system capable of increasing the processing speed of unstructured data by providing a correction result after being processed for unstructured data.

상기와 같은 문제점을 해결하기 위한 본 발명의 일 목적은 비정형 데이터에 대한 가공 과정을 거친 후 보정 결과를 제공함으로써 비정형 데이터의 처리 속도를 증가시킬 수 있는 데이터 처리 시스템의 동작 방법을 제공하는 것이다.An object of the present invention is to provide a method of operating a data processing system capable of increasing the processing speed of an unstructured data by providing a correction result after processing the unstructured data.

본 발명의 일 목적을 달성하기 위하여 본 발명의 실시예들에 따른 데이터 처리 시스템은 단말기, 데이터 분류기, 제1 가공부, 제2 가공부, 제3 가공부 및 결과부를 포함한다. 상기 단말기는 설문 조사 결과를 제공한다. 상기 데이터 분류기는 상기 설문 조사 결과를 정형 데이터 및 비정형 데이터로 구분한다. 상기 제1 가공부는 설문 조사 대상자의 개인 정보에 기초하여 가공 기준 조회 그룹을 결정한다. 상기 제1 가공부는 상기 가공 기준 조회 그룹에 상응하는 누적 가공 전후 데이터에 따라 상기 비정형 데이터를 보정하여 제1 가공 데이터를 제공한다. 상기 제2 가공부는 상기 제1 가공 데이터에 대한 문법적 오류를 수정하여 제2 가공 데이터를 제공한다. 상기 제3 가공부는 동의어 정보에 기초하여 상기 제2 가공 데이터를 하나의 대표 데이터로 그룹화하여 상기 대표 데이터를 제3 가공 데이터로서 제공한다. 상기 결과부는 상기 정형 데이터 및 상기 제3 가공 데이터를 분석한다. In order to accomplish one object of the present invention, a data processing system according to embodiments of the present invention includes a terminal, a data classifier, a first processing unit, a second processing unit, a third processing unit, and a result unit. The terminal provides a survey result. The data classifier divides the results of the survey into regular data and atypical data. The first processing section determines a machining criterion inquiry group based on the personal information of the person to be surveyed. The first machining unit corrects the irregular data according to pre-cumulative machining data corresponding to the machining reference inquiry group to provide first machining data. The second processing unit corrects a grammatical error of the first processing data to provide second processing data. The third processing section groups the second processed data into one representative data based on synonym information and provides the representative data as third processed data. The result section analyzes the form data and the third processed data.

예시적인 실시예에 있어서, 상기 가공 기준 조회 그룹은 상기 설문 조사 대상자의 나이, 성별 또는 직업에 따라 결정될 수 있다. In an exemplary embodiment, the processing criterion inquiry group may be determined according to the age, sex, or occupation of the person to be surveyed.

예시적인 실시예에 있어서, 상기 누적 가공 전후 데이터는 누적 가공 전 데이터 및 누적 가공 후 데이터를 포함할 수 있다. 상기 비정형 데이터가 상기 누적 가공 전 데이터 중 하나와 일치하는 경우, 상기 제1 가공부는 상응하는 상기 누적 가공 후 데이터를 제공할 수 있다. In an exemplary embodiment, the pre-cumulative machining data may include pre-cumulative machining data and cumulative machining data. If the atypical data matches one of the pre-cumulative data, the first processing unit may provide the corresponding post-cumulative data.

예시적인 실시예에 있어서, 상기 제2 가공부는 상기 제1 가공 데이터에 포함되는 맞춤법 오류 및 표준어 오류를 수정하여 제2 가공 데이터를 제공할 수 있다. In an exemplary embodiment, the second processing section may provide second processing data by correcting a spelling error and a standard word error contained in the first processing data.

예시적인 실시예에 있어서, 상기 동의어 정보는 동의어 그룹 및 대표 동의어를 포함하는 동의어 데이터 베이스로부터 제공될 수 있다. In an exemplary embodiment, the synonym information may be provided from a synonym database that includes a synonym group and a representative synonym.

예시적인 실시예에 있어서, 상기 제1 가공부가 상기 비정형 데이터를 보정하여 상기 제1 가공 데이터를 생성하는 경우, 상기 비정형 데이터 및 상기 제1 가공 데이터는 상기 누적 가공 전후 데이터에 추가될 수 있다. In an exemplary embodiment, when the first machining part corrects the irregular data to generate the first machining data, the atypical data and the first machining data may be added to the data before and after the cumulative machining.

예시적인 실시예에 있어서, 상기 제1 가공부는 상기 누적 가공 전후 데이터를 저장하는 누적 가공 전후 데이터 저장부를 포함할 수 있다. 상기 누적 가공 전후 데이터 저장부는 상기 비정형 데이터를 상기 누적 가공 전 데이터로서 저장하고, 상기 제1 가공 데이터를 상기 누적 가공 후 데이터로서 저장할 수 있다. In an exemplary embodiment, the first processing section may include a pre-cumulative data storage section for storing data before and after the cumulative processing. The data storage section before and after the cumulative machining may store the unstructured data as the pre-cumulative data and store the first processed data as the post-cumulative data.

예시적인 실시예에 있어서, 상기 제2 가공부는 상기 제1 가공 데이터에 대한 문법적 오류를 수정하여 상기 제2 가공 데이터를 생성하는 과정에서 발생되는 문법적 오류 데이터를 저장하는 문법적 오류 데이터 저장부를 포함할 수 있다. In an exemplary embodiment, the second processing unit may include a grammatical error data storage unit for storing grammatical error data generated in the process of generating the second processed data by correcting a grammatical error in the first processed data have.

예시적인 실시예에 있어서, 상기 제2 가공부는 상기 문법적 오류 데이터 저장부에 포함되는 상기 문법적 오류 데이터에 기초하여 상기 제2 가공 데이터를 제공할 수 있다. In an exemplary embodiment, the second processing unit may provide the second processing data based on the grammatical error data contained in the grammatical error data storage.

예시적인 실시예에 있어서, 상기 제3 가공부가 상기 제2 가공 데이터를 하나의 상기 대표 데이터로 그룹화하여 상기 대표 데이터를 제3 가공 데이터로서 제공하는 과정에서 발생되는 정보들은 상기 동의어 정보에 추가될 수 있다. 상기 제3 가공부는 상기 추가된 동의어 정보에 기초하여 상기 제3 가공 데이터를 제공할 수 있다. In an exemplary embodiment, information generated in the process of grouping the second processed data into one representative data and providing the representative data as third processed data may be added to the synonym information have. And the third processing section may provide the third processing data based on the added synonym information.

본 발명의 일 목적을 달성하기 위하여 본 발명의 실시예들에 따른 데이터 처리 시스템의 동작 방법은 설문 조사 결과를 정형 데이터 및 비정형 데이터로 구분하는 단계, 설문 조사 대상자의 개인 정보에 기초하여 가공 기준 조회 그룹을 결정하고, 상기 가공 기준 조회 그룹에 상응하는 누적 가공 전후 데이터에 따라 상기 비정형 데이터를 보정하여 제1 가공 데이터를 제공하는 단계, 상기 제1 가공 데이터에 대한 문법적 오류를 수정하여 제2 가공 데이터를 제공하는 단계, 동의어 정보에 기초하여 상기 제2 가공 데이터를 하나의 대표 데이터로 그룹화하여 상기 대표 데이터를 제3 가공 데이터로서 제공하는 단계를 포함한다. In order to accomplish one object of the present invention, a method of operating a data processing system according to embodiments of the present invention includes dividing a survey result into regular data and unstructured data, Group, and correcting the irregular data according to pre-cumulative machining data corresponding to the machining-criterion inquiry group to provide first machining data, correcting a grammatical error of the first machining data, And grouping the second processed data into one representative data based on synonym information and providing the representative data as third processed data.

예시적인 실시예에 있어서, 상기 제3 가공 데이터를 단어 별로 분리하여 라벨리스트를 생성하고, 보정 데이터 베이스를 검색하여 상기 라벨리스트와 일치하는 보정 결과를 출력하는 단계를 더 포함할 수 있다. In an exemplary embodiment, the method may further include generating a label list by separating the third processed data for each word, searching the correction database, and outputting a correction result matching the label list.

본 발명의 실시예들에 따른 데이터 처리 시스템은 비정형 데이터에 대한 가공 과정을 거친 후 보정 결과를 제공함으로써 비정형 데이터의 처리 속도를 증가시킬 수 있어 데이터 처리 시스템을 포함하는 설문 응답 수집 시스템의 성능을 높일 수 있다. The data processing system according to the embodiments of the present invention can increase the processing speed of the unstructured data by providing the correction result after processing the unstructured data to increase the performance of the questionnaire response collection system including the data processing system .

도 1은 본 발명의 실시예들에 따른 데이터 처리 시스템을 나타내는 블록도이다.
도 2는 도 1의 데이터 처리 시스템에 포함되는 제1 가공부의 일 동작 예를 설명하기 위한 도면이다.
도 3은 도 1의 데이터 처리 시스템에 포함되는 제1 가공부의 다른 동작 예를 설명하기 위한 도면이다.
도 4는 도 1의 데이터 처리 시스템에 포함되는 제2 가공부의 일 동작 예를 설명하기 위한 도면이다.
도 5는 도 1의 데이터 처리 시스템에 포함되는 제3 가공부의 일 동작 예를 설명하기 위한 도면이다.
도 6은 도 1의 데이터 처리 시스템에서 사용되는 개인 정보의 일 예를 나타내는 도면이다.
도 7은 도 1의 제1 가공부에 포함되는 누적 가공 전후 데이터 저장부를 나타내는 도면이다.
도 8은 도 1의 제2 가공부에 포함되는 문법적 오류 데이터 저장부를 나타내는 도면이다.
도 9는 도 1의 제3 가공부에 포함되는 동의어 데이터 베이스의 일 예를 나타내는 도면이다.
도 10은 도 1의 제3 가공부에 포함되는 동의어 데이터 베이스의 다른 예를 나타내는 도면이다.
도 11은 본 발명의 실시예들에 따른 데이터 처리 시스템의 동작 방법을 나타내는 순서도이다.
도 12는 본 발명의 일 실시예에 따른 데이터 처리 시스템의 동작 방법을 나타내는 순서도이다.
도 13은 라벨리스트 동작의 일 예를 나타내는 도면이다.
도 14는 보정 데이터 베이스 검색 결과 및 보정 결과를 출력하는 과정을 설명하기 위한 도면이다. 1 is a block diagram illustrating a data processing system in accordance with embodiments of the present invention.
FIG. 2 is a view for explaining an example of operation of the first processing section included in the data processing system of FIG. 1;
FIG. 3 is a diagram for explaining another operation example of the first processing section included in the data processing system of FIG. 1. FIG.
4 is a view for explaining an example of operation of the second processing unit included in the data processing system of FIG.
5 is a diagram for explaining an example of operation of the third processing unit included in the data processing system of FIG.
6 is a diagram showing an example of personal information used in the data processing system of FIG.
7 is a view showing a data storage unit before and after the cumulative processing included in the first processing unit of FIG.
8 is a diagram showing a grammatical error data storage unit included in the second processing unit of FIG.
9 is a diagram showing an example of a synonym database included in the third processed portion of FIG.
10 is a diagram showing another example of the synonym database included in the third processed portion in FIG.
11 is a flowchart illustrating an operation method of a data processing system according to embodiments of the present invention.
12 is a flowchart illustrating an operation method of a data processing system according to an embodiment of the present invention.
13 is a diagram showing an example of the label list operation.
14 is a diagram for explaining a process of outputting a correction database search result and a correction result.

본문에 개시되어 있는 본 발명의 실시예들에 대해서, 특정한 구조적 내지 기능적 설명들은 단지 본 발명의 실시예를 설명하기 위한 목적으로 예시된 것으로, 본 발명의 실시예들은 다양한 형태로 실시될 수 있으며 본문에 설명된 실시예들에 한정되는 것으로 해석되지 않는다.For the embodiments of the invention disclosed herein, specific structural and functional descriptions are set forth for the purpose of describing an embodiment of the invention only, and it is to be understood that the embodiments of the invention may be practiced in various forms, And is not to be construed as limited to the embodiments described in Figs.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는바, 특정 실시예들을 도면에 예시하고 본문에 상세하게 설명하고자 한다. 그러나 이는 본 발명을 특정한 개시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다.The present invention is capable of various modifications and various forms, and specific embodiments are illustrated in the drawings and described in detail in the text. It is to be understood, however, that the invention is not intended to be limited to the particular forms disclosed, but on the contrary, is intended to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로 사용될 수 있다. 예를 들어, 본 발명의 권리 범위로부터 이탈되지 않은 채 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다.The terms first, second, etc. may be used to describe various components, but the components should not be limited by the terms. The terms may be used for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as a second component, and similarly, the second component may also be referred to as a first component.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다거나 "접속되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어" 있다거나 "직접 접속되어" 있다고 언급된 때에는, 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다. 구성요소들 간의 관계를 설명하는 다른 표현들, 즉 "~사이에"와 "바로 ~사이에" 또는 "~에 이웃하는"과 "~에 직접 이웃하는" 등도 마찬가지로 해석되어야 한다.It is to be understood that when an element is referred to as being "connected" or "connected" to another element, it may be directly connected or connected to the other element, . On the other hand, when an element is referred to as being "directly connected" or "directly connected" to another element, it should be understood that there are no other elements in between. Other expressions that describe the relationship between components, such as "between" and "between" or "neighboring to" and "directly adjacent to" should be interpreted as well.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 설시된 특징, 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terminology used in this application is used only to describe a specific embodiment and is not intended to limit the invention. The singular expressions include plural expressions unless the context clearly dictates otherwise. In the present application, the terms "comprise", "having", and the like are intended to specify the presence of stated features, integers, steps, operations, elements, components, or combinations thereof, , Steps, operations, components, parts, or combinations thereof, as a matter of principle.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미이다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥상 가지는 의미와 일치하는 의미인 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Unless otherwise defined, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in commonly used dictionaries should be construed as meaning consistent with meaning in the context of the relevant art and are not to be construed as ideal or overly formal in meaning unless expressly defined in the present application .

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. The same reference numerals are used for the same constituent elements in the drawings and redundant explanations for the same constituent elements are omitted.

도 1은 본 발명의 실시예들에 따른 데이터 처리 시스템을 나타내는 블록도이고, 도 2는 도 1의 데이터 처리 시스템에 포함되는 제1 가공부의 일 동작 예를 설명하기 위한 도면이다.FIG. 1 is a block diagram showing a data processing system according to an embodiment of the present invention, and FIG. 2 is a view for explaining an example of operation of a first processing part included in the data processing system of FIG.

도 1 및 도 2를 참조하면, 데이터 처리 시스템(10)은 단말기(100), 데이터 분류기(300), 가공부(500) 및 결과부(700)를 포함한다. 가공부(500)는 제1 가공부(510), 제2 가공부(530), 제3 가공부(550)를 포함할 수 있다. Referring to FIGS. 1 and 2, a data processing system 10 includes a terminal 100, a data classifier 300, a processing unit 500, and a result unit 700. The processing unit 500 may include a first processing unit 510, a second processing unit 530, and a third processing unit 550.

단말기(100)는 설문 조사 결과(SR)를 제공한다. 예를 들어 단말기(100)는 설문 조사 대상자에 의해서 제어될 수 있다. 설문 조사 결과(SR)는 복수의 설문 조사 대상자로부터 제공될 수 있다. 단말기(100)는 컴퓨터, 노트북, 스마트 폰 및 휴대형 단말기를 포함할 수 있다. The terminal 100 provides a survey result (SR). For example, the terminal 100 can be controlled by the person to be surveyed. Survey results (SR) can be provided from multiple survey subjects. The terminal 100 may include a computer, a notebook, a smart phone, and a portable terminal.

데이터 분류기(300)는 설문 조사 결과(SR)를 정형 데이터(SD) 및 비정형 데이터(NSD)로 구분한다. 예를 들어 데이터 분류기(300)는 설문 조사 결과(SR)를 전달받을 수 있다. 설문 조사는 객관식 설문 조사 및 주관식 설문 조사를 포함할 수 있다. 객관식 설문 조사의 경우, 설문 조사의 결과는 숫자일 수 있다. 주관식 설문 조사의 경우, 설문 조사의 결과는 숫자, 단어 또는 문장일 수 있다. 데이터는 정형 데이터(SD) 및 비정형 데이터(NSD)를 포함할 수 있다. 정형 데이터(SD)는 객관식 설문 조사의 결과인 숫자일 수 있다. 또한 정형 데이터(SD)는 주관식 설문 조사의 결과 중 숫자일 수 있다. 정형 데이터(SD)는 데이터 분류기(300)로부터 결과부(700)로 전달될 수 있다. 비정형 데이터(NSD)는 주관식 설문 조사의 결과 중 단어 또는 문장일 수 있다. 비정형 데이터(NSD)는 데이터 분류기(300)로부터 제1 가공부(510)로 전달될 수 있다. The data classifier 300 divides the survey result (SR) into the form data (SD) and the atypical data (NSD). For example, the data classifier 300 may receive the survey result (SR). The questionnaire can include an objective questionnaire and a questionnaire survey. For multiple-choice surveys, the results of the survey may be numbers. For a questionnaire survey, the results of the survey can be numbers, words, or sentences. The data may include regular data (SD) and unstructured data (NSD). Formal data (SD) may be a number that is the result of a multiple-choice survey. Also, the formatted data (SD) can be a number among the results of the questionnaire survey. The format data SD may be transferred from the data classifier 300 to the result portion 700. Unstructured data (NSD) can be words or sentences of the results of the questionnaire survey. The atypical data (NSD) may be transferred from the data classifier 300 to the first processing unit 510.

제1 가공부(510)는 설문 조사 대상자의 개인 정보(PI)에 기초하여 가공 기준 조회 그룹(PRG)을 결정한다. 설문 조사 대상자의 개인 정보(PI)는 나이 정보일 수 있다. 이 경우, 가공 기준 조회 그룹(PRG)은 나이에 따라 결정될 수 있다. 예를 들어, 설문 조사 대상자의 나이가 10대인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 설문 조사 대상자의 나이가 20대인 경우, 가공 기준 조회 그룹(PRG)은 제2 가공 기준 조회 그룹(PRG2)일 수 있다. 설문 조사 대상자의 나이가 90대인 경우, 가공 기준 조회 그룹(PRG)은 제9 가공 기준 조회 그룹(PRG9)일 수 있다.The first processing unit 510 determines the machining criterion group PRG based on the personal information PI of the person to be surveyed. The personal information (PI) of the survey subject may be age information. In this case, the machining criterion inquiry group PRG can be determined according to age. For example, if the age of the survey subject is ten, the machining criterion group PRG may be the first machining criterion group PRG1. When the age of the survey subject is twenties, the machining criterion group PRG may be the second machining criterion group PRG2. If the age of the questionnaire is 90 years old, the machining criterion group (PRG) may be the ninth machining criterion group (PRG9).

도 3은 도 1의 데이터 처리 시스템에 포함되는 제1 가공부의 다른 동작 예를 설명하기 위한 도면이다.FIG. 3 is a diagram for explaining another operation example of the first processing section included in the data processing system of FIG. 1. FIG.

도 3을 참조하면, 제1 가공부(510)는 가공 기준 조회 그룹(PRG)에 상응하는 누적 가공 전후 데이터에 따라 비정형 데이터(NSD)를 보정하여 제1 가공 데이터(PD1)를 제공한다. 누적 가공 전후 데이터는 누적 가공 전 데이터(PRE_PD) 및 누적 가공 후 데이터(POST_PD)를 포함할 수 있다. Referring to FIG. 3, the first machining unit 510 corrects the irregular data NSD according to the pre-cumulative machining data corresponding to the machining criterion inquiry group PRG to provide the first machining data PD1. The pre-cumulative machining data may include pre-cumulative machining data PRE_PD and cumulative machining data POST_PD.

예를 들어 설문 조사 대상자의 나이가 10대일 수 있다. 설문 조사 대상자의 나이가 10대인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 제1 가공 기준 조회 그룹(PRG1)에 상응하는 누적 가공 전 데이터(PRE_PD)는 "젤" 및 "짱"일 수 있다. 누적 가공 전 데이터(PRE_PD)는 제1 가공 기준 조회 그룹(PRG1)에 해당하는 10대들이 자주 사용하는 줄임말들일 수 있다. 제1 가공 기준 조회 그룹(PRG1)에 상응하는 누적 가공 후 데이터(POST_PD)는 "제일"일 수 있다. 누적 가공 후 데이터(POST_PD)는 누적 가공 전 데이터(PRE_PD)를 표준화시킨 단어 및 단어들의 조합일 수 있다. 예를 들어, 제1 가공 기준 조회 그룹(PRG1)에 상응하는 누적 가공 전 데이터(PRE_PD)는 "맛난다" 및 "맛좋다"일 수 있다. 제1 가공 기준 조회 그룹(PRG1)에 상응하는 누적 가공 후 데이터(POST_PD)는 "맛있다"일 수 있다.For example, the age of the survey subject may be teenagers. If the age of the survey subject is ten, the processing criterion inquiry group (PRG) may be the first processing criterion inquiry group (PRG1). The accumulated pre-processing data PRE_PD corresponding to the first machining reference query group PRG1 may be "gel" and " The pre-accumulated data PRE_PD may be abbreviations frequently used by the teenagers corresponding to the first machining criterion group PRG1. The post-cumulative post-processing data POST_PD corresponding to the first machining criterion inquiry group PRG1 may be "best ". The post-cumulative post-processing data POST_PD may be a word and a combination of words that standardize the pre-cumulative pre-processing data PRE_PD. For example, the accumulated pre-processing data PRE_PD corresponding to the first machining reference query group PRG1 may be "delicious" and "tastier". The accumulated post-processing data POST_PD corresponding to the first machining criterion inquiry group PRG1 may be "delicious ".

제1 가공부(510)는 가공 기준 조회 그룹(PRG)에 상응하는 누적 가공 전후 데이터에 따라 비정형 데이터(NSD)를 보정하여 제1 가공 데이터(PD1)를 제공한다. The first machining unit 510 corrects the irregular data NSD according to the pre-cumulative machining data corresponding to the machining criterion inquiry group PRG to provide the first machining data PD1.

예를 들어, 제1 가공부(510)에 제공되는 비정형 데이터(NSD)는 "콜드스톤아이스클임이 젤 맛난다"일 수 있다. 설문 조사 대상자의 나이가 10대인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 제1 가공 기준 조회 그룹(PRG1)에 상응하는 누적 가공 전 데이터(PRE_PD)는 "젤", "짱", "맛난다" 및 " 맛좋다"를 포함할 수 있다. 누적 가공 전 데이터(PRE_PD) "젤" 및 "짱"에 상응하는 누적 가공 후 데이터(POST_PD)는 "제일" 일 수 있다. 누적 가공 전 데이터(PRE_PD) "맛난다" 및 "맛좋다"에 상응하는 누적 가공 후 데이터(POST_PD)는 "맛있다" 일 수 있다. 제1 가공부(510)에 제공되는 비정형 데이터(NSD)는 "젤"이라는 단어를 포함한다. 따라서 제1 가공부(510)는 "젤"이라는 단어를 "제일"이라는 단어로 변환하여 제1 가공 데이터(PD1)를 제공할 수 있다. 또한 제1 가공부(510)에 제공되는 비정형 데이터(NSD)는 "맛난다"라는 단어를 포함한다. 따라서 제1 가공부(510)는 "맛난다"이라는 단어를 "맛있다"이라는 단어로 변환하여 제1 가공 데이터(PD1)를 제공할 수 있다. 제1 가공 데이터(PD1)는 "콜드스톤아이스클임이 제일 맛있다"일 수 있다. For example, the atypical data (NSD) provided to the first processing portion 510 may be "cold stone ice cream is gel-like ". If the age of the survey subject is ten, the processing criterion inquiry group (PRG) may be the first processing criterion inquiry group (PRG1). The accumulated pre-processing data PRE_PD corresponding to the first machining criterion query group PRG1 may include "gel", "chan", "delicious" and "tastier". The accumulated post-processing data POST_PD corresponding to the pre-cumulative data PRE_PD "gel" and "chan" may be "best". Cumulative post-processing data PRE_PD The accumulated post-processing data POST_PD corresponding to "delicious" and "delicious" may be "delicious". The atypical data NSD provided to the first processing unit 510 includes the word "gel ". Therefore, the first processing section 510 can convert the word "gel" into the word " first "to provide the first processed data PD1. Further, the atypical data NSD provided to the first processing unit 510 includes the word "delicious ". Therefore, the first processing section 510 can convert the word " delicious "into the word " delicious" to provide the first processed data PD1. The first processed data PD1 may be "the coldest ice ice is the most delicious ".

예시적인 실시예에 있어서, 제1 가공부(510)가 제1 가공 데이터(PD1)를 생성하는 과정에서 발생하는 자료들은 누적 가공 전후 데이터에 추가될 수 있다. 예를 들어 10대들이 자주 사용하는 말 중에 하나가 "??오"라는 단어일 수 있다. "??오"는 "최고"라는 의미를 나타낼 수 있다. "최고"는 "제일"이라는 단어와 동일한 의미일 수 있다. 따라서 제1 가공부(510)에 제공되는 비정형 데이터(NSD)에 "??오"라는 단어가 포함되는 경우, "??오"라는 표현은 "제일"이라는 단어로 변환될 수 있다. 이 경우, "??오"라는 단어는 누적 가공 전 데이터(PRE_PD)에 포함될 수 있다. 누적 가공 전 데이터(PRE_PD) "??오"의 누적 가공 후 데이터(POST_PD)는 "제일"일 수 있다. 제1 가공부(510)의 동작이 반복됨에 따라 누적 가공 전후 데이터는 증가할 수 있다. 누적 가공 전후 데이터가 증가함에 따라 제1 가공부(510)를 포함하는 전체 데이터 처리 시스템(10)의 정확도를 높이면서 동작 속도도 증가할 수 있다. In the exemplary embodiment, data generated in the process of generating the first processing data PD1 by the first processing unit 510 may be added to data before and after the cumulative processing. For example, one of the words that teens often use can be the word "oh". "Oh" can mean "best". The word "best" may have the same meaning as the word "first. &Quot; Therefore, when the word "??" is included in the atypical data NSD provided to the first processing unit 510, the expression ?? ?? can be converted into the word ??? best`. In this case, the word " ?? "may be included in the accumulated pre-processing data PRE_PD. The accumulated post-processing data POST_PD of the pre-cumulative pre-processing data PRE_PD "??" may be "best". As the operation of the first machining unit 510 is repeated, data before and after the cumulative machining can be increased. As the data before and after the cumulative machining operation increases, the operating speed can be increased while increasing the accuracy of the entire data processing system 10 including the first machining portion 510.

도 4는 도 1의 데이터 처리 시스템에 포함되는 제2 가공부의 일 동작 예를 설명하기 위한 도면이다.4 is a view for explaining an example of operation of the second processing unit included in the data processing system of FIG.

도 4를 참조하면, 제2 가공부(530)는 제1 가공 데이터(PD1)에 대한 문법적 오류를 수정하여 제2 가공 데이터(PD2)를 제공한다. 제2 가공부(530)로 제공되는 제1 가공 데이터(PD1)는 문법적 오류를 포함할 수 있다. 예를 들어, 제1 가공 데이터(PD1)는 "콜드스톤아이스클임이 제일 맛있다"일 수 있다. 제1 가공 데이터(PD1)에 포함되는"콜드스톤아이스클임"은 문법적 오류를 포함한다. "콜드스톤아이스클임"에 포함되는 문법적 오류는 띄어쓰기와 맞춤법 오류일 수 있다. "콜드스톤"과 "아이스클임"사이에는 띄어쓰기를 하면 문법적 오류 중 띄어쓰기 오류를 수정할 수 있다. 또한"아이스클임"을 "아이스크림"으로 수정하면 문법적 오류 중 맞춤법 오류를 수정할 수 있다. 제2 가공부(530)는 제1 가공 데이터(PD1)에 해당하는 "콜드스톤아이스클임"대한 문법적 오류인 띄어쓰기 및 맞춤법 오류를 수정하여 제2 가공 데이터(PD2)를 제공할 수 있다. 제2 가공 데이터(PD2)는 "콜드스톤 아이스크림이 제일 맛있다"일 수 있다. Referring to FIG. 4, the second machining unit 530 corrects the grammatical error of the first machining data PD1 to provide the second machining data PD2. The first processing data PD1 provided to the second processing unit 530 may include a grammatical error. For example, the first processed data PD1 may be "the coldest ice ice is the most delicious ". The "cold stone ice block" included in the first processing data PD1 includes a grammatical error. The grammatical errors included in "Cold Stone Ice Cream" can be spacing and spelling errors. Spacing between "Cold Stone" and "Ice Cream" can correct the spacing error of grammatical errors. You can also fix "spelling errors" in grammatical errors by modifying "ice creams" to "ice cream". The second machining unit 530 may provide the second machining data PD2 by correcting the spacing and the spelling error which are grammatical errors for the "cold stone ice chime" corresponding to the first machining data PD1. The second processed data PD2 may be "cold stone ice cream is the most delicious ".

예시적인 실시예에 있어서, 제2 가공부(530)가 제2 가공 데이터(PD2)를 생성하는 과정에서 발생하는 자료들은 문법적 오류 데이터 또는 누적 가공 전후 데이터에 추가될 수 있다. 제2 가공부(530)의 동작이 반복됨에 따라 문법적 오류 데이터 및 누적 가공 전후 데이터는 증가할 수 있다. 문법적 오류 데이터 및 누적 가공 전후 데이터가 증가함에 따라 제1 가공부(510) 및 제2 가공부(530)를 포함하는 전체 데이터 처리 시스템(10)의 정확도를 높이면서 동작 속도도 증가할 수 있다.In the exemplary embodiment, the data generated in the process of generating the second processed data PD2 by the second processed portion 530 may be added to the grammatical error data or the data before and after the cumulative process. As the operation of the second processing unit 530 is repeated, the grammatical error data and the data before and after the cumulative processing can be increased. As the grammatical error data and cumulative machining data increase, the operating speed can be increased while increasing the accuracy of the entire data processing system 10 including the first machining portion 510 and the second machining portion 530.

도 5는 도 1의 데이터 처리 시스템에 포함되는 제3 가공부의 일 동작 예를 설명하기 위한 도면이다.5 is a diagram for explaining an example of operation of the third processing unit included in the data processing system of FIG.

도 5를 참조하면, 제3 가공부(550)는 동의어 정보에 기초하여 제2 가공 데이터(PD2)를 하나의 대표 데이터로 그룹화하여 대표 데이터를 제3 가공 데이터(PD3)로서 제공한다. 예를 들어 제1 가공부(510) 및 제2 가공부(530)를 통해서 생성되는 제2 가공 데이터(PD2)는 "콜드스톤이 맛있다"일 수 있다. 또한 제1 가공부(510) 및 제2 가공부(530)를 통해서 생성되는 제2 가공 데이터(PD2)는 "콜드스톤이 제일 맛있다"일 수 있다. 제1 가공부(510) 및 제2 가공부(530)를 통해서 생성되는 제2 가공 데이터(PD2)는 "콜드스톤이 제일 낫다"일 수 있다. "콜드스톤이 맛있다", "콜드스톤이 제일 맛있다" 및 "콜드스톤이 제일 낫다"는 "콜드스톤 맛있다"와 동의어일 수 있다. 이 경우, "콜드스톤 맛있다"는 제2 가공 데이터(PD2)들을 하나로 그룹화할 수 있는 대표 데이터일 수 있다. 제3 가공부(550)는 동의어 정보에 기초하여 제2 가공 데이터(PD2)에 해당하는 "콜드스톤이 맛있다", "콜드스톤이 제일 맛있다" 및 "콜드스톤이 제일 낫다"를 하나의 대표 데이터"콜드스톤 맛있다"로 그룹화하여 대표 데이터를 제3 가공 데이터(PD3)로서 제공한다. 제3 가공 데이터(PD3)는 "콜드스톤 맛있다"일 수 있다. Referring to FIG. 5, the third processing section 550 groups the second processed data PD2 into one representative data based on synonym information, and provides representative data as the third processed data PD3. For example, the second machining data PD2 generated through the first machining portion 510 and the second machining portion 530 may be "cold stone is good ". The second processed data PD2 generated through the first machining portion 510 and the second machining portion 530 may be "the cold stone is the most delicious ". The second processed data PD2 generated through the first machining portion 510 and the second machining portion 530 may be "the cold stone is the best." "Cold stone is delicious", "Cold stone is the best" and "Cold stone is the best" can be synonymous with "Cold stone is good". In this case, "cold stone delicious" may be representative data capable of grouping the second processed data PD2 into one. The third processing section 550 determines whether or not the "cold stone is delicious", "cold stone is the most delicious" and "cold stone is the best" corresponding to the second processing data PD2 based on the synonym information, And "cold stone delicious" to provide representative data as the third processed data PD3. The third processed data PD3 may be "cold stone delicious ".

예시적인 실시예에 있어서, 제3 가공부(550)가 제3 가공 데이터(PD3)를 생성하는 과정에서 발생하는 자료들은 동의어 정보에 추가될 수 있다. 제3 가공부(550)의 동작이 반복됨에 따라 동의어 정보는 증가할 수 있다. 동의어 정보가 증가함에 따라 제3 가공부(550)를 포함하는 전체 데이터 처리 시스템(10)의 정확도를 높이면서 동작 속도도 증가할 수 있다.In the exemplary embodiment, data generated in the process of generating the third processed data PD3 by the third processed portion 550 may be added to the synonym information. As the operation of the third processing unit 550 is repeated, the synonym information can be increased. As the synonym information increases, the operating speed can be increased while increasing the accuracy of the entire data processing system 10 including the third processing unit 550.

결과부(700)는 정형 데이터(SD) 및 제3 가공 데이터(PD3)를 분석한다. 예를 들어. 정형 데이터(SD)는 객관식 설문 조사의 결과인 숫자일 수 있다. 또한 정형 데이터(SD)는 주관식 설문 조사의 결과 중 숫자일 수 있다. 제3 가공 데이터(PD3)는 "콜드스톤 맛있다"일 수 있다. 결과부(700)는 정형 데이터(SD)에 해당하는 숫자 및 비정형 데이터(NSD)에 해당하는 "콜드스톤 맛있다"를 이용하여 데이터 처리 결과를 제공할 수 있다. The resultant portion 700 analyzes the shaping data SD and the third processed data PD3. E.g. Formal data (SD) may be a number that is the result of a multiple-choice survey. Also, the formatted data (SD) can be a number among the results of the questionnaire survey. The third processed data PD3 may be "cold stone delicious ". The result unit 700 can provide the data processing result using the number corresponding to the formatted data SD and the "cold stone delicious" corresponding to the irregular data NSD.

도 6은 도 1의 데이터 처리 시스템에서 사용되는 개인 정보의 일 예를 나타내는 도면이다.6 is a diagram showing an example of personal information used in the data processing system of FIG.

도 6을 참조하면, 가공 기준 조회 그룹(PRG)은 설문 조사 대상자의 나이, 성별 또는 직업에 따라 결정될 수 있다. 예를 들어, 가공 기준 조회 그룹(PRG)은 성별에 따라 결정될 수 있다. 설문 조사 대상자의 성별이 남성인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 설문 조사 대상자의 성별이 여성인 경우, 가공 기준 조회 그룹(PRG)은 제2 가공 기준 조회 그룹(PRG2)일 수 있다. 예를 들어, 가공 기준 조회 그룹(PRG)은 직업에 따라 결정될 수 있다. 설문 조사 대상자의 직업이 회사원인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 설문 조사 대상자의 직업이 학생인 경우, 가공 기준 조회 그룹(PRG)은 제2 가공 기준 조회 그룹(PRG2)일 수 있다. 설문 조사 대상자의 직업이 주부인 경우, 가공 기준 조회 그룹(PRG)은 제3 가공 기준 조회 그룹(PRG3)일 수 있다.Referring to FIG. 6, the processing reference group (PRG) may be determined according to the age, sex, or occupation of the surveyee. For example, a machining criterion query group (PRG) may be determined according to gender. If the sex of the survey subject is male, the machining criterion group PRG may be the first machining criterion group PRG1. If the gender of the survey subject is female, the machining criterion group PRG may be the second machining criterion group PRG2. For example, a processing reference group (PRG) can be determined by job. If the job of the survey subject is a company, the processing reference group (PRG) may be the first processing reference group (PRG1). If the subject of the questionnaire is a student, the machining criterion group PRG may be the second machining criterion group PRG2. If the subject of the questionnaire is a housewife, the processing criterion inquiry group (PRG) may be the third processing criterion inquiry group (PRG3).

가공 기준 조회 그룹(PRG)은 설문 조사 대상자의 나이, 성별 또는 직업의 조합에 따라 결정될 수 있다. 예를 들어 가공 기준 조회 그룹(PRG)은 나이 및 성별에 따라 결정될 수 있다. 설문 조사 대상자의 성별이 남성이고, 나이가 10대인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 설문 조사 대상자의 성별이 남성이고, 나이가 20대인 경우, 가공 기준 조회 그룹(PRG)은 제2 가공 기준 조회 그룹(PRG2)일 수 있다. 설문 조사 대상자의 성별이 남성이고, 나이가 90대인 경우, 가공 기준 조회 그룹(PRG)은 제9 가공 기준 조회 그룹(PRG9)일 수 있다. 설문 조사 대상자의 성별이 여성이고, 나이가 10대인 경우, 가공 기준 조회 그룹(PRG)은 제10 가공 기준 조회 그룹(PRG10)일 수 있다. 설문 조사 대상자의 성별이 여성이고, 나이가 90대인 경우, 가공 기준 조회 그룹(PRG)은 제18 가공 기준 조회 그룹(PRG18)일 수 있다.The processing reference group (PRG) may be determined by the age, gender, or combination of occupations of the surveyee. For example, the processing reference group (PRG) can be determined according to age and gender. If the sex of the survey subject is male and the age is ten, the machining criterion group PRG may be the first machining criterion group PRG1. If the sex of the survey subject is male and the age is twenties, the machining criterion group PRG may be the second machining criterion group PRG2. If the gender of the survey subject is male and the age is 90 years, the processing reference group (PRG) may be the ninth processing reference group (PRG9). If the gender of the survey subject is female and the age is 10, the processing standard reference group (PRG) can be the 10th processing standard reference group (PRG10). If the gender of the questionnaire is female and the age is 90 years, the Processing Criteria Inquiry Group (PRG) may be the 18th Processing Criteria Inquiry Group (PRG18).

도 7은 도 1의 제1 가공부에 포함되는 누적 가공 전후 데이터 저장부를 나타내는 도면이다.7 is a view showing a data storage unit before and after the cumulative processing included in the first processing unit of FIG.

도 7을 참조하면, 제1 가공부(510)는 누적 가공 전후 데이터 저장부(511)를 포함할 수 있다. 누적 가공 전후 데이터 저장부(511)는 누적 가공 전후 데이터를 저장할 수 있다. 누적 가공 전후 데이터는 누적 가공 전 데이터(PRE_PD) 및 누적 가공 후 데이터(POST_PD)를 포함할 수 있다. 예를 들어, 누적 가공 전 데이터(PRE_PD)는 "젤" 및 "짱"을 포함할 수 있다. 누적 가공 후 데이터(POST_PD)는 "제일"을 포함할 수 있다. "젤", "짱" 및 "제일"은 누적 가공 전후 데이터 저장부(511)에 저장될 수 있다. Referring to FIG. 7, the first processing unit 510 may include a data storage unit 511 before and after the cumulative processing. The cumulative machining data storage unit 511 can store data before and after the cumulative machining. The pre-cumulative machining data may include pre-cumulative machining data PRE_PD and cumulative machining data POST_PD. For example, the accumulated pre-processing data PRE_PD may include "gel" and "chan". The post-cumulative post-processing data POST_PD may include "first ". The "gel", "chan" and "first" can be stored in the data storage unit 511 before and after the cumulative processing.

비정형 데이터(NSD)가 누적 가공 전 데이터(PRE_PD) 중 하나와 일치하는 경우, 제1 가공부(510)는 상응하는 누적 가공 후 데이터(POST_PD)를 제공할 수 있다. 예를 들어, 제1 가공부(510)에 제공되는 비정형 데이터(NSD)는 "콜드스톤아이스클임이 젤 맛난다"일 수 있다. 설문 조사 대상자의 나이가 10대인 경우, 가공 기준 조회 그룹(PRG)은 제1 가공 기준 조회 그룹(PRG1)일 수 있다. 제1 가공 기준 조회 그룹(PRG1)에 상응하는 누적 가공 전 데이터(PRE_PD)는 "젤" 및 "짱"을 포함할 수 있다. 비정형 데이터(NSD) 중 "젤"은 누적 가공 전 데이터(PRE_PD) 중 "젤"과 일치할 수 있다. 이 경우, 제1 가공부(510)는 "젤"에 상응하는 누적 가공 후 데이터(POST_PD) "제일"을 제공할 수 있다. If the atypical data NSD coincides with one of the accumulated pre-processing data PRE_PD, the first processing unit 510 can provide the corresponding accumulated post-processing data POST_PD. For example, the atypical data (NSD) provided to the first processing portion 510 may be "cold stone ice cream is gel-like ". If the age of the survey subject is ten, the processing criterion inquiry group (PRG) may be the first processing criterion inquiry group (PRG1). The accumulated pre-processing data PRE_PD corresponding to the first machining reference query group PRG1 may include "gel" and "chan". "Gel" in the non-regular data (NSD) may coincide with "gel" in the pre-cumulative data (PRE_PD). In this case, the first processed portion 510 can provide post-cumulative post-processing data POST_PD corresponding to "gel ".

누적 가공 전후 데이터 저장부(511)는 제1 가공부(510) 내부에 표시되어 있으나, 제1 가공부(510)의 구현 방법에 따라서 누적 가공 전후 데이터 저장부(511)는 제1 가공부(510) 외부에 구현될 수 있다. The cumulative machining data storage unit 511 is displayed in the first machining unit 510 but the cumulative machining data storage unit 511 stores the data before and after the cumulative machining process in the first machining unit 510 510).

예시적인 실시예에 있어서, 제1 가공부(510)가 비정형 데이터(NSD)를 보정하여 제1 가공 데이터(PD1)를 생성하는 경우, 비정형 데이터(NSD) 및 제1 가공 데이터(PD1)는 누적 가공 전후 데이터에 추가될 수 있다. In the exemplary embodiment, when the first processing portion 510 generates the first processing data PD1 by correcting the irregular data NSD, the irregular data NSD and the first processing data PD1 are accumulated Can be added to the data before and after machining.

예시적인 실시예에 있어서, 제1 가공부(510)는 누적 가공 전후 데이터를 저장하는 누적 가공 전후 데이터 저장부(511)를 포함할 수 있다. 누적 가공 전후 데이터 저장부(511)는 비정형 데이터(NSD)를 누적 가공 전 데이터(PRE_PD)로서 저장하고, 제1 가공 데이터(PD1)를 누적 가공 후 데이터(POST_PD)로서 저장할 수 있다.In the exemplary embodiment, the first processing unit 510 may include a pre-cumulative data storage unit 511 that stores data before and after the cumulative processing. The cumulative machining data storage unit 511 stores the irregular data NSD as the pre-cumulative data PRE_PD and the first machining data PD1 as the post-cumulative data POST_PD.

예시적인 실시예에 있어서, 제1 가공부(510)가 제1 가공 데이터(PD1)를 생성하는 과정에서 발생하는 자료들은 누적 가공 전후 데이터에 추가될 수 있다. 제1 가공부(510)의 동작이 반복됨에 따라 누적 가공 전후 데이터는 증가할 수 있다. 누적 가공 전후 데이터가 증가함에 따라 제1 가공부(510)를 포함하는 전체 데이터 처리 시스템(10)의 정확도를 높이면서 동작 속도도 증가할 수 있다.In the exemplary embodiment, data generated in the process of generating the first processing data PD1 by the first processing unit 510 may be added to data before and after the cumulative processing. As the operation of the first machining unit 510 is repeated, data before and after the cumulative machining can be increased. As the data before and after the cumulative machining operation increases, the operating speed can be increased while increasing the accuracy of the entire data processing system 10 including the first machining portion 510.

도 8은 도 1의 제2 가공부에 포함되는 문법적 오류 데이터 저장부를 나타내는 도면이다.8 is a diagram showing a grammatical error data storage unit included in the second processing unit of FIG.

도 8을 참조하면, 제2 가공부는 문법적 오류 데이터 저장부(531)를 포함할 수 있다. Referring to FIG. 8, the second processing unit may include a grammatical error data storage unit 531.

예시적인 실시예에 있어서, 제2 가공부(530)는 제1 가공 데이터(PD1)에 포함되는 맞춤법 오류 및 표준어 오류를 수정하여 제2 가공 데이터(PD2)를 제공할 수 있다. 예를 들어, 제1 가공 데이터(PD1)는 "콜드스톤아이스클임이 제일 맛있다"일 수 있다. 제1 가공 데이터(PD1)에 포함되는"콜드스톤아이스클임"은 문법적 오류를 포함한다. "콜드스톤아이스클임"에 포함되는 문법적 오류는 띄어쓰기와 맞춤법 오류일 수 있다. "콜드스톤아이스클임"에 포함되는 문법적 오류 데이터는 제2 가공부(530)에 포함되는 문법적 오류 데이터 저장부(531)에 저장될 수 있다. 제2 가공부(530)는 문법적 오류 데이터 저장부(531)에 저장되는 문법적 오류 데이터에 기초하여"콜드스톤"과 "아이스클임"사이의 띄어쓰기 오류를 수정할 수 있다. 또한 맞춤법 오류에 해당하는 "아이스클임"을 "아이스크림"으로 수정할 수 있다. 제2 가공부(530)는 제1 가공 데이터(PD1)에 해당하는 "콜드스톤아이스클임"대한 문법적 오류인 띄어쓰기 및 맞춤법 오류를 수정하여 제2 가공 데이터(PD2)를 제공할 수 있다. 제2 가공 데이터(PD2)는 "콜드스톤 아이스크림이 제일 맛있다"일 수 있다.In the exemplary embodiment, the second machining unit 530 may provide the second machining data PD2 by correcting a spelling error and a standardized word error contained in the first machining data PD1. For example, the first processed data PD1 may be "the coldest ice ice is the most delicious ". The "cold stone ice block" included in the first processing data PD1 includes a grammatical error. The grammatical errors included in "Cold Stone Ice Cream" can be spacing and spelling errors. The grammatical error data included in the " cold stone ice block "can be stored in the grammatical error data storage unit 531 included in the second processing unit 530. [ The second processing unit 530 can correct a spacing error between "cold stone" and "ice clock" based on the grammatical error data stored in the grammatical error data storage unit 531. [ You can also modify the "Ice Cream" that corresponds to the spelling error to "Ice Cream". The second machining unit 530 may provide the second machining data PD2 by correcting the spacing and the spelling error which are grammatical errors for the "cold stone ice chime" corresponding to the first machining data PD1. The second processed data PD2 may be "cold stone ice cream is the most delicious ".

문법적 오류 데이터 저장부(531)는 제2 가공부(530) 내부에 표시되어 있으나, 제2 가공부(530)의 구현 방법에 따라서 문법적 오류 데이터 저장부(531)는 제2 가공부(530) 외부에 구현될 수 있다.The grammatical error data storage unit 531 is displayed in the second processing unit 530 according to the implementation method of the second processing unit 530, Can be implemented externally.

예시적인 실시예에 있어서, 제2 가공부(530)는 제1 가공 데이터(PD1)에 대한 문법적 오류를 수정하여 제2 가공 데이터(PD2)를 생성하는 과정에서 발생되는 문법적 오류 데이터를 저장하는 문법적 오류 데이터 저장부(531)를 포함할 수 있다.In the exemplary embodiment, the second processing unit 530 includes a grammatical error storing grammatical error data generated in the process of generating the second processed data PD2 by correcting a grammatical error in the first processed data PD1 And an error data storage unit 531.

예시적인 실시예에 있어서, 제2 가공부(530)는 문법적 오류 데이터 저장부(531)에 포함되는 문법적 오류 데이터에 기초하여 제2 가공 데이터(PD2)를 제공할 수 있다.In the exemplary embodiment, the second processing unit 530 may provide the second processing data PD2 based on the grammatical error data included in the grammatical error data storage unit 531. [

예시적인 실시예에 있어서, 제2 가공부(530)가 제2 가공 데이터(PD2)를 생성하는 과정에서 발생하는 자료들은 문법적 오류 데이터 저장부(531) 또는 누적 가공 전후 데이터 저장부(511)에 추가될 수 있다. 제2 가공부(530)의 동작이 반복됨에 따라 문법적 오류 데이터 및 누적 가공 전후 데이터는 증가할 수 있다. 문법적 오류 데이터 및 누적 가공 전후 데이터가 증가함에 따라 제1 가공부(510) 및 제2 가공부(530)를 포함하는 전체 데이터 처리 시스템(10)의 정확도를 높이면서 동작 속도도 증가할 수 있다.In the exemplary embodiment, data generated in the process of generating the second processed data PD2 by the second processed portion 530 may be stored in the grammatical error data storage unit 531 or in the pre-cumulative data storage unit 511 Can be added. As the operation of the second processing unit 530 is repeated, the grammatical error data and the data before and after the cumulative processing can be increased. As the grammatical error data and cumulative machining data increase, the operating speed can be increased while increasing the accuracy of the entire data processing system 10 including the first machining portion 510 and the second machining portion 530.

도 9는 도 1의 제3 가공부에 포함되는 동의어 데이터 베이스의 일 예를 나타내는 도면이고, 도 10은 도 1의 제3 가공부에 포함되는 동의어 데이터 베이스의 다른 예를 나타내는 도면이다.FIG. 9 is a view showing an example of a synonym database included in the third processed portion in FIG. 1, and FIG. 10 is a view showing another example of the synonym database included in the third processed portion in FIG.

도 9를 참조하면, 제3 가공부(550)는 동의어 데이터 베이스(551)를 포함할 수 있다. 동의어 정보는 동의어 그룹 및 대표 동의어를 포함하는 동의어 데이터 베이스(551)로부터 제공될 수 있다. 예를 들어 동의어 그룹은 "콜드스톤이 맛있다" 및 "콜드스톤이 제일 맛있다"일 수 있다. 대표 동의어는 "콜드스톤 맛있다"일 수 있다. 동의어 정보는 동의어 그룹에 해당하는 "콜드스톤이 맛있다" 및 "콜드스톤이 제일 맛있다"와 대표 동의어에 해당하는 "콜드스톤 맛있다"일 수 있다.Referring to FIG. 9, the third processing unit 550 may include a synonym database 551. The synonym information may be provided from a synonym database 551 that includes a synonym group and a representative synonym. For example, a synonym group could be "cold stone is good" and "cold stone is the best." Representative synonyms can be "cold stone delicious". The synonym information can be "cold stone good" and "cold stone good" which correspond to the synonym group and "cold stone good" corresponding to the representative synonyms.

예시적인 실시예에 있어서, 제3 가공부(550)가 제3 가공 데이터(PD3)를 생성하는 과정에서 발생하는 자료들은 동의어 정보에 추가될 수 있다. 예를 들어 "콜드스톤 맛있다"와 동의어에 해당하는 "콜드스톤이 제일 낫다"는 동의어 정보에 추가될 수 있다. In the exemplary embodiment, data generated in the process of generating the third processed data PD3 by the third processed portion 550 may be added to the synonym information. For example, "cold stone is the best" and synonyms "cold stone is the best" can be added to the synonym information.

예시적인 실시예에 있어서, 제3 가공부(550)가 제2 가공 데이터(PD2)를 하나의 대표 데이터로 그룹화하여 대표 데이터를 제3 가공 데이터(PD3)로서 제공하는 과정에서 발생되는 정보들은 동의어 정보에 추가될 수 있다. 제3 가공부(550)는 추가된 동의어 정보에 기초하여 제3 가공 데이터(PD3)를 제공할 수 있다.In the exemplary embodiment, the information generated in the process of grouping the second processed data PD2 into one representative data and providing the representative data as the third processed data PD3 in the third processed portion 550 is a synonym Can be added to the information. The third processing section 550 can provide the third processed data PD3 based on the added synonym information.

도 11은 본 발명의 실시예들에 따른 데이터 처리 시스템의 동작 방법을 나타내는 순서도이다. 11 is a flowchart illustrating an operation method of a data processing system according to embodiments of the present invention.

도 1 및 도 11을 참조하면, 데이터 처리 시스템(10)은 단말기(100), 데이터 분류기(300), 제1 가공부(510), 제2 가공부(530), 제3 가공부(550) 및 결과부(700)를 포함한다. 단말기(100)는 설문 조사 결과(SR)를 제공한다. 데이터 분류기(300)는 설문 조사 결과(SR)를 정형 데이터(SD) 및 비정형 데이터(NSD)로 구분한다. 제1 가공부(510)는 설문 조사 대상자의 개인 정보(PI)에 기초하여 가공 기준 조회 그룹(PRG)을 결정한다. 제1 가공부(510)는 가공 기준 조회 그룹(PRG)에 상응하는 누적 가공 전후 데이터에 따라 비정형 데이터(NSD)를 보정하여 제1 가공 데이터(PD1)를 제공한다. 제2 가공부(530)는 제1 가공 데이터(PD1)에 대한 문법적 오류를 수정하여 제2 가공 데이터(PD2)를 제공한다. 제3 가공부(550)는 동의어 정보에 기초하여 제2 가공 데이터(PD2)를 하나의 대표 데이터로 그룹화하여 대표 데이터를 제3 가공 데이터(PD3)로서 제공한다. 결과부(700)는 정형 데이터(SD) 및 제3 가공 데이터(PD3)를 분석한다.1 and 11, the data processing system 10 includes a terminal 100, a data classifier 300, a first processing unit 510, a second processing unit 530, a third processing unit 550, And a result portion 700. The terminal 100 provides a survey result (SR). The data classifier 300 divides the survey result (SR) into the form data (SD) and the atypical data (NSD). The first processing unit 510 determines the machining criterion group PRG based on the personal information PI of the person to be surveyed. The first machining unit 510 corrects the irregular data NSD according to the pre-cumulative machining data corresponding to the machining criterion inquiry group PRG to provide the first machining data PD1. The second machining unit 530 corrects the grammatical error of the first machining data PD1 to provide the second machining data PD2. The third processing section 550 groups the second processed data PD2 into one representative data based on the synonym information, and provides the representative data as the third processed data PD3. The resultant portion 700 analyzes the shaping data SD and the third processed data PD3.

데이터 처리 시스템(10)의 동작 방법은 설문 조사 결과(SR)를 정형 데이터(SD) 및 비정형 데이터(NSD)로 구분한다(S100). 설문 조사 대상자의 개인 정보(PI)에 기초하여 가공 기준 조회 그룹(PRG)을 결정하고, 가공 기준 조회 그룹(PRG)에 상응하는 누적 가공 전후 데이터에 따라 비정형 데이터(NSD)를 보정하여 제1 가공 데이터(PD1)를 제공한다(S101). 제1 가공 데이터(PD1)에 대한 문법적 오류를 수정하여 제2 가공 데이터(PD2)를 제공한다(S102). 동의어 정보에 기초하여 제2 가공 데이터(PD2)를 하나의 대표 데이터로 그룹화하여 대표 데이터를 제3 가공 데이터(PD3)로서 제공한다(S103).The operation method of the data processing system 10 divides a survey result (SR) into regular data (SD) and unstructured data (NSD) (SlOO). (PRG) on the basis of the personal information (PI) of the person to be surveyed, and corrects the irregular data (NSD) according to the accumulated before and after data corresponding to the machining criterion group (PRG) And provides data PD1 (S101). The second machining data PD2 is provided by correcting the grammatical error of the first machining data PD1 (S102). The second processed data PD2 is grouped into one representative data based on the synonym information and the representative data is provided as the third processed data PD3 (S103).

도 12는 본 발명의 일 실시예에 따른 데이터 처리 시스템의 동작 방법을 나타내는 순서도이고, 도 13은 라벨리스트 동작의 일 예를 나타내는 도면이고, 도 14는 보정 데이터 베이스 검색 결과 및 보정 결과를 출력하는 과정을 설명하기 위한 도면이다.FIG. 12 is a flowchart showing an operation method of a data processing system according to an embodiment of the present invention, FIG. 13 is a diagram illustrating an example of a label list operation, FIG. 14 is a flowchart illustrating a method of outputting a correction database search result and a correction result Fig.

도12 내지 14를 참조하면, 제3 가공 데이터(PD3)를 단어 별로 분리하여 라벨리스트를 생성하고, 보정 데이터 베이스를 검색하여 라벨리스트와 일치하는 보정 결과를 출력할 수 있다(S104). 예를 들어 제3 가공 데이터(PD3)는 "삼일전자가 복지혜택과 연봉수준이 제일 좋다"인 경우, 제3 가공 데이터(PD3)를 단어별로 분리하여 라벨리스트를 생성하면 {삼일전자, 복지, 혜택, 연봉, 수준, 제일, 좋다}일 수 있다. 라벨리스트 {삼일전자, 복지, 혜택, 연봉, 수준, 제일, 좋다}을 보정 데이터 베이스를 통해서 검색하면, 검색 결과는 {삼일전자, 복지, 연봉, 좋다}일 수 있다. 이 경우, 보정 결과는 "삼일전자가 복지와 연봉이 좋다"일 수 있다. 라벨리스트를 생성하고 보정결과를 출력하는 동작은 결과부(700)에서 수행될 수 있다. Referring to FIGS. 12 to 14, the third processed data PD3 is divided into words to generate a label list, and the correction database is searched to output a correction result matching the label list (S104). For example, in the case where the third processed data PD3 is "good for the welfare benefit and salary level" of the third processed data PD3, the third processed data PD3 is divided into words to generate a label list { Benefit, salary, level, best, good}. If you search through the calibration database for the label list {Samil E, Welfare, Benefit, Salary, Level, Best, Good}, the search results can be {Samil E, Welfare, Salary, Good}. In this case, the result of the correction may be "Samil E < RTI ID = 0.0 > The operation of generating the label list and outputting the correction result may be performed in the result portion 700.

본 발명에 따른 제1 내지 제3 가공부(510, 530, 550)를 포함하는 데이터 처리 시스템(10) 및 데이터 처리 시스템(10)의 동작 방법을 사용하면, 제1 내지 제3 가공부(510, 530, 550)의 과거 데이터 베이스를 누적하여 비정형 데이터(NSD)를 처리할 수 있어 데이터 처리 시스템(10)의 정확도 및 동작 속도를 증가시킬 수 있다. Using the operation method of the data processing system 10 and the data processing system 10 including the first through third processing parts 510, 530 and 550 according to the present invention, the first through third processing parts 510 , 530, and 550 can be accumulated to process the atypical data (NSD), thereby increasing the accuracy and the operation speed of the data processing system 10.

본 발명의 실시예들에 따른 데이터 처리 시스템은 제1 내지 제3 가공부의 과거 데이터 베이스를 누적하여 비정형 데이터를 처리함으로써 데이터 처리 시스템의 정확도 및 동작 속도를 증가 시킬 수 있어 데이터 처리 시스템을 사용하는 설문 조사 시스템에 적용될 수 있다. The data processing system according to the embodiments of the present invention can increase the accuracy and operation speed of the data processing system by accumulating past data bases of the first to third processing units to process irregular data, Can be applied to an inspection system.

상기에서는 본 발명이 바람직한 실시예를 참조하여 설명하였지만, 해당 기술분야의 숙련된 당업자는 하기의 특허청구범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 것이다.While the present invention has been described with reference to the preferred embodiments thereof, it will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit and scope of the invention as defined in the appended claims. It will be understood.

Claims

A terminal for providing a survey result;
A data classifier for classifying the result of the survey into regular data and unstructured data;
A first processing unit for determining a machining criterion inquiry group on the basis of personal information of a survey subject and correcting the irregular data according to the cumulative machining data corresponding to the machining criterion inquiry group to provide first machining data;
A second processing unit for correcting a grammatical error of the first processed data to provide second processed data;
A third processing unit for grouping the second processed data into one representative data based on synonym information and providing the representative data as third processed data; And
And a result portion for analyzing the shape data and the third processed data.

The method according to claim 1,
Wherein the processing criterion inquiry group is determined according to the age, sex or occupation of the survey subject.

3. The method of claim 2,
Wherein the pre-cumulative machining data includes cumulative machining data and cumulative machining data,
And if the atypical data matches one of the pre-cumulative data, the first processing unit provides the corresponding post-cumulative data.

The method of claim 3,
Wherein the second processing unit corrects a spelling error and a standard word error included in the first processing data to provide second processing data.

5. The method of claim 4,
Wherein the synonym information is provided from a synonym database including a synonym group and a representative synonym.

The method according to claim 1,
When the first machining unit corrects the atypical data to generate the first machining data,
Wherein the atypical data and the first processed data are added to the before and after accumulated processing data.

7. The image forming apparatus according to claim 6,
And a cumulative machining data storage section for storing the data before and after the cumulative machining,
Wherein the data storage section before and after the cumulative processing stores the irregular data as the pre-cumulative data, and stores the first processed data as the cumulative post-processing data.

The apparatus according to claim 1,
And a grammatical error data storage unit for storing grammatical error data generated in the process of generating the second processed data by correcting a grammatical error of the first processed data.

9. The method of claim 8,
And the second processing section provides the second processing data based on the grammatical error data included in the grammatical error data storage section.

The method according to claim 1,
Wherein information generated in the process of grouping the second processed data into one representative data and providing the representative data as third processed data is added to the synonym information,
And the third processing unit provides the third processed data based on the added synonym information.

Dividing the survey result into the form data and the atypical data;
Determining a machining criterion inquiry group on the basis of personal information of a surveyee and providing the first machining data by correcting the atypical data according to pre-cumulative machining data corresponding to the machining criterion inquiry group;
Correcting a grammatical error for the first processed data to provide second processed data;
Grouping said second processed data into one representative data based on synonym information and providing said representative data as third processed data.

12. The method of claim 11,
Generating a label list by separating the third processed data by word, searching the correction database, and outputting a correction result matching the label list.