KR102125056B1

KR102125056B1 - System and Method for Generating Character and Book for Mixed Character Automatic Recognition and System and Method for Searching using same

Info

Publication number: KR102125056B1
Application number: KR1020190034627A
Authority: KR
Inventors: 황재준; 이지은; 신우선; 윤지훈; 최윤호
Original assignee: 부산대학교 산학협력단
Priority date: 2019-03-26
Filing date: 2019-03-26
Publication date: 2020-06-19

Abstract

The present invention relates to a character and book generation system for mixed character automatic recognition which increases the efficiency of mixed character recognition and search through deep learning including the formation of individual characters variation and virtual page formation for automatic recognition of mixed characters, to a method thereof, and to a search system and method using the same. The system comprises: a mutation forming unit which learns a variation of individual characters or one side and learns a combination of one side based on an actual letter, and generates all characters which can be combined with the combination of the side; a virtual page forming unit randomly filling in pages of different sizes and shapes with various languages and types of characters and noise generated by the mutation forming unit and generating a book having virtual pages; and a network learning unit learning text regions of pages generated by the virtual page forming unit, classifying and learning what language the characters generated in the mutation forming unit, and learning a location and type of lower components.

Description

System and Method for Generating Characters and Books for Automatic Recognition of Mixed Characters and Searching System and Method Using the Same

본 발명은 혼합문자 자동인식에 관한 것으로, 구체적으로 개별 문자의 변이 형성 및 가상 페이지 형성을 포함하는 딥러닝으로 혼합문자 인식 및 검색의 효율성을 높일 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법에 관한 것이다.The present invention relates to automatic recognition of mixed characters, and specifically, deep learning that includes variation of individual characters and formation of virtual pages to generate characters and books for automatic recognition of mixed characters to increase the efficiency of recognition and search of mixed characters. The present invention relates to a system and method and a search system and method using the same.

현재 다양한 산업분야에서 문자인식기술의 향상이 요구되고 있다.Currently, it is required to improve the character recognition technology in various industries.

예를들어, 은행, 카드회사, 관공서 등 대량의 서류를 신속하고 정확하게 처리하거나, 로봇 응용기술에서 자동제어 능력을 향상시키기 위해 더욱 향상된 문자인식 기술이 요구되고 있다.For example, more advanced character recognition technology is required to rapidly and accurately process large amounts of documents, such as banks, card companies, government offices, or to improve automatic control capabilities in robot application technology.

특히, 이미지 데이터화된 고문서의 전문 검색서비스 실현을 위한 문서인식시스템 구축을 위한 정밀한 문자인식 기술의 개발이 이루어지고 있다.In particular, the development of a precise character recognition technology for the construction of a document recognition system for realization of a specialized search service for image documents and high documents has been developed.

이와 같이 이미지 데이터화된 고문서의 전문검색서비스 실현을 위한 문서인식시스템은 문자인식 기술과의 결합이 필요한데, 문자인식 기술은 입력 문자데이터로부터 특징을 추출하고, 미리 정해진 여러 문자모델과의 매칭을 통해 한 문자 모델로 분류하는 과정으로 구성된다.The document recognition system for realizing a professional search service for high-document documented image data needs to be combined with character recognition technology. Character recognition technology extracts features from input text data and matches them with several predefined character models. It consists of a process of classifying into a character model.

학습(training)과정에서 미리 구축되는 문자모델은 인식을 통해 하나의 개념으로 대응시키는 기준과 같아서, 문자모델의 성질에 따라 인식 대상 자료가 제한되거나 문자인식의 성능에 영향을 미친다.The character model that is pre-built in the training process is the same as the criterion that corresponds to a single concept through recognition, and thus the data to be recognized is limited or affects the performance of character recognition depending on the nature of the character model.

이와 같은 문자인식 관련기술로는 패턴매칭, 코릴레이션 계산기법, 특징점 추출 및 매칭, 뉴럴네트워크 기반 딥러닝 등 다양한 방법이 알려져 있다.Various methods, such as pattern matching, correlation calculation, feature point extraction and matching, and neural network-based deep learning, are known as such character recognition-related technologies.

이러한 기술들은 기준 영상 대비 테스트 영상의 스케일 또는 오리엔테이션 변화에 따라 문자 인식 성능이 크게 좌우된다. 특히 오리엔테이션 보정은 문자 인식률 개선에 지대한 영향을 미친다.In these technologies, character recognition performance greatly depends on a scale or orientation change of a test image compared to a reference image. In particular, orientation correction greatly affects the improvement of character recognition rate.

종래 기술의 일 예로 문자의 경사도와 특징의 상대적인 위치를 바탕으로 한 통계적인 패턴 인식에 관한 기술은 다음과 같은 문제가 있다.(대한민국 등록특허 제10-1900903호)As an example of the prior art, the technique for statistical pattern recognition based on the relative position of the inclination and the character of the character has the following problems. (Registration No. 10-1900903, Republic of Korea)

먼저, 문자가 차치하고 있는 영역 분할 오류는 문자 인식의 오류로 직결되고, 개인의 습관이나 특성의 차이로 인한 수많은 변형을 알아낼 수 없는 문제가 있다.First, an area division error in which characters are placed is directly related to an error in character recognition, and there is a problem in that numerous variations due to differences in personal habits or characteristics cannot be identified.

경사도와 상대적인 위치를 바탕으로 한 패턴인식으로는 약 6만자 가까이 되는 한자나 16만자를 넘는 옛한글(자음과 모음 조합 시)의 경우, 내부의 비슷한 글자들과 개인 특성의 조합의 경우의 수가 너무 많아 특정 글자를 인식하는 데 정확성이 저하되는 문제가 있다.For pattern recognition based on the inclination and relative position, in the case of Chinese characters that are close to 60,000 or more than 160,000 Korean characters (when combined consonants and vowels), the number of similar characters and personal characteristics in the interior is too large. There are many problems in that accuracy is deteriorated in recognizing a specific character.

또한, 여러 문자가 함께 섞여 있는 고문이나 서적의 경우 이런 경우의 수가 거의 무한대로 많아져 정확한 문자 인식이 어렵다.Also, in the case of tortures or books in which several characters are mixed together, the number of cases in this case is almost unlimited, making it difficult to accurately recognize characters.

그리고 종래 기술의 다른 일 예로 딥러닝을 이용한 문자 인식 시스템의 경우에는 특정한 책에서 문자를 일일이 분할하거나, 필기체를 다수의 사람이 작성하는 등 시간이 많이 걸리는 제한된 표본 수집 문제가 있고, 학습 표본수의 제한으로 인한 인식률 저하 문제가 있다.(대한민국 공개특허 제10-2011-0039900호)(대한민국 공개특허 제10-2008-0076433호)Also, as another example of the prior art, in the case of a character recognition system using deep learning, there is a limited sample collection problem that takes a lot of time, such as dividing characters individually in a specific book or writing a number of handwritten characters, and limiting the number of learning samples There is a problem that the recognition rate decreases due to (Republic of Korea Patent Publication No. 10-2011-0039900) (Republic of Korea Patent Publication No. 10-2008-0076433)

이와 같이 종래 기술의 문서인식 시스템을 구축하기 위한 문자인식 기술은 사람의 정보처리 능력을 완벽하게 알고리즘화하지 못해 문자의 난이도에 따라 질적인 측면과 양적인 측면의 어려움이 있다.As described above, the character recognition technology for constructing the document recognition system of the prior art has difficulty in qualitative and quantitative aspects depending on the difficulty of the character because it cannot fully algorithmize the human information processing ability.

예를 들어, 인쇄체 문자의 경우 다양한 활자체(font) 형태나 크기(size), 기울어짐(slant) 등의 불일치로 인하여 어려움이 발생하고, 문자인식의 양적인 측면의 어려움은 다양한 문자의 방대한 양으로 인하여 공학적으로 방대한 기억용량을 갖춘 시스템 구축이 어려운 문제가 있고, 방대한 기억용량을 갖춘 시스템 상에서 특정의 문자를 인식하기 위하여 인식소요시간이 오래 걸리는 문제가 발생한다.For example, in the case of printed characters, difficulties arise due to inconsistencies such as various font shapes, sizes, slants, and the difficulty in quantitative aspects of character recognition due to the vast amount of various characters There is a problem in that it is difficult to construct a system having a large storage capacity engineeringly, and a problem takes a long time to recognize a specific character on a system having a large storage capacity.

이와 같은 문자인식의 질적이고 양적인 난이도는 모든 데이터에 일정한 성능을 가지는 범용의 문자인식 구현을 어렵게 하는 요인이다.The qualitative and quantitative difficulty of such character recognition is a factor that makes it difficult to implement general-purpose character recognition with constant performance for all data.

따라서, 방대한 이미지 데이터화된 문서의 효율적인 텍스트 데이터화 문제를 해결하기 위해서, 문자인식의 난이도를 최소화하기 위한 기술 및 텍스트화가 필요한 문서의 분야에 따라서 각 분야에 해당하는 전문적인 데이터베이스를 구축하여 보다 정확성이 뛰어나고 효율적인 시스템 구축이 요구된다.Therefore, in order to solve the problem of efficient text dataization of large image data documents, it is more accurate by constructing a specialized database for each field according to the field of the document that requires technology and text to minimize the difficulty of character recognition. Efficient system construction is required.

특히, 혼합 문자를 갖거나 고문서와 같이 처리해야 하는 문서의 이미지가 소장 기간이 오래되어 문서 이미지 품질이 낮고 다양한 문서구조를 포함하고, 다양한 폰트에 의해 생성된 대용량 데이터인 경우에는 문자인식의 성능은 더욱 떨어지기 때문에 최적의 전문검색서비스 구현을 위해서는 문자인식 기술에 관한 보다 새로운 기술의 개발이 요구되고 있다.In particular, if the image of a document that has mixed characters or needs to be processed, such as an old document, has a long collection period, the document image quality is low and includes various document structures, and the performance of character recognition is high for large data generated by various fonts. Because of the further decline, the development of new technologies for character recognition technology is required to realize the optimal professional search service.

대한민국 등록특허 제10-1900903호Republic of Korea Registered Patent No. 10-1900903 대한민국 공개특허 제10-2011-0039900호Republic of Korea Patent Publication No. 10-2011-0039900 대한민국 공개특허 제10-2008-0076433호Republic of Korea Patent Publication No. 10-2008-0076433

본 발명은 종래 기술의 혼합문자 자동인식 기술의 문제점을 해결하기 위한 것으로, 개별 문자의 변이 형성 및 가상 페이지 형성을 포함하는 딥러닝으로 혼합문자 인식 및 검색의 효율성을 높일 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법을 제공하는데 그 목적이 있다.The present invention is to solve the problems of the prior art mixed character automatic recognition technology, deep character recognition, including the formation of variations and virtual page formation of individual characters to improve the efficiency of recognition and search of mixed characters automatic recognition of mixed characters The purpose is to provide a system and method for generating text and books, and a search system and method using the same.

본 발명은 문자를 이루는 각 요소의 구성과 위치를 데이터 베이스로 정리되어 디지털화된 각 개별 문자 혹은 편방의 다양한 변이를 VAE(Variational AutoEncoder)나 GAE(Generative Adversal Network) 등으로 학습하여 새로운 이미지를 생성할 수 있도록 하여 혼합문자 인식 및 검색의 효율성을 높일 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법을 제공하는데 그 목적이 있다.According to the present invention, the composition and position of each element constituting the characters are organized into a database, and various variations of each digitized individual character or one side are learned with VAE (Variational AutoEncoder) or GAE (Generative Adversal Network) to generate a new image. The purpose of the present invention is to provide a system and method for generating text and books for automatic recognition of mixed text, and a search system and method using the same, so as to increase the efficiency of searching and searching for mixed text.

본 발명은 생성된 여러 언어 및 종류의 문자들과 노이즈 등을 다양한 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 실제 책과 구별하기 힘든 무한대에 가까운 페이지를 가지는 책을 생성하고, 각 문자의 영역, 형태 및 뜻 등 답을 미리 아는 상태에서 딥러닝으로 학습하여 문자 인식을 할 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법을 제공하는데 그 목적이 있다.The present invention randomly fills the generated languages and types of characters and noise into pages of various sizes and shapes to create a book with pages close to infinity that are difficult to distinguish from actual books, and the area and shape of each character The purpose is to provide a system and method for generating text and books for automatic recognition of mixed characters, and a search system and method using the same.

본 발명은 하부 구성요소(한자의 부수, 한글의 음운)가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 포함여부를 학습하는 네트워크를 이용하거나, region proposal network가 있는 faster-RCNN, mask-RCNN 등의 딥러닝 네트워크를 이용하여 하부 구성요소의 위치를 학습하여 정확도를 높일 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법을 제공하는데 그 목적이 있다.The present invention uses a network that learns whether to include sub-components by using a classification network in case of characters with sub-components (number of Chinese characters, phonology of Hangul), or faster- with region proposal network. Provides a system and method for generating text and books for automatic recognition of mixed characters to improve the accuracy by learning the location of sub-components using deep learning networks such as RCNN, mask-RCNN, etc., and provides a search system and method using the same It has a purpose.

본 발명은 하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부(상위) 구성요소의 조합과 구조 관계(위치)로 검색하여 각 문자를 인식하고, 구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 빠르게 문자를 인식 및 검색할 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법을 제공하는데 그 목적이 있다.In the present invention, in the case of a document having sub-components, each character is recognized by retrieving the combination (structure) and structure (location) of the sub- (higher) components in the database of each character, and after identifying the components and their relations Provides a system and method for generating texts and books for automatic recognition of mixed characters, and a search system and method using the same, which enables quick recognition and search of characters by re-searching the relations between the components in the database after first searching for the component There is a purpose.

본 발명은 변이 형성 및 가상 페이지 형성 과정에서 흔히 사용되지 않는 벽자 입력 시스템을 구축하여 고문서 인식 및 검색의 정확성 및 효율성을 높일 수 있도록 한 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법을 제공하는데 그 목적이 있다.The present invention is a system and method for generating text and books for automatic recognition of mixed texts to improve the accuracy and efficiency of high document recognition and search by constructing a wall input system that is not commonly used in the process of forming variations and forming virtual pages, and using the same The aim is to provide a search system and method.

본 발명의 다른 목적들은 이상에서 언급한 목적으로 제한되지 않으며, 언급되지 않은 또 다른 목적들은 아래의 기재로부터 당업자에게 명확하게 이해될 수 있을 것이다.Other objects of the present invention are not limited to those mentioned above, and other objects not mentioned will be clearly understood by those skilled in the art from the following description.

상기와 같은 목적을 달성하기 위한 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템은 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성할 수 있도록 하는 변이 형성부;상기 변이 형성부에서 생성된 여러 언어 및 종류의 문자들과 노이즈를 서로 다른 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 가상 페이지를 갖는 책을 생성하는 가상 페이지 형성부;상기 가상 페이지 형성부에서 생성된 페이지들의 문자 영역을 학습하고 변이 형성부에서 생성된 문자들이 어떤 언어인지 분류 학습을 하고, 하부 구성요소 위치 및 종류를 학습하는 네트워크 학습부;를 포함하는 것을 특징으로 한다.The character and book generation system for automatic recognition of mixed characters according to the present invention for achieving the above object is to learn the variation of individual characters or one side to learn the combination of one side based on the actual character, and to combine the combinations of one side A variation forming unit that allows all possible characters to be generated; randomly filling characters and noise of various languages and types generated in the variation forming unit into pages of different sizes and shapes to generate a book having virtual pages A virtual page forming unit; a network learning unit learning a character area of the pages generated by the virtual page forming unit, classifying and learning what language the characters generated by the variation forming unit are, and learning the location and type of sub-components; It is characterized by including.

여기서, 상기 변이 형성부는, 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성하여 하부 구성요소 변이를 형성하는 하부 구성요소 변이 형성부와, 다른 크기와 형태, 굵기, 기울기, 서체의 가능한 변이들이 포함되도록 테스트 이미지들을 학습해서 글자 변이를 형성하는 글자 변이 형성부를 포함하는 것을 특징으로 한다.Here, the variation forming unit learns the variation of individual characters or one side to learn the combination of one side based on the actual character, and generates all the characters that can be combined with the combination of one side to form the lower component variation to form the lower component variation. Characterized in that it comprises a character variation forming unit that forms a character variation by learning the test images to include the formation unit and possible variations of different sizes and shapes, thickness, slope, and typeface.

그리고 상기 가상 페이지 형성부는, 문자 크기와 옆 문자와의 간격들을 일정 범위 내에서 랜덤하게 변화시켜 실제 문서와 유사한 변이를 생성하는 크기 및 간격 변이 생성부와, 페이지 자체에 실제 문서에서 뽑아낸 노이즈 및 블록(block), 그림(figure), 표(table)를 포함하는 문자가 아닌 영역을 바탕으로 딥러닝 네트워크로 여러 변이의 노이즈, 블록, 그림 및 표를 생성하여 실제 환경과 유사한 문서를 생성하는 영역 기반 문서 생성부를 포함하는 것을 특징으로 한다.And the virtual page forming unit, the size and spacing variation generating unit to randomly change the space between the character size and the next character within a certain range to generate a variation similar to the actual document, and the noise extracted from the actual document on the page itself and An area that generates documents similar to the real environment by generating noise, blocks, pictures, and tables of various variations with a deep learning network based on non-character areas including blocks, figures, and tables. It characterized in that it comprises a base document generator.

그리고 상기 네트워크 학습부는, 상기 가상 페이지 형성부에서 생성된 페이지들의 문자 영역을 학습하여 문자 영역을 검출하는 문자 영역 검출부와,상기 변이 형성부에서 생성된 문자들이 어떤 언어인지 분류 학습을 하는 언어 종류 분류부와,하부 구성요소가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 위치를 학습하고, 하부 구성요소의 종류 및 포함 여부를 학습하는 하부 구성요소 위치 학습부 및 하부 구성요소 종류 학습부를 포함하는 것을 특징으로 한다.In addition, the network learning unit, a character area detection unit that detects a character area by learning a character area of pages generated by the virtual page formation unit, and a language type classification that learns which language is generated by the variation formation unit In case of characters with sub- and sub-components, the location of the sub-components is studied using a classification network to learn the location of the sub-components, and to learn the types and inclusions of the sub-components. Characterized in that it comprises a kind learning unit.

그리고 하부 구성 요소가 없는 언어이면 생성된 페이지들의 문자 영역을 직접 학습하여 개별 문자로 검출(detection) 및 분할(segmentation)하는 것을 특징으로 한다.In addition, if the language has no sub-elements, it is characterized by directly learning the character areas of the generated pages and detecting and segmenting them into individual characters.

다른 목적을 달성하기 위한 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 방법은 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성하여 하부 구성요소 변이를 형성하는 단계;다른 크기와 형태, 굵기, 기울기, 서체의 가능한 변이들이 포함되도록 테스트 이미지들을 학습해서 글자 변이를 형성하는 단계;변이 형성부에서 생성된 여러 언어 및 종류의 문자들과 노이즈를 서로 다른 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 가상 페이지를 갖는 책을 생성하는 가상 페이지 형성 단계;상기 가상 페이지 형성 단계에서 생성된 페이지들의 문자 영역을 검출하고, 변이 형성에 의해 생성된 문자들이 어떤 언어인지 분류 학습을 하고, 하부 구성요소 위치 및 종류를 학습하는 네트워크 학습 단계;를 포함하는 것을 특징으로 한다.The method for generating characters and books for automatic recognition of mixed characters according to the present invention for achieving other purposes is to learn variations of individual characters or one side and learn combinations of one side based on real characters, so that all combinations of one side can be combined. Forming a character component variation by generating letters and learning the test images to include possible variations of different sizes, shapes, thicknesses, inclinations, and fonts; forming character variations; multiple languages created in the variation formation unit, and A virtual page forming step of randomly filling characters of different types and noise into pages of different sizes and shapes to generate a book having virtual pages; detecting a character area of pages generated in the virtual page forming step, and forming variations Characterized in that it comprises; a network learning step of learning the classification of the characters generated by the language, and learning the location and type of sub-components.

여기서, 상기 가상 페이지 형성 단계는, 문자 크기와 옆 문자와의 간격들을 일정 범위 내에서 랜덤하게 변화시켜 실제 문서와 유사한 변이를 생성하는 크기 및 간격 변이 생성 단계와,페이지 자체에 실제 문서에서 뽑아낸 여러 노이즈 및 블록(block), 그림(figure), 표(table)를 갖는 문자가 아닌 영역을 바탕으로 딥러닝 네트워크로 여러 변이의 노이즈, 블록, 그림 및 표를 생성하여 실제 환경과 유사한 문서를 생성하는 영역 기반 문서 생성 단계를 포함하는 것을 특징으로 한다.Here, the step of forming the virtual page includes a step of generating a size and spacing variation to generate a variation similar to the actual document by randomly changing the character size and the spacing between the characters next to the character, and the page itself extracted from the actual document Create a document similar to the real environment by generating noise, blocks, pictures, and tables of various variations with a deep learning network based on non-character areas with multiple noises, blocks, figures, and tables It characterized in that it comprises a step of generating an area-based document.

그리고 상기 네트워크 학습 단계는, 가상 페이지 형성부에서 생성된 페이지들의 문자 영역을 학습하여 문자 영역을 검출하는 문자 영역 검출 단계와,변이 형성부에서 생성된 문자들이 어떤 언어인지 분류 학습을 하는 언어 종류 분류 단계와,하부 구성요소가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 위치를 학습하는 단계와,하부 구성요소의 종류 및 포함 여부를 학습하는 단계를 포함하는 것을 특징으로 한다.In the network learning step, a text area detection step of learning a text area of pages generated by the virtual page forming unit to detect a text area, and a language type classification for classifying and learning what language the characters generated in the variation forming unit are. And a step of learning the position of the lower component using a classification network in the case of a character having a lower component, and learning the type and inclusion of the lower component. .

다른 목적을 달성하기 위한 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 시스템은 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성할 수 있도록 하는 변이 형성부;상기 변이 형성부에서 생성된 여러 언어 및 종류의 문자들과 노이즈를 서로 다른 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 가상 페이지를 갖는 책을 생성하는 가상 페이지 형성부;상기 가상 페이지 형성부에서 생성된 페이지들의 문자 영역을 학습하고, 문자 영역을 검출하여 변이 형성부에서 생성된 문자들이 어떤 언어인지 분류 학습을 하고, 하부 구성요소 위치 및 종류를 학습하는 네트워크 학습부;상기 네트워크 학습부에서 학습된 결과를 이용하여 생성된 페이지들의 문자 영역을 학습하여 개별 문자를 분할하고, 어떤 언어인지 확인하여 하부 구성 요소의 구성을 인식하거나 그 위치를 파악하여 문자를 인식 및 검색하는 혼합문자 인식 및 검색부;를 포함하는 것을 특징으로 한다.The search system using text and book generation for automatic recognition of mixed characters according to the present invention for achieving other objects learns variations of individual characters or one side to learn combinations of one side based on real characters, and thus to a combination of one side. A variation forming unit that allows all combinable characters to be generated; randomly filling the characters and noise of various languages and types generated in the variation forming unit into pages of different sizes and shapes to create a book with virtual pages The virtual page forming unit; learning the character area of the pages generated by the virtual page forming unit, detecting the character area, and learning the classification of the characters generated by the variation forming unit, and determining the location and type of sub-components. A network learning unit that learns; By learning the character area of the pages generated using the results learned from the network learning unit, divides individual characters, recognizes the language, recognizes the configuration of sub-components, or locates them Characterized in that it comprises; a mixed character recognition and search unit for recognizing and searching for characters.

여기서, 상기 혼합문자 인식 및 검색부는, 생성된 페이지들의 문자 영역을 학습하여 개별 문자로 구분하는 문자 영역 구분부와,문자 영역 구분부에서 구분된 문자 영역을 검출하는 문자 영역 검출부와,네트워크를 이용하여 어떤 언어인지 확인하는 언어 종류 분류부와,하부 구성요소가 있는 언어라면 네트워크를 이용하여 하부 구성 요소의 구성을 인식하거나 그 위치를 파악하는 하부 구성요소 위치 인식부와,하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부 구성요소의 조합과 구조 관계로 검색하여 각 문자를 인식하는 하부 구성요소 종류 인식부와,구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 문자를 인식 및 검색할 수 있도록 하는 구성요소 기반 검색부를 포함하는 것을 특징으로 한다.Here, the mixed character recognition and retrieval unit uses a character region detection unit for learning the character regions of the generated pages and classifying them into individual characters, a character region detection unit for detecting the character regions separated by the character region division units, and a network. A language type classification unit to check which language is used, and if there is a language with sub-components, a sub-element location recognition unit that recognizes or recognizes the configuration of sub-components using a network, and a document with sub-components. In one case, the sub-component type recognition unit for recognizing each character by retrieving the combination and structure of sub-components from the database of each character is identified, and the components and their relations are searched first. Then, there is characterized in that it includes a component-based retrieval unit that enables recognizing and retrieving characters by re-searching the relationships between the components therein.

그리고 하부 구성요소가 없는 문자의 경우 하부 구성요소의 포함여부를 학습하는 네트워크 또는 region proposal network가 있는 faster-RCNN, mask-RCNN 들을 포함하는 딥러닝 네트워크의 어느 하나를 이용하여 직접 문자를 인식하는 것을 특징으로 한다.In the case of characters without sub-components, it is recommended to recognize the characters directly using either a network learning whether the sub-components are included or a deep learning network including a faster-RCNN or a mask-RCNN with a region proposal network. It is characterized by.

그리고 하부 구성요소가 합쳐져 하나의 요소로 쓰이는 상위 요소인 경우에는, 문자에서 상위 구성 요소를 먼저 인식하고 구성 요소끼리의 관계를 기준으로 분류를 하는 것을 특징으로 한다.In addition, when the lower component is the upper element used as one element, the upper component is recognized first in the text and classified based on the relationship between the components.

다른 목적을 달성하기 위한 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 방법은 변이 형성 단계 및 가상 페이지 형성 단계, 네트워크 학습 단계를 수행하여 혼합 문자 자동인식을 위한 문자 및 서적 생성을 하는 단계;네트워크 학습 단계에서 학습된 결과를 이용하여 생성된 페이지들의 문자 영역을 학습하여 개별 문자를 구분하는 단계;하부 구성요소가 없는 문자의 경우 직접 문자를 인식하고, 하부 구성요소가 있는 언어라면 네트워크를 이용하여 하부 구성 요소의 구성을 인식하거나 그 위치를 파악하는 단계;하부 구성요소가 합쳐져 하나의 요소로 쓰이는 상위 요소인 경우에는 문자에서 상위 구성 요소를 먼저 인식하고 구성 요소끼리의 관계를 기준으로 분류하는 단계;하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부 구성요소의 조합과 구조 관계로 검색하여 각 문자를 인식하여 검색 결과를 출력하는 단계;를 포함하는 것을 특징으로 한다.The search method using the generation of characters and books for automatic recognition of mixed characters according to the present invention for achieving other objects is performed by performing a variation formation step, a virtual page formation step, and a network learning step to generate characters and books for automatic recognition of mixed characters. A step of separating individual characters by learning a character area of pages generated using the results learned in the network learning step; in the case of a character without a sub-component, recognizes the character directly, and the language with the sub-component Recognizing the configuration of the lower component or determining its location using a ramen network; if the lower component is a parent element that is used as one element, the upper component is recognized first in the text and the relationship between the components is established. It characterized in that it comprises; sorting by criteria; in the case of a document with sub-components, retrieving each character by retrieving the combination and structure of sub-components from the database of each character and outputting a search result; do.

이상에서 설명한 바와 같은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법은 다음과 같은 효과가 있다.The character and book generation system and method for automatic recognition of mixed characters according to the present invention as described above, and the search system and method using the same have the following effects.

첫째, 개별 문자의 변이 형성 및 가상 페이지 형성을 포함하는 딥러닝으로 혼합문자 인식 및 검색의 효율성을 높일 수 있다.First, it is possible to increase the efficiency of mixed character recognition and search by deep learning including variation of individual characters and virtual page formation.

둘째, 문자를 이루는 각 요소의 구성과 위치를 데이터 베이스로 정리되어 디지털화된 각 개별 문자 혹은 편방의 다양한 변이를 VAE(Variational AutoEncoder)나 GAE(Generative Adversal Network) 등으로 학습하여 새로운 이미지를 생성할 수 있도록 하여 정확성이 뛰어나고 효율적인 혼합 문자 검색 시스템 구축이 가능하다.Second, the composition and position of each element constituting the characters are organized into a database, and various variations of each digitized individual character or one side can be learned with VAE (Variational AutoEncoder) or GAE (Generative Adversal Network) to create a new image. By doing so, it is possible to build a highly accurate and efficient mixed character search system.

셋째, 생성된 여러 언어 및 종류의 문자들과 노이즈 등을 다양한 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 실제 책과 구별하기 힘든 무한대에 가까운 페이지를 가지는 책을 생성하고, 각 문자의 영역, 형태 및 뜻 등 답을 미리 아는 상태에서 딥러닝으로 학습하여 문자 인식을 할 수 있도록 하여 최적의 전문검색서비스 구현이 가능하다.Third, randomly fill the generated languages and types of characters and noise into pages of various sizes and shapes to create a book with pages close to infinity that are difficult to distinguish from the actual books. It is possible to realize the optimal professional search service by learning deep learning and recognizing characters while knowing the answers such as meaning.

넷째, 하부 구성요소(한자의 부수, 한글의 음운)가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 포함여부를 학습하는 네트워크를 이용하거나, region proposal network가 있는 faster-RCNN, mask-RCNN 들을 포함하는 딥러닝 네트워크의 어느 하나를 이용하여 하부 구성요소의 위치를 학습하여 정확도를 높일 수 있도록 한다.Fourth, in the case of characters with sub-elements (number of Chinese characters, phonology of Hangul), use a network that learns whether to include sub-components using a classification network, or faster-RCNN with a region proposal network. In order to improve accuracy by learning the location of the lower component using any one of deep learning networks including mask-RCNNs.

다섯째, 하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부(상위) 구성요소의 조합과 구조 관계(위치)로 검색하여 각 문자를 인식하고, 구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 빠르게 문자를 인식 및 검색할 수 있도록 한다.Fifth, in the case of a document with sub-components, each character is recognized by retrieving the combination (structure) and structure (location) of the sub- (parent) components from the database of each character, and then recognizing the relations with the components and then database To search the component first, and then re-search the relationship between the components in it so that characters can be quickly recognized and searched.

여섯째, 변이 형성 및 가상 페이지 형성 과정에서 흔히 사용되지 않는 벽자 입력 시스템을 구축하여 고문서 인식 및 검색의 정확성 및 효율성을 높일 수 있도록 한다.Sixth, it is possible to increase the accuracy and efficiency of high document recognition and retrieval by constructing a wall entry system that is not commonly used in the process of forming variations and forming virtual pages.

도 1은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템의 구성도
도 2는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 방법을 나타낸 플로우 차트
도 3a와 도 3b는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 기술에 의해 처리되는 문자 데이터의 일 예를 나타낸 구성도
도 4a와 도 4b는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 위한 분류 결과 특성 그래프 및 분류 결과 문자 데이터의 일 예를 나타낸 구성도
도 5는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성의 생성된 가상 페이지의 구성도
도 6은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 위한 네트워크 학습 툴의 일 예를 나타낸 구성도
도 7a와 도 7b는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성에 따른 검출 결과 구성도
도 8은 본 발명에 따른 동국정운에서의 문자 인식 결과를 나타낸 구성도
도 9는 본 발명에 따른 SSD(Single Shot Detector)의 개념을 나타낸 구성도
도 10a와 도 10b는 본 발명에 따른 벽자 입력 처리를 위한 벽자 데이터 및 변이 구성도
도 11은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 시스템의 구성도
도 12는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 방법을 나타낸 플로우 차트1 is a block diagram of a system for generating characters and books for automatic recognition of mixed characters according to the present invention
Figure 2 is a flow chart showing a method for generating text and books for automatic recognition of mixed characters according to the present invention
3A and 3B are diagrams illustrating an example of character data processed by a character and book generation technique for automatic recognition of mixed characters according to the present invention.
4A and 4B are diagrams illustrating an example of classification result characteristic graph and classification result character data for generating characters and books for automatic recognition of mixed characters according to the present invention.
5 is a configuration diagram of a generated virtual page of text and book generation for automatic recognition of mixed characters according to the present invention
6 is a block diagram showing an example of a network learning tool for generating characters and books for automatic recognition of mixed characters according to the present invention
7A and 7B are configuration diagrams of detection results according to the generation of characters and books for automatic recognition of mixed characters according to the present invention.
8 is a block diagram showing the result of character recognition in Dongguk Jungwoon according to the present invention
9 is a block diagram showing the concept of a single shot detector (SSD) according to the present invention
10A and 10B are wall data and variation configuration diagrams for wall input processing according to the present invention.
11 is a block diagram of a search system using text and book generation for automatic recognition of mixed characters according to the present invention.
12 is a flow chart showing a search method using text and book generation for automatic recognition of mixed characters according to the present invention

이하, 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법의 바람직한 실시 예에 관하여 상세히 설명하면 다음과 같다.Hereinafter, a description will be given of a preferred embodiment of a system and method for generating text and books for automatic recognition of mixed characters and a search system and method using the same, as follows.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법의 특징 및 이점들은 이하에서의 각 실시 예에 대한 상세한 설명을 통해 명백해질 것이다.Characters and advantages of a character and book generation system and method for automatic recognition of mixed characters according to the present invention and a search system and method using the same will become apparent through detailed description of each embodiment below.

도 1은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템의 구성도이다.1 is a block diagram of a system for generating characters and books for automatic recognition of mixed characters according to the present invention.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법은 문자를 이루는 각 요소의 구성과 위치를 데이터 베이스로 정리되어 디지털화된 각 개별 문자 혹은 편방의 다양한 변이를 VAE(Variational AutoEncoder)나 GAE(Generative Adversal Network) 등으로 학습하여 새로운 이미지를 생성할 수 있도록 하여 정확성이 뛰어나고 효율적인 혼합 문자 검색 시스템 구축이 가능하도록 한 것이다.The system and method for generating texts and books for automatic recognition of mixed characters according to the present invention and the search system and method using the same are organized into a database for the composition and location of each element constituting characters, and various variations of each individual character or one side digitized. It is possible to build a highly accurate and efficient mixed character search system by learning a VAE (Variational AutoEncoder) or a GAE (Generative Adversal Network) to create new images.

이를 위하여, 본 발명은 편방의 조합을 실제 문자를 기준으로 학습시켜, 다수의 편방의 조합으로 조합 가능한 모든 문자를 생성할 수 있도록 하고, 다양한 크기와 형태, 굵기, 기울기, 서체 등의 가능한 변이들이 포함되도록 다양한 형태의 테스트 이미지들을 학습해서 생성시키는 구성을 포함할 수 있다.To this end, the present invention learns a combination of squares based on actual characters, so that all combinations of characters can be generated with a combination of multiple squares, and possible variations of various sizes and shapes, thickness, slope, typeface, etc. It may include a configuration to learn and generate various types of test images to be included.

본 발명은 생성된 여러 언어 및 종류의 문자들과 노이즈 등을 다양한 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 실제 책과 구별하기 힘든 무한대에 가까운 페이지를 가지는 책을 생성하고, 각 문자의 영역, 형태 및 뜻 등 답을 미리 아는 상태에서 딥러닝으로 학습하여 문자 인식을 할 수 있도록 하는 구성을 포함할 수 있다.The present invention randomly fills the generated languages and types of characters and noise into pages of various sizes and shapes to create a book with pages close to infinity that are difficult to distinguish from actual books, and the area and shape of each character And learning to perform deep learning while knowing the answer such as the meaning in advance so as to enable character recognition.

본 발명은 하부 구성요소(한자의 부수, 한글의 음운)가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 포함여부를 학습하는 네트워크를 이용하거나, region proposal network가 있는 faster-RCNN, mask-RCNN 들을 포함하는 딥러닝 네트워크의 어느 하나를 이용하여 하부 구성요소의 위치를 학습하는 구성을 포함할 수 있다.The present invention uses a network that learns whether to include sub-components by using a classification network in case of characters with sub-components (number of Chinese characters, phonology of Hangul), or faster- with region proposal network. It may include a configuration for learning the location of the sub-component using any one of the deep learning network including RCNN, mask-RCNN.

본 발명은 하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부(상위) 구성요소의 조합과 구조 관계(위치)로 검색하여 각 문자를 인식하고, 구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 빠르게 문자를 인식 및 검색할 수 있도록 하는 구성을 포함할 수 있다.In the present invention, in the case of a document having sub-components, each character is recognized by retrieving the combination (structure) and structure (location) of the sub- (higher) components in the database of each character, and after identifying the components and their relations It may include a component that searches for a component in the database first, and then re-searches the relationships between the components in it to quickly recognize and search characters.

본 발명은 변이 형성 및 가상 페이지 형성 과정에서 흔히 사용되지 않는 벽자 입력 시스템을 구축하여 고문서 인식 및 검색의 정확성 및 효율성을 높이기 위한 구성을 포함할 수 있다.The present invention may include a configuration for increasing the accuracy and efficiency of high document recognition and retrieval by constructing a wall entry system that is not commonly used in the process of forming variations and forming virtual pages.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템은 도 1에서와 같이, 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성할 수 있도록 하는 변이 형성부(100)와, 변이 형성부(100)에서 생성된 여러 언어 및 종류의 문자들과 노이즈를 서로 다른 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 가상 페이지를 갖는 책을 생성하는 가상 페이지 형성부(200)와, 상기 가상 페이지 형성부(200)에서 생성된 페이지들의 문자 영역을 학습하고, 문자 영역을 검출하여 변이 형성부(100)에서 생성된 문자들이 어떤 언어인지 분류 학습을 하고, 하부 구성요소 위치 및 종류를 학습하는 네트워크 학습부(300)를 포함한다.The character and book generation system for automatic recognition of mixed characters according to the present invention, as shown in FIG. 1, learns the variation of individual characters or one side to learn the combination of one side based on the actual character, and can combine all of the combinations in one side. Having a virtual page by randomly filling characters and noise of various languages and types generated in the variation forming unit 100 and the variation forming unit 100 to generate characters in pages of different sizes and shapes A virtual page forming unit 200 for generating a book, a character area of pages generated by the virtual page forming unit 200, a character area is detected to detect the characters generated in the variation forming unit 100. And a network learning unit 300 that performs cognitive classification learning and learns the location and type of sub-components.

여기서, 변이 형성부(100)는 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성하여 하부 구성요소 변이를 형성하는 하부 구성요소 변이 형성부(10)와, 다른 크기와 형태, 굵기, 기울기, 서체의 가능한 변이들이 포함되도록 테스트 이미지들을 학습해서 글자 변이를 형성하는 글자 변이 형성부(11)를 포함한다.Here, the variation forming unit 100 learns the variation of individual characters or one side to learn the combination of one side based on the actual character, generates all the characters that can be combined with the combination of one side to form a lower component variation. It includes an element variation forming unit 10 and a character variation forming unit 11 that learns test images so as to include possible variations of different sizes, shapes, thicknesses, inclinations, and fonts to form character variations.

그리고 가상 페이지 형성부(200)는 문자 크기와 옆 문자와의 간격들을 일정 범위 내에서 랜덤하게 변화시켜 실제 문서와 유사한 변이를 생성하는 크기 및 간격 변이 생성부(20)와, 페이지 자체에 실제 문서에서 뽑아낸 여러 노이즈 및 블록(block), 그림(figure), 표(table) 등 문자가 아닌 영역을 바탕으로 딥러닝 네트워크로 여러 변이의 노이즈, 블록, 그림 및 표 등을 생성하여 실제 환경과 유사한 문서를 생성하는 영역 기반 문서 생성부(21)를 포함한다.In addition, the virtual page forming unit 200 randomly changes the character size and the spacing between the next characters within a certain range to generate a variation similar to the actual document, and a variation document generating unit 20 that generates a variation similar to the actual document, and the actual document on the page itself. Based on non-character areas such as noise, blocks, figures, and tables extracted from the deep learning network, it generates noise, blocks, pictures, and tables of various variations, similar to the real environment. And an area-based document generation unit 21 for generating a document.

그리고 네트워크 학습부(300)는 가상 페이지 형성부(200)에서 생성된 페이지들의 문자 영역을 학습하여 문자 영역을 검출하는 문자 영역 검출부(31)와, 변이 형성부(100)에서 생성된 문자들이 어떤 언어인지 분류 학습을 하는 언어 종류 분류부(32)와, 하부 구성요소(한자의 부수, 한글의 음운)가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 위치를 학습하고, 하부 구성요소의 종류 및 포함 여부를 학습하는 하부 구성요소 위치 학습부(33) 및 하부 구성요소 종류 학습부(34)를 포함한다.In addition, the network learning unit 300 learns the character area of the pages generated by the virtual page forming unit 200 to detect the character area, and the characters generated by the variation forming unit 100 are used. In the case of a character having a language type classification unit 32 for language recognition classification learning and a sub-element (the number of Chinese characters, the phonology of Hangul), the location of the sub-elements is learned by using a classification network, It includes a lower component position learning unit 33 and a lower component type learning unit 34 for learning the type and inclusion of the lower component.

여기서, 네트워크 학습부(300)는 하부 구성 요소가 없는 언어이면 생성된 페이지들의 문자 영역을 직접 학습하여 개별 문자로 검출(detection) 및 분할(segmentation)하는 것이다.Here, the network learning unit 300 directly detects the character area of the generated pages in a language without a sub-component, and detects and segments into individual characters.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 방법을 구체적으로 설명하면 다음과 같다.The method of generating characters and books for automatic recognition of mixed characters according to the present invention is as follows.

도 2는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 방법을 나타낸 플로우 차트이다.2 is a flow chart showing a method for generating text and books for automatic recognition of mixed characters according to the present invention.

먼저, 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성하여 하부 구성요소 변이를 형성한다.(S201)First, by learning the variation of individual characters or one side, the combination of the one side is learned based on the actual character, and all the characters that can be combined with the combination of one side are generated to form the lower component variation (S201).

이어, 다른 크기와 형태, 굵기, 기울기, 서체의 가능한 변이들이 포함되도록 테스트 이미지들을 학습해서 글자 변이를 형성한다.(S202)Subsequently, the test images are trained to form the character variations so that possible variations of different sizes and shapes, thickness, slope, and typeface are included (S202).

도 3a와 도 3b는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 기술에 의해 처리되는 문자 데이터의 일 예를 나타낸 구성도이다.3A and 3B are diagrams illustrating an example of character data processed by a character and book generation technique for automatic recognition of mixed characters according to the present invention.

그리고 변이 형성부(100)에서 생성된 여러 언어 및 종류의 문자들과 노이즈를 서로 다른 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 가상 페이지를 갖는 책을 생성하는 가상 페이지 형성 단계를 수행한다.(S203)Then, the virtual page forming step of generating a book having a virtual page is performed by randomly filling characters and noise of various languages and types generated in the variation forming unit 100 into pages of different sizes and shapes. (S203 )

각 문자의 영역, 형태 및 뜻 등 답을 미리 아는 상태에서 딥러닝으로 학습하여 문자 인식을 하고(S204), 실제 환경과 유사한 문서를 생성한다.(S205)Learning by deep learning while knowing the answer, such as the area, shape, and meaning of each character in advance (S204), generates a document similar to the real environment (S205).

이와 같은 가상 페이지 형성 단계는 문자 크기와 옆 문자와의 간격들을 일정 범위 내에서 랜덤하게 변화시켜 실제 문서와 유사한 변이를 생성하는 크기 및 간격 변이 생성 단계와, 페이지 자체에 실제 문서에서 뽑아낸 여러 노이즈 및 블록(block), 그림(figure), 표(table) 등 문자가 아닌 영역을 바탕으로 딥러닝 네트워크로 여러 변이의 노이즈, 블록, 그림 및 표 등을 생성하여 실제 환경과 유사한 문서를 생성하는 영역 기반 문서 생성 단계를 포함한다.Such a virtual page forming step is a step of generating a size and spacing variation that randomly changes the character size and the spacing between adjacent characters within a certain range to generate a variation similar to a real document, and various noises extracted from the actual document on the page itself. And an area that generates documents similar to the real environment by generating noise, blocks, pictures, and tables of various variations with a deep learning network based on non-character areas such as blocks, figures, and tables. It includes the steps of generating the base document.

도 5는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성의 생성된 가상 페이지의 구성도이고, 도 6은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 위한 네트워크 학습 툴의 일 예를 나타낸 구성도이다.5 is a configuration diagram of a generated virtual page of text and book generation for automatic recognition of mixed text according to the present invention, and FIG. 6 is a network learning tool for text and book generation for automatic recognition of mixed text according to the present invention. It is a block diagram showing an example.

그리고 문자 영역 검출 및 생성된 문자들이 어떤 언어인지 분류 학습을 하고, 하부 구성요소 위치 및 종류를 학습하는 네트워크 학습 단계를 수행한다.(S206)Then, the character area detection and classification learning of the generated characters are performed, and a network learning step of learning the location and type of the sub-elements is performed (S206).

도 4a와 도 4b는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 위한 분류 결과 특성 그래프 및 분류 결과 문자 데이터의 일 예를 나타낸 구성도이다.4A and 4B are diagrams illustrating an example of classification result characteristic graphs and classification result character data for generating characters and books for automatic recognition of mixed characters according to the present invention.

여기서, 네트워크 학습 단계는 가상 페이지 형성부(200)에서 생성된 페이지들의 문자 영역을 학습하여 문자 영역을 검출하는 문자 영역 검출 단계와, 변이 형성부(100)에서 생성된 문자들이 어떤 언어인지 분류 학습을 하는 언어 종류 분류 단계와, 하부 구성요소(한자의 부수, 한글의 음운)가 있는 문자일 경우 분류 네트워크(classification network)를 이용하여 하부 구성요소의 위치를 학습하는 단계와, 하부 구성요소의 종류 및 포함 여부를 학습하는 단계를 포함한다.Here, the network learning step is a character area detection step of learning a character area by learning a character area of pages generated by the virtual page forming unit 200, and classifying and learning what language the characters generated by the variation forming unit 100 are. Classifying the type of language, and learning the position of the sub-element using a classification network in case of characters with sub-elements (the number of Chinese characters, the phonology of Hangul), and the type of the sub-elements And learning whether to include.

이와 같은 네트워크 학습 단계는 생성된 문자들을 어떤 언어인지 구분하는 분류 네트워크(classification network) 학습을 하는 것이다.Such a network learning step is to perform a classification network classifying the generated characters into which language.

하부 구성요소 (한자의 부수, 한글의 음운)가 있는 문자일 경우 classification network를 이용하여 하부 구성요소의 포함여부를 학습하는 네트워크를 이용 혹은 region proposal network가 있는 faster-RCNN, mask-RCNN 등의 딥러닝 네트워크를 이용하여 하부 구성요소의 위치 학습을 하고, 하부 구성 요소가 없는 언어이면 동일한 방식으로 각 문자를 직접 학습을 한다.In the case of characters with sub-components (number of Chinese characters, phonology of Hangul), use a network to learn whether the sub-components are included by using a classification network, or deeper such as faster-RCNN or mask-RCNN with region proposal network. Learning the location of sub-elements using the learning network, and in the case of a language without sub-elements, learn each character directly in the same way.

하부 구성요소들로 거대한 수의 문자를 생성하여 학습 정확도를 높일 수 있다.Sub-components can generate a large number of characters to improve learning accuracy.

문자 영역을 먼저 학습할 수 있는 region proposal network가 있는 RCNN 등의 딥러닝 네트워크를 이용하여 가상 페이지 형성 단계에서 생성된 페이지들의 문자 영역을 학습하여 개별 문자로 검출(detection) 및 분할(segmentation)하는 네트워크 학습을 한다.A network that learns the character area of pages created in the virtual page formation step by using a deep learning network such as RCNN, which has a region proposal network that can learn the text area first, and detects and segments into individual characters. Learn.

도 7a와 도 7b는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성에 따른 검출 결과 구성도이고, 도 8은 본 발명에 따른 동국정운에서의 문자 인식 결과를 나타낸 구성도이다.7A and 7B are configuration diagrams of detection results according to the generation of characters and books for automatic recognition of mixed characters according to the present invention, and FIG. 8 is a diagram illustrating the results of character recognition in Dongguk Jungwoon according to the present invention.

그리고 도 9는 본 발명에 따른 SSD(Single Shot Detector)의 개념을 나타낸 구성도이다.And Figure 9 is a block diagram showing the concept of a single shot detector (SSD) according to the present invention.

이와 같은 본 발명은 변이 형성 및 가상 페이지 형성 과정에서 흔히 사용되지 않는 벽자 입력 시스템을 구축하여 고문서 인식 및 검색의 정확성 및 효율성을 높이기 위한 구성을 포함할 수 있다.The present invention may include a configuration for increasing the accuracy and efficiency of high document recognition and retrieval by constructing a wall input system that is not commonly used in the process of forming variations and forming virtual pages.

도 10a와 도 10b는 본 발명에 따른 벽자 입력 처리를 위한 벽자 데이터 및 변이 구성도이다.10A and 10B are wall data and variation configuration diagrams for wall input processing according to the present invention.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 시스템의 구성을 구체적으로 설명하면 다음과 같다.The construction of a search system using text and book generation for automatic recognition of mixed characters according to the present invention will be described in detail as follows.

도 11은 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 시스템의 구성도이다.11 is a block diagram of a search system using text and book generation for automatic recognition of mixed characters according to the present invention.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 시스템은 도 11에서와 같이, 개별 문자 또는 편방의 변이를 학습하여 편방의 조합을 실제 문자를 기준으로 학습시켜, 편방의 조합으로 조합 가능한 모든 문자를 생성할 수 있도록 하는 변이 형성부(100)와, 변이 형성부(100)에서 생성된 여러 언어 및 종류의 문자들과 노이즈를 서로 다른 크기와 형태의 페이지 안에 랜덤하게 채워 넣어 가상 페이지를 갖는 책을 생성하는 가상 페이지 형성부(200)와, 상기 가상 페이지 형성부(200)에서 생성된 페이지들의 문자 영역을 학습하고, 문자 영역을 검출하여 변이 형성부(100)에서 생성된 문자들이 어떤 언어인지 분류 학습을 하고, 하부 구성요소 위치 및 종류를 학습하는 네트워크 학습부(300)와, 네트워크 학습부(300)에서 학습된 결과를 이용하여 생성된 페이지들의 문자 영역을 학습하여 개별 문자를 구분하고, 어떤 언어인지 확인하여 하부 구성 요소의 구성을 인식하거나 그 위치를 파악하여 문자를 인식 및 검색하는 혼합문자 인식 및 검색부(400)를 포함한다.The search system using text and book generation for automatic recognition of mixed characters according to the present invention, as shown in FIG. 11, learns variations of individual characters or one side to learn combinations of one side based on actual characters, and thus to a combination of one side. It is possible to randomly fill in various languages and types of characters and noise generated by the variation forming unit 100 and the variation forming unit 100 to generate all combinable characters in pages of different sizes and shapes. A virtual page forming unit 200 for generating a book with pages, a character area of pages generated by the virtual page forming unit 200, and a character area detected to detect a character area. The network learning unit 300 for classifying and learning the location and type of sub-components, and learning the character areas of the pages generated using the results learned by the network learning unit 300, thereby learning individual characters. And a mixed character recognition and search unit 400 for recognizing and retrieving characters by recognizing the location of the sub-components by identifying the language, and recognizing the location of the sub-components.

여기서, 혼합문자 인식 및 검색부(400)는 생성된 페이지들의 문자 영역을 학습하여 개별 문자로 구분하는 문자 영역 구분부(41)와, 문자 영역 구분부(41)에서 구분된 문자 영역을 검출하는 문자 영역 검출부(42)와, 네트워크를 이용하여 어떤 언어인지 확인하는 언어 종류 분류부(43)와, 하부 구성요소가 있는 언어라면 네트워크를 이용하여 하부 구성 요소의 구성을 인식하거나 그 위치를 파악하는 하부 구성요소 위치 인식부(44)와, 하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부 구성요소의 조합과 구조 관계로 검색하여 각 문자를 인식하는 하부 구성요소 종류 인식부(45)와, 구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 빠르게 문자를 인식 및 검색할 수 있도록 하는 구성요소 기반 검색부(46)를 포함한다.Here, the mixed character recognition and search unit 400 learns the character region of the generated pages and detects the character region divided by the character region division unit 41 and the character region division unit 41. The character area detection unit 42, a language type classification unit 43 for confirming which language is used by using a network, and, if a language having a sub-element, uses a network to recognize the configuration of the sub-elements or determine its location The sub-component position recognition unit 44 and the sub-component type recognition unit 45 for retrieving a combination of sub-components and structure relations in a database of each character to recognize each character in the case of a document having sub-components. ), and the component-based search unit 46 for recognizing and retrieving characters quickly by retrieving the relations between the components after retrieving the components from the database after recognizing the components and their relations. do.

여기서, 하부 구성요소가 없는 문자의 경우 하부 구성요소의 포함여부를 학습하는 네트워크를 이용 혹은 region proposal network가 있는 faster-RCNN, mask-RCNN 등의 딥러닝 네트워크를 이용하여 직접 문자가 인식된다.Here, in the case of a character without a sub-component, the character is directly recognized using a network learning whether the sub-component is included or a deep learning network such as faster-RCNN or mask-RCNN with a region proposal network.

그리고 하부 구성요소가 합쳐져 하나의 요소로 쓰이는 상위 요소인 경우에는 문자에서 상위 구성 요소를 먼저 인식하고 구성 요소끼리의 관계를 구조 관계(4개~20개) 중 하나로 분류를 한다.In the case of a parent element that is used as a single element by combining lower components, the upper component is recognized first in the text and the relationship between the components is classified as one of the structural relations (4-20).

그리고 하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부(상위) 구성요소의 조합과 구조 관계(위치)로 검색하여 각 문자를 인식하고, 이때 image captioning network (deep learning)를 이용하여 구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 빠르게 문자를 인식 및 검색할 수 있도록 하는 것이다.And, in the case of a document with sub-components, each character is recognized by searching for the combination of the sub-(high) components and the structure relation (location) in the database of each character, and using image captioning network (deep learning) After grasping the components and their relationships, the database is searched for the components first, and then the relationships between the components are re-searched so that characters can be quickly recognized and searched.

본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 방법을 구체적으로 설명하면 다음과 같다.The search method using text and book generation for automatic recognition of mixed characters according to the present invention will be described in detail as follows.

도 12는 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성을 이용한 검색 방법을 나타낸 플로우 차트이다.12 is a flow chart showing a search method using text and book generation for automatic recognition of mixed characters according to the present invention.

먼저, 변이 형성 단계 및 가상 페이지 형성 단계, 네트워크 학습 단계를 수행하여 혼합 문자 자동인식을 위한 문자 및 서적 생성을 한다.(S1201)First, characters and books are generated for automatic recognition of mixed characters by performing a variation formation step, a virtual page formation step, and a network learning step. (S1201)

이어, 네트워크 학습 단계에서 학습된 결과를 이용하여 생성된 페이지들의 문자 영역을 학습하여 개별 문자를 구분한다.(S1202)Subsequently, the character area of the pages generated using the result learned in the network learning step is learned to distinguish individual characters. (S1202)

그리고 하부 구성요소가 없는 문자의 경우 하부 구성요소의 포함여부를 학습하는 네트워크를 이용 혹은 region proposal network가 있는 faster-RCNN, mask-RCNN 등의 딥러닝 네트워크를 이용하여 직접 문자를 인식한다.(S1203)In the case of characters without sub-components, the characters are directly recognized using a network learning whether the sub-components are included or using a deep learning network such as faster-RCNN or mask-RCNN with a region proposal network (S1203). )

이어, 하부 구성요소가 있는 언어라면 네트워크를 이용하여 하부 구성 요소의 구성을 인식하거나 그 위치를 파악한다.(S1204)Subsequently, if the language has the sub-components, the configuration of the sub-components is recognized by using a network or its location is determined (S1204).

그리고 하부 구성요소가 합쳐져 하나의 요소로 쓰이는 상위 요소인 경우에는 문자에서 상위 구성 요소를 먼저 인식하고 구성 요소끼리의 관계를 구조 관계(4개~20개) 중 하나로 분류를 한다.(S1205)In the case of a higher element used as a single element by combining lower components, the upper component is recognized first in the text and the relationship between the components is classified as one of the structural relations (4-20) (S1205).

이어, 하부 구성 요소가 있는 문서일 경우, 각 문자의 데이터 베이스에서 하부(상위) 구성요소의 조합과 구조 관계(위치)로 검색하여 각 문자를 인식하여 검색 결과를 출력한다.(S1206)Subsequently, in the case of a document with sub-components, the combination of the sub-(high) components and the structure relation (location) of each character database is searched to recognize each character and output the search result. (S1206)

이때 image captioning network(deep learning)를 이용하여 구성요소들과 그 관계를 파악한 후 데이터베이스에서 구성요소를 먼저 검색한 후 그 안에서 구성요소 간의 관계를 재검색하여 빠르게 문자를 인식 및 검색할 수 있도록 한다.At this time, using the image captioning network (deep learning), the components and their relationships are identified, and then the components are first searched in the database and then the relationships between the components are re-discovered to quickly recognize and search characters.

이상에서 설명한 본 발명에 따른 혼합문자 자동인식을 위한 문자 및 서적 생성 시스템 및 방법 그리고 이를 이용한 검색 시스템 및 방법은 문자를 이루는 각 요소의 구성과 위치를 데이터 베이스로 정리되어 디지털화된 각 개별 문자 혹은 편방의 다양한 변이를 VAE(Variational AutoEncoder)나 GAE(Generative Adversal Network) 등으로 학습하여 새로운 이미지를 생성할 수 있도록 하여 정확성이 뛰어나고 효율적인 혼합 문자 검색 시스템 구축이 가능하도록 한 것이다.The character and book generation system and method for automatic recognition of mixed characters according to the present invention described above, and the search system and method using the same are organized into a database for each element constituting characters and digitized for each individual character or edition. It is possible to build a highly accurate and efficient mixed character search system by learning various variations of the room with VAE (Variational AutoEncoder) or GAE (Generative Adversal Network) to create new images.

이상에서의 설명에서와 같이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 본 발명이 구현되어 있음을 이해할 수 있을 것이다.It will be understood that the present invention is implemented in a modified form without departing from the essential characteristics of the present invention as described above.

그러므로 명시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 하고, 본 발명의 범위는 전술한 설명이 아니라 특허청구 범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.Therefore, the specified embodiments should be considered in terms of explanation rather than limitation, and the scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the equivalent range are included in the present invention. Should be interpreted.

100. 변이 형성부
200. 가상 페이지 형성부
300. 네트워크 학습부
400. 혼합문자 인식 및 검색부100. mutation formation
200. Virtual page forming unit
300. Network Learning Department
400. Mixed character recognition and search unit

Claims

A variation forming unit configured to learn individual characters or variations of one side and learn combinations of one side based on actual characters to generate all characters that can be combined with a combination of sides;
A virtual page forming unit that randomly fills various languages and types of characters and noise generated in the variation forming unit into pages of different sizes and shapes to generate a book having virtual pages;
And a network learning unit that learns a character region of pages generated by the virtual page forming unit, classifies and learns what language the characters generated by the variation forming unit are, and learns the location and type of sub-components. Text and book generation system for automatic recognition of mixed characters.

The method of claim 1, wherein the mutation forming unit,
A lower component variation forming unit that learns individual characters or variations of one side to learn combinations of one side based on actual characters to generate all characters that can be combined with a combination of one side to form lower component variations;
Character and book generation system for automatic recognition of mixed characters, characterized in that it comprises a character variation forming unit to form a character variation by learning test images to include possible variations of different sizes and shapes, thickness, slope, and typeface.

According to claim 1, The virtual page forming unit,
A size and spacing variation generation unit that randomly changes the spacing between the character size and the next character within a certain range to generate a variation similar to a real document;
Deep learning network generates noise, blocks, pictures, and tables of various variations based on non-character areas including noise, blocks, figures, and tables extracted from actual documents on the page itself Characters and books generation system for automatic recognition of mixed characters, characterized in that it comprises an area-based document generation unit for generating a document similar to the real environment.

According to claim 1, The network learning unit,
A text area detection unit for learning a text area of pages generated by the virtual page forming unit to detect a text area;
A language type classification unit that classifies and learns what language the characters generated in the mutation forming unit are,
In the case of characters with sub-components, the sub-component location learning unit and the sub-component type learning unit for learning the location of the sub-components using a classification network and learning the types and inclusions of the sub-components. Character and book generation system for automatic recognition of mixed characters, characterized in that it comprises.

[5] The method of claim 4, wherein in the case of a language without sub-components, characters and books are generated for automatic recognition of mixed characters, characterized in that the character areas of the generated pages are directly learned and detected and segmented as individual characters. system.

Learning individual characters or variations of one side to learn a combination of sides based on actual characters, thereby generating all characters that can be combined with a combination of sides to form lower component variations;
Forming text variations by learning test images to include possible variations of different sizes and shapes, thickness, slope, and typeface;
A virtual page forming step of randomly filling characters and noise of various languages and types generated in the variation forming unit into pages of different sizes and shapes to generate a book having virtual pages;
And a network learning step of detecting a character area of pages generated in the virtual page forming step, classifying and learning what language the characters generated by variation formation are, and learning the location and type of sub-components. How to generate texts and books for automatic recognition of mixed texts.

The method of claim 6, wherein the virtual page forming step,
A size and spacing variation generation step of randomly changing the size of the text and the spacing between the adjacent characters within a predetermined range to generate a variation similar to a real document;
Generates noise, blocks, pictures and tables of multiple variations with a deep learning network based on areas of non-characters with various noises, blocks, figures, and tables extracted from the actual document on the page itself Characterized in that it comprises a region-based document generation step for generating a document similar to the real environment Character and book generation method for automatic recognition of mixed characters.

The method of claim 6, wherein the network learning step,
A text area detection step of learning a text area of pages generated by the virtual page forming unit to detect the text area;
A language type classification step of classifying and learning what language the characters generated in the mutation forming unit are,
Learning the location of the sub-element using a classification network if the character has the sub-element;
And learning the type and inclusion of sub-components.

A variation forming unit that learns individual characters or variations of one side to learn combinations of one side based on actual characters, thereby generating all characters that can be combined with a combination of sides;
A virtual page forming unit for randomly filling characters and noise of various languages and types generated in the variation forming unit into pages of different sizes and shapes to generate a book having virtual pages;
A network learning unit that learns a character area of pages generated by the virtual page forming unit, detects a character area, classifies and learns what language the characters generated in the variation forming unit are, and learns the location and type of sub-components;
By learning the character area of the pages generated by using the results learned from the network learning unit, the individual characters are classified, and in which language, the composition of the lower component is recognized or the location is recognized to recognize and search the characters. A search system using text and book generation for automatic recognition of mixed characters, comprising: a mixed character recognition and search unit.

The method of claim 9, wherein the mixed character recognition and search unit,
A character area divider for learning the character areas of the generated pages and classifying them into individual characters,
A character area detection unit that detects a character area separated by the character area division unit;
Language type classification unit to check the language using the network,
If the language with the sub-components, the sub-component location recognition unit for recognizing the location of the sub-elements using a network or determining its location,
In the case of a document having sub-components, a sub-component type recognition unit for recognizing each character by searching in a combination and structure relation of sub-components in the database of each character;
A character-based search unit that includes a component-based search unit that recognizes and searches characters by retrieving the relations between the components after retrieving the components from the database after recognizing the components and their relationships. Search system using text and book generation for recognition.

10. The method according to claim 9, in the case of a character without a sub-component, directly using a network learning whether the sub-component is included or a deep learning network including a faster-RCNN with a region proposal network or a deep learning network with mask-RCNNs. A search system using text and book generation for automatic recognition of mixed characters, which recognizes characters.

10. The method of claim 9, If the lower component is combined to be used as one element, the upper element,
A search system using text and book generation for automatic recognition of mixed characters, characterized by first recognizing the upper components from the characters and classifying them based on the relationship between the components.

Generating characters and books for automatic recognition of mixed characters by performing a variation forming step, a virtual page forming step, and a network learning step;
Classifying individual characters by learning a character region of pages generated using the results learned in the network learning stage;
In the case of a character without a sub-component, directly recognizing a character, and in a language having a sub-component, recognizing the configuration of the sub-component or using a network to recognize a location thereof;
If the lower component is the upper element that is used as one element by combining, recognizing the upper component in the text first and classifying it based on the relationship between the components;
In the case of a document having sub-components, searching for a combination and structure relation of sub-components in the database of each character, recognizing each character, and outputting a search result. Search method using text and book generation.

The method of claim 13, wherein the virtual page forming step,
A size and spacing variation generation step of randomly changing the size of the text and the spacing between the adjacent characters within a predetermined range to generate a variation similar to a real document;
Generates noise, blocks, pictures and tables of multiple variations with a deep learning network based on areas of non-characters with various noises, blocks, figures, and tables extracted from the actual document on the page itself A search method using text and book generation for automatic recognition of mixed characters, characterized by including an area-based document generation step of generating a document similar to a real environment.

The method of claim 13, wherein the network learning step,
A text area detection step of learning a text area of pages generated by the virtual page forming unit to detect the text area;
A language type classification step of classifying and learning what language the characters generated in the mutation forming unit are,
Learning the location of the sub-element using a classification network if the character has the sub-element;
And learning the type and inclusion of sub-components. A search method using text and book generation for automatic recognition of mixed characters.