KR102516990B1

KR102516990B1 - Device and method for processing text indexing through multi-image segmentation

Info

Publication number: KR102516990B1
Application number: KR1020220030374A
Authority: KR
Inventors: 김지성; 채종은
Original assignee: 주식회사 아키브소프트
Priority date: 2021-06-14
Filing date: 2022-03-11
Publication date: 2023-04-03
Also published as: KR102570866B1; KR20230033694A; KR20220167741A; KR102374797B1

Abstract

본 발명은 멀티이미지 텍스트 처리 장치에 관한 것으로, 보다 구체적으로 스캔한 다수의 문서들을 하나의 파일로 병합한 멀티이미지로부터 텍스트를 추출하여 처리할 수 있는 멀티 이미지 분할을 통한 텍스트 색인 처리 장치 및 그 방법에 관한 것이다.
상기와 같이 본 발명에 따르면, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있다.The present invention relates to a multi-image text processing apparatus, and more particularly, to a text index processing apparatus and method through multi-image segmentation capable of extracting and processing text from multi-images obtained by merging a plurality of scanned documents into a single file, and processing the same. It is about.
As described above, according to the present invention, page information for a single image divided from multi-images is generated, and the location and information of the text extracted by adding the generated page information to an index value is processed by page indexing for each single image, thereby obtaining information from the multi-images. It is possible to provide accurate search results to the user by providing the location and information of the extracted text in units of pages.

Description

Apparatus and method for processing text indexing through multi-image segmentation

본 발명은 멀티이미지 텍스트 처리 장치에 관한 것으로, 보다 구체적으로 스캔한 다수의 문서들을 하나의 파일로 병합한 멀티이미지로부터 텍스트를 추출하여 처리할 수 있는 멀티 이미지 분할을 통한 텍스트 색인 처리 장치 및 그 방법에 관한 것이다.The present invention relates to a multi-image text processing apparatus, and more particularly, to a text index processing apparatus and method through multi-image segmentation capable of extracting and processing text from multi-images obtained by merging a plurality of scanned documents into a single file, and processing the same. It is about.

일반적으로, 인터넷의 발달 및 보급의 증가로 인해 인터넷을 이용한 다양한 서비스가 제공되고 있는데, 그 중 대표적인 예가 검색 서비스라 할 수 있다.BACKGROUND OF THE INVENTION [0002] In general, various services using the Internet have been provided due to the development and spread of the Internet, among which a representative example is a search service.

이러한 검색 서비스는, 사용자가 검색하고자 하는 단어 또는 단어의 조합을 질의어로 입력하면, 검색 엔진이 입력된 질의어에 상응하는 검색결과를 사용자에게 제공하는 서비스이다.Such a search service is a service in which, when a user inputs a word or a combination of words to be searched for as a query, a search engine provides the user with search results corresponding to the inputted query.

이처럼, 사용자들이 검색하고자 하는 내용을 적절히 보여주기 위해서 검색 서비스 제공자는, 다양한 전자문서를 수집하고, 수집된 전자문서로부터 문서 텍스트를 추출하고, 이를 바탕으로 인덱싱한 뒤 별도로 저장하여, 사용자로부터 입력된 질의어에 대한 검색 결과를 빠르게 사용자에게 제공하도록 구현할 수 있지만, 이미지화되어 있는 문서는 전자문서와 같은 방법으로 추출 및 검색 활용이 불가능했었다.In this way, in order to properly show the content that users want to search for, search service providers collect various electronic documents, extract document text from the collected electronic documents, index them based on them, and store them separately, Although it can be implemented to quickly provide users with search results for queries, it was not possible to extract and search images of documents in the same way as electronic documents.

이러한 불편함을 해결하기 위해 OCR 기능을 통하여 이미지 내 텍스트를 추출하여 인덱싱을 할 수 있지만, 문서를 1장씩 나누어서 스캔한 싱글 이미지만 추출된 텍스트에 대한 위치값 생성이 가능하고, 싱글 이미지에 대해서만 한정적으로 검색활용 기능을 제공할 수 있었다. To solve this inconvenience, it is possible to extract and index the text in the image through the OCR function. to provide a search function.

하지만, 다수의 문서를 스캔한 후에 이들을 병합하여 하나의 파일로 생성된 멀티이미지로부터 텍스트를 추출하는 경우, 모든 이미지가 페이지의 구분이 없고, 텍스트를 추출한 후에 색인 시, 텍스트의 위치 정보를 제공할 수 없어서, 검색 효율성이 저하되므로 멀티이미지에 대한 검색 기능을 제공할 수 없는 문제가 있었다.However, in the case of extracting text from multiple images created as a single file by scanning multiple documents and merging them, all images do not have page divisions, and text location information may be provided when indexing after text is extracted. Therefore, since the search efficiency is lowered, there is a problem in that a search function for multi-images cannot be provided.

예를 들면, 1000장짜리 멀티이미지에 대해 텍스트를 추출한 후에 색인 시, 해당 텍스트의 이미지상 위치를 가늠하기 어렵기 때문에 검색 엔진의 효율성이 떨어지는 문제가 있었다.For example, when indexing after extracting text from 1000 multi-images, it is difficult to estimate the position of the text on the image, resulting in a decrease in efficiency of the search engine.

따라서, 향후, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있는 멀티이미지 텍스트 처리 장치의 개발이 요구되고 있다.Therefore, in the future, there is a demand for the development of a multi-image text processing apparatus capable of providing accurate search results to users by providing information and locations of texts extracted from multi-images in units of pages.

대한민국 공개특허 10-2019-0123790호 (2019. 11. 01)Republic of Korea Patent Publication No. 10-2019-0123790 (2019. 11. 01)

상술한 바와 같은 문제점을 해결하기 위한 본 발명의 일 목적은, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있는 멀티이미지 텍스트 처리 장치 및 방법을 제공하는 것이다.One object of the present invention to solve the above problems is to generate page information for a single image divided from multiple images, add the generated page information to an index value, and extract the location and information of text for each single image. It is to provide a multi-image text processing apparatus and method capable of providing accurate search results to users by providing the location and information of text extracted from multi-images in units of pages by performing page indexing.

본 발명이 해결하고자 하는 과제들은 이상에서 언급된 과제로 제한되지 않으며, 언급되지 않은 또 다른 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The problems to be solved by the present invention are not limited to the problems mentioned above, and other problems not mentioned will be clearly understood by those skilled in the art from the description below.

상술한 과제를 해결하기 위한 본 발명의 일 실시예에 따른 멀티 이미지 분할을 통한 텍스트 색인 처리 장치는, 멀티 이미지를 각 싱글 이미지로 변환하는 변환부; 상기 각 싱글 이미지로부터 텍스트를 추출하는 추출부; 상기 추출된 텍스트를 저장하는 저장부; 및 상기 각 싱글 이미지에 대한 페이지 정보를 생성하고, 상기 생성된 페이지 정보를 색인값에 추가하여 상기 추출된 텍스트의 위치 및 정보를 상기 각 싱글 이미지 별로 페이지 색인 처리하며, 상기 페이지 색인 처리된 텍스트의 위치 및 정보를 상기 저장부에 저장하는 제어부;를 포함하고, 상기 제어부는, 획득된 이미지가 다수의 스캔된 문서들이 병합되어 하나의 파일로 재스캔된 상기 멀티 이미지인 경우에 상기 변환부를 통해 상기 멀티 이미지를 상기 각 싱글 이미지로 변환하고, 상기 각 싱글 이미지 별로 페이지 색인을 처리 시에, 상기 추출된 텍스트의 위치 및 정보가 상기 각 싱글 이미지 중 복수의 싱글 이미지에 동일하게 존재하는 경우, 상기 추출된 텍스트의 위치 및 정보를 중복 페이지로 색인 처리하는 것을 특징으로 한다.An apparatus for processing text indexing through multi-image segmentation according to an embodiment of the present invention for solving the above problems includes a conversion unit for converting multiple images into single images; an extraction unit extracting text from each single image; a storage unit for storing the extracted text; and generating page information for each single image, adding the generated page information to an index value, performing page indexing of the location and information of the extracted text for each single image, and adding the generated page information to an index value. and a control unit for storing location and information in the storage unit, wherein the control unit, when an acquired image is the multi-image obtained by merging a plurality of scanned documents and rescanning them into a single file, converts the image through the conversion unit. When converting a multi-image into each of the single images and processing a page index for each of the single images, when the location and information of the extracted text are equally present in a plurality of single images among the single images, the extraction It is characterized by indexing the location and information of the edited text as duplicate pages.

이때, 상기 변환부는, 상기 멀티이미지로부터 분할 영역을 설정하고, 상기 설정된 분할 영역에 상응하여 상기 멀티이미지를 분할하며, 상기 멀티이미지의 각 분할 영역을 상기 각 싱글 이미지로 변환할 수 있다.In this case, the conversion unit may set a division area from the multi-image, divide the multi-image according to the set division area, and convert each division area of the multi-image into each single image.

또한, 상기 변환부는, 상기 멀티이미지로부터 분할 영역을 설정할 때, 미리 설정된 설정값에 따라 상기 분할 영역을 설정할 수 있다. 또한, 상기 변환부는, 상기 멀티이미지로부터 분할 영역을 설정할 때, 사용자 입력값에 따라 상기 분할 영역을 설정할 수 있다.Also, when setting the division area from the multi-image, the conversion unit may set the division area according to a preset setting value. Also, when setting the division area from the multi-image, the conversion unit may set the division area according to a user input value.

또한, 상기 추출부는, OCR(Optical Character Recognition) 기능을 수행하는 광학 인식 모듈을 포함하고, 상기 변환된 각 싱글 이미지로부터 텍스트를 추출 시에, 상기 각 싱글 이미지 내에 포함된 글자, 기호, 숫자 중 적어도 하나를 추출할 수 있다. 이때, 상기 제어부는, 상기 각 싱글 이미지의 텍스트가 추출되면, 상기 각 싱글이미지에 대한 페이지 정보를 생성할 수 있다. 또한, 상기 제어부는, 상기 추출부가 상기 각 싱글 이미지의 텍스트를 추출할 때, 상기 각 싱글 이미지에 대한 페이지 정보를 생성하도록 상기 추출부를 제어할 수 있다.In addition, the extraction unit includes an optical recognition module that performs an OCR (Optical Character Recognition) function, and when extracting text from each converted single image, at least one of letters, symbols, and numbers included in each single image. one can be extracted. In this case, the control unit may generate page information for each single image when the text of each single image is extracted. The control unit may control the extraction unit to generate page information for each single image when the extraction unit extracts the text of each single image.

또한, 상기 제어부는, 상기 각 싱글 이미지 별로 페이지 색인을 처리 시에, 상기 각 싱글 이미지 중에서 상기 추출된 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글 이미지를 파악하고, 미리 설정된 우선 순위를 기반으로 상기 파악된 다수의 싱글 이미지 중 어느 하나를 선택하고, 상기 선택된 싱글 이미지에 대한 페이지 정보로 상기 추출된 텍스트의 위치 및 정보를 페이지 색인 처리할 수 있다.In addition, when processing a page index for each single image, the control unit identifies a plurality of single images having the same position and information of the extracted text among the single images, based on preset priorities. In this way, it is possible to select any one of the plurality of identified single images, and to perform page index processing on the position and information of the extracted text as page information on the selected single image.

한편, 본 발명의 일 실시예에 따른 멀티 이미지 분할을 통한 텍스트 색인 처리 방법은, 멀티 이미지를 각 싱글 이미지로 변환하는 단계; 상기 각 싱글 이미지로부터 텍스트를 추출하는 단계; 상기 각 싱글 이미지에 대한 페이지 정보를 생성하는 단계; 상기 생성된 페이지 정보를 색인값에 추가하여 상기 추출된 텍스트의 위치 및 정보를 상기 각 싱글 이미지 별로 페이지 색인 처리하는 단계; 및 상기 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하는 단계;를 포함하고, 상기 페이지 색인 처리 단계는, 획득된 이미지가 다수의 스캔된 문서들이 병합되어 하나의 파일로 재스캔된 상기 멀티 이미지인 경우에 상기 멀티 이미지를 상기 각 싱글 이미지로 변환하고, 상기 각 싱글 이미지 별로 페이지 색인을 처리 시에, 상기 추출된 텍스트의 위치 및 정보가 상기 각 싱글 이미지 중 복수의 싱글 이미지에 동일하게 존재하는 경우, 상기 추출된 텍스트의 위치 및 정보를 중복 페이지로 색인 처리하는 것을 특징으로 한다.Meanwhile, a text index processing method through multi-image segmentation according to an embodiment of the present invention includes converting multiple images into single images; extracting text from each single image; generating page information for each single image; indexing the location and information of the extracted text for each single image by adding the generated page information to an index value; and storing the location and information of the page-indexed text, wherein the page-indexing step is the multi-image obtained by merging multiple scanned documents and rescanning them into a single file. In this case, when the multi-image is converted into each of the single images and the page index is processed for each of the single images, the location and information of the extracted text are equally present in a plurality of single images among the single images. , characterized in that the location and information of the extracted text are indexed as duplicate pages.

또한, 상술한 과제를 해결하기 위한 본 발명의 일 실시예에 따른 멀티 이미지 분할을 통한 텍스트 색인 처리 방법을 제공하는 컴퓨터 프로그램은, 하드웨어인 컴퓨터와 결합되어 상술한 방법 중 어느 하나의 방법을 수행하기 위해 컴퓨터 판독 가능한 기록매체에 저장된다.In addition, a computer program that provides a text index processing method through multi-image segmentation according to an embodiment of the present invention for solving the above problems is combined with a computer that is hardware to perform any one of the above methods. stored on a computer-readable recording medium for

이 외에도, 본 발명을 구현하기 위한 다른 방법, 다른 시스템 및 상기 방법을 실행하기 위한 컴퓨터 프로그램을 기록하는 컴퓨터 판독 가능한 기록 매체가 더 제공될 수 있다.In addition to this, another method for implementing the present invention, another system, and a computer readable recording medium recording a computer program for executing the method may be further provided.

상기와 같이 본 발명에 따르면, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있다.As described above, according to the present invention, page information for a single image divided from multi-images is generated, and the location and information of the text extracted by adding the generated page information to an index value is processed by page indexing for each single image, thereby obtaining information from the multi-images. It is possible to provide accurate search results to the user by providing the location and information of the extracted text in units of pages.

본 발명의 효과들은 이상에서 언급된 효과로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The effects of the present invention are not limited to the effects mentioned above, and other effects not mentioned will be clearly understood by those skilled in the art from the description below.

도 1은, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치를 설명하기 위한 블록 구성도이다.
도 2는, 멀티이미지를 설명하기 위한 도면이다.
도 3은, 싱글이미지를 설명하기 위한 도면이다.
도 4는, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치의 멀티이미지 텍스트 처리 방법을 설명하기 위한 흐름도이다.1 is a block diagram illustrating a multi-image text processing apparatus according to an embodiment of the present invention.
2 is a diagram for explaining multi-images.
3 is a diagram for explaining a single image.
4 is a flowchart illustrating a multi-image text processing method of a multi-image text processing apparatus according to an embodiment of the present invention.

본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나, 본 발명은 이하에서 개시되는 실시예들에 제한되는 것이 아니라 서로 다른 다양한 형태로 구현될 수 있으며, 단지 본 실시예들은 본 발명의 개시가 완전하도록 하고, 본 발명이 속하는 기술 분야의 통상의 기술자에게 본 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 범주에 의해 정의될 뿐이다.Advantages and features of the present invention, and methods of achieving them, will become clear with reference to the detailed description of the following embodiments taken in conjunction with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, but may be implemented in various different forms, only these embodiments are intended to complete the disclosure of the present invention, and are common in the art to which the present invention belongs. It is provided to fully inform the person skilled in the art of the scope of the invention, and the invention is only defined by the scope of the claims.

본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성요소 외에 하나 이상의 다른 구성요소의 존재 또는 추가를 배제하지 않는다. 명세서 전체에 걸쳐 동일한 도면 부호는 동일한 구성 요소를 지칭하며, "및/또는"은 언급된 구성요소들의 각각 및 하나 이상의 모든 조합을 포함한다. 비록 "제1", "제2" 등이 다양한 구성요소들을 서술하기 위해서 사용되나, 이들 구성요소들은 이들 용어에 의해 제한되지 않음은 물론이다. 이들 용어들은 단지 하나의 구성요소를 다른 구성요소와 구별하기 위하여 사용하는 것이다. 따라서, 이하에서 언급되는 제1 구성요소는 본 발명의 기술적 사상 내에서 제2 구성요소일 수도 있음은 물론이다.Terminology used herein is for describing the embodiments and is not intended to limit the present invention. In this specification, singular forms also include plural forms unless specifically stated otherwise in a phrase. As used herein, "comprises" and/or "comprising" does not exclude the presence or addition of one or more other elements other than the recited elements. Like reference numerals throughout the specification refer to like elements, and “and/or” includes each and every combination of one or more of the recited elements. Although "first", "second", etc. are used to describe various components, these components are not limited by these terms, of course. These terms are only used to distinguish one component from another. Accordingly, it goes without saying that the first element mentioned below may also be the second element within the technical spirit of the present invention.

다른 정의가 없다면, 본 명세서에서 사용되는 모든 용어(기술 및 과학적 용어를 포함)는 본 발명이 속하는 기술분야의 통상의 기술자에게 공통적으로 이해될 수 있는 의미로 사용될 수 있을 것이다. 또한, 일반적으로 사용되는 사전에 정의되어 있는 용어들은 명백하게 특별히 정의되어 있지 않는 한 이상적으로 또는 과도하게 해석되지 않는다.Unless otherwise defined, all terms (including technical and scientific terms) used in this specification may be used with meanings commonly understood by those skilled in the art to which the present invention belongs. In addition, terms defined in commonly used dictionaries are not interpreted ideally or excessively unless explicitly specifically defined.

이하, 첨부된 도면을 참조하여 본 발명의 실시예를 상세하게 설명한다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

설명에 앞서 본 명세서에서 사용하는 용어의 의미를 간략히 설명한다. 그렇지만 용어의 설명은 본 명세서의 이해를 돕기 위한 것이므로, 명시적으로 본 발명을 한정하는 사항으로 기재하지 않은 경우에 본 발명의 기술적 사상을 한정하는 의미로 사용하는 것이 아님을 주의해야 한다.Prior to the description, the meaning of the terms used in this specification will be briefly described. However, it should be noted that the description of terms is intended to help the understanding of the present specification, and is not used in the sense of limiting the technical spirit of the present invention unless explicitly described as limiting the present invention.

도 1은, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치를 설명하기 위한 블록 구성도이고, 도 2는, 멀티이미지를 설명하기 위한 도면이며, 도 3은, 싱글이미지를 설명하기 위한 도면이다.1 is a block diagram for explaining a multi-image text processing apparatus according to an embodiment of the present invention, FIG. 2 is a diagram for explaining multi-images, and FIG. 3 is a diagram for explaining a single image. am.

도 1에 도시된 바와 같이, 본 발명의 멀티이미지 텍스트 처리 장치는, 멀티이미지를 획득하는 이미지 획득부(110), 획득한 멀티이미지를 다수의 싱글이미지로 변환하는 이미지 변환부(120), 변환된 싱글이미지로부터 텍스트를 추출하는 텍스트 추출부(130), 추출한 텍스트를 저장하는 텍스트 저장부(140), 그리고 이미지 변환부(120), 텍스트 추출부(130) 및 텍스트 저장부(140)를 제어하는 제어부(150)를 포함할 수 있다.As shown in FIG. 1, the multi-image text processing apparatus of the present invention includes an image acquisition unit 110 that acquires multi-images, an image conversion unit 120 that converts the acquired multi-images into a plurality of single images, and conversion Controls the text extraction unit 130 that extracts text from the single image, the text storage unit 140 that stores the extracted text, and the image conversion unit 120, the text extraction unit 130, and the text storage unit 140. It may include a control unit 150 to.

여기서, 제어부(150)는, 각 싱글이미지에 대한 페이지 정보를 생성하고, 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리하며, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하도록 텍스트 저장부(140)를 제어할 수 있다.Here, the control unit 150 generates page information for each single image, adds the generated page information to an index value, processes the location and information of the extracted text for each single image, and processes the page indexed text. The text storage unit 140 may be controlled to store location and information.

본 발명에서, 도 2와 같이, 멀티이미지는, 다수의 문서를 스캔한 후에 스캔한 문서들을 병합하여 하나의 파일로 재스캔한 이미지 파일을 의미할 수 있다.In the present invention, as shown in FIG. 2 , a multi-image may refer to an image file obtained by scanning multiple documents and merging the scanned documents into a single file.

일 예로, 멀티이미지는, jpg, tif. Png 등을 포함하는 다수의 이미지 파일들을 포함할 수 있다,For example, multi-images, jpg, tif. It can contain multiple image files including png, etc.

또한, 도 3과 같이, 싱글이미지는, 멀티이미지로부터 분할된 하나의 이미지 파일을 포함할 수 있다.Also, as shown in FIG. 3, a single image may include one image file divided from multiple images.

그리고, 이미지 변환부(120)는, 멀티이미지로부터 분할 영역을 설정하고, 설정된 분할 영역에 상응하여 멀티이미지를 분할하며, 멀티이미지의 각 분할 영역을 하나의 싱글 이미지로 변환할 수 있다.Also, the image conversion unit 120 may set division areas from the multi-images, divide the multi-images according to the set division areas, and convert each division area of the multi-images into one single image.

일 예로, 이미지 변환부(120)는, 멀티이미지로부터 분할 영역을 설정할 때, 미리 설정된 설정값에 따라 분할 영역을 설정할 수 있다.For example, the image conversion unit 120 may set the divided area according to a preset setting value when setting the divided area from the multi-image.

여기서, 미리 설정된 설정값은, 분할 영역의 크기 및 개수를 포함할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Here, the preset setting value may include the size and number of divided regions, which is only an example, but is not limited thereto.

다른 예로, 이미지 변환부(120)는, 멀티이미지로부터 분할 영역을 설정할 때, 사용자 입력값에 따라 분할 영역을 설정할 수도 있다.As another example, the image converting unit 120 may set the split area according to a user input value when setting a split area from multiple images.

여기서, 사용자 입력값은, 분할 영역의 크기 및 개수를 포함할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Here, the user input value may include the size and number of divided regions, which is only an example, but is not limited thereto.

이어, 텍스트 추출부(130)는, OCR(Optical Character Recognition) 기능을 수행하는 광학 인식 모듈을 포함할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Next, the text extractor 130 may include an optical recognition module that performs an OCR (Optical Character Recognition) function, which is only one embodiment, but is not limited thereto.

그리고, 텍스트 추출부(130)는, 변환된 싱글이미지로부터 텍스트를 추출할 때, 싱글이미지 내에 포함되는 글자, 기호, 숫자 중 적어도 하나를 추출할 수 있다.Also, when extracting text from the converted single image, the text extractor 130 may extract at least one of letters, symbols, and numbers included in the single image.

다음, 제어부(150)는, 각 싱글이미지에 대한 페이지 정보를 생성하고, 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리하며, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하도록 텍스트 저장부(140)를 제어할 수 있다.Next, the control unit 150 generates page information for each single image, adds the generated page information to an index value, processes the location and information of the extracted text for each single image, and processes the page indexed text. The text storage unit 140 may be controlled to store location and information.

여기서, 제어부(150)는, 이미지 획득부(110)가 멀티이미지를 획득하면 획득한 멀티이미지를 다수의 싱글이미지로 변환하도록 이미지 변환부(120)를 제어할 수 있다.Here, the controller 150 may control the image conversion unit 120 to convert the acquired multi-images into a plurality of single images when the image acquisition unit 110 acquires the multi-images.

경우에 따라, 제어부(150)는, 이미지 획득부(110)가 이미지를 획득하면 획득한 이미지가 멀티이미지인지를 확인하고, 획득한 이미지가 멀티이미지이면 멀티이미지를 다수의 싱글이미지로 변환하도록 이미지 변환부(120)를 제어하고, 획득한 이미지가 멀티이미지가 아니면 이미지로부터 텍스트를 추출하도록 텍스트 추출부(130)를 제어할 수 있다.In some cases, the controller 150, when the image acquisition unit 110 acquires the image, determines whether the acquired image is a multi-image, and if the acquired image is a multi-image, converts the multi-image into a plurality of single images. The conversion unit 120 may be controlled, and the text extraction unit 130 may be controlled to extract text from the image if the obtained image is not a multi-image.

또한, 제어부(150)는, 이미지 변환부(110)에 의해 멀티이미지가 다수의 싱글이미지로 변환되면 변환된 싱글이미지로부터 텍스트를 추출하도록 텍스트 추출부(130)를 제어할 수 있다.In addition, the control unit 150 may control the text extraction unit 130 to extract text from the converted single image when the multi-image is converted into a plurality of single images by the image conversion unit 110 .

그리고, 제어부(150)는, 싱글 이미지의 텍스트가 추출되면 싱글이미지에 대한 페이지 정보를 생성할 수 있다.Further, the controller 150 may generate page information for the single image when the text of the single image is extracted.

여기서, 제어부(150)는, 모든 싱글 이미지의 텍스트가 추출되면 각 싱글이미지에 대한 페이지 정보를 생성할 수 있다.Here, the controller 150 may generate page information for each single image when texts of all single images are extracted.

경우에 따라, 제어부(150)는, 각 싱글 이미지의 텍스트가 추출될 때마다 해당하는 싱글이미지에 대한 페이지 정보를 생성할 수도 있다.In some cases, the controller 150 may generate page information for a corresponding single image whenever the text of each single image is extracted.

다른 경우로서, 제어부(150)는, 텍스트 추출부(130)가 싱글 이미지의 텍스트를 추출할 때, 싱글이미지에 대한 페이지 정보를 생성하도록 텍스트 추출부를 제어할 수도 있다.As another case, the controller 150 may control the text extractor 130 to generate page information for the single image when the text extractor 130 extracts the text of the single image.

이처럼, 본 발명은, 텍스트 추출부(130)의 텍스트 추출에 따라 제어부(150)가 각 싱글이미지에 대한 페이지 정보의 생성을 제어할 수도 있고, 텍스트 추출부(130)가 직접 텍스트 추출을 수행하면서 각 싱글이미지에 대한 페이지 정보를 생성할 수도 있다.As such, according to the present invention, the control unit 150 may control generation of page information for each single image according to text extraction by the text extraction unit 130, while the text extraction unit 130 directly extracts text. It is also possible to create page information for each single image.

따라서, 본 발명은, 각 싱글이미지에 대한 페이지 정보의 생성에 있어서, 다양한 설계 변형이 가능하다.Accordingly, in the present invention, various design modifications are possible in generating page information for each single image.

다음, 제어부(150)는, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 중복되면 추출한 텍스트의 위치 및 정보를 중복 페이지로 색인 처리할 수 있다.Next, when page indexing is performed for each single image, the controller 150 may index the location and information of the extracted text as a duplicate page if the location and information of the extracted text overlap with a plurality of single images.

다른 실시예로서, 제어부(150)는, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 중복되면 중복되는 다수의 싱글이미지들 중 어느 하나를 선택하고, 선택한 싱글이미지에 대한 페이지 정보로 추출한 텍스트의 위치 및 정보를 페이지 색인 처리할 수도 있다.As another embodiment, when page indexing is performed for each single image, the controller 150 selects one of the plurality of overlapping single images when the location and information of the extracted text are duplicated in the plurality of single images, and selects the selected single image. The location and information of the text extracted as the page information for the image may be page indexed.

여기서, 제어부(150)는, 중복되는 다수의 싱글이미지들 중 어느 하나를 선택할 때, 미리 설정된 우선 순위에 따라 중복되는 다수의 싱글이미지들 중 어느 하나를 선택할 수 있다.Here, the controller 150 may select any one of the plurality of overlapping single images according to a preset priority when selecting one of the plurality of overlapping single images.

이처럼, 본 발명은, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 중복될 경우, 신속하고 빠른 색인 처리를 위하여 다양한 방법을 사용할 수 있다.As such, in the present invention, when the position and information of the extracted text are duplicated in a plurality of single images, various methods can be used for quick and fast indexing.

또한, 본 발명은, 우선 순위에 따라 중복되는 다수의 싱글이미지들 중 어느 하나를 선택하여 선택한 싱글이미지에 대한 페이지 정보로 추출한 텍스트의 위치 및 정보를 페이지 색인 처리함으로써, 사용자에게 신속한 검색 결과를 제공할 수도 있다.In addition, according to the present invention, by selecting any one of a plurality of overlapping single images according to priority and indexing the location and information of text extracted as page information for the selected single image, prompt search results are provided to the user. You may.

이어, 제어부(150)는, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장할 때, 멀티이미지별로 싱글이미지로부터 추출된 텍스트의 위치 및 정보를 하나의 그룹으로 묶어 저장할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Next, when storing the location and information of the text indexed by the page, the control unit 150 may group and store the location and information of the text extracted from the single image for each multi-image as a group, which is only an example. It is not limited to this.

이와 같이, 본 발명은, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있다.In this way, the present invention generates page information for a single image divided from multiple images, adds the generated page information to an index value, and processes the location and information of the extracted text by page indexing for each single image. It is possible to provide accurate search results to users by providing text location and information in units of pages.

도 4는, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치의 멀티이미지 텍스트 처리 방법을 설명하기 위한 흐름도이다.4 is a flowchart illustrating a multi-image text processing method of a multi-image text processing apparatus according to an embodiment of the present invention.

도 4에 도시된 바와 같이, 본 발명의 멀티이미지 텍스트 처리 장치는, 먼저, 멀티이미지를 획득하고, 획득한 멀티이미지를 다수의 싱글이미지로 변환할 수 있다(S10).As shown in FIG. 4 , the apparatus for processing multi-image text according to the present invention first acquires multi-images and converts the obtained multi-images into a plurality of single images (S10).

여기서, 본 발명은, 멀티이미지를 다수의 싱글이미지로 변환할 때, 멀티이미지로부터 분할 영역을 설정하고, 설정된 분할 영역에 상응하여 멀티이미지를 분할하며, 멀티이미지의 각 분할 영역을 하나의 싱글 이미지로 변환할 수 있다.Here, the present invention, when converting a multi-image into a plurality of single images, sets a division area from the multi-image, divides the multi-image according to the set division region, and converts each division region of the multi-image into one single image. can be converted to

이때, 본 발명은, 멀티이미지로부터 분할 영역을 설정할 때, 미리 설정된 설정값에 따라 분할 영역을 설정할 수도 있고, 경우에 따라, 사용자 입력값에 따라 분할 영역을 설정할 수도 있다.In this case, in the present invention, when setting a division region from multi-images, the division region may be set according to a preset setting value, or in some cases, the division region may be set according to a user input value.

그리고, 본 발명은, 변환된 싱글이미지로부터 텍스트를 추출할 수 있다(S20).And, in the present invention, text can be extracted from the converted single image (S20).

다음, 본 발명은, 각 싱글이미지에 대한 페이지 정보를 생성할 수 있다(S30).Next, the present invention may generate page information for each single image (S30).

여기서, 본 발명은, 모든 싱글 이미지의 텍스트가 추출되면 각 싱글이미지에 대한 페이지 정보를 생성할 수도 있고, 경우에 따라, 본 발명은, 각 싱글 이미지의 텍스트가 추출될 때마다 해당하는 싱글이미지에 대한 페이지 정보를 생성할 수도 있다.Here, in the present invention, when the text of all single images is extracted, page information for each single image may be generated. Depending on the case, the present invention may generate page information for each single image whenever the text of each single image is extracted. You can also create page information about.

이어, 본 발명은, 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리할 수 있다(S40).Subsequently, the present invention may add the generated page information to the index value and perform page index processing on the location and information of the extracted text for each single image (S40).

여기서, 본 발명은, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 중복되면 추출한 텍스트의 위치 및 정보를 중복 페이지로 색인 처리할 수 있다.Here, in the present invention, when page indexing is performed for each single image, if the location and information of the extracted text overlap with a plurality of single images, the location and information of the extracted text can be indexed as a duplicate page.

경우에 따라, 본 발명은, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 중복되면 중복되는 다수의 싱글이미지들 중 어느 하나를 선택하고, 선택한 싱글이미지에 대한 페이지 정보로 추출한 텍스트의 위치 및 정보를 페이지 색인 처리할 수도 있다.In some cases, in the present invention, when page indexing is performed for each single image, if the position and information of the extracted text is duplicated in a plurality of single images, one of the plurality of overlapping single images is selected, and information about the selected single image is selected. The location and information of text extracted as page information may be page indexed.

여기서, 본 발명은, 중복되는 다수의 싱글이미지들 중 어느 하나를 선택할 때, 미리 설정된 우선 순위에 따라 중복되는 다수의 싱글이미지들 중 어느 하나를 선택할 수 있다.Here, in the present invention, when selecting one of a plurality of overlapping single images, one of the plurality of overlapping single images may be selected according to a preset priority.

그리고, 본 발명은, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장할 수 있다(S50).In addition, the present invention may store the location and information of the page indexed text (S50).

여기서, 본 발명은, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장할 때, 멀티이미지별로 싱글이미지로부터 추출된 텍스트의 위치 및 정보를 하나의 그룹으로 묶어 저장할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Here, in the present invention, when storing the location and information of the text indexed on a page, the location and information of the text extracted from a single image for each multi-image can be grouped and stored as a group, which is only one embodiment and is limited thereto. It doesn't work.

이상에서 전술한 본 발명의 일 실시예에 따른 방법은, 하드웨어인 서버와 결합되어 실행되기 위해 프로그램(또는 어플리케이션)으로 구현되어 매체에 저장될 수 있다.The method according to an embodiment of the present invention described above may be implemented as a program (or application) to be executed in combination with a server, which is hardware, and stored in a medium.

상기 전술한 프로그램은, 상기 컴퓨터가 프로그램을 읽어 들여 프로그램으로 구현된 상기 방법들을 실행시키기 위하여, 상기 컴퓨터의 프로세서(CPU)가 상기 컴퓨터의 장치 인터페이스를 통해 읽힐 수 있는 C, C++, JAVA, 기계어 등의 컴퓨터 언어로 코드화된 코드(Code)를 포함할 수 있다. 이러한 코드는 상기 방법들을 실행하는 필요한 기능들을 정의한 함수 등과 관련된 기능적인 코드(Functional Code)를 포함할 수 있고, 상기 기능들을 상기 컴퓨터의 프로세서가 소정의 절차대로 실행시키는데 필요한 실행 절차 관련 제어 코드를 포함할 수 있다. 또한, 이러한 코드는 상기 기능들을 상기 컴퓨터의 프로세서가 실행시키는데 필요한 추가 정보나 미디어가 상기 컴퓨터의 내부 또는 외부 메모리의 어느 위치(주소 번지)에서 참조되어야 하는지에 대한 메모리 참조관련 코드를 더 포함할 수 있다. 또한, 상기 컴퓨터의 프로세서가 상기 기능들을 실행시키기 위하여 원격(Remote)에 있는 어떠한 다른 컴퓨터나 서버 등과 통신이 필요한 경우, 코드는 상기 컴퓨터의 통신 모듈을 이용하여 원격에 있는 어떠한 다른 컴퓨터나 서버 등과 어떻게 통신해야 하는지, 통신 시 어떠한 정보나 미디어를 송수신해야 하는지 등에 대한 통신 관련 코드를 더 포함할 수 있다.The aforementioned program is C, C++, JAVA, machine language, etc. It may include a code coded in a computer language of. These codes may include functional codes related to functions defining necessary functions for executing the methods, and include control codes related to execution procedures necessary for the processor of the computer to execute the functions according to a predetermined procedure. can do. In addition, these codes may further include memory reference related codes for which location (address address) of the computer's internal or external memory should be referenced for additional information or media required for the computer's processor to execute the functions. there is. In addition, when the processor of the computer needs to communicate with any other remote computer or server in order to execute the functions, the code uses the computer's communication module to determine how to communicate with any other remote computer or server. It may further include communication-related codes for whether to communicate, what kind of information or media to transmit/receive during communication, and the like.

상기 저장되는 매체는, 레지스터, 캐쉬, 메모리 등과 같이 짧은 순간 동안 데이터를 저장하는 매체가 아니라 반영구적으로 데이터를 저장하며, 기기에 의해 판독(reading)이 가능한 매체를 의미한다. 구체적으로는, 상기 저장되는 매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광 데이터 저장장치 등이 있지만, 이에 제한되지 않는다. 즉, 상기 프로그램은 상기 컴퓨터가 접속할 수 있는 다양한 서버 상의 다양한 기록매체 또는 사용자의 상기 컴퓨터상의 다양한 기록매체에 저장될 수 있다. 또한, 상기 매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드가 저장될 수 있다.The storage medium is not a medium that stores data for a short moment, such as a register, cache, or memory, but a medium that stores data semi-permanently and is readable by a device. Specifically, examples of the storage medium include ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage device, etc., but are not limited thereto. That is, the program may be stored in various recording media on various servers accessible by the computer or various recording media on the user's computer. In addition, the medium may be distributed to computer systems connected through a network, and computer readable codes may be stored in a distributed manner.

본 발명의 실시예와 관련하여 설명된 방법 또는 알고리즘의 단계들은 하드웨어로 직접 구현되거나, 하드웨어에 의해 실행되는 소프트웨어 모듈로 구현되거나, 또는 이들의 결합에 의해 구현될 수 있다. 소프트웨어 모듈은 RAM(Random Access Memory), ROM(Read Only Memory), EPROM(Erasable Programmable ROM), EEPROM(Electrically Erasable Programmable ROM), 플래시 메모리(Flash Memory), 하드 디스크, 착탈형 디스크, CD-ROM, 또는 본 발명이 속하는 기술 분야에서 잘 알려진 임의의 형태의 컴퓨터 판독가능 기록매체에 상주할 수도 있다.Steps of a method or algorithm described in connection with an embodiment of the present invention may be implemented directly in hardware, implemented in a software module executed by hardware, or implemented by a combination thereof. A software module may include random access memory (RAM), read only memory (ROM), erasable programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), flash memory, hard disk, removable disk, CD-ROM, or It may reside in any form of computer readable recording medium well known in the art to which the present invention pertains.

이상, 첨부된 도면을 참조로 하여 본 발명의 실시예를 설명하였지만, 본 발명이 속하는 기술분야의 통상의 기술자는 본 발명이 그 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 그러므로, 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며, 제한적이 아닌 것으로 이해해야만 한다.Although the embodiments of the present invention have been described with reference to the accompanying drawings, those skilled in the art to which the present invention pertains can be implemented in other specific forms without changing the technical spirit or essential features of the present invention. you will be able to understand Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive.

Claims

a conversion unit that converts multiple images into single images;
an extraction unit extracting text from each single image;
a storage unit for storing the extracted text; and
Page information for each single image is generated, and the location and information of the extracted text is page-indexed for each single image by adding the generated page information to an index value, and the location of the page-indexed text And a control unit for storing information in the storage unit;
The control unit,
When the acquired image is the multi-image obtained by merging a plurality of scanned documents and rescanning into a single file, converting the multi-image into each single image through the conversion unit;
When the page index is processed for each single image, if the location and information of the extracted text are equally present in a plurality of single images among the single images, the location and information of the extracted text are indexed as duplicate pages. Text indexing processing apparatus through multi-image segmentation, characterized in that for processing.

According to claim 1,
The conversion unit,
Setting a division area from the multi-image, dividing the multi-image according to the set division area, and converting each division area of the multi-image into each single image. processing unit.

According to claim 2,
The conversion unit,
Text indexing processing apparatus through multi-image division, characterized in that when setting the division region from the multi-image, the division region is set according to a preset setting value.

According to claim 2,
The conversion unit,
Text indexing processing apparatus through multi-image division, characterized in that when setting the division region from the multi-image, the division region is set according to a user input value.

According to claim 1,
The extraction part,
Includes an optical recognition module that performs an OCR (Optical Character Recognition) function,
When text is extracted from each converted single image, at least one of letters, symbols, and numbers included in each single image is extracted.

According to claim 5,
The control unit,
When the text of each single image is extracted, the text index processing apparatus through multi-image segmentation, characterized in that for generating page information for each single image.

According to claim 5,
The control unit,
When the extraction unit extracts the text of each single image, the apparatus for processing text indexing through multi-image segmentation, characterized in that for controlling the extraction unit to generate page information for each single image.

According to claim 1,
The control unit,
When processing the page index for each single image, a plurality of single images having the same position and information of the extracted text among the single images are identified;
Selecting one of the identified multiple single images based on a preset priority,
Text indexing processing apparatus through multi-image segmentation, characterized in that for page indexing the position and information of the extracted text with page information for the selected single image.

In the method performed by the text index processing apparatus through multi-image segmentation,
converting multiple images into single images;
extracting text from each single image;
generating page information for each single image;
indexing the location and information of the extracted text for each single image by adding the generated page information to an index value; and
Storing the position and information of the page indexed text;
The page index processing step,
When the obtained image is the multi-image in which a plurality of scanned documents are merged and rescanned as a single file, the multi-image is converted into each single image;
When the page index is processed for each single image, if the location and information of the extracted text are equally present in a plurality of single images among the single images, the location and information of the extracted text are indexed as duplicate pages. Text index processing method through multi-image segmentation, characterized in that the processing.

delete