KR102374797B1

KR102374797B1 - Apparatus and method for processing text of multiimage

Info

Publication number: KR102374797B1
Application number: KR1020210076424A
Authority: KR
Inventors: 김지성; 채종은
Original assignee: 주식회사 아키브소프트
Priority date: 2021-06-14
Filing date: 2021-06-14
Publication date: 2022-03-16
Also published as: KR20230033694A; KR20220167741A; KR102516990B1; KR102570866B1

Abstract

The present invention relates to an apparatus and method for processing a multi-image text, which are able to integrate a large number of scanned documents, and extract and process a text from a multi-image made by re-scanning the scanned documents into a single file. According to the present invention, the apparatus for processing the multi-image text comprises: an image acquisition unit acquiring a multi-image; an image conversion unit converting the acquired multi-image into a large number of single images; a text extraction unit extracting a text from the converted single images; a text storage unit storing the extracted text; and a control unit controlling the image conversion unit, the text extraction unit, and the text storage unit. The control unit is able to generate a page information on each of the single images, add the generated page information to an index value, process the position and information of the extracted text as a page index for each of the single images, and control the text storage unit to store the position and information of the text processed as a page index. The present invention aims to provide an apparatus and method for processing a multi-image text to provide a user with accurate search results.

Description

Multi-image text processing apparatus and method {APPARATUS AND METHOD FOR PROCESSING TEXT OF MULTIIMAGE}

본 발명은 멀티이미지 텍스트 처리 장치에 관한 것으로, 보다 구체적으로 스캔한 다수의 문서들을 하나의 파일로 병합한 멀티이미지로부터 텍스트를 추출하여 처리할 수 있는 멀티이미지 텍스트 처리 장치 및 방법에 관한 것이다.The present invention relates to a multi-image text processing apparatus, and more particularly, to a multi-image text processing apparatus and method capable of processing by extracting text from a multi-image obtained by merging a plurality of scanned documents into one file.

일반적으로, 인터넷의 발달 및 보급의 증가로 인해 인터넷을 이용한 다양한 서비스가 제공되고 있는데, 그 중 대표적인 예가 검색 서비스라 할 수 있다.In general, various services using the Internet are provided due to the development and spread of the Internet, and a representative example among them is a search service.

이러한 검색 서비스는, 사용자가 검색하고자 하는 단어 또는 단어의 조합을 질의어로 입력하면, 검색 엔진이 입력된 질의어에 상응하는 검색결과를 사용자에게 제공하는 서비스이다.Such a search service is a service in which, when a user inputs a word or combination of words to be searched as a query word, the search engine provides a search result corresponding to the inputted query word to the user.

이처럼, 사용자들이 검색하고자 하는 내용을 적절히 보여주기 위해서 검색 서비스 제공자는, 다양한 전자문서를 수집하고, 수집된 전자문서로부터 문서 텍스트를 추출하고, 이를 바탕으로 인덱싱한 뒤 별도로 저장하여, 사용자로부터 입력된 질의어에 대한 검색 결과를 빠르게 사용자에게 제공하도록 구현할 수 있지만, 이미지화되어 있는 문서는 전자문서와 같은 방법으로 추출 및 검색 활용이 불가능했었다.In this way, in order to properly show the contents that users want to search, the search service provider collects various electronic documents, extracts document texts from the collected electronic documents, indexes them based on this, and stores them separately, Although it can be implemented to quickly provide users with search results for query words, it was impossible to extract and search imaged documents in the same way as electronic documents.

이러한 불편함을 해결하기 위해 OCR 기능을 통하여 이미지 내 텍스트를 추출하여 인덱싱을 할 수 있지만, 문서를 1장씩 나누어서 스캔한 싱글 이미지만 추출된 텍스트에 대한 위치값 생성이 가능하고, 싱글 이미지에 대해서만 한정적으로 검색활용 기능을 제공할 수 있었다. To solve this inconvenience, text in the image can be extracted and indexed through the OCR function, but only a single image scanned by dividing the document can be scanned and the position value can be created for the extracted text, and it is only limited to single images to provide a search function.

하지만, 다수의 문서를 스캔한 후에 이들을 병합하여 하나의 파일로 생성된 멀티이미지로부터 텍스트를 추출하는 경우, 모든 이미지가 페이지의 구분이 없고, 텍스트를 추출한 후에 색인 시, 텍스트의 위치 정보를 제공할 수 없어서, 검색 효율성이 저하되므로 멀티이미지에 대한 검색 기능을 제공할 수 없는 문제가 있었다.However, when extracting text from multi-images created as a single file by merging multiple documents after scanning, all images have no page division, and text location information is not provided when indexing after text is extracted. There was a problem in that the search function for multi-images could not be provided because the search efficiency was lowered.

예를 들면, 1000장짜리 멀티이미지에 대해 텍스트를 추출한 후에 색인 시, 해당 텍스트의 이미지상 위치를 가늠하기 어렵기 때문에 검색 엔진의 효율성이 떨어지는 문제가 있었다.For example, when indexing after extracting text from a 1000-page multi-image, it is difficult to estimate the location of the text on the image, so the efficiency of the search engine is reduced.

따라서, 향후, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있는 멀티이미지 텍스트 처리 장치의 개발이 요구되고 있다.Therefore, in the future, there is a need for development of a multi-image text processing apparatus capable of providing an accurate search result to a user by providing the location and information of the text extracted from the multi-image page by page.

대한민국 공개특허 10-2019-0123790호 (2019. 11. 01)Republic of Korea Patent Publication No. 10-2019-0123790 (2019. 11. 01)

상술한 바와 같은 문제점을 해결하기 위한 본 발명의 일 목적은, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있는 멀티이미지 텍스트 처리 장치 및 방법을 제공하는 것이다.One object of the present invention for solving the above-described problems is to generate page information for a single image divided from a multi-image, add the generated page information to an index value, and store the location and information of the extracted text for each single image. An object of the present invention is to provide a multi-image text processing apparatus and method capable of providing an accurate search result to a user by providing the location and information of the text extracted from the multi-image page by page by performing page indexing.

본 발명이 해결하고자 하는 과제들은 이상에서 언급된 과제로 제한되지 않으며, 언급되지 않은 또 다른 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The problems to be solved by the present invention are not limited to the problems mentioned above, and other problems not mentioned will be clearly understood by those skilled in the art from the following description.

상술한 과제를 해결하기 위한 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치는, 멀티이미지를 획득하는 이미지 획득부, 상기 획득한 멀티이미지를 다수의 싱글이미지로 변환하는 이미지 변환부, 상기 변환된 싱글이미지로부터 텍스트를 추출하는 텍스트 추출부, 상기 추출한 텍스트를 저장하는 텍스트 저장부, 그리고 상기 이미지 변환부, 텍스트 추출부 및 텍스트 저장부를 제어하는 제어부를 포함하고, 상기 제어부는, 상기 각 싱글이미지에 대한 페이지 정보를 생성하고, 상기 생성한 페이지 정보를 색인값에 추가하여 상기 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리하며, 상기 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하도록 상기 텍스트 저장부를 제어하는 것을 특징으로 한다.A multi-image text processing apparatus according to an embodiment of the present invention for solving the above problems includes an image acquisition unit for acquiring a multi-image, an image conversion unit for converting the acquired multi-image into a plurality of single images, and the conversion A text extraction unit for extracting text from the single image, a text storage unit for storing the extracted text, and a control unit for controlling the image conversion unit, the text extraction unit, and the text storage unit, wherein the control unit, each of the single image to generate page information for , add the generated page information to an index value to index the extracted text location and information for each single image, and store the page indexed text location and information It is characterized in that it controls the storage unit.

실시 예에 있어서, 상기 이미지 변환부는, 상기 멀티이미지로부터 분할 영역을 설정하고, 상기 설정된 분할 영역에 상응하여 상기 멀티이미지를 분할하며, 상기 멀티이미지의 각 분할 영역을 하나의 싱글 이미지로 변환하는 것을 특징으로 한다.In an embodiment, the image conversion unit sets a division area from the multi-image, divides the multi-image according to the set division area, and converts each division area of the multi-image into a single image characterized.

실시 예에 있어서, 상기 이미지 변환부는, 상기 멀티이미지로부터 분할 영역을 설정할 때, 미리 설정된 설정값에 따라 상기 분할 영역을 설정하는 것을 특징으로 한다.In an embodiment, the image conversion unit is configured to set the division area according to a preset value when setting the division area from the multi-image.

실시 예에 있어서, 상기 이미지 변환부는, 상기 멀티이미지로부터 분할 영역을 설정할 때, 사용자 입력값에 따라 상기 분할 영역을 설정하는 것을 특징으로 한다.In an embodiment, the image conversion unit is configured to set the division area according to a user input value when setting the division area from the multi-image.

실시 예에 있어서, 상기 텍스트 추출부는, OCR(Optical Character Recognition) 기능을 수행하는 광학 인식 모듈을 포함하는 것을 특징으로 한다.In an embodiment, the text extraction unit is characterized in that it includes an optical recognition module that performs an OCR (Optical Character Recognition) function.

실시 예에 있어서, 상기 텍스트 추출부는, 상기 변환된 싱글이미지로부터 텍스트를 추출할 때, 상기 싱글이미지 내에 포함되는 글자, 기호, 숫자 중 적어도 하나를 추출하는 것을 특징으로 한다.In an embodiment, the text extraction unit, when extracting text from the converted single image, extracts at least one of letters, symbols, and numbers included in the single image.

실시 예에 있어서, 상기 제어부는, 상기 싱글 이미지의 텍스트가 추출되면 상기 싱글이미지에 대한 페이지 정보를 생성하는 것을 특징으로 한다.In an embodiment, the controller generates page information for the single image when the text of the single image is extracted.

실시 예에 있어서, 상기 제어부는, 상기 텍스트 추출부가 상기 싱글 이미지의 텍스트를 추출할 때, 상기 싱글이미지에 대한 페이지 정보를 생성하도록 상기 텍스트 추출부를 제어하는 것을 특징으로 한다.In an embodiment, when the text extractor extracts the text of the single image, the controller controls the text extractor to generate page information for the single image.

한편, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치의 멀티이미지 텍스트 방법은, 멀티이미지를 획득하는 단계, 상기 획득한 멀티이미지를 다수의 싱글이미지로 변환하는 단계, 상기 변환된 싱글이미지로부터 텍스트를 추출하는 단계, 상기 각 싱글이미지에 대한 페이지 정보를 생성하는 단계, 상기 생성한 페이지 정보를 색인값에 추가하여 상기 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리하는 단계, 및 상기 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하는 단계를 포함하는 것을 특징으로 한다.On the other hand, the multi-image text method of the multi-image text processing apparatus according to an embodiment of the present invention includes the steps of obtaining a multi-image, converting the obtained multi-image into a plurality of single images, and from the converted single image extracting text, generating page information for each single image, adding the generated page information to an index value and performing page index processing on the location and information of the extracted text for each single image, and the page and storing the location and information of the indexed text.

상술한 과제를 해결하기 위한 본 발명의 다른 실시 예에 따른 멀티이미지 텍스트 처리 방법을 제공하는 컴퓨터 프로그램은, 하드웨어인 컴퓨터와 결합되어 상술한 방법 중 어느 하나의 방법을 수행하기 위해 매체에 저장된다.A computer program for providing a multi-image text processing method according to another embodiment of the present invention for solving the above-described problems is stored in a medium in order to perform any one of the methods described above in combination with a computer that is hardware.

이 외에도, 본 발명을 구현하기 위한 다른 방법, 다른 시스템 및 상기 방법을 실행하기 위한 컴퓨터 프로그램을 기록하는 컴퓨터 판독 가능한 기록 매체가 더 제공될 수 있다.In addition to this, another method for implementing the present invention, another system, and a computer-readable recording medium for recording a computer program for executing the method may be further provided.

상기와 같이 본 발명에 따르면, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있다.As described above, according to the present invention, page information for a single image divided from a multi-image is generated and the position and information of the extracted text are page indexed for each single image by adding the generated page information to an index value. By providing the location and information of the extracted text on a page-by-page basis, it is possible to provide an accurate search result to the user.

본 발명의 효과들은 이상에서 언급된 효과로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.Effects of the present invention are not limited to the effects mentioned above, and other effects not mentioned will be clearly understood by those skilled in the art from the following description.

도 1은, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치를 설명하기 위한 블록 구성도이다.
도 2는, 멀티이미지를 설명하기 위한 도면이다.
도 3은, 싱글이미지를 설명하기 위한 도면이다.
도 4는, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치의 멀티이미지 텍스트 처리 방법을 설명하기 위한 흐름도이다.1 is a block diagram illustrating a multi-image text processing apparatus according to an embodiment of the present invention.
2 is a diagram for explaining a multi-image.
3 is a diagram for explaining a single image.
4 is a flowchart illustrating a multi-image text processing method of the multi-image text processing apparatus according to an embodiment of the present invention.

본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나, 본 발명은 이하에서 개시되는 실시예들에 제한되는 것이 아니라 서로 다른 다양한 형태로 구현될 수 있으며, 단지 본 실시예들은 본 발명의 개시가 완전하도록 하고, 본 발명이 속하는 기술 분야의 통상의 기술자에게 본 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 범주에 의해 정의될 뿐이다.Advantages and features of the present invention and methods of achieving them will become apparent with reference to the embodiments described below in detail in conjunction with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, but may be implemented in various different forms, and only the present embodiments allow the disclosure of the present invention to be complete, and those of ordinary skill in the art to which the present invention pertains. It is provided to fully understand the scope of the present invention to those skilled in the art, and the present invention is only defined by the scope of the claims.

본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성요소 외에 하나 이상의 다른 구성요소의 존재 또는 추가를 배제하지 않는다. 명세서 전체에 걸쳐 동일한 도면 부호는 동일한 구성 요소를 지칭하며, "및/또는"은 언급된 구성요소들의 각각 및 하나 이상의 모든 조합을 포함한다. 비록 "제1", "제2" 등이 다양한 구성요소들을 서술하기 위해서 사용되나, 이들 구성요소들은 이들 용어에 의해 제한되지 않음은 물론이다. 이들 용어들은 단지 하나의 구성요소를 다른 구성요소와 구별하기 위하여 사용하는 것이다. 따라서, 이하에서 언급되는 제1 구성요소는 본 발명의 기술적 사상 내에서 제2 구성요소일 수도 있음은 물론이다.The terminology used herein is for the purpose of describing the embodiments and is not intended to limit the present invention. In this specification, the singular also includes the plural unless otherwise specified in the phrase. As used herein, “comprises” and/or “comprising” does not exclude the presence or addition of one or more other components in addition to the stated components. Like reference numerals refer to like elements throughout, and "and/or" includes each and every combination of one or more of the recited elements. Although "first", "second", etc. are used to describe various elements, these elements are not limited by these terms, of course. These terms are only used to distinguish one component from another. Accordingly, it goes without saying that the first component mentioned below may be the second component within the spirit of the present invention.

다른 정의가 없다면, 본 명세서에서 사용되는 모든 용어(기술 및 과학적 용어를 포함)는 본 발명이 속하는 기술분야의 통상의 기술자에게 공통적으로 이해될 수 있는 의미로 사용될 수 있을 것이다. 또한, 일반적으로 사용되는 사전에 정의되어 있는 용어들은 명백하게 특별히 정의되어 있지 않는 한 이상적으로 또는 과도하게 해석되지 않는다.Unless otherwise defined, all terms (including technical and scientific terms) used herein will have the meaning commonly understood by those of ordinary skill in the art to which this invention belongs. In addition, terms defined in a commonly used dictionary are not to be interpreted ideally or excessively unless specifically defined explicitly.

이하, 첨부된 도면을 참조하여 본 발명의 실시예를 상세하게 설명한다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

설명에 앞서 본 명세서에서 사용하는 용어의 의미를 간략히 설명한다. 그렇지만 용어의 설명은 본 명세서의 이해를 돕기 위한 것이므로, 명시적으로 본 발명을 한정하는 사항으로 기재하지 않은 경우에 본 발명의 기술적 사상을 한정하는 의미로 사용하는 것이 아님을 주의해야 한다.Before the description, the meaning of the terms used in this specification will be briefly described. However, it should be noted that, since the description of the term is for the purpose of helping the understanding of the present specification, it is not used in the meaning of limiting the technical idea of the present invention unless explicitly described as limiting the present invention.

도 1은, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치를 설명하기 위한 블록 구성도이고, 도 2는, 멀티이미지를 설명하기 위한 도면이며, 도 3은, 싱글이미지를 설명하기 위한 도면이다.1 is a block diagram for explaining a multi-image text processing apparatus according to an embodiment of the present invention, FIG. 2 is a diagram for explaining a multi-image, and FIG. 3 is a diagram for explaining a single image am.

도 1에 도시된 바와 같이, 본 발명의 멀티이미지 텍스트 처리 장치는, 멀티이미지를 획득하는 이미지 획득부(110), 획득한 멀티이미지를 다수의 싱글이미지로 변환하는 이미지 변환부(120), 변환된 싱글이미지로부터 텍스트를 추출하는 텍스트 추출부(130), 추출한 텍스트를 저장하는 텍스트 저장부(140), 그리고 이미지 변환부(120), 텍스트 추출부(130) 및 텍스트 저장부(140)를 제어하는 제어부(150)를 포함할 수 있다.1, the multi-image text processing apparatus of the present invention includes an image acquisition unit 110 for acquiring a multi-image, an image conversion unit 120 for converting the acquired multi-image into a plurality of single images, and the conversion Controls the text extraction unit 130 for extracting text from the single image, the text storage unit 140 for storing the extracted text, and the image conversion unit 120 , the text extraction unit 130 and the text storage unit 140 . It may include a control unit 150 that

여기서, 제어부(150)는, 각 싱글이미지에 대한 페이지 정보를 생성하고, 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리하며, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하도록 텍스트 저장부(140)를 제어할 수 있다.Here, the controller 150 generates page information for each single image, adds the generated page information to an index value, and indexes the location and information of the extracted text for each single image, and The text storage unit 140 may be controlled to store the location and information.

본 발명에서, 도 2와 같이, 멀티이미지는, 다수의 문서를 스캔한 후에 스캔한 문서들을 병합하여 하나의 파일로 재스캔한 이미지 파일을 의미할 수 있다.In the present invention, as shown in FIG. 2 , a multi-image may refer to an image file re-scanned into one file by merging the scanned documents after scanning a plurality of documents.

일 예로, 멀티이미지는, jpg, tif. Png 등을 포함하는 다수의 이미지 파일들을 포함할 수 있다,As an example, the multi-image, jpg, tif. It can contain multiple image files, including pngs, etc.

또한, 도 3과 같이, 싱글이미지는, 멀티이미지로부터 분할된 하나의 이미지 파일을 포함할 수 있다.Also, as shown in FIG. 3 , a single image may include one image file divided from multiple images.

그리고, 이미지 변환부(120)는, 멀티이미지로부터 분할 영역을 설정하고, 설정된 분할 영역에 상응하여 멀티이미지를 분할하며, 멀티이미지의 각 분할 영역을 하나의 싱글 이미지로 변환할 수 있다.In addition, the image converter 120 may set a divided region from the multi-image, divide the multi-image corresponding to the set divided region, and convert each divided region of the multi-image into one single image.

일 예로, 이미지 변환부(120)는, 멀티이미지로부터 분할 영역을 설정할 때, 미리 설정된 설정값에 따라 분할 영역을 설정할 수 있다.For example, when setting the division area from the multi-image, the image converter 120 may set the division area according to a preset setting value.

여기서, 미리 설정된 설정값은, 분할 영역의 크기 및 개수를 포함할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Here, the preset setting value may include the size and number of divided regions, but this is only an example and is not limited thereto.

다른 예로, 이미지 변환부(120)는, 멀티이미지로부터 분할 영역을 설정할 때, 사용자 입력값에 따라 분할 영역을 설정할 수도 있다.As another example, when setting the division area from the multi-image, the image converter 120 may set the division area according to a user input value.

여기서, 사용자 입력값은, 분할 영역의 크기 및 개수를 포함할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Here, the user input value may include the size and number of divided regions, but this is only an example and is not limited thereto.

이어, 텍스트 추출부(130)는, OCR(Optical Character Recognition) 기능을 수행하는 광학 인식 모듈을 포함할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Next, the text extraction unit 130 may include an optical recognition module that performs an OCR (Optical Character Recognition) function, which is only an embodiment, but is not limited thereto.

그리고, 텍스트 추출부(130)는, 변환된 싱글이미지로부터 텍스트를 추출할 때, 싱글이미지 내에 포함되는 글자, 기호, 숫자 중 적어도 하나를 추출할 수 있다.And, when extracting text from the converted single image, the text extraction unit 130 may extract at least one of letters, symbols, and numbers included in the single image.

다음, 제어부(150)는, 각 싱글이미지에 대한 페이지 정보를 생성하고, 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리하며, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장하도록 텍스트 저장부(140)를 제어할 수 있다.Next, the control unit 150 generates page information for each single image, adds the generated page information to an index value, and indexes the location and information of the extracted text for each single image, and The text storage unit 140 may be controlled to store the location and information.

여기서, 제어부(150)는, 이미지 획득부(110)가 멀티이미지를 획득하면 획득한 멀티이미지를 다수의 싱글이미지로 변환하도록 이미지 변환부(120)를 제어할 수 있다.Here, when the image acquisition unit 110 acquires a multi-image, the controller 150 may control the image conversion unit 120 to convert the acquired multi-image into a plurality of single images.

경우에 따라, 제어부(150)는, 이미지 획득부(110)가 이미지를 획득하면 획득한 이미지가 멀티이미지인지를 확인하고, 획득한 이미지가 멀티이미지이면 멀티이미지를 다수의 싱글이미지로 변환하도록 이미지 변환부(120)를 제어하고, 획득한 이미지가 멀티이미지가 아니면 이미지로부터 텍스트를 추출하도록 텍스트 추출부(130)를 제어할 수 있다.In some cases, when the image acquisition unit 110 acquires an image, the controller 150 checks whether the acquired image is a multi-image, and if the acquired image is a multi-image, converts the multi-image into a plurality of single images. The converter 120 may be controlled, and if the obtained image is not a multi-image, the text extractor 130 may be controlled to extract text from the image.

또한, 제어부(150)는, 이미지 변환부(110)에 의해 멀티이미지가 다수의 싱글이미지로 변환되면 변환된 싱글이미지로부터 텍스트를 추출하도록 텍스트 추출부(130)를 제어할 수 있다.Also, when the multi-image is converted into a plurality of single images by the image conversion unit 110 , the controller 150 may control the text extraction unit 130 to extract text from the converted single image.

그리고, 제어부(150)는, 싱글 이미지의 텍스트가 추출되면 싱글이미지에 대한 페이지 정보를 생성할 수 있다.And, when the text of the single image is extracted, the controller 150 may generate page information for the single image.

여기서, 제어부(150)는, 모든 싱글 이미지의 텍스트가 추출되면 각 싱글이미지에 대한 페이지 정보를 생성할 수 있다.Here, when the text of all single images is extracted, the controller 150 may generate page information for each single image.

경우에 따라, 제어부(150)는, 각 싱글 이미지의 텍스트가 추출될 때마다 해당하는 싱글이미지에 대한 페이지 정보를 생성할 수도 있다.In some cases, the controller 150 may generate page information for a corresponding single image whenever text of each single image is extracted.

다른 경우로서, 제어부(150)는, 텍스트 추출부(130)가 싱글 이미지의 텍스트를 추출할 때, 싱글이미지에 대한 페이지 정보를 생성하도록 텍스트 추출부를 제어할 수도 있다.In another case, when the text extractor 130 extracts text of a single image, the controller 150 may control the text extractor to generate page information for the single image.

이처럼, 본 발명은, 텍스트 추출부(130)의 텍스트 추출에 따라 제어부(150)가 각 싱글이미지에 대한 페이지 정보의 생성을 제어할 수도 있고, 텍스트 추출부(130)가 직접 텍스트 추출을 수행하면서 각 싱글이미지에 대한 페이지 정보를 생성할 수도 있다.As described above, according to the present invention, the control unit 150 may control the generation of page information for each single image according to the text extraction of the text extraction unit 130, and the text extraction unit 130 directly performs text extraction while You can also create page information for each single image.

따라서, 본 발명은, 각 싱글이미지에 대한 페이지 정보의 생성에 있어서, 다양한 설계 변형이 가능하다.Accordingly, in the present invention, various design modifications are possible in the generation of page information for each single image.

다음, 제어부(150)는, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 동일하게 존재하면 추출한 텍스트의 위치 및 정보를 중복 페이지로 색인 처리할 수 있다.Next, when performing page indexing for each single image, the controller 150 may index the extracted text location and information as duplicate pages if the extracted text location and information are identically present in a plurality of single images.

다른 실시예로서, 제어부(150)는, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 동일하게 존재하면 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택하고, 선택한 싱글이미지에 대한 페이지 정보로 추출한 텍스트의 위치 및 정보를 페이지 색인 처리할 수도 있다.As another embodiment, when the page indexing process for each single image is performed, the control unit 150 may include, when the location and information of the extracted text are identically present in a plurality of single images, the location and information of the extracted text are identical to a plurality of single images. Any one of the images may be selected, and the location and information of the text extracted as page information for the selected single image may be page indexed.

여기서, 제어부(150)는, 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택할 때, 미리 설정된 우선 순위에 따라 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택할 수 있다.Here, when selecting any one of a plurality of single images having the same location and information of the extracted text, the control unit 150 includes a plurality of images having the same location and information of the extracted text according to a preset priority. You can select any one of the single images.

이처럼, 본 발명은, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 동일하게 존재될 경우, 신속하고 빠른 색인 처리를 위하여 다양한 방법을 사용할 수 있다.As such, in the present invention, when the location and information of the extracted text are identically present in a plurality of single images, various methods can be used for quick and rapid indexing.

또한, 본 발명은, 우선 순위에 따라 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택하여 선택한 싱글이미지에 대한 페이지 정보로 추출한 텍스트의 위치 및 정보를 페이지 색인 처리함으로써, 사용자에게 신속한 검색 결과를 제공할 수도 있다.In addition, the present invention selects any one of a plurality of single images in which the position and information of the extracted text according to the priority are the same, and processes the position and information of the extracted text as page information for the selected single image as page index processing By doing so, it is possible to provide a quick search result to the user.

이어, 제어부(150)는, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장할 때, 멀티이미지별로 싱글이미지로부터 추출된 텍스트의 위치 및 정보를 하나의 그룹으로 묶어 저장할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Then, when storing the position and information of the text that has been indexed by the page, the controller 150 may group and store the position and information of the text extracted from the single image for each multi-image as a group, which is only an example, However, the present invention is not limited thereto.

이와 같이, 본 발명은, 멀티이미지로부터 분할된 싱글이미지에 대한 페이지 정보를 생성하고 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리함으로써, 멀티이미지로부터 추출한 텍스트의 위치 및 정보를 페이지 단위로 제공하여 사용자에게 정확한 검색 결과를 제공할 수 있다.In this way, the present invention generates page information for a single image divided from a multi-image, adds the generated page information to an index value, and indexes the location and information of the extracted text for each single image, thereby extracting the extracted information from the multi-image. By providing the location and information of text in units of pages, it is possible to provide users with accurate search results.

도 4는, 본 발명의 일 실시예에 따른 멀티이미지 텍스트 처리 장치의 멀티이미지 텍스트 처리 방법을 설명하기 위한 흐름도이다.4 is a flowchart illustrating a multi-image text processing method of the multi-image text processing apparatus according to an embodiment of the present invention.

도 4에 도시된 바와 같이, 본 발명의 멀티이미지 텍스트 처리 장치는, 먼저, 멀티이미지를 획득하고, 획득한 멀티이미지를 다수의 싱글이미지로 변환할 수 있다(S10).As shown in FIG. 4 , the multi-image text processing apparatus of the present invention may first obtain a multi-image and convert the obtained multi-image into a plurality of single images (S10).

여기서, 본 발명은, 멀티이미지를 다수의 싱글이미지로 변환할 때, 멀티이미지로부터 분할 영역을 설정하고, 설정된 분할 영역에 상응하여 멀티이미지를 분할하며, 멀티이미지의 각 분할 영역을 하나의 싱글 이미지로 변환할 수 있다.Here, in the present invention, when converting a multi-image into a plurality of single images, a division area is set from the multi-image, the multi-image is divided according to the set division area, and each division area of the multi-image is converted into one single image. can be converted to

이때, 본 발명은, 멀티이미지로부터 분할 영역을 설정할 때, 미리 설정된 설정값에 따라 분할 영역을 설정할 수도 있고, 경우에 따라, 사용자 입력값에 따라 분할 영역을 설정할 수도 있다.At this time, according to the present invention, when setting the division area from the multi-image, the division area may be set according to a preset setting value, or in some cases, the division area may be set according to a user input value.

그리고, 본 발명은, 변환된 싱글이미지로부터 텍스트를 추출할 수 있다(S20).And, according to the present invention, text can be extracted from the converted single image (S20).

다음, 본 발명은, 각 싱글이미지에 대한 페이지 정보를 생성할 수 있다(S30).Next, the present invention can generate page information for each single image (S30).

여기서, 본 발명은, 모든 싱글 이미지의 텍스트가 추출되면 각 싱글이미지에 대한 페이지 정보를 생성할 수도 있고, 경우에 따라, 본 발명은, 각 싱글 이미지의 텍스트가 추출될 때마다 해당하는 싱글이미지에 대한 페이지 정보를 생성할 수도 있다.Here, in the present invention, when the text of all single images is extracted, page information for each single image may be generated. You can also create page information about it.

이어, 본 발명은, 생성한 페이지 정보를 색인값에 추가하여 추출한 텍스트의 위치 및 정보를 싱글이미지별로 페이지 색인 처리할 수 있다(S40).Next, according to the present invention, by adding the generated page information to the index value, the location and information of the extracted text may be page indexed for each single image (S40).

여기서, 본 발명은, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 동일하게 존재하면 추출한 텍스트의 위치 및 정보를 중복 페이지로 색인 처리할 수 있다.Here, in the present invention, when performing page indexing for each single image, if the location and information of the extracted text are identically present in a plurality of single images, the location and information of the extracted text can be indexed as a duplicate page.

경우에 따라, 본 발명은, 싱글이미지별로 페이지 색인 처리할 때, 추출한 텍스트의 위치 및 정보가 다수의 싱글이미지에 동일하게 존재하면 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택하고, 선택한 싱글이미지에 대한 페이지 정보로 추출한 텍스트의 위치 및 정보를 페이지 색인 처리할 수도 있다.In some cases, in the present invention, when performing page indexing for each single image, if the location and information of the extracted text are identically present in a plurality of single images, among a plurality of single images in which the location and information of the extracted text are identically present It is also possible to select any one and perform page index processing on the location and information of the text extracted as page information for the selected single image.

여기서, 본 발명은, 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택할 때, 미리 설정된 우선 순위에 따라 추출한 텍스트의 위치 및 정보가 동일하게 존재하는 다수의 싱글이미지들 중 어느 하나를 선택할 수 있다.Here, in the present invention, when any one of a plurality of single images having the same location and information of the extracted text is selected, a plurality of single images having the same location and information of the extracted text according to a preset priority You can choose any one of them.

그리고, 본 발명은, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장할 수 있다(S50).And, according to the present invention, it is possible to store the location and information of the page indexed text (S50).

여기서, 본 발명은, 페이지 색인 처리된 텍스트의 위치 및 정보를 저장할 때, 멀티이미지별로 싱글이미지로부터 추출된 텍스트의 위치 및 정보를 하나의 그룹으로 묶어 저장할 수 있는데, 이는 일 실시예일 뿐, 이에 한정되지는 않는다.Here, in the present invention, when storing the position and information of the page indexed text, the position and information of the text extracted from the single image for each multi-image can be bundled and stored as a group, which is only an example, and is limited thereto. it doesn't happen

이상에서 전술한 본 발명의 일 실시예에 따른 방법은, 하드웨어인 서버와 결합되어 실행되기 위해 프로그램(또는 어플리케이션)으로 구현되어 매체에 저장될 수 있다.The method according to an embodiment of the present invention described above may be implemented as a program (or application) to be executed in combination with a server, which is hardware, and stored in a medium.

상기 전술한 프로그램은, 상기 컴퓨터가 프로그램을 읽어 들여 프로그램으로 구현된 상기 방법들을 실행시키기 위하여, 상기 컴퓨터의 프로세서(CPU)가 상기 컴퓨터의 장치 인터페이스를 통해 읽힐 수 있는 C, C++, JAVA, 기계어 등의 컴퓨터 언어로 코드화된 코드(Code)를 포함할 수 있다. 이러한 코드는 상기 방법들을 실행하는 필요한 기능들을 정의한 함수 등과 관련된 기능적인 코드(Functional Code)를 포함할 수 있고, 상기 기능들을 상기 컴퓨터의 프로세서가 소정의 절차대로 실행시키는데 필요한 실행 절차 관련 제어 코드를 포함할 수 있다. 또한, 이러한 코드는 상기 기능들을 상기 컴퓨터의 프로세서가 실행시키는데 필요한 추가 정보나 미디어가 상기 컴퓨터의 내부 또는 외부 메모리의 어느 위치(주소 번지)에서 참조되어야 하는지에 대한 메모리 참조관련 코드를 더 포함할 수 있다. 또한, 상기 컴퓨터의 프로세서가 상기 기능들을 실행시키기 위하여 원격(Remote)에 있는 어떠한 다른 컴퓨터나 서버 등과 통신이 필요한 경우, 코드는 상기 컴퓨터의 통신 모듈을 이용하여 원격에 있는 어떠한 다른 컴퓨터나 서버 등과 어떻게 통신해야 하는지, 통신 시 어떠한 정보나 미디어를 송수신해야 하는지 등에 대한 통신 관련 코드를 더 포함할 수 있다.The above-described program is C, C++, JAVA, machine language, etc. that a processor (CPU) of the computer can read through a device interface of the computer in order for the computer to read the program and execute the methods implemented as a program It may include code (Code) coded in the computer language of Such code may include functional code related to a function defining functions necessary for executing the methods, etc., and includes an execution procedure related control code necessary for the processor of the computer to execute the functions according to a predetermined procedure. can do. In addition, this code may further include additional information necessary for the processor of the computer to execute the functions or code related to memory reference for which location (address address) in the internal or external memory of the computer should be referenced. there is. In addition, when the processor of the computer needs to communicate with any other computer or server located remotely in order to execute the functions, the code uses the communication module of the computer to determine how to communicate with any other computer or server remotely. It may further include a communication-related code for whether to communicate and what information or media to transmit and receive during communication.

상기 저장되는 매체는, 레지스터, 캐쉬, 메모리 등과 같이 짧은 순간 동안 데이터를 저장하는 매체가 아니라 반영구적으로 데이터를 저장하며, 기기에 의해 판독(reading)이 가능한 매체를 의미한다. 구체적으로는, 상기 저장되는 매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광 데이터 저장장치 등이 있지만, 이에 제한되지 않는다. 즉, 상기 프로그램은 상기 컴퓨터가 접속할 수 있는 다양한 서버 상의 다양한 기록매체 또는 사용자의 상기 컴퓨터상의 다양한 기록매체에 저장될 수 있다. 또한, 상기 매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드가 저장될 수 있다.The storage medium is not a medium that stores data for a short moment, such as a register, a cache, a memory, etc., but a medium that stores data semi-permanently and can be read by a device. Specifically, examples of the storage medium include, but are not limited to, ROM, RAM, CD-ROM, magnetic tape, floppy disk, and an optical data storage device. That is, the program may be stored in various recording media on various servers accessible by the computer or in various recording media on the computer of the user. In addition, the medium may be distributed in a computer system connected to a network, and a computer-readable code may be stored in a distributed manner.

본 발명의 실시예와 관련하여 설명된 방법 또는 알고리즘의 단계들은 하드웨어로 직접 구현되거나, 하드웨어에 의해 실행되는 소프트웨어 모듈로 구현되거나, 또는 이들의 결합에 의해 구현될 수 있다. 소프트웨어 모듈은 RAM(Random Access Memory), ROM(Read Only Memory), EPROM(Erasable Programmable ROM), EEPROM(Electrically Erasable Programmable ROM), 플래시 메모리(Flash Memory), 하드 디스크, 착탈형 디스크, CD-ROM, 또는 본 발명이 속하는 기술 분야에서 잘 알려진 임의의 형태의 컴퓨터 판독가능 기록매체에 상주할 수도 있다.The steps of a method or algorithm described in relation to an embodiment of the present invention may be implemented directly in hardware, as a software module executed by hardware, or by a combination thereof. A software module may contain random access memory (RAM), read only memory (ROM), erasable programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), flash memory, hard disk, removable disk, CD-ROM, or It may reside in any type of computer-readable recording medium well known in the art to which the present invention pertains.

이상, 첨부된 도면을 참조로 하여 본 발명의 실시예를 설명하였지만, 본 발명이 속하는 기술분야의 통상의 기술자는 본 발명이 그 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 그러므로, 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며, 제한적이 아닌 것으로 이해해야만 한다.As mentioned above, although embodiments of the present invention have been described with reference to the accompanying drawings, those skilled in the art to which the present invention pertains know that the present invention may be embodied in other specific forms without changing the technical spirit or essential features thereof. you will be able to understand Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive.

Claims

an image acquisition unit for acquiring a multi-image;
an image conversion unit for converting the acquired multi-image into a plurality of single images;
a text extraction unit for extracting text from the converted single image;
a text storage unit for storing the extracted text; And,
A control unit for controlling the image conversion unit, the text extraction unit, and the text storage unit,
The control unit is
Generate page information for each single image, add the generated page information to an index value, perform page index processing for the location and information of the extracted text for each single image, and determine the location and information of the page indexed text control the text storage unit to store,
The control unit is
When performing page indexing for each single image, if the location and information of the extracted text are the same in a plurality of single images, select any one of a plurality of single images in which the location and information of the extracted text are the same, , and page indexing the location and information of the extracted text as page information for the selected single image,
When selecting any one of a plurality of single images having the same location and information of the extracted text, select any one of a plurality of single images having the same location and information of the extracted text according to a preset priority Multi-image text processing device, characterized in that the selection.

According to claim 1,
The image conversion unit,
A multi-image text processing apparatus, characterized in that setting a division area from the multi-image, dividing the multi-image according to the set division area, and converting each division area of the multi-image into a single image.

3. The method of claim 2,
The image conversion unit,
Multi-image text processing apparatus, characterized in that when setting the division area from the multi-image, the division area is set according to a preset value.

3. The method of claim 2,
The image conversion unit,
Multi-image text processing apparatus, characterized in that when setting the division area from the multi-image, the division area is set according to a user input value.

According to claim 1,
The text extraction unit,
Multi-image text processing apparatus comprising an optical recognition module that performs an OCR (Optical Character Recognition) function.

3. The method of claim 2,
The text extraction unit,
When extracting text from the converted single image, the multi-image text processing apparatus, characterized in that for extracting at least one of letters, symbols, and numbers included in the single image.

7. The method of claim 6,
The control unit is
Multi-image text processing apparatus, characterized in that when the text of the single image is extracted, page information for the single image is generated.

7. The method of claim 6,
The control unit is
When the text extractor extracts the text of the single image, the multi-image text processing apparatus of claim 1, wherein the text extractor controls the text extractor to generate page information for the single image.

In the multi-image text method of the multi-image text processing device,
acquiring a multi-image;
converting the obtained multi-image into a plurality of single images;
extracting text from the converted single image;
generating page information for each single image;
adding the generated page information to an index value and performing page index processing on the location and information of the extracted text for each single image; and
Storing the location and information of the page indexed text,
The step of indexing the page for each single image is,
If the location and information of the extracted text are identically present in a plurality of single images, any one of a plurality of single images having the same location and information of the extracted text is selected, and as page information for the selected single image Page indexing the location and information of the extracted text,
When selecting any one of a plurality of single images in which the location and information of the extracted text are the same, select any one of a plurality of single images in which the location and information of the extracted text are identical according to a preset priority Multi-image text processing method, characterized in that the selection.

A computer program for providing a multi-image text method of a multi-image text processing apparatus, which is combined with a computer as hardware and stored in a medium to perform the multi-image text processing method of claim 9.