KR20200021398A

KR20200021398A - Image processing apparatus and method

Info

Publication number: KR20200021398A
Application number: KR1020190086021A
Authority: KR
Inventors: 김원태; 강신욱; 이명재; 김동민; 김필수
Original assignee: (주)제이엘케이인스펙션
Priority date: 2019-07-16
Filing date: 2019-07-16
Publication date: 2020-02-28
Also published as: KR102188649B1

Abstract

Provided are an image processing method and apparatus for determining whether an object corresponds to a search target object. The image processing method of the present disclosure comprises: a step of receiving an analysis target image including at least one object; a step of acquiring numerical information regarding whether the at least one object included in the analysis target image corresponds to a detection target object by using an artificial intelligence model; a primary detection step of determining the at least one object as the detection target object when the acquired numerical information is greater than or equal to a first threshold value; and a step of outputting the determination result, wherein sensitivity in the primary detection step is adjusted by adjusting a parameter in the artificial intelligence model or the first threshold value related to the acquisition of the numerical information.

Description

Image processing apparatus and method {IMAGE PROCESSING APPARATUS AND METHOD}

본 개시는 객체 검출을 위한 영상 처리 장치 및 방법에 관한 것이다. 보다 구체적으로, 본 개시는 인공지능 모델을 이용하여 분석 대상 영상에 포함된 객체가 검출 대상 객체에 해당하는지를 판단하는 영상 처리 장치, 방법 및 본 개시의 영상 처리 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 판독 가능한 기록 매체에 관한 것이다.The present disclosure relates to an image processing apparatus and a method for detecting an object. More specifically, the present disclosure provides an image processing apparatus, a method for determining whether an object included in an analysis target image corresponds to a detection target object using an artificial intelligence model, and a computer for recording a program for executing the image processing method of the present disclosure. A readable recording medium.

객체 인식은 임의의 영상 내에서 객체로 인식된 영역을 기설정된 복수의 부류(class) 중 하나로 인식하는 프로세싱으로서, 객체란 영상 내의 특정한 물체를 의미할 수 있다.Object recognition is a process of recognizing an area recognized as an object in an image as one of a plurality of preset classes. An object may mean a specific object in an image.

최근 인공지능 모델을 이용하여 영상에 포함되어 있는 객체를 인식하거나 분류하는 연구가 늘어나고 있는 추세이다.Recently, researches for recognizing or classifying objects included in images using artificial intelligence models are increasing.

한편, 딥러닝(deep learning)은 매우 방대한 양의 데이터를 학습하여, 새로운 데이터가 입력될 경우 학습 결과를 바탕으로 확률적으로 가장 높은 답을 선택하는 인공지능 모델로서, 영상에 따라 적응적으로 동작할 수 있으며, 데이터에 기초하여 모델을 학습하는 과정에서 특성인자를 자동으로 찾아낼 수 있다.On the other hand, deep learning is an artificial intelligence model that learns a very large amount of data and selects the highest answer probability based on the learning result when new data is input. It can automatically find feature factors in the course of training the model based on the data.

본 개시의 기술적 과제는, 영상에 포함된 객체가 검출 대상 객체에 해당하는지를 판단하는 영상 처리 장치 및 방법을 제공하는 것이다.An object of the present disclosure is to provide an image processing apparatus and method for determining whether an object included in an image corresponds to a detection target object.

본 개시의 다른 기술적 과제는, 인공지능 모델에 기초하여 객체 검출의 민감도를 조정함으로써 검출 객체가 검출 대상 객체에 해당하는지를 판단하는 영상 처리 장치 및 방법을 제공하는 것이다.Another technical problem of the present disclosure is to provide an image processing apparatus and method for determining whether a detection object corresponds to a detection object by adjusting a sensitivity of object detection based on an artificial intelligence model.

본 개시에서 이루고자 하는 기술적 과제들은 이상에서 언급한 기술적 과제들로 제한되지 않으며, 언급하지 않은 또 다른 기술적 과제들은 아래의 기재로부터 본 개시가 속하는 기술분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.Technical problems to be achieved in the present disclosure are not limited to the above-mentioned technical problems, and other technical problems not mentioned above will be clearly understood by those skilled in the art from the following description. Could be.

본 개시의 일 양상에 따르면, 하나 이상의 객체를 포함하는 분석 대상 영상을 수신하는 단계; 인공지능 모델을 이용하여 상기 분석 대상 영상에 포함된 상기 하나 이상의 객체가 검출 대상 객체에 해당하는지에 관한 수치 정보를 획득하는 단계; 상기 획득된 수치 정보가 제1 임계값 이상인 경우, 상기 하나 이상의 객체를 상기 검출 대상 객체로 판단하는 1차 검출 단계; 및 상기 판단한 결과를 출력하는 단계를 포함하고, 상기 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터 또는 상기 제1 임계값을 조정함으로써, 상기 1차 검출 단계의 민감도를 조정하는 것을 특징으로 하는 영상 처리 방법이 제공될 수 있다.According to an aspect of the present disclosure, a method of receiving an analysis target image including one or more objects; Obtaining numerical information about whether the one or more objects included in the analysis target image correspond to the detection target object using an artificial intelligence model; A first detection step of determining the at least one object as the detection target object when the obtained numerical information is equal to or greater than a first threshold value; And outputting the determined result, and adjusting the sensitivity of the first detection step by adjusting a parameter or the first threshold value in the artificial intelligence model related to the acquisition of the numerical information. An image processing method may be provided.

본 개시의 다른 양상에 따르면, 하나 이상의 객체를 포함하는 분석 대상 영상을 수신하는 영상 수신부; 인공지능 모델을 이용하여 상기 분석 대상 영상에 포함된 상기 하나 이상의 객체가 검출 대상 객체에 해당하는지에 관한 수치 정보를 획득하고, 상기 획득된 수치 정보가 제1 임계값 이상인 경우, 상기 하나 이상의 객체를 상기 검출 대상 객체로 판단하는 1차 검출 단계를 수행하는 객체 검출부; 및 상기 판단한 결과를 출력하는 출력부를 포함하고, 상기 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터 또는 상기 제1 임계값을 조정함으로써, 상기 1차 검출 단계의 민감도를 조정하는 것을 특징으로 하는 영상 처리 장치가 제공될 수 있다.According to another aspect of the present disclosure, an image receiving unit receiving an analysis target image including one or more objects; Acquiring numerical information about whether the one or more objects included in the analysis target image corresponds to a detection object using an artificial intelligence model, and when the acquired numerical information is equal to or greater than a first threshold value, the one or more objects are selected. An object detector configured to perform a first detection step of determining the detection target object; And an output unit for outputting the determined result, wherein the sensitivity of the first detection step is adjusted by adjusting a parameter in the artificial intelligence model related to the acquisition of the numerical information or the first threshold value. An image processing apparatus may be provided.

본 개시의 또 다른 양상에 따르면, 본 개시의 영상 처리 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 판독 가능한 기록 매체가 제공될 수 있다.According to another aspect of the present disclosure, a computer-readable recording medium having recorded a program for executing the image processing method of the present disclosure may be provided.

본 개시에 대하여 위에서 간략하게 요약된 특징들은 후술하는 본 개시의 상세한 설명의 예시적인 양상일 뿐이며, 본 개시의 범위를 제한하는 것은 아니다.The features briefly summarized above with respect to the present disclosure are merely exemplary aspects of the detailed description of the present disclosure described below, and do not limit the scope of the present disclosure.

본 개시에 따르면, 영상에 포함된 객체가 검출 대상 객체에 해당하는지를 판단하는 영상 처리 장치 및 방법이 제공될 수 있다.According to the present disclosure, an image processing apparatus and method for determining whether an object included in an image corresponds to a detection object may be provided.

본 개시에 따르면, 인공지능 모델에 기초하여 객체 검출의 민감도를 조정함으로써 검출 객체가 검출 대상 객체에 해당하는지를 판단하는 영상 처리 장치 및 방법이 제공될 수 있다.According to the present disclosure, an image processing apparatus and method for determining whether a detection object corresponds to a detection target object by adjusting the sensitivity of object detection based on an artificial intelligence model may be provided.

본 개시에서 얻을 수 있는 효과는 이상에서 언급한 효과들로 제한되지 않으며, 언급하지 않은 또 다른 효과들은 아래의 기재로부터 본 개시가 속하는 기술분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.Effects obtained in the present disclosure are not limited to the above-mentioned effects, and other effects not mentioned above may be clearly understood by those skilled in the art from the following description. will be.

도 1은 본 개시의 일 실시 예에 따른 영상 처리 장치의 구성을 나타내는 블록도이다.
도 2는 본 개시의 일 실시 예에 따른 영상에 포함된 객체가 검출 대상 객체에 해당하는지를 판단하는 과정을 나타내는 도면이다.
도 3은 본 개시의 일 실시 예에 따른 영상의 맥락 정보를 생성하고 분석하는 과정을 나타내는 도면이다.
도 4는 본 개시의 일 실시 예에 따른 영상 처리 장치가 영상을 분석하여 객체를 검출하는 과정을 나타내는 도면이다.
도 5는 본 개시의 일 실시 예에 따른 객체의 물성에 기초하여 색상이 표현된 영상을 나타내는 도면이다.
도 6은 본 개시의 일 실시 예에 따른 영상 처리 방법을 나타내는 도면이다.1 is a block diagram illustrating a configuration of an image processing apparatus according to an exemplary embodiment.
2 is a diagram illustrating a process of determining whether an object included in an image corresponds to a detection object according to an exemplary embodiment.
3 is a diagram illustrating a process of generating and analyzing context information of an image according to an exemplary embodiment.
4 is a diagram illustrating a process of detecting an object by analyzing an image according to an embodiment of the present disclosure.
5 is a diagram illustrating an image in which colors are expressed based on physical properties of an object according to an exemplary embodiment.
6 is a diagram illustrating an image processing method according to an exemplary embodiment.

이하에서는 첨부한 도면을 참고로 하여 본 개시의 실시 예에 대하여 본 개시가 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 상세히 설명한다. 그러나, 본 개시는 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시 예에 한정되지 않는다. Hereinafter, exemplary embodiments of the present disclosure will be described in detail with reference to the accompanying drawings so that those skilled in the art may easily implement the present disclosure. However, the present disclosure may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein.

본 개시의 실시 예를 설명함에 있어서 공지 구성 또는 기능에 대한 구체적인 설명이 본 개시의 요지를 흐릴 수 있다고 판단되는 경우에는 그에 대한 상세한 설명은 생략한다. 그리고, 도면에서 본 개시에 대한 설명과 관계없는 부분은 생략하였으며, 유사한 부분에 대해서는 유사한 도면 부호를 붙였다.In describing the embodiments of the present disclosure, when it is determined that a detailed description of a known structure or function may obscure the gist of the present disclosure, a detailed description thereof will be omitted. In the drawings, parts irrelevant to the description of the present disclosure are omitted, and like reference numerals designate like parts.

본 개시에 있어서, 어떤 구성요소가 다른 구성요소와 "연결", "결합" 또는 "접속"되어 있다고 할 때, 이는 직접적인 연결관계뿐만 아니라, 그 중간에 또 다른 구성요소가 존재하는 간접적인 연결관계도 포함할 수 있다. 또한 어떤 구성요소가 다른 구성요소를 "포함한다" 또는 "가진다"고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 배제하는 것이 아니라 또 다른 구성요소를 더 포함할 수 있는 것을 의미한다.In the present disclosure, when a component is "connected", "coupled" or "connected" with another component, it is not only a direct connection, but also an indirect connection in which another component exists in the middle of the connection. It may also include. In addition, when a component "includes" or "having" another component, it means that it may further include another component, without excluding the other component unless otherwise stated. .

본 개시에 있어서, 제1, 제2 등의 용어는 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용되며, 특별히 언급되지 않는 한 구성요소들간의 순서 또는 중요도 등을 한정하지 않는다. 따라서, 본 개시의 범위 내에서 일 실시 예에서의 제1 구성요소는 다른 실시 예에서 제2 구성요소라고 칭할 수도 있고, 마찬가지로 일 실시 예에서의 제2 구성요소를 다른 실시 예에서 제1 구성요소라고 칭할 수도 있다. In the present disclosure, the terms "first" and "second" are used only for the purpose of distinguishing one component from other components, and do not limit the order or importance between the components unless specifically mentioned. Accordingly, within the scope of the present disclosure, a first component in one embodiment may be referred to as a second component in another embodiment, and likewise, a second component in one embodiment may be referred to as a first component in another embodiment. It may also be called.

본 개시에 있어서, 서로 구별되는 구성요소들은 각각의 특징을 명확하게 설명하기 위함이며, 구성요소들이 반드시 분리되는 것을 의미하지는 않는다. 즉, 복수의 구성요소가 통합되어 하나의 하드웨어 또는 소프트웨어 단위로 이루어질 수도 있고, 하나의 구성요소가 분산되어 복수의 하드웨어 또는 소프트웨어 단위로 이루어질 수도 있다. 따라서, 별도로 언급하지 않더라도 이와 같이 통합된 또는 분산된 실시 예도 본 개시의 범위에 포함된다. In the present disclosure, the components distinguished from each other are for clearly describing each feature, and do not necessarily mean that the components are separated. That is, a plurality of components may be integrated into one hardware or software unit, or one component may be distributed into a plurality of hardware or software units. Therefore, even if not mentioned otherwise, such integrated or distributed embodiments are included in the scope of the present disclosure.

본 개시에 있어서, 다양한 실시 예에서 설명하는 구성요소들이 반드시 필수적인 구성요소들은 의미하는 것은 아니며, 일부는 선택적인 구성요소일 수 있다. 따라서, 일 실시 예에서 설명하는 구성요소들의 부분집합으로 구성되는 실시 예도 본 개시의 범위에 포함된다. 또한, 다양한 실시 예에서 설명하는 구성요소들에 추가적으로 다른 구성요소를 포함하는 실시 예도 본 개시의 범위에 포함된다. In the present disclosure, components described in various embodiments are not necessarily required components, and some may be optional components. Therefore, an embodiment composed of a subset of components described in an embodiment is also included in the scope of the present disclosure. In addition, embodiments including other components in addition to the components described in the various embodiments are included in the scope of the present disclosure.

이하, 첨부한 도면을 참조하여 본 개시의 실시 예들에 대해서 설명한다.Hereinafter, embodiments of the present disclosure will be described with reference to the accompanying drawings.

도 1은 본 개시의 일 실시 예에 따른 영상 처리 장치의 구성을 나타내는 블록도이다.1 is a block diagram illustrating a configuration of an image processing apparatus according to an exemplary embodiment.

도 1을 참조하면, 영상 처리 장치(100)는 영상 수신부(110), 객체 검출부(120), 객체 분석부(130) 및/또는 출력부(미도시)를 포함할 수 있다. 다만, 이는 본 실시예를 설명하기 위해 필요한 일부 구성요소만을 도시한 것일 뿐, 영상 처리 장치(100)에 포함된 구성요소가 전술한 예에 한정되는 것은 아니다. 예컨대, 둘 이상의 구성부가 하나의 구성부 내에서 구현될 수도 있고, 하나의 구성부에서 실행되는 동작이 분할되어 둘 이상의 구성부에서 실행되도록 구현될 수도 있다. 또한, 일부 구성부가 생략되거나 부가적인 구성부가 추가될 수도 있다.Referring to FIG. 1, the image processing apparatus 100 may include an image receiver 110, an object detector 120, an object analyzer 130, and / or an output unit (not shown). However, this only illustrates some components necessary to describe the present embodiment, and the components included in the image processing apparatus 100 are not limited to the above-described example. For example, two or more components may be implemented in one component, or an operation executed in one component may be divided and implemented to be executed in two or more components. In addition, some components may be omitted or additional components may be added.

일 실시 예에 따른 영상 처리 장치(100)는 분석 대상 영상(140)을 수신하고, 인공지능 모델을 이용하여 수신된 분석 대상 영상(140)에 포함되어 있는 하나 이상의 객체를 검출하고, 검출된 객체가 검출 대상 객체에 해당하는지를 판단할 수 있다.The image processing apparatus 100 according to an embodiment receives the analysis target image 140, detects one or more objects included in the received analysis target image 140 using an artificial intelligence model, and detects the detected object. It may be determined whether the corresponds to the detection target object.

영상 수신부(110)는 분석 대상 영상(140)을 수신할 수 있다. 분석 대상 영상(140)은 적어도 하나 이상의 객체를 포함할 수 있다. 예컨대, 분석 대상 영상(140)은 적어도 하나 이상의 객체를 포함하는 물품에 관한 영상일 수 있다. 또한 예컨대, 분석 대상 영상(140)은 X-Ray 판독 기기가 촬영한 물품에 관한 X-Ray 영상일 수 있다. 상기 영상은 X-Ray 영상 기기가 촬영한 로(raw) 이미지이거나 상기 로 이미지를 저장 또는 전송하기 위한 임의의 형태(포맷)의 이미지일 수 있다. 상기 영상은 X-Ray 판독 기기가 촬영하여 모니터와 같은 출력 장치로 전송하는 영상 정보를 캡쳐하여 데이터화함으로써 획득될 수도 있다.The image receiver 110 may receive the analysis target image 140. The analysis target image 140 may include at least one object. For example, the analysis target image 140 may be an image regarding an article including at least one object. Also, for example, the analysis target image 140 may be an X-ray image of an article photographed by the X-ray reader. The image may be a raw image captured by an X-ray imaging device or an image of any form (format) for storing or transmitting the raw image. The image may be obtained by capturing and data-forming image information that is taken by an X-ray reading device and transmitted to an output device such as a monitor.

객체 검출부(120)는 영상 수신부(110)에서 수신된 분석 대상 영상에 포함되어 있는 객체를 검출할 수 있다. 예컨대, 객체 검출부(120)는 인공지능 기술을 이용하여 분석 대상 영상에 포함되어 있는 객체를 검출할 수 있다. 구체적으로, 인공지능 모델은 객체 검출을 위한 딥러닝 모델일 수 있다.The object detector 120 may detect an object included in the analysis target image received by the image receiver 110. For example, the object detector 120 may detect an object included in the analysis target image by using artificial intelligence technology. In detail, the artificial intelligence model may be a deep learning model for object detection.

인공지능 기술은 컴퓨터에게 데이터를 학습시켜 마치 사람처럼 스스로 의사결정을 할 수 있게 하는데, 인공 신경망(artificial neural network)은 생물학의 신경망에서 영감을 얻은 수학적 모델로서, 시냅스의 결합으로 네트워크를 형성한 인공 뉴런이 학습을 통해 시냅스의 결합 세기를 변화시킴으로써 문제 해결 능력을 가지는 모델 전반을 의미할 수 있다. 인공 신경망은 일반적으로 입력층(input layer), 은닉층(hidden layer) 및 출력층(output layer)로 구성되어 있으며 각 층에 포함된 뉴런들이 가중치를 통해 연결되어 있으며, 가중치와 뉴런값의 선형 결합과 비선형 활성화 함수를 통해 인공 신경망은 복잡한 함수를 근사화할 수 있는 형태를 가질 수 있다. 인공 신경망 학습의 목적은 출력층에서 계산된 출력과 실제 출력의 값 차이를 최소화 시키는 가중치를 찾는데 있다.AI technology allows computers to learn data and make decisions as if they were humans. Artificial neural networks are mathematical models inspired by biological neural networks. Neurons can refer to models that have problem-solving abilities by changing the strength of synapses through learning. Artificial neural networks generally consist of an input layer, a hidden layer, and an output layer, and neurons included in each layer are connected by weight, and a linear combination of the weight and neuron values and a nonlinear Through the activation function, the artificial neural network can be shaped to approximate complex functions. The purpose of artificial neural network learning is to find weights that minimize the difference between the output computed at the output layer and the actual output.

심층 신경망(deep neural network)은 입력층과 출력층 사이에 여러 개의 은닉층들로 이루어진 인공 신경망으로서, 많은 은닉층을 통해 복잡한 비선형 관계들을 모델링할 수 있으며, 이처럼 층의 개수를 늘림으로써 고도화된 추상화가 가능한 신경망 구조를 딥러닝(deep learning)이라고 부른다. 딥러닝은 매우 방대한 양의 데이터를 학습하여, 새로운 데이터가 입력될 경우 학습 결과를 바탕으로 확률적으로 가장 높은 답을 선택하기 때문에 영상에 따라 적응적으로 동작할 수 있으며, 데이터에 기초하여 모델을 학습하는 과정에서 특성인자를 자동으로 찾아낼 수 있다.Deep neural networks are artificial neural networks made up of multiple hidden layers between the input and output layers, and many hidden layers can model complex nonlinear relationships. The structure is called deep learning. Deep learning learns a very large amount of data, and when new data is input, it can select adaptively the highest answer based on the learning results, so it can operate adaptively according to the image. As you learn, you can automatically find the feature factors.

본 개시의 딥러닝 기반의 모델은 완전 합성곱 신경망(완전 컨볼루션 뉴럴 네트워크, fully convolutional neural network), 합성곱 신경망(컨볼루션 뉴럴 네트워크, convolutional neural network), 순환 신경망(회귀 뉴럴 네트워크, recurrent neural network), 제한 볼츠만 머신(restricted Boltzmann machine, RBM) 및 심층 신뢰 신경망(deep belief neural network, DBN) 중 적어도 하나를 포함할 수 있으나, 이에 제한되지 않는다. 또는, 딥러닝 이외의 머신 러닝 방법도 포함할 수 있다. 또는 딥러닝과 머신 러닝을 결합한 하이브리드 형태의 모델도 포함할 수 있다. 예컨대, 딥러닝 기반의 모델을 적용하여 영상의 특징을 추출하고, 상기 추출된 특징에 기초하여 영상을 분류하거나 인식할 때는 머신 러닝 기반의 모델을 적용할 수도 있다. 머신 러닝 기반의 모델은 서포트 벡터 머신(Support Vector Machine, SVM), 에이다부스트(AdaBoost) 등을 포함할 수 있으나, 이에 한정되지 않는다.Deep learning-based models of the present disclosure can be used for a full convolutional neural network (fully convolutional neural network), a convolutional neural network (convolutional neural network), a cyclic neural network (recurrent neural network). ), But is not limited to at least one of a restricted Boltzmann machine (RBM) and a deep belief neural network (DBN). Alternatively, the method may include machine learning methods other than deep learning. Or it could include a hybrid model that combines deep learning and machine learning. For example, a feature of an image may be extracted by applying a deep learning based model, and a machine learning based model may be applied when classifying or recognizing an image based on the extracted feature. The machine learning based model may include a support vector machine (SVM), AdaBoost, and the like, but is not limited thereto.

또한, 본 개시의 딥러닝 기반의 모델을 학습하는 방법은 지도 학습(supervised learning), 비지도 학습(unsupervised learning) 또는 강화 학습(reinforcement learning) 중 적어도 하나를 포함할 수 있으나, 이에 제한되지 않는다. 지도 학습은 일련의 학습 데이터와 그에 상응하는 레이블(label, 목표 출력값)을 이용하여 학습이 이루어지며, 지도 학습에 기초한 신경망 모델은 훈련용 데이터(training data)로부터 함수를 추론해내는 형태의 모델일 수 있다. 지도 학습은 일련의 학습 데이터와 그에 상응하는 목표 출력 값을 수신하고, 입력되는 데이터에 대한 실제 출력 값과 목표 출력 값을 비교하는 학습을 통해 오류를 찾아내고, 해당 결과를 근거로 모델을 수정하게 된다. 지도 학습은 결과물의 형태에 따라 다시 회귀(regression), 분류(classification), 검출(detection), 시멘틱 세그멘테이션(semantic segmentation) 등으로 구분될 수 있다. 지도 학습을 통해 도출된 함수는 다시 새로운 결과값을 예측하는데 사용될 수 있다. 이처럼, 지도 학습에 기초한 신경망 모델은 수많은 학습 데이터의 학습을 통해, 신경망 모델의 파라미터를 최적화하게 된다.In addition, the method of learning a deep learning-based model of the present disclosure may include at least one of supervised learning, unsupervised learning, or reinforcement learning, but is not limited thereto. Supervised learning is trained using a series of training data and a corresponding label (target output value). A neural network model based on supervised learning is a model that infers a function from training data. Can be. Supervised learning receives a series of training data and corresponding target output values, and learns to find errors by modifying the model based on the results by learning to compare the actual output values with the target output values. do. Supervised learning can be further divided into regression, classification, detection, semantic segmentation, etc., depending on the type of the result. Functions derived from supervised learning can then be used to predict new outcomes. As such, the neural network model based on supervised learning optimizes the parameters of the neural network model through learning a large number of training data.

객체 검출부(120)는 인공지능 모델을 이용하여 분석 대상 영상에 포함된 하나 이상의 객체가 검출 대상 객체에 해당하는지에 관한 수치 정보를 획득할 수 있다. 검출 대상 객체는 미리 설정될 수 있으며, 인공지능 모델은 상기 검출 대상 객체에 대해 미리 학습될 수 있다. 또한, 검출 대상 객체는 적용되는 분야에 따라 달라질 수 있다. 예컨대, 반입 금지 물품인지를 판단하는 분야, 기업 보안상 침해 물품인지를 판단하는 분야, 의료 분야 등 다양한 종류의 영상을 처리하고 분석하는 분야에 적용될 수 있다. 구체적으로, 검출 대상 객체는 USB, 노트북, 하드 드라이브 등의 저장 매체, 칼, 라이터, 총 등을 포함할 수 있다. 따라서, 본 개시의 영상 처리 장치 및/또는 영상 처리 방법은 입력되는 영상을 분석해서 특정한 객체를 검출할 수 있으며, 기업 보안 시스템 등의 다양한 분야에 적용될 수 있다.The object detector 120 may acquire numerical information regarding whether one or more objects included in the analysis target image correspond to the detection target object by using the artificial intelligence model. The object to be detected may be set in advance, and the artificial intelligence model may be pre-learned with respect to the object to be detected. Also, the object to be detected may vary depending on the field of application. For example, the present invention may be applied to a field for processing and analyzing various kinds of images, such as a field for determining whether a product is prohibited from entering, a field for determining whether a product is infringing on a company security, or a medical field. In detail, the object to be detected may include a storage medium such as a USB, a notebook, a hard drive, a knife, a lighter, a gun, and the like. Accordingly, the image processing apparatus and / or image processing method of the present disclosure may detect a specific object by analyzing an input image and may be applied to various fields such as an enterprise security system.

수치 정보는 하나 이상의 객체를 포함하는 영상에 대해 인공지능 모델을 적용하여 객체를 검출하는 경우, 검출된 객체가 상기 검출 대상 객체에 해당하는지를 나타내는 확률, 점수(score) 또는 분포 등을 의미할 수 있다. 예컨대, 검출 대상 객체가 USB, 칼, 라이터라고 하면 상기 수치 정보는 분석 대상 영상에 포함된 객체가 USB에 해당하는 확률, 칼에 해당하는 확률 및/또는 라이터에 해당하는 확률을 포함할 수 있다.The numerical information may mean a probability, score, or distribution indicating whether the detected object corresponds to the object to be detected when the object is detected by applying an artificial intelligence model to an image including one or more objects. . For example, if the detection target object is a USB, a knife, a lighter, the numerical information may include a probability that the object included in the analysis image corresponds to a USB, a probability corresponding to a knife, and / or a probability corresponding to a lighter.

상기 수치 정보는 인공지능 모델을 이용하여 검출된 객체가 어떤 검출 대상 객체에 해당하는지에 관한 신뢰도 또는 판단 기준을 제공할 수 있다. 예컨대, 객체 검출부(120)는 인공지능 모델의 적용 결과, 상기 객체에 관한 상기 수치 정보가 높은 검출 대상 객체 순서대로 검출 객체가 검출 대상 객체에 해당한다고 판단할 수 있다. 또한, 객체 검출부(120)는 상기 획득된 수치 정보가 제1 임계값 이상인 경우, 상기 하나 이상의 객체를 상기 검출 대상 객체로 판단할 수 있다. 제1 임계값은 검출 대상 객체에 따라 다를 수 있다. 제1 임계값은 기설정되거나 사용자에 의해 입력될 수 있다. 상기 제1 임계값을 이용함으로써 본 개시의 객체 판단 장치는 보다 더 신뢰성을 확보할 수 있다. 또한, 예컨대 제1 임계값은 객체 검출에 관련된 매개변수일 수 있다.The numerical information may provide a reliability or a criterion for determining which detection target object the object detected using the artificial intelligence model corresponds to. For example, as a result of applying the artificial intelligence model, the object detector 120 may determine that the detection object corresponds to the detection object in order of the detection object having the high numerical information about the object. In addition, the object detector 120 may determine the at least one object as the detection target object when the obtained numerical information is equal to or greater than a first threshold value. The first threshold value may vary depending on the object to be detected. The first threshold may be preset or input by a user. By using the first threshold value, the object determining apparatus of the present disclosure can secure more reliability. Also, for example, the first threshold may be a parameter related to object detection.

예컨대, 분석 대상 영상에 포함된 객체가 USB에 해당하는 확률, 칼에 해당하는 확률 및/또는 라이터에 해당하는 확률이 수치 정보에 포함되는 경우, 검출 객체가 USB에 해당하는 확률이 제1 임계값 이상이면 검출된 객체는 USB에 해당한다고 판단될 수 있다. 또한 예컨대, 상기 수치 정보에서 검출 객체가 USB에 해당하는 확률 및 칼에 해당하는 확률이 각각 제1 임계값 이상이고 검출 객체가 USB에 해당하는 확률이 칼에 해당하는 확률보다 큰 경우, 검출된 객체는 우선적으로 USB에 해당한다고 판단될 수 있다.For example, when the numerical information includes the probability that the object included in the analysis target image corresponds to the USB, the probability corresponding to the knife, and / or the probability corresponding to the lighter, the probability that the detected object corresponds to the USB is the first threshold value. If abnormal, the detected object may be determined to correspond to USB. Further, for example, when the probability that the detection object corresponds to the USB and the probability that the knife corresponds in the numerical information is greater than or equal to the first threshold value and the probability that the detection object corresponds to the USB is greater than the probability that the knife corresponds, respectively, the detected object May be determined to correspond to USB first.

또한, 예컨대, 복수의 검출 대상 객체들이 있는 경우, 적어도 일부의 검출 대상 객체는 나머지 다른 검출 대상 객체와 다른 임계값이 적용될 수 있다. 그런데 이 경우, 검출해야 되는 객체들의 수가 많아지면 그에 따라 임계값들의 수도 증가하게 될 것이다. 또한, 적어도 일부의 검출 대상 객체에 적용되는 임계값은 나머지 다른 검출 대상 객체에 적용되는 임계값과의 차이가 클 수도 있다. 이와 같이, 임계값들의 수가 증가하거나 임계값간의 차이가 크면, 임계값들 각각의 값을 조정하거나 관리하는데 어려움이 있을 수 있다. 따라서, 복수의 검출 대상 객체들이 있는 경우에, 예컨대, 하나의 대표적인 임계값이 설정되고, 또한 검출 대상 객체의 수만큼 객체 검출과 관련된 민감도 계수가 각각 설정될 수 있다. 그리고, 상기 설정된 임계값과 각각의 민감도 계수를 곱해서 검출 대상 객체 별 최종 임계값이 설정될 수 있다. 상기 최종 임계값이 검출 대상 객체마다의 제1 임계값이 될 수 있다. 결국, 본 개시의 영상 처리 장치 및/또는 영상 처리 방법은 다중 목적물, 즉 복수의 검출 대상 객체들에 대해 객체 검출의 민감도를 개별적으로 조정할 수 있다. 다른 실시예로서, 분석 대상 영상에 포함된 하나 이상의 객체가 검출 대상 객체에 해당하는지에 관한 상기 수치 정보를 획득할 때, 검출 대상 객체 별로 민감도에 관한 내부 파라미터(예컨대, 계수)를 다르게 설정할 수 있다. 예컨대, USB에 대해서는 내부 파라미터를 2로 하고, 칼에 대해서는 내부 파라미터를 1로 할 수 있다. 그리고, 실제 획득된 수치 정보에 상기 내부 파라미터를 적용함으로써, (예컨대, 곱함으로써) 획득된 수치 정보를 보정할 수 있다. 이와 같이 하면, 동일한 수치 정보가 획득되더라도 최종적으로는 칼보다 USB에 대해서 높은 보정 수치 정보가 획득될 수 있으므로, USB와 칼에 대해 동일한 임계값을 이용하더라도, USB에 대한 검출 민감도를 높일 수 있다.Also, for example, when there are a plurality of detection target objects, at least some of the detection target objects may have different thresholds than the other detection target objects. In this case, however, as the number of objects to be detected increases, the number of thresholds increases accordingly. In addition, the threshold value applied to at least some of the detection target objects may be different from the threshold value applied to the other detection target objects. As such, when the number of thresholds increases or the difference between the thresholds is large, it may be difficult to adjust or manage the value of each of the thresholds. Therefore, when there are a plurality of objects to be detected, for example, one representative threshold value may be set, and the sensitivity coefficients related to object detection may be set by the number of objects to be detected, respectively. The final threshold for each object to be detected may be set by multiplying the set threshold by the sensitivity coefficient. The final threshold may be a first threshold for each object to be detected. As a result, the image processing apparatus and / or the image processing method of the present disclosure may individually adjust the sensitivity of object detection with respect to multiple objects, that is, a plurality of detection target objects. As another embodiment, when acquiring the numerical information on whether one or more objects included in the analysis target image correspond to the detection target object, an internal parameter (eg, coefficient) related to sensitivity may be differently set for each detection target object. . For example, an internal parameter can be set to 2 for USB and an internal parameter can be set to 1 for a knife. In addition, by applying the internal parameter to the actually obtained numerical information, the obtained numerical information may be corrected (eg, by multiplying). In this way, even if the same numerical information is obtained, the higher correction numerical information can be finally obtained for the USB than the knife, so that even if the same threshold value is used for the USB and the knife, the detection sensitivity for the USB can be increased.

또한, 객체 검출부(120)는 상기 검출된 객체가 검출 대상 객체에 해당하는지에 관한 판단 결과를 출력할 수 있다.In addition, the object detector 120 may output a determination result regarding whether the detected object corresponds to a detection target object.

또한, 객체 검출부(120)는 상기 제1 임계값을 조정할 수 있다. 상기 조정은 기설정되어 있거나 사용자에 의해 입력될 수 있다. 예컨대, 상기 제1 임계값을 조정함으로써, 영상에 포함된 객체가 검출 대상 객체에 해당한다고 판단될 가능성, 즉, 객체 검출의 민감도를 조정할 수 있다. In addition, the object detector 120 may adjust the first threshold value. The adjustment may be preset or entered by the user. For example, by adjusting the first threshold value, it is possible to adjust the possibility that the object included in the image corresponds to the detection target object, that is, the sensitivity of the object detection.

또는, 객체 검출부(120)는 상기 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터를 조정함으로써, 객체 검출의 민감도를 조정할 수 있다. 상기 인공지능 모델 내부의 파라미터를 조정하면 하나의 입력 영상에 대해 복수의 서로 다른 수치 정보를 획득할 수 있다. 상기 조정은 기설정되어 있거나 사용자에 의해 입력될 수 있다. 예컨대, 인공지능 모델 내부의 파라미터는 합성곱 신경망에서 합성곱 계층 및/또는 풀링 계층의 필터 커널들 내부의 파라미터일 수 있다. Alternatively, the object detector 120 may adjust the sensitivity of the object detection by adjusting a parameter inside the artificial intelligence model related to the acquisition of the numerical information. By adjusting parameters inside the artificial intelligence model, a plurality of different numerical information may be obtained with respect to one input image. The adjustment may be preset or entered by the user. For example, the parameter inside the artificial intelligence model may be a parameter inside filter kernels of the convolutional layer and / or the pooling layer in the convolutional neural network.

전술한 바와 같이, 인공지능 모델 내부의 파라미터를 조정함으로써 수치 정보를 조정하거나 또는 제1 임계값을 조정함으로써 검출된 객체가 검출 대상 객체에 해당하는지에 관한 판단 정도(민감도)를 조정할 수 있다. 결국, 본 개시의 영상 처리 장치는 객체 검출 과정에서의 검출 민감도를 조정하기 위해 인공지능 모델 내부의 파라미터를 조정하거나 제1 임계값을 조정할 수 있다.As described above, it is possible to adjust the degree of determination (sensitivity) as to whether the detected object corresponds to the object to be detected by adjusting numerical information by adjusting parameters inside the artificial intelligence model or adjusting the first threshold value. As a result, the image processing apparatus of the present disclosure may adjust a parameter inside the artificial intelligence model or adjust the first threshold in order to adjust the detection sensitivity in the object detection process.

한편, 의료 영상 분야의 경우, 일반적으로 의사들은 X-Ray 의료 영상에 나타난 여러 가지 특징들에 기초하여 환자들에 대한 진단을 한다. 예컨대, 폐 영상에 나타난 내부 기관들의 여러 가지 특징들에 기초하여 환자의 폐 병변 여부가 진단될 수 있다. 의료 영상에 나타난 영역들을 명확하게 구분할 수 있다면 보다 정확한 진단을 내릴 수 있을 것이다. 따라서, 상기 의료 영상에서 영역들을 구분하고 검출할 필요가 있다. 즉, 의료 영상에는 일반적으로 USB, 칼 등의 별개의 객체들이 포함되어 있는 것은 아니나, 내부 장기나 조직, 여러 기관들이 혼재해 있기 때문에 이들을 명확하게 구분할 필요가 있다. 예컨대, 폐 영상의 경우 폐, 에어(air), 흉막, 횡경막 등의 기관들을 보다 명확하게 검출할 필요가 있다. 또는, 폐 영상에는 경화(consolidation), 간질혼탁(interstitial), 혹(nodule), 덩어리(mass), 폐확장부전(atelectasis) 등으로 판단될 수 있는 다양한 영상 내 객체가 존재할 수 있는데, 본 개시에 따라, 이러한 폐 영상 내 객체 검출의 민감도를 조절할 수도 있다.On the other hand, in the medical imaging field, doctors generally diagnose patients based on various features shown in X-ray medical images. For example, the patient may be diagnosed as a lung lesion based on various features of internal organs shown in the lung image. If you can clearly distinguish the areas on the medical image, you will be able to make a more accurate diagnosis. Therefore, it is necessary to distinguish and detect regions in the medical image. In other words, medical images do not generally include separate objects such as USB and knife, but internal organs, tissues, and organs are mixed, so it is necessary to clearly distinguish them. For example, in the case of lung imaging, it is necessary to more clearly detect organs such as lungs, air, pleura, and diaphragm. Alternatively, lung images may include objects in various images that can be determined as consolidation, interstitial, nodule, mass, pulmonary dilatation, and the like. Accordingly, the sensitivity of object detection in the lung image may be adjusted.

상기에서는 폐 영상을 예로 들었으나, 이에 한정되지 않는다. 즉, 영상에 기초하여 진단하거나 병변을 검출하는 다양한 의료 영상을 입력으로 하여, 본 개시의 발명을 적용함으로써, 의료 영상 내 객체 검출의 민감도를 조절하는 것도 가능하다. 나아가, 영상이 이용되는 분야와 용도를 불문하고, 입력 영상으로부터 복수의 객체를 검출하고자 하는 모든 분야에 본 개시의 발명이 적용될 수도 있다.In the above, the lung image is taken as an example, but is not limited thereto. That is, it is also possible to adjust the sensitivity of object detection in the medical image by applying the invention of the present disclosure by inputting various medical images for diagnosis or detection of lesions based on the image. Furthermore, the invention of the present disclosure may be applied to any field in which a plurality of objects are to be detected from an input image, regardless of the field and use of the image.

본 개시의 영상 처리 장치 및/또는 영상 처리 방법은 인공지능 모델을 이용하여 의료 영상에 포함된 하나 이상의 영역이 검출 대상 영역에 해당하는지에 관한 수치 정보를 획득할 수 있다. 검출 대상 영역은 미리 설정될 수 있으며, 인공지능 모델은 상기 검출 대상 영역에 대해 미리 학습될 수 있다. 또한, 본 개시의 영상 처리 장치 및/또는 영상 처리 방법은 인공지능 모델 내부의 파라미터를 조정함으로써 수치 정보를 조정하거나 또는 제1 임계값을 조정함으로써 검출된 영역이 검출 대상 영역에 해당하는지에 관한 판단 정도(민감도)를 조정할 수 있다. 검출 대상 영역은 의료 영상의 종류에 따라 달라질 수 있다. 검출 대상 영역은 병변이 발생하였는지를 판단하는 것과 관련된 영역일 수 있다. 예컨대, 폐 영상의 경우 검출 대상 영역은 폐, 에어, 흉막, 횡경막 등의 기관들을 포함할 수 있다. The image processing apparatus and / or the image processing method of the present disclosure may acquire numerical information regarding whether one or more regions included in the medical image correspond to a detection target region by using an artificial intelligence model. The detection target region may be set in advance, and the artificial intelligence model may be trained on the detection target region in advance. Also, the image processing apparatus and / or the image processing method of the present disclosure may determine whether the detected region corresponds to the detection target region by adjusting numerical information by adjusting a parameter inside the artificial intelligence model or by adjusting a first threshold value. You can adjust the degree of sensitivity. The detection target area may vary depending on the type of medical image. The detection subject area may be an area related to determining whether a lesion has occurred. For example, in the case of a lung image, the detection target area may include organs such as lung, air, pleura, and diaphragm.

상기 객체 검출부의 동작은 입력 영상이 의료 영상인 경우에도 적용될 수 있다. 객체 분석부(130)는, 객체 검출부(120)에서 검출 대상 객체로 판단된 하나 이상의 객체에 대해 해당 객체가 상기 검출 대상 객체에 해당하는지의 여부를 2차적으로 판단할 수 있다.The operation of the object detector may be applied even when the input image is a medical image. The object analyzer 130 may secondarily determine whether the corresponding object corresponds to the detection target object with respect to one or more objects determined by the object detection unit 120 as the detection target object.

예컨대, 상기 제1 임계값이 상대적으로 큰 경우일 때의 출력 결과는 상기 제1 임계값이 그보다 작은 경우일 때의 출력 결과보다 신뢰성 있는 검출 결과라고 판단될 수 있다. 따라서, 상기 제1 임계값이 상대적으로 작은 경우일 때의 검출된 객체는, 검출 대상 객체에 해당하는지 아니면 상기 검출 대상 객체와 형태적 유사성 등의 이유로 검출된 것으로서 상기 검출 대상 객체에 해당하지 않는지에 관하여 이차적으로 판단할 필요가 있다. 즉, 출력 결과에 대한 과검출 여부 판단이 요구될 수 있다.For example, the output result when the first threshold is relatively large may be determined to be a more reliable detection result than the output result when the first threshold is smaller. Therefore, whether the detected object when the first threshold value is relatively small corresponds to the object to be detected or is detected because of a form similarity to the object to be detected or does not correspond to the object to be detected. We need to make a second judgment about that. That is, it may be required to determine whether or not overdetect the output result.

예컨대, 상기 2차 검출 단계는, 객체 검출부(120)에서 상기 검출 대상 객체로 판단된 하나 이상의 객체로서, 제2 임계값 이하의 수치 정보를 갖는 객체에 대해서만 수행될 수 있다. 제2 임계값은 검출 대상 객체에 따라 다를 수 있다. 제2 임계값은 기설정되거나 사용자에 의해 입력될 수 있다.For example, the secondary detection step may be performed only on an object having one or more objects determined as the detection target object by the object detection unit 120 and having numerical information less than or equal to a second threshold. The second threshold may vary depending on the object to be detected. The second threshold may be preset or input by the user.

객체 분석부(130)는 영상 매칭 기법 및/또는 영상 인식 기법을 이용할 수 있다. 영상 매칭 기법은 입력 영상과 참조 영상간의 특징점을 추출하여 영상간 유사도를 판단하는 방법일 수 있다. 예컨대, 기업 보안상 반입 금지 물품에 관한 X-ray 영상이 있는 경우처럼 검출 대상 객체에 관한 영상을 미리 확보하고 있는 경우, 상기 검출 대상 객체에 관한 기확보된 영상이 참조 영상으로서 이용될 수 있다. 한편, 영상 인식 기법은 인식(recognition) 또는 분류(classification) 분야에서 일반적으로 이용되는 다양한 방법들이 이용될 수 있다.The object analyzer 130 may use an image matching technique and / or an image recognition technique. The image matching technique may be a method of determining similarity between images by extracting feature points between the input image and the reference image. For example, when an image of the object to be detected is secured in advance, such as when there is an X-ray image of an item forbidden to enter the company security, the previously obtained image of the object to be detected may be used as a reference image. Meanwhile, the image recognition technique may use various methods generally used in the field of recognition or classification.

객체 분석부(130)는 참조 영상의 유무에 따라 적용 기법을 다르게 적용할 수 있다. 예컨대, 참조 영상이 있는 경우 영상 매칭 기법이나 영상 인식 기법이 적용될 수 있을 것이다. 반면, 참조 영상이 없는 경우에는 입력 영상과 참조 영상간 특징점을 추출 및 비교할 수 없기 때문에 영상 매칭 기법이 적용되기 어려울 수 있다. 또한, 참조 영상이 없는 경우 영상 인식 기법이 적용되기 위해서는 입력 영상을 인식하고 분류할 수 있는 학습된 인공지능 모델이 필요할 수 있다. 예컨대, 상기 인공지능 모델은 객체 검출부(120)에서 이용되는 모델과는 다른 모델일 수 있다.The object analyzer 130 may apply an application technique differently according to the presence or absence of a reference image. For example, if there is a reference image, an image matching technique or an image recognition technique may be applied. On the other hand, in the absence of a reference picture, it may be difficult to apply an image matching technique because feature points between the input picture and the reference picture cannot be extracted and compared. In addition, when there is no reference image, in order to apply an image recognition technique, a trained artificial intelligence model capable of recognizing and classifying an input image may be required. For example, the AI model may be a different model from the model used in the object detector 120.

객체 분석부(130)에서의 이차적인 판단 결과, 객체 검출부(120)에서 판단된 결과와 다른 경우, 본 개시의 영상 처리 장치는 상기 수치 정보에 기초하여 검출 객체가 다른 검출 대상 객체에 해당하는지에 관하여 다시 판단할 수 있다.If the result of the secondary determination by the object analyzer 130 is different from the result determined by the object detector 120, the image processing apparatus of the present disclosure may determine whether the detected object corresponds to another detected object based on the numerical information. You can judge again.

예컨대, 검출 객체가 USB에 해당하는 확률, 칼에 해당하는 확률 및/또는 라이터에 해당하는 확률이 수치 정보에 포함되고, 상기 기재된 순서대로 확률값을 가지는 경우, 객체 검출부(120)는 검출 객체를 USB에 해당한다고 일차적으로 판단할 수 있다. 또한, 객체 분석부(130)는 검출 객체가 USB에 해당하는지에 관하여 이차적으로 판단할 수 있다. 객체 분석부(130)에서의 이차적인 판단 결과, 검출 객체가 USB에 해당하지 않은 것으로 판단이 된 경우, 객체 분석부(130)는 이차적인 판단 결과를 최종 결과로서 출력할 수도 있고, 수치 정보에 기초하여 검출 객체가 USB 다음으로 높은 확률값을 가지는(즉, USB를 제외하고 가장 높은 확률값을 가지는) 칼에 해당하는지를 추가적으로 판단할 수 있다. 상기의 추가적인 판단을 함으로써 검출 객체가 검출 대상 객체에 해당하는지 보다 정확하게 판단될 수 있다.For example, when a detection object has a probability corresponding to USB, a probability corresponding to a knife, and / or a probability corresponding to a writer in numerical information, and has a probability value in the order described above, the object detection unit 120 may detect the detected object as USB. It can be determined primarily that In addition, the object analyzer 130 may secondarily determine whether the detected object corresponds to the USB. As a result of the secondary determination by the object analyzer 130, when it is determined that the detected object does not correspond to the USB, the object analyzer 130 may output the secondary determination result as a final result, Based on this, it may be further determined whether the detection object corresponds to a knife having the next highest probability value after the USB (that is, having the highest probability value except USB). By performing the above additional determination, it may be determined more accurately whether the detection object corresponds to the detection object.

또한, 상기 객체 분석부의 동작은 입력 영상이 의료 영상인 경우에도 적용될 수 있다. 이 경우, 검출 대상 객체는 검출 대상 영역을 의미할 수 있다. 예컨대, 본 개시의 영상 처리 장치 및/또는 영상 처리 방법은 검출 대상 영역으로 판단된 하나 이상의 영역에 대해 해당 영역이 상기 검출 대상 영역에 해당하는지의 여부를 2차적으로 판단할 수 있다.In addition, the operation of the object analyzer may be applied when the input image is a medical image. In this case, the detection target object may mean a detection target region. For example, the image processing apparatus and / or the image processing method of the present disclosure may secondly determine whether the corresponding area corresponds to the detection target area with respect to one or more areas determined as the detection target area.

도 2는 본 개시의 일 실시 예에 따른 영상에 포함된 객체가 검출 대상 객체에 해당하는지를 판단하는 과정을 나타내는 도면이다.2 is a diagram illustrating a process of determining whether an object included in an image corresponds to a detection object according to an exemplary embodiment.

입력 영상(210)은 임의의 물체를 X-Ray 검색 장치에 통과시킴으로써 획득된 X-ray 영상일 수 있다. 예컨대, 입력 영상(210)은 적어도 하나 이상의 검출 대상 객체를 포함하는 영상일 수 있다. 예컨대, 검출 대상 객체는 칼, 라이터, USB, 총 등의 기업 보안상 반입 금지 물품을 포함할 수 있다.The input image 210 may be an X-ray image obtained by passing an arbitrary object through the X-ray search apparatus. For example, the input image 210 may be an image including at least one or more detection target objects. For example, the object to be detected may include a prohibited item for corporate security, such as a knife, a lighter, a USB, a gun, or the like.

본 개시의 영상 처리 장치는 입력 영상(210)에 대해 인공지능 모델을 이용하여 객체 검출을 수행할 수 있다. 또한, 본 개시의 영상 처리 장치는 상기 검출된 객체가 검출 대상 객체에 해당하는지를 판단할 수 있다. 또한, 본 개시의 영상 처리 장치는 상기 입력 영상에서 검출된 객체의 상기 수치 정보가 제1 임계값 이상인 경우, 상기 검출 객체를 상기 검출 대상 객체로 판단할 수 있다. 또한, 본 개시의 영상 처리 장치는 상기 검출 대상 객체로 판단된 객체를 출력할 수 있다.The image processing apparatus of the present disclosure may perform object detection on an input image 210 using an artificial intelligence model. Also, the image processing apparatus of the present disclosure may determine whether the detected object corresponds to a detection object. The image processing apparatus of the present disclosure may determine the detection object as the detection target object when the numerical information of the object detected in the input image is equal to or greater than a first threshold value. Also, the image processing apparatus of the present disclosure may output an object determined as the detection target object.

예컨대, 본 개시의 영상 처리 장치는 상기 검출 대상 객체로 판단된 객체 영역을 입력 영상에 표시하여, 결과 영상을 생성할 수 있다. 제1 결과 영상(220), 제2 결과 영상(230) 및 제3 결과 영상(240)은 각각 제1 임계값 이상을 가지는 객체 영역들을 입력 영상(210)에 사각형 박스로 표시한 영상일 수 있다. 여기서, 제1 결과 영상(220), 제2 결과 영상(230) 및 제3 결과 영상(240)에 적용된 제1 임계값은 각각 다를 수 있다. 예컨대, 제1 임계값의 크기는 제1 결과 영상(220), 제2 결과 영상(230) 및 제3 결과 영상(240) 순일 수 있다. 도 2를 참조하면, 제1 결과 영상(220)에는 칼에 해당되는 것으로 검출된 제1 영역(222)이 표시될 수 있다. 또한, 제2 결과 영상(230)에는 제1 영역(222) 이외에 USB에 해당되는 것으로 검출된 제2 영역(224)이 추가적으로 표시될 수 있다. 또한, 제3 결과 영상(240)에는 제1 영역(222) 및 제2 영역(224) 이외에 USB에 해당되는 것으로 검출된 제3 영역(226)이 추가적으로 표시될 수 있다.For example, the image processing apparatus of the present disclosure may display an object region determined as the detection target object on an input image, and generate a resultant image. The first result image 220, the second result image 230, and the third result image 240 may be images in which object areas having a first threshold value or more are displayed on the input image 210 in a rectangular box. . Here, the first threshold values applied to the first result image 220, the second result image 230, and the third result image 240 may be different. For example, the magnitude of the first threshold value may be in order of the first result image 220, the second result image 230, and the third result image 240. Referring to FIG. 2, the first result image 220 may display a first area 222 detected as a knife. In addition, the second result image 230 may additionally display a second area 224 detected to correspond to the USB in addition to the first area 222. In addition, in addition to the first area 222 and the second area 224, the third result image 240 may further display a third area 226 detected as being corresponding to the USB.

또한, 본 개시의 영상 처리 장치는 상기 객체 영역을 영상에 표시하기 위해 우선 검출 객체의 위치를 결정할 수 있다. 구체적으로, 본 개시의 영상 처리 장치는 객체 영역을 둘러싸는 사각형 박스(bounding box)를 특정하고, 특정된 사각형 박스에 기초하여, 검출된 객체의 위치 정보를 생성할 수 있다. 객체의 위치 정보는 사각형 박스를 형성하는 네 개의 꼭지점의 위치 정보일 수 있으나, 이에 한정되지 않는다. 예컨대, 위치 정보는 사각형 박스의 하나의 꼭지점의 좌표 (x, y) 및 사각형 박스의 가로 길이(width), 세로 길이(height)에 의해 표현될 수도 있다. 상기 하나의 꼭지점의 좌표 (x, y)는 사각형 박스(262)의 좌측 상단 꼭지점의 좌표일 수 있다. 상기 꼭지점의 좌표 (x, y)는 입력 영상의 좌측 상단 꼭지점의 좌표 (0, 0)을 기준으로 특정될 수 있다.In addition, the image processing apparatus of the present disclosure may first determine the position of the detection object in order to display the object area on the image. In detail, the image processing apparatus of the present disclosure may specify a bounding box surrounding the object area and generate location information of the detected object based on the specified rectangular box. The location information of the object may be location information of four vertices forming a rectangular box, but is not limited thereto. For example, the location information may be represented by the coordinates (x, y) of one vertex of the rectangle box and the width and height of the rectangle box. The coordinate (x, y) of the one vertex may be the coordinate of the upper left corner of the rectangular box 262. The coordinate (x, y) of the vertex may be specified based on the coordinate (0, 0) of the upper left corner of the input image.

상술한 바와 같이 수치 정보는 검출 객체가 검출 대상 객체에 해당하는지를 나타내는 확률일 수 있다. 인공지능 모델의 성능이 보장될 수 있는 수준이라고 할 때, 제1 임계값이 클수록 본 개시의 영상 처리 장치에서 검출 대상 객체라고 판단된 객체는 올바르게 판단된 것이라고 볼 수 있다. 즉, 객체 검출에 관한 신뢰도의 관점에서 상기 제1 임계값이 상대적으로 작을 때 검출된 객체의 경우에는 상기 객체가 검출 대상 객체에 부합하는지에 대해 이차적으로 판단될 필요가 있다.As described above, the numerical information may be a probability indicating whether the detection object corresponds to the detection target object. When the performance of the artificial intelligence model is guaranteed, it can be seen that the object determined as the detection target object in the image processing apparatus of the present disclosure is correctly determined as the first threshold value increases. That is, in the case of an object detected when the first threshold is relatively small in terms of reliability regarding object detection, it is necessary to secondarily determine whether the object corresponds to the object to be detected.

예컨대, 도 2를 참조하면, 제1 임계값의 크기가 가장 작은 경우일 때의 결과 영상인 제3 결과 영상(240)에 표시된 제1 영역(222), 제2 영역(224) 및 제3 영역(226)은 객체 검출에 관한 신뢰도의 관점에서 상기 객체가 검출 대상 객체에 부합하는지에 대해 이차적으로 판단될 필요가 있다. 그런데, 제1 영역(222) 및 제2 영역(224)은 제1 결과 영상(220) 및/또는 제2 결과 영상(230)에 표시된 영역들이고, 제3 영역(226)은 제1 결과 영상(220) 또는 제2 결과 영상(230)에는 표시되지 않은 영역이다. 따라서, 제1 영역(222) 및 제2 영역(224)은 제3 영역(226)과 비교하여 상대적으로 올바르게 판단된 영역이라 볼 수 있다.For example, referring to FIG. 2, the first region 222, the second region 224, and the third region displayed on the third result image 240, which is the result image when the first threshold value is the smallest. 226 needs to be secondarily determined whether the object corresponds to the object to be detected in view of the reliability regarding object detection. However, the first area 222 and the second area 224 are areas displayed on the first result image 220 and / or the second result image 230, and the third area 226 is the first result image ( 220 or the second result image 230 is not displayed. Accordingly, the first region 222 and the second region 224 may be regarded as relatively correct regions compared to the third region 226.

따라서, 본 개시의 영상 처리 장치는 상기 제3 영역(226)에 대해 제3 영역(226) 상의 객체가 검출 대상 객체에 해당하는지에 관하여 추가적인 분석을 수행할 수 있다. 예컨대, 본 개시의 영상 처리 장치는 제3 결과 영상(240)에서 추가적인 분석이 필요한 제3 영역(226)을 자르고(crop), 상기 자른 영역(250)에 대해 다양한 형태의 영상 매칭 기법이나 영상 인식 기법들을 적용할 수 있다. 한편, 분석 기법에 따라 검출된 객체를 분석할 수 있는 미리 학습된 인공지능 모델이 필요할 수 있다. 예컨대, 검출 대상 영상이 없는 경우, 딥러닝 모델을 이용하여 검출된 객체를 특정 객체라고 인식하거나 특정 객체로 분류하기 위해서는, 상기 특정 객체를 인식하거나 분류할 수 있는 미리 학습된 딥러닝 모델이 필요할 수 있다.Therefore, the image processing apparatus of the present disclosure may perform additional analysis on the third region 226 as to whether the object on the third region 226 corresponds to the detection target object. For example, the image processing apparatus of the present disclosure crops the third region 226 that requires further analysis in the third result image 240, and recognizes various types of image matching techniques or images for the cropped region 250. The techniques can be applied. Meanwhile, a pre-learned AI model capable of analyzing the detected object may be needed according to an analysis technique. For example, when there is no image to be detected, a pre-learned deep learning model capable of recognizing or classifying the specific object may be required to recognize or classify the detected object as a specific object using the deep learning model. have.

본 개시의 영상 처리 장치는 제3 영역(226)에 대해 추가적인 분석을 통해 상기 영역 상의 객체가 USB가 아니라고 최종적으로 판단할 수 있다. 또한, 본 개시의 영상 처리 장치는 상기 판단 결과를 출력할 수 있다. 예컨대, 본 개시의 영상 처리 장치는 최종 판단 결과, 검출 대상 객체로 판단된 객체 영역을 입력 영상에 표시할 수 있다. 제4 결과 영상(260)은 제3 영역(226)은 제외하고 제1 영역(222) 및 제2 영역(224)을 입력 영상(210)에 사각형 박스로 표시한 영상이다.The image processing apparatus of the present disclosure may finally determine that an object on the region is not USB through additional analysis of the third region 226. In addition, the image processing apparatus of the present disclosure may output the determination result. For example, the image processing apparatus of the present disclosure may display an object region determined as the detection target object on the input image as a result of the final determination. The fourth resultant image 260 is an image in which the first region 222 and the second region 224 are displayed in a rectangular box on the input image 210 except for the third region 226.

다른 실시 예로서, 본 개시의 영상 처리 장치는, 객체 검출의 민감도를 조정하기 위해, 상기 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터를 조정할 수 있다. 상기 인공지능 모델 내부의 파라미터를 조정하면 하나의 입력 영상에 대해 복수의 서로 다른 수치 정보를 획득할 수 있다. 상기 조정은 기설정되어 있거나 사용자에 의해 입력될 수 있다. 한편, 도 2에서 제1 결과 영상(220), 제2 결과 영상(230) 및 제3 결과 영상(240)은 제1 임계값 대신 상기 수치 정보를 조정함으로써 획득될 수 있다.As another embodiment, the image processing apparatus of the present disclosure may adjust a parameter inside the artificial intelligence model related to the acquisition of the numerical information in order to adjust the sensitivity of the object detection. By adjusting parameters inside the artificial intelligence model, a plurality of different numerical information may be obtained with respect to one input image. The adjustment may be preset or entered by the user. Meanwhile, in FIG. 2, the first result image 220, the second result image 230, and the third result image 240 may be obtained by adjusting the numerical information instead of the first threshold value.

본 개시의 영상 처리 장치는 객체 검출 시 제1 임계값 및/또는 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터를 조정할 수 있다. 또한, 본 개시의 제1 임계값 및/또는 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터는 자동적으로 조정될 수 있다. 상기 조정은 기설정되어 있거나 사용자에 의해 입력될 수 있다. 이와 같이, 자동적으로 제1 임계값 및/또는 수치 정보의 획득과 관련된 상기 인공지능 모델 내부의 파라미터를 조절함으로써 그에 대응되는 검출 결과를 생성할 수 있으며, 생성된 검출 결과에 대해 추가적인 객체 분석을 수행할 수 있다. 따라서, 본 개시의 영상 처리 장치는 단순히 일차적인 객체 검출 결과만으로 객체 분석을 완료하는 것이 아니라, 해당 결과를 선별하고, 선별된 결과에 대해 추가적인 객체 인식 기법 등을 적용함으로써 보다 안정적인 인공지능의 성능을 보장할 수 있다.The image processing apparatus of the present disclosure may adjust a parameter inside the artificial intelligence model related to the acquisition of the first threshold value and / or numerical information when the object is detected. In addition, the parameters within the AI model related to the acquisition of the first threshold and / or numerical information of the present disclosure may be automatically adjusted. The adjustment may be preset or entered by the user. As such, by automatically adjusting a parameter in the artificial intelligence model related to the acquisition of the first threshold value and / or numerical information, a detection result corresponding thereto may be generated, and further object analysis may be performed on the generated detection result. can do. Therefore, the image processing apparatus of the present disclosure does not simply complete the object analysis based on the primary object detection result, but selects the result and applies an additional object recognition technique to the selected result to provide more stable performance of artificial intelligence. I can guarantee it.

또한, 본 개시의 영상 처리 장치는 객체를 포함하는 영상으로부터 다중 클래스에 대한 검출 및/또는 2차적인 검증을 수행하는 인공지능 모델을 제공할 수 있다. 예컨대, 다중 클래스에 대한 검출은 인공지능 모델 내부의 파라미터를 조정하거나 제1 임계값을 조정하여 복수의 검출 대상 객체들을 검출하는 것을 의미할 수 있다.In addition, the image processing apparatus of the present disclosure may provide an artificial intelligence model that performs detection and / or secondary verification for multiple classes from an image including an object. For example, detection of multiple classes may mean adjusting a parameter inside the AI model or adjusting a first threshold to detect a plurality of detection target objects.

한편, 본 개시의 영상 처리 장치는 영상에 포함된 객체를 검출하거나, 분류하거나 관심 객체의 위치를 찾아내기 위해 분석 대상 영상의 특징을 추출하고, 추출된 특징에 기초하여 맥락 정보를 생성하고, 추출된 특징 및 생성된 맥락 정보를 이용할 수 있다.Meanwhile, the image processing apparatus of the present disclosure extracts a feature of an analysis target image to detect, classify, or find a location of an object of interest, generates context information based on the extracted feature, and extracts an object included in the image. Features and generated contextual information can be used.

도 3은 본 개시의 일 실시 예에 따른 영상의 맥락 정보를 생성하고 분석하는 과정을 나타내는 도면으로서, 본 개시의 영상 처리 장치는 특징 추출부(310), 맥락 생성부(320) 및/또는 특징 및 맥락 분석부(330)를 포함할 수 있다. 3 is a diagram illustrating a process of generating and analyzing context information of an image according to an embodiment of the present disclosure. The image processing apparatus of the present disclosure may include a feature extractor 310, a context generator 320, and / or a feature. And a context analyzer 330.

구체적으로, 특징 추출부(310)는 입력 영상(312)을 이용하여 입력 영상(312)으로부터 특징을 추출하고, 추출된 특징 정보를 포함하는 특징 영상(314)을 생성할 수 있다. 상기 추출된 특징은 입력 영상의 국소 영역에 대한 특징일 수 있다. 상기 입력 영상(312)은 영상 분석 장치의 입력 영상 또는 합성곱 신경망 모델 내의 각 계층에서의 특징맵을 포함할 수 있다. 또한 상기 특징 영상(314)은 입력 영상(312)에 대해 합성곱 신경망 기법 및/또는 풀링 기법을 적용하여 획득된 특징맵 및/또는 특징 벡터를 포함할 수 있다. 맥락 생성부(320)는 특징 추출부(310)에서 추출된 특징 영상(314)에 대해 합성곱 신경망 기법 및/또는 풀링 기법을 적용하여 맥락 정보를 생성할 수 있다. 예컨대, 상기 맥락 정보는 분석 대상 영상의 전체 또는 일부 영역을 나타내는 대표값일 수 있다. 또한 상기 맥락 정보는 입력 영상의 전역적인 맥락 정보일 수 있다. 예컨대, 맥락 생성부(320)는 풀링의 간격(stride)을 다양하게 조절함으로써 영상 전체, 4등분 영역, 9등분 영역 등의 다양한 크기(scale)의 맥락 정보를 생성할 수 있다. 예컨대, 영상 전체 크기의 영상에 대한 맥락 정보를 포함하는 전체 맥락 정보 영상(322), 영상 전체를 4등분한 크기의 4등분 영상에 대한 맥락 정보를 포함하는 4등분 맥락 정보 영상(324) 및 영상 전체를 9등분한 크기의 9등분 영상에 대한 맥락 정보를 포함하는 9등분 맥락 정보 영상(326)이 획득될 수 있다. 특징 및 맥락 분석부(330)는 상기 특징 영상(314)과 상기 맥락 정보 영상(322, 324, 326)을 모두 이용하여 분석 대상 영상의 특정 영역에 대한 분석을 보다 정확히 수행할 수 있다.In detail, the feature extractor 310 may extract a feature from the input image 312 using the input image 312 and generate a feature image 314 including the extracted feature information. The extracted feature may be a feature of a local region of the input image. The input image 312 may include a feature map of each layer in the input image or the composite product neural network model of the image analyzing apparatus. Also, the feature image 314 may include a feature map and / or a feature vector obtained by applying a convolutional neural network technique and / or a pooling technique to the input image 312. The context generator 320 may generate context information by applying a convolutional neural network technique and / or a pooling technique to the feature image 314 extracted by the feature extractor 310. For example, the context information may be a representative value representing all or a partial region of the analysis target image. In addition, the context information may be global context information of the input image. For example, the context generator 320 may generate context information of various scales, such as the entire image, the quadrant region, and the ninth region, by variously adjusting the pooling interval. For example, a full contextual information image 322 including contextual information about an image of an image full size, a quadrant contextual information image 324 and an image including contextual information about a quadrant image having a size that is divided into quarters of the entire image. A ninth contextual information image 326 including contextual information about a ninth-segment image of a total size divided by nine may be obtained. The feature and context analyzer 330 may more accurately analyze a specific region of the analysis target image using both the feature image 314 and the context information image 322, 324, and 326.

예컨대, 자동차(car)와 유사한 형태를 갖는 보트(boat)가 포함된 영상이 입력 영상인 경우, 특징 추출부(310)가 추출한 국소적인 특징을 포함하는 특징 영상(314)으로부터는 상기 식별된 객체가 자동차인지 보트인지 정확히 판단할 수 없다. 즉, 특징 추출부(310)는 국소적인 특징에 기초하여 객체의 형상을 인식할 수 있으나, 해당 객체의 형상만 가지고는 정확히 객체를 식별하고 분류할 수 없는 경우가 있다. 맥락 생성부(320)는 상기 분석 대상 영상 또는 상기 특징 영상(214)에 기초하여 맥락 정보(322, 324, 326)를 생성함으로써, 보다 정확히 객체를 식별하고 분류할 수 있다. 예컨대, 전체 영상에 대해 추출된 특징이 “자연 경관”으로 인식 또는 분류되고, 4등분 영상에 대해 추출된 특징이 “호수”로 인식 또는 분류되고, 9등분 영상에 대해 추출된 특징이 “물”로 인식 또는 분류되는 경우, 상기 추출된 특징인 “자연 경관”, “호수”, “물”을 맥락 정보로서 생성하고 활용할 수 있다. 특징 및 맥락 분석부(330)는 상기 맥락 정보를 활용함으로써, 상기 보트 또는 자동차의 형상을 갖는 객체를 “보트”로 식별할 수 있다.For example, when an image including a boat having a form similar to a car is an input image, the identified object is selected from the feature image 314 including a local feature extracted by the feature extractor 310. I can't tell if it's a car or a boat. That is, the feature extractor 310 may recognize the shape of the object based on the local feature, but there may be cases in which the object may not be correctly identified and classified only with the shape of the object. The context generator 320 generates context information 322, 324, and 326 based on the analysis target image or the feature image 214, thereby more accurately identifying and classifying objects. For example, the feature extracted for the whole image is recognized or classified as "natural landscape", the feature extracted for the quadrant image is recognized or classified as "lake", and the feature extracted for the ninth image is "water". If it is recognized or classified as, the extracted features "natural landscape", "lake", "water" can be generated and used as the context information. The feature and context analyzer 330 may identify an object having the shape of the boat or automobile as a boat by using the context information.

도 3을 참조하여 설명한 실시 예에서는 전체 영상에 대한 맥락 정보, 4등분 영상에 대한 맥락 정보, 9등분 영상에 대한 맥락 정보를 생성하고 활용하는 것에 대해 설명하였으나, 맥락 정보를 추출하는 영상의 크기는 이에 한정되지 않는다. 예컨대, 전술한 크기의 영상 이외의 크기를 갖는 영상에 대한 맥락 정보를 생성하고 활용할 수도 있다. In the embodiment described with reference to FIG. 3, the context information for the entire image, the context information for the quadrant image, and the context information for the ninth segment have been described. However, the size of the image from which the context information is extracted is described. It is not limited to this. For example, context information about an image having a size other than the above-described image may be generated and utilized.

도 4는 본 개시의 일 실시 예에 따른 영상 처리 장치가 영상을 분석하여 객체를 검출하는 과정을 나타내는 도면이다.4 is a diagram illustrating a process of detecting an object by analyzing an image according to an embodiment of the present disclosure.

예컨대, 영상 처리 장치(400)는 영상(410)을 입력 받고, 다양한 크기의 영상 영역에 대한 정보를 생성함으로써, 영상(410)에 포함된 객체를 정확히 검출 및/또는 분류할 수 있다. 입력 영상(410)은 예컨대, 가방을 포함하는 X-ray 영상일 수 있다. 영상 처리 장치(400)는 전술한 바에 따라 입력 영상(410)을 분석하여, 영상 전체에 대한 특징, 영상의 일부 영역에 대한 특징을 추출하고 이를 이용하여 영상(410)에 포함된 객체를 정확히 식별할 수 있다. 상기 영상 전체에 대한 특징(422)은 예컨대, 가방의 형상에 대한 특징일 수 있다. 상기 영상의 일부 영역에 대한 특징은 예컨대, 손잡이에 대한 특징(424), 지퍼에 대한 특징(426, 고리에 대한 특징(428) 등을 포함할 수 있다. 영상 처리 장치(400)는 상기 생성된 특징들(422, 424, 426, 428)을 맥락 정보로서 활용함으로써, 상기 영상(410)에 포함된 객체가 “가방”이라는 것을 정확히 식별할 수 있다. 만약 상기 생성된 특징들 중 일부가 “가방”과 관련이 없는 특징이라면, 영상 처리 장치(400)는 상기 영상(410)에 포함된 객체가 “가방”이라고 식별할 수 없거나 또는 상기 영상(410)에 포함된 객체를 “가방”으로 식별할 수 없다는 분석 결과를 제공할 수 있다.For example, the image processing apparatus 400 may correctly receive and / or classify an object included in the image 410 by receiving the image 410 and generating information on image areas of various sizes. The input image 410 may be, for example, an X-ray image including a bag. The image processing apparatus 400 analyzes the input image 410 as described above, extracts the feature of the entire image and the feature of the partial region of the image, and accurately identifies the object included in the image 410 by using the same. can do. The feature 422 for the entire image may be, for example, a feature of a bag shape. Features for some regions of the image may include, for example, features 424 for handles, features 426 for zippers, features 428 for rings, etc. The image processing apparatus 400 may generate the generated image. By utilizing features 422, 424, 426, 428 as contextual information, one can accurately identify that an object included in the image 410 is a “bag.” If some of the generated features are “bags” The image processing apparatus 400 cannot identify that an object included in the image 410 is a “bag” or identify an object included in the image 410 as a “bag”. Can provide analytical results.

또한, 본 개시의 영상 처리 장치는 입력 영상에 포함된 하나 이상의 객체를 검출하거나, 최종 출력 영상을 생성하기 위해 객체의 물성에 따라 색상 표현 범위가 상이하다는 특징을 이용할 수 있다. In addition, the image processing apparatus of the present disclosure may use a feature that the range of color expression is different according to the physical properties of the object in order to detect one or more objects included in the input image or generate the final output image.

도 5는 본 개시의 일 실시 예에 따른 객체의 물성에 기초하여 색상이 표현된 영상을 나타내는 도면으로서, X-Ray 판독 기기에 의해 촬영된 가방 영상(500), 의약품용기 영상(510) 및 캐리어 가방 영상(520)을 나타낸다. 가방 고리(502), 가방 지퍼(504), 의약품(512) 및 병(522)의 경우 각각 객체의 물성에 따라 색상 표현 범위가 상이함을 확인할 수 있다. 한편, 가방 고리(502), 가방 지퍼(504), 의약품(512) 및 병(522)은 다른 객체들과 구분될 수 있도록 비교적 선명하게 색상이 표현되어 있는 반면에, 캐리어 가방 내의 임의의 내용물(524)의 경우에는 캐리어 가방 영상(520)에서 그 임의의 내용물(524)이 무엇인지도 확인하기 어렵고 다른 객체들과 구분하기에도 용이하지 않음을 알 수 있다. 따라서, 본 개시의 영상 처리 장치는 색상 표현 범위에 따라 영상의 강화 정도를 다르게 할 수 있다. 이를 위해, 분할된 영역들 각각에 대한 색상 분포를 분석하여 적어도 일부의 영역에 대해 가중치를 적용할 수 있다.FIG. 5 is a view illustrating an image in which colors are expressed based on physical properties of an object according to an exemplary embodiment of the present disclosure, wherein a bag image 500, a medicine container image 510, and a carrier photographed by an X-ray reading apparatus are illustrated. A bag image 520 is shown. In the case of the bag loop 502, the bag zipper 504, the medicine 512, and the bottle 522, the range of color expression may be different depending on the physical properties of the object. Meanwhile, the bag loop 502, the bag zipper 504, the medicine 512, and the bottle 522 are relatively vividly colored so that they can be distinguished from other objects, while any contents in the carrier bag ( In the case of 524, it may be difficult to identify what the arbitrary contents 524 are in the carrier bag image 520 and may not be easily distinguished from other objects. Therefore, the image processing apparatus of the present disclosure may vary the degree of enhancement of the image according to the color expression range. To this end, the color distribution of each of the divided regions may be analyzed and weighted to at least some of the regions.

상기 하나 이상의 가중치는 n개의 색상 표현 범위 중 적어도 일부에 대한 가중치를 포함할 수 있다. 예컨대, 하나의 영역이 n개의 색상 표현 범위를 가진다고 하면, 해당 영역에서의 가중치의 개수는 1부터 n개를 가질 수 있다.The one or more weights may include weights for at least some of the n color representation ranges. For example, if one region has n color ranges, the number of weights in the region may range from 1 to n.

예컨대, 하나의 영역에 대해 하나의 가중치가 결정되는 경우, 상기 하나의 영역에 포함된 모든 색상 표현 범위에 대해 상기 결정된 가중치를 적용할 수 있다. 또는 상기 하나의 영역에 포함된 모든 색상 표현 범위 중 적어도 일부에 대해 상기 결정된 가중치를 적용할 수도 있다. 예컨대, 영상 강화의 대상이 되는 소정의 색상 표현 범위에 대해서만 상기 결정된 가중치를 적용할 수 있다.For example, when one weight is determined for one region, the determined weight may be applied to all color expression ranges included in the one region. Alternatively, the determined weight may be applied to at least a portion of all color expression ranges included in the one region. For example, the determined weight may be applied only to a predetermined color expression range that is an object of image enhancement.

또는 예컨대, n개의 색상 표현 범위의 각각에 대해 가중치가 결정될 수 있다. 즉, 하나의 영역에 대한 가중치의 개수는 n일 수 있다. 이 경우, 상기 영역에 포함된 모든 색상 표현 범위의 각각에 대응되는 가중치를 해당하는 색상 표현 범위에 적용할 수 있다. 가중치는 영상 강화의 대상이 되는 소정의 색상 표현 범위에 대해 상대적으로 높은 가중치가 부여될 수 있다.Or, for example, a weight may be determined for each of the n color representation ranges. That is, the number of weights for one region may be n. In this case, a weight corresponding to each of all the color expression ranges included in the area may be applied to the corresponding color expression range. The weight may be given a relatively high weight for a predetermined color expression range that is an object of image enhancement.

또는 예컨대, 1보다 크고 n보다 작은 m개의 색상 표현 범위의 각각에 대해 가중치가 결정될 수 있다. 즉, 하나의 영역에 대한 가중치의 개수는 m일 수 있다. 이 경우, 상기 영역에 포함된 색상 표현 범위 중 가중치가 부여된 색상 표현 범위에 대해서만 상기 부여된 가중치를 적용할 수 있다. 영상 강화의 대상이 되는 소정의 색상 표현 범위에 대해 상대적으로 높은 가중치가 부여되는 것은 전술한 바와 같다.Or, for example, a weight may be determined for each of m color representation ranges greater than 1 and less than n. That is, the number of weights for one region may be m. In this case, the weighted weight may be applied only to the color weighted range of the color color ranges included in the area. As described above, a relatively high weight is given to a predetermined color expression range that is an object of image enhancement.

전술한 바와 같이, n개의 색상 표현 범위 중 소정의 색상 표현 범위에 대해서는 가중치를 상대적으로 높게 결정할 수 있다. 예컨대, X-Ray 영상에 포함된 객체가 유기물인 경우 다른 물성(금속, 무기물 등)을 가진 객체에 비해 상대적으로 경계가 덜 선명하게 영상에 표현되는 경우가 많다. 따라서, 분할 영역 중 유기물을 나타내는 색상 표현 범위에 해당하는 부분에 대해서 상대적으로 높은 가중치가 부여될 수 있다.As described above, the weight may be determined relatively high for the predetermined color expression range among the n color expression ranges. For example, when the object included in the X-Ray image is an organic material, the boundary is often expressed more clearly than the object having other physical properties (metal, inorganic, etc.). Therefore, a relatively high weight may be given to a portion of the divided region corresponding to the color expression range representing the organic material.

상대적으로 높은 가중치가 부여되는 상기 소정의 색상 표현 범위는 하나 이상일 수 있다. 예컨대, 전체 색상 표현 범위가 n개일 때, 상대적으로 높은 가중치가 부여되는 상기 소정의 색상 표현 범위는 1 내지 n-1개일 수 있다. 상기 소정의 색상 표현 범위가 복수 개일 때, 각각에 대해 요구되는 영상 강화의 정도는 상이할 수 있으며, 그에 따라 가중치가 부여될 수 있다. 예컨대, 금속->무기물->유기물의 순서로 영상이 선명하게 표현될 때, 유기물에 대한 색상 표현 범위에 대해서만 상대적으로 높은 가중치를 부여할 수도 있으나, 무기물과 유기물에 대해 금속보다 상대적으로 높은 가중치를 부여할 수도 있다. 이때, 무기물보다는 유기물에 상대적으로 높은 가중치가 부여될 수 있다.The predetermined color representation range to which the relatively high weight is given may be one or more. For example, when the total color representation range is n, the predetermined color representation range to which a relatively high weight is given may be 1 to n-1. When there are a plurality of predetermined color representation ranges, the degree of image enhancement required for each may be different, and thus weighted. For example, when an image is clearly displayed in the order of metal-> inorganic-> organic, a relatively high weight may be given only to the range of color expression for organic matter, but a weight that is relatively higher than metal for inorganic and organic matter may be given. You can also grant. In this case, a relatively high weight may be given to the organic material rather than the inorganic material.

예컨대, 각각 0 내지 255의 값을 갖는 Red, Green, Blue의 조합에 의해 색상이 표현될 수 있다. 이 경우, 예컨대, Red로 표현되는 색상 표현 범위는 (R, G, B) = (10~255, 0~10, 0~10)일 수 있다. 또한, 예컨대, Green으로 표현되는 색상 표현 범위는 (R, G, B) = (0~10, 10~255, 0~10)일 수 있다. 또한, 예컨대, Blue로 표현되는 색상 표현 범위는 (R, G, B) = (0~10, 0~10, 10~255)일 수 있다. 예컨대, 적색인 색상 표현 범위가 강화의 대상일 수 있으며, 이 경우, 적색의 색상 표현 범위에 높은 가중치가 부여될 수 있다. 예컨대, 소정의 영역의 색상이 (R, G, B) = (50, 8, 3)으로 표현되는 경우, 해당 색상은 상기 적색의 색상 표현 범위에 포함되므로, 강화의 대상이 된다. 만약 적색의 색상 표현 범위에 부여되는 가중치가 2라면, 해당 영역의 색상은 강화에 의해, (R, G, B) = (50x2, 8x2, 3x2)으로 강화될 수 있다. 또는 특정 요소 색상에만 가중치가 부여되어 (R, G, B) = (50x2, 8, 3)으로 강화될 수도 있다. 상기 가중치는 강화의 정도에 따라 상이하게 결정될 수 있다.For example, the color may be expressed by a combination of red, green, and blue having values of 0 to 255, respectively. In this case, for example, the color expression range represented by Red may be (R, G, B) = (10 to 255, 0 to 10, 0 to 10). Also, for example, the color expression range represented by Green may be (R, G, B) = (0-10, 10-255, 0-10). Also, for example, the color expression range represented by Blue may be (R, G, B) = (0-10, 0-10, 10-255). For example, a color representation range that is red may be an object of enhancement, in which case, a high weight may be given to the color representation range of red. For example, when a color of a predetermined region is represented by (R, G, B) = (50, 8, 3), the color is included in the red color expression range, and thus is subject to enhancement. If the weight assigned to the red color representation range is 2, the color of the corresponding region may be enhanced by (R, G, B) = (50x2, 8x2, 3x2) by enhancement. Alternatively, only specific element colors may be weighted and enhanced to (R, G, B) = (50x2, 8, 3). The weight may be determined differently according to the degree of strengthening.

상기 색상의 강화는 Green으로 표현되는 색상 표현 범위 및/또는 Blue로 표현되는 색상 표현 범위에 적응적으로 적용될 수 있다.The enhancement of the color may be adaptively applied to the color expression range represented by Green and / or the color expression range represented by Blue.

각각의 색상 표현 범위에 대해서는 동일하거나 상이한 가중치가 적용될 수 있다. 또는 일부의 색상 표현 범위에 대해서는 가중치가 적용되지 않을 수 있다.The same or different weights may be applied to each color representation range. Alternatively, the weight may not be applied to some color representation ranges.

도 6은 본 개시의 일 실시 예에 따른 영상 처리 방법을 설명하기 위한 도면이다.6 is a diagram for describing an image processing method according to an exemplary embodiment.

S600 단계에서 하나 이상의 객체를 포함하는 분석 대상 영상을 수신할 수 있다.In operation S600, an analysis target image including one or more objects may be received.

분석 대상 영상은 적어도 하나 이상의 객체를 포함할 수 있다. 예컨대, 영상은 X-Ray 영상 기기가 촬영한 로(raw) 이미지이거나 상기 로 이미지를 저장 또는 전송하기 위한 임의의 형태(포맷)의 이미지일 수 있다. 상기 영상은 X-Ray 판독 기기가 촬영하여 모니터와 같은 출력 장치로 전송하는 영상 정보를 캡쳐하여 데이터화함으로써 획득될 수도 있다.The analysis target image may include at least one object. For example, the image may be a raw image captured by an X-ray imaging device or an image of any form (format) for storing or transmitting the raw image. The image may be obtained by capturing and data-forming image information that is taken by an X-ray reading device and transmitted to an output device such as a monitor.

S610 단계에서 인공지능 모델을 이용하여 상기 분석 대상 영상에 포함된 상기 하나 이상의 객체가 검출 대상 객체에 해당하는지에 관한 수치 정보를 획득할 수 있다.In operation S610, numerical information regarding whether the one or more objects included in the analysis target image correspond to the detection target object may be obtained using the artificial intelligence model.

예컨대, 인공지능 모델은 객체 검출을 위한 딥러닝 모델일 수 있다. 검출 대상 객체는 미리 설정될 수 있으며, 인공지능 모델은 상기 검출 대상 객체에 대해 미리 학습될 수 있다. 또한, 검출 대상 객체는 적용되는 분야에 따라 달라질 수 있다. 예컨대, USB, 노트북, 하드 드라이브 등의 저장 매체, 칼, 라이터, 총 등을 포함할 수 있다. 따라서, 본 개시의 영상 처리 장치 및/또는 영상 처리 방법은 입력되는 영상을 분석해서 특정한 객체를 검출할 수 있으며, 기업 보안 시스템 등의 다양한 분야에 적용될 수 있다. For example, the artificial intelligence model may be a deep learning model for object detection. The object to be detected may be set in advance, and the artificial intelligence model may be pre-learned with respect to the object to be detected. Also, the object to be detected may vary depending on the field of application. For example, the storage medium may include a storage medium such as a USB, a notebook, a hard drive, a knife, a lighter, a gun, and the like. Accordingly, the image processing apparatus and / or image processing method of the present disclosure may detect a specific object by analyzing an input image and may be applied to various fields such as an enterprise security system.

수치 정보는 하나 이상의 객체를 포함하는 분석 대상 영상에 대해 인공지능 모델을 적용하여 객체를 검출하는 경우, 검출된 객체가 상기 검출 대상 객체에 해당하는지를 나타내는 확률, 점수 또는 분포 등을 의미할 수 있다. 예컨대, 검출 대상 객체가 USB, 칼, 라이터라고 하면 상기 수치 정보는 분석 대상 영상에 포함된 객체가 USB에 해당되는 확률, 칼에 해당되는 확률 및/또는 라이터에 해당되는 확률을 포함할 수 있다.The numerical information may mean a probability, a score, or a distribution indicating whether the detected object corresponds to the detection target object when the object is detected by applying an artificial intelligence model to the analysis target image including one or more objects. For example, if the detection target object is a USB, a knife, a lighter, the numerical information may include a probability that the object included in the analysis image corresponds to a USB, a probability that corresponds to a knife, and / or a probability that corresponds to a lighter.

S620 단계에서 획득된 수치 정보가 제1 임계값 이상인 경우, 1차 검출 단계로서 하나 이상의 객체를 검출 대상 객체로 판단할 수 있다.When the numerical information obtained in step S620 is equal to or greater than the first threshold value, one or more objects may be determined as the detection target object as the first detection step.

제1 임계값은 검출 대상 객체에 따라 다를 수 있다. 제1 임계값은 기설정되거나 사용자에 의해 입력될 수 있다. 상기 제1 임계값을 이용함으로써 본 개시의 객체 판단 방법은 보다 더 신뢰성을 확보할 수 있다. 또한, 예컨대 제1 임계값은 객체 검출에 관련된 매개변수일 수 있다.The first threshold value may vary depending on the object to be detected. The first threshold may be preset or input by a user. By using the first threshold value, the object determination method of the present disclosure can ensure more reliability. Also, for example, the first threshold may be a parameter related to object detection.

한편, 상기 1차 검출 단계의 민감도는 수치 정보의 획득과 관련된 인공지능 모델 내부의 파라미터 또는 제1 임계값을 이용하여 조정할 수 있다. 상기 인공지능 모델 내부의 파라미터를 조정하면 하나의 입력 영상에 대해 복수의 서로 다른 수치 정보를 획득할 수 있다. 상기 조정은 기설정되어 있거나 사용자에 의해 입력될 수 있다. 예컨대, 인공지능 모델 내부의 파라미터는 합성곱 신경망에서 합성곱 계층 및/또는 풀링 계층의 필터 커널들을 포함할 수 있다. 따라서, 인공지능 모델 내부의 파라미터를 조정하여 수치 정보를 조정하거나 또는 제1 임계값을 조정함으로써 검출된 객체가 검출 대상 객체에 해당하는지에 관한 판단 정도를 조정할 수 있다.Meanwhile, the sensitivity of the first detection step may be adjusted by using a first threshold value or a parameter inside an artificial intelligence model related to acquisition of numerical information. By adjusting parameters inside the artificial intelligence model, a plurality of different numerical information may be obtained with respect to one input image. The adjustment may be preset or entered by the user. For example, the parameter inside the artificial intelligence model may include filter kernels of the convolutional layer and / or the pooling layer in the convolutional neural network. Accordingly, the degree of determination as to whether the detected object corresponds to the object to be detected may be adjusted by adjusting numerical information by adjusting parameters in the artificial intelligence model or adjusting the first threshold value.

S630 단계에서 S620 단계에서 검출 대상 객체로 판단된 하나 이상의 객체에 대해, 2차 검출 단계로서 영상 매칭 기법 또는 영상 인식 기법을 적용함으로써 하나 이상의 객체가 검출 대상 객체에 해당하는지의 여부를 판단할 수 있다.In operation S630, whether the at least one object corresponds to the detection target object may be determined by applying an image matching technique or an image recognition technique to the at least one object determined as the detection target object in operation S620. .

예컨대, 상기 2차 검출 단계는, S620 단계에서 상기 검출 대상 객체로 판단된 하나 이상의 객체로서, 제2 임계값 이하의 수치 정보를 갖는 객체에 대해서만 수행될 수 있다. 제2 임계값은 검출 대상 객체에 따라 다를 수 있다. 제2 임계값은 기설정되거나 사용자에 의해 입력될 수 있다.For example, the secondary detection step may be performed only on an object having one or more objects determined as the detection target object in operation S620 and having numerical information less than or equal to a second threshold. The second threshold may vary depending on the object to be detected. The second threshold may be preset or input by the user.

한편, 검출 대상 객체에 관한 영상이 있는 경우(예를 들어, 기업 보안상 반입 금지 물품에 관한 X-ray 영상이 있는 경우), 상기 영상을 이용한 영상 매칭 기법이 적용될 수 있다. 여기서 영상 매칭 기법은, 예컨대, 입력 영상들간의 특징점을 추출하여 영상간 유사도를 판단하는 방법일 수 있다. 또한, 검출 대상 객체에 관한 영상이 없는 경우에는, 인식(recognition) 또는 분류(classification) 분야에서 일반적으로 이용되는 다양한 영상 인식 방법들이 적용될 수 있다.On the other hand, if there is an image of the object to be detected (for example, if there is an X-ray image of the prohibited items to enter the company security), an image matching technique using the image may be applied. The image matching technique may be, for example, a method of determining similarity between images by extracting feature points between input images. In addition, when there is no image of the object to be detected, various image recognition methods generally used in the field of recognition or classification may be applied.

또한, 상기 이차적인 판단 결과, S620 단계에서 판단된 결과와 다른 경우, 본 개시의 방법은 상기 수치 정보에 기초하여 검출 객체가 다른 검출 대상 객체에 해당하는지에 관하여 다시 판단할 수 있다. 예컨대, 상기 수치 정보에 기초하여 S630 단계에서 판단된 검출 대상 객체를 제외하고 그 다음의 확률값을 가지는 검색 대상 객체에 상기 객체가 해당하는지를 판단할 수 있다.In addition, when the second determination result is different from the result determined in operation S620, the method of the present disclosure may again determine whether the detection object corresponds to another detection object based on the numerical information. For example, based on the numerical information, it may be determined whether the object corresponds to a search target object having a next probability value except the detection target object determined in operation S630.

본 개시의 예시적인 방법들은 설명의 명확성을 위해서 동작의 시리즈로 표현되어 있지만, 이는 단계가 수행되는 순서를 제한하기 위한 것은 아니며, 필요한 경우에는 각각의 단계가 동시에 또는 상이한 순서로 수행될 수도 있다. 본 개시에 따른 방법을 구현하기 위해서, 예시하는 단계에 추가적으로 다른 단계를 포함하거나, 일부의 단계를 제외하고 나머지 단계를 포함하거나, 또는 일부의 단계를 제외하고 추가적인 다른 단계를 포함할 수도 있다.Exemplary methods of the present disclosure are represented as a series of operations for clarity of description, but are not intended to limit the order in which the steps are performed, and each step may be performed simultaneously or in a different order as necessary. In order to implement the method according to the present disclosure, the illustrated step may further include other steps, may include remaining steps except for some steps, or may include additional other steps except for some steps.

본 개시의 다양한 실시 예는 모든 가능한 조합을 나열한 것이 아니고 본 개시의 대표적인 양상을 설명하기 위한 것이며, 다양한 실시 예에서 설명하는 사항들은 독립적으로 적용되거나 또는 둘 이상의 조합으로 적용될 수도 있다.The various embodiments of the present disclosure are not an exhaustive list of all possible combinations and are intended to describe representative aspects of the present disclosure, and the matters described in the various embodiments may be applied independently or in combination of two or more.

또한, 본 개시의 다양한 실시 예는 하드웨어, 펌웨어(firmware), 소프트웨어, 또는 그들의 결합 등에 의해 구현될 수 있다. 하드웨어에 의한 구현의 경우, 하나 또는 그 이상의 ASICs(Application Specific Integrated Circuits), DSPs(Digital Signal Processors), DSPDs(Digital Signal Processing Devices), PLDs(Programmable Logic Devices), FPGAs(Field Programmable Gate Arrays), 범용 프로세서(general processor), 컨트롤러, 마이크로 컨트롤러, 마이크로 프로세서 등에 의해 구현될 수 있다. In addition, various embodiments of the present disclosure may be implemented by hardware, firmware, software, or a combination thereof. In hardware implementations, one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), General Purpose It may be implemented by a general processor, a controller, a microcontroller, a microprocessor, and the like.

본 개시의 범위는 다양한 실시 예의 방법에 따른 동작이 장치 또는 컴퓨터 상에서 실행되도록 하는 소프트웨어 또는 머신-실행가능한 명령들(예를 들어, 운영체제, 애플리케이션, 펌웨어(firmware), 프로그램 등), 및 이러한 소프트웨어 또는 명령 등이 저장되어 장치 또는 컴퓨터 상에서 실행 가능한 비-일시적 컴퓨터-판독가능 매체(non-transitory computer-readable medium)를 포함한다.It is intended that the scope of the disclosure include software or machine-executable instructions (eg, an operating system, an application, firmware, a program, etc.) to cause an operation in accordance with various embodiments of the method to be executed on an apparatus or a computer, and such software or Instructions, and the like, including non-transitory computer-readable media that are stored and executable on a device or computer.

Claims

Receiving an analysis target image including an object;
Obtaining numerical information on whether the object included in the analysis target image corresponds to a detection target object using an artificial intelligence model;
A primary detection step of determining whether the numerical information is equal to or greater than a first threshold value;
A second detection step of determining whether the numerical information is equal to or greater than a second threshold value; And
Determining whether the object corresponds to the object to be detected based on results of the first detection step and the second detection step,
The second threshold is greater than the first threshold,
And the second detecting step is performed only when the numerical information is equal to or greater than a first threshold value.

The method of claim 1,
And adjusting the sensitivity of the first detection step or the second detection step by adjusting the first threshold value or the second threshold value.

The method of claim 2,
The first threshold value or the second threshold value, characterized in that determined based on the learning of the artificial intelligence model, image processing method.

The method of claim 1,
And adjusting the sensitivity of the first detection step or the second detection step by adjusting a parameter inside the artificial intelligence model.

The method of claim 1,
The first threshold value and the second threshold value is determined differently according to the type of the object to be detected, image processing method.

The method of claim 1,
And when the numerical information is equal to or greater than a first threshold and less than or equal to the second threshold, the object is determined not to be the detection target object.

The method of claim 6,
If it is determined that the object is not the detection target object, changing the type of the detection target object.

The method of claim 1,
Acquiring the numerical information,
A feature extraction step of extracting a feature of the object;
Generating context information based on the extracted feature; And
And obtaining the numerical information based on the extracted feature and the generated context information.

The method of claim 8,
The generating of the context information may include generating the context information on all or a portion of the analysis target image.
The context information may be an entire or partial region of the analysis target image.

The method of claim 9,
The generating of the context information may include generating the context information by applying at least one of a convolutional neural network technique and a pooling technique to the extracted features.

The method of claim 10,
The generating of the context information may include generating the context information on all or a portion of the analysis target image by adjusting the pooling interval.

An image receiver configured to receive an analysis target image including an object;
An output unit configured to output an analysis result of the analysis target image; And
A processor for controlling the receiving unit and the output unit,
The processor,
Acquires numerical information about whether the object included in the analysis target image corresponds to a detection target object using an artificial intelligence model,
Perform first detection to determine whether the numerical information is equal to or greater than a first threshold value,
Perform second detection to determine whether the numerical information is equal to or greater than a second threshold value,
It is determined whether the object corresponds to the object to be detected based on the results of the first detection and the second detection,
The second threshold is greater than the first threshold,
And the second detection is performed only when the numerical information is equal to or greater than a first threshold value.

The method of claim 12,
The processor,
And adjusting the sensitivity of the first detection or the second detection by adjusting the first threshold value or the second threshold value.

The method of claim 13,
The first threshold value or the second threshold value, characterized in that determined based on the learning of the artificial intelligence model, the image processing apparatus.

The method of claim 1,
The processor is
And adjusting the sensitivity of the first detection or the second detection by adjusting a parameter inside the artificial intelligence model.

The method of claim 12,
And the first threshold value and the second threshold value are differently determined according to the type of the object to be detected.

The method of claim 12,
And when the numerical information is equal to or greater than a first threshold and less than or equal to the second threshold, the object is determined not to be the detection target object.

The method of claim 17,
The processor,
And when it is determined that the object is not the detection target object, changing the type of the detection target object.

The method of claim 12,
The processor,
Extract features of the object,
Generate context information based on the extracted feature,
And the numerical information is obtained based on the extracted feature and the generated context information.

The method of claim 19,
The context information is generated for all or part of the region of the analysis target image.
The context information may be an entire or partial region of the analysis target image.

The method of claim 20,
The contextual information is generated by applying at least one of a convolutional neural network technique and a pooling technique to the extracted feature.

The method of claim 21,
And the context information is generated for all or a portion of the analysis target image by adjusting the pooling interval.

A computer-readable recording medium having recorded a program,
The program,
Receiving an analysis target image including an object;
Obtaining numerical information on whether the object included in the analysis target image corresponds to a detection target object using an artificial intelligence model;
A primary detection step of determining whether the numerical information is equal to or greater than a first threshold value;
A second detection step of determining whether the numerical information is equal to or greater than a second threshold value; And
Determining whether the object corresponds to the object to be detected based on results of the first detection step and the second detection step,
The second threshold is greater than the first threshold,
And said secondary detecting step is performed only when said numerical information is equal to or greater than a first threshold value.