KR20230091736A

KR20230091736A - Apparatus and method for providing intelligent image analysis

Info

Publication number: KR20230091736A
Application number: KR1020210193295A
Authority: KR
Inventors: 원동일; 신호환; 이지훈; 이진형; 최인명
Original assignee: 삼성전자주식회사
Priority date: 2021-12-16
Filing date: 2021-12-30
Publication date: 2023-06-23

Abstract

According to various embodiments, a device for intelligent image analysis includes at least one processor and memory. Upon execution, the at least one processor may be configured to store instructions for obtaining at least one component by analyzing at least one input image, identifying a plurality of sensing conditions based on the at least one component, generating a plurality of sensing condition sets based on the plurality of sensing conditions, and detecting abnormalities in a target image based on at least one of the plurality of detection condition sets. Various other embodiments may be provided. According to various embodiments, an artificial intelligence system can generate various yet highly accurate rules that serve as a standard for the verification of a verification target.

Description

Apparatus and method for providing an intelligent image analysis method {APPARATUS AND METHOD FOR PROVIDING INTELLIGENT IMAGE ANALYSIS}

본 문서에 개시된 다양한 실시 예는 지능형 영상 분석 방법을 제공하기 위한 장치 및 방법에 관한 것이다.Various embodiments disclosed in this document relate to an apparatus and method for providing an intelligent image analysis method.

스마트 폰(smart phone)과 같은 전자 장치가 점차 고성능화됨에 따라 전자 장치를 이용한 다양한 서비스들이 제공되고 있다. 예를 들어, 전자 장치를 통해 빅스비와 같이 AI(artificial intelligence) 기술을 이용한 고차원 서비스를 제공하는 어플리케이션들이 점차 확대 적용되고 있다. 이러한 고차원 서비스를 제공하기 위한 기술은 스스로 학습하고 판단하며 사용할수록 인식률이 향상되는 기술로, 입력된 사용자 데이터들의 특징을 스스로 분류/학습하는 알고리즘을 이용하는 기계학습(딥러닝) 기술 및 기계학습 알고리즘을 활용하여 인간 두뇌의 인지, 판단과 같은 기능을 모사하는 요소 기술들로 구성될 수 있다.BACKGROUND ART As electronic devices such as smart phones have gradually improved in performance, various services using the electronic devices are being provided. For example, applications that provide high-level services using artificial intelligence (AI) technology, such as Bixby, are gradually being applied through electronic devices. The technology for providing such a high-level service is a technology that self-learns and judges, and the recognition rate improves as it is used. It can be composed of element technologies that mimic the functions of the human brain, such as cognition and judgment.

이러한 인공 지능 기술은 다양한 분야에 이용되고 있다. 예를 들어, 시각적 이해는 인간의 시각처럼 인식하여 처리하는 기술로서, 객체 인식, 객체 추적, 영상 검색, 사람 인식, 장면 이해, 또는 공간 이해를 포함할 수 있다.Such artificial intelligence technology is used in various fields. For example, visual understanding is a technology that recognizes and processes like human vision, and may include object recognition, object tracking, image search, person recognition, scene understanding, or spatial understanding.

예를 들어, 비대면 방식의 온라인 시험의 경우 온라인 시험의 특성 상 실시간으로 응시자에 대한 관리가 이루어져야 하며, 응시자에 대한 평가 공정성이 요구될 수 있다. 하지만, 현재 인공 지능 시스템은 입력되는 영상에서 특정 모션만을 감지할 수 있다. 특정 모션만을 감지하는 방법은 응시자가 특정 행동을 해야 한다는 경우만을 전제로 하므로, 응시자가 다양한 행동을 한다거나 응시자의 주변 환경이 달라질 경우에는 자동화된 검증이 어려울 수 있다. For example, in the case of a non-face-to-face online test, due to the nature of the online test, real-time management of test takers may be required, and fairness in the evaluation of test takers may be required. However, current artificial intelligence systems can only detect specific motions in input images. Since the method of detecting only a specific motion assumes only a case in which the candidate has to perform a specific action, automated verification may be difficult when the candidate performs various actions or the surrounding environment of the candidate changes.

따라서 인공 지능 기술이 적용된 인공 지능 시스템이 정상적으로 동작하기 위해서는 검증 대상(예: 응시자)에 대한 검증의 기준이 되는 룰(rule)들이 다양하면서도 정확도가 높을 필요가 있다. Therefore, in order for an artificial intelligence system to which artificial intelligence technology is applied to operate normally, it is necessary to have various rules that are standards for verification of a verification target (eg, a candidate) and have high accuracy.

본 개시의 다양한 실시예들은, 대상의 데이터를 획득, 모니터링 및 분석을 통해 검증 기준인 룰을 생성할 수 있는, 지능형 영상 분석 방법을 제공하기 위한 장치 및 방법에 관한 것이다.Various embodiments of the present disclosure relate to an apparatus and method for providing an intelligent video analysis method capable of generating a rule, which is a verification criterion, through acquisition, monitoring, and analysis of target data.

다양한 실시 예에 따르면, 지능형 영상 분석을 위한 장치는, 적어도 하나의 프로세서 및 메모리를 포함하며, 상기 메모리는, 실행 시에, 상기 적어도 하나의 프로세서가, 적어도 하나의 입력 이미지를 분석함으로써 적어도 하나의 구성 요소를 획득하고, 상기 적어도 하나의 구성 요소에 기반하여 복수의 감지 조건들을 식별하고, 상기 복수의 감지 조건들에 기반하여, 복수의 감지 조건 집합들을 생성하고, 상기 복수의 감지 조건 집합들 중 적어도 하나에 근거하여, 대상 이미지에서 이상 유무를 검출하도록 하는 인스트럭션들을 저장하도록 설정될 수 있다. According to various embodiments of the present disclosure, an apparatus for intelligent video analysis includes at least one processor and a memory, and when the memory is executed, the at least one processor analyzes at least one input image, thereby generating at least one image. Acquire a component, identify a plurality of sensing conditions based on the at least one component, generate a plurality of sensing condition sets based on the plurality of sensing conditions, and among the plurality of sensing condition sets Based on at least one, it may be set to store instructions for detecting the presence or absence of abnormalities in the target image.

다양한 실시 예에 따르면, 지능형 영상 분석을 위한 방법은, 적어도 하나의 입력 이미지를 분석함으로써 적어도 하나의 구성 요소를 획득하는 동작, 상기 적어도 하나의 구성 요소에 기반하여 복수의 감지 조건들을 식별하는 동작, 상기 복수의 감지 조건들에 기반하여, 복수의 감지 조건 집합들을 생성하는 동작 및 상기 복수의 감지 조건 집합들 중 적어도 하나에 근거하여, 대상 이미지에서 이상 유무를 검출하는 동작을 포함할 수 있다. According to various embodiments, a method for intelligent video analysis includes obtaining at least one component by analyzing at least one input image, identifying a plurality of sensing conditions based on the at least one component, An operation of generating a plurality of detection condition sets based on the plurality of detection conditions, and an operation of detecting whether there is an abnormality in the target image based on at least one of the plurality of detection condition sets.

다양한 실시 예에 따르면, 인공 지능 시스템에서 검증 대상에 대한 검증의 기준이 되는 다양하면서도 정확도가 높은 룰(rule)들을 생성할 수 있다. According to various embodiments, it is possible to create various and highly accurate rules that are standards for verification of a verification target in an artificial intelligence system.

다양한 실시 예에 따르면, 대상의 데이터를 획득, 모니터링 및 분석을 통해 최소한의 비용으로 상황에 맞는 검증 기준인 룰들을 생성할 수 있어, 생성된 룰들을 기반으로 대상의 다양한 모션을 감지할 수 있다. According to various embodiments, it is possible to create rules that are verification criteria suitable for situations at a minimum cost through acquisition, monitoring, and analysis of object data, and various motions of the object can be detected based on the generated rules.

다양한 실시 예에 따르면, 온라인 시험의 경우 응시자에 대한 실시간 영상, 음성과 같은 데이터를 분석하여, 응시자의 정해진 규정 위반 여부를 확인할 수 있고, 효과적으로 부정행위를 방지할 수 있다. According to various embodiments of the present disclosure, in the case of an online test, it is possible to check whether the test taker has violated a predetermined rule by analyzing data such as real-time video and audio of the test taker, and cheating may be effectively prevented.

본 개시에서 얻을 수 있는 효과는 이상에서 언급한 효과들로 제한되지 않으며, 언급하지 않은 또 다른 효과들은 아래의 기재로부터 본 개시가 속하는 기술 분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.Effects obtainable in the present disclosure are not limited to the effects mentioned above, and other effects not mentioned may be clearly understood by those skilled in the art from the description below. will be.

도 1은 다양한 실시 예에 따른 네트워크 환경 내의 전자 장치의 블럭도이다.
도 2는 다양한 실시 예에 따른 대상 데이터 수집 및 처리를 위한 데이터 처리 시스템의 구성도이다.
도 3은 다양한 실시 예에 따른 지능형 영상 분석 방법을 제공하기 위한 장치 의 내부 블록 구성도이다.
도 4는 다양한 실시 예에 따른 분석 엔진의 동작을 설명하기 위한 도면이다.
도 5는 다양한 실시 예에 따른 지능형 영상 분석 방법을 제공하기 위한 장치 에서의 동작 흐름도이다.
도 6은 다양한 실시 예에 따른 룰셋 생성을 위한 동작 흐름도이다.
도 7은 다양한 실시 예에 따른 온라인 시험 방식을 나타낸 예시도이다.
도 8은 다양한 실시 예에 따른 실시간 응시자 영상을 나타낸 예시도이다.
도 9는 다양한 실시 예에 따른 룰셋 생성 및 감지 목록 생성 동작을 설명하기 위한 예시도이다.
도 10a은 다양한 실시 예에 따른 입력 이미지 기반의 룰셋 생성 방법을 설명하기 위한 예시도이다.
도 10b는 다양한 실시 예에 따른 룰셋 추천 방법을 나타낸 예시도이다.
도 11은 다양한 실시 예에 따른 온라인 시험에서의 동작 흐름도이다.
도 12는 다양한 실시 예에 따른 실시간 응시자 영상들을 나타낸 예시도이다.
도면의 설명과 관련하여, 동일 또는 유사한 구성 요소에 대해서는 동일 또는 유사한 참조 부호가 사용될 수 있다.1 is a block diagram of an electronic device in a network environment according to various embodiments.
2 is a configuration diagram of a data processing system for collecting and processing target data according to various embodiments.
3 is an internal block diagram of an apparatus for providing an intelligent video analysis method according to various embodiments.
4 is a diagram for explaining an operation of an analysis engine according to various embodiments.
5 is a flowchart illustrating an operation of an apparatus for providing an intelligent video analysis method according to various embodiments.
6 is a flowchart of an operation for generating a ruleset according to various embodiments.
7 is an exemplary diagram illustrating an online test method according to various embodiments.
8 is an exemplary view illustrating a real-time viewer image according to various embodiments.
9 is an exemplary diagram for explaining ruleset creation and detection list creation operations according to various embodiments.
10A is an exemplary diagram for explaining a method of generating a ruleset based on an input image according to various embodiments.
10B is an exemplary view illustrating a rule set recommendation method according to various embodiments.
11 is a flowchart illustrating an operation in an online test according to various embodiments.
12 is an exemplary diagram illustrating real-time gazer images according to various embodiments.
In connection with the description of the drawings, the same or similar reference numerals may be used for the same or similar elements.

도 1은, 다양한 실시예들에 따른, 네트워크 환경(100) 내의 전자 장치(101)의 블록도이다. 도 1을 참조하면, 네트워크 환경(100)에서 전자 장치(101)는 제 1 네트워크(198)(예: 근거리 무선 통신 네트워크)를 통하여 전자 장치(102)와 통신하거나, 또는 제 2 네트워크(199)(예: 원거리 무선 통신 네트워크)를 통하여 전자 장치(104) 또는 서버(108) 중 적어도 하나와 통신할 수 있다. 일실시예에 따르면, 전자 장치(101)는 서버(108)를 통하여 전자 장치(104)와 통신할 수 있다. 일실시예에 따르면, 전자 장치(101)는 프로세서(120), 메모리(130), 입력 모듈(150), 음향 출력 모듈(155), 디스플레이 모듈(160), 오디오 모듈(170), 센서 모듈(176), 인터페이스(177), 연결 단자(178), 햅틱 모듈(179), 카메라 모듈(180), 전력 관리 모듈(188), 배터리(189), 통신 모듈(190), 가입자 식별 모듈(196), 또는 안테나 모듈(197)을 포함할 수 있다. 어떤 실시예에서는, 전자 장치(101)에는, 이 구성요소들 중 적어도 하나(예: 연결 단자(178))가 생략되거나, 하나 이상의 다른 구성요소가 추가될 수 있다. 어떤 실시예에서는, 이 구성요소들 중 일부들(예: 센서 모듈(176), 카메라 모듈(180), 또는 안테나 모듈(197))은 하나의 구성요소(예: 디스플레이 모듈(160))로 통합될 수 있다.1 is a block diagram of an electronic device 101 within a network environment 100, according to various embodiments. Referring to FIG. 1 , in a network environment 100, an electronic device 101 communicates with an electronic device 102 through a first network 198 (eg, a short-range wireless communication network) or through a second network 199. It may communicate with at least one of the electronic device 104 or the server 108 through (eg, a long-distance wireless communication network). According to one embodiment, the electronic device 101 may communicate with the electronic device 104 through the server 108 . According to an embodiment, the electronic device 101 includes a processor 120, a memory 130, an input module 150, an audio output module 155, a display module 160, an audio module 170, a sensor module ( 176), interface 177, connection terminal 178, haptic module 179, camera module 180, power management module 188, battery 189, communication module 190, subscriber identification module 196 , or the antenna module 197 may be included. In some embodiments, in the electronic device 101, at least one of these components (eg, the connection terminal 178) may be omitted or one or more other components may be added. In some embodiments, some of these components (eg, sensor module 176, camera module 180, or antenna module 197) are integrated into a single component (eg, display module 160). It can be.

프로세서(120)는, 예를 들면, 소프트웨어(예: 프로그램(140))를 실행하여 프로세서(120)에 연결된 전자 장치(101)의 적어도 하나의 다른 구성요소(예: 하드웨어 또는 소프트웨어 구성요소)를 제어할 수 있고, 다양한 데이터 처리 또는 연산을 수행할 수 있다. 일실시예에 따르면, 데이터 처리 또는 연산의 적어도 일부로서, 프로세서(120)는 다른 구성요소(예: 센서 모듈(176) 또는 통신 모듈(190))로부터 수신된 명령 또는 데이터를 휘발성 메모리(132)에 저장하고, 휘발성 메모리(132)에 저장된 명령 또는 데이터를 처리하고, 결과 데이터를 비휘발성 메모리(134)에 저장할 수 있다. 일실시예에 따르면, 프로세서(120)는 메인 프로세서(121)(예: 중앙 처리 장치 또는 어플리케이션 프로세서) 또는 이와는 독립적으로 또는 함께 운영 가능한 보조 프로세서(123)(예: 그래픽 처리 장치, 신경망 처리 장치(NPU: neural processing unit), 이미지 시그널 프로세서, 센서 허브 프로세서, 또는 커뮤니케이션 프로세서)를 포함할 수 있다. 예를 들어, 전자 장치(101)가 메인 프로세서(121) 및 보조 프로세서(123)를 포함하는 경우, 보조 프로세서(123)는 메인 프로세서(121)보다 저전력을 사용하거나, 지정된 기능에 특화되도록 설정될 수 있다. 보조 프로세서(123)는 메인 프로세서(121)와 별개로, 또는 그 일부로서 구현될 수 있다.The processor 120, for example, executes software (eg, the program 140) to cause at least one other component (eg, hardware or software component) of the electronic device 101 connected to the processor 120. It can control and perform various data processing or calculations. According to one embodiment, as at least part of data processing or operation, the processor 120 transfers instructions or data received from other components (e.g., sensor module 176 or communication module 190) to volatile memory 132. , processing commands or data stored in the volatile memory 132 , and storing resultant data in the non-volatile memory 134 . According to one embodiment, the processor 120 may include a main processor 121 (eg, a central processing unit or an application processor) or a secondary processor 123 (eg, a graphic processing unit, a neural network processing unit ( NPU: neural processing unit (NPU), image signal processor, sensor hub processor, or communication processor). For example, when the electronic device 101 includes the main processor 121 and the auxiliary processor 123, the auxiliary processor 123 may use less power than the main processor 121 or be set to be specialized for a designated function. can The secondary processor 123 may be implemented separately from or as part of the main processor 121 .

보조 프로세서(123)는, 예를 들면, 메인 프로세서(121)가 인액티브(예: 슬립) 상태에 있는 동안 메인 프로세서(121)를 대신하여, 또는 메인 프로세서(121)가 액티브(예: 어플리케이션 실행) 상태에 있는 동안 메인 프로세서(121)와 함께, 전자 장치(101)의 구성요소들 중 적어도 하나의 구성요소(예: 디스플레이 모듈(160), 센서 모듈(176), 또는 통신 모듈(190))와 관련된 기능 또는 상태들의 적어도 일부를 제어할 수 있다. 일실시예에 따르면, 보조 프로세서(123)(예: 이미지 시그널 프로세서 또는 커뮤니케이션 프로세서)는 기능적으로 관련 있는 다른 구성요소(예: 카메라 모듈(180) 또는 통신 모듈(190))의 일부로서 구현될 수 있다. 일실시예에 따르면, 보조 프로세서(123)(예: 신경망 처리 장치)는 인공지능 모델의 처리에 특화된 하드웨어 구조를 포함할 수 있다. 인공지능 모델은 기계 학습을 통해 생성될 수 있다. 이러한 학습은, 예를 들어, 인공지능 모델이 수행되는 전자 장치(101) 자체에서 수행될 수 있고, 별도의 서버(예: 서버(108))를 통해 수행될 수도 있다. 학습 알고리즘은, 예를 들어, 지도형 학습(supervised learning), 비지도형 학습(unsupervised learning), 준지도형 학습(semi-supervised learning) 또는 강화 학습(reinforcement learning)을 포함할 수 있으나, 전술한 예에 한정되지 않는다. 인공지능 모델은, 복수의 인공 신경망 레이어들을 포함할 수 있다. 인공 신경망은 심층 신경망(DNN: deep neural network), CNN(convolutional neural network), RNN(recurrent neural network), RBM(restricted boltzmann machine), DBN(deep belief network), BRDNN(bidirectional recurrent deep neural network), 심층 Q-네트워크(deep Q-networks) 또는 상기 중 둘 이상의 조합 중 하나일 수 있으나, 전술한 예에 한정되지 않는다. 인공지능 모델은 하드웨어 구조 이외에, 추가적으로 또는 대체적으로, 소프트웨어 구조를 포함할 수 있다.The secondary processor 123 may, for example, take the place of the main processor 121 while the main processor 121 is in an inactive (eg, sleep) state, or the main processor 121 is active (eg, running an application). ) state, together with the main processor 121, at least one of the components of the electronic device 101 (eg, the display module 160, the sensor module 176, or the communication module 190) It is possible to control at least some of the related functions or states. According to one embodiment, the auxiliary processor 123 (eg, image signal processor or communication processor) may be implemented as part of other functionally related components (eg, camera module 180 or communication module 190). there is. According to an embodiment, the auxiliary processor 123 (eg, a neural network processing device) may include a hardware structure specialized for processing an artificial intelligence model. AI models can be created through machine learning. Such learning may be performed, for example, in the electronic device 101 itself where the artificial intelligence model is performed, or may be performed through a separate server (eg, the server 108). The learning algorithm may include, for example, supervised learning, unsupervised learning, semi-supervised learning or reinforcement learning, but in the above example Not limited. The artificial intelligence model may include a plurality of artificial neural network layers. Artificial neural networks include deep neural networks (DNNs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), restricted boltzmann machines (RBMs), deep belief networks (DBNs), bidirectional recurrent deep neural networks (BRDNNs), It may be one of deep Q-networks or a combination of two or more of the foregoing, but is not limited to the foregoing examples. The artificial intelligence model may include, in addition or alternatively, software structures in addition to hardware structures.

메모리(130)는, 전자 장치(101)의 적어도 하나의 구성요소(예: 프로세서(120) 또는 센서 모듈(176))에 의해 사용되는 다양한 데이터를 저장할 수 있다. 데이터는, 예를 들어, 소프트웨어(예: 프로그램(140)) 및, 이와 관련된 명령에 대한 입력 데이터 또는 출력 데이터를 포함할 수 있다. 메모리(130)는, 휘발성 메모리(132) 또는 비휘발성 메모리(134)를 포함할 수 있다. The memory 130 may store various data used by at least one component (eg, the processor 120 or the sensor module 176) of the electronic device 101 . The data may include, for example, input data or output data for software (eg, program 140) and commands related thereto. The memory 130 may include volatile memory 132 or non-volatile memory 134 .

프로그램(140)은 메모리(130)에 소프트웨어로서 저장될 수 있으며, 예를 들면, 운영 체제(142), 미들 웨어(144) 또는 어플리케이션(146)을 포함할 수 있다. The program 140 may be stored as software in the memory 130 and may include, for example, an operating system 142 , middleware 144 , or an application 146 .

입력 모듈(150)은, 전자 장치(101)의 구성요소(예: 프로세서(120))에 사용될 명령 또는 데이터를 전자 장치(101)의 외부(예: 사용자)로부터 수신할 수 있다. 입력 모듈(150)은, 예를 들면, 마이크, 마우스, 키보드, 키(예: 버튼), 또는 디지털 펜(예: 스타일러스 펜)을 포함할 수 있다. The input module 150 may receive a command or data to be used by a component (eg, the processor 120) of the electronic device 101 from the outside of the electronic device 101 (eg, a user). The input module 150 may include, for example, a microphone, a mouse, a keyboard, a key (eg, a button), or a digital pen (eg, a stylus pen).

음향 출력 모듈(155)은 음향 신호를 전자 장치(101)의 외부로 출력할 수 있다. 음향 출력 모듈(155)은, 예를 들면, 스피커 또는 리시버를 포함할 수 있다. 스피커는 멀티미디어 재생 또는 녹음 재생과 같이 일반적인 용도로 사용될 수 있다. 리시버는 착신 전화를 수신하기 위해 사용될 수 있다. 일실시예에 따르면, 리시버는 스피커와 별개로, 또는 그 일부로서 구현될 수 있다.The sound output module 155 may output sound signals to the outside of the electronic device 101 . The sound output module 155 may include, for example, a speaker or a receiver. The speaker can be used for general purposes such as multimedia playback or recording playback. A receiver may be used to receive an incoming call. According to one embodiment, the receiver may be implemented separately from the speaker or as part of it.

디스플레이 모듈(160)은 전자 장치(101)의 외부(예: 사용자)로 정보를 시각적으로 제공할 수 있다. 디스플레이 모듈(160)은, 예를 들면, 디스플레이, 홀로그램 장치, 또는 프로젝터 및 해당 장치를 제어하기 위한 제어 회로를 포함할 수 있다. 일실시예에 따르면, 디스플레이 모듈(160)은 터치를 감지하도록 설정된 터치 센서, 또는 상기 터치에 의해 발생되는 힘의 세기를 측정하도록 설정된 압력 센서를 포함할 수 있다. The display module 160 may visually provide information to the outside of the electronic device 101 (eg, a user). The display module 160 may include, for example, a display, a hologram device, or a projector and a control circuit for controlling the device. According to one embodiment, the display module 160 may include a touch sensor set to detect a touch or a pressure sensor set to measure the intensity of force generated by the touch.

오디오 모듈(170)은 소리를 전기 신호로 변환시키거나, 반대로 전기 신호를 소리로 변환시킬 수 있다. 일실시예에 따르면, 오디오 모듈(170)은, 입력 모듈(150)을 통해 소리를 획득하거나, 음향 출력 모듈(155), 또는 전자 장치(101)와 직접 또는 무선으로 연결된 외부 전자 장치(예: 전자 장치(102))(예: 스피커 또는 헤드폰)를 통해 소리를 출력할 수 있다.The audio module 170 may convert sound into an electrical signal or vice versa. According to one embodiment, the audio module 170 acquires sound through the input module 150, the sound output module 155, or an external electronic device connected directly or wirelessly to the electronic device 101 (eg: Sound may be output through the electronic device 102 (eg, a speaker or a headphone).

센서 모듈(176)은 전자 장치(101)의 작동 상태(예: 전력 또는 온도), 또는 외부의 환경 상태(예: 사용자 상태)를 감지하고, 감지된 상태에 대응하는 전기 신호 또는 데이터 값을 생성할 수 있다. 일실시예에 따르면, 센서 모듈(176)은, 예를 들면, 제스처 센서, 자이로 센서, 기압 센서, 마그네틱 센서, 가속도 센서, 그립 센서, 근접 센서, 컬러 센서, IR(infrared) 센서, 생체 센서, 온도 센서, 습도 센서, 또는 조도 센서를 포함할 수 있다. The sensor module 176 detects an operating state (eg, power or temperature) of the electronic device 101 or an external environmental state (eg, a user state), and generates an electrical signal or data value corresponding to the detected state. can do. According to one embodiment, the sensor module 176 may include, for example, a gesture sensor, a gyro sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a grip sensor, a proximity sensor, a color sensor, an IR (infrared) sensor, a bio sensor, It may include a temperature sensor, humidity sensor, or light sensor.

인터페이스(177)는 전자 장치(101)가 외부 전자 장치(예: 전자 장치(102))와 직접 또는 무선으로 연결되기 위해 사용될 수 있는 하나 이상의 지정된 프로토콜들을 지원할 수 있다. 일실시예에 따르면, 인터페이스(177)는, 예를 들면, HDMI(high definition multimedia interface), USB(universal serial bus) 인터페이스, SD카드 인터페이스, 또는 오디오 인터페이스를 포함할 수 있다.The interface 177 may support one or more designated protocols that may be used to directly or wirelessly connect the electronic device 101 to an external electronic device (eg, the electronic device 102). According to one embodiment, the interface 177 may include, for example, a high definition multimedia interface (HDMI), a universal serial bus (USB) interface, an SD card interface, or an audio interface.

연결 단자(178)는, 그를 통해서 전자 장치(101)가 외부 전자 장치(예: 전자 장치(102))와 물리적으로 연결될 수 있는 커넥터를 포함할 수 있다. 일실시예에 따르면, 연결 단자(178)는, 예를 들면, HDMI 커넥터, USB 커넥터, SD 카드 커넥터, 또는 오디오 커넥터(예: 헤드폰 커넥터)를 포함할 수 있다.The connection terminal 178 may include a connector through which the electronic device 101 may be physically connected to an external electronic device (eg, the electronic device 102). According to one embodiment, the connection terminal 178 may include, for example, an HDMI connector, a USB connector, an SD card connector, or an audio connector (eg, a headphone connector).

햅틱 모듈(179)은 전기적 신호를 사용자가 촉각 또는 운동 감각을 통해서 인지할 수 있는 기계적인 자극(예: 진동 또는 움직임) 또는 전기적인 자극으로 변환할 수 있다. 일실시예에 따르면, 햅틱 모듈(179)은, 예를 들면, 모터, 압전 소자, 또는 전기 자극 장치를 포함할 수 있다.The haptic module 179 may convert electrical signals into mechanical stimuli (eg, vibration or motion) or electrical stimuli that a user may perceive through tactile or kinesthetic senses. According to one embodiment, the haptic module 179 may include, for example, a motor, a piezoelectric element, or an electrical stimulation device.

카메라 모듈(180)은 정지 영상 및 동영상을 촬영할 수 있다. 일실시예에 따르면, 카메라 모듈(180)은 하나 이상의 렌즈들, 이미지 센서들, 이미지 시그널 프로세서들, 또는 플래시들을 포함할 수 있다.The camera module 180 may capture still images and moving images. According to one embodiment, the camera module 180 may include one or more lenses, image sensors, image signal processors, or flashes.

전력 관리 모듈(188)은 전자 장치(101)에 공급되는 전력을 관리할 수 있다. 일실시예에 따르면, 전력 관리 모듈(188)은, 예를 들면, PMIC(power management integrated circuit)의 적어도 일부로서 구현될 수 있다.The power management module 188 may manage power supplied to the electronic device 101 . According to one embodiment, the power management module 188 may be implemented as at least part of a power management integrated circuit (PMIC), for example.

배터리(189)는 전자 장치(101)의 적어도 하나의 구성요소에 전력을 공급할 수 있다. 일실시예에 따르면, 배터리(189)는, 예를 들면, 재충전 불가능한 1차 전지, 재충전 가능한 2차 전지 또는 연료 전지를 포함할 수 있다.The battery 189 may supply power to at least one component of the electronic device 101 . According to one embodiment, the battery 189 may include, for example, a non-rechargeable primary cell, a rechargeable secondary cell, or a fuel cell.

통신 모듈(190)은 전자 장치(101)와 외부 전자 장치(예: 전자 장치(102), 전자 장치(104), 또는 서버(108)) 간의 직접(예: 유선) 통신 채널 또는 무선 통신 채널의 수립, 및 수립된 통신 채널을 통한 통신 수행을 지원할 수 있다. 통신 모듈(190)은 프로세서(120)(예: 어플리케이션 프로세서)와 독립적으로 운영되고, 직접(예: 유선) 통신 또는 무선 통신을 지원하는 하나 이상의 커뮤니케이션 프로세서를 포함할 수 있다. 일실시예에 따르면, 통신 모듈(190)은 무선 통신 모듈(192)(예: 셀룰러 통신 모듈, 근거리 무선 통신 모듈, 또는 GNSS(global navigation satellite system) 통신 모듈) 또는 유선 통신 모듈(194)(예: LAN(local area network) 통신 모듈, 또는 전력선 통신 모듈)을 포함할 수 있다. 이들 통신 모듈 중 해당하는 통신 모듈은 제 1 네트워크(198)(예: 블루투스, WiFi(wireless fidelity) direct 또는 IrDA(infrared data association)와 같은 근거리 통신 네트워크) 또는 제 2 네트워크(199)(예: 레거시 셀룰러 네트워크, 5G 네트워크, 차세대 통신 네트워크, 인터넷, 또는 컴퓨터 네트워크(예: LAN 또는 WAN)와 같은 원거리 통신 네트워크)를 통하여 외부의 전자 장치(104)와 통신할 수 있다. 이런 여러 종류의 통신 모듈들은 하나의 구성요소(예: 단일 칩)로 통합되거나, 또는 서로 별도의 복수의 구성요소들(예: 복수 칩들)로 구현될 수 있다. 무선 통신 모듈(192)은 가입자 식별 모듈(196)에 저장된 가입자 정보(예: 국제 모바일 가입자 식별자(IMSI))를 이용하여 제 1 네트워크(198) 또는 제 2 네트워크(199)와 같은 통신 네트워크 내에서 전자 장치(101)를 확인 또는 인증할 수 있다. The communication module 190 is a direct (eg, wired) communication channel or a wireless communication channel between the electronic device 101 and an external electronic device (eg, the electronic device 102, the electronic device 104, or the server 108). Establishment and communication through the established communication channel may be supported. The communication module 190 may include one or more communication processors that operate independently of the processor 120 (eg, an application processor) and support direct (eg, wired) communication or wireless communication. According to one embodiment, the communication module 190 is a wireless communication module 192 (eg, a cellular communication module, a short-range wireless communication module, or a global navigation satellite system (GNSS) communication module) or a wired communication module 194 (eg, : a local area network (LAN) communication module or a power line communication module). Among these communication modules, a corresponding communication module is a first network 198 (eg, a short-range communication network such as Bluetooth, wireless fidelity (WiFi) direct, or infrared data association (IrDA)) or a second network 199 (eg, legacy It may communicate with the external electronic device 104 through a cellular network, a 5G network, a next-generation communication network, the Internet, or a telecommunications network such as a computer network (eg, a LAN or a WAN). These various types of communication modules may be integrated as one component (eg, a single chip) or implemented as a plurality of separate components (eg, multiple chips). The wireless communication module 192 uses subscriber information (eg, International Mobile Subscriber Identifier (IMSI)) stored in the subscriber identification module 196 within a communication network such as the first network 198 or the second network 199. The electronic device 101 may be identified or authenticated.

무선 통신 모듈(192)은 4G 네트워크 이후의 5G 네트워크 및 차세대 통신 기술, 예를 들어, NR 접속 기술(new radio access technology)을 지원할 수 있다. NR 접속 기술은 고용량 데이터의 고속 전송(eMBB(enhanced mobile broadband)), 단말 전력 최소화와 다수 단말의 접속(mMTC(massive machine type communications)), 또는 고신뢰도와 저지연(URLLC(ultra-reliable and low-latency communications))을 지원할 수 있다. 무선 통신 모듈(192)은, 예를 들어, 높은 데이터 전송률 달성을 위해, 고주파 대역(예: mmWave 대역)을 지원할 수 있다. 무선 통신 모듈(192)은 고주파 대역에서의 성능 확보를 위한 다양한 기술들, 예를 들어, 빔포밍(beamforming), 거대 배열 다중 입출력(massive MIMO(multiple-input and multiple-output)), 전차원 다중입출력(FD-MIMO: full dimensional MIMO), 어레이 안테나(array antenna), 아날로그 빔형성(analog beam-forming), 또는 대규모 안테나(large scale antenna)와 같은 기술들을 지원할 수 있다. 무선 통신 모듈(192)은 전자 장치(101), 외부 전자 장치(예: 전자 장치(104)) 또는 네트워크 시스템(예: 제 2 네트워크(199))에 규정되는 다양한 요구사항을 지원할 수 있다. 일실시예에 따르면, 무선 통신 모듈(192)은 eMBB 실현을 위한 Peak data rate(예: 20Gbps 이상), mMTC 실현을 위한 손실 Coverage(예: 164dB 이하), 또는 URLLC 실현을 위한 U-plane latency(예: 다운링크(DL) 및 업링크(UL) 각각 0.5ms 이하, 또는 라운드 트립 1ms 이하)를 지원할 수 있다.The wireless communication module 192 may support a 5G network after a 4G network and a next-generation communication technology, for example, NR access technology (new radio access technology). NR access technologies include high-speed transmission of high-capacity data (enhanced mobile broadband (eMBB)), minimization of terminal power and access of multiple terminals (massive machine type communications (mMTC)), or high reliability and low latency (ultra-reliable and low latency (URLLC)). -latency communications)) can be supported. The wireless communication module 192 may support a high frequency band (eg, mmWave band) to achieve a high data rate, for example. The wireless communication module 192 uses various technologies for securing performance in a high frequency band, such as beamforming, massive multiple-input and multiple-output (MIMO), and full-dimensional multiplexing. Technologies such as input/output (FD-MIMO: full dimensional MIMO), array antenna, analog beam-forming, or large scale antenna may be supported. The wireless communication module 192 may support various requirements defined for the electronic device 101, an external electronic device (eg, the electronic device 104), or a network system (eg, the second network 199). According to one embodiment, the wireless communication module 192 is a peak data rate for eMBB realization (eg, 20 Gbps or more), a loss coverage for mMTC realization (eg, 164 dB or less), or a U-plane latency for URLLC realization (eg, Example: downlink (DL) and uplink (UL) each of 0.5 ms or less, or round trip 1 ms or less) may be supported.

안테나 모듈(197)은 신호 또는 전력을 외부(예: 외부의 전자 장치)로 송신하거나 외부로부터 수신할 수 있다. 일실시예에 따르면, 안테나 모듈(197)은 서브스트레이트(예: PCB) 위에 형성된 도전체 또는 도전성 패턴으로 이루어진 방사체를 포함하는 안테나를 포함할 수 있다. 일실시예에 따르면, 안테나 모듈(197)은 복수의 안테나들(예: 어레이 안테나)을 포함할 수 있다. 이런 경우, 제 1 네트워크(198) 또는 제 2 네트워크(199)와 같은 통신 네트워크에서 사용되는 통신 방식에 적합한 적어도 하나의 안테나가, 예를 들면, 통신 모듈(190)에 의하여 상기 복수의 안테나들로부터 선택될 수 있다. 신호 또는 전력은 상기 선택된 적어도 하나의 안테나를 통하여 통신 모듈(190)과 외부의 전자 장치 간에 송신되거나 수신될 수 있다. 어떤 실시예에 따르면, 방사체 이외에 다른 부품(예: RFIC(radio frequency integrated circuit))이 추가로 안테나 모듈(197)의 일부로 형성될 수 있다. The antenna module 197 may transmit or receive signals or power to the outside (eg, an external electronic device). According to one embodiment, the antenna module 197 may include an antenna including a radiator formed of a conductor or a conductive pattern formed on a substrate (eg, PCB). According to one embodiment, the antenna module 197 may include a plurality of antennas (eg, an array antenna). In this case, at least one antenna suitable for a communication method used in a communication network such as the first network 198 or the second network 199 is selected from the plurality of antennas by the communication module 190, for example. can be chosen A signal or power may be transmitted or received between the communication module 190 and an external electronic device through the selected at least one antenna. According to some embodiments, other components (eg, a radio frequency integrated circuit (RFIC)) may be additionally formed as a part of the antenna module 197 in addition to the radiator.

다양한 실시예에 따르면, 안테나 모듈(197)은 mmWave 안테나 모듈을 형성할 수 있다. 일실시예에 따르면, mmWave 안테나 모듈은 인쇄 회로 기판, 상기 인쇄 회로 기판의 제 1 면(예: 아래 면)에 또는 그에 인접하여 배치되고 지정된 고주파 대역(예: mmWave 대역)을 지원할 수 있는 RFIC, 및 상기 인쇄 회로 기판의 제 2 면(예: 윗 면 또는 측 면)에 또는 그에 인접하여 배치되고 상기 지정된 고주파 대역의 신호를 송신 또는 수신할 수 있는 복수의 안테나들(예: 어레이 안테나)을 포함할 수 있다.According to various embodiments, the antenna module 197 may form a mmWave antenna module. According to one embodiment, the mmWave antenna module includes a printed circuit board, an RFIC disposed on or adjacent to a first surface (eg, a lower surface) of the printed circuit board and capable of supporting a designated high frequency band (eg, mmWave band); and a plurality of antennas (eg, array antennas) disposed on or adjacent to a second surface (eg, a top surface or a side surface) of the printed circuit board and capable of transmitting or receiving signals of the designated high frequency band. can do.

상기 구성요소들 중 적어도 일부는 주변 기기들간 통신 방식(예: 버스, GPIO(general purpose input and output), SPI(serial peripheral interface), 또는 MIPI(mobile industry processor interface))을 통해 서로 연결되고 신호(예: 명령 또는 데이터)를 상호간에 교환할 수 있다.At least some of the components are connected to each other through a communication method between peripheral devices (eg, a bus, general purpose input and output (GPIO), serial peripheral interface (SPI), or mobile industry processor interface (MIPI)) and signal ( e.g. commands or data) can be exchanged with each other.

일실시예에 따르면, 명령 또는 데이터는 제 2 네트워크(199)에 연결된 서버(108)를 통해서 전자 장치(101)와 외부의 전자 장치(104)간에 송신 또는 수신될 수 있다. 외부의 전자 장치(102, 또는 104) 각각은 전자 장치(101)와 동일한 또는 다른 종류의 장치일 수 있다. 일실시예에 따르면, 전자 장치(101)에서 실행되는 동작들의 전부 또는 일부는 외부의 전자 장치들(102, 104, 또는 108) 중 하나 이상의 외부의 전자 장치들에서 실행될 수 있다. 예를 들면, 전자 장치(101)가 어떤 기능이나 서비스를 자동으로, 또는 사용자 또는 다른 장치로부터의 요청에 반응하여 수행해야 할 경우에, 전자 장치(101)는 기능 또는 서비스를 자체적으로 실행시키는 대신에 또는 추가적으로, 하나 이상의 외부의 전자 장치들에게 그 기능 또는 그 서비스의 적어도 일부를 수행하라고 요청할 수 있다. 상기 요청을 수신한 하나 이상의 외부의 전자 장치들은 요청된 기능 또는 서비스의 적어도 일부, 또는 상기 요청과 관련된 추가 기능 또는 서비스를 실행하고, 그 실행의 결과를 전자 장치(101)로 전달할 수 있다. 전자 장치(101)는 상기 결과를, 그대로 또는 추가적으로 처리하여, 상기 요청에 대한 응답의 적어도 일부로서 제공할 수 있다. 이를 위하여, 예를 들면, 클라우드 컴퓨팅, 분산 컴퓨팅, 모바일 에지 컴퓨팅(MEC: mobile edge computing), 또는 클라이언트-서버 컴퓨팅 기술이 이용될 수 있다. 전자 장치(101)는, 예를 들어, 분산 컴퓨팅 또는 모바일 에지 컴퓨팅을 이용하여 초저지연 서비스를 제공할 수 있다. 다른 실시예에 있어서, 외부의 전자 장치(104)는 IoT(internet of things) 기기를 포함할 수 있다. 서버(108)는 기계 학습 및/또는 신경망을 이용한 지능형 서버일 수 있다. 일실시예에 따르면, 외부의 전자 장치(104) 또는 서버(108)는 제 2 네트워크(199) 내에 포함될 수 있다. 전자 장치(101)는 5G 통신 기술 및 IoT 관련 기술을 기반으로 지능형 서비스(예: 스마트 홈, 스마트 시티, 스마트 카, 또는 헬스 케어)에 적용될 수 있다. According to an embodiment, commands or data may be transmitted or received between the electronic device 101 and the external electronic device 104 through the server 108 connected to the second network 199 . Each of the external electronic devices 102 or 104 may be the same as or different from the electronic device 101 . According to an embodiment, all or part of operations executed in the electronic device 101 may be executed in one or more external electronic devices among the external electronic devices 102 , 104 , or 108 . For example, when the electronic device 101 needs to perform a certain function or service automatically or in response to a request from a user or another device, the electronic device 101 instead of executing the function or service by itself. Alternatively or additionally, one or more external electronic devices may be requested to perform the function or at least part of the service. One or more external electronic devices receiving the request may execute at least a part of the requested function or service or an additional function or service related to the request, and deliver the execution result to the electronic device 101 . The electronic device 101 may provide the result as at least part of a response to the request as it is or additionally processed. To this end, for example, cloud computing, distributed computing, mobile edge computing (MEC), or client-server computing technology may be used. The electronic device 101 may provide an ultra-low latency service using, for example, distributed computing or mobile edge computing. In another embodiment, the external electronic device 104 may include an internet of things (IoT) device. Server 108 may be an intelligent server using machine learning and/or neural networks. According to one embodiment, the external electronic device 104 or server 108 may be included in the second network 199 . The electronic device 101 may be applied to intelligent services (eg, smart home, smart city, smart car, or health care) based on 5G communication technology and IoT-related technology.

도 2는 다양한 실시 예에 따른 대상 데이터 수집 및 처리를 위한 데이터 처리 시스템의 구성도(200)이다. 2 is a configuration diagram 200 of a data processing system for collecting and processing target data according to various embodiments.

도 2를 참조하면, 데이터 처리 시스템(200)은 전자 장치(101), 외부 전자 장치(102), 지능형 서버(250) 및 하나 이상의 서비스 사업자(260)를 포함할 수 있다. Referring to FIG. 2 , the data processing system 200 may include an electronic device 101 , an external electronic device 102 , an intelligent server 250 and one or more service providers 260 .

다양한 실시 예에 따르면, 지능형 서버(250)는 네트워크(240)를 통해 전자 장치(101), 외부 전자 장치(102) 및/또는 하나 이상의 서비스 사업자(260)와 통신을 수행할 수 있다. According to various embodiments, the intelligent server 250 may communicate with the electronic device 101, the external electronic device 102, and/or one or more service providers 260 through the network 240.

지능형 서버(250)는 다양한 형태의 입력 데이터를 네트워크(240)를 통해 수집할 수 있으며, 이를 데이터베이스 형태로 축적 및 관리할 수 있다. 일 실시 예에 따르면, 지능형 서버(250)는 입력 데이터를 수신할 수 있는데, 예를 들어, 훈련 데이터(training data) 또는 가공되지 않은 데이터를 수신할 수 있다. 지능형 서버(250)는 입력 데이터에 기초하여 감지 조건을 생성할 수 있다. 이러한 입력 데이터는 지능형 서버(250)에 탑재된 학습 모델을 학습시키는데 이용될 수 있다.The intelligent server 250 may collect various types of input data through the network 240, and may accumulate and manage them in the form of a database. According to one embodiment, the intelligent server 250 may receive input data, for example, may receive training data or raw data. Intelligent server 250 may generate a sensing condition based on the input data. This input data can be used to learn the learning model loaded in the intelligent server 250.

본 개시의 '룰(rule)'이라는 용어는 일반적으로, 인공 지능 모델에 기반하여 다양한 종류의 입력 데이터를 처리하기 위한 감지 조건을 의미할 수 있다. 인공 지능 모델에 기반하여 처리될 다양한 종류의 입력 데이터(또는 컨텐트)들은 이미지, 영상, 및 오디오 데이터와 같은 미디어 데이터, 전자적인 문서를 포함할 수 있으며, 이에 제한되지 않고 인공 지능 모델에 의해 전자적으로 분석 가능한 종류의 전자적인 데이터들(예: 소프트웨어, 센서의 값들)을 더 포함할 수 있다.The term 'rule' of the present disclosure may generally mean a sensing condition for processing various types of input data based on an artificial intelligence model. Various types of input data (or content) to be processed based on the artificial intelligence model may include media data such as images, video, and audio data, and electronic documents, but are not limited thereto, and may be electronically processed by the artificial intelligence model. It may further include electronic data of a kind that can be analyzed (eg, values of software and sensors).

예를 들어, 카메라 어플리케이션이 실행 및/또는 구동되는 경우, 학습 모델에 의해, 입력 데이터로서 이미지 및/또는 영상을 획득할 수 있는데, 지능형 서버(250)는 이러한 입력 데이터로부터 객체 인식, 객체 추적, 영상 검색, 사람 인식, 장면 이해, 또는 공간 이해를 통해 감지 조건에 대응하는 다양한 모션을 검출하거나 주변 상황을 검출할 수 있다. 여기서, 입력 데이터로부터 다양한 모션을 검출하거나 주변 상황을 검출하기 위해서는 복수의 감지 조건들이 요구되며, 이러한 복수의 감지 조건들의 조합을 룰셋(rule set)이라고 칭할 수 있다. 또한, 이러한 조합들의 저장 형태를 룰셋 테이블(table)이라고 칭할 수 있다. For example, when a camera application is executed and/or driven, an image and/or video may be acquired as input data by a learning model, and the intelligent server 250 may perform object recognition, object tracking, Various motions corresponding to detection conditions or surrounding situations may be detected through image search, person recognition, scene understanding, or spatial understanding. Here, a plurality of sensing conditions are required to detect various motions from input data or to detect a surrounding situation, and a combination of the plurality of sensing conditions may be referred to as a rule set. In addition, the storage form of these combinations may be referred to as a ruleset table.

일 실시 예에 따르면, 룰셋은 학습 모델 또는 인공 지능 시스템 예컨대, 지능형 서버(250)에 의해 제공되거나, 생성될 수 있다. 룰셋은 미리 정의된 감지 조건들의 집합(또는 룰셋 테이블)에서 선택될 수 있거나, 사용자 요청에 응답하여 실시간으로 생성될 수 있다. 또한, 상기 미리 정의된 감지 조건들의 집합은 학습 모델에 기반하여 생성되거나 갱신될 수 있다. 예를 들어, 지능형 서버(250)는 미리 정의된 복수의 감지 조건 집합들 중 적어도 하나의 감지 조건 집합(예: 룰셋)을 선택하거나, 동적으로 또는 실시간으로 감지 조건들을 생성할 수도 있다. 학습된 감지 조건들은 다양한 형태의 저장 포맷으로 저장될 수 있으며, 예를 들어 데이터베이스 테이블(table), 스프레드시트 파일, 또는 텍스트 파일 형태로 저장될 수 있다. 여기서, 감지 조건들의 집합은 단순히 감지 조건들이 나열된 것이 아니라 감지 조건들을 구성하는 구성 요소들을 특정 조건에 따라 조합하거나 분류함으로써 묶음 처리한 것일 수 있다. According to an embodiment, the ruleset may be provided or generated by a learning model or an artificial intelligence system, for example, the intelligent server 250 . The rule set may be selected from a set of predefined detection conditions (or a rule set table) or may be generated in real time in response to a user request. Also, the set of predefined sensing conditions may be generated or updated based on a learning model. For example, the intelligent server 250 may select at least one sensing condition set (eg, a rule set) from among a plurality of predefined sensing condition sets, or may dynamically or in real time generate sensing conditions. The learned sensing conditions may be stored in various types of storage formats, such as database tables, spreadsheet files, or text files. Here, the set of sensing conditions may not simply be a list of sensing conditions, but may be grouped by combining or classifying components constituting the sensing conditions according to a specific condition.

일 실시 예에 따르면, 인공 지능 모델은 지정된 종류의 학습 알고리즘을 기반으로 학습되는 모델로서, 다양한 종류의 데이터를 입력받아 연산하여 결과 데이터를 출력(또는 획득)하도록 구현된 인공 지능 모델들일 수 있다. 이러한 입력 데이터를 기반으로 학습이 수행되어 학습 모델(예: 머신 러닝 모델, 및 딥 러닝 모델)이 생성됨으로써 지능형 서버(250)에 저장되거나, 또는 전자 장치(101)로 전달되어 저장될 수 있다. 예를 들어, 학습 모델은 적어도 하나의 하드웨어 칩 형태로 제적되어 지능형 서버(250)에 탑재될 수 있다. 학습 모델은 인공 지능을 위한 전용 하드웨어 칩 형태로 제적될 수 있고, 또는 기존의 범용 프로세서(예: CPU, AP)의 일부로 제작되어 전술한 각종 전자 장치에 탑재될 수도 있다. 상기 지능형 서버(250)에 탑재되는 학습 모델은 전자 장치(101) 또는 외부 전자 장치(102)에 탑재될 수 있으며, 또는 서비스 사업자(260)에도 탑재될 수 있다. 일 실시 예에서 지능형 서버(250)로부터 학습 모델이 전자 장치(101)로 수신(또는 다운로드)될 수 있으나, 이에 제한되지 않는다. According to an embodiment, the artificial intelligence model is a model learned based on a designated type of learning algorithm, and may be artificial intelligence models implemented to output (or obtain) result data by receiving and calculating various types of data. Learning is performed based on these input data, and learning models (eg, machine learning models and deep learning models) are generated and stored in the intelligent server 250 or transferred to the electronic device 101 and stored therein. For example, the learning model may be manufactured in the form of at least one hardware chip and loaded into the intelligent server 250 . The learning model may be manufactured in the form of a dedicated hardware chip for artificial intelligence, or may be manufactured as a part of an existing general-purpose processor (eg, CPU, AP) and installed in various electronic devices described above. The learning model loaded in the intelligent server 250 may be loaded in the electronic device 101 or the external electronic device 102, or may be loaded in the service provider 260 as well. In one embodiment, the learning model may be received (or downloaded) from the intelligent server 250 to the electronic device 101, but is not limited thereto.

일 실시 예에 따르면, 입력 데이터에 기반하여 생성된 감지 조건들(또는 룰셋) 또는 훈련 데이터에 기반하여 학습된 감지 조건들은 서비스 사업자(260)가 액세스 가능한 저장 시스템에 저장될 수 있다. 여기서, 서비스 사업자(260)가 액세스 가능한 저장 시스템은 지능형 서버(250) 내에 구현되거나 지능형 서버(250)와는 독립적으로 구현되어 지능형 서버(250)에 의해 관리될 수 있다. According to an embodiment, sensing conditions (or rule sets) generated based on input data or sensing conditions learned based on training data may be stored in a storage system accessible to the service provider 260 . Here, the storage system accessible to the service provider 260 may be implemented within the intelligent server 250 or implemented independently of the intelligent server 250 and managed by the intelligent server 250 .

지능형 서버(250)는 입력 데이터에 기반하여 생성된 또는 학습된 감지 조건들을 온라인 시험 테스트에 활용될 수 있도록 예컨대, 개발 업체와 같은 서비스 사업자(260)에 제공할 수 있다. The intelligent server 250 may provide the sensing conditions generated or learned based on the input data to the service provider 260, such as a development company, so that they may be used for online testing.

서비스 사업자(260)는 지능형 서버(250)로부터 제공되는 룰셋에 기반하여, 실시간으로 입력되는 데이터를 분석하고 분석 결과에 따라 사용자에게 다양한 서비스를 제공할 수 있다. 또한, 서비스 사업자(260)는 분석된 데이터와 관련된 서비스를 생성하거나 분석된 데이터를 전자 장치(101) 또는 외부 전자 장치(102)로 제공할 수도 있다. 예를 들어, 서비스 사업자(260)는 온라인 시험 서비스를 전자 장치(101)로 제공할 수 있다. 이러한 서비스 사업자(260)는 서버 형태로 구현될 수 있으며, 분석 결과에 따라 사용자에게 다양한 서비스를 제공할 수 있으며, 서비스의 종류는 이에 한정되지 않을 수 있다.The service provider 260 may analyze input data in real time based on a rule set provided from the intelligent server 250 and provide various services to users according to the analysis result. In addition, the service provider 260 may create a service related to the analyzed data or provide the analyzed data to the electronic device 101 or the external electronic device 102 . For example, the service provider 260 may provide an online test service to the electronic device 101 . The service provider 260 may be implemented in the form of a server, and may provide various services to users according to analysis results, and the types of services may not be limited thereto.

일 실시 예에 따르면, 서비스 사업자(260)는 응시자가 시험을 수행하는 데에 필요한 정보를 제공하는 응시 서버로 동작할 수 있으며, 또한 응시자의 응시 상황을 실시간으로 감독하는 감독 서버로도 동작할 수 있다. According to an embodiment, the service provider 260 may operate as a test-taking server that provides information necessary for the test taker to take the test, and may also operate as a supervising server that supervises the test taker's test-taking situation in real time. there is.

다양한 실시 예에 따르면, 전자 장치(101)는 지능형 서버(250)에서와 같이 동작할 수 있다. 일 실시 예에 따르면, 전자 장치(101)는 입력 데이터에 기반하여 감지 조건들을 생성하거나 학습시킬 수 있다. 예를 들어, 전자 장치(101)의 카메라 어플리케이션이 실행 및/또는 구동되는 경우, 학습 모델에 의해, 입력 데이터로부터 감지 조건들에 기반하여 감지 조건에 대응하는 다양한 모션을 검출하거나 주변 상황을 검출할 수 있다. 다른 실시 예에 따르면, 전자 장치(101)는 응시자의 전자 장치일 수 있다. 예를 들어, 전자 장치(101)는 응시자가 시험을 응시하기 위하여 사용하는 전자 장치일 수 있다. 예를 들어, 전자 장치(101)는 서비스 사업자(260)로부터 시험 문제를 포함하는 시험 응시와 관련된 정보를 수신할 수 있고, 응시자로부터 시험 응시에 필요한 정보(예: 응시자 정보)를 서비스 사업자(260)로 전송할 수 있다. According to various embodiments, the electronic device 101 may operate as the intelligent server 250. According to an embodiment, the electronic device 101 may generate or learn sensing conditions based on input data. For example, when a camera application of the electronic device 101 is executed and/or driven, a learning model may detect various motions corresponding to detection conditions or detect surrounding situations based on detection conditions from input data. can According to another embodiment, the electronic device 101 may be an electronic device of a test taker. For example, the electronic device 101 may be an electronic device used by a test taker to take an exam. For example, the electronic device 101 may receive information related to taking a test including test questions from the service provider 260, and may transmit information necessary for taking the test (eg, test taker information) from the service provider 260. ) can be transmitted.

다양한 실시 예에 따르면, 외부 전자 장치(102)는 응시자를 원격으로 감독하기 위한 장치로, 실시간으로 응시자의 응시 영상을 촬영하는 장치일 수 있다. 외부 전자 장치(102)는 응시자 및 주변 환경에 대한 영상을 포함하는 시험 감독에 필요한 정보(예: 실시간 응시자 영상)를 실시간으로 서비스 사업자(260)에게 전송할 수 있다. 이에 따라 서비스 사업자(260)는 수신한 정보를 분석할 수 있으며, 분석된 정보를 이용하여 응시자의 응시 상황을 실시간으로 감독하는 감독 서버로 동작할 수 있다. According to various embodiments, the external electronic device 102 is a device for remotely supervising a test taker, and may be a device that captures a gaze image of the test taker in real time. The external electronic device 102 may transmit to the service provider 260 information necessary for test supervision including images of the candidate and the surrounding environment (eg, real-time video of the candidate) to the service provider 260 . Accordingly, the service provider 260 can analyze the received information and operate as a supervision server that supervises the gaze of the candidate in real time using the analyzed information.

전술한 바와 같이 시험 응시용 서버와 시험 감독용 서버는 각각 독립적인 장치로서 동작할 수 있으나, 이에 한정되지 않고 하나의 서버(예: 서비스 사업자(260))에 포함되어 동작하는 것도 가능할 수 있다. 다른 실시 예에 따르면, 시험 응시용 전자 장치와 시험 감독용 전자 장치는 하나의 전자 장치(예: 전자 장치(101))에 포함되어 동작하는 것도 가능할 수 있다. As described above, the server for taking the test and the server for supervising the test may operate as independent devices, but are not limited thereto and may be included in one server (eg, the service provider 260) and operated. According to another embodiment, the electronic device for taking a test and the electronic device for supervising a test may be included in and operated as one electronic device (eg, the electronic device 101).

다양한 실시 예에 따르면, 서비스 사업자(260)는 부정 행위를 방지하기 위해 지능형 서버(250)로부터 제공된 감지 조건들에 기반하여 실시간으로 시험 응시자를 감독하는 서비스를 제공할 수 있다. 이러한 감지 조건들은 입력 데이터로부터 응시자의 부정 행위를 검출하기 위한 규정들일 수 있다. 지능형 서버(250)는 실시간으로 감독에 필요한 감지 조건들을 서비스 사업자(260)에게 제공할 수 있다. 이에 따라 서비스 사업자(260)는 실시간 응시자 영상 데이터를 인공지능 알고리즘을 이용하여 분석하고, 분석 결과 이상이 있는 경우 이상 상태를 알릴 수 있다. According to various embodiments, the service provider 260 may provide a service for supervising test takers in real time based on detection conditions provided from the intelligent server 250 in order to prevent cheating. These detection conditions may be rules for detecting cheating of the test taker from input data. The intelligent server 250 may provide the service provider 260 with sensing conditions necessary for supervision in real time. Accordingly, the service provider 260 may analyze the real-time candidate image data using an artificial intelligence algorithm, and notify an abnormal state if there is an abnormality as a result of the analysis.

전술한 전자 장치(101), 외부 전자 장치(102) 및/또는 서비스 사업자(260)의 각 동작은 지능형 서버(250)의 동작으로 대체될 수 있다. 예를 들어, 지능형 서버(250)에서 실시간 응시자 영상 데이터를 학습 모델을 이용하여 분석하고, 룰셋에 기반하여 분석 결과 이상이 있는 경우 감독 주체한테 이상 상태를 알릴 수 있다. Each operation of the aforementioned electronic device 101, external electronic device 102, and/or service provider 260 may be replaced with an operation of the intelligent server 250. For example, the intelligent server 250 analyzes real-time candidate image data using a learning model, and if there is an abnormality as a result of the analysis based on a rule set, the supervising subject may be notified of the abnormal state.

다양한 실시 예에 따르면, 검증 대상(예: 응시자)에 대한 검증의 기준이 되는 감지 조건들의 집합(예: 룰셋)이 다양하면서도 정확도가 높을 필요가 있는데, 응시자에 대한 입력 데이터를 분석하는 데에 이용되는 룰셋의 생성, 편집, 또는 학습에 대한 설명은 후술하기로 한다. According to various embodiments, a set of detection conditions (eg, a rule set) serving as a criterion for verification of a verification target (eg, a candidate) needs to be diverse and high in accuracy. Used to analyze input data for a candidate A description of creation, editing, or learning of the ruleset to be used will be described later.

도 3은 다양한 실시 예에 따른 지능형 영상 분석 방법을 제공하기 위한 장치의 내부 블록 구성도이다. 이하에서는 도 4를 참조하여 도 3에 대해서 설명한다. 도 4는 다양한 실시 예에 따른 분석 엔진의 동작을 설명하기 위한 도면이다. 일 실시 예에 따르면, 지능형 영상 분석 방법을 제공하기 위한 장치(이하, 지능형 영상 분석 장치)는 스마트 폰, 태블릿 PC와 같은 다양한 전자 장치로 구현될 수 있다.3 is an internal block diagram of an apparatus for providing an intelligent video analysis method according to various embodiments. Hereinafter, FIG. 3 will be described with reference to FIG. 4 . 4 is a diagram for explaining an operation of an analysis engine according to various embodiments. According to an embodiment, an apparatus for providing an intelligent video analysis method (hereinafter referred to as an intelligent video analysis apparatus) may be implemented in various electronic devices such as a smart phone and a tablet PC.

도 3을 참조하면, 지능형 영상 분석 장치(300)는 프로세서(320), 메모리(330) 및/또는 통신 인터페이스(390)를 포함할 수 있다. 도 3에서는 지능형 영상 분석 장치(300)의 구성의 예를 도시하나, 도 3에 도시되는 구성들에 제한되지 않고 도 3에 도시되는 구성들 보다 더 많은 구성들을 포함하도록 구현될 수 있다. 도 3의 지능형 영상 분석 장치(300)는 도 2의 전자 장치(101) 또는 지능형 서버(250)일 수 있다. 따라서 지능형 서버(250)의 동작은 전자 장치(101)에서의 동작으로 대체될 수 있다. 예를 들어, 지능형 서버(250)가 미리 저장된 학습 모델을 이용하여 동작할 수 있으나, 이에 제한되지 않고 다양한 실시 예에 따르면, 전자 장치(101)가 지능형 서버(250)로부터 상기 학습 모델을 수신(또는 다운로드)하여 입력 데이터를 처리하기 위한 룰셋을 생성, 편집, 또는 학습하는 동작을 수행할 수 있다. Referring to FIG. 3 , the intelligent video analysis device 300 may include a processor 320 , a memory 330 and/or a communication interface 390 . 3 shows an example of the configuration of the intelligent video analysis device 300, but is not limited to the configurations shown in FIG. 3 and may be implemented to include more configurations than those shown in FIG. The intelligent video analysis device 300 of FIG. 3 may be the electronic device 101 or the intelligent server 250 of FIG. 2 . Accordingly, the operation of the intelligent server 250 may be replaced with the operation of the electronic device 101 . For example, the intelligent server 250 may operate using a pre-stored learning model, but is not limited thereto, and according to various embodiments, the electronic device 101 receives the learning model from the intelligent server 250 ( or download) to generate, edit, or learn a rule set for processing input data.

도 3에서는 설명의 편의를 위해 지능형 영상 분석 장치(300)가 전자 장치(101)인 경우를 예로 들어 설명하나, 지능형 서버(250)로 대체될 수 있음은 물론이다. In FIG. 3 , for convenience of description, the case where the intelligent video analysis device 300 is the electronic device 101 is described as an example, but it can be replaced with the intelligent server 250 as a matter of course.

다양한 실시 예에 따르면, 통신 인터페이스(390)는 네트워크 또는 서버(예: 지능형 서버(250), 서비스 사업자(260))와 유선/무선으로 통신하기 위한 소프트웨어 및/또는 하드웨어 모듈을 포함할 수 있다. 통신 인터페이스(390)를 통해 외부로부터 입력 데이터 또는 학습된 감지 조건들을 제공받을 수 있으며, 다르게는 학습 모델을 제공받을 수도 있다. According to various embodiments, the communication interface 390 may include software and/or hardware modules for wired/wireless communication with a network or server (eg, the intelligent server 250 or the service provider 260). Input data or learned sensing conditions may be provided from the outside through the communication interface 390, or a learning model may be provided.

다양한 실시 예에 따르면, 메모리(330)는 인공지능 알고리즘을 이용하여 입력 데이터(예: 실시간 응시자 이미지 및/또는 영상)를 분석하는 동작에 대한 인스트럭션을 저장할 수 있다. 또한, 메모리(330)는 입력 데이터에 기반하여 생성된 감지 조건들(또는 룰셋) 또는 훈련 데이터에 기반하여 학습된 감지 조건들을 테이블 형태로 저장하는 데이터베이스(335)를 포함할 수 있다. According to various embodiments, the memory 330 may store instructions for an operation of analyzing input data (eg, a real-time candidate image and/or video) using an artificial intelligence algorithm. In addition, the memory 330 may include a database 335 that stores sensing conditions (or rule sets) generated based on input data or sensing conditions learned based on training data in a table form.

다양한 실시 예에 따르면, 프로세서(320)는 입력 데이터 획득 시 입력 데이터를 분석함으로써 감지 조건들을 구성하는 구성 요소들을 검출할 수 있다. 예를 들어, 프로세서(320)는 입력 데이터의 분석을 통해 입력 데이터로부터 신체 부위, 동작, 사물 또는 지속 시간 정보와 같은 구성 요소들을 획득할 수 있다. 이때, 프로세서(320)는 사용자 설정에 따른 각 구성 요소들을 조합함으로써 감지 조건들을 생성할 수 있으며, 다르게는 반복 학습을 통해 입력 데이터로부터 해당 구성 요소들을 획득하여 획득된 구성 요소들에 기반하여 자동으로 감지 조건들을 생성할 수도 있다. According to various embodiments, the processor 320 may detect components constituting the sensing conditions by analyzing the input data when obtaining the input data. For example, the processor 320 may obtain elements such as body parts, motions, objects, or duration information from the input data through analysis of the input data. At this time, the processor 320 may generate detection conditions by combining each component according to user settings, and otherwise obtain corresponding components from input data through repetitive learning and automatically based on the acquired components. You can also create detection conditions.

또한, 프로세서(320)는 학습된 감지 조건들에서 노이즈를 제거하고 정규화하는 동작을 수행할 수 있다. 이러한 반복 학습을 통해 응시자의 부정 행위를 감지하기 위한 기준이 되는 감지 조건들의 정확도가 높아질 수 있다. Also, the processor 320 may perform an operation of removing and normalizing noise from the learned sensing conditions. Through such repetitive learning, accuracy of detection conditions serving as criteria for detecting cheating of the test taker may be increased.

프로세서(320)는 입력 데이터에 기반하여 생성된 감지 조건들(또는 룰셋) 또는 훈련 데이터에 기반하여 학습된 감지 조건들의 조합 중에서 적어도 하나의 감지 조건에 기반하여 대상 이미지의 이상 유무를 검출할 수 있다. 여기서, 감지 조건들의 조합을 이용하여 대상 이미지의 이상 유무를 검출하기 위한 감지 목록을 생성할 수 있는데, 감지 목록은 학습된 감지 조건들의 조합 중에서 추천된 감지 조건을 포함하여 생성되거나 사용자 예컨대, 감독관의 선택에 대응하는 감지 조건을 포함하여 생성될 수 있다. 이와 같이 감지 목록은 기본적으로 제공되는 감지 목록 이외에 사용자 설정에 따라 원하는 감지 조건들을 선택으로써 수정될 수 있다. 예를 들어, 데이터베이스(335)에 저장된 학습된 감지 조건들의 조합 중에서 사용자가 사용하고 싶은 감지 조건의 조합을 선택할 수 있으며, 만일 학습된 감지 조건들의 조합 중에서 다시 사용하고 싶은 감지 조건이 없는 경우에는, 프로세서(320)는 입력 데이터에 기반하여 신규 감지 조건들을 생성할 수 있다. 이와 같이 사용자 설정에 따라 또는 자동으로 생성된 감지 조건들은 '나만의 감지 목록' 기능을 통해 별도로 관리될 수 있다.The processor 320 may detect whether the target image is abnormal based on at least one sensing condition among sensing conditions (or rule sets) generated based on input data or a combination of sensing conditions learned based on training data. . Here, a detection list for detecting anomalies in the target image may be generated using a combination of detection conditions. The detection list is generated by including a recommended detection condition among learned combinations of detection conditions or by a user, for example, a supervisor. It may be generated by including a detection condition corresponding to the selection. In this way, the detection list may be modified by selecting desired detection conditions according to user settings in addition to the detection list provided by default. For example, the user may select a combination of sensing conditions that the user wants to use from combinations of learned sensing conditions stored in the database 335, and if there is no sensing condition that the user wants to use again among the combinations of learned sensing conditions, The processor 320 may generate new sensing conditions based on input data. In this way, detection conditions generated automatically or according to user settings can be separately managed through the 'my detection list' function.

한편, 프로세서(320)는 입력 데이터로부터 감지 조건을 구성하기 위한 구성 요소들을 획득하기 위해 상기 입력 데이터에 대한 분석을 수행할 수 있다. 이러한 입력 데이터에 대한 분석을 구체적으로 설명하기 위해 도 4를 참조하기로 한다. Meanwhile, the processor 320 may perform analysis on the input data in order to obtain components for constructing a sensing condition from the input data. Reference will be made to FIG. 4 to describe the analysis of such input data in detail.

도 4는 다양한 실시 예에 따른 분석 엔진의 동작을 설명하기 위한 도면이다.4 is a diagram for explaining an operation of an analysis engine according to various embodiments.

도 4를 참조하면, 분석 엔진(400)은 신체 인식 엔진(410), 동작 분석 엔진(420), 사물 인식 엔진(430), 시간 분석 엔진(440) 및/또는 소리 분석 엔진(450)을 포함할 수 있다. 분석 엔진(400)은 도 3의 프로세서(320)에 탑재되거나 독립적인 형태로 구현된 모듈일 수 있다. Referring to FIG. 4 , the analysis engine 400 includes a body recognition engine 410, a motion analysis engine 420, an object recognition engine 430, a time analysis engine 440, and/or a sound analysis engine 450. can do. The analysis engine 400 may be a module mounted on the processor 320 of FIG. 3 or implemented in an independent form.

일 실시 예에 따르면, 신체 인식 엔진(410)은 입력 데이터(예: 이미지 및/또는 영상)를 획득 시 입력 데이터를 분석할 수 있다. 예를 들어, 신체 인식 엔진(410)은 입력 이미지에 포함된 객체들(예: 머리, 손, 몸, 팔과 같은 신체 부위들)을 인식 후 부위별로 검출된 객체들을 분류할 수 있다. 또한, 신체 인식 엔진(410)은 눈, 코, 입, 귀와 같은 얼굴 부위들을 인식할 수 있으며, 페이스 메쉬(face mesh)를 이용하여 얼굴 각도와 시선 방향을 식별할 수 있다. 따라서 신체 인식 엔진(410)을 통해 입력 데이터와 관련된 구성 요소는 신체 부위 별로 분류될 수 있다.According to an embodiment, the body recognition engine 410 may analyze input data (eg, images and/or videos) when acquiring them. For example, the body recognition engine 410 may recognize objects included in the input image (eg, body parts such as head, hand, body, and arm) and then classify the detected objects for each part. In addition, the body recognition engine 410 may recognize face parts such as eyes, nose, mouth, and ears, and may identify a face angle and a gaze direction using a face mesh. Therefore, through the body recognition engine 410, components related to the input data may be classified for each body part.

일 실시 예에 따르면, 동작 분석 엔진(420)은 입력 데이터(예: 이미지 및/또는 영상)에 포함된 객체의 상태(또는 동작)을 분석할 수 있다. 예를 들어, 동작 분석 엔진(420)은 포즈넷(posenet) 알고리즘 또는 바디픽스(body pix) 알고리즘을 이용하여 입력 이미지로부터 실시간으로 자세 추정(또는 골격 인식)을 통해 대상의 움직임, 누움, 달림, 팔 들기, 점프, 굽힘, 사라짐과 같은 행동을 식별할 수 있다. 따라서 동작 분석 엔진(420)을 통해 입력 데이터와 관련된 구성 요소는 행동 별로 분류될 수 있다. According to an embodiment, the motion analysis engine 420 may analyze a state (or motion) of an object included in input data (eg, an image and/or video). For example, the motion analysis engine 420 uses a posenet algorithm or a body pix algorithm to estimate motion, lying down, running, Behaviors such as raising arms, jumping, bending, and disappearing can be identified. Accordingly, components related to the input data may be classified according to behaviors through the motion analysis engine 420 .

일 실시 예에 따르면, 사물 인식 엔진(430)은 객체 검출(object detection) 기술을 이용하여 입력 데이터(예: 이미지 및/또는 영상)에 포함된 객체의 종류(예: 모니터, 휴대폰, 키보드, 책과 같은 사물)를 식별할 수 있다. 따라서 사물 인식 엔진(430)을 통해 입력 데이터와 관련된 구성 요소는 사물 별로 분류될 수 있다.According to an embodiment, the object recognition engine 430 uses object detection technology to determine the type of object (eg, a monitor, a mobile phone, a keyboard, a book) included in input data (eg, an image and/or video). objects) can be identified. Accordingly, components related to the input data may be classified for each object through the object recognition engine 430 .

일 실시 예에 따르면, 시간 분석 엔진(440)은 신체 인식 엔진(410), 동작 분석 엔진(420) 또는 사물 인식 엔진(430) 중 적어도 하나에 의해 검출되는 결과의 지속 시간을 분석할 수 있다. 예를 들어, 시간 분석 엔진(440)을 통해 입력 데이터와 관련된 구성 요소는 행동에 따른 소요 시간, 유지 시간, 또는 노출 시간 별로 분류될 수 있다. According to an embodiment, the time analysis engine 440 may analyze the duration of a result detected by at least one of the body recognition engine 410, the motion analysis engine 420, and the object recognition engine 430. For example, components related to the input data through the time analysis engine 440 may be classified according to required time, maintenance time, or exposure time according to the action.

일 실시 예에 따르면, 소리 분석 엔진(450)은 마이크를 통해 입력 데이터로서 발화 및/또는 주변 소리들과 같은 오디오 데이터를 획득하여 이를 분석할 수 있다. 외부에서 발생되는 사운드는 화자들(예: 사용자 및/또는 다른 화자(또는 타인))의 음성들(또는 발화들), 생활 노이즈, 주변(또는 배경) 노이즈들을 포함할 수 있다. 예를 들어, 소리 분석 엔진(450)은 db 크기를 분석하거나 언어를 인식하는 신호 처리를 통해 오디오 데이터와 관련된 구성 요소로 음성과 노이즈를 분류할 수 있다. According to an embodiment, the sound analysis engine 450 may acquire audio data such as speech and/or ambient sounds as input data through a microphone and analyze them. Externally generated sounds may include voices (or utterances) of speakers (eg, a user and/or other speakers (or others)), life noise, and ambient (or background) noises. For example, the sound analysis engine 450 may classify voice and noise as components related to audio data through signal processing that analyzes the size of a db or recognizes a language.

한편, 전술한 바에서는 입력 데이터로부터 감지 조건들의 집합(예: 룰셋)을 생성하기 위한 다양한 구성 요소들을 분류하는 경우를 설명하였으며, 구성 요소들의 일 예로 신체 부위, 동작, 사물, 시간, 또는 소리를 예시하나, 전술한 예에 한정되지 않는다. 예를 들어, 입력 데이터로부터 인공 지능 모델에 의해 전자적으로 분석 가능한 종류의 전자적인 데이터들(예: 소프트웨어, 센서의 값들)과 같이 독립적으로 추출이 가능한 구성 요소이면 모두 가능할 수 있다. Meanwhile, in the foregoing, a case of classifying various components for generating a set of detection conditions (eg, a rule set) from input data has been described, and body parts, motions, objects, time, or sounds are examples of the components. Illustrative, but not limited to the above examples. For example, any component that can be independently extracted, such as electronic data (eg, software, sensor values) that can be electronically analyzed by an artificial intelligence model from input data, may be used.

다양한 실시 예에 따르면, 지능형 영상 분석을 위한 장치(300)는, 적어도 하나의 프로세서(320) 및 메모리(330)를 포함하며, 상기 메모리(330)는, 실행 시에, 상기 적어도 하나의 프로세서(320)가, 적어도 하나의 입력 이미지를 분석함으로써 적어도 하나의 구성 요소를 획득하고, 상기 적어도 하나의 구성 요소에 기반하여 복수의 감지 조건들을 식별하고, 상기 복수의 감지 조건들에 기반하여, 복수의 감지 조건 집합들을 생성하고, 상기 복수의 감지 조건 집합들 중 적어도 하나에 근거하여, 대상 이미지에서 이상 유무를 검출하도록 하는 인스트럭션들을 저장하도록 설정될 수 있다.According to various embodiments, the apparatus 300 for intelligent image analysis includes at least one processor 320 and a memory 330, and the memory 330, when executed, the at least one processor ( 320) acquires at least one component by analyzing at least one input image, identifies a plurality of sensing conditions based on the at least one component, and performs a plurality of sensing conditions based on the plurality of sensing conditions. It may be configured to generate sensing condition sets and store instructions for detecting whether or not there is an abnormality in the target image based on at least one of the plurality of sensing condition sets.

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 복수의 감지 조건들을 기반으로 한 반복 학습을 통해 상기 복수의 감지 조건 집합들을 생성하도록 설정될 수 있다. According to various embodiments of the present disclosure, the at least one processor may be set to generate the plurality of sensing condition sets through repetitive learning based on the plurality of sensing conditions.

다양한 실시 예에 따르면, 상기 적어도 하나의 구성 요소는, 상기 복수의 입력 이미지들에 포함된 신체 부위를 나타내는 제1 데이터, 객체의 동작을 나타내는 제2 데이터, 사물을 나타내는 제3 데이터, 또는 상기 객체의 동작과 관련된 시간을 나타내는 제4 데이터 중 적어도 하나를 포함할 수 있다. According to various embodiments, the at least one component may include first data representing a body part included in the plurality of input images, second data representing an operation of an object, third data representing an object, or the object. It may include at least one of the fourth data indicating the time related to the operation of the .

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 적어도 하나의 구성 요소에 포함된 상기 제1 데이터 내지 상기 제4 데이터를 조합하고, 상기 조합을 통해 상기 복수의 감지 조건들을 생성하도록 설정될 수 있다. According to various embodiments of the present disclosure, in the instructions, the at least one processor combines the first to fourth data included in the at least one component, and generates the plurality of sensing conditions through the combination. can be set to

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 조합을 통해 중복되는 감지 조건을 고려하여 상기 복수의 감지 조건들을 생성하도록 설정될 수 있다.According to various embodiments of the present disclosure, the at least one processor may be set to generate the plurality of sensing conditions in consideration of overlapping sensing conditions through the combination.

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 복수의 감지 조건 집합들 중에서 상기 생성된 복수의 감지 조건들과 유사도 비교 결과 스코어가 가장 높은 감지 조건 집합을 추천하도록 설정될 수 있다.According to various embodiments of the present disclosure, the at least one processor may be configured to recommend a detection condition set having the highest similarity comparison result score with the generated plurality of detection conditions among the plurality of detection condition sets. there is.

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 생성된 복수의 감지 조건들과 연관되는 신규 감지 조건을 생성하고, 상기 신규 감지 조건을 추천하도록 설정될 수 있다.According to various embodiments of the present disclosure, the instructions may be set such that the at least one processor generates a new sensing condition associated with the plurality of generated sensing conditions and recommends the new sensing condition.

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 복수의 감지 조건 집합들 중에서 사용 빈도에 기반하여 감지 조건 집합을 추천하도록 설정될 수 있다.According to various embodiments of the present disclosure, the at least one processor may be configured to recommend a sensing condition set among the plurality of sensing condition sets based on a frequency of use.

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 복수의 감지 조건 집합들 중에서 사용 빈도 및 추천 횟수에 기반하여 감지 조건 집합을 추천하도록 설정될 수 있다.According to various embodiments of the present disclosure, the at least one processor may be configured to recommend a sensing condition set based on a frequency of use and a number of recommendations among the plurality of sensing condition sets.

다양한 실시 예에 따르면, 상기 인스트럭션들은, 상기 적어도 하나의 프로세서가, 상기 복수의 감지 조건 집합들 중 추천된 감지 조건 집합에 근거하여, 상기 대상 이미지에서 이상 유무를 검출하도록 설정될 수 있다.According to various embodiments of the present disclosure, the at least one processor may be configured to detect whether or not there is an abnormality in the target image based on a recommended sensing condition set among the plurality of sensing condition sets.

도 5는 다양한 실시 예에 따른 지능형 영상 분석 장치(300)에서의 동작 흐름도(500)이다. 5 is a flowchart 500 of operations in the intelligent video analysis device 300 according to various embodiments.

도 5를 참조하면, 지능형 영상 분석 장치(300)는 505 동작에서 적어도 하나의 입력 이미지들을 분석함으로써 적어도 하나의 구성 요소를 획득할 수 있다. 일 실시 예에 따르면, 상기 적어도 하나의 구성 요소는, 상기 적어도 하나의 입력 이미지들에 포함된 신체 부위를 나타내는 제1 데이터, 객체의 동작을 나타내는 제2 데이터, 사물을 나타내는 제3 데이터, 또는 상기 객체의 동작과 관련된 시간을 나타내는 제4 데이터 중 적어도 하나를 포함할 수 있다. Referring to FIG. 5 , the intelligent video analysis device 300 may acquire at least one component by analyzing at least one input image in operation 505 . According to an embodiment, the at least one component may include first data representing a body part included in the at least one input image, second data representing an operation of an object, third data representing an object, or the It may include at least one of fourth data indicating time related to the operation of the object.

510 동작에서, 지능형 영상 분석 장치(300)는 상기 적어도 하나의 구성 요소에 기반하여 복수의 감지 조건들을 식별할 수 있다. 일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 적어도 하나의 구성 요소에 포함된 상기 제1 데이터 내지 상기 제4 데이터를 조합하고, 상기 조합을 통해 상기 복수의 감지 조건들을 생성할 수 있다. 예를 들어, 제1 데이터와 제2 데이터를 조합하여 제1 감지 조건을 생성할 수 있으며, 제2 데이터와 제4 데이터를 조합하여 제2 감지 조건을 생성할 수 있으며, 제1 내지 제3 데이터 모두를 조합하여 제3 감지 조건을 생성할 수 있다. 이와 같이 지능형 영상 분석 장치(300)는 구성 요소들 간의 다양한 조합을 통해 서로 다른 복수의 감지 조건들을 생성할 수 있다. In operation 510, the intelligent video analysis device 300 may identify a plurality of sensing conditions based on the at least one component. According to an embodiment, the intelligent video analysis device 300 may combine the first to fourth data included in the at least one component, and generate the plurality of detection conditions through the combination. . For example, the first sensing condition may be generated by combining the first data and the second data, the second sensing condition may be generated by combining the second data and the fourth data, and the first to third data A third sensing condition may be created by combining all of them. As such, the intelligent video analysis device 300 may generate a plurality of different sensing conditions through various combinations of components.

다른 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 조합을 통해 중복되는 감지 조건을 고려하여 상기 복수의 감지 조건들을 생성할 수 있다. According to another embodiment, the intelligent video analysis device 300 may generate the plurality of sensing conditions in consideration of overlapping sensing conditions through the combination.

515 동작에서, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건들에 기반하여, 복수의 감지 조건 집합들을 생성할 수 있다. 일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건들을 기반으로 한 반복 학습(예: 머신 러닝)을 통해 상기 복수의 감지 조건 집합들을 생성할 수 있다. In operation 515, the intelligent video analysis apparatus 300 may generate a plurality of detection condition sets based on the plurality of detection conditions. According to an embodiment, the intelligent video analysis device 300 may generate the plurality of detection condition sets through iterative learning (eg, machine learning) based on the plurality of detection conditions.

520 동작에서, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건 집합들 중 적어도 하나에 근거하여 대상 이미지에서 이상 유무를 검출할 수 있다. In operation 520, the intelligent video analysis device 300 may detect whether there is an abnormality in the target image based on at least one of the plurality of detection condition sets.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건 집합들 중 추천된 감지 조건 집합에 근거하여, 상기 대상 이미지에서 이상 유무를 검출할 수 있다. 여기서, 감지 조건의 집합은 하기와 같은 방식에 기반하여 추천될 수 있다. According to an embodiment, the intelligent video analysis device 300 may detect whether or not there is an abnormality in the target image based on a recommended sensing condition set among the plurality of sensing condition sets. Here, a set of detection conditions may be recommended based on the following method.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건 집합들 중에서 상기 생성된 복수의 감지 조건들과 유사도 비교 결과 스코어가 가장 높은 감지 조건 집합을 추천할 수 있다. 예를 들어, 지능형 영상 분석 장치(300)는 생성된 복수의 감지 조건(예: 룰셋)과 기존 감지 조건 집합들(예: 룰셋 테이블) 간의 유사도를 식별할 수 있다. 만일 생성된 복수의 감지 조건이 '화면에 사람 없음', '화면에 여러 사람 있음', '책'인 조건을 포함하는 경우, 이러한 조건들과 유사한 감지 조건 집합을 추천할 수 있다. According to an embodiment, the intelligent video analysis device 300 may recommend a sensing condition set having the highest similarity comparison result score with the generated plurality of sensing conditions among the plurality of sensing condition sets. For example, the intelligent video analysis apparatus 300 may identify a similarity between a plurality of generated detection conditions (eg, rulesets) and existing detection condition sets (eg, ruleset tables). If the plurality of generated sensing conditions include conditions such as 'there is no person on the screen', 'there are many people on the screen', and 'book', a sensing condition set similar to these conditions may be recommended.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 생성된 복수의 감지 조건들과 연관되는 신규 감지 조건을 생성하고, 상기 생성된 신규 감지 조건을 추천할 수 있다. 예를 들어, 지능형 영상 분석 장치(300)는 상기 생성된 복수의 감지 조건들 간의 연계성을 고려하여, 만일 생성된 복수의 감지 조건에 '책'인 조건이 포함되어 있는 경우, 일반적으로 책을 볼 때 손을 올리는 동작이 예측 가능하므로, '손 올림'인 조건을 신규 감지 조건으로 추천할 수 있다. According to an embodiment, the intelligent video analysis device 300 may create a new sensing condition associated with the plurality of generated sensing conditions and recommend the created new sensing condition. For example, the intelligent video analysis device 300 considers connectivity between the plurality of generated detection conditions and, if the generated plurality of detection conditions include a condition of 'book', generally reads a book. Since the hand-raising motion is predictable, the 'hand-raising' condition can be recommended as a new detection condition.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건 집합들 중에서 사용 빈도에 기반하여 감지 조건 집합을 추천할 수 있다. 예를 들어, 지능형 영상 분석 장치(300)는 이전 시험들을 통해 축적된 데이터에 기반하여, 사용 빈도가 높은 감지 조건 집합을 추천할 수 있다. 이전 시험들에서 특정 감지 조건 집합이 많이 사용되었다면 상기 특정 감지 조건 집합에 대한 스코어는 반복 학습을 통해 높아진 상태일 수 있다. 따라서 지능형 영상 분석 장치(300)는 스코어가 가장 높은 감지 조건 집합을 추천할 수 있다.According to an embodiment, the intelligent video analysis device 300 may recommend a detection condition set based on a frequency of use among the plurality of detection condition sets. For example, the intelligent video analysis device 300 may recommend a detection condition set with a high frequency of use based on data accumulated through previous tests. If a specific sensing condition set was frequently used in previous tests, the score for the specific sensing condition set may be in a state of being increased through repetitive learning. Accordingly, the intelligent video analysis device 300 may recommend a detection condition set having the highest score.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건 집합들 중에서 사용 빈도 및 추천 횟수에 기반하여 감지 조건 집합을 추천할 수 있다. 예를 들어, 지능형 영상 분석 장치(300)는 이전 시험들에서'화면에 사람 없음'인 감지 조건이 많이 충족된 경우, 해당 감지 조건을 추천할 수 있다. 즉, '화면에 사람 없음'인 감지 조건을 충족하는 상황이 감독 관리시 많이 발생한 경우이면서 상기 '화면에 사람 없음'인 감지 조건이 많이 추천된 경우에 상기 감지 조건을 추천할 수 있다. According to an embodiment, the intelligent video analysis device 300 may recommend a sensing condition set based on the frequency of use and the number of recommendations among the plurality of sensing condition sets. For example, the intelligent video analysis device 300 may recommend a corresponding detection condition when many of the detection conditions of 'there is no person on the screen' are satisfied in previous tests. That is, the detection condition may be recommended when situations satisfying the detection condition of 'no person on the screen' occur frequently during supervision and management, and when the detection condition of 'no person on the screen' is recommended a lot.

도 6은 다양한 실시 예에 따른 룰셋 생성을 위한 동작 흐름도(600)이다. 6 is a flowchart 600 of an operation for generating a ruleset according to various embodiments.

605 동작에서, 지능형 영상 분석 장치(300)는 입력 이미지를 획득할 수 있다. 예를 들어, 입력 이미지는 훈련 대상자의 이미지 및/또는 영상을 의미하는 것으로, 테스트 시간 동안에 촬영되는 복수의 이미지들일 수 있다. In operation 605, the intelligent video analysis device 300 may obtain an input image. For example, the input image denotes an image and/or video of a trainee, and may be a plurality of images captured during the test time.

610 동작에서, 지능형 영상 분석 장치(300)는 입력 이미지를 분석함으로써 룰(또는 감지 조건)의 구성 요소들에 대응하는 신체 부위, 동작, 사물, 또는 지속 시간 정보를 검출할 수 있다. 615 동작에서, 지능형 영상 분석 장치(300)는 신체 부위, 동작, 사물, 또는 지속 시간 정보의 조합을 통해 복수의 룰셋을 생성할 수 있다. In operation 610, the intelligent video analysis device 300 may detect body parts, motions, objects, or duration information corresponding to components of the rule (or detection condition) by analyzing the input image. In operation 615, the intelligent video analysis apparatus 300 may generate a plurality of rule sets through a combination of body parts, motions, objects, or duration information.

620 동작에서, 지능형 영상 분석 장치(300)는 학습 알고리즘에 기반하여 룰셋 테이블 생성 및/또는 업데이트를 수행할 수 있다. 625 동작에서, 지능형 영상 분석 장치(300)는 룰셋 테이블을 이용한 검증 목록을 생성할 수 있다. In operation 620, the intelligent video analysis device 300 may generate and/or update a rule set table based on the learning algorithm. In operation 625, the intelligent video analysis device 300 may generate a verification list using a rule set table.

이후, 630 동작에서, 지능형 영상 분석 장치(300)는 실제 시험이 시작됨에 따라 감독관으로부터 검증 요청이 수신되는지를 식별할 수 있다. 635 동작에서, 상기 검증 요청의 수신에 대응하여, 지능형 영상 분석 장치(300)는 검증 목록을 이용하여 대상 이미지(예: 실시간 응시자의 이미지 및/또는 영상)를 분석할 수 있다. 640 동작에서, 지능형 영상 분석 장치(300)는 분석 결과를 출력할 수 있다. 예를 들어, 실시간으로 응시자를 촬영한 입력 데이터에서 신체 부위, 사물과 같은 객체, 객체의 동작과 같은 구성 요소들이 추출되는 경우 그 구성 요소들의 조합이 상기 검증 목록에 포함된 감지 조건들 조합에 대응하는지를 식별할 수 있다. 이러한 구성 요소들의 조합이 상기 검증 목록 상의 감지 조건들 조합에 대응할 경우, 응시자의 행위는 부정 행위로서 간주될 수 있다. 이러한 부정 행위를 감지하기 위한 감지 조건들에 대한 구체적인 예들에 대해서는 도 12를 참조하여 후술하기로 한다. Thereafter, in operation 630, the intelligent video analysis device 300 may identify whether a verification request is received from a supervisor as the actual exam begins. In operation 635, in response to receiving the verification request, the intelligent video analysis device 300 may analyze a target image (eg, a real-time candidate's image and/or video) using the verification list. In operation 640, the intelligent video analysis device 300 may output an analysis result. For example, when components such as body parts, objects such as objects, and motions of objects are extracted from input data obtained by photographing a test taker in real time, a combination of the components corresponds to a combination of detection conditions included in the verification list. can identify what it does. If a combination of these components corresponds to a combination of detection conditions on the verification list, the candidate's action may be regarded as a cheating action. Specific examples of detection conditions for detecting such fraudulent behavior will be described later with reference to FIG. 12 .

도 7은 다양한 실시 예에 따른 온라인 시험 방식을 나타낸 예시도이다. 도 7의 설명의 이해를 돕기 위해 도 8을 참조하기로 한다. 도 8은 다양한 실시 예에 따른 실시간 응시자 영상을 나타낸 예시도이다. 7 is an exemplary diagram illustrating an online test method according to various embodiments. 8 will be referred to for better understanding of the description of FIG. 7 . 8 is an exemplary view illustrating a real-time viewer image according to various embodiments.

도 7을 참조하면, 지능형 서버(250)는 전자 장치(101)와의 통신을 통해, 전자 장치(101)로부터 실시간으로 응시자(710)의 영상을 수신할 수 있으며, 이를 온라인 시험을 관리 및 감독하는 서비스를 제공하는 서비스 사업자(260)로 전송할 수 있다. Referring to FIG. 7 , an intelligent server 250 may receive an image of a candidate 710 from the electronic device 101 in real time through communication with the electronic device 101, and manage and supervise the online test. It can be transmitted to the service provider 260 that provides the service.

일 실시 예에 따르면, 전자 장치(101)는 응시자(710)에 대한 영상 및 주변 환경을 실시간으로 촬영하는 장치일 수 있다. 다른 실시 예에 따르면, 지능형 서버(250)는 다른 전자 장치(예: 전자 장치(101) 또는 외부 전자 장치(102))를 이용하여 실시간으로 응시자(710)의 응시 영상을 촬영할 수 있다. According to an embodiment, the electronic device 101 may be a device that captures an image of the candidate 710 and a surrounding environment in real time. According to another embodiment, the intelligent server 250 may capture a gaze image of the candidate 710 in real time using another electronic device (eg, the electronic device 101 or the external electronic device 102).

일 실시 예에 따르면, 지능형 서버(250)는 응시자(710)를 원격으로 감독하기 위한 실시간 응시 영상을 서비스 사업자(260)로 제공함과 동시에 응시자의 다양한 모션 또는 주변 상황을 검출하는데 이용되는 감지 조건들을 생성, 편집 및/또는 학습시킬 수 있다. 지능형 서버(250)는 실시간으로 입력되는 데이터(예: 응시자 이미지 및/또는 영상, 소리)를 처리하고 분석하여 감독에 필요한 정보로서 활용할 수 있다. According to an embodiment, the intelligent server 250 provides a real-time gaze image for remotely supervising the candidate 710 to the service provider 260 and at the same time detects various motions of the candidate 710 or detection conditions used to detect surrounding situations. can be created, edited and/or taught. The intelligent server 250 may process and analyze real-time input data (eg, candidate image and/or video, sound) and use it as information necessary for supervision.

도 8을 참조하면, 지능형 서버(250)는 실시간 응시자와 관련된 입력 데이터를 분석하고, 입력 데이터로부터 감독에 필요한 특정 객체를 인식할 수 있다. 예를 들면, 지능형 서버(250)는 응시자의 특정 신체 부위(810) 및 얼굴 부위(820)(예: 응시자의 얼굴, 양 손), 주변 물체(예: 응시자가 소지한 전자 장치(824), 응시에 필요한 장치들(830, 840))를 특정 객체로서 인식할 수 있다. 또한, 지능형 서버(250)는 특정 객체 이외에 주변 물체를 나타내는 데이터를 실시간으로 추출함으로써 객체의 동작을 나타내는 데이터를 획득할 수 있으며, 해당 객체의 동작에 따른 소요 시간, 유지 시간 또는 노출 시간과 같은 객체의 동작과 관련된 시간을 나타내는 데이터를 획득할 수도 있다. 이에 따라 지능형 서버(250)는 실시간으로 응시자의 신체 일부 및/또는 응시자 주변 물체를 인식 대상 객체로 하여 응시자 행동 및 응시자 주변 환경을 모니터링 할 수 있다. Referring to FIG. 8 , the intelligent server 250 may analyze input data related to a test taker in real time and recognize a specific object required for supervision from the input data. For example, the intelligent server 250 may include a specific body part 810 and face part 820 (eg, the candidate's face, both hands), surrounding objects (eg, the electronic device 824 possessed by the candidate), Devices 830 and 840 required for gaze may be recognized as specific objects. In addition, the intelligent server 250 may obtain data representing the operation of the object by extracting data representing surrounding objects in addition to the specific object in real time, and objects such as required time, retention time, or exposure time according to the operation of the object Data representing the time associated with the operation of may be obtained. Accordingly, the intelligent server 250 may monitor the candidate's behavior and the surrounding environment of the candidate in real time by using a part of the candidate's body and/or an object around the candidate as a recognition target object.

지능형 서버(250)는 모니터링하는 응시자 행동 및 응시자 주변 환경이 감지 조건들의 집합에 의해 규정된 감지 범위에 해당하는지를 식별할 수 있다. 예를 들면, 지능형 서버(250)에서 감지 조건들의 집합을 이용하여 감지 목록을 설정할 수 있으며, 상기 감지 목록은 사용자 설정에 따라 선택 또는 변경 가능할 수 있다. 따라서 지능형 서버(250)는 응시자 행동 및 응시자 주변 환경이 상기 감지 목록에 해당하는 경우 예컨대, 응시 조건 범위를 벗어나는 경우, 응시자에 대한 이상이 발생했다고 판단하여, 이상 상태를 알리는 알림을 출력할 수 있다. 예를 들어, 지능형 서버(250)는 이상 상태를 감독관에게 알리기 위해 알림 신호를 경고창을 통해 표시하거나 알림음을 통해 출력할 수 있다.The intelligent server 250 may identify whether the monitored behavior of the candidate and the surrounding environment of the candidate fall within a sensing range defined by a set of sensing conditions. For example, the intelligent server 250 may set a detection list using a set of detection conditions, and the detection list may be selectable or changeable according to user settings. Accordingly, the intelligent server 250 determines that an abnormality has occurred for the candidate when the candidate's behavior and the candidate's surrounding environment correspond to the detection list, for example, when the candidate's surrounding environment is out of the range of the gaze condition, and outputs a notification informing of the abnormal state. . For example, the intelligent server 250 may display a notification signal through a warning window or output a notification sound to inform a supervisor of an abnormal condition.

도 9는 다양한 실시 예에 따른 룰셋 생성 및 감지 목록 생성 동작을 설명하기 위한 예시도이다. 9 is an exemplary diagram for explaining ruleset creation and detection list creation operations according to various embodiments.

도 9를 참조하면, 룰셋 구성 요소(905)는 입력 데이터로부터 감지 조건들의 집합(예: 룰셋)을 생성하기 위한 다양한 구성 요소를 나타내는 것으로, 구성 요소들의 일 예로 신체 부위, 동작, 또는 시간을 예시하고 있다. 예를 들어, 부위별 데이터(예: 얼굴, 손, 몸, 팔, …), 동작별 데이터(예: 사라짐, 움직임,…), 시간별 데이터(예: 0.1초, 1초, 4초, …)와 같이 구성 요소별로 데이터들이 분류될 수 있다. 도 9에서는 입력 데이터로부터 검출 가능한 구성 요소들의 일 예를 예시하나, 그 종류는 이에 한정되지 않는다. 예를 들어, 입력 데이터로부터 인공 지능 모델에 의해 전자적으로 분석 가능한 종류의 전자적인 데이터들(예: 소프트웨어, 센서의 값들)과 같이 독립적으로 추출이 가능한 구성 요소이면 모두 가능할 수 있다.Referring to FIG. 9, a rule set component 905 represents various components for generating a set of detection conditions (eg, a rule set) from input data, and body parts, motions, or time are examples of the components. are doing For example, data by part (e.g. face, hand, body, arm, …), data by motion (e.g. disappear, movement,…), data by time (e.g. 0.1 second, 1 second, 4 second, …) Data can be classified by component as in 9 illustrates an example of components detectable from input data, but the types are not limited thereto. For example, any component that can be independently extracted, such as electronic data (eg, software, sensor values) that can be electronically analyzed by an artificial intelligence model from input data, may be used.

지능형 영상 분석 장치(300)는 룰셋 구성 요소(905)에서의 분류된 각각의 데이터들 간의 조합(또는 정의)을 통해 복수의 감지 조건들(911, 912)을 생성할 수 있다. 일 실시 예에 따르면, 초기 학습 모델을 학습시키기 위해서는 분류된 데이터들을 미리 디폴트로 설정해 놓은 정의 방식에 따라 조합함으로써 복수의 감지 조건들을 생성할 수 있다. 이후, 반복 학습을 통해 훈련 데이터가 많이 누적되게 되면, 예컨대, 사용 빈도와 같은 다양한 조건에 따라 데이터 조합이 달라질 수 있으며, 이에 따라 다양한 조합의 감지 조건들(또는 룰셋)(911, 912)이 생성될 수 있다. 예를 들어, 분류된 각 데이터의 조합을 통해 제1 룰셋(911), 제2 룰셋(912)은 테이블 형태(910)로 저장될 수 있다. The intelligent video analysis device 300 may generate a plurality of detection conditions 911 and 912 through a combination (or definition) of each data classified in the rule set component 905 . According to an embodiment, in order to learn an initial learning model, a plurality of sensing conditions may be generated by combining classified data according to a definition method set as a default in advance. Then, when a lot of training data is accumulated through repetitive learning, data combinations may be changed according to various conditions such as frequency of use, and accordingly, various combinations of detection conditions (or rule sets) 911 and 912 are generated. It can be. For example, the first rule set 911 and the second rule set 912 may be stored in a table form 910 through a combination of each classified data.

한편, 지능형 영상 분석 장치(300)는 반복 학습을 통해 생성된 복수의 감지 조건 집합들 중 감독에 필요한 감지 목록들(920)을 구성할 수 있다. 이때, 각각의 감지 목록(921, 922)은 사용자 선택에 따라 생성될 수 있으며, 다르게는 추천 방식을 통해 생성될 수 있다. 예를 들어, 감지 목록 1(921)은 제1 룰셋(911) 및 제2 룰셋(912)을 포함하여 구성될 수 있으며, 감지 목록 2(922)는 제1 룰셋(911)과 다른 룰셋들을 포함하여 구성될 수 있다. 이러한 감지 목록은 응시자에 대한 입력 데이터로부터 이상 여부를 검출하기 위한 감지 조건들의 조합을 나타내는 것으로, 예를 들어, 실제 시험 응시 감독관이 선택할 수 있다. 다양한 실시 예에 따르면, 지능형 영상 분석 장치(300)는 학습 모델에 의해, 적어도 하나의 감지 조건 집합(예: 룰셋)을 선택하거나, 동적으로 또는 실시간으로 감지 조건들을 생성할 수도 있다.Meanwhile, the intelligent video analysis device 300 may configure detection lists 920 necessary for supervision among a plurality of detection condition sets generated through repetitive learning. At this time, each of the detection lists 921 and 922 may be generated according to user selection, or may be generated through a recommendation method. For example, detection list 1 921 may include a first rule set 911 and a second rule set 912, and detection list 2 922 may include rulesets different from the first rule set 911. can be configured. This detection list represents a combination of detection conditions for detecting abnormality from the input data of the candidate, and may be selected by, for example, an actual test-taking supervisor. According to various embodiments, the intelligent video analysis device 300 may select at least one detection condition set (eg, a rule set) or dynamically or in real time generate detection conditions by a learning model.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 감지 목록에 의해 검출해야 하는 항목들을 분류(930)할 수 있다. 예를 들어, 감지 목록에 포함된 감지 조건들에 해당하는 내용을 분류할 경우 일부 중복되는 내용이 있을 수 있으므로, 지능형 영상 분석 장치(300)는 중복되는 감지 조건을 제외하기 위해 분류(930) 동작을 수행할 수 있다. 다른 실시 예에 따르면, 지능형 영상 분석 장치(300)는 입력 데이터의 분석을 통해 획득한 구성 요소들의 조합을 통해 감지 조건들을 생성 시 중복 여부에 기반하여 감지 조건들을 생성할 수 있다. 예를 들어, 복수의 감지 조건들의 경우 감지 조건들을 구성하는 적어도 일부가 중복될 수 있는데, 지능형 영상 분석 장치(300)에서 이미지 분석을 통해 A 이미지에서 a 사물을 나타내는 데이터(예: '책'이 감지되었음을 나타내는 데이터) 및 b 사물을 나타내는 데이터를 획득하고, B 이미지에서는 a 사물을 나타내는 데이터(예: '책'이 감지되었음을 나타내는 데이터) 및 c 사물을 나타내는 데이터를 획득한 경우, 공통되는 a 사물(예: '책'이 감지되었음을 나타내는 데이터)을 나타내는 데이터를 유효한(또는 신뢰성이 높은) 감지 조건으로 남긴 후 나머지 다른 데이터들을 정리할 수 있다. 예컨대, 감지 조건들이 중복되는 경우에는 가장 효율적인 적어도 하나의 감지 조건만을 남기고 나머지는 폐기 처리할 수 있다. According to an embodiment, the intelligent video analysis device 300 may classify (930) items to be detected using a detection list. For example, when classifying content corresponding to detection conditions included in the detection list, there may be some overlapping content, so the intelligent video analysis device 300 performs a classification operation 930 to exclude overlapping detection conditions. can be performed. According to another embodiment, the intelligent video analysis device 300 may generate sensing conditions based on overlapping or not when generating sensing conditions through a combination of components obtained through analysis of input data. For example, in the case of a plurality of detection conditions, at least some constituting the detection conditions may overlap. Data representing object a in image A through image analysis in the intelligent video analysis device 300 (eg, 'book' is When data representing object a) and data representing object b are obtained, and data representing object a (e.g., data indicating that 'book' has been detected) and data representing object c are acquired in image B, common object a (eg, data indicating that 'book' has been detected) is left as a valid (or highly reliable) detection condition, and then other data can be organized. For example, when sensing conditions overlap, only at least one most efficient sensing condition may be left and the rest may be discarded.

지능형 영상 분석 장치(300)는 대상 이미지 입력(940) 시, 대상 이미지에 대한 분석(950)을 통해 분류(930)된 조건들에 대응하는 이상 여부(960)를 판단할 수 있다. 여기서, 대상 이미지는 학습 모델에서 자동으로 감지 조건들을 생성하기 위한 데이터로서 활용될 수 있다. When the target image is input 940 , the intelligent video analysis device 300 may determine whether there is an abnormality 960 corresponding to the conditions classified 930 through analysis 950 of the target image. Here, the target image may be used as data for automatically generating detection conditions in the learning model.

예를 들어, 응시자 감독을 위해 선택된 감지 목록 즉, 감지 조건 집합으로서, 왼손의 경우에는 사라짐 2초, 손올림 1초인 조건, 사물의 경우 사람 2초, 책 2초인 조건, 얼굴의 경우 사라짐 2초, 움직임 2초인 조건을 포함하는 경우를 가정해보기로 한다. 만일 대상 이미지에 대한 분석 결과 왼손 사라짐 2초, 사람 1초, 얼굴 움직임 2초의 경우 검출되는 경우, 지능형 영상 분석 장치(300)는 상기 감지 조건 집합의 범위에 속하는지를 식별할 수 있다. 지능형 영상 분석 장치(300)는 대상 이미지의 분석 결과에 따른 왼손 사라짐 2초와 얼굴 움직임 2초인 조건이 상기 감지 조건 집합에 대응하는 것으로 식별할 수 있다. 이에 따라 지능형 영상 분석 장치(300)는 해당 응시자를 부정 행위자로 간주하여 이상 상태를 감독 주체한테 보고할 수 있다. For example, a list of detections selected for candidate supervision, i.e. a set of detection conditions, conditions for left hand: disappearing 2 seconds, hand raised 1 second, for object: person 2 seconds, book 2 seconds, face for disappearing 2 seconds. , let's assume a case including a condition of movement of 2 seconds. If, as a result of the analysis of the target image, 2-second disappearance of the left hand, 1-second person, and 2-second facial movement are detected, the intelligent video analysis device 300 may identify whether the image falls within the range of the set of detection conditions. The intelligent video analysis device 300 may identify that the condition of left hand disappearing for 2 seconds and face moving for 2 seconds according to the analysis result of the target image corresponds to the detection condition set. Accordingly, the intelligent video analysis device 300 may regard the candidate as a cheater and report the abnormal state to the supervisor.

상기한 바와 같이 다양한 실시 예에 따르면, 미리 감지 조건들을 설정하여 생성하는 별도의 작업 없이 입력 데이터(예: 입력 이미지 및/또는 영상)를 입력하기만 하면, 지능형 영상 분석 장치(300)는 입력 데이터를 분석하여 분석 결과에 기반하여 각 구성 요소들의 조합을 통해 자동으로 감지 조건들을 생성할 수 있다. 이러한 감지 조건들은 학습 모델에 의해 응시자의 다양한 행동을 비롯하여 주변 상황과 관련된 입력 데이터를 이용하여 학습되기 때문에 응시자의 부정 행위 유무를 손쉽게 모니터링할 수 있다. As described above, according to various embodiments, if input data (eg, an input image and/or video) is input without a separate operation of setting and generating detection conditions in advance, the intelligent video analysis device 300 performs the input data It is possible to automatically generate detection conditions through a combination of each component based on the analysis result by analyzing . Since these detection conditions are learned by the learning model using input data related to various behaviors of the candidate and surrounding situations, it is possible to easily monitor whether or not the candidate cheated.

도 10a은 다양한 실시 예에 따른 입력 이미지 기반의 룰셋 생성 방법을 설명하기 위한 예시도이며, 도 10b는 상기 도 10a에 이어지는 도면으로, 다양한 실시 예에 따른 룰셋 추천 방법을 나타낸 예시도이다. FIG. 10A is an exemplary diagram for explaining a method of generating a ruleset based on an input image according to various embodiments, and FIG. 10B is a diagram continuing from FIG. 10A and illustrating a method of recommending a ruleset according to various embodiments.

도 10a를 참조하면, 지능형 영상 분석 장치(300)는 복수의 이미지들(1000)을 획득할 수 있다. 여기서, 복수의 이미지들(1000)은 부정 행위를 감지하기 위한 기준이 되는 훈련 이미지들(또는 이상 동작 이미지)(1001, 1002, 1003)을 포함할 수 있다. 또한 복수의 이미지들(1000)은 실시간으로 촬영되는 응시자의 입력 이미지들을 더 포함할 수 있다. Referring to FIG. 10A , the intelligent video analysis device 300 may obtain a plurality of images 1000. Here, the plurality of images 1000 may include training images (or abnormal motion images) 1001 , 1002 , and 1003 serving as standards for detecting cheating. In addition, the plurality of images 1000 may further include input images of a viewer captured in real time.

지능형 영상 분석 장치(300)는 복수의 이미지들(1000)을 분석할 수 있으며, 분석을 통해 상기 복수의 이미지들 각각으로부터 구성 요소들을 획득할 수 있다. 지능형 영상 분석 장치(300)는 이러한 구성 요소들의 조합을 통해 복수의 감지 조건들(예: 룰셋)을 생성(1010)할 수 있다. 예를 들어, 제1 룰셋(1012)은 제1 이미지(1001)의 분석 결과, '동작'으로 분류되는 '화면에 사람 없음'인 조건을 포함하며, 제2 룰셋(1013)은 제1 이미지(1001)의 분석 결과, '동작'으로 분류되는 '화면에 여러 사람 있음'인 조건을 포함하며, 제3 룰셋(1014)는 '사물'로 분류되는 '책'인 조건을 포함하는 경우를 예시하고 있다. The intelligent video analysis device 300 may analyze the plurality of images 1000 and obtain components from each of the plurality of images through analysis. The intelligent video analysis device 300 may generate (1010) a plurality of detection conditions (eg, a rule set) through a combination of these components. For example, as a result of analyzing the first image 1001, the first rule set 1012 includes a condition of 'no person on the screen' classified as 'motion', and the second rule set 1013 includes the first image ( 1001), exemplifies a case in which the condition of 'there are many people on the screen' classified as 'action' and the third rule set 1014 includes the condition of 'book' classified as 'object' there is.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 새롭게 생성된 감지 조건들(예: 제1 룰셋(1012), 제2 룰셋(1013), 제3 룰셋(1014))을 복수의 감지 조건 집합들(예: 학습된 룰셋 집합 테이블(1020)) 내의 감지 조건들 집합과 비교할 수 있다. 비교 결과, 지능형 영상 분석 장치(300)는 상기 복수의 감지 조건 집합들 중에서 상기 생성된 복수의 감지 조건들과 유사도 비교 결과 스코어가 가장 높은 감지 조건 집합을 추천할 수 있다. 예를 들어, 복수의 감지 조건 집합들(예: 학습된 룰셋 집합 테이블(1020)) 중에서 새롭게 생성된 감지 조건들(예: 제1 룰셋(1012), 제2 룰셋(1013), 제3 룰셋(1014))과 가장 유사한 감지 조건 조합(예: 룰셋 집합 #4)(1060)을 추천할 수 있다. According to an embodiment, the intelligent video analysis apparatus 300 sets newly generated detection conditions (eg, the first rule set 1012, the second rule set 1013, and the third rule set 1014) into a plurality of detection condition sets. (eg, the learned ruleset set table 1020). As a result of the comparison, the intelligent video analysis apparatus 300 may recommend a sensing condition set having the highest score as a result of similarity comparison with the generated sensing conditions among the plurality of sensing condition sets. For example, newly generated detection conditions (eg, the first rule set 1012, the second rule set 1013, and the third rule set (eg, the learned rule set table 1020) among a plurality of detection condition sets (eg, the learned rule set table 1020) 1014)) and the most similar detection condition combination (eg, rule set #4) 1060 may be recommended.

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 새롭게 생성된 감지 조건들(예: 제1 룰셋(1012), 제2 룰셋(1013), 제3 룰셋(1014))과 연관되는 신규 감지 조건을 생성할 수 있다. 따라서, 지능형 영상 분석 장치(300)는 복수의 감지 조건 집합들(예: 학습된 룰셋 집합 테이블(1020))에 신규 감지 조건과 함께 상기 생성된 감지 조건들을 추가할 수 있다. 예를 들어, 제1 룰셋(1012)과 연관된 신규 감지 조건을 포함하는 감지 조건 조합(예: 룰셋 집합 #1, 룰셋 집합 #n)(1030)(1070)을 생성하여 상기 테이블(1020) 내에 추가할 수 있다. 이와 유사한 방식으로, 감지 조건 조합(예: 룰셋 집합 #1)(1040)은 제2 룰셋(1013)과 연관된 신규 감지 조건을 포함할 수 있다. According to an embodiment, the intelligent video analysis apparatus 300 determines a new detection condition related to newly created detection conditions (eg, the first rule set 1012, the second rule set 1013, and the third rule set 1014). can create Accordingly, the intelligent video analysis apparatus 300 may add the generated detection conditions together with a new detection condition to a plurality of detection condition sets (eg, the learned rule set table 1020). For example, a detection condition combination (eg, rule set #1, rule set #n) 1030 and 1070 including a new detection condition associated with the first rule set 1012 is created and added to the table 1020 can do. In a similar way, the detection condition combination (eg, rule set #1) 1040 may include a new detection condition related to the second rule set 1013 .

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 복수의 감지 조건 집합들(예: 학습된 룰셋 집합 테이블(1020)) 중에서 사용 빈도에 기반하여 적어도 하나의 감지 조건 집합을 추천할 수도 있다. According to an embodiment, the intelligent video analysis apparatus 300 may recommend at least one detection condition set based on frequency of use among a plurality of detection condition sets (eg, the learned rule set table 1020).

일 실시 예에 따르면, 지능형 영상 분석 장치(300)는 상기 사용 빈도 이외에 추천 횟수도 고려하여 복수의 감지 조건 집합들(예: 학습된 룰셋 집합 테이블(1020)) 중에서 적어도 하나의 감지 조건 집합을 추천할 수도 있다. According to an embodiment, the intelligent video analysis apparatus 300 recommends at least one detection condition set among a plurality of detection condition sets (eg, the learned rule set table 1020) in consideration of the number of recommendations in addition to the frequency of use. You may.

상기한 바와 같이 입력 데이터에 기반하여 생성된 감지 조건들(또는 룰셋) 또는 훈련 데이터에 기반하여 학습된 감지 조건들의 조합(1030, 1040, 1050, 1060, 1070)을 테이블 형태로 저장 및 관리할 수 있다. As described above, combinations (1030, 1040, 1050, 1060, 1070) of detection conditions (or rule sets) generated based on input data or learned detection conditions based on training data can be stored and managed in a table form. there is.

예를 들어, 입력 데이터로부터 하기 표 1에 해당하는 구성 요소들을 추출한 경우, 상기 구성 요소들은 이전 감지 조건들과 비교 분석되어, 학습된 감지 조건들이 생성될 수 있다. For example, when components corresponding to Table 1 below are extracted from input data, the components are compared and analyzed with previous sensing conditions, and learned sensing conditions may be generated.

구성 요소 추출component extraction 이전 감지 조건들(또는 룰셋)Previous detection conditions (or ruleset) 학습된 감지 조건들(또는 룰셋)Learned Detection Conditions (or Ruleset) Parts: 오른손Parts: right hand ID: 10134ID: 10134 ID: 10135ID: 10135 Pose: 사라짐Pose: disappear Name: 오른손 사라짐(N초)Name: Right hand disappears (N seconds) Name: 오른손 사라짐(책/N초)Name: Right hand disappears (books/N seconds) Duration: 5초Duration: 5 seconds Parts: 오른손Parts: right hand Parts: 오른손Parts: right hand Object: 책Object: book Pose: 사라짐Pose: disappear Object: 책Object: book Duration: 4초Duration: 4 seconds Pose: 사라짐Pose: disappear Duration: 4~5초Duration: 4-5 seconds …… …… ……

따라서, 입력 데이터 획득 시 반복 학습을 통해 이러한 감지 조건들의 조합(1030, 1040, 1050, 1060, 1070)에 새로운 감지 조건들의 조합이 추가될 수 있으며, 매 시험 마다 편집 및 업데이트될 수 있다. Therefore, new combinations of sensing conditions can be added to these combinations of sensing conditions (1030, 1040, 1050, 1060, and 1070) through repetitive learning when acquiring input data, and can be edited and updated for each test.

도 11은 다양한 실시 예에 따른 온라인 시험에서의 동작 흐름도이다.11 is a flowchart illustrating an operation in an online test according to various embodiments.

도 11을 참조하면, 지능형 영상 분석 장치(300)는 1105 동작에서 온라인 시험이 생성되면, 1110 동작에서 시험 감지 목록을 선택할 수 있다. 시험 감지 목록은 기본 감지 목록 중에서 감독관 선택에 따라 결정될 수 있다. 예를 들어, 측면 카메라를 이용한 감지 목록의 경우, 측면 카메라 룰셋 감지 목록(1115)은 신체 부위, 카메라 주시, 팔 올림, 책, 컵과 같은 구성 요소들의 조합을 포함할 수 있다. 온라인 시험의 경우 온라인 시험 형식마다 대응되는 기본 감지 목록이 미리 구성되어 있을 수 있는데, 다양한 실시 예에서는 현재의 온라인 시험 형식에 맞게 감지 목록을 추가로 수정하는 기능을 제공할 수 있다.Referring to FIG. 11 , when an online test is generated in operation 1105, the intelligent video analysis device 300 may select a test detection list in operation 1110. The test detection list may be determined by the supervisor's selection from among the basic detection lists. For example, in the case of a detection list using a side camera, the side camera rule set detection list 1115 may include a combination of components such as body part, camera gaze, arm raised, book, and cup. In the case of an online test, a basic detection list corresponding to each online test format may be pre-configured. In various embodiments, a function of additionally modifying the detection list to suit the current online test format may be provided.

따라서 1120 동작에서, 지능형 영상 분석 장치(300)는 사용자 입력에 따라 선택한 목록을 추가 수정할 수 있다. 예를 들어, 기본 감지 목록은 지속적으로 학습된 감지 조건들을 이용하여 업데이트될 수 있다. 이후, 1125 동작에서 온라인 시험이 시작되면, 1130 동작에서 실시간으로 응시자 영상이 입력될 수 있으며, 지능형 영상 분석 장치(300)는 1135 동작에서 수정된 감지 목록의 감지 조건들에 기반하여, 이상 상태를 감지할 수 있다. 만일 이상 상태의 감지 시 1140 동작에서 감독관에게 이상 상태를 알릴 수 있다. Accordingly, in operation 1120, the intelligent video analysis device 300 may further modify the selected list according to user input. For example, the basic detection list may be continuously updated using learned detection conditions. Then, when the online test starts in operation 1125, the candidate's image may be input in real time in operation 1130, and the intelligent video analysis device 300 detects an abnormal state based on the detection conditions of the detection list modified in operation 1135. can detect If an abnormal state is detected, the supervisor may be notified of the abnormal state in operation 1140 .

도 12는 다양한 실시 예에 따른 실시간 응시자 영상들을 나타낸 예시도이다. 12 is an exemplary diagram illustrating real-time gazer images according to various embodiments.

도 12를 참조하면, 지능형 영상 분석 장치(300)는 입력 데이터(예: 실시간 응시자 이미지 및/또는 영상)를 분석할 수 있다. 지능형 영상 분석 장치(300)는 입력 데이터 분석을 통해 시험 감독에 필요한 검증 대상(예: 응시자)에 대한 검증의 기준이 되는 감지 조건들을 구성하기 위한 구성 요소들을 획득할 수 있다. 지능형 영상 분석 장치(300)는 이러한 구성 요소들의 조합에 따른 감지 조건들을 기반으로 반복 학습을 통해 다양한 형태의 감지 조건들의 집합을 생성할 수 있다. 지능형 영상 분석 장치(300)는 감독자의 선택에 대응하여, 감지 조건들의 집합 중에서 부정 행위를 감지하기 위한 감지 목록을 생성할 수 있다. Referring to FIG. 12 , the intelligent video analysis device 300 may analyze input data (eg, a real-time viewer image and/or video). The intelligent video analysis device 300 may acquire components for configuring detection conditions that are standards for verification of a verification target (eg, a candidate) required for test supervision through input data analysis. The intelligent video analysis device 300 may generate a set of various types of sensing conditions through repetitive learning based on the sensing conditions according to the combination of these components. The intelligent video analysis device 300 may generate a detection list for detecting cheating from a set of detection conditions in response to a supervisor's selection.

부정 행위를 감지(또는 판단)하기 위한 감지 목록을 예시하면 표 2와 같다. An example of a detection list for detecting (or judging) cheating is shown in Table 2.

유형 category 부정행위(의심행동)cheating (suspicious behavior) 세부요인detailed factor 응시자 행동Candidate Behavior 불필요한 얼굴 움직임(또는 불필요한 곳 응시)Unnecessary facial movements (or unnecessary gaze) 시간(예: 1~4초)time (e.g. 1-4 seconds) 손올림(또는 들기, 뻗기, 손바닥 뒤집기 등)Raising hands (or lifting, reaching, palm flipping, etc.) 시간(예: 1~4초)time (e.g. 1-4 seconds) 얼굴에 손을 댐(또는 어루만짐, 턱 굄 등)Touching the face (or caressing, chin chin, etc.) 카메라를 응시함staring at the camera 시간(예: 1~4초)time (e.g. 1-4 seconds) 생리 현상(예: 재채기, 하품, 스트레칭 등)Physiological phenomena (e.g. sneezing, yawning, stretching, etc.) 지침 위반violation of guidelines 특정 부위 미감지(예: 양손, 얼굴, 몸, 귀 등)Certain parts not detected (e.g. both hands, face, body, ears, etc.) 신체부위, 시간(예: 1~10초)body part, time (e.g. 1-10 seconds) 의심 물품(또는 금지 물품) 인식Suspicious (or Prohibited) Recognition 객체 종류, 시간(예: 1~4초)Object type, time (e.g. 1-4 seconds) 시험 중 자리 이탈Leaving seat during exam 시간(예: 1~4초)time (e.g. 1-4 seconds) 공조cooperate 응시자 외의 제3자 포착Capture a third party other than the test taker 시간(예: 1~4초)time (e.g. 1-4 seconds) 평균 이상의 소음 감지Above average noise detection 소음Db, 시간(예: 1~4초)Noise Db, time (e.g. 1-4 seconds)

지능형 영상 분석 장치(300)는 상기 표 2에서와 같은 감지 목록에 기반하여, 부정 행위를 감지할 수 있다. 예를 들면, 지능형 영상 분석 장치(300)는 입력 데이터(예: 실시간 응시자 이미지 및/또는 영상) 중 분석 결과가 부정 행위에 해당하는 이미지들(1210, 1215, 1220, 1225, 1230, 1235, 1240, 1245, 1250)로부터 추출된 데이터와 일치하는 경우, 부정 행위라고 간주할 수 있다. The intelligent video analysis device 300 may detect fraud based on the detection list as shown in Table 2 above. For example, the intelligent video analysis device 300 selects images (1210, 1215, 1220, 1225, 1230, 1235, and 1240) for which analysis results are misconduct among input data (eg, real-time image and/or video). , 1245, 1250), it can be regarded as cheating.

이와 같이 생성된 감지 조건들의 집합은 입력 데이터에 기반한 반복 학습(예: 머신 러닝)을 통해 지속적으로 업데이트될 수 있다. 따라서 기존의 감지 조건들의 집합도 편집이 가능할 뿐만 아니라 연관된 신규 감지 조건의 추가도 가능하기 때문에, 응시자의 행동 및 주변 환경을 고려하여 다양하면서도 정확도가 높은 감지 조건들을 제공할 수 있다. The set of detection conditions generated in this way may be continuously updated through repetitive learning (eg, machine learning) based on input data. Therefore, since a set of existing sensing conditions can be edited and a new sensing condition related to it can be added, various sensing conditions with high accuracy can be provided in consideration of the candidate's behavior and the surrounding environment.

본 문서에 개시된 다양한 실시예들에 따른 전자 장치는 다양한 형태의 장치가 될 수 있다. 전자 장치는, 예를 들면, 휴대용 통신 장치(예: 스마트폰), 컴퓨터 장치, 휴대용 멀티미디어 장치, 휴대용 의료 기기, 카메라, 웨어러블 장치, 또는 가전 장치를 포함할 수 있다. 본 문서의 실시예에 따른 전자 장치는 전술한 기기들에 한정되지 않는다.Electronic devices according to various embodiments disclosed in this document may be devices of various types. The electronic device may include, for example, a portable communication device (eg, a smart phone), a computer device, a portable multimedia device, a portable medical device, a camera, a wearable device, or a home appliance. An electronic device according to an embodiment of the present document is not limited to the aforementioned devices.

본 문서의 다양한 실시예들 및 이에 사용된 용어들은 본 문서에 기재된 기술적 특징들을 특정한 실시예들로 한정하려는 것이 아니며, 해당 실시예의 다양한 변경, 균등물, 또는 대체물을 포함하는 것으로 이해되어야 한다. 도면의 설명과 관련하여, 유사한 또는 관련된 구성요소에 대해서는 유사한 참조 부호가 사용될 수 있다. 아이템에 대응하는 명사의 단수 형은 관련된 문맥상 명백하게 다르게 지시하지 않는 한, 상기 아이템 한 개 또는 복수 개를 포함할 수 있다. 본 문서에서, "A 또는 B", "A 및 B 중 적어도 하나", "A 또는 B 중 적어도 하나", "A, B 또는 C", "A, B 및 C 중 적어도 하나", 및 "A, B, 또는 C 중 적어도 하나"와 같은 문구들 각각은 그 문구들 중 해당하는 문구에 함께 나열된 항목들 중 어느 하나, 또는 그들의 모든 가능한 조합을 포함할 수 있다. "제 1", "제 2", 또는 "첫째" 또는 "둘째"와 같은 용어들은 단순히 해당 구성요소를 다른 해당 구성요소와 구분하기 위해 사용될 수 있으며, 해당 구성요소들을 다른 측면(예: 중요성 또는 순서)에서 한정하지 않는다. 어떤(예: 제 1) 구성요소가 다른(예: 제 2) 구성요소에, "기능적으로" 또는 "통신적으로"라는 용어와 함께 또는 이런 용어 없이, "커플드" 또는 "커넥티드"라고 언급된 경우, 그것은 상기 어떤 구성요소가 상기 다른 구성요소에 직접적으로(예: 유선으로), 무선으로, 또는 제 3 구성요소를 통하여 연결될 수 있다는 것을 의미한다.Various embodiments of this document and terms used therein are not intended to limit the technical features described in this document to specific embodiments, but should be understood to include various modifications, equivalents, or substitutes of the embodiments. In connection with the description of the drawings, like reference numerals may be used for like or related elements. The singular form of a noun corresponding to an item may include one item or a plurality of items, unless the relevant context clearly dictates otherwise. In this document, "A or B", "at least one of A and B", "at least one of A or B", "A, B or C", "at least one of A, B and C", and "A Each of the phrases such as "at least one of , B, or C" may include any one of the items listed together in that phrase, or all possible combinations thereof. Terms such as "first", "second", or "first" or "secondary" may simply be used to distinguish a given component from other corresponding components, and may be used to refer to a given component in another aspect (eg, importance or order) is not limited. A (e.g., first) component is said to be "coupled" or "connected" to another (e.g., second) component, with or without the terms "functionally" or "communicatively." When mentioned, it means that the certain component may be connected to the other component directly (eg by wire), wirelessly, or through a third component.

본 문서의 다양한 실시예들에서 사용된 용어 "모듈"은 하드웨어, 소프트웨어 또는 펌웨어로 구현된 유닛을 포함할 수 있으며, 예를 들면, 로직, 논리 블록, 부품, 또는 회로와 같은 용어와 상호 호환적으로 사용될 수 있다. 모듈은, 일체로 구성된 부품 또는 하나 또는 그 이상의 기능을 수행하는, 상기 부품의 최소 단위 또는 그 일부가 될 수 있다. 예를 들면, 일실시예에 따르면, 모듈은 ASIC(application-specific integrated circuit)의 형태로 구현될 수 있다. The term "module" used in various embodiments of this document may include a unit implemented in hardware, software, or firmware, and is interchangeable with terms such as, for example, logic, logical blocks, parts, or circuits. can be used as A module may be an integrally constructed component or a minimal unit of components or a portion thereof that performs one or more functions. For example, according to one embodiment, the module may be implemented in the form of an application-specific integrated circuit (ASIC).

본 문서의 다양한 실시예들은 기기(machine)(예: 전자 장치(101))에 의해 읽을 수 있는 저장 매체(storage medium)(예: 내장 메모리(136) 또는 외장 메모리(138))에 저장된 하나 이상의 명령어들을 포함하는 소프트웨어(예: 프로그램(140))로서 구현될 수 있다. 예를 들면, 기기(예: 전자 장치(101))의 프로세서(예: 프로세서(120))는, 저장 매체로부터 저장된 하나 이상의 명령어들 중 적어도 하나의 명령을 호출하고, 그것을 실행할 수 있다. 이것은 기기가 상기 호출된 적어도 하나의 명령어에 따라 적어도 하나의 기능을 수행하도록 운영되는 것을 가능하게 한다. 상기 하나 이상의 명령어들은 컴파일러에 의해 생성된 코드 또는 인터프리터에 의해 실행될 수 있는 코드를 포함할 수 있다. 기기로 읽을 수 있는 저장 매체는, 비일시적(non-transitory) 저장 매체의 형태로 제공될 수 있다. 여기서, ‘비일시적’은 저장 매체가 실재(tangible)하는 장치이고, 신호(signal)(예: 전자기파)를 포함하지 않는다는 것을 의미할 뿐이며, 이 용어는 데이터가 저장 매체에 반영구적으로 저장되는 경우와 임시적으로 저장되는 경우를 구분하지 않는다.Various embodiments of this document describe one or more stored in a storage medium (eg, internal memory 136 or external memory 138) readable by a machine (eg, electronic device 101). It may be implemented as software (eg, program 140) containing instructions. For example, a processor (eg, the processor 120 ) of a device (eg, the electronic device 101 ) may call at least one command among one or more instructions stored from a storage medium and execute it. This enables the device to be operated to perform at least one function according to the at least one command invoked. The one or more instructions may include code generated by a compiler or code executable by an interpreter. The device-readable storage medium may be provided in the form of a non-transitory storage medium. Here, 'non-temporary' only means that the storage medium is a tangible device and does not contain a signal (e.g. electromagnetic wave), and this term refers to the case where data is stored semi-permanently in the storage medium. It does not discriminate when it is temporarily stored.

일실시예에 따르면, 본 문서에 개시된 다양한 실시예들에 따른 방법은 컴퓨터 프로그램 제품(computer program product)에 포함되어 제공될 수 있다. 컴퓨터 프로그램 제품은 상품으로서 판매자 및 구매자 간에 거래될 수 있다. 컴퓨터 프로그램 제품은 기기로 읽을 수 있는 저장 매체(예: compact disc read only memory(CD-ROM))의 형태로 배포되거나, 또는 어플리케이션 스토어(예: 플레이 스토어^TM)를 통해 또는 두 개의 사용자 장치들(예: 스마트 폰들) 간에 직접, 온라인으로 배포(예: 다운로드 또는 업로드)될 수 있다. 온라인 배포의 경우에, 컴퓨터 프로그램 제품의 적어도 일부는 제조사의 서버, 어플리케이션 스토어의 서버, 또는 중계 서버의 메모리와 같은 기기로 읽을 수 있는 저장 매체에 적어도 일시 저장되거나, 임시적으로 생성될 수 있다.According to one embodiment, the method according to various embodiments disclosed in this document may be included and provided in a computer program product. Computer program products may be traded between sellers and buyers as commodities. A computer program product is distributed in the form of a device-readable storage medium (eg compact disc read only memory (CD-ROM)), or through an application store (eg Play Store ^TM ) or on two user devices ( It can be distributed (eg downloaded or uploaded) online, directly between smart phones. In the case of online distribution, at least part of the computer program product may be temporarily stored or temporarily created in a device-readable storage medium such as a manufacturer's server, an application store server, or a relay server's memory.

다양한 실시예들에 따르면, 상기 기술한 구성요소들의 각각의 구성요소(예: 모듈 또는 프로그램)는 단수 또는 복수의 개체를 포함할 수 있으며, 복수의 개체 중 일부는 다른 구성요소에 분리 배치될 수도 있다. 다양한 실시예들에 따르면, 전술한 해당 구성요소들 중 하나 이상의 구성요소들 또는 동작들이 생략되거나, 또는 하나 이상의 다른 구성요소들 또는 동작들이 추가될 수 있다. 대체적으로 또는 추가적으로, 복수의 구성요소들(예: 모듈 또는 프로그램)은 하나의 구성요소로 통합될 수 있다. 이런 경우, 통합된 구성요소는 상기 복수의 구성요소들 각각의 구성요소의 하나 이상의 기능들을 상기 통합 이전에 상기 복수의 구성요소들 중 해당 구성요소에 의해 수행되는 것과 동일 또는 유사하게 수행할 수 있다. 다양한 실시예들에 따르면, 모듈, 프로그램 또는 다른 구성요소에 의해 수행되는 동작들은 순차적으로, 병렬적으로, 반복적으로, 또는 휴리스틱하게 실행되거나, 상기 동작들 중 하나 이상이 다른 순서로 실행되거나, 생략되거나, 또는 하나 이상의 다른 동작들이 추가될 수 있다.According to various embodiments, each component (eg, module or program) of the above-described components may include a single object or a plurality of entities, and some of the plurality of entities may be separately disposed in other components. there is. According to various embodiments, one or more components or operations among the aforementioned corresponding components may be omitted, or one or more other components or operations may be added. Alternatively or additionally, a plurality of components (eg modules or programs) may be integrated into a single component. In this case, the integrated component may perform one or more functions of each of the plurality of components identically or similarly to those performed by a corresponding component of the plurality of components prior to the integration. . According to various embodiments, the actions performed by a module, program, or other component are executed sequentially, in parallel, iteratively, or heuristically, or one or more of the actions are executed in a different order, or omitted. or one or more other actions may be added.

Claims

In the apparatus for intelligent image analysis,
at least one processor; and
contains memory;
The memory, when executed, the at least one processor,
obtaining at least one component by analyzing at least one input image;
identify a plurality of sensing conditions based on the at least one component;
Based on the plurality of sensing conditions, generating a plurality of sensing condition sets;
An apparatus for intelligent video analysis, which stores instructions for detecting whether or not there is an anomaly in a target image based on at least one of the plurality of detection condition sets.

The method of claim 1, wherein the instructions, the at least one processor,
An apparatus for intelligent image analysis configured to generate the plurality of detection condition sets through iterative learning based on the plurality of detection conditions.

The method of claim 2, wherein the at least one component,
At least one of first data representing a body part included in the plurality of input images, second data representing a motion of an object, third data representing an object, or fourth data representing a time related to the motion of the object Including, a device for intelligent video analysis.

The method of claim 3, wherein the instructions, the at least one processor,
Combining the first data to the fourth data included in the at least one component,
An apparatus for intelligent image analysis, configured to generate the plurality of detection conditions through the combination.

The method of claim 4, wherein the instructions, the at least one processor,
Apparatus for intelligent image analysis, configured to generate the plurality of detection conditions in consideration of overlapping detection conditions through the combination.

The method of claim 4, wherein the instructions, the at least one processor,
The apparatus for intelligent image analysis, configured to recommend a detection condition set having the highest similarity comparison result score with the generated plurality of detection conditions among the plurality of detection condition sets.

The method of claim 4, wherein the instructions, the at least one processor,
Creating a new sensing condition associated with the plurality of sensing conditions,
An apparatus for intelligent video analysis, set to recommend the new detection condition.

The method of claim 4, wherein the instructions, the at least one processor,
An apparatus for intelligent video analysis configured to recommend a detection condition set based on a frequency of use among the plurality of detection condition sets.

The method of claim 6, wherein the instructions, the at least one processor,
An apparatus for intelligent video analysis configured to recommend a detection condition set based on a frequency of use and a number of recommendations among the plurality of detection condition sets.

The method of claim 1, wherein the instructions, the at least one processor,
The apparatus for intelligent video analysis configured to detect the presence or absence of abnormalities in the target image based on a recommended detection condition set among the plurality of detection condition sets.

In the method for intelligent image analysis,
obtaining at least one component by analyzing at least one input image;
identifying a plurality of sensing conditions based on the at least one component;
generating a plurality of sensing condition sets based on the plurality of sensing conditions; and
The method for intelligent video analysis comprising an operation of detecting whether there is an abnormality in the target image based on at least one of the plurality of detection condition sets.

The method of claim 11, wherein the generating of the plurality of detection condition sets comprises:
And generating the plurality of detection condition sets through iterative learning based on the plurality of detection conditions.

The method of claim 11, wherein the at least one component,
At least one of first data representing a body part included in the plurality of input images, second data representing a motion of an object, third data representing an object, or fourth data representing a time related to the motion of the object Including, a method for intelligent video analysis.

According to claim 13,
combining the first data to the fourth data included in the at least one element; and
Further comprising generating the plurality of detection conditions through the combination, the method for intelligent image analysis.

15. The method of claim 14, wherein the operation of generating the plurality of sensing conditions,
And generating the plurality of detection conditions in consideration of overlapping detection conditions through the combination.

According to claim 14,
and recommending, among the plurality of sensing condition sets, a sensing condition set having the highest similarity comparison result score with the generated plurality of sensing conditions.

According to claim 14,
generating a new sensing condition associated with the plurality of sensing conditions; and
Method for intelligent image analysis, further comprising the operation of recommending the new detection condition.

According to claim 14,
The method for intelligent video analysis further comprising an operation of recommending a detection condition set based on a frequency of use from among the plurality of detection condition sets.

The method of claim 16 , further comprising recommending a sensing condition set based on a frequency of use and a number of recommendations among the plurality of sensing condition sets.

The method of claim 11, wherein detecting whether or not there is an abnormality in the target image comprises:
and detecting whether there is an abnormality in the target image based on a recommended sensing condition set among the plurality of sensing condition sets.