KR102331453B1

KR102331453B1 - System for vehicle number plate recognition using neural network and its method

Info

Publication number: KR102331453B1
Application number: KR1020200023407A
Authority: KR
Inventors: 고재필
Original assignee: 금오공과대학교 산학협력단
Priority date: 2020-02-26
Filing date: 2020-02-26
Publication date: 2021-11-25
Also published as: KR20210108615A

Abstract

본 발명은 신경망을 이용한 번호판 인식 시스템 및 그 방법에 관한 것으로서, a) 차량 영상이 획득되면, 상기 차량 영상에서 번호판 영역을 포함하는 번호판 영상을 검출하는 단계; b)상기 검출된 번호판 영상을 이미지 변환 알고리즘을 이용하여 기설정된 폰트로 합성하여 번호판 합성 영상을 생성하는 단계; c) 상기 번호판 합성 영상과 목표 차량번호를 쌍으로 하는 학습 데이터를 이용하여 번호판 인식 모델을 학습하는 단계; 및 d) 적어도 하나 이상의 문자와 숫자로 이루어진 차량 영상이 입력되면, 상기 이미지 변환 알고리즘을 통해 생성된 번호판 합성 영상이 상기 학습된 번호판 인식 모델에 입력되어 차량 번호를 인식한 후 상기 인식된 차량 번호를 제공하는 단계를 포함하는 방법일 수 있다. The present invention relates to a license plate recognition system and method using a neural network, comprising: a) detecting a license plate image including a license plate area from the vehicle image when a vehicle image is obtained; b) generating a license plate composite image by synthesizing the detected license plate image into a preset font using an image conversion algorithm; c) learning the license plate recognition model using the learning data pairing the license plate composite image and the target vehicle number; and d) when a vehicle image composed of at least one or more letters and numbers is input, the license plate composite image generated through the image conversion algorithm is input to the learned license plate recognition model to recognize the vehicle number and then the recognized vehicle number It may be a method comprising the step of providing.

Description

License plate recognition system and method using neural network { SYSTEM FOR VEHICLE NUMBER PLATE RECOGNITION USING NEURAL NETWORK AND ITS METHOD }

본 발명은 이미지 변환 알고리즘을 이용하여 단일 폰트로 합성된 번호판 합성 영상을 이용하여 번호판 인식 모델을 학습시킬 수 있는 신경망을 이용한 텍스트 인식 시스템 및 그 방법에 관한 것이다.The present invention relates to a text recognition system and method using a neural network capable of learning a license plate recognition model using a license plate composite image synthesized into a single font using an image conversion algorithm.

자동차 번호판 인식 기술은 무인 주차 관리 시스템, 불법 주차 차량 적발 시스템, 고속도로에서의 속도 위반 자동 과태료 시스템, 도난 차량의 불법 번호판 적발 시스템과 같은 곳에 응용되고 있으며, 보다 처리 속도가 빠르고 정확하게 인식하는 시스템을 구성하기 위해서는 필수적인 기술이다. License plate recognition technology is being applied in places such as unmanned parking management system, illegally parked vehicle detection system, automatic speeding fine system on highway, and illegal license plate detection system of stolen vehicle, and constitutes a system that recognizes more quickly and accurately. This is an essential skill to do.

최근에는 재난, 치안용 무인 환경 시스템을 활용하여 교통 정보 확인 및 감시, 추적 임무를 수행함에 따라 무인 환경에서 번호판 검출 및 인식이 가능한 시스템이 요구되고 있다.Recently, as an unmanned environment system for disaster and security is used to check, monitor, and track traffic information, a system capable of detecting and recognizing license plates in an unmanned environment is required.

종래의 차량 번호판 인식 시스템은 카메라가 고정된 환경에서 제한적인 각도 내의 번호판 영역을 검출한다는 한계를 가지며, 딥러닝 알고리즘을 이용하는 경우에도 최적의 정확도를 위한 다수의 샘플데이터를 생성하는 데 어려움이 있다. The conventional license plate recognition system has a limitation in that the camera detects a license plate area within a limited angle in a fixed environment, and it is difficult to generate a large number of sample data for optimal accuracy even when using a deep learning algorithm.

일반적으로 차량 번호판 인식 시스템은 영상획득, 번호판 영역 검출, 문자인식 등 3단계 과정으로 진행된다. 이때, 번호판 인식 기술에는 원형 정합 방법, 구문론적 방법, 신경망을 이용한 방법 등을 사용할 수 있다. In general, the vehicle license plate recognition system proceeds in three steps: image acquisition, license plate area detection, and character recognition. In this case, the license plate recognition technology may use a circular matching method, a syntactic method, a method using a neural network, and the like.

특히, 신경망을 이용한 번호판 인식 방법은 문자 영상의 화소 값을 그대로 입력 패턴으로 정하는 비특징 추출 방법과 문자 영상으로부터 특징을 추출하여 사용하는 특징 추출 방법이 있다. 특징 추출 방법에 비해 비특징 추출 방법은 특징을 추출하는 시간이 소모되지 않기 때문에 처리 시간이 짧지만, 문자를 이루는 영상 정보와 인식 대상의 패턴 수가 많은 한글 문자 인식에서는 인식 수행을 위한 계산량이 많아지고 입력 노드의 수가 많아지게 된다.In particular, the license plate recognition method using a neural network includes a non-feature extraction method in which a pixel value of a text image is determined as an input pattern as it is, and a feature extraction method in which a feature is extracted from the text image and used. Compared to the feature extraction method, the non-feature extraction method takes less time to process because it does not consume the time to extract the feature, but in the Korean character recognition with a large number of image information constituting the text and the number of patterns to be recognized, the amount of calculation required to perform recognition increases. The number of input nodes increases.

한편, 종래 기술로서 문자 인식의 효율을 높이고자 자소 분리 방법을 이용한 차량 번호판의 문자 인식 연구가 있다. 그러나 이는 실제 번호판의 훼손, 카메라의 성능, 기타 여러 가지 조건에 의해서 번호판 영상에 많은 잡음이 포함되므로 한글 문자를 자소 분리하는 것이 사실상 어렵다는 문제점이 있다. On the other hand, as a prior art, there is a study on character recognition of a license plate using a grapheme separation method in order to increase the efficiency of character recognition. However, there is a problem in that it is practically difficult to separate Hangul characters from alphabets because a lot of noise is included in the license plate image due to damage to the actual license plate, camera performance, and various other conditions.

또한, 종래의 신경망을 이용한 번호판 인식 방법은 검출된 번호판 영역에 대해 문자를 인식하는 과정에서 다양한 환경에서의 번호판 인식을 위해 다양한 폰트로 번호판 이미지를 학습하게 된다. In addition, the license plate recognition method using a conventional neural network learns license plate images with various fonts for license plate recognition in various environments in the process of recognizing characters for the detected license plate area.

이때, 하나의 번호판 이미지에 대해 조도 변화, 픽셀 이동 변화, 문자의 두께와 크기 변화, 번호판의 배경이나 문자의 색상 변화 등을 통해 대량의 학습 데이터를 생성할 수 있으므로 필요한 학습 데이터가 급격히 증가하게 되고, 대량의 학습 데이터를 학습하기 위한 비용이나 시간이 많이 소요되고, 그로 인해 새로운 폰트를 추가하기 매우 어렵다는 문제점이 있다. At this time, as a large amount of learning data can be generated for one license plate image through changes in illuminance, pixel movement, thickness and size changes of characters, and color changes of license plate backgrounds or characters, the required learning data increases rapidly. , there is a problem in that it takes a lot of money or time to learn a large amount of learning data, and therefore it is very difficult to add a new font.

대한민국 공개특허공보 제10-2019-0095571호(발명의 명칭: 차량번호판 자동인식 방법)Republic of Korea Patent Publication No. 10-2019-0095571 (Title of the invention: vehicle license plate automatic recognition method)

본 발명은 전술한 문제점을 해결하기 위하여, 본 발명의 일 실시예에 따라 다양한 폰트의 번호판 영상을 단일 폰트로 합성한 번호판 합성 영상을 이용하여 번호판 인식 모델을 학습함으로써 기존의 번호판 인식 시스템에 비해 적은 수의 학습 데이터를 이용하여 인식 성능을 높일 수 있도록 하는 것에 목적이 있다.In order to solve the above problems, the present invention learns a license plate recognition model using a license plate composite image that synthesizes license plate images of various fonts into a single font according to an embodiment of the present invention, thereby reducing the number of plates compared to the existing license plate recognition system. The purpose is to improve recognition performance by using numerical training data.

다만, 본 실시예가 이루고자 하는 기술적 과제는 상기된 바와 같은 기술적 과제로 한정되지 않으며, 또 다른 기술적 과제들이 존재할 수 있다.However, the technical task to be achieved by the present embodiment is not limited to the technical task as described above, and other technical tasks may exist.

상기한 기술적 과제를 달성하기 위한 기술적 수단으로서 본 발명의 일 실시예에 따른 차량 번호판 인식 시스템에 의해 수행되는 신경망을 이용한 번호판 인식 방법에 있어서, a) 차량 영상이 획득되면, 상기 차량 영상에서 번호판 영역을 포함하는 번호판 영상을 검출하는 단계; b)상기 검출된 번호판 영상을 이미지 변환 알고리즘을 이용하여 기설정된 폰트로 합성하여 번호판 합성 영상을 생성하는 단계; c) 상기 번호판 합성 영상과 목표 차량번호를 쌍으로 하는 학습 데이터를 이용하여 번호판 인식 모델을 학습하는 단계; 및 d) 적어도 하나 이상의 문자와 숫자로 이루어진 차량 영상이 입력되면, 상기 이미지 변환 알고리즘을 통해 생성된 번호판 합성 영상이 상기 학습된 번호판 인식 모델에 입력되어 차량 번호를 인식한 후 상기 인식된 차량 번호를 제공하는 단계를 포함하는 방법일 수 있다. In the license plate recognition method using a neural network performed by the vehicle license plate recognition system according to an embodiment of the present invention as a technical means for achieving the above technical problem, a) when a vehicle image is obtained, the license plate area in the vehicle image Detecting a license plate image comprising a; b) generating a license plate composite image by synthesizing the detected license plate image into a preset font using an image conversion algorithm; c) learning the license plate recognition model using the learning data pairing the license plate composite image and the target vehicle number; and d) when a vehicle image composed of at least one or more letters and numbers is input, the license plate composite image generated through the image conversion algorithm is input to the learned license plate recognition model to recognize the vehicle number and then the recognized vehicle number It may be a method comprising the step of providing.

상기 이미지 변환 알고리즘은 이미지 투 이미지 변환(Image to Image Translation, Pix2Pix) 알고리즘을 사용할 수 있다. The image conversion algorithm may use an Image to Image Translation (Pix2Pix) algorithm.

상기 번호판 영상을 상기 이미지 변환 알고리즘을 통해 기설정된 두께와 크기를 갖는 폰트로 변환하고, 어텐션 알고리즘(attention algorithm) 기반의 양방향 LSTM(Bidirectional Long Short-Term Memory, BLSTM )을 적용하여 글자 또는 숫자의 위치를 결정하여 번호판 합성 영상을 생성하는 단계를 더 포함하는 할 수 있다. The license plate image is converted to a font having a preset thickness and size through the image conversion algorithm, and Bidirectional Long Short-Term Memory (BLSTM) based on an attention algorithm is applied to position letters or numbers It may further include the step of generating a license plate composite image by determining the.

상기 번호판 영상에서 배경 영역과 숫자 또는 글자를 포함하는 문자 영역을 구분하여 검출하고, 상기 배역 영역을 기설정된 색상으로 변환하고, 상기 문자 영역은 상기 배경 영역과 다른 색상을 갖도록 변환하는 것이다. In the license plate image, a background area and a text area including numbers or letters are distinguished and detected, the casting area is converted to a preset color, and the text area is converted to have a color different from the background area.

이때, 상기 결정된 글자 또는 숫자의 위치마다 구분자로 표시하고, 적어도 하나 이상의 글자 또는 숫자가 조합된 차량 번호가 완성되면 종료 구분자로 표시할 수 있다. 상기 구분자 또는 종료 구분자는 특수 문자나 특수 기호 중 어느 하나 이상을 사용할 수 있다. In this case, each determined position of the letter or number may be displayed as a separator, and when the vehicle number in which at least one or more letters or numbers are combined is completed, it may be displayed as an end separator. The delimiter or the end delimiter may use any one or more of a special character or a special symbol.

상기 번호판 인식 모델은, 상기 번호판 합성 영상에서 특징을 추출하고, 상기 추출한 특징을 순차적으로 통합하여 사용하는 CNN-LSTM(convolutional neural network- Long Short-Term Memory) 복합모델을 이용할 수 있다.The license plate recognition model may use a convolutional neural network-Long Short-Term Memory (CNN-LSTM) complex model that extracts features from the license plate composite image, and sequentially integrates and uses the extracted features.

전술한 본 발명의 과제 해결 수단에 의하면, 다양한 폰트, 배경색상, 문자 색상을 가지는 번호판 영상을 이미지 변환 알고리즘을 통해 단일 폰트, 단일 배경색상과 문자 색상을 가지도록 합성한 번호판 합성 영상을 이용하여 번호판 인식 모델을 학습함으로써 기존의 번호판 인식 시스템에 비해 학습 데이터 생성을 위한 시간 및 노력이 최솨될 수 있고, 새로은 폰트를 추가하여 학습할 필요가 없을 뿐만 아니라 훨씬 적은 학습 데이터를 이용하여 레이블링이 가능하고, 학습량이 감소할수록 학습 데이터 구성에 필요한 비용이 감소할 수 있다.According to the problem solving means of the present invention described above, using a license plate composite image synthesized to have a single font, a single background color and a character color through an image conversion algorithm of license plate images having various fonts, background colors, and character colors By learning the recognition model, the time and effort for generating learning data can be minimized compared to the existing license plate recognition system, and there is no need to learn by adding a new font, and labeling is possible using much less learning data, As the learning amount decreases, the cost required to construct the training data may decrease.

도 1은 본 발명의 일 실시예에 따른 차량 번호판 인식 시스템의 구성을 나타낸 도면이다.
도 2는 본 발명의 일 실시예에 따른 신경망을 이용한 번호판 인식 방법을 설명하는 순서도이다.
도 3은 본 발명의 일 실시예에 따른 번호판 인식 모델의 학습 과정을 설명하는 도면이다.
도 4는 본 발명의 일 실시예에 따른 번호판 인식 모델에 의한 텍스트 인식 과정을 설명하는 예시도이다.1 is a view showing the configuration of a vehicle license plate recognition system according to an embodiment of the present invention.
2 is a flowchart illustrating a license plate recognition method using a neural network according to an embodiment of the present invention.
3 is a view for explaining a learning process of the license plate recognition model according to an embodiment of the present invention.
4 is an exemplary diagram illustrating a text recognition process by a license plate recognition model according to an embodiment of the present invention.

아래에서는 첨부한 도면을 참조하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예를 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those of ordinary skill in the art can easily implement them. However, the present invention may be embodied in many different forms and is not limited to the embodiments described herein. And in order to clearly explain the present invention in the drawings, parts irrelevant to the description are omitted, and similar reference numerals are attached to similar parts throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐 아니라, 그 중간에 다른 소자를 사이에 두고 "전기적으로 연결"되어 있는 경우도 포함한다. 또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미하며, 하나 또는 그 이상의 다른 특징이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.Throughout the specification, when a part is "connected" with another part, this includes not only the case of being "directly connected" but also the case of being "electrically connected" with another element interposed therebetween. . Also, when a part "includes" a component, it means that other components may be further included, rather than excluding other components, unless otherwise stated, and one or more other features However, it is to be understood that the existence or addition of numbers, steps, operations, components, parts, or combinations thereof is not precluded in advance.

본 명세서에서 ‘단말’은 휴대성 및 이동성이 보장된 무선 통신 장치일 수 있으며, 예를 들어 스마트 폰, 태블릿 PC 또는 노트북 등과 같은 모든 종류의 핸드헬드(Handheld) 기반의 무선 통신 장치일 수 있다. 또한, ‘단말’은 네트워크를 통해 다른 단말 또는 서버 등에 접속할 수 있는 PC 등의 유선 통신 장치인 것도 가능하다. 또한, 네트워크는 단말들 및 서버들과 같은 각각의 노드 상호 간에 정보 교환이 가능한 연결 구조를 의미하는 것으로, 근거리 통신망(LAN: Local Area Network), 광역 통신망(WAN: Wide Area Network), 인터넷 (WWW: World Wide Web), 유무선 데이터 통신망, 전화망, 유무선 텔레비전 통신망 등을 포함한다. In the present specification, a 'terminal' may be a wireless communication device with guaranteed portability and mobility, for example, any type of handheld-based wireless communication device such as a smart phone, a tablet PC, or a notebook computer. In addition, the 'terminal' may be a wired communication device such as a PC that can connect to another terminal or server through a network. In addition, the network refers to a connection structure capable of exchanging information between each node, such as terminals and servers, and includes a local area network (LAN), a wide area network (WAN), and the Internet (WWW). : World Wide Web), wired and wireless data networks, telephone networks, and wired and wireless television networks.

무선 데이터 통신망의 일례에는 3G, 4G, 5G, 3GPP(3rd Generation Partnership Project), LTE(Long Term Evolution), WIMAX(World Interoperability for Microwave Access), 와이파이(Wi-Fi), 블루투스 통신, 적외선 통신, 초음파 통신, 가시광 통신(VLC: Visible Light Communication), 라이파이(LiFi) 등이 포함되나 이에 한정되지는 않는다.Examples of wireless data communication networks include 3G, 4G, 5G, 3rd Generation Partnership Project (3GPP), Long Term Evolution (LTE), World Interoperability for Microwave Access (WIMAX), Wi-Fi, Bluetooth communication, infrared communication, ultrasound Communication, Visible Light Communication (VLC), LiFi, etc. are included, but are not limited thereto.

이하의 실시예는 본 발명의 이해를 돕기 위한 상세한 설명이며, 본 발명의 권리 범위를 제한하는 것이 아니다. 따라서 본 발명과 동일한 기능을 수행하는 동일 범위의 발명 역시 본 발명의 권리 범위에 속할 것이다.The following examples are detailed descriptions to help the understanding of the present invention, and do not limit the scope of the present invention. Accordingly, an invention of the same scope performing the same function as the present invention will also fall within the scope of the present invention.

이하 첨부된 도면을 참고하여 본 발명의 일 실시예를 상세히 설명하기로 한다.Hereinafter, an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일 실시예에 따른 차량 번호판 인식 시스템의 구성을 나타낸 도면이다.1 is a view showing the configuration of a vehicle license plate recognition system according to an embodiment of the present invention.

도 1을 참조하면, 차량 번호판 인식 시스템(100)은 통신 모듈(110), 메모리(120), 프로세서(130) 및 데이터베이스(140)를 포함한다.Referring to FIG. 1 , the license plate recognition system 100 includes a communication module 110 , a memory 120 , a processor 130 , and a database 140 .

상세히, 통신 모듈(110)은 통신망(300)과 연동하여 차량 번호판 인식 시스템(100)으로 송수신되는 신호를 패킷 데이터 형태로 제공하는 데 필요한 통신 인터페이스를 제공한다. 여기서, 통신 모듈(110)은 다른 네트워크 장치와 유무선 연결을 통해 제어 신호 또는 데이터 신호와 같은 신호를 송수신하기 위해 필요한 하드웨어 및 소프트웨어를 포함하는 장치일 수 있다.In detail, the communication module 110 provides a communication interface necessary to provide a signal transmitted and received to the license plate recognition system 100 in the form of packet data in conjunction with the communication network 300 . Here, the communication module 110 may be a device including hardware and software necessary for transmitting and receiving signals such as control signals or data signals through wired/wireless connection with other network devices.

메모리(120)는 신경망을 이용한 번호판 인식 방법을 수행하기 위한 프로그램이 기록된다. 또한, 프로세서(130)가 처리하는 데이터를 일시적 또는 영구적으로 저장하는 기능을 수행한다. 여기서, 메모리(120)는 휘발성 저장 매체(volatile storage media) 또는 비휘발성 저장 매체(non-volatile storage media)를 포함할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다.In the memory 120, a program for performing a license plate recognition method using a neural network is recorded. In addition, the processor 130 performs a function of temporarily or permanently storing the processed data. Here, the memory 120 may include a volatile storage medium or a non-volatile storage medium, but the scope of the present invention is not limited thereto.

프로세서(130)는 번호판 인식 방법을 실행하기 위한 전체 과정을 제어하는 것으로서, 번호판 영상을 단일 폰트를 갖는 번호판 합성 영상으로 이미지 변환하고, 번호판 합성 영상과 목표 차량 번호의 쌍으로 이루어진 학습 데이터를 이용하여 번호판 인식 모델을 학습한 후 학습된 번호판 인식 모델을 통해 차량 영상에서 번호판 영역의 챠량 번호를 인식할 수 있다. 이러한 프로세서(130)가 수행하는 각 단계에 대해서는 도 2을 참조하여 후술하기로 한다.The processor 130 controls the entire process for executing the license plate recognition method, and converts the license plate image to a license plate composite image having a single font, and uses the learning data consisting of a pair of the license plate composite image and the target vehicle number. After learning the license plate recognition model, it is possible to recognize the vehicle number in the license plate area from the vehicle image through the learned license plate recognition model. Each step performed by the processor 130 will be described later with reference to FIG. 2 .

여기서, 프로세서(130)는 데이터를 처리할 수 있는 모든 종류의 장치를 포함할 수 있다. 여기서, '프로세서(processor)'는, 예를 들어 프로그램 내에 포함된 코드 또는 명령으로 표현된 기능을 수행하기 위해 물리적으로 구조화된 회로를 갖는, 하드웨어에 내장된 데이터 처리 장치를 의미할 수 있다. 이와 같이 하드웨어에 내장된 데이터 처리 장치의 일 예로써, 마이크로프로세서(microprocessor), 중앙처리장치(central processing unit: CPU), 프로세서 코어(processor core), 멀티프로세서(multiprocessor), ASIC(application-specific integrated circuit), FPGA(field programmable gate array) 등의 처리 장치를 망라할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다.Here, the processor 130 may include all kinds of devices capable of processing data. Here, the 'processor' may refer to a data processing device embedded in hardware, for example, having a physically structured circuit to perform a function expressed as a code or an instruction included in a program. As an example of the data processing device embedded in the hardware as described above, a microprocessor, a central processing unit (CPU), a processor core, a multiprocessor, an application-specific integrated (ASIC) circuit) and a processing device such as a field programmable gate array (FPGA), but the scope of the present invention is not limited thereto.

데이터베이스(140)는 번호판 인식 방법을 위한 프로그램을 실행하면서 누적되는 데이터가 저장된다. 예컨대, 데이터베이스(140)에는 차량 영상, 번호판 합성 영상, 학습 데이터, 새로 인식된 차량 번호 정보 등이 저장될 수 있다.The database 140 stores data accumulated while executing a program for a license plate recognition method. For example, the database 140 may store a vehicle image, a license plate composite image, learning data, and newly recognized vehicle number information.

도 2는 본 발명의 일 실시예에 따른 신경망을 이용한 번호판 인식 방법을 설명하는 순서도이고, 도 3은 본 발명의 일 실시예에 따른 번호판 인식 모델의 학습 과정을 설명하는 도면이며, 도 4는 본 발명의 일 실시예에 따른 번호판 인식 모델에 의한 텍스트 인식 과정을 설명하는 예시도이다.Figure 2 is a flowchart illustrating a license plate recognition method using a neural network according to an embodiment of the present invention, Figure 3 is a view for explaining a learning process of a license plate recognition model according to an embodiment of the present invention, Figure 4 is this It is an exemplary diagram illustrating a text recognition process by a license plate recognition model according to an embodiment of the present invention.

도 2 내지 도 4를 참조하면, 신경망을 이용한 번호판 인식 방법은, 프로세서(130)에서 샘플 차량 영상이 입력되면 차량 영상에서 번호판 영역이 포함된 번호판 영상을 검출한다(S1). 프로세서(130)는 번호판 영상을 이미지 변환 알고리즘을 이용하여 기설정된 폰트로 합성하여 번호판 합성 영상을 생성한다(S2). 이때, 이미지 변환 알고리즘은 이미지 투 이미지 변환(Image to Image Translation, Pix2Pix) 알고리즘을 사용한다. 2 to 4 , the license plate recognition method using a neural network detects a license plate image including a license plate area from the vehicle image when a sample vehicle image is input from the processor 130 (S1). The processor 130 generates a license plate composite image by synthesizing the license plate image into a preset font using an image conversion algorithm (S2). In this case, the image conversion algorithm uses an Image to Image Translation (Pix2Pix) algorithm.

도 3에 도시된 바와 같이, 프로세서(130)는 샘플 차량 영상을 이미지 변환 알고리즘을 통해 기설정된 폰트로 변환하여 번호판 합성 영상을 생성하고, 어텐션 알고리즘(attention algorithm) 기반의 양방향 LSTM(Bidirectional Long Short-Term Memory, BLSTM )을 적용하여 글자와 숫자의 위치를 결정하여 번호판을 인식한다. 3, the processor 130 converts the sample vehicle image into a preset font through an image conversion algorithm to generate a license plate composite image, and an attention algorithm-based bidirectional Long Short- Term Memory, BLSTM ) is applied to determine the position of letters and numbers to recognize the license plate.

이러한 어텐션 알고리즘 기반의 BLSTM은 문장의 분류나 번역에 단어의 위치를 결정하는데 유용하게 사용되고 있는 것으로서, 번호판 인식 모델(200)에서는 글자와 숫자의 위치를 확실하게 파악하기 위해 시계열 데이터의 현재 입력의 다음 방향만을 향하는 기존의 LSTM에서 입력의 이전 방향까지 확인할 수 있도록 더욱 개선된 양방향 LSTM(Bidirectional Long Short-Term Memory, BLSTM)을 사용한다. This attention algorithm-based BLSTM is usefully used to determine the position of a word in the classification or translation of a sentence, and in the license plate recognition model 200, A more improved bidirectional long short-term memory (BLSTM) is used to check the previous direction of the input from the existing LSTM that only faces one direction.

번호판 합성 영상은 도 4에 도시된 바와 같이, 결정된 글자 또는 숫자의 위치마다 구분자로 표시하고, 적어도 하나 이상의 글자와 숫자가 조합된 차량 번호가 완성되면 종료 구분자로 표시한다. 구분자 또는 종료 구분자는 공백 문자(SP(Space), EOS(End Of Space)), '*, #, ;, .' 와 같은 특수 문자나 특수 기호 중 어느 하나 이상을 사용할 수 있고, 특수 문자나 특수 기호의 색상을 다르게 설정하여 사용할 수도 있다. As shown in FIG. 4, the license plate composite image is displayed as a separator for each position of the determined letter or number, and when the vehicle number in which at least one or more letters and numbers are combined is completed, it is displayed as an end separator. The delimiter or end delimiter is a space character (SP (Space), EOS (End Of Space)), '*, #, ;, .' Any one or more of special characters or special symbols such as .

번호판 인식 모델(200)은 번호판 합성 영상에서 특징을 추출하는 CNN 모델과, CNN 모델에서 추출한 특징을 순차적으로 통합하여 사용하는 LSTM 모델을 사용하는 CNN-LSTM(convolutional neural network- Long Short-Term Memory) 복합 모델이 될 수 있다. 번호판 인식 모델(200)은 CNN-LSTM 복합 모델 뿐만 아니라 R-CNN(Region-Convolutional Neural Network) 방식의 YOLO 알고리즘을 사용하여 무인기 환경에서도 번호판 검출이 가능하도록 할 수 있다. 즉, 다양한 각도로 진입하는 샘플 차량 영상이 입력되면 YOLO 알고리즘으로 학습하여 번호판 검출을 위한 신경망을 생성할 수 있다. The license plate recognition model 200 is CNN-LSTM (convolutional neural network- Long Short-Term Memory) using a CNN model for extracting features from a license plate composite image, and an LSTM model that sequentially integrates and uses features extracted from the CNN model. It can be a composite model. The license plate recognition model 200 may enable license plate detection even in an unmanned aerial vehicle environment using the YOLO algorithm of the R-CNN (Region-Convolutional Neural Network) method as well as the CNN-LSTM complex model. That is, when a sample vehicle image entering from various angles is input, it is possible to generate a neural network for license plate detection by learning with the YOLO algorithm.

프로세서(130)는 번호판 합성 영상과 목표 차량 번호를 쌍으로 하는 학습 데이터를 생성하고(S3), 생성된 학습 데이터를 이용하여 번호판 인식 모델을 학습한다(S4). 이때, 목표 차량 번호는 한글, 숫자, 특수 기호, 영어를 포함한 외국어 등이 다양할 수 있다. The processor 130 generates training data that pairs the license plate composite image and the target vehicle number (S3), and learns the license plate recognition model using the generated training data (S4). In this case, the target vehicle number may be various in Korean, numbers, special symbols, foreign languages including English, and the like.

프로세서(130)는 차량 영상이 입력된 학습된 번호판 인식 모델에 입력하여 차량 영상에 해당하는 차량 번호를 인식하고, 인식된 결과를 사용자 단말에 제공한다(S5, S6). 이때, 프로세서(130)는 새롭게 인식된 차량 번호와 차량 영상을 학습데이터로 활용하여 번호판 인식 모델을 학습시킬 수 있다.The processor 130 recognizes the vehicle number corresponding to the vehicle image by inputting the vehicle image into the learned license plate recognition model input, and provides the recognized result to the user terminal (S5, S6). In this case, the processor 130 may learn the license plate recognition model by using the newly recognized vehicle number and vehicle image as learning data.

도 4에 도시된 바와 같이, 번호판 영상이 'YZ1547'인 경우에, 번호판의 문자는 나라 또는 지역마다 폰트가 다르며, 간혹 손글씨로 기재하는 경우도 있다. 따라서, 프로세서(130)는 Pix2Pix 알고리즘을 이용하여 단일 폰트, 단일 배경색, 단일 문자 색상으로 번호판 영상을 합성하여 번호판 합성 영상으로 번호판 인식 모델(200)에 제공한다. 이때 번호판 합성 영상은 각 글자와 글자, 글자와 숫자, 숫자와 숫자 사이에 구분자와 종료 구분자로 표시되며, 어텐션 알고리즘 기반의 BLSTM을 적용하여 글자 또는 숫자의 위치를 결정한 후 차량 번호를 인식하여 인식 결과를 제공할 수 있다.As shown in Figure 4, when the license plate image is 'YZ1547', the characters of the license plate have different fonts for each country or region, and sometimes they are written in handwriting. Accordingly, the processor 130 uses the Pix2Pix algorithm to synthesize a license plate image with a single font, a single background color, and a single character color, and provides it to the license plate recognition model 200 as a license plate composite image. At this time, the license plate composite image is displayed as a separator and an end separator between each letter and letter, letter and number, and number and number. can provide

이와 같이, 번호판 인식 모델은 문자 인식이 아니라 문자 이미지를 변환하는 신경망으로 구성됨으로써 문자 인식을 위한 신경망에 비해 학습이 수월해진다. 즉, 번호판 인식 모델(200)이 번호판 영상을 변환하여 단일 폰트로 합성된 번호판 합성 영상을 이용하여 학습되기 때문에 기존에 다양한 폰트로 학습된 번호판 인식 모델에 비해 학습이 상대적으로 쉽고, 필요한 학습 데이터가 적을 뿐만 아니라 새로은 폰트를 추가하여 학습할 필요가 없다. As such, the license plate recognition model is composed of a neural network that converts a character image rather than character recognition, thereby making learning easier compared to a neural network for character recognition. That is, since the license plate recognition model 200 is learned using the license plate composite image synthesized into a single font by converting the license plate image, learning is relatively easy compared to the license plate recognition model learned with various fonts in the past, and the necessary learning data is Not only is it small, but there is no need to learn by adding new fonts.

이상에서 설명한 본 발명의 실시예에 따른 신경망을 이용한 번호판 인식 방법은, 컴퓨터에 의해 실행되는 프로그램 모듈과 같은 컴퓨터에 의해 실행 가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 이러한 기록 매체는 컴퓨터 판독 가능 매체를 포함하며, 컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체를 포함하며, 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다.The license plate recognition method using a neural network according to an embodiment of the present invention described above may be implemented in the form of a recording medium including instructions executable by a computer, such as a program module executed by a computer. Such recording media includes computer-readable media, and computer-readable media can be any available media that can be accessed by a computer, and includes both volatile and nonvolatile media, removable and non-removable media. Computer readable media also includes computer storage media, which include volatile and nonvolatile embodied in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. , both removable and non-removable media.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다.The above description of the present invention is for illustration, and those of ordinary skill in the art to which the present invention pertains can understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive. For example, each component described as a single type may be implemented in a dispersed form, and likewise components described as distributed may be implemented in a combined form.

본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.The scope of the present invention is indicated by the following claims rather than the above detailed description, and all changes or modifications derived from the meaning and scope of the claims and their equivalent concepts should be interpreted as being included in the scope of the present invention. do.

100: 차량 번호판 인식 시스템
110: 통신 모듈 120: 메모리
130: 프로세서 140: 데이터베이스100: license plate recognition system
110: communication module 120: memory
130: processor 140: database

Claims

In the license plate recognition method using a neural network performed by the vehicle license plate recognition system,
a) when a vehicle image is obtained, detecting a license plate image including a license plate area from the vehicle image;
b) generating a license plate composite image by synthesizing the detected license plate image into a preset font using an image conversion algorithm;
c) learning the license plate recognition model using the learning data pairing the license plate composite image and the target vehicle number; and
d) When a vehicle image consisting of at least one or more letters and numbers is input, the license plate composite image generated through the image conversion algorithm is input to the learned license plate recognition model to recognize the vehicle number and then provide the recognized vehicle number comprising the steps of
The image conversion algorithm uses an Image to Image Translation (Pix2Pix) algorithm to synthesize a license plate image with a single font, a single background color, and a single character color, and provides a neural network to the license plate recognition model as a license plate composite image. How to recognize license plates.

delete

The method of claim 1,
Step b) is,
The license plate image is converted to a font having a preset thickness and size through the image conversion algorithm, and Bidirectional Long Short-Term Memory (BLSTM) based on an attention algorithm is applied to position letters or numbers Which further comprises the step of generating a license plate composite image by determining the, license plate recognition method using a neural network.

5. The method of claim 4,
Step b) is,
In the license plate image, a background area and a text area including numbers or letters are distinguished and detected, the background area is converted into a preset color, and the text area is converted to have a color different from the background area, a neural network license plate recognition method using

5. The method of claim 4,
A method for recognizing a license plate using a neural network, wherein each position of the determined letter or number is displayed as a separator, and when the vehicle number in which at least one or more letters or numbers are combined is completed, the vehicle number is displayed as an end separator.

7. The method of claim 6,
The delimiter or the end delimiter is to use any one or more of a special character or a special symbol, a license plate recognition method using a neural network.

The method of claim 1,
The license plate recognition model is,
Extracting features from the license plate composite image, and using a CNN-LSTM (convolutional neural network-Long Short-Term Memory) complex model that sequentially integrates and uses the extracted features, a license plate recognition method using a neural network.

A computer-readable recording medium in which a program for performing a license plate recognition method using a neural network according to any one of claims 1 to 8 is recorded.