CN102541347A

CN102541347A - Automatic identification system and method of handwriting Chinese character

Info

Publication number: CN102541347A
Application number: CN2011103604419A
Authority: CN
Inventors: 薛雷; 何金胜; 袁苑; 何星剑; 张丽
Original assignee: University of Shanghai for Science and Technology
Current assignee: University of Shanghai for Science and Technology
Priority date: 2011-11-15
Filing date: 2011-11-15
Publication date: 2012-07-04

Abstract

The invention relates to a system and method for automatic recognition of handwriting. The system includes the EMB-4650 main board, and its extended sub-interfaces include LVDS interface, DVI interface, USB interface and COM interface; these interfaces are respectively connected with an AUO display screen, a DVI-LVDS signal conversion board, a USB controller and a GSM module The DVI-LVDS signal conversion board is connected with another AUO display, and the USB controller is connected with the touch screen. The operation steps of the recognition method are (1) input of handwriting data, (2) preprocessing of handwriting data, (3) shape feature extraction of handwriting data, and (4) recognition of handwriting data. The invention adopts a new method of polar coordinate feature recognition with font centroid as the reference point. Users can use handwriting to record, save, and send important information in real time; to achieve matching from handwriting to handwriting. The system integrates high confidentiality and high-speed transmission, and is suitable for intelligence personnel to collect and screen important information.

Description

System and method for automatic handwriting recognition

技术领域 technical field

本发明涉及一种利用计算机自动提取笔记特征的方法，开发出一种手写笔体自动识别系统和方法，利用图像分割算法和基于统计特征的判别算法来查找和提取笔迹，此笔迹鉴别方法可广泛应用于金融、法律等领域。 The invention relates to a method for automatically extracting note features by using a computer, and develops a handwriting automatic recognition system and method, which uses an image segmentation algorithm and a discriminant algorithm based on statistical features to search and extract handwriting. This handwriting identification method can be widely used. Applied in finance, law and other fields.

背景技术 Background technique

笔迹是一种相当稳定的行为特征，不同的人有不同的笔迹，且手写笔迹易于获取。随着信息数字化的飞速发展，手写体笔迹鉴别成为计算机视觉和模式识别领域中活跃的研究课题。众所周知，文献专家可以鉴别出笔迹的真伪，但计算机自动提取笔迹特征，并鉴别其真伪，特别是对少量的笔迹仍然有一定的难度。 Handwriting is a fairly stable behavioral characteristic, different people have different handwriting, and handwriting is easy to obtain. With the rapid development of information digitization, handwriting recognition has become an active research topic in the fields of computer vision and pattern recognition. As we all know, literature experts can identify the authenticity of handwriting, but it is still difficult for a computer to automatically extract handwriting features and identify its authenticity, especially for a small amount of handwriting.

计算机笔迹鉴别的研究最早始于上世纪 60 年代。近年来，由于计算机笔迹鉴别技术在公安、金融、教育测评等领域得到广泛的应用，使它正逐步成为计算机视觉和模式识别领域中一个十分活跃但难度较高的研究课题，中外学者对此进行了广泛的研究，但至今未能取得令人满意的解决方案。 The research on computer handwriting identification first began in the 1960s. In recent years, due to the wide application of computer handwriting identification technology in public security, finance, education evaluation and other fields, it is gradually becoming a very active but difficult research topic in the field of computer vision and pattern recognition. Extensive research has been carried out, but no satisfactory solution has been achieved so far.

由于发达国家的信息通讯技术起步早、发展快，其普及和使用程度已达到较高的水平，而且他们的计算机管理与自动化技术亦有相当的普及率和较高的现代化程度，因而计算机身份认证技术已逐步受到广泛的注意和研究。目前在国外市场上已有几种签名鉴别的有关产品出现，但是由于汉字构造的特殊性，针对汉字笔迹鉴别的研究工作进展缓慢。又由于文化背景等的差异，西方国家对于东方文字鉴别问题研究较少，因此相关问题的研究机构大多分布在亚洲，目前主要研究机构包括韩国釜山大学，中国科学院自动化所模式识别实验室，中国科学院计算所。此外，目前国内有很多的公司都在致力于笔迹鉴别系统方面的开发，并取得了一些成果，但真正投入使用的到现在还没有。 Due to the early start and rapid development of information and communication technology in developed countries, its popularity and use have reached a relatively high level, and their computer management and automation technology also has a considerable penetration rate and a high degree of modernization, so computer identity authentication Technology has gradually received extensive attention and research. At present, there have been several related products for signature identification in foreign markets, but due to the particularity of Chinese character structure, research on Chinese handwriting identification has progressed slowly. Due to differences in cultural background, etc., Western countries have less research on the identification of oriental characters. Therefore, most of the research institutions on related issues are located in Asia. At present, the main research institutions include Pusan University in South Korea, the Pattern Recognition Laboratory of the Institute of Automation of the Chinese Academy of Sciences, and the Chinese Academy of Sciences. Computing office. In addition, many companies in China are currently devoting themselves to the development of handwriting identification systems and have achieved some results, but none have been put into use.

发明内容 Contents of the invention

本发明的目的在于针对已有技术存在的缺陷，提供一种手写笔体自动识别系统和方法，以日常生活工作中的日记本为设计雏形，结合个人双页笔记书写习惯，利用计算机技术和创新的查找算法，针对个人手写笔体实现信息查找。系统从汉字本身的诸多结构特点出发，综合分析了手写体的特点，舍弃了庞大的标准字库，直接由“笔迹到笔迹”检索的便捷，既注意从个性中提取出共性，又强调对于个性的把握。 The purpose of the present invention is to provide a system and method for automatic recognition of handwriting in view of the defects in the prior art. It takes the diary in daily life as the prototype of the design, combines personal double-page note writing habits, and uses computer technology and innovation Search algorithm for personal handwriting to achieve information search. Starting from the many structural characteristics of Chinese characters, the system comprehensively analyzes the characteristics of handwriting, abandons the huge standard font library, and directly searches from "handwriting to handwriting" for convenience. It not only pays attention to extracting commonness from individuality, but also emphasizes the grasp of individuality .

为了实现上述目的，本发明的构思如下：本发明是基于图像的手写字体识别，实现查找功能的。此技术属于脱机字符识别范畴，系统查找功能只需在使用者所写的库中找到与其相同或相近的字符即可，并不需要判定使用者所写的字符具体对应于汉字中的哪个字。本系统所要处理的对象是二维图像，将所写汉字与背景分离后，通过一系列的预处理，查找出与所写汉字相同或者是相近字符的图片文件所在的位置同时将其显示出来即可。整个查找过程可分为以下几个部分：信息获取（手写字符输入）、预处理（包括去除背景、字符分割等）、单个字符特征提取、比较选择。 In order to achieve the above object, the idea of the present invention is as follows: the present invention is based on image-based handwriting recognition to realize the search function. This technology belongs to the category of offline character recognition. The system search function only needs to find the same or similar characters in the library written by the user, and does not need to determine which character in the Chinese characters the character written by the user corresponds to. . The object to be processed by this system is a two-dimensional image. After separating the written Chinese characters from the background, through a series of preprocessing, find out the location of the picture file with the same or similar characters as the written Chinese characters and display them at the same time. Can. The entire search process can be divided into the following parts: information acquisition (handwritten character input), preprocessing (including background removal, character segmentation, etc.), single character feature extraction, comparison and selection.

根据上述发明构思，本发明采用下述技术方案： According to above-mentioned inventive concept, the present invention adopts following technical scheme:

一种手写笔体自动识别系统，包括一块由Intel® Atom Z510P芯片主控制作的一块EMB-4650主板，其特征在于所述主板扩展的子接口有LVDS接口、DVI接口、USB接口和COM接口；这些接口分别与AUO显示屏、DVI-LVDS信号转换板、USB控制器、GSM模块相连接；由此DVI-LVDS信号转换板与另外AUO显示屏相连接，USB控制器与触摸屏相连接。当用户在触摸屏上写入数据时，这些数据通过USB控制器传送到EMB-4650主板，同时EMB-4650主板把数据传送到显示屏，实现实时显示；同时EMB-4650主板对这些数据进行识别算法处理，然后把处理后的结果送到显示屏显示，达到简易的人机互动实时操作效果。如果有需要，可以把编辑好的字体数据通过COM口发送到GSM模块，实现数据的实时发送。 A stylus automatic recognition system comprises an EMB-4650 main board made by Intel® Atom Z510P chip master control, it is characterized in that the sub-interfaces of said main board expansion have LVDS interface, DVI interface, USB interface and COM interface; These interfaces are respectively connected with AUO display, DVI-LVDS signal conversion board, USB controller, and GSM module; thus the DVI-LVDS signal conversion board is connected with another AUO display, and the USB controller is connected with the touch screen. When the user writes data on the touch screen, the data is transmitted to the EMB-4650 main board through the USB controller, and the EMB-4650 main board transmits the data to the display screen to realize real-time display; at the same time, the EMB-4650 main board performs a recognition algorithm on these data processing, and then send the processed results to the display screen to achieve the effect of simple human-computer interaction and real-time operation. If necessary, the edited font data can be sent to the GSM module through the COM port to realize real-time data transmission.

由用户在触摸屏上书写字体，字体数据输入到存储设备，经过处理器预处理，然后使用本文申请保护的极坐标方法提取单个字符的形状特征后，选择确定所输入的字符。同时，用户的输入和处理之后的字符结果在液晶显示屏上进行显示。如果有需要，可以利用GSM模块对输入信息进行无线传输，使之更便捷及时的传输信息。 The user writes the font on the touch screen, the font data is input to the storage device, and after preprocessing by the processor, the shape feature of a single character is extracted by using the polar coordinate method for protection in this paper, and the input character is selected and determined. At the same time, the user's input and processed character results are displayed on the liquid crystal display. If necessary, the GSM module can be used to wirelessly transmit the input information, making it more convenient and timely to transmit information.

上述系统中，系统的电源采用外接电源的方式供电。通过一个电源滤波器和一个变压器从市电220V50Hz获得所述18V交流电源，将其接入所述电压变换电路，所述电压变换电路将18V交流电转成12V、5V、3.3V和1.8V电源，为系统各个模块提供工作电源。 In the above system, the power supply of the system is powered by an external power supply. Obtain the 18V AC power from the mains 220V50Hz through a power filter and a transformer, connect it to the voltage conversion circuit, and the voltage conversion circuit converts the 18V AC power into 12V, 5V, 3.3V and 1.8V power supplies, Provide working power for each module of the system.

上述系统中，触摸屏则是通过转换板连接到主板的USB接口。该转换板完成将输入的信号转换成USB信号与系统主机相连接，并通过相应的软件来控制触摸屏的性能。 In the above system, the touch screen is connected to the USB interface of the main board through the conversion board. The conversion board converts the input signal into a USB signal and connects with the system host, and controls the performance of the touch screen through the corresponding software.

上述系统中，B089AW01 V1是一款由TFT LCD平板、一套驱动电路和LED背光系统组成的彩色点阵液晶显示屏。屏幕显示支持WSVGA (1024(H) x 600(V))格式和配备了LED背光驱动回路的262k色数据驱动。所有输入信号都是兼容LVDS接口的。这两块显示屏其中一块连接到主板的LVDS接口，另一块通过DVI-LVDS的转换板连接到主板的DVI接口。DVI-LVDS完成了将输入的DVI信号转换成适合液晶显示屏的LVDS信号。 Among the above systems, B089AW01 V1 is a color dot-matrix liquid crystal display composed of a TFT LCD panel, a set of drive circuits and an LED backlight system. The screen display supports WSVGA (1024(H) x 600(V)) format and is equipped with 262k color data drive with LED backlight drive circuit. All input signals are compatible with LVDS interface. One of the two display screens is connected to the LVDS interface of the motherboard, and the other is connected to the DVI interface of the motherboard through a DVI-LVDS conversion board. DVI-LVDS completes the conversion of the input DVI signal into an LVDS signal suitable for the LCD screen.

上述系统中，两个GSM模块是泽汉T-1008系列的GSM/GPRS无线收发器。模块主要由Modem主体部分（包括芯片、外围电路等），稳压电源，以及天线三个部分组成。GPRS模通过芯片COM口连接到主板。 In the above system, the two GSM modules are Zehan T-1008 series GSM/GPRS wireless transceivers. The module is mainly composed of the main part of Modem (including chip, peripheral circuit, etc.), regulated power supply, and antenna. The GPRS module is connected to the motherboard through the COM port of the chip.

一种手写笔体自动识别方法，采用上述系统进行识别，其特征在于具体操作步骤如下：步骤1：笔体数据的输入；步骤1：笔体数据的预处理；步骤3：笔体数据的形状特征提取；步骤4：笔体数据识别。 A method for automatic handwriting recognition, using the above-mentioned system for recognition, characterized in that the specific steps are as follows: Step 1: input of handwriting data; Step 1: preprocessing of handwriting data; Step 3: shape of handwriting data Feature extraction; step 4: handwriting data recognition.

本发明与已有技术相比较，具有如下显而易见的突出实质性特点和显著优点： Compared with the prior art, the present invention has the following obvious outstanding substantive features and significant advantages:

本手写笔体自动识别方法在于识别方法在于笔迹查找，建立以笔迹字体质心为基准点的极坐标特征识别新方法，识别准确率极高；并且方便查找书写信息，免去繁杂纸质查找过程，实现笔记图片形式实时保存，保存时间长久。 The handwriting automatic recognition method is that the recognition method is the handwriting search, and a new method of polar coordinate feature recognition with the handwriting font centroid as the reference point is established, and the recognition accuracy is extremely high; and it is convenient to find writing information, eliminating the complicated paper search process, Real-time storage of notes in the form of pictures is realized, and the storage time is long.

本发明区别于之前的单屏显示，采用双屏显示方式，在软件进行相关设置，可以设置为双屏独立显示模式和联合显示模式，双屏独立显示模式时主显示器可以设置为数字显示器，次显示器设置为MID，同时还可以进行屏幕旋转的设置。这种可变换的显示模式能满足各种场合的要求，既不损失视觉效果又携带方便。 The present invention is different from the previous single-screen display, and adopts a dual-screen display mode, and can be set to a dual-screen independent display mode and a combined display mode in the software. In the dual-screen independent display mode, the main display can be set to a digital display, and the secondary The display is set to MID, and the screen rotation can also be set. This convertible display mode can meet the requirements of various occasions, and it is easy to carry without losing the visual effect.

本发明的字体形状特征提取算法先进，与传统的X-Y轴坐标识别方法不同，创新的提出了以字体的质心为圆心的极坐标识别方法，解决了对于左右结构或者左中右结构的汉字，分割算法阈值的设置与个人的书写习惯是一对比较难以解决的矛盾，提高字体的识别率。 The font shape feature extraction algorithm of the present invention is advanced, and is different from the traditional X-Y axis coordinate recognition method. It innovatively proposes a polar coordinate recognition method with the center of mass of the font as the center of the circle, which solves the problem of segmentation for Chinese characters with left-right structure or left-middle-right structure. The setting of the algorithm threshold and personal writing habits are a pair of contradictions that are difficult to solve, so as to improve the recognition rate of fonts.

附图说明 Description of drawings

图1是本发明手写笔体自动识别系统的结构框图。 Fig. 1 is a structural block diagram of the automatic handwriting recognition system of the present invention.

图2是本发明手写笔体自动识别流程图。 Fig. 2 is a flow chart of automatic recognition of stylus in the present invention.

图3是图2 中的笔体数据预处理流程图。 Fig. 3 is a flowchart of the preprocessing of the handwriting data in Fig. 2 .

图4是图3中的图像二值化流程图。 FIG. 4 is a flow chart of image binarization in FIG. 3 .

图5是图1中的字体识别流程图。 FIG. 5 is a flowchart of font recognition in FIG. 1 .

具体实施方式：Detailed ways:

本发明的优选实施案例结合附图详述如下： Preferred implementation cases of the present invention are described in detail as follows in conjunction with accompanying drawings:

实施例一：参见图1，本手写笔体自动识别系统包括一块由Intel® Atom Z510P芯片主控制作的一块EMB-4650主板，其扩展的子接口有LVDS接口、DVI接口、USB接口和COM接口；这些接口分别与AUO显示屏、DVI-LVDS信号转换板、USB控制器、GSM模块相连接；由此DVI-LVDS信号转换板与另外AUO显示屏相连接，USB控制器与触摸屏相连接。当用户在触摸屏上写入数据时，这些数据通过USB控制器传送到EMB-4650主板，同时EMB-4650主板把数据传送到显示屏，实现实时显示；同时EMB-4650主板对这些数据进行识别算法处理，然后把处理后的结果送到显示屏显示，达到简易的人机互动实时操作效果。如果有需要，可以把编辑好的字体数据通过COM口发送到GSM模块，实现数据的实时发送。下面结合本发明的优选实施例及附图对本发明做进一步详细说明。 Embodiment one: referring to Fig. 1, this stylus automatic recognition system comprises an EMB-4650 main board that is made by Intel® Atom Z510P chip master control, and the sub-interface of its expansion has LVDS interface, DVI interface, USB interface and COM interface ; These interfaces are respectively connected with the AUO display screen, DVI-LVDS signal conversion board, USB controller, and GSM module; thus the DVI-LVDS signal conversion board is connected with another AUO display screen, and the USB controller is connected with the touch screen. When the user writes data on the touch screen, the data is transmitted to the EMB-4650 main board through the USB controller, and the EMB-4650 main board transmits the data to the display screen to realize real-time display; at the same time, the EMB-4650 main board performs a recognition algorithm on these data processing, and then send the processed results to the display screen to achieve the effect of simple human-computer interaction and real-time operation. If necessary, the edited font data can be sent to the GSM module through the COM port to realize real-time data transmission. The present invention will be described in further detail below in conjunction with preferred embodiments of the present invention and accompanying drawings.

实施例二：参见图2，自动手写笔体自动识别系统实现方法采用上述系统进行识别，具体操作包括以下步骤： Embodiment two: referring to Fig. 2, the realization method of the automatic handwriting recognition system adopts the above-mentioned system to identify, and the specific operations include the following steps:

步骤1：笔体数据的输入：按着触摸屏工作原理的不同分为表面声波式触摸屏、电阻式触摸屏、电容式触摸屏、红外线式触摸屏等几种。本发明所用的触摸屏为电阻式触摸屏。电阻式触摸屏安装在显示器表面，主要有两层透明导电层组成，两导电层之间绝缘。从电气特性上看，每一导电层相当于一个电阻元件，在导电层两端加电压，就可以在平面位置与压降之间建立起对应关系。当被触摸时，两导电层在触点处接触，借助于触点，把反映平面位置的电压量引出，测量后获得触点位置。触摸屏把把这些触点数据以图像数据的形式储存起来，供主机读取。 Step 1: Input of handwriting data: According to the different working principles of the touch screen, it can be divided into surface acoustic wave touch screen, resistive touch screen, capacitive touch screen, infrared touch screen and so on. The touch screen used in the present invention is a resistive touch screen. The resistive touch screen is installed on the surface of the display, mainly composed of two transparent conductive layers, and the two conductive layers are insulated. From the perspective of electrical characteristics, each conductive layer is equivalent to a resistance element, and a corresponding relationship between the plane position and the voltage drop can be established by applying a voltage across the conductive layer. When touched, the two conductive layers are in contact at the contact, and the voltage reflecting the plane position is drawn out by means of the contact, and the contact position is obtained after measurement. The touch screen stores these contact data in the form of image data for the host to read.

步骤2：笔体数据的预处理，参见图3。本发明数据预处理流程图如图4所示。主板（1）从触摸屏（7）读入已存的图像数据，输入到主板（1）寄存器中。 Step 2: Preprocessing of stroke data, see Figure 3. The data preprocessing flow chart of the present invention is shown in FIG. 4 . The main board (1) reads the stored image data from the touch screen (7), and inputs it into the register of the main board (1).

步骤2-1：256色位图转灰度图。 Step 2-1: Convert 256-color bitmap to grayscale.

a) 计算灰度映射表 a) Calculate the grayscale mapping table

由亮度信号Y与三基色R、G、B之间的关系式Y=0.299R+0.587G+0.114B可得 bMap[i] = (BYTE)(0.299 * lpbmi->bmiColors[i].rgbRed From the relationship between the brightness signal Y and the three primary colors R, G, B, Y=0.299R+0.587G+0.114B can be obtained bMap[i] = (BYTE)(0.299 * lpbmi->bmiColors[i].rgbRed

+0.587 * lpbmi->bmiColors[i].rgbGreen +0.587* lpbmi->bmiColors[i].rgbGreen

+0.114 * lpbmi->bmiColors[i].rgbBlue + 0.5); +0.114* lpbmi->bmiColors[i].rgbBlue + 0.5);

其中lpbmi为指向BITMAPINFO结构的指针 Where lpbmi is a pointer to the BITMAPINFO structure

b) 更新DIB调色板三基色分量 b) Update the three primary color components of the DIB palette

lpbmi->bmiColors[i].rgbRed = i; lpbmi->bmiColors[i].rgbRed = i;

lpbmi->bmiColors[i].rgbGreen = i; lpbmi->bmiColors[i].rgbGreen = i;

lpbmi->bmiColors[i].rgbBlue = i; lpbmi->bmiColors[i].rgbBlue = i;

c) 更换每个象素的颜色索引（即按照灰度映射表换成灰度值） c) Replace the color index of each pixel (that is, replace it with a grayscale value according to the grayscale mapping table)

for(i = 0; i < lHeight; i++) for(i = 0; i < lHeight; i++)

{ for(j = 0; j < lWidth; j++) { for(j = 0; j < lWidth; j++)

{ lpSrc = (unsigned char*)lpDIBBits + lLineBytes * (lHeight - 1 - i) + j; { lpSrc = (unsigned char*)lpDIBBits + lLineBytes * (lHeight - 1 - i) + j;

*lpSrc = bMap[*lpSrc]; *lpSrc = bMap[*lpSrc];

} }

（以上程序段中lHeight和lWidth分别为整幅图像的高度和宽度） (lHeight and lWidth in the above program segment are the height and width of the entire image respectively)

步骤2-2：二值化：本发明中采用硬阈值的方式对灰度图实现二值化，图像二值化流程图程序流程图如图4所示。设定某一阈值Threshold,利用Threshold 将灰度图像的数据分成两部分:大于等于阈值的像素群和小于阈值的像素群。例如，输入灰度图像函数为f(x,y),输出二值图像函数为g(x,y),则

,通过确定阈值Threshold 来确定整幅图像的黑白分布。需要强调的是至于将特征物与背景分开这一目标我们则编写了专门的去背景函数以从根本上将彩色背景与文字分成两幅“互相独立”的位图，而非单纯地将二者分成两个区域，这是我们处理方式上有别于传统二值化思想之处。 Step 2-2: Binarization: In the present invention, the method of hard threshold is used to realize binarization on the gray scale image, and the program flow chart of image binarization is shown in Fig. 4 . Set a certain threshold Threshold, and use Threshold to divide the grayscale image data into two parts: the pixel group greater than or equal to the threshold and the pixel group smaller than the threshold. For example, the input grayscale image function is f(x,y), and the output binary image function is g(x,y), then

, by determining the threshold Threshold to determine the black and white distribution of the entire image. It should be emphasized that as for the goal of separating the features from the background, we have written a special de-background function to fundamentally divide the colored background and text into two "mutually independent" bitmaps, rather than simply separating the two Divided into two areas, this is what we do differently from the traditional binarization thinking.

步骤2-3：去离散噪声：在书写的过程中尤其是酝酿一篇好文章需要字斟句酌的时候，人们往往有不时地将笔往纸上点触的习惯，这就使得我们实时保存的信息中混入了一定量的噪声，这也就给之后的准确查找笔迹带来了附加的不利因素，所以必须首先进行去噪处理。 Step 2-3: Discrete noise removal: In the process of writing, especially when brewing a good article, people often have the habit of touching the pen to the paper from time to time, which makes the information we save in real time A certain amount of noise is mixed in, which brings additional unfavorable factors to the subsequent accurate search for handwriting, so denoising must be performed first.

基本思想是扫描整幅图像，将判定为离散点的像素点置为白色。在完成去离散噪声的操作中使用了两个函数的嵌套，一个函数是void RemoveScatterNoise(HDIB hDIB) ，该函数是外层函数，主要完成的功能是判断像素点是否是离散点，若是则将其置为白色，而在进行离散性判断时调用了DeleteScaterJudge()函数；另外一个函数是bool DeleteScaterJudge()，该函数采用了递归的思想，考察上下左右以及左上、右上、左下、右下八个方向，如果是黑色点，则调用自身进行递归，由于其本身是bool类型，所以返回值为TRUE或FALSE，在void RemoveScatterNoise(HDIB hDIB)中得以获取，若为FALSE则说明像素点为离散点，便将其置为白色，从而实现了去除离散噪声效果。 The basic idea is to scan the entire image and set the pixels judged as discrete points to white. In the operation of de-discrete noise, the nesting of two functions is used, one function is void RemoveScatterNoise(HDIB hDIB), this function is an outer layer function, the main function is to judge whether the pixel is a discrete point, if so, set it to white, and call the DeleteScaterJudge() function when making a discrete judgment; the other The function is bool DeleteScaterJudge(). This function adopts the recursive idea to examine the eight directions of up, down, left, right, upper left, upper right, lower left, and lower right. If it is a black point, it will call itself for recursion. Since it is of type bool, so The return value is TRUE or FALSE, in void It can be obtained from RemoveScatterNoise (HDIB hDIB). If it is FALSE, it means that the pixel point is a discrete point, and it is set to white, thereby realizing the effect of removing discrete noise.

步骤2-4：梯度锐化：图像锐化处理的目的是使模糊的图像变得更加清晰，图像锐化一般有两种方法：一种是微分法，另一种是高通滤波器法。本发明所采用的梯度锐化就是微分法中的一种比较常用的方法。 Step 2-4: Gradient sharpening: The purpose of image sharpening is to make the blurred image clearer. There are generally two methods for image sharpening: one is the differential method, and the other is the high-pass filter method. The gradient sharpening adopted in the present invention is a commonly used method in the differential method.

设图像为

Figure DEST_PATH_RE-686803DEST_PATH_IMAGE002

，定义

Figure DEST_PATH_RE-124738DEST_PATH_IMAGE002

在点

处的梯度矢量为： Let the image be

,definition

at point

Gradient vector at for:

，梯度有两个重要性质：梯度的方向在函数

Figure DEST_PATH_RE-955607DEST_PATH_IMAGE002

最大变化率方向上和梯度的幅度用

Figure DEST_PATH_RE-69057DEST_PATH_IMAGE004

表示，且其值为：

Figure DEST_PATH_RE-931971DEST_PATH_IMAGE006

, the gradient has two important properties: the direction of the gradient is in the function

The direction of the maximum rate of change and the magnitude of the gradient are given by

represents, and its value is:

由此式可得这样的结论：梯度的数值就是

Figure DEST_PATH_RE-156279DEST_PATH_IMAGE002

在其最大变化率方向上的单位距离所增加的量。 From this formula, it can be concluded that the value of the gradient is

The amount of increase per unit distance in the direction of its maximum rate of change.

对于离散的数字图像，上式可以改写成：

For discrete digital images, the above formula can be rewritten as:

为了计算方便，也可以采用下面的近似计算公式(1)： For the convenience of calculation, the following approximate calculation formula (1) can also be used:

Figure DEST_PATH_RE-421038DEST_PATH_IMAGE008

如果直接采用梯度值

Figure DEST_PATH_RE-439809DEST_PATH_IMAGE004

来表示图像，即令

=

Figure DEST_PATH_RE-852336DEST_PATH_IMAGE010

，则有上述公式(1)可见：在图像变换缓慢的地方其值很小（对应于图像较暗）；而在线条轮廓等变换较快的地方的值很大。图像在经过梯度运算后变得清晰从而达到锐化的目的。 If the gradient value is directly used

to represent the image, that is,

=

, then it can be seen from the above formula (1): the value is small in places where the image changes slowly (corresponding to darker images); and the value is large in places where the line contour changes quickly. The image becomes clear after the gradient operation to achieve the purpose of sharpening.

对于图像变换缓慢的地方梯度很小而导致图像较暗这一不利因素，我们采用的解决方法是给一个阈值

，如果小于该阈值

，则保持原灰度值不变；如果大于或等于阈值，则赋值为：

Figure DEST_PATH_RE-528245DEST_PATH_IMAGE012

For the unfavorable factor that the image is dark because the gradient is small in places where the image changes slowly, the solution we adopt is to give a threshold

, if less than the threshold

, keep the original gray value unchanged; if greater than or equal to the threshold , then the assignment is :

步骤2-5：细化：细化是指求一副图像中央骨骼部分过程，中央骨骼部分是描述图像几何及拓扑性质的重要特征之一，对被处理的图像进行细化有助于突出形状特点和减少冗余的信息量。 Step 2-5: Thinning: Thinning refers to the process of finding the central bone part of an image. The central bone part is one of the important features describing the geometric and topological properties of the image. Thinning the processed image helps to highlight the shape features and reduce the amount of redundant information.

本发明采用的细化算法具有的特性：一幅图像中的一个

区域，对各点标记名称P1，P2，…P8，其中P1位于中心。如果P1=1（即黑点），下面4个条件如果同时满足，则删除P1（P1=0）。①2

NZ(P1)

6；②

=1；③P2*P4*P8=0或者

1；④P2*P4*P6=0或者

1；对于图像中的每一个点重复这一步骤，直到所有的点都不可删除为止。 The thinning algorithm adopted in the present invention has the characteristics: one image in one image

region, label the points with names P1, P2, ... P8, where P1 is at the center. If P1=1 (that is, black dot), if the following four conditions are met at the same time, delete P1 (P1=0). ①2

NZ(P1)

6;②

=1; ③P2*P4*P8=0 or

1; ④P2*P4*P6=0 or

1; Repeat this step for each point in the image until all points cannot be deleted.

步骤3：笔体数据的形状特征提取：手写体汉字的形状特征包括倾斜度、重心偏移、字符伸展等，这些特征物理意义明确，反映了书写人的一部分书写风格。通常情况下，这种特征可以用矩来描述。 Step 3: Shape feature extraction of handwriting data: The shape features of handwritten Chinese characters include inclination, center of gravity offset, character stretching, etc. These features have clear physical meanings and reflect part of the writing style of the writer. Usually, this feature can be described by moments.

对于一幅

Figure DEST_PATH_RE-545059DEST_PATH_IMAGE018

的字符图像

Figure DEST_PATH_RE-342114DEST_PATH_IMAGE002

，其p+q阶矩定义为： for a picture

character image for

, its p+q order moment is defined as:

其中，

Figure DEST_PATH_RE-387168DEST_PATH_IMAGE020

，各阶矩的物理意义说明如下：

表示图像在水平方向上的伸展度；

Figure DEST_PATH_RE-232764DEST_PATH_IMAGE022

表示图像在垂直方向上的伸展度；

表示图像的倾斜度（>0表示图像向左上倾斜，

Figure DEST_PATH_RE-211402DEST_PATH_IMAGE023

<0表示图像向右上倾斜）；

Figure DEST_PATH_RE-245217DEST_PATH_IMAGE024

表示图像在水平方向上的重心偏移度（

Figure DEST_PATH_RE-956821DEST_PATH_IMAGE024

>0表示重心偏左，<0表示重心偏右）；

表示图像在垂直方向上的重心偏移度（>0表示重心偏上，<0表示重心偏下）；

Figure DEST_PATH_RE-997272DEST_PATH_IMAGE026

表示图像水平伸展的均衡程度（

Figure DEST_PATH_RE-150255DEST_PATH_IMAGE026

>0表示图像下部的水平伸展度比图像上部大，

Figure DEST_PATH_RE-460014DEST_PATH_IMAGE026

<0表示图像下部的水平伸展度比图像上部小）；

表示图像垂直伸展的均衡程度（

Figure DEST_PATH_RE-530738DEST_PATH_IMAGE027

>0表示图像右部的垂直伸展度比图像左部大，

Figure DEST_PATH_RE-521827DEST_PATH_IMAGE027

<0表示图像右部的垂直伸展度比图像左部小）。 in,

, the physical meaning of each order moment is explained as follows:

Indicates the stretch of the image in the horizontal direction;

Indicates the stretch of the image in the vertical direction;

Indicates the slope of the image ( >0 indicates that the image is tilted up to the left,

<0 means the image is tilted up to the right);

Indicates the offset of the center of gravity of the image in the horizontal direction (

>0 means the center of gravity is left, <0 means the center of gravity is to the right);

Indicates the offset of the center of gravity of the image in the vertical direction ( >0 means the center of gravity is on the upper side, <0 means the center of gravity is lower);

Indicates how evenly the image is stretched horizontally (

>0 indicates that the horizontal stretch of the lower part of the image is larger than that of the upper part of the image,

<0 means the horizontal stretch of the lower part of the image is smaller than that of the upper part of the image);

Indicates the balance of the vertical stretch of the image (

>0 indicates that the vertical stretch of the right part of the image is larger than that of the left part of the image,

<0 means the right part of the image has less vertical stretch than the left part of the image).

由上面计算出的中心矩可以导出以下的一些归一化的形状特征： From the central moments calculated above, the following normalized shape features can be derived:

a) 长宽比： a) Aspect ratio:

b) 字形倾斜度：

；其中

Figure DEST_PATH_RE-842267DEST_PATH_IMAGE030

b) Font slope:

;in

c) 拉长度：

；其中

Figure DEST_PATH_RE-337971DEST_PATH_IMAGE032

的计算式如下：

c) Pull length:

;in

The calculation formula is as follows:

d) 伸展度：

Figure DEST_PATH_RE-929489DEST_PATH_IMAGE034

；其中size表示字符点阵的大小，即 d) Stretch:

; where size represents the size of the character lattice, namely

三阶的中心矩由于其值可能为正或为负，实际上，它们都是由基函数为正和为负的两部分所组成的。 The third-order central moment may be positive or negative because of its value, in fact, they are composed of two parts whose basis function is positive and negative.

Figure DEST_PATH_RE-561459DEST_PATH_IMAGE036

通过三阶中心矩提取的归一化特征如下： The normalized features extracted by the third-order central moment are as follows:

①水平偏移度： ①Horizontal offset:

②垂直偏移度： ②Vertical offset:

③水平伸展均衡度：

Figure DEST_PATH_RE-694555DEST_PATH_IMAGE038

③Horizontal stretch balance:

④垂直伸展均衡度：

④ Vertical stretch balance:

对于对应的特征字，求出它们的特征矢量

Figure DEST_PATH_RE-89764DEST_PATH_IMAGE040

，定义的距离如下： For the corresponding feature words, find their feature vectors

, the distance is defined as follows:

；其中r为：

Figure DEST_PATH_RE-841819DEST_PATH_IMAGE042

; where r is:

步骤4：笔体数据识别：以文字质心为原点、归一化后的文字高度的一半为半径，建立极坐标系，对整个圆周进行扇形平均分块，块数n可变。最终判断两幅位图是否相似即当前所写的汉字与以往保存的笔记中出现过的字是否具有一定程度上的相似性遵循同一条准则：只有当两幅位图相对应的n块区域内每块区域像素点的总数的两两差值均小于预先设置的阈值时，才将二者视为相似。而将n定格为可变等效于将查找的精度设置为可调。显然，n=12时我们认为查找被视作模糊查找，而n=360时则认为是精确查找。图5字体数据识别流程图。在完成字体数据识别之后，计算机把识别到的字体在屏幕上显示出来。在完成字体编辑后，通过软件控制，可以把字体数据通过无线GSM网络发送到接收站，实现实时传输。 Step 4: Handwriting data recognition: take the center of mass of the text as the origin, and half of the normalized text height as the radius, establish a polar coordinate system, and divide the entire circumference into fan-shaped average blocks, and the number of blocks n is variable. Finally, judge whether the two bitmaps are similar, that is, whether the currently written Chinese characters have a certain degree of similarity with the words that appeared in the previously saved notes, and follow the same criterion: only when the two bitmaps correspond to n blocks When the pairwise differences of the total number of pixels in each area are smaller than the preset threshold, the two are considered similar. And setting n as variable is equivalent to setting the precision of the lookup as adjustable. Obviously, when n=12 we consider the lookup to be regarded as a fuzzy lookup, and when n=360, we consider it to be an exact lookup. Fig. 5 Flowchart of font data identification. After the font data recognition is completed, the computer displays the recognized fonts on the screen. After the font editing is completed, the font data can be sent to the receiving station through the wireless GSM network through software control to realize real-time transmission.

Claims

1. An automatic handwriting pen body recognition system comprises an EMB-4650 mainboard (1) which is made by controlling Intel Atom Z510P chips, and is characterized in that the expanded subinterfaces of the mainboard (1) comprise an LVDS interface, a DVI interface, a USB interface and a COM interface; the interfaces are respectively connected with an AUO display screen (2), a DVI-LVDS signal conversion board (3), a USB controller (4) and a GSM module (5); the DVI-LVDS signal conversion board (3) is connected with another AUO display screen (6), and the USB controller (4) is connected with the touch screen (7); when a user writes data in the touch screen, the data are transmitted to the EMB-4650 mainboard through the USB controller, and meanwhile, the EMB-4650 mainboard transmits the data to the display screen to realize real-time display; meanwhile, the EMB-4650 mainboard performs recognition algorithm processing on the data, and then sends the processed result to a display screen for displaying, so that a simple human-computer interaction real-time operation effect is achieved; the edited font data can be sent to the GSM module through the COM port, and real-time data sending is achieved.

2. An automatic handwriting recognition method, which is recognized by the automatic handwriting recognition system of claim 1, and is characterized by comprising the following specific operation steps:

step 1: inputting the data of the pen body: when the touch screen is touched, the touch screen stores the data in the form of image data for a host to read;

step 2: preprocessing the pen body data: the main board 1 reads in stored image data from the touch screen (7) and inputs the image data into a register of the main board (1);

2-1, converting the 256-color bitmap of the data input by the touch screen (7) into a gray scale map according to a gray scale mapping table method;

step 2-2: binarization: realizing binarization on the gray level image by adopting a hard threshold value mode;

step 2-3: and (3) removing discrete noise: denoising noise mixed by touch;

step 2-4: gradient sharpening: carrying out image sharpening by adopting a differential method:

let the image be

Definition of

At the point of

Gradient vector of (2)

Comprises the following steps:

the gradient has two important properties: direction of gradient as a function

In the direction of maximum rate of change and the magnitude of the gradient

And its value is:

the following results are obtained: the value of the gradient is

The amount by which the unit distance in the direction of its maximum rate of change increases;

for discrete digital images, the above equation is rewritten as:

for convenience of calculation, the following approximate calculation formula (1) may also be employed:

if gradient values are directly adoptedTo represent an image, i.e. order

=

Then, the above formula (1) shows that: where the image transitions slowly its value is small (corresponding to the image being darker); and the values are large at places where the transformation is fast, such as line profiles; the image becomes clear after gradient operation so as to achieve the aim of sharpening;

for the adverse factor that the image is dark due to small gradient at the place where the image transformation is slow, the solution is to give a threshold value

If less than the threshold value

Keeping the original gray value unchanged; if greater than or equal to the threshold valueThen assign a value of

：

Step 2-5: thinning: the process of finding the central skeleton part of one image is specified, the central skeleton part is one of important features for describing the geometric and topological properties of the image, and the thinning of the processed image is helpful for highlighting the shape characteristics and reducing the redundant information content;

the adopted refinement algorithm has the characteristics that: one of an image

Areas, for each point, are labeled the names P1, P2, … P8, where P1 is located at the center; if P1=1, i.e., a black dot, the following 4 conditions, if satisfied simultaneously, delete P1 (P1 = 0); (ii) 2NZ(P1)

6；②= 1; ③ P2P 4P 8=0 or

1; (iv) P2 × P4 × P6=0 or

1; this step is repeated for each point in the image until all points are undeletable;

and step 3: shape feature extraction of the pen body data: the shape characteristics of the handwritten Chinese characters comprise inclination, gravity center offset and character extension, and the characteristics are described by moments;

for a frame

Character image ofThe p + q order moment is defined as:

wherein,each ofThe physical meaning of the order moments is explained below:

representing the degree of extension of the image in the horizontal direction;

representing the degree of extension of the image in the vertical direction;representing the inclination of an image

>0 means that the image is tilted up to the left,

<0 represents the image tilting up to the right;

representing the degree of shift of the center of gravity of an image in the horizontal direction

>0 indicates that the center of gravity is to the left,<0 indicates the center of gravity is to the right;

representing the degree of shift of the center of gravity of an image in the vertical direction

>0 means that the center of gravity is on the upper side,

<0 represents a center of gravity off;

indicating the degree of equalization of the horizontal stretching of the image,

>0 means that the horizontal extension in the lower part of the image is larger than in the upper part of the image,

<0 indicates that the horizontal stretch in the lower part of the image is smaller than in the upper part of the image;

indicating the degree of equalization of the vertical stretching of the image,>0 means that the vertical extension of the right part of the image is greater than the left part of the image,

<0 indicates that the vertical extension of the right part of the image is smaller than the left part of the image;

from the central moments calculated above, some normalized shape features are derived:

aspect ratio:

font gradient:

(ii) a Wherein

Drawing length:(ii) a Wherein

Is calculated as follows:

the stretching degree:

(ii) a Wherein size represents the size of the character lattice, i.e.

The central moments of the third order, since their values may be positive or negative, are in fact composed of two parts, the basis functions of which are positive and negative;

the normalized features extracted by the third-order central moment are as follows:

horizontal offset degree:

vertical offset degree:

③ degree of horizontal stretching and balancing:

vertical stretching equilibrium degree:

for corresponding feature words, a feature vector is obtained

The defined distances are as follows:

(ii) a Wherein r is:

and 4, step 4: and (3) identifying pen body data: establishing a polar coordinate system by taking the centroid of the character as an origin and half of the normalized height of the character as a radius, and performing sector average partitioning on the whole circumference, wherein the number n of blocks is variable;

finally, whether the two bitmaps are similar or not is judged, namely whether the current written Chinese character and the character appearing in the past stored notes have certain similarity or not follows the same criterion: only when the difference value of every two pixel points of each block area in n block areas corresponding to the two bitmaps is smaller than a preset threshold value, the two bitmaps are regarded as similar; freezing n to be variable is equivalent to setting the search precision to be adjustable;

obviously, we consider a search to be a fuzzy search when n =12, and a precise search when n = 360; after the font data recognition is completed, the computer displays the recognized font on the screen.