WO2021184692A1 - Document classification collaborative robot and image character recognition method based thereon - Google Patents

Document classification collaborative robot and image character recognition method based thereon Download PDF

Info

Publication number
WO2021184692A1
WO2021184692A1 PCT/CN2020/112598 CN2020112598W WO2021184692A1 WO 2021184692 A1 WO2021184692 A1 WO 2021184692A1 CN 2020112598 W CN2020112598 W CN 2020112598W WO 2021184692 A1 WO2021184692 A1 WO 2021184692A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
text
image
text line
character recognition
Prior art date
Application number
PCT/CN2020/112598
Other languages
French (fr)
Chinese (zh)
Inventor
邓辅秦
李伟科
林淮荣
黄永深
冯华
岳洪伟
丁毅
龙佳乐
张建民
王栋
钟东洲
李澄非
习江涛
Original Assignee
五邑大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 五邑大学 filed Critical 五邑大学
Publication of WO2021184692A1 publication Critical patent/WO2021184692A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Definitions

  • the invention relates to the field of file management, in particular to a file classification collaborative robot and an image character recognition method based thereon.
  • the purpose of the present invention is to provide a document classification collaborative robot and an image and text recognition method based on it, which can sort and organize documents manually, which is beneficial to reduce the workload of office workers.
  • a collaborative robot for document classification including:
  • the camera device is used to take images of files
  • the main body of the robot which is used to put the file in the position designated by the user;
  • the upper computer is used for recognizing and outputting the characters in the text picture and correspondingly overall controlling the movement of the robot main body, which is respectively connected with the camera device and the robot main body.
  • An image and text recognition method of a document classification collaborative robot including:
  • the characters in the text picture to be recognized are output.
  • the one or more technical solutions provided in the embodiments of the present invention have at least the following beneficial effects: based on the cooperation of the camera device, the main body of the robot, and the host computer, it can take a picture of a document and recognize the text in it, and then realize the realization based on the recognized text
  • For document classification management there is no need to manually assist the entire process, and the degree of intelligent control is high.
  • the construction of a relevant recognition model for the taken pictures can systematically recognize the text in the document, and the recognition effect is more prominent and not easy
  • the recognition error rate is greatly reduced. Therefore, the present invention is ingenious in design, intelligent and efficient in classification and recognition, and can manually sort and organize files, which is beneficial to reducing the work burden of office workers.
  • said converting the obtained text line picture into a standard text line picture includes:
  • the text line binarized picture and the background picture are synthesized to obtain a standard text line picture.
  • said forming a preset text line binarization picture according to the obtained text line picture includes:
  • a preset text line binary picture is formed according to the text image.
  • said determining the preset background picture according to the text picture to be recognized includes:
  • a preset background picture is formed according to the background area without text.
  • construction of a picture character recognition model based on the standard text line picture includes:
  • a picture text recognition model is constructed through a deep neural network based on the training sample set, wherein the deep neural network is designed to use the sample picture as training data and the text content of the sample picture as a label.
  • the obtaining the corresponding sample picture according to the standard text line picture includes:
  • the output of the text in the text picture to be recognized based on the picture text recognition model includes:
  • the new entry is converted into corresponding text content according to the preset font type.
  • the method further includes:
  • FIG. 1 is a schematic block diagram of the structure of a file classification collaborative robot according to an embodiment of the present invention
  • FIG. 2 is a schematic block diagram of the overall steps of an image character recognition method according to an embodiment of the present invention.
  • FIG. 3 is a schematic flowchart of the step of "converting the obtained text line picture into a standard text line picture" in the image character recognition method of the embodiment of the present invention
  • FIG. 4 is a schematic flowchart of the step of "building a picture character recognition model based on the standard text line picture" in the image character recognition method of the embodiment of the present invention
  • FIG. 5 is a schematic flowchart of the step of "outputting text in a text picture to be recognized based on the picture text recognition model" in the image text recognition method according to an embodiment of the present invention
  • Fig. 6 is a schematic flowchart of steps of an image character recognition method according to an embodiment of the present invention.
  • a document classification collaborative robot includes:
  • the camera device is used to take images of files
  • the main body of the robot which is used to put the file in the position designated by the user;
  • the upper computer is used for recognizing and outputting the characters in the text picture and correspondingly overall controlling the movement of the robot main body, which is respectively connected with the camera device and the robot main body.
  • an image and text recognition method of a document classification collaborative robot includes:
  • the characters in the text picture to be recognized are output.
  • the present invention is ingenious in design, intelligent and efficient in classification and recognition, and can manually sort and organize files, which is beneficial to reducing the work burden of office workers.
  • the function of the robot body is not limited to the above-mentioned content.
  • the recognition and shooting of the camera device when blocked, it can feed back to the host computer, and then control the robot body to move the obstacle, which plays a good auxiliary shooting role;
  • the upper computer stores a database of standard text line pictures, so it can easily construct a corresponding picture character recognition model based on any standard text line picture; and, when the camera device recognizes, it is classified based on the paper document input by the user According to the rules, the document is generally photographed and identified from near to far, that is, the documents are photographed one by one, and the documents are identified separately to avoid errors and omissions.
  • this step output the text in the text picture to be recognized based on the picture text recognition model, only need to obtain the text picture to be recognized, and then input it into the picture text recognition model, then the text to be recognized can be output It is very convenient and reliable to recognize the text in the text picture.
  • the conversion of the obtained text line picture into a standard text line picture includes:
  • the text line binarized picture and the background picture are synthesized to obtain a standard text line picture.
  • the obtained text line picture is converted into a standard text line picture, which can be matched with a database of standard text line pictures in the host computer, so as to facilitate its construction of a relevant picture character recognition model, wherein the obtained text line Binarization of the picture rendering can make its brightness more clearly visible and improve its recognition.
  • the background of the text picture to be recognized can be extracted, which can determine the application range of the text picture to be recognized, and then make the text line binary. By synthesizing the chemical picture and the background picture, the standard can be unified, and the standard text line picture can be obtained to facilitate the construction of the recognition model.
  • said forming a preset text line binarization picture according to the obtained text line picture includes:
  • a preset text line binary picture is formed according to the text image.
  • the text characteristics of the text line picture can be reflected, and then the text characteristics are output in the form of a text image, and finally converted into a text line binarized picture It can be seen that, through the steps of this embodiment, the acquired text characteristics of the text line picture can be extracted and presented in the text line binarized picture, which is beneficial to the subsequent text recognition processing.
  • the said determining the preset background picture according to the text picture to be recognized includes:
  • a preset background picture is formed according to the background area without text.
  • the standard template image is compared to the text image to be recognized, and the text image to be recognized can be mapped to the standard template image. Then only the background area without text in the standard template image is obtained, which is equivalent to also from the text to be recognized In addition to the corresponding background area extracted from the picture, the conversion can be used to conveniently and effectively generate a background picture for the text picture to be recognized.
  • the construction of a picture character recognition model based on the standard text line picture includes:
  • a picture text recognition model is constructed through a deep neural network based on the training sample set, wherein the deep neural network is designed to use the sample picture as training data and the text content of the sample picture as a label.
  • training based on sample pictures and their text content can reflect the characteristics of standard text line pictures in the training set, so that the recognition of the constructed picture text recognition model is more accurate, and a deep neural network is used.
  • the image data can be integrated and processed more stably, making the construction of the recognition model more convenient and effective.
  • set the CRNN model as a deep neural network model.
  • the CRNN model includes a convolutional layer using CNN, a recurrent layer using BiLSTM, and a transcription layer using CTC.
  • I i is the i-th sample picture
  • L I is the text content in the i-th sample picture
  • Y I is the i-th sample picture
  • the subscript i is the sequence number of the training data in the training sample set.
  • said obtaining the corresponding sample picture according to the standard text line picture includes:
  • the standard text line picture is subjected to one or more processing of expansion and variable transformation, tone transformation, adding shadow effect, adding highlight effect, adding noise, cropping, scaling, and compression.
  • the expansion and change processing of the standard text line picture can eliminate and reduce its own picture defects, so as to obtain a more stable and rich sample picture.
  • the above expansion and change processing is obtained by the inventor based on experiments and experience. of.
  • the output of the text in the text picture to be recognized based on the picture text recognition model includes:
  • the new entry is converted into corresponding text content according to the preset font type.
  • the combined expansion of the text in the text image can generate more new entries and match them with the text content, so this is equivalent to expanding the scope of its application on the original basis.
  • the method further includes:
  • the user classification rule determined in the host computer is that the user decides the classification method, such as mathematical characters, subject differences, etc., which can facilitate the user to participate in the formulation of the classification; because there is likely to be more than one document processed in practice, Therefore, the above steps can also be repeated until it is judged whether it is the last file. If it is, the classification is ended. Otherwise, continue to classify according to the above steps. It can be seen that through comparison, it can be verified whether the text in the text image to be recognized meets the user's classification. Demands in order to find errors and reduce the error rate.
  • the classification method such as mathematical characters, subject differences, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Character Discrimination (AREA)

Abstract

A document classification collaborative robot and an image character recognition method based thereon, wherein the document classification collaborative robot comprises: a camera device for capturing an image of a document; a robot main body for placing the document at a location specified by a user; and an upper computer for recognizing and outputting characters in a text picture and correspondingly performing overall control on the robot main body to make same move, the upper computer being connected to the camera device and the robot main body, respectively. The image character recognition method for the robot comprises: converting an acquired text line picture into a standard text line picture (S100); constructing a picture character recognition model according to the standard text line picture (S200); on the basis of the picture character recognition model, outputting characters in a text picture to be subjected to recognition (S300). The method is cleverly designed, realizes intelligent and efficient classification and recognition, and can classify and sort documents in a non-manual manner, thereby facilitating the reduction in the workload of office workers.

Description

一种文件分类协作机器人及基于其的图像文字识别方法Document classification collaborative robot and image character recognition method based on it 技术领域Technical field
本发明涉及文件管理领域,尤其是一种文件分类协作机器人及基于其的图像文字识别方法。The invention relates to the field of file management, in particular to a file classification collaborative robot and an image character recognition method based thereon.
背景技术Background technique
现在的都市人群工作压力大,需要处理大量的纸质文件,而纸质文件的整理耗费的时间在效率至上的当今显得十分宝贵,鉴于目前现有的文件分类方法大多是采用人工手动分类,分类效率低,并容易出现分类错误的情况,市面上相应推出了一系列设备来执行机器分类,以减轻人工负担,但无法实现完全的去人工化,比如在中国专利CN110369296A中,提出了一种办公文件整理系统及其提示方法,其采用RFID电子标签的方式对纸质文件进行存储分类,但仍需要人工对待分类文件进行初步的鉴别并对待分类份文件安装电子标签,当待分类文件数量较大,或待分类文件需分类数目众多时,仍需要人工安装大量的电子标签,效率仍然较低,且有一定的出错率。Nowadays, urban people are under great work pressure and need to process a large number of paper documents. The time spent on paper document sorting is very precious today when efficiency is paramount. In view of the current existing document classification methods, most of them use manual manual classification. The efficiency is low and classification errors are prone to occur. A series of equipment has been launched on the market to perform machine classification to reduce the labor burden, but it is impossible to achieve complete de-manualization. For example, in the Chinese patent CN110369296A, an office is proposed. The file sorting system and its prompting method use RFID electronic tags to store and classify paper documents, but it still needs to manually perform preliminary identification of classified files and install electronic tags on the classified files. When the number of files to be classified is large , Or when a large number of files to be classified need to be classified, a large number of electronic tags still need to be manually installed, the efficiency is still low, and there is a certain error rate.
发明内容Summary of the invention
为了解决上述问题,本发明的目的是提供一种文件分类协作机器人及基于其的图像文字识别方法,能够去人工化地对文件进行分类整理,有利于降低办公人群的工作负担。In order to solve the above-mentioned problems, the purpose of the present invention is to provide a document classification collaborative robot and an image and text recognition method based on it, which can sort and organize documents manually, which is beneficial to reduce the workload of office workers.
为了弥补现有技术的不足,本发明实施例采用的技术方案是:In order to make up for the deficiencies of the prior art, the technical solutions adopted in the embodiments of the present invention are:
一种文件分类协作机器人,包括:A collaborative robot for document classification, including:
摄像装置,用于对文件进行图像拍摄;The camera device is used to take images of files;
机器人主体,用于将所述文件放至用户指定位置;The main body of the robot, which is used to put the file in the position designated by the user;
上位机,用于对文本图片中文字进行识别输出并相应地统筹控制所述机器人主体进行移动,分别与所述摄像装置和所述机器人主体相连接。The upper computer is used for recognizing and outputting the characters in the text picture and correspondingly overall controlling the movement of the robot main body, which is respectively connected with the camera device and the robot main body.
一种文件分类协作机器人的图像文字识别方法,包括:An image and text recognition method of a document classification collaborative robot, including:
将所获取的文本行图片转化为标准文本行图片;Convert the obtained text line picture into a standard text line picture;
根据所述标准文本行图片构建图片文字识别模型;Constructing a picture character recognition model according to the standard text line picture;
基于所述图片文字识别模型输出待识别文本图片中的文字。Based on the picture character recognition model, the characters in the text picture to be recognized are output.
本发明实施例中提供的一个或多个技术方案,至少具有如下有益效果:基于摄像装置、机器人主体和上位机的配合,能够拍摄文件图片并针对其中的文字进行识别,进而根据识别的文字实现对于文件的分类管理,无需人工协助整个流程,智能化控制程度高,尤其是,针对所拍摄的图片构建相关识别模型,能够系统化地对文件中的文字进行识别,识别效果较为突出,不容易出现误识别的情况,大大降低了识别出错率。因此,本发明设计巧妙,分类识别智能高效,能够去人工化地对文件进行分类整理,有利于降低办公人群的工作负担。The one or more technical solutions provided in the embodiments of the present invention have at least the following beneficial effects: based on the cooperation of the camera device, the main body of the robot, and the host computer, it can take a picture of a document and recognize the text in it, and then realize the realization based on the recognized text For document classification management, there is no need to manually assist the entire process, and the degree of intelligent control is high. In particular, the construction of a relevant recognition model for the taken pictures can systematically recognize the text in the document, and the recognition effect is more prominent and not easy In the case of misrecognition, the recognition error rate is greatly reduced. Therefore, the present invention is ingenious in design, intelligent and efficient in classification and recognition, and can manually sort and organize files, which is beneficial to reducing the work burden of office workers.
进一步地,所述的将所获取的文本行图片转化为标准文本行图片,包括:Further, said converting the obtained text line picture into a standard text line picture includes:
根据所获取的文本行图片形成预设的文本行二值化图片;Form a preset text line binarization picture according to the obtained text line picture;
根据所述待识别文本图片确定预设的背景图片;Determining a preset background picture according to the text picture to be recognized;
将所述文本行二值化图片和所述背景图片进行合成处理,得到标准文本行图片。The text line binarized picture and the background picture are synthesized to obtain a standard text line picture.
进一步地,所述的根据所获取的文本行图片形成预设的文本行二值化图片,包括:Further, said forming a preset text line binarization picture according to the obtained text line picture includes:
从所获取的文本行图片中提取若干相关的文本内容;Extract a number of relevant text content from the obtained text line picture;
将所述文本内容处理生成对应的文本图像;Processing the text content to generate a corresponding text image;
根据所述文本图像形成预设的文本行二值化图片。A preset text line binary picture is formed according to the text image.
进一步地,所述的根据待识别文本图片确定预设的背景图片,包括:Further, said determining the preset background picture according to the text picture to be recognized includes:
根据所述待识别文本图片确定相关的标准模板图片;Determine the relevant standard template picture according to the text picture to be recognized;
从所述标准模板图片中获取无文字的背景区域;Obtain a background area without text from the standard template picture;
根据所述无文字的背景区域形成预设的背景图片。A preset background picture is formed according to the background area without text.
进一步地,所述的根据所述标准文本行图片构建图片文字识别模型,包括:Further, the construction of a picture character recognition model based on the standard text line picture includes:
根据所述标准文本行图片得到对应的样本图片;Obtain a corresponding sample picture according to the standard text line picture;
整合所述样本图片以及所述样本图片中的文本内容,形成训练样本集;Integrating the sample picture and the text content in the sample picture to form a training sample set;
基于所述训练样本集通过深度神经网络构建图片文字识别模型,其中,所述深度神经网络被设计为以所述样本图片为训练数据,以所述样本图片的文本内容为标签。A picture text recognition model is constructed through a deep neural network based on the training sample set, wherein the deep neural network is designed to use the sample picture as training data and the text content of the sample picture as a label.
进一步地,所述的根据所述标准文本行图片得到对应的样本图片,包括:Further, the obtaining the corresponding sample picture according to the standard text line picture includes:
对所述标准文本行图片进行扩充变化处理,得到对应的样本图片,其中, 所述的扩充变化处理包括透视变换、色调变换、添加阴影效果、添加高光效果、添加噪点、裁剪、缩放以及压缩中的一种处理或多种处理。Perform expansion and change processing on the standard text line picture to obtain a corresponding sample picture, where the expansion and change processing includes perspective transformation, tone transformation, adding shadow effects, adding highlight effects, adding noise, cropping, scaling, and compression. One treatment or multiple treatments.
进一步地,所述的基于所述图片文字识别模型输出待识别文本图片中的文字,包括:Further, the output of the text in the text picture to be recognized based on the picture text recognition model includes:
从所述待识别文本图片中的文字中获取若干相关词条;Acquiring a number of related entries from the text in the text picture to be recognized;
对若干所述词条进行拆分组合,生成新词条;Split and combine a number of said entries to generate new entries;
将所述新词条按照预设的字体类型转换为对应的文本内容。The new entry is converted into corresponding text content according to the preset font type.
进一步地,当所述的基于所述图片文字识别模型输出所述待识别文本图片中的文字之后,还包括:Further, after the output of the text in the to-be-recognized text image based on the image text recognition model, the method further includes:
将输出的所述待识别文本图片中的文字与所述上位机中所确定的用户分类规则进行比较,若两者保持一致,则通过上位机发送指令使所述机器人主体将所述文件放至用户指定位置,否则,返回获取文本行图片并执行所述的图像文字识别方法。Compare the output text in the to-be-recognized text picture with the user classification rules determined in the host computer. If the two are consistent, the host computer sends instructions to the robot body to put the file in The user specifies the location, otherwise, return to get the text line picture and execute the image text recognition method.
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。The additional aspects and advantages of the present invention will be partly given in the following description, and partly will become obvious from the following description, or be understood through the practice of the present invention.
附图说明Description of the drawings
下面结合附图给出本发明较佳实施例,以详细说明本发明的实施方案。Hereinafter, preferred embodiments of the present invention are given in conjunction with the drawings to illustrate the implementation of the present invention in detail.
图1是本发明实施例的一种文件分类协作机器人的结构示意框图;FIG. 1 is a schematic block diagram of the structure of a file classification collaborative robot according to an embodiment of the present invention;
图2是本发明实施例的图像文字识别方法的整体步骤示意框图;2 is a schematic block diagram of the overall steps of an image character recognition method according to an embodiment of the present invention;
图3是本发明实施例的图像文字识别方法中步骤“将所获取的文本行图片转化为标准文本行图片”的流程示意图;3 is a schematic flowchart of the step of "converting the obtained text line picture into a standard text line picture" in the image character recognition method of the embodiment of the present invention;
图4是本发明实施例的图像文字识别方法中步骤“根据所述标准文本行图片构建图片文字识别模型”的流程示意图;4 is a schematic flowchart of the step of "building a picture character recognition model based on the standard text line picture" in the image character recognition method of the embodiment of the present invention;
图5是本发明实施例的图像文字识别方法中步骤“基于所述图片文字识别模型输出待识别文本图片中的文字”的流程示意图;5 is a schematic flowchart of the step of "outputting text in a text picture to be recognized based on the picture text recognition model" in the image text recognition method according to an embodiment of the present invention;
图6是本发明实施例的图像文字识别方法的步骤流程示意图。Fig. 6 is a schematic flowchart of steps of an image character recognition method according to an embodiment of the present invention.
具体实施方式Detailed ways
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以 解释本发明,并不用于限定本发明。In order to make the objectives, technical solutions, and advantages of the present invention clearer, the following further describes the present invention in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, but not used to limit the present invention.
需要说明的是,如果不冲突,本发明实施例中的各个特征可以相互结合,均在本发明的保护范围之内。另外,虽然在系统示意图中进行了功能模块划分,在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于系统中的模块划分,或流程图中的顺序执行所示出或描述的步骤。It should be noted that if there is no conflict, the various features in the embodiments of the present invention can be combined with each other, and all fall within the protection scope of the present invention. In addition, although functional modules are divided in the system schematic diagram, and the logical sequence is shown in the flowchart, in some cases, the module division in the system may be different from the module division in the system, or the sequence shown in the flowchart may be executed. Or the steps described.
下面结合附图,对本发明实施例作进一步阐述。The embodiments of the present invention will be further described below in conjunction with the accompanying drawings.
参照图1,本发明实施例的一种文件分类协作机器人,包括:1, a document classification collaborative robot according to an embodiment of the present invention includes:
摄像装置,用于对文件进行图像拍摄;The camera device is used to take images of files;
机器人主体,用于将所述文件放至用户指定位置;The main body of the robot, which is used to put the file in the position designated by the user;
上位机,用于对文本图片中文字进行识别输出并相应地统筹控制所述机器人主体进行移动,分别与所述摄像装置和所述机器人主体相连接。The upper computer is used for recognizing and outputting the characters in the text picture and correspondingly overall controlling the movement of the robot main body, which is respectively connected with the camera device and the robot main body.
参照图2和图6,本发明实施例的一种文件分类协作机器人的图像文字识别方法,包括:2 and 6, an image and text recognition method of a document classification collaborative robot according to an embodiment of the present invention includes:
将所获取的文本行图片转化为标准文本行图片;Convert the obtained text line picture into a standard text line picture;
根据所述标准文本行图片构建图片文字识别模型;Constructing a picture character recognition model according to the standard text line picture;
基于所述图片文字识别模型输出待识别文本图片中的文字。Based on the picture character recognition model, the characters in the text picture to be recognized are output.
具体地,基于摄像装置、机器人主体和上位机的配合,能够拍摄文件图片并针对其中的文字进行识别,进而根据识别的文字实现对于文件的分类管理,无需人工协助整个流程,智能化控制程度高,尤其是,针对所拍摄的图片构建相关识别模型,能够系统化地对文件中的文字进行识别,识别效果较为突出,不容易出现误识别的情况,大大降低了识别出错率。因此,本发明设计巧妙,分类识别智能高效,能够去人工化地对文件进行分类整理,有利于降低办公人群的工作负担。Specifically, based on the cooperation of the camera, the main body of the robot, and the host computer, it can take a picture of the document and recognize the text in it, and then realize the classification and management of the document according to the recognized text, without manual assistance in the entire process, and a high degree of intelligent control In particular, the construction of a relevant recognition model for the taken pictures can systematically recognize the text in the document, the recognition effect is more prominent, the situation of misrecognition is not easy to occur, and the recognition error rate is greatly reduced. Therefore, the present invention is ingenious in design, intelligent and efficient in classification and recognition, and can manually sort and organize files, which is beneficial to reducing the work burden of office workers.
其中,机器人主体的功能不限于上述所述内容,实际上,当摄像装置的识别拍摄受阻时,其能够反馈给上位机,进而控制机器人主体将阻挡物进行腾挪,起到良好的辅助拍摄作用;并且,上位机中储存有标准文本行图片的数据库,因此能够轻松地根据任一标准文本行图片来构建相应的图片文字识别模型;并且,在摄像装置识别时,基于用户输入的纸质文件分类规则,一般采用由近及远的方式对文件进行拍摄识别,即逐份地对文件一一拍摄,分别进行识别,以免发生错漏。参照图5,该步骤:基于所述图片文字识别模型输出待识别文本图片中的文字, 只需将待识别文本图片获取得到,然后输入至所述图片文字识别模型中即可,则能够输出待识别文本图片中的文字,非常方便可靠。Among them, the function of the robot body is not limited to the above-mentioned content. In fact, when the recognition and shooting of the camera device is blocked, it can feed back to the host computer, and then control the robot body to move the obstacle, which plays a good auxiliary shooting role; In addition, the upper computer stores a database of standard text line pictures, so it can easily construct a corresponding picture character recognition model based on any standard text line picture; and, when the camera device recognizes, it is classified based on the paper document input by the user According to the rules, the document is generally photographed and identified from near to far, that is, the documents are photographed one by one, and the documents are identified separately to avoid errors and omissions. Referring to Figure 5, this step: output the text in the text picture to be recognized based on the picture text recognition model, only need to obtain the text picture to be recognized, and then input it into the picture text recognition model, then the text to be recognized can be output It is very convenient and reliable to recognize the text in the text picture.
更进一步地,参照图3,所述的将所获取的文本行图片转化为标准文本行图片,包括:Furthermore, referring to FIG. 3, the conversion of the obtained text line picture into a standard text line picture includes:
根据所获取的文本行图片形成预设的文本行二值化图片;Form a preset text line binarization picture according to the obtained text line picture;
根据所述待识别文本图片确定预设的背景图片;Determining a preset background picture according to the text picture to be recognized;
将所述文本行二值化图片和所述背景图片进行合成处理,得到标准文本行图片。The text line binarized picture and the background picture are synthesized to obtain a standard text line picture.
具体地,将所获取的文本行图片转化为标准文本行图片,能够与上位机中的标准文本行图片的数据库进行匹配,从而方便其构建相关图片文字识别模型,其中,使所获取的文本行图片呈现二值化,能够使其亮度更加清楚可见,提高了其识别度,另一方面,使待识别文本图片的背景被提取,能够确定待识别文本图片的应用范围,进而使文本行二值化图片和背景图片进行合成,就能够统一标准,得到标准文本行图片,以便于识别模型的构建。Specifically, the obtained text line picture is converted into a standard text line picture, which can be matched with a database of standard text line pictures in the host computer, so as to facilitate its construction of a relevant picture character recognition model, wherein the obtained text line Binarization of the picture rendering can make its brightness more clearly visible and improve its recognition. On the other hand, the background of the text picture to be recognized can be extracted, which can determine the application range of the text picture to be recognized, and then make the text line binary. By synthesizing the chemical picture and the background picture, the standard can be unified, and the standard text line picture can be obtained to facilitate the construction of the recognition model.
更进一步地,所述的根据所获取的文本行图片形成预设的文本行二值化图片,包括:Furthermore, said forming a preset text line binarization picture according to the obtained text line picture includes:
从所获取的文本行图片中提取若干相关的文本内容;Extract a number of relevant text content from the obtained text line picture;
将所述文本内容处理生成对应的文本图像;Processing the text content to generate a corresponding text image;
根据所述文本图像形成预设的文本行二值化图片。A preset text line binary picture is formed according to the text image.
在本实施例中,通过提取所获取的文本行图片中的文本内容,能够体现文本行图片的文字特征,进而将该文字特征以文本图像的形式进行输出,最终转化为文本行二值化图片,可见,通过本实施例的步骤能够将所获取的文本行图片的文字特征提取并呈现在文本行二值化图片之中,有利于后续的文字识别处理。In this embodiment, by extracting the text content of the obtained text line picture, the text characteristics of the text line picture can be reflected, and then the text characteristics are output in the form of a text image, and finally converted into a text line binarized picture It can be seen that, through the steps of this embodiment, the acquired text characteristics of the text line picture can be extracted and presented in the text line binarized picture, which is beneficial to the subsequent text recognition processing.
更进一步地,所述的根据待识别文本图片确定预设的背景图片,包括:Furthermore, the said determining the preset background picture according to the text picture to be recognized includes:
根据所述待识别文本图片确定相关的标准模板图片;Determine the relevant standard template picture according to the text picture to be recognized;
从所述标准模板图片中获取无文字的背景区域;Obtain a background area without text from the standard template picture;
根据所述无文字的背景区域形成预设的背景图片。A preset background picture is formed according to the background area without text.
具体地,以标准模板图片来比对待识别文本图片,能够将待识别文本图片映射于标准模板图片上,那么只需在标准模板图片中获取无文字的背景区域,即相 当于也从待识别文本图片中提取除了相应的背景区域,因此,通过该转化能够方便有效地为待识别文本图片生成背景图片。Specifically, the standard template image is compared to the text image to be recognized, and the text image to be recognized can be mapped to the standard template image. Then only the background area without text in the standard template image is obtained, which is equivalent to also from the text to be recognized In addition to the corresponding background area extracted from the picture, the conversion can be used to conveniently and effectively generate a background picture for the text picture to be recognized.
更进一步地,参照图4,所述的根据所述标准文本行图片构建图片文字识别模型,包括:Furthermore, referring to FIG. 4, the construction of a picture character recognition model based on the standard text line picture includes:
根据所述标准文本行图片得到对应的样本图片;Obtain a corresponding sample picture according to the standard text line picture;
整合所述样本图片以及所述样本图片中的文本内容,形成训练样本集;Integrating the sample picture and the text content in the sample picture to form a training sample set;
基于所述训练样本集通过深度神经网络构建图片文字识别模型,其中,所述深度神经网络被设计为以所述样本图片为训练数据,以所述样本图片的文本内容为标签。A picture text recognition model is constructed through a deep neural network based on the training sample set, wherein the deep neural network is designed to use the sample picture as training data and the text content of the sample picture as a label.
在本实施例中,基于样本图片及其文本内容进行训练,能够将标准文本行图片的特征反映在训练集中,使得所构建形成的图片文字识别模型的识别度更加精准,并且,采用深度神经网络进行构建,可以将图片数据进行更加稳定地整合处理,使得识别模型构建更加方便有效。具体地,设置CRNN模型为深度神经网络的网络模型,CRNN模型包括使用CNN的卷积层,使用BiLSTM的循环层和使用CTC的转录层,公式表达为
Figure PCTCN2020112598-appb-000001
其中,χ={I i,,L I},i表示训练样本集,I i,为第i个样本图片,L I为第i个样本图片中的文本内容,Y I为第i个样本图片对应的预测文本内容,下标i为训练样本集中训练数据的序号。
In this embodiment, training based on sample pictures and their text content can reflect the characteristics of standard text line pictures in the training set, so that the recognition of the constructed picture text recognition model is more accurate, and a deep neural network is used. With the construction, the image data can be integrated and processed more stably, making the construction of the recognition model more convenient and effective. Specifically, set the CRNN model as a deep neural network model. The CRNN model includes a convolutional layer using CNN, a recurrent layer using BiLSTM, and a transcription layer using CTC. The formula is expressed as
Figure PCTCN2020112598-appb-000001
Among them, χ={I i, ,L I }, i represents the training sample set, I i, is the i-th sample picture, L I is the text content in the i-th sample picture, Y I is the i-th sample picture Corresponding to the predicted text content, the subscript i is the sequence number of the training data in the training sample set.
更进一步地,所述的根据所述标准文本行图片得到对应的样本图片,包括:Furthermore, said obtaining the corresponding sample picture according to the standard text line picture includes:
对所述标准文本行图片进行扩充变视变换、色调变换、添加阴影效果、添加高光效果、添加噪点、裁剪、缩放以及压缩中的一种处理或多种处理。具体地,对标准文本行图片进行扩充变化处理,能够消除减小其本身存在的图片缺陷,从而可获得性质更加稳定丰富的样本图片,其中,上述扩充变化处理是发明人根据实验和经验所得到的。The standard text line picture is subjected to one or more processing of expansion and variable transformation, tone transformation, adding shadow effect, adding highlight effect, adding noise, cropping, scaling, and compression. Specifically, the expansion and change processing of the standard text line picture can eliminate and reduce its own picture defects, so as to obtain a more stable and rich sample picture. The above expansion and change processing is obtained by the inventor based on experiments and experience. of.
进一步地,所述的基于所述图片文字识别模型输出待识别文本图片中的文字,包括:Further, the output of the text in the text picture to be recognized based on the picture text recognition model includes:
从所述待识别文本图片中的文字中获取若干相关词条;Acquiring a number of related entries from the text in the text picture to be recognized;
对若干所述词条进行拆分组合,生成新词条;Split and combine a number of said entries to generate new entries;
将所述新词条按照预设的字体类型转换为对应的文本内容。The new entry is converted into corresponding text content according to the preset font type.
在本实施例中,通过对文本图片中的文字进行组合式扩展,能够产生更多 的新词条并实现其与文本内容之间的匹配,因此这相当于在原有基础上扩大了其应用范围,能够适配于所述的图片文字识别模型输出待识别文本图片中的文字。In this embodiment, the combined expansion of the text in the text image can generate more new entries and match them with the text content, so this is equivalent to expanding the scope of its application on the original basis. , Can be adapted to the picture character recognition model to output the characters in the text picture to be recognized.
进一步地,当所述的基于所述图片文字识别模型输出所述待识别文本图片中的文字之后,还包括:Further, after the output of the text in the to-be-recognized text image based on the image text recognition model, the method further includes:
将输出的所述待识别文本图片中的文字与所述上位机中所确定的用户分类规则进行比较,若两者保持一致,则通过上位机发送指令使所述机器人主体将所述文件放至用户指定位置,否则,返回获取文本行图片并执行所述的图像文字识别方法。Compare the output text in the to-be-recognized text picture with the user classification rules determined in the host computer. If the two are consistent, the host computer sends instructions to the robot body to put the file in The user specifies the location, otherwise, return to get the text line picture and execute the image text recognition method.
具体地,上位机中所确定的用户分类规则即由用户决定分类方法,比如按照数学字符、科目差别等,可以方便用户参与制订该项分类;由于实际中所处理的文件很可能不止一件,因此上述步骤也是可以重复执行的,直至判断是否为最后一份文件,若是,则结束分类,否则继续按照上面步骤进行分类,可见,通过比较可以验证待识别文本图片中的文字是否满足用户的分类需求,以便于找出错误,降低错误率。Specifically, the user classification rule determined in the host computer is that the user decides the classification method, such as mathematical characters, subject differences, etc., which can facilitate the user to participate in the formulation of the classification; because there is likely to be more than one document processed in practice, Therefore, the above steps can also be repeated until it is judged whether it is the last file. If it is, the classification is ended. Otherwise, continue to classify according to the above steps. It can be seen that through comparison, it can be verified whether the text in the text image to be recognized meets the user's classification. Demands in order to find errors and reduce the error rate.
以上内容对本发明的较佳实施例和基本原理作了详细论述,但本发明并不局限于上述实施方式,熟悉本领域的技术人员应该了解在不违背本发明精神的前提下还会有各种等同变形和替换,这些等同变形和替换都落入要求保护的本发明范围内。The above content has described the preferred embodiments and basic principles of the present invention in detail, but the present invention is not limited to the above-mentioned embodiments. Those skilled in the art should understand that there will be various types without departing from the spirit of the present invention. Equivalent deformations and replacements, these equivalent deformations and replacements all fall within the scope of the claimed invention.

Claims (9)

  1. 一种文件分类协作机器人,其特征在于,包括:A document classification collaborative robot, which is characterized in that it includes:
    摄像装置,用于对文件进行图像拍摄;The camera device is used to take images of files;
    机器人主体,用于将所述文件放至用户指定位置;The main body of the robot, which is used to put the file in the position designated by the user;
    上位机,用于对文本图片中文字进行识别输出并相应地统筹控制所述机器人主体进行移动,分别与所述摄像装置和所述机器人主体相连接。The upper computer is used for recognizing and outputting the characters in the text picture and correspondingly overall controlling the movement of the robot main body, which is respectively connected with the camera device and the robot main body.
  2. 基于权利要求1所述的一种文件分类协作机器人的图像文字识别方法,其特征在于,包括:An image character recognition method based on a document classification collaborative robot according to claim 1, characterized in that it comprises:
    将所获取的文本行图片转化为标准文本行图片;Convert the obtained text line picture into a standard text line picture;
    根据所述标准文本行图片构建图片文字识别模型;Constructing a picture character recognition model according to the standard text line picture;
    基于所述图片文字识别模型输出待识别文本图片中的文字。Based on the picture character recognition model, the characters in the text picture to be recognized are output.
  3. 根据权利要求2所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,所述的将所获取的文本行图片转化为标准文本行图片,包括:The image text recognition method based on a document classification collaborative robot according to claim 2, wherein said converting the obtained text line picture into a standard text line picture comprises:
    根据所获取的文本行图片形成预设的文本行二值化图片;Form a preset text line binarization picture according to the obtained text line picture;
    根据所述待识别文本图片确定预设的背景图片;Determining a preset background picture according to the text picture to be recognized;
    将所述文本行二值化图片和所述背景图片进行合成处理,得到标准文本行图片。The text line binarized picture and the background picture are synthesized to obtain a standard text line picture.
  4. 根据权利要求3所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,所述的根据所获取的文本行图片形成预设的文本行二值化图片,包括:The method for image text recognition based on a document classification collaborative robot according to claim 3, wherein said forming a preset text line binarization picture according to the obtained text line picture comprises:
    从所获取的文本行图片中提取若干相关的文本内容;Extract a number of relevant text content from the obtained text line picture;
    将所述文本内容处理生成对应的文本图像;Processing the text content to generate a corresponding text image;
    根据所述文本图像形成预设的文本行二值化图片。A preset text line binary picture is formed according to the text image.
  5. 根据权利要求3所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,所述的根据待识别文本图片确定预设的背景图片,包括:The image and text recognition method based on a document classification collaborative robot according to claim 3, wherein said determining a preset background picture according to a text picture to be recognized comprises:
    根据所述待识别文本图片确定相关的标准模板图片;Determine the relevant standard template picture according to the text picture to be recognized;
    从所述标准模板图片中获取无文字的背景区域;Obtain a background area without text from the standard template picture;
    根据所述无文字的背景区域形成预设的背景图片。A preset background picture is formed according to the background area without text.
  6. 根据权利要求2所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,所述的根据所述标准文本行图片构建图片文字识别模型,包括:The image character recognition method based on a document classification collaborative robot according to claim 2, wherein said constructing an image character recognition model based on the standard text line image comprises:
    根据所述标准文本行图片得到对应的样本图片;Obtain a corresponding sample picture according to the standard text line picture;
    整合所述样本图片以及所述样本图片中的文本内容,形成训练样本集;Integrating the sample picture and the text content in the sample picture to form a training sample set;
    基于所述训练样本集通过深度神经网络构建图片文字识别模型,其中,所述深度神经网络被设计为以所述样本图片为训练数据,以所述样本图片的文本内容为标签。A picture text recognition model is constructed through a deep neural network based on the training sample set, wherein the deep neural network is designed to use the sample picture as training data and the text content of the sample picture as a label.
  7. 根据权利要求6所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,所述的根据所述标准文本行图片得到对应的样本图片,包括:The method for image character recognition based on a document classification collaborative robot according to claim 6, wherein said obtaining corresponding sample pictures according to said standard text line pictures comprises:
    对所述标准文本行图片进行扩充变化处理,得到对应的样本图片,其中,所述的扩充变化处理包括透视变换、色调变换、添加阴影效果、添加高光效果、添加噪点、裁剪、缩放以及压缩中的一种处理或多种处理。Perform expansion and change processing on the standard text line picture to obtain a corresponding sample picture, where the expansion and change processing includes perspective transformation, tone transformation, adding shadow effects, adding highlight effects, adding noise, cropping, scaling, and compression. One treatment or multiple treatments.
  8. 根据权利要求2所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,所述的基于所述图片文字识别模型输出待识别文本图片中的文字,包括:The image character recognition method based on a document classification collaborative robot according to claim 2, wherein said outputting the characters in the text picture to be recognized based on the picture character recognition model comprises:
    从所述待识别文本图片中的文字中获取若干相关词条;Acquiring a number of related entries from the text in the text picture to be recognized;
    对若干所述词条进行拆分组合,生成新词条;Split and combine a number of said entries to generate new entries;
    将所述新词条按照预设的字体类型转换为对应的文本内容。The new entry is converted into corresponding text content according to the preset font type.
  9. 根据权利要求2或8所述的基于一种文件分类协作机器人的图像文字识别方法,其特征在于,当所述的基于所述图片文字识别模型输出所述待识别文本图片中的文字之后,还包括:The image text recognition method based on a document classification collaborative robot according to claim 2 or 8, wherein after the text in the text image to be recognized is output based on the image text recognition model, further include:
    将输出的所述待识别文本图片中的文字与所述上位机中所确定的用户分类规则进行比较,若两者保持一致,则通过上位机发送指令使所述机器人主体将所述文件放至用户指定位置,否则,返回获取文本行图片并执行所述的图像文字识别方法。Compare the output text in the to-be-recognized text picture with the user classification rule determined in the host computer. If the two are consistent, the host computer sends an instruction to the robot body to put the file in The user specifies the location, otherwise, return to get the text line picture and execute the image text recognition method.
PCT/CN2020/112598 2020-03-16 2020-08-31 Document classification collaborative robot and image character recognition method based thereon WO2021184692A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010182139.8 2020-03-16
CN202010182139.8A CN111428710A (en) 2020-03-16 2020-03-16 File classification collaboration robot and image character recognition method based on same

Publications (1)

Publication Number Publication Date
WO2021184692A1 true WO2021184692A1 (en) 2021-09-23

Family

ID=71547960

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/112598 WO2021184692A1 (en) 2020-03-16 2020-08-31 Document classification collaborative robot and image character recognition method based thereon

Country Status (2)

Country Link
CN (1) CN111428710A (en)
WO (1) WO2021184692A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111428710A (en) * 2020-03-16 2020-07-17 五邑大学 File classification collaboration robot and image character recognition method based on same
CN112686243A (en) * 2020-12-29 2021-04-20 平安普惠企业管理有限公司 Method and device for intelligently identifying picture characters, computer equipment and storage medium
CN112906693B (en) * 2021-03-05 2022-06-24 杭州费尔斯通科技有限公司 Method for identifying subscript character and subscript character
CN113807416B (en) * 2021-08-30 2024-04-05 国泰新点软件股份有限公司 Model training method and device, electronic equipment and storage medium
CN116912852A (en) * 2023-07-25 2023-10-20 京东方科技集团股份有限公司 Method, device and storage medium for identifying text of business card

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150023590A1 (en) * 2013-07-16 2015-01-22 National Taiwan University Of Science And Technology Method and system for human action recognition
CN109325493A (en) * 2018-08-23 2019-02-12 厦门理工学院 A kind of character recognition method and anthropomorphic robot based on anthropomorphic robot
CN109650021A (en) * 2018-12-27 2019-04-19 安徽清芝商用机器有限公司 A kind of slim file accessing transportation robot device and its application method
CN109807883A (en) * 2018-12-17 2019-05-28 广州正瀚材料科技有限公司 A kind of archiving method of file
CN110369296A (en) * 2019-06-25 2019-10-25 安徽永顺信息科技有限公司 A kind of office document clearing system and its reminding method
CN110414519A (en) * 2019-06-27 2019-11-05 众安信息技术服务有限公司 A kind of recognition methods of picture character and its identification device
CN111428710A (en) * 2020-03-16 2020-07-17 五邑大学 File classification collaboration robot and image character recognition method based on same

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104827459B (en) * 2015-04-07 2017-03-01 济南大学 A kind of intelligent finishing book robot and its method for sorting
CN206318532U (en) * 2016-11-02 2017-07-11 孔庆周 A kind of file ordering collating unit in bulk
CN107609549B (en) * 2017-09-20 2021-01-08 北京工业大学 Text detection method for certificate image in natural scene
CN108000525B (en) * 2017-10-23 2021-10-15 广东数相智能科技有限公司 Book management robot and book misplacement recognition method based on robot
CN110378350A (en) * 2019-07-23 2019-10-25 中国工商银行股份有限公司 A kind of method, apparatus and system of Text region

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150023590A1 (en) * 2013-07-16 2015-01-22 National Taiwan University Of Science And Technology Method and system for human action recognition
CN109325493A (en) * 2018-08-23 2019-02-12 厦门理工学院 A kind of character recognition method and anthropomorphic robot based on anthropomorphic robot
CN109807883A (en) * 2018-12-17 2019-05-28 广州正瀚材料科技有限公司 A kind of archiving method of file
CN109650021A (en) * 2018-12-27 2019-04-19 安徽清芝商用机器有限公司 A kind of slim file accessing transportation robot device and its application method
CN110369296A (en) * 2019-06-25 2019-10-25 安徽永顺信息科技有限公司 A kind of office document clearing system and its reminding method
CN110414519A (en) * 2019-06-27 2019-11-05 众安信息技术服务有限公司 A kind of recognition methods of picture character and its identification device
CN111428710A (en) * 2020-03-16 2020-07-17 五邑大学 File classification collaboration robot and image character recognition method based on same

Also Published As

Publication number Publication date
CN111428710A (en) 2020-07-17

Similar Documents

Publication Publication Date Title
WO2021184692A1 (en) Document classification collaborative robot and image character recognition method based thereon
CN109993160B (en) Image correction and text and position identification method and system
US8494273B2 (en) Adaptive optical character recognition on a document with distorted characters
WO2018233038A1 (en) Deep learning-based method, apparatus and device for recognizing license plate, and storage medium
Singh et al. Offline script identification from multilingual indic-script documents: a state-of-the-art
CN111695392B (en) Face recognition method and system based on cascade deep convolutional neural network
CN110008909A (en) A kind of real-time audit system of system of real name business based on AI
WO2021208617A1 (en) Method and apparatus for recognizing station entering and exiting, terminal, and storage medium
CN105426884A (en) Fast document type recognition method based on full-sized feature extraction
CN112036406B (en) Text extraction method and device for image document and electronic equipment
CN112507800A (en) Pedestrian multi-attribute cooperative identification method based on channel attention mechanism and light convolutional neural network
CN112883980B (en) Data processing method and system
CN113221735A (en) Multimodal-based scanned part paragraph structure restoration method and device and related equipment
Panda et al. Odia offline typewritten character recognition using template matching with unicode mapping
WO2020019457A1 (en) User instruction matching method and apparatus, computer device, and storage medium
CN113780276A (en) Text detection and identification method and system combined with text classification
CN109388935A (en) Document verification method and device, electronic equipment and readable storage medium storing program for executing
CN110991279B (en) Document Image Analysis and Recognition Method and System
CN115830620B (en) Archive text data processing method and system based on OCR
Ovodov Optical Braille recognition using object detection CNN
Ajao et al. Yoruba handwriting word recognition quality evaluation of preprocessing attributes using information theory approach
Zhi et al. A Hybrid Framework for Text Recognition Used in Commodity Futures Document Verification
WO2021068682A1 (en) Method and apparatus for intelligently filtering table text, and computer-readable storage medium
CN106780637A (en) A kind of fast parallel image thinning algorithm based on pulse nerve membranous system
CN112329695A (en) Dynamic handwriting recognition method based on intelligent blackboard

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20925471

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20925471

Country of ref document: EP

Kind code of ref document: A1