CN110858307B

CN110858307B - Character recognition model training method and device and character recognition method and device

Info

Publication number: CN110858307B
Application number: CN201810973521.3A
Authority: CN
Inventors: 江建军; 郑凯; 段立新; 李建丽
Original assignee: Guoxin Youe Data Co Ltd
Current assignee: Guoxin Youe Data Co Ltd
Priority date: 2018-08-24
Filing date: 2018-08-24
Publication date: 2022-09-13
Anticipated expiration: 2038-08-24
Also published as: CN110858307A

Abstract

The application provides a character recognition model training method and device and a character recognition method and device, wherein the training method comprises the following steps: acquiring a sample image; wherein, the sample image comprises a plurality of sample contents; confirming content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content; and taking the content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content as input characteristics of the character recognition model to be trained, taking the sample characters contained in the sample content as output results of the character recognition model to be trained, and training to obtain the character recognition model. According to the method and the device, the target characters in the target image can be identified by utilizing the relation probability matrix between the target characters contained in the target content, and the identification accuracy and efficiency are high.

Description

Character recognition model training method and device and character recognition method and device

Technical Field

The application relates to the technical field of image-text processing, in particular to a character recognition model training method and device and a character recognition method and device.

Background

Optical Character Recognition (OCR) is a relatively common image-based Character Recognition technology that can recognize Optical characters in pictures and translate the Optical characters into computer text through image processing and pattern Recognition technologies.

In the related art, after a picture to be recognized is acquired, an image to be recognized can be recognized through an OCR recognition model, and an OCR recognition result of the acquired image is directly used as a final recognition result. However, the difficulty of learning the word relationship from the image is limited, which results in low recognition accuracy and recognition efficiency of the related technical scheme of character recognition by using the OCR recognition model, and limits the wide application of character recognition to a certain extent.

Disclosure of Invention

In view of the above, an object of the present application is to provide a method and an apparatus for training a character recognition model, and a method and an apparatus for recognizing characters, so as to improve accuracy and efficiency of character recognition.

In a first aspect, an embodiment of the present application provides a method for training a character recognition model, including:

acquiring a sample image; wherein the sample image comprises a plurality of sample contents;

confirming content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content;

and taking the content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content as input characteristics of the character recognition model to be trained, taking the sample characters contained in the sample content as output results of the character recognition model to be trained, and training to obtain the character recognition model.

With reference to the first aspect, the present application provides a first possible implementation manner of the first aspect, where the training of the character recognition model by using the content feature information of each sample content and the relationship probability matrix between the sample characters included in the sample content as the input features of the character recognition model to be trained and using the sample characters included in the sample content as the output result of the character recognition model to be trained includes:

sequentially taking the determined content characteristic information of each sample content as the input characteristic of a first character recognition submodel, taking the corresponding sample content as the output result of the first character recognition submodel, and training the first character recognition submodel to obtain an initial training first character recognition submodel;

for each sample content in the sample image, taking content feature information of the sample content as input of the initial training first character recognition submodel to obtain recognition content, taking the recognition content as input feature of a second character recognition submodel, taking a relation probability matrix between recognition characters contained in the recognition content as an output result of the second character recognition submodel, and training the second character recognition submodel;

for each sample content in the sample image, taking the content characteristic information of the sample content and a relation probability matrix obtained by identifying the identification content corresponding to the sample content by the trained second character identification submodel as the input characteristic of the initially trained first character identification submodel, and retraining the initially trained first character identification submodel again; the character recognition model comprises a retrained initial training first character recognition submodel and a trained second character recognition submodel.

With reference to the first possible implementation manner of the first aspect, the present application provides a second possible implementation manner of the first aspect, where the taking the recognition content as an input feature of a second character recognition submodel and taking a relationship probability matrix between recognition characters included in the recognition content as an output result of the second character recognition submodel includes:

for each sample content in the sample image, extracting a character coding matrix of a recognition character contained in the recognition content from the recognition content obtained by recognizing the sample content by the initial training first character recognition sub-model;

and taking the character encoding matrix of the extracted recognition character as the input characteristic of a second character recognition submodel, and taking the relation probability matrix between the recognition characters contained in the recognition content as the output result of the second character recognition submodel.

With reference to the second possible implementation manner of the first aspect, the present application provides a third possible implementation manner of the first aspect, where after the training of the second character recognition submodel, before the training of the initially trained first character recognition submodel again, the method further includes:

for each sample content in the sample image, determining an image area of the sample image corresponding to the sample content;

based on the size of the determined image area, expanding a relation probability matrix between identification characters contained in identification content corresponding to the sample content to obtain an expanded relation probability matrix;

retraining the initially trained first character recognition submodel, including:

and aiming at each sample content in the sample image, performing retraining on the initially trained first character recognition submodel by using the content characteristic information of the sample content and an extended relation probability matrix obtained by extending after the trained second character recognition submodel recognizes the recognition content corresponding to the sample content to obtain a relation probability matrix as the input characteristic of the initially trained first character recognition submodel.

In a second aspect, the present application further provides a method for recognizing a character based on a character recognition model trained in any one of the first aspect and the first possible implementation manner to the third possible implementation manner of the first aspect, including:

acquiring a target image; wherein the target image comprises a plurality of target contents;

confirming content characteristic information of each target content and a relation probability matrix between target characters contained in the target content;

and inputting the content characteristic information of each target content and a relation probability matrix between target characters contained in the target content into the character recognition model, and recognizing to obtain the target characters contained in the target content.

In a third aspect, the present application further provides a character recognition model training apparatus, including:

the image acquisition module is used for acquiring a sample image; wherein the sample image comprises a plurality of sample contents;

the information confirming module is used for confirming the content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content;

and the model training module is used for taking the content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content as input characteristics of the character recognition model to be trained, taking the sample characters contained in the sample content as output results of the character recognition model to be trained, and training to obtain the character recognition model.

With reference to the third aspect, the present application provides a first possible implementation manner of the third aspect, wherein the model training module includes:

the first sub-model training unit is used for sequentially taking the determined content characteristic information of each sample content as the input characteristic of a first character recognition sub-model, taking the corresponding sample content as the output result of the first character recognition sub-model, and training the first character recognition sub-model to obtain an initial training first character recognition sub-model;

a second sub-model training unit, configured to, for each sample content in the sample image, use content feature information of the sample content as an input of the initial training first character recognition sub-model to obtain recognition content, use the recognition content as an input feature of a second character recognition sub-model, use a relationship probability matrix between recognition characters included in the recognition content as an output result of the second character recognition sub-model, and train the second character recognition sub-model;

the first sub-model training unit is further used for retraining the initial training first character recognition sub-model by taking the content feature information of the sample content and a relation probability matrix obtained by recognizing the recognition content corresponding to the sample content by the trained second character recognition sub-model as the input feature of the initial training first character recognition sub-model aiming at each sample content in the sample image; the character recognition model comprises a retrained initial training first character recognition submodel and a trained second character recognition submodel.

With reference to the first possible implementation manner of the third aspect, the present application provides a second possible implementation manner of the third aspect, wherein the second submodel training unit is specifically configured to:

and taking the character coding matrix of the extracted recognition characters as the input characteristic of a second character recognition submodel, and taking a relation probability matrix between the recognition characters contained in the recognition content as the output result of the second character recognition submodel.

With reference to the second possible implementation manner of the third aspect, the present application provides a third possible implementation manner of the third aspect, where the method further includes:

the matrix expansion module is used for determining the image area of each sample content in the sample image, wherein the sample content corresponds to the sample image; based on the size of the determined image area, expanding a relation probability matrix between identification characters contained in identification content corresponding to the sample content to obtain an expanded relation probability matrix;

the first sub-model training unit is specifically configured to, for each sample content in the sample image, train the initial training first character recognition sub-model again by using, as input features of the initial training first character recognition sub-model, content feature information of the sample content and an extended relationship probability matrix obtained by extending an extended relationship probability matrix obtained by identifying, by the trained second character recognition sub-model, recognition content corresponding to the sample content.

In a fourth aspect, an embodiment of the present application further provides an apparatus for recognizing a character based on a character recognition model trained by any one of the third aspect and the first possible implementation manner to the third possible implementation manner of the third aspect, including:

the image acquisition module is used for acquiring a target image; wherein the target image comprises a plurality of target contents;

the information confirmation module is used for confirming the content characteristic information of each target content and a relation probability matrix between target characters contained in the target content;

and the character recognition module is used for inputting the content characteristic information of each target content and a relation probability matrix between target characters contained in the target content into the character recognition model, and recognizing to obtain the target characters contained in the target content.

In the above scheme provided by the embodiment of the application, the content feature information of the sample content included in the sample image and the relationship probability matrix between the sample characters included in the sample content are used as the input features of the character recognition model to be trained, and the sample characters included in the sample content are used as the output results of the character recognition model to be trained, so as to train and obtain the character recognition model. The character recognition model trained by the scheme can recognize the target characters in the target image by utilizing the relation probability matrix between the target characters contained in the target content, and the recognition accuracy and efficiency are high.

In order to make the aforementioned objects, features and advantages of the present application more comprehensible, preferred embodiments accompanied with figures are described in detail below.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are required to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained from the drawings without inventive effort.

FIG. 1 is a flow chart illustrating a method for training a character recognition model according to an embodiment of the present disclosure;

FIG. 2 is a flow chart illustrating another method for training a character recognition model provided by an embodiment of the present application;

FIG. 3 is a flow chart illustrating a method for recognizing characters provided by an embodiment of the present application;

FIG. 4 is a schematic structural diagram illustrating a training apparatus for a character recognition model according to an embodiment of the present application;

fig. 5 is a schematic structural diagram illustrating an apparatus for recognizing characters according to an embodiment of the present application;

FIG. 6 is a schematic structural diagram of a computer device provided in an embodiment of the present application;

fig. 7 shows a schematic structural diagram of another computer device provided in an embodiment of the present application.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all the embodiments. The components of the embodiments of the present application, generally described and illustrated in the figures herein, can be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present application, presented in the accompanying drawings, is not intended to limit the scope of the claimed application, but is merely representative of selected embodiments of the application. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present application without making any creative effort, shall fall within the protection scope of the present application.

Considering the difficulty limited by learning the character relationship from the image, the recognition accuracy and recognition efficiency of the related technical scheme of character recognition by adopting the OCR recognition model are low. In view of this, an embodiment of the present application provides a method for training a character recognition model to improve accuracy and efficiency of character recognition, which is described in the following embodiments.

As shown in fig. 1, a flowchart of a character recognition model training method provided in an embodiment of the present application is provided, where an execution subject of the method may be a computer device, and the training method includes the following steps:

s101, obtaining a sample image; wherein, the sample image comprises a plurality of sample contents.

Here, it is necessary to acquire a sample image in advance, and the sample image may be in a picture format of JPG, PNG, GIF, BMP, DOC, or the like, and may include a plurality of sample contents. The sample content may be a word (such as a word or a phrase), a number, a mathematical formula, or the like, which is not limited in this application embodiment.

S102, confirming content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content.

Before identifying the sample content in the sample image, the sample content, such as the area where the text content is located, may be first found from the sample image, and the corresponding text area is separated from the sample image, so that the feature extraction may be performed on the image area including only the sample content to obtain the corresponding content feature information. Considering that the related technical scheme of performing character recognition by using an OCR recognition model is limited by the difficulty of learning the character relationship from the image, the embodiment of the application can perform recognition by combining the content feature information on the premise of determining the relationship probability matrix between the sample characters contained in the sample content. That is, the character recognition model training method provided by the embodiment of the application aims to learn the character relationship through the relationship probability matrix corresponding to the sample content, so that the character relationship is prevented from being directly learned from the image, and the recognition efficiency is improved while the recognition accuracy is ensured.

S103, taking the content characteristic information of each sample content and the relation probability matrix between sample characters contained in the sample content as input characteristics of the character recognition model to be trained, taking the sample characters contained in the sample content as output results of the character recognition model to be trained, and training to obtain the character recognition model.

In the character recognition model training stage, the content feature information of each sample content confirmed in S102 and the relationship probability matrix between the sample characters included in the sample content are used as the input features of the character recognition model to be trained, and the sample characters included in the sample content are used as the output results of the character recognition model to be trained, so as to obtain the parameter information of the character recognition model through training, that is, obtain the trained character recognition model. Thus, the target characters in the target image can be recognized through the trained character recognition model.

In a specific implementation, the character recognition model may be implemented by a combination of a first character recognition submodel and a second character recognition submodel. As shown in fig. 2, the training process of the character recognition model specifically includes the following steps:

s201, sequentially taking the determined content characteristic information of each sample content as an input characteristic of a first character recognition submodel, taking the corresponding sample content as an output result of the first character recognition submodel, and training the first character recognition submodel to obtain an initial training first character recognition submodel;

s202, aiming at each sample content in a sample image, taking content characteristic information of the sample content as input of an initial training first character recognition submodel to obtain recognition content, taking the recognition content as input characteristics of a second character recognition submodel, taking a relation probability matrix between recognition characters contained in the recognition content as an output result of the second character recognition submodel, and training the second character recognition submodel;

s203, aiming at each sample content in the sample image, taking the content characteristic information of the sample content and a relation probability matrix obtained by identifying the identification content corresponding to the sample content by the trained second character identification submodel as the input characteristic of the initially trained first character identification submodel, and retraining the initially trained first character identification submodel again; the character recognition model comprises a retrained initial training first character recognition submodel and a trained second character recognition submodel.

Here, the first character recognition submodel is configured to map content feature information of each sample content to corresponding sample content, the second character recognition submodel is configured to map recognition content output by the first character recognition submodel on the sample content to a relationship probability matrix between recognition characters included in the recognition content, and the relationship probability matrix recognized by the second character recognition submodel may be combined with the content feature information of the sample content to serve as an input feature of the first character recognition submodel to train the first character recognition submodel again, so as to improve accuracy of recognition of sample characters included in the sample content.

For the initial training of the first Character Recognition submodel, an existing Optical Character Recognition (OCR) model, such as a random forest, a Support Vector Machine (SVM), a Neural Network (NN) model, or the like, may be used for training. In this way, based on the trained first character recognizer model, recognition content corresponding to each sample content (corresponding to an image region) in the sample image can be obtained.

For training the second character recognition submodel, it mainly relies on the above-mentioned recognition content to directly train a relationship probability matrix between the recognition characters contained in the recognition content, and the relationship probability matrix is used to characterize the degree of association between the characters, for example, for a character sequence "learning", the second character recognition submodel of the embodiment of the present application may be as much as possible? ' as ' r ' instead of ' a ' and other letters.

The embodiment of the application aims to directly attach the relation probability matrix of the recognition characters included in the recognition content to the input characteristics of the first character recognition submodel for retraining, so that the problems of low efficiency and strong limitation caused by directly learning the image-character relation are avoided as much as possible, and the recognition accuracy and efficiency are high.

In a specific implementation, the second character recognition submodel maps an input sequence (i.e., the recognition content obtained by recognizing the sample content from the initially trained first character recognition submodel) to an output matrix (i.e., a relationship probability matrix between the recognition characters included in the recognition content). The embodiment of the application can adopt a special type of Recurrent Neural Networks (RNN) -Long Short Term Memory (LSTM) network to carry out model training. That is, in the embodiment of the present application, the LSTM network is used to gradually grasp various basic knowledge through repeated iterative learning, and finally how to generate a relationship probability matrix meeting the requirements according to the identification content is learned.

Here, the LSTM network described above can be described in accordance with the following equations (1) to (5):

i _t ＝σ(W _xi +W _hi ht-1+w _ci ⊙c _t-1 +b _i ) (1)

f _t ＝σ(W _xf x _t +W _hf h _t-1 +w _cf ⊙c _t-1 +b _f ) (2)

c _t ＝f _t c _t-1 +i _t tanh(W _xc x _t +W _hc h _t-1 +b _c ) (3)

o _t ＝σ(W _xo x _t +W _ho h _t-1 +w _co ⊙c _t-1 +b _o ) (4)

h _t ＝o _t tanh(c _t ) (5)

wherein W _xi ，W _hi ，w _ci ，b _x Is the input gate parameter, W _xo ，W _ho ，w _co ，b _o Is the output gate parameter, W _xf ，W _hf ，w _cf ，b _f Is a forgetting gate parameter, W _xc ，W _hc ，b _c The parameter is associated with the input (is the state) and the memory cell can be directly modified. The symbol |, indicates element multiplication. The gating cells are implemented by multiple multipliers, so their domain ranges are [0, 1 ]]Corresponding to sigmoid nonlinear functions. We define p _lstm ＝{W _xi ，W _hi ，w _ci ，b _x ，W _x o，W _h o，w _c o，bo，W _xf ，W _hf ，w _cf ，b _f ，W _xc ，W _hc ，b _c Denotes the merging parameters of the LSTM network.

In addition, the LSTM network may further include a full link layer, the full link layer may adjust the relational probability matrix, and the calculation of the conversion amount may be performed by

Wherein

Is a weight matrix, b ∈ R ^n×d Is the offset, α is the activation function (softmax), and the output is expressed as:

wherein x is _i，j Representation matrix

Element index of, output of current layer

Can be viewed as a matrix consisting of a probability matrix of the input sentences.

In addition, after the recognition content is obtained by the sample content recognition from the initial training of the first character recognizer model, the method based on mathematics can be used for: word2vec, which converts the recognition content as natural language into digital information in vector form for machine recognition, this process is called encoding (Encoder). That is, a semantic vector is used to represent a word, and then the semantic vector is used as an input feature of the second character recognition sub-model. The semantic vectors can be obtained by using a word Representation model of One-time Representation (One-hot Representation). That is, in the embodiment of the present application, a very long vector may be used to represent a word, the length of the vector is the word size N of the dictionary, each vector has only one dimension of 1, the remaining dimensions are all 0, and the position of 1 represents the position of the word in the dictionary. That is, the word representation model stores word information in a sparse manner, that is, each word is assigned with a digital identifier, and the representation form is relatively simple. Thus, a character encoding matrix can be associated with each sample content. According to the embodiment of the application, the character encoding matrix of the extracted recognition characters can be used as the input characteristic of the second character recognition submodel, and the relation probability matrix between the recognition characters contained in the recognition content is used as the output result of the second character recognition submodel to train the second character recognition submodel.

According to the embodiment of the application, after the second character recognition submodel is trained and before the initially trained first character recognition submodel is retrained again, the relationship probability matrix obtained by recognizing the recognition content corresponding to the sample content according to the trained second character recognition submodel can be expanded to obtain the expanded relationship probability matrix, the expanded relationship probability matrix and the content characteristic information of the sample content are both used as the input characteristics of the initially trained first character recognition submodel, and the initially trained first character recognition submodel is retrained again, so that the robustness of model training is ensured.

When the relationship probability matrix corresponding to any sample content is expanded, the expansion can be performed depending on the size of the image area of the sample image corresponding to the sample content. In the embodiment of the application, some small and meaningless labels can be inserted into all the relation probability matrixes, and finally, the labels are expanded to be the same as the width (pixels) of the image area, so that the meaningless labels can be filtered to weaken the length difference between the input and the output. For example, the sequence '-bb-u-tt' would be converted to 'but', where '-' is the nonsense tag).

Based on the character recognition model obtained by training in the above embodiment, an embodiment of the present application further provides a method for recognizing a character, as shown in fig. 3, which is a flowchart of the method for recognizing a character provided in the embodiment of the present application, and is applied to a computer device, where the method for recognizing a character includes the following steps:

s301, acquiring a target image; wherein the target image comprises a plurality of target contents;

s302, confirming content characteristic information of each target content and a relation probability matrix between target characters contained in the target content;

and S303, inputting the content characteristic information of each target content and the relation probability matrix between the target characters contained in the target content into a character recognition model, and recognizing to obtain the target characters contained in the target content.

Here, in the embodiment of the present application, the content feature information of each target content and the relationship probability matrix between the target characters included in the target content are input to the trained character recognition model, so that the target characters included in the target content can be recognized and obtained. The process of using the character recognition model is similar to the training process, the first character recognition submodel can be used for generating the relation probability matrix, and then the second character recognition submodel is used for recognizing the characters, so that the recognition accuracy and efficiency are ensured.

Based on the same inventive concept, the embodiment of the present application further provides a character recognition model training device corresponding to the character recognition model training method, and because the principle of solving the problem of the device in the embodiment of the present application is similar to that of the character recognition model training method in the embodiment of the present application, the implementation of the device can refer to the implementation of the method, and repeated details are not repeated.

As shown in fig. 4, which is a schematic structural diagram of a character recognition model training apparatus provided in an embodiment of the present application, the character recognition model training apparatus includes:

an image acquisition module 401, configured to acquire a sample image; wherein, the sample image comprises a plurality of sample contents;

an information confirming module 402, configured to confirm content feature information of each sample content and a relationship probability matrix between sample characters included in the sample content;

the model training module 403 is configured to use the content feature information of each sample content and the relationship probability matrix between the sample characters included in the sample content as input features of the character recognition model to be trained, and use the sample characters included in the sample content as output results of the character recognition model to be trained, so as to obtain the character recognition model through training.

In one embodiment, model training module 403 includes:

the first sub-model training unit is used for sequentially taking the determined content characteristic information of each sample content as the input characteristic of the first character recognition sub-model, taking the corresponding sample content as the output result of the first character recognition sub-model, and training the first character recognition sub-model to obtain an initial training first character recognition sub-model;

the second submodel training unit is used for taking the content characteristic information of the sample content as the input of an initial training first character recognition submodel to obtain recognition content aiming at each sample content in a sample image, taking the recognition content as the input characteristic of a second character recognition submodel, taking a relation probability matrix between recognition characters contained in the recognition content as the output result of the second character recognition submodel, and training the second character recognition submodel;

the first sub-model training unit is further used for retraining the initially trained first character recognition sub-model by taking the content characteristic information of the sample content and a relation probability matrix obtained by recognizing the recognition content corresponding to the sample content by the trained second character recognition sub-model as the input characteristic of the initially trained first character recognition sub-model aiming at each sample content in the sample image; the character recognition model comprises a retrained initial training first character recognition submodel and a trained second character recognition submodel.

In another embodiment, the second sub-model training unit is specifically configured to:

aiming at each sample content in the sample image, extracting a character coding matrix of a recognition character contained in the recognition content from the recognition content obtained by recognizing the sample content through an initial training first character recognition sub-model;

and taking the character coding matrix of the extracted recognition characters as the input characteristic of the second character recognition submodel, and taking the relation probability matrix between the recognition characters contained in the recognition content as the output result of the second character recognition submodel.

In another embodiment, the character recognition model training apparatus further includes:

a matrix expansion module 404, configured to determine, for each sample content in the sample image, an image area of the sample image corresponding to the sample content; based on the size of the determined image area, expanding a relation probability matrix between identification characters contained in identification content corresponding to the sample content to obtain an expanded relation probability matrix;

and the second sub-model training unit is specifically used for retraining the initially trained first character recognition sub-model by taking the content characteristic information of the sample content and the expanded relation probability matrix obtained by expanding the relation probability matrix obtained by recognizing the recognition content corresponding to the sample content by the trained second character recognition sub-model as the input characteristic of the initially trained first character recognition sub-model for each sample content in the sample image.

Based on the same application concept, the embodiment of the present application further provides a device for recognizing characters corresponding to the method for recognizing characters, and because the principle of solving the problem of the device in the embodiment of the present application is similar to that of the method for recognizing characters in the embodiment of the present application, the implementation of the device can refer to the implementation of the method, and repeated details are not repeated.

As shown in fig. 5, which is a schematic structural diagram of an apparatus for recognizing characters provided in an embodiment of the present application, the apparatus for recognizing characters includes:

an image obtaining module 501, configured to obtain a target image; the target image comprises a plurality of target contents;

an information confirming module 502, configured to confirm content feature information of each target content and a relationship probability matrix between target characters included in the target content;

the character recognition module 503 is configured to input the content feature information of each target content and the relationship probability matrix between the target characters included in the target content into the character recognition model, and recognize to obtain the target characters included in the target content.

As shown in fig. 6, a schematic structural diagram of a computer device provided in an embodiment of the present application is shown, where the computer device includes: a processor 601, a memory 602 and a bus 603, the memory 602 storing machine-readable instructions executable by the processor 601, the processor 601 and the memory 602 communicating via the bus 603 when the computer device is running, the machine-readable instructions when executed by the processor 601 performing the following:

acquiring a sample image; wherein, the sample image comprises a plurality of sample contents;

In one embodiment, in the processing executed by the processor 601, taking the content feature information of each sample content and the relationship probability matrix between the sample characters included in the sample content as the input features of the character recognition model to be trained, and taking the sample characters included in the sample content as the output result of the character recognition model to be trained, training the character recognition model to obtain a character recognition model, including:

sequentially taking the determined content characteristic information of each sample content as the input characteristic of the first character recognition submodel, taking the corresponding sample content as the output result of the first character recognition submodel, and training the first character recognition submodel to obtain an initial training first character recognition submodel;

aiming at each sample content in the sample image, taking the content characteristic information of the sample content as the input of an initial training first character recognition submodel to obtain recognition content, taking the recognition content as the input characteristic of a second character recognition submodel, taking a relation probability matrix between recognition characters contained in the recognition content as the output result of the second character recognition submodel, and training the second character recognition submodel;

for each sample content in the sample image, taking a relation probability matrix obtained by identifying the identification content corresponding to the sample content by the content characteristic information of the sample content and the trained second character identification submodel as the input characteristic of the initially trained first character identification submodel, and retraining the initially trained first character identification submodel again; the character recognition model comprises a retrained initial training first character recognition submodel and a trained second character recognition submodel.

In another embodiment, the processing executed by the processor 601, taking the recognition content as the input feature of the second character recognition submodel, and taking the relationship probability matrix between the recognition characters contained in the recognition content as the output result of the second character recognition submodel, includes:

In another embodiment, the processing executed by the processor 601, after the training of the second character recognition submodel, and before the retraining of the initially trained first character recognition submodel, further includes:

the above-mentioned processing executed by the processor 601 is to train the initially trained first character recognition submodel again, and includes:

and aiming at each sample content in the sample image, recognizing the recognition content corresponding to the sample content by the content characteristic information of the sample content and the trained second character recognition submodel to obtain a relation probability matrix, and then expanding the relation probability matrix to obtain an expanded relation probability matrix as the input characteristic of the initially trained first character recognition submodel, and retraining the initially trained first character recognition submodel again.

Fig. 7 is a schematic structural diagram of another computer device provided in an embodiment of the present application, where the computer device includes: a processor 701, a memory 702 and a bus 703, the memory 702 storing machine-readable instructions executable by the processor 701, the processor 701 and the memory 702902 communicating via the bus 703 when the computer device is operating, the machine-readable instructions when executed by the processor 701 performing the following:

and inputting the content characteristic information of each target content and a relation probability matrix between target characters contained in the target content into a character recognition model, and recognizing to obtain the target characters contained in the target content.

An embodiment of the present application further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by the processor 601, the steps of the character recognition model training method are performed.

Specifically, the storage medium can be a general storage medium, such as a mobile disk, a hard disk, and the like, and when a computer program on the storage medium is run, the character recognition model training method can be executed, so that the problem that the recognition accuracy and the recognition efficiency of the related technical scheme for performing character recognition by adopting an OCR recognition model are low is solved, and the effect of improving the accuracy and the efficiency of character recognition is achieved.

An embodiment of the present application further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by the processor 701, the steps of the method for recognizing a character are performed.

Specifically, the storage medium can be a general storage medium, such as a mobile disk, a hard disk, and the like, and when a computer program on the storage medium is run, the method for recognizing characters can be executed, so that the problem that the recognition accuracy and the recognition efficiency of the related technical scheme for performing character recognition by using an OCR recognition model are low is solved, and the effect of improving the accuracy and the efficiency of character recognition is achieved.

The computer program product of the character recognition model training method and the character recognition method provided in the embodiments of the present application includes a computer readable storage medium storing a program code, and instructions included in the program code may be used to execute the methods in the foregoing method embodiments.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the system and the apparatus described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solutions of the present application or portions thereof that substantially contribute to the prior art may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the methods according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

The above description is only for the specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A character recognition model training method is characterized by comprising the following steps:

confirming content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content; the content feature information of the sample content is obtained by feature extraction of an image area of the sample content, and a relationship probability matrix between sample characters contained in the sample content is used for representing the degree of association between the sample characters contained in the sample content;

2. The method according to claim 1, wherein the training of the character recognition model by using the content feature information of each sample content and the relationship probability matrix between the sample characters contained in the sample content as the input features of the character recognition model to be trained and using the sample characters contained in the sample content as the output result of the character recognition model to be trained comprises:

3. The method according to claim 2, wherein the using the recognition content as an input feature of a second character recognition submodel and using a relationship probability matrix between recognition characters included in the recognition content as an output result of the second character recognition submodel comprises:

4. The method of claim 3, wherein after training the second character recognition submodel and before retraining the initially trained first character recognition submodel, further comprising:

and aiming at each sample content in the sample image, identifying the identification content corresponding to the sample content by using the content characteristic information of the sample content and the trained second character identification submodel to obtain a relation probability matrix, and then expanding the relation probability matrix to obtain an expanded relation probability matrix as the input characteristic of the initially trained first character identification submodel, and re-training the initially trained first character identification submodel.

5. A method for recognizing characters based on a character recognition model trained according to any one of claims 1 to 4, comprising:

confirming content characteristic information of each target content and a relation probability matrix between target characters contained in the target content; the content feature information of the target content is obtained by feature extraction of an image area of the target content, and a relation probability matrix between target characters contained in the target content is used for representing the degree of association between the target characters contained in the target content;

6. A character recognition model training apparatus, comprising:

the information confirming module is used for confirming the content characteristic information of each sample content and a relation probability matrix between sample characters contained in the sample content; the content feature information of the sample content is obtained by performing feature extraction on an image area of the sample content, and a relationship probability matrix between sample characters contained in the sample content is used for representing the degree of association between the sample characters contained in the sample content;

7. The apparatus of claim 6, wherein the model training module comprises:

the first sub-model training unit is also used for retraining the initial training first character recognition sub-model by taking the content characteristic information of the sample content and a relation probability matrix obtained by recognizing the recognition content corresponding to the sample content by the trained second character recognition sub-model as the input characteristic of the initial training first character recognition sub-model aiming at each sample content in the sample image; the character recognition model comprises a retrained initial training first character recognition submodel and a trained second character recognition submodel.

8. The apparatus of claim 7, wherein the second submodel training unit is specifically configured to:

9. The apparatus of claim 8, further comprising:

the matrix expansion module is used for determining each sample content in the sample image, and the sample content corresponds to the image area of the sample image; based on the size of the determined image area, expanding a relation probability matrix between identification characters contained in identification content corresponding to the sample content to obtain an expanded relation probability matrix;

the first sub-model training unit is specifically configured to retrain, for each sample content in the sample image, the initial training first character recognition sub-model again by using content feature information of the sample content and an extended relationship probability matrix obtained by extending an extended relationship probability matrix obtained by identifying, by the trained second character recognition sub-model, the recognition content corresponding to the sample content as input features of the initial training first character recognition sub-model.

10. An apparatus for recognizing a character based on the character recognition model trained in any one of claims 6 to 9, comprising:

the information confirmation module is used for confirming the content characteristic information of each target content and a relation probability matrix between target characters contained in the target content; the content feature information of the target content is obtained by performing feature extraction on an image area of the target content, and a relation probability matrix between target characters contained in the target content is used for representing the degree of association between the target characters contained in the target content;