CN114580429A

CN114580429A - Artificial intelligence-based language and image understanding integrated service system

Info

Publication number: CN114580429A
Application number: CN202210092447.0A
Authority: CN
Inventors: 陈慧
Original assignee: Yunjie Computer Software Jiangsu Co ltd
Current assignee: Yunjie Computer Software Jiangsu Co ltd
Priority date: 2022-01-26
Filing date: 2022-01-26
Publication date: 2022-06-03

Abstract

The invention discloses a language and image understanding integrated service system based on artificial intelligence, which comprises a preprocessing module, a calculation and analysis module and a judgment and display module, wherein the preprocessing module is used for analyzing and preprocessing the content of characters input by scanning, the calculation and analysis module is used for identifying semantic information of the scanned characters and calculating the actual distance of characteristic vectors of the scanned characters, the judgment and display module is used for judging and confirming the comprehensive calculation result of the characters and displaying the comprehensive calculation result, the preprocessing module is electrically connected with the calculation and analysis module, the calculation and analysis module is electrically connected with the judgment and display module, firstly, character texts identified by users are preprocessed, a similar character template library is synchronously established, the distance of the characteristic vectors of the character texts is calculated through the characteristic vectors obtained in the preprocessing, and finally, the character semantic values are comprehensively calculated according to the distance of the characteristic vectors and the character currency values related to word frequency, the method has the characteristics of accurate identification and strong practicability.

Description

Artificial intelligence-based language and image understanding integrated service system

Technical Field

The invention relates to the technical field of character and image recognition, in particular to a language and image understanding integrated service system based on artificial intelligence.

Background

With the popularization of electronic office work, the application of picture character recognition software is also popularized, and a plurality of units and individuals regard the picture character recognition software as necessary software, so that a large number of picture characters can be rapidly recognized into editable text characters, and the characters can be conveniently processed.

The recognition accuracy in the picture character recognition process is always a key concern of people, but the recognition accuracy of the existing character recognition technology in some application scenes is relatively low, and the identification technology is mainly embodied in the recognition application of multiple texts and long texts with a certain standard legend in a unit, and some minor errors which are troublesome to process often occur in the recognition process, for example, one section of the character which should be recognized originally is recognized as two or even multiple sections of split characters, or a square legend in the character is recognized as a similar character, a Chinese character 'one' is recognized as a horizontal bar or is recognized reversely, and a continuous writing number 13 is recognized as an uppercase English letter 'B', and the like, so that the occurrence of the problems causes that the recognized text still needs a lot of time and energy for removing kernel and modifying, and even can be directly printed and used after being recognized in some scenes, and the error information can cause errors and misunderstanding in the later use process, so that it is necessary to design an artificial intelligence-based language and image understanding integrated service system with strong practicability and high accuracy.

Disclosure of Invention

The invention aims to provide an artificial intelligence-based language and image understanding integrated service system to solve the problems in the background art.

In order to solve the technical problems, the invention provides the following technical scheme: the utility model provides a language and image understanding integration service system based on artificial intelligence, includes preprocessing module, calculates analysis module and judges display module, its characterized in that: the device comprises a preprocessing module, a calculation and analysis module, a judgment display module, a calculation and analysis module and a display module, wherein the preprocessing module is used for analyzing and preprocessing the scanned and input character content, the calculation and analysis module is used for identifying semantic information of scanned characters and calculating the actual distance of characteristic vectors of the scanned characters, the judgment display module is used for judging and confirming the comprehensive calculation result of the characters and displaying the comprehensive calculation result, the preprocessing module is electrically connected with the calculation and analysis module, and the calculation and analysis module is electrically connected with the judgment display module.

According to the technical scheme, the preprocessing module comprises a scanning input module, a character image preprocessing module and a similar character template establishing module, the scanning input module is used for scanning and extracting characters on the image and the file to be recognized, the character image preprocessing module is used for carrying out preprocessing operations such as segmentation, binaryzation, smooth denoising, thinning and normalization on the character text image recorded by scanning, and the similar character template establishing module is used for establishing a sample template with multiple text dimensions such as similar characters, symbols, numbers and letters.

According to the technical scheme, the calculation and analysis module comprises a feature vector distance module, a semantic analysis module and a character comprehensive calculation module, the feature vector distance module is used for calculating the feature vector distance of a single divided character, the analysis and judgment module is used for judging whether the character belongs to the character needing to be added for semantic understanding to carry out comprehensive calculation, the semantic analysis module is used for analyzing the meaning of the character in combination with the meanings of the front character and the rear character, the character comprehensive calculation module is used for carrying out comprehensive calculation according to the semantics and the vector distance to confirm the character, the feature vector distance module is electrically connected with the analysis and judgment module, and the analysis and judgment module is electrically connected with the character comprehensive calculation module.

According to the technical scheme, the feature vector distance module comprises an actual character distance calculation submodule and a similar character distance calculation submodule, the actual character distance calculation submodule is used for calculating a standard distance value between a feature vector of a current character and the character in a template database, the similar character distance calculation submodule is used for calculating the distance between the character and the similar character in the template database to carry out distance matching calculation, the semantic analysis module comprises a single-character semantic analysis submodule and a front-and-back character semantic analysis submodule, the single-character semantic analysis submodule is used for analyzing the analysis meaning of the character in the database template corresponding to the current recognized character, and the front-and-back character semantic analysis submodule is used for analyzing semantic information of front-and-back adjacent characters of the current character in the database.

According to the technical scheme, the judgment display module comprises a judgment confirmation module and a character presentation module, the judgment confirmation module is used for confirming the accurate characters in the database corresponding to the current characters according to the calculated comprehensive character information, and the character presentation module is used for presenting and displaying the accurate characters.

According to the technical scheme, the operation method of the language and image understanding integrated service system mainly comprises the following steps:

step S1: a user puts a picture or a file to be identified into an identification position to click an identification function, a preprocessing module analyzes character and character scanning and inputting, and preprocessing steps such as image area division, characteristic vector extraction and the like are carried out;

step S2: synchronously establishing a similar character template module in an existing reference character template, adjusting the character position in the character template according to the font similarity of the preprocessed character, and establishing a character morphological characteristic multi-template library when the character with larger similarity is at an adjacent position;

step S3: carrying out feature vector distance calculation on the preprocessed characters, comparing and judging the calculated distance with a character form feature template library, and judging the character accuracy and fuzziness;

step S4: the semantic analysis module performs single character semantic analysis and front and back character semantic analysis on the characters with ambiguity, and the character comprehensive calculation module performs comprehensive calculation on the characters with ambiguity according to the similarity value and the semantic analysis value;

step S5: and judging the final character corresponding to the comprehensive calculation result, and displaying.

According to the above technical solution, the step S2 further includes the following steps:

step S21: the character image preprocessing module extracts feature vectors of each character to be recognized in the horizontal direction, the vertical direction, the left falling direction and the right falling direction by using a skeleton extraction technology to obtain a four-dimensional feature vector X (X) of the character to be recognized₁,x₂,x₃,x₄)^T；

Step S22: according to the set existing m-class template mode W of the reference character₁，W₂，W₃，W_mEstablishing a four-dimensional feature vector Y_i＝(y_i1,y_i2,y_i3,y_i4) Class i schema W of the representation_iThe reference template of (1).

According to the above technical solution, the step S3 further includes the following steps:

step S31: four-dimensional characteristic vector X ═ X (X) of N characters to be recognized extracted from character image preprocessing module₁,x₂,x₃,x₄)^TThe sample is automatically identified to a class template W matched with the reference template by using a clustering algorithm_iThe clustering algorithm is to compare and analyze the recognized character feature vectors of the same category with the character templates of the category in the template library;

step S32: when the clustering algorithm automatically identifies more than one type of matchable reference template, the best matched first type of template W is removed_iAnd other templates W with higher similarity_m；

Step S33: the similar character distance calculation submodule calculates the distance between the similar character and the W by utilizing a four-dimensional characteristic vector calculation formula_iDistance d (X, Y) between quasi-reference templates_i) And similarity, and other kinds of templates W_mD (X, Y) of the two_m) And similarity;

step S34: the actual character distance calculation submodule calculates the distance between the actual character distance and the W according to a four-dimensional feature vector calculation formula_iDistance d (X, Y) between quasi-reference templates_i)，：

In the formula, x₁,x₂,x₃,x₄Feature vector y representing four directions of horizontal, vertical, left-falling and right-falling of character actually measured₁,y₂,y₃,y₄Represents W_iThe standard horizontal, vertical, left-falling and right-falling characteristic vectors of the characters in the class reference template, and the larger the difference of the dimensional values of the two characteristic vectors is, the larger the distance is;

step S35: calculating the similarity between the character and the standard character by distance conversion of the calculated feature vector

Similarity S (X, Y)_i) In the range of [0,1]The closer the distance, the greater the similarity.

According to the above technical solution, the step S4 further includes the following steps:

step S41: when the clustering algorithm automatically identifies and matches a unique reference template, the analysis and judgment module judges the identification as accurate identification, and directly displays the identification without semantic analysis;

step S42: when the clustering algorithm automatically identifies and matches a plurality of types of reference templates, analyzing and judging the calculated similarity, when the similarity is large, using the template with the large similarity as the accurate identification and not performing semantic analysis, and when the calculated similarity is small, judging the character as fuzzy identification;

step S43: the semantic analysis module carries out semantic analysis on the judged fuzzy recognized characters, and the matched characters in the multi-class templates are respectively entered into a standard character semantic database to carry out single character semantic analysis;

step S44: combining the analyzed single characters with front and rear characters to form word frequency, analyzing and judging the use frequency of the word frequency by using a word frequency database, wherein the higher the use frequency of the word frequency is, the larger the sentence currency value in the sentence pattern is, so that the sentence currency value K of the sentence pattern is analyzed by placing the word frequency in a complete sentence pattern, and the range of the sentence currency value K is [0,1 ];

step S45: the character comprehensive calculation module calculates the comprehensive degree of character semantics according to the calculated character similarity and sentence currency value, and the calculation formula is as follows:

in the formula, Q is the semantic integration degree of the character, S is the similarity of the character, and K is the sentence currency value in the complete sentence pattern formed by the characters before and after the character is put in, i.e. when the similarity S of the character is close to or unchanged, the sentence currency value K is larger, the semantic integration degree Q is also larger.

According to the above technical solution, the step S5 further includes the following steps:

step S51: and the judgment confirming module judges the standard character in the reference module corresponding to the confirmed character according to the comprehensive degree calculated by the character comprehensive calculating module.

Compared with the prior art, the invention has the following beneficial effects: the invention can synchronously establish a similar character template library besides the basic preprocessing work of the character text recognized by a user by arranging a preprocessing module, a calculation and analysis module and a judgment and display module, carry out the calculation and judgment of the characteristic vector distance of the character characteristic vector obtained in the preprocessing, judge whether the character belongs to the accurate recognition or the fuzzy recognition in the character template library, carry out the word frequency semantic value calculation of the character recognized in a fuzzy manner, and finally comprehensively calculate the accurate semantic value of the character according to the characteristic vector distance and the character currency value related to the word frequency, on the basis of original character recognition, fuzzy characters are added to combine into word frequency, the currency value of the sentence is obtained through the use frequency of the word frequency in a word frequency database and the complete sentence pattern formed by combining front and rear characters, and finally comprehensive calculation and judgment are carried out according to the eigenvector distance and the sentence currency value.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

FIG. 1 is a schematic diagram of the system module composition of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.

Referring to fig. 1, the present invention provides a technical solution: an artificial intelligence-based language and image understanding integrated service system comprises a preprocessing module, a calculation and analysis module and a judgment and display module, wherein the preprocessing module is used for analyzing and preprocessing scanned and input character contents, the calculation and analysis module is used for identifying semantic information of scanned characters and calculating actual distances of characteristic vectors of the scanned characters, the judgment and display module is used for judging and confirming comprehensive calculation results of characters and displaying the comprehensive calculation results, the preprocessing module is electrically connected with the calculation and analysis module, the calculation and analysis module is electrically connected with the judgment and display module, preprocessing work is carried out on character texts identified by users, a similar character template library is synchronously established, the characteristic vector distances of the character texts are calculated through character characteristic vectors obtained in preprocessing, and finally, character semantic values are comprehensively calculated according to the characteristic vector distances and character currency values related to word frequency.

The preprocessing module comprises a scanning input module, a character image preprocessing module and a similar character template establishing module, wherein the scanning input module is used for scanning and extracting characters on pictures and files to be recognized, the character image preprocessing module is used for carrying out preprocessing work such as segmentation, binaryzation, smooth denoising, thinning and normalization on character text images recorded by scanning, the similar character template establishing module is used for establishing sample templates with multiple text dimensions such as similar characters, symbols, numbers and letters, the preprocessing work is used for preparing subsequent calculation of feature vector distances and is the basis of all calculation recognition, a template base is established by dividing similar character dimensions into a class of templates and establishing the templates in an adjacent mode, and when fuzzy characters are recognized in the later stage, the similar characters can be accurately calculated and judged according to feature vectors and word frequencies.

The calculation and analysis module comprises a characteristic vector distance module, a semantic analysis module and a character comprehensive calculation module, wherein the characteristic vector distance module is used for calculating the characteristic vector distance of a single divided character, the analysis and judgment module is used for judging whether the character belongs to the condition that semantic comprehension needs to be added for comprehensive calculation, the semantic analysis module is used for analyzing the meaning of the character in combination with the meanings of front and back characters, the character comprehensive calculation module is used for comprehensively calculating and confirming the character according to the semantics and the vector distance, the characteristic vector distance module is electrically connected with the analysis and judgment module, the analysis and judgment module is electrically connected with the character comprehensive calculation module, the character which can be accurately identified through the characteristic vector distance of the character is directly output, the character with ambiguity is added into the semantic comprehension for comprehensive calculation, the semantic comprehension is that the single character and the front and back characters are combined into word frequency, the use frequency of the word frequency database is analyzed, and the currency value of a sentence is further brought into a complete sentence formula, the higher the frequency of use the greater the compliance value.

The characteristic vector distance module comprises an actual character distance calculation submodule and a similar character distance calculation submodule, the actual character distance calculation submodule is used for calculating a standard distance value between a characteristic vector of a current character and the character in a template database, the similar character distance calculation submodule is used for calculating the distance matching between the character and the similar character in the template database, the semantic analysis module comprises a single-character semantic analysis submodule and a front-and-back character semantic analysis submodule, the single-character semantic analysis submodule is used for analyzing the analysis meaning of the character in the database template corresponding to the current recognized character, and the front-and-back character semantic analysis submodule is used for analyzing semantic information of front-and-back adjacent characters of the current character in the database.

The judgment and display module comprises a judgment and confirmation module and a character presentation module, the judgment and confirmation module is used for confirming the accurate characters in the database corresponding to the current characters according to the calculated comprehensive character information, the character presentation module is used for presenting and displaying the correct characters, finally, the characters accurately identified through the characteristic vector distance are presented and displayed, and finally, the characters accurately identified by the fuzzy characters are presented and displayed by utilizing semantic analysis.

The operation method of the language and image understanding integrated service system mainly comprises the following steps:

step S1: the method comprises the steps that a user puts a picture or a file to be recognized into a recognition position to click a recognition function, a preprocessing module analyzes character scanning input, and preprocessing steps such as image area division and characteristic vector extraction are further carried out;

step S2: synchronously establishing a similar character template module in an existing reference character template, adjusting the character position in the character template according to the font similarity of the preprocessed characters, establishing a character morphological characteristic multi-template library when the characters with larger similarity are in the adjacent position, establishing a large number of character template libraries in the recognition system, subsequently adjusting and dividing the positions of the similar characters recognized in the recognition process through unsupervised learning, and establishing a similar character template;

step S3: carrying out feature vector distance calculation on the preprocessed characters, carrying out comparison judgment according to the calculated distance and a character form feature template library, judging character accuracy and ambiguity, carrying out comparison judgment on the calculated distance, wherein most characters can be accurately identified and matched, but uncertainty exists in the case of aiming at partial extremely similar characters, namely the identified characters have certain ambiguity;

step S4: the semantic analysis module performs single-character semantic analysis and front-and-back character semantic analysis on the characters with ambiguity, and the character comprehensive calculation module performs comprehensive calculation on the characters with ambiguity according to the similarity value and the semantic analysis value;

step S5: and judging the final character corresponding to the comprehensive calculation result, and displaying the final character.

Step S2 further includes the steps of:

step S21: the character image preprocessing module extracts feature vectors of each character to be recognized in the horizontal direction, the vertical direction, the left falling direction and the right falling direction by using a skeleton extraction technology to obtain a four-dimensional feature vector X (X) of the character to be recognized₁,x₂,x₃,x₄)^TThe character feature vector can be selected in multiple dimensions, and the feature vectors in the horizontal, vertical, left-falling and right-falling directions of the most basic character are selected for subsequent vector distance calculation;

Step S3 further includes the steps of:

step S31: four-dimensional characteristic vector X ═ X (X) of N characters to be recognized extracted from character image preprocessing module₁,x₂,x₃,x₄)^TAutomatically identifying class template W matched into reference template by using clustering algorithm_iThe clustering algorithm automatically divides the recognized character texts, specifically belongs to a character class, a number class, a letter class and the like, and finds character templates corresponding to different base classes in the database;

step S32: when the clustering algorithm automatically identifies more than one type of matchable reference template, the best matched first type of template W is removed_iAnd other templates W with higher similarity_mMost of characters with ambiguity are uncertainty of similar base classes, and the other part has ambiguity of crossing base classes;

In the formula, x₁,x₂,x₃,x₄Feature vector y representing four directions of horizontal, vertical, left-falling and right-falling of character actually measured₁,y₂,y₃,y₄Represents W_iThe standard horizontal, vertical, left-falling and right-falling characteristic vectors of the characters in the class reference template, and the larger the difference of the dimension values of the two characteristic vectors is, the larger the difference is, the characteristic vectors areThe greater the distance;

Step S4 further includes the steps of:

step S42: when the clustering algorithm automatically identifies and matches a plurality of types of reference templates, analyzing and judging the calculated similarity, when the similarity is large, using a type of template with large similarity as accurate identification to not perform semantic analysis, when the calculated similarity is small, judging the character to be recognized as fuzzy identification, after the distance of a feature vector is obtained, obtaining a plurality of types of reference templates according to distance matching, and in order to further screen and calculate the similarity of the character matched with the template, wherein the similarity in the plurality of types of templates is different, selecting a type with high similarity on the calculated distance of the feature vector, and performing further character calculation when the similarity is similar;

step S44: combining the analyzed single characters with front and rear characters to form word frequency, analyzing and judging the use frequency of the word frequency by using a word frequency database, wherein the higher the use frequency of the word frequency is, the larger the sentence currency value in a sentence pattern is, so that the word frequency is put into the complete sentence pattern to analyze the sentence currency value K of the sentence pattern, the range of the sentence currency value K is [0,1], the word frequency is formed by combining the characters and the characters, the correct word frequency frequently used by people is stored in the database, and the word frequency is higher and the currency value is larger when the sentence is put into the sentence according to the use rate of the word frequency;

step S45: the character comprehensive calculation module calculates the character semantic comprehensive degree according to the calculated character similarity and the sentence currency value, and the calculation formula is as follows:

Step S5 further includes the steps of:

It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. The utility model provides a language and image understanding integration service system based on artificial intelligence, includes preprocessing module, calculates analysis module and judges display module, its characterized in that: the device comprises a preprocessing module, a calculation and analysis module, a judgment display module, a calculation and analysis module and a display module, wherein the preprocessing module is used for analyzing and preprocessing the scanned and input character content, the calculation and analysis module is used for identifying semantic information of scanned characters and calculating the actual distance of characteristic vectors of the scanned characters, the judgment display module is used for judging and confirming the comprehensive calculation result of the characters and displaying the comprehensive calculation result, the preprocessing module is electrically connected with the calculation and analysis module, and the calculation and analysis module is electrically connected with the judgment display module.

2. The system of claim 1, wherein the service system comprises: the preprocessing module comprises a scanning input module, a character image preprocessing module and a similar character template establishing module, wherein the scanning input module is used for scanning and extracting characters on pictures and files to be recognized, the character image preprocessing module is used for carrying out preprocessing work such as segmentation, binaryzation, smooth denoising, thinning and normalization on character text images recorded by scanning, and the similar character template establishing module is used for establishing sample templates with multiple text dimensions such as similar characters, symbols, numbers, letters and the like.

3. The system of claim 1, wherein the service system comprises: the calculation and analysis module comprises a feature vector distance module, a semantic analysis module and a character comprehensive calculation module, the feature vector distance module is used for calculating the feature vector distance of a single divided character, the analysis and judgment module is used for judging whether the character belongs to the character needing to be added with semantic understanding to carry out comprehensive calculation, the semantic analysis module is used for analyzing the meaning of the character in combination with the meaning of the front character and the meaning of the rear character, the character comprehensive calculation module is used for carrying out comprehensive calculation according to the semantics and the vector distance to confirm the character, the feature vector distance module is electrically connected with the analysis and judgment module, and the analysis and judgment module is electrically connected with the character comprehensive calculation module.

4. The artificial intelligence based language and image understanding integrated service system according to claim 3, wherein: the system comprises a characteristic vector distance module, a template database, a character distance module, a semantic analysis module and a front and back character semantic analysis module, wherein the characteristic vector distance module comprises an actual character distance calculation submodule and a similar character distance calculation submodule, the actual character distance calculation submodule is used for calculating a standard distance value between a characteristic vector of a current character and the character in the template database, the similar character distance calculation submodule is used for calculating the distance matching calculation between the character and the similar character in the template database, the semantic analysis module comprises a single-character semantic analysis submodule and a front and back character semantic analysis submodule, the single-character semantic analysis submodule is used for analyzing the analysis meaning of the character in the database template corresponding to the current recognized character, and the front and back character semantic analysis submodule is used for analyzing semantic information of front and back adjacent characters of the current character in the database.

5. The integrated service system for language and image understanding based on artificial intelligence of claim 4, wherein: the judgment display module comprises a judgment confirmation module and a character presentation module, the judgment confirmation module is used for confirming the accurate characters in the database corresponding to the current characters according to the calculated comprehensive character information, and the character presentation module is used for presenting and displaying the correct characters.

6. The artificial intelligence based language and image understanding integrated service system according to claim 5, wherein: the operation method of the language and image understanding integrated service system mainly comprises the following steps:

step S3: carrying out feature vector distance calculation on the preprocessed characters, comparing and judging the calculated distance with a character form feature template library, and judging character accuracy and fuzziness;

7. The integrated service system for speech and image understanding based on artificial intelligence of claim 6, wherein: the step S2 further includes the steps of:

step S21: the character image preprocessing module extracts feature vectors of each character to be recognized in the horizontal direction, the vertical direction, the left falling direction and the right falling direction by using a skeleton extraction technology to obtain a four-dimensional feature vector X (X) of the character to be recognized₁，x₂，x₃，x₄)^T；

Step S22: according to the set existing m-class template mode W of the reference character₁，W₂，W₃，W_mEstablishing a four-dimensional feature vector Y_i＝(y_i1，y_i2，y_i3，y_i4) Class i schema W of the representation_iThe reference template of (1).

8. The artificial intelligence based language and image understanding integrated service system according to claim 7, wherein: the step S3 further includes the steps of:

step S31: n to-be-recognized characters extracted from character image preprocessing moduleFour-dimensional character vector X ═ X₁，x₂，x₃，x₄)^TThe sample is automatically identified to a class template W matched with the reference template by using a clustering algorithm_iThe clustering algorithm is to compare and analyze the recognized character feature vectors of the same category with the character templates of the category in the template library;

In the formula, x₁，x₂，x₃，x₄Feature vector y representing four directions of horizontal, vertical, left-falling and right-falling of character actually measured₁，y₂，y₃，y₄Represents W_iThe standard horizontal, vertical, left-falling and right-falling characteristic vectors of the characters in the class reference template, and the larger the difference of the dimensional values of the two characteristic vectors is, the larger the distance is;

9. The artificial intelligence based language and image understanding integrated service system according to claim 6, wherein: the step S4 further includes the steps of:

10. The artificial intelligence based language and image understanding integrated service system according to claim 6, wherein: the step S5 further includes the steps of: