CN114580429A - Artificial intelligence-based language and image understanding integrated service system - Google Patents
Artificial intelligence-based language and image understanding integrated service system Download PDFInfo
- Publication number
- CN114580429A CN114580429A CN202210092447.0A CN202210092447A CN114580429A CN 114580429 A CN114580429 A CN 114580429A CN 202210092447 A CN202210092447 A CN 202210092447A CN 114580429 A CN114580429 A CN 114580429A
- Authority
- CN
- China
- Prior art keywords
- character
- module
- distance
- calculation
- characters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000013473 artificial intelligence Methods 0.000 title claims abstract description 16
- 238000004364 calculation method Methods 0.000 claims abstract description 96
- 238000004458 analytical method Methods 0.000 claims abstract description 90
- 239000013598 vector Substances 0.000 claims abstract description 83
- 238000007781 pre-processing Methods 0.000 claims abstract description 49
- 238000000034 method Methods 0.000 claims abstract description 14
- 230000010354 integration Effects 0.000 claims description 8
- 238000012790 confirmation Methods 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 6
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 239000000284 extract Substances 0.000 claims description 3
- 230000006870 function Effects 0.000 claims description 3
- 230000000877 morphologic effect Effects 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 230000011218 segmentation Effects 0.000 claims description 3
- 230000008569 process Effects 0.000 description 7
- 230000009471 action Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a language and image understanding integrated service system based on artificial intelligence, which comprises a preprocessing module, a calculation and analysis module and a judgment and display module, wherein the preprocessing module is used for analyzing and preprocessing the content of characters input by scanning, the calculation and analysis module is used for identifying semantic information of the scanned characters and calculating the actual distance of characteristic vectors of the scanned characters, the judgment and display module is used for judging and confirming the comprehensive calculation result of the characters and displaying the comprehensive calculation result, the preprocessing module is electrically connected with the calculation and analysis module, the calculation and analysis module is electrically connected with the judgment and display module, firstly, character texts identified by users are preprocessed, a similar character template library is synchronously established, the distance of the characteristic vectors of the character texts is calculated through the characteristic vectors obtained in the preprocessing, and finally, the character semantic values are comprehensively calculated according to the distance of the characteristic vectors and the character currency values related to word frequency, the method has the characteristics of accurate identification and strong practicability.
Description
Technical Field
The invention relates to the technical field of character and image recognition, in particular to a language and image understanding integrated service system based on artificial intelligence.
Background
With the popularization of electronic office work, the application of picture character recognition software is also popularized, and a plurality of units and individuals regard the picture character recognition software as necessary software, so that a large number of picture characters can be rapidly recognized into editable text characters, and the characters can be conveniently processed.
The recognition accuracy in the picture character recognition process is always a key concern of people, but the recognition accuracy of the existing character recognition technology in some application scenes is relatively low, and the identification technology is mainly embodied in the recognition application of multiple texts and long texts with a certain standard legend in a unit, and some minor errors which are troublesome to process often occur in the recognition process, for example, one section of the character which should be recognized originally is recognized as two or even multiple sections of split characters, or a square legend in the character is recognized as a similar character, a Chinese character 'one' is recognized as a horizontal bar or is recognized reversely, and a continuous writing number 13 is recognized as an uppercase English letter 'B', and the like, so that the occurrence of the problems causes that the recognized text still needs a lot of time and energy for removing kernel and modifying, and even can be directly printed and used after being recognized in some scenes, and the error information can cause errors and misunderstanding in the later use process, so that it is necessary to design an artificial intelligence-based language and image understanding integrated service system with strong practicability and high accuracy.
Disclosure of Invention
The invention aims to provide an artificial intelligence-based language and image understanding integrated service system to solve the problems in the background art.
In order to solve the technical problems, the invention provides the following technical scheme: the utility model provides a language and image understanding integration service system based on artificial intelligence, includes preprocessing module, calculates analysis module and judges display module, its characterized in that: the device comprises a preprocessing module, a calculation and analysis module, a judgment display module, a calculation and analysis module and a display module, wherein the preprocessing module is used for analyzing and preprocessing the scanned and input character content, the calculation and analysis module is used for identifying semantic information of scanned characters and calculating the actual distance of characteristic vectors of the scanned characters, the judgment display module is used for judging and confirming the comprehensive calculation result of the characters and displaying the comprehensive calculation result, the preprocessing module is electrically connected with the calculation and analysis module, and the calculation and analysis module is electrically connected with the judgment display module.
According to the technical scheme, the preprocessing module comprises a scanning input module, a character image preprocessing module and a similar character template establishing module, the scanning input module is used for scanning and extracting characters on the image and the file to be recognized, the character image preprocessing module is used for carrying out preprocessing operations such as segmentation, binaryzation, smooth denoising, thinning and normalization on the character text image recorded by scanning, and the similar character template establishing module is used for establishing a sample template with multiple text dimensions such as similar characters, symbols, numbers and letters.
According to the technical scheme, the calculation and analysis module comprises a feature vector distance module, a semantic analysis module and a character comprehensive calculation module, the feature vector distance module is used for calculating the feature vector distance of a single divided character, the analysis and judgment module is used for judging whether the character belongs to the character needing to be added for semantic understanding to carry out comprehensive calculation, the semantic analysis module is used for analyzing the meaning of the character in combination with the meanings of the front character and the rear character, the character comprehensive calculation module is used for carrying out comprehensive calculation according to the semantics and the vector distance to confirm the character, the feature vector distance module is electrically connected with the analysis and judgment module, and the analysis and judgment module is electrically connected with the character comprehensive calculation module.
According to the technical scheme, the feature vector distance module comprises an actual character distance calculation submodule and a similar character distance calculation submodule, the actual character distance calculation submodule is used for calculating a standard distance value between a feature vector of a current character and the character in a template database, the similar character distance calculation submodule is used for calculating the distance between the character and the similar character in the template database to carry out distance matching calculation, the semantic analysis module comprises a single-character semantic analysis submodule and a front-and-back character semantic analysis submodule, the single-character semantic analysis submodule is used for analyzing the analysis meaning of the character in the database template corresponding to the current recognized character, and the front-and-back character semantic analysis submodule is used for analyzing semantic information of front-and-back adjacent characters of the current character in the database.
According to the technical scheme, the judgment display module comprises a judgment confirmation module and a character presentation module, the judgment confirmation module is used for confirming the accurate characters in the database corresponding to the current characters according to the calculated comprehensive character information, and the character presentation module is used for presenting and displaying the accurate characters.
According to the technical scheme, the operation method of the language and image understanding integrated service system mainly comprises the following steps:
step S1: a user puts a picture or a file to be identified into an identification position to click an identification function, a preprocessing module analyzes character and character scanning and inputting, and preprocessing steps such as image area division, characteristic vector extraction and the like are carried out;
step S2: synchronously establishing a similar character template module in an existing reference character template, adjusting the character position in the character template according to the font similarity of the preprocessed character, and establishing a character morphological characteristic multi-template library when the character with larger similarity is at an adjacent position;
step S3: carrying out feature vector distance calculation on the preprocessed characters, comparing and judging the calculated distance with a character form feature template library, and judging the character accuracy and fuzziness;
step S4: the semantic analysis module performs single character semantic analysis and front and back character semantic analysis on the characters with ambiguity, and the character comprehensive calculation module performs comprehensive calculation on the characters with ambiguity according to the similarity value and the semantic analysis value;
step S5: and judging the final character corresponding to the comprehensive calculation result, and displaying.
According to the above technical solution, the step S2 further includes the following steps:
step S21: the character image preprocessing module extracts feature vectors of each character to be recognized in the horizontal direction, the vertical direction, the left falling direction and the right falling direction by using a skeleton extraction technology to obtain a four-dimensional feature vector X (X) of the character to be recognized1,x2,x3,x4)T;
Step S22: according to the set existing m-class template mode W of the reference character1,W2,W3,WmEstablishing a four-dimensional feature vector Yi=(yi1,yi2,yi3,yi4) Class i schema W of the representationiThe reference template of (1).
According to the above technical solution, the step S3 further includes the following steps:
step S31: four-dimensional characteristic vector X ═ X (X) of N characters to be recognized extracted from character image preprocessing module1,x2,x3,x4)TThe sample is automatically identified to a class template W matched with the reference template by using a clustering algorithmiThe clustering algorithm is to compare and analyze the recognized character feature vectors of the same category with the character templates of the category in the template library;
step S32: when the clustering algorithm automatically identifies more than one type of matchable reference template, the best matched first type of template W is removediAnd other templates W with higher similaritym;
Step S33: the similar character distance calculation submodule calculates the distance between the similar character and the W by utilizing a four-dimensional characteristic vector calculation formulaiDistance d (X, Y) between quasi-reference templatesi) And similarity, and other kinds of templates WmD (X, Y) of the twom) And similarity;
step S34: the actual character distance calculation submodule calculates the distance between the actual character distance and the W according to a four-dimensional feature vector calculation formulaiDistance d (X, Y) between quasi-reference templatesi),:
In the formula, x1,x2,x3,x4Feature vector y representing four directions of horizontal, vertical, left-falling and right-falling of character actually measured1,y2,y3,y4Represents WiThe standard horizontal, vertical, left-falling and right-falling characteristic vectors of the characters in the class reference template, and the larger the difference of the dimensional values of the two characteristic vectors is, the larger the distance is;
step S35: calculating the similarity between the character and the standard character by distance conversion of the calculated feature vectorSimilarity S (X, Y)i) In the range of [0,1]The closer the distance, the greater the similarity.
According to the above technical solution, the step S4 further includes the following steps:
step S41: when the clustering algorithm automatically identifies and matches a unique reference template, the analysis and judgment module judges the identification as accurate identification, and directly displays the identification without semantic analysis;
step S42: when the clustering algorithm automatically identifies and matches a plurality of types of reference templates, analyzing and judging the calculated similarity, when the similarity is large, using the template with the large similarity as the accurate identification and not performing semantic analysis, and when the calculated similarity is small, judging the character as fuzzy identification;
step S43: the semantic analysis module carries out semantic analysis on the judged fuzzy recognized characters, and the matched characters in the multi-class templates are respectively entered into a standard character semantic database to carry out single character semantic analysis;
step S44: combining the analyzed single characters with front and rear characters to form word frequency, analyzing and judging the use frequency of the word frequency by using a word frequency database, wherein the higher the use frequency of the word frequency is, the larger the sentence currency value in the sentence pattern is, so that the sentence currency value K of the sentence pattern is analyzed by placing the word frequency in a complete sentence pattern, and the range of the sentence currency value K is [0,1 ];
step S45: the character comprehensive calculation module calculates the comprehensive degree of character semantics according to the calculated character similarity and sentence currency value, and the calculation formula is as follows:
in the formula, Q is the semantic integration degree of the character, S is the similarity of the character, and K is the sentence currency value in the complete sentence pattern formed by the characters before and after the character is put in, i.e. when the similarity S of the character is close to or unchanged, the sentence currency value K is larger, the semantic integration degree Q is also larger.
According to the above technical solution, the step S5 further includes the following steps:
step S51: and the judgment confirming module judges the standard character in the reference module corresponding to the confirmed character according to the comprehensive degree calculated by the character comprehensive calculating module.
Compared with the prior art, the invention has the following beneficial effects: the invention can synchronously establish a similar character template library besides the basic preprocessing work of the character text recognized by a user by arranging a preprocessing module, a calculation and analysis module and a judgment and display module, carry out the calculation and judgment of the characteristic vector distance of the character characteristic vector obtained in the preprocessing, judge whether the character belongs to the accurate recognition or the fuzzy recognition in the character template library, carry out the word frequency semantic value calculation of the character recognized in a fuzzy manner, and finally comprehensively calculate the accurate semantic value of the character according to the characteristic vector distance and the character currency value related to the word frequency, on the basis of original character recognition, fuzzy characters are added to combine into word frequency, the currency value of the sentence is obtained through the use frequency of the word frequency in a word frequency database and the complete sentence pattern formed by combining front and rear characters, and finally comprehensive calculation and judgment are carried out according to the eigenvector distance and the sentence currency value.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
FIG. 1 is a schematic diagram of the system module composition of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.
Referring to fig. 1, the present invention provides a technical solution: an artificial intelligence-based language and image understanding integrated service system comprises a preprocessing module, a calculation and analysis module and a judgment and display module, wherein the preprocessing module is used for analyzing and preprocessing scanned and input character contents, the calculation and analysis module is used for identifying semantic information of scanned characters and calculating actual distances of characteristic vectors of the scanned characters, the judgment and display module is used for judging and confirming comprehensive calculation results of characters and displaying the comprehensive calculation results, the preprocessing module is electrically connected with the calculation and analysis module, the calculation and analysis module is electrically connected with the judgment and display module, preprocessing work is carried out on character texts identified by users, a similar character template library is synchronously established, the characteristic vector distances of the character texts are calculated through character characteristic vectors obtained in preprocessing, and finally, character semantic values are comprehensively calculated according to the characteristic vector distances and character currency values related to word frequency.
The preprocessing module comprises a scanning input module, a character image preprocessing module and a similar character template establishing module, wherein the scanning input module is used for scanning and extracting characters on pictures and files to be recognized, the character image preprocessing module is used for carrying out preprocessing work such as segmentation, binaryzation, smooth denoising, thinning and normalization on character text images recorded by scanning, the similar character template establishing module is used for establishing sample templates with multiple text dimensions such as similar characters, symbols, numbers and letters, the preprocessing work is used for preparing subsequent calculation of feature vector distances and is the basis of all calculation recognition, a template base is established by dividing similar character dimensions into a class of templates and establishing the templates in an adjacent mode, and when fuzzy characters are recognized in the later stage, the similar characters can be accurately calculated and judged according to feature vectors and word frequencies.
The calculation and analysis module comprises a characteristic vector distance module, a semantic analysis module and a character comprehensive calculation module, wherein the characteristic vector distance module is used for calculating the characteristic vector distance of a single divided character, the analysis and judgment module is used for judging whether the character belongs to the condition that semantic comprehension needs to be added for comprehensive calculation, the semantic analysis module is used for analyzing the meaning of the character in combination with the meanings of front and back characters, the character comprehensive calculation module is used for comprehensively calculating and confirming the character according to the semantics and the vector distance, the characteristic vector distance module is electrically connected with the analysis and judgment module, the analysis and judgment module is electrically connected with the character comprehensive calculation module, the character which can be accurately identified through the characteristic vector distance of the character is directly output, the character with ambiguity is added into the semantic comprehension for comprehensive calculation, the semantic comprehension is that the single character and the front and back characters are combined into word frequency, the use frequency of the word frequency database is analyzed, and the currency value of a sentence is further brought into a complete sentence formula, the higher the frequency of use the greater the compliance value.
The characteristic vector distance module comprises an actual character distance calculation submodule and a similar character distance calculation submodule, the actual character distance calculation submodule is used for calculating a standard distance value between a characteristic vector of a current character and the character in a template database, the similar character distance calculation submodule is used for calculating the distance matching between the character and the similar character in the template database, the semantic analysis module comprises a single-character semantic analysis submodule and a front-and-back character semantic analysis submodule, the single-character semantic analysis submodule is used for analyzing the analysis meaning of the character in the database template corresponding to the current recognized character, and the front-and-back character semantic analysis submodule is used for analyzing semantic information of front-and-back adjacent characters of the current character in the database.
The judgment and display module comprises a judgment and confirmation module and a character presentation module, the judgment and confirmation module is used for confirming the accurate characters in the database corresponding to the current characters according to the calculated comprehensive character information, the character presentation module is used for presenting and displaying the correct characters, finally, the characters accurately identified through the characteristic vector distance are presented and displayed, and finally, the characters accurately identified by the fuzzy characters are presented and displayed by utilizing semantic analysis.
The operation method of the language and image understanding integrated service system mainly comprises the following steps:
step S1: the method comprises the steps that a user puts a picture or a file to be recognized into a recognition position to click a recognition function, a preprocessing module analyzes character scanning input, and preprocessing steps such as image area division and characteristic vector extraction are further carried out;
step S2: synchronously establishing a similar character template module in an existing reference character template, adjusting the character position in the character template according to the font similarity of the preprocessed characters, establishing a character morphological characteristic multi-template library when the characters with larger similarity are in the adjacent position, establishing a large number of character template libraries in the recognition system, subsequently adjusting and dividing the positions of the similar characters recognized in the recognition process through unsupervised learning, and establishing a similar character template;
step S3: carrying out feature vector distance calculation on the preprocessed characters, carrying out comparison judgment according to the calculated distance and a character form feature template library, judging character accuracy and ambiguity, carrying out comparison judgment on the calculated distance, wherein most characters can be accurately identified and matched, but uncertainty exists in the case of aiming at partial extremely similar characters, namely the identified characters have certain ambiguity;
step S4: the semantic analysis module performs single-character semantic analysis and front-and-back character semantic analysis on the characters with ambiguity, and the character comprehensive calculation module performs comprehensive calculation on the characters with ambiguity according to the similarity value and the semantic analysis value;
step S5: and judging the final character corresponding to the comprehensive calculation result, and displaying the final character.
Step S2 further includes the steps of:
step S21: the character image preprocessing module extracts feature vectors of each character to be recognized in the horizontal direction, the vertical direction, the left falling direction and the right falling direction by using a skeleton extraction technology to obtain a four-dimensional feature vector X (X) of the character to be recognized1,x2,x3,x4)TThe character feature vector can be selected in multiple dimensions, and the feature vectors in the horizontal, vertical, left-falling and right-falling directions of the most basic character are selected for subsequent vector distance calculation;
step S22: according to the set existing m-class template mode W of the reference character1,W2,W3,WmEstablishing a four-dimensional feature vector Yi=(yi1,yi2,yi3,yi4) Class i schema W of the representationiThe reference template of (1).
Step S3 further includes the steps of:
step S31: four-dimensional characteristic vector X ═ X (X) of N characters to be recognized extracted from character image preprocessing module1,x2,x3,x4)TAutomatically identifying class template W matched into reference template by using clustering algorithmiThe clustering algorithm automatically divides the recognized character texts, specifically belongs to a character class, a number class, a letter class and the like, and finds character templates corresponding to different base classes in the database;
step S32: when the clustering algorithm automatically identifies more than one type of matchable reference template, the best matched first type of template W is removediAnd other templates W with higher similaritymMost of characters with ambiguity are uncertainty of similar base classes, and the other part has ambiguity of crossing base classes;
step S33: the similar character distance calculation submodule calculates the distance between the similar character and the W by utilizing a four-dimensional characteristic vector calculation formulaiDistance d (X, Y) between quasi-reference templatesi) And similarity, and other kinds of templates WmD (X, Y) of the twom) And similarity;
step S34: the actual character distance calculation submodule calculates the distance between the actual character distance and the W according to a four-dimensional feature vector calculation formulaiDistance d (X, Y) between quasi-reference templatesi),:
In the formula, x1,x2,x3,x4Feature vector y representing four directions of horizontal, vertical, left-falling and right-falling of character actually measured1,y2,y3,y4Represents WiThe standard horizontal, vertical, left-falling and right-falling characteristic vectors of the characters in the class reference template, and the larger the difference of the dimension values of the two characteristic vectors is, the larger the difference is, the characteristic vectors areThe greater the distance;
step S35: calculating the similarity between the character and the standard character by distance conversion of the calculated feature vectorSimilarity S (X, Y)i) In the range of [0,1]The closer the distance, the greater the similarity.
Step S4 further includes the steps of:
step S41: when the clustering algorithm automatically identifies and matches a unique reference template, the analysis and judgment module judges the identification as accurate identification, and directly displays the identification without semantic analysis;
step S42: when the clustering algorithm automatically identifies and matches a plurality of types of reference templates, analyzing and judging the calculated similarity, when the similarity is large, using a type of template with large similarity as accurate identification to not perform semantic analysis, when the calculated similarity is small, judging the character to be recognized as fuzzy identification, after the distance of a feature vector is obtained, obtaining a plurality of types of reference templates according to distance matching, and in order to further screen and calculate the similarity of the character matched with the template, wherein the similarity in the plurality of types of templates is different, selecting a type with high similarity on the calculated distance of the feature vector, and performing further character calculation when the similarity is similar;
step S43: the semantic analysis module carries out semantic analysis on the judged fuzzy recognized characters, and the matched characters in the multi-class templates are respectively entered into a standard character semantic database to carry out single character semantic analysis;
step S44: combining the analyzed single characters with front and rear characters to form word frequency, analyzing and judging the use frequency of the word frequency by using a word frequency database, wherein the higher the use frequency of the word frequency is, the larger the sentence currency value in a sentence pattern is, so that the word frequency is put into the complete sentence pattern to analyze the sentence currency value K of the sentence pattern, the range of the sentence currency value K is [0,1], the word frequency is formed by combining the characters and the characters, the correct word frequency frequently used by people is stored in the database, and the word frequency is higher and the currency value is larger when the sentence is put into the sentence according to the use rate of the word frequency;
step S45: the character comprehensive calculation module calculates the character semantic comprehensive degree according to the calculated character similarity and the sentence currency value, and the calculation formula is as follows:
in the formula, Q is the semantic integration degree of the character, S is the similarity of the character, and K is the sentence currency value in the complete sentence pattern formed by the characters before and after the character is put in, i.e. when the similarity S of the character is close to or unchanged, the sentence currency value K is larger, the semantic integration degree Q is also larger.
Step S5 further includes the steps of:
step S51: and the judgment confirming module judges the standard character in the reference module corresponding to the confirmed character according to the comprehensive degree calculated by the character comprehensive calculating module.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (10)
1. The utility model provides a language and image understanding integration service system based on artificial intelligence, includes preprocessing module, calculates analysis module and judges display module, its characterized in that: the device comprises a preprocessing module, a calculation and analysis module, a judgment display module, a calculation and analysis module and a display module, wherein the preprocessing module is used for analyzing and preprocessing the scanned and input character content, the calculation and analysis module is used for identifying semantic information of scanned characters and calculating the actual distance of characteristic vectors of the scanned characters, the judgment display module is used for judging and confirming the comprehensive calculation result of the characters and displaying the comprehensive calculation result, the preprocessing module is electrically connected with the calculation and analysis module, and the calculation and analysis module is electrically connected with the judgment display module.
2. The system of claim 1, wherein the service system comprises: the preprocessing module comprises a scanning input module, a character image preprocessing module and a similar character template establishing module, wherein the scanning input module is used for scanning and extracting characters on pictures and files to be recognized, the character image preprocessing module is used for carrying out preprocessing work such as segmentation, binaryzation, smooth denoising, thinning and normalization on character text images recorded by scanning, and the similar character template establishing module is used for establishing sample templates with multiple text dimensions such as similar characters, symbols, numbers, letters and the like.
3. The system of claim 1, wherein the service system comprises: the calculation and analysis module comprises a feature vector distance module, a semantic analysis module and a character comprehensive calculation module, the feature vector distance module is used for calculating the feature vector distance of a single divided character, the analysis and judgment module is used for judging whether the character belongs to the character needing to be added with semantic understanding to carry out comprehensive calculation, the semantic analysis module is used for analyzing the meaning of the character in combination with the meaning of the front character and the meaning of the rear character, the character comprehensive calculation module is used for carrying out comprehensive calculation according to the semantics and the vector distance to confirm the character, the feature vector distance module is electrically connected with the analysis and judgment module, and the analysis and judgment module is electrically connected with the character comprehensive calculation module.
4. The artificial intelligence based language and image understanding integrated service system according to claim 3, wherein: the system comprises a characteristic vector distance module, a template database, a character distance module, a semantic analysis module and a front and back character semantic analysis module, wherein the characteristic vector distance module comprises an actual character distance calculation submodule and a similar character distance calculation submodule, the actual character distance calculation submodule is used for calculating a standard distance value between a characteristic vector of a current character and the character in the template database, the similar character distance calculation submodule is used for calculating the distance matching calculation between the character and the similar character in the template database, the semantic analysis module comprises a single-character semantic analysis submodule and a front and back character semantic analysis submodule, the single-character semantic analysis submodule is used for analyzing the analysis meaning of the character in the database template corresponding to the current recognized character, and the front and back character semantic analysis submodule is used for analyzing semantic information of front and back adjacent characters of the current character in the database.
5. The integrated service system for language and image understanding based on artificial intelligence of claim 4, wherein: the judgment display module comprises a judgment confirmation module and a character presentation module, the judgment confirmation module is used for confirming the accurate characters in the database corresponding to the current characters according to the calculated comprehensive character information, and the character presentation module is used for presenting and displaying the correct characters.
6. The artificial intelligence based language and image understanding integrated service system according to claim 5, wherein: the operation method of the language and image understanding integrated service system mainly comprises the following steps:
step S1: a user puts a picture or a file to be identified into an identification position to click an identification function, a preprocessing module analyzes character and character scanning and inputting, and preprocessing steps such as image area division, characteristic vector extraction and the like are carried out;
step S2: synchronously establishing a similar character template module in an existing reference character template, adjusting the character position in the character template according to the font similarity of the preprocessed character, and establishing a character morphological characteristic multi-template library when the character with larger similarity is at an adjacent position;
step S3: carrying out feature vector distance calculation on the preprocessed characters, comparing and judging the calculated distance with a character form feature template library, and judging character accuracy and fuzziness;
step S4: the semantic analysis module performs single-character semantic analysis and front-and-back character semantic analysis on the characters with ambiguity, and the character comprehensive calculation module performs comprehensive calculation on the characters with ambiguity according to the similarity value and the semantic analysis value;
step S5: and judging the final character corresponding to the comprehensive calculation result, and displaying.
7. The integrated service system for speech and image understanding based on artificial intelligence of claim 6, wherein: the step S2 further includes the steps of:
step S21: the character image preprocessing module extracts feature vectors of each character to be recognized in the horizontal direction, the vertical direction, the left falling direction and the right falling direction by using a skeleton extraction technology to obtain a four-dimensional feature vector X (X) of the character to be recognized1,x2,x3,x4)T;
Step S22: according to the set existing m-class template mode W of the reference character1,W2,W3,WmEstablishing a four-dimensional feature vector Yi=(yi1,yi2,yi3,yi4) Class i schema W of the representationiThe reference template of (1).
8. The artificial intelligence based language and image understanding integrated service system according to claim 7, wherein: the step S3 further includes the steps of:
step S31: n to-be-recognized characters extracted from character image preprocessing moduleFour-dimensional character vector X ═ X1,x2,x3,x4)TThe sample is automatically identified to a class template W matched with the reference template by using a clustering algorithmiThe clustering algorithm is to compare and analyze the recognized character feature vectors of the same category with the character templates of the category in the template library;
step S32: when the clustering algorithm automatically identifies more than one type of matchable reference template, the best matched first type of template W is removediAnd other templates W with higher similaritym;
Step S33: the similar character distance calculation submodule calculates the distance between the similar character and the W by utilizing a four-dimensional characteristic vector calculation formulaiDistance d (X, Y) between quasi-reference templatesi) And similarity, and other kinds of templates WmD (X, Y) of the twom) And similarity;
step S34: the actual character distance calculation submodule calculates the distance between the actual character distance and the W according to a four-dimensional feature vector calculation formulaiDistance d (X, Y) between quasi-reference templatesi),:
In the formula, x1,x2,x3,x4Feature vector y representing four directions of horizontal, vertical, left-falling and right-falling of character actually measured1,y2,y3,y4Represents WiThe standard horizontal, vertical, left-falling and right-falling characteristic vectors of the characters in the class reference template, and the larger the difference of the dimensional values of the two characteristic vectors is, the larger the distance is;
9. The artificial intelligence based language and image understanding integrated service system according to claim 6, wherein: the step S4 further includes the steps of:
step S41: when the clustering algorithm automatically identifies and matches a unique reference template, the analysis and judgment module judges the identification as accurate identification, and directly displays the identification without semantic analysis;
step S42: when the clustering algorithm automatically identifies and matches a plurality of types of reference templates, analyzing and judging the calculated similarity, when the similarity is large, using the template with the large similarity as the accurate identification and not performing semantic analysis, and when the calculated similarity is small, judging the character as fuzzy identification;
step S43: the semantic analysis module carries out semantic analysis on the judged fuzzy recognized characters, and the matched characters in the multi-class templates are respectively entered into a standard character semantic database to carry out single character semantic analysis;
step S44: combining the analyzed single characters with front and rear characters to form word frequency, analyzing and judging the use frequency of the word frequency by using a word frequency database, wherein the higher the use frequency of the word frequency is, the larger the sentence currency value in the sentence pattern is, so that the sentence currency value K of the sentence pattern is analyzed by placing the word frequency in a complete sentence pattern, and the range of the sentence currency value K is [0,1 ];
step S45: the character comprehensive calculation module calculates the character semantic comprehensive degree according to the calculated character similarity and the sentence currency value, and the calculation formula is as follows:
in the formula, Q is the semantic integration degree of the character, S is the similarity of the character, and K is the sentence currency value in the complete sentence pattern formed by the characters before and after the character is put in, i.e. when the similarity S of the character is close to or unchanged, the sentence currency value K is larger, the semantic integration degree Q is also larger.
10. The artificial intelligence based language and image understanding integrated service system according to claim 6, wherein: the step S5 further includes the steps of:
step S51: and the judgment confirming module judges the standard character in the reference module corresponding to the confirmed character according to the comprehensive degree calculated by the character comprehensive calculating module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210092447.0A CN114580429A (en) | 2022-01-26 | 2022-01-26 | Artificial intelligence-based language and image understanding integrated service system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210092447.0A CN114580429A (en) | 2022-01-26 | 2022-01-26 | Artificial intelligence-based language and image understanding integrated service system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114580429A true CN114580429A (en) | 2022-06-03 |
Family
ID=81771989
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210092447.0A Pending CN114580429A (en) | 2022-01-26 | 2022-01-26 | Artificial intelligence-based language and image understanding integrated service system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114580429A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117037184A (en) * | 2023-10-10 | 2023-11-10 | 深圳牛图科技有限公司 | OCR fuzzy recognition system and method based on cloud matching |
Citations (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101697201A (en) * | 2009-09-25 | 2010-04-21 | 华南理工大学 | Handwritten Chinese character identification method irrelevant to rotation direction |
CN104809451A (en) * | 2015-05-15 | 2015-07-29 | 河海大学常州校区 | Handwriting authentication system based on stroke curvature detection |
CN107545391A (en) * | 2017-09-07 | 2018-01-05 | 安徽共生物流科技有限公司 | A kind of logistics document intellectual analysis and automatic storage method based on image recognition |
CN108564064A (en) * | 2018-04-28 | 2018-09-21 | 北京宙心科技有限公司 | A kind of efficient OCR recognizers of view-based access control model |
CN108615058A (en) * | 2018-05-10 | 2018-10-02 | 苏州大学 | A kind of method, apparatus of character recognition, equipment and readable storage medium storing program for executing |
CN109508712A (en) * | 2018-11-14 | 2019-03-22 | 衡阳师范学院 | A kind of Chinese written language recognition methods based on image |
CN109800762A (en) * | 2019-01-28 | 2019-05-24 | 安阳工学院 | A kind of fuzzy license plate recognizer based on the Dynamic Matching factor |
CN109840519A (en) * | 2019-01-25 | 2019-06-04 | 青岛盈智科技有限公司 | A kind of adaptive intelligent form recognition input device and its application method |
CN110533020A (en) * | 2018-05-25 | 2019-12-03 | 腾讯科技(深圳)有限公司 | A kind of recognition methods of text information, device and storage medium |
CN111401099A (en) * | 2018-12-28 | 2020-07-10 | 中国电信股份有限公司 | Text recognition method, device and storage medium |
CN111523532A (en) * | 2020-04-14 | 2020-08-11 | 广东小天才科技有限公司 | Method for correcting OCR character recognition error and terminal equipment |
CN111539414A (en) * | 2020-04-26 | 2020-08-14 | 梁华智能科技(上海)有限公司 | OCR image character recognition and character correction method and system |
CN111814485A (en) * | 2020-07-09 | 2020-10-23 | 倪亚晖 | Semantic analysis method and device based on massive standard document data |
CN112085011A (en) * | 2020-09-27 | 2020-12-15 | 中国建设银行股份有限公司 | OCR recognition result error correction method, device and storage medium |
CN112434691A (en) * | 2020-12-02 | 2021-03-02 | 上海三稻智能科技有限公司 | HS code matching and displaying method and system based on intelligent analysis and identification and storage medium |
CN112487774A (en) * | 2020-11-20 | 2021-03-12 | 深圳市鹰硕教育服务有限公司 | Writing form electronization method and device and electronic equipment |
CN112507806A (en) * | 2020-11-19 | 2021-03-16 | 深圳市鹰硕教育服务有限公司 | Intelligent classroom information interaction method and device and electronic equipment |
CN112562669A (en) * | 2020-12-01 | 2021-03-26 | 浙江方正印务有限公司 | Intelligent digital newspaper automatic summarization and voice interaction news chat method and system |
CN112613500A (en) * | 2020-12-21 | 2021-04-06 | 安徽科迅教育装备集团有限公司 | Campus dynamic scoring system based on deep learning |
CN112836650A (en) * | 2021-02-05 | 2021-05-25 | 广东电网有限责任公司广州供电局 | Semantic analysis method and system for quality inspection report scanning image table |
-
2022
- 2022-01-26 CN CN202210092447.0A patent/CN114580429A/en active Pending
Patent Citations (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101697201A (en) * | 2009-09-25 | 2010-04-21 | 华南理工大学 | Handwritten Chinese character identification method irrelevant to rotation direction |
CN104809451A (en) * | 2015-05-15 | 2015-07-29 | 河海大学常州校区 | Handwriting authentication system based on stroke curvature detection |
CN107545391A (en) * | 2017-09-07 | 2018-01-05 | 安徽共生物流科技有限公司 | A kind of logistics document intellectual analysis and automatic storage method based on image recognition |
CN108564064A (en) * | 2018-04-28 | 2018-09-21 | 北京宙心科技有限公司 | A kind of efficient OCR recognizers of view-based access control model |
CN108615058A (en) * | 2018-05-10 | 2018-10-02 | 苏州大学 | A kind of method, apparatus of character recognition, equipment and readable storage medium storing program for executing |
CN110533020A (en) * | 2018-05-25 | 2019-12-03 | 腾讯科技(深圳)有限公司 | A kind of recognition methods of text information, device and storage medium |
CN109508712A (en) * | 2018-11-14 | 2019-03-22 | 衡阳师范学院 | A kind of Chinese written language recognition methods based on image |
CN111401099A (en) * | 2018-12-28 | 2020-07-10 | 中国电信股份有限公司 | Text recognition method, device and storage medium |
CN109840519A (en) * | 2019-01-25 | 2019-06-04 | 青岛盈智科技有限公司 | A kind of adaptive intelligent form recognition input device and its application method |
CN109800762A (en) * | 2019-01-28 | 2019-05-24 | 安阳工学院 | A kind of fuzzy license plate recognizer based on the Dynamic Matching factor |
CN111523532A (en) * | 2020-04-14 | 2020-08-11 | 广东小天才科技有限公司 | Method for correcting OCR character recognition error and terminal equipment |
CN111539414A (en) * | 2020-04-26 | 2020-08-14 | 梁华智能科技(上海)有限公司 | OCR image character recognition and character correction method and system |
CN111814485A (en) * | 2020-07-09 | 2020-10-23 | 倪亚晖 | Semantic analysis method and device based on massive standard document data |
CN112085011A (en) * | 2020-09-27 | 2020-12-15 | 中国建设银行股份有限公司 | OCR recognition result error correction method, device and storage medium |
CN112507806A (en) * | 2020-11-19 | 2021-03-16 | 深圳市鹰硕教育服务有限公司 | Intelligent classroom information interaction method and device and electronic equipment |
CN112487774A (en) * | 2020-11-20 | 2021-03-12 | 深圳市鹰硕教育服务有限公司 | Writing form electronization method and device and electronic equipment |
CN112562669A (en) * | 2020-12-01 | 2021-03-26 | 浙江方正印务有限公司 | Intelligent digital newspaper automatic summarization and voice interaction news chat method and system |
CN112434691A (en) * | 2020-12-02 | 2021-03-02 | 上海三稻智能科技有限公司 | HS code matching and displaying method and system based on intelligent analysis and identification and storage medium |
CN112613500A (en) * | 2020-12-21 | 2021-04-06 | 安徽科迅教育装备集团有限公司 | Campus dynamic scoring system based on deep learning |
CN112836650A (en) * | 2021-02-05 | 2021-05-25 | 广东电网有限责任公司广州供电局 | Semantic analysis method and system for quality inspection report scanning image table |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117037184A (en) * | 2023-10-10 | 2023-11-10 | 深圳牛图科技有限公司 | OCR fuzzy recognition system and method based on cloud matching |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8731300B2 (en) | Handwritten word spotter system using synthesized typed queries | |
JP3292388B2 (en) | Method and apparatus for summarizing a document without decoding the document image | |
US9152883B2 (en) | System and method for increasing the accuracy of optical character recognition (OCR) | |
Park et al. | Automatic detection and recognition of Korean text in outdoor signboard images | |
Zagoris et al. | A document image retrieval system | |
US20080222095A1 (en) | Document management system | |
CN111523622B (en) | Method for simulating handwriting by mechanical arm based on characteristic image self-learning | |
CN111626292A (en) | Character recognition method of building indication mark based on deep learning technology | |
CN114937278A (en) | Text content extraction and identification method based on line text box word segmentation algorithm | |
CN114580429A (en) | Artificial intelligence-based language and image understanding integrated service system | |
Mohammad et al. | Contour-based character segmentation for printed Arabic text with diacritics | |
Rahiman et al. | An efficient character recognition system for handwritten Malayalam characters based on intensity variations | |
Liang et al. | An optimization methodology for document structure extraction on latin character documents | |
CN112036330A (en) | Text recognition method, text recognition device and readable storage medium | |
CN115311666A (en) | Image-text recognition method and device, computer equipment and storage medium | |
Wahlberg et al. | Data mining medieval documents by word spotting | |
CN112560849A (en) | Neural network algorithm-based grammar segmentation method and system | |
Behera et al. | Visual signature based identification of low-resolution document images | |
Li | An implementation of ocr system based on skeleton matching | |
Kumar et al. | A comparative Analysis of Feature Extraction Algorithms and Deep Learning Techniques for Detection from Natural Images | |
Sousa et al. | Ancient document recognition using fuzzy methods | |
CN113989822B (en) | Picture table content extraction method based on computer vision and natural language processing | |
JPH08287188A (en) | Character string recognition device | |
Kawtrakul et al. | Multifeature extraction for printed thai character recognition | |
Hewavitharana et al. | A statistical approach to sinhala handwriting recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20220603 |