CN114187600A - Auxiliary system for intelligent management of metering assets - Google Patents
Auxiliary system for intelligent management of metering assets Download PDFInfo
- Publication number
- CN114187600A CN114187600A CN202111465318.3A CN202111465318A CN114187600A CN 114187600 A CN114187600 A CN 114187600A CN 202111465318 A CN202111465318 A CN 202111465318A CN 114187600 A CN114187600 A CN 114187600A
- Authority
- CN
- China
- Prior art keywords
- characters
- character
- recognition
- classifier
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/109—Font handling; Temporal or kinetic typography
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/232—Orthographic correction, e.g. spell checking or vowelisation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Character Discrimination (AREA)
Abstract
An auxiliary system for intelligent management of metering assets relates to the technical field of OCR recognition and comprises the following steps: the operation steps are start → login interface → input account password → check information → correct access to the system → upload photo → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result. The invention has the beneficial effects that: OCR technology has let everybody reduce equipment configuration, has reduced the human cost, has improved work efficiency.
Description
Technical Field
The invention relates to the technical field of OCR (optical character recognition), in particular to an auxiliary system for intelligent management of metering assets.
Background
The weight of the transformer, the position of the nameplate and the installation space limit the way of looking over the nameplate information, and in the prior working mode, when the transformer of a user is received and handled in the early stage, when the staff need to make a consignment checking order, transact the procedures of warehousing and ex-warehouse, record and maintain the paper, manually input the information of the electronic version of the mutual inductor, and check, accept, meter installation and power supply at the later stage, in order to match the field mutual inductor with the early-stage transformer for inspection, the mutual inductor information is required to be checked and recorded, and the system can be communicated with a worker handling the procedures of receiving and sending the mutual inductor by telephone to confirm whether the mutual inductor is matched or not and whether a certificate is verified or not, the work is complicated, the office must be supported by the information of the worker, and the electricity stealing prevention is realized during the work of checking the power mutual inductor of the electricity consumer, the paper data which are difficult to store and query are required to be checked for many years ago; the tool for intelligently managing and metering assets solves the problems of difficulty in checking mutual inductor information and data in work, long waiting time of a client in the process of checking mutual inductor information, acceptance and power supply user mutual inductor information and the like of a user.
Disclosure of Invention
The invention provides an auxiliary system for intelligent management of metering assets, which is characterized in that: the operation steps are beginning → login interface → input account password → check information → correctly enter the system → upload photos → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
The identification pretreatment is graying, and the color image is subjected to the substeps of noise reduction, binarization, character segmentation and normalization; after binarization, the image only has two colors, namely black and white, wherein one color is an image background, and the other color is a character to be identified; the noise reduction algorithm influences the feature extraction, characters in an image are divided into single characters by character segmentation, the characters are recognized word by word during recognition, if lines of the characters are inclined, inclination correction is usually carried out, normalization is carried out to regulate single character images to the same size, and a unified algorithm is applied under the same specification.
The characteristic extraction and dimension reduction: the characters are information for identifying characters, and each different character can be distinguished from other characters through the characteristics; the feature extraction of the numbers and the English letters is easy, because the numbers are only 10, the English letters are only 52, and all the letters are small character sets, the feature extraction is difficult for Chinese characters, the Chinese characters are large character sets, and 3755 first-level Chinese characters which are most commonly used in the national standard and Chinese light are used; the Chinese character has a complex structure and many characters with similar shapes, after determining which kind of characteristics are used, the characteristics are subjected to characteristic dimension reduction, the characteristics are represented by a vector, the dimension is the component number of the vector, if the dimension of the characteristics is too high, the efficiency of the classifier can be greatly influenced, the dimension reduction needs to reduce the dimension, and the characteristic vector after the dimension reduction also reserves enough information content to distinguish different characters.
The classifier design, training and actual recognition: the classifier is used for identification, such as feature extraction and dimension reduction, for a character image, features are extracted and sent to the classifier, the classifier classifies the character image, which character the features are identified into is determined, and before actual identification, the classifier is trained to supervise learning cases.
The OCR recognition post-processing: the classification result is optimized by post-processing, the classification of the classifier occasionally makes mistakes, such as the recognition of Chinese characters, because of the existence of the shape-close characters in the Chinese characters, a character is easily recognized as the shape-close character, and the problem can be solved in the post-processing, such as the correction is carried out through a language model, if the classifier recognizes the 'where' to store ', the' where 'to store' is found to be wrong through the language model, and then the correction is carried out; the OCR recognition image has a large amount of characters which have complex conditions of typesetting, font size and the like, and the recognition result is formatted in the post-processing and output according to the typesetting arrangement in the image.
The uploading photo is an uploading mutual inductor nameplate photo.
The core of the invention is to use artificial intelligence OCR (character recognition) technology, which refers to the process of analyzing and recognizing the image file of the text data to obtain characters and layout information, and also recognizes the characters in the image and returns the characters in the form of text.
Through the accurate mutual-inductor data plate information of different models of different producers of discernment of using this technique for the mutual-inductor data plate information direct conversion who shoots the scanning can edit the characters type and record in this instrument, exports the form of selected data with the Excel form simultaneously, has not only solved the problem of the information that the operational environment caused under the complicacy and check the difficulty, record check consuming time of a specified duration through the scanning of shooing.
The invention has the beneficial effects that: through the development of the invention, the papery record and the secondary manual electronic record when the former customer inspects the power transformer are changed into the electronic record which is changed along with the scanning and the checking, and the field acceptance inspection of the power transformer, the power supply operation inspection of the power transformer, the anti-electricity-stealing inspection of the power transformer and the later maintenance of the power transformer are changed into the working mode that the user information can be mastered along with the scanning and the checking in order to take out the mobile phone which is carried along. From slowly reading to grasp at any time, from original manual work to present AI intelligence, what improve is not only work efficiency, operating mass, has changed the working method from the root, has also saved the time of handling procedures, waiting to complete the acceptance, power supply for the customer simultaneously, faster, better for user service.
OCR technology has let everybody reduce equipment configuration, has reduced the human cost, has improved work efficiency.
Description of the drawings:
FIG. 1 is a block diagram of the process of the present invention.
Detailed Description
Embodiment 1, an auxiliary system for intelligent management of a metered asset, characterized in that: the operation steps are beginning → login interface → input account password → check information → correctly enter the system → upload photos → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
The identification pretreatment is graying, and the color image is subjected to the substeps of noise reduction, binarization, character segmentation and normalization; after binarization, the image only has two colors, namely black and white, wherein one color is an image background, and the other color is a character to be identified; the noise reduction algorithm influences the feature extraction, characters in an image are divided into single characters by character segmentation, the characters are recognized word by word during recognition, if lines of the characters are inclined, inclination correction is usually carried out, normalization is carried out to regulate single character images to the same size, and a unified algorithm is applied under the same specification.
The characteristic extraction and dimension reduction: the characters are information for identifying characters, and each different character can be distinguished from other characters through the characteristics; the feature extraction of the numbers and the English letters is easy, because the numbers are only 10, the English letters are only 52, and all the letters are small character sets, the feature extraction is difficult for Chinese characters, the Chinese characters are large character sets, and 3755 first-level Chinese characters which are most commonly used in the national standard and Chinese light are used; the Chinese character has a complex structure and many characters with similar shapes, after determining which kind of characteristics are used, the characteristics are subjected to characteristic dimension reduction, the characteristics are represented by a vector, the dimension is the component number of the vector, if the dimension of the characteristics is too high, the efficiency of the classifier can be greatly influenced, the dimension reduction needs to reduce the dimension, and the characteristic vector after the dimension reduction also reserves enough information content to distinguish different characters.
The classifier design, training and actual recognition: the classifier is used for identification, such as feature extraction and dimension reduction, for a character image, features are extracted and sent to the classifier, the classifier classifies the character image, which character the features are identified into is determined, and before actual identification, the classifier is trained to supervise learning cases.
The OCR recognition post-processing: the classification result is optimized by post-processing, the classification of the classifier occasionally makes mistakes, such as the recognition of Chinese characters, because of the existence of the shape-close characters in the Chinese characters, a character is easily recognized as the shape-close character, and the problem can be solved in the post-processing, such as the correction is carried out through a language model, if the classifier recognizes the 'where' to store ', the' where 'to store' is found to be wrong through the language model, and then the correction is carried out; the OCR recognition image has a large amount of characters which have complex conditions of typesetting, font size and the like, and the recognition result is formatted in the post-processing and output according to the typesetting arrangement in the image.
Example 2; the utility model provides an auxiliary system of intelligent management of measurement mutual-inductor which characterized in that: the operation steps are beginning → login interface → input account password → check information → correct entering system → upload mutual inductor information card photo → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
Claims (6)
1. An auxiliary system for intelligent management of metering assets is characterized in that: the operation steps are beginning → login interface → input account password → check information → correctly enter the system → upload photos → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
2. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the identification pretreatment is graying, and the color image is subjected to the substeps of noise reduction, binarization, character segmentation and normalization; after binarization, the image only has two colors, namely black and white, wherein one color is an image background, and the other color is a character to be identified; the noise reduction algorithm influences the feature extraction, characters in an image are divided into single characters by character segmentation, the characters are recognized word by word during recognition, if lines of the characters are inclined, inclination correction is usually carried out, normalization is carried out to regulate single character images to the same size, and a unified algorithm is applied under the same specification.
3. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the characteristic extraction and dimension reduction: the characters are information for identifying characters, and each different character can be distinguished from other characters through the characteristics; the feature extraction of the numbers and the English letters is easy, because the numbers are only 10, the English letters are only 52, and all the letters are small character sets, the feature extraction is difficult for Chinese characters, the Chinese characters are large character sets, and 3755 first-level Chinese characters which are most commonly used in the national standard and Chinese light are used; the Chinese character has a complex structure and many characters with similar shapes, after determining which kind of characteristics are used, the characteristics are subjected to characteristic dimension reduction, the characteristics are represented by a vector, the dimension is the component number of the vector, if the dimension of the characteristics is too high, the efficiency of the classifier can be greatly influenced, the dimension reduction needs to reduce the dimension, and the characteristic vector after the dimension reduction also reserves enough information content to distinguish different characters.
4. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the classifier design, training and actual recognition: the classifier is used for identification, such as feature extraction and dimension reduction, for a character image, features are extracted and sent to the classifier, the classifier classifies the character image, which character the features are identified into is determined, and before actual identification, the classifier is trained to supervise learning cases.
5. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the OCR recognition post-processing: the classification result is optimized by post-processing, the classification of the classifier occasionally makes mistakes, such as the recognition of Chinese characters, because of the existence of the shape-close characters in the Chinese characters, a character is easily recognized as the shape-close character, and the problem can be solved in the post-processing, such as the correction is carried out through a language model, if the classifier recognizes the 'where' to store ', the' where 'to store' is found to be wrong through the language model, and then the correction is carried out; the OCR recognition image has a large amount of characters which have complex conditions of typesetting, font size and the like, and the recognition result is formatted in the post-processing and output according to the typesetting arrangement in the image.
6. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the uploading photo is an uploading mutual inductor nameplate photo.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111465318.3A CN114187600A (en) | 2021-12-03 | 2021-12-03 | Auxiliary system for intelligent management of metering assets |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111465318.3A CN114187600A (en) | 2021-12-03 | 2021-12-03 | Auxiliary system for intelligent management of metering assets |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114187600A true CN114187600A (en) | 2022-03-15 |
Family
ID=80542101
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111465318.3A Pending CN114187600A (en) | 2021-12-03 | 2021-12-03 | Auxiliary system for intelligent management of metering assets |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114187600A (en) |
-
2021
- 2021-12-03 CN CN202111465318.3A patent/CN114187600A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3432197B1 (en) | Method and device for identifying characters of claim settlement bill, server and storage medium | |
CN109887153B (en) | Finance and tax processing method and system | |
US8233751B2 (en) | Method and system for simplified recordkeeping including transcription and voting based verification | |
CN112508011A (en) | OCR (optical character recognition) method and device based on neural network | |
CN108960223A (en) | The method for automatically generating voucher based on bill intelligent recognition | |
CN112800848A (en) | Structured extraction method, device and equipment of information after bill identification | |
CN109598517B (en) | Commodity clearance processing, object processing and category prediction method and device thereof | |
CN112528041B (en) | Scheduling term specification verification method based on knowledge graph | |
CN109190594A (en) | Optical Character Recognition system and information extracting method | |
EP3588376A1 (en) | System and method for enrichment of ocr-extracted data | |
CN113569863B (en) | Document checking method, system, electronic equipment and storage medium | |
CN109446345A (en) | Nuclear power file verification processing method and system | |
CN111462388A (en) | Bill inspection method and device, terminal equipment and storage medium | |
CN116052186A (en) | Multi-mode invoice automatic classification and identification method, verification method and system | |
CN114612905A (en) | Invoice processing method, device, equipment and medium based on RPA and AI | |
CN117807967A (en) | Financial account reporting method and device based on OCR intelligent form filling and electronic equipment | |
CN114187600A (en) | Auxiliary system for intelligent management of metering assets | |
CN116110066A (en) | Information extraction method, device and equipment of bill text and storage medium | |
CN115984885A (en) | Work order management method and system for marketing field operation | |
CN113935296A (en) | Method for extracting paper bank flow information by using sliding template technology | |
CN115482075A (en) | Financial data anomaly analysis method and device, electronic equipment and storage medium | |
TW202316312A (en) | Accounting management system for recognizes accounting voucher image to automatically obtain accounting related information | |
CN114549177A (en) | Insurance letter examination method, device, system and computer readable storage medium | |
Yuan et al. | Intelligent Work Order Recognition System Based on End-to-End Deep Neural Network | |
CN112348022A (en) | Free-form document identification method based on deep learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |