CN114187600A - Auxiliary system for intelligent management of metering assets - Google Patents

Auxiliary system for intelligent management of metering assets Download PDF

Info

Publication number
CN114187600A
CN114187600A CN202111465318.3A CN202111465318A CN114187600A CN 114187600 A CN114187600 A CN 114187600A CN 202111465318 A CN202111465318 A CN 202111465318A CN 114187600 A CN114187600 A CN 114187600A
Authority
CN
China
Prior art keywords
characters
character
recognition
classifier
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111465318.3A
Other languages
Chinese (zh)
Inventor
郭琰
王瑾
吕甜甜
任建民
王浩
甘源
穆萍
杨媛
星玥
李永国
周洪煜
潘沙沙
乔阳波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Haidong Power Supply Company State Grid Qinghai Electric Power Co ltd
State Grid Corp of China SGCC
State Grid Qinghai Electric Power Co Ltd
Original Assignee
Haidong Power Supply Company State Grid Qinghai Electric Power Co ltd
State Grid Corp of China SGCC
State Grid Qinghai Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Haidong Power Supply Company State Grid Qinghai Electric Power Co ltd, State Grid Corp of China SGCC, State Grid Qinghai Electric Power Co Ltd filed Critical Haidong Power Supply Company State Grid Qinghai Electric Power Co ltd
Priority to CN202111465318.3A priority Critical patent/CN114187600A/en
Publication of CN114187600A publication Critical patent/CN114187600A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/109Font handling; Temporal or kinetic typography
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Character Discrimination (AREA)

Abstract

An auxiliary system for intelligent management of metering assets relates to the technical field of OCR recognition and comprises the following steps: the operation steps are start → login interface → input account password → check information → correct access to the system → upload photo → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result. The invention has the beneficial effects that: OCR technology has let everybody reduce equipment configuration, has reduced the human cost, has improved work efficiency.

Description

Auxiliary system for intelligent management of metering assets
Technical Field
The invention relates to the technical field of OCR (optical character recognition), in particular to an auxiliary system for intelligent management of metering assets.
Background
The weight of the transformer, the position of the nameplate and the installation space limit the way of looking over the nameplate information, and in the prior working mode, when the transformer of a user is received and handled in the early stage, when the staff need to make a consignment checking order, transact the procedures of warehousing and ex-warehouse, record and maintain the paper, manually input the information of the electronic version of the mutual inductor, and check, accept, meter installation and power supply at the later stage, in order to match the field mutual inductor with the early-stage transformer for inspection, the mutual inductor information is required to be checked and recorded, and the system can be communicated with a worker handling the procedures of receiving and sending the mutual inductor by telephone to confirm whether the mutual inductor is matched or not and whether a certificate is verified or not, the work is complicated, the office must be supported by the information of the worker, and the electricity stealing prevention is realized during the work of checking the power mutual inductor of the electricity consumer, the paper data which are difficult to store and query are required to be checked for many years ago; the tool for intelligently managing and metering assets solves the problems of difficulty in checking mutual inductor information and data in work, long waiting time of a client in the process of checking mutual inductor information, acceptance and power supply user mutual inductor information and the like of a user.
Disclosure of Invention
The invention provides an auxiliary system for intelligent management of metering assets, which is characterized in that: the operation steps are beginning → login interface → input account password → check information → correctly enter the system → upload photos → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
The identification pretreatment is graying, and the color image is subjected to the substeps of noise reduction, binarization, character segmentation and normalization; after binarization, the image only has two colors, namely black and white, wherein one color is an image background, and the other color is a character to be identified; the noise reduction algorithm influences the feature extraction, characters in an image are divided into single characters by character segmentation, the characters are recognized word by word during recognition, if lines of the characters are inclined, inclination correction is usually carried out, normalization is carried out to regulate single character images to the same size, and a unified algorithm is applied under the same specification.
The characteristic extraction and dimension reduction: the characters are information for identifying characters, and each different character can be distinguished from other characters through the characteristics; the feature extraction of the numbers and the English letters is easy, because the numbers are only 10, the English letters are only 52, and all the letters are small character sets, the feature extraction is difficult for Chinese characters, the Chinese characters are large character sets, and 3755 first-level Chinese characters which are most commonly used in the national standard and Chinese light are used; the Chinese character has a complex structure and many characters with similar shapes, after determining which kind of characteristics are used, the characteristics are subjected to characteristic dimension reduction, the characteristics are represented by a vector, the dimension is the component number of the vector, if the dimension of the characteristics is too high, the efficiency of the classifier can be greatly influenced, the dimension reduction needs to reduce the dimension, and the characteristic vector after the dimension reduction also reserves enough information content to distinguish different characters.
The classifier design, training and actual recognition: the classifier is used for identification, such as feature extraction and dimension reduction, for a character image, features are extracted and sent to the classifier, the classifier classifies the character image, which character the features are identified into is determined, and before actual identification, the classifier is trained to supervise learning cases.
The OCR recognition post-processing: the classification result is optimized by post-processing, the classification of the classifier occasionally makes mistakes, such as the recognition of Chinese characters, because of the existence of the shape-close characters in the Chinese characters, a character is easily recognized as the shape-close character, and the problem can be solved in the post-processing, such as the correction is carried out through a language model, if the classifier recognizes the 'where' to store ', the' where 'to store' is found to be wrong through the language model, and then the correction is carried out; the OCR recognition image has a large amount of characters which have complex conditions of typesetting, font size and the like, and the recognition result is formatted in the post-processing and output according to the typesetting arrangement in the image.
The uploading photo is an uploading mutual inductor nameplate photo.
The core of the invention is to use artificial intelligence OCR (character recognition) technology, which refers to the process of analyzing and recognizing the image file of the text data to obtain characters and layout information, and also recognizes the characters in the image and returns the characters in the form of text.
Through the accurate mutual-inductor data plate information of different models of different producers of discernment of using this technique for the mutual-inductor data plate information direct conversion who shoots the scanning can edit the characters type and record in this instrument, exports the form of selected data with the Excel form simultaneously, has not only solved the problem of the information that the operational environment caused under the complicacy and check the difficulty, record check consuming time of a specified duration through the scanning of shooing.
The invention has the beneficial effects that: through the development of the invention, the papery record and the secondary manual electronic record when the former customer inspects the power transformer are changed into the electronic record which is changed along with the scanning and the checking, and the field acceptance inspection of the power transformer, the power supply operation inspection of the power transformer, the anti-electricity-stealing inspection of the power transformer and the later maintenance of the power transformer are changed into the working mode that the user information can be mastered along with the scanning and the checking in order to take out the mobile phone which is carried along. From slowly reading to grasp at any time, from original manual work to present AI intelligence, what improve is not only work efficiency, operating mass, has changed the working method from the root, has also saved the time of handling procedures, waiting to complete the acceptance, power supply for the customer simultaneously, faster, better for user service.
OCR technology has let everybody reduce equipment configuration, has reduced the human cost, has improved work efficiency.
Description of the drawings:
FIG. 1 is a block diagram of the process of the present invention.
Detailed Description
Embodiment 1, an auxiliary system for intelligent management of a metered asset, characterized in that: the operation steps are beginning → login interface → input account password → check information → correctly enter the system → upload photos → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
The identification pretreatment is graying, and the color image is subjected to the substeps of noise reduction, binarization, character segmentation and normalization; after binarization, the image only has two colors, namely black and white, wherein one color is an image background, and the other color is a character to be identified; the noise reduction algorithm influences the feature extraction, characters in an image are divided into single characters by character segmentation, the characters are recognized word by word during recognition, if lines of the characters are inclined, inclination correction is usually carried out, normalization is carried out to regulate single character images to the same size, and a unified algorithm is applied under the same specification.
The characteristic extraction and dimension reduction: the characters are information for identifying characters, and each different character can be distinguished from other characters through the characteristics; the feature extraction of the numbers and the English letters is easy, because the numbers are only 10, the English letters are only 52, and all the letters are small character sets, the feature extraction is difficult for Chinese characters, the Chinese characters are large character sets, and 3755 first-level Chinese characters which are most commonly used in the national standard and Chinese light are used; the Chinese character has a complex structure and many characters with similar shapes, after determining which kind of characteristics are used, the characteristics are subjected to characteristic dimension reduction, the characteristics are represented by a vector, the dimension is the component number of the vector, if the dimension of the characteristics is too high, the efficiency of the classifier can be greatly influenced, the dimension reduction needs to reduce the dimension, and the characteristic vector after the dimension reduction also reserves enough information content to distinguish different characters.
The classifier design, training and actual recognition: the classifier is used for identification, such as feature extraction and dimension reduction, for a character image, features are extracted and sent to the classifier, the classifier classifies the character image, which character the features are identified into is determined, and before actual identification, the classifier is trained to supervise learning cases.
The OCR recognition post-processing: the classification result is optimized by post-processing, the classification of the classifier occasionally makes mistakes, such as the recognition of Chinese characters, because of the existence of the shape-close characters in the Chinese characters, a character is easily recognized as the shape-close character, and the problem can be solved in the post-processing, such as the correction is carried out through a language model, if the classifier recognizes the 'where' to store ', the' where 'to store' is found to be wrong through the language model, and then the correction is carried out; the OCR recognition image has a large amount of characters which have complex conditions of typesetting, font size and the like, and the recognition result is formatted in the post-processing and output according to the typesetting arrangement in the image.
Example 2; the utility model provides an auxiliary system of intelligent management of measurement mutual-inductor which characterized in that: the operation steps are beginning → login interface → input account password → check information → correct entering system → upload mutual inductor information card photo → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.

Claims (6)

1. An auxiliary system for intelligent management of metering assets is characterized in that: the operation steps are beginning → login interface → input account password → check information → correctly enter the system → upload photos → OCR character recognition → recognition preprocessing → feature extraction and dimension reduction → classifier design, training and actual recognition → OCR recognition post-processing → output result; and if the input account password is incorrect, returning to the login interface.
2. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the identification pretreatment is graying, and the color image is subjected to the substeps of noise reduction, binarization, character segmentation and normalization; after binarization, the image only has two colors, namely black and white, wherein one color is an image background, and the other color is a character to be identified; the noise reduction algorithm influences the feature extraction, characters in an image are divided into single characters by character segmentation, the characters are recognized word by word during recognition, if lines of the characters are inclined, inclination correction is usually carried out, normalization is carried out to regulate single character images to the same size, and a unified algorithm is applied under the same specification.
3. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the characteristic extraction and dimension reduction: the characters are information for identifying characters, and each different character can be distinguished from other characters through the characteristics; the feature extraction of the numbers and the English letters is easy, because the numbers are only 10, the English letters are only 52, and all the letters are small character sets, the feature extraction is difficult for Chinese characters, the Chinese characters are large character sets, and 3755 first-level Chinese characters which are most commonly used in the national standard and Chinese light are used; the Chinese character has a complex structure and many characters with similar shapes, after determining which kind of characteristics are used, the characteristics are subjected to characteristic dimension reduction, the characteristics are represented by a vector, the dimension is the component number of the vector, if the dimension of the characteristics is too high, the efficiency of the classifier can be greatly influenced, the dimension reduction needs to reduce the dimension, and the characteristic vector after the dimension reduction also reserves enough information content to distinguish different characters.
4. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the classifier design, training and actual recognition: the classifier is used for identification, such as feature extraction and dimension reduction, for a character image, features are extracted and sent to the classifier, the classifier classifies the character image, which character the features are identified into is determined, and before actual identification, the classifier is trained to supervise learning cases.
5. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the OCR recognition post-processing: the classification result is optimized by post-processing, the classification of the classifier occasionally makes mistakes, such as the recognition of Chinese characters, because of the existence of the shape-close characters in the Chinese characters, a character is easily recognized as the shape-close character, and the problem can be solved in the post-processing, such as the correction is carried out through a language model, if the classifier recognizes the 'where' to store ', the' where 'to store' is found to be wrong through the language model, and then the correction is carried out; the OCR recognition image has a large amount of characters which have complex conditions of typesetting, font size and the like, and the recognition result is formatted in the post-processing and output according to the typesetting arrangement in the image.
6. The support system for the intelligent management of a metered asset as claimed in claim 1, wherein: the uploading photo is an uploading mutual inductor nameplate photo.
CN202111465318.3A 2021-12-03 2021-12-03 Auxiliary system for intelligent management of metering assets Pending CN114187600A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111465318.3A CN114187600A (en) 2021-12-03 2021-12-03 Auxiliary system for intelligent management of metering assets

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111465318.3A CN114187600A (en) 2021-12-03 2021-12-03 Auxiliary system for intelligent management of metering assets

Publications (1)

Publication Number Publication Date
CN114187600A true CN114187600A (en) 2022-03-15

Family

ID=80542101

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111465318.3A Pending CN114187600A (en) 2021-12-03 2021-12-03 Auxiliary system for intelligent management of metering assets

Country Status (1)

Country Link
CN (1) CN114187600A (en)

Similar Documents

Publication Publication Date Title
EP3432197B1 (en) Method and device for identifying characters of claim settlement bill, server and storage medium
CN109887153B (en) Finance and tax processing method and system
US8233751B2 (en) Method and system for simplified recordkeeping including transcription and voting based verification
CN112508011A (en) OCR (optical character recognition) method and device based on neural network
CN108960223A (en) The method for automatically generating voucher based on bill intelligent recognition
CN112800848A (en) Structured extraction method, device and equipment of information after bill identification
CN109598517B (en) Commodity clearance processing, object processing and category prediction method and device thereof
CN112528041B (en) Scheduling term specification verification method based on knowledge graph
CN109190594A (en) Optical Character Recognition system and information extracting method
EP3588376A1 (en) System and method for enrichment of ocr-extracted data
CN113569863B (en) Document checking method, system, electronic equipment and storage medium
CN109446345A (en) Nuclear power file verification processing method and system
CN111462388A (en) Bill inspection method and device, terminal equipment and storage medium
CN116052186A (en) Multi-mode invoice automatic classification and identification method, verification method and system
CN114612905A (en) Invoice processing method, device, equipment and medium based on RPA and AI
CN117807967A (en) Financial account reporting method and device based on OCR intelligent form filling and electronic equipment
CN114187600A (en) Auxiliary system for intelligent management of metering assets
CN116110066A (en) Information extraction method, device and equipment of bill text and storage medium
CN115984885A (en) Work order management method and system for marketing field operation
CN113935296A (en) Method for extracting paper bank flow information by using sliding template technology
CN115482075A (en) Financial data anomaly analysis method and device, electronic equipment and storage medium
TW202316312A (en) Accounting management system for recognizes accounting voucher image to automatically obtain accounting related information
CN114549177A (en) Insurance letter examination method, device, system and computer readable storage medium
Yuan et al. Intelligent Work Order Recognition System Based on End-to-End Deep Neural Network
CN112348022A (en) Free-form document identification method based on deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination