CN118247095A - Automatic auditing method, system, equipment and medium for academic information - Google Patents

Automatic auditing method, system, equipment and medium for academic information Download PDF

Info

Publication number
CN118247095A
CN118247095A CN202311467324.1A CN202311467324A CN118247095A CN 118247095 A CN118247095 A CN 118247095A CN 202311467324 A CN202311467324 A CN 202311467324A CN 118247095 A CN118247095 A CN 118247095A
Authority
CN
China
Prior art keywords
academic
certificate
information
learning
auditing method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311467324.1A
Other languages
Chinese (zh)
Inventor
姚志峰
吉永栋
何柳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
America Online Beijing Technology Co ltd
Original Assignee
America Online Beijing Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by America Online Beijing Technology Co ltd filed Critical America Online Beijing Technology Co ltd
Priority to CN202311467324.1A priority Critical patent/CN118247095A/en
Publication of CN118247095A publication Critical patent/CN118247095A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an automatic auditing method, system, equipment and medium for learning information, which relate to the technical field of information auditing, and are characterized in that first learning information filled by an examinee and an uploaded learning certificate are firstly obtained, then the learning certificate is identified to obtain second learning information contained in the learning certificate, and finally the first learning information and the second learning information are compared to audit the first learning information, so that the learning information can be automatically audited, and the problems of low auditing efficiency and easiness in error are solved.

Description

Automatic auditing method, system, equipment and medium for academic information
Technical Field
The present invention relates to the field of information auditing technologies, and in particular, to an automatic auditing method, system, device, and medium for learning information.
Background
At present, when an examination is entered through an online registration system, an examinee is usually required to input the academic information, after the online registration system collects the academic information of the examinee, the academic information is audited manually, a great deal of manpower is required to audit the academic information manually, the workload is high, the auditing efficiency is low, meanwhile, the manual auditing is easy to make mistakes, whether the academic information of the examinee is true or not cannot be judged correctly, and the acquired academic information of the examination is inaccurate.
Based on this, a scheme capable of automatically auditing the learning information is needed.
Disclosure of Invention
The invention aims to provide an automatic auditing method, system, equipment and medium for academic information, which can automatically audit the academic information and solve the problems of low auditing efficiency and easy error.
In order to achieve the above object, the present invention provides the following solutions:
an automatic auditing method for academic information, the automatic auditing method comprising:
acquiring first academic information filled in by an examinee and an uploaded academic certificate;
identifying the academic certificate to obtain second academic information included in the academic certificate;
And comparing the first academic information with the second academic information to audit the first academic information.
In some embodiments, the identifying the learning certificate, and obtaining the second learning information included in the learning certificate specifically includes:
performing word recognition on the academic certificate to obtain text information included in the academic certificate;
And extracting the characteristics of the text information to obtain second learning information contained in the learning certificate.
In some embodiments, the automatic auditing method further includes, prior to text recognition of the academic credentials:
Judging whether the format of the academic certificate is a picture or not;
if not, carrying out format conversion on the academic certificate, and converting the format of the academic certificate into a picture.
In some embodiments, the text information included in the learning certificate obtained by performing text recognition on the learning certificate specifically includes:
And performing character recognition on the academic certificate by utilizing OCR to obtain text information included in the academic certificate.
In some embodiments, after converting the format of the academic certificate to a picture, and before text recognition of the academic certificate, the automatic auditing method further includes:
preprocessing the academic certificate to obtain a preprocessed academic certificate, and taking the preprocessed academic certificate as a new academic certificate; the preprocessing includes denoising, graying, binarizing, and image enhancement.
In some embodiments, the extracting the features of the text information to obtain the second learning information included in the learning certificate specifically includes:
And extracting the characteristics of the text information by using an Aho-Corasick algorithm to obtain second learning information contained in the learning certificate.
In some embodiments, the comparing the first and second learning information specifically includes:
And comparing the first learning information with the second learning information by using a naive algorithm.
An automatic audit system of academic information, the automatic audit system comprising:
the data acquisition module is used for acquiring the first academic information filled in by the examinee and the uploaded academic certificate;
the certificate identification module is used for identifying the academic certificate to obtain second academic information included in the academic certificate;
And the information comparison module is used for comparing the first academic information with the second academic information so as to audit the first academic information.
An apparatus for automatically auditing learning information, comprising:
A processor; and
A memory in which computer-readable program instructions are stored,
Wherein the above-described automatic auditing method is performed when the computer readable program instructions are executed by the processor.
A computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of the above-described automatic auditing method.
According to the specific embodiment provided by the invention, the invention discloses the following technical effects:
The invention aims to provide an automatic auditing method, system, equipment and medium for learning information, which are characterized in that first learning information filled by an examinee and an uploaded learning certificate are firstly obtained, then the learning certificate is identified to obtain second learning information contained in the learning certificate, and finally the first learning information and the second learning information are compared to audit the first learning information, so that the learning information can be automatically audited, and the problems of low auditing efficiency and easiness in error are solved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of the method for automatic auditing method according to embodiment 1 of the present invention;
FIG. 2 is a detailed flowchart of the automatic auditing method according to embodiment 1 of the present invention;
FIG. 3 is a schematic diagram of an electronic registration record table of education institution learning certificate according to the embodiment 1 of the present invention;
FIG. 4 is a schematic diagram of the comparison result provided in example 1 of the present invention;
Fig. 5 is a system block diagram of an automatic auditing system according to embodiment 2 of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The invention aims to provide an automatic auditing method, system, equipment and medium for academic information, which can automatically audit the academic information and solve the problems of low auditing efficiency and easy error.
In order that the above-recited objects, features and advantages of the present invention will become more readily apparent, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description.
Example 1:
the embodiment is used for providing an automatic auditing method for academic information, as shown in fig. 1 and fig. 2, the automatic auditing method includes:
S1: acquiring first academic information filled in by an examinee and an uploaded academic certificate;
When an examinee registers through the online registration system, the first academic information of the corresponding examinee is filled in according to the requirements of the online registration system, and the first academic information is correspondingly changed according to the different requirements of different online registration systems. The first learning information may be part or all of information on the learning certificate, and may include, for example, professions, learning categories, certificate numbers, and the like.
Meanwhile, when the examinee registers through the online registration system, the examinee needs to upload the personal academic certificate for subsequent automatic auditing of the first academic information, namely, the examinee uploads the academic certificate at the same time, and clicks and submits after uploading is finished.
Optionally, the learning certificate of the embodiment may be replaced by "education department learning certificate electronic registration record table", that is, the examinee uploads "education department learning certificate electronic registration record table" while filling the first learning information, and the examinee does not need to upload the learning certificate any more, and clicks and submits after uploading is completed, as shown in fig. 3, which provides a schematic diagram of "education department learning certificate electronic registration record table". The academic information included in the "education institution's academic certificate electronic registration record form" is the same as the academic information included in the academic certificate, all include name, gender, date of birth, date of admission, date of study, category of school, hierarchy, name of school, academic, specialty, form of study, number of certificate, date of study, long name of school, type of certificate and date of issue.
S2: identifying the academic certificate to obtain second academic information included in the academic certificate;
Specifically, S2 may include:
(1) And performing text recognition on the academic certificate to obtain text information included in the academic certificate.
(2) And extracting the characteristics of the text information to obtain second learning information contained in the learning certificate.
It should be noted that, the first learning information in this embodiment refers to learning information filled by the examinee, and the second learning information refers to learning information obtained by identifying the learning certificate, so that the first learning information and the second learning information are named only to distinguish two learning information of different sources.
In order to simplify the recognition process of the learning certificate, before performing text recognition on the learning certificate, the automatic auditing method of the embodiment further includes: judging whether the format of the academic certificate is a picture, if not, converting the format of the academic certificate into the picture, namely converting the format of the academic certificate into the academic certificate in the PDF format or other formats into the academic certificate in the picture format through file format conversion, acquiring the picture of the academic certificate, and then processing the picture to simplify the processing. It should be noted that the conversion of the file format may be performed by any existing format converter, which is not described herein.
Preferably, after converting the format of the learning certificate into the picture and before performing text recognition on the learning certificate, the automatic auditing method of this embodiment may further include: preprocessing the academic certificate in the picture format to obtain a preprocessed academic certificate, and executing a subsequent character recognition process by taking the preprocessed academic certificate as a new academic certificate. Wherein, the preprocessing comprises denoising, graying, binarization, image enhancement and the like so as to improve the accuracy and efficiency of the subsequent processing. The denoising, graying, binarization, and image enhancement can be achieved by any one of the existing methods, and this embodiment is not limited in any way.
In this embodiment, when performing text recognition on the learning certificate, any existing text recognition method may be selected, so long as the text recognition process can be completed, and this embodiment does not limit the text recognition process. Specifically, in this embodiment, performing text recognition on the learning certificate, and obtaining text information included in the learning certificate may include: and performing character recognition on the academic certificate by utilizing OCR (Optical Character Recognition and optical character recognition) to obtain text information contained in the academic certificate, and specifically, calling an OCR image-text recognition service to extract the text information contained in the academic certificate. OCR graphic recognition is a technology of converting various texts such as printed matter, handwriting, etc. into a computer-readable digital form, and its basic idea is to process scanned pictures, find out characters, words, lines and paragraphs therein, and convert them into a digital form recognizable by a computer, where the digital form may be text, marks, codes, etc. Through the character recognition process, all text information included in the academic certificate can be extracted.
The embodiment can use a feature extraction algorithm to perform feature extraction on the text information included in the extracted academic certificate, extract key information on the academic certificate from the text information, and obtain second academic information included in the academic certificate, and the feature extraction algorithm can adopt any existing algorithm as long as the feature extraction process can be completed, which is not limited in any way. Specifically, extracting features of the text information to obtain second learning information included in the learning certificate may include: the method comprises the steps of carrying out feature extraction on text information by using an Aho-Corasick algorithm to obtain second academic information contained in an academic certificate, specifically, obtaining matching result data (namely the second academic information) by using feature value matching based on the extracted text information, wherein the feature value comprises the specific contents of name, gender, date of birth, date of admission, category of academic, hierarchy, school name, academic, specialty, learning form, certificate number, date of graduation and school name, determining the positions of all feature values in the text information by using the Aho-Corasick algorithm, sequentially obtaining character strings from each feature value to the next feature value according to the sequence of the feature values, and finally obtaining the required matching result data, namely the second academic information comprises the specific contents of name, gender, date of birth, date of admission, date of graduation, category of graduation, hierarchy, school name, academic, specialty, learning form, certificate number and school name.
After the second learning information is identified, the embodiment may further store and subsequently compare the second learning information.
S3: and comparing the first academic information with the second academic information to audit the first academic information.
The present embodiment may use any conventional text comparison algorithm to compare the first learning information and the second learning information, so long as the comparison process can be completed, and the present embodiment does not limit the comparison process. Specifically, S3 may include: and comparing the first and second learning information by using a naive algorithm, namely comparing the characteristic data (the second learning information) obtained by identification from the learning certificate with input data (namely the first learning information) filled by the examinee person by using the naive algorithm, judging whether the first learning information filled by the examinee person has content inconsistent with the second learning information, and if so, representing that the first learning information has errors. The naive algorithm is an algorithm based on character-by-character comparison, when comparing whether two character strings are equal, the lengths of the two character strings are compared first, if the lengths are equal, each character of the two character strings is compared one by one according to the characters until the unequal characters are found or the comparison is finished, if the unequal characters exist, the first learning information is represented to be wrong, and then a test taker is prompted to indicate which content is wrong when the comparison result is fed back, and the fed back comparison result is shown in fig. 4.
The automatic checking method of the embodiment can be integrated in any online checking system, such as an ATA on-line test platform, at this time, the online checking system adds an academic on-line authentication technology, after the test taker completes the filling of the checking information in the online checking system integrated with the automatic checking method, the checking information comprises first academic information, the online checking system completes the checking information acquisition, at this time, the online checking system can start the academic authentication technology to complete the academic authentication, namely, the steps of the automatic checking method are completed, so as to automatically check the first academic information filled by the test taker.
In order to complete the steps of the automatic auditing method in the online registering system, corresponding hardware equipment (or called components) needs to be added in the online registering system, and the hardware equipment mainly comprises:
And (3) a server: program code and corresponding data for the academic database and validator are stored.
The academic database: and the second learning information and the comparison result are used for storing the identified second learning information.
The verifier: the method is used for identifying and obtaining second academic information, comparing the first academic information filled in by the test taker with the identified second academic information to verify whether the first academic information filled in by the test taker is correct or not, and completing an automatic auditing process to obtain a comparison result.
Further, the hardware device of this embodiment may further include:
User interface: and after the test taker inquires the comparison result, if the first learning information is wrong, the first learning information can be modified according to the comparison result.
Data analysis tool: the data statistics method is used for analyzing and counting the data in the academic database, and is mainly used for the related data statistics work after the second academic information acquisition, such as the number of people in each academic hierarchy, the number of professionals and the like.
The automatic auditing method provided by the embodiment can also be called an automatic auditing method based on the academic authentication comparison, adopts an academic online authentication technology to automatically identify the uploaded academic certificate to obtain second academic information, further automatically compares the second academic information with the first academic information filled by an examinee, automatically audits the first academic information, greatly improves auditing efficiency of the examinee's academic information compared with a manual auditing mode, audits the first academic information according to the true and reliable second academic information, is not easy to make mistakes compared with the manual auditing mode, can correctly judge whether the examinee's academic information is true, and can modify the first academic information according to the comparison result of the second academic information and the first academic information, so that the academic information acquired by the examinee is more accurate, the acquisition accuracy of the examinee's academic information can be greatly improved, and the problem of inaccurate auditing efficiency is solved.
Example 2:
The embodiment is used for providing an automatic auditing system for academic information, as shown in fig. 5, the automatic auditing system includes:
the data acquisition module M1 is used for acquiring first learning information filled in by an examinee and an uploaded learning certificate;
The certificate identification module M2 is used for identifying the academic certificate to obtain second academic information included in the academic certificate;
and the information comparison module M3 is used for comparing the first learning information with the second learning information so as to audit the first learning information.
Example 3:
The embodiment is used for providing an automatic auditing device for academic information, comprising:
A processor; and
A memory in which computer-readable program instructions are stored,
Wherein the automatic auditing method of embodiment 1 is performed when the computer readable program instructions are executed by the processor.
Example 4:
This embodiment is directed to a computer readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of the automatic auditing method of embodiment 1.
In this specification, each embodiment is mainly described in the specification as a difference from other embodiments, and the same similar parts between the embodiments are referred to each other. For the system disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and the relevant points refer to the description of the method section.
The principles and embodiments of the present invention have been described herein with reference to specific examples, the description of which is intended only to assist in understanding the methods of the present invention and the core ideas thereof; also, it is within the scope of the present invention to be modified by those of ordinary skill in the art in light of the present teachings. In view of the foregoing, this description should not be construed as limiting the invention.

Claims (10)

1. An automatic auditing method for academic information is characterized by comprising the following steps:
acquiring first academic information filled in by an examinee and an uploaded academic certificate;
identifying the academic certificate to obtain second academic information included in the academic certificate;
And comparing the first academic information with the second academic information to audit the first academic information.
2. The automatic auditing method according to claim 1, wherein the identifying the academic certificate to obtain the second academic information included in the academic certificate specifically includes:
performing word recognition on the academic certificate to obtain text information included in the academic certificate;
And extracting the characteristics of the text information to obtain second learning information contained in the learning certificate.
3. The automatic auditing method of claim 2, further comprising, prior to text recognition of the academic credentials:
Judging whether the format of the academic certificate is a picture or not;
if not, carrying out format conversion on the academic certificate, and converting the format of the academic certificate into a picture.
4. The automatic auditing method according to claim 3, wherein the text recognition of the academic certificate to obtain text information included in the academic certificate specifically includes:
And performing character recognition on the academic certificate by utilizing OCR to obtain text information included in the academic certificate.
5. The automatic auditing method according to claim 3, characterized in that after converting the format of the academic certificate into a picture, and before text recognition of the academic certificate, the automatic auditing method further comprises:
preprocessing the academic certificate to obtain a preprocessed academic certificate, and taking the preprocessed academic certificate as a new academic certificate; the preprocessing includes denoising, graying, binarizing, and image enhancement.
6. The automatic auditing method according to claim 2, wherein the feature extraction of the text information to obtain the second learning information included in the learning certificate specifically includes:
And extracting the characteristics of the text information by using an Aho-Corasick algorithm to obtain second learning information contained in the learning certificate.
7. The automatic auditing method according to claim 1, wherein the comparing the first and second academic information specifically comprises:
And comparing the first learning information with the second learning information by using a naive algorithm.
8. An automatic review system for learning information, the automatic review system comprising:
the data acquisition module is used for acquiring the first academic information filled in by the examinee and the uploaded academic certificate;
the certificate identification module is used for identifying the academic certificate to obtain second academic information included in the academic certificate;
And the information comparison module is used for comparing the first academic information with the second academic information so as to audit the first academic information.
9. An automatic review device for learning information, comprising:
A processor; and
A memory in which computer-readable program instructions are stored,
Wherein the computer readable program instructions, when executed by the processor, perform the automatic auditing method of any of claims 1-7.
10. A computer readable storage medium having stored thereon a computer program, wherein the computer program when executed by a processor implements the steps of the automatic auditing method of any of claims 1-7.
CN202311467324.1A 2023-11-07 2023-11-07 Automatic auditing method, system, equipment and medium for academic information Pending CN118247095A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311467324.1A CN118247095A (en) 2023-11-07 2023-11-07 Automatic auditing method, system, equipment and medium for academic information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311467324.1A CN118247095A (en) 2023-11-07 2023-11-07 Automatic auditing method, system, equipment and medium for academic information

Publications (1)

Publication Number Publication Date
CN118247095A true CN118247095A (en) 2024-06-25

Family

ID=91557207

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311467324.1A Pending CN118247095A (en) 2023-11-07 2023-11-07 Automatic auditing method, system, equipment and medium for academic information

Country Status (1)

Country Link
CN (1) CN118247095A (en)

Similar Documents

Publication Publication Date Title
US11232300B2 (en) System and method for automatic detection and verification of optical character recognition data
US10489645B2 (en) System and method for automatic detection and verification of optical character recognition data
TWI621077B (en) Character recognition method and server for claim documents
CN110751143A (en) Electronic invoice information extraction method and electronic equipment
EP3588376A1 (en) System and method for enrichment of ocr-extracted data
US20140207631A1 (en) Systems and Method for Analyzing and Validating Invoices
CN112418812A (en) Distributed full-link automatic intelligent clearance system, method and storage medium
CN112381099A (en) Question recording system based on digital education resources
CN111144210A (en) Image structuring processing method and device, storage medium and electronic equipment
CN109684957A (en) A kind of method and system showing system data according to paper form automatically
CN113935710A (en) Contract auditing method and device, electronic equipment and storage medium
RU2597163C2 (en) Comparing documents using reliable source
CN109960707B (en) College recruitment data acquisition method and system based on artificial intelligence
CN112418813A (en) AEO qualification intelligent rating management system and method based on intelligent analysis and identification and storage medium
CN111241329A (en) Image retrieval-based ancient character interpretation method and device
CN118247095A (en) Automatic auditing method, system, equipment and medium for academic information
CN110991352A (en) File data examination method and device
CN114638597A (en) Intelligent government affair handling application system, method, terminal and medium
CN114065762A (en) Text information processing method, device, medium and equipment
CN110852713A (en) Unified credit code certificate recognition system and algorithm
CN114565044B (en) Seal identification method and system
CN115640952B (en) Method and system for importing and uploading data
CN113963367B (en) Model-based financial transaction file and money extraction method
CN116862692A (en) Intelligent reimbursement method and system based on visual question and answer
TWM655760U (en) System for processing invoice data

Legal Events

Date Code Title Description
PB01 Publication