CN115620317A - Method and system for verifying authenticity of electronic engineering document - Google Patents

Method and system for verifying authenticity of electronic engineering document Download PDF

Info

Publication number
CN115620317A
CN115620317A CN202211316941.7A CN202211316941A CN115620317A CN 115620317 A CN115620317 A CN 115620317A CN 202211316941 A CN202211316941 A CN 202211316941A CN 115620317 A CN115620317 A CN 115620317A
Authority
CN
China
Prior art keywords
document
electronic engineering
information
preset
engineering document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202211316941.7A
Other languages
Chinese (zh)
Inventor
王蒙娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202211316941.7A priority Critical patent/CN115620317A/en
Publication of CN115620317A publication Critical patent/CN115620317A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/19147Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)

Abstract

The invention relates to the technical field of document detection, and particularly discloses a method and a system for verifying the authenticity of an electronic engineering document, wherein the method comprises the steps of acquiring the electronic engineering document containing label information according to a preset document interface, and inquiring a reference document in a preset template library according to the label information; determining a query mode of each position in the electronic engineering document according to the reference file, and extracting information in the electronic engineering document based on the query mode; the query mode is preset; identifying all information, counting identification results and generating the truth; in the execution process of the content, the fluctuation interference is generated in real time according to a preset fluctuation generation rule. The invention generates the interference item in real time in the process of identifying the electronic document, ensures the identification performance as much as possible and improves the safety.

Description

Method and system for verifying authenticity of electronic engineering document
Technical Field
The invention relates to the technical field of document detection, in particular to a method and a system for verifying the authenticity of an electronic engineering document.
Background
Electronic document (Electronic document) refers to a word material which is formed in social activities by people and takes chemical magnetic materials such as computer discs, magnetic discs, optical discs and the like as carriers. Dependent on the computer system access and may be transmitted over a communication network. It mainly includes electronic document, electronic mail, electronic report form and electronic drawing, etc..
The convenience of the electronic document is the convenience, namely filling speed or transmission speed; on the other hand, the method is safe, the examination of the electronic document can be carried out by means of the existing intelligent identification technology, compared with the traditional manual identification technology, the method is not tired, and the detection process of the electronic document can be more comprehensive by combining manual detection.
However, most of the existing electronic document detection methods are based on preset algorithms, and in the working process of the algorithms, the positions and relevant execution characteristics of the algorithms are easily determined according to the process running conditions of a computer background, so that the electronic document detection methods have the possibility of being decoded, and therefore how to further improve the safety is the technical problem to be solved by the technical scheme of the invention.
Disclosure of Invention
The invention aims to provide a method and a system for verifying the authenticity of an electronic engineering document, so as to solve the problems in the background technology.
In order to achieve the purpose, the invention provides the following technical scheme:
an electronic engineering document authenticity verification method, the method comprising:
acquiring an electronic engineering document containing label information according to a preset document interface, and inquiring a reference document in a preset template library according to the label information;
determining a query mode of each position in the electronic engineering document according to the reference file, and extracting information in the electronic engineering document based on the query mode; the query mode is preset;
identifying all information, counting identification results and generating the truth;
in the execution process of the content, the fluctuation interference is generated in real time according to a preset fluctuation generation rule.
As a further scheme of the invention: the steps of acquiring the electronic engineering document containing the label information according to the preset document interface and inquiring the reference document in the preset template library according to the label information comprise:
establishing a cache region corresponding to each document interface;
inputting the electronic engineering document containing the tag information acquired by the document interface into the cache region, and recording the acquisition time;
extracting the electronic engineering document in the cache region according to the preset traversal thread of the document interface and the acquisition time;
traversing a preset template base according to the label information, and inquiring a corresponding reference file; the template library comprises tag information items and reference document items.
As a further scheme of the invention: the step of determining the query mode of each position in the electronic engineering document according to the reference file and extracting the information in the electronic engineering document based on the query mode comprises the following steps:
sequentially reading the region segmentation grids of the reference file; the region segmentation grid is a rectangular grid, and the position information of the region segmentation grid comprises a page number and a coordinate of a vertex in the page number;
sequentially extracting the contents in the region segmentation grid, and judging the content type; the content type comprises a text type and an image type;
reading a query mode in a recorded mode library based on the content type;
and extracting information in the electronic engineering document based on the query mode.
As a further scheme of the invention: the step of extracting the information in the electronic engineering document based on the query mode comprises the following steps:
when the content type is a text type, marking a to-be-detected grid corresponding to the region segmentation grid in the electronic engineering document;
extracting text contents in the to-be-detected grid according to the trained text recognition model, and inputting the text contents into a preset word recognition model to obtain the type of each word in the text contents; the types comprise fictional words, nouns and descriptors;
extracting and arranging the obtained types according to the sequence of the text content to obtain a text framework of the text content;
judging the accuracy of the text content according to the text architecture;
and scaling the network to be detected according to the accuracy, and repeating the above contents until the accuracy reaches a preset accuracy threshold.
As a further scheme of the invention: the step of extracting the information in the electronic engineering document based on the query mode comprises the following steps:
when the content type is an image type, marking a to-be-detected grid corresponding to the region segmentation grid in the electronic engineering document;
positioning a target image according to a trained image recognition algorithm, and extracting contour features and color value features in the target image; the contour feature is used for representing a boundary position in the target image, and the color value feature is used for representing color value parameters of the target image in a preset traversal direction;
and inputting the contour feature and the color value feature into a trained simplified statistical model, and outputting the image feature.
As a further scheme of the invention: the steps of identifying all the information, counting the identification results and generating the truth degree comprise:
reading the extracted information and the position information of the region segmentation grid;
inquiring reference information according to the position information, comparing the information with the reference information, and determining a first truth degree;
when the first truth is smaller than a preset truth threshold, sending information to the artificial end;
and receiving a second truth degree fed back by the artificial end.
As a further scheme of the invention: the step of generating the fluctuation interference in real time according to a preset fluctuation generation rule comprises the following steps:
monitoring the number of processes and the CPU occupancy rate thereof in the execution process of the steps in real time;
randomly generating a virtual process according to the CPU occupancy rate;
the ratio of the number of the processes of the virtual process to the number of the processes in the execution process is a preset value; only one CPU occupancy rate of the virtual processes reaches the maximum CPU occupancy rate of each process in the execution process, and the virtual processes are randomly selected; the execution time window of the virtual process is a random value.
The technical scheme of the invention also provides an electronic engineering document authenticity verification system, which comprises:
the reference document query module is used for acquiring the electronic engineering document containing the label information according to a preset document interface and querying the reference document in a preset template library according to the label information;
the information extraction module is used for determining the query mode of each position in the electronic engineering document according to the reference file and extracting the information in the electronic engineering document based on the query mode; the query mode is preset;
the truth generation module is used for identifying all information, counting identification results and generating truth;
in the execution process of the content, the fluctuation interference is generated in real time according to a preset fluctuation generation rule.
As a further scheme of the invention: the reference document query module comprises:
a buffer area establishing unit for establishing a buffer area corresponding to each document interface;
the data acquisition unit is used for inputting the electronic engineering document containing the tag information acquired by the document interface into the cache region and recording acquisition time;
the data extraction unit is used for extracting the electronic engineering documents in the cache region according to the preset traversal thread of the document interface and the acquisition time;
the traversing query unit is used for traversing a preset template base according to the label information and querying a corresponding reference file; the template library comprises tag information items and reference document items.
As a further scheme of the invention: the information extraction module comprises:
the network reading unit is used for sequentially reading the region segmentation grids of the reference file; the region segmentation grid is a rectangular grid, and the position information of the region segmentation grid comprises a page number and a coordinate of a vertex in the page number;
the type judging unit is used for sequentially extracting the contents in the region segmentation grid and judging the content type; the content type comprises a text type and an image type;
a mode reading unit, configured to read a query mode in a documented mode library based on the content type;
and the execution unit is used for extracting the information in the electronic engineering document based on the query mode.
Compared with the prior art, the invention has the beneficial effects that: the method comprises the steps of receiving electronic documents uploaded by uploaders with different space-time distances in real time according to a preset document interface, determining region segmentation information of the electronic documents according to a preset template, further performing text recognition and graphic recognition on each region to obtain information of each region in the electronic documents, and judging authenticity according to the information; in the process, the interference item is generated in real time, so that the safety is improved while the identification performance is ensured as much as possible.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention.
FIG. 1 is a flow chart of a method for verifying authenticity of an electronic engineering document.
Fig. 2 is a first sub-flow block diagram of an electronic engineering document authenticity verification method.
Fig. 3 is a second sub-flow diagram of the method for verifying the authenticity of an electronic engineering document.
Fig. 4 is a third sub-flow block diagram of the method for verifying the authenticity of the electronic engineering document.
Fig. 5 is a block diagram of the composition structure of the electronic engineering document authenticity verification system.
Detailed Description
In order to make the technical problems, technical solutions and advantageous effects to be solved by the present invention more clearly apparent, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Example 1
Fig. 1 is a flow chart of a method for verifying authenticity of an electronic engineering document, in an embodiment of the present invention, the method includes:
step S100: acquiring an electronic engineering document containing label information according to a preset document interface, and inquiring a reference document in a preset template library according to the label information;
the electronic engineering documents are uploaded by a plurality of main bodies, space distances exist among different main bodies, and the authorities among different main bodies are different, so that different document interfaces are required to obtain the electronic engineering documents; wherein, the label information is an index of the electronic engineering document, and it can be an integrated information, including the name, the creation time and the summary information of the electronic engineering document; the label information can inquire a reference document, the reference document is a pre-stored template, and the received electronic engineering document is an engineering document generated based on the template.
Step S200: determining a query mode of each position in the electronic engineering document according to the reference file, and extracting information in the electronic engineering document based on the query mode; the query mode is preset;
since the filling information of each position in the reference file is predefined, when the electronic engineering document is identified, a corresponding identification mode is determined according to the position, and the information in the electronic engineering document can be extracted by executing identification.
Step S300: identifying all information, counting identification results and generating the truth;
the extracted information is processed by means of the existing recognition model to obtain a recognition result and the recognition result is counted, and the truth degree can be obtained according to the counted recognition result.
It is worth mentioning that in the execution process of the content, fluctuation interference is generated in real time according to a preset fluctuation generation rule; in the authenticity verification process of the electronic engineering document, the background of the computer equipment monitors related behaviors, the behaviors are analyzed, the position and the execution characteristic of the recognition algorithm can be positioned, and certain hidden dangers exist.
Fig. 2 is a first sub-flow block diagram of a method for verifying authenticity of an electronic engineering document, where the step of obtaining the electronic engineering document containing tag information according to a preset document interface and querying a reference document in a preset template library according to the tag information includes:
step S101: establishing a cache region corresponding to each document interface;
step S102: inputting the electronic engineering document containing tag information acquired by a document interface into the cache region, and recording acquisition time;
the process of acquiring the electronic engineering document by the document interface requires time, the process of sending the electronic engineering document to the data processing end by the document interface also requires time, and the time length is related to the data transmission speed of the document interface; the purpose of setting the cache region is to enable the process of acquiring the electronic engineering document by the document interface to be continuously carried out, and after the electronic engineering document is received, the electronic engineering document is sent to the data processing terminal by the cache region; some processing algorithms can be added in the cache region, such as recording the acquisition time, sequencing the electronic engineering documents according to the acquisition time, and the like.
Specifically, after the electronic engineering document is received, the document interface is closed, and the data extraction process of the cache region can be performed slowly.
Step S103: extracting the electronic engineering document in the cache region according to the preset traversal thread of the document interface and the acquisition time;
the method comprises the steps that a document interface corresponds to a cache region, the data reading sequence of the cache region is determined by the traversing thread and the obtaining time of the document interface, and the traversing thread of the document interface is the traversing sequence of the document interface.
Step S104: traversing a preset template base according to the label information, and inquiring a corresponding reference file; the template library comprises tag information items and reference document items;
when an electronic engineering document is extracted, a reference file can be inquired according to the label information of the electronic engineering document.
Fig. 3 is a second sub-flow block diagram of the method for verifying authenticity of an electronic engineering document, where the step of determining an inquiry manner of each position in the electronic engineering document according to the reference file and extracting information in the electronic engineering document based on the inquiry manner includes:
step S201: sequentially reading the region segmentation grids of the reference file; the region segmentation grid is a rectangular grid, and the position information of the region segmentation grid comprises a page number and a coordinate of a vertex in the page number;
each area in the reference file is used for filling in which contents are specified in advance; performing segmentation between different regions by adopting a region segmentation grid;
step S202: sequentially extracting contents in the region segmentation grid, and judging the content type; the content type comprises a text type and an image type;
the content in the region segmentation grid is the content in a small region, and the content type can be judged by extracting the content, wherein the content types are only two, one is a text and the other is an image.
Step S203: reading a query mode in a recorded mode library based on the content type;
step S204: extracting information in the electronic engineering document based on the query mode;
steps S203 and S204 are execution steps, and are specifically set forth below.
In an example of the technical solution of the present invention, the step of extracting information in the electronic engineering document based on the query mode includes:
when the content type is a text type, marking a to-be-detected grid corresponding to the region segmentation grid in the electronic engineering document;
extracting text contents in the to-be-detected grid according to the trained text recognition model, and inputting the text contents into a preset word recognition model to obtain the type of each word in the text contents; the types comprise fictitious words, nouns and descriptors;
extracting and arranging the obtained types according to the sequence of the text contents to obtain a text framework of the text contents;
judging the accuracy of the text content according to the text architecture;
and scaling the network to be detected according to the accuracy, and repeating the above contents until the accuracy reaches a preset accuracy threshold.
For the content of the text type, firstly, a text area in the electronic engineering document is positioned according to a reference file, text recognition is carried out on the text area, and the text content can be obtained by adopting the existing text recognition technology in a text recognition mode; then, according to the existing text content analysis technology, the part of speech of each word can be judged, the part of speech is arranged, and whether the text content is obviously discontent can be judged; if the text is obviously not smooth, zooming and adjusting the corresponding to-be-detected grid, and adjusting the size of the text recognition area; and finally, repeating the contents until the text contents are smooth.
It is worth mentioning that the scaling process is typically an expansion, with little or no reduction.
In an example of the technical solution of the present invention, the step of extracting information in the electronic engineering document based on the query mode includes:
when the content type is an image type, marking a to-be-detected grid corresponding to the region segmentation grid in the electronic engineering document;
positioning a target image according to a trained image recognition algorithm, and extracting contour features and color value features in the target image; the contour feature is used for representing a boundary position in the target image, and the color value feature is used for representing color value parameters of the target image in a preset traversal direction;
and inputting the contour feature and the color value feature into a trained simplified statistical model, and outputting the image feature.
When the content type is an image type, the corresponding content is some signature information and the like, the signature information is identified by means of the existing image identification technology, the outline characteristic and the color value characteristic can be determined, the outline characteristic and the color value characteristic are converted into the image characteristic according to the preset conversion rule, and the image characteristic can be used as information extracted from the signature information.
Fig. 4 is a third sub-flow block diagram of the method for verifying the authenticity of the electronic engineering document, where the steps of identifying all information, counting the identification result, and generating the authenticity include:
step S301: reading the extracted information and the position information of the region segmentation grid;
step S302: inquiring reference information according to the position information, comparing the information with the reference information, and determining a first truth degree;
step S303: when the first truth is smaller than a preset truth threshold, sending information to the artificial end;
step S304: and receiving a second truth degree fed back by the artificial end.
The first truth and the second truth can be used as final truths, the second truths are more accurate, and the first truths and the second truths can be generated only under the condition that the first truths are abnormal.
As a preferred embodiment of the technical solution of the present invention, the step of generating the fluctuation interference in real time according to a preset fluctuation generation rule includes:
monitoring the number of processes and the CPU occupancy rate thereof in the execution process of the steps in real time;
randomly generating a virtual process according to the CPU occupancy rate;
the ratio of the number of the processes of the virtual process to the number of the processes in the execution process is a preset value; only one CPU occupancy rate of the virtual processes reaches the maximum CPU occupancy rate of each process in the execution process, and the virtual processes are randomly selected; the execution time window of the virtual process is a random value.
In one example of the technical scheme of the invention, the process number and the CPU occupancy rate of the process number in the computer equipment are monitored, and the running number and the running degree of the virtual process are determined by the process number and the CPU occupancy rate; specifically, the number of processes of the virtual process should be limited, which is a preset proportion of the number of processes generated in the authenticity verification process; in the virtual processes, the CPU occupancy rate of one virtual process is the same as the maximum CPU occupancy rate of a normal process at random and lasts for random time. Therefore, the method can ensure that the normal process can normally run as much as possible and has stronger interference capability.
Example 2
Fig. 5 is a block diagram of a component structure of an electronic engineering document authenticity verification system, in an embodiment of the present invention, an electronic engineering document authenticity verification system includes:
the reference document query module 11 is configured to obtain an electronic engineering document containing tag information according to a preset document interface, and query a reference document in a preset template library according to the tag information;
the information extraction module 12 is configured to determine a query mode of each position in the electronic engineering document according to the reference file, and extract information in the electronic engineering document based on the query mode; the query mode is preset;
the truth generation module 13 is used for identifying all information, counting identification results and generating truth;
in the execution process of the content, the fluctuation interference is generated in real time according to a preset fluctuation generation rule.
The reference document query module 11 includes:
a buffer area establishing unit for establishing a buffer area corresponding to each document interface;
the data acquisition unit is used for inputting the electronic engineering document containing the tag information acquired by the document interface into the cache region and recording the acquisition time;
the data extraction unit is used for extracting the electronic engineering documents in the cache region according to the preset traversal thread of the document interface and the acquisition time;
the traversing query unit is used for traversing a preset template base according to the label information and querying a corresponding reference file; the template library comprises tag information items and reference document items.
The information extraction module 12 includes:
the network reading unit is used for sequentially reading the area segmentation grids of the reference file; the region segmentation grid is a rectangular grid, and the position information of the region segmentation grid comprises a page number and a coordinate of a vertex in the page number;
the type judging unit is used for sequentially extracting the contents in the region segmentation grid and judging the content type; the content type comprises a text type and an image type;
a mode reading unit, configured to read a query mode in a documented mode library based on the content type;
and the execution unit is used for extracting the information in the electronic engineering document based on the query mode.
The functions which can be realized by the electronic engineering document authenticity verification method are all completed by computer equipment, the computer equipment comprises one or more processors and one or more memories, and at least one program code is stored in the one or more memories and is loaded and executed by the one or more processors to realize the functions of the electronic engineering document authenticity verification method.
The processor fetches instructions and analyzes the instructions one by one from the memory, then completes corresponding operations according to the instruction requirements, generates a series of control commands, enables all parts of the computer to automatically, continuously and coordinately act to form an organic whole, realizes the input of programs, the input of data, the operation and the output of results, and the arithmetic operation or the logic operation generated in the process is completed by the arithmetic unit; the Memory comprises a Read-Only Memory (ROM) for storing a computer program, and a protection device is arranged outside the Memory.
Illustratively, a computer program can be partitioned into one or more modules, which are stored in memory and executed by a processor to implement the present invention. One or more of the modules may be a series of computer program instruction segments capable of performing certain functions, which are used to describe the execution of the computer program in the terminal device.
Those skilled in the art will appreciate that the above description of the service device is merely exemplary and not limiting of the terminal device, and may include more or less components than those described, or combine certain components, or different components, such as may include input output devices, network access devices, buses, etc.
The Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. The general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like, which is the control center of the terminal equipment and connects the various parts of the entire user terminal using various interfaces and lines.
The memory may be used to store computer programs and/or modules, and the processor may implement various functions of the terminal device by operating or executing the computer programs and/or modules stored in the memory and calling data stored in the memory. The memory mainly comprises a storage program area and a storage data area, wherein the storage program area can store an operating system, application programs required by at least one function (such as an information acquisition template display function, a product information publishing function and the like) and the like; the storage data area may store data created according to the use of the berth-state display system (e.g., product information acquisition templates corresponding to different product types, product information that needs to be issued by different product providers, etc.), and the like. In addition, the memory may include high speed random access memory, and may also include non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other volatile solid state storage device.
The terminal device integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the modules/units in the system according to the above embodiment may be implemented by a computer program, which may be stored in a computer-readable storage medium and used by a processor to implement the functions of the embodiments of the system. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer readable medium may include: any entity or device capable of carrying computer program code, recording medium, U.S. disk, removable hard disk, magnetic disk, optical disk, computer Memory, read-Only Memory (ROM), random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution media, and the like.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or apparatus that comprises the element.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, and all equivalent structures or equivalent processes performed by the present invention or directly or indirectly applied to other related technical fields are also included in the scope of the present invention.

Claims (10)

1. An electronic engineering document authenticity verification method, characterized by comprising:
acquiring an electronic engineering document containing label information according to a preset document interface, and inquiring a reference document in a preset template library according to the label information;
determining a query mode of each position in the electronic engineering document according to the reference file, and extracting information in the electronic engineering document based on the query mode; the query mode is preset;
identifying all information, counting identification results and generating the truth;
in the execution process of the content, the fluctuation interference is generated in real time according to a preset fluctuation generation rule.
2. The method for verifying the authenticity of an electronic engineering document according to claim 1, wherein the step of obtaining the electronic engineering document containing the tag information according to a preset document interface, and the step of querying the reference document in a preset template library according to the tag information comprises:
establishing a cache region corresponding to each document interface;
inputting the electronic engineering document containing the tag information acquired by the document interface into the cache region, and recording the acquisition time;
extracting the electronic engineering document in the cache region according to the preset traversal thread of the document interface and the acquisition time;
traversing a preset template base according to the label information, and inquiring a corresponding reference file; the template library comprises tag information items and reference document items.
3. The method for verifying the authenticity of the electronic engineering document according to claim 1, wherein the step of determining the query mode of each position in the electronic engineering document according to the reference file, and the step of extracting the information in the electronic engineering document based on the query mode comprises the following steps:
sequentially reading the area segmentation grids of the reference file; the region segmentation grid is a rectangular grid, and the position information of the region segmentation grid comprises a page number and a coordinate of a vertex in the page number;
sequentially extracting contents in the region segmentation grid, and judging the content type; the content type comprises a text type and an image type;
reading a query mode in a recorded mode library based on the content type;
and extracting information in the electronic engineering document based on the query mode.
4. The method for verifying the authenticity of an electronic engineering document according to claim 3, wherein the step of extracting the information in the electronic engineering document based on the query mode comprises:
when the content type is a text type, marking a to-be-detected grid corresponding to the region segmentation grid in the electronic engineering document;
extracting text contents in the to-be-detected grid according to the trained text recognition model, and inputting the text contents into a preset word recognition model to obtain the type of each word in the text contents; the types comprise fictitious words, nouns and descriptors;
extracting and arranging the obtained types according to the sequence of the text content to obtain a text framework of the text content;
judging the accuracy of the text content according to the text architecture;
and scaling the network to be detected according to the accuracy, and repeating the above contents until the accuracy reaches a preset accuracy threshold.
5. The method for verifying the authenticity of an electronic engineering document according to claim 4, wherein the step of extracting the information in the electronic engineering document based on the query mode comprises the steps of:
when the content type is an image type, marking a to-be-detected grid corresponding to the region segmentation grid in the electronic engineering document;
positioning a target image according to a trained image recognition algorithm, and extracting contour features and color value features in the target image; the contour feature is used for representing a boundary position in a target image, and the color value feature is used for representing color value parameters of the target image in a preset traversal direction;
and inputting the contour feature and the color value feature into a trained simplified statistical model, and outputting the image feature.
6. The method for verifying the authenticity of an electronic engineering document according to claim 1, wherein the steps of identifying all information, counting the identification results and generating the degree of authenticity include:
reading the extracted information and the position information of the area segmentation grid thereof;
inquiring reference information according to the position information, comparing the information with the reference information, and determining a first truth degree;
when the first truth is smaller than a preset truth threshold, sending information to the artificial end;
and receiving a second truth degree fed back by the artificial end.
7. The method for verifying the authenticity of an electronic engineering document according to any one of claims 1 to 6, wherein the step of generating the fluctuation interference in real time according to a preset fluctuation generation rule comprises:
monitoring the number of processes and the CPU occupancy rate thereof in the execution process of the steps in real time;
randomly generating a virtual process according to the CPU occupancy rate;
the ratio of the number of the processes of the virtual process to the number of the processes in the execution process is a preset value; only one CPU occupancy rate of the virtual processes reaches the maximum CPU occupancy rate of each process in the execution process, and the virtual processes are randomly selected; the execution time window of the virtual process is a random value.
8. An electronic engineering document authenticity verification system, the system comprising:
the reference document query module is used for acquiring the electronic engineering document containing the label information according to a preset document interface and querying the reference document in a preset template library according to the label information;
the information extraction module is used for determining the query mode of each position in the electronic engineering document according to the reference file and extracting the information in the electronic engineering document based on the query mode; the query mode is preset;
the truth generation module is used for identifying all information, counting identification results and generating truth;
in the execution process of the content, the fluctuation interference is generated in real time according to a preset fluctuation generation rule.
9. The system of claim 8, wherein the reference document query module comprises:
a buffer area establishing unit for establishing a buffer area corresponding to each document interface;
the data acquisition unit is used for inputting the electronic engineering document containing the tag information acquired by the document interface into the cache region and recording acquisition time;
the data extraction unit is used for extracting the electronic engineering documents in the cache region according to the preset traversal thread of the document interface and the acquisition time;
the traversing query unit is used for traversing a preset template library according to the label information and querying a corresponding reference file; the template library comprises tag information items and reference document items.
10. The system of verifying authenticity of an electronic engineering document according to claim 8, wherein the information extraction module comprises:
the network reading unit is used for sequentially reading the region segmentation grids of the reference file; the region segmentation grid is a rectangular grid, and the position information of the region segmentation grid comprises a page number and a coordinate of a vertex in the page number;
the type judging unit is used for sequentially extracting the contents in the region segmentation grid and judging the content type; the content type comprises a text type and an image type;
a mode reading unit, configured to read a query mode in a documented mode library based on the content type;
and the execution unit is used for extracting the information in the electronic engineering document based on the query mode.
CN202211316941.7A 2022-10-26 2022-10-26 Method and system for verifying authenticity of electronic engineering document Withdrawn CN115620317A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211316941.7A CN115620317A (en) 2022-10-26 2022-10-26 Method and system for verifying authenticity of electronic engineering document

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211316941.7A CN115620317A (en) 2022-10-26 2022-10-26 Method and system for verifying authenticity of electronic engineering document

Publications (1)

Publication Number Publication Date
CN115620317A true CN115620317A (en) 2023-01-17

Family

ID=84864431

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211316941.7A Withdrawn CN115620317A (en) 2022-10-26 2022-10-26 Method and system for verifying authenticity of electronic engineering document

Country Status (1)

Country Link
CN (1) CN115620317A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117149716A (en) * 2023-10-27 2023-12-01 北京美络克思科技有限公司 Method, device and equipment for detecting authenticity of electronic document

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117149716A (en) * 2023-10-27 2023-12-01 北京美络克思科技有限公司 Method, device and equipment for detecting authenticity of electronic document

Similar Documents

Publication Publication Date Title
CN109886928B (en) Target cell marking method, device, storage medium and terminal equipment
CN113746758B (en) Method and terminal for dynamically identifying flow protocol
CN114414935A (en) Automatic positioning method and system for feeder fault area of power distribution network based on big data
CN111858977B (en) Bill information acquisition method, device, computer equipment and storage medium
CN115620317A (en) Method and system for verifying authenticity of electronic engineering document
CN114581442A (en) Product detection method and device for MES system
CN111797396B (en) Malicious code visualization and variant detection method, device, equipment and storage medium
CN116052848B (en) Data coding method and system for medical imaging quality control
CN112839047A (en) Asset vulnerability scanning method, device, equipment and medium on cloud platform
CN114491134B (en) Trademark registration success rate analysis method and system
CN116844006A (en) Target identification method and device, electronic equipment and readable storage medium
CN110970100A (en) Method, device and equipment for detecting item coding and computer readable storage medium
CN114040012B (en) Information query pushing method and device and computer equipment
CN115734072A (en) Internet of things centralized monitoring method and device for industrial automation equipment
CN113283677B (en) Index data processing method, device, equipment and storage medium
CN112200004B (en) Training method and device for image detection model and terminal equipment
CN114443834A (en) Method and device for extracting license information and storage medium
CN112949305A (en) Negative feedback information acquisition method, device, equipment and storage medium
CN115909345B (en) Touch and talk pen information interaction method and system
CN117555428B (en) Artificial intelligent interaction method, system, computer equipment and storage medium thereof
CN115720148B (en) Industrial Internet of things information visualization method, server and storage medium
CN114327615B (en) Interface document generation method and system based on big data
CN117112846B (en) Multi-information source license information management method, system and medium
CN109885491B (en) Method for detecting existence of data overflow expression and terminal equipment
CN117668298B (en) Artificial intelligence method and system for application data analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20230117