CN111832264A - PDF file based signature position determination method, device and equipment - Google Patents

PDF file based signature position determination method, device and equipment Download PDF

Info

Publication number
CN111832264A
CN111832264A CN202010489992.4A CN202010489992A CN111832264A CN 111832264 A CN111832264 A CN 111832264A CN 202010489992 A CN202010489992 A CN 202010489992A CN 111832264 A CN111832264 A CN 111832264A
Authority
CN
China
Prior art keywords
signature
block data
keyword
determining
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010489992.4A
Other languages
Chinese (zh)
Other versions
CN111832264B (en
Inventor
苏梅
朱继刚
王瑗
林俊杰
王欣悦
张佳明
许留中
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Valueonline Technology Co ltd
Original Assignee
Shenzhen Valueonline Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Valueonline Technology Co ltd filed Critical Shenzhen Valueonline Technology Co ltd
Priority to CN202010489992.4A priority Critical patent/CN111832264B/en
Publication of CN111832264A publication Critical patent/CN111832264A/en
Application granted granted Critical
Publication of CN111832264B publication Critical patent/CN111832264B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The application is applicable to the technical field of computers, and provides a signature position determining method, a signature position determining device and signature position determining equipment based on a PDF (portable document format) file. The method comprises the steps of obtaining the file attribute of a PDF file to be signed, and determining keywords for representing a signer according to the file attribute; determining a target signature format of the keyword according to a preset keyword configuration table; performing text analysis processing on a PDF file to be signed to obtain a plurality of block data set based on a context sequence and position information of each block data; and determining target block data which contains the keywords and is matched with the target signature format of the keywords according to the context information of the plurality of block data, and determining the position represented by the position information of the target block data as the signature position of the PDF file to be signed. The embodiment of the application provides a PDF file-based signature position determining method, which realizes automatic positioning of the signature position of a PDF file.

Description

PDF file based signature position determination method, device and equipment
Technical Field
The application belongs to the technical field of computers, and particularly relates to a signature position determining method, device and equipment based on a PDF file.
Background
With the development of internet technology, the signing demand of documents in commercial activities is increasing.
In order to solve the problem of low efficiency in the conventional signing mode, a block chain-based electronic signature method is usually adopted to sign the PDF file, that is, multi-party fast signing of the electronic file is realized based on distributed data of the block chain and a consensus mechanism.
The block chain-based electronic signature method can only realize automatic signature of the PDF electronic file, cannot automatically identify the signature position, and needs to manually pre-designate the signature position of each file to be signed; due to the fact that the files to be signed have diversification, the signature positions of different files to be signed are different, even a plurality of signature positions possibly exist in one signed file, and the signing efficiency of the PDF electronic file is greatly reduced.
Disclosure of Invention
In view of this, embodiments of the present application provide a method, an apparatus, and a device for determining a signature position based on a PDF file, so as to solve the technical problem that signing efficiency of a PDF electronic file is greatly reduced due to an inability to automatically locate a signature position in the prior art.
In a first aspect, an embodiment of the present application provides a method for determining a signature position based on a PDF file, including:
acquiring the file attribute of a PDF file to be signed, and determining a keyword for representing a signer according to the file attribute; the file attributes comprise company names, conference types and file names;
determining a target signature format of the keyword according to a preset keyword configuration table; the keyword configuration table comprises corresponding relations among the file attributes, the signers and the signature formats, and the corresponding relations are used for indicating the signature formats which can be used by the signers in the files corresponding to the file attributes;
performing text analysis processing on a PDF file to be signed to obtain a plurality of block data set based on a context sequence and position information of each block data;
and determining target block data which contains the keywords and is matched with the target signature format of the keywords according to the context information of the plurality of block data, and determining the position represented by the position information of the target block data as the signature position of the PDF file to be signed.
In a possible implementation manner of the first aspect, performing text parsing on a PDF file to be signed to obtain a plurality of block data set based on a context order and location information of each block data includes:
acquiring text information of a PDF file to be signed;
dividing text information into a plurality of natural sentences, and determining position information of each natural sentence;
and taking each natural sentence as block data, and setting a plurality of block data based on the context sequence.
In a possible implementation manner of the first aspect, each signer in the keyword configuration table corresponds to multiple signature formats, and the multiple signature formats corresponding to each signer have different priorities;
determining a target signature format of the keyword according to a preset keyword configuration table, wherein the target signature format comprises the following steps:
searching a plurality of candidate signature formats corresponding to the keywords from the keyword configuration table;
and sorting the candidate signature formats according to the priorities respectively corresponding to the candidate signature formats, and taking the sorted candidate signature formats as target signature formats of the keywords.
In a possible implementation manner of the first aspect, determining, according to context information of a plurality of block data, target block data that includes a keyword and matches a target signature format of the keyword, includes:
searching at least two first block data containing keywords from the plurality of block data;
combining the keywords in each first partition data with the context information of the keywords to generate signature blocks, and analyzing to obtain the signature format of each signature block;
and performing similarity analysis on the signature format of each signature block in the plurality of signature blocks and the target signature format of the keyword, and determining N target block data with the highest similarity to the target signature format of the keyword, wherein N is an integer greater than or equal to 2.
In a possible implementation manner of the first aspect, the location information of each block data includes page number information;
searching at least two first block data containing keywords from a plurality of block data, comprising:
combining a plurality of block data corresponding to the same page number to generate a character string according to the page number information of each block data to obtain a plurality of character strings to be compared;
and aiming at each character string to be compared, comparing each block data in the character string to be compared with the keyword one by one under the condition that the character string to be compared contains the keyword, and storing the block data matched with the keyword as first block data.
In a possible implementation manner of the first aspect, performing similarity analysis on a signature format of each signature block in the plurality of signature blocks and a target signature format of the keyword, and determining N target block data with the highest similarity to the target signature format of the keyword, includes:
aiming at each candidate signature format in the target signature formats, performing matching analysis on the signature format of each signature block in the plurality of signature blocks and the candidate signature format, and saving the signature block matched with the candidate signature format as a candidate signature block;
sorting the candidate signature blocks according to the priority order of the candidate signature formats in the target signature format;
and determining the block data respectively corresponding to the first N candidate signature blocks in sequence as N target block data with the highest target signature format similarity with the keyword.
In one possible implementation form of the first aspect, the keyword configuration table further comprises a signature style for characterizing the signer;
after the position represented by the position information of the target block data is determined as the signature position of the PDF file to be signed, the method further comprises the following steps:
determining the signature style of a signer according to the keyword configuration table, and respectively combining and storing the signature style and a plurality of candidate signature formats to generate a plurality of signature effects;
for each target block data in the N target block data, responding to the received signature effect checking operation information, sequentially adding a plurality of signature effects to the corresponding positions of the target block data and displaying the signature effects;
and responding to the received signature to determine operation information, and generating a signed PDF file according to the currently displayed signature effect.
In a second aspect, an embodiment of the present application provides an apparatus for determining a signature location based on a PDF file, including:
the acquisition module is used for acquiring the file attribute of the PDF file to be signed and determining a keyword for representing a signer according to the file attribute; the file attributes comprise company names, conference types and file names;
the format determining module is used for determining a target signature format of the keyword according to a preset keyword configuration table; the keyword configuration table comprises corresponding relations among the file attributes, the signers and the signature formats, and the corresponding relations are used for indicating the signature formats which can be used by the signers in the files corresponding to the file attributes;
the analysis module is used for carrying out text analysis processing on the PDF file to be signed to obtain a plurality of block data set based on the context sequence and the position information of each block data;
and the position determining module is used for determining target block data which contains the keywords and is matched with the target signature format of the keywords according to the context information of the plurality of block data, and determining the position represented by the position information of the target block data as the signature position of the PDF file to be signed.
In a third aspect, an embodiment of the present application provides a PDF file-based signature location determining apparatus, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of any one of the above-mentioned methods in the first aspect when executing the computer program.
In a fourth aspect, the present application provides a computer-readable storage medium, where a computer program is stored, and when executed by a processor, the computer program implements the steps of any one of the methods in the first aspect.
In a fifth aspect, the present application provides a computer program product, which when run on a terminal device, causes the terminal device to execute the method of any one of the above first aspects.
According to the signature position determining method based on the PDF file, the key words representing the signer are determined according to the file attributes of the PDF file to be signed, the target signature format of the key words is determined according to the preset key word configuration table, then the PDF file to be signed is analyzed to obtain the plurality of block data and the position information of each block data, the block data which contain the key words and are matched with the target signature format of the key words are used as the target block data, the position represented by the position information of the target block data is determined as the signature position of the PDF file to be signed, the automatic positioning of the signature position of the PDF file to be signed is achieved, the signature position of each PDF file to be signed does not need to be manually specified, and the signature efficiency of the electronic file is greatly improved.
It is understood that the beneficial effects of the second aspect to the fifth aspect can be referred to the related description of the first aspect, and are not described herein again.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.
Fig. 1 is a schematic flowchart of a PDF file based signature location determination method according to an embodiment of the present application;
FIG. 2 is a schematic flow chart illustrating a process for determining a target signature format of a keyword according to an embodiment of the present application;
FIG. 3 is a flowchart illustrating a method for determining block data according to an embodiment of the present disclosure;
FIG. 4 is a flowchart illustrating a process of determining target tile data for a keyword according to an embodiment of the present application;
FIG. 5 is a schematic flow chart diagram illustrating the determination of target block data according to another embodiment of the present application;
fig. 6 is a schematic flowchart of a method for generating a signed PDF file according to an embodiment of the present application;
FIG. 7 is a schematic diagram illustrating a signature effect of a signed PDF document according to an embodiment of the present application;
fig. 8 is a schematic structural diagram of a PDF file based signature location determining apparatus according to an embodiment of the present application;
fig. 9 is a schematic structural diagram of a PDF file based signature location determining device according to an embodiment of the present application.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.
Reference throughout this specification to "one embodiment" or "some embodiments," or the like, means that a particular feature, structure, or characteristic described in connection with the embodiment is included in one or more embodiments of the present application. Thus, appearances of the phrases "in one embodiment," "in some embodiments," "in other embodiments," or the like, in various places throughout this specification are not necessarily all referring to the same embodiment, but rather "one or more but not all embodiments" unless specifically stated otherwise. The terms "comprising," "including," "having," and variations thereof mean "including, but not limited to," unless expressly specified otherwise.
The following describes the technical solutions of the present application and how to solve the above technical problems with specific embodiments. It is worth mentioning that the specific embodiments listed below may be combined with each other, and the same or similar concepts or processes may not be described in detail in some embodiments.
Fig. 1 is a schematic flowchart of a method for determining a signature position based on a PDF file according to an embodiment of the present application, where an execution subject of the embodiment is a PDF-based signature position determining device, and the PDF-based signature position determining device may include, but is not limited to, a mobile terminal such as a smartphone, a tablet computer, and a wearable device, and may also be a desktop computer, a robot, a server, and the like. As shown in fig. 1, the PDF file-based signature location determination method includes:
and S10, acquiring the file attribute of the PDF file to be signed, and determining the keyword for representing the signer according to the file attribute.
In this embodiment, the file attribute of the PDF file to be signed may include one or more of a company name, a conference type, a file type, and a file name.
The PDF file to be signed may be any conference file that three parties need to sign, such as a voting table to be voted, a resolution file, an authorization entrusting file, and the like. It should be understood that three meetings refer to stockholder's colleagues, board of directors, and prisoners.
For example, the PDF file to be signed is a voting table of the first and third conferences of company a, and specifically a voting ticket of "adjust the proposal about the company organization architecture". The company name of the PDF file to be signed is company a, the conference type is a temporary conference in the board of directors, the file type is a voting ticket, and the file name is "adjustment proposal for company organization".
In this embodiment, the keyword for characterizing the signer may refer to the name of the signer.
In this embodiment, determining the keyword for characterizing the signer according to the file attribute may refer to searching and determining a keyword corresponding to the file attribute in a preset keyword configuration table.
The preset keyword configuration table (hereinafter referred to as the keyword configuration table) may include signers needing to sign in the files corresponding to different file attributes.
In this embodiment, there may be a plurality of signers in the PDF file to be signed. For example, if the PDF file to be signed is a vote for "adjustment of proposal for corporate organization architecture", then the signer in the PDF file to be signed may include multiple ones of the board of directors.
It should be understood that the process of determining the signing location of each signer is the same, and the following embodiments only describe the exemplary embodiment process of determining the signing location of one signer, but do not limit the number of signers.
And S20, determining the target signature format of the keyword according to a preset keyword configuration table.
The keyword configuration table in this step is the same as the keyword configuration table in step 10, and in addition to the signers needing to sign in the files corresponding to different file attributes, the keyword configuration table may further include file attributes and a corresponding relationship between the signers and the file attributes, the signers and the signature formats, where the corresponding relationship is used to indicate signature formats that can be used by the signers in the files corresponding to the file attributes.
In this embodiment, the keyword configuration table is predetermined according to the file attributes, signers, and signature formats of a large number of three signature files. Alternatively, each company may have a corresponding keyword configuration table.
For example, please refer to table 1 together, where table 1 is a keyword configuration table of company Y provided in the embodiment of the present application. The key word configuration table for company Y includes the meeting type, document name, signer, and the various signature formats available to the signer. The conference types in the keyword configuration table can be classified into various types, such as a board conference, a shareholder conference, and a proctor conference. There may be more than one document name for each meeting type, more than one signer for each document, and more than one signature format available to each signer.
Illustratively, as shown in table 1, the signer of the document a in the board of the director conference has two bits of zhang san and lie san, in the document a, signature formats usable by zhang san are a colon sign, a cross-line up-down sign, a cross-line left-right sign and a bracket sign, and signature formats usable by lie san are a colon sign and a bracket sign.
The signer of the file B in the board of the; in document B, the signature formats that lie four can use are colon signature and bracket signature, and the signature formats that wang five can use are colon signature and cross line up-down signature.
TABLE 1 keyword configuration Table for Y corporation
Figure BDA0002520706790000081
It should be understood that the number of the keyword configuration tables of the plurality of companies may be plural, and table 1 is only an exemplary description and does not limit the number of the keyword configuration tables.
In this embodiment, the target signature format of the keyword may refer to at least one of signature formats corresponding to signers represented by the keyword.
The multiple signature formats corresponding to each signer can have different priorities, and the target signature format of the keyword can be one or more signature formats with higher priority.
And S30, performing text analysis processing on the PDF file to be signed, and acquiring a plurality of block data and position information of each block data which are arranged based on the context sequence.
In this embodiment, determining the signature position requires determining the position of the keyword first, and in order to quickly obtain the position of the keyword in the PDF file to be signed, the PDF file to be signed may be subjected to blocking processing, and text information of each block data and position information of each block data are determined.
S40, according to the context information of the plurality of block data, determining target block data which contains keywords and is matched with the target signature format of the keywords, and determining the position represented by the position information of the target block data as the signature position of the PDF file to be signed.
In this embodiment, a plurality of block data in the PDF file to be signed all include keywords, and in different block data, the keywords may be in a single row, may also be combined with a signature indicator, and may also be combined with a text string of a context.
The signature indicators may include colon, parentheses, horizontal lines, etc., among others.
Where the keyword alone in a line and in combination with a signature indicator may characterize a signature position, the keyword in combination with a string of context characterizes a semantic position other than a signature position.
In this embodiment, determining target block data that includes a keyword and matches a target signature format of the keyword according to context information of the plurality of block data may be determining to search first block data that includes the keyword, then determining a format of the keyword in each first block data according to context information of each first block data, and finally determining first block data corresponding to the format of the keyword that matches the target signature format as the target block data.
There may be more than one target signature format, and there may also be more than one target block data.
Illustratively, the signer represented by the keyword is zhang san, and the signature formats usable in the document a in the board of director conference are a colon signature, a horizontal line upper and lower signature, a horizontal line left and right signature and a bracket signature in sequence from high to low according to the priority, so that the four sorted signature formats are used as the target signature formats of the keyword.
Finding a plurality of first block data containing Zhang III in a PDF file to be signed, determining the format of Zhang III in each first block data according to the context information of Zhang III in each first block data, for example, if the context of Zhang III is blank, the format of Zhang III can be known as a single line, if the context of Zhang III is colon, the Zhang III and the colon of a signature indicator can be known to form a signature position, and if the upper and lower questions of Zhang III are character strings, the Zhang III can be known to form semantic information.
And for each signature format in the target signature formats, sequentially comparing the similarity of the signature format with the format of Zhang III in the first block data, and determining the first block data corresponding to the format of the keyword matched with the signature format as target block data.
The matching may mean that the signature indicators in the signature format are the same, and the relative position relationship between the signature indicators and the keywords is the same.
The related position relationship between the signature indicator and the keyword can include: the signature indicators are located to the left, right, below, above, outside, etc. of the keyword.
For example, where the signature indicator is a horizontal line, the horizontal line may be above or below the keyword; when the signature indicators are brackets, the signature indicators may be located outside the keywords, i.e., the keywords are located inside the brackets.
In this embodiment, after determining the target block data that includes the keyword and matches the target signature format of the keyword, the position represented by the position information of the target block data may be determined as the signature position of the PDF file to be signed.
The position represented by the position information of the target block data can be determined according to the signature indicator in the target block data and the relative position relationship between the keyword and the signature indicator.
In one example, the signature indicator is a colon, and the signature indicator is located at the right side of the keyword, then the starting position of the position represented by the position information of the target block data is the position of the m-th character at the right side of the colon in the target block data, where m is a preset value, and may be 2, for example.
In another example, the signature indicator is a horizontal line, and the horizontal line is located below the keyword, then the location of the location information representation of the target block data is located below the horizontal line.
It should be understood that if there are multiple target block data, there are multiple signature locations characterizing the PDF file to be signed.
According to the signature position determining method based on the PDF file, the key words representing the signer are determined according to the file attributes of the PDF file to be signed, the target signature format of the key words is determined according to the preset key word configuration table, then the PDF file to be signed is analyzed to obtain a plurality of block data and the position information of each block data, the block data which contain the key words and are matched with the target signature format of the key words are used as the target block data, the position represented by the position information of the target block data is determined as the signature position of the PDF file to be signed, and the automatic positioning of the signature position of the PDF file to be signed is achieved.
Fig. 2 is a schematic flowchart of a process for determining a target signature format of a keyword according to an embodiment of the present application, and describes one possible implementation manner of determining the target signature format of the keyword according to a preset keyword configuration table in step 20 in fig. 1. As shown in fig. 2, determining the target signature format of the keyword according to the preset keyword configuration table includes:
s201, searching a plurality of candidate signature formats corresponding to the keywords from the keyword configuration table.
In this embodiment, each signer in the keyword configuration table may correspond to a plurality of signature formats. The candidate signature format corresponding to the keyword may be a signature format corresponding to a signer represented by the keyword.
Alternatively, the corresponding keyword configuration table may be determined according to the company name in the document attribute, and a plurality of signature formats that can be used by the corresponding signer may be looked up in the document word configuration table.
Since the signatures of the signers represented by the keywords can have different available signature formats in different documents, finding at least one candidate signature format corresponding to the signer may refer to determining a target document according to the meeting type and the document name in the document attribute, and finding and determining the available signature format of the corresponding signer in the target document.
S202, sorting the candidate signature formats according to the priorities respectively corresponding to the candidate signature formats, and taking the sorted candidate signature formats as target signature formats of the keywords.
In this embodiment, the plurality of signature formats corresponding to each signer may have different priorities.
Wherein the priority represents the number of times the signer uses the signature format in the history signed document, and the higher the priority, the more the signature is used.
For example, referring to table 1 together, the signer represented by the keyword is zhang san, and the signature formats usable in the document a in the board of the.
Fig. 3 is a flowchart illustrating a method for determining multiple block data according to an embodiment of the present application, and describes one possible implementation manner of step 30 in the embodiment of fig. 1. As shown in fig. 3, performing text parsing on the PDF file to be signed to obtain a plurality of block data set based on the context order and the location information of each block data may refer to:
s301, acquiring text information of the PDF file to be signed.
In this step, the text information includes a plurality of text sequences, wherein the text sequences are character strings in which more than one character is sequentially formed.
In this step, obtaining the text information of the PDF file to be signed may refer to converting the PDF file to be signed into text information through the information analysis script.
S302, dividing the text information into a plurality of natural sentences, and determining the position information of each natural sentence.
In the PDF file, the boundaries between sentences or paragraphs of text information are generally formed by delimiters, and the text information can be divided into a plurality of natural sentences by identifying the delimiters in the text information.
The delimiters may be punctuation marks, and the text information may be divided into a plurality of natural sentences by identifying punctuation marks of the text information.
In this step, determining the position information of each natural sentence may refer to determining the page number and the row number of each natural sentence in the PDF file and the number of columns of the starting position and the ending position of each natural sentence.
And S303, taking each natural sentence as block data, and setting the block data based on the context sequence.
In this step, the plurality of block data are sorted based on the context semantic information.
Fig. 4 is a flowchart illustrating a process of determining target block data of a keyword according to an embodiment of the present application, and describes one possible implementation manner of determining the target block data of the keyword in step 40 in the embodiment of fig. 1. As shown in fig. 4, determining target block data containing a keyword and matching a target signature format of the keyword according to context information of a plurality of block data includes:
s401, at least two first block data containing keywords are searched from the plurality of block data.
In this embodiment, searching the first block data containing the keyword from the plurality of block data may be to compare the keyword with each block data in sequence, and determine the first block data containing the keyword; the method can also comprise the following steps: combining at least two block data to generate combined data, judging whether the combined data contains a keyword, and determining the block data to which the keyword belongs as first block data under the condition that the combined data contains the keyword.
For example, searching a plurality of first block data containing a keyword from a plurality of block data may include:
step 1, combining a plurality of block data corresponding to the same page number to generate a character string according to the page number information of each block data, and obtaining a plurality of character strings to be compared.
The plurality of block data corresponding to the same page number may be a plurality of block data on the same page number in the PDF file to be signed.
The generating of the character string by combining the multiple block data corresponding to the same page number may be performed by splicing text sequences of the multiple block data on the same page number in the PDF file to be signed according to semantic information of context to obtain the character string to be compared corresponding to each page number.
And 2, aiming at each character string to be compared, comparing each block data in the character string to be compared with the keyword one by one under the condition that the character string to be compared contains the keyword, and storing the block data matched with the keyword as first block data.
In this step, whether each character string to be compared contains a keyword or not can be sequentially judged according to the page number information corresponding to each character string to be compared and the sequence of page numbers from small to large.
For example, if the character string to be compared corresponding to the page number 1 contains the keyword, comparing each block data in the character string to be compared corresponding to the page number 1 with the keyword one by one, saving the block data matched with the keyword as first block data, saving the first block data to a preset file list, and clearing the character string to be compared corresponding to the page number 1.
And if the character string to be compared corresponding to the page number 1 does not contain the keyword, directly clearing the character string to be compared corresponding to the page number 1. And then, the character strings to be compared corresponding to the page number 2 are judged until the character strings to be compared corresponding to each page number are judged one by one, so that a final file list is obtained, and the text information and the position information of the first block data are stored in the file list.
S402, combining the keywords in each first partition data with the context information of the keywords to generate signature blocks, and analyzing to obtain the signature format of each signature block.
In this embodiment, in the different first block data, the keyword may be a single line, may be combined with the signature indicator, or may be combined with the string of the context.
The signature indicators may include colon, parentheses, horizontal lines, etc., among others.
In this embodiment, the signature block includes text information of the keyword and a signature format of the keyword.
In this embodiment, the signature format of each signature block includes a signature format of a keyword.
The analyzing to obtain the signature format of each signature block may refer to determining context information of the keyword in each signature block and a relative position of the keyword with respect to the context information.
For example, if the keywords are in rows alone, the context information of the keywords is blank; combining the keyword with the signature indicators, determining the relative position of the keyword relative to the signature indicators if the context information of the keyword is one of the signature indicators; if the keyword is combined with the character string of the context, the context information of the keyword is the character string.
S403, carrying out similarity analysis on the signature format of each signature block in the plurality of signature blocks and the target signature format of the keyword, and determining N target block data with the highest similarity to the target signature format of the keyword, wherein N is an integer greater than or equal to 2.
In this embodiment, the target signature formats of the keywords may include a plurality of candidate signature formats, and the priorities of the candidate signature formats are different.
Determining N target block data with the highest similarity to the target signature formats of the keywords may refer to determining the block data matched with each candidate signature format as one target block data; the method also can be used for determining the block data matched with each candidate signature format as candidate block data, sorting the candidate block data according to the priority of various candidate signature formats from high to low, and determining N candidate block data with the top ranking as N target block data.
The step of determining the block data matching with the candidate signature format may be to, for each candidate signature format, search for a signature block determined to match with the candidate signature format, and determine the block data corresponding to the signature block as the target block data.
In this embodiment, on one hand, a character string is generated by combining a plurality of block data corresponding to the same page number, a plurality of character strings to be compared are obtained, and when the character strings to be compared include a keyword, the block data to which the keyword belongs is determined as the first block data, so that the efficiency of determining the first block data is improved.
On the other hand, a PDF file to be signed usually needs to have a plurality of signature positions, and the signature format of the middle keyword in each signature position may be different, in this embodiment, similarity analysis is performed on the signature format of the keyword in each first block data and the target signature format of the keyword, so as to determine N target block data with the highest similarity to the target signature format of the keyword, and positions represented by position information of the N target block data are used as signature positions of the PDF file to be signed, so as to achieve determination of a plurality of signature positions in the PDF file to be signed.
Fig. 5 is a flowchart illustrating a process of determining target block data according to another embodiment of the present application, and describes one possible implementation manner of determining N target block data with the highest target signature format similarity to the keyword in the embodiment of fig. 4. As shown in fig. 5, performing similarity analysis between the signature format of each signature block in the plurality of signature blocks and the target signature format of the keyword to determine N target block data with the highest similarity to the target signature format of the keyword, includes:
s411, aiming at each candidate signature format in the target signature formats, matching and analyzing the signature format of each signature block in the plurality of signature blocks and the candidate signature format, and under the condition that the signature block matched with the candidate signature format exists, saving the signature block matched with the candidate signature format as the candidate signature block.
In this embodiment, the number of signatures of each signature block and the candidate signature formats may be sequentially matched and analyzed according to the page numbers of the signature blocks in the PDF file to be signed and the order of the page numbers from small to large.
In this embodiment, the matching analysis of the signature format of the signature block and the candidate signature format may refer to first determining whether the context information of the keyword in the signature block is consistent with the signature indicator in the candidate signature format, and determining whether the relative position of the keyword with respect to the signature indicator in the two signature formats is consistent under the condition that the context information of the keyword in the signature block is consistent with the signature indicator in the candidate signature format, and if the relative position is consistent, determining that the signature format of the signature block is matched with the candidate signature format.
It will be appreciated that this step may result in a plurality of candidate signature blocks having a one-to-one correspondence with candidate signature formats.
S412, sorting the candidate signature blocks according to the priority order of the candidate signature formats of the keywords.
In this embodiment, the candidate signature blocks are sorted in the order of priority from high to low of the multiple candidate signature formats of the keyword.
And S413, determining the block data respectively corresponding to the first N candidate signature blocks in sequence as N target block data with the highest target signature format similarity with the keyword.
In this embodiment, the number N of the target block data may be set according to a file attribute of the PDF file to be signed, and is pre-stored in the keyword configuration table, that is, the number of the target block data with different file attributes may be different.
Optionally, the number N of the target block data may also be a set value, and the target block data of each PDF file to be signed is a set value, for example, 3.
Fig. 6 is a flowchart of a method for generating a signed PDF file according to an embodiment of the present application, which describes a possible implementation manner of generating a signed PDF file after determining a signature position of a PDF to be signed based on the method in the foregoing embodiment. As shown in fig. 6, after determining the location of the location information representation of the target block data as the signature location of the PDF file to be signed, the method further comprises:
s501, according to the keyword configuration table, determining signature styles of signers, and respectively combining and storing the signature styles and various candidate signature formats to generate a plurality of signature effects.
In this embodiment, the keyword configuration table further includes a signature style for characterizing the signer. For example, the signature style of "Zhang III" of the signer is Zhang III of a regular script.
The signature style of the signer in the keyword configuration table can be determined according to the historical signature style of the signer.
In order to obtain the signature effect which accords with the signing habit of the signer, the signature style of the signer is determined according to the keyword configuration table, and then the signature style and a plurality of candidate signature formats are combined and stored respectively to generate a plurality of signature effects, so that the signer can select from the plurality of signature effects.
Wherein the plurality of candidate signature formats may be a plurality of higher priority candidate signature formats of the signer.
Illustratively, the signature represented by the keyword is zhang san, the signature formats usable by zhang san in the document a in the board of the director meeting are a colon signature, a horizontal line upper and lower signature, a horizontal line left and right signature and a bracket signature according to the priority from high to low, and the signature pattern of zhang san is zhang san of a regular script, and then the signature pattern and the four signature formats are combined in sequence to generate and store four signature effects.
S502, aiming at each target block data in the N target block data, responding to the received signature effect checking operation information, and sequentially adding and displaying a plurality of signature effects to the corresponding positions of the target block data.
The purpose of this step is to sequentially preview the signature effect in the displaying step 501 at the signature position corresponding to each target block data, so that the signer can view the display effect of the signed PDF file.
Since there may be more target block data, i.e. there may be more signature positions determined by automatic positioning, it is necessary to determine the best signature effect of each target block data.
In this embodiment, the PDF file based signature position determining device sequentially selects each target block data according to the position information of the N target block data. And aiming at the currently selected target block data, responding to the received signature effect viewing operation by the signature position determining equipment based on the PDF file, sequentially adding a plurality of signature effects to the corresponding positions of the currently selected target block data and displaying the signature effects.
And the plurality of signature effects are sequentially displayed according to the priorities of the corresponding signature formats.
The step of sequentially selecting the target block data according to the position information of the target block data may be to select the target block data from small to large according to the size of the page number of the target block data.
For example, please refer to fig. 7 together, fig. 7 is a schematic diagram illustrating a signature effect of a signed PDF file according to an embodiment of the present application. There are two target block data in the PDF file to be signed, and the signature positions represented by the two target block data are line 8, column 25 of the first page and line 15, column 25 of the 2 nd page, respectively, it should be understood that "zhang san" shown in fig. 5: "is the target block data of page 2.
As shown in fig. 7, four buttons of cancel, preview, next group and confirm are arranged below the signed PDF file, wherein the signature effect viewing operation may refer to the signer selecting the "preview" or "next group" button.
For example, the currently selected target block data is line 15, column 25 of page 2, and in response to a received signer's "preview" operation (i.e., the signer selects the preview button), the signature effect corresponding to the signature format with the highest priority is displayed at the position indicated by the target block data based on the signature position of the PDF file; if the signer is satisfied with the signature effect, the 'confirm' button can be directly selected, if the signer is not satisfied with the signature effect, the 'next group' button can be selected, the operation of selecting the 'next group' button by the signer is responded based on the signature position of the PDF file, and the operation is sequentially displayed according to the priority of the signature effect from high to low until the signer selects the 'confirm' button or the 'cancel' button.
Optionally, if all four signature effects are displayed, the four signature effects are displayed again according to the priority order of the plurality of signature effects from high to low.
Alternatively, if the signer selects the "cancel" button, it returns to the signature effect preview effect graph of the previous target tile data. In this example, the target block data of the 25 th column of line 8 with the signature position as the first page is returned, and the selected signature effect of the target block data of the position is preview-displayed to replace the signature effect according to the signer's selection. It should be understood that if the currently selected target block data is the smallest target block data among the page data, the "cancel" button is in the unselected state.
In this embodiment, the signature effect of the corresponding position of each target block data may be different.
S503, responding to the received signature to determine operation information, and generating a signed PDF file according to the currently displayed signature effect.
In this embodiment, the signature determination operation may mean that the signer selects the "determine" button in the embodiment of fig. 7.
In this embodiment, the currently displayed signature effect may refer to a signature effect selected by a signer selecting a "ok" button in each target block data.
Referring to fig. 7, if the signer selects the "ok" button, the process jumps to the next target block data, and if the current target block data is the last target block data, the signed PDF file is generated according to the signature effect determined by each target block data.
The method for generating the signed PDF file provided by the embodiment of the application can respond to the operation of checking the signature effect of a signer, sequentially display a plurality of signature effects of the signer at the signature position specified by each target block data, and determine the operation to generate the signed PDF file according to the signature of the signer, so that the signed PDF file can meet the requirement of the signer.
It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present application.
Based on the method for determining the signature position based on the PDF file provided by the above embodiment, an embodiment of an apparatus implementing the above method embodiment is further provided in the embodiment of the present invention.
Fig. 8 is a schematic structural diagram of a PDF file based signature location determining apparatus according to an embodiment of the present application. As shown in fig. 8, the PDF file based signature location determining apparatus 60 includes an obtaining module 601, a format determining module 602, a parsing module 603, and a location determining module 604; wherein the content of the first and second substances,
the acquisition module 601 is configured to acquire a file attribute of a PDF file to be signed, and determine a keyword for representing a signer according to the file attribute; the file attributes comprise company names, conference types and file names;
a format determining module 602, configured to determine a target signature format of the keyword according to a preset keyword configuration table; the keyword configuration table comprises corresponding relations among the file attributes, the signers and the signature formats, and the corresponding relations are used for indicating the signature formats which can be used by the signers in the files corresponding to the file attributes;
the parsing module 603 is configured to perform text parsing on the PDF file to be signed, and obtain a plurality of block data set based on the context sequence and location information of each block data;
the position determining module 604 is configured to determine, according to the context information of the multiple block data, target block data that includes a keyword and matches a target signature format of the keyword, and determine a position represented by position information of the target block data as a signature position of the PDF file to be signed.
Optionally, the parsing module 603 performs text parsing on the PDF file to be signed to obtain a plurality of block data set based on the context order and location information of each block data, and the obtaining may include:
acquiring text information of a PDF file to be signed;
dividing text information into a plurality of natural sentences, and determining position information of each natural sentence;
and taking each natural sentence as block data, and setting a plurality of block data based on the context sequence.
Optionally, each signer in the keyword configuration table corresponds to a plurality of signature formats, and the plurality of signature formats corresponding to each signer have different priorities.
The format determining module 602 determines a target signature format of the keyword according to a preset keyword configuration table, including:
searching a plurality of candidate signature formats corresponding to the keywords from the keyword configuration table;
and sorting the candidate signature formats according to the priorities respectively corresponding to the candidate signature formats, and taking the sorted candidate signature formats as target signature formats of the keywords.
Optionally, the determining module 604 determines the target block data containing the keyword and matching with the target signature format of the keyword according to the context information of the plurality of block data, including:
searching at least two first block data containing keywords from the plurality of block data;
combining the keywords in each first partition data with the context information of the keywords to generate signature blocks, and analyzing to obtain the signature format of each signature block;
and performing similarity analysis on the signature format of each signature block in the plurality of signature blocks and the target signature format of the keyword, and determining N target block data with the highest similarity to the target signature format of the keyword, wherein N is an integer greater than or equal to 2.
Optionally, the position information of each block data includes page number information; the position determining module 604 searches for at least two first tile data containing the keyword from the plurality of tile data, including:
combining a plurality of block data corresponding to the same page number to generate a character string according to the page number information of each block data to obtain a plurality of character strings to be compared;
and aiming at each character string to be compared, comparing each block data in the character string to be compared with the keyword one by one under the condition that the character string to be compared contains the keyword, and storing the block data matched with the keyword as first block data.
Optionally, the position determining module 604 performs similarity analysis on the signature format of each signature block in the plurality of signature blocks and the target signature format of the keyword, and determines N target block data with the highest similarity to the target signature format of the keyword, including:
aiming at each candidate signature format in the target signature formats, performing matching analysis on the signature format of each signature block in the plurality of signature blocks and the candidate signature format, and saving the signature block matched with the candidate signature format as a candidate signature block;
sorting the candidate signature blocks according to the priority order of the candidate signature formats in the target signature format;
and determining the block data respectively corresponding to the first N candidate signature blocks in sequence as N target block data with the highest target signature format similarity with the keyword.
Optionally, the keyword configuration table further comprises a signature style for characterizing the signer; the PDF file signature location-based determining apparatus 600 further includes a display module.
Optionally, the display module is configured to: determining the signature style of a signer according to the keyword configuration table, and respectively combining and storing the signature style and a plurality of candidate signature formats to generate a plurality of signature effects; for each target block data in the N target block data, responding to the received signature effect checking operation information, and sequentially adding and displaying a plurality of signature effects to the corresponding positions of the target block data; and responding to the received signature to determine operation information, and generating a signed PDF file according to the currently displayed signature effect.
The signature position determining apparatus based on a PDF file provided in the embodiment shown in fig. 8 may be used to implement the technical solution in the foregoing method embodiment, and the implementation principle and technical effect are similar, which is not described herein again.
Fig. 9 is a schematic diagram of a PDF file based signature location determining device according to an embodiment of the present application. As shown in fig. 9, the PDF file-based signature position determining device 70 of this embodiment includes: at least one processor 701, a memory 702, and a computer program stored in said memory 702 and executable on said processor 701. The apparatus for determining a signature position based on a PDF file further includes a communication section 703, wherein the processor 701, the memory 702, and the communication section 703 are connected by a bus 704.
The processor 701, when executing the computer program, implements the steps in each embodiment of the above-described PDF file based signature location determination method, such as steps S10 to S40 in the embodiment shown in fig. 1. Alternatively, the processor 701, when executing the computer program, implements the functions of each module/unit in each device embodiment described above, for example, the functions of the modules 601 to 604 shown in fig. 8.
Illustratively, a computer program may be partitioned into one or more modules/units that are stored in the memory 702 and executed by the processor 701 to accomplish the present application. One or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution process of the computer program in the PDF file based signature location determining device 70.
Those skilled in the art will appreciate that fig. 9 is merely an example of a PDF file based signature location determining device and does not constitute a limitation of PDF file based signature location determining devices and may include more or fewer components than shown, or some components in combination, or different components such as input output devices, network access devices, buses, etc.
The Processor 701 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory 702 may be an internal storage unit of the device for determining the signature position of the PDF file, or an external storage device of the device for determining the signature position of the PDF file, such as a plug-in hard disk, a Smart Memory Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. The memory 702 is used to store the computer program and other programs and data needed to determine the device based on the signature location of the PDF file. The memory 702 may also be used to temporarily store data that has been output or is to be output.
The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended ISA (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, the buses in the figures of the present application are not limited to only one bus or one type of bus.
The embodiments of the present application also provide a computer-readable storage medium, where a computer program is stored, and when the computer program is executed by a processor, the computer program implements the steps in the above-mentioned method embodiments.
The embodiments of the present application provide a computer program product, which when running on a mobile terminal, enables the mobile terminal to implement the steps in the above method embodiments when executed.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, all or part of the processes in the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium and can implement the steps of the embodiments of the methods described above when the computer program is executed by a processor. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer readable medium may include at least: any entity or device capable of carrying computer program code to a photographing apparatus/terminal apparatus, a recording medium, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), an electrical carrier signal, a telecommunications signal, and a software distribution medium. Such as a usb-disk, a removable hard disk, a magnetic or optical disk, etc. In certain jurisdictions, computer-readable media may not be an electrical carrier signal or a telecommunications signal in accordance with legislative and patent practice.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus/network device and method may be implemented in other ways. For example, the above-described apparatus/network device embodiments are merely illustrative, and for example, a module or a unit may be divided into only one logical function, and may be implemented in other ways, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
The above embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present application and are intended to be included within the scope of the present application.

Claims (10)

1. A signature position determining method based on a PDF file is characterized by comprising the following steps:
acquiring the file attribute of a PDF file to be signed, and determining a keyword for representing a signer according to the file attribute; wherein the file attributes comprise a company name, a conference type and a file name;
determining a target signature format of the keyword according to a preset keyword configuration table; the keyword configuration table comprises corresponding relations among file attributes, signers and signature formats, and the corresponding relations are used for indicating the signature formats which can be used by the signers in the files corresponding to the file attributes;
performing text analysis processing on the PDF file to be signed to obtain a plurality of block data set based on a context sequence and position information of each block data;
and determining target block data which contains the keywords and is matched with the target signature format of the keywords according to the context information of the plurality of block data, and determining the position represented by the position information of the target block data as the signature position of the PDF file to be signed.
2. The method for determining the signature position based on the PDF file according to claim 1, wherein the performing text parsing on the PDF file to be signed to obtain a plurality of block data set based on a context order and position information of each of the block data comprises:
acquiring text information of the PDF file to be signed;
dividing the text information into a plurality of natural sentences, and determining the position information of each natural sentence;
and taking each natural sentence as block data, and setting the plurality of block data based on the context sequence.
3. The PDF file based signature location determining method of claim 1, wherein each signer in the keyword configuration table corresponds to a plurality of signature formats, and the plurality of signature formats corresponding to each signer have different priorities;
the determining of the target signature format of the keyword according to a preset keyword configuration table comprises the following steps:
searching a plurality of candidate signature formats corresponding to the keywords from the keyword configuration table;
and sorting the candidate signature formats according to the priorities respectively corresponding to the candidate signature formats, and taking the sorted candidate signature formats as the target signature formats of the keywords.
4. The method as claimed in claim 3, wherein the determining the target block data containing the keyword and matching with the target signature format of the keyword according to the context information of the plurality of block data comprises:
searching at least two first block data containing the key words from the plurality of block data;
combining the keywords in each first partition data with the context information of the keywords to generate signature blocks, and analyzing to obtain the signature format of each signature block;
and performing similarity analysis on the signature format of each signature block in the plurality of signature blocks and the target signature format of the keyword, and determining N target block data with the highest similarity to the target signature format of the keyword, wherein N is an integer greater than or equal to 2.
5. The PDF file based signature location determining method according to claim 4, wherein the location information of each of the block data includes page number information;
the searching at least two first block data containing the keyword from the plurality of block data comprises:
combining a plurality of block data corresponding to the same page number to generate a character string according to the page number information of each block data to obtain a plurality of character strings to be compared;
aiming at each character string to be compared, under the condition that the character string to be compared contains the key words, comparing each block data in the character string to be compared with the key words one by one, and saving the block data matched with the key words as the first block data.
6. The PDF file based signature location determining method as recited in claim 4, wherein said analyzing the similarity between the signature format of each of said plurality of signature blocks and the target signature format of said keyword to determine N target block data with highest similarity to the target signature format of said keyword, comprises:
for each candidate signature format in the target signature formats, performing matching analysis on the signature format of each signature block in the plurality of signature blocks and the candidate signature format, and saving the signature block matched with the candidate signature format as a candidate signature block;
sorting the candidate signature blocks according to the priority order of multiple candidate signature formats in the target signature format;
and determining the block data respectively corresponding to the first N candidate signature blocks in sequence as N target block data with the highest target signature format similarity with the keyword.
7. The PDF file based signature location determining method of any one of claims 4 to 6, wherein said keyword configuration table further comprises a signature pattern for characterizing a signer;
after the position represented by the position information of the target block data is determined as the signature position of the PDF file to be signed, the method further includes:
determining the signature style of the signer according to the keyword configuration table, and respectively combining and storing the signature style and a plurality of candidate signature formats to generate a plurality of signature effects;
for each target block data in the N target block data, responding to the received signature effect checking operation information, and sequentially adding and displaying the plurality of signature effects to the corresponding positions of the target block data;
and responding to the received signature to determine operation information, and generating a signed PDF file according to the currently displayed signature effect.
8. A PDF file based signature location determination apparatus, comprising:
the acquisition module is used for acquiring the file attribute of the PDF file to be signed and determining a keyword for representing a signer according to the file attribute; wherein the file attributes comprise a company name, a conference type and a file name;
the format determining module is used for determining the target signature format of the keywords according to a preset keyword configuration table; the keyword configuration table comprises corresponding relations among file attributes, signers and signature formats, and the corresponding relations are used for indicating the signature formats which can be used by the signers in the files corresponding to the file attributes;
the analysis module is used for carrying out text analysis processing on the PDF file to be signed to obtain a plurality of block data set based on a context sequence and position information of each block data;
and the position determining module is used for determining target block data which contains the keywords and is matched with the target signature format of the keywords according to the context information of the plurality of block data, and determining the position represented by the position information of the target block data as the signature position of the PDF file to be signed.
9. A PDF file based signature location determination device comprising a memory, a processor and a computer program stored in said memory and executable on said processor, wherein said processor implements the steps of the method according to any one of claims 1 to 7 when executing said computer program.
10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.
CN202010489992.4A 2020-06-02 2020-06-02 Signature position determining method, device and equipment based on PDF (portable document format) file Active CN111832264B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010489992.4A CN111832264B (en) 2020-06-02 2020-06-02 Signature position determining method, device and equipment based on PDF (portable document format) file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010489992.4A CN111832264B (en) 2020-06-02 2020-06-02 Signature position determining method, device and equipment based on PDF (portable document format) file

Publications (2)

Publication Number Publication Date
CN111832264A true CN111832264A (en) 2020-10-27
CN111832264B CN111832264B (en) 2023-09-15

Family

ID=72897559

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010489992.4A Active CN111832264B (en) 2020-06-02 2020-06-02 Signature position determining method, device and equipment based on PDF (portable document format) file

Country Status (1)

Country Link
CN (1) CN111832264B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112612815A (en) * 2020-12-29 2021-04-06 鲁班(北京)电子商务科技有限公司 Method and device for positioning bid evaluation file and electronic equipment
CN113112222A (en) * 2021-04-02 2021-07-13 北京点聚信息技术有限公司 Automatic signature method and system for OFD format file based on big data analysis
CN113407665A (en) * 2021-05-25 2021-09-17 北京有竹居网络技术有限公司 Text comparison method, device, medium and electronic equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101751612A (en) * 2008-12-18 2010-06-23 鸿富锦精密工业(深圳)有限公司 System for approving electronic contract and method therefor
US20100235727A1 (en) * 2009-03-14 2010-09-16 Ashton Brian G Systems and Methods for Dynamic Electronic Signature Placement
CN101903901A (en) * 2007-12-14 2010-12-01 鲁特宛有限公司 The system and method that is used for the electronic signature capture of e-contracting transactions
US20180039828A1 (en) * 2015-02-18 2018-02-08 Fuji Xerox Australia Pty Limited Generating a signed electronic document
CN109543444A (en) * 2018-10-25 2019-03-29 深圳壹账通智能科技有限公司 A kind of file signature method, apparatus, storage medium and server
CN109672536A (en) * 2018-12-24 2019-04-23 航天信息股份有限公司 A kind of batch PDF file digital signature method and system
CN109960785A (en) * 2017-12-25 2019-07-02 顺丰科技有限公司 File on-line signing method, apparatus, storage medium and equipment
CN110245220A (en) * 2019-05-05 2019-09-17 深圳法大大网络科技有限公司 Electronic document signs method, apparatus and server, storage medium
CN110750964A (en) * 2019-09-18 2020-02-04 平安科技(深圳)有限公司 Information adding method and related device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101903901A (en) * 2007-12-14 2010-12-01 鲁特宛有限公司 The system and method that is used for the electronic signature capture of e-contracting transactions
CN101751612A (en) * 2008-12-18 2010-06-23 鸿富锦精密工业(深圳)有限公司 System for approving electronic contract and method therefor
US20100235727A1 (en) * 2009-03-14 2010-09-16 Ashton Brian G Systems and Methods for Dynamic Electronic Signature Placement
US20180039828A1 (en) * 2015-02-18 2018-02-08 Fuji Xerox Australia Pty Limited Generating a signed electronic document
CN109960785A (en) * 2017-12-25 2019-07-02 顺丰科技有限公司 File on-line signing method, apparatus, storage medium and equipment
CN109543444A (en) * 2018-10-25 2019-03-29 深圳壹账通智能科技有限公司 A kind of file signature method, apparatus, storage medium and server
CN109672536A (en) * 2018-12-24 2019-04-23 航天信息股份有限公司 A kind of batch PDF file digital signature method and system
CN110245220A (en) * 2019-05-05 2019-09-17 深圳法大大网络科技有限公司 Electronic document signs method, apparatus and server, storage medium
CN110750964A (en) * 2019-09-18 2020-02-04 平安科技(深圳)有限公司 Information adding method and related device

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
孟晓冬;: "浅谈电子签名系统在工程设计中的开发与应用", 锅炉制造, no. 4, pages 62 - 64 *
方伟: "基于PDM的异地协同设计平台的研究与应用", 中国优秀硕士学位论文全文数据库 信息科技辑, pages 138 - 2276 *
杨威: "细粒度安全中XML数字签名原始语义保护研究", 中国优秀硕士学位论文全文数据库 信息科技辑, pages 138 - 173 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112612815A (en) * 2020-12-29 2021-04-06 鲁班(北京)电子商务科技有限公司 Method and device for positioning bid evaluation file and electronic equipment
CN112612815B (en) * 2020-12-29 2024-04-19 鲁班(北京)电子商务科技有限公司 Method and device for positioning evaluation mark file and electronic equipment
CN113112222A (en) * 2021-04-02 2021-07-13 北京点聚信息技术有限公司 Automatic signature method and system for OFD format file based on big data analysis
CN113112222B (en) * 2021-04-02 2024-02-09 北京点聚信息技术有限公司 Automatic signature method and system for OFD format file based on big data analysis
CN113407665A (en) * 2021-05-25 2021-09-17 北京有竹居网络技术有限公司 Text comparison method, device, medium and electronic equipment

Also Published As

Publication number Publication date
CN111832264B (en) 2023-09-15

Similar Documents

Publication Publication Date Title
CN111832264B (en) Signature position determining method, device and equipment based on PDF (portable document format) file
US8577882B2 (en) Method and system for searching multilingual documents
CN108701121B (en) Dispatching user input to multiple input fields in a user interface
CN107526846B (en) Method, device, server and medium for generating and sorting channel sorting model
CN109948122B (en) Error correction method and device for input text and electronic equipment
CN109033385A (en) Picture retrieval method, device, server and storage medium
CN110929125A (en) Search recall method, apparatus, device and storage medium thereof
JP2019520662A (en) Content-based search and retrieval of trademark images
CN111198936B (en) Voice search method and device, electronic equipment and storage medium
CN107832035B (en) Voice input method of intelligent terminal
US9977766B2 (en) Keyboard input corresponding to multiple languages
CN111506608A (en) Method and device for comparing structured texts
CN109545223B (en) Voice recognition method applied to user terminal and terminal equipment
US20160196256A1 (en) Method to input content in a structured manner with real-time assistance and validation
CN112329409B (en) Cell color conversion method and device and electronic equipment
CN111602129B (en) Smart search for notes and ink
CN106202423A (en) A kind of file ordering method and apparatus
CN104090875A (en) Information retrieval system and information retrieval method
CN112883218A (en) Image-text combined representation searching method, system, server and storage medium
US20160314130A1 (en) Computing device with spell-check feature
CN105893614A (en) Information recommendation method and device and electronic equipment
CN112989011B (en) Data query method, data query device and electronic equipment
US9600480B2 (en) Systems and methods for indexing and linking electronic documents
CN111050194B (en) Video sequence processing method, video sequence processing device, electronic equipment and computer readable storage medium
KR102254329B1 (en) Method and Apparatus for Providing User Customized Search Result

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant