CN111914548A - Intelligent document filling method and system and related equipment - Google Patents

Intelligent document filling method and system and related equipment Download PDF

Info

Publication number
CN111914548A
CN111914548A CN202010744800.XA CN202010744800A CN111914548A CN 111914548 A CN111914548 A CN 111914548A CN 202010744800 A CN202010744800 A CN 202010744800A CN 111914548 A CN111914548 A CN 111914548A
Authority
CN
China
Prior art keywords
word segmentation
target
preset
text
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010744800.XA
Other languages
Chinese (zh)
Inventor
陈卓基
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Kingdee Tianyanyun Computing Co ltd
Original Assignee
Shenzhen Kingdee Tianyanyun Computing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Kingdee Tianyanyun Computing Co ltd filed Critical Shenzhen Kingdee Tianyanyun Computing Co ltd
Priority to CN202010744800.XA priority Critical patent/CN111914548A/en
Publication of CN111914548A publication Critical patent/CN111914548A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Abstract

The embodiment of the invention provides a method, a system and related equipment for intelligently filling a document, which are used for realizing automatic filling of the document and improving the document filling efficiency. The method provided by the embodiment of the invention comprises the following steps: receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence; matching the word segmentation sequence with preset service logic, and if the successfully matched target service logic exists, creating a target receipt corresponding to the target service logic; and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.

Description

Intelligent document filling method and system and related equipment
Technical Field
The invention relates to the technical field of intelligent filling of documents, in particular to an intelligent filling method, an intelligent filling system and related equipment of documents.
Background
The enterprise management cloud platform is an enterprise management software cloud platform leading the development trend of the IT industry in China, not only meets the increasing information management requirements of medium and small enterprises, but also provides efficient, safe and stable one-stop professional software application service for large group enterprises containing branch companies.
When a document is opened and relevant data is filled in, an existing enterprise management cloud platform needs to manually find a corresponding application into an application, find a corresponding menu item and click to open the document, and after the document is opened, items to be filled in need to be clicked one by one to obtain an input focus and manually fill in various information. For example, a business travel ticket may need to fill in items such as data of collaborators, business categories, application reasons, etc., click on the items to be filled one by one to obtain an input focus, and manually fill in information of each item in a project bar.
In the existing enterprise management cloud platform, the items to be filled in need to be manually clicked one by one to obtain an input focus, and all items of information are manually filled in, so that time and labor are wasted, and the efficiency is low.
Disclosure of Invention
The embodiment of the invention provides a method, a system and related equipment for intelligently filling a document, which are used for realizing automatic filling of the document and improving the document filling efficiency.
The first aspect of the embodiment of the invention provides an intelligent filling method for a receipt, which can comprise the following steps:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with preset service logic, and if the successfully matched target service logic exists, creating a target receipt corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, in the method for intelligently filling a document in the embodiment of the present invention, matching the word segmentation sequence with a preset service logic may include:
extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
Optionally, as a possible implementation manner, in the method for intelligently filling a document in the embodiment of the present invention, matching the word segmentation sequence with a preset service logic may include:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles in the context and the combination relation of the participles;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the performing word segmentation processing on the text by using a preset word segmentation component may include:
and sending the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, calling a word segmentation component by the back-end server to segment words, and then returning a word segmentation result.
Optionally, as a possible implementation manner, in the method for intelligently filling a document in the embodiment of the present invention, the preset word segmentation component is a jieba word segmentation component, the obtained text data input by the user is sent to a back-end server through a plug-in bound to the interactive interface, and the back-end server calls the word segmentation component to perform word segmentation, which may include:
the text is sent to a server where a jieba word segmentation component is located by adopting an Apache Httpclient client programming tool kit;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the intelligent document filling method in the embodiment of the present invention may further include:
and extracting preset fields in the text, and processing the formats of the field values of the preset fields into a uniform format.
A second aspect of an embodiment of the present invention provides an intelligent document filling system, which may include:
the processing module is used for receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component and outputting a word segmentation sequence;
the matching module is used for matching the word segmentation sequence with preset service logic, and if the target service logic which is successfully matched exists, a target receipt corresponding to the target service logic is created;
and the filling module is used for automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the matching module may include:
and the matching unit is used for extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the matching module may include:
the extraction unit is used for extracting context characteristics corresponding to preset business logic, and the context characteristics are used for indicating the parts of speech of a plurality of participles in the context and the combination relation of the participles;
and the judging unit is used for judging whether the word segmentation sequence meets the context characteristics, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the processing module may include:
and the processing unit sends the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, and the back-end server calls the word segmentation component to segment words and then returns word segmentation results.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the preset word segmentation component is a jieba word segmentation component, and the processing unit may include:
the sending subunit sends the text to a server where the jieba word segmentation component is located by adopting an Apache Httpclient client programming tool package;
the loading subunit loads a custom dictionary corresponding to the preset service in the jieba word segmentation component;
and the word segmentation subunit is used for performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the intelligent document filling system in the embodiment of the present invention may further include:
and the preprocessing module is used for extracting preset fields in the text and processing the formats of the field values of the preset fields into a uniform format.
A third aspect of embodiments of the present invention provides a computer apparatus, which includes a processor, and the processor is configured to implement the steps in any one of the possible implementation manners of the first aspect and the first aspect when executing a computer program stored in a memory.
A fourth aspect of the embodiments of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in any one of the possible implementations of the first aspect and the first aspect.
According to the technical scheme, the embodiment of the invention has the following advantages:
in the embodiment of the invention, the document intelligent filling system can receive a text input by a user, perform word segmentation processing on the text by adopting a preset word segmentation component, output a word segmentation sequence, match the word segmentation sequence with preset business logic, create a target document corresponding to the target business logic if the target business logic which is successfully matched exists, and automatically fill the words in the word segmentation sequence into a project column of the target document according to the business rules of the target business logic. Compared with the prior art, the user does not need to click item columns in the document one by one for inputting, the user operation time is saved, and the document filling efficiency is improved.
Drawings
FIG. 1 is a schematic diagram of an embodiment of an intelligent document filling method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of an embodiment of the present invention, in which a document intelligent filling method is applied;
FIG. 3 is a schematic diagram of an embodiment of an intelligent document filling system according to an embodiment of the present invention;
FIG. 4 is a diagram of a computer device according to an embodiment of the present invention.
Detailed Description
The embodiment of the invention provides a method, a system and related equipment for intelligently filling a document, which are used for realizing automatic filling of the document and improving the document filling efficiency.
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For convenience of understanding, a specific flow in the embodiment of the present invention is described below, and referring to fig. 1, an embodiment of an intelligent document filling method in the embodiment of the present invention may include:
s101, receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
in the existing enterprise management cloud platform, after a document is opened, items required to be filled in need to be clicked one by one to obtain an input focus, and various information needs to be manually filled in.
In order to improve the bill filling efficiency, the intelligent bill filling system in the embodiment of the invention can be provided with an interactive interface for receiving the text input by the user. The intelligent document filling system can adopt a preset word segmentation component to perform word segmentation processing on the received text and output a word segmentation sequence, wherein the word segmentation sequence can contain the part of speech of each word segmentation. And finally, the intelligent document filling system can realize automatic document filling based on the word segmentation sequence.
S102, matching the word segmentation sequence with a preset service logic, and if a target service logic which is successfully matched exists, creating a target document corresponding to the target service logic;
in order to realize automatic filling of documents, business logics corresponding to various types of documents need to be preset, then the obtained word segmentation sequence is matched with the preset business logics, and if a target business logic which is successfully matched exists, a target document corresponding to the target business logic is created.
Optionally, as a possible implementation manner, the specific process of matching the word segmentation sequence with the preset service logic may include: and extracting target participles corresponding to preset service logics, judging whether the participle sequence contains the target participles, and if so, judging that the participle sequence is successfully matched with the target service logics corresponding to the target participles.
For example, target participles such as "travel", "business trip", and "going out" may be set in the travel request form. When the text input by the user generates a travel application form with the communication between No. 3 and No. 11, Zhang III and Li IV from the airplane sitting in the Guangdong Fushan in China to Beijing for more than three months, the service logic of successful matching can be determined to be the travel application form service according to the word segmentation sequence containing the travel.
Optionally, as a possible implementation manner, the specific process of matching the word segmentation sequence with the preset service logic may include: extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context; and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the target service logic corresponding to the word segmentation sequence and the context characteristics is successfully matched.
For example, for the application form of the foreign country, the context characteristic can be set as that the part of speech containing one or more participles is a place name, and the contained place name exists in a preset foreign place name white list. When the text "No. 3 to No. 11, zhang san and lie si input by the user together goes from the korean kettle mountain to the application form communicated in other places for more than three months in new york, the service logic of successful matching can be determined to be the international application form service according to the word segmentation sequence including" korean kettle mountain ".
It is understood that the service logic of the international application form service and the travel application form service in the above examples is only exemplary, and the service logic may be reasonably set according to the specific requirements of the user in practical application, and is not limited herein.
S103, automatically filling the participles in the participle sequence into the item column of the target document according to the business rule of the target business logic.
After the correct business logic is matched, the document intelligent filling system can automatically fill the participles in the participle sequence into the item column of the target document according to the business rules of the target business logic. For example, when a text input by a user is "generate a travel request form with a distance of 3 months to 11 months, three months and four months communicating with each other from an airplane sitting on the Guangdong Fushan in China to Beijing for more than three months", after the matched travel request form is matched, the participles such as departure place "Fushan", destination "Beijing", departure time "3 months to 5 months" and deadline "3 months to 11 months" in the participle sequence can be extracted according to the business rules of the travel request form, and automatically filled into the corresponding project columns of the travel request form to generate a complete travel request form.
In the embodiment of the invention, the document intelligent filling system can receive a text input by a user, perform word segmentation processing on the text by adopting a preset word segmentation component, output a word segmentation sequence, match the word segmentation sequence with preset business logic, create a target document corresponding to the target business logic if the target business logic which is successfully matched exists, and automatically fill the words in the word segmentation sequence into a project column of the target document according to the business rules of the target business logic. Compared with the prior art, the user does not need to click item columns in the document one by one for inputting, the user operation time is saved, and the document filling efficiency is improved.
The word segmentation components in practical application can be Yaha word segmentation components, Qinghua THULAC components and jieba word segmentation components. Optionally, as a possible implementation manner, when the preset word segmentation component is a jieba word segmentation component, performing word segmentation processing on the text by using the preset word segmentation component may include: adopting an Apache http client programming toolkit (or common http tool) to send the text to a server where the jieba word segmentation component is located; loading a custom dictionary corresponding to a preset service in the jieba word segmentation component; and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
In practical application, the word segmentation sequence obtained by directly segmenting words of the text with the date cannot judge the date set in the text. In view of this, it is necessary to process the date format in the text. Optionally, as a possible implementation manner, on the basis of the foregoing embodiment, before performing word segmentation processing on the text by using a preset word segmentation component, a preset field in the text may be extracted, and a format of a field value of the preset field may be processed into a uniform format, for example, when the preset field is a date, different date formats may be processed into the uniform format.
For example, the text: "i want to live from tomorrow to 28", if the text is not subjected to date processing, the complete date set in the text cannot be judged. Scripts with different date formats can be called, and different date formats in the text before word segmentation are processed into a uniform format, so that the date data with the uniform format can be conveniently obtained by subsequent services. For example, the test date is 26/4/2020, and the corresponding date data in the unified format after the text "i want to live from tomorrow to number 28" processing includes: "2020-04-27" and "2020-04-28".
For convenience of understanding, the intelligent document filling method in the embodiment of the present invention will be described below with reference to specific application embodiments.
The intelligent document filling method in the embodiment is realized based on a server of a Python Django framework, a jieba word segmentation component, a cloud platform plug-in and an Apache Httpclient. The method comprises the steps that Apache Http is used in a plug-in bound with an interactive interface to send an acquired text data input by a user to a back-end service built by a Python Django framework in an Http mode, the server calls a jieba word segmentation component to perform word segmentation, then word segmentation results are returned, the word segmentation results are matched with preset service logic through the parts of speech of different words of the word segmentation results and the part of speech logic of a plurality of words, and the word segmentation results are processed by adopting the service logic which is successfully matched to generate different documents. Compared with the URLConnection of the traditional JDK, the Apache HttpClient increases the usability and flexibility, not only is the HTTP request sent by the client side easy, but also the development personnel can conveniently test the interface (based on the HTTP protocol), the development efficiency is improved, and the code robustness is also conveniently improved.
Referring to fig. 2, the method specifically includes the following steps:
a) acquiring a text: in a development platform, a PC terminal interactive page is created, a control of a text field type is added in the interactive page, the control is used for receiving a text to be participled input by a user, then a preset plug-in is bound, and input text data is obtained through a function method in the plug-in.
b) Text transmission and reception: and initiating an Http request in a plug-in of the cloud platform through the Apache Http client to send the acquired text to a back-end service built by a Python Django framework.
c) Word segmentation processing of the text: introducing a jieba word segmentation component into the back-end service of the server, and adding a self-defined dictionary into the jieba word segmentation component to identify the special words of different service scenes. And finally, performing a precise word segmentation mode with part-of-speech tagging on the text, and returning a word segmentation result in a json format.
d) And (3) generating a document: after a plug-in of the cloud platform initiates a request to send a text by using Apache http, json data of the participle returned from the server is received, and then the type of the document is judged according to a specific vocabulary in the returned json data. For example, after the travel application document is generated according to a specific vocabulary, a travel application form with travel information can be generated according to whether a word with a part of speech as a place name exists in a returned result and whether the domestic travel reimbursement standard in the travel application form is matched according to the place name or not. Of course, the text may also contain other information, such as business trip date, collaborators, etc., which can be determined according to the parts of speech of the date, the parts of speech of the names, etc., and the data is automatically filled in the business trip application form.
Illustratively, the example 1 text is entered: and generating a travel application form of communication between No. 3 and No. 5 and No. 3 and No. 11 with Zhang III and Li IV from the airplane sitting in the Guangdong Fushan to Beijing for more than three months. And generating a travel application bill, and automatically assigning values to the travel application bill according to key data such as a sponsor, a travel category, a departure place, a destination, a departure date, an ending date and the like processed by the service codes after word segmentation.
In addition, different document types can be generated according to various business logic processes such as the parts of speech of different words in the word segmentation result, the front and back collocation among different parts of speech, the part of speech of self-defined words in a special business scene and the like.
Illustratively, the example 2 text is entered: no. 3 and No. 5 to No. 3 and No. 11, together with Zhang III and Li IV, take the advantage of the airplane from Korea kettle mountain to communicate with the application form in remote places for more than three months in New York. (the text does not indicate which document is generated, but filters out words which are place names according to the part of speech after word segmentation and then judges that the documents are foreign place names according to the place names, matches whether the place names exist according to the reimbursement standard of the foreign countries (frontiers) or not according to the foreign place names, and identifies the application form of the foreign countries (frontiers) if the place names exist). After word segmentation, the value is automatically assigned on the application document of the country of interest according to key data such as the collaborator, the business trip category, the departure place, the destination, the departure date, the ending date and the like processed by the service code.
The word segmentation component jieba in the invention can not identify the word segmentation to different date formats well. For example, the text: i want to live from tomorrow to No. 28, if the text is not subjected to date processing, the direct word segmentation cannot judge the complete date set in the text. The scripts capable of filtering and identifying different date formats can be called to process different date formats in the text before word segmentation into a uniform format, so that the date data in the uniform format can be conveniently obtained by subsequent services. For example, the test date is 26/4/2020, and the corresponding date data in the unified format after the text "i want to live from tomorrow to number 28" processing includes: "2020-04-27" and "2020-04-28".
Referring to fig. 3, an embodiment of the present invention further provides an intelligent document filling system, which includes:
the processing module 301 is configured to receive a text input by a user, perform word segmentation processing on the text by using a preset word segmentation component, and output a word segmentation sequence;
the matching module 302 is configured to match the word segmentation sequence with a preset service logic, and if a successfully matched target service logic exists, create a target document corresponding to the target service logic;
and the filling module 303 is configured to automatically fill the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the matching module may include:
and the matching unit is used for extracting the target word segmentation corresponding to the preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the matching module may include:
the extraction unit is used for extracting context characteristics corresponding to preset business logic, and the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context;
and the judging unit is used for judging whether the word segmentation sequence meets the context characteristics, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the processing module may include:
and the processing unit sends the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, and the back-end server calls the word segmentation component to segment words and then returns word segmentation results.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the preset word segmentation component is a jieba word segmentation component, and the processing unit may include:
the sending subunit sends the text to a server where the jieba word segmentation component is located by adopting an Apache Httpclient client programming tool package;
the loading subunit loads a custom dictionary corresponding to the preset service in the jieba word segmentation component;
and the word segmentation subunit adopts a jieba word segmentation component to perform accurate word segmentation processing with part-of-speech tagging on the text.
Optionally, as a possible implementation manner, the intelligent document filling system in the embodiment of the present invention may further include:
and the preprocessing module is used for extracting preset fields in the text and processing the formats of the field values of the preset fields into a uniform format.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the above description of the intelligent document filling system in the embodiment of the present invention from the perspective of the modular functional entity, please refer to fig. 4, and the following description describes the computer apparatus in the embodiment of the present invention from the perspective of hardware processing:
the computer device 1 may include a memory 11, a processor 12 and an input output bus 13. The processor 11, when executing the computer program, implements the steps in the above-described intelligent document filling method embodiment shown in fig. 1, such as steps 101 to 103 shown in fig. 1. Alternatively, the processor, when executing the computer program, implements the functions of each module or unit in the above-described device embodiments.
In some embodiments of the present invention, the processor is specifically configured to implement the following steps:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with a preset service logic, and if a target service logic which is successfully matched exists, creating a target document corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting target participles corresponding to preset service logics, judging whether the participle sequence contains the target participles, and if so, judging that the participle sequence is successfully matched with the target service logics corresponding to the target participles.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the target service logic corresponding to the word segmentation sequence and the context characteristics is successfully matched.
Optionally, as a possible implementation manner, the preset word segmentation component is a jieba word segmentation component, and the processor may be further configured to implement the following steps:
the method comprises the steps that an Apache Httpclient client programming tool kit is adopted to send a text to a server where a jieba word segmentation component is located;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting preset fields in the text, and processing the formats of the field values of the preset fields into a uniform format.
The memory 11 includes at least one type of readable storage medium, and the readable storage medium includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 11 may in some embodiments be an internal storage unit of the computer device 1, for example a hard disk of the computer device 1. The memory 11 may also be an external storage device of the computer apparatus 1 in other embodiments, such as a plug-in hard disk provided on the computer apparatus 1, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. Further, the memory 11 may also include both an internal storage unit and an external storage device of the computer apparatus 1. The memory 11 may be used not only to store application software installed in the computer apparatus 1 and various types of data, such as codes of the computer program 01, but also to temporarily store data that has been output or is to be output.
The processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor or other data Processing chip in some embodiments, and is used for executing program codes stored in the memory 11 or Processing data, such as executing the computer program 01.
The input/output bus 13 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc.
Further, the computer apparatus may further include a wired or wireless network interface 14, and the network interface 14 may optionally include a wired interface and/or a wireless interface (such as a WI-FI interface, a bluetooth interface, etc.), which are generally used for establishing a communication connection between the computer apparatus 1 and other electronic devices.
Optionally, the computer device 1 may further include a user interface, the user interface may include a Display (Display), an input unit such as a Keyboard (Keyboard), and optionally, the user interface may further include a standard wired interface and a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or display unit, is suitable for displaying information processed in the computer device 1 and for displaying a visualized user interface.
Fig. 4 shows only the computer arrangement 1 with the components 11-14 and the computer program 01, it being understood by a person skilled in the art that the structure shown in fig. 4 does not constitute a limitation of the computer arrangement 1, but may comprise fewer or more components than shown, or a combination of certain components, or a different arrangement of components.
The present invention also provides a computer-readable storage medium having a computer program stored thereon, which when executed by a processor, performs the steps of:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with a preset service logic, and if a target service logic which is successfully matched exists, creating a target document corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting target participles corresponding to preset service logics, judging whether the participle sequence contains the target participles, and if so, judging that the participle sequence is successfully matched with the target service logics corresponding to the target participles.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the target service logic corresponding to the word segmentation sequence and the context characteristics is successfully matched.
Optionally, as a possible implementation manner, the preset word segmentation component is a jieba word segmentation component, and the processor may be further configured to implement the following steps:
the method comprises the steps that an Apache Httpclient client programming tool kit is adopted to send a text to a server where a jieba word segmentation component is located;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting preset fields in the text, and processing field values of the preset fields into a uniform format.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. An intelligent filling method for documents is characterized by comprising the following steps:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with preset service logic, and if the successfully matched target service logic exists, creating a target receipt corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
2. The method of claim 1, wherein matching the sequence of word segments to a pre-set business logic comprises:
extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
3. The method of claim 1, wherein matching the sequence of word segments to a pre-set business logic comprises:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles in the context and the combination relation of the participles;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
4. The method according to any one of claims 1 to 3, wherein the performing word segmentation processing on the text by using a preset word segmentation component comprises:
and sending the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, calling a word segmentation component by the back-end server to segment words, and then returning a word segmentation result.
5. The method according to any one of claims 4, wherein the preset word segmentation component is a jieba word segmentation component, and the step of sending the acquired text data input by the user to a back-end server through a plug-in bound to an interactive interface, and the back-end server invoking the word segmentation component to perform word segmentation comprises:
the text is sent to a server where a jieba word segmentation component is located by adopting an Apache Httpclient client programming tool kit;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
6. The method according to any one of claims 1 to 3, wherein before the tokenizing the text with a preset tokenizing component, the method further comprises:
and extracting preset fields in the text, and processing the formats of the field values of the preset fields into a uniform format.
7. An intelligent document filling system, comprising:
the processing module is used for receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component and outputting a word segmentation sequence;
the matching module is used for matching the word segmentation sequence with preset service logic, and if the target service logic which is successfully matched exists, a target receipt corresponding to the target service logic is created;
and the filling module is used for automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
8. The system of claim 7, wherein the matching module comprises:
and the matching unit is used for extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
9. A computer arrangement, characterized in that the computer arrangement comprises a processor for implementing the steps of the method according to any one of claims 1 to 6 when executing a computer program stored in a memory.
10. A computer-readable storage medium having stored thereon a computer program, characterized in that: the computer program, when being executed by a processor, realizes the steps of the method according to any one of claims 1 to 6.
CN202010744800.XA 2020-07-29 2020-07-29 Intelligent document filling method and system and related equipment Pending CN111914548A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010744800.XA CN111914548A (en) 2020-07-29 2020-07-29 Intelligent document filling method and system and related equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010744800.XA CN111914548A (en) 2020-07-29 2020-07-29 Intelligent document filling method and system and related equipment

Publications (1)

Publication Number Publication Date
CN111914548A true CN111914548A (en) 2020-11-10

Family

ID=73286859

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010744800.XA Pending CN111914548A (en) 2020-07-29 2020-07-29 Intelligent document filling method and system and related equipment

Country Status (1)

Country Link
CN (1) CN111914548A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112835860A (en) * 2021-02-19 2021-05-25 腾讯科技(深圳)有限公司 Shared document processing method, device, equipment and computer readable storage medium
CN113407599A (en) * 2021-06-30 2021-09-17 上海万物新生环保科技集团有限公司 Text data based standardized processing method and equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107066432A (en) * 2016-10-24 2017-08-18 禾盈软件科技有限公司 Method and its system that document is automatically generated
CN107274889A (en) * 2017-06-19 2017-10-20 北京紫博光彦信息技术有限公司 A kind of method and device according to speech production business paper
CN111062195A (en) * 2019-12-13 2020-04-24 深圳我家云网络科技有限公司 File processing method, file processing device, equipment and storage medium
CN111126029A (en) * 2019-12-31 2020-05-08 广州市昊链信息科技股份有限公司 Electronic document generation method and device, computer equipment and storage medium
CN111352987A (en) * 2020-02-28 2020-06-30 汤学民 Electronic medical record structuring method, system and related equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107066432A (en) * 2016-10-24 2017-08-18 禾盈软件科技有限公司 Method and its system that document is automatically generated
CN107274889A (en) * 2017-06-19 2017-10-20 北京紫博光彦信息技术有限公司 A kind of method and device according to speech production business paper
CN111062195A (en) * 2019-12-13 2020-04-24 深圳我家云网络科技有限公司 File processing method, file processing device, equipment and storage medium
CN111126029A (en) * 2019-12-31 2020-05-08 广州市昊链信息科技股份有限公司 Electronic document generation method and device, computer equipment and storage medium
CN111352987A (en) * 2020-02-28 2020-06-30 汤学民 Electronic medical record structuring method, system and related equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
朱频频 等: "《智能客户服务技术与应用》", vol. 2019, 中国铁道出版社有限公司, pages: 132 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112835860A (en) * 2021-02-19 2021-05-25 腾讯科技(深圳)有限公司 Shared document processing method, device, equipment and computer readable storage medium
CN113407599A (en) * 2021-06-30 2021-09-17 上海万物新生环保科技集团有限公司 Text data based standardized processing method and equipment

Similar Documents

Publication Publication Date Title
US20150039707A1 (en) Document processing
WO2022134583A1 (en) Insurance data information generation method, apparatus, server, and storage medium
CN111914548A (en) Intelligent document filling method and system and related equipment
US10067631B2 (en) Communication terminal and method of providing unified interface to the same
CN110795697A (en) Logic expression obtaining method and device, storage medium and electronic device
CN111143505A (en) Document processing method, device, medium and electronic equipment
CN104484486A (en) Information recommending method and electronic device
CN112434501A (en) Work order intelligent generation method and device, electronic equipment and medium
CN110633331A (en) Method, system and related equipment for extracting data in relational database
CN109214362B (en) Document processing method and related equipment
CN109862399A (en) It shows the method for rich media information, handle method, computer installation and the computer readable storage medium of rich media information
CN112732893A (en) Text information extraction method and device, storage medium and electronic equipment
CN105554254A (en) Notification message processing method and device
CN102929845B (en) A kind of server and method for realizing network reading
CN111753210A (en) Resource pushing method and device, computer equipment and computer readable storage medium
CN109740130B (en) Method and device for generating file
CN116167881A (en) Automatic generation method, device, equipment and storage medium for payment account record
CN116069407A (en) Method and system for analyzing SWIFT message and automatically mapping SWIFT message to business transaction column
CN107909382A (en) User grouping management method, server and storage medium
CN109214846A (en) Information storage means and device
CN111368522A (en) Information generation method and device, computer equipment and computer readable storage medium
CN113344674A (en) Product recommendation method, device, equipment and storage medium based on user purchasing power
CN115114280A (en) Report generation processing method, device and equipment
CN105404671A (en) Implementation method and apparatus for mobile terminal electronic billing
CN112631675A (en) Workflow configuration method, device, equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination