CN111914548A - Intelligent document filling method and system and related equipment - Google Patents
Intelligent document filling method and system and related equipment Download PDFInfo
- Publication number
- CN111914548A CN111914548A CN202010744800.XA CN202010744800A CN111914548A CN 111914548 A CN111914548 A CN 111914548A CN 202010744800 A CN202010744800 A CN 202010744800A CN 111914548 A CN111914548 A CN 111914548A
- Authority
- CN
- China
- Prior art keywords
- word segmentation
- target
- preset
- text
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 46
- 230000011218 segmentation Effects 0.000 claims abstract description 187
- 238000012545 processing Methods 0.000 claims abstract description 48
- 238000004590 computer program Methods 0.000 claims description 12
- 230000002452 interceptive effect Effects 0.000 claims description 10
- 230000008569 process Effects 0.000 description 9
- 238000004891 communication Methods 0.000 description 5
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000013515 script Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
Abstract
The embodiment of the invention provides a method, a system and related equipment for intelligently filling a document, which are used for realizing automatic filling of the document and improving the document filling efficiency. The method provided by the embodiment of the invention comprises the following steps: receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence; matching the word segmentation sequence with preset service logic, and if the successfully matched target service logic exists, creating a target receipt corresponding to the target service logic; and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Description
Technical Field
The invention relates to the technical field of intelligent filling of documents, in particular to an intelligent filling method, an intelligent filling system and related equipment of documents.
Background
The enterprise management cloud platform is an enterprise management software cloud platform leading the development trend of the IT industry in China, not only meets the increasing information management requirements of medium and small enterprises, but also provides efficient, safe and stable one-stop professional software application service for large group enterprises containing branch companies.
When a document is opened and relevant data is filled in, an existing enterprise management cloud platform needs to manually find a corresponding application into an application, find a corresponding menu item and click to open the document, and after the document is opened, items to be filled in need to be clicked one by one to obtain an input focus and manually fill in various information. For example, a business travel ticket may need to fill in items such as data of collaborators, business categories, application reasons, etc., click on the items to be filled one by one to obtain an input focus, and manually fill in information of each item in a project bar.
In the existing enterprise management cloud platform, the items to be filled in need to be manually clicked one by one to obtain an input focus, and all items of information are manually filled in, so that time and labor are wasted, and the efficiency is low.
Disclosure of Invention
The embodiment of the invention provides a method, a system and related equipment for intelligently filling a document, which are used for realizing automatic filling of the document and improving the document filling efficiency.
The first aspect of the embodiment of the invention provides an intelligent filling method for a receipt, which can comprise the following steps:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with preset service logic, and if the successfully matched target service logic exists, creating a target receipt corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, in the method for intelligently filling a document in the embodiment of the present invention, matching the word segmentation sequence with a preset service logic may include:
extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
Optionally, as a possible implementation manner, in the method for intelligently filling a document in the embodiment of the present invention, matching the word segmentation sequence with a preset service logic may include:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles in the context and the combination relation of the participles;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the performing word segmentation processing on the text by using a preset word segmentation component may include:
and sending the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, calling a word segmentation component by the back-end server to segment words, and then returning a word segmentation result.
Optionally, as a possible implementation manner, in the method for intelligently filling a document in the embodiment of the present invention, the preset word segmentation component is a jieba word segmentation component, the obtained text data input by the user is sent to a back-end server through a plug-in bound to the interactive interface, and the back-end server calls the word segmentation component to perform word segmentation, which may include:
the text is sent to a server where a jieba word segmentation component is located by adopting an Apache Httpclient client programming tool kit;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the intelligent document filling method in the embodiment of the present invention may further include:
and extracting preset fields in the text, and processing the formats of the field values of the preset fields into a uniform format.
A second aspect of an embodiment of the present invention provides an intelligent document filling system, which may include:
the processing module is used for receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component and outputting a word segmentation sequence;
the matching module is used for matching the word segmentation sequence with preset service logic, and if the target service logic which is successfully matched exists, a target receipt corresponding to the target service logic is created;
and the filling module is used for automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the matching module may include:
and the matching unit is used for extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the matching module may include:
the extraction unit is used for extracting context characteristics corresponding to preset business logic, and the context characteristics are used for indicating the parts of speech of a plurality of participles in the context and the combination relation of the participles;
and the judging unit is used for judging whether the word segmentation sequence meets the context characteristics, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the processing module may include:
and the processing unit sends the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, and the back-end server calls the word segmentation component to segment words and then returns word segmentation results.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the preset word segmentation component is a jieba word segmentation component, and the processing unit may include:
the sending subunit sends the text to a server where the jieba word segmentation component is located by adopting an Apache Httpclient client programming tool package;
the loading subunit loads a custom dictionary corresponding to the preset service in the jieba word segmentation component;
and the word segmentation subunit is used for performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the intelligent document filling system in the embodiment of the present invention may further include:
and the preprocessing module is used for extracting preset fields in the text and processing the formats of the field values of the preset fields into a uniform format.
A third aspect of embodiments of the present invention provides a computer apparatus, which includes a processor, and the processor is configured to implement the steps in any one of the possible implementation manners of the first aspect and the first aspect when executing a computer program stored in a memory.
A fourth aspect of the embodiments of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in any one of the possible implementations of the first aspect and the first aspect.
According to the technical scheme, the embodiment of the invention has the following advantages:
in the embodiment of the invention, the document intelligent filling system can receive a text input by a user, perform word segmentation processing on the text by adopting a preset word segmentation component, output a word segmentation sequence, match the word segmentation sequence with preset business logic, create a target document corresponding to the target business logic if the target business logic which is successfully matched exists, and automatically fill the words in the word segmentation sequence into a project column of the target document according to the business rules of the target business logic. Compared with the prior art, the user does not need to click item columns in the document one by one for inputting, the user operation time is saved, and the document filling efficiency is improved.
Drawings
FIG. 1 is a schematic diagram of an embodiment of an intelligent document filling method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of an embodiment of the present invention, in which a document intelligent filling method is applied;
FIG. 3 is a schematic diagram of an embodiment of an intelligent document filling system according to an embodiment of the present invention;
FIG. 4 is a diagram of a computer device according to an embodiment of the present invention.
Detailed Description
The embodiment of the invention provides a method, a system and related equipment for intelligently filling a document, which are used for realizing automatic filling of the document and improving the document filling efficiency.
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For convenience of understanding, a specific flow in the embodiment of the present invention is described below, and referring to fig. 1, an embodiment of an intelligent document filling method in the embodiment of the present invention may include:
s101, receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
in the existing enterprise management cloud platform, after a document is opened, items required to be filled in need to be clicked one by one to obtain an input focus, and various information needs to be manually filled in.
In order to improve the bill filling efficiency, the intelligent bill filling system in the embodiment of the invention can be provided with an interactive interface for receiving the text input by the user. The intelligent document filling system can adopt a preset word segmentation component to perform word segmentation processing on the received text and output a word segmentation sequence, wherein the word segmentation sequence can contain the part of speech of each word segmentation. And finally, the intelligent document filling system can realize automatic document filling based on the word segmentation sequence.
S102, matching the word segmentation sequence with a preset service logic, and if a target service logic which is successfully matched exists, creating a target document corresponding to the target service logic;
in order to realize automatic filling of documents, business logics corresponding to various types of documents need to be preset, then the obtained word segmentation sequence is matched with the preset business logics, and if a target business logic which is successfully matched exists, a target document corresponding to the target business logic is created.
Optionally, as a possible implementation manner, the specific process of matching the word segmentation sequence with the preset service logic may include: and extracting target participles corresponding to preset service logics, judging whether the participle sequence contains the target participles, and if so, judging that the participle sequence is successfully matched with the target service logics corresponding to the target participles.
For example, target participles such as "travel", "business trip", and "going out" may be set in the travel request form. When the text input by the user generates a travel application form with the communication between No. 3 and No. 11, Zhang III and Li IV from the airplane sitting in the Guangdong Fushan in China to Beijing for more than three months, the service logic of successful matching can be determined to be the travel application form service according to the word segmentation sequence containing the travel.
Optionally, as a possible implementation manner, the specific process of matching the word segmentation sequence with the preset service logic may include: extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context; and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the target service logic corresponding to the word segmentation sequence and the context characteristics is successfully matched.
For example, for the application form of the foreign country, the context characteristic can be set as that the part of speech containing one or more participles is a place name, and the contained place name exists in a preset foreign place name white list. When the text "No. 3 to No. 11, zhang san and lie si input by the user together goes from the korean kettle mountain to the application form communicated in other places for more than three months in new york, the service logic of successful matching can be determined to be the international application form service according to the word segmentation sequence including" korean kettle mountain ".
It is understood that the service logic of the international application form service and the travel application form service in the above examples is only exemplary, and the service logic may be reasonably set according to the specific requirements of the user in practical application, and is not limited herein.
S103, automatically filling the participles in the participle sequence into the item column of the target document according to the business rule of the target business logic.
After the correct business logic is matched, the document intelligent filling system can automatically fill the participles in the participle sequence into the item column of the target document according to the business rules of the target business logic. For example, when a text input by a user is "generate a travel request form with a distance of 3 months to 11 months, three months and four months communicating with each other from an airplane sitting on the Guangdong Fushan in China to Beijing for more than three months", after the matched travel request form is matched, the participles such as departure place "Fushan", destination "Beijing", departure time "3 months to 5 months" and deadline "3 months to 11 months" in the participle sequence can be extracted according to the business rules of the travel request form, and automatically filled into the corresponding project columns of the travel request form to generate a complete travel request form.
In the embodiment of the invention, the document intelligent filling system can receive a text input by a user, perform word segmentation processing on the text by adopting a preset word segmentation component, output a word segmentation sequence, match the word segmentation sequence with preset business logic, create a target document corresponding to the target business logic if the target business logic which is successfully matched exists, and automatically fill the words in the word segmentation sequence into a project column of the target document according to the business rules of the target business logic. Compared with the prior art, the user does not need to click item columns in the document one by one for inputting, the user operation time is saved, and the document filling efficiency is improved.
The word segmentation components in practical application can be Yaha word segmentation components, Qinghua THULAC components and jieba word segmentation components. Optionally, as a possible implementation manner, when the preset word segmentation component is a jieba word segmentation component, performing word segmentation processing on the text by using the preset word segmentation component may include: adopting an Apache http client programming toolkit (or common http tool) to send the text to a server where the jieba word segmentation component is located; loading a custom dictionary corresponding to a preset service in the jieba word segmentation component; and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
In practical application, the word segmentation sequence obtained by directly segmenting words of the text with the date cannot judge the date set in the text. In view of this, it is necessary to process the date format in the text. Optionally, as a possible implementation manner, on the basis of the foregoing embodiment, before performing word segmentation processing on the text by using a preset word segmentation component, a preset field in the text may be extracted, and a format of a field value of the preset field may be processed into a uniform format, for example, when the preset field is a date, different date formats may be processed into the uniform format.
For example, the text: "i want to live from tomorrow to 28", if the text is not subjected to date processing, the complete date set in the text cannot be judged. Scripts with different date formats can be called, and different date formats in the text before word segmentation are processed into a uniform format, so that the date data with the uniform format can be conveniently obtained by subsequent services. For example, the test date is 26/4/2020, and the corresponding date data in the unified format after the text "i want to live from tomorrow to number 28" processing includes: "2020-04-27" and "2020-04-28".
For convenience of understanding, the intelligent document filling method in the embodiment of the present invention will be described below with reference to specific application embodiments.
The intelligent document filling method in the embodiment is realized based on a server of a Python Django framework, a jieba word segmentation component, a cloud platform plug-in and an Apache Httpclient. The method comprises the steps that Apache Http is used in a plug-in bound with an interactive interface to send an acquired text data input by a user to a back-end service built by a Python Django framework in an Http mode, the server calls a jieba word segmentation component to perform word segmentation, then word segmentation results are returned, the word segmentation results are matched with preset service logic through the parts of speech of different words of the word segmentation results and the part of speech logic of a plurality of words, and the word segmentation results are processed by adopting the service logic which is successfully matched to generate different documents. Compared with the URLConnection of the traditional JDK, the Apache HttpClient increases the usability and flexibility, not only is the HTTP request sent by the client side easy, but also the development personnel can conveniently test the interface (based on the HTTP protocol), the development efficiency is improved, and the code robustness is also conveniently improved.
Referring to fig. 2, the method specifically includes the following steps:
a) acquiring a text: in a development platform, a PC terminal interactive page is created, a control of a text field type is added in the interactive page, the control is used for receiving a text to be participled input by a user, then a preset plug-in is bound, and input text data is obtained through a function method in the plug-in.
b) Text transmission and reception: and initiating an Http request in a plug-in of the cloud platform through the Apache Http client to send the acquired text to a back-end service built by a Python Django framework.
c) Word segmentation processing of the text: introducing a jieba word segmentation component into the back-end service of the server, and adding a self-defined dictionary into the jieba word segmentation component to identify the special words of different service scenes. And finally, performing a precise word segmentation mode with part-of-speech tagging on the text, and returning a word segmentation result in a json format.
d) And (3) generating a document: after a plug-in of the cloud platform initiates a request to send a text by using Apache http, json data of the participle returned from the server is received, and then the type of the document is judged according to a specific vocabulary in the returned json data. For example, after the travel application document is generated according to a specific vocabulary, a travel application form with travel information can be generated according to whether a word with a part of speech as a place name exists in a returned result and whether the domestic travel reimbursement standard in the travel application form is matched according to the place name or not. Of course, the text may also contain other information, such as business trip date, collaborators, etc., which can be determined according to the parts of speech of the date, the parts of speech of the names, etc., and the data is automatically filled in the business trip application form.
Illustratively, the example 1 text is entered: and generating a travel application form of communication between No. 3 and No. 5 and No. 3 and No. 11 with Zhang III and Li IV from the airplane sitting in the Guangdong Fushan to Beijing for more than three months. And generating a travel application bill, and automatically assigning values to the travel application bill according to key data such as a sponsor, a travel category, a departure place, a destination, a departure date, an ending date and the like processed by the service codes after word segmentation.
In addition, different document types can be generated according to various business logic processes such as the parts of speech of different words in the word segmentation result, the front and back collocation among different parts of speech, the part of speech of self-defined words in a special business scene and the like.
Illustratively, the example 2 text is entered: no. 3 and No. 5 to No. 3 and No. 11, together with Zhang III and Li IV, take the advantage of the airplane from Korea kettle mountain to communicate with the application form in remote places for more than three months in New York. (the text does not indicate which document is generated, but filters out words which are place names according to the part of speech after word segmentation and then judges that the documents are foreign place names according to the place names, matches whether the place names exist according to the reimbursement standard of the foreign countries (frontiers) or not according to the foreign place names, and identifies the application form of the foreign countries (frontiers) if the place names exist). After word segmentation, the value is automatically assigned on the application document of the country of interest according to key data such as the collaborator, the business trip category, the departure place, the destination, the departure date, the ending date and the like processed by the service code.
The word segmentation component jieba in the invention can not identify the word segmentation to different date formats well. For example, the text: i want to live from tomorrow to No. 28, if the text is not subjected to date processing, the direct word segmentation cannot judge the complete date set in the text. The scripts capable of filtering and identifying different date formats can be called to process different date formats in the text before word segmentation into a uniform format, so that the date data in the uniform format can be conveniently obtained by subsequent services. For example, the test date is 26/4/2020, and the corresponding date data in the unified format after the text "i want to live from tomorrow to number 28" processing includes: "2020-04-27" and "2020-04-28".
Referring to fig. 3, an embodiment of the present invention further provides an intelligent document filling system, which includes:
the processing module 301 is configured to receive a text input by a user, perform word segmentation processing on the text by using a preset word segmentation component, and output a word segmentation sequence;
the matching module 302 is configured to match the word segmentation sequence with a preset service logic, and if a successfully matched target service logic exists, create a target document corresponding to the target service logic;
and the filling module 303 is configured to automatically fill the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the matching module may include:
and the matching unit is used for extracting the target word segmentation corresponding to the preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
Optionally, as a possible implementation manner, in the embodiment of the present invention, the matching module may include:
the extraction unit is used for extracting context characteristics corresponding to preset business logic, and the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context;
and the judging unit is used for judging whether the word segmentation sequence meets the context characteristics, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the processing module may include:
and the processing unit sends the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, and the back-end server calls the word segmentation component to segment words and then returns word segmentation results.
Optionally, as a possible implementation manner, in an embodiment of the present invention, the preset word segmentation component is a jieba word segmentation component, and the processing unit may include:
the sending subunit sends the text to a server where the jieba word segmentation component is located by adopting an Apache Httpclient client programming tool package;
the loading subunit loads a custom dictionary corresponding to the preset service in the jieba word segmentation component;
and the word segmentation subunit adopts a jieba word segmentation component to perform accurate word segmentation processing with part-of-speech tagging on the text.
Optionally, as a possible implementation manner, the intelligent document filling system in the embodiment of the present invention may further include:
and the preprocessing module is used for extracting preset fields in the text and processing the formats of the field values of the preset fields into a uniform format.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the above description of the intelligent document filling system in the embodiment of the present invention from the perspective of the modular functional entity, please refer to fig. 4, and the following description describes the computer apparatus in the embodiment of the present invention from the perspective of hardware processing:
the computer device 1 may include a memory 11, a processor 12 and an input output bus 13. The processor 11, when executing the computer program, implements the steps in the above-described intelligent document filling method embodiment shown in fig. 1, such as steps 101 to 103 shown in fig. 1. Alternatively, the processor, when executing the computer program, implements the functions of each module or unit in the above-described device embodiments.
In some embodiments of the present invention, the processor is specifically configured to implement the following steps:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with a preset service logic, and if a target service logic which is successfully matched exists, creating a target document corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting target participles corresponding to preset service logics, judging whether the participle sequence contains the target participles, and if so, judging that the participle sequence is successfully matched with the target service logics corresponding to the target participles.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the target service logic corresponding to the word segmentation sequence and the context characteristics is successfully matched.
Optionally, as a possible implementation manner, the preset word segmentation component is a jieba word segmentation component, and the processor may be further configured to implement the following steps:
the method comprises the steps that an Apache Httpclient client programming tool kit is adopted to send a text to a server where a jieba word segmentation component is located;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting preset fields in the text, and processing the formats of the field values of the preset fields into a uniform format.
The memory 11 includes at least one type of readable storage medium, and the readable storage medium includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 11 may in some embodiments be an internal storage unit of the computer device 1, for example a hard disk of the computer device 1. The memory 11 may also be an external storage device of the computer apparatus 1 in other embodiments, such as a plug-in hard disk provided on the computer apparatus 1, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. Further, the memory 11 may also include both an internal storage unit and an external storage device of the computer apparatus 1. The memory 11 may be used not only to store application software installed in the computer apparatus 1 and various types of data, such as codes of the computer program 01, but also to temporarily store data that has been output or is to be output.
The processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor or other data Processing chip in some embodiments, and is used for executing program codes stored in the memory 11 or Processing data, such as executing the computer program 01.
The input/output bus 13 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc.
Further, the computer apparatus may further include a wired or wireless network interface 14, and the network interface 14 may optionally include a wired interface and/or a wireless interface (such as a WI-FI interface, a bluetooth interface, etc.), which are generally used for establishing a communication connection between the computer apparatus 1 and other electronic devices.
Optionally, the computer device 1 may further include a user interface, the user interface may include a Display (Display), an input unit such as a Keyboard (Keyboard), and optionally, the user interface may further include a standard wired interface and a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or display unit, is suitable for displaying information processed in the computer device 1 and for displaying a visualized user interface.
Fig. 4 shows only the computer arrangement 1 with the components 11-14 and the computer program 01, it being understood by a person skilled in the art that the structure shown in fig. 4 does not constitute a limitation of the computer arrangement 1, but may comprise fewer or more components than shown, or a combination of certain components, or a different arrangement of components.
The present invention also provides a computer-readable storage medium having a computer program stored thereon, which when executed by a processor, performs the steps of:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with a preset service logic, and if a target service logic which is successfully matched exists, creating a target document corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting target participles corresponding to preset service logics, judging whether the participle sequence contains the target participles, and if so, judging that the participle sequence is successfully matched with the target service logics corresponding to the target participles.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles and the combination relation of the plurality of participles in the context;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the target service logic corresponding to the word segmentation sequence and the context characteristics is successfully matched.
Optionally, as a possible implementation manner, the preset word segmentation component is a jieba word segmentation component, and the processor may be further configured to implement the following steps:
the method comprises the steps that an Apache Httpclient client programming tool kit is adopted to send a text to a server where a jieba word segmentation component is located;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
Optionally, as a possible implementation manner, the processor may be further configured to implement the following steps:
and extracting preset fields in the text, and processing field values of the preset fields into a uniform format.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
Claims (10)
1. An intelligent filling method for documents is characterized by comprising the following steps:
receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component, and outputting a word segmentation sequence;
matching the word segmentation sequence with preset service logic, and if the successfully matched target service logic exists, creating a target receipt corresponding to the target service logic;
and automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
2. The method of claim 1, wherein matching the sequence of word segments to a pre-set business logic comprises:
extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
3. The method of claim 1, wherein matching the sequence of word segments to a pre-set business logic comprises:
extracting context characteristics corresponding to preset business logic, wherein the context characteristics are used for indicating the parts of speech of a plurality of participles in the context and the combination relation of the participles;
and judging whether the word segmentation sequence meets the context characteristics, if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the context characteristics.
4. The method according to any one of claims 1 to 3, wherein the performing word segmentation processing on the text by using a preset word segmentation component comprises:
and sending the acquired text data input by the user to a back-end server through a plug-in bound on the interactive interface, calling a word segmentation component by the back-end server to segment words, and then returning a word segmentation result.
5. The method according to any one of claims 4, wherein the preset word segmentation component is a jieba word segmentation component, and the step of sending the acquired text data input by the user to a back-end server through a plug-in bound to an interactive interface, and the back-end server invoking the word segmentation component to perform word segmentation comprises:
the text is sent to a server where a jieba word segmentation component is located by adopting an Apache Httpclient client programming tool kit;
loading a custom dictionary corresponding to a preset service in the jieba word segmentation component;
and performing accurate word segmentation processing with part-of-speech tagging on the text by adopting a jieba word segmentation component.
6. The method according to any one of claims 1 to 3, wherein before the tokenizing the text with a preset tokenizing component, the method further comprises:
and extracting preset fields in the text, and processing the formats of the field values of the preset fields into a uniform format.
7. An intelligent document filling system, comprising:
the processing module is used for receiving a text input by a user, performing word segmentation processing on the text by adopting a preset word segmentation component and outputting a word segmentation sequence;
the matching module is used for matching the word segmentation sequence with preset service logic, and if the target service logic which is successfully matched exists, a target receipt corresponding to the target service logic is created;
and the filling module is used for automatically filling the participles in the participle sequence into the item column of the target document according to the service rule of the target service logic.
8. The system of claim 7, wherein the matching module comprises:
and the matching unit is used for extracting a target word segmentation corresponding to preset service logic, judging whether the word segmentation sequence contains the target word segmentation, and if so, judging that the word segmentation sequence is successfully matched with the target service logic corresponding to the target word segmentation.
9. A computer arrangement, characterized in that the computer arrangement comprises a processor for implementing the steps of the method according to any one of claims 1 to 6 when executing a computer program stored in a memory.
10. A computer-readable storage medium having stored thereon a computer program, characterized in that: the computer program, when being executed by a processor, realizes the steps of the method according to any one of claims 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010744800.XA CN111914548A (en) | 2020-07-29 | 2020-07-29 | Intelligent document filling method and system and related equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010744800.XA CN111914548A (en) | 2020-07-29 | 2020-07-29 | Intelligent document filling method and system and related equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111914548A true CN111914548A (en) | 2020-11-10 |
Family
ID=73286859
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010744800.XA Pending CN111914548A (en) | 2020-07-29 | 2020-07-29 | Intelligent document filling method and system and related equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111914548A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112835860A (en) * | 2021-02-19 | 2021-05-25 | 腾讯科技(深圳)有限公司 | Shared document processing method, device, equipment and computer readable storage medium |
CN113407599A (en) * | 2021-06-30 | 2021-09-17 | 上海万物新生环保科技集团有限公司 | Text data based standardized processing method and equipment |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107066432A (en) * | 2016-10-24 | 2017-08-18 | 禾盈软件科技有限公司 | Method and its system that document is automatically generated |
CN107274889A (en) * | 2017-06-19 | 2017-10-20 | 北京紫博光彦信息技术有限公司 | A kind of method and device according to speech production business paper |
CN111062195A (en) * | 2019-12-13 | 2020-04-24 | 深圳我家云网络科技有限公司 | File processing method, file processing device, equipment and storage medium |
CN111126029A (en) * | 2019-12-31 | 2020-05-08 | 广州市昊链信息科技股份有限公司 | Electronic document generation method and device, computer equipment and storage medium |
CN111352987A (en) * | 2020-02-28 | 2020-06-30 | 汤学民 | Electronic medical record structuring method, system and related equipment |
-
2020
- 2020-07-29 CN CN202010744800.XA patent/CN111914548A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107066432A (en) * | 2016-10-24 | 2017-08-18 | 禾盈软件科技有限公司 | Method and its system that document is automatically generated |
CN107274889A (en) * | 2017-06-19 | 2017-10-20 | 北京紫博光彦信息技术有限公司 | A kind of method and device according to speech production business paper |
CN111062195A (en) * | 2019-12-13 | 2020-04-24 | 深圳我家云网络科技有限公司 | File processing method, file processing device, equipment and storage medium |
CN111126029A (en) * | 2019-12-31 | 2020-05-08 | 广州市昊链信息科技股份有限公司 | Electronic document generation method and device, computer equipment and storage medium |
CN111352987A (en) * | 2020-02-28 | 2020-06-30 | 汤学民 | Electronic medical record structuring method, system and related equipment |
Non-Patent Citations (1)
Title |
---|
朱频频 等: "《智能客户服务技术与应用》", vol. 2019, 中国铁道出版社有限公司, pages: 132 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112835860A (en) * | 2021-02-19 | 2021-05-25 | 腾讯科技(深圳)有限公司 | Shared document processing method, device, equipment and computer readable storage medium |
CN113407599A (en) * | 2021-06-30 | 2021-09-17 | 上海万物新生环保科技集团有限公司 | Text data based standardized processing method and equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20150039707A1 (en) | Document processing | |
WO2022134583A1 (en) | Insurance data information generation method, apparatus, server, and storage medium | |
CN111914548A (en) | Intelligent document filling method and system and related equipment | |
US10067631B2 (en) | Communication terminal and method of providing unified interface to the same | |
CN110795697A (en) | Logic expression obtaining method and device, storage medium and electronic device | |
CN111143505A (en) | Document processing method, device, medium and electronic equipment | |
CN104484486A (en) | Information recommending method and electronic device | |
CN112434501A (en) | Work order intelligent generation method and device, electronic equipment and medium | |
CN110633331A (en) | Method, system and related equipment for extracting data in relational database | |
CN109214362B (en) | Document processing method and related equipment | |
CN109862399A (en) | It shows the method for rich media information, handle method, computer installation and the computer readable storage medium of rich media information | |
CN112732893A (en) | Text information extraction method and device, storage medium and electronic equipment | |
CN105554254A (en) | Notification message processing method and device | |
CN102929845B (en) | A kind of server and method for realizing network reading | |
CN111753210A (en) | Resource pushing method and device, computer equipment and computer readable storage medium | |
CN109740130B (en) | Method and device for generating file | |
CN116167881A (en) | Automatic generation method, device, equipment and storage medium for payment account record | |
CN116069407A (en) | Method and system for analyzing SWIFT message and automatically mapping SWIFT message to business transaction column | |
CN107909382A (en) | User grouping management method, server and storage medium | |
CN109214846A (en) | Information storage means and device | |
CN111368522A (en) | Information generation method and device, computer equipment and computer readable storage medium | |
CN113344674A (en) | Product recommendation method, device, equipment and storage medium based on user purchasing power | |
CN115114280A (en) | Report generation processing method, device and equipment | |
CN105404671A (en) | Implementation method and apparatus for mobile terminal electronic billing | |
CN112631675A (en) | Workflow configuration method, device, equipment and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |