CN110532350B - Text information online identification processing method and device based on jurisdiction - Google Patents

Text information online identification processing method and device based on jurisdiction Download PDF

Info

Publication number
CN110532350B
CN110532350B CN201910645485.2A CN201910645485A CN110532350B CN 110532350 B CN110532350 B CN 110532350B CN 201910645485 A CN201910645485 A CN 201910645485A CN 110532350 B CN110532350 B CN 110532350B
Authority
CN
China
Prior art keywords
arbitration
arbitration mechanism
contract
appointed
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910645485.2A
Other languages
Chinese (zh)
Other versions
CN110532350A (en
Inventor
叶素兰
窦文伟
罗钰林
潘诗韵
张学
乔佳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Publication of CN110532350A publication Critical patent/CN110532350A/en
Application granted granted Critical
Publication of CN110532350B publication Critical patent/CN110532350B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/182Alternative dispute resolution

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Technology Law (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Databases & Information Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application relates to a text information online identification processing method and device based on jurisdiction in the field of artificial intelligence. The method comprises the following steps: receiving an arbitration application request uploaded by a terminal; returning an arbitration application page to the terminal according to the arbitration application request; receiving arbitration application information uploaded by a terminal through the arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism; inputting a text format contract to a case-setting auditing model, identifying dispute resolution terms from the text format contract through the case-setting auditing model, and auditing text information in the dispute resolution terms by using preset rules; when the text information audit in the dispute resolution clause passes, determining that the currently specified arbitration mechanism has a corresponding jurisdiction. By adopting the method, whether the current appointed arbitration mechanism has corresponding jurisdiction or not can be rapidly and accurately identified before the on-line arbitration case is subjected to case establishment.

Description

Text information online identification processing method and device based on jurisdiction
Technical Field
The present application relates to the field of computer technologies, and in particular, to a jurisdiction-based text information online identification processing method and apparatus, a computer device, and a storage medium.
Background
Arbitration is a non-prosecution dispute resolution, as opposed to litigation. Arbitration is the resolution of disputes in which parties voluntarily submit complaints to neutral third party referees. The jurisdiction problem is the first problem that the arbitration program must solve, whether there is jurisdiction or not, is critical to both the arbitration and the parties. If the arbitration mechanism has no jurisdiction, it may be revoked or denied execution by the court even if an arbitration is made. In a conventional manner, jurisdiction auditing of an arbitration mechanism is typically accomplished manually.
With the development of internet technology, online arbitration becomes possible. How to quickly and accurately identify whether the current appointed arbitration mechanism has corresponding jurisdiction before the on-line arbitration case is found is a technical problem to be solved first.
Disclosure of Invention
Based on the above, it is necessary to provide a jurisdiction-based text information online identification processing method, apparatus, computer device and storage medium capable of accurately identifying whether a currently specified arbitration mechanism has a corresponding jurisdiction for an online arbitration case before a case is set up.
An jurisdiction-based text information online identification processing method, comprising the following steps:
Receiving an arbitration application request uploaded by a terminal; returning an arbitration application page to the terminal according to the arbitration application request;
receiving arbitration application information uploaded by a terminal through the arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism;
inputting a text format contract to a case-setting auditing model, identifying dispute resolution terms from the text format contract through the case-setting auditing model, and auditing text information in the dispute resolution terms by using preset rules;
when the text information audit in the dispute resolution clause passes, determining that the currently specified arbitration mechanism has a corresponding jurisdiction.
In one embodiment, the auditing the text information in the dispute resolution terms using preset rules includes:
extracting a contract arbitration mechanism name from text information of the dispute resolution clause through the case-setting audit model;
when the contract arbitration mechanism is extracted, matching the contract arbitration mechanism name with the current appointed arbitration mechanism name, and if the matching is successful, checking the dispute resolution clause;
When the contract arbitration mechanism name is not extracted, determining that the dispute resolution terms audit is not passed.
In one embodiment, said matching said contract arbitration mechanism name with said currently specified arbitration mechanism name comprises:
respectively calculating the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name by using the pronunciation similarity and the font similarity through the case-standing audit model;
Comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name;
and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
In one embodiment, said matching said contract arbitration mechanism name with said currently specified arbitration mechanism name comprises:
reading a preset word list through the case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism names;
Respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through the case-standing audit model;
And when the matching degree of the appointed arbitration mechanism name and the currently appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
In one embodiment, before the invoking the proposal audit model extracts dispute resolution terms in the contract, the method further comprises:
during the process of converting the contract into a text format, identifying whether specific characters exist in the contract;
When a specific character exists, converting the specific character into a preset character string;
The identifying dispute resolution terms from the text-formatted contract by the case-setting audit model comprises: positioning the dispute resolution terms in the text format contract by using the preset keywords through the case-setting auditing model; the dispute resolution terms include a plurality of dispute resolution means; reading the meaning corresponding to the preset character string; and identifying the agreed dispute resolution modes in the plurality of dispute resolution modes according to the meaning corresponding to the preset character string.
An jurisdiction-based text information online identification processing device, the device comprising:
The communication module is used for receiving an arbitration application request uploaded by the terminal; returning an arbitration application page to the terminal according to the arbitration application request; receiving arbitration application information uploaded by a terminal through the arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism;
The auditing module is used for inputting the contract in the text format into a case-setting auditing model, identifying the dispute resolution terms from the contract in the text format through the case-setting auditing model, and auditing the text information in the dispute resolution terms by utilizing a preset rule; when the text information audit in the dispute resolution clause passes, determining that the currently specified arbitration mechanism has a corresponding jurisdiction.
In one embodiment, the auditing module is further configured to extract, by using the case-setting auditing model, a contract arbitration mechanism name from text information of the dispute resolution terms; when the contract arbitration mechanism is extracted, matching the contract arbitration mechanism name with the current appointed arbitration mechanism name, and if the matching is successful, checking the dispute resolution clause; when the contract arbitration mechanism name is not extracted, determining that the dispute resolution terms audit is not passed.
In one embodiment, the auditing module is further configured to calculate, by using the pronunciation similarity and the font similarity through the case-setting auditing model, a similarity corresponding to each of the contracted arbitration mechanism name and the current designated arbitration mechanism name; comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name; and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
A computer device comprising a memory storing a computer program and a processor implementing the steps of the method embodiments described above when the processor executes the computer program.
A computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of the various method embodiments described above.
According to the jurisdiction-based text information online identification processing method, the jurisdiction-based text information online identification processing device, the computer equipment and the storage medium, after arbitration application information uploaded by the terminal is received, the text-format contract is input into the case-setting auditing model, the case-setting auditing model extracts dispute resolution terms in the contract of the arbitration application information, and the text information in the dispute resolution terms is audited by utilizing preset rules. When the text information audit in the dispute resolution clause passes, it is determined that the currently specified arbitration mechanism has the corresponding jurisdiction. The whole process can quickly and accurately identify whether the current appointed arbitration mechanism has corresponding jurisdiction or not before the on-line arbitration case is set up without manual participation.
Drawings
FIG. 1 is an application scenario diagram of a jurisdiction-based text information online recognition processing method in one embodiment;
FIG. 2 is a flow diagram of a method for jurisdiction-based text information online identification processing in one embodiment;
FIG. 3 is a flow diagram of a process for matching a contract arbitration mechanism name with a currently specified arbitration mechanism name, in one embodiment;
FIG. 4 is a block diagram of an exemplary device for processing the online identification of jurisdictional-based text messages;
Fig. 5 is an internal structural diagram of a computer device in one embodiment.
Detailed Description
The present application will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present application more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.
The text information online identification processing method based on jurisdiction provided by the application can be applied to an application environment shown in figure 1. Wherein the terminal 102 communicates with the server 104 via a network. The terminal 102 may be, but not limited to, various personal computers, notebook computers, smartphones, tablet computers, and portable wearable devices, and the server 104 may be implemented by a stand-alone server or a server cluster composed of a plurality of servers.
In one embodiment, as shown in fig. 2, a method for online recognition processing of text information based on jurisdiction is provided, and the method is applied to the server in fig. 1 for illustration, and includes the following steps:
step 202, receiving an arbitration application request uploaded by a terminal; and returning an arbitration application page to the terminal according to the arbitration application request.
Step 204, the receiving terminal uploads the arbitration application information through the arbitration application page, wherein the arbitration application information comprises the contract and the current appointed arbitration mechanism.
An on-line arbitration system is run on the server. The user can register an account of the online arbitration system through the terminal, and log in the online arbitration system by using the account. The terminal may upload an arbitration application request to the on-line arbitration system. And the server returns an arbitration application page to the terminal according to the arbitration application request. The arbitration application page may include a variety of pages including, for example, an arbitration mechanism selection page, an information registration page, and the like. The terminal can select an arbitration mechanism to perform online arbitration for the current case in the arbitration application page. The arbitration mechanism for the amount selected by the terminal may also be referred to as the currently specified arbitration mechanism. The terminal may also input in the information registration page applicant information, contracts required for arbitration, arbitration requests, arbitration facts, arbitration reasons, and other arbitration application information. And the terminal uploads the arbitration application information to the server.
And 206, inputting the contract in the text format into a case-setting auditing model, identifying the dispute resolution terms from the contract in the text format through the case-setting auditing model, and auditing the text information in the dispute resolution terms by utilizing preset rules.
Step 208, when the text message audit in the dispute resolution clause passes, determining that the currently specified arbitration mechanism has a corresponding jurisdiction.
The server establishes a case-setting auditing model in advance, and the case-setting auditing model can audit the jurisdiction of the on-line arbitration. In order to facilitate the operation of the case-setting audit model, the server identifies the contract format in the arbitration application information after receiving the arbitration information. If the contract format is not a text format, the contract needs to be formatted to a text format. For example, a contract in PDF format or image format is converted into text format. The server invokes a standing case audit model, inputs a text format contract to the standing case audit model, and the standing case audit model uses keywords to identify text information in the dispute resolution terms from the text format contract, wherein the keywords include: contract arbitration institutions, contract usage arbitration rules, arbitration clauses, and the like. The case-setting auditing model utilizes preset rules to match the dispute resolution clauses with the current appointed arbitration mechanism, if the match is successful, the dispute resolution clauses are audited to pass, and the current appointed arbitration mechanism is determined to have the corresponding jurisdiction. If the matching fails, the dispute resolution clause audit is not passed, and the fact that the current appointed arbitration mechanism does not have the corresponding jurisdiction is determined.
In one embodiment, auditing text information in a dispute resolution clause using preset rules includes: extracting the names of the contract arbitration mechanisms from the text information of the dispute resolution clauses through a case-setting auditing model; when the name of the contract arbitration mechanism is extracted, the name of the contract arbitration mechanism is matched with the name of the currently appointed arbitration mechanism, and if the matching is successful, the dispute resolution clause is checked and passed; when the contract arbitration mechanism name is not extracted, then it is determined that the dispute resolution terms audit is not passed.
The proposal audit model extracts the contract arbitration mechanism name from the text information of the dispute resolution clause, and matches the contract arbitration mechanism name (hereinafter abbreviated as contract name) with the current appointed arbitration mechanism name (hereinafter abbreviated as appointed name). The matching may be an exact matching or a fuzzy matching. For example, the contract arbitration mechanism name in the dispute resolution clause is the same as the currently specified arbitration mechanism name, i.e., the contract name is exactly the same as the specified name. Or the contract arbitration mechanism name in the dispute resolution clause is considered the same as the currently specified arbitration mechanism name, i.e., the contract name is equivalent to the specified name. And when the contract name of the arbitration mechanism is the same as or equal to the appointed name, determining that the dispute resolution clause is audited to pass. Indicating that the currently designated arbitration mechanism has the corresponding jurisdiction. Otherwise, the audit is not passed, which means that the currently designated arbitration mechanism does not have the corresponding jurisdiction. For example, the agreed name of the arbitration mechanism is different from the specified name; the arbitration jurisdiction is agreed, but the arbitration mechanism is unknown in agreement; clearly contracted court jurisdiction; the dispute resolution means is contracted unequivocally; dispute resolution is not contracted, etc. All are not passed.
In this embodiment, after receiving the arbitration application information uploaded by the terminal, the standing auditing model extracts the dispute resolution terms from the contract of the arbitration application information by inputting the contract in the text format to the standing auditing model, and the text information in the dispute resolution terms is audited by using the preset rule. When the text information audit in the dispute resolution clause passes, it is determined that the currently specified arbitration mechanism has the corresponding jurisdiction. The whole process can quickly and accurately identify whether the current appointed arbitration mechanism has corresponding jurisdiction or not before the on-line arbitration case is set up without manual participation.
In one embodiment, as shown in FIG. 3, the step of matching the contracted arbitration mechanism name with the currently assigned arbitration mechanism name includes:
step 302, respectively calculating the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name by using the pronunciation similarity and the font similarity through the case-standing audit model.
Step 304, the similarity corresponding to the contracted arbitration mechanism name is compared with the similarity corresponding to the currently appointed arbitration mechanism name.
In step 306, when the difference between the two similarities is within the preset range, it is determined that the matching is successful.
When matching, because Chinese description is adopted in the contract, the case-standing audit model needs to match the contracted arbitration mechanism name of Chinese characters and the currently appointed arbitration mechanism name. In order to effectively and highly match the accuracy, the case-based audit model can utilize the names of the agreed arbitration mechanisms to compare the similarity corresponding to the names of the currently appointed arbitration mechanisms.
Specifically, the similarity includes pronunciation similarity and font similarity. In order to effectively improve accuracy of similarity comparison, the pronunciation similarity and the font similarity can be respectively provided with corresponding weights. For example, d=ap+bs, where d is the similarity between two words, P is the pronunciation similarity, and S is the glyph similarity. a is the weight of pronunciation similarity, and b is the weight of font similarity. The case-standing auditing model can encode according to the pinyin and the tone of the word to generate a pronunciation character string, and perform hash calculation on the pronunciation character string to convert the pronunciation character string into a pronunciation hash value so as to obtain the pronunciation similarity. The case-standing auditing model can encode the fonts according to the font similarity, generate font character strings, hash the font character strings, and convert the font character strings into font hash values, thereby obtaining the font similarity. The encoding method of the quadrangle number can be adopted for encoding the fonts. And accumulating the pronunciation hash value, the font hash value and the weights corresponding to the pronunciation hash value and the font hash value respectively by the case-standing auditing model to obtain a total hash value, and obtaining a similarity value.
The proposal auditing model calculates the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name respectively according to the mode, compares the similarity of the two arbitration mechanism names, namely compares the similarity of the appointed name with the similarity of the appointed name, determines that the two arbitration mechanism names are matched if the difference value of the two is within a preset range, and the auditing is passed, so that the current appointed arbitration mechanism has corresponding jurisdiction.
In this embodiment, by using the pronunciation similarity and the font similarity, the similarity of the contracted arbitration mechanism name is compared with the similarity of the currently specified arbitration mechanism name, so that the two arbitration mechanism names of the Chinese character can be effectively matched, and whether the currently specified arbitration mechanism has the corresponding jurisdiction can be accurately identified.
In one embodiment, matching the contracted arbitration mechanism name with the currently specified arbitration mechanism name includes: reading a preset word list through a case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism name; respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through a case-setting audit model; and when the matching degree of the appointed arbitration mechanism name and the current appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
Considering that there may be some contract non-norms in the description of the arbitration mechanism, for example, short names, others of the arbitration mechanism are adopted. The result returned by the case-filing audit model may be a null value where it cannot be determined whether the currently designated arbitration mechanism has the corresponding jurisdiction. In order to solve the problem, the accuracy of jurisdiction identification is further improved, and the proposal audit model can audit the dispute resolution terms by adopting a fuzzy matching algorithm.
Specifically, a corresponding vocabulary may be pre-established in the server, and a plurality of alternative names corresponding to the currently specified arbitration mechanism may be recorded in the vocabulary, including short names, unique names, great names, and the like. When the proposal audit model audits the dispute resolution clauses, a plurality of alternative names corresponding to the names (namely the appointed names) of the current appointed arbitration mechanisms can be extracted from the word list, fuzzy matching is carried out by utilizing the alternative names and the appointed names, if the matching is successful, the audit is passed, and the current appointed arbitration mechanisms are determined to have corresponding jurisdiction rights. Otherwise, the auditing is not passed, and the fact that the current appointed arbitration mechanism does not have corresponding jurisdiction is determined.
Further, the case-setting audit model may calculate a similarity name corresponding to each alternative name according to the manner provided in the above embodiment, and when performing fuzzy matching, the case-setting audit model may respectively compare the similarity of the name of the contract arbitration mechanism (i.e. the contract name) with the similarity of the designated name, and compare the similarity of the contract name with the similarity of the alternative name, thereby obtaining a plurality of comparison results. When a difference value exists in the comparison result within a preset range, namely the matching degree reaches a threshold value, the names of the two arbitration mechanisms are determined to be matched, and the verification is passed, so that the fact that the current appointed arbitration mechanism has corresponding jurisdiction can be known.
In one embodiment, before identifying the dispute resolution terms from the text-formatted contract by the criminal audit model, the method further comprises: during the process of converting the contract, identifying whether specific characters exist in the contract; when the specific character exists, converting the specific character into a preset character string; identifying dispute resolution terms from a text formatted contract by a case-setting audit model includes: positioning the dispute resolution terms in the text format contract by using a preset keyword through a case-setting auditing model; the terms of dispute resolution include a variety of dispute resolution means; reading the meaning corresponding to the preset character string; and identifying the agreed dispute resolution modes in the plurality of dispute resolution modes according to the meaning corresponding to the preset character string.
In identifying the dispute resolution terms of the contract, the server needs to convert the contract to a text format. When the server performs format conversion on the combination, some special characters may not be correctly recognized, so that the case-setting audit model cannot recognize corresponding arbitration clauses. For example, various dispute resolution means may be contracted in the contract, including courts, arbitration agencies, and the like. If the contract is described in the following format:
The dispute resolution is as follows:
shenzhen city middle-grade court
Shenzhen international arbitration hospital
From this, it can be known that the resolution of the contract in collaboration is arbitrated and that a specific arbitration mechanism is contracted, but becauseThis special character cannot be recognized, resulting in the arbitration clause being unrecognizable. To solve this problem, the server may convert the special character into a preset character string when format conversion is performed on the combination. The server may obtain a comparison table, where a plurality of character strings corresponding to the special characters are recorded, and when the server reads the special characters in the contract, the server converts the special characters into corresponding preset character strings. Different feature characters may be converted into different preset character strings.
The corresponding meaning of the preset character string can also be recorded in the comparison table. For example, when the preset string is 000, it indicates that it is not selected; when the preset character string is 111, the selection is indicated. The case-setting audit model can locate preset dispute terms in the converted-format contract by using preset keywords. And the case-setting auditing model identifies the dispute resolution mode appointed in the dispute resolution clause according to the meaning corresponding to the preset character string. When the dispute resolution terms of the contract are identified as the arbitration mode, the standing auditing model audits the dispute resolution terms so as to identify whether the appointed arbitration mechanism has corresponding jurisdiction.
Further, the case-standing auditing model can audit the dispute resolution terms in the manner of the embodiment, so as to effectively identify whether the current arbitration mechanism has corresponding jurisdiction.
It should be understood that, although the steps in the flowcharts of fig. 2-3 are shown in order as indicated by the arrows, these steps are not necessarily performed in order as indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in fig. 2-3 may include multiple sub-steps or stages that are not necessarily performed at the same time, but may be performed at different times, nor do the order in which the sub-steps or stages are performed necessarily occur sequentially, but may be performed alternately or alternately with at least a portion of the sub-steps or stages of other steps or steps.
In one embodiment, as shown in fig. 4, there is provided an apparatus for processing text information online identification based on jurisdiction, including: a communication module 402 and an auditing module 404, wherein:
A communication module 402, configured to receive an arbitration application request uploaded by a terminal; returning an arbitration application page to the terminal according to the arbitration application request; and the receiving terminal uploads arbitration application information through an arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism.
An auditing module 404, configured to input a text format contract to a case-standing auditing model, identify a dispute resolution term from the text format contract through the case-standing auditing model, and audit text information in the dispute resolution term by using a preset rule; determining that the currently designated arbitration mechanism has a corresponding jurisdiction when the text message audit in the dispute resolution clause passes
In one embodiment, the auditing module is further configured to extract, by means of a case-setting auditing model, a contract arbitration mechanism name from text information of the dispute resolution terms; when the contract arbitration mechanism is extracted, the contract arbitration mechanism name is matched with the currently appointed arbitration mechanism name, and if the matching is successful, the dispute resolution clause is checked and passed; when the contract arbitration mechanism name is not extracted, then it is determined that the dispute resolution terms audit is not passed.
In one embodiment, the auditing module is further used for calculating the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name respectively by using the pronunciation similarity and the font similarity through the case-setting auditing model; comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name; and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
In one embodiment, the auditing module is further configured to read a preset vocabulary through a case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism name; respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through a case-setting audit model; and when the matching degree of the appointed arbitration mechanism name and the current appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
In one embodiment, the apparatus further comprises: the format conversion module is used for identifying whether specific characters exist in the contract or not in the process of converting the contract into the text format; when the specific character exists, converting the specific character into a preset character string; the auditing module is also used for positioning the dispute resolution terms in the text format contract by utilizing the preset keywords through the case-setting auditing model; the terms of dispute resolution include a variety of dispute resolution means; reading the meaning corresponding to the preset character string; and identifying the agreed dispute resolution modes in the plurality of dispute resolution modes according to the meaning corresponding to the preset character string.
For specific limitations on the jurisdiction-based text information online identification processing device, reference may be made to the above limitation on the jurisdiction-based text information online identification processing method, and the description thereof will not be repeated here. The above-mentioned various modules in the jurisdiction-based text information online identification processing device may be implemented in whole or in part by software, hardware, and combinations thereof. The above modules may be embedded in hardware or may be independent of a processor in the computer device, or may be stored in software in a memory in the computer device, so that the processor may call and execute operations corresponding to the above modules.
In one embodiment, a computer device is provided, which may be a server, the internal structure of which may be as shown in fig. 5. The computer device includes a processor, a memory, a network interface, and a database connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer programs, and a database. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The database of the computer device is used for storing arbitration application information and the like. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program, when executed by a processor, implements a jurisdiction-based text information online recognition processing method.
It will be appreciated by those skilled in the art that the structure shown in FIG. 5 is merely a block diagram of some of the structures associated with the present inventive arrangements and is not limiting of the computer device to which the present inventive arrangements may be applied, and that a particular computer device may include more or fewer components than shown, or may combine some of the components, or have a different arrangement of components.
In one embodiment, a computer device is provided comprising a memory storing a computer program and a processor that when executing the computer program performs the steps of: receiving an arbitration application request uploaded by a terminal; returning an arbitration application page to the terminal according to the arbitration application request; receiving arbitration application information uploaded by a terminal through an arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism; inputting the contract in the text format into a case-setting auditing model, identifying the dispute resolution terms from the contract in the text format through the case-setting auditing model, and auditing the text information in the dispute resolution terms by utilizing a preset rule; when the text information audit in the dispute resolution clause passes, it is determined that the currently specified arbitration mechanism has the corresponding jurisdiction.
In one embodiment, the processor when executing the computer program further implements: extracting the names of the contract arbitration mechanisms from the text information of the dispute resolution clauses through a case-setting auditing model; when the contract arbitration mechanism is extracted, the contract arbitration mechanism name is matched with the currently appointed arbitration mechanism name, and if the matching is successful, the dispute resolution clause is checked and passed; when the contract arbitration mechanism name is not extracted, then it is determined that the dispute resolution terms audit is not passed.
In one embodiment, the processor when executing the computer program further implements: respectively calculating the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name by using the pronunciation similarity and the font similarity through a case-standing audit model; comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name; and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
In one embodiment, the processor when executing the computer program further implements: reading a preset word list through a case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism name; respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through a case-setting audit model; and when the matching degree of the appointed arbitration mechanism name and the current appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
In one embodiment, the processor when executing the computer program further implements: during the process of converting the contract into the text format, identifying whether specific characters exist in the contract; when the specific character exists, converting the specific character into a preset character string; positioning the dispute resolution terms in the text format contract by using a preset keyword through a case-setting auditing model; the terms of dispute resolution include a variety of dispute resolution means; reading the meaning corresponding to the preset character string; and identifying the agreed dispute resolution modes in the plurality of dispute resolution modes according to the meaning corresponding to the preset character string.
In one embodiment, a computer readable storage medium is provided having a computer program stored thereon, the computer program being executed by a processor to: receiving an arbitration application request uploaded by a terminal; returning an arbitration application page to the terminal according to the arbitration application request; receiving arbitration application information uploaded by a terminal through an arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism; inputting the contract in the text format into a case-setting auditing model, identifying the dispute resolution terms from the contract in the text format through the case-setting auditing model, and auditing the text information in the dispute resolution terms by utilizing a preset rule; when the text information audit in the dispute resolution clause passes, it is determined that the currently specified arbitration mechanism has the corresponding jurisdiction.
In one embodiment, the computer program is further executable by the processor to: extracting the names of the contract arbitration mechanisms from the text information of the dispute resolution clauses through a case-setting auditing model; when the contract arbitration mechanism is extracted, the contract arbitration mechanism name is matched with the currently appointed arbitration mechanism name, and if the matching is successful, the dispute resolution clause is checked and passed; when the contract arbitration mechanism name is not extracted, then it is determined that the dispute resolution terms audit is not passed.
In one embodiment, the computer program is further executable by the processor to: respectively calculating the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name by using the pronunciation similarity and the font similarity through a case-standing audit model; comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name; and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
In one embodiment, the computer program is further executable by the processor to: reading a preset word list through a case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism name; respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through a case-setting audit model; and when the matching degree of the appointed arbitration mechanism name and the current appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
In one embodiment, the computer program is further executable by the processor to: during the process of converting the contract into the text format, identifying whether specific characters exist in the contract; when the specific character exists, converting the specific character into a preset character string; positioning the dispute resolution terms in the text format contract by using a preset keyword through a case-setting auditing model; the terms of dispute resolution include a variety of dispute resolution means; reading the meaning corresponding to the preset character string; and identifying the agreed dispute resolution modes in the plurality of dispute resolution modes according to the meaning corresponding to the preset character string.
Those skilled in the art will appreciate that implementing all or part of the above described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in embodiments provided herein may include non-volatile and/or volatile memory. The nonvolatile memory can include Read Only Memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous link (SYNCHLINK) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), among others.
The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description.
The above examples illustrate only a few embodiments of the application, which are described in detail and are not to be construed as limiting the scope of the application. It should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the application, which are all within the scope of the application. Accordingly, the scope of protection of the present application is to be determined by the appended claims.

Claims (10)

1. An jurisdiction-based text information online identification processing method, comprising the following steps:
Receiving an arbitration application request uploaded by a terminal; returning an arbitration application page to the terminal according to the arbitration application request;
receiving arbitration application information uploaded by a terminal through the arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism;
Identifying the contract format of the contract, carrying out format conversion on the contract when the contract format is PDF format or image format, and identifying whether specific characters exist in the contract in the process of converting the contract into text format; when a specific character exists, converting the specific character into a preset character string to obtain a contract containing the text format of the preset character string;
Inputting the text format contract to a standing auditing model, and positioning the dispute resolution terms in the text format contract by using preset keywords; the dispute resolution terms include a plurality of dispute resolution means; reading the meaning corresponding to the preset character string; identifying the agreed dispute resolution modes in a plurality of dispute resolution modes according to the meaning corresponding to the preset character string; the dispute resolution of the contract includes contract arbitration mechanism names; matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name, if the matching is successful, checking the dispute resolution terms, and determining that the current appointed arbitration mechanism has corresponding jurisdiction;
different special characters can be converted into different preset character strings;
the matching the contract arbitration mechanism name with the currently specified arbitration mechanism name includes:
Encoding according to the pinyin and the tone of the words through the case-setting auditing model to generate pronunciation character strings, carrying out hash calculation on the pronunciation character strings, converting the pronunciation character strings into pronunciation hash values, and respectively calculating pronunciation similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name based on the pronunciation hash values;
Encoding according to word fonts through the case-setting auditing model, generating a font character string, carrying out hash calculation on the font character string, converting the font character string into font hash values, and respectively calculating font similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name based on the font hash values;
Based on the pronunciation similarity and the font similarity, determining the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name;
Comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name;
and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
2. The method of claim 1, wherein the preset key comprises a contract arbitration mechanism.
3. The method of claim 1, wherein determining the similarity to which the contracted arbitration name and the currently specified arbitration name each correspond based on the pronunciation similarity and the font similarity comprises:
And determining the similarity corresponding to each of the appointed arbitration mechanism name and the current appointed arbitration mechanism name based on the pronunciation similarity, the font similarity, the weight corresponding to the pronunciation similarity and the weight corresponding to the font similarity.
4. The method of claim 1, wherein said matching the contract arbitration mechanism name with the currently specified arbitration mechanism name comprises:
reading a preset word list through the case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism names;
Respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through the case-standing audit model;
And when the matching degree of the appointed arbitration mechanism name and the currently appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
5. An jurisdiction-based text information online identification processing device, comprising:
The communication module is used for receiving an arbitration application request uploaded by the terminal; returning an arbitration application page to the terminal according to the arbitration application request; receiving arbitration application information uploaded by a terminal through the arbitration application page, wherein the arbitration application information comprises a contract and a current appointed arbitration mechanism;
The format conversion module is used for identifying the contract format of the contract, converting the format of the contract when the contract format is PDF format or image format, and identifying whether specific characters exist in the contract in the process of converting the contract into text format; when a specific character exists, converting the specific character into a preset character string to obtain a contract containing the text format of the preset character string;
The auditing module is used for inputting the text format contract to a case-standing auditing model and positioning the dispute resolution terms in the text format contract by utilizing preset keywords; the dispute resolution terms include a plurality of dispute resolution means; reading the meaning corresponding to the preset character string; identifying the agreed dispute resolution modes in a plurality of dispute resolution modes according to the meaning corresponding to the preset character string; the dispute resolution of the contract includes contract arbitration mechanism names; matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name, if the matching is successful, checking the dispute resolution terms, and determining that the current appointed arbitration mechanism has corresponding jurisdiction;
The auditing module is also used for encoding according to the pinyin and the tone of the words through the case-standing auditing model to generate pronunciation character strings, carrying out hash calculation on the pronunciation character strings, converting the pronunciation character strings into pronunciation hash values, and respectively calculating the pronunciation similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name based on the pronunciation hash values; encoding according to word fonts through the case-setting auditing model, generating a font character string, carrying out hash calculation on the font character string, converting the font character string into font hash values, and respectively calculating font similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name based on the font hash values; based on the pronunciation similarity and the font similarity, determining the similarity corresponding to the appointed arbitration mechanism name and the current appointed arbitration mechanism name; comparing the similarity corresponding to the appointed arbitration mechanism name with the similarity corresponding to the current appointed arbitration mechanism name; and when the difference value of the two similarities is within a preset range, determining that the matching is successful.
6. The apparatus of claim 5, wherein the key comprises a contract arbitration mechanism.
7. The apparatus of claim 5, wherein the auditing module is further configured to determine a similarity for each of the contracted arbitration mechanism name and the currently specified arbitration mechanism name based on the pronunciation similarity, the font similarity, the weight for the pronunciation similarity, and the weight for the font similarity.
8. The apparatus of claim 5, wherein the auditing module is further configured to read a preset vocabulary via the case-setting auditing model; the preset word list comprises a plurality of alternative names of the currently appointed arbitration mechanism names; respectively matching the appointed arbitration mechanism name with the current appointed arbitration mechanism name and a plurality of alternative names through the case-standing audit model; and when the matching degree of the appointed arbitration mechanism name and the currently appointed arbitration mechanism name or one of the alternative names reaches a threshold value, the successful matching is indicated.
9. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor implements the steps of the method of any of claims 1 to 4 when the computer program is executed.
10. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 4.
CN201910645485.2A 2019-06-14 2019-07-17 Text information online identification processing method and device based on jurisdiction Active CN110532350B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910514672 2019-06-14
CN2019105146727 2019-06-14

Publications (2)

Publication Number Publication Date
CN110532350A CN110532350A (en) 2019-12-03
CN110532350B true CN110532350B (en) 2024-05-03

Family

ID=68660647

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910645485.2A Active CN110532350B (en) 2019-06-14 2019-07-17 Text information online identification processing method and device based on jurisdiction

Country Status (1)

Country Link
CN (1) CN110532350B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7028259B1 (en) * 2000-02-01 2006-04-11 Jacobson Robert L Interactive legal citation checker
CN102880647A (en) * 2012-08-24 2013-01-16 北京百度网讯科技有限公司 Method and device for acquiring another name of organization
CN108182252A (en) * 2017-12-29 2018-06-19 方正璞华软件(武汉)股份有限公司 A kind of Personnel disputes arbitration information storage system and method
CN108509458A (en) * 2017-02-28 2018-09-07 阿里巴巴集团控股有限公司 A kind of business object recognition methods and device
CN109447105A (en) * 2018-09-10 2019-03-08 平安科技(深圳)有限公司 Contract audit method, apparatus, computer equipment and storage medium
CN109741168A (en) * 2018-12-07 2019-05-10 上海指旺信息科技有限公司 Online arbitration information exchange method and system
CN109753653A (en) * 2018-12-25 2019-05-14 金蝶软件(中国)有限公司 Entity name recognition methods, device, computer equipment and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11416956B2 (en) * 2017-03-15 2022-08-16 Coupa Software Incorporated Machine evaluation of contract terms

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7028259B1 (en) * 2000-02-01 2006-04-11 Jacobson Robert L Interactive legal citation checker
CN102880647A (en) * 2012-08-24 2013-01-16 北京百度网讯科技有限公司 Method and device for acquiring another name of organization
CN108509458A (en) * 2017-02-28 2018-09-07 阿里巴巴集团控股有限公司 A kind of business object recognition methods and device
CN108182252A (en) * 2017-12-29 2018-06-19 方正璞华软件(武汉)股份有限公司 A kind of Personnel disputes arbitration information storage system and method
CN109447105A (en) * 2018-09-10 2019-03-08 平安科技(深圳)有限公司 Contract audit method, apparatus, computer equipment and storage medium
CN109741168A (en) * 2018-12-07 2019-05-10 上海指旺信息科技有限公司 Online arbitration information exchange method and system
CN109753653A (en) * 2018-12-25 2019-05-14 金蝶软件(中国)有限公司 Entity name recognition methods, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN110532350A (en) 2019-12-03

Similar Documents

Publication Publication Date Title
CN110765763B (en) Error correction method and device for voice recognition text, computer equipment and storage medium
WO2020077895A1 (en) Signing intention determining method and apparatus, computer device, and storage medium
CN109992664B (en) Dispute focus label classification method and device, computer equipment and storage medium
CN111324743A (en) Text relation extraction method and device, computer equipment and storage medium
CN111814466A (en) Information extraction method based on machine reading understanding and related equipment thereof
CN110362799B (en) On-line arbitration-based method and device for generating and processing resolution book and computer equipment
CN110008250B (en) Social security data processing method and device based on data mining and computer equipment
CN111651992A (en) Named entity labeling method and device, computer equipment and storage medium
CN111680634B (en) Document file processing method, device, computer equipment and storage medium
CN112380837B (en) Similar sentence matching method, device, equipment and medium based on translation model
CN111859916B (en) Method, device, equipment and medium for extracting key words of ancient poems and generating poems
CN112417887B (en) Sensitive word and sentence recognition model processing method and related equipment thereof
CN110362798B (en) Method, apparatus, computer device and storage medium for judging information retrieval analysis
CN112231224A (en) Business system testing method, device, equipment and medium based on artificial intelligence
WO2021139278A1 (en) Intelligent interview method and apparatus, and terminal device
CN113159013B (en) Paragraph identification method, device, computer equipment and medium based on machine learning
CN110517007A (en) Request for arbitration, which is put on record, audits processing method, device and computer equipment
CN112766319A (en) Dialogue intention recognition model training method and device, computer equipment and medium
CN114429635A (en) Book management method
CN110956195B (en) Image matching method, device, computer equipment and storage medium
CN110532543A (en) Analysis and processing method, device, computer equipment and the storage medium of evidence material
CN110533381B (en) Case jurisdiction auditing method, device, computer equipment and storage medium
CN111723870B (en) Artificial intelligence-based data set acquisition method, apparatus, device and medium
CN114281931A (en) Text matching method, device, equipment, medium and computer program product
CN110390083B (en) Method and device for pushing approximate cases, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant