CN112883687B - Law contract interactive labeling method based on contract text markup language - Google Patents

Law contract interactive labeling method based on contract text markup language Download PDF

Info

Publication number
CN112883687B
CN112883687B CN202110162638.5A CN202110162638A CN112883687B CN 112883687 B CN112883687 B CN 112883687B CN 202110162638 A CN202110162638 A CN 202110162638A CN 112883687 B CN112883687 B CN 112883687B
Authority
CN
China
Prior art keywords
legal
text
contract
user
asset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110162638.5A
Other languages
Chinese (zh)
Other versions
CN112883687A (en
Inventor
朱岩
王迪
范雨晴
童晓梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Jiaotu Information Technology Co ltd
University of Science and Technology Beijing USTB
Original Assignee
Tianjin Jiaotu Information Technology Co ltd
University of Science and Technology Beijing USTB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Jiaotu Information Technology Co ltd, University of Science and Technology Beijing USTB filed Critical Tianjin Jiaotu Information Technology Co ltd
Priority to CN202110162638.5A priority Critical patent/CN112883687B/en
Publication of CN112883687A publication Critical patent/CN112883687A/en
Application granted granted Critical
Publication of CN112883687B publication Critical patent/CN112883687B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • G06F40/157Transformation using dictionaries or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Technology Law (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a contract text markup language-based law contract interactive labeling method, which comprises the following steps: 1) the user selects the legal constituent elements in the legal contract text, and prompts the user to appoint or revise the legal attributes, legal components and characteristics of the selected legal constituent elements in an interactive mode to generate a marking frame of the legal contract text; 2) establishing a Contract Text Markup Language (CTML), wherein the Contract Text Markup Language (CTML) comprises semantic marks and source marks according to mark types; 3) and adding the semantic mark and the user-oriented customized data source mark into a mark frame of the legal contract text to realize the generation of the contract text with legal element marks. The invention provides automatic generation of labels meeting the grammatical rules in the language for processing legal contracts by users, and is beneficial to understanding contract semantics and standardized processing by adopting a digital means, block chain intelligent contract general development and style personalized display.

Description

Law contract interactive labeling method based on contract text markup language
Technical Field
The invention relates to the technical field of information, in particular to a contract text markup language-based interactive labeling method for legal contracts, which is used for development of information services and applications related to the legal contracts.
Background
The contract is an agreement for establishing, changing and terminating civil legal relationships among civil subjects, is an important means for maintaining the market economic order and guaranteeing the legal rights and interests of the parties, and has an important position in the market economy. With the development of information, computer and network technologies in recent years, especially the emergence of technologies such as electronic signatures, block chains, intelligent contracts, etc., the trend from the electronization and automatic processing of contracts, to the implementation of automatic execution of contract terms by computer, and even to the development of law laws in the future has become a real or future trend. However, since the text of the legal contract is written in natural language, it still cannot be processed directly by computer, which directly results in that the legal contract processing still relies on legal person to interpret the legal meaning, and the deep application of the legal contract is seriously affected. Particularly for the blockchain intelligent contracts, an effective method for accurately understanding contract contents and meaning expressions is lacking, so that legal contract texts to intelligent contract codes still need to be realized by cooperation of legal persons and programmers, and the written intelligent contract programs also lack legal recognition and corresponding legal effectiveness.
The reason can be summarized as the accuracy problem expressed by the meaning of the legal contract and the computer mechanism solution problem expressed by the meaning of the legal contract. Although the contract text is legal, the natural language used by the contract text has expressive ambiguity objectively, which easily causes ambiguity in understanding by different legal subjects, and the program represented by the computer language does not allow ambiguity, so that the practical problems of ambiguity elimination in the computer processing process of legal documents and how to represent contract contents in a more effective form exist.
One solution to this problem is: a method for standardizing the content and meaning expression in the legal contract text is established, and the content of grammar, structure and vocabulary in the document is labeled, so that various personnel can understand the legal contract text more accurately and consistently. Although there are some works for extracting information from the contract text, there is no effective method for extracting legal elements and their features from the contract according to the current legal requirements, and there is no method for standardizing the extracted information for the legal elements and their features which are easy to be processed by computer.
Disclosure of Invention
In order to solve the technical problems, the invention provides a Contract Text Markup Language (CTML), which is a normative computer processing language for creating or expressing the meaning of a legal contract and conforms to the basic document specifications of relevant laws such as the national common and national codex (hereinafter referred to as the national codex). On the basis, the invention provides a legal contract interactive labeling method based on contract text markup language, which provides automatic generation of a label conforming to grammatical specifications in Contract Text Markup Language (CTML) for a user to process a legal contract text in an interactive mode, and realizes more standardized contract text design and related legal contract application development by supporting an exchange label data table (EMD, exchange table for short).
The invention is realized by the following technical scheme:
the markup language of the present invention is a computer-processable word code that combines text and text-related information to reveal information about the structure, characteristics, lexical meaning and meaning of documents, and details of data processing.
The legal contract in the invention refers to the agreement of setting up, changing and terminating the legal relation among the civil subjects, the explanation of the relevant legal vocabulary in the contract text refers to the relevant legal laws and regulations of the national standard of law, the electronic signature law, etc., and the data message refers to the information generated, sent, received or stored by electronic, optical, magnetic or similar means. The contract terms are the manifestation and immobilization of the contract conditions and are the basis for determining the rights and obligations of the parties to the contract, i.e., in the case of legal documents, the content of the contract refers to the terms of the contract. The legal elements in the invention are the essential elements or elements necessary for the existence of legal regulations, and are composed of legal rules, principles and concepts. The meaning explanation of the legal element refers to the explanation of the meaning of the legal element expressed in the legal property, the legal component, the characteristic and the like in the legal contract.
The symbols used and the description of the invention are given below:
@ identification prefix
I same level element or
[] Keyword optional
Optional feature indicator
% type indicator
+ zero or more terms
Hierarchy relationship connector
< > </semantic tags
< { } > number source tag
The present invention assumes that the legal contract text does not contain the same markup symbols as the semantic tags and the source tags. The mark information in the semantic mark is arranged inside the double-pointed brackets, and the front and back semantic marks are arranged outside the marked text, namely < < mark information > > text < < mark information > >.
A method for interactive annotation of legal contracts based on a contract text markup language, as shown in fig. 1, the method comprising:
(1) selecting a legal constituent factor in a legal contract text by a user, prompting the user to specify or revise the legal property, the legal component and the characteristic attribute of the selected legal constituent factor in an interactive mode, and extracting the meaning explanation of the legal constituent factor in the legal contract text selected by the user;
(2) establishing a Contract Text Markup Language (CTML), interpreting meaning of the legal elements to generate a mark for expressing the Contract Text Markup Language (CTML), wherein the Contract Text Markup Language (CTML) is a computer-processable language which combines normative expression of the legal elements with a contract text, marks and explains the legal elements by marking, shows the legal meaning of document structures, characteristics and words and describes a data processing method, and comprises semantic marks and source marks according to mark types;
the semantic mark is also called a nestable mark or a complex mark and is used for marking legal components, legal attributes and legal components in the document; has indicative function and can provide legal meaning.
The digital source mark is used for identifying basic and non-separable words in the document, the basic words are called interactive data exchangedData, the basic words are commonly used for declaring, filling or selecting by contract parties, and the digital source mark can be used anywhere in the document;
the nested structure may be represented by a hierarchical relationship connector and the numerical source tag may be used anywhere in the text.
And forming a hierarchical semantic annotation according to a formal grammar in a Contract Text Markup Language (CTML). The semantic mark adopts a hierarchical semantic marking method, the hierarchical semantic marking comprises a legal element marking LFM, a legal attribute marking LPM, a legal component marking LCM and an auxiliary mark, and the auxiliary mark comprises a domain mark;
the semantic mark adopts the following format:
< element | Attribute | component | Domain parameter List > > text < < element | Attribute | Domain > >
Corresponding to English expression as
<<factor|property|component|field parameterList>>text<</factor|property|component|field>>
The factor, property and field are reserved words for representing elements, attributes and domains, the component is a specific legal component, the parameter list represents a parameter list, has different definitions according to the mark type, and the text represents a marked text.
The nested structure of the invention refers to the inclusion relationship between elements represented by identifiers or names composed of element identifiers, attributes and components through a hierarchical relationship connector.
The source tag of the invention adopts the following format:
number source flag [ { [ element identification ] @ interactive data identification [% type ] (# selection mode ═ alternative data) + } >
Corresponding to English expression as
metadataExpression::=<{[factorID]@exchangedDataID[%type](#option=value)+}>
Wherein, factoriD represents the element identification of the outmost layer in the hierarchy to which the interactive data belongs, exchangedDataID represents the unique identification of the interactive data, option represents the selection mode of the interactive data, which can include single selection single option, multiple selection multiple option, external input import, trigger, allocation allocate and the like, and the collection of the selection mode represents that option set is
optionSet::={singleOption|multiOption,import,trigger,allocate}
Wherein, value represents alternative data or value of the contract text, type represents type of the interactive data, and can be divided into dataType and ownership type, external input import represents that user incoming data can be received, trigger represents that external event can be received, and allocate represents that user-defined complex type data can be received.
(3) And adding the semantic mark and the user-oriented customized data source mark into the contract text to realize the generation of the contract text with legal element marks.
Further, the method further comprises the step (4): and constructing an exchange tag data table EMD, namely extracting, recording and setting necessary attributes of the labeled interactive data in a Contract Text Markup Language (CTML) by adopting an exchangeable table structure, and constraining and limiting the user customizable data in the contract text by setting the necessary attributes, so that the separation of contract semantics and user data is realized, and a basis is provided for contract negotiation between parties and executed data interaction.
Further, in step (2), the legal element tag LFM belongs to a primary tagging function, and is used for providing extraction, identification and element formalization representation factor expression for a contract legal constituent element (element factor) specified by law, and the tagging process is described as follows:
after a user selects a text needing to be labeled with a legal component factor and specifies a factor set of a factor category to which the text belongs, the user obtains an automatically generated factor identification (factor ID) for use or revision;
prompting a user to select or input the feature attribute and the feature value of the element to be labeled from the text of the legal constituent element to be labeled; the element can have a plurality of characteristics according to different element categories;
adding the element type factorSet, the element identification factoriD feature attribute and the feature value of the text to the outer side of the selected marked element text according to element formal expression factor expression, and realizing the normalization of formal expression and meaning expression of legal elements;
wherein the element class factorSet includes: title, party, asset, term, contract, additional information addition; the parties are divided into individual party (called party for short) and group party (called group or organization for short), and the terms are divided into general term genTerm, default term breTerm and arbitration term arbiTerm.
Contractual law components (component factor) refer to textual information such as party name or name and address, bid amount, quantity, quality, price or consideration, term of fulfillment, place and mode, liability to default, method of dispute resolution, etc.
Contractual legal property (property) refers to a basic property in a legal component, and includes text information such as property and right of an asset, behavior of contract terms, preconditions, accompanying conditions, postconditions, violations, disputes, institutions, and signatures in contract establishment.
The contract law component (component) refers to the definition information related to the legal attribute expression, and includes the action time, the time predicate and the boundary predicate in the time expression, and the text information such as the storage action, the retrieval action, the transfer action and the asset expression in the asset operation.
The feature attribute in the invention is used for describing the characteristics of the text content different from other meaning expressions, and is the minimum element expressed by legal elements, attributes and components.
The element formalization representation of the legal element labeling LFM defines factorExpression as:
the element formalization indicates that the English format corresponds to an English format of ═ element category @ element identification (# characteristic ═ characteristic value) + > > text </element >:
factorExpression::=<<factor factorSet@factorID(#attribute=value)+>>text<</factor>>
wherein, the factor is a reserved word, the factorID represents an element identifier, the attribute represents legal characteristics of the element, the value represents a characteristic value, the factorSet represents a specific element type of the element, and the element type is represented in a set form as follows:
element type: { title, party | group, asset, general term | default term | arbitration term, contract establishment, additional information }
Corresponding to English expression as
factorSet::={title,party|group,asset,genTerm|breTerm|arbiTerm,conclusion,addition}
The symbol | is used to divide different types of the same element.
Further, in the step (2), the law attribute labeling LPM belongs to a secondary labeling function, and is configured to extract and identify a law attribute (attribute for short) under a law component factor, and formalize the law attribute to represent a propertyExpression, where the labeling process is as follows:
after a user selects a text to be labeled with legal property from the text labeled with the LFM and having the factor identification factor ID and specifies the property type property set to which the text belongs, the user obtains the automatically generated property identification property ID for use or revision;
prompting a user to select or input the feature attribute and the feature value of the legal property to be labeled from the text; the system can have a plurality of characteristics according to different attribute categories;
adding element identification factor, attribute type set, attribute identification propertyID, feature attribute and feature value thereof to the outer side of the selected labeling attribute text according to legal attribute formalization representation propertyExpression;
wherein the property class propertySet includes: information info, ownership right, action, preCondition, adjoint condition, postCondition, violation against, dispute controlverse, organization institution, signature.
The legal attribute formalization representation propertyExpression in the legal attribute labeling LPM is defined as:
the legal attribute formalization represents that ═ attribute [ element identification ] attribute category [ @ attribute identification ] (# feature ═ feature value) + > > text </attribute > >
Corresponding to English format:
propertyExpression::=<<property[factorID.]propertySet[@propertyID]
(#attribute=value)+>>text<</property>>
wherein, property is a reserved word, property set represents a specific legal property of the property, and property id represents a property identifier.
If the LPM is nested into a primary annotation, the element identification (i.e., factoriD.) may be omitted. When the number of times the element identification corresponds to the property is one and only one, the @ property identification (i.e., @ propertyID) may be omitted.
The legal attribute labeling LPM can adopt a collective form to represent the legal attributes as follows:
attribute type: { information, rights, behavior, preconditions, adjoint conditions, postconditions, violation, dispute, mechanism }
Corresponding to English expression as
propertySet::={info,right,action,preCondition,adjCondition,postcondition,against,controversy,institution,signature}
Further, in the step (2), the legal component labeling LCM belongs to a three-level labeling function, and is used for providing a nested extraction and identification function for legal components under specific legal constituents, legal attributes and legal components (component for short), wherein other legal component labels can be nested in the legal component labels, so as to form a legal component formal representation component expression; the labeling process is as follows:
after a user selects a text needing to be labeled with a legal component from a text with a property identification propertyID after labeling a legal property LPM, the user obtains the type selection of a component labeling method to which the legal component belongs, and the selectable types of the component labeling method to which the legal component belongs comprise a time representation labeling function, an asset operation labeling function and an asset representation labeling function;
prompting a user to select or input a feature attribute and a feature value which are to be labeled in a required labeling legal component; wherein, according to different legal components, the user can be allowed to label different characteristics;
and adding the legal component, the characteristic attribute and the characteristic value thereof to which the text belongs to the outer side of the selected marking attribute text according to the legal component formalization representation component.
The legal component formalization in the legal component labeling LCM represents that the componentExpression is defined as:
legal composition formalization means that ═ composition (# characteristic ═ characteristic value) + > > text </composition > >
Corresponding to English format:
componentExpression::=<<component(#attribute=value)+>>text<</component>>
where a component represents a specific legal component specified from the aforementioned set of components.
The legal component labeling LCM can adopt a collective form to represent the legal components as follows:
composition: { action time, time predicate, boundary predicate, store action | retrieve action | transfer action, asset expression }
Corresponding to English expression as
component::={actionTime,timePredicate,rangePredicate,
deposit|withdraw|transfer,assetExpression}
Further, in the step (2), the domain mark belongs to an auxiliary labeling function, and is used for providing an extracting and identifying function for describing a characteristic field and a characteristic value possessed under a legal component or a legal attribute, and forming a domain information formalized representation field expression; the labeling process is as follows:
the user selects a text needing to be labeled with domain information from the text with the element identification factoriD after the legal element is labeled with the LFM or the text with the attribute identification propertyID after the legal attribute is labeled with the LPM, and the user obtains the automatically generated domain identification fieldID for use or revision;
prompting a user to select or input the type, the threshold value quality and the threshold value of the characteristic to be labeled of the domain information to be labeled, the element identification factor and the attribute type propertySet to which the characteristic belongs; and adding the type, the quality and the value of the domain value of the characteristic to be labeled of the domain information to be labeled, the factor ID and the property type propertySet of the characteristic to be labeled to the outer side of the selected labeling text according to the domain information formalization representation field expression.
The formal representation of the domain information in the domain tag defines fieldexpress as:
the domain information formalized representation is ═ domain [ element identification. attribute set ] @ domain identification [% type ]
The [ # domain value is a value ] > > text < > > > >, and
corresponding to English format:
fieldExpression::=<<field[factorID.propertySet]@fieldID[%type]
[#quantity=value]>>text<</field>>
wherein the threshold value qualification is a reserved word.
The dataType in the invention represents the data type of the attribute, the data type is divided into character string, currency, date, integer, floating point number, Boolean value and time, and the dataType is represented by English set as follows:
dataType::={String,Money,Date,Integer,Float,Boolean,Time}。
the contract text markup language CTML does not limit the language type, so only English is shown on the following formula, and the corresponding Chinese references will be explained in the explanation.
The invention can realize the labeling of legal constituent elements in a legal contract text by adopting the Contract Text Markup Language (CTML) according to a hierarchical labeling method, and a labeling element relation diagram is shown as a figure 2 and comprises the following steps:
-a contract title (title) for describing the related information of the legal contract document and its label, comprising: the specific format of the contract identifier (titleID) and the contract serial number (serial number) is as follows:
<<factor title@titleID[#serialNumber=value]>>text<</factor>>
the title is a title reserved word, the title ID represents a contract identifier, the serial number represents a characteristic keyword, the value represents a contract serial number, and the serial number can be defaulted.
The party tag is used for declaring party related information, including identity information, contact information and the like unique to the party. The parties are divided into individual party and group party groups, and the labels distinguish all the declared parties through party identifiers, and the format is as follows:
● Individual principal (party):
<<factor party@partyID>>text<</factor>>
● group party (group):
<<factor group@groupID>>text<</factor>>
wherein, party reserves words for individual parties, part ID identifies for individual parties, group reserves words for group parties, and group ID identifies for group parties.
Object (asset) tags are used to describe objects to which party rights and obligations point together and their ownership, objects being called assets, generally divided into things, behaviors, intellectual achievements, etc. The annotation distinguishes the declared asset by an asset identifier, in the format:
<<factor asset@assetID>>text<</factor>>
wherein, asset is an asset reserved word, and asset ID is an asset identifier.
The marking information when declaring the assets comprises asset attribute information (info) and asset attribute information (right) related to the contracts, wherein the asset attribute information and the asset attribute information both belong to the legal attributes under the asset elements, and the format is as follows:
● asset attribute information (info):
<<property[assetID.]info>>text<</property>>
● asset ownership information (right):
<<property[assetID.]right>>text<</property>>
wherein info is an asset attribute information reserved word, right is an asset attribute information reserved word, and assetID is a designated asset identifier.
In the mark texts of the asset attribute information info and the asset right information right, the domain mark is used to mark the owned characteristics and values. The property Right information should use a Right type (rightType) when marking the type, and the Right type is divided into a use Right (useRight), an ownership (owershipr Right), a possessiveness (possessRight), an earning Right (usufructRight), a distribution Right (dispeseright), and a proprietary Right (Right) allowing a user to customize under a specific attribute, and is expressed as follows by using an english set:
rightType::={Right,useRight,ownershipRight,possessRight,usufructRight,disposeRight}。
if the subject matter attributes are described in a non-centralized manner, the following form may be used:
<<field assetID.info@fieldID%dataType>>text<</field>>
if the subject matter rights are described in a non-centralized manner, the following form may be used:
<<field assetID.right@fieldID%rightType>>text<</field>>
term flags the declaration of rights and obligations for a party, which can be subdivided into general terms (genTerm), default terms (breTerm) and arbitration terms (arbiTerm), belongs to the legal essentials.
General terms (geneterm) are postconditions for parties to agree on rights and obligations, terms execution conditions, asset transfers, and terms execution should be satisfied after execution. If the general clauses have a hierarchical structure of upper and lower levels logically, the corresponding lower clauses are called sub clauses, the sub clauses are declared in the feature terms, the sub clause identifiers need to be distinguished by a hierarchical naming method, and the format is as follows:
<<factor genTerm@termID[#terms={(termID)+}]>>text<</factor>>
wherein, genTerm is a general term reserved word, termID is a term identifier, terms is a characteristic keyword, a sub term identifier set is arranged behind an equal number, and a plurality of sub terms can be provided.
In the contract text markup language CTML, a general term genTerm marks a term composition with four types of attributes of action (action), preCondition (preCondition), adjoint (adjCondition), and postCondition (postCondition):
● action (action): the specific action of marking the terms, under the general term genTerm, needs to mark the corresponding behavior attributes of the terms
<<property[termID.]action@actionID#party=(partyID|groupID)
#duty=Duty>>text<</property>>
Wherein termID is corresponding clause identification, action is behavior reserved word, action ID is behavior identification, party characteristic is used for identifying clause party, duty characteristic is used for identifying rights and obligations of the party, and the two must be clearly specified in the mark.
The rights and obligations of the invention include rights restriction (can), due restriction (must) and forbidden restriction (cannot), expressed in aggregate form Duty:
Duty::={can,must,cannot}
● preCondition: marking requirements to be met before clause execution, i.e. clause execution conditions
<<property[termID.]preCondition[@preConditionID]>>text<</property>>
Wherein termID is corresponding clause identification, preCondition is a preCondition reservation word, and preCondition ID is used as preCondition identification when the preCondition mark corresponding to termID clause exceeds one position.
● accompanying condition (adjCondition): asset manipulation attendant during execution of flagged clauses
<<property[termID.]adjCondition[@adjConditionID]>>text<</property>>
Wherein, termID is corresponding clause identification, adjCondition is an accompanying condition reservation word, and adjConditionID is used as an accompanying condition identification when the accompanying condition mark corresponding to termID clause exceeds one point.
● postCondition (postCondition): marking conditions that should be satisfied after execution of a clause
<<property[termID.]postCondition[@postConditionID]>>text<</property>>
Wherein termID is corresponding clause identification, postCondition is a postCondition reserved word, and postConditionID is used as postCondition identification when postCondition marks corresponding to termID clauses exceed one place.
The default terms (breTerm) refer to the legal obligations that both parties agree to assume when they do not fulfill the obligations specified in the contract or when the fulfillment obligations do not comply with the contract. When sub-terms exist in the default terms, the statements are still made in the features terms, and the format is as follows:
<<factor breTerm@breTermID[#terms={(termID)+}]>>text<</factor>>
wherein breTerm reserves words for default terms, and breTermID is default term identification.
In the legal contract text, the default terms need to indicate which term or terms this clause is directed to, it is marked with a violation (against) attribute in the contract text markup language, and the term is declared in the features terms of the attribute:
● violation (against):
<<property[breTermID.]against@againstID#terms={(termID)+}>>text<</property>>
wherein breTermID is default clause identification, against is violation reservation word, againstID is violation identification, terms is feature keyword, and the equal sign is followed by the corresponding clause identification set, and the feature can not be defaulted.
Meanwhile, the default clause elements also have four types of attributes including action, preCondition, accompanying condition adjCondition and postCondition, and the action must be labeled.
Arbitration terms (arbiTerm) are used to specify the method of resolving disputes and may specify an arbitration mechanism, the format being defined as follows:
<<factor arbiTerm@termID>>text<</factor>>
wherein arbiTerm reserves words for arbitration terms and termID identifies terms. Under the arbitration terms element, its defined disputes (controlsys) and organizations (institutions) are marked by the fact that the contract states:
● dispute (controlversy):
<<property[termID.]controversy[@controveryID]>>text<</property>>
wherein, termID is corresponding element identification, controlverse is dispute reserved word, controlverseID is dispute identification when dispute mark corresponding to termID clause exceeds one.
● mechanism (organization):
<<property[termID.]institution[@institutionID]>>text<</property>>
wherein, termID is corresponding element identification, the institute is a dispute reserved word, and the institute ID is used as organization identification when the organization mark corresponding to the termID clause exceeds one place.
The additional information (addition) flag may give a definition of other supplemental conditions, the flag format being as follows:
<<factor addition@additionID>>text<</factor>>
wherein, the addition is an additional information reserved word, and the addition ID is an additional information identifier. The additional information should be marked with the property and value of the domain mark.
The contract (contract) mark refers to the contract summary of all parties, which means that contracting parties mutually mean and reach a consensus to form a contract, and the mark format is as follows:
<<factor conclusion@conclusionID>>text<</factor>>
wherein, the conclusion is a contract reservation word, and the conclusionID is a contract reservation identification.
The signature (signature) of the party is also included under the contract-making element, and represents that the two parties agree with the statement related to the contract and sign the contract, and the format is as follows:
<<property[conclusionID.]signature[@signatureID]
#party=(partyID|groupID)>>text<</property>>
wherein, the connusidentid is a corresponding subscription identifier, the signature is a signature reserved word, and the signature id is a signature identifier. party is a feature keyword, and the feature value is the identifier of the party corresponding to the signature.
Further, in the establishment of the clauses, the extraction and identification of time and asset operation information are the basis of the computer processing of the clauses, which are classified as legal components in the present invention, and the legal component labeling LCM includes labeling functions of time representation, asset operation and asset representation;
the time representation marking function is the capability of extracting and representing time information in a contract and is divided into three marking functions of behavior time actionTime, time predicate timesredicate and boundary predicate rangePredicate;
the asset operation marking function is the capability of providing extraction and representation for the transfer action of the object to be marked between different parties in the contract and is divided into three marking functions of storing action default, retrieving action withdraw and transfer action;
the asset representation annotation function is used for providing the capability of extracting and representing the asset information related to the asset operation in the contract.
Further, the action time actionTime annotation process includes: the method comprises the steps that a user selects a behavior time expression text of a party for completing a specific action in pre-condition preCondition and post-condition postCondition labeled texts, embeds the behavior time expression text into the pre-condition preCondition and post-condition postCondition labeled texts, prompts the user to select or input a specific action to be labeled and an action party for executing the action from the behavior time expression texts, and obtains a selection prompt of a time limit, and after the user determines that any one of three options, namely all, a presence of a game and the current this in the party performs the action time article, the specific action, the action party for executing the action and the time limit specified by the user are added to the outer side of the behavior time expression text according to the behavior time expression;
the time predicate timesredicate labeling process comprises the following steps: selecting a time predicate expression text representing the relation between a certain target time and a certain reference time from pre-condition preCondition and post-condition postCondition annotation texts in a nested mode by a user, prompting the user to select or input a reference time baseTime, a time direction prepose, a target time targetTime and a judgment word judge to be labeled from the time predicate expression text, and adding the reference time baseTime, the time direction prepose, the target time targetTime and the judgment word judge specified by the user to the outer side of the time predicate expression text according to time predicate expression;
the boundary predicate rangePredicate labeling flow comprises the following steps: the method comprises the steps that a user selects boundary predicate expression texts which represent a certain range before and after a certain reference time from pre-condition preCondition and post-condition postCondition annotation texts in a nested mode, the user is prompted to select or input a reference time baseTime, a time direction prepos, a boundary judgment word within and a time range boundary which are to be annotated from the boundary predicate expression texts, and the reference time baseTime, the time direction prepos, the boundary judgment word within and the time range boundary which are specified by the user are added to the outer side of the boundary predicate expression texts according to boundary predicate expression.
The storage action position marking process comprises the following steps: selecting a storage behavior expression text of an asset actively stored by a party from an accompanying condition adjCondition marking text by a user in a nesting mode, calling an asset representation marking function to generate an asset expression assetExpression of the stored asset, prompting the user to select or input a value requirement condition to be marked from the storage behavior expression text, and adding the asset expression assetExpression and the marked value requirement condition specified by the user to the outer side of the selected storage behavior expression text according to a storage action representation depsitExpression;
the retrieving action withdraw labeling process comprises the following steps: the method comprises the steps that a user selects a retrieval behavior expression text for retrieving assets by a principal in an accompanying condition adjCondition annotation text in a nested mode, an asset representation annotation function is called to generate an asset expression AssetExpression for retrieving the assets, and the asset expression AssetExpression specified by the user is added to the outer side of the selected retrieval behavior expression text according to a retrieval action representation within a world;
the transfer labeling process of the transfer action comprises the following steps: the user selects a transfer behavior expression text for transferring the asset to a specific target party in the accompanying condition adjCondition annotation text in a nested mode, invokes an asset representation annotation function to generate an asset expression AssetExpression for transferring the asset, prompts the user to select or input a target party target to be annotated from the transfer behavior expression text, and adds the asset expression AssetExpression and the target party target specified by the user to the outer side of the selected transfer behavior expression text according to the transfer action representation transferExpression.
The annotation process of the asset representation annotation function comprises the following steps: selecting an expression text of asset information related to asset operation in a behavior expression text in an asset operation labeling function by a user in a nested mode, and prompting the user to select or input an asset identification (assetID) to be labeled, a ownership type assetRight of a transferred asset and a transfer asset quantity (amount); and adding the asset identification assetID, the ownership type assetRight and the transferred asset number amount, which are specified by the user, to the outer side of the selected expression text according to an asset expression assetExpression, wherein the selectable ownership types assetRight comprise ownership ownershirpRight, usage right, occupational right, income right, useFructRight and division right disposeRight.
The time expression used by the above-mentioned time representation annotation function is described as follows:
● action time (action): behavioral time representation of a principal completing an action
actionTimeExpression::=<<actionTime[#limit=(all|some|this)]#party=(partyID|groupID)
#action=actionID>>text<</actionTime>>
Wherein actionTime is a behavior time reserved word; the limit feature is a limiting article, and any (all), existing (some) and current (this) limiting information is identified after the equal sign; the party feature is used to specify a party identification; action features are used to specify behavior identification.
● time predicate (timePredicate): relation between a target time and a reference time
timePredicateExpression::=<<timePredicate[#targetTime=timeID][#judge=(is|isn't)]
[#prep=(before|after)]#baseTime=timePoint>>text<</timePredicate>>
Wherein, the timePage is a time predicate reserved word; the targetTime is a target time feature keyword, and the timeID is a designated time identifier; judge represents judging the word characteristic, appoint whether (is) is (isn't) after waiting for a sign; the prep feature is used for specifying a preposition in the time direction, and the feature value can be selected from before time (before) and after time (after); baseTime is a reference time key, and its characteristic values may include time type (Date), current time (now) and action time (action), which are represented by the set timePoint, that is:
timepoint::={Date,now,actionTime}
● boundary predicate (rangePredicate): a certain range before and after a certain reference time
rangePredicateExpress::=<<rangePredicate[#within=(true|false)]#boundary=TIME
[#prep=(before|after)]#baseTime=timePoint>>text<</rangePredicate>>
Wherein, the rangePredicate is a reserved word of the boundary predicate; the within feature represents a boundary decision, the feature value is specified as a boolean value, the boundary feature represents a TIME range, and the feature value is specified as a TIME Type (TIME).
The time expression used by the asset operation tagging function is described as follows:
● deposit action (dispose): principal logging asset
depositExpression::=<<deposit[#condition=STRING]>>text<</deposit>>
The default is a stored action reserved word, the condition feature represents a value requirement, and the feature value is specified as a character STRING Type (STRING).
● retrieve action (withdraw): principal retrieval asset
withdrawExpression::=<<withdraw>>text<</withdraw>>
Wherein the withdraw reserves words for retrieving actions.
● transfer action (transfer): transferring assets to a particular target party
transferExpression::=<<transfer#target=partyID>>text<</transfer>>
Wherein transfer is a transfer action reserved word, target is a feature keyword, and the feature value partyID is a party identifier.
The asset expression (asseteexpress) is used for calling the asset in the clause, and the features include a specific number (around), a specific weight (assetRight), and a specified asset (assetTarget), and the format is as follows:
assetExpression::=<<assetExpression[#amount=(FLOAT|INT)][#assetRight=fieldID]
#assetTarget=assetID>>text<</assetExpression>>
the asset expression is an asset expression reserved word, the amount, the assetRight and the assetTarget are feature keywords, the corresponding feature value of the amount is a floating point type (FLOAT) or an integer type (INT), the feature value is 1 under the default condition, the corresponding feature value of the assetRight is a target object weight, the weight of the contract handling object is an asset weight under the default condition, the corresponding feature value of the assetTarget is an asset identifier, and the default is not allowed when the attribute is declared.
Further, constructing an exchange tag data table EMD refers to a user interaction process for extracting and recording a number source tag metadataExpression and number source tag feature information from a Contract Text Markup Language (CTML) annotation contract text according to a structured table form, and includes two processes of number source tag generation and exchange tag data table EMD generation:
the number source tag metadataExpression generation process refers to: selecting the position of interactive data to be marked in a text or selecting the text where the interactive data to be marked is located by a user, and specifying the type to which the interactive data belongs; a user obtains an interactive data identifier exchangedDataID which automatically generates an element identifier factorID containing an element where a text is located for use or revision, and then prompts the user to select, confirm and set a selection mode option of the interactive data exchangedData according to a type to which the interactive data belongs, wherein the selection mode for selection comprises single selection, multiple option, external input import, trigger and allocate allocates, and the interactive data identifier exchangedDataID, the type to which the interactive data belongs and the selection mode option specified by the user are filled into a specified position of the user or replace the text where the selected interactive data to be labeled is located according to a number source label metadataExpression;
the exchange label data table EMD generation process refers to: extracting the number source marks from the marked contract text one by one in a traversal mode, generating a record named by the interactive data identifier exchangedDataID in the number source mark in an exchange mark data table EMD for each number source mark, prompting a user to confirm, refer or limit the necessary attributes of the interactive data in the record in a selection or editing mode, and writing the necessary attributes of the interactive data specified by the user into the corresponding record of the exchange mark data table EMD; the necessary attributes of the interactive data that can be selected include: the method comprises the steps of using a method use, a source identifier ctmlID, a target conversion contract corresponding identifier targetID after a legal contract text is converted into a target intelligent contract, a type, a default value, a definite value condition and an interactive data value filled in an interactive process; the necessary attributes of the interaction data refer to specific properties or relations which constrain and define the interaction data of the parties in the contract negotiation and execution process.
The necessary attributes of the interactive data include, but are not limited to, the above attributes, which are represented by a set
The necessary attribute of the interactive data is { using method, source identification, identification corresponding to the target conversion contract, type, default value, definite value condition, interactive data value filled in the interactive process }
The corresponding English format is:
dataEssentialAttribute::={usage,ctmlID,targetID,type,defaultValue,condition,value}
in the necessary attributes of the interactive data, the usage method use attribute is used for constraining the operation state of the interactive data, the constraint time period of the constraint is divided into three stages before contract signing, during contract signing and after contract signing, the usage state authority of each stage is divided into two types of readable and writable, and the constraint of the writable authority is divided into two types of necessary filling and optional filling.
The usage method use can be represented by six characters, and each two characters are a group and respectively sign state authority constraints on variables for the front stage, the middle stage and the rear stage of contract. The user use state authority is divided into two types of R reading and W writing, and two types of constraints on the writing authority, namely C filling and U filling, are not required, and the logical relationship is as follows:
1) r indicates that the variable is only readable;
2) w indicates that this variable is writable;
3) c and W are used together to form WC, and the variable must be filled;
4) u and W are used together to form WU, which means that the variable is not required to be filled in;
5) when the variable is in a readable state, the permission constraint representation is denoted by "_".
The value-determining condition is used for limiting the value domain range of the interactive data and constraining the constraint conditions of the filling person, wherein if the number source mark contains a single-choice single-option characteristic, the corresponding interactive data is indicated to be a single-choice, and the value range is recorded; if the multi-option characteristics are contained, the corresponding interactive data are indicated to be multi-options, and the value range is recorded at the same time; if the corresponding interaction data needs to be filled in by the specified party, the corresponding party identification partyID is recorded.
The target conversion contract corresponding identification targetID after the legal contract text is converted into the target intelligent contract refers to a variable name used in the target contract, wherein the target contract refers to a high-level intelligent contract or an intelligent contract converted by a marked contract and is usually written by a computer executable language.
The invention has the beneficial technical effects
In the method provided by the invention, a Contract Text Markup Language (CTML) and an exchange markup data table (EMD) are adopted, wherein the Contract Text Markup Language (CTML) clarifies the basic legal elements and semantics in the legal contract text through a labeling form, and formally represents the legal elements, the legal attributes and the legal components from coarse to fine through a hierarchical structure, so that the ambiguity caused by the natural language used by the contract text can be effectively avoided, the accurate understanding of the legal contract text by personnel in different fields is facilitated, and the further analysis and application of the legal contract document are facilitated.
The method is characterized in that semantic marks and source marks are distinguished from text elements in a Common Text Mark Language (CTML), the semantic information of a legal contract is represented by the semantic marks and a contract example is formed, the semantic information is further converted into an intelligent legal contract program, a universal intelligent contract program and codes, the text elements (called interactive data) which bring about contract personalization due to different contract parties or target objects are marked by the text elements, and the label information is extracted to form an exchange mark data table EMD, so that the code and data separation principle provided by the invention is realized, and the universal development and personalized application of the intelligent contract and the efficient processing and style personalized display (such as single-choice and multi-choice style customization of the interactive data) of the computer are facilitated.
The invention provides a method for accurately describing and explaining legal elements and meaning expressions in a legal contract by adopting a formal language labeling mode, which removes the ambiguity of natural language expression and achieves the aims of effectively identifying and understanding legal contents without deviation by people in different fields.
The Contract Text Markup Language (CTML) provided by the invention adopts a structured representation form based on hierarchical annotation, and extracts legal key information such as legal constituent elements, attributes, components, characteristics and the like in a document according to the current legal requirements, thereby being beneficial to understanding contract semantics and normalized processing by adopting a computer and other digital means.
The formatted legal contract file generated by the contract text markup language CTML provided by the invention is convenient to be converted into other formatted files, for example, the formatted legal contract file is converted into an intelligent legal contract SLC program and a hypertext markup language HTML file, and the formatted legal contract file is rendered into a visual webpage; and the document is converted into a PDF document which is conveniently printed as a written paper contract.
The method of the invention allows programmers to embed scripting languages, such as JavaScript, SPESC, ASP, JSP, PHP, Perl languages, in the Contract Text Markup Language (CTML), thereby guiding computers to automatically fulfill contract clauses and also being used for creating interactive forms.
The method of the invention allows the client as a contract party to confirm, refer in particular or limit the interactive data according to the limiting information by introducing the limiting information such as the type, the constraint, the permission and the like of the interactive data into the exchange mark data table EMD;
the method of the invention supports the use of the exchange mark data table EMD to generate the 'contract instance' customized by the user in an interactive mode, and further realizes the customized development of the intelligent contract program, thereby supporting the contract negotiation, subscription and execution between the client and the server based on the agreement and commitment, and finally realizing the high-efficiency, automatic and customizable intelligent contract generation method and system.
Drawings
FIG. 1 is a schematic diagram of an interactive labeling method for legal contracts based on a contract text markup language according to an embodiment of the present invention;
FIG. 2 is a diagram of element relationships in the contract text markup language in an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
On the contrary, the invention is intended to cover alternatives, modifications, equivalents and alternatives which may be included within the spirit and scope of the invention as defined by the appended claims. Furthermore, in the following detailed description of the present invention, certain specific details are set forth in order to provide a better understanding of the present invention. It will be apparent to one skilled in the art that the present invention may be practiced without these specific details.
In this embodiment, a simple house buying and selling contract is taken as an example to demonstrate how to use the contract text markup language to realize the legal contract markup and the effect thereof, so as to realize the intelligent contract generating method, namely from the legal contract text → the legal contract text labeled by the contract text markup language (CTML → exchange markup data table → the intelligent legal contract program generated by the labeled legal contract text.
In this embodiment, a legal contract text is marked, and a corresponding exchange mark data table is extracted and translated into a corresponding intelligent legal contract: the buyer and seller of the contract party conduct purchase transaction to the house, and the specific contract content, the marking result, the exchange marking data table and the intelligent legal contract are introduced one by one later.
First, legal contract text
Contract for buying and selling houses
A seller: __________________ (hereinafter referred to as Jiafang)
The buyer: ___________________ (hereinafter referred to as second square)
One), for the relevant matters of house buying and selling, the contract is made through the negotiation of both parties as follows: party A volunteers all the following houses to Party B:
1. the house condition is as follows: (please fill in according to the title of the House)
Use of total floor building area (square) of building structure of number room number set (room) number of house setting
2. The land use right of the house is obtained in the manner of "√": let () transfer ()
Second), the trade price of both parties is ________ yuan (capital) for Renminbi (_____ Bai _____ Jie _____ Wan _____ Qian _____ Bai _____ Yuan.
The second party pays off _____ times before _____, _____ in __________ years in a payment mode:
third), party a delivers the house to party b on the basis of _____ months _____ in __________. The land use rights are simultaneously transferred within the occupation range of the house.
Fourthly), if the party B can not pay by date, 5 percent of late payment is given to the party A according to the stipulation, and the late payment is obtained according to the stipulation of the contractual law.
Fifthly), the contract is signed by both parties and examined and identified by the real estate transaction administration institute in Jiaxing city, and then takes effect, and has constraint force on both parties and should be strictly fulfilled. If the default is found, the default party carries out the default responsibility, compensates the loss and pays the default cost.
Sixthly), both parties pay taxes and fees according to the national regulations and transact related procedures. Both parties are willing to handle according to the relevant national regulations. If dispute occurs, the two parties negotiate to resolve; the two parties, who did not agree, would like to apply (___________) for arbitration to the arbitration committee.
Seventhly), the contract is in quadruplicate, one copy is respectively used by the first party, the second party and the tax department, and one copy is used by the house management department.
Eight), other matters agreed by both parties: ___________________________________
Party A (signature or stamp) _______________
Signing date: ____ months ____ days ___________
Party b (signature or seal) _______________
Signing date: ____ months ____ days ___________
Second, Contract Text Markup Language (CTML) annotation description
1) Title: the house deal contract, statement identification printerDeal, is defined as follows:
house purchase and sale contract < < factor title @ printerDeal > >
2) The party: including buyers and sellers
The buyer and the seller can declare the contract text by using the party element party at the beginning, and because the name needs to be filled in the example and the place needs to be occupied by the source mark, the corresponding marking result is as follows:
< < factor party @ Seller > > Seller: < { sender @ name% String } > (hereinafter referred to as "A square >) </factor >
< factor party @ layer > Buyer: < { Buyer @ name% String } > (hereinafter referred to as "B-side > < < factor >)
3) The target is as follows: a house
The property information of the house thereof includes "house seating", "building number", "room number", "set (room) number", "building structure", "total floor number", "building area (square)" and "use", which are respectively defined as characteristics under the property information info in the example, identified using the domain flag. Since the above information needs to be filled in by contract parties, all the information needs to be occupied by using a number source mark. The corresponding labeling results are as follows:
Figure BDA0002936077580000181
the house ownership information comprises the land use right of the house, meanwhile, contract parties need to select 'giving a give-out' or 'transferring' for a land use right obtaining mode, a selection result is obtained in a variable mode, and a marking result is as follows:
Figure BDA0002936077580000182
in the above asset tag, the property information info of the House and the property information right of the House are both tagged in a nested form within the asset element asset, indicating all tags to the House asset House.
4) Contract terms:
(1) general clause notation: take the actual contract clause two as an example
The entire clause is first marked with a general clause element. "day … …" refers to the restrictions on the precondition of the terms, including the time direction prepositions as before and the reference time, the numerical value of which determines the need to use the source tag for both parties' interaction and declares it using the domain tag. Meanwhile, the real contract expects the number of times of the period filled by the party as a parameter for executing the action, so the marking result is as follows:
Figure BDA0002936077580000191
the action of the clause is 'pay off', the corresponding clause party is 'second party', and the right obligation related to the clause is judged to be limited from the semantics. Meanwhile, the actual contract requires the party to specify a payment mode and occupy the place with the digital source mark, so the corresponding mark result is as follows:
the utility model discloses a method for removing the impurity from the waste of the utility action, afford # duration, bus # party and property,
the payment method comprises the following steps: < { term1@ paymentMethod } >)
<</factor>>
The preconditions and behavior action attributes of the general term are all nested within the element genTerm.
In this clause, the action "pay-off" relates to the manipulation of the asset where the statement accompanying the condition is located differently from the statement of the other elements of the clause, so the accompanying action of the clause needs to be marked in a hierarchical notation manner. The position is that the price of the commodity is ________ RMB (capital) from _____ Bai _____ to _____ Wan _____ Qian _____ Bai _____ to get yuan. "the asset operation is transfer, the target object is the first party, the concerned asset is the RMB, the concrete amount needs to be negotiated between the parties, so the occupation is carried out by the data source mark, the lower set upper-case amount semantic meaning is the same as the meaning of the element indicated by the data source mark, the contract text mark language CTML is not distinguished, the data source mark is used for replacing uniformly, so the mark result is as follows:
Figure BDA0002936077580000192
Figure BDA0002936077580000201
(2) marking default clauses: take the real contract clause five as an example
First, the entire terms are marked with a default term element. The "not enough to pay by date" pointer marks the result for real contract clause two (term1) as follows:
Figure BDA0002936077580000202
in the above mentioned default clause marking, the clause attributes are all marked in a nested form in the default clause element breTerm, representing all marks to the default clause term 3.
(3) Arbitration clause labeling:
in a real contract, the contracting party is required to specify an arbitration committee as an arbitration mechanism, the input place of the party is occupied by a digital source mark in the mark, the terms do not limit the dispute, the mark can be defaulted, and the mark result is as follows:
Figure BDA0002936077580000211
5) additional information:
in this contract, an additional information statement needs to be made about the "late fund" that it appears in the legal contract clause five, with the following labeling results:
Figure BDA0002936077580000212
6) contract establishment:
the contract making needs to mark the contract making of two parties appearing in the contract, and simultaneously needs to mark the signature position of the parties, the signature position occupies the space through the source mark for later data table extraction, and the corresponding marking result is as follows:
Figure BDA0002936077580000213
third, legal contract text marked by contract text mark language CTML
The complete labeling results for a real contract are as follows:
Figure BDA0002936077580000221
Figure BDA0002936077580000231
Figure BDA0002936077580000241
Figure BDA0002936077580000251
four, exchange tag data table EMD
The contract example written by the contract text markup language CTML comprises two parts of document type description and document content, wherein the document content refers to the contract text marked according to the above hierarchical marking method, and the document type description comprises a language type, a specific document type and a corresponding exchange markup data table EMD.
Specific examples are as follows:
Figure BDA0002936077580000252
the exchange tag data table EMD expressed by the JSON format adopts a hierarchical structure to describe each source tag in the original legal contract.
Examples are as follows:
Figure BDA0002936077580000253
and extracting interaction variables to be filled by a user according to the text contract marked by the Contract Text Markup Language (CTML) to generate a formatted interaction marking data table (EMD). In this embodiment, a Json formatting format is used to label the hierarchical structure of each interactive variable and the corresponding parametric constraints, and the specific rule is as follows:
the single variable adopts the 'index': { parameter bond: the object flag { } includes a plurality of parameter key value pairs defined for the object flag, use ", and is divided by using the unique identifier as an index. The hierarchical structure of the variables is represented in a multi-layer index form, the parent level adopts the unique identifier of the layer as an index and can be nested, and if the sub-layer is a single variable, the sub-layer is put into an object mark { }; if the variables are multiple siblings, they are put in array flag [ ], and used, and "split".
1) The name source to be filled by the Seller party is labeled < { sender @ name% String } >, this variable must be filled before the contract is signed, can be modified during the signing process, but cannot be modified afterwards, and must be filled by the Seller party, and in conclusion, the rule of using the exchange label data table EMD is expressed as:
Figure BDA0002936077580000261
2) the property of the house has a plurality of variables to be interacted, the sitting and the number of the house in the property of the house are extracted, the use right in the property of the house is taken as an example for conversion, and the expression form of a Contract Text Markup Language (CTML) is as follows:
Figure BDA0002936077580000262
the interactive variables are three variables belonging to the same legal element House, and include attribute values < { House @ location } >, < { House @ blockNumber } > and an accessMethod with a feature value of single choice, which need to be determined before signing, the corresponding number sources are marked as < { @ accessMethod% Right # transmitted } and < { @ accessMethod% Right # locked } and { allowable } in an interactive data table EMD, and the interactive variables are expressed as follows:
Figure BDA0002936077580000263
Figure BDA0002936077580000271
■ if value equals transferred, it means yield true (selected state), transfer false;
■ if value equals allocation, it means yield false and transfer true (selected state).
3) In clause 2, the delivery deadline is < { term2@ house Deadline% Date } >, which variable can be filled before contract making, must be filled in during contract making, but is not modifiable thereafter. In summary, the EMD rule is expressed as:
Figure BDA0002936077580000272
for other interaction variables in the document, the rules need to be represented one by one, and the other interaction variables are combined to form an exchange mark data table EMD.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (9)

1. A method for interactive annotation of legal contracts based on contract text markup language, the method comprising:
1) selecting a legal constituent factor in a legal contract text by a user, prompting the user to specify or revise the legal property, the legal component and the characteristic attribute of the selected legal constituent factor in an interactive mode, and extracting the meaning explanation of the legal constituent factor in the legal contract text selected by the user;
2) establishing a Contract Text Markup Language (CTML), interpreting the meaning of the legal elements to generate a mark expressed by the Contract Text Markup Language (CTML), wherein the Contract Text Markup Language (CTML) comprises a semantic mark and a source mark according to the mark type;
the semantic mark is also called a nestable mark or a complex mark and is used for marking legal components, legal attributes and legal components in the document;
the digital source mark is used for identifying basic and non-separable words in the document, the basic and non-separable words are called interactive data exchangedData, are used for declaring, filling or selecting by contract parties, and can be used anywhere in the document;
the semantic mark adopts a hierarchical semantic marking method, the hierarchical semantic marking comprises a legal element marking LFM, a legal attribute marking LPM, a legal component marking LCM and an auxiliary mark, and the auxiliary mark comprises a domain mark;
3) adding the semantic mark and the user-oriented customized data-oriented data source mark into a contract text to realize the generation of the contract text with legal element marks;
4) and constructing an exchange tag data table EMD, namely extracting, recording and setting necessary attributes of the labeled interactive data in a Contract Text Markup Language (CTML) by adopting an exchangeable table structure, and constraining and limiting the user customizable data in the contract text by setting the necessary attributes, so that the separation of contract semantics and user data is realized, and a basis is provided for contract negotiation between parties and executed data interaction.
2. The interactive annotation method for legal contracts based on contract text markup language (TEXT) of claim 1, wherein in step 2), the LFM belongs to a primary annotation function for providing extraction, identification and element formalization representation factor expression for legal constituents of contracts stipulated by law, and the annotation process is described as follows:
after a user selects a text needing to be labeled with a legal component factor and specifies an element type factor to which the text belongs, a server generates an element identifier factor ID of the legal component factor and feeds the element identifier factor ID back to the user for use or revision;
the server prompts a user to select or input the characteristic attribute and the characteristic value of the element to be labeled from the text of the legal component element factor to be labeled;
the server adds the element type factorSet, the element identification factoriD feature attribute and the feature value of the text to the outer side of the selected marked element text according to element formal expression factor, so as to realize the normalization of formal expression and meaning expression of the legal elements;
wherein the element class factorSet includes: title, party, asset, term, contract, additional information addition; the parties are divided into individual party and group party, and the terms are divided into general terms genTerm, default terms breTerm and arbitration terms arbiTerm.
3. The interactive labeling method for legal contracts based on contract text markup language (TEXT) as claimed in claim 1, wherein in step 2), the LPM belongs to a secondary labeling function for extracting, identifying and formalizing the legal property under the legal component factor to represent the PropertyExpression, and the labeling process is as follows:
after a user selects a text to be labeled with legal property in the text with the element identification factor ID after the legal element is labeled with the LFM and specifies the property type property set to which the text belongs, the server generates the property identification property ID of the legal property and feeds the property identification property ID back to the user for use or revision;
the server prompts a user to select or input a characteristic attribute and a characteristic value thereof of the legal property to be labeled from the text;
the server adds element identification factor, attribute type set, attribute identification propertyID, feature attribute and feature value thereof to the outer side of the selected labeling attribute text according to legal attribute formalization representation propertyExpression;
wherein the property class propertySet includes: information info, ownership right, action, preCondition, adjoint condition, postCondition, violation against, dispute controlverse, organization institution, signature.
4. The interactive labeling method for legal contracts based on contract text markup language (TEXT) as claimed in claim 1, wherein in step 2), the legal component labeling LCM belongs to a three-level labeling function, which is used to provide nested extraction and recognition function for target legal constituents, legal attributes and legal components under the legal components and form a formal representation of legal components; the labeling process is as follows:
after a user selects a text needing to be labeled with a legal component from a text with a property identification propertyID after labeling a legal property LPM, a server feeds back the type of a component labeling method to which the legal component belongs to the user and prompts the user to select or input a feature attribute and a feature value which are to be labeled from the legal component to be labeled; the selectable categories of the component marking methods to which the legal components belong comprise a time representation marking function, an asset operation marking function and an asset representation marking function;
the server adds the legal component, the feature attribute and the feature value of the legal component to which the text belongs to the outside of the selected annotation attribute text according to the legal component formalization representation component expression.
5. The interactive annotation method for legal contracts based on contract text markup language as claimed in claim 1, wherein in step 2), the domain tag belongs to an auxiliary annotation function, which is used to provide extraction and recognition functions for describing the property field and the property value owned by the legal constituent or the legal attribute, and form a domain information formalized representation field expression; the labeling process is as follows:
the user selects the text of the information of the domain to be labeled from the text with the element identification factor ID after the legal element labeling LFM or the text with the attribute identification propertyID after the legal attribute labeling LPM, and the server generates the domain identification fieldID of the information of the domain to be labeled and feeds the domain identification fieldID back to the user for use or revision;
the server prompts a user to select or input the type, the quality and the value of the domain value of the characteristic to be labeled of the domain information to be labeled, the factor ID to which the characteristic belongs and the property type propertySet; and adding the type, the quality and the value of the domain value of the characteristic to be labeled of the domain information to be labeled, the factor ID and the property type propertySet of the characteristic to be labeled to the outer side of the selected labeling text according to the domain information formalization representation field expression.
6. The interactive labeling method for legal contracts based on contract text markup language (TEXT) as claimed in claim 1 or 4, wherein the legal component labeling LCM comprises labeling functions of time representation, asset operation and asset representation;
the time representation marking function is the capability of extracting and representing time information in a contract and is divided into three marking functions of behavior time actionTime, time predicate timesredicate and boundary predicate rangePredicate;
the asset operation marking function is the capability of providing extraction and representation for the transfer action of the object to be marked between different parties in the contract and is divided into three marking functions of storing action default, retrieving action withdraw and transfer action;
the asset representation annotation function is used for providing the capability of extracting and representing the asset information related to the asset operation in the contract.
7. The interactive annotation method of legal contract based on contract text markup language according to claim 6,
the action time actionTime annotation process comprises the following steps: the method comprises the steps that a user selects an action time expression text of a party completing a target action in pre-condition preCondition and post-condition postCondition labeled texts and embeds the action time expression text into the pre-condition preCondition and post-condition postCondition labeled texts, a server prompts the user to select or input a target action to be labeled and an action party to be executed from the action time expression texts, the server feeds back a selection prompt of a user time limit, and after receiving the party executing the target action and a time article, the server adds the target action, the action party to be executed and the time limit specified by the user to the outer side of the action time expression text according to action time representation TimeExpression;
the time predicate timesredicate labeling process comprises the following steps: selecting a time predicate expression text representing the relation between the target time and the reference time from a preposition annotation text and a postCondition annotation text in a nesting mode by a user, prompting the user to select or input a reference time baseTime, a time direction preposition, a target time targetTime and a judgment word judge to be labeled from the time predicate expression text by a server, and adding the reference time baseTime, the time direction preposition, the target time targetTime and the judgment word judge specified by the user to the outer side of the time predicate expression text according to time predicate expression timepredicateexpress;
the boundary predicate rangePredicate labeling flow comprises the following steps: a user selects boundary predicate expression texts representing a preset range before and after a reference time from a preposition annotation text and a postCondition annotation text in a nesting mode, a server prompts the user to select or input a reference time baseTime, a time direction preposition, a boundary judgment word within and a time range boundary from the boundary predicate expression texts, and adds the reference time baseTime, the time direction preposition, the boundary judgment word within and the time range boundary specified by the user to the outer side of the boundary predicate expression text according to boundary predicate expression;
the storage action position marking process comprises the following steps: the method comprises the steps that a user selects a storage behavior expression text of an asset actively stored by a party from an accompanying condition adjCondition marking text in a nesting mode, a server calls an asset representation marking function to generate an asset expression AssetExpression of the stored asset, then prompts the user to select or input a value requirement condition to be marked from the storage behavior expression text, and adds the asset expression AssetExpression and the marked value requirement condition specified by the user to the outer side of the selected storage behavior expression text according to a storage action representation depsitExpression;
the retrieving action withdraw labeling process comprises the following steps: the method comprises the steps that a user selects a retrieval behavior expression text of an asset retrieval of a principal in an accompanying condition adjCondition annotation text in a nested mode, a server calls an asset representation annotation function to generate an asset expression AssetExpression of the asset retrieval, and the asset expression AssetExpression specified by the user is added to the outer side of the selected retrieval behavior expression text according to a retrieval action representation within a world;
the transfer labeling process of the transfer action comprises the following steps: the method comprises the steps that a user selects a transfer behavior expression text for transferring assets to a target party in an accompanying condition adjCondition annotation text in a nested mode, a server calls an asset representation annotation function to generate an asset expression AssetExpression for transferring the assets, then prompts the user to select or input a target party target to be annotated from the transfer behavior expression text, and adds the asset expression AssetExpression and the target party target specified by the user to the outer side of the selected transfer behavior expression text according to a transfer action representation transfer Expression;
the annotation process of the asset representation annotation function comprises the following steps: selecting an expression text of asset information related to asset operation in a behavior expression text in an asset operation labeling function by a user in a nested mode, and prompting the user to select or input an asset identification (assetID) to be labeled, a ownership type assetRight of a transferred asset and a transfer asset number amount by a server; and adding the asset identification assetID, the ownership type assetRight and the transferred asset number amount, which are specified by the user, to the outer side of the selected expression text according to an asset expression assetExpression, wherein the selectable ownership types assetRight comprise ownership ownershirpRight, usage right, occupational right, income right, useFructRight and division right disposeRight.
8. The method for interactive annotation of legal contracts based on Contract Text Markup Language (CTML) as claimed in claim 1, wherein the step of constructing the EMD is a user interaction process for extracting and recording the metadataExpression and the feature information of the metadatamarkup from the contract text of CTML in a structured table form, and the process comprises two processes of generating the metadatamarkup and generating the EMD:
the number source tag metadataExpression generation process refers to: selecting the position of interactive data to be marked in a text or selecting the text where the interactive data to be marked is located by a user, and specifying the type to which the interactive data belongs; the server generates an interactive data identifier exchangedDataID containing an element identifier factoriD of an element where the text is located, feeds the interactive data identifier exchangedDataID back to a user for use or revision, prompts the user to select, confirm and set a selection mode option of the interactive data exchangedData according to a type to which the interactive data belongs, and fills the interactive data identifier exchangedDataID, the type to which the interactive data belongs and the selection mode option specified by the user into a user-specified position or replaces a text where the selected interactive data to be labeled is located according to a number source label metadataExpression;
the exchange label data table EMD generation process refers to: extracting the number source marks from the marked contract text one by one in a traversal mode, generating a record named by the interactive data identifier exchangedDataID in the number source mark in an exchange mark data table EMD for each number source mark, prompting a user to confirm, refer or limit the necessary attributes of the interactive data in the record in a selection or editing mode, and writing the necessary attributes of the interactive data specified by the user into the corresponding record of the exchange mark data table EMD; the necessary attributes of the interactive data that can be selected include: the method comprises the steps of using the method use, the source identification ctmlID, the target conversion contract corresponding identification targetID after the legal contract text is converted into the target intelligent contract, type, definite value condition and interactive data value filled in the interactive process.
9. The interactive annotation method for legal contracts based on contract text markup language as claimed in claim 8, wherein the source markup essential attributes are:
the use method use attribute is used for constraining the operation state of interactive data, the constraint time period of the constraint is divided into three stages before contract signing, during contract signing and after contract signing, the use state authority of each stage is divided into two types of readable and writable, and the constraint of the writable authority is divided into two types of necessary filling and optional filling;
the value-determining condition is used for limiting the value domain range of the interactive data and constraining the constraint conditions of the filling person, wherein if the number source mark contains a single-choice single-option characteristic, the corresponding interactive data is indicated to be a single-choice, and the value range is recorded; if the multi-option characteristics are contained, the corresponding interactive data are indicated to be multi-options, and the value range is recorded at the same time; if the corresponding interactive data needs to be filled by the appointed party, recording the identifier partyID of the corresponding party;
the corresponding identification targetID refers to a variable name used in a target contract, where the target contract refers to a premium smart contract or a smart contract that is translated by a annotated contract.
CN202110162638.5A 2021-02-05 2021-02-05 Law contract interactive labeling method based on contract text markup language Active CN112883687B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110162638.5A CN112883687B (en) 2021-02-05 2021-02-05 Law contract interactive labeling method based on contract text markup language

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110162638.5A CN112883687B (en) 2021-02-05 2021-02-05 Law contract interactive labeling method based on contract text markup language

Publications (2)

Publication Number Publication Date
CN112883687A CN112883687A (en) 2021-06-01
CN112883687B true CN112883687B (en) 2022-03-15

Family

ID=76055875

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110162638.5A Active CN112883687B (en) 2021-02-05 2021-02-05 Law contract interactive labeling method based on contract text markup language

Country Status (1)

Country Link
CN (1) CN112883687B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113255306B (en) * 2021-07-07 2021-11-19 明品云(北京)数据科技有限公司 Insurance contract auxiliary reading method, insurance contract auxiliary reading system, electronic equipment and medium
CN113761044A (en) * 2021-08-30 2021-12-07 上海快确信息科技有限公司 Labeling system method for labeling text into table

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100671073B1 (en) * 2002-04-29 2007-01-17 콘텐트가드 홀딩즈 인코포레이티드 Rights management system using legality expression language
CN103136317A (en) * 2011-11-29 2013-06-05 北京建龙重工集团有限公司 Implement method of on-line examination and approval informatization of engineering contracts in engineering management system
US20180365201A1 (en) * 2017-06-14 2018-12-20 Clause, Inc. System and method for compound data-driven contracts and documentation
US11474696B2 (en) * 2017-08-29 2022-10-18 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for providing automatic document filling functionality
CN110020414A (en) * 2018-01-08 2019-07-16 优信拍(北京)信息科技有限公司 A kind of electronic contract document creation method, system
JP7203554B2 (en) * 2018-10-04 2023-01-13 Tis株式会社 Information processing equipment
CN109801192A (en) * 2018-12-15 2019-05-24 深圳壹账通智能科技有限公司 Electron contract method, apparatus, computer equipment and storage medium
CN110020424B (en) * 2019-01-04 2023-10-31 创新先进技术有限公司 Contract information extraction method and device and text information extraction method
TWI696970B (en) * 2019-03-19 2020-06-21 彰化商業銀行股份有限公司 Interbank transaction failure noticing system based on blockchain and method thereof
CN110688847A (en) * 2019-08-23 2020-01-14 上海市研发公共服务平台管理中心 Technical contract determination method, device, computer equipment and storage medium
CN111752574B (en) * 2020-05-08 2021-03-30 北京科技大学 Intelligent executable contract construction and execution method and system of legal contract

Also Published As

Publication number Publication date
CN112883687A (en) 2021-06-01

Similar Documents

Publication Publication Date Title
Prasad et al. The Penn Discourse Treebank: An annotated corpus of discourse relations
Chitchyan et al. Semantics-based composition for aspect-oriented requirements engineering
US20160283228A1 (en) Integrated cloud platform translation system
CN112883687B (en) Law contract interactive labeling method based on contract text markup language
EP2367132A2 (en) Automated rules-based rights resolution
US20030046093A1 (en) Rights management
Du et al. Softcite dataset: A dataset of software mentions in biomedical and economic research publications
US10216733B2 (en) Smart commenting software
Steen et al. ODP enterprise viewpoint specification
EP3759656A1 (en) System and method for a thing machine to perform models
Roegiest et al. A dataset and an examination of identifying passages for due diligence
Candaş et al. Automated identification of vagueness in the FIDIC Silver Book conditions of contract
Funaki et al. A contract corpus for recognizing rights and obligations
CN113448592B (en) Intelligent legal contract generation method based on contract text markup language
An et al. Translation strategy of legal terms with Chinese characteristics in Civil Code of the People’s Republic of China based on Skopos theory
Frank et al. Contingent level classes: motivation, conceptualization, modeling guidelines, and implications for model management
Loddo et al. Intersemiotic translation of contracts into digital environments
Montrieux et al. Challenges in model-based evolution and merging of access control policies
Qamar et al. Unambiguous data modeling to ensure higher accuracy term binding to clinical terminologies
CN110796573A (en) Intellectual property life cycle management method, system and storage medium
Molina et al. An MDE modeling framework for measurable goal‐oriented requirements
Zhou et al. Auto-generation of class diagram from free-text functional specifications and domain ontology
Meloche et al. Towards legal contract formalization with controlled natural language templates
Alam et al. Blockchain domain-specific languages: survey, classification, and comparison
Lee et al. AI and global EDI

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant