CN116227487A - Legal text risk point intelligent auditing system - Google Patents

Legal text risk point intelligent auditing system Download PDF

Info

Publication number
CN116227487A
CN116227487A CN202310031069.XA CN202310031069A CN116227487A CN 116227487 A CN116227487 A CN 116227487A CN 202310031069 A CN202310031069 A CN 202310031069A CN 116227487 A CN116227487 A CN 116227487A
Authority
CN
China
Prior art keywords
risk
identification code
detection data
module
risk point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310031069.XA
Other languages
Chinese (zh)
Other versions
CN116227487B (en
Inventor
华涛
郝锦程
周志明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Fazhidao Information Technology Co ltd
Original Assignee
Zhejiang Fazhidao Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Fazhidao Information Technology Co ltd filed Critical Zhejiang Fazhidao Information Technology Co ltd
Priority to CN202310031069.XA priority Critical patent/CN116227487B/en
Publication of CN116227487A publication Critical patent/CN116227487A/en
Application granted granted Critical
Publication of CN116227487B publication Critical patent/CN116227487B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/268Morphological analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Strategic Management (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Data Mining & Analysis (AREA)
  • General Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Probability & Statistics with Applications (AREA)
  • Technology Law (AREA)
  • Primary Health Care (AREA)
  • Machine Translation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to the technical field of text data analysis, and discloses an intelligent auditing system for legal text risk points, which comprises an operation terminal, a server and a service terminal: the operation terminal is used for a user to upload the document to be checked; the server comprises a grabbing module, a word segmentation module, an analysis module, a detection data generation module and a risk point library, and the service terminal is used for providing the important sentences related to the risk points for the user. According to the method and the device, important vocabularies in the document to be audited are grabbed according to the preset grabbing rules, the important sentences are relocated, the important sentences are segmented and split, and then corresponding detection data in a picture format are generated, so that risk of the detection data is conveniently judged by a risk point library based on an AI technology, risk points are obtained, suspicious, risk or error positions in legal texts can be efficiently checked, the work efficiency of legal matters is improved, and the work load of the legal matters is reduced.

Description

Legal text risk point intelligent auditing system
Technical Field
The invention relates to the technical field of text data analysis, in particular to an intelligent auditing system for legal text risk points.
Background
The risk point auditing is an indispensable process of law affairs when writing law documents, and aims to ensure that the law documents are real, legal and effective and avoid possible problems in advance.
The existing risk point auditing is mainly completed by manpower, has low efficiency and more limitation, and firstly requires deep lawyers, and the quantity of the deep lawyers is small, so that the manual auditing cost is high, the auditing time is long, the risk cannot be completely identified, the identification rate is not high, and the accuracy is not high.
In order to solve the problem, the legal text risk point intelligent auditing system fuses a machine learning algorithm, a text analysis and natural language processing technology, a deep learning technology, a big data technology and the like, a professional sets specific risk point auditing principles and auditing standards, a machine automatically learns the principles and standards by means of an artificial intelligence technology, content of a legal document to be audited is compared with the principles and standards, missing data of the text to be audited and possible risk points are found, and risk prompts are made.
Disclosure of Invention
The invention aims to provide an intelligent auditing system for legal text risk points, which solves the following technical problems:
how to provide a legal text risk point intelligent auditing system capable of improving the working efficiency of legal workers.
The aim of the invention can be achieved by the following technical scheme:
the legal text risk point intelligent auditing system comprises an operation terminal, a server and a service terminal:
the operation terminal is used for uploading the document to be checked by a user;
the server includes:
the grabbing module is connected with the operation terminal and used for grabbing important words of the document to be checked according to preset grabbing rules;
the word segmentation module is connected with the grabbing module and used for splitting important sentences where the important vocabularies are located according to a preset word segmentation rule to obtain segmented words;
the analysis module is connected with the word segmentation module and used for marking the part of speech of the word segmentation and carrying out syntax analysis on the word segmentation according to the dependency analysis model;
the detection data generation module is connected with the word segmentation module and the analysis module and is used for generating corresponding detection data according to the syntactic analysis result of the important sentence;
the risk point library is connected with the detection data generation module and is used for obtaining risk points according to the detection data comparison;
the service terminal is connected with the risk point library and used for providing the important sentences related to the risk points for the user.
According to the technical scheme, important vocabularies in the document to be audited are grabbed according to the preset grabbing rules, the important sentences are relocated, the important sentences are split, and then the corresponding image format detection data are generated, so that risk of the detection data is conveniently judged by a risk point library based on an AI technology, risk points are obtained, suspicious, risk or error positions in legal texts can be efficiently checked, the work efficiency of legal matters is improved, and the work load of the legal matters is reduced.
As a further scheme of the invention: the syntactic analysis result comprises word sense identification codes corresponding to the segmented words and logic identification codes representing logic relations among the segmented words;
the method for generating corresponding detection data comprises the following steps:
and loading the word sense identification code and the logic identification code of the word corresponding to the important sentence in sequence to generate a blank picture, so as to obtain the detection data.
In the invention, the word sense identification code and the logic identification code are generated according to the preset corresponding rule, so that a word segmentation or a logic relationship is respectively in one-to-one correspondence with the word sense identification code and the logic identification code.
As a further scheme of the invention: the word sense identification code is a rectangular matrix punctuation code with the size specification of 50 x 50 pixels, and the logic identification code is a rectangular matrix punctuation code with the size specification of 10 x 2 pixels;
and the word sense identification code, black pixels and white pixels in the logic identification code and the arrangement sequence among different pixels respectively represent the logical relationship of word sense and front and rear word segmentation.
Thus, in this embodiment of the present invention, the specific display form in the blank picture is that word sense identification codes and logic identification codes are alternately arranged in sequence from left to right, and auxiliary words such as "have", "ground" and the like are deleted.
As a further scheme of the invention: the risk point library comprises a rechecking adjustment unit, when the abnormal probability is lower than the preset abnormal probability but higher than the preset normal probability, the rechecking adjustment unit drives the detection data generation module to regenerate the detection data according to a preset adjustment rule, so as to obtain the abnormal probability of the second detection, and comprehensively calculate the final risk probability;
P=αP1+βP2
wherein P is the final risk probability, alpha and beta are weight coefficients respectively, alpha < beta, P1 is the anomaly probability obtained by the first detection, and P2 is the anomaly probability obtained by the second detection. In the present embodiment of the present invention, α+β=1.
As a further scheme of the invention: the preset adjustment rule comprises:
the word sense identification code is adjusted to be a rectangular matrix punctuation code with the size specification of 60 x 60 pixels, and the logic identification code is a rectangular matrix punctuation code with the size specification of 10 x 3 pixels;
respectively adjusting the word sense identification code and the logic identification code into rectangular matrix punctuation codes filled by color pixels;
the word sense identification code, the color pixels in the logic identification code and the arrangement sequence among different pixels represent the logical relationship of word sense and front and back word segmentation respectively.
As a further scheme of the invention: the risk reminding system further comprises a risk reminding module, wherein the risk reminding module is used for integrating and centralizing the paragraphs where the key sentences exist, establishing indexes and facilitating checking.
The invention has the beneficial effects that: according to the method and the device, important vocabularies in the document to be audited are grabbed according to the preset grabbing rules, the important sentences are relocated, the important sentences are segmented and split, and then corresponding detection data in a picture format are generated, so that risk of the detection data is conveniently judged by a risk point library based on an AI technology, risk points are obtained, suspicious, risk or error positions in legal texts can be efficiently checked, the work efficiency of legal matters is improved, and the work load of the legal matters is reduced.
Drawings
The invention is further described below with reference to the accompanying drawings.
FIG. 1 is a schematic diagram of the principle of operation of the present invention;
FIG. 2 is a schematic diagram of audit logic in the present invention;
FIG. 3 is a machine learning training method for capturing important vocabularies in the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1-3, the invention discloses an intelligent auditing system for legal text risk points, which comprises an operation terminal, a server and a service terminal:
the operation terminal is used for uploading the document to be checked by a user;
the server includes:
the grabbing module is connected with the operation terminal and used for grabbing important words of the document to be checked according to preset grabbing rules;
the word segmentation module is connected with the grabbing module and used for splitting important sentences where the important vocabularies are located according to a preset word segmentation rule to obtain segmented words;
the analysis module is connected with the word segmentation module and used for marking the part of speech of the word segmentation and carrying out syntax analysis on the word segmentation according to the dependency analysis model;
the detection data generation module is connected with the word segmentation module and the analysis module and is used for generating corresponding detection data according to the syntactic analysis result of the important sentence;
the risk point library is connected with the detection data generation module and is used for obtaining risk points according to the detection data comparison;
the service terminal is connected with the risk point library and used for providing the important sentences related to the risk points for the user.
According to the technical scheme, important vocabularies in the document to be audited are grabbed according to the preset grabbing rules, the important sentences are relocated, the important sentences are split, and then the corresponding image format detection data are generated, so that risk of the detection data is conveniently judged by a risk point library based on an AI technology, risk points are obtained, suspicious, risk or error positions in legal texts can be efficiently checked, the work efficiency of legal matters is improved, and the work load of the legal matters is reduced.
In the specific embodiment of the invention, when the user uploads the document to be audited through the operation terminal, the document to be audited can be audited, and the text content can be modified.
After the operation terminal sends out, the server receives the information:
different documents are treated as word document processing, and the complex words possibly appearing in the text and the radical words are identified.
After receiving the information, the server can utilize a machine learning algorithm to perform data extraction operation, and grasp important words and data in the text. Then, the important sentences where important vocabularies are located are segmented, and the grabbed sentences are split into a plurality of words. For example, "the legal rights of the original notice has been seriously damaged by the behavior to be notified", "behavior", "serious", "damage", "original notice", "legal" and "rights" are then removed meaningless auxiliary words, the "already", "right" is removed corresponding to the above example, then Pos tag (part of speech tagging) is made for the divided words, and according to the risk recognition unit included in the risk point library, the abnormal probability corresponding to the important sentence is outputted according to the detection data, and when the abnormal probability is higher than the preset abnormal probability, the important sentence corresponding to the detection data output is marked as a risk point.
And comparing and judging the risk points related in the text to be checked with a legal network protection risk point library, and prompting to check the legal service. In addition, the correlation calculation can be carried out on the correlation and the numerical correlation of the money possibly related in the text to be checked, and after the analysis is finished, the relation of important information and the context of the whole things in the text to be checked are mastered. The text to be checked can be generated into legal documents with legal benefits without errors.
As a further scheme of the invention: the syntactic analysis result comprises word sense identification codes corresponding to the segmented words and logic identification codes representing logic relations among the segmented words;
the method for generating corresponding detection data comprises the following steps:
and loading the word sense identification code and the logic identification code of the word corresponding to the important sentence in sequence to generate a blank picture, so as to obtain the detection data.
In the invention, the word sense identification code and the logic identification code are generated according to the preset corresponding rule, so that a word segmentation or a logic relationship is respectively in one-to-one correspondence with the word sense identification code and the logic identification code.
As a further scheme of the invention: the word sense identification code is a rectangular matrix punctuation code with the size specification of 50 x 50 pixels, and the logic identification code is a rectangular matrix punctuation code with the size specification of 10 x 2 pixels;
and the word sense identification code, black pixels and white pixels in the logic identification code and the arrangement sequence among different pixels respectively represent the logical relationship of word sense and front and rear word segmentation.
Thus, in this embodiment of the present invention, the specific display form in the blank picture is that word sense identification codes and logic identification codes are alternately arranged in sequence from left to right, and auxiliary words such as "have", "ground" and the like are deleted.
As a further scheme of the invention: the risk point library comprises a risk identification unit, and is used for outputting abnormal probability corresponding to the important sentences according to the detection data, and marking the important sentences corresponding to the detection data output as risk points when the abnormal probability is higher than a preset abnormal probability.
As a further scheme of the invention: the risk point library comprises a rechecking adjustment unit, when the abnormal probability is lower than the preset abnormal probability but higher than the preset normal probability, the rechecking adjustment unit drives the detection data generation module to regenerate the detection data according to a preset adjustment rule, so as to obtain the abnormal probability of the second detection, and comprehensively calculate the final risk probability;
P=αP1+βP2
wherein P is the final risk probability, alpha and beta are weight coefficients respectively, alpha < beta, P1 is the anomaly probability obtained by the first detection, and P2 is the anomaly probability obtained by the second detection. In the present embodiment of the present invention, α+β=1.
As a further scheme of the invention: the preset adjustment rule comprises:
the word sense identification code is adjusted to be a rectangular matrix punctuation code with the size specification of 60 x 60 pixels, and the logic identification code is a rectangular matrix punctuation code with the size specification of 10 x 3 pixels;
respectively adjusting the word sense identification code and the logic identification code into rectangular matrix punctuation codes filled by color pixels;
the word sense identification code, the color pixels in the logic identification code and the arrangement sequence among different pixels represent the logical relationship of word sense and front and back word segmentation respectively.
As a further scheme of the invention: the risk reminding system further comprises a risk reminding module, wherein the risk reminding module is used for integrating and centralizing the paragraphs where the key sentences exist, establishing indexes and facilitating checking.
The foregoing describes one embodiment of the present invention in detail, but the description is only a preferred embodiment of the present invention and should not be construed as limiting the scope of the invention. All equivalent changes and modifications within the scope of the present invention are intended to be covered by the present invention.

Claims (7)

1. The legal text risk point intelligent auditing system is characterized by comprising an operation terminal, a server and a service terminal:
the operation terminal is used for uploading the document to be checked by a user;
the server includes:
the grabbing module is connected with the operation terminal and used for grabbing important words of the document to be checked according to preset grabbing rules;
the word segmentation module is connected with the grabbing module and used for splitting important sentences where the important vocabularies are located according to a preset word segmentation rule to obtain segmented words;
the analysis module is connected with the word segmentation module and used for marking the part of speech of the word segmentation and carrying out syntax analysis on the word segmentation according to the dependency analysis model;
the detection data generation module is connected with the word segmentation module and the analysis module and is used for generating corresponding detection data according to the syntactic analysis result of the important sentence;
the risk point library is connected with the detection data generation module and is used for obtaining risk points according to the detection data comparison;
the service terminal is connected with the risk point library and used for providing the important sentences related to the risk points for the user.
2. The legal text risk point intelligent auditing system according to claim 1, wherein the syntactic analysis result comprises a word sense identification code corresponding to the segmentation and a logic identification code representing a logic relationship between the segmentation;
the method for generating corresponding detection data comprises the following steps:
and loading the word sense identification code and the logic identification code of the word corresponding to the important sentence in sequence to generate a blank picture, so as to obtain the detection data.
3. The legal text risk point intelligent auditing system according to claim 2, wherein the word sense identification code is a rectangular matrix punctuation code of 50 x 50 pixel size specification, and the logic identification code is a rectangular matrix punctuation code of 10 x 2 pixel size specification;
and the word sense identification code, black pixels and white pixels in the logic identification code and the arrangement sequence among different pixels respectively represent the logical relationship of word sense and front and rear word segmentation.
4. The legal text risk point intelligent auditing system according to claim 2, wherein the risk point library includes a risk recognition unit for outputting an abnormal probability corresponding to the important sentence according to the detection data, and marking the important sentence corresponding to the detection data output as a risk point when the abnormal probability is higher than a preset abnormal probability.
5. The legal text risk point intelligent auditing system according to claim 2, wherein the risk point library comprises a rechecking adjustment unit, when the abnormal probability is lower than the preset abnormal probability but higher than the preset normal probability, the rechecking adjustment unit drives the detection data generation module to regenerate the detection data according to a preset adjustment rule, so as to obtain the abnormal probability of the second detection, and comprehensively calculate the final risk probability;
P=αP1+βP2
wherein P is the final risk probability, alpha and beta are weight coefficients respectively, alpha < beta, P1 is the anomaly probability obtained by the first detection, and P2 is the anomaly probability obtained by the second detection.
6. The legal text risk point intelligent auditing system of claim 5, wherein the preset adjustment rules comprise:
the word sense identification code is adjusted to be a rectangular matrix punctuation code with the size specification of 60 x 60 pixels, and the logic identification code is a rectangular matrix punctuation code with the size specification of 10 x 3 pixels;
respectively adjusting the word sense identification code and the logic identification code into rectangular matrix punctuation codes filled by color pixels;
the word sense identification code, the color pixels in the logic identification code and the arrangement sequence among different pixels represent the logical relationship of word sense and front and back word segmentation respectively.
7. The legal text risk point intelligent auditing system according to claim 1, further comprising a risk reminding module for integrating and centralizing the paragraphs of the key sentences with risks and establishing an index for convenient viewing.
CN202310031069.XA 2023-01-10 2023-01-10 Legal text risk point intelligent auditing system Active CN116227487B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310031069.XA CN116227487B (en) 2023-01-10 2023-01-10 Legal text risk point intelligent auditing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310031069.XA CN116227487B (en) 2023-01-10 2023-01-10 Legal text risk point intelligent auditing system

Publications (2)

Publication Number Publication Date
CN116227487A true CN116227487A (en) 2023-06-06
CN116227487B CN116227487B (en) 2023-11-10

Family

ID=86581734

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310031069.XA Active CN116227487B (en) 2023-01-10 2023-01-10 Legal text risk point intelligent auditing system

Country Status (1)

Country Link
CN (1) CN116227487B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102279875A (en) * 2011-06-24 2011-12-14 成都市华为赛门铁克科技有限公司 Method and device for identifying phishing website
CN107943793A (en) * 2018-01-10 2018-04-20 威盛电子股份有限公司 The semantic analytic method of natural language
CN111311451A (en) * 2020-02-29 2020-06-19 重庆百事得大牛机器人有限公司 Remote interaction management system for corporate counselor services
CN112017019A (en) * 2020-07-10 2020-12-01 苏宁云计算有限公司 Automatic reimbursement method and device based on PDF semantic extraction analysis, computer equipment and storage medium
CN113836928A (en) * 2021-09-28 2021-12-24 平安科技(深圳)有限公司 Text entity generation method, device, equipment and storage medium
CN114841160A (en) * 2022-05-16 2022-08-02 中国工商银行股份有限公司 Text auditing method and related device thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102279875A (en) * 2011-06-24 2011-12-14 成都市华为赛门铁克科技有限公司 Method and device for identifying phishing website
CN107943793A (en) * 2018-01-10 2018-04-20 威盛电子股份有限公司 The semantic analytic method of natural language
CN111311451A (en) * 2020-02-29 2020-06-19 重庆百事得大牛机器人有限公司 Remote interaction management system for corporate counselor services
CN112017019A (en) * 2020-07-10 2020-12-01 苏宁云计算有限公司 Automatic reimbursement method and device based on PDF semantic extraction analysis, computer equipment and storage medium
CN113836928A (en) * 2021-09-28 2021-12-24 平安科技(深圳)有限公司 Text entity generation method, device, equipment and storage medium
CN114841160A (en) * 2022-05-16 2022-08-02 中国工商银行股份有限公司 Text auditing method and related device thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
RAINER KOSCHKE 等: "Clone Detection Using Abstract Syntax suffix Trees", 《2006 13TH WORKING CONFERENCE ON REVERSE ENGINEERING》, pages 1 - 5 *
李洪波;于建平;: "英语情态动词语义排歧中的知识发现研究", 电脑知识与技术, no. 12, pages 1 - 3 *

Also Published As

Publication number Publication date
CN116227487B (en) 2023-11-10

Similar Documents

Publication Publication Date Title
CN110580308B (en) Information auditing method and device, electronic equipment and storage medium
CN114611494B (en) Text error correction method, device, equipment and storage medium
CN112883693B (en) Method and terminal for automatically generating electric power work ticket
CN111930792A (en) Data resource labeling method and device, storage medium and electronic equipment
CN115438650B (en) Contract text error correction method, system, equipment and medium fusing multi-source characteristics
CN115759119B (en) Financial text emotion analysis method, system, medium and equipment
CN112257425A (en) Power data analysis method and system based on data classification model
CN111027553A (en) Character recognition method for circular seal
CN115834935A (en) Multimedia information auditing method, advertisement auditing method, equipment and storage medium
CN111126056B (en) Method and device for identifying trigger words
CN114970502A (en) Text error correction method applied to digital government
CN116842021B (en) Data dictionary standardization method, equipment and medium based on AI generation technology
CN116227487B (en) Legal text risk point intelligent auditing system
CN118170907A (en) Document intelligent label system based on deep neural network and implementation method thereof
CN117932009A (en) ChatGLM model-based insurance customer service dialogue generation method, chatGLM model-based insurance customer service dialogue generation device, chatGLM model-based insurance customer service dialogue generation equipment and ChatGLM model-based insurance customer service dialogue generation medium
CN116956944A (en) Endangered language translation model method integrating syntactic information
CN115482075A (en) Financial data anomaly analysis method and device, electronic equipment and storage medium
CN112488593B (en) Auxiliary bid evaluation system and method for bidding
CN113111869B (en) Method and system for extracting text picture and description thereof
CN111402012B (en) E-commerce defective product identification method based on transfer learning
CN113553052B (en) Method for automatically recognizing security-related code submissions using an Attention-coded representation
US20220121818A1 (en) Dependency graph-based word embeddings model generation and utilization
Schneider et al. Nautilus: An End-To-End METS/ALTO OCR Enhancement Pipeline
CN116432635A (en) Text error correction method, system, electronic equipment and storage medium
Shree et al. Generation of Syntax Parser on South Indian Language using Bottom-Up Parsing Technique and PCFG

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant