WO2021031383A1 - Intelligent auxiliary judgment method and apparatus, and computer device and storage medium - Google Patents

Intelligent auxiliary judgment method and apparatus, and computer device and storage medium Download PDF

Info

Publication number
WO2021031383A1
WO2021031383A1 PCT/CN2019/116487 CN2019116487W WO2021031383A1 WO 2021031383 A1 WO2021031383 A1 WO 2021031383A1 CN 2019116487 W CN2019116487 W CN 2019116487W WO 2021031383 A1 WO2021031383 A1 WO 2021031383A1
Authority
WO
WIPO (PCT)
Prior art keywords
text
trial
original
target
data
Prior art date
Application number
PCT/CN2019/116487
Other languages
French (fr)
Chinese (zh)
Inventor
胡文成
赵付利
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021031383A1 publication Critical patent/WO2021031383A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • This application relates to the field of artificial intelligence technology, in particular to an intelligent auxiliary trial method, device, computer equipment and storage medium.
  • the inventor proposes an intelligent auxiliary trial system that can help to reduce the workload of judicial staff (including clerks and judges) in the trial process of cases and is applied to the intelligent auxiliary trial system Intelligent auxiliary trial method.
  • the embodiments of the present application provide an intelligent auxiliary trial method, device, computer equipment, and storage medium to solve the current problem of large workload of judicial staff in the judgment process.
  • An intelligent auxiliary trial method including:
  • An intelligent auxiliary trial device including:
  • the case description information acquisition module is used to obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
  • the target word segmentation result acquisition module is configured to use a word segmentation tool to segment the case description information to obtain a target word segmentation result, where the target word segmentation result includes multiple target word segmentation;
  • the target keyword determining module is configured to query the keyword database based on each target word segmentation, and determine the original keywords stored in the keyword database that match the target word segmentation as target keywords;
  • the target case type determination module is configured to query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
  • the target push law article acquisition module is used to acquire prior knowledge matching at least one of the target case types from the prior knowledge base, and use the intelligent law article push model to process the prior knowledge to obtain the target push method Article;
  • the recommended trial viewpoint acquisition module is configured to use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial viewpoint database based on the standardized information, and obtain corresponding recommended trial viewpoints;
  • the trial suggestion file acquisition module is used to push the law and the recommended trial viewpoint according to the target, and acquire and display the trial suggestion file.
  • a computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
  • One or more non-volatile readable storage media storing computer readable instructions, the computer readable storage medium storing computer readable instructions, and when the computer readable instructions are executed by one or more processors, Make the one or more processors execute the following steps:
  • Fig. 1 is a schematic diagram of an application environment of an intelligent assisted trial method in an embodiment of the present application
  • Fig. 2 is a flow chart of the intelligent assisted trial method in an embodiment of the present application
  • FIG. 3 is another flowchart of the intelligent assisted trial method in an embodiment of the present application.
  • FIG. 4 is another flowchart of the intelligent assisted trial method in an embodiment of the present application.
  • Fig. 5 is a functional block diagram of the intelligent auxiliary trial device in an embodiment of the present application.
  • Fig. 6 is a schematic diagram of a computer device in an embodiment of the present application.
  • the intelligent auxiliary trial method provided by the embodiment of the present application can be applied to the application environment shown in FIG. 1.
  • the intelligent assisted trial method is applied in an intelligent assisted trial system.
  • the intelligent assisted trial system includes a client, a microphone, and a server as shown in FIG. 1. Both the client and the microphone communicate with the server through a network, where the client
  • the client is also called the client, which refers to the program that corresponds to the server and provides local services to the client.
  • the client can be installed on, but not limited to, various personal computers, laptops, smart phones, tablet computers, and portable wearable devices.
  • the client is a terminal used to implement human-computer interaction with the parties in the court trial, and the microphone is used to collect voice data of the parties in the court trial.
  • clients and microphones are installed on the judge’s bench, the plaintiff’s seat, the court’s seat, and the third party’s seat, while the clerk’s seat has a client.
  • the server can be implemented as an independent server or a server cluster composed of multiple servers.
  • the client and the server in the intelligent auxiliary trial system communicate through the network to realize the processing of the court trial transcripts to obtain the result of the judgment proposal, so that the judge can make the judgment based on the result of the judgment proposal It is recommended that the results be judged accordingly, so as to reduce the workload of judges and improve their work efficiency.
  • an intelligent assisted trial method is provided. The method is applied to the server in Figure 1 as an example for description, including the following steps:
  • S201 Obtain court trial transcript files corresponding to the case to be judged from the database, and extract case description information from the court trial transcript files.
  • the pending cases refer to cases that need to be tried and handled.
  • the court hearing transcript is a document that records the entire litigation activities carried out by all the parties to the court hearing during the court hearing.
  • the court trial transcript file in this embodiment is the court trial transcript file formed in the foregoing embodiment and stored in the database.
  • Case description information refers to the description information extracted from the court trial transcripts that is related to the pending case and will affect the trial result.
  • This court trial transcript is a document that collects information such as facts and evidence of the case to be judged by the parties in the court trial based on the standard court trial transcript template during the court trial.
  • the standard court trial transcript template is a pre-set transcript template for recording case information corresponding to each process in the court trial process.
  • the trial transcript file is a transcript file corresponding to the pending case collected by the courts at all levels according to the standard court trial transcript template
  • the court trial transcript file has a standard format, which can help quickly extract case description information.
  • the transcript of a court hearing records the following content: the facts that the two parties have no dispute are..., the focus of the dispute between the two parties is...; in the process of obtaining case description information from the court hearing transcript, a text matching algorithm can be used to match the original
  • the courts have undisputed facts" and "the focus of disputes between the two parties have” fields.
  • the content after these fields are determined as the facts ascertained by the court and the focus of the dispute between the parties.
  • These content are the descriptive information that affects the outcome of the trial. It serves as case description information.
  • the case description information can also be filled into the corresponding position of the judgment document template, and the final judgment recommendation result determined according to this embodiment is obtained.
  • the case basis that is, the specific applicable legal provisions
  • the case basis and the judgment point of view are filled into the corresponding position of the judgment document template to quickly obtain the corresponding target judgment document.
  • the target judgment document is the final judgment document corresponding to the pending case, such as a civil judgment and a criminal judgment.
  • S202 Use a word segmentation tool to segment the case description information to obtain a target segmentation result, and the target segmentation result includes multiple target segmentation.
  • the word segmentation tool is a tool used to implement Chinese word segmentation of the text.
  • Chinese Word Segmentation refers to the segmentation of a sequence of Chinese characters into multiple individual words. Word segmentation refers to the process of recombining consecutive word sequences into word sequences according to certain specifications.
  • the target segmentation result refers to the result of multiple segmentation formed by the case description information.
  • the target word segmentation refers to the word segmentation finally formed according to the case description information.
  • the word segmentation tool may be an open source tool commonly used in the market for word segmentation of Chinese text-stuttering word segmentation.
  • Stuttering Chinese word segmentation is used to segment the case description information.
  • the segmentation process can support three word segmentation modes including: (1) Accurate mode: Try to cut the sentence most accurately, suitable for text analysis, but low efficiency. (2) Full mode: Scan all the words in the sentence that can be formed into words. The word segmentation speed is fast, but it cannot solve the ambiguity problem.
  • step S202 specifically includes the following steps:
  • S2011 Use the search engine mode of the stuttering word segmentation tool to perform text segmentation on the case description information, and obtain the text segmentation result.
  • the text segmentation result includes N first-level word segmentation.
  • the server uses the search engine mode of the stammering word segmentation tool to perform text segmentation of the case description information to quickly obtain the text segmentation result.
  • the text segmentation result can be understood as the result of the word segmentation using the conventional search engine mode, which is before optimization Word segmentation result.
  • the first-level word segmentation is the word segmentation that constitutes the text segmentation result, and N is the number of first-level word segmentation in the text word segmentation result.
  • the obtained text segmentation results include the following 5 first-level word segmentation: 1. Defendant, 2. .None, 3. Behavior, 4. Ability, 5. Behavior Ability. From the above text segmentation results, it can be seen that in three consecutive first-level participles (from the third first-level participle to the fifth first-level participle), the third first-level participle "behavior" and the fourth first-level participle "ability" After the combination, it repeats the meaning of the fifth first-level word segmentation "behavior", and the word segmentation result is inaccurate. If the subsequent semantic analysis is directly based on the text segmentation result, the efficiency and accuracy of the subsequent analysis may be affected.
  • Superposition means that in two adjacent first-level participles, at least one Chinese character at the end of the previous first-level participle overlaps with at least one Chinese character at the front end of the next first-level participle, that is, two adjacent first-level participles.
  • Grade participles can be spliced based on overlapping Chinese characters, leaving only one overlapping character or word to form a concatenated word, for example: the three consecutive first-level participles are: “Management”, “Technology” and “Work” in " The overlapping Chinese character in the two first-level participles of "Management” and “Technology” is " ⁇ ", and the overlapping Chinese character of "Technology” and “ ⁇ ” is “ ⁇ ”.
  • the three first-level participles “management”, “science and engineering” and “work” can be superimposed to form a new word: "management work”.
  • “Combine” refers to the direct merging of two first-level participles without removing the merged form of the participles of repeated Chinese characters.
  • the two first-level participles are "management” and "work”, and the compound word “management work” can be obtained by merging the aforementioned two first-level participles.
  • the case description information is: the court masters trade secrets during the management work; the search engine mode of the stuttering word segmentation tool is used to segment the case description information, and the text segmentation results obtained are: 1. The court, 2. In, 3. Management, 4. Science and engineering, 5. Work, 6. Management work, 7. Process, 8. In, 9. Master, 10. Business, 11. Secret, 12. Trade secret.
  • the superposition of "management", “science and engineering” and “work” equals “management work”
  • the combination of "management” and “work” equals “management work”. Therefore, only “management” and “work” are retained.
  • Two first-level participles are used as optimized target participles, and the two first-level participles of "science and engineering” and “management work” are deleted.
  • the case description information is: this clause is only for candidates with industry experience; after text segmentation is performed on the case description information using the search engine mode of the stuttering word segmentation tool, the text segmentation results obtained are: 1. This, 2. . Clause, 3. Only, 4, Targeting, 5. Yes, 6, Same industry, 7. Experience, 8. Same industry experience, 9, of, 10, Candidate, 11, Pick, 11, Candidate. Among them, the superposition of "candidate” and “chosen” is equal to “candidate”, and there is no situation that the combination is equal to “candidate”. At this time, delete "candidate” and “choose person” and only keep the “candidate” The first level participle.
  • the case description information is: the court masters trade secrets during the management work; the search engine mode of the stuttering word segmentation tool is used to segment the case description information, and the text segmentation results obtained are: 1. The court, 2. In, 3. Management, 4. Science and Engineering, 5. Work, 6. Management Work, 7. Process, 8. In, 9. Master, 10. Business, 11. Secret, 12. Trade Secret.
  • the combination of "commercial” and “secret” is equal to "commercial secret", and there is no overlap. Therefore, only the two first-level participles of "commercial” and “secret” are retained as the optimized target participles, and " "Trade secret” is a first-level participle.
  • first-level segmentation words with semantic repetition are analyzed to determine whether there is a combination or superposition of consecutive first-level segmentation words that can constitute a continuous
  • the long words immediately following the first-level participles of, and the first-level participles or the final long words are processed according to the different combinations or superpositions of the continuous first-level participles to obtain more accurate word segmentation results, reducing repeated words or uselessness Word, in order to realize the optimization of the word segmentation results under the premise of ensuring the efficiency of word segmentation, and improve the accuracy of word segmentation.
  • S203 Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword.
  • the keyword library is a preset database for storing original keywords.
  • the original keywords are pre-set keywords that may affect the identification of the case type.
  • the corresponding relationship between the original keyword and at least one synonym is stored in the keyword library.
  • the server uses a string matching algorithm or other matching algorithms to query whether the target word segment exists in the keyword database (it can be the original keyword or its corresponding synonym), and if the target word segment exists, it The original keyword corresponding to the target word segmentation is determined as the target keyword. For example, there are synonym groups A1, A2, and A3 in the original keyword database, and A1 is the original keyword. If the target word segmentation is the same as any of A1, A2, and A3, then A1 is determined as the target word segmentation corresponding The target keywords.
  • step S203 specifically includes the following steps:
  • S2031 Query the thesaurus according to the target word segmentation, and obtain the target synonyms corresponding to the target word segmentation.
  • the thesaurus is used to store a pre-set database for storing synonym relationships.
  • the target synonyms are the synonyms recorded in the thesaurus that have a synonym relationship with the target segmentation.
  • a synonym group with a synonym relationship is pre-stored in the thesaurus, and the server can use a string matching algorithm to match the target word segmentation, and then obtain a synonym having a synonym relationship with the target word segmentation and determine it as the target synonym.
  • S2032 Query the keyword database according to the target word segmentation and target synonyms, determine whether there is an original keyword matching the target word segmentation or the target synonym, and if the original keyword exists, determine the original keyword as the target keyword.
  • the criminal Law stipulates that “for the purpose of committing a crime, tools and conditions are prepared for crime. For prepared criminals, they can be given a lighter, mitigated punishment or exempt from punishment in the same way as the completed offender.”
  • the keywords in the preparation tools and manufacturing conditions are whether to determine whether It is the key word for the situation of "criminal preparation", and during the court hearing, the parties in the court hearing can use other synonyms to express this meaning. Therefore, the server needs to query the keyword database based on the target word segmentation and target synonyms to determine whether there is an original keyword matching the target word segmentation or target synonym in the keyword database. If the original keyword exists, it will be determined as the target keyword. In order to expand the search scope of the target word segmentation, as many target keywords as possible are determined from the case description information to improve the accuracy of the subsequent judgment and suggestion results obtained.
  • S204 Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword.
  • the case type information database is an information database used to store different case types and their corresponding keywords. Since different case types (such as different types of crimes in the criminal law) may correspond to the same keyword during the trial process, each keyword and its corresponding case type will be stored in the form of key-value pairs in the case type information database , So that after obtaining the target keyword, the server can query the case type information table according to the target keyword to determine all case types containing the target keyword as the target case type.
  • S205 Obtain prior knowledge that matches at least one target case type from the prior knowledge base, and obtain the target push rule based on the prior knowledge.
  • the priori knowledge base is a database constructed based on priori knowledge extracted from historical judgment data, and each historical judgment data is specific to a specific case type.
  • the server queries the prior knowledge base according to at least one case type, and queries the prior knowledge matching the at least one target case type from the prior knowledge base, thereby performing prior knowledge screening.
  • the prior knowledge includes case description information in historical judgment data corresponding to at least one target case type and corresponding case judgment results.
  • the a priori knowledge base stores all case knowledge with too much content, if all the data in the prior knowledge base is vectorized for each case, there may be a problem of low execution efficiency. Therefore, it can be determined by the target keywords At least one target case type, and then use at least one target case type to filter the prior knowledge in the prior knowledge base to determine its corresponding prior knowledge, so as to perform the subsequent steps S206 and S207 to avoid the prior knowledge All prior knowledge in the library is processed to improve execution efficiency.
  • the target push law specifically refers to extracting from all prior knowledge the judgment basis used in the judgment result of the case (ie the specific application of the law), and the application of all judgment basis The number of times is counted and sorted, and finally a target push rule based on the number of applications from more to less is formed, so that the judge can understand the basis of the judgment of relevant historical judgment data, so as to save the time of consulting relevant materials in the judgment process and reduce the workload.
  • S206 Use the semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, and query the judgment opinion database corresponding to the target case type based on the standardized information, and obtain corresponding recommended judgment opinions.
  • the semantic analysis model is a model constructed based on the NLP (Natural Language Processing) technology for analyzing the semantics of text.
  • Standardized information refers to the information extracted from the case description information that may affect the trial of the case.
  • the judgment viewpoint database is a database used to record the historical description information extracted from the historical judgment data and the association relationship between the corresponding historical judgment viewpoints.
  • the server uses a semantic analysis model to perform semantic analysis on case description information to extract standardized information therein.
  • a semantic analysis model to describe the case.
  • information that matches the age of the court will be screened out as standardized information.
  • the similarity is sorted according to the text similarity, and the top N (the number of N can be set independently) historical judgment viewpoints corresponding to the historical description information with the greater text similarity are used as recommended judgment viewpoints so that the judge can judge In the course of the case, timely understand the historical trial viewpoints of similar cases, and combine the actual situation of the pending case to make a judgment in a timely manner, reducing the workload of the judge to consult relevant materials. Understandably, querying the trial viewpoint database corresponding to the target case type according to the standardized information can effectively reduce the amount of data comparison between the standardized information and the historical description information in the trial viewpoint database, and help improve the efficiency of data processing.
  • the server pushes the acquired at least one target law and the acquired at least one recommended trial viewpoint as a trial suggestion file corresponding to the case to be judged.
  • the trial suggestion file is a case description extracted by the system through the court trial transcript.
  • the information is analyzed, and the targets determined in the historical judgment data related to the description of the case are pushed to the legal articles and recommended trial viewpoints, so as to provide a reference for the judge to judge, reduce the judge's process of consulting related cases and reduce their workload.
  • the case description information after the case description information is quickly extracted from the court hearing transcript, the case description information can be segmented and keyword matching processing can quickly obtain its corresponding target keywords, and use the The target keyword determines the corresponding at least one target case type, so that prior knowledge matching at least one target case type can be selected from the prior knowledge base, and the corresponding target push law can be quickly obtained based on the prior knowledge , To speed up the efficiency of obtaining the judgment proposal documents.
  • the intelligent auxiliary trial method before step S201, that is, before obtaining the court trial transcript file corresponding to the case to be judged from the database, the intelligent auxiliary trial method further includes:
  • the data update task includes the original case type and task update time.
  • the data update task is a task used to update the prior knowledge base and target judgment model.
  • the original case type refers to the case type targeted by the data update task.
  • the task update time refers to the time when the data update task was executed last time.
  • the judgment basis corresponding to the target case type specifically refers to the legal basis corresponding to the target case type, including legal provisions, regulations, and judicial interpretations.
  • the judgment basis corresponding to the original case type is changed after the task update time, it means that the judgment basis corresponding to the original case type has changed after the task update time, and the subsequent case trial process New laws, regulations, and judicial interpretations will be applied to the judgment basis. Therefore, the prior knowledge base and target judgment model need to be updated.
  • the execution time of the judgment basis is determined as the time of change, and the time of change and system are obtained
  • the historical judgment data corresponding to the original case type between the current time and the data to be processed are determined based on the historical judgment data, which helps to improve the timeliness and accuracy of the target judgment model trained subsequently.
  • the to-be-processed data is specifically training data used to update the a priori knowledge base and the trial opinion base.
  • the preset period is a preset period for collecting data, which can be set to three months, half a year, or one year.
  • the judgment basis corresponding to the original case type does not change after the task update time, it means that the old trial basis is still applied in the subsequent case trial process, but as time changes, the application of these trial basis
  • the standard may change, such as the amount of compensation.
  • the historical judgment data corresponding to the original case type in the preset period before the current time of the system is used to determine the data to be processed to ensure the timeliness of the data to be processed. This helps to improve the timeliness of the subsequent training of the target judgment model.
  • the data to be processed includes a target area, which can be understood as the administrative area where the trial court corresponding to the historical judgment data is located, such as Guangdong province or Shenzhen.
  • a target area which can be understood as the administrative area where the trial court corresponding to the historical judgment data is located, such as Guangdong province or Shenzhen.
  • the historical judgment data corresponding to the same target area and the original case type is determined as the data to be processed.
  • the preset number threshold is a preset number threshold. If the target data volume of the to-be-processed data corresponding to any target area is greater than the preset number threshold, it means that there are more cases in the target area that are tried by courts at all levels corresponding to the original case type. Therefore, the same target area
  • the historical judgment data corresponding to the original case type is determined as the data to be processed, so that a priori knowledge base corresponding to the target area and the original case type is constructed based on the data to be processed, and a priori knowledge base corresponding to the target area and the original case type is constructed. In order to make the a priori knowledge base and the trial viewpoint database have greater reference significance and more pertinence for the pending cases corresponding to the target case type in the target area.
  • the historical judgment data corresponding to the same original case type is determined as the data to be processed.
  • the target data volume of the to-be-processed data corresponding to any target area is not greater than the preset number threshold, it means that the number of cases corresponding to the original case type in the courts at all levels in the target area is small, and the same original case type is corresponding
  • the historical judgment data is determined as the data to be processed, the a priori knowledge base corresponding to the original case type is constructed, and the trial viewpoint database corresponding to the original case type is trained, so that the The data to be processed is not limited to the same target area, but is only divided according to the original case type.
  • S304 Extract prior information from the data to be processed, and construct a priori knowledge base corresponding to the target case type based on the prior information.
  • the data to be processed is historical judgment data corresponding to the original case type.
  • the server extracts a priori information from the data to be processed. Specifically, it can be understood as extracting the description of the case and the result of the case from the data to be processed, and then extracting keywords from the description of the case and the result of the case, using keys specifically
  • the key-value form defines these keywords; then, the extracted keywords are used to construct a priori knowledge base corresponding to the original case type, specifically the key-value pairs extracted from all the data to be processed Stored in the database to construct a priori knowledge base corresponding to the original case type.
  • Key can specifically be a keyword in the case description that affects the judgment result, and Value is the judgment result in the case judgment result.
  • S305 Extract historical description information and historical judgment viewpoints from the data to be processed, and construct a judgment viewpoint database corresponding to the original case type based on the historical description information and historical judgment viewpoints.
  • the historical description information is the information extracted from the data to be processed that may affect the trial of the case. Specifically, it is extracted from the case description information of the data to be processed using a semantic analysis model.
  • the historical trial viewpoint is to extract the judge's trial viewpoint of the historical case from the data to be processed.
  • the server uses the historical description information and historical judgment opinions as a set of training data and inputs them to common CNN (Convolutional Neural Network) or RNN (Recurrent Neural Network) Carry out model training to update the model parameters, so as to obtain the trial opinion library corresponding to the original case type.
  • CNN Convolutional Neural Network
  • RNN Recurrent Neural Network
  • the intelligent auxiliary trial method according to whether the judgment basis corresponding to the original case type has changed after the task update time, the historical judgment data of different time periods is collected and determined to be used to construct prior knowledge To ensure the timeliness of the data to be processed, the a priori knowledge base and the trial view database are updated to ensure that the target push method is obtained separately based on the prior knowledge base and the trial view database. The timeliness and accuracy of the article and recommended trial opinions.
  • the intelligent assisted trial method further includes the following steps:
  • S401 Display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text.
  • the standard court trial transcript template generally includes pre-trial preparation, announcement of the opening, court investigation and court debate, etc.
  • the court investigation stage it specifically includes the statement of the parties, the summary of the focus of the dispute, the cross-examination of evidence around the focus, the witness testimony in court, the reading of expert opinions and Corresponding instructions are provided for each link of the inspection transcript and application for appraisal.
  • This guiding speech is generally the speech corresponding to the guiding question that the judge guides other parties (such as the plaintiff, lawyer or witness) to respond, such as "the plaintiff first presents to the court around your litigation request".
  • Trial prompt text refers to the guiding words corresponding to the current trial link in the standard court trial transcript template.
  • the current trial link refers to the ongoing link in the court hearing the case, such as the statement of the parties or other links.
  • the original voice data is the voice data collected in real time when the parties to the trial respond to the trial prompt text.
  • the judge will guide or ask questions to the parties (such as the plaintiff, lawyer or witness) in the court trial based on the trial prompt information.
  • the parties in the court trial need to respond.
  • the voice data collected by the microphone is Raw voice data.
  • the intelligent auxiliary trial system includes at least one microphone connected to the server, and each microphone corresponds to a microphone identifier, and the microphone identifier is an identifier for uniquely identifying different microphones. When the microphone collects raw voice data in real time, its raw voice data is associated with the corresponding microphone identifier.
  • S402 Perform voiceprint recognition on the original voice data, obtain a voiceprint recognition result, and determine an object identifier corresponding to the original voice data according to the voiceprint recognition result.
  • the voiceprint recognition is performed on the original voice data to obtain the voiceprint recognition result.
  • the server uses the preset voiceprint feature extraction algorithm to extract the voiceprint feature of the original voice data, and then performs the voiceprint feature extraction based on the extracted voiceprint feature. Pattern recognition to determine the identity of the speaker corresponding to the original voice data.
  • the object identifier is used to uniquely identify different parties in the court hearing.
  • the server uses a voiceprint feature extraction algorithm to extract voiceprint features from the original voice data, obtains the target voiceprint feature, and determines whether there is a standard voiceprint feature corresponding to the target voiceprint feature to obtain the corresponding voiceprint recognition result .
  • the voiceprint recognition result includes two results of the existence of a standard voiceprint feature and the absence of a standard voiceprint feature.
  • step S402 specifically includes the following steps:
  • S4011 Use the voiceprint feature extraction algorithm to extract the voiceprint feature of the original voice data, obtain the target voiceprint feature, and determine whether there is a standard voiceprint feature corresponding to the target voiceprint feature.
  • the voiceprint feature extraction algorithm is an algorithm used to extract voiceprint features from voice data to determine the voiceprint features corresponding to the original voice data.
  • the voiceprint feature extraction algorithm includes but is not limited to the MFCC extraction algorithm, and the extracted target voiceprint feature is the MFCC feature.
  • MFCC Mel-scale Frequency Cepstral Coefficients
  • the Mel scale describes the non-linear characteristics of the human ear frequency.
  • the target voiceprint feature is the voiceprint feature extracted from the original voice data.
  • the standard voiceprint feature is the voiceprint feature extracted based on the voice data of the parties to the trial collected during the pre-trial preparation process.
  • the standard voiceprint feature is also the MFCC feature extracted by the MFCC extraction algorithm.
  • the standard voiceprint features collected by the parties in the court trial and their corresponding identification are stored in a database in association with each other for subsequent identification processing.
  • the server uses the cosine similarity algorithm or other similarity algorithms to calculate the similarity between the target voiceprint feature and each standard voiceprint feature pre-stored in the server to obtain the target similarity; if the target similarity is greater than the preset If the similarity threshold is the same, it is determined that there is a standard voiceprint feature; if the target similarity is not greater than the preset similarity threshold, it is determined that there is no standard voiceprint feature.
  • the preset similarity threshold is a threshold used to evaluate whether the similarity reaches the criteria for identifying the same speaker.
  • the object identifier corresponding to the original voice data is determined according to the identity tag corresponding to the standard voiceprint feature, so as to quickly determine the original voiceprint feature.
  • Object ID corresponding to the voice data The identity mark is used to distinguish the identity of the parties in the court hearing, such as the plaintiff, the court, and the witness.
  • the object identifier is used to uniquely identify different parties in the court hearing.
  • the object identification can be a serial number identification added to the identity identification. For example, in the case of multiple plaintiffs, the object identification in the form of plaintiff 01 and plaintiff 02 can be used to distinguish.
  • the serial number identification can be determined according to the sequence of collecting standard identity features during the pre-trial preparation process, or according to the sequence of their speeches during the court hearing, so that each party to the court hearing has an object identifier that uniquely identifies its identity.
  • the server does not store the standard voiceprint feature corresponding to the target voiceprint feature, it means that the speaker has not collected the standard voiceprint feature in advance during the pre-court preparation process. At this time, it can be based on the original voice data.
  • the carried microphone ID queries the microphone information table, obtains the ID ID corresponding to the microphone ID, and generates the corresponding object ID based on the ID ID.
  • the microphone information table is an information comparison table used to determine the identity identifier corresponding to the speaker according to the placement position of the microphone, and the microphone information table associates the microphone identifier with its corresponding identity identifier.
  • the identity corresponding to the microphone identity is the witness, and the corresponding object identity is generated based on the identity. Specifically, it refers to the serial number identity formed by adding the identity of the witness and the order of speaking. Obtain the corresponding object ID, such as witness 01, witness 02, etc.
  • the target voiceprint feature extracted from the original voice data is used to determine whether there is a voiceprint recognition result corresponding to the standard voiceprint feature to determine whether it is based on the standard voiceprint feature or the microphone
  • the identifier determines the corresponding object identifier to ensure the uniqueness of the determined identity identifier.
  • S403 Perform text translation on the original voice data, obtain the original text data corresponding to the object identifier, and store the object identifier and the original text data in a corresponding position in the standard court trial transcript template.
  • the text translation of the original voice data refers to the process of translating the original voice data into data in text form.
  • the original text data refers to the text data translated from the original voice data.
  • the server can use, but is not limited to, a static decoding network to perform text translation on the original voice data. Since the static decoding network has fully expanded the search space, the decoding speed is very fast when performing text translation, which can be fast Obtain the original text data corresponding to the object ID. Understandably, the server receives the original voice data collected by the microphone, and then uses the static decoding network to translate the original voice data to quickly obtain the corresponding original text data without manual input by the clerk, thereby speeding up the original text data Input efficiency.
  • the server performs text translation on the original voice data to obtain the corresponding original text data
  • the original text data and its corresponding object identifier are associated and stored in the corresponding position of the standard court trial transcript template, that is, the original text data is filled in The position corresponding to the object identifier in the current trial link of the standard court trial transcript template.
  • the original voice data is the voice data that responds to the trial prompt text "Does the plaintiff have any supplements to your prosecution?" and the corresponding object is identified as the plaintiff, the original voice data can be translated into the original
  • the text data is filled in the position corresponding to the trial prompt text in the standard court trial transcript template to improve the input efficiency of the original text data and reduce the work burden of the clerks.
  • S404 Query the prior text database based on the original text data, and determine whether there is prior text data corresponding to the original text data.
  • the previous text data refers to the text data that has been formed and recorded in the corresponding position of the standard court trial transcript template before the original voice data is collected by the microphone. Since the court trial is a process in which the parties to the trial play a game against the same event, during the trial, the parties to the trial will discuss the same event from different perspectives. The content of the discussion may be relevant.
  • the previous text data can be understood as The text content corresponding to the content discussed in the original voice data before the original voice data is collected. Take time as an example. In a criminal case, the time node of the case is a key factor that affects the evidence chain of the case formation or the severity of the sentence. The plaintiff, lawyer, and witnesses may publish different original voice data based on these time nodes.
  • various time nodes related to the pending case are the key factors affecting the determination of liability for breach of contract.
  • the plaintiff, the lawyer and the witnesseses may publish different original voice data based on these time nodes, and determine the previous text data and the original text data according to the order of their formation time.
  • step S404 specifically includes the following steps:
  • S4041 Use a keyword extraction algorithm to extract keywords from the original text data to obtain text keywords.
  • text keywords are keywords extracted from the original text data.
  • Keyword extraction algorithm is an algorithm used to extract keywords from text data.
  • keyword extraction algorithms such as TextRank, LDA, TPR-TextRank, etc. are used, but not limited to, to perform keyword extraction on the original text data to obtain text keywords corresponding to the original text data.
  • S4042 Query the thesaurus based on the text keywords, and obtain text synonyms corresponding to the text keywords.
  • the thesaurus is used to store a pre-set database for storing synonym relationships.
  • Text synonyms are synonyms recorded in the thesaurus that have a synonym relationship with text keywords.
  • synonym groups with synonym relationships are pre-stored in the thesaurus. These synonym groups may specifically be synonym groups involved in the trial process of the case, so that the server can query the text keywords extracted from the original text data. Corresponding text keywords to help expand the scope of the query in the subsequent query process.
  • S4043 Query the prior text database according to the text keywords and text synonyms, and determine whether there is prior text data containing the text keywords or text synonyms.
  • the previous text database is a database used to store all previous text data formed before the original voice data is collected.
  • the prior text database is queried according to text keywords and text synonyms, and it is determined whether there is prior text data corresponding to the text keyword in the prior text database, or whether there is a prior text data corresponding to the text synonym. First text data to expand the search range of previous text data.
  • the speech data is translated into prior text data and stored in the prior text database for subsequent semantic analysis based on the original text data and prior text data To determine whether the two expressions have the same meaning, that is, the step S405 is subsequently executed.
  • S405 If prior text data exists, perform semantic analysis on the original text data and prior text data to determine the semantic analysis result, highlight the original text data based on the semantic analysis result, and display the trial corresponding to the semantic analysis result Prompt text, repeated execution to obtain the original voice data collected by the microphone and corresponding to the trial prompt text.
  • semantic analysis is performed on the original text data and the prior text data to determine the semantic analysis result, which specifically includes: using a semantic analysis tool to analyze the original document data and the previous text data. Perform semantic analysis on the first text data to determine whether the semantics of the original document data and the previous text data are the same or different, and obtain the corresponding semantic analysis results.
  • the speech analysis results include the same semantic analysis results and different semantic analysis results.
  • the semantic analysis tool may adopt, but is not limited to, an analysis tool created by NLP (Natural Language Processing) technology.
  • the prior text data corresponding to party A in the court trial records "I bought a batch of products worth 100,000 from B on March 10", while the original text data corresponding to party B in the court trial records "I was on March 10 Sell a batch of products with a value of 100,000 to A”.
  • the parties involved, time, subject matter, and price are all the same.
  • semantic analysis tools to analyze the original document data and the previous text
  • the previous text data corresponding to party A in the court trial recorded "I purchased a batch of products worth 100,000 from B on March 10”
  • the original text data corresponding to party B in the court trial recorded "I was on March 8. No. Selling a batch of products with a value of 150,000 to A”.
  • the parties involved and the subject matter are the same, the time and price are not the same, and the opinions described by the two are determined Different, obtain different semantic analysis results.
  • the original text data is highlighted according to the semantic analysis result, and the trial prompt text corresponding to the semantic analysis result is displayed, which specifically refers to whether the original text data and the previous text data have the same semantics or different semantics according to the semantic analysis result , So as to determine whether the original text data corresponding to the current trial link is highlighted in different ways in response to the facts determined by the two parties, the focus of the dispute and whether the description is inconsistent, so that the judge can highlight the processing results during the trial Understanding the above situation will help reduce the workload of the judge in the trial process, thereby reducing the workload, and display the trial prompt information corresponding to the semantic analysis result, which will help speed up the trial progress of the court trial.
  • step S405 specifically includes the following steps:
  • the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is the same identifier, it means that the speaker who spoke the original text data and the previous text data is the same person, and the original text data and The semantics of the previous text data is the same, and there is no conflict between the previous and the next. It can be understood as a discussion without special circumstances. Therefore, the original text data is not highlighted, and the next trial link corresponding to the standard court trial transcript template is displayed. For the corresponding trial prompt text, repeat the steps of acquiring the original voice data corresponding to the trial prompt text collected by the microphone in step S401 and the subsequent steps.
  • the first highlighting mode is a preset mode for highlighting the inconsistent content discussed by the same speaker.
  • the first highlighting mode can adopt font color, background color, bold, slanted or underlined, etc. The form is highlighted.
  • the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are the same, indicating that the speaker who spoke the original text data and the previous text data is the same person, and the original text data and The semantics of the previous text data are different, and there is a problem of contradictory statements. It is very likely that the product was produced because of the special situation of the speaker lying. Therefore, the first highlighting mode needs to be used to highlight the original text data.
  • the second highlighting mode is used to highlight the original text data, and display the information including the non-dispute prompt information Trial reminder text.
  • the second highlighting mode is a preset mode for highlighting content that is not contradictory to different speakers’ discussions. Understandably, the second highlighting mode is a mode different from the first highlighting mode. It can be highlighted in the form of font color, background color, bold, slanted, or underlined.
  • the second highlighting mode is required to highlight the original text data. Displaying the trial reminder text that includes non-controversial reminder information can help determine the undisputed facts in the court trial process, and better control the guiding issues in the court trial process, helping to reduce the workload of the judge in the trial process, thereby Reduce the workload.
  • the third highlighting mode is a preset mode for highlighting conflicting content discussed by different speakers. Understandably, this third highlighting mode is the same as the previous first highlighting mode and second highlighting mode. Modes with different display modes can also be highlighted in the form of font color, background color, bold, oblique, or underlined.
  • the third highlight mode can be used to highlight the original text data.
  • Display the trial prompt text containing the prompt information of the dispute focus which helps to determine the focus of the dispute in the court hearing process, and then better control the guiding problems in the court hearing process, which helps to reduce the workload of the judge during the trial process, thereby reducing Work load.
  • the server can also use the voice polygraph model preset on the server to process the original voice data to obtain the probability of lying. If the probability of lying is greater than With the preset probability threshold, the lie display mode is used to highlight the original text data, so that the judge knows whether the parties in the trial are lying in a timely manner during the trial, so as to ensure a fair and just trial of the case.
  • the voice test model may be a model used on a voice tester currently publicly available on the market, so as to determine the probability that the original voice data spoken by the speaker is a lie based on the voice frequency or voice tone contained in the original voice data .
  • the preset probability threshold is a preset threshold used to evaluate whether the probability of determining a lie is reached.
  • the lie display mode is a preset mode for highlighting the original text data with a higher probability of lying.
  • the standard court trial record template can be used to determine whether there is a trial prompt text corresponding to the next trial link; if there is a trial corresponding to the next trial link
  • prompting text repeat the steps to obtain the original voice data corresponding to the trial prompt text collected by the microphone and the subsequent steps (ie steps S402, S403); if there is no trial prompt text corresponding to the next trial session, the court is deemed to be heard
  • a court trial record file is formed based on all the original text data filled in the corresponding position in the standard court trial record template, and the court trial record file is stored in the database so that the judge can make a judgment document based on the court trial record file.
  • the intelligent assisted trial method provided in this embodiment, after the original voice data corresponding to the trial prompt text corresponding to the current trial session is collected through a microphone in real time, it is determined according to the voiceprint recognition result of the voiceprint recognition of the original voice data Object identification to determine the identity of the speaker corresponding to the original voice data; the original text data obtained by the text translation of the original voice data and the object identification are stored in the corresponding position of the standard court trial transcript template, thereby improving the input efficiency of the original text data , No need for clerks to enter verbatim, reducing the work burden of clerks.
  • the original text data is highlighted, and the trial prompt text corresponding to the semantic analysis result is displayed , which enables judges to understand the special situations corresponding to different semantic analysis results according to the highlighted processing results during the court trial, which helps to reduce the workload of the judges during the court trial, thereby reducing the workload and displaying the trial corresponding to the semantic analysis result
  • the prompt information helps to speed up the trial progress of the court trial and improve the efficiency of the trial.
  • an intelligent auxiliary trial device is provided, and the intelligent auxiliary trial device corresponds one-to-one with the intelligent auxiliary trial method in the foregoing embodiment.
  • the detailed description of each functional module of the intelligent auxiliary trial device is as follows:
  • the case description information obtaining module 501 is used to obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
  • the target word segmentation result acquisition module 502 is configured to use word segmentation tools to segment the case description information to obtain a target word segmentation result, and the target word segmentation result includes multiple target word segmentation;
  • the target keyword determining module 503 is configured to query a keyword database based on each target word segmentation, and determine the original keywords stored in the keyword database that match the target word segmentation as the target keyword;
  • the target case type determination module 504 is configured to query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
  • the target push law acquisition module 505 acquires prior knowledge matching at least one target case type from the prior knowledge base, and obtains the target push law based on the prior knowledge;
  • the recommended trial viewpoint acquisition module 506 is used to perform semantic analysis on case description information using a semantic analysis model to obtain standardized information, query the trial viewpoint database based on the standardized information, and obtain corresponding recommended trial viewpoints;
  • the trial suggestion file acquisition module 507 is used to push the law and recommended trial opinions according to the target, and acquire and display the trial suggestion file.
  • the intelligent auxiliary trial device further includes:
  • the original voice data acquisition module is used to display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text;
  • the object identification acquisition module is used to perform voiceprint recognition on the original voice data, obtain the voiceprint recognition result, and determine the object identifier corresponding to the original voice data according to the voiceprint recognition result;
  • the original text data acquisition module is used to translate the original speech data, obtain the original text data corresponding to the object identification, and store the object identification and the original text data in the corresponding position of the standard court trial transcript template;
  • the prior text data judgment module is used to query the prior text database based on the original text data to determine whether there is prior text data corresponding to the original text data;
  • the highlight processing module is used to perform semantic analysis on the original text data and previous text data if there is prior text data, determine the semantic analysis result, and perform highlight processing on the original text data according to the semantic analysis result, display and semantic analysis
  • the trial prompt text corresponding to the result is repeatedly executed to obtain the original voice data corresponding to the trial prompt text collected by the microphone;
  • the transcript file acquisition module is used to repeatedly execute the trial prompt text corresponding to the next trial link in the standard court trial transcript template if there is no prior text data, and obtain the corresponding trial prompt text collected by the microphone The original voice data, until there is no trial prompt text corresponding to the next trial link, obtain the court trial transcript file and store the court trial transcript file in the database.
  • the object identification acquisition module includes:
  • the voiceprint feature extraction and judgment unit is used to perform voiceprint feature extraction on the original voice data by using the voiceprint feature extraction algorithm, obtain the target voiceprint feature, and determine whether there is a standard voiceprint feature corresponding to the target voiceprint feature;
  • the first object identification determining unit is configured to determine the object identification corresponding to the original voice data according to the identity identification corresponding to the standard voiceprint feature if there is a standard voiceprint feature;
  • the second object identifier determining unit is configured to determine the object identifier corresponding to the original voice data according to the microphone identifier corresponding to the original voice data if there is no standard voiceprint feature.
  • the prior text data judgment module includes:
  • the text keyword acquisition unit is used to extract keywords from the original text data using a keyword extraction algorithm to obtain text keywords;
  • the text synonym acquisition unit is used to query the thesaurus based on text keywords to obtain text synonyms corresponding to the text keywords;
  • the prior text query judgment unit is used to query the prior text database based on text keywords and text synonyms, and determine whether there is prior text data containing text keywords or text synonyms;
  • the first judgment processing unit is configured to determine that there is prior text data corresponding to the original text data if there is prior text data containing text keywords or text synonyms;
  • the second judgment processing unit is configured to determine that there is no prior text data corresponding to the original text data if there is no prior text data containing text keywords or text synonyms.
  • the highlight processing module includes:
  • the first display processing unit is used for if the semantic analysis result is the same, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the original text data is not highlighted, and the standard court trial record template is displayed. Trial prompt text corresponding to the next trial session;
  • the second display processing unit is used to perform highlight processing on the original text data using the first highlight mode if the semantic analysis results are different, and the original text data and the object identifier corresponding to the previous text data are the same identifier, and display Trial reminder text including contradictory reminder information;
  • the third display processing unit is configured to use the second highlighting mode to highlight the original text data if the semantic analysis results are the same, and the original text data and the object identifier corresponding to the previous text data are not the same identifier, and display Trial reminder text including no dispute reminder information;
  • the fourth display processing unit is used to perform highlight processing on the original text data using the third highlighting mode if the semantic analysis results are different and the object identifiers corresponding to the original text data and the previous text data are not the same identifier, Display the trial reminder text including the reminder of the focus of the dispute.
  • the target word segmentation result acquisition module includes:
  • the text segmentation processing unit is used to use the search engine mode of the stuttering word segmentation tool to segment the case description information to obtain the text segmentation result.
  • the text segmentation result includes N first-level word segmentation;
  • the first optimization processing unit is configured to: if the superposition of consecutive k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are at least two consecutive k-1 first-level participles The combination of graded participles is equal to the k-th first-level participle, and only at least two first-level participles that are combined and equal to the k-th first-level participle are retained as target participles to obtain the target segmentation result;
  • the second optimization processing unit is configured to: if the superposition of k-1 consecutive first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are no at least two one in any continuous k first-level participles The combination of level participle is equal to the k-th first-level participle, then delete the first k-1 first-level participles, keep the k-th first-level participle as the target participle, and obtain the target participle result;
  • the third optimization processing unit is used to delete the k-th first-level participle and keep the first k-1 if the combination of k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle
  • the first-level word segmentation is used as the target word segmentation to obtain the target word segmentation result.
  • the target keyword determination module includes:
  • the target synonym obtaining unit is used to query the thesaurus according to the target word segmentation, and obtain the target synonym corresponding to the target word segmentation;
  • the target keyword determining unit is used to query the keyword database according to the target word segmentation and target synonyms, and determine whether there is an original keyword that matches the target word segmentation or target synonym. If the original keyword exists, the original keyword is determined as the target key word.
  • the intelligent auxiliary trial device further includes:
  • the update task acquisition module is used to acquire the data update task, which includes the original case type and task update time;
  • the first data acquisition module is used to determine the change time if the judgment basis corresponding to the original case type is changed after the task update time, and obtain the history corresponding to the original case type between the change time and the current system time Judgment data, determine the data to be processed based on historical judgment data;
  • the second data acquisition module is used to obtain historical judgment data corresponding to the original case type in the preset period before the current time of the system if the judgment basis corresponding to the original case type does not change after the task update time , Determine the data to be processed based on historical judgment data;
  • a priori knowledge base building module used to extract prior information from the data to be processed, and build a priori knowledge base corresponding to the original case type based on the prior information
  • the trial viewpoint database building module is used to extract historical description information and historical trial viewpoints from the data to be processed, and construct a trial viewpoint database corresponding to the original case type based on the historical description information and historical trial viewpoints.
  • the data to be processed includes the target area;
  • the historical judgment data corresponding to the same target area and the original case type is determined as the data to be processed
  • the historical judgment data corresponding to the same original case type is determined as the data to be processed.
  • Each module in the above-mentioned intelligent auxiliary trial device can be implemented in whole or in part by software, hardware and a combination thereof.
  • the foregoing modules may be embedded in the form of hardware or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the foregoing modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 6.
  • the computer equipment includes a processor, a memory, a network interface and a database connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, computer readable instructions, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium.
  • the database of the computer equipment is used to store data used or generated during the execution of the intelligent auxiliary trial method.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instruction is executed by the processor to realize an intelligent auxiliary trial method.
  • a computer device including a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor.
  • the processor executes the computer-readable instructions to realize the intelligence in the above-mentioned embodiments.
  • the auxiliary trial method such as shown in Figure 2- Figure 4, in order to avoid repetition, will not be repeated here.
  • the processor executes the computer-readable instructions, the functions of the modules/units of the above-mentioned intelligent auxiliary trial device are realized.
  • the modules are shown in FIG.
  • one or more non-volatile readable storage media storing computer readable instructions are provided.
  • the computer readable storage medium stores computer readable instructions, and the computer readable instructions are stored by one or more
  • the one or more processors are executed to implement the intelligent auxiliary judgment method in the above-mentioned embodiment, for example, as shown in Figs. 2 to 4, in order to avoid repetition, details are not repeated here.
  • the functions of the modules/units of the above-mentioned intelligent auxiliary trial device when the computer-readable instructions are executed by the processor are the modules shown in FIG. 5, and to avoid repetition, the description will be different here.
  • Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • ROM read only memory
  • PROM programmable ROM
  • EPROM electrically programmable ROM
  • EEPROM electrically erasable programmable ROM
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Technology Law (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Databases & Information Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An intelligent auxiliary judgment method and apparatus, and a computer device and a storage medium. The method comprises: acquiring a trial transcript file corresponding to a case to be judged, and extracting case description information from the trial transcript file (S201); performing word segmentation on the case description information by means of a word segmentation tool to acquire a target word segmentation result (S202); querying a keyword base on the basis of each target segmented word, and determining a target keyword (S203); querying a case type information base according to the target keyword, and acquiring at least one matching target case type (S204); acquiring, from a priori knowledge base, priori knowledge matching the at least one target case type, and processing the priori knowledge by means of an intelligent legal provision pushing model to acquire a target pushed legal provision (S205); performing semantic analysis on the case description information by means of a semantic analysis model to acquire corresponding recommended judgment viewpoints (S206); and acquiring and displaying, according to the target pushed legal provision and the recommended judgment viewpoints, a judgment suggestion file (S207), so as to reduce the workload of judges during a judgment process, thereby reducing the burden of work.

Description

智能辅助审判方法、装置、计算机设备及存储介质Intelligent auxiliary trial method, device, computer equipment and storage medium
本申请以2019年8月16日提交的申请号为201910756628.7,名称为“智能辅助审判方法、装置、计算机设备及存储介质”的中国申请为基础,并要求其优先权。This application is based on the Chinese application filed on August 16, 2019 with the application number 201910756628.7 and titled "Intelligent Aided Trial Method, Device, Computer Equipment and Storage Medium", and claims its priority.
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种智能辅助审判方法、装置、计算机设备及存储介质。This application relates to the field of artificial intelligence technology, in particular to an intelligent auxiliary trial method, device, computer equipment and storage medium.
背景技术Background technique
随着社会经济的发展与法治日趋完善,人民群众的司法需求日益增长,法院立案的案件越来越多。法官审理案件的案件量逐渐增加,存在工作量超负荷的问题,而且作为判案依据的法律法规不断更新,也使得法官判案的难度增大,工作量过大或者审判难度增大均容易导致法官判案出错率提升。随着法官审理案件的案件量逐渐增加,书记员参与庭审的案件量也逐渐增加,书记员手动录入庭审信息的过程中,录入效率低,使得书记员的工作负荷大。With the development of society and economy and the improvement of the rule of law, the people's judicial needs are increasing, and more and more cases are registered by the courts. The number of cases handled by judges is gradually increasing, and there is a problem of workload overload, and the continuous updating of laws and regulations as the basis for judgments has also made it more difficult for judges to decide cases. Excessive workload or increased difficulty of trial can easily lead to The rate of judges making mistakes has increased. As the number of cases handled by judges has gradually increased, the number of cases involving clerks in court hearings has gradually increased. In the process of manually entering court information by clerks, the input efficiency is low, which makes the work load of clerks heavy.
有鉴于此,发明人经过深入研究,提出一种可有助于减轻司法工作人员(包括书记员和法官)在案件审判过程中的工作量的智能辅助审判系统和应用在该智能辅助审判系统上的智能辅助审判方法。In view of this, after in-depth research, the inventor proposes an intelligent auxiliary trial system that can help to reduce the workload of judicial staff (including clerks and judges) in the trial process of cases and is applied to the intelligent auxiliary trial system Intelligent auxiliary trial method.
发明内容Summary of the invention
本申请实施例提供一种智能辅助审判方法、装置、计算机设备及存储介质,以解决当前司法工作人员判案过程中工作量较大的问题。The embodiments of the present application provide an intelligent auxiliary trial method, device, computer equipment, and storage medium to solve the current problem of large workload of judicial staff in the judgment process.
一种智能辅助审判方法,包括:An intelligent auxiliary trial method, including:
从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;Obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;Use a word segmentation tool to segment the case description information to obtain a target segmentation result, where the target segmentation result includes multiple target segmentation;
基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword;
根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;Acquire prior knowledge that matches at least one of the target case types from the prior knowledge base, and obtain the target push law according to the prior knowledge;
采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;Use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial opinion database based on the standardized information, and obtain corresponding recommended trial opinions;
根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。According to the target push law and the recommended trial viewpoint, obtain and display trial suggestion documents.
一种智能辅助审判装置,包括:An intelligent auxiliary trial device, including:
案件描述信息获取模块,用于从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;The case description information acquisition module is used to obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
目标分词结果获取模块,用于采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;The target word segmentation result acquisition module is configured to use a word segmentation tool to segment the case description information to obtain a target word segmentation result, where the target word segmentation result includes multiple target word segmentation;
目标关键词确定模块,用于基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;The target keyword determining module is configured to query the keyword database based on each target word segmentation, and determine the original keywords stored in the keyword database that match the target word segmentation as target keywords;
目标案件类型确定模块,用于根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;The target case type determination module is configured to query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
目标推送法条获取模块,用于从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,采用智能法条推送模型对所述先验知识进行处理,获取目标推送法条;The target push law article acquisition module is used to acquire prior knowledge matching at least one of the target case types from the prior knowledge base, and use the intelligent law article push model to process the prior knowledge to obtain the target push method Article;
推荐审判观点获取模块,用于采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;The recommended trial viewpoint acquisition module is configured to use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial viewpoint database based on the standardized information, and obtain corresponding recommended trial viewpoints;
审判建议文件获取模块,用于根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。The trial suggestion file acquisition module is used to push the law and the recommended trial viewpoint according to the target, and acquire and display the trial suggestion file.
一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;Obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;Use a word segmentation tool to segment the case description information to obtain a target segmentation result, where the target segmentation result includes multiple target segmentation;
基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword;
根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;Acquire prior knowledge that matches at least one of the target case types from the prior knowledge base, and obtain the target push law according to the prior knowledge;
采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;Use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial opinion database based on the standardized information, and obtain corresponding recommended trial opinions;
根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。According to the target push law and the recommended trial viewpoint, obtain and display trial suggestion documents.
一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more non-volatile readable storage media storing computer readable instructions, the computer readable storage medium storing computer readable instructions, and when the computer readable instructions are executed by one or more processors, Make the one or more processors execute the following steps:
从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;Obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;Use a word segmentation tool to segment the case description information to obtain a target segmentation result, where the target segmentation result includes multiple target segmentation;
基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword;
根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;Acquire prior knowledge that matches at least one of the target case types from the prior knowledge base, and obtain the target push law according to the prior knowledge;
采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;Use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial opinion database based on the standardized information, and obtain corresponding recommended trial opinions;
根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。According to the target push law and the recommended trial viewpoint, obtain and display trial suggestion documents.
本申请的一个或多个实施例的细节在下面的附图及描述中提出。本申请的其他特征和优点将从说明书、附图以及权利要求书变得明显。The details of one or more embodiments of the present application are set forth in the following drawings and description. Other features and advantages of this application will become apparent from the description, drawings and claims.
附图说明Description of the drawings
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments of the present application. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative labor.
图1是本申请一实施例中智能辅助审判方法的一应用环境示意图;Fig. 1 is a schematic diagram of an application environment of an intelligent assisted trial method in an embodiment of the present application;
图2是本申请一实施例中智能辅助审判方法的一流程图;Fig. 2 is a flow chart of the intelligent assisted trial method in an embodiment of the present application;
图3是本申请一实施例中智能辅助审判方法的另一流程图;FIG. 3 is another flowchart of the intelligent assisted trial method in an embodiment of the present application;
图4是本申请一实施例中智能辅助审判方法的另一流程图;FIG. 4 is another flowchart of the intelligent assisted trial method in an embodiment of the present application;
图5是本申请一实施例中智能辅助审判装置的一原理框图;Fig. 5 is a functional block diagram of the intelligent auxiliary trial device in an embodiment of the present application;
图6是本申请一实施例中计算机设备的一示意图。Fig. 6 is a schematic diagram of a computer device in an embodiment of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of this application.
本申请实施例提供的智能辅助审判方法,该智能辅助审判方法可应用如图1所示的应用环境中。具体地,该智能辅助审判方法应用在智能辅助审判系统中,该智能辅助审判系统包括如图1所示的客户端、麦克风和服务器,客户端和麦克风均与服务器通过网络进行通信,其中,客户端又称为用户端,是指与服务器相对应,为客户提供本地服务的程序。客户端可安装在但不限于各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备上。该客户端是用于实现与庭审当事人进行人机交互的终端,该麦克风用于采集庭审当事人的语音数据的设备。本实施例中,法官所在的审判席、原告所在的原告席、被告所在的被告席和第三人所在的坐席上均设有客户端和麦克风,而书记员所在的坐席上设有客户端,证人所在的证人席上设有麦克风。服务器可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The intelligent auxiliary trial method provided by the embodiment of the present application can be applied to the application environment shown in FIG. 1. Specifically, the intelligent assisted trial method is applied in an intelligent assisted trial system. The intelligent assisted trial system includes a client, a microphone, and a server as shown in FIG. 1. Both the client and the microphone communicate with the server through a network, where the client The client is also called the client, which refers to the program that corresponds to the server and provides local services to the client. The client can be installed on, but not limited to, various personal computers, laptops, smart phones, tablet computers, and portable wearable devices. The client is a terminal used to implement human-computer interaction with the parties in the court trial, and the microphone is used to collect voice data of the parties in the court trial. In this embodiment, clients and microphones are installed on the judge’s bench, the plaintiff’s seat, the defendant’s seat, and the third party’s seat, while the clerk’s seat has a client. There is a microphone on the witness stand where the witness is. The server can be implemented as an independent server or a server cluster composed of multiple servers.
在一实施例中,该智能辅助审判系统中的客户端与服务器通过网络进行通信,用于实现对庭审笔录文件进行处理,以获取判案建议结果,以使法官根据判案建议结果进行判案建议结果进行相应的判案处理,从而减轻法官的工作负荷,提高其工作效率。如图2所示,提供一种智能辅助审判方法,以该方法应用在图1中的服务器为例进行说明,包括如下步骤:In one embodiment, the client and the server in the intelligent auxiliary trial system communicate through the network to realize the processing of the court trial transcripts to obtain the result of the judgment proposal, so that the judge can make the judgment based on the result of the judgment proposal It is recommended that the results be judged accordingly, so as to reduce the workload of judges and improve their work efficiency. As shown in Figure 2, an intelligent assisted trial method is provided. The method is applied to the server in Figure 1 as an example for description, including the following steps:
S201:从数据库中获取待判案件对应的庭审笔录文件,从庭审笔录文件中提取案件描述信息。S201: Obtain court trial transcript files corresponding to the case to be judged from the database, and extract case description information from the court trial transcript files.
其中,待判案件是指需要进行审判处理的案件。庭审笔录文件是庭审中一切到庭的庭审当事人所进行的诉讼活动的全程记录的文件。本实施例中的庭审笔录文件是上述实施例中形成并存储在数据库中的庭审笔录文件。案件描述信息是指从庭审笔录文件中提取出来的与待判案件相关的会影响审判结果的描述信息。该庭审笔录文件是在庭审过程中基于标准法庭审理笔录模板采集庭审当事人对待判案件的事实和证据等信息的文件。标准法庭审理笔录模板是预先设置的与庭审过程中各个流程相对应的用于记录案件信息的笔录模板。Among them, the pending cases refer to cases that need to be tried and handled. The court hearing transcript is a document that records the entire litigation activities carried out by all the parties to the court hearing during the court hearing. The court trial transcript file in this embodiment is the court trial transcript file formed in the foregoing embodiment and stored in the database. Case description information refers to the description information extracted from the court trial transcripts that is related to the pending case and will affect the trial result. This court trial transcript is a document that collects information such as facts and evidence of the case to be judged by the parties in the court trial based on the standard court trial transcript template during the court trial. The standard court trial transcript template is a pre-set transcript template for recording case information corresponding to each process in the court trial process.
由于庭审笔录文件是各级法院依据标准法庭审理笔录模板采集到的与待判案件相对应的笔录文件,因此,该庭审笔录文件具有标准的格式,可有助于快速提取案件描述信息。例如,庭审笔录文件中记录有如下内容:双方无争议的事实有……,双方争议的焦点有……;在从庭审笔录文件中获取案件描述信息过程中,可采用文字匹配算法匹配到“原被告双方无争议的事实有”和“双方争议的焦点有”这些字段,将这些字段之后的内容分别确定为法院认定的事实和双方争议的焦点,这些内容即为影响审判结果的描述信息,将其作为案件描述信息。Since the trial transcript file is a transcript file corresponding to the pending case collected by the courts at all levels according to the standard court trial transcript template, the court trial transcript file has a standard format, which can help quickly extract case description information. For example, the transcript of a court hearing records the following content: the facts that the two parties have no dispute are..., the focus of the dispute between the two parties is...; in the process of obtaining case description information from the court hearing transcript, a text matching algorithm can be used to match the original The defendants have undisputed facts" and "the focus of disputes between the two parties have" fields. The content after these fields are determined as the facts ascertained by the court and the focus of the dispute between the parties. These content are the descriptive information that affects the outcome of the trial. It serves as case description information.
可以理解地,在从庭审笔录文件中提取出案件描述信息之后,还可将这些案件描述信息填充到判决文书模板的相应位置,在依据本实施例最终确定的判案建议结果获取最终依据的判案依据(即具体适用法条)和判案观点后,将该判案依据和判案观点一并填充到判决文书模板的相应位置,以快速获取对应的目标判决文件。该目标判决文件是最终形成的待判案件对应的判决文件,如民事判决书和刑事判决书等。Understandably, after extracting the case description information from the court hearing transcript, the case description information can also be filled into the corresponding position of the judgment document template, and the final judgment recommendation result determined according to this embodiment is obtained. After the case basis (that is, the specific applicable legal provisions) and the judgment point of view, the case basis and the judgment point of view are filled into the corresponding position of the judgment document template to quickly obtain the corresponding target judgment document. The target judgment document is the final judgment document corresponding to the pending case, such as a civil judgment and a criminal judgment.
S202:采用分词工具对案件描述信息进行分词,获取目标分词结果,目标分词结果包括多个目标分词。S202: Use a word segmentation tool to segment the case description information to obtain a target segmentation result, and the target segmentation result includes multiple target segmentation.
其中,分词工具是用于实现对文本进行中文分词的工具。中文分词(Chinese Word Segmentation)指的是将一个汉字序列切分成多个单独的词。分词是指将连续的字序列按照一定的规范重新组合成词序列的过程。目标分词结果是指案件描述信息最终形成的多个分词的结果。目标分词是指根据案件描述信息最终形成的分词。Among them, the word segmentation tool is a tool used to implement Chinese word segmentation of the text. Chinese Word Segmentation refers to the segmentation of a sequence of Chinese characters into multiple individual words. Word segmentation refers to the process of recombining consecutive word sequences into word sequences according to certain specifications. The target segmentation result refers to the result of multiple segmentation formed by the case description information. The target word segmentation refers to the word segmentation finally formed according to the case description information.
本实施例中,分词工具可以为市面常用的用于实现对中文文本进行分词的开源工具-结巴分词。采用结巴中文分词对案件描述信息进行分词,其分词过程可支持三种分词模式包括:(1)精确模式:试图 将句子最精确地切开,适合文本分析,但效率较低。(2)全模式:把句子中所有的可以成词的词语都扫描出来,分词速度快,但是不能解决歧义问题。(3)搜索引擎模式:在精确模式的基础上,对长词再次切分,将长分词切分后的短分词放在长分词之前,效率较快,但这种分词模式可能出现前面至少两个短分词与后续一个长分词之间存在语义重复,导致分词准确性不高。为保证分词效率,可通过对搜索引擎模式的分词结果进行优化,以获取分词准确率较高的目标分词结果。因此,步骤S202具体包括如下步骤:In this embodiment, the word segmentation tool may be an open source tool commonly used in the market for word segmentation of Chinese text-stuttering word segmentation. Stuttering Chinese word segmentation is used to segment the case description information. The segmentation process can support three word segmentation modes including: (1) Accurate mode: Try to cut the sentence most accurately, suitable for text analysis, but low efficiency. (2) Full mode: Scan all the words in the sentence that can be formed into words. The word segmentation speed is fast, but it cannot solve the ambiguity problem. (3) Search engine mode: On the basis of the precise mode, the long words are segmented again, and the short participles after the long word segmentation are placed before the long participles, which is more efficient, but this word segmentation mode may appear at least two first There are semantic repetitions between a short participle and a subsequent long participle, resulting in low accuracy of word segmentation. In order to ensure the efficiency of word segmentation, the word segmentation results of the search engine mode can be optimized to obtain the target word segmentation results with higher word segmentation accuracy. Therefore, step S202 specifically includes the following steps:
S2011:采用结巴分词工具的搜索引擎模式对案件描述信息进行文本分词,获取文本分词结果,文本分词结果包括N个一级分词。S2011: Use the search engine mode of the stuttering word segmentation tool to perform text segmentation on the case description information, and obtain the text segmentation result. The text segmentation result includes N first-level word segmentation.
具体地,服务器采用结巴分词工具的搜索引擎模式对案件描述信息进行文本分词,以快速获取文本分词结果,该文本分词结果可以理解为采用常规的搜索引擎模式进行分词后的结果,是优化之前的分词结果。一级分词是构成文本分词结果的分词,N为文本分词结果中一级分词的数量。Specifically, the server uses the search engine mode of the stammering word segmentation tool to perform text segmentation of the case description information to quickly obtain the text segmentation result. The text segmentation result can be understood as the result of the word segmentation using the conventional search engine mode, which is before optimization Word segmentation result. The first-level word segmentation is the word segmentation that constitutes the text segmentation result, and N is the number of first-level word segmentation in the text word segmentation result.
例如,若案件描述信息为:被告无行为能力;则采用结合分词工具的搜索引擎模式对该案件描述信息进行文本分词后,获取的文本分词结果包括如下5个一级分词:1.被告,2.无,3.行为,4.能力,5.行为能力。由上述文本分词结果可知,连续3个一级分词(从第3个一级分词到第5个一级分词)中的第3个一级分词“行为”和第4个一级分词“能力”进行结合后与第5个一级分词“行为能力”意义重复,分词结果不准确若直接基于这一文本分词结果进行后续的语义分析,可能会影响后续分析的效率和准确率。For example, if the case description information is: the defendant is incapacitated; after text segmentation is performed on the case description information using the search engine mode combined with word segmentation tools, the obtained text segmentation results include the following 5 first-level word segmentation: 1. Defendant, 2. .None, 3. Behavior, 4. Ability, 5. Behavior Ability. From the above text segmentation results, it can be seen that in three consecutive first-level participles (from the third first-level participle to the fifth first-level participle), the third first-level participle "behavior" and the fourth first-level participle "ability" After the combination, it repeats the meaning of the fifth first-level word segmentation "behavior", and the word segmentation result is inaccurate. If the subsequent semantic analysis is directly based on the text segmentation result, the efficiency and accuracy of the subsequent analysis may be affected.
S2012:若任意连续k个一级分词中连续k-1个一级分词的叠加等于第k个一级分词,且连续k-1个一级分词中存在至少两个一级分词的结合等于第k个一级分词,则仅保留结合等于第k个一级分词的至少两个一级分词作为目标分词,获取目标分词结果。S2012: If the superposition of continuous k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and the combination of at least two first-level participles in the continuous k-1 first-level participles is equal to the first For k first-level participles, only at least two first-level participles that are combined and equal to the k-th first-level participle are retained as target participles, and the target participle result is obtained.
“叠加”是指相邻的两个一级分词中,前一个一级分词的尾部的至少一个汉字,和后一个一级分词前端的至少一个汉字重合,也即两个相邻的两个一级分词可以依据重合的汉字进行拼接,仅保留一个重叠的字或词形成拼接词的过程,举例如下:连续三个一级分词分别为:“管理”、“理工”和“工作”中,“管理”和“理工”两个一级分词中重叠的汉字为“理”,“理工”和“工作”重叠的汉字为“工”。将前述三个一级分词“管理”、“理工”和“工作”进行叠加后可形成新的拼接词:“管理工作”。"Superposition" means that in two adjacent first-level participles, at least one Chinese character at the end of the previous first-level participle overlaps with at least one Chinese character at the front end of the next first-level participle, that is, two adjacent first-level participles. Grade participles can be spliced based on overlapping Chinese characters, leaving only one overlapping character or word to form a concatenated word, for example: the three consecutive first-level participles are: "Management", "Technology" and "Work" in " The overlapping Chinese character in the two first-level participles of "Management" and "Technology" is "理", and the overlapping Chinese character of "Technology" and "工作" is "工". The three first-level participles "management", "science and engineering" and "work" can be superimposed to form a new word: "management work".
“结合”是指将两个一级分词直接进行合并,无需去除重复汉字的分词合并形式。比如,两个一级分词分别为“管理”和“工作”,将前述两个一级分词进行合并后可得“管理工作”这一合成词。"Combine" refers to the direct merging of two first-level participles without removing the merged form of the participles of repeated Chinese characters. For example, the two first-level participles are "management" and "work", and the compound word "management work" can be obtained by merging the aforementioned two first-level participles.
例如,若案件描述信息为:被告在开展管理工作过程中,掌握商业秘密;则采用结巴分词工具的搜索引擎模式对该案件描述信息进行文本分词后,获取的文本分词结果为:1.被告,2.在,3.管理,4.理工,5.工作,6.管理工作,7.过程,8.中,9.掌握,10.商业,11.秘密,12.商业秘密。其中,“管理”、“理工”和“工作”的叠加等于“管理工作”,而且,“管理”和“工作”的结合等于“管理工作”,因此,仅保留“管理”和“工作”这两个一级分词作为优化后的目标分词,删除“理工”和“管理工作”这两个一级分词。For example, if the case description information is: the defendant masters trade secrets during the management work; the search engine mode of the stuttering word segmentation tool is used to segment the case description information, and the text segmentation results obtained are: 1. The defendant, 2. In, 3. Management, 4. Science and engineering, 5. Work, 6. Management work, 7. Process, 8. In, 9. Master, 10. Business, 11. Secret, 12. Trade secret. Among them, the superposition of "management", "science and engineering" and "work" equals "management work", and the combination of "management" and "work" equals "management work". Therefore, only "management" and "work" are retained. Two first-level participles are used as optimized target participles, and the two first-level participles of "science and engineering" and "management work" are deleted.
S2013:若任意连续k个一级分词中连续k-1个一级分词的叠加等于第k个一级分词,且任意连续k个一级分词中不存在至少两个一级分词的结合等于第k个一级分词,则删除前k-1个一级分词,保留第k个一级分词作为目标分词,获取目标分词结果。S2013: If the superposition of continuous k-1 first-level participles in any continuous k first-level participles is equal to the kth first-level participle, and there is no combination of at least two first-level participles in any continuous k first-level participles, it is equal to the first For k first-level participles, delete the first k-1 first-level participles, keep the k-th first-level participle as the target participle, and obtain the target participle result.
例如,若案件描述信息为:本条款仅针对有同业经验的候选人;则采用结巴分词工具的搜索引擎模式对该案件描述信息进行文本分词后,获取的文本分词结果为:1.本,2.条款,3.仅,4,针对,5.有,6,同业,7.经验,8.同业经验,9,的,10,候选,11,选人,11,候选人。其中,“候选”与“选人”的叠加等于“候选人”,且不存在结合等于“候选人”的情况,此时,删除“候选”与“选人”,仅保留“候选人”这个一级分词。For example, if the case description information is: this clause is only for candidates with industry experience; after text segmentation is performed on the case description information using the search engine mode of the stuttering word segmentation tool, the text segmentation results obtained are: 1. This, 2. . Clause, 3. Only, 4, Targeting, 5. Yes, 6, Same industry, 7. Experience, 8. Same industry experience, 9, of, 10, Candidate, 11, Pick, 11, Candidate. Among them, the superposition of "candidate" and "chosen" is equal to "candidate", and there is no situation that the combination is equal to "candidate". At this time, delete "candidate" and "choose person" and only keep the "candidate" The first level participle.
S2014:若任意连续k个一级分词中连续k-1个一级分词的结合等于第k个一级分词,则删除第k个一级分词,保留前k-1个一级分词作为目标分词,获取目标分词结果。S2014: If the combination of k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, delete the k-th first-level participle and keep the first k-1 first-level participles as the target participle To get the target segmentation result.
例如,若案件描述信息为:被告在开展管理工作过程中,掌握商业秘密;则采用结巴分词工具的搜索引擎模式对该案件描述信息进行文本分词后,获取的文本分词结果为:1.被告,2.在,3.管理,4. 理工,5.工作,6.管理工作,7.过程,8.中,9.掌握,10.商业,11.秘密,12.商业秘密。其中,“商业”和“秘密”的结合等于“商业秘密”,而不存在叠加的情况,因此,仅保留“商业”和“秘密”这两个一级分词作为优化后的目标分词,删除“商业秘密”这一个一级分词。For example, if the case description information is: the defendant masters trade secrets during the management work; the search engine mode of the stuttering word segmentation tool is used to segment the case description information, and the text segmentation results obtained are: 1. The defendant, 2. In, 3. Management, 4. Science and Engineering, 5. Work, 6. Management Work, 7. Process, 8. In, 9. Master, 10. Business, 11. Secret, 12. Trade Secret. Among them, the combination of "commercial" and "secret" is equal to "commercial secret", and there is no overlap. Therefore, only the two first-level participles of "commercial" and "secret" are retained as the optimized target participles, and " "Trade secret" is a first-level participle.
本实施例中,采用分词工具的搜索引擎分词模式将案件描述信息进行分词后,对存在语义重复的连续k个一级分词进行分析,判定是否存在由连续一级分词的结合或叠加可构成连续的一级分词后紧跟的长词,并依据连续的一级分词不同的结合或叠加情况对一级分词或最后的长词进行处理,可获得更为精准的分词结果,减少重复词或无用词,以实现在保证分词效率的前提下,对分词结果进行优化,提高分词准确性。In this embodiment, after the case description information is segmented using the search engine word segmentation mode of the word segmentation tool, k consecutive first-level segmentation words with semantic repetition are analyzed to determine whether there is a combination or superposition of consecutive first-level segmentation words that can constitute a continuous The long words immediately following the first-level participles of, and the first-level participles or the final long words are processed according to the different combinations or superpositions of the continuous first-level participles to obtain more accurate word segmentation results, reducing repeated words or uselessness Word, in order to realize the optimization of the word segmentation results under the premise of ensuring the efficiency of word segmentation, and improve the accuracy of word segmentation.
S203:基于每一目标分词查询关键词库,将关键词库中存储的与目标分词相匹配的原始关键词确定为目标关键词。S203: Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword.
其中,关键词库是预先设置的用于存储原始关键词的数据库。该原始关键词是预先设置的用于可能影响案件类型认定的关键词。该关键词库中存储原始关键词与至少一个同义词的对应关系。本实施例中,服务器采用字符串匹配算法或者其他匹配算法,查询关键词库中是否存在该目标分词(可以为原始关键词也可以是其对应的同义词),若存在该目标分词,则将该目标分词对应的原始关键词确定为目标关键词。例如,在原始关键词库中存在A1、A2和A3这一组同义词组,A1为原始关键词,若目标分词与A1、A2和A3中的任一个相同,则将A1确定为该目标分词对应的目标关键词。Among them, the keyword library is a preset database for storing original keywords. The original keywords are pre-set keywords that may affect the identification of the case type. The corresponding relationship between the original keyword and at least one synonym is stored in the keyword library. In this embodiment, the server uses a string matching algorithm or other matching algorithms to query whether the target word segment exists in the keyword database (it can be the original keyword or its corresponding synonym), and if the target word segment exists, it The original keyword corresponding to the target word segmentation is determined as the target keyword. For example, there are synonym groups A1, A2, and A3 in the original keyword database, and A1 is the original keyword. If the target word segmentation is the same as any of A1, A2, and A3, then A1 is determined as the target word segmentation corresponding The target keywords.
在一实施例中,步骤S203具体包括如下步骤:In an embodiment, step S203 specifically includes the following steps:
S2031:根据目标分词查询同义词库,获取与目标分词相对应的目标同义词。S2031: Query the thesaurus according to the target word segmentation, and obtain the target synonyms corresponding to the target word segmentation.
其中,同义词库是用于存储预先设置的用于存储同义词关系的数据库。目标同义词是记录在同义词库中的与目标分词具有同义词关系的同义词。本实施例中,同义词库中预先存储具有同义词关系的同义词组,服务器可采用字符串匹配算法匹配到该目标分词,然后,获取与该目标分词具有同义词关系的同义词确定为目标同义词。Among them, the thesaurus is used to store a pre-set database for storing synonym relationships. The target synonyms are the synonyms recorded in the thesaurus that have a synonym relationship with the target segmentation. In this embodiment, a synonym group with a synonym relationship is pre-stored in the thesaurus, and the server can use a string matching algorithm to match the target word segmentation, and then obtain a synonym having a synonym relationship with the target word segmentation and determine it as the target synonym.
S2032:根据目标分词和目标同义词查询关键词库,判断是否存在与目标分词或者目标同义词相匹配的原始关键词,若存在原始关键词,则将原始关键词确定为目标关键词。S2032: Query the keyword database according to the target word segmentation and target synonyms, determine whether there is an original keyword matching the target word segmentation or the target synonym, and if the original keyword exists, determine the original keyword as the target keyword.
例如刑法规定“为了犯罪,准备工具、制造条件的,是犯罪预备。对于预备犯,可以比照既遂犯从轻、减轻处罚或者免除处罚”,则其中的准备工具、制造条件等关键词是认定是否为“犯罪预备”这一情形的关键词,而法庭庭审过程中,庭审当事人在口头论述时,可以采用其他同义词表述这一意思。因此,服务器需根据目标分词和目标同义词查询关键词库,判断关键词库中是否存在与目标分词或者目标同义词相匹配的原始关键词,若存在原始关键词,则将其确定为目标关键词,以扩大目标分词的搜索范围,从案件描述信息中尽可能确定更多的目标关键词,提高后续获取的判案建议结果的准确性。For example, the Criminal Law stipulates that “for the purpose of committing a crime, tools and conditions are prepared for crime. For prepared criminals, they can be given a lighter, mitigated punishment or exempt from punishment in the same way as the completed offender.” The keywords in the preparation tools and manufacturing conditions are whether to determine whether It is the key word for the situation of "criminal preparation", and during the court hearing, the parties in the court hearing can use other synonyms to express this meaning. Therefore, the server needs to query the keyword database based on the target word segmentation and target synonyms to determine whether there is an original keyword matching the target word segmentation or target synonym in the keyword database. If the original keyword exists, it will be determined as the target keyword. In order to expand the search scope of the target word segmentation, as many target keywords as possible are determined from the case description information to improve the accuracy of the subsequent judgment and suggestion results obtained.
S204:根据目标关键词查询案件类型信息库,获取与目标关键词相匹配的至少一个目标案件类型。S204: Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword.
其中,案件类型信息库是用于存储不同案件类型及其对应的关键词的信息库。由于案件审判过程中,不同案件类型(如刑法中的不同类型的犯罪)可能对应同一关键词,在案件类型信息库中会将每一关键词及其对应的案件类型以键值对的形式存储,以使服务器在获取目标关键词之后,可根据该目标关键词查询案件类型信息表,以将所有包含该目标关键词的案件类型确定为目标案件类型。Among them, the case type information database is an information database used to store different case types and their corresponding keywords. Since different case types (such as different types of crimes in the criminal law) may correspond to the same keyword during the trial process, each keyword and its corresponding case type will be stored in the form of key-value pairs in the case type information database , So that after obtaining the target keyword, the server can query the case type information table according to the target keyword to determine all case types containing the target keyword as the target case type.
S205:从先验知识库中获取与至少一个目标案件类型相匹配的先验知识,根据先验知识,获取目标推送法条。S205: Obtain prior knowledge that matches at least one target case type from the prior knowledge base, and obtain the target push rule based on the prior knowledge.
其中,先验知识库是基于历史判案数据提取出的先验知识构建的数据库,每一历史判案数据具体一具体案件类型。本实施例中,服务器根据至少一个案件类型查询先验知识库,从先验知识库中查询与至少一个目标案件类型相匹配的先验知识,从而进行先验知识的筛选。该先验知识包含与至少一个目标案件类型相对应的历史判案数据中的案件描述信息及相应的案件判案结果。Among them, the priori knowledge base is a database constructed based on priori knowledge extracted from historical judgment data, and each historical judgment data is specific to a specific case type. In this embodiment, the server queries the prior knowledge base according to at least one case type, and queries the prior knowledge matching the at least one target case type from the prior knowledge base, thereby performing prior knowledge screening. The prior knowledge includes case description information in historical judgment data corresponding to at least one target case type and corresponding case judgment results.
由于先验知识库存储了所有的案例知识,内容太多,若每次判案均全部向量化先验知识库中的数据,可能存在执行效率较低的问题,因此,可通过目标关键词确定的至少一个目标案件类型,再利用至少一个目标案件类型对先验知识库中先验知识进行筛选后,以确定其对应的先验知识,以执行后续的步骤S206和S207,避免对先验知识库中所有的先验知识进行处理,提高执行效率。Since the a priori knowledge base stores all case knowledge with too much content, if all the data in the prior knowledge base is vectorized for each case, there may be a problem of low execution efficiency. Therefore, it can be determined by the target keywords At least one target case type, and then use at least one target case type to filter the prior knowledge in the prior knowledge base to determine its corresponding prior knowledge, so as to perform the subsequent steps S206 and S207 to avoid the prior knowledge All prior knowledge in the library is processed to improve execution efficiency.
其中,根据先验知识,获取目标推送法条,具体是指从所有先验知识中,提取其案件判案结果中采用的判案依据(即具体应用法条),对所有判案依据的应用次数进行统计并排序,最终形成依据应用次数由多到少的目标推送法条,以便法官了解相关历史判案数据的判案依据,从而节省判案过程中查阅相关资料的时间,减轻工作量。Among them, according to prior knowledge, to obtain the target push law, specifically refers to extracting from all prior knowledge the judgment basis used in the judgment result of the case (ie the specific application of the law), and the application of all judgment basis The number of times is counted and sorted, and finally a target push rule based on the number of applications from more to less is formed, so that the judge can understand the basis of the judgment of relevant historical judgment data, so as to save the time of consulting relevant materials in the judgment process and reduce the workload.
S206:采用语义分析模型对案件描述信息进行语义分析,获取标准化信息,基于标准化信息查询与目标案件类型相对应的审判观点库,获取对应的推荐审判观点。S206: Use the semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, and query the judgment opinion database corresponding to the target case type based on the standardized information, and obtain corresponding recommended judgment opinions.
其中,语义分析模型是基于NLP(Natural Language Processing,自然语义处理)技术构建的用于分析文本语义的模型。标准化信息是指从案件描述信息中提取出来的可能影响案件审判的信息。审判观点库是用于记录历史判案数据中提取出的历史描述信息及其对应的历史审判观点的关联关系的数据库。Among them, the semantic analysis model is a model constructed based on the NLP (Natural Language Processing) technology for analyzing the semantics of text. Standardized information refers to the information extracted from the case description information that may affect the trial of the case. The judgment viewpoint database is a database used to record the historical description information extracted from the historical judgment data and the association relationship between the corresponding historical judgment viewpoints.
具体地,服务器采用语义分析模型对案件描述信息进行语义分析,以提取其中的标准化信息。例如,在刑事案件审判过程中,被告的年龄是影响判断其是否具备刑事责任能力的关键因素,而是否准备刑事责任能力是影响量刑的关键因素,因此,服务器在采用语义分析模型对案件描述信息进行语义分析时,会筛选出与被告年龄相匹配的信息作为标准化信息。在从案件描述信息中提取出所有标准化信息之后,基于这些标准化信息查询与目标案件类型相对应的审判观点库,采用文本相似度算法对比标准化信息与目标案件类型相对应的所有历史描述信息的文本相似度,依据文本相似度的大小进行排序,将文本相似度较大的前N个(N的个数可自主设置)历史描述信息对应的历史审判观点,作为推荐审判观点,以使法官在判案过程中,及时了解相似案件的历史审判观点,并结合待判案件的实际情况,及时作出判决,减轻法官查阅相关资料的工作量。可以理解地,根据标准化信息查询与目标案件类型相对应的审判观点库,可以有效减少标准化信息与审判观点库中的历史描述信息的数据对比量,有助于提高数据处理效率。Specifically, the server uses a semantic analysis model to perform semantic analysis on case description information to extract standardized information therein. For example, in the trial process of a criminal case, the age of the defendant is a key factor in determining whether he is capable of criminal responsibility, and whether he is prepared for criminal responsibility is a key factor affecting sentencing. Therefore, the server is using a semantic analysis model to describe the case. When conducting semantic analysis, information that matches the age of the defendant will be screened out as standardized information. After extracting all standardized information from the case description information, based on the standardized information, query the judgment opinion database corresponding to the target case type, and use the text similarity algorithm to compare the text of all the historical description information corresponding to the standardized information and the target case type. The similarity is sorted according to the text similarity, and the top N (the number of N can be set independently) historical judgment viewpoints corresponding to the historical description information with the greater text similarity are used as recommended judgment viewpoints so that the judge can judge In the course of the case, timely understand the historical trial viewpoints of similar cases, and combine the actual situation of the pending case to make a judgment in a timely manner, reducing the workload of the judge to consult relevant materials. Understandably, querying the trial viewpoint database corresponding to the target case type according to the standardized information can effectively reduce the amount of data comparison between the standardized information and the historical description information in the trial viewpoint database, and help improve the efficiency of data processing.
S207:根据目标推送法条和推荐审判观点,获取并显示审判建议文件。S207: According to the target push laws and recommended trial opinions, obtain and display trial suggestion documents.
具体地,服务器将获取到的至少一条目标推送法条和获取到的至少一个推荐审判观点,作为待判案件对应的审判建议文件,该审判建议文件是系统通过对庭审笔录文件中提取的案件描述信息进行分析,从与该案件描述信息相关的历史判案数据中确定的目标推送法条和推荐审判观点,为法官审判提供参考,减轻法官查阅相关案件的过程,减少其工作量。Specifically, the server pushes the acquired at least one target law and the acquired at least one recommended trial viewpoint as a trial suggestion file corresponding to the case to be judged. The trial suggestion file is a case description extracted by the system through the court trial transcript. The information is analyzed, and the targets determined in the historical judgment data related to the description of the case are pushed to the legal articles and recommended trial viewpoints, so as to provide a reference for the judge to judge, reduce the judge's process of consulting related cases and reduce their workload.
本实施例所提供的智能辅助方法中,在从庭审笔录文件中快速提取出案件描述信息之后,通过对案件描述信息进行分词和关键词匹配处理,可快速获取其对应的目标关键词,利用该目标关键词确定相对应的至少一个目标案件类型,从而可实现从先验知识库中筛选与至少一个目标案件类型相匹配的先验知识,可根据先验知识快速获取其对应的目标推送法条,加快判案建议文件的获取效率。先采用语义分析模型对案件描述信息进行语义分析,以提取影响案件审判的标准化信息,有助于减少后续数据处理的工作量,提高推荐审判观点的获取效率;通过标准化信息查询审判观点库,获取推荐审判观点,从而根据目标推送法条和推荐审判观点,可快速获取对应的判案建议文件,以有助于减轻法官审判过程中查询相关资料的工作量,从而加快审判效率。In the intelligent assistance method provided in this embodiment, after the case description information is quickly extracted from the court hearing transcript, the case description information can be segmented and keyword matching processing can quickly obtain its corresponding target keywords, and use the The target keyword determines the corresponding at least one target case type, so that prior knowledge matching at least one target case type can be selected from the prior knowledge base, and the corresponding target push law can be quickly obtained based on the prior knowledge , To speed up the efficiency of obtaining the judgment proposal documents. First, use the semantic analysis model to perform semantic analysis on the case description information to extract standardized information that affects the trial of the case, which helps to reduce the workload of subsequent data processing and improve the efficiency of obtaining recommended judgment opinions; query the judgment opinion database through standardized information to obtain Recommended trial viewpoints, so as to push the legal articles and recommended trial viewpoints according to the target, and quickly obtain the corresponding judgment proposal documents, which helps to reduce the workload of the judges in the process of searching for relevant materials, thereby speeding up the efficiency of the trial.
在一实施例中,如图3所示,在步骤S201之前,即在从数据库中获取待判案件对应的庭审笔录文件之前,智能辅助审判方法还包括:In one embodiment, as shown in FIG. 3, before step S201, that is, before obtaining the court trial transcript file corresponding to the case to be judged from the database, the intelligent auxiliary trial method further includes:
S301:获取数据更新任务,数据更新任务包括原始案件类型和任务更新时间。S301: Obtain a data update task. The data update task includes the original case type and task update time.
其中,数据更新任务是用于更新先验知识库和目标判案模型的任务。原始案件类型是指该数据更新任务所针对的案件类型。任务更新时间是指上一次执行数据更新任务的时间。Among them, the data update task is a task used to update the prior knowledge base and target judgment model. The original case type refers to the case type targeted by the data update task. The task update time refers to the time when the data update task was executed last time.
S302:若与原始案件类型相对应的判案依据在任务更新时间以后发生变更,则确定变更时间,获取变更时间和系统当前时间之间的与原始案件类型相对应的历史判案数据,根据历史判案数据确定待处理数据。S302: If the judgment basis corresponding to the original case type is changed after the task update time, the change time is determined, and the historical judgment data corresponding to the original case type between the change time and the current time of the system is obtained. The judgment data determines the data to be processed.
其中,与目标案件类型相对应的判案依据具体是指与目标案件类型相对应的法律依据,包括法条、法规和司法解释等内容。本实施例中,若与原始案件类型相对应的判案依据在任务更新时间以后发生变更,则说明在任务更新时间之后,该原始案件类型对应的判案依据发生变更,后续的案件审判过程中会适用新的法条、法规和司法解释等判案依据,因此,需更新先验知识库及目标判案模型,此时,将判案依据的执行时间确定为变更时间,获取变更时间和系统当前时间之间的与原始案件类型相对应的历史判 案数据,根据历史判案数据确定待处理数据,从而有助于提高后续训练出的目标判案模型的时效性和准确性。该待处理数据具体为用于更新先验知识库和审判观点库的训练数据。Among them, the judgment basis corresponding to the target case type specifically refers to the legal basis corresponding to the target case type, including legal provisions, regulations, and judicial interpretations. In this embodiment, if the judgment basis corresponding to the original case type is changed after the task update time, it means that the judgment basis corresponding to the original case type has changed after the task update time, and the subsequent case trial process New laws, regulations, and judicial interpretations will be applied to the judgment basis. Therefore, the prior knowledge base and target judgment model need to be updated. At this time, the execution time of the judgment basis is determined as the time of change, and the time of change and system are obtained The historical judgment data corresponding to the original case type between the current time and the data to be processed are determined based on the historical judgment data, which helps to improve the timeliness and accuracy of the target judgment model trained subsequently. The to-be-processed data is specifically training data used to update the a priori knowledge base and the trial opinion base.
S303:若与原始案件类型相对应的判案依据在任务更新时间以后没有发生变更,则获取系统当前时间之前预设周期内的与原始案件类型相对应的历史判案数据,根据历史判案数据确定待处理数据。S303: If the judgment basis corresponding to the original case type does not change after the task update time, obtain the historical judgment data corresponding to the original case type in the preset period before the current time of the system, based on the historical judgment data Determine the data to be processed.
其中,预设周期是预先设置的用于采集数据的周期,可以设置为三个月、半年或者1年。本实施例中,若与原始案件类型相对应的判案依据在任务更新时间以后没有发生变更,说明后续案件审判过程中仍适用旧的审判依据,但随着时间的变化,这些审判依据的适用标准可能会发生变化,如赔付金额等情况,此时,将系统当前时间之前预设周期内的与原始案件类型相对应的历史判案数据确定待处理数据,以保证待处理数据的时效性,从而有助于提高后续训练出的目标判案模型的时效性。Among them, the preset period is a preset period for collecting data, which can be set to three months, half a year, or one year. In this embodiment, if the judgment basis corresponding to the original case type does not change after the task update time, it means that the old trial basis is still applied in the subsequent case trial process, but as time changes, the application of these trial basis The standard may change, such as the amount of compensation. At this time, the historical judgment data corresponding to the original case type in the preset period before the current time of the system is used to determine the data to be processed to ensure the timeliness of the data to be processed. This helps to improve the timeliness of the subsequent training of the target judgment model.
进一步地,待处理数据包括目标区域,该目标区域可以理解为历史判案数据对应的审判法院所在的行政区域,如广东省或者深圳市等。此时,步骤S302和S303中的根据历史判案数据确定待处理数据,具体包括如下步骤:Further, the data to be processed includes a target area, which can be understood as the administrative area where the trial court corresponding to the historical judgment data is located, such as Guangdong Province or Shenzhen. At this time, the determination of the data to be processed based on historical judgment data in steps S302 and S303 specifically includes the following steps:
(1)确定任一目标区域对应的历史判案数据的目标数据量。(1) Determine the target data volume of historical judgment data corresponding to any target area.
若目标数据量大于预设数量阈值,则将同一目标区域和原始案件类型对应的历史判案数据,确定为待处理数据。If the target data amount is greater than the preset number threshold, the historical judgment data corresponding to the same target area and the original case type is determined as the data to be processed.
其中,预设数量阈值是预先设置的数量阈值。若任一目标区域对应的待处理数据的目标数据量大于预设数量阈值,则说明该目标区域内各级法院审判与原始案件类型相对应的案件的数量较多,因此,可将同一目标区域和原始案件类型对应的历史判案数据确定为待处理数据,以便后续基于该待处理数据构建与目标区域和原始案件类型相对应的先验知识库,并构建与目标区域和原始案件类型相对应的审判观点库,以使该先验知识库和审判观点库对该目标区域内的与目标案件类型相对应的待判案件的借鉴意义更大,更具有针对性。The preset number threshold is a preset number threshold. If the target data volume of the to-be-processed data corresponding to any target area is greater than the preset number threshold, it means that there are more cases in the target area that are tried by courts at all levels corresponding to the original case type. Therefore, the same target area The historical judgment data corresponding to the original case type is determined as the data to be processed, so that a priori knowledge base corresponding to the target area and the original case type is constructed based on the data to be processed, and a priori knowledge base corresponding to the target area and the original case type is constructed. In order to make the a priori knowledge base and the trial viewpoint database have greater reference significance and more pertinence for the pending cases corresponding to the target case type in the target area.
(3)若目标数据量不大于预设数量阈值,则将同一原始案件类型对应的历史判案数据,确定为待处理数据。(3) If the target data volume is not greater than the preset number threshold, the historical judgment data corresponding to the same original case type is determined as the data to be processed.
若任一目标区域对应的待处理数据的目标数据量不大于预设数量阈值,则说明该目标区域内各级法院审判与原始案件类型相对应的案件的数量较少,将同一原始案件类型对应的历史判案数据确定为待处理数据,构建与原始案件类型相对应的先验知识库,并训练与原始案件类型相对应的审判观点库,使得用于构建先验知识库和审判观点库的待处理数据不局限于同一目标区域,而仅依据原始案件类型进行划分。If the target data volume of the to-be-processed data corresponding to any target area is not greater than the preset number threshold, it means that the number of cases corresponding to the original case type in the courts at all levels in the target area is small, and the same original case type is corresponding The historical judgment data is determined as the data to be processed, the a priori knowledge base corresponding to the original case type is constructed, and the trial viewpoint database corresponding to the original case type is trained, so that the The data to be processed is not limited to the same target area, but is only divided according to the original case type.
S304:从待处理数据中提取先验信息,基于先验信息构建与目标案件类型相对应的先验知识库。S304: Extract prior information from the data to be processed, and construct a priori knowledge base corresponding to the target case type based on the prior information.
该待处理数据为与原始案件类型相对应的历史判案数据。服务器从待处理数据中提取先验信息,具体可以理解为从待处理数据中提取其案件描述信息和案件判案结果,再从这些案件描述信息和案件判案结果中提取关键词,具体采用键值对(Key-Value)的形式将这些关键词进行限定;然后,利用所抽取的关键词构建与原始案件类型相对应的先验知识库,具体是将所有待处理数据所提取的键值对存储在数据库中,构建与原始案件类型相对应的先验知识库。其中,Key具体可以是案件描述信息中影响判案结果的关键词,而Value是案件判案结果中的判案结果。The data to be processed is historical judgment data corresponding to the original case type. The server extracts a priori information from the data to be processed. Specifically, it can be understood as extracting the description of the case and the result of the case from the data to be processed, and then extracting keywords from the description of the case and the result of the case, using keys specifically The key-value form defines these keywords; then, the extracted keywords are used to construct a priori knowledge base corresponding to the original case type, specifically the key-value pairs extracted from all the data to be processed Stored in the database to construct a priori knowledge base corresponding to the original case type. Among them, Key can specifically be a keyword in the case description that affects the judgment result, and Value is the judgment result in the case judgment result.
S305:从待处理数据提取历史描述信息和历史审判观点,基于历史描述信息和历史审判观点,构建与原始案件类型相对应的审判观点库。S305: Extract historical description information and historical judgment viewpoints from the data to be processed, and construct a judgment viewpoint database corresponding to the original case type based on the historical description information and historical judgment viewpoints.
其中,历史描述信息是从待处理数据中提取出来的可能影响案件审判的信息,具体是采用语义分析模型从待处理数据的案件描述信息中提取出来的。历史审判观点是从待处理数据中提取法官对该历史案件的审判观点。服务器在从待处理数据中提取出历史描述信息和历史审判观点之后,将历史描述信息和历史审判观点作为一组训练数据,输入到常见的CNN(卷积神经网络)或者RNN(循环神经网络)进行模型训练,以更新模型参数,从而获取与原始案件类型相对应的审判观点库。Among them, the historical description information is the information extracted from the data to be processed that may affect the trial of the case. Specifically, it is extracted from the case description information of the data to be processed using a semantic analysis model. The historical trial viewpoint is to extract the judge's trial viewpoint of the historical case from the data to be processed. After the server extracts historical description information and historical judgment opinions from the data to be processed, it uses the historical description information and historical judgment opinions as a set of training data and inputs them to common CNN (Convolutional Neural Network) or RNN (Recurrent Neural Network) Carry out model training to update the model parameters, so as to obtain the trial opinion library corresponding to the original case type.
本实施例所提供的智能辅助审判方法中,根据与原始案件类型相对应的判案依据在任务更新时间之后是否发生变更,以采集不同时间段的历史判案数据,确定用于构建先验知识库和审判观点库的待处理数据,以保证待处理数据的时效性,从而实现对先验知识库和审判观点库的更新,从而保障后续基于先 验知识库和审判观点库分别获取目标推送法条和推荐审判观点的时效性和准确性。In the intelligent auxiliary trial method provided in this embodiment, according to whether the judgment basis corresponding to the original case type has changed after the task update time, the historical judgment data of different time periods is collected and determined to be used to construct prior knowledge To ensure the timeliness of the data to be processed, the a priori knowledge base and the trial view database are updated to ensure that the target push method is obtained separately based on the prior knowledge base and the trial view database. The timeliness and accuracy of the article and recommended trial opinions.
在一实施例中,如图4所示,在步骤S201之前,该智能辅助审判方法还包括如下步骤:In one embodiment, as shown in FIG. 4, before step S201, the intelligent assisted trial method further includes the following steps:
S401:在客户端上显示与标准法庭审理笔录模板中当前审判环节对应的审判提示文字,获取麦克风采集的与审判提示文字相对应的原始语音数据。S401: Display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text.
其中,标准法庭审理笔录模板一般包括开庭前准备、宣布开庭、法庭调查和法庭辩论等阶段,在法庭调查阶段具体包括当事人陈述、归纳争议焦点、围绕焦点举证质证、证人出庭作证、宣读鉴定意见和勘验笔录、申请鉴定等环节,每一环节均设有相应的引导话术。该引导话术一般为法官引导其他庭审当事人(如原告、被告或者证人)进行回复的引导问题对应的话术,如“首先由原告围绕你的诉讼请求向法庭陈述”等。审判提示文字是指该标准法庭审理笔录模板中与当前审判环节相对应的引导话术。其中,当前审判环节是指法庭审理案件过程中正在进行的环节,如当事人陈述或者其他环节。Among them, the standard court trial transcript template generally includes pre-trial preparation, announcement of the opening, court investigation and court debate, etc. In the court investigation stage, it specifically includes the statement of the parties, the summary of the focus of the dispute, the cross-examination of evidence around the focus, the witness testimony in court, the reading of expert opinions and Corresponding instructions are provided for each link of the inspection transcript and application for appraisal. This guiding speech is generally the speech corresponding to the guiding question that the judge guides other parties (such as the plaintiff, defendant or witness) to respond, such as "the plaintiff first presents to the court around your litigation request". Trial prompt text refers to the guiding words corresponding to the current trial link in the standard court trial transcript template. Among them, the current trial link refers to the ongoing link in the court hearing the case, such as the statement of the parties or other links.
原始语音数据是实时采集到的庭审当事人针对审判提示文字进行回复时采集到的语音数据。一般来说,法官在庭审过程中,会基于审判提示信息对庭审当事人(如原告、被告或者证人)进行引导或者提问,此时,庭审当事人需要进行回复,此时,麦克风采集到的语音数据为原始语音数据。本实施例中,该智能辅助审判系统包括与服务器相连的至少一个麦克风,每一麦克风对应的麦克风标识,该麦克风标识是用于唯一识别不同麦克风的标识。在麦克风实时采集原始语音数据时,其原始语音数据与相应的麦克风标识关联。The original voice data is the voice data collected in real time when the parties to the trial respond to the trial prompt text. Generally speaking, during the court hearing, the judge will guide or ask questions to the parties (such as the plaintiff, defendant or witness) in the court trial based on the trial prompt information. At this time, the parties in the court trial need to respond. At this time, the voice data collected by the microphone is Raw voice data. In this embodiment, the intelligent auxiliary trial system includes at least one microphone connected to the server, and each microphone corresponds to a microphone identifier, and the microphone identifier is an identifier for uniquely identifying different microphones. When the microphone collects raw voice data in real time, its raw voice data is associated with the corresponding microphone identifier.
S402:对原始语音数据进行声纹识别,获取声纹识别结果,根据声纹识别结果确定原始语音数据对应的对象标识。S402: Perform voiceprint recognition on the original voice data, obtain a voiceprint recognition result, and determine an object identifier corresponding to the original voice data according to the voiceprint recognition result.
其中,对原始语音数据进行声纹识别,获取声纹识别结果,具体是指服务器采用预先设置的声纹特征提取算法对原始语音数据进行声纹特征提取,再根据提取出的声纹特征进行声纹识别,以确定原始语音数据对应的说话人的身份的过程。该对象标识是用于唯一识别庭审过程不同庭审当事人的标识。具体地,服务器采用声纹特征提取算法对原始语音数据进行声纹特征提取,获取目标声纹特征,判断是否存在与目标声纹特征相对应的标准声纹特征,以获取相应的声纹识别结果。该声纹识别结果包括存在标准声纹特征和不存在标准声纹特征这两个结果。Among them, the voiceprint recognition is performed on the original voice data to obtain the voiceprint recognition result. Specifically, the server uses the preset voiceprint feature extraction algorithm to extract the voiceprint feature of the original voice data, and then performs the voiceprint feature extraction based on the extracted voiceprint feature. Pattern recognition to determine the identity of the speaker corresponding to the original voice data. The object identifier is used to uniquely identify different parties in the court hearing. Specifically, the server uses a voiceprint feature extraction algorithm to extract voiceprint features from the original voice data, obtains the target voiceprint feature, and determines whether there is a standard voiceprint feature corresponding to the target voiceprint feature to obtain the corresponding voiceprint recognition result . The voiceprint recognition result includes two results of the existence of a standard voiceprint feature and the absence of a standard voiceprint feature.
在一实施例中,步骤S402具体包括如下步骤:In an embodiment, step S402 specifically includes the following steps:
S4011:采用声纹特征提取算法对原始语音数据进行声纹特征提取,获取目标声纹特征,判断是否存在与目标声纹特征相对应的标准声纹特征。S4011: Use the voiceprint feature extraction algorithm to extract the voiceprint feature of the original voice data, obtain the target voiceprint feature, and determine whether there is a standard voiceprint feature corresponding to the target voiceprint feature.
其中,声纹特征提取算法是用于对语音数据进行声纹特征提取,以确定原始语音数据对应的声纹特征的算法。该声纹特征提取算法包括但不限于MFCC提取算法,所提取的目标声纹特征为MFCC特征。MFCC(Mel-scale Frequency Cepstral Coefficients,梅尔倒谱系数)是在Mel标度频率域提取出来的倒谱参数,Mel标度描述了人耳频率的非线性特性。目标声纹特征是从原始语音数据中提取出的声纹特征。Among them, the voiceprint feature extraction algorithm is an algorithm used to extract voiceprint features from voice data to determine the voiceprint features corresponding to the original voice data. The voiceprint feature extraction algorithm includes but is not limited to the MFCC extraction algorithm, and the extracted target voiceprint feature is the MFCC feature. MFCC (Mel-scale Frequency Cepstral Coefficients) is a cepstral parameter extracted in the frequency domain of the Mel scale. The Mel scale describes the non-linear characteristics of the human ear frequency. The target voiceprint feature is the voiceprint feature extracted from the original voice data.
标准声纹特征是根据开庭前准备过程中采集到的庭审当事人的语音数据所提取出来的声纹特征。该标准声纹特征也是采用MFCC提取算法提取出来的MFCC特征。一般来说,在开庭前准备过程中,会将庭审当事人所采集的标准声纹特征与其对应的身份标识关联存储在数据库中,以使后续识别处理。The standard voiceprint feature is the voiceprint feature extracted based on the voice data of the parties to the trial collected during the pre-trial preparation process. The standard voiceprint feature is also the MFCC feature extracted by the MFCC extraction algorithm. Generally speaking, in the pre-trial preparation process, the standard voiceprint features collected by the parties in the court trial and their corresponding identification are stored in a database in association with each other for subsequent identification processing.
本实施例中,服务器采用余弦相似度算法或者其他相似度算法对目标声纹特征和服务器中预先存储的每一标准声纹特征进行相似度计算,获取目标相似度;若目标相似度大于预设相似度阈值,则认定存在标准声纹特征;若目标相似度不大于预设相似度阈值,则认定不存在标准声纹特征。其中,预设相似度阈值是用于评估相似度是否达到认定为同一说话人的标准的阈值。In this embodiment, the server uses the cosine similarity algorithm or other similarity algorithms to calculate the similarity between the target voiceprint feature and each standard voiceprint feature pre-stored in the server to obtain the target similarity; if the target similarity is greater than the preset If the similarity threshold is the same, it is determined that there is a standard voiceprint feature; if the target similarity is not greater than the preset similarity threshold, it is determined that there is no standard voiceprint feature. Among them, the preset similarity threshold is a threshold used to evaluate whether the similarity reaches the criteria for identifying the same speaker.
S4012:若存在标准声纹特征,则根据标准声纹特征对应的身份标识,确定原始语音数据对应的对象标识。S4012: If there are standard voiceprint features, determine the object identifier corresponding to the original voice data according to the identity identifier corresponding to the standard voiceprint feature.
具体地,若服务器中存储有与目标声纹特征相对应的标准声纹特征,则根据与该标准声纹特征相对应的身份标识,确定该原始语音数据对应的对象标识,从而快速确定该原始语音数据对应的对象标识。该身份标识是用于区分庭审当事人在庭审过程中的身份的标识,如原告、被告和证人等。该对象标识是用于唯一识别庭审过程不同庭审当事人的标识。该对象标识可以是在身份标识的基础上增加序号标识,如在多个原告的情况下,可以采用原告01和原告02这种形式的对象标识进行区分。该序号标识可以根 据开庭前准备过程中采集标准身份特征的先后顺序确定,也可以根据其在法庭庭审过程中发言的先后顺序确定,以使每一庭审当事人均有唯一识别其身份的对象标识。Specifically, if the standard voiceprint feature corresponding to the target voiceprint feature is stored in the server, the object identifier corresponding to the original voice data is determined according to the identity tag corresponding to the standard voiceprint feature, so as to quickly determine the original voiceprint feature. Object ID corresponding to the voice data. The identity mark is used to distinguish the identity of the parties in the court hearing, such as the plaintiff, the defendant, and the witness. The object identifier is used to uniquely identify different parties in the court hearing. The object identification can be a serial number identification added to the identity identification. For example, in the case of multiple plaintiffs, the object identification in the form of plaintiff 01 and plaintiff 02 can be used to distinguish. The serial number identification can be determined according to the sequence of collecting standard identity features during the pre-trial preparation process, or according to the sequence of their speeches during the court hearing, so that each party to the court hearing has an object identifier that uniquely identifies its identity.
S4013:若不存在标准声纹特征,则根据原始语音数据对应的麦克风标识,确定原始语音数据对应的对象标识。S4013: If there is no standard voiceprint feature, determine the object identifier corresponding to the original voice data according to the microphone identifier corresponding to the original voice data.
具体地,若服务器不存储有与目标声纹特征相对应的标准声纹特征,则说明该说话人在开庭前准备过程中没有预先采集到标准声纹特征,此时,可根据原始语音数据所携带的麦克风标识查询麦克风信息表,获取麦克风标识对应的身份标识,基于身份标识生成相应的对象标识。其中,麦克风信息表是用于根据麦克风的摆放位置确定其说话人对应的身份标识的信息对照表,该麦克风信息表将麦克风标识与其对应的身份标识关联存储。以摆放在证人席上的麦克风为例,其麦克风标识对应的身份标识为证人,基于身份标识生成相应的对象标识,具体是指根据证人这一身份标识加上发言先后顺序形成的序号标识,获取其对应的对象标识,如证人01,证人02等。Specifically, if the server does not store the standard voiceprint feature corresponding to the target voiceprint feature, it means that the speaker has not collected the standard voiceprint feature in advance during the pre-court preparation process. At this time, it can be based on the original voice data. The carried microphone ID queries the microphone information table, obtains the ID ID corresponding to the microphone ID, and generates the corresponding object ID based on the ID ID. The microphone information table is an information comparison table used to determine the identity identifier corresponding to the speaker according to the placement position of the microphone, and the microphone information table associates the microphone identifier with its corresponding identity identifier. Taking the microphone placed on the witness stand as an example, the identity corresponding to the microphone identity is the witness, and the corresponding object identity is generated based on the identity. Specifically, it refers to the serial number identity formed by adding the identity of the witness and the order of speaking. Obtain the corresponding object ID, such as witness 01, witness 02, etc.
本实施例所提供的智能辅助审判方法中,根据原始语音数据提取出的目标声纹特征,判断是否存在相对应的标准声纹特征的声纹识别结果,以决定是根据标准声纹特征还是麦克风标识确定相对应的对象标识,以保证确定的身份标识的唯一性。In the intelligent auxiliary trial method provided in this embodiment, the target voiceprint feature extracted from the original voice data is used to determine whether there is a voiceprint recognition result corresponding to the standard voiceprint feature to determine whether it is based on the standard voiceprint feature or the microphone The identifier determines the corresponding object identifier to ensure the uniqueness of the determined identity identifier.
S403:对原始语音数据进行文本翻译,获取与对象标识相对应的原始文本数据,将对象标识和原始文本数据关联存储在标准法庭审理笔录模板的相应位置。S403: Perform text translation on the original voice data, obtain the original text data corresponding to the object identifier, and store the object identifier and the original text data in a corresponding position in the standard court trial transcript template.
其中,对原始语音数据进行文本翻译,是指将原始语音数据翻译成文本形式的数据的过程。原始文本数据是指原始语音数据翻译成的文本数据。在一实施例中,服务器可采用但不限于静态解码网络对原始语音数据进行文本翻译,由于静态解码网络已经把搜索空间全部展开,因此其在进行文本翻译时,解码速度非常快,从而可快速获取与对象标识相对应的原始文本数据。可以理解地,服务器接收麦克风采集到的原始语音数据,再采用静态解码网络对原始语音数据进行文本翻译,以快速获取其对应的原始文本数据,而无需书记员进行手动输入,从而加快原始文本数据的录入效率。Among them, the text translation of the original voice data refers to the process of translating the original voice data into data in text form. The original text data refers to the text data translated from the original voice data. In one embodiment, the server can use, but is not limited to, a static decoding network to perform text translation on the original voice data. Since the static decoding network has fully expanded the search space, the decoding speed is very fast when performing text translation, which can be fast Obtain the original text data corresponding to the object ID. Understandably, the server receives the original voice data collected by the microphone, and then uses the static decoding network to translate the original voice data to quickly obtain the corresponding original text data without manual input by the clerk, thereby speeding up the original text data Input efficiency.
具体地,服务器在对原始语音数据进行文本翻译,以获取对应的原始文本数据之后,将该原始文本数据与其对应的对象标识关联存储在标准法庭审理笔录模板的相应位置,即将原始文本数据填充在标准法庭审理笔录模板的当前审判环节中与对象标识相对应的位置。例如,原始语音数据是针对“原告对你的起诉还有补充吗”这一审判提示文字进行回复的语音数据,其对应的对象标识为原告,则可以将这一原始语音数据所翻译形成的原始文本数据填充在标准法庭审理笔录模板中与审判提示文字相对应的位置,以提高原始文本数据的录入效率,减轻书记员的工作负担。Specifically, after the server performs text translation on the original voice data to obtain the corresponding original text data, the original text data and its corresponding object identifier are associated and stored in the corresponding position of the standard court trial transcript template, that is, the original text data is filled in The position corresponding to the object identifier in the current trial link of the standard court trial transcript template. For example, the original voice data is the voice data that responds to the trial prompt text "Does the plaintiff have any supplements to your prosecution?" and the corresponding object is identified as the plaintiff, the original voice data can be translated into the original The text data is filled in the position corresponding to the trial prompt text in the standard court trial transcript template to improve the input efficiency of the original text data and reduce the work burden of the clerks.
S404:基于原始文本数据查询在先文本数据库,判断是否存在与原始文本数据相对应的在先文本数据。S404: Query the prior text database based on the original text data, and determine whether there is prior text data corresponding to the original text data.
其中,在先文本数据是指在麦克风采集到该原始语音数据之前已经形成并记录在标准法庭审理笔录模板相应位置的文本数据。由于法庭庭审是庭审当事人针对同一事件进行博弈的过程,在庭审过程中,庭审当事人针对同一事件会从不同角度进行论述,其论述内容可能有相关性,此时,在先文本数据可以理解为在采集到该原始语音数据之前与原始语音数据所论述内容相对应的文本内容。以时间为例,在刑事案件中,案发过程的时间节点是影响案件形成的证据链或者量刑轻重的关键因素,原告、被告和证人可能会基于这些时间节点发表不同的原始语音数据,依据其形成时间的先后顺序,从而确定在先文本数据和原始文本数据。或者,在民事案件中,与待判案件相关的各个时间节点(如承诺的撤回时间、承诺的撤销时间、合同成立时间和合同生成时间)是影响违约责任的认定的关键因素,原告、被告和证人可能会基于这些时间节点发表不同的原始语音数据,依据其形成时间的先后顺序,从而确定在先文本数据和原始文本数据。Among them, the previous text data refers to the text data that has been formed and recorded in the corresponding position of the standard court trial transcript template before the original voice data is collected by the microphone. Since the court trial is a process in which the parties to the trial play a game against the same event, during the trial, the parties to the trial will discuss the same event from different perspectives. The content of the discussion may be relevant. At this time, the previous text data can be understood as The text content corresponding to the content discussed in the original voice data before the original voice data is collected. Take time as an example. In a criminal case, the time node of the case is a key factor that affects the evidence chain of the case formation or the severity of the sentence. The plaintiff, defendant, and witnesses may publish different original voice data based on these time nodes. Form the time sequence, thereby determining the previous text data and the original text data. Or, in a civil case, various time nodes related to the pending case (such as the withdrawal time of the promise, the cancellation time of the promise, the time when the contract is established, and the time when the contract is generated) are the key factors affecting the determination of liability for breach of contract. The plaintiff, the defendant and the Witnesses may publish different original voice data based on these time nodes, and determine the previous text data and the original text data according to the order of their formation time.
在一实施例中,步骤S404具体包括如下步骤:In an embodiment, step S404 specifically includes the following steps:
S4041:采用关键词提取算法对原始文本数据进行关键词提取,获取文本关键词。S4041: Use a keyword extraction algorithm to extract keywords from the original text data to obtain text keywords.
其中,文本关键词是从原始文本数据中提取出来的关键词。关键词提取算法是用于实现从文本数据中提取关键词的算法。本实施例中,采用但不限于TextRank、LDA、TPR-TextRank等关键词提取算法对原始文本数据进行关键词提取,以获取该原始文本数据对应的文本关键词。Among them, text keywords are keywords extracted from the original text data. Keyword extraction algorithm is an algorithm used to extract keywords from text data. In this embodiment, keyword extraction algorithms such as TextRank, LDA, TPR-TextRank, etc. are used, but not limited to, to perform keyword extraction on the original text data to obtain text keywords corresponding to the original text data.
S4042:基于文本关键词查询同义词库,获取与文本关键词相对应的文本同义词。S4042: Query the thesaurus based on the text keywords, and obtain text synonyms corresponding to the text keywords.
其中,同义词库是用于存储预先设置的用于存储同义词关系的数据库。文本同义词是记录在同义词库中的与文本关键词具有同义词关系的同义词。本实施例中,同义词库中预先存储具有同义词关系的同义词组,这些同义词组具体可以为与案件审判过程中涉及到的同义词组,以便服务器可根据原始文本数据中提取出的文本关键词查询到相应的文本关键词,从而有助于后续查询过程扩大查询范围。Among them, the thesaurus is used to store a pre-set database for storing synonym relationships. Text synonyms are synonyms recorded in the thesaurus that have a synonym relationship with text keywords. In this embodiment, synonym groups with synonym relationships are pre-stored in the thesaurus. These synonym groups may specifically be synonym groups involved in the trial process of the case, so that the server can query the text keywords extracted from the original text data. Corresponding text keywords to help expand the scope of the query in the subsequent query process.
S4043:根据文本关键词和文本同义词查询在先文本数据库,判断是否存在包含文本关键词或者文本同义词的在先文本数据。S4043: Query the prior text database according to the text keywords and text synonyms, and determine whether there is prior text data containing the text keywords or text synonyms.
其中,在先文本数据库是用于存储在在采集到该原始语音数据之前所形成的所有在先文本数据的数据库。本实施例中,根据文本关键词和文本同义词查询在先文本数据库,判断在先文本数据库中是否存在与该文本关键词相对应的在先文本数据,或者是否存在与该文本同义词相对应的在先文本数据,以扩大在先文本数据的查找范围。Among them, the previous text database is a database used to store all previous text data formed before the original voice data is collected. In this embodiment, the prior text database is queried according to text keywords and text synonyms, and it is determined whether there is prior text data corresponding to the text keyword in the prior text database, or whether there is a prior text data corresponding to the text synonym. First text data to expand the search range of previous text data.
S4044:若存在包含文本关键词或者文本同义词的在先文本数据,则认定存在与原始文本数据相对应的在先文本数据。S4044: If there is prior text data containing text keywords or text synonyms, it is determined that there is prior text data corresponding to the original text data.
具体地,若在先文本数据库中存在包含文本关键词或者文本同义词的在先文本数据,则认定存在与原始文本数据相对应的在先文本数据,说明在庭审当事人说出原始语音数据时,已经有人在先提及过包含文本关键词或者文本同义词的语音数据,该语音数据经文本翻译后形成在先文本数据存储在先文本数据库中,以便后续基于原始文本数据和在先文本数据进行语义分析,以确定两者表述的意思是否相同,即后续执行步骤S405的步骤。Specifically, if there is prior text data containing text keywords or text synonyms in the prior text database, it is determined that there is prior text data corresponding to the original text data, indicating that the original voice data was already Someone mentioned the speech data containing text keywords or text synonyms. The speech data is translated into prior text data and stored in the prior text database for subsequent semantic analysis based on the original text data and prior text data To determine whether the two expressions have the same meaning, that is, the step S405 is subsequently executed.
S4045:若不存在包含文本关键词或者文本同义词的在先文本数据,则认定不存在与原始文本数据相对应的在先文本数据。S4045: If there is no prior text data containing text keywords or text synonyms, it is determined that there is no prior text data corresponding to the original text data.
具体地,若在先文本数据库中不存在包含文本关键词或者文本同义词的在先文本数据,则认定不存在与原始文本数据相对应的在先文本数据,说明在庭审当事人说出原始语音数据时,没有其他庭审当事人在先提及包含文本关键词或者文本同义词的语音数据,此时需执行后续的步骤S405。Specifically, if there is no prior text data containing text keywords or text synonyms in the prior text database, it is determined that there is no prior text data corresponding to the original text data, indicating that the original voice data is spoken by the parties in the court hearing , No other party in the court trial mentioned the voice data containing text keywords or text synonyms first, and the subsequent step S405 needs to be executed at this time.
可以理解地,先根据原始文本数据所提取出的文本关键词查询同义词库,以确定其对应的文本同义词,再基于文本关键词和文本同义词查询在先文本数据库,以确定其包含文本关键词或者文本同义词在先文本数据,从而扩大在先文本数据的确定范围,避免出现遗漏。Understandably, first query the thesaurus based on the text keywords extracted from the original text data to determine the corresponding text synonyms, and then query the previous text database based on the text keywords and text synonyms to determine whether it contains text keywords or The text synonym precedes the text data, thereby expanding the scope of the prior text data and avoiding omissions.
S405:若存在在先文本数据,则对原始文本数据和在先文本数据进行语义分析,确定语义分析结果,根据语义分析结果对原始文本数据进行突出显示处理,显示与语义分析结果相对应的审判提示文字,重复执行获取麦克风采集的与审判提示文字相对应的原始语音数据。S405: If prior text data exists, perform semantic analysis on the original text data and prior text data to determine the semantic analysis result, highlight the original text data based on the semantic analysis result, and display the trial corresponding to the semantic analysis result Prompt text, repeated execution to obtain the original voice data collected by the microphone and corresponding to the trial prompt text.
具体地,在确定存在与原始文本数据相对应的在先文本数据时,对原始文本数据和在先文本数据进行语义分析,确定语义分析结果,具体包括:采用语义分析工具对原始文件数据和在先文本数据进行语义分析,以确定原始文件数据和在先文本数据的语义是否相同或者相异,获取相应的语义分析结果。该语音分析结果包括相同的语义分析结果和相异的语义分析结果。该语义分析工具可以采用但不限于NLP(Natural Language Processing,自然语义处理)技术创建的分析工具。Specifically, when it is determined that there is prior text data corresponding to the original text data, semantic analysis is performed on the original text data and the prior text data to determine the semantic analysis result, which specifically includes: using a semantic analysis tool to analyze the original document data and the previous text data. Perform semantic analysis on the first text data to determine whether the semantics of the original document data and the previous text data are the same or different, and obtain the corresponding semantic analysis results. The speech analysis results include the same semantic analysis results and different semantic analysis results. The semantic analysis tool may adopt, but is not limited to, an analysis tool created by NLP (Natural Language Processing) technology.
例如,庭审当事人A对应的在先文本数据中记录“我在3月10日向B购买一批价值为10万的产品”,而庭审当事人B对应的原始文本数据中记录“我在3月10号将一批价值为10万的产品卖给A”,虽然这句话表述不相同,但涉及的当事人、时间、标的和价格等因素均相同,在采用语义分析工具对原始文件数据和在先文本数据进行语义分析时,认定两者描述的意思相同,则获取相同的语义分析结果。又例如,庭审当事人A对应的在先文本数据中记录“我在3月10日向B购买一批价值为10万的产品”,而庭审当事人B对应的原始文本数据中记录“我在3月8号将一批价值为15万的产品卖给A”,这两句话中,虽然涉及的当事人和标的这两个因素相同,但时间和价格这两个因素不相同,认定两者描述的意见不相同,获取相异的语义分析结果。For example, the prior text data corresponding to party A in the court trial records "I bought a batch of products worth 100,000 from B on March 10", while the original text data corresponding to party B in the court trial records "I was on March 10 Sell a batch of products with a value of 100,000 to A". Although this sentence is different, the parties involved, time, subject matter, and price are all the same. Using semantic analysis tools to analyze the original document data and the previous text When data is analyzed for semantics, it is determined that the two descriptions have the same meaning, and the same semantic analysis result is obtained. For another example, the previous text data corresponding to party A in the court trial recorded "I purchased a batch of products worth 100,000 from B on March 10", while the original text data corresponding to party B in the court trial recorded "I was on March 8. No. Selling a batch of products with a value of 150,000 to A". In these two sentences, although the parties involved and the subject matter are the same, the time and price are not the same, and the opinions described by the two are determined Different, obtain different semantic analysis results.
具体地,根据语义分析结果对原始文本数据进行突出显示处理,显示与语义分析结果相对应的审判提示文字,具体是指根据语义分析结果原始文本数据与在先文本数据中语义是相同还是相异,从而确定针对双方认定的事实、双方争议的焦点和描述是否前后矛盾等特殊情况,对当前审判环节对应的原始文 本数据进行不同的突出显示处理,以使法官在庭审过程中根据突出显示处理结果了解上述情况,有助于减少法官庭审过程中的工作量,从而降低工作负担,并显示与该语义分析结果相对应的审判提示信息,有助于加快法庭庭审的庭审进度。Specifically, the original text data is highlighted according to the semantic analysis result, and the trial prompt text corresponding to the semantic analysis result is displayed, which specifically refers to whether the original text data and the previous text data have the same semantics or different semantics according to the semantic analysis result , So as to determine whether the original text data corresponding to the current trial link is highlighted in different ways in response to the facts determined by the two parties, the focus of the dispute and whether the description is inconsistent, so that the judge can highlight the processing results during the trial Understanding the above situation will help reduce the workload of the judge in the trial process, thereby reducing the workload, and display the trial prompt information corresponding to the semantic analysis result, which will help speed up the trial progress of the court trial.
在一实施例中,步骤S405具体包括如下步骤:In an embodiment, step S405 specifically includes the following steps:
S4051:若语义分析结果为相同,且原始文本数据与在先文本数据对应的对象标识为同一标识,则不对原始文本数据进行突出显示处理,显示标准法庭审理笔录模板对应的下一审判环节对应的审判提示文字。S4051: If the semantic analysis result is the same, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the original text data is not highlighted, and the next trial link corresponding to the standard court trial transcript template is displayed. Trial reminder text.
具体地,若语义分析结果为相同,且原始文本数据与在先文本数据对应的对象标识为同一标识,则说明说出原始文本数据和在先文本数据的说话人为同一人,而且原始文本数据和在先文本数据表述的语义相同,不存在前后表述相互矛盾的问题,可以理解为无特殊情况的论述,因此,不对原始文本数据进行突出显示处理,显示标准法庭审理笔录模板对应的下一审判环节对应的审判提示文字,重复执行步骤S401中的获取麦克风采集的与审判提示文字相对应的原始语音数据及其以后的步骤。Specifically, if the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is the same identifier, it means that the speaker who spoke the original text data and the previous text data is the same person, and the original text data and The semantics of the previous text data is the same, and there is no conflict between the previous and the next. It can be understood as a discussion without special circumstances. Therefore, the original text data is not highlighted, and the next trial link corresponding to the standard court trial transcript template is displayed. For the corresponding trial prompt text, repeat the steps of acquiring the original voice data corresponding to the trial prompt text collected by the microphone in step S401 and the subsequent steps.
S4052:若语义分析结果为相异,且原始文本数据与在先文本数据对应的对象标识为同一标识,则采用第一突出显示模式对原始文本数据进行突出显示处理,显示包括矛盾提示信息的审判提示文字。S4052: If the semantic analysis results are different, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the first highlighting mode is used to highlight the original text data, and display the trial including contradictory prompt information Prompt text.
其中,第一突出显示模式是预先设置的用于对同一说话人论述前后矛盾的内容进行突出显示的模式,该第一突出显示模式可以采用字体颜色、背景颜色、加粗、倾斜或者加下划线等形式进行突出显示。Among them, the first highlighting mode is a preset mode for highlighting the inconsistent content discussed by the same speaker. The first highlighting mode can adopt font color, background color, bold, slanted or underlined, etc. The form is highlighted.
具体地,语义分析结果为相异,且原始文本数据与在先文本数据对应的对象标识为同一标识,则说明说出原始文本数据和在先文本数据的说话人为同一人,而且原始文本数据和在先文本数据表述的语义相异,存在前后表述相互矛盾的问题,极有可能是因为说话人说谎这一特殊情况而产品的,因此,需采用第一突出显示模式对原始文本数据进行突出显示处理,以显示包含矛盾提示信息的审判提示文字,以使法官在庭审过程中了解到说话人论述存在前后矛盾的地方,进而更好地把控庭审过程中的引导问题,保障庭审过程中的公平公正,有助于减少法官庭审过程中的工作量,从而降低工作负担。Specifically, the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are the same, indicating that the speaker who spoke the original text data and the previous text data is the same person, and the original text data and The semantics of the previous text data are different, and there is a problem of contradictory statements. It is very likely that the product was produced because of the special situation of the speaker lying. Therefore, the first highlighting mode needs to be used to highlight the original text data. Processing to display the trial prompt text containing contradictory prompt information, so that the judge can understand the inconsistencies in the speaker's statement during the trial, and then better control the guiding issues in the trial process and ensure fairness in the trial process Fairness helps to reduce the workload of judges during the trial process, thereby reducing the workload.
S4053:若语义分析结果为相同,且原始文本数据与在先文本数据对应的对象标识不为同一标识,则采用第二突出显示模式对原始文本数据进行突出显示处理,显示包括无争议提示信息的审判提示文字。S4053: If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is not the same identifier, the second highlighting mode is used to highlight the original text data, and display the information including the non-dispute prompt information Trial reminder text.
其中,第二突出显示模式是预先设置的用于对不同说话人论述无矛盾的内容进行突出显示的模式,可以理解地,该第二突出显示模式是区别于第一突出显示模式的模式,同样可以采用字体颜色、背景颜色、加粗、倾斜或者加下划线等形式进行突出显示。Among them, the second highlighting mode is a preset mode for highlighting content that is not contradictory to different speakers’ discussions. Understandably, the second highlighting mode is a mode different from the first highlighting mode. It can be highlighted in the form of font color, background color, bold, slanted, or underlined.
具体地,若语义分析结果为相同,且原始文本数据与在先文本数据对应的对象标识不为同一标识,则说明说出原始文本数据和在先文本数据的说话人不为同一人,而且原始文本数据和在先文本数据的语义相同,即这两个说话人对所论述内容无异义,即不存在争议的内容,因此,需采用第二突出显示模式对原始文本数据进行突出显示处理,显示包括无争议提示信息的审判提示文字,有助确定法庭庭审过程中的无争议的事实,进而更好地把控庭审过程中的引导问题,有助于减少法官庭审过程中的工作量,从而降低工作负担。Specifically, if the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is not the same identifier, it means that the speaker who spoke the original text data and the previous text data is not the same person, and the original The semantics of the text data and the previous text data are the same, that is, the two speakers have no different meanings to the content discussed, that is, there is no content in dispute. Therefore, the second highlighting mode is required to highlight the original text data. Displaying the trial reminder text that includes non-controversial reminder information can help determine the undisputed facts in the court trial process, and better control the guiding issues in the court trial process, helping to reduce the workload of the judge in the trial process, thereby Reduce the workload.
S4054:若语义分析结果为相异,且原始文本数据与在先文本数据对应的对象标识不为同一标识,则采用第三突出显示模式对原始文本数据进行突出显示处理,显示包括争议焦点提示信息的审判提示文字。S4054: If the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are not the same identifier, the third highlighting mode is used to highlight the original text data, and display prompt information including the focus of the dispute The trial reminder text.
其中,第三突出显示模式是预先设置的用于对不同说话人论述有矛盾的内容进行突出显示的模式,可以理解地,该第三突出显示模式与前面的第一突出显示模式和第二突出显示模式不相同的模式,同样可以采用字体颜色、背景颜色、加粗、倾斜或者加下划线等形式进行突出显示。Among them, the third highlighting mode is a preset mode for highlighting conflicting content discussed by different speakers. Understandably, this third highlighting mode is the same as the previous first highlighting mode and second highlighting mode. Modes with different display modes can also be highlighted in the form of font color, background color, bold, oblique, or underlined.
具体地,若语义分析结果为相异,且原始文本数据与在先文本数据对应的对象标识不为同一标识,则说明说出原始文本数据和在先文本数据的说话人不为同一人,而且原始文本数据和在先文本数据的语义相异,即这两个说话人对所论述内容有争议,一般为双方争议焦点所在,因此,可采用第三突出显示模式对原始文本数据进行突出显示处理,显示包含争议焦点提示信息的审判提示文字,有助确定法庭庭审过程中的争议焦点,进而更好地把控庭审过程中的引导问题,有助于减少法官庭审过程中的工作量, 从而降低工作负担。Specifically, if the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are not the same, it means that the speaker who spoke the original text data and the previous text data is not the same person, and The semantics of the original text data and the previous text data are different, that is, the two speakers have disputes about the content discussed, which is generally the focus of the dispute between the two parties. Therefore, the third highlight mode can be used to highlight the original text data. , Display the trial prompt text containing the prompt information of the dispute focus, which helps to determine the focus of the dispute in the court hearing process, and then better control the guiding problems in the court hearing process, which helps to reduce the workload of the judge during the trial process, thereby reducing Work load.
进一步地,服务器在获取麦克风采集到的与审判提示文字相对应的原始语音数据之后,还可以采用服务器上预先设置的语音测谎模型对原始语音数据进行处理,获取说谎概率,若该说谎概率大于预设概率阈值,则采用谎言显示模式对原始文本数据进行突出显示处理,以使法官庭审过程中及时了解各方庭审当事人是否说谎,以保证案件公平公正地审理。其中,该语音测试模型可以是应用在当前市面公开的语音测试仪上使用的模型,以便根据原始语音数据中包含的语音频率或者语音基调等信息确定说话人说出的原始语音数据是谎言的概率。预设概率阈值是预先设置的用于评估是否达到判定为谎言的概率的阈值。谎言显示模式是预先设置的用于对较大概率为谎言的原始文本数据进行突出显示的模式。Further, after the server obtains the original voice data corresponding to the trial prompt text collected by the microphone, it can also use the voice polygraph model preset on the server to process the original voice data to obtain the probability of lying. If the probability of lying is greater than With the preset probability threshold, the lie display mode is used to highlight the original text data, so that the judge knows whether the parties in the trial are lying in a timely manner during the trial, so as to ensure a fair and just trial of the case. The voice test model may be a model used on a voice tester currently publicly available on the market, so as to determine the probability that the original voice data spoken by the speaker is a lie based on the voice frequency or voice tone contained in the original voice data . The preset probability threshold is a preset threshold used to evaluate whether the probability of determining a lie is reached. The lie display mode is a preset mode for highlighting the original text data with a higher probability of lying.
S406:若不存在在先文本数据,则重复执行在客户端上显示与标准法庭审理笔录模板中下一审判环节对应的审判提示文字,获取麦克风采集的与审判提示文字相对应的原始语音数据,直至不存在下一审判环节对应的审判提示文字时,获取庭审笔录文件,并将庭审笔录文件存储在数据库中。S406: If there is no prior text data, repeat the execution to display the trial prompt text corresponding to the next trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text. Until there is no trial prompt text corresponding to the next trial session, the court trial transcript file is obtained and the court trial transcript file is stored in the database.
具体地,在确定不存在与原始文本数据相对应的在先文本数据时,可依据标准法庭审理笔录模板,判断是否存在下一审判环节对应的审判提示文字;若存在下一审判环节对应的审判提示文字时,重复执行获取麦克风采集的与审判提示文字相对应的原始语音数据及其之后的步骤(即步骤S402、S403);若不存在下一审判环节对应的审判提示文字时,认定法庭审理过程结束,则根据标准法庭审理笔录模板中相应位置填充的所有原始文本数据,形成庭审笔录文件,并将该庭审笔录文件存储在数据库中,以便法官基于该庭审笔录文件制作裁判文书。Specifically, when it is determined that there is no prior text data corresponding to the original text data, the standard court trial record template can be used to determine whether there is a trial prompt text corresponding to the next trial link; if there is a trial corresponding to the next trial link When prompting text, repeat the steps to obtain the original voice data corresponding to the trial prompt text collected by the microphone and the subsequent steps (ie steps S402, S403); if there is no trial prompt text corresponding to the next trial session, the court is deemed to be heard At the end of the process, a court trial record file is formed based on all the original text data filled in the corresponding position in the standard court trial record template, and the court trial record file is stored in the database so that the judge can make a judgment document based on the court trial record file.
本实施例所提供的智能辅助审判方法中,通过麦克风实时采集与当前审判环节对应的审判提示文字相对应的原始语音数据之后,根据对原始语音数据进行声纹识别的声纹识别结果,确定其对象标识,从而确定该原始语音数据对应的说话人身份;将原始语音数据文本翻译所获取的原始文本数据与对象标识关联存储在标准法庭审理笔录模板的相应位置,从而提高原始文本数据的录入效率,无需书记员逐字录入,减轻书记员的工作负担。在存在与原始文本数据相对应的在先文本数据时,根据原始文本数据与在先文本数据的语义分析结果,对原始文本数据进行突出显示处理,并显示与语义分析结果相对应的审判提示文字,可使法官在庭审过程中根据突出显示处理结果了解不同语义分析结果对应的特殊情况,有助于减少法官庭审过程中的工作量,从而降低工作负担,显示与该语义分析结果相对应的审判提示信息,有助于加快法庭庭审的庭审进度,提高庭审效率。In the intelligent assisted trial method provided in this embodiment, after the original voice data corresponding to the trial prompt text corresponding to the current trial session is collected through a microphone in real time, it is determined according to the voiceprint recognition result of the voiceprint recognition of the original voice data Object identification to determine the identity of the speaker corresponding to the original voice data; the original text data obtained by the text translation of the original voice data and the object identification are stored in the corresponding position of the standard court trial transcript template, thereby improving the input efficiency of the original text data , No need for clerks to enter verbatim, reducing the work burden of clerks. When there is prior text data corresponding to the original text data, based on the semantic analysis results of the original text data and the prior text data, the original text data is highlighted, and the trial prompt text corresponding to the semantic analysis result is displayed , Which enables judges to understand the special situations corresponding to different semantic analysis results according to the highlighted processing results during the court trial, which helps to reduce the workload of the judges during the court trial, thereby reducing the workload and displaying the trial corresponding to the semantic analysis result The prompt information helps to speed up the trial progress of the court trial and improve the efficiency of the trial.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution. The execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation to the implementation process of the embodiment of the present application.
在一实施例中,如图5所示,提供一种智能辅助审判装置,该智能辅助审判装置与上述实施例中智能辅助审判方法一一对应。该智能辅助审判装置的各功能模块详细说明如下:In one embodiment, as shown in FIG. 5, an intelligent auxiliary trial device is provided, and the intelligent auxiliary trial device corresponds one-to-one with the intelligent auxiliary trial method in the foregoing embodiment. The detailed description of each functional module of the intelligent auxiliary trial device is as follows:
案件描述信息获取模块501,用于从数据库中获取待判案件对应的庭审笔录文件,从庭审笔录文件中提取案件描述信息;The case description information obtaining module 501 is used to obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
目标分词结果获取模块502,用于采用分词工具对案件描述信息进行分词,获取目标分词结果,目标分词结果包括多个目标分词;The target word segmentation result acquisition module 502 is configured to use word segmentation tools to segment the case description information to obtain a target word segmentation result, and the target word segmentation result includes multiple target word segmentation;
目标关键词确定模块503,用于基于每一目标分词查询关键词库,将关键词库中存储的与目标分词相匹配的原始关键词确定为目标关键词;The target keyword determining module 503 is configured to query a keyword database based on each target word segmentation, and determine the original keywords stored in the keyword database that match the target word segmentation as the target keyword;
目标案件类型确定模块504,用于根据目标关键词查询案件类型信息库,获取与目标关键词相匹配的至少一个目标案件类型;The target case type determination module 504 is configured to query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
目标推送法条获取模块505,从先验知识库中获取与至少一个目标案件类型相匹配的先验知识,根据先验知识,获取目标推送法条;The target push law acquisition module 505 acquires prior knowledge matching at least one target case type from the prior knowledge base, and obtains the target push law based on the prior knowledge;
推荐审判观点获取模块506,用于采用语义分析模型对案件描述信息进行语义分析,获取标准化信息,基于标准化信息查询审判观点库,获取对应的推荐审判观点;The recommended trial viewpoint acquisition module 506 is used to perform semantic analysis on case description information using a semantic analysis model to obtain standardized information, query the trial viewpoint database based on the standardized information, and obtain corresponding recommended trial viewpoints;
审判建议文件获取模块507,用于根据目标推送法条和推荐审判观点,获取并显示审判建议文件。The trial suggestion file acquisition module 507 is used to push the law and recommended trial opinions according to the target, and acquire and display the trial suggestion file.
优选地,在案件描述信息获取模块之前,智能辅助审判装置还包括:Preferably, before the case description information acquisition module, the intelligent auxiliary trial device further includes:
原始语音数据获取模块,用于在客户端上显示与标准法庭审理笔录模板中当前审判环节对应的审判 提示文字,获取麦克风采集的与审判提示文字相对应的原始语音数据;The original voice data acquisition module is used to display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text;
对象标识获取模块,用于对原始语音数据进行声纹识别,获取声纹识别结果,根据声纹识别结果确定原始语音数据对应的对象标识;The object identification acquisition module is used to perform voiceprint recognition on the original voice data, obtain the voiceprint recognition result, and determine the object identifier corresponding to the original voice data according to the voiceprint recognition result;
原始文本数据获取模块,用于对原始语音数据进行文本翻译,获取与对象标识相对应的原始文本数据,将对象标识和原始文本数据关联存储在标准法庭审理笔录模板的相应位置;The original text data acquisition module is used to translate the original speech data, obtain the original text data corresponding to the object identification, and store the object identification and the original text data in the corresponding position of the standard court trial transcript template;
在先文本数据判断模块,用于基于原始文本数据查询在先文本数据库,判断是否存在与原始文本数据相对应的在先文本数据;The prior text data judgment module is used to query the prior text database based on the original text data to determine whether there is prior text data corresponding to the original text data;
突出显示处理模块,用于若存在在先文本数据,则对原始文本数据和在先文本数据进行语义分析,确定语义分析结果,根据语义分析结果对原始文本数据进行突出显示处理,显示与语义分析结果相对应的审判提示文字,重复执行获取麦克风采集的与审判提示文字相对应的原始语音数据;The highlight processing module is used to perform semantic analysis on the original text data and previous text data if there is prior text data, determine the semantic analysis result, and perform highlight processing on the original text data according to the semantic analysis result, display and semantic analysis The trial prompt text corresponding to the result is repeatedly executed to obtain the original voice data corresponding to the trial prompt text collected by the microphone;
笔录文件获取模块,用于若不存在在先文本数据,则重复执行在客户端上显示与标准法庭审理笔录模板中下一审判环节对应的审判提示文字,获取麦克风采集的与审判提示文字相对应的原始语音数据,直至不存在下一审判环节对应的审判提示文字时,获取庭审笔录文件,并将庭审笔录文件存储在数据库中。The transcript file acquisition module is used to repeatedly execute the trial prompt text corresponding to the next trial link in the standard court trial transcript template if there is no prior text data, and obtain the corresponding trial prompt text collected by the microphone The original voice data, until there is no trial prompt text corresponding to the next trial link, obtain the court trial transcript file and store the court trial transcript file in the database.
优选地,对象标识获取模块,包括:Preferably, the object identification acquisition module includes:
声纹特征提取判断单元,用于采用声纹特征提取算法对原始语音数据进行声纹特征提取,获取目标声纹特征,判断是否存在与目标声纹特征相对应的标准声纹特征;The voiceprint feature extraction and judgment unit is used to perform voiceprint feature extraction on the original voice data by using the voiceprint feature extraction algorithm, obtain the target voiceprint feature, and determine whether there is a standard voiceprint feature corresponding to the target voiceprint feature;
第一对象标识确定单元,用于若存在标准声纹特征,则根据标准声纹特征对应的身份标识,确定原始语音数据对应的对象标识;The first object identification determining unit is configured to determine the object identification corresponding to the original voice data according to the identity identification corresponding to the standard voiceprint feature if there is a standard voiceprint feature;
第二对象标识确定单元,用于若不存在标准声纹特征,则根据原始语音数据对应的麦克风标识,确定原始语音数据对应的对象标识。The second object identifier determining unit is configured to determine the object identifier corresponding to the original voice data according to the microphone identifier corresponding to the original voice data if there is no standard voiceprint feature.
优选地,在先文本数据判断模块,包括:Preferably, the prior text data judgment module includes:
文本关键词获取单元,用于采用关键词提取算法对原始文本数据进行关键词提取,获取文本关键词;The text keyword acquisition unit is used to extract keywords from the original text data using a keyword extraction algorithm to obtain text keywords;
文本同义词获取单元,用于基于文本关键词查询同义词库,获取与文本关键词相对应的文本同义词;The text synonym acquisition unit is used to query the thesaurus based on text keywords to obtain text synonyms corresponding to the text keywords;
在先文本查询判断单元,用于根据文本关键词和文本同义词查询在先文本数据库,判断是否存在包含文本关键词或者文本同义词的在先文本数据;The prior text query judgment unit is used to query the prior text database based on text keywords and text synonyms, and determine whether there is prior text data containing text keywords or text synonyms;
第一判断处理单元,用于若存在包含文本关键词或者文本同义词的在先文本数据,则认定存在与原始文本数据相对应的在先文本数据;The first judgment processing unit is configured to determine that there is prior text data corresponding to the original text data if there is prior text data containing text keywords or text synonyms;
第二判断处理单元,用于若不存在包含文本关键词或者文本同义词的在先文本数据,则认定不存在与原始文本数据相对应的在先文本数据。The second judgment processing unit is configured to determine that there is no prior text data corresponding to the original text data if there is no prior text data containing text keywords or text synonyms.
优选地,突出显示处理模块,包括:Preferably, the highlight processing module includes:
第一显示处理单元,用于若语义分析结果为相同,且原始文本数据与在先文本数据对应的对象标识为同一标识,则不对原始文本数据进行突出显示处理,显示标准法庭审理笔录模板对应的下一审判环节对应的审判提示文字;The first display processing unit is used for if the semantic analysis result is the same, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the original text data is not highlighted, and the standard court trial record template is displayed. Trial prompt text corresponding to the next trial session;
第二显示处理单元,用于若语义分析结果为相异,且原始文本数据与在先文本数据对应的对象标识为同一标识,则采用第一突出显示模式对原始文本数据进行突出显示处理,显示包括矛盾提示信息的审判提示文字;The second display processing unit is used to perform highlight processing on the original text data using the first highlight mode if the semantic analysis results are different, and the original text data and the object identifier corresponding to the previous text data are the same identifier, and display Trial reminder text including contradictory reminder information;
第三显示处理单元,用于若语义分析结果为相同,且原始文本数据与在先文本数据对应的对象标识不为同一标识,则采用第二突出显示模式对原始文本数据进行突出显示处理,显示包括无争议提示信息的审判提示文字;The third display processing unit is configured to use the second highlighting mode to highlight the original text data if the semantic analysis results are the same, and the original text data and the object identifier corresponding to the previous text data are not the same identifier, and display Trial reminder text including no dispute reminder information;
第四显示处理单元,用于若语义分析结果为相异,且原始文本数据与在先文本数据对应的对象标识不为同一标识,则采用第三突出显示模式对原始文本数据进行突出显示处理,显示包括争议焦点提示信息的审判提示文字。The fourth display processing unit is used to perform highlight processing on the original text data using the third highlighting mode if the semantic analysis results are different and the object identifiers corresponding to the original text data and the previous text data are not the same identifier, Display the trial reminder text including the reminder of the focus of the dispute.
优选地,目标分词结果获取模块,包括:Preferably, the target word segmentation result acquisition module includes:
文本分词处理单元,用于采用结巴分词工具的搜索引擎模式对案件描述信息进行文本分词,获取文 本分词结果,文本分词结果包括N个一级分词;The text segmentation processing unit is used to use the search engine mode of the stuttering word segmentation tool to segment the case description information to obtain the text segmentation result. The text segmentation result includes N first-level word segmentation;
第一优化处理单元,用于若任意连续k个一级分词中连续k-1个一级分词的叠加等于第k个一级分词,且连续k-1个一级分词中存在至少两个一级分词的结合等于第k个一级分词,则仅保留结合等于第k个一级分词的至少两个一级分词作为目标分词,获取目标分词结果;The first optimization processing unit is configured to: if the superposition of consecutive k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are at least two consecutive k-1 first-level participles The combination of graded participles is equal to the k-th first-level participle, and only at least two first-level participles that are combined and equal to the k-th first-level participle are retained as target participles to obtain the target segmentation result;
第二优化处理单元,用于若任意连续k个一级分词中连续k-1个一级分词的叠加等于第k个一级分词,且任意连续k个一级分词中不存在至少两个一级分词的结合等于第k个一级分词,则删除前k-1个一级分词,保留第k个一级分词作为目标分词,获取目标分词结果;The second optimization processing unit is configured to: if the superposition of k-1 consecutive first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are no at least two one in any continuous k first-level participles The combination of level participle is equal to the k-th first-level participle, then delete the first k-1 first-level participles, keep the k-th first-level participle as the target participle, and obtain the target participle result;
第三优化处理单元,用于若任意连续k个一级分词中连续k-1个一级分词的结合等于第k个一级分词,则删除第k个一级分词,保留前k-1个一级分词作为目标分词,获取目标分词结果。The third optimization processing unit is used to delete the k-th first-level participle and keep the first k-1 if the combination of k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle The first-level word segmentation is used as the target word segmentation to obtain the target word segmentation result.
优选地,目标关键词确定模块,包括:Preferably, the target keyword determination module includes:
目标同义词获取单元,用于根据目标分词查询同义词库,获取与目标分词相对应的目标同义词;The target synonym obtaining unit is used to query the thesaurus according to the target word segmentation, and obtain the target synonym corresponding to the target word segmentation;
目标关键词确定单元,用于根据目标分词和目标同义词查询关键词库,判断是否存在与目标分词或者目标同义词相匹配的原始关键词,若存在原始关键词,则将原始关键词确定为目标关键词。The target keyword determining unit is used to query the keyword database according to the target word segmentation and target synonyms, and determine whether there is an original keyword that matches the target word segmentation or target synonym. If the original keyword exists, the original keyword is determined as the target key word.
优选地,在案件描述信息获取模块之前,智能辅助审判装置还包括:Preferably, before the case description information acquisition module, the intelligent auxiliary trial device further includes:
更新任务获取模块,用于获取数据更新任务,数据更新任务包括原始案件类型和任务更新时间;The update task acquisition module is used to acquire the data update task, which includes the original case type and task update time;
第一数据获取模块,用于若与原始案件类型相对应的判案依据在任务更新时间以后发生变更,则确定变更时间,获取变更时间和系统当前时间之间的与原始案件类型相对应的历史判案数据,根据历史判案数据确定待处理数据;The first data acquisition module is used to determine the change time if the judgment basis corresponding to the original case type is changed after the task update time, and obtain the history corresponding to the original case type between the change time and the current system time Judgment data, determine the data to be processed based on historical judgment data;
第二数据获取模块,用于若与原始案件类型相对应的判案依据在任务更新时间以后没有发生变更,则获取系统当前时间之前预设周期内的与原始案件类型相对应的历史判案数据,根据历史判案数据确定待处理数据;The second data acquisition module is used to obtain historical judgment data corresponding to the original case type in the preset period before the current time of the system if the judgment basis corresponding to the original case type does not change after the task update time , Determine the data to be processed based on historical judgment data;
先验知识库构建模块,用于从待处理数据中提取先验信息,基于先验信息构建与原始案件类型相对应的先验知识库;A priori knowledge base building module, used to extract prior information from the data to be processed, and build a priori knowledge base corresponding to the original case type based on the prior information;
审判观点库构建模块,用于从待处理数据提取历史描述信息和历史审判观点,基于历史描述信息和历史审判观点,构建与原始案件类型相对应的审判观点库。The trial viewpoint database building module is used to extract historical description information and historical trial viewpoints from the data to be processed, and construct a trial viewpoint database corresponding to the original case type based on the historical description information and historical trial viewpoints.
优选地,待处理数据包括目标区域;Preferably, the data to be processed includes the target area;
根据历史判案数据确定待处理数据,包括:Determine the data to be processed based on historical judgment data, including:
确定任一目标区域对应的历史判案数据的目标数据量;Determine the target data volume of historical judgment data corresponding to any target area;
若目标数据量大于预设数量阈值,则将同一目标区域和原始案件类型对应的历史判案数据,确定为待处理数据;If the target data volume is greater than the preset number threshold, the historical judgment data corresponding to the same target area and the original case type is determined as the data to be processed;
若目标数据量不大于预设数量阈值,则将同一原始案件类型对应的历史判案数据,确定为待处理数据。If the target data volume is not greater than the preset number threshold, the historical judgment data corresponding to the same original case type is determined as the data to be processed.
关于智能辅助审判装置的具体限定可以参见上文中对于智能辅助审判方法的限定,在此不再赘述。上述智能辅助审判装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific limitation of the intelligent auxiliary trial device, please refer to the above definition of the intelligent auxiliary trial method, which will not be repeated here. Each module in the above-mentioned intelligent auxiliary trial device can be implemented in whole or in part by software, hardware and a combination thereof. The foregoing modules may be embedded in the form of hardware or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the foregoing modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图6所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储执行智能辅助审判方法过程中使用或者生成的数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种智能辅助审判方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 6. The computer equipment includes a processor, a memory, a network interface and a database connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium. The database of the computer equipment is used to store data used or generated during the execution of the intelligent auxiliary trial method. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instruction is executed by the processor to realize an intelligent auxiliary trial method.
在一个实施例中,提供了一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机可读指令,处理器执行计算机可读指令时实现上述实施例中智能辅助审判方法,例如图 2-图4所示,为避免重复,这里不再赘述。或者,或者,处理器执行计算机可读指令时实现上述智能辅助审判装置的各模块/单元的功能,如图5所示的各模块,为避免重复,这里不同赘述。In one embodiment, a computer device is provided, including a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor. The processor executes the computer-readable instructions to realize the intelligence in the above-mentioned embodiments. The auxiliary trial method, such as shown in Figure 2-Figure 4, in order to avoid repetition, will not be repeated here. Or, alternatively, when the processor executes the computer-readable instructions, the functions of the modules/units of the above-mentioned intelligent auxiliary trial device are realized. The modules are shown in FIG.
在一个实施例中,提供了一个或多个存储有计算机可读指令的非易失性可读存储介质,计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行时实现上述实施例中智能辅助审判方法,例如图2-图4所示,为避免重复,这里不再赘述。或者,该计算机可读指令被处理器执行时上述智能辅助审判装置的各模块/单元的功能,如图5所示的各模块,为避免重复,这里不同赘述。In one embodiment, one or more non-volatile readable storage media storing computer readable instructions are provided. The computer readable storage medium stores computer readable instructions, and the computer readable instructions are stored by one or more When executed by two processors, the one or more processors are executed to implement the intelligent auxiliary judgment method in the above-mentioned embodiment, for example, as shown in Figs. 2 to 4, in order to avoid repetition, details are not repeated here. Alternatively, the functions of the modules/units of the above-mentioned intelligent auxiliary trial device when the computer-readable instructions are executed by the processor are the modules shown in FIG. 5, and to avoid repetition, the description will be different here.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through computer-readable instructions, which can be stored in a non-volatile computer. In a readable storage medium, when the computer-readable instructions are executed, they may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. As an illustration and not a limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。Those skilled in the art can clearly understand that for the convenience and conciseness of description, only the division of the above-mentioned functional units and modules is used as an example. In practical applications, the above-mentioned functions can be allocated to different functional units and modules as required. Module completion means dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that it can still implement the foregoing The technical solutions recorded in the examples are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of protection of this application.

Claims (20)

  1. 一种智能辅助审判方法,其特征在于,包括:An intelligent auxiliary trial method, characterized in that it includes:
    从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;Obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
    采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;Use a word segmentation tool to segment the case description information to obtain a target segmentation result, where the target segmentation result includes multiple target segmentation;
    基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword;
    根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
    从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;Acquire prior knowledge that matches at least one of the target case types from the prior knowledge base, and obtain the target push law according to the prior knowledge;
    采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;Use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial opinion database based on the standardized information, and obtain corresponding recommended trial opinions;
    根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。According to the target push law and the recommended trial viewpoint, obtain and display trial suggestion documents.
  2. 如权利要求1所述的智能辅助审判方法,其特征在于,在所述从数据库中获取待判案件对应的庭审笔录文件之前,所述智能辅助审判方法还包括:5. The intelligent auxiliary trial method according to claim 1, characterized in that, before said obtaining the court trial transcript corresponding to the case to be judged from the database, the intelligent auxiliary trial method further comprises:
    在客户端上显示与标准法庭审理笔录模板中当前审判环节对应的审判提示文字,获取麦克风采集的与所述审判提示文字相对应的原始语音数据;Display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text;
    对所述原始语音数据进行声纹识别,获取声纹识别结果,根据所述声纹识别结果确定所述原始语音数据对应的对象标识;Perform voiceprint recognition on the original voice data, obtain a voiceprint recognition result, and determine an object identifier corresponding to the original voice data according to the voiceprint recognition result;
    对所述原始语音数据进行文本翻译,获取与所述对象标识相对应的原始文本数据,将所述对象标识和所述原始文本数据关联存储在所述标准法庭审理笔录模板的相应位置;Performing text translation on the original voice data, obtaining original text data corresponding to the object identifier, and storing the object identifier and the original text data in a corresponding position in the standard court trial transcript template;
    基于所述原始文本数据查询在先文本数据库,判断是否存在与所述原始文本数据相对应的在先文本数据;Query a prior text database based on the original text data, and determine whether there is prior text data corresponding to the original text data;
    若存在所述在先文本数据,则对所述原始文本数据和所述在先文本数据进行语义分析,确定语义分析结果,根据所述语义分析结果对所述原始文本数据进行突出显示处理,显示与所述语义分析结果相对应的审判提示文字,重复执行所述获取麦克风采集的与所述审判提示文字相对应的原始语音数据;If the prior text data exists, perform semantic analysis on the original text data and the prior text data, determine the semantic analysis result, perform highlight processing on the original text data according to the semantic analysis result, and display The trial prompt text corresponding to the semantic analysis result is repeatedly executed for obtaining the original voice data corresponding to the trial prompt text collected by the microphone;
    若不存在所述在先文本数据,则重复执行在客户端上显示与标准法庭审理笔录模板中下一审判环节对应的审判提示文字,获取麦克风采集的与所述审判提示文字相对应的原始语音数据,直至不存在下一审判环节对应的审判提示文字时,获取庭审笔录文件,并将所述庭审笔录文件存储在数据库中。If the prior text data does not exist, repeat the execution and display the trial prompt text corresponding to the next trial link in the standard court trial transcript template on the client, and obtain the original voice collected by the microphone and corresponding to the trial prompt text Data, until there is no trial prompt text corresponding to the next trial link, obtain the court trial transcript file, and store the court trial transcript file in the database.
  3. 如权利要求2所述的智能辅助审判方法,其特征在于,所述基于所述原始文本数据查询在先文本数据库,判断是否存在与所述原始文本数据相对应的在先文本数据,包括:The intelligent assisted trial method according to claim 2, wherein the querying a prior text database based on the original text data to determine whether there is prior text data corresponding to the original text data comprises:
    采用关键词提取算法对原始文本数据进行关键词提取,获取文本关键词;Use keyword extraction algorithm to extract keywords from the original text data to obtain text keywords;
    基于所述文本关键词查询同义词库,获取与所述文本关键词相对应的文本同义词;Query the thesaurus based on the text keywords, and obtain text synonyms corresponding to the text keywords;
    根据所述文本关键词和所述文本同义词查询在先文本数据库,判断是否存在包含所述文本关键词或者所述文本同义词的在先文本数据;Query a prior text database according to the text keywords and the text synonyms, and determine whether there is prior text data containing the text keywords or the text synonyms;
    若存在包含所述文本关键词或者所述文本同义词的在先文本数据,则认定存在与所述原始文本数据相对应的在先文本数据;If there is prior text data containing the text keywords or the text synonyms, it is determined that there is prior text data corresponding to the original text data;
    若不存在包含所述文本关键词或者所述文本同义词的在先文本数据,则认定不存在与所述原始文本数据相对应的在先文本数据。If there is no prior text data containing the text keywords or the text synonyms, it is determined that there is no prior text data corresponding to the original text data.
  4. 如权利要求2所述的智能辅助审判方法,其特征在于,所述根据所述语义分析结果对所述原始文本数据进行突出显示处理,显示与所述语义分析结果相对应的审判提示文字,包括:The intelligent auxiliary trial method according to claim 2, wherein the process of highlighting the original text data according to the semantic analysis result, and displaying the trial prompt text corresponding to the semantic analysis result, comprises :
    若所述语义分析结果为相同,且所述原始文本数据与所述在先文本数据对应的对象标识为同一标识,则不对所述原始文本数据进行突出显示处理,显示所述标准法庭审理笔录模板对应的下一审判环节 对应的审判提示文字;If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is the same identifier, the original text data is not highlighted, and the standard court trial transcript template is displayed Trial prompt text corresponding to the corresponding next trial session;
    若所述语义分析结果为相异,且所述原始文本数据与所述在先文本数据对应的对象标识为同一标识,则采用第一突出显示模式对所述原始文本数据进行突出显示处理,显示包括矛盾提示信息的审判提示文字;If the semantic analysis results are different, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the first highlight mode is used to perform highlight processing on the original text data, and display Trial reminder text including contradictory reminder information;
    若所述语义分析结果为相同,且所述原始文本数据与所述在先文本数据对应的对象标识不为同一标识,则采用第二突出显示模式对所述原始文本数据进行突出显示处理,显示包括无争议提示信息的审判提示文字;If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is not the same identifier, the second highlighting mode is adopted to perform highlight processing on the original text data, and display Trial reminder text including no dispute reminder information;
    若所述语义分析结果为相异,且所述原始文本数据与所述在先文本数据对应的对象标识不为同一标识,则采用第三突出显示模式对所述原始文本数据进行突出显示处理,显示包括争议焦点提示信息的审判提示文字。If the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are not the same identifier, the third highlighting mode is adopted to perform highlighting processing on the original text data, Display the trial reminder text including the reminder of the dispute focus.
  5. 如权利要求1所述的智能辅助审判方法,其特征在于,所述采用分词工具对所述案件描述信息进行分词,获取目标分词结果,包括:The intelligent auxiliary trial method according to claim 1, wherein said using a word segmentation tool to segment said case description information to obtain a target segmentation result comprises:
    采用结巴分词工具的搜索引擎模式对所述案件描述信息进行文本分词,获取文本分词结果,所述文本分词结果包括N个一级分词;Use the search engine mode of the stuttering word segmentation tool to perform text segmentation on the case description information to obtain a text segmentation result, and the text segmentation result includes N first-level word segmentation;
    若任意连续k个所述一级分词中连续k-1个所述一级分词的叠加等于所述第k个一级分词,且连续k-1个所述一级分词中存在至少两个一级分词的结合等于第k个一级分词,则仅保留结合等于第k个一级分词的至少两个所述一级分词作为所述目标分词,获取目标分词结果;If the superposition of consecutive k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are at least two consecutive k-1 first-level participles If the combination of graded participles is equal to the k-th first-level participle, only at least two of the first-level participles whose combination is equal to the k-th first-level participle are retained as the target participles, and the target segmentation result is obtained;
    若任意连续k个所述一级分词中连续k-1个所述一级分词的叠加等于所述第k个一级分词,且任意连续k个所述一级分词中不存在至少两个一级分词的结合等于第k个一级分词,则删除所述前k-1个所述一级分词,保留所述第k个一级分词作为所述目标分词,获取目标分词结果;If the superposition of k-1 consecutive first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are no at least two one in any continuous k first-level participles The combination of the graded participle is equal to the k-th first-level participle, then delete the first k-1 first-level participles, keep the k-th first-level participle as the target participle, and obtain the target participle result;
    若任意连续k个所述一级分词中连续k-1个一级分词的结合等于所述第k个一级分词,则删除所述第k个一级分词,保留前k-1个所述一级分词作为所述目标分词,获取目标分词结果。If the combination of k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, delete the k-th first-level participle and keep the first k-1 The first-level word segmentation is used as the target word segmentation to obtain the target word segmentation result.
  6. 如权利要求1所述的智能辅助审判方法,其特征在于,在所述从数据库中获取待判案件对应的庭审笔录文件之前,所述智能辅助审判方法还包括:5. The intelligent auxiliary trial method according to claim 1, characterized in that, before said obtaining the court trial transcript corresponding to the case to be judged from the database, the intelligent auxiliary trial method further comprises:
    获取数据更新任务,所述数据更新任务包括原始案件类型和任务更新时间;Acquiring a data update task, the data update task including the original case type and task update time;
    若与所述原始案件类型相对应的判案依据在所述任务更新时间以后发生变更,则确定变更时间,获取所述变更时间和系统当前时间之间的与原始案件类型相对应的历史判案数据,根据所述历史判案数据确定待处理数据;If the judgment basis corresponding to the original case type is changed after the task update time, the change time is determined, and the historical judgments corresponding to the original case type between the change time and the current system time are obtained Data, the data to be processed is determined based on the historical judgment data;
    若与所述原始案件类型相对应的判案依据在所述任务更新时间以后没有发生变更,则获取系统当前时间之前预设周期内的与原始案件类型相对应的历史判案数据,根据所述历史判案数据确定待处理数据;If the judgment basis corresponding to the original case type does not change after the task update time, then the historical judgment data corresponding to the original case type in the preset period before the current time of the system is obtained, and according to the Historical judgment data to determine the data to be processed;
    从所述待处理数据中提取先验信息,基于所述先验信息构建与所述原始案件类型相对应的先验知识库;Extracting prior information from the to-be-processed data, and constructing a priori knowledge base corresponding to the original case type based on the prior information;
    从所述待处理数据提取历史描述信息和历史审判观点,基于所述历史描述信息和所述历史审判观点,构建与所述原始案件类型相对应的审判观点库。Extracting historical description information and historical judgment opinions from the to-be-processed data, and constructing a judgment opinion database corresponding to the original case type based on the historical description information and the historical judgment opinions.
  7. 如权利要求6所述的智能辅助审判方法,其特征在于,所述待处理数据包括目标区域;8. The intelligent auxiliary trial method of claim 6, wherein the data to be processed includes a target area;
    所述根据所述历史判案数据确定待处理数据,包括:The determining the data to be processed according to the historical judgment data includes:
    确定任一所述目标区域对应的历史判案数据的目标数据量;Determining the target data volume of historical judgment data corresponding to any of the target areas;
    若所述目标数据量大于预设数量阈值,则将同一所述目标区域和所述原始案件类型对应的历史判案数据,确定为待处理数据;If the target data amount is greater than a preset number threshold, determining the historical judgment data corresponding to the same target area and the original case type as the data to be processed;
    若所述目标数据量不大于预设数量阈值,则将同一所述原始案件类型对应的历史判案数据,确定为待处理数据。If the target data volume is not greater than the preset number threshold, the historical judgment data corresponding to the same original case type is determined as the data to be processed.
  8. 一种智能辅助审判装置,其特征在于,包括:An intelligent auxiliary trial device, characterized by comprising:
    案件描述信息获取模块,用于从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;The case description information acquisition module is used to obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
    目标分词结果获取模块,用于采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;The target word segmentation result acquisition module is configured to use a word segmentation tool to segment the case description information to obtain a target word segmentation result, where the target word segmentation result includes multiple target word segmentation;
    目标关键词确定模块,用于基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;The target keyword determining module is configured to query the keyword database based on each target word segmentation, and determine the original keywords stored in the keyword database that match the target word segmentation as target keywords;
    目标案件类型确定模块,用于根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;The target case type determination module is configured to query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
    目标推送法条获取模块,从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;The target push law acquisition module acquires prior knowledge matching at least one of the target case types from the prior knowledge base, and obtains the target push law according to the prior knowledge;
    推荐审判观点获取模块,用于采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;The recommended trial viewpoint acquisition module is configured to use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial viewpoint database based on the standardized information, and obtain corresponding recommended trial viewpoints;
    审判建议文件获取模块,用于根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。The trial suggestion file acquisition module is used to push the law and the recommended trial viewpoint according to the target, and acquire and display the trial suggestion file.
  9. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, wherein the processor executes the computer-readable instructions as follows step:
    从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;Obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
    采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;Use a word segmentation tool to segment the case description information to obtain a target segmentation result, where the target segmentation result includes multiple target segmentation;
    基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword;
    根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
    从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;Acquire prior knowledge that matches at least one of the target case types from the prior knowledge base, and obtain the target push law according to the prior knowledge;
    采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;Use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial opinion database based on the standardized information, and obtain corresponding recommended trial opinions;
    根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。According to the target push law and the recommended trial viewpoint, obtain and display trial suggestion documents.
  10. 如权利要求9所述的计算机设备,其特征在于,在所述从数据库中获取待判案件对应的庭审笔录文件之前,所述处理器执行所述计算机可读指令时还实现如下步骤:9. The computer device according to claim 9, wherein before the obtaining the court trial transcript file corresponding to the case to be judged from the database, the processor further implements the following steps when executing the computer-readable instruction:
    在客户端上显示与标准法庭审理笔录模板中当前审判环节对应的审判提示文字,获取麦克风采集的与所述审判提示文字相对应的原始语音数据;Display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text;
    对所述原始语音数据进行声纹识别,获取声纹识别结果,根据所述声纹识别结果确定所述原始语音数据对应的对象标识;Perform voiceprint recognition on the original voice data, obtain a voiceprint recognition result, and determine an object identifier corresponding to the original voice data according to the voiceprint recognition result;
    对所述原始语音数据进行文本翻译,获取与所述对象标识相对应的原始文本数据,将所述对象标识和所述原始文本数据关联存储在所述标准法庭审理笔录模板的相应位置;Performing text translation on the original voice data, obtaining original text data corresponding to the object identifier, and storing the object identifier and the original text data in a corresponding position in the standard court trial transcript template;
    基于所述原始文本数据查询在先文本数据库,判断是否存在与所述原始文本数据相对应的在先文本数据;Query a prior text database based on the original text data, and determine whether there is prior text data corresponding to the original text data;
    若存在所述在先文本数据,则对所述原始文本数据和所述在先文本数据进行语义分析,确定语义分析结果,根据所述语义分析结果对所述原始文本数据进行突出显示处理,显示与所述语义分析结果相对应的审判提示文字,重复执行所述获取麦克风采集的与所述审判提示文字相对应的原始语音数据;If the prior text data exists, perform semantic analysis on the original text data and the prior text data, determine the semantic analysis result, perform highlight processing on the original text data according to the semantic analysis result, and display The trial prompt text corresponding to the semantic analysis result is repeatedly executed for obtaining the original voice data corresponding to the trial prompt text collected by the microphone;
    若不存在所述在先文本数据,则重复执行在客户端上显示与标准法庭审理笔录模板中下一审判环节对应的审判提示文字,获取麦克风采集的与所述审判提示文字相对应的原始语音数据,直至不存在下一审判环节对应的审判提示文字时,获取庭审笔录文件,并将所述庭审笔录文件存储在数据库中。If the prior text data does not exist, repeat the execution and display the trial prompt text corresponding to the next trial link in the standard court trial transcript template on the client, and obtain the original voice collected by the microphone and corresponding to the trial prompt text Data, until there is no trial prompt text corresponding to the next trial link, obtain the court trial transcript file, and store the court trial transcript file in the database.
  11. 如权利要求10所述的计算机设备,其特征在于,所述基于所述原始文本数据查询在先文本数据库,判断是否存在与所述原始文本数据相对应的在先文本数据,包括:10. The computer device of claim 10, wherein the querying a prior text database based on the original text data to determine whether there is prior text data corresponding to the original text data comprises:
    采用关键词提取算法对原始文本数据进行关键词提取,获取文本关键词;Use keyword extraction algorithm to extract keywords from the original text data to obtain text keywords;
    基于所述文本关键词查询同义词库,获取与所述文本关键词相对应的文本同义词;Query the thesaurus based on the text keywords, and obtain text synonyms corresponding to the text keywords;
    根据所述文本关键词和所述文本同义词查询在先文本数据库,判断是否存在包含所述文本关键词或者所述文本同义词的在先文本数据;Query a prior text database according to the text keywords and the text synonyms, and determine whether there is prior text data containing the text keywords or the text synonyms;
    若存在包含所述文本关键词或者所述文本同义词的在先文本数据,则认定存在与所述原始文本数据相对应的在先文本数据;If there is prior text data containing the text keywords or the text synonyms, it is determined that there is prior text data corresponding to the original text data;
    若不存在包含所述文本关键词或者所述文本同义词的在先文本数据,则认定不存在与所述原始文本数据相对应的在先文本数据。If there is no prior text data containing the text keywords or the text synonyms, it is determined that there is no prior text data corresponding to the original text data.
  12. 如权利要求10所述的计算机设备,其特征在于,所述根据所述语义分析结果对所述原始文本数据进行突出显示处理,显示与所述语义分析结果相对应的审判提示文字,包括:10. The computer device according to claim 10, wherein said performing highlight processing on said original text data according to said semantic analysis result, and displaying trial prompt text corresponding to said semantic analysis result, comprises:
    若所述语义分析结果为相同,且所述原始文本数据与所述在先文本数据对应的对象标识为同一标识,则不对所述原始文本数据进行突出显示处理,显示所述标准法庭审理笔录模板对应的下一审判环节对应的审判提示文字;If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is the same identifier, the original text data is not highlighted, and the standard court trial transcript template is displayed Trial prompt text corresponding to the corresponding next trial session;
    若所述语义分析结果为相异,且所述原始文本数据与所述在先文本数据对应的对象标识为同一标识,则采用第一突出显示模式对所述原始文本数据进行突出显示处理,显示包括矛盾提示信息的审判提示文字;If the semantic analysis results are different, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the first highlight mode is used to perform highlight processing on the original text data, and display Trial reminder text including contradictory reminder information;
    若所述语义分析结果为相同,且所述原始文本数据与所述在先文本数据对应的对象标识不为同一标识,则采用第二突出显示模式对所述原始文本数据进行突出显示处理,显示包括无争议提示信息的审判提示文字;If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is not the same identifier, the second highlighting mode is adopted to perform highlight processing on the original text data, and display Trial reminder text including no dispute reminder information;
    若所述语义分析结果为相异,且所述原始文本数据与所述在先文本数据对应的对象标识不为同一标识,则采用第三突出显示模式对所述原始文本数据进行突出显示处理,显示包括争议焦点提示信息的审判提示文字。If the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are not the same identifier, the third highlighting mode is adopted to perform highlighting processing on the original text data, Display the trial reminder text including the reminder of the dispute focus.
  13. 如权利要求9所述的计算机设备,其特征在于,所述采用分词工具对所述案件描述信息进行分词,获取目标分词结果,包括:9. The computer device according to claim 9, wherein said using a word segmentation tool to segment said case description information to obtain a target segmentation result comprises:
    采用结巴分词工具的搜索引擎模式对所述案件描述信息进行文本分词,获取文本分词结果,所述文本分词结果包括N个一级分词;Use the search engine mode of the stuttering word segmentation tool to perform text segmentation on the case description information to obtain a text segmentation result, and the text segmentation result includes N first-level word segmentation;
    若任意连续k个所述一级分词中连续k-1个所述一级分词的叠加等于所述第k个一级分词,且连续k-1个所述一级分词中存在至少两个一级分词的结合等于第k个一级分词,则仅保留结合等于第k个一级分词的至少两个所述一级分词作为所述目标分词,获取目标分词结果;If the superposition of consecutive k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are at least two consecutive k-1 first-level participles If the combination of graded participles is equal to the k-th first-level participle, only at least two of the first-level participles whose combination is equal to the k-th first-level participle are retained as the target participles, and the target segmentation result is obtained;
    若任意连续k个所述一级分词中连续k-1个所述一级分词的叠加等于所述第k个一级分词,且任意连续k个所述一级分词中不存在至少两个一级分词的结合等于第k个一级分词,则删除所述前k-1个所述一级分词,保留所述第k个一级分词作为所述目标分词,获取目标分词结果;If the superposition of k-1 consecutive first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are no at least two one in any continuous k first-level participles The combination of the graded participle is equal to the k-th first-level participle, then delete the first k-1 first-level participles, keep the k-th first-level participle as the target participle, and obtain the target participle result;
    若任意连续k个所述一级分词中连续k-1个一级分词的结合等于所述第k个一级分词,则删除所述第k个一级分词,保留前k-1个所述一级分词作为所述目标分词,获取目标分词结果。If the combination of k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, delete the k-th first-level participle and keep the first k-1 The first-level word segmentation is used as the target word segmentation to obtain the target word segmentation result.
  14. 如权利要求9所述的计算机设备,其特征在于,在所述从数据库中获取待判案件对应的庭审笔录文件之前,所述处理器执行所述计算机可读指令时还实现如下步骤:9. The computer device according to claim 9, wherein before the obtaining the court trial transcript file corresponding to the case to be judged from the database, the processor further implements the following steps when executing the computer-readable instruction:
    获取数据更新任务,所述数据更新任务包括原始案件类型和任务更新时间;Acquiring a data update task, the data update task including the original case type and task update time;
    若与所述原始案件类型相对应的判案依据在所述任务更新时间以后发生变更,则确定变更时间,获取所述变更时间和系统当前时间之间的与原始案件类型相对应的历史判案数据,根据所述历史判案数据确定待处理数据;If the judgment basis corresponding to the original case type is changed after the task update time, the change time is determined, and the historical judgments corresponding to the original case type between the change time and the current system time are obtained Data, the data to be processed is determined based on the historical judgment data;
    若与所述原始案件类型相对应的判案依据在所述任务更新时间以后没有发生变更,则获取系统当前时间之前预设周期内的与原始案件类型相对应的历史判案数据,根据所述历史判案数据确定待处理数据;If the judgment basis corresponding to the original case type does not change after the task update time, then the historical judgment data corresponding to the original case type in the preset period before the current time of the system is obtained, and according to the Historical judgment data to determine the data to be processed;
    从所述待处理数据中提取先验信息,基于所述先验信息构建与所述原始案件类型相对应的先验知识库;Extracting prior information from the to-be-processed data, and constructing a priori knowledge base corresponding to the original case type based on the prior information;
    从所述待处理数据提取历史描述信息和历史审判观点,基于所述历史描述信息和所述历史审判观 点,构建与所述原始案件类型相对应的审判观点库。Extract historical description information and historical judgment viewpoints from the data to be processed, and construct a judgment viewpoint database corresponding to the original case type based on the historical description information and the historical judgment viewpoints.
  15. 一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more non-volatile readable storage media storing computer readable instructions, the computer readable storage medium storing computer readable instructions, characterized in that the computer readable instructions are processed by one or more When the processor executes, the one or more processors execute the following steps:
    从数据库中获取待判案件对应的庭审笔录文件,从所述庭审笔录文件中提取案件描述信息;Obtain the court trial transcript file corresponding to the case to be judged from the database, and extract the case description information from the court trial transcript file;
    采用分词工具对所述案件描述信息进行分词,获取目标分词结果,所述目标分词结果包括多个目标分词;Use a word segmentation tool to segment the case description information to obtain a target segmentation result, where the target segmentation result includes multiple target segmentation;
    基于每一所述目标分词查询所述关键词库,将所述关键词库中存储的与所述目标分词相匹配的原始关键词确定为目标关键词;Query the keyword database based on each target word segmentation, and determine the original keyword stored in the keyword database that matches the target word segmentation as the target keyword;
    根据所述目标关键词查询所述案件类型信息库,获取与所述目标关键词相匹配的至少一个目标案件类型;Query the case type information database according to the target keyword, and obtain at least one target case type that matches the target keyword;
    从先验知识库中获取与至少一个所述目标案件类型相匹配的先验知识,根据所述先验知识,获取目标推送法条;Acquire prior knowledge that matches at least one of the target case types from the prior knowledge base, and obtain the target push law according to the prior knowledge;
    采用语义分析模型对所述案件描述信息进行语义分析,获取标准化信息,基于所述标准化信息查询审判观点库,获取对应的推荐审判观点;Use a semantic analysis model to perform semantic analysis on the case description information, obtain standardized information, query the trial opinion database based on the standardized information, and obtain corresponding recommended trial opinions;
    根据所述目标推送法条和所述推荐审判观点,获取并显示审判建议文件。According to the target push law and the recommended trial viewpoint, obtain and display trial suggestion documents.
  16. 如权利要求15所述的非易失性可读存储介质,其特征在于,在所述从数据库中获取待判案件对应的庭审笔录文件之前,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:The non-volatile readable storage medium of claim 15, wherein the computer-readable instructions are executed by one or more processors before the trial transcript file corresponding to the case to be judged is obtained from the database. During execution, the one or more processors are caused to further execute the following steps:
    在客户端上显示与标准法庭审理笔录模板中当前审判环节对应的审判提示文字,获取麦克风采集的与所述审判提示文字相对应的原始语音数据;Display the trial prompt text corresponding to the current trial link in the standard court trial transcript template on the client, and obtain the original voice data collected by the microphone and corresponding to the trial prompt text;
    对所述原始语音数据进行声纹识别,获取声纹识别结果,根据所述声纹识别结果确定所述原始语音数据对应的对象标识;Perform voiceprint recognition on the original voice data, obtain a voiceprint recognition result, and determine an object identifier corresponding to the original voice data according to the voiceprint recognition result;
    对所述原始语音数据进行文本翻译,获取与所述对象标识相对应的原始文本数据,将所述对象标识和所述原始文本数据关联存储在所述标准法庭审理笔录模板的相应位置;Performing text translation on the original voice data, obtaining original text data corresponding to the object identifier, and storing the object identifier and the original text data in a corresponding position in the standard court trial transcript template;
    基于所述原始文本数据查询在先文本数据库,判断是否存在与所述原始文本数据相对应的在先文本数据;Query a prior text database based on the original text data, and determine whether there is prior text data corresponding to the original text data;
    若存在所述在先文本数据,则对所述原始文本数据和所述在先文本数据进行语义分析,确定语义分析结果,根据所述语义分析结果对所述原始文本数据进行突出显示处理,显示与所述语义分析结果相对应的审判提示文字,重复执行所述获取麦克风采集的与所述审判提示文字相对应的原始语音数据;If the prior text data exists, perform semantic analysis on the original text data and the prior text data, determine the semantic analysis result, perform highlight processing on the original text data according to the semantic analysis result, and display The trial prompt text corresponding to the semantic analysis result is repeatedly executed for obtaining the original voice data corresponding to the trial prompt text collected by the microphone;
    若不存在所述在先文本数据,则重复执行在客户端上显示与标准法庭审理笔录模板中下一审判环节对应的审判提示文字,获取麦克风采集的与所述审判提示文字相对应的原始语音数据,直至不存在下一审判环节对应的审判提示文字时,获取庭审笔录文件,并将所述庭审笔录文件存储在数据库中。If the prior text data does not exist, repeat the execution and display the trial prompt text corresponding to the next trial link in the standard court trial transcript template on the client, and obtain the original voice collected by the microphone and corresponding to the trial prompt text Data, until there is no trial prompt text corresponding to the next trial link, obtain the court trial transcript file, and store the court trial transcript file in the database.
  17. 如权利要求16所述的非易失性可读存储介质,其特征在于,所述基于所述原始文本数据查询在先文本数据库,判断是否存在与所述原始文本数据相对应的在先文本数据,包括:The non-volatile readable storage medium according to claim 16, wherein the query a prior text database based on the original text data to determine whether there is prior text data corresponding to the original text data ,include:
    采用关键词提取算法对原始文本数据进行关键词提取,获取文本关键词;Use keyword extraction algorithm to extract keywords from the original text data to obtain text keywords;
    基于所述文本关键词查询同义词库,获取与所述文本关键词相对应的文本同义词;Query the thesaurus based on the text keywords, and obtain text synonyms corresponding to the text keywords;
    根据所述文本关键词和所述文本同义词查询在先文本数据库,判断是否存在包含所述文本关键词或者所述文本同义词的在先文本数据;Query a prior text database according to the text keywords and the text synonyms, and determine whether there is prior text data containing the text keywords or the text synonyms;
    若存在包含所述文本关键词或者所述文本同义词的在先文本数据,则认定存在与所述原始文本数据相对应的在先文本数据;If there is prior text data containing the text keywords or the text synonyms, it is determined that there is prior text data corresponding to the original text data;
    若不存在包含所述文本关键词或者所述文本同义词的在先文本数据,则认定不存在与所述原始文本数据相对应的在先文本数据。If there is no prior text data containing the text keywords or the text synonyms, it is determined that there is no prior text data corresponding to the original text data.
  18. 如权利要求16所述的非易失性可读存储介质,其特征在于,所述根据所述语义分析结果对所述原始文本数据进行突出显示处理,显示与所述语义分析结果相对应的审判提示文字,包括:The non-volatile readable storage medium of claim 16, wherein the original text data is highlighted according to the semantic analysis result, and the trial corresponding to the semantic analysis result is displayed Prompt text, including:
    若所述语义分析结果为相同,且所述原始文本数据与所述在先文本数据对应的对象标识为同一标识,则不对所述原始文本数据进行突出显示处理,显示所述标准法庭审理笔录模板对应的下一审判环节对应的审判提示文字;If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is the same identifier, the original text data is not highlighted, and the standard court trial transcript template is displayed Trial prompt text corresponding to the corresponding next trial session;
    若所述语义分析结果为相异,且所述原始文本数据与所述在先文本数据对应的对象标识为同一标识,则采用第一突出显示模式对所述原始文本数据进行突出显示处理,显示包括矛盾提示信息的审判提示文字;If the semantic analysis results are different, and the original text data and the object identifier corresponding to the previous text data are the same identifier, the first highlight mode is used to perform highlight processing on the original text data, and display Trial reminder text including contradictory reminder information;
    若所述语义分析结果为相同,且所述原始文本数据与所述在先文本数据对应的对象标识不为同一标识,则采用第二突出显示模式对所述原始文本数据进行突出显示处理,显示包括无争议提示信息的审判提示文字;If the semantic analysis result is the same, and the object identifier corresponding to the original text data and the previous text data is not the same identifier, the second highlighting mode is adopted to perform highlight processing on the original text data, and display Trial reminder text including no dispute reminder information;
    若所述语义分析结果为相异,且所述原始文本数据与所述在先文本数据对应的对象标识不为同一标识,则采用第三突出显示模式对所述原始文本数据进行突出显示处理,显示包括争议焦点提示信息的审判提示文字。If the semantic analysis results are different, and the object identifiers corresponding to the original text data and the previous text data are not the same identifier, the third highlighting mode is adopted to perform highlighting processing on the original text data, Display the trial reminder text including the reminder of the dispute focus.
  19. 如权利要求15所述的非易失性可读存储介质,其特征在于,所述采用分词工具对所述案件描述信息进行分词,获取目标分词结果,包括:15. The non-volatile readable storage medium according to claim 15, wherein said using a word segmentation tool to segment said case description information to obtain a target segmentation result comprises:
    采用结巴分词工具的搜索引擎模式对所述案件描述信息进行文本分词,获取文本分词结果,所述文本分词结果包括N个一级分词;Use the search engine mode of the stuttering word segmentation tool to perform text segmentation on the case description information to obtain a text segmentation result, and the text segmentation result includes N first-level word segmentation;
    若任意连续k个所述一级分词中连续k-1个所述一级分词的叠加等于所述第k个一级分词,且连续k-1个所述一级分词中存在至少两个一级分词的结合等于第k个一级分词,则仅保留结合等于第k个一级分词的至少两个所述一级分词作为所述目标分词,获取目标分词结果;If the superposition of consecutive k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are at least two consecutive k-1 first-level participles If the combination of graded participles is equal to the k-th first-level participle, only at least two of the first-level participles whose combination is equal to the k-th first-level participle are retained as the target participles, and the target segmentation result is obtained;
    若任意连续k个所述一级分词中连续k-1个所述一级分词的叠加等于所述第k个一级分词,且任意连续k个所述一级分词中不存在至少两个一级分词的结合等于第k个一级分词,则删除所述前k-1个所述一级分词,保留所述第k个一级分词作为所述目标分词,获取目标分词结果;If the superposition of k-1 consecutive first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, and there are no at least two one in any continuous k first-level participles The combination of the graded participle is equal to the k-th first-level participle, then delete the first k-1 first-level participles, keep the k-th first-level participle as the target participle, and obtain the target participle result;
    若任意连续k个所述一级分词中连续k-1个一级分词的结合等于所述第k个一级分词,则删除所述第k个一级分词,保留前k-1个所述一级分词作为所述目标分词,获取目标分词结果。If the combination of k-1 first-level participles in any continuous k first-level participles is equal to the k-th first-level participle, delete the k-th first-level participle and keep the first k-1 The first-level word segmentation is used as the target word segmentation to obtain the target word segmentation result.
  20. 如权利要求15所述的非易失性可读存储介质,其特征在于,在所述从数据库中获取待判案件对应的庭审笔录文件之前,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:The non-volatile readable storage medium of claim 15, wherein the computer-readable instructions are executed by one or more processors before the trial transcript file corresponding to the case to be judged is obtained from the database. During execution, the one or more processors are caused to further execute the following steps:
    获取数据更新任务,所述数据更新任务包括原始案件类型和任务更新时间;Acquiring a data update task, the data update task including the original case type and task update time;
    若与所述原始案件类型相对应的判案依据在所述任务更新时间以后发生变更,则确定变更时间,获取所述变更时间和系统当前时间之间的与原始案件类型相对应的历史判案数据,根据所述历史判案数据确定待处理数据;If the judgment basis corresponding to the original case type is changed after the task update time, the change time is determined, and the historical judgments corresponding to the original case type between the change time and the current system time are obtained Data, the data to be processed is determined based on the historical judgment data;
    若与所述原始案件类型相对应的判案依据在所述任务更新时间以后没有发生变更,则获取系统当前时间之前预设周期内的与原始案件类型相对应的历史判案数据,根据所述历史判案数据确定待处理数据;If the judgment basis corresponding to the original case type does not change after the task update time, then the historical judgment data corresponding to the original case type in the preset period before the current time of the system is obtained, and according to the Historical judgment data to determine the data to be processed;
    从所述待处理数据中提取先验信息,基于所述先验信息构建与所述原始案件类型相对应的先验知识库;Extracting prior information from the to-be-processed data, and constructing a priori knowledge base corresponding to the original case type based on the prior information;
    从所述待处理数据提取历史描述信息和历史审判观点,基于所述历史描述信息和所述历史审判观点,构建与所述原始案件类型相对应的审判观点库。Extracting historical description information and historical judgment opinions from the to-be-processed data, and constructing a judgment opinion database corresponding to the original case type based on the historical description information and the historical judgment opinions.
PCT/CN2019/116487 2019-08-16 2019-11-08 Intelligent auxiliary judgment method and apparatus, and computer device and storage medium WO2021031383A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910756628.7 2019-08-16
CN201910756628.7A CN110675288B (en) 2019-08-16 2019-08-16 Intelligent auxiliary judgment method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2021031383A1 true WO2021031383A1 (en) 2021-02-25

Family

ID=69075331

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/116487 WO2021031383A1 (en) 2019-08-16 2019-11-08 Intelligent auxiliary judgment method and apparatus, and computer device and storage medium

Country Status (2)

Country Link
CN (1) CN110675288B (en)
WO (1) WO2021031383A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112884440A (en) * 2021-03-02 2021-06-01 岭东核电有限公司 Test procedure execution method and device in nuclear power test and computer equipment
CN113570269A (en) * 2021-08-03 2021-10-29 工银科技有限公司 Operation and maintenance project management method, device, equipment, medium and program product
CN114416988A (en) * 2022-01-17 2022-04-29 国网福建省电力有限公司 Defect automatic rating and disposal suggestion pushing method based on natural language processing
CN116596709A (en) * 2023-07-19 2023-08-15 北京分音塔科技有限公司 Auxiliary judging method, device, equipment and storage medium
CN116758947A (en) * 2023-08-14 2023-09-15 北京分音塔科技有限公司 Auxiliary judgment method, device, equipment and storage medium based on audio emotion
CN117009605A (en) * 2023-08-08 2023-11-07 四川大学 Strategic innovation design problem solving method and system
CN117035406A (en) * 2023-07-31 2023-11-10 北京华夏电通科技股份有限公司 Intelligent control method, device and equipment for judging flow
CN117371916A (en) * 2023-12-05 2024-01-09 智粤铁路设备有限公司 Data processing method based on digital maintenance and intelligent management system for measuring tool
CN117473074A (en) * 2023-11-01 2024-01-30 中国通信建设集团有限公司数智科创分公司 Judicial case intelligent information matching system and method based on artificial intelligence
CN117609487A (en) * 2024-01-19 2024-02-27 武汉百智诚远科技有限公司 Legal provision quick retrieval method and system based on artificial intelligence

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111563107A (en) * 2020-05-25 2020-08-21 泰康保险集团股份有限公司 Information recommendation method and device, electronic equipment and storage medium
CN111930933A (en) * 2020-05-29 2020-11-13 深圳壹账通智能科技有限公司 Detection case processing method and device based on artificial intelligence
CN111709236B (en) * 2020-05-29 2024-01-09 中山大学 Judgment risk early warning method based on case similarity matching
CN111859888B (en) * 2020-07-22 2024-04-02 北京致医健康信息技术有限公司 Diagnosis assisting method, diagnosis assisting device, electronic equipment and storage medium
CN112559748A (en) * 2020-12-18 2021-03-26 厦门市法度信息科技有限公司 Method for classifying stroke record data records, terminal equipment and storage medium
CN112765974B (en) * 2021-01-19 2023-11-24 卡奥斯数字科技(北京)有限公司 Service assistance method, electronic equipment and readable storage medium
CN113157722B (en) * 2021-04-01 2023-12-26 北京达佳互联信息技术有限公司 Data processing method, device, server, system and storage medium
CN113283765A (en) * 2021-05-31 2021-08-20 浙江环玛信息科技有限公司 Intelligent court case data processing method and system
CN113609256B (en) * 2021-08-05 2022-03-15 郑州银丰电子科技有限公司 Smart court management system and method based on big data
CN114492436A (en) * 2022-02-11 2022-05-13 国家电网有限公司华东分部 Processing method, device and system for auditing interview information
CN116244315B (en) * 2022-12-08 2023-11-10 南京擎盾信息科技有限公司 Method and system for dynamically updating timeliness of legal and regulatory database

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090070099A1 (en) * 2006-10-10 2009-03-12 Konstantin Anisimovich Method for translating documents from one language into another using a database of translations, a terminology dictionary, a translation dictionary, and a machine translation system
CN107633465A (en) * 2017-08-21 2018-01-26 厦门能见易判信息科技有限公司 Intelligence aids in method of deciding a case
CN107818138A (en) * 2017-09-28 2018-03-20 银江股份有限公司 A kind of case legal regulation recommends method and system
CN108009299A (en) * 2017-12-28 2018-05-08 北京市律典通科技有限公司 Law tries method and device for business processing
CN108595532A (en) * 2018-04-02 2018-09-28 三峡大学 A kind of quantum clustering system and method for Law Text
CN109002431A (en) * 2018-07-20 2018-12-14 吴怡 Legal opinion generates system and production method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2009201864A1 (en) * 2009-05-11 2010-11-25 Fiona MacPhee Semi-automated court document production
CN109192213B (en) * 2018-08-21 2023-10-20 平安科技(深圳)有限公司 Method and device for real-time transcription of court trial voice, computer equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090070099A1 (en) * 2006-10-10 2009-03-12 Konstantin Anisimovich Method for translating documents from one language into another using a database of translations, a terminology dictionary, a translation dictionary, and a machine translation system
CN107633465A (en) * 2017-08-21 2018-01-26 厦门能见易判信息科技有限公司 Intelligence aids in method of deciding a case
CN107818138A (en) * 2017-09-28 2018-03-20 银江股份有限公司 A kind of case legal regulation recommends method and system
CN108009299A (en) * 2017-12-28 2018-05-08 北京市律典通科技有限公司 Law tries method and device for business processing
CN108595532A (en) * 2018-04-02 2018-09-28 三峡大学 A kind of quantum clustering system and method for Law Text
CN109002431A (en) * 2018-07-20 2018-12-14 吴怡 Legal opinion generates system and production method

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112884440A (en) * 2021-03-02 2021-06-01 岭东核电有限公司 Test procedure execution method and device in nuclear power test and computer equipment
CN112884440B (en) * 2021-03-02 2024-05-24 岭东核电有限公司 Test procedure execution method and device in nuclear power test and computer equipment
CN113570269A (en) * 2021-08-03 2021-10-29 工银科技有限公司 Operation and maintenance project management method, device, equipment, medium and program product
CN114416988A (en) * 2022-01-17 2022-04-29 国网福建省电力有限公司 Defect automatic rating and disposal suggestion pushing method based on natural language processing
CN116596709A (en) * 2023-07-19 2023-08-15 北京分音塔科技有限公司 Auxiliary judging method, device, equipment and storage medium
CN116596709B (en) * 2023-07-19 2024-02-06 北京分音塔科技有限公司 Auxiliary judging method, device, equipment and storage medium
CN117035406A (en) * 2023-07-31 2023-11-10 北京华夏电通科技股份有限公司 Intelligent control method, device and equipment for judging flow
CN117009605B (en) * 2023-08-08 2024-04-02 四川大学 Strategic innovation design problem solving method and system
CN117009605A (en) * 2023-08-08 2023-11-07 四川大学 Strategic innovation design problem solving method and system
CN116758947A (en) * 2023-08-14 2023-09-15 北京分音塔科技有限公司 Auxiliary judgment method, device, equipment and storage medium based on audio emotion
CN116758947B (en) * 2023-08-14 2023-10-20 北京分音塔科技有限公司 Auxiliary judgment method, device, equipment and storage medium based on audio emotion
CN117473074A (en) * 2023-11-01 2024-01-30 中国通信建设集团有限公司数智科创分公司 Judicial case intelligent information matching system and method based on artificial intelligence
CN117371916A (en) * 2023-12-05 2024-01-09 智粤铁路设备有限公司 Data processing method based on digital maintenance and intelligent management system for measuring tool
CN117371916B (en) * 2023-12-05 2024-02-23 智粤铁路设备有限公司 Data processing method based on digital maintenance and intelligent management system for measuring tool
CN117609487A (en) * 2024-01-19 2024-02-27 武汉百智诚远科技有限公司 Legal provision quick retrieval method and system based on artificial intelligence
CN117609487B (en) * 2024-01-19 2024-04-09 武汉百智诚远科技有限公司 Legal provision quick retrieval method and system based on artificial intelligence

Also Published As

Publication number Publication date
CN110675288A (en) 2020-01-10
CN110675288B (en) 2023-07-14

Similar Documents

Publication Publication Date Title
WO2021031383A1 (en) Intelligent auxiliary judgment method and apparatus, and computer device and storage medium
CN110704571B (en) Court trial auxiliary processing method, trial auxiliary processing device, equipment and medium
CN110874531B (en) Topic analysis method and device and storage medium
Hassan et al. The quest to automate fact-checking
CN115238101A (en) Multi-engine intelligent question-answering system oriented to multi-type knowledge base
WO2021000497A1 (en) Retrieval method and apparatus, and computer device and storage medium
TWI650719B (en) System and method for evaluating customer service quality from text content
WO2020181808A1 (en) Text punctuation prediction method and apparatus, and computer device and storage medium
WO2013080406A1 (en) Dialog system, redundant message removal method and redundant message removal program
KR102216768B1 (en) System and Method for Analyzing Emotion in Text using Psychological Counseling data
CN106601237A (en) Interactive voice response system and voice recognition method thereof
WO2020010834A1 (en) Faq question and answer library generalization method, apparatus, and device
WO2019232893A1 (en) Method and device for text emotion analysis, computer apparatus and storage medium
CN105912629A (en) Intelligent question and answer method and device
CN105787134B (en) Intelligent answer method, apparatus and system
CN113094578A (en) Deep learning-based content recommendation method, device, equipment and storage medium
CN110309504B (en) Text processing method, device, equipment and storage medium based on word segmentation
CN113569023A (en) Chinese medicine question-answering system and method based on knowledge graph
Bockhorst et al. Predicting self-reported customer satisfaction of interactions with a corporate call center
CN110347812A (en) A kind of search ordering method and system towards judicial style
CN113157887A (en) Knowledge question-answering intention identification method and device and computer equipment
KR20130068624A (en) Apparatus and method for recognizing speech based on speaker group
Wang et al. Automatic dialogue system of marriage law based on the parallel C4. 5 decision tree
CN111145053A (en) Enterprise law consultant management system and method based on artificial intelligence
CN112668284B (en) Legal document segmentation method and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19942209

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19942209

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16/08/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19942209

Country of ref document: EP

Kind code of ref document: A1