WO2017092555A1 - Procédé et dispositif d'analyse de somme d'argent dans un document de jugement - Google Patents

Procédé et dispositif d'analyse de somme d'argent dans un document de jugement Download PDF

Info

Publication number
WO2017092555A1
WO2017092555A1 PCT/CN2016/105272 CN2016105272W WO2017092555A1 WO 2017092555 A1 WO2017092555 A1 WO 2017092555A1 CN 2016105272 W CN2016105272 W CN 2016105272W WO 2017092555 A1 WO2017092555 A1 WO 2017092555A1
Authority
WO
WIPO (PCT)
Prior art keywords
amount
paragraph
judgment
court
clause
Prior art date
Application number
PCT/CN2016/105272
Other languages
English (en)
Chinese (zh)
Inventor
胡斌
崔维福
Original Assignee
北京国双科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京国双科技有限公司 filed Critical 北京国双科技有限公司
Publication of WO2017092555A1 publication Critical patent/WO2017092555A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods

Definitions

  • the invention relates to the field of amount analysis, in particular to a method and a device for analyzing an amount in a judgment document.
  • the judgment document is the people's court exercising the state's judicial power. After the trial of the case, based on the determination of the facts of the parties' disputes and the provisions of laws, regulations and relevant judicial interpretations, the litigation procedure of the case and the substantive rights and obligations of the parties are made. Legally binding judicial documents.
  • the present invention has been made in order to provide an amount analysis method and apparatus in a referee document that overcomes the above problems or at least partially solves the above problems, which can save manpower, realize the withdrawal of the amount in the automated judgment document, and improve The correctness of the withdrawal amount.
  • the present invention provides a method for parsing an amount in a judgment document, comprising:
  • the present invention provides an amount resolving device in a referee document, comprising:
  • An extracting unit configured to extract, in accordance with a predetermined rule, an amount in each clause of the appeal paragraph and the judgment paragraph;
  • the first summing unit is configured to sum the amounts extracted by the clauses in the appeal paragraph and the judgment paragraph respectively, and obtain the amount of the plaintiff's appeal and the court's support amount in the judgment document.
  • the method and device for analyzing the amount in the judgment document provided by the present invention firstly obtains the plaintiff's appeal paragraph and the court's judgment paragraph in stages, and then the appeal paragraph and the judgment.
  • the paragraph is divided into clauses, and the amount of money appearing in each clause of the judgment document is unified to facilitate the subsequent calculation of the amount, and the amount in each clause of the appeal paragraph and the judgment paragraph is extracted according to the predetermined rule, further In the process of extracting the amount, the amount of the duplicate is successively eliminated, and the correctness of the withdrawal of the amount is further verified.
  • the amounts parsed by the clauses in the appeal paragraph and the judgment paragraph are summed up, which can be accurately analyzed. The amount in the judgment document.
  • the present invention can save manpower by realizing the unification of the plurality of different expression forms in the judgment document, thereby realizing the extraction of the amount in the automated judgment document and improving The correctness of the withdrawal amount.
  • FIG. 1 is a schematic flowchart of a method for analyzing an amount in a referee document according to an embodiment of the present invention
  • FIG. 2 is a schematic flow chart of an amount analysis method in another referee document provided by an embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of an amount resolving device in a referee document according to an embodiment of the present invention
  • FIG. 4 is a schematic structural diagram of an amount resolving device in another referee document according to an embodiment of the present invention.
  • the embodiment of the invention provides a schematic diagram of a method for analyzing the amount of money in the judgment document. As shown in FIG. 1 , the method includes:
  • the judgment document is a record of the trial process and the result of the people's court. It is the carrier of the outcome of the lawsuit and the only evidence for the people's court to determine and assign the rights and obligations of the parties.
  • the refereeing document is segmented.
  • the appealing paragraph with the first one being the first and the second one being the final one is selected, and the first one is the original or the modified version.
  • the identifier 2 is a line-feeding symbol; the judgment paragraph with the identifier three as the head and the identifier four as the tail intercepting the court is selected, and the identifier 3 is a variant as follows or the judgment is as follows, and the identifier is the variant of the case or the case.
  • the plaintiff’s petition paragraph records the amount of the claim of the plaintiff, and the amount of court support is recorded in the court support paragraph.
  • the contents of the judgment document can be divided into two parts: the plaintiff and the court.
  • the amount of the two parts is further extracted, and the relevant amount in the two parts is obtained.
  • the appeal paragraph and the judgment paragraph may be divided by a line break, a period or a semicolon.
  • the claim paragraph and the judgment paragraph are firstly segmented according to a newline character. If there is a period in the paragraph obtained by the line break separation, the text in the paragraph is further divided according to the period, if according to the period If there is a semicolon in the separated paragraph, the text in the period will continue to be segmented according to the semicolon until the claim paragraph and the judgment paragraph are divided into multiple clauses.
  • the step is specifically: firstly, according to a predetermined rule, the amount in each clause is sorted to obtain an amount in a preset standard form, wherein the embodiment does not limit the amount of the preset standard form, and may be set according to requirements. For example, if the Arabic numeral can be used as the amount of the standard form, the amount of the final form will be the amount in the form of Arabic numerals, and then the amount of the preset standard form sorted out in each clause will be deduplicated and extracted. After the amount.
  • the amount of the preset standard form obtained in each clause is de-duplicated, and the de-weighted amount is extracted, specifically: the duplicate amount in the amount sorted in each clause is removed, for example
  • the compensation for the victim Zhang San medical expenses is 1,000 yuan
  • the labor insurance fee is 2,000 yuan
  • the total compensation is 3,000 yuan.
  • the first two sums of 1,000 yuan and 2,000 yuan are 3,000 yuan.
  • the amount is the duplicate amount of the third amount, and the first two amounts need to be excluded.
  • the amount in the appeal paragraph extracted in step 103 and the amount in the judgment paragraph are respectively summed, and the amount of the plaintiff's appeal and the court's support amount in the judgment document are obtained, and the corresponding amount record is made.
  • the method for parsing the amount in the judgment document provided by the embodiment of the present invention firstly obtains the appealing paragraph of the plaintiff and the judgment paragraph of the court by segmenting the judgment document, and then segmenting the appeal paragraph and the judgment paragraph according to the clause.
  • the predetermined rule extracts the amount in each clause of the appeal paragraph and the judgment paragraph, and can accurately resolve the amount in the judgment document.
  • the present invention can save manpower by realizing the unification of the plurality of different expression forms in the judgment document, thereby realizing the extraction of the amount in the automated judgment document and improving The correctness of the withdrawal amount.
  • the embodiment of the present invention provides another method for analyzing the amount of money in the judgment document. As shown in FIG. 2, the method includes:
  • the first paragraph is marked with the first one, and the second one is the closing paragraph of the plaintiff.
  • the first one is a variant of the original or the original, and the second is a newline symbol.
  • the identifiers 1 and 2 are the keywords that can identify the paragraph of the claim of the plaintiff.
  • the embodiment of the present invention does not limit the keywords, and the referee can be based on the actual situation.
  • the keyword that shows the plaintiff’s claim is used as the logo.
  • the judgment paragraph headed by the third identifier and the fourth identifier is the court.
  • the identifier 3 is a variant of the judgment as follows or the judgment is as follows.
  • the identifier 4 is a variant of the case or the case.
  • the identifier 3 and the identifier 4 herein are keywords that can identify the court decision paragraph of the court.
  • the embodiment of the present invention does not limit the keyword, and the referee can be expressed according to the actual situation.
  • the keyword of the court's decision is used as the logo.
  • the specific includes:
  • the amount of the Chinese case in the respective clauses is sorted into the amount in the preset standard form.
  • the step may include:
  • the word segmentation technique is used to process the word segmentation to obtain a plurality of words.
  • the amount string is divided into a plurality of amount segmentation words. For example, in the above example, “thousands” and “pick up” are the unit words of the amount, then the amount string "one thousand zeros can be picked up. “Yuan” is divided into two parts: “one thousand” and "zero”.
  • the Arabic values corresponding to each of the amount segmentation words are summed to obtain the amount involved in each of the clauses.
  • clauses may also include unit words of billions, ten thousand, one hundred, ones, and cents, and may be processed according to the process described above, which is not limited in this application.
  • the amount string is divided into a plurality of amount segments according to the unit of the amount of the amount, and further: whether the query amount string contains a plurality of consecutive unit words, and if not, the processing is as follows: When it is, the amount string is divided into a plurality of amount segmentation words according to the last unit word in a plurality of consecutive unit words. At this time, for the amount segmentation word containing a plurality of consecutive amount unit words, the Arabic value is calculated according to the amount value and the amount unit.
  • the method described above is recursively performed in order from left to right according to the continuous amount unit word.
  • the Arabic value corresponding to each unit word is calculated in turn until the Arabic value corresponding to the last unit word is calculated, and the value is used as the final Arabic value of the amount segmentation word.
  • the amount involved in Chinese capitalization can be accurately sorted into the amount represented by the standard Arabic numerals, which satisfies the diversity and accuracy of the amount extraction. Sexual requirements.
  • the clause involving the proportional relationship amount can be identified by keyword recognition, for example, The clause "the damage caused by 3,000 yuan, A should bear 70%", when the keyword "bear” is identified, the amount of proportional relationship is considered in the clause, and then according to the data of 3000 and 70% 2100.
  • the clause involving the amount of the deductible relationship can be identified by means of keyword recognition, such as for the clause "deduction”
  • keyword recognition such as for the clause "deduction”
  • the 1,000 yuan previously paid when the keyword "deduction” is recognized, the clause is considered to involve the deductible relationship amount, and then the data 1000 is sorted into a negative value, that is, -1000.
  • the plurality of amounts sorted in each clause are added from the first amount, and the sum is compared in turn. If the sum of the first two amounts is equal to the third amount, then the former The value of the two amounts is cleared, and the third amount is retained. Similarly, starting from the second amount, the second amount and the third amount are added and compared, and then the analogy is incremented until the amount in each clause Extract it and keep the amount after the extraction.
  • the result of retaining the total value of the amount sorted in each clause is described above, and the single amount before the summation is cleared to zero, so that the parsing obtains a total amount in each clause, so that the amount can be avoided.
  • the repeated calculations also ensure the accuracy of the withdrawal amount.
  • a group of documents in this step refers to a group of referee documents of the same type, and each of the referee documents in the group can obtain the amount of the plaintiff's appeal in each judgment document and the court through the above steps.
  • the amount of support is summed up to get the total amount of the original request and the amount of support from the General Court.
  • the court support ratio is equal to the court support amount divided by the original telling amount
  • the total court support amount and the total original request amount are obtained in step 207, and the total court support amount is divided by the total original request.
  • the amount of the court is supported by the court of the group of judges.
  • the court support ratio of each referee document can also be calculated, and the court support ratios obtained by all the judgment documents of the group are aggregated and averaged, and the average court of the group of judgment documents is obtained. Support the ratio and get another summary indicator.
  • the form of the amount of money appearing in each clause of the judgment document is unified, which facilitates the subsequent calculation of the amount, and further eliminates the repetition by multiple layers in the process of extracting the amount.
  • the amount further verified the correctness of the withdrawal of the amount, and accurately analyzed the amount of the plaintiff’s claim and the amount of support from the court in the judgment document.
  • an embodiment of the present invention provides an amount resolving device in a referee document, and the device embodiment corresponds to the foregoing method embodiment, and the device is not implemented in the foregoing method for reading.
  • the device in this embodiment can implement all the contents in the foregoing method embodiments.
  • the device includes: an obtaining unit 31 and a clause unit 32.
  • the obtaining unit 31 is configured to segment the appeal document by the plaintiff and the judgment paragraph of the court;
  • the clause unit 32 is configured to perform a clause on the appeal paragraph and the judgment paragraph;
  • the extracting unit 33 is configured to extract the amount in each clause of the appeal paragraph and the judgment paragraph according to a predetermined rule
  • the first summation unit 34 is configured to sum up the amounts extracted by the clauses in the appeal paragraph and the judgment paragraph respectively, to obtain the amount of the plaintiff's appeal and the court's support amount in the judgment document. .
  • the method for analyzing the amount in the judgment document provided by the embodiment of the present invention firstly obtains the appeal paragraph of the plaintiff and the judgment paragraph of the court by segmenting the judgment document, and then segmenting the appeal paragraph and the judgment paragraph according to the clause.
  • the predetermined rule extracts the amount in each clause of the appeal paragraph and the judgment paragraph, and can accurately resolve the amount in the judgment document.
  • the present invention can save manpower by realizing the unification of the plurality of different expression forms in the judgment document, thereby realizing the extraction of the amount in the automated judgment document and improving The correctness of the withdrawal amount.
  • an embodiment of the present invention provides an amount resolving device in another judging document, and the device embodiment corresponds to the foregoing method embodiment, and the device is not in the foregoing method for reading.
  • the details in the embodiment are described one by one, but it should be understood that the device in this embodiment can implement all the contents in the foregoing method embodiments.
  • the device further includes: a second summing unit 35. And ratio calculation unit 36.
  • the second summing unit 35 is configured to traverse each of the plurality of referee documents in the set of documents, respectively summing the amount of the plaintiff's appeal and the amount of the court's support in each of the referee documents, and obtaining the total original amount and total amount. Court support amount;
  • the ratio calculation unit 36 is configured to divide the total court support amount by the total original telling amount, and obtain the court support ratio.
  • the acquiring unit 31 includes:
  • the first intercepting module is configured to intercept the plaintiff with the first one of the identifier and the second identifier, and the identifier is a variant of the original or the original, and the identifier is a newline symbol;
  • the second intercepting module is used for the judgment paragraph with the identifier three as the head and the identifier four as the tail intercepting the court.
  • the identifier 3 is a variant of the following judgment or the following judgment, and the identifier is the present case.
  • clause unit 32 includes:
  • the extracting unit 33 includes:
  • a sorting module configured to sort the amount in each clause according to a predetermined rule to obtain an amount in a preset standard form
  • the extraction module is configured to de-weight the amount of the preset standard form collated in each clause, and extract the de-weighted amount.
  • finishing module is specifically configured to:
  • the word segmentation technique is used to perform word segmentation processing on each clause, and a plurality of words are obtained.
  • the clause "the court compensates the plaintiff for a thousand yuan and a whole yuan” is processed by word segmentation, and the following words are obtained: the court, the compensation, The plaintiff, one thousand and zero, the yuan, the whole;
  • the amount string is divided into a plurality of amount segmentation words. For example, in the above example, “thousands” and “pick up” are the unit words of the amount, then the amount string "one thousand zeros can be picked up. “Yuan” is divided into two parts: “one thousand” and "zero”.
  • clauses may also include unit words of billions, ten thousand, one hundred, ones, and cents, and may be processed according to the process described above, which is not limited in this application.
  • the amount string there may also be such a "trillion”, “billion”, “ten million” and so on, which contain the amount of the two consecutive unit words, so based on the above
  • the amount string is divided into a plurality of amount segments according to the unit of the amount of the amount, and further: whether the query amount string contains a plurality of consecutive unit words, and if not, the processing is as follows: When it is, the amount string is divided into a plurality of amount segmentation words according to the last unit word in a plurality of consecutive unit words.
  • the Arabic value is calculated according to the amount value and the amount unit, and further: according to the continuous amount unit words from left to right, recursively using the above description
  • the method calculates the Arabic value corresponding to each unit word in turn until the Arabic value corresponding to the last unit word is calculated, and the value is used as the final Arabic value of the amount segmentation word.
  • the amount involved in Chinese capitalization can be accurately sorted into the amount represented by the standard Arabic numerals, which satisfies the diversity and accuracy of the amount extraction. Sexual requirements.
  • the clause involving the proportional relationship amount can be identified by keyword recognition, for example, The clause "the damage caused by 3,000 yuan, A should bear 70%", when the keyword "bear” is identified, the amount of proportional relationship is considered in the clause, and then according to the data of 3000 and 70% 2100.
  • the amount deducted in the deduction relationship in each of the clauses is arranged as a negative value of the amount set as a preset standard form
  • the clause involving the amount of the deductible relationship can be identified by means of keyword recognition, such as for the clause "deduction”
  • keyword recognition such as for the clause "deduction”
  • the 1,000 yuan previously paid when the keyword "deduction” is recognized, the clause is considered to involve the deductible relationship amount, and then the data 1000 is sorted into a negative value, that is, -1000.
  • the form of the amount of money appearing in each clause of the judgment document is unified, which facilitates the subsequent calculation of the amount, and further eliminates the repeated amount by multiple layers in the process of extracting the amount, further verifying the correctness of the amount extraction. And accurately analyzed the amount of the plaintiff's appeal in the judgment document and the amount of support of the court.
  • the amount resolving device in the referee document includes a processor and a memory, and the above-mentioned obtaining unit 31, the sentence unit 32, the extracting unit 33, the first summing unit 34, and the like are all stored as a program unit in a memory, and are executed by the processor.
  • the above described program elements in the memory implement the corresponding functions.
  • the processor contains a kernel, and the kernel removes the corresponding program unit from the memory.
  • the kernel can be set to one or more.
  • the memory may include non-persistent memory, random access memory (RAM), and/or non-volatile memory in a computer readable medium, such as read only memory (ROM) or flash memory (flash RAM), the memory including at least one Memory chip.
  • RAM random access memory
  • ROM read only memory
  • flash RAM flash memory
  • the present application also provides a computer program product, when executed on a data processing device, is adapted to perform a program code that initializes a method step of: segmenting a referee document to obtain a plaintiff's appeal paragraph and a court judgment paragraph; Subdividing the appeal paragraph and the judgment paragraph; extracting the amount in each clause of the appeal paragraph and the judgment paragraph according to a predetermined rule; respectively parsing the clauses in the appeal paragraph and the judgment paragraph The amount of money is summed up to obtain the amount of the plaintiff’s claim and the amount of support from the court.
  • embodiments of the present application can be provided as a method, system, or computer program product.
  • the present application can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment in combination of software and hardware.
  • the application can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) including computer usable program code.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
  • the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
  • the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
  • a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
  • processors CPUs
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • the memory may include non-persistent memory, random access memory (RAM), and/or non-volatile memory in a computer readable medium, such as read only memory (ROM) or flash memory.
  • RAM random access memory
  • ROM read only memory
  • Memory is an example of a computer readable medium.
  • Computer readable media includes both permanent and non-persistent, removable and non-removable media.
  • Information storage can be implemented by any method or technology.
  • the information can be computer readable instructions, data structures, modules of programs, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory. (ROM), electrically erasable programmable read only memory (EEPROM), flash memory or other memory technology, compact disk read only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, Magnetic tape cartridges, magnetic tape storage or other magnetic storage devices or any other non-transportable media can be used to store information that can be accessed by a computing device.
  • computer readable media does not include temporary storage of computer readable media, such as modulated data signals and carrier waves.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)

Abstract

La présente invention concerne un procédé et un dispositif pour analyser la somme d'argent dans un document de jugement, qui concernent le domaine de l'analyse de somme. Le procédé consiste : à segmenter en premier un document de jugement afin d'acquérir un paragraphe d'appel d'un accusateur et un paragraphe de peine d'un tribunal (101); puis à phraser le paragraphe d'appel et le paragraphe de peine (102); à extraire la somme d'argent dans chaque clause du paragraphe d'appel dans chaque clause du paragraphe d'appel et le paragraphe de peine selon une règle prédéterminée (103); et à totaliser la somme d'argent extraite des diverses clauses dans le paragraphe d'appel et le paragraphe de peine respectivement afin d'obtenir la somme d'argent d'appel de l'accusateur et la somme d'argent de soutien du tribunal dans le document de jugement (104). Le procédé est principalement utilisé afin d'extraire la somme d'argent dans un document de jugement.
PCT/CN2016/105272 2015-12-01 2016-11-10 Procédé et dispositif d'analyse de somme d'argent dans un document de jugement WO2017092555A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510867476.XA CN106815203B (zh) 2015-12-01 2015-12-01 一种裁判文书中的金额解析方法及装置
CN201510867476.X 2015-12-01

Publications (1)

Publication Number Publication Date
WO2017092555A1 true WO2017092555A1 (fr) 2017-06-08

Family

ID=58796238

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/105272 WO2017092555A1 (fr) 2015-12-01 2016-11-10 Procédé et dispositif d'analyse de somme d'argent dans un document de jugement

Country Status (2)

Country Link
CN (1) CN106815203B (fr)
WO (1) WO2017092555A1 (fr)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110046345A (zh) * 2019-03-12 2019-07-23 同盾控股有限公司 一种数据提取方法和装置
CN110765889A (zh) * 2019-09-29 2020-02-07 平安直通咨询有限公司上海分公司 法律文书的特征提取方法、相关装置及存储介质
CN111144095A (zh) * 2019-11-26 2020-05-12 方正璞华软件(武汉)股份有限公司 一种工伤案件裁决书的生成方法及装置
CN111507095A (zh) * 2019-01-29 2020-08-07 阿里巴巴集团控股有限公司 裁判文书的生成方法、装置、存储介质和处理器
CN111798344A (zh) * 2020-07-01 2020-10-20 北京金堤科技有限公司 主体名称确定方法和装置、电子设备和存储介质
CN112307726A (zh) * 2020-11-09 2021-02-02 浙江大学 因果去偏差模型指导的法院观点自动生成方法
CN112632941A (zh) * 2019-09-23 2021-04-09 北京国双科技有限公司 生成pdf格式公安文书的方法、装置、设备及存储介质
CN113010684A (zh) * 2020-12-31 2021-06-22 北京法意科技有限公司 民事诉判图谱的构建方法及系统
CN113343661A (zh) * 2021-06-28 2021-09-03 福建师范大学 一种减刑假释文书自动生成方法及装置
CN113901125A (zh) * 2021-09-17 2022-01-07 盐城金堤科技有限公司 数据展示方法和装置、及存储介质和电子设备
CN114239561A (zh) * 2021-12-10 2022-03-25 北京天眼查科技有限公司 供应关系获取方法、装置、存储介质及电子设备

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108197099A (zh) * 2017-12-01 2018-06-22 厦门快商通信息技术有限公司 一种文本信息提取方法及计算机可读存储介质
CN108287818A (zh) * 2018-01-03 2018-07-17 小草数语(北京)科技有限公司 裁判文书中金额的提取方法、装置和电子设备
CN108984500B (zh) * 2018-06-19 2022-04-29 平安科技(深圳)有限公司 金额信息的提取方法、终端设备及介质
CN110633458A (zh) * 2018-06-25 2019-12-31 阿里巴巴集团控股有限公司 裁判文书的生成方法和生成装置
CN109446511B (zh) * 2018-09-10 2022-07-08 平安科技(深圳)有限公司 裁判文书处理方法、装置、计算机设备和存储介质
CN110378784A (zh) * 2019-07-24 2019-10-25 中国工商银行股份有限公司 金额输入方法及装置
CN110851591A (zh) * 2019-09-17 2020-02-28 河北省讯飞人工智能研究院 一种裁判文书的质量评估方法、装置、设备及存储介质
CN112541344B (zh) * 2019-09-23 2024-07-26 北京国双科技有限公司 目标段落的确定方法、装置、存储介质及设备
CN111008523A (zh) * 2019-11-21 2020-04-14 中科鼎富(北京)科技发展有限公司 一种信息提取方法、装置及服务器
CN111177332B (zh) * 2019-11-27 2023-11-24 中证信用增进股份有限公司 自动提取裁判文书涉案标的和裁判结果的方法及装置
CN112651853A (zh) * 2020-11-17 2021-04-13 四川大学 一种基于裁判文书的论辩观点挖掘方法及系统

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120084228A1 (en) * 2003-10-15 2012-04-05 Rao Srinivasan N System and method for processing partially unstructured data
CN103077164A (zh) * 2012-12-27 2013-05-01 新浪网技术(中国)有限公司 文本分析方法及文本分析器

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101312559A (zh) * 2007-05-23 2008-11-26 乐金电子(中国)研究开发中心有限公司 基于移动通信终端的消费短信息管理方法及移动通信终端
CN102682109B (zh) * 2012-05-09 2014-07-16 北京彼速信息技术有限公司 一种专利信息解析方法和装置
CN102866990B (zh) * 2012-08-20 2016-08-03 北京搜狗信息服务有限公司 一种主题对话方法和装置
CN103778200B (zh) * 2014-01-09 2017-08-08 中国科学院计算技术研究所 一种报文信息源抽取方法及其系统

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120084228A1 (en) * 2003-10-15 2012-04-05 Rao Srinivasan N System and method for processing partially unstructured data
CN103077164A (zh) * 2012-12-27 2013-05-01 新浪网技术(中国)有限公司 文本分析方法及文本分析器

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GAO, XIAOYUN ET AL.: "Chinese Time Words and Numerals Automatic Segmentation Method Based on Rules", NEW TECHNOLOGY OF LIBRARY AND INFORMATION SERVICE, vol. 3, 25 March 2007 (2007-03-25), pages 46 - 50 *
WU, FEI.: "Study on Extraction Method of Value Information", ELECTRONIC TECHNOLOGY & INFORMATION SCIENCE , CHINA MASTER'S THESES FULL-TEXT DATABASE, 15 March 2011 (2011-03-15), pages 1138 - 1541, ISSN: 1674-0246 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111507095B (zh) * 2019-01-29 2023-05-02 阿里云计算有限公司 裁判文书的生成方法、装置、存储介质和处理器
CN111507095A (zh) * 2019-01-29 2020-08-07 阿里巴巴集团控股有限公司 裁判文书的生成方法、装置、存储介质和处理器
CN110046345A (zh) * 2019-03-12 2019-07-23 同盾控股有限公司 一种数据提取方法和装置
CN112632941A (zh) * 2019-09-23 2021-04-09 北京国双科技有限公司 生成pdf格式公安文书的方法、装置、设备及存储介质
CN110765889A (zh) * 2019-09-29 2020-02-07 平安直通咨询有限公司上海分公司 法律文书的特征提取方法、相关装置及存储介质
CN111144095A (zh) * 2019-11-26 2020-05-12 方正璞华软件(武汉)股份有限公司 一种工伤案件裁决书的生成方法及装置
CN111144095B (zh) * 2019-11-26 2024-04-05 方正璞华软件(武汉)股份有限公司 一种工伤案件裁决书的生成方法及装置
CN111798344A (zh) * 2020-07-01 2020-10-20 北京金堤科技有限公司 主体名称确定方法和装置、电子设备和存储介质
CN111798344B (zh) * 2020-07-01 2023-09-22 北京金堤科技有限公司 主体名称确定方法和装置、电子设备和存储介质
CN112307726B (zh) * 2020-11-09 2023-08-04 浙江大学 因果去偏差模型指导的法院观点自动生成方法
CN112307726A (zh) * 2020-11-09 2021-02-02 浙江大学 因果去偏差模型指导的法院观点自动生成方法
CN113010684A (zh) * 2020-12-31 2021-06-22 北京法意科技有限公司 民事诉判图谱的构建方法及系统
CN113010684B (zh) * 2020-12-31 2024-02-09 北京法意科技有限公司 民事诉判图谱的构建方法及系统
CN113343661A (zh) * 2021-06-28 2021-09-03 福建师范大学 一种减刑假释文书自动生成方法及装置
CN113901125A (zh) * 2021-09-17 2022-01-07 盐城金堤科技有限公司 数据展示方法和装置、及存储介质和电子设备
CN114239561A (zh) * 2021-12-10 2022-03-25 北京天眼查科技有限公司 供应关系获取方法、装置、存储介质及电子设备

Also Published As

Publication number Publication date
CN106815203A (zh) 2017-06-09
CN106815203B (zh) 2021-03-30

Similar Documents

Publication Publication Date Title
WO2017092555A1 (fr) Procédé et dispositif d'analyse de somme d'argent dans un document de jugement
CN111291570B (zh) 一种实现司法文书中要素识别的方法及装置
WO2019223103A1 (fr) Procédé et appareil d'acquisition de similarités de texte, dispositif terminal et support
WO2020052184A1 (fr) Procédé et appareil de traitement de document de jugement, dispositif informatique et support d'informations
WO2017167067A1 (fr) Procédé et dispositif pour une classification de texte de page internet, procédé et dispositif pour une reconnaissance de texte de page internet
WO2019237540A1 (fr) Procédé et dispositif d'acquisition de données financières, dispositif terminal et support
WO2017092337A1 (fr) Procédé et appareil d'extraction d'étiquette de commentaire
TWI689825B (zh) 一種文件品質指標獲取方法及裝置
CN110738039B (zh) 一种案件辅助信息的提示方法、装置、存储介质和服务器
WO2019080402A1 (fr) Procédé d'extraction d'informations textuelles pour texte structuré, support de stockage et serveur
WO2019242124A1 (fr) Procédé et appareil d'extraction d'informations de somme d'argent, et dispositif terminal et support
CN109471933A (zh) 一种文本摘要的生成方法、存储介质和服务器
CN109684476B (zh) 一种文本分类方法、文本分类装置及终端设备
CN110321466B (zh) 一种基于语义分析的证券资讯查重方法及系统
CN109101489B (zh) 一种文本自动摘要方法、装置及一种电子设备
CN110019820B (zh) 一种病历中主诉与现病史症状时间一致性检测方法
CN111291177A (zh) 一种信息处理方法、装置和计算机存储介质
CN108733816A (zh) 一种微博突发事件检测方法
CN112527958A (zh) 用户行为倾向识别方法、装置、设备及存储介质
CN105631018A (zh) 基于主题模型的文章特征抽取方法
CN112329460A (zh) 文本的主题聚类方法、装置、设备及存储介质
CN102880631A (zh) 一种基于双层分类模型的中文作者识别方法及其装置
CN106598997B (zh) 一种计算文本主题归属度的方法及装置
WO2022105178A1 (fr) Procédé d'extraction de mot-clé et dispositif associé
CN104615728B (zh) 一种网页正文提取方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16869865

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16869865

Country of ref document: EP

Kind code of ref document: A1