CN110289066A - A kind of checking method and system of forensic identification report - Google Patents

A kind of checking method and system of forensic identification report Download PDF

Info

Publication number
CN110289066A
CN110289066A CN201910689439.2A CN201910689439A CN110289066A CN 110289066 A CN110289066 A CN 110289066A CN 201910689439 A CN201910689439 A CN 201910689439A CN 110289066 A CN110289066 A CN 110289066A
Authority
CN
China
Prior art keywords
examination
characteristic
report
model
case
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910689439.2A
Other languages
Chinese (zh)
Inventor
张银柱
王红专
郭军红
吴益真
王广峰
李翠翠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan Kaihe Software Technology Co Ltd
Original Assignee
Henan Kaihe Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan Kaihe Software Technology Co Ltd filed Critical Henan Kaihe Software Technology Co Ltd
Priority to CN201910689439.2A priority Critical patent/CN110289066A/en
Publication of CN110289066A publication Critical patent/CN110289066A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H15/00ICT specially adapted for medical reports, e.g. generation or transmission thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Medical Informatics (AREA)
  • Epidemiology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses the checking methods and system of a kind of report of forensic identification, are related to extraction of semantics and analysis field;Its method includes step 1: being extracted in database after the multidimensional characteristic of case legal medical expert's probation report using NLP, multidimensional characteristic segmentation and fitting are carried out using the HMM model that the multistage Markov based on word frequency distribution and synonym analysis is assumed, obtains and stores characteristic model;Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, characteristic model progress formality examination and damage location is called to examine;Step 3: the examination result examined according to formality examination and damage location generates and shows that legal medical expert's Identification and investigation is reported.The present invention utilizes the HMM model assumed in medical jurisprudence the multistage Markov of generic term synonym and context-sensitive characteristic building by building, characteristic model is established, figure in the law circle is helped quickly and accurately to examine flaw present in probation report and mistake.

Description

A kind of checking method and system of forensic identification report
Technical field
The present invention relates to extraction of semantics and analysis field, the especially a kind of checking method and system of forensic identification report.
Background technique
Under the background of current reform of the judicial system iterative method, a large amount of survey reports, expert opinion, assessment report and specially Professional, scientific and objectivity the requirement of family's examination reports is higher and higher, and procurator, lawyer and judge are to such The examination consciousness and depth of evidence are constantly reinforced.But such opinion or report still rely on investigation appraiser with this profession art Language is write using all kinds of text edit softwares, exists and searches and manage bring time-consuming by artificial because of cited data source Long, the problem of process is many and diverse, low efficiency.For example, the figure in the law circle that Yunnan Province handles drug-related case is mostly to be related to the life of drugs Object chemistry expert can identify case sample of being involved in drug traffic, and write probation report, but interior ground province method using softwares such as Word Rule personage does not have related drugs identification technical ability, so needing to rely on social appraisal organization provides probation report.Law attainment, prison Superintending and directing many reasons such as management causes the conclusion of social appraisal organization there is certain flaw and mistake, for example identification item is not complete The relevance in face, identification and case is not strong etc.;Meanwhile there are following characteristics for forensic identification report: term is that clause is special in content Sign, Similar content is more, and length is longer, causes semantic analysis accuracy rate low;Therefore a kind of checking method and system, accurate language are needed Justice is analyzed, the mistake in examination report, completes the report identification of high accuracy.
Summary of the invention
It is an object of the invention to: the present invention provides the checking methods and system of a kind of report of forensic identification, help method Rule personage quickly and accurately examines flaw present in probation report and mistake.
The technical solution adopted by the invention is as follows:
A kind of checking method of forensic identification report, includes the following steps:
Step 1: after the multidimensional characteristic using case legal medical expert's probation report in NLP extraction database, using word-based frequency division The HMM model that cloth and the multistage Markov of synonym analysis are assumed carries out multidimensional characteristic segmentation and fitting, obtains and stores spy Levy model;
Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, characteristic model is called to carry out lattice Formula examines and damage location examines;
Step 3: the examination result examined according to formality examination and damage location generates and shows legal medical expert's Identification and investigation report It accuses.
Preferably, the step 1 includes the following steps:
Step 1.1: the multidimensional characteristic of body matter in case legal medical expert probation report, the multidimensional characteristic are extracted using NLP Including case description, medicolegal examination, analytic explanation and expert's conclusion;
Step 1.2: case property, case plot, case time of origin and the case duration data in case description are extracted, The HMM model assumed using the multistage Markov based on word frequency distribution is formed to that can be split with term in case description Serialize feature, that is, word frequency distribution and relevance data;
Step 1.3: the term in medicolegal examination is extracted, using based on the more of the word frequency distribution after synonym analysis corrections The HMM model that rank Markov is assumed, is identified and is marked to the term in medicolegal examination;Using OCR technique to legal medical expert The image data learned in examining is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of term Table, composition sequence feature;
Step 1.4: after expert's conclusion is carried out token segmentation, the high template library of matching similarity, and it is stored in the mould Plate library;
Step 1.5: the resulting serializing feature of fit procedure 1.2-1.4 and data composition characteristic model.
Preferably, formality examination includes the following steps: in the step 2
Step a1: acquiring the body matter and attachment content of pending report, and body matter includes title, introduction, medical jurisprudence Inspection, analytic explanation, expert's conclusion, surveyor and appraisal organization;Attachment content includes physical examination, presentation content, operation note Record, medicolegal examination and medical jurisprudence image data;
Step a2: body matter and attachment content are serialized, and memory headroom is stored in the form of characteristic, are called Characteristic model is compared and analyzes.
Preferably, damage location examination includes the following steps: in the step 2
Step b1: acquiring the attachment content in pending report, and the attachment content includes physical examination, presentation content, hand Art record, medicolegal examination, medical jurisprudence image data;
Step b2: characteristic model is called, damage location is examined using knowledge mapping therein.
A kind of auditing system of forensic identification report, including
Database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;
Model foundation unit makes for extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP Multidimensional characteristic segmentation and fitting are carried out with the HMM model that the multistage Markov based on word frequency distribution and synonym analysis is assumed, Construction feature model;
Interactive unit examines assignment options for inputting pending report and selection, and shows that legal medical expert's Identification and investigation is reported;
It examines unit, after carrying out data processing to pending report, characteristic model is called to carry out formality examination and damage Position examines.
Preferably, the multidimensional characteristic in body matter that the model foundation unit extracts includes case description, medical jurisprudence Inspection, analytic explanation and expert's conclusion.
Preferably, the model foundation unit includes
Case Expressive Features extraction module, the HMM model for using the multistage Markov based on word frequency distribution to assume, To that can be split with term in case description, forms word frequency distribution and relevance data serializes feature;
Medicolegal examination characteristic extracting module, for using the multistage horse based on the word frequency distribution after synonym analysis corrections The HMM model that Er Kefu assumes, is identified and is marked to the term in medicolegal examination;Medical jurisprudence is examined using OCR technique Image data in testing is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of nomenclature, group At serializing feature;
Expert's conclusion characteristic extracting module, after carrying out token segmentation, the high template library of matching similarity, and store In the template library;
Feature fitting module, the feature extracted for being fitted all characteristic extracting modules;
The case Expressive Features extraction module, medicolegal examination characteristic extracting module, expert's conclusion characteristic extracting module It is connect respectively with feature fitting module.
Preferably, the examination unit includes that formality examination module and damage location examine module.
Preferably, the formality examination module include the pending report for that will acquire body matter and attachment content into The data processing module of row serializing.
Preferably, the damage location examines that module includes the read module for examining attachment content in pending report.
In conclusion by adopting the above-described technical solution, the beneficial effects of the present invention are:
1. the present invention is examined by characteristic model unified standard, auxiliary personnel in charge of the case carries out electronic authentication, reduces report It accuses because of identification side's bring mistake and error, injury of human degree Identification and investigation is standardized, modularization, personnel in charge of the case can Independent operation completes the judgement to professional accreditation conclusion;
2. assisted review experience is incorporated characteristic model assisted review by the present invention, it is conducive to newly fast into legal medical expert personnel and ordinary people Speed grasps checking method, improves and examines efficiency;
3. the present invention constructs generic term synonym and context-sensitive characteristic using in medical jurisprudence by building Multistage Markov assume HMM model, using based on word frequency distribution multistage Markov assume HMM model to case It can be split with term in description, be assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections HMM model is identified and is marked to the term in medicolegal examination, and feature extraction precision is greatly improved, to improve legal medical expert's mirror Surely the accuracy of examination reports is reported.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1 is flow chart of the method for the present invention;
Fig. 2 is the block diagram of system of the invention;
Fig. 3 is characteristic model schematic diagram of the invention;
Fig. 4 is examination report schematic diagram of the invention.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that described herein, specific examples are only used to explain the present invention, not For limiting the present invention, i.e., described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is logical The component for the embodiment of the present invention being often described and illustrated herein in the accompanying drawings can be arranged and be designed with a variety of different configurations.
Therefore, the detailed description of the embodiment of the present invention provided in the accompanying drawings is not intended to limit below claimed The scope of the present invention, but be merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art Member's every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.
It should be noted that the relational terms of term " first " and " second " or the like be used merely to an entity or Operation is distinguished with another entity or operation, and without necessarily requiring or implying between these entities or operation, there are any This actual relationship or sequence.Moreover, the terms "include", "comprise" or its any other variant be intended to it is non-exclusive Property include so that include a series of elements process, method, article or equipment not only include those elements, but also Further include other elements that are not explicitly listed, or further include for this process, method, article or equipment it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including described There is also other identical elements in the process, method, article or equipment of element.
Feature and performance of the invention are described in further detail with reference to embodiments.
Embodiment 1
There is certain flaw and mistakes for the conclusion of existing society's appraisal organization, for example identification item is not comprehensive, identification It is not strong etc. with the relevance of case;Meanwhile there are following characteristics for forensic identification report: term is sentence features in content, similar Content is more, and length is longer, causes semantic analysis accuracy rate low;Therefore the application proposes a kind of checking method and system, accurate language Justice is analyzed, the mistake in examination report, completes the report identification of high accuracy, and details is as follows:
As shown in Figure 1, a kind of checking method of forensic identification report, characterized by the following steps:
Step 1: the multidimensional characteristic of the case information stored in database is extracted using NLP, fitting multidimensional characteristic obtains special Levy model;
Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, call the feature in database Model carries out formality examination and damage location examines;
Step 3: obtaining examination result, generate and show that legal medical expert's Identification and investigation is reported.
Because there are following characteristics for forensic identification report: similar item is more, cause Similar content can not duplicate removal, to the heavier condition of the injury, The part such as length longer expert opinion can judge to be not allowed when extracting to condition of the injury position, cause to judge by accident;Term is that clause is special It levies, there are relevances between the extraction process of sample, the number of sample are read and identified the contextual feature for having all kinds of samples.Cause This application building utilizes the multistage Ma Erke constructed in medical jurisprudence to generic term synonym and context-sensitive characteristic Husband assume HMM model, using based on word frequency distribution multistage Markov assume HMM model to case description in can use art Language is split, and the HMM model assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections is to method Term in medical test is identified and is marked, and feature extraction precision is greatly improved, and is examined to improve forensic identification report The accuracy of opinion.
Wherein, hidden Markov model (Hidden Markov Model, HMM) is the probabilistic model about timing, description It generates unobservable state random sequence at random by a hiding Markov chain, then an observation is generated by each state And generate the process of observation random sequence.The sequence for the state that hiding Markov chain generates at random, referred to as status switch (state sequence);Each state generates one and observes, and the random sequence of resulting observation, referred to as observation sequence (observation sequence).Each position of sequence can be regarded as a moment again.
When construction feature model, the multidimensional characteristic of body matter in case legal medical expert probation report is extracted using NLP, it is described more Dimensional feature includes case description, medicolegal examination, analytic explanation and expert's conclusion;
Using based on word frequency distribution multistage Markov assume HMM model, to case description in can be carried out with term Segmentation, word frequency distribution and relevance data serialize feature.
The HMM model assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections, to legal medical expert The term learned in examining is identified and is marked.
Image data is identified using OCR technique, marks and check image data format, content is with the presence or absence of mistake Accidentally, the content that will identify that is compared with the term that previous step identifies, is as a result stored with array form.For existing Inconsistent situation is prompted.
To expert's conclusion, the keyword of standard qualification result template is used to carry out semantization segmentation as token, according to wound Feelings position is divided into a plurality of.Structural data, that is, characteristic model after the above serializing.Using OCR technique to the text in attachment content Word identification, together with body matter word segment using token segmentation after, carry out synonym serializing, by obtained data with The form of Multidimensional numerical is stored in memory, when a plurality of preliminary conclusion is fitted, needs to utilize point based on vector space Analyse model.
Formality examination includes the following steps:
Step a1: acquiring the body matter and attachment content of pending report, and body matter includes title, introduction, medical jurisprudence Inspection, analytic explanation, expert's conclusion, surveyor and appraisal organization;Attachment content includes physical examination, presentation content, operation note Record, medicolegal examination and medical jurisprudence image data;
Step a2: body matter and attachment content are serialized, and memory headroom is stored in the form of characteristic, are called Characteristic model is compared and analyzes.
Damage location examination includes the following steps:
Step b1: acquiring the attachment content in pending report, and the attachment content includes physical examination, presentation content, hand Art record, medicolegal examination, medical jurisprudence image data;
Step b2: characteristic model is called, damage location is examined using knowledge mapping therein.Wherein, according to " people Bulk damage standard of perfection " classification of partes corporis humani's division methods, knowledge mapping is generated using 190 remainder of totally 12 class damage item as node.
As shown in figure 3, imaging data copy be shown as " rib cage multiple fracture " then sequence turn to keyword " rib cage ", " at two or more than fracture " are matched to the characteristic model of regular 5.6.4b, so analyze be " slight wound second level " preliminary conclusion.Root The examination report obtained according to the application is as shown in figure 4, include formality examination opinion and damage location opinion, the application reflects to legal medical expert Fixed technical examination result improves 17% or so accuracy.
Embodiment 2
Based on the method for embodiment 1, the present embodiment provides a kind of corresponding system, details is as follows:
As shown in Fig. 2, a kind of auditing system of forensic identification report, including
Database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;
Model foundation unit makes for extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP Multidimensional characteristic segmentation and fitting are carried out with the HMM model that the multistage Markov based on word frequency distribution and synonym analysis is assumed, Construction feature model;
Interactive unit examines assignment options for inputting pending report and selection, and shows that legal medical expert's Identification and investigation is reported;
It examines unit, after carrying out data processing to pending report, characteristic model is called to carry out formality examination and damage Position examines.
The multidimensional characteristic in body matter that the model foundation unit extracts includes case description, medicolegal examination, divides Analysis explanation and expert's conclusion.
The model foundation unit includes
Case Expressive Features extraction module, the HMM model for using the multistage Markov based on word frequency distribution to assume, To that can be split with term in case description, forms word frequency distribution and relevance data serializes feature;
Medicolegal examination characteristic extracting module, for using the multistage horse based on the word frequency distribution after synonym analysis corrections The HMM model that Er Kefu assumes, is identified and is marked to the term in medicolegal examination;Medical jurisprudence is examined using OCR technique Image data in testing is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of nomenclature, group At serializing feature;
Expert's conclusion characteristic extracting module, after carrying out token segmentation, the high template library of matching similarity, and store In the template library;
Feature fitting module, the feature extracted for being fitted all characteristic extracting modules;
The case Expressive Features extraction module, medicolegal examination characteristic extracting module, expert's conclusion characteristic extracting module It is connect respectively with feature fitting module.
The examination unit includes that formality examination module and damage location examine module.
The formality examination module includes that the body matter of the pending report for that will acquire and attachment content carry out sequence The data processing module of change.
The damage location examines that module includes the read module for examining attachment content in pending report.
The system comprises processor, memory and store the calculating that can be run in the memory and on a processor Machine program, such as " step 1: the multidimensional characteristic of the case information stored in database is extracted using NLP, fitting multidimensional characteristic obtains Take characteristic model;Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, call the spy in database It levies model and carries out formality examination and damage location examination;Step 3: obtaining examination result, generate and show legal medical expert's Identification and investigation report Announcement " program, computer program can be divided into one or more module/units, one or more of modules/unit quilt Storage in the memory, and is executed by the processor, to complete the present invention.One or more of module/units can To be the series of computation machine program instruction section that can complete specific function, which exists for describing the computer program Implementation procedure in system.For example, the computer program can be divided into model foundation unit, interactive unit, examine list Member and database, database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;Model foundation unit is used In extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP, using based on word frequency distribution and synonym The HMM model that the multistage Markov of analysis is assumed carries out multidimensional characteristic segmentation and fitting, construction feature model;Interactive unit, Assignment options are examined for inputting pending report and selection, and show that legal medical expert's Identification and investigation is reported;Unit is examined, for pending After report carries out data processing, characteristic model is called to carry out formality examination and damage location examination.The model foundation unit mentions The multidimensional characteristic in body matter taken includes case description, medicolegal examination, analytic explanation and expert's conclusion.Model foundation list Member includes case Expressive Features extraction module, and the HMM model for using the multistage Markov based on word frequency distribution to assume is right It can be split with term in case description, form word frequency distribution and relevance data serializes feature;Medicolegal examination feature Extraction module, the HMM model for using the multistage Markov based on the word frequency distribution after synonym analysis corrections to assume are right Term in medicolegal examination is identified and is marked;The image data in medicolegal examination is identified using OCR technique, Label, and check image data format, analytical analysis illustrates to obtain a plurality of nomenclature, composition sequence feature;Expert's conclusion is special Extraction module is levied, after carrying out token segmentation, the high template library of matching similarity, and it is stored in the template library;Feature is quasi- Block is molded, the feature extracted for being fitted all characteristic extracting modules;The case Expressive Features extraction module, medicolegal examination Characteristic extracting module, expert's conclusion characteristic extracting module are connect with feature fitting module respectively.The examination unit includes format Examine that module and damage location examine module.The system can be desktop PC, notebook, palm PC and cloud clothes Business device etc. calculates equipment.The system may include, but be not limited only to, processor, memory.Those skilled in the art can manage Solution, the schematic diagram is only the example of the system, does not constitute the restriction to the system, may include more than illustrating Or less component, certain components or different components are perhaps combined, such as the system equipment can also include that input is defeated Equipment, network access equipment, bus etc. out.
Alleged processor can be central processing unit (Central Processing Unit, CPU), can also be it His general processor, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor Deng the processor is the control centre of the increase resolution system, utilizes various interfaces and the entire resolution ratio of connection The various pieces of lifting system.
The memory can be used for storing the computer program and/or module, and the processor is by operation or executes Computer program in the memory and/or module are stored, and calls the data being stored in memory, described in realization The various functions of increase resolution system.The memory can mainly include storing program area and storage data area, wherein storage It program area can application program needed for storage program area, at least one function (such as sound-playing function, image player function Deng) etc.;Storage data area, which can be stored, uses created data (such as audio data, phone directory etc.) etc. according to mobile phone.This Outside, memory may include high-speed random access memory, can also include nonvolatile memory, such as hard disk, memory, insert Connect formula hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash memory Block (Flash Card), at least one disk memory, flush memory device or other volatile solid-state parts.
The application building is multistage using constructing in medical jurisprudence to generic term synonym and context-sensitive characteristic The HMM model that Markov is assumed, the HMM model assumed using the multistage Markov based on word frequency distribution is in case description It can be split with term, the HMM mould assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections Type is identified and is marked to the term in medicolegal examination, and feature extraction precision is greatly improved, to improve forensic identification report Assisted review experience is incorporated characteristic model assisted review by the accuracy for accusing examination reports, is conducive to newly into legal medical expert personnel and common People quickly grasps checking method, improves and examines efficiency.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.

Claims (10)

1. a kind of checking method of forensic identification report, characterized by the following steps:
Step 1: extracted in database after the multidimensional characteristic of case legal medical expert's probation report using NLP, using based on word frequency distribution and The HMM model that the multistage Markov of synonym analysis is assumed carries out multidimensional characteristic segmentation and fitting, obtains and stores character modules Type;
Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, calls characteristic model to carry out format and examines It looks into and is examined with damage location;
Step 3: the examination result examined according to formality examination and damage location generates and shows that legal medical expert's Identification and investigation is reported.
2. a kind of checking method of forensic identification report according to claim 1, it is characterised in that: the step 1 includes Following steps:
Step 1.1: the multidimensional characteristic of body matter in case legal medical expert probation report is extracted using NLP, the multidimensional characteristic includes Case description, medicolegal examination, analytic explanation and expert's conclusion;
Step 1.2: extracting case property, case plot, case time of origin and the case duration data in case description, use The HMM model that multistage Markov based on word frequency distribution is assumed, to can be split with term in case description, formation sequence Change feature, that is, word frequency distribution and relevance data;
Step 1.3: extracting the term in medicolegal examination, use the multistage horse based on the word frequency distribution after synonym analysis corrections The HMM model that Er Kefu assumes, is identified and is marked to the term in medicolegal examination;Medical jurisprudence is examined using OCR technique Image data in testing is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of nomenclature, group At serializing feature;
Step 1.4: after expert's conclusion is carried out token segmentation, the high template library of matching similarity, and it is stored in the template Library;
Step 1.5: the resulting serializing feature of fit procedure 1.2-1.4 and data composition characteristic model.
3. a kind of checking method of forensic identification report according to claim 1, it is characterised in that: lattice in the step 2 Formula examination includes the following steps:
Step a1: acquiring the body matter and attachment content of pending report, body matter include title, introduction, medicolegal examination, Analytic explanation, expert's conclusion, surveyor and appraisal organization;Attachment content includes physical examination, presentation content, operation record, method Medical inspection and medical jurisprudence image data;
Step a2: body matter and attachment content are serialized, and memory headroom is stored in the form of characteristic, call feature Model is compared and analyzes.
4. a kind of checking method of forensic identification report according to claim 1, it is characterised in that: damaged in the step 2 Traumatic part position, which examines, to be included the following steps:
Step b1: acquiring the attachment content in pending report, and the attachment content includes physical examination, presentation content, operation note Record, medicolegal examination, medical jurisprudence image data;
Step b2: characteristic model is called, damage location is examined using knowledge mapping therein.
5. a kind of auditing system of forensic identification report, it is characterised in that: including
Database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;
Model foundation unit uses base for extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP Multidimensional characteristic segmentation and fitting, building are carried out in the HMM model that the multistage Markov that word frequency distribution and synonym are analyzed is assumed Characteristic model;
Interactive unit examines assignment options for inputting pending report and selection, and shows that legal medical expert's Identification and investigation is reported;
It examines unit, after carrying out data processing to pending report, characteristic model is called to carry out formality examination and damage location It examines.
6. a kind of auditing system of forensic identification report according to claim 5, it is characterised in that: the model foundation list The multidimensional characteristic in body matter that member is extracted includes case description, medicolegal examination, analytic explanation and expert's conclusion.
7. a kind of auditing system of forensic identification report according to claim 6, it is characterised in that: the model foundation list Member includes
Case Expressive Features extraction module, the HMM model for using the multistage Markov based on word frequency distribution to assume, to case It can be split with term in part description, form word frequency distribution and relevance data serializes feature;
Medicolegal examination characteristic extracting module, for using the multistage Ma Erke based on the word frequency distribution after synonym analysis corrections The HMM model that husband assumes, is identified and is marked to the term in medicolegal examination;Using OCR technique in medicolegal examination Image data identified, marked, and check image data format, analytical analysis illustrates to obtain a plurality of nomenclature, forms sequence Columnization feature;
Expert's conclusion characteristic extracting module, after carrying out token segmentation, the high template library of matching similarity, and it is stored in institute State template library;
Feature fitting module, the feature extracted for being fitted all characteristic extracting modules;
The case Expressive Features extraction module, medicolegal examination characteristic extracting module, expert's conclusion characteristic extracting module difference It is connect with feature fitting module.
8. a kind of auditing system of forensic identification report according to claim 5, it is characterised in that: the examination unit packet It includes formality examination module and damage location examines module.
9. a kind of auditing system of forensic identification report according to claim 8, it is characterised in that: the formality examination mould Block includes the data processing module for serializing the body matter of the pending report of acquisition and attachment content.
10. a kind of auditing system of forensic identification report according to claim 8, it is characterised in that: the damage location Examine that module includes the read module for examining attachment content in pending report.
CN201910689439.2A 2019-07-29 2019-07-29 A kind of checking method and system of forensic identification report Pending CN110289066A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910689439.2A CN110289066A (en) 2019-07-29 2019-07-29 A kind of checking method and system of forensic identification report

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910689439.2A CN110289066A (en) 2019-07-29 2019-07-29 A kind of checking method and system of forensic identification report

Publications (1)

Publication Number Publication Date
CN110289066A true CN110289066A (en) 2019-09-27

Family

ID=68022775

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910689439.2A Pending CN110289066A (en) 2019-07-29 2019-07-29 A kind of checking method and system of forensic identification report

Country Status (1)

Country Link
CN (1) CN110289066A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110941612A (en) * 2019-11-19 2020-03-31 上海交通大学 Autonomous data lake construction system and method based on associated data
CN115062165A (en) * 2022-08-18 2022-09-16 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) Medical image diagnosis method and device based on film reading knowledge graph

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050153269A1 (en) * 1997-03-27 2005-07-14 Driscoll Gary F. System and method for computer based creation of tests formatted to facilitate computer based testing
US20140089301A1 (en) * 2011-05-23 2014-03-27 Lgc Limited And relating to the matching of forensic results
CN104572616A (en) * 2014-12-23 2015-04-29 北京锐安科技有限公司 Method and device for identifying text orientation
CN106682397A (en) * 2016-12-09 2017-05-17 江西中科九峰智慧医疗科技有限公司 Knowledge-based electronic medical record quality control method
CN107168946A (en) * 2017-04-14 2017-09-15 北京化工大学 A kind of name entity recognition method of medical text data
CN108447534A (en) * 2018-05-18 2018-08-24 灵玖中科软件(北京)有限公司 A kind of electronic health record data quality management method based on NLP
CN108595432A (en) * 2018-04-28 2018-09-28 江苏医像信息技术有限公司 Medical document error correction method
CN108763483A (en) * 2018-05-25 2018-11-06 南京大学 A kind of Text Information Extraction method towards judgement document
CN109492203A (en) * 2018-11-21 2019-03-19 深圳中广核工程设计有限公司 A kind of nuclear power large-scale synthesis reporting format method of calibration and system
CN109636564A (en) * 2018-10-16 2019-04-16 平安科技(深圳)有限公司 Information verification mechanism, device, equipment and storage medium for air control
CN109658071A (en) * 2018-12-27 2019-04-19 四川西南交大铁路发展股份有限公司 Railway sedimentation automation assessment management method
CN109919585A (en) * 2019-05-14 2019-06-21 上海市浦东新区行政服务中心(上海市浦东新区市民中心) Artificial intelligence auxiliary administrative examination and approval method, system and the terminal of knowledge based map

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050153269A1 (en) * 1997-03-27 2005-07-14 Driscoll Gary F. System and method for computer based creation of tests formatted to facilitate computer based testing
US20140089301A1 (en) * 2011-05-23 2014-03-27 Lgc Limited And relating to the matching of forensic results
CN104572616A (en) * 2014-12-23 2015-04-29 北京锐安科技有限公司 Method and device for identifying text orientation
CN106682397A (en) * 2016-12-09 2017-05-17 江西中科九峰智慧医疗科技有限公司 Knowledge-based electronic medical record quality control method
CN107168946A (en) * 2017-04-14 2017-09-15 北京化工大学 A kind of name entity recognition method of medical text data
CN108595432A (en) * 2018-04-28 2018-09-28 江苏医像信息技术有限公司 Medical document error correction method
CN108447534A (en) * 2018-05-18 2018-08-24 灵玖中科软件(北京)有限公司 A kind of electronic health record data quality management method based on NLP
CN108763483A (en) * 2018-05-25 2018-11-06 南京大学 A kind of Text Information Extraction method towards judgement document
CN109636564A (en) * 2018-10-16 2019-04-16 平安科技(深圳)有限公司 Information verification mechanism, device, equipment and storage medium for air control
CN109492203A (en) * 2018-11-21 2019-03-19 深圳中广核工程设计有限公司 A kind of nuclear power large-scale synthesis reporting format method of calibration and system
CN109658071A (en) * 2018-12-27 2019-04-19 四川西南交大铁路发展股份有限公司 Railway sedimentation automation assessment management method
CN109919585A (en) * 2019-05-14 2019-06-21 上海市浦东新区行政服务中心(上海市浦东新区市民中心) Artificial intelligence auxiliary administrative examination and approval method, system and the terminal of knowledge based map

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
APEZIT: "法医技术证据审查软件介绍", 《HTTPS://TIEBA.BAIDU.COM/P/6101794510》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110941612A (en) * 2019-11-19 2020-03-31 上海交通大学 Autonomous data lake construction system and method based on associated data
CN115062165A (en) * 2022-08-18 2022-09-16 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) Medical image diagnosis method and device based on film reading knowledge graph

Similar Documents

Publication Publication Date Title
US9418144B2 (en) Similar document detection and electronic discovery
US11308143B2 (en) Discrepancy curator for documents in a corpus of a cognitive computing system
US9189748B2 (en) Information extraction system, method, and program
US10146858B2 (en) Discrepancy handler for document ingestion into a corpus for a cognitive computing system
US20080201131A1 (en) Method and apparatus for automatically discovering features in free form heterogeneous data
US8626737B1 (en) Method and apparatus for processing electronically stored information for electronic discovery
CN109840255A (en) Reply document creation method, device, equipment and storage medium
CN110289066A (en) A kind of checking method and system of forensic identification report
KR102280490B1 (en) Training data construction method for automatically generating training data for artificial intelligence model for counseling intention classification
Jeon et al. Making a graph database from unstructured text
Garrido et al. GEO-NASS: A semantic tagging experience from geographical data on the media
CN108021595B (en) Method and device for checking knowledge base triples
CN112380848A (en) Text generation method, device, equipment and storage medium
CN111881695A (en) Audit knowledge retrieval method and device
CN113779983B (en) Text data processing method and device, storage medium and electronic device
CN105786929A (en) Information monitoring method and device
CN115017256A (en) Power data processing method and device, electronic equipment and storage medium
CN113505117A (en) Data quality evaluation method, device, equipment and medium based on data indexes
CN113722421A (en) Contract auditing method and system and computer readable storage medium
KR20150134645A (en) Author clearly confirm device and method.
CN116976683B (en) Automatic auditing method, system, storage medium and device for contract clauses
TWI703453B (en) Suggestion creating device, computer readable recording medium with suggestion creating program recorded thereon, and suggestion creating method
Kim et al. Hierarchical image classification in the bioscience literature
CN117609509A (en) Automatic device, method and storage medium for knowledge carding and conflict resolution
CN116910054A (en) Data processing method, device, electronic equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190927

RJ01 Rejection of invention patent application after publication