CN110289066A - A kind of checking method and system of forensic identification report - Google Patents
A kind of checking method and system of forensic identification report Download PDFInfo
- Publication number
- CN110289066A CN110289066A CN201910689439.2A CN201910689439A CN110289066A CN 110289066 A CN110289066 A CN 110289066A CN 201910689439 A CN201910689439 A CN 201910689439A CN 110289066 A CN110289066 A CN 110289066A
- Authority
- CN
- China
- Prior art keywords
- examination
- characteristic
- report
- model
- case
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H15/00—ICT specially adapted for medical reports, e.g. generation or transmission thereof
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Primary Health Care (AREA)
- Public Health (AREA)
- Medical Informatics (AREA)
- Epidemiology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses the checking methods and system of a kind of report of forensic identification, are related to extraction of semantics and analysis field;Its method includes step 1: being extracted in database after the multidimensional characteristic of case legal medical expert's probation report using NLP, multidimensional characteristic segmentation and fitting are carried out using the HMM model that the multistage Markov based on word frequency distribution and synonym analysis is assumed, obtains and stores characteristic model;Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, characteristic model progress formality examination and damage location is called to examine;Step 3: the examination result examined according to formality examination and damage location generates and shows that legal medical expert's Identification and investigation is reported.The present invention utilizes the HMM model assumed in medical jurisprudence the multistage Markov of generic term synonym and context-sensitive characteristic building by building, characteristic model is established, figure in the law circle is helped quickly and accurately to examine flaw present in probation report and mistake.
Description
Technical field
The present invention relates to extraction of semantics and analysis field, the especially a kind of checking method and system of forensic identification report.
Background technique
Under the background of current reform of the judicial system iterative method, a large amount of survey reports, expert opinion, assessment report and specially
Professional, scientific and objectivity the requirement of family's examination reports is higher and higher, and procurator, lawyer and judge are to such
The examination consciousness and depth of evidence are constantly reinforced.But such opinion or report still rely on investigation appraiser with this profession art
Language is write using all kinds of text edit softwares, exists and searches and manage bring time-consuming by artificial because of cited data source
Long, the problem of process is many and diverse, low efficiency.For example, the figure in the law circle that Yunnan Province handles drug-related case is mostly to be related to the life of drugs
Object chemistry expert can identify case sample of being involved in drug traffic, and write probation report, but interior ground province method using softwares such as Word
Rule personage does not have related drugs identification technical ability, so needing to rely on social appraisal organization provides probation report.Law attainment, prison
Superintending and directing many reasons such as management causes the conclusion of social appraisal organization there is certain flaw and mistake, for example identification item is not complete
The relevance in face, identification and case is not strong etc.;Meanwhile there are following characteristics for forensic identification report: term is that clause is special in content
Sign, Similar content is more, and length is longer, causes semantic analysis accuracy rate low;Therefore a kind of checking method and system, accurate language are needed
Justice is analyzed, the mistake in examination report, completes the report identification of high accuracy.
Summary of the invention
It is an object of the invention to: the present invention provides the checking methods and system of a kind of report of forensic identification, help method
Rule personage quickly and accurately examines flaw present in probation report and mistake.
The technical solution adopted by the invention is as follows:
A kind of checking method of forensic identification report, includes the following steps:
Step 1: after the multidimensional characteristic using case legal medical expert's probation report in NLP extraction database, using word-based frequency division
The HMM model that cloth and the multistage Markov of synonym analysis are assumed carries out multidimensional characteristic segmentation and fitting, obtains and stores spy
Levy model;
Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, characteristic model is called to carry out lattice
Formula examines and damage location examines;
Step 3: the examination result examined according to formality examination and damage location generates and shows legal medical expert's Identification and investigation report
It accuses.
Preferably, the step 1 includes the following steps:
Step 1.1: the multidimensional characteristic of body matter in case legal medical expert probation report, the multidimensional characteristic are extracted using NLP
Including case description, medicolegal examination, analytic explanation and expert's conclusion;
Step 1.2: case property, case plot, case time of origin and the case duration data in case description are extracted,
The HMM model assumed using the multistage Markov based on word frequency distribution is formed to that can be split with term in case description
Serialize feature, that is, word frequency distribution and relevance data;
Step 1.3: the term in medicolegal examination is extracted, using based on the more of the word frequency distribution after synonym analysis corrections
The HMM model that rank Markov is assumed, is identified and is marked to the term in medicolegal examination;Using OCR technique to legal medical expert
The image data learned in examining is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of term
Table, composition sequence feature;
Step 1.4: after expert's conclusion is carried out token segmentation, the high template library of matching similarity, and it is stored in the mould
Plate library;
Step 1.5: the resulting serializing feature of fit procedure 1.2-1.4 and data composition characteristic model.
Preferably, formality examination includes the following steps: in the step 2
Step a1: acquiring the body matter and attachment content of pending report, and body matter includes title, introduction, medical jurisprudence
Inspection, analytic explanation, expert's conclusion, surveyor and appraisal organization;Attachment content includes physical examination, presentation content, operation note
Record, medicolegal examination and medical jurisprudence image data;
Step a2: body matter and attachment content are serialized, and memory headroom is stored in the form of characteristic, are called
Characteristic model is compared and analyzes.
Preferably, damage location examination includes the following steps: in the step 2
Step b1: acquiring the attachment content in pending report, and the attachment content includes physical examination, presentation content, hand
Art record, medicolegal examination, medical jurisprudence image data;
Step b2: characteristic model is called, damage location is examined using knowledge mapping therein.
A kind of auditing system of forensic identification report, including
Database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;
Model foundation unit makes for extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP
Multidimensional characteristic segmentation and fitting are carried out with the HMM model that the multistage Markov based on word frequency distribution and synonym analysis is assumed,
Construction feature model;
Interactive unit examines assignment options for inputting pending report and selection, and shows that legal medical expert's Identification and investigation is reported;
It examines unit, after carrying out data processing to pending report, characteristic model is called to carry out formality examination and damage
Position examines.
Preferably, the multidimensional characteristic in body matter that the model foundation unit extracts includes case description, medical jurisprudence
Inspection, analytic explanation and expert's conclusion.
Preferably, the model foundation unit includes
Case Expressive Features extraction module, the HMM model for using the multistage Markov based on word frequency distribution to assume,
To that can be split with term in case description, forms word frequency distribution and relevance data serializes feature;
Medicolegal examination characteristic extracting module, for using the multistage horse based on the word frequency distribution after synonym analysis corrections
The HMM model that Er Kefu assumes, is identified and is marked to the term in medicolegal examination;Medical jurisprudence is examined using OCR technique
Image data in testing is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of nomenclature, group
At serializing feature;
Expert's conclusion characteristic extracting module, after carrying out token segmentation, the high template library of matching similarity, and store
In the template library;
Feature fitting module, the feature extracted for being fitted all characteristic extracting modules;
The case Expressive Features extraction module, medicolegal examination characteristic extracting module, expert's conclusion characteristic extracting module
It is connect respectively with feature fitting module.
Preferably, the examination unit includes that formality examination module and damage location examine module.
Preferably, the formality examination module include the pending report for that will acquire body matter and attachment content into
The data processing module of row serializing.
Preferably, the damage location examines that module includes the read module for examining attachment content in pending report.
In conclusion by adopting the above-described technical solution, the beneficial effects of the present invention are:
1. the present invention is examined by characteristic model unified standard, auxiliary personnel in charge of the case carries out electronic authentication, reduces report
It accuses because of identification side's bring mistake and error, injury of human degree Identification and investigation is standardized, modularization, personnel in charge of the case can
Independent operation completes the judgement to professional accreditation conclusion;
2. assisted review experience is incorporated characteristic model assisted review by the present invention, it is conducive to newly fast into legal medical expert personnel and ordinary people
Speed grasps checking method, improves and examines efficiency;
3. the present invention constructs generic term synonym and context-sensitive characteristic using in medical jurisprudence by building
Multistage Markov assume HMM model, using based on word frequency distribution multistage Markov assume HMM model to case
It can be split with term in description, be assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections
HMM model is identified and is marked to the term in medicolegal examination, and feature extraction precision is greatly improved, to improve legal medical expert's mirror
Surely the accuracy of examination reports is reported.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached
Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair
The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this
A little attached drawings obtain other relevant attached drawings.
Fig. 1 is flow chart of the method for the present invention;
Fig. 2 is the block diagram of system of the invention;
Fig. 3 is characteristic model schematic diagram of the invention;
Fig. 4 is examination report schematic diagram of the invention.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right
The present invention is further elaborated.It should be appreciated that described herein, specific examples are only used to explain the present invention, not
For limiting the present invention, i.e., described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is logical
The component for the embodiment of the present invention being often described and illustrated herein in the accompanying drawings can be arranged and be designed with a variety of different configurations.
Therefore, the detailed description of the embodiment of the present invention provided in the accompanying drawings is not intended to limit below claimed
The scope of the present invention, but be merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art
Member's every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.
It should be noted that the relational terms of term " first " and " second " or the like be used merely to an entity or
Operation is distinguished with another entity or operation, and without necessarily requiring or implying between these entities or operation, there are any
This actual relationship or sequence.Moreover, the terms "include", "comprise" or its any other variant be intended to it is non-exclusive
Property include so that include a series of elements process, method, article or equipment not only include those elements, but also
Further include other elements that are not explicitly listed, or further include for this process, method, article or equipment it is intrinsic
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including described
There is also other identical elements in the process, method, article or equipment of element.
Feature and performance of the invention are described in further detail with reference to embodiments.
Embodiment 1
There is certain flaw and mistakes for the conclusion of existing society's appraisal organization, for example identification item is not comprehensive, identification
It is not strong etc. with the relevance of case;Meanwhile there are following characteristics for forensic identification report: term is sentence features in content, similar
Content is more, and length is longer, causes semantic analysis accuracy rate low;Therefore the application proposes a kind of checking method and system, accurate language
Justice is analyzed, the mistake in examination report, completes the report identification of high accuracy, and details is as follows:
As shown in Figure 1, a kind of checking method of forensic identification report, characterized by the following steps:
Step 1: the multidimensional characteristic of the case information stored in database is extracted using NLP, fitting multidimensional characteristic obtains special
Levy model;
Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, call the feature in database
Model carries out formality examination and damage location examines;
Step 3: obtaining examination result, generate and show that legal medical expert's Identification and investigation is reported.
Because there are following characteristics for forensic identification report: similar item is more, cause Similar content can not duplicate removal, to the heavier condition of the injury,
The part such as length longer expert opinion can judge to be not allowed when extracting to condition of the injury position, cause to judge by accident;Term is that clause is special
It levies, there are relevances between the extraction process of sample, the number of sample are read and identified the contextual feature for having all kinds of samples.Cause
This application building utilizes the multistage Ma Erke constructed in medical jurisprudence to generic term synonym and context-sensitive characteristic
Husband assume HMM model, using based on word frequency distribution multistage Markov assume HMM model to case description in can use art
Language is split, and the HMM model assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections is to method
Term in medical test is identified and is marked, and feature extraction precision is greatly improved, and is examined to improve forensic identification report
The accuracy of opinion.
Wherein, hidden Markov model (Hidden Markov Model, HMM) is the probabilistic model about timing, description
It generates unobservable state random sequence at random by a hiding Markov chain, then an observation is generated by each state
And generate the process of observation random sequence.The sequence for the state that hiding Markov chain generates at random, referred to as status switch
(state sequence);Each state generates one and observes, and the random sequence of resulting observation, referred to as observation sequence
(observation sequence).Each position of sequence can be regarded as a moment again.
When construction feature model, the multidimensional characteristic of body matter in case legal medical expert probation report is extracted using NLP, it is described more
Dimensional feature includes case description, medicolegal examination, analytic explanation and expert's conclusion;
Using based on word frequency distribution multistage Markov assume HMM model, to case description in can be carried out with term
Segmentation, word frequency distribution and relevance data serialize feature.
The HMM model assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections, to legal medical expert
The term learned in examining is identified and is marked.
Image data is identified using OCR technique, marks and check image data format, content is with the presence or absence of mistake
Accidentally, the content that will identify that is compared with the term that previous step identifies, is as a result stored with array form.For existing
Inconsistent situation is prompted.
To expert's conclusion, the keyword of standard qualification result template is used to carry out semantization segmentation as token, according to wound
Feelings position is divided into a plurality of.Structural data, that is, characteristic model after the above serializing.Using OCR technique to the text in attachment content
Word identification, together with body matter word segment using token segmentation after, carry out synonym serializing, by obtained data with
The form of Multidimensional numerical is stored in memory, when a plurality of preliminary conclusion is fitted, needs to utilize point based on vector space
Analyse model.
Formality examination includes the following steps:
Step a1: acquiring the body matter and attachment content of pending report, and body matter includes title, introduction, medical jurisprudence
Inspection, analytic explanation, expert's conclusion, surveyor and appraisal organization;Attachment content includes physical examination, presentation content, operation note
Record, medicolegal examination and medical jurisprudence image data;
Step a2: body matter and attachment content are serialized, and memory headroom is stored in the form of characteristic, are called
Characteristic model is compared and analyzes.
Damage location examination includes the following steps:
Step b1: acquiring the attachment content in pending report, and the attachment content includes physical examination, presentation content, hand
Art record, medicolegal examination, medical jurisprudence image data;
Step b2: characteristic model is called, damage location is examined using knowledge mapping therein.Wherein, according to " people
Bulk damage standard of perfection " classification of partes corporis humani's division methods, knowledge mapping is generated using 190 remainder of totally 12 class damage item as node.
As shown in figure 3, imaging data copy be shown as " rib cage multiple fracture " then sequence turn to keyword " rib cage ",
" at two or more than fracture " are matched to the characteristic model of regular 5.6.4b, so analyze be " slight wound second level " preliminary conclusion.Root
The examination report obtained according to the application is as shown in figure 4, include formality examination opinion and damage location opinion, the application reflects to legal medical expert
Fixed technical examination result improves 17% or so accuracy.
Embodiment 2
Based on the method for embodiment 1, the present embodiment provides a kind of corresponding system, details is as follows:
As shown in Fig. 2, a kind of auditing system of forensic identification report, including
Database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;
Model foundation unit makes for extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP
Multidimensional characteristic segmentation and fitting are carried out with the HMM model that the multistage Markov based on word frequency distribution and synonym analysis is assumed,
Construction feature model;
Interactive unit examines assignment options for inputting pending report and selection, and shows that legal medical expert's Identification and investigation is reported;
It examines unit, after carrying out data processing to pending report, characteristic model is called to carry out formality examination and damage
Position examines.
The multidimensional characteristic in body matter that the model foundation unit extracts includes case description, medicolegal examination, divides
Analysis explanation and expert's conclusion.
The model foundation unit includes
Case Expressive Features extraction module, the HMM model for using the multistage Markov based on word frequency distribution to assume,
To that can be split with term in case description, forms word frequency distribution and relevance data serializes feature;
Medicolegal examination characteristic extracting module, for using the multistage horse based on the word frequency distribution after synonym analysis corrections
The HMM model that Er Kefu assumes, is identified and is marked to the term in medicolegal examination;Medical jurisprudence is examined using OCR technique
Image data in testing is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of nomenclature, group
At serializing feature;
Expert's conclusion characteristic extracting module, after carrying out token segmentation, the high template library of matching similarity, and store
In the template library;
Feature fitting module, the feature extracted for being fitted all characteristic extracting modules;
The case Expressive Features extraction module, medicolegal examination characteristic extracting module, expert's conclusion characteristic extracting module
It is connect respectively with feature fitting module.
The examination unit includes that formality examination module and damage location examine module.
The formality examination module includes that the body matter of the pending report for that will acquire and attachment content carry out sequence
The data processing module of change.
The damage location examines that module includes the read module for examining attachment content in pending report.
The system comprises processor, memory and store the calculating that can be run in the memory and on a processor
Machine program, such as " step 1: the multidimensional characteristic of the case information stored in database is extracted using NLP, fitting multidimensional characteristic obtains
Take characteristic model;Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, call the spy in database
It levies model and carries out formality examination and damage location examination;Step 3: obtaining examination result, generate and show legal medical expert's Identification and investigation report
Announcement " program, computer program can be divided into one or more module/units, one or more of modules/unit quilt
Storage in the memory, and is executed by the processor, to complete the present invention.One or more of module/units can
To be the series of computation machine program instruction section that can complete specific function, which exists for describing the computer program
Implementation procedure in system.For example, the computer program can be divided into model foundation unit, interactive unit, examine list
Member and database, database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;Model foundation unit is used
In extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP, using based on word frequency distribution and synonym
The HMM model that the multistage Markov of analysis is assumed carries out multidimensional characteristic segmentation and fitting, construction feature model;Interactive unit,
Assignment options are examined for inputting pending report and selection, and show that legal medical expert's Identification and investigation is reported;Unit is examined, for pending
After report carries out data processing, characteristic model is called to carry out formality examination and damage location examination.The model foundation unit mentions
The multidimensional characteristic in body matter taken includes case description, medicolegal examination, analytic explanation and expert's conclusion.Model foundation list
Member includes case Expressive Features extraction module, and the HMM model for using the multistage Markov based on word frequency distribution to assume is right
It can be split with term in case description, form word frequency distribution and relevance data serializes feature;Medicolegal examination feature
Extraction module, the HMM model for using the multistage Markov based on the word frequency distribution after synonym analysis corrections to assume are right
Term in medicolegal examination is identified and is marked;The image data in medicolegal examination is identified using OCR technique,
Label, and check image data format, analytical analysis illustrates to obtain a plurality of nomenclature, composition sequence feature;Expert's conclusion is special
Extraction module is levied, after carrying out token segmentation, the high template library of matching similarity, and it is stored in the template library;Feature is quasi-
Block is molded, the feature extracted for being fitted all characteristic extracting modules;The case Expressive Features extraction module, medicolegal examination
Characteristic extracting module, expert's conclusion characteristic extracting module are connect with feature fitting module respectively.The examination unit includes format
Examine that module and damage location examine module.The system can be desktop PC, notebook, palm PC and cloud clothes
Business device etc. calculates equipment.The system may include, but be not limited only to, processor, memory.Those skilled in the art can manage
Solution, the schematic diagram is only the example of the system, does not constitute the restriction to the system, may include more than illustrating
Or less component, certain components or different components are perhaps combined, such as the system equipment can also include that input is defeated
Equipment, network access equipment, bus etc. out.
Alleged processor can be central processing unit (Central Processing Unit, CPU), can also be it
His general processor, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit
(Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-
Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic,
Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor
Deng the processor is the control centre of the increase resolution system, utilizes various interfaces and the entire resolution ratio of connection
The various pieces of lifting system.
The memory can be used for storing the computer program and/or module, and the processor is by operation or executes
Computer program in the memory and/or module are stored, and calls the data being stored in memory, described in realization
The various functions of increase resolution system.The memory can mainly include storing program area and storage data area, wherein storage
It program area can application program needed for storage program area, at least one function (such as sound-playing function, image player function
Deng) etc.;Storage data area, which can be stored, uses created data (such as audio data, phone directory etc.) etc. according to mobile phone.This
Outside, memory may include high-speed random access memory, can also include nonvolatile memory, such as hard disk, memory, insert
Connect formula hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash memory
Block (Flash Card), at least one disk memory, flush memory device or other volatile solid-state parts.
The application building is multistage using constructing in medical jurisprudence to generic term synonym and context-sensitive characteristic
The HMM model that Markov is assumed, the HMM model assumed using the multistage Markov based on word frequency distribution is in case description
It can be split with term, the HMM mould assumed using the multistage Markov based on the word frequency distribution after synonym analysis corrections
Type is identified and is marked to the term in medicolegal examination, and feature extraction precision is greatly improved, to improve forensic identification report
Assisted review experience is incorporated characteristic model assisted review by the accuracy for accusing examination reports, is conducive to newly into legal medical expert personnel and common
People quickly grasps checking method, improves and examines efficiency.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention
Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.
Claims (10)
1. a kind of checking method of forensic identification report, characterized by the following steps:
Step 1: extracted in database after the multidimensional characteristic of case legal medical expert's probation report using NLP, using based on word frequency distribution and
The HMM model that the multistage Markov of synonym analysis is assumed carries out multidimensional characteristic segmentation and fitting, obtains and stores character modules
Type;
Step 2: after inputting pending report, assignment options being examined based on interactive unit selection, calls characteristic model to carry out format and examines
It looks into and is examined with damage location;
Step 3: the examination result examined according to formality examination and damage location generates and shows that legal medical expert's Identification and investigation is reported.
2. a kind of checking method of forensic identification report according to claim 1, it is characterised in that: the step 1 includes
Following steps:
Step 1.1: the multidimensional characteristic of body matter in case legal medical expert probation report is extracted using NLP, the multidimensional characteristic includes
Case description, medicolegal examination, analytic explanation and expert's conclusion;
Step 1.2: extracting case property, case plot, case time of origin and the case duration data in case description, use
The HMM model that multistage Markov based on word frequency distribution is assumed, to can be split with term in case description, formation sequence
Change feature, that is, word frequency distribution and relevance data;
Step 1.3: extracting the term in medicolegal examination, use the multistage horse based on the word frequency distribution after synonym analysis corrections
The HMM model that Er Kefu assumes, is identified and is marked to the term in medicolegal examination;Medical jurisprudence is examined using OCR technique
Image data in testing is identified, is marked, and checks image data format, and analytical analysis illustrates to obtain a plurality of nomenclature, group
At serializing feature;
Step 1.4: after expert's conclusion is carried out token segmentation, the high template library of matching similarity, and it is stored in the template
Library;
Step 1.5: the resulting serializing feature of fit procedure 1.2-1.4 and data composition characteristic model.
3. a kind of checking method of forensic identification report according to claim 1, it is characterised in that: lattice in the step 2
Formula examination includes the following steps:
Step a1: acquiring the body matter and attachment content of pending report, body matter include title, introduction, medicolegal examination,
Analytic explanation, expert's conclusion, surveyor and appraisal organization;Attachment content includes physical examination, presentation content, operation record, method
Medical inspection and medical jurisprudence image data;
Step a2: body matter and attachment content are serialized, and memory headroom is stored in the form of characteristic, call feature
Model is compared and analyzes.
4. a kind of checking method of forensic identification report according to claim 1, it is characterised in that: damaged in the step 2
Traumatic part position, which examines, to be included the following steps:
Step b1: acquiring the attachment content in pending report, and the attachment content includes physical examination, presentation content, operation note
Record, medicolegal examination, medical jurisprudence image data;
Step b2: characteristic model is called, damage location is examined using knowledge mapping therein.
5. a kind of auditing system of forensic identification report, it is characterised in that: including
Database, for storing case legal medical expert probation report, characteristic model and knowledge mapping;
Model foundation unit uses base for extracting the corresponding multidimensional characteristic of case legal medical expert's probation report in database using NLP
Multidimensional characteristic segmentation and fitting, building are carried out in the HMM model that the multistage Markov that word frequency distribution and synonym are analyzed is assumed
Characteristic model;
Interactive unit examines assignment options for inputting pending report and selection, and shows that legal medical expert's Identification and investigation is reported;
It examines unit, after carrying out data processing to pending report, characteristic model is called to carry out formality examination and damage location
It examines.
6. a kind of auditing system of forensic identification report according to claim 5, it is characterised in that: the model foundation list
The multidimensional characteristic in body matter that member is extracted includes case description, medicolegal examination, analytic explanation and expert's conclusion.
7. a kind of auditing system of forensic identification report according to claim 6, it is characterised in that: the model foundation list
Member includes
Case Expressive Features extraction module, the HMM model for using the multistage Markov based on word frequency distribution to assume, to case
It can be split with term in part description, form word frequency distribution and relevance data serializes feature;
Medicolegal examination characteristic extracting module, for using the multistage Ma Erke based on the word frequency distribution after synonym analysis corrections
The HMM model that husband assumes, is identified and is marked to the term in medicolegal examination;Using OCR technique in medicolegal examination
Image data identified, marked, and check image data format, analytical analysis illustrates to obtain a plurality of nomenclature, forms sequence
Columnization feature;
Expert's conclusion characteristic extracting module, after carrying out token segmentation, the high template library of matching similarity, and it is stored in institute
State template library;
Feature fitting module, the feature extracted for being fitted all characteristic extracting modules;
The case Expressive Features extraction module, medicolegal examination characteristic extracting module, expert's conclusion characteristic extracting module difference
It is connect with feature fitting module.
8. a kind of auditing system of forensic identification report according to claim 5, it is characterised in that: the examination unit packet
It includes formality examination module and damage location examines module.
9. a kind of auditing system of forensic identification report according to claim 8, it is characterised in that: the formality examination mould
Block includes the data processing module for serializing the body matter of the pending report of acquisition and attachment content.
10. a kind of auditing system of forensic identification report according to claim 8, it is characterised in that: the damage location
Examine that module includes the read module for examining attachment content in pending report.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910689439.2A CN110289066A (en) | 2019-07-29 | 2019-07-29 | A kind of checking method and system of forensic identification report |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910689439.2A CN110289066A (en) | 2019-07-29 | 2019-07-29 | A kind of checking method and system of forensic identification report |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110289066A true CN110289066A (en) | 2019-09-27 |
Family
ID=68022775
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910689439.2A Pending CN110289066A (en) | 2019-07-29 | 2019-07-29 | A kind of checking method and system of forensic identification report |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110289066A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110941612A (en) * | 2019-11-19 | 2020-03-31 | 上海交通大学 | Autonomous data lake construction system and method based on associated data |
CN115062165A (en) * | 2022-08-18 | 2022-09-16 | 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) | Medical image diagnosis method and device based on film reading knowledge graph |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050153269A1 (en) * | 1997-03-27 | 2005-07-14 | Driscoll Gary F. | System and method for computer based creation of tests formatted to facilitate computer based testing |
US20140089301A1 (en) * | 2011-05-23 | 2014-03-27 | Lgc Limited | And relating to the matching of forensic results |
CN104572616A (en) * | 2014-12-23 | 2015-04-29 | 北京锐安科技有限公司 | Method and device for identifying text orientation |
CN106682397A (en) * | 2016-12-09 | 2017-05-17 | 江西中科九峰智慧医疗科技有限公司 | Knowledge-based electronic medical record quality control method |
CN107168946A (en) * | 2017-04-14 | 2017-09-15 | 北京化工大学 | A kind of name entity recognition method of medical text data |
CN108447534A (en) * | 2018-05-18 | 2018-08-24 | 灵玖中科软件(北京)有限公司 | A kind of electronic health record data quality management method based on NLP |
CN108595432A (en) * | 2018-04-28 | 2018-09-28 | 江苏医像信息技术有限公司 | Medical document error correction method |
CN108763483A (en) * | 2018-05-25 | 2018-11-06 | 南京大学 | A kind of Text Information Extraction method towards judgement document |
CN109492203A (en) * | 2018-11-21 | 2019-03-19 | 深圳中广核工程设计有限公司 | A kind of nuclear power large-scale synthesis reporting format method of calibration and system |
CN109636564A (en) * | 2018-10-16 | 2019-04-16 | 平安科技(深圳)有限公司 | Information verification mechanism, device, equipment and storage medium for air control |
CN109658071A (en) * | 2018-12-27 | 2019-04-19 | 四川西南交大铁路发展股份有限公司 | Railway sedimentation automation assessment management method |
CN109919585A (en) * | 2019-05-14 | 2019-06-21 | 上海市浦东新区行政服务中心(上海市浦东新区市民中心) | Artificial intelligence auxiliary administrative examination and approval method, system and the terminal of knowledge based map |
-
2019
- 2019-07-29 CN CN201910689439.2A patent/CN110289066A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050153269A1 (en) * | 1997-03-27 | 2005-07-14 | Driscoll Gary F. | System and method for computer based creation of tests formatted to facilitate computer based testing |
US20140089301A1 (en) * | 2011-05-23 | 2014-03-27 | Lgc Limited | And relating to the matching of forensic results |
CN104572616A (en) * | 2014-12-23 | 2015-04-29 | 北京锐安科技有限公司 | Method and device for identifying text orientation |
CN106682397A (en) * | 2016-12-09 | 2017-05-17 | 江西中科九峰智慧医疗科技有限公司 | Knowledge-based electronic medical record quality control method |
CN107168946A (en) * | 2017-04-14 | 2017-09-15 | 北京化工大学 | A kind of name entity recognition method of medical text data |
CN108595432A (en) * | 2018-04-28 | 2018-09-28 | 江苏医像信息技术有限公司 | Medical document error correction method |
CN108447534A (en) * | 2018-05-18 | 2018-08-24 | 灵玖中科软件(北京)有限公司 | A kind of electronic health record data quality management method based on NLP |
CN108763483A (en) * | 2018-05-25 | 2018-11-06 | 南京大学 | A kind of Text Information Extraction method towards judgement document |
CN109636564A (en) * | 2018-10-16 | 2019-04-16 | 平安科技(深圳)有限公司 | Information verification mechanism, device, equipment and storage medium for air control |
CN109492203A (en) * | 2018-11-21 | 2019-03-19 | 深圳中广核工程设计有限公司 | A kind of nuclear power large-scale synthesis reporting format method of calibration and system |
CN109658071A (en) * | 2018-12-27 | 2019-04-19 | 四川西南交大铁路发展股份有限公司 | Railway sedimentation automation assessment management method |
CN109919585A (en) * | 2019-05-14 | 2019-06-21 | 上海市浦东新区行政服务中心(上海市浦东新区市民中心) | Artificial intelligence auxiliary administrative examination and approval method, system and the terminal of knowledge based map |
Non-Patent Citations (1)
Title |
---|
APEZIT: "法医技术证据审查软件介绍", 《HTTPS://TIEBA.BAIDU.COM/P/6101794510》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110941612A (en) * | 2019-11-19 | 2020-03-31 | 上海交通大学 | Autonomous data lake construction system and method based on associated data |
CN115062165A (en) * | 2022-08-18 | 2022-09-16 | 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) | Medical image diagnosis method and device based on film reading knowledge graph |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9418144B2 (en) | Similar document detection and electronic discovery | |
US11308143B2 (en) | Discrepancy curator for documents in a corpus of a cognitive computing system | |
US9189748B2 (en) | Information extraction system, method, and program | |
US10146858B2 (en) | Discrepancy handler for document ingestion into a corpus for a cognitive computing system | |
US20080201131A1 (en) | Method and apparatus for automatically discovering features in free form heterogeneous data | |
US8626737B1 (en) | Method and apparatus for processing electronically stored information for electronic discovery | |
CN109840255A (en) | Reply document creation method, device, equipment and storage medium | |
CN110289066A (en) | A kind of checking method and system of forensic identification report | |
KR102280490B1 (en) | Training data construction method for automatically generating training data for artificial intelligence model for counseling intention classification | |
Jeon et al. | Making a graph database from unstructured text | |
Garrido et al. | GEO-NASS: A semantic tagging experience from geographical data on the media | |
CN108021595B (en) | Method and device for checking knowledge base triples | |
CN112380848A (en) | Text generation method, device, equipment and storage medium | |
CN111881695A (en) | Audit knowledge retrieval method and device | |
CN113779983B (en) | Text data processing method and device, storage medium and electronic device | |
CN105786929A (en) | Information monitoring method and device | |
CN115017256A (en) | Power data processing method and device, electronic equipment and storage medium | |
CN113505117A (en) | Data quality evaluation method, device, equipment and medium based on data indexes | |
CN113722421A (en) | Contract auditing method and system and computer readable storage medium | |
KR20150134645A (en) | Author clearly confirm device and method. | |
CN116976683B (en) | Automatic auditing method, system, storage medium and device for contract clauses | |
TWI703453B (en) | Suggestion creating device, computer readable recording medium with suggestion creating program recorded thereon, and suggestion creating method | |
Kim et al. | Hierarchical image classification in the bioscience literature | |
CN117609509A (en) | Automatic device, method and storage medium for knowledge carding and conflict resolution | |
CN116910054A (en) | Data processing method, device, electronic equipment and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190927 |
|
RJ01 | Rejection of invention patent application after publication |