CN105069123B - A kind of automatic coding and system of Chinese surgical procedure information - Google Patents
A kind of automatic coding and system of Chinese surgical procedure information Download PDFInfo
- Publication number
- CN105069123B CN105069123B CN201510496500.3A CN201510496500A CN105069123B CN 105069123 B CN105069123 B CN 105069123B CN 201510496500 A CN201510496500 A CN 201510496500A CN 105069123 B CN105069123 B CN 105069123B
- Authority
- CN
- China
- Prior art keywords
- ontology
- surgical procedure
- type
- term
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2452—Query translation
- G06F16/24522—Translation of natural language queries to structured queries
Abstract
Embodiments of the present invention provide a kind of automatic coding and system of Chinese surgical procedure information, and this method includes:Natural language processing is carried out to the Chinese surgical procedure information of input, obtains title to be encoded;It searches the standard terminology to match with title to be encoded or expands term, and by the standard terminology of successful match or expand the coding of term, be determined as the coding of title to be encoded;Standard terminology is surgical procedure title specified in International Classification of Diseases ICD, and the coding of standard terminology is the coding of corresponding surgical procedure title specified in International Classification of Diseases ICD;Expanding term is and word of the standard terminology with synonymy or the word with relation of genus and species;It is consistent to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species.The present invention can automatically, surgical procedure title is fast and accurately identified and it is encoded, whole process has that coding rate is fast, at low cost, high accuracy for examination without manually participating in.
Description
Technical field
Embodiments of the present invention are related to medical information field, more specifically, embodiments of the present invention are related to one kind
The automatic coding and system of Chinese surgical procedure information.
Background technology
Background that this section is intended to provide an explanation of the embodiments of the present invention set forth in the claims or context.Herein
Description recognizes it is the prior art not because not being included in this part.
At present in medicine and hygiene fields, the execution of surgical procedure is generally managed using operation differentiated control system, is being divided
In grade management, the writing of surgical procedure title and coding are particularly significant.Under normal circumstances, surgical procedure name is write by surgical doctor
Claim, then it is encoded by medical record administrator, it is that surgical procedure is suitable to write correct surgical procedure title and accurate coding
The basis that profit carries out is conducive to improve the normalization of surgical procedure, reduces medical-risk.
The Ministry of Public Health of China regulation medical and health industry is unified to perform surgical procedure coding according to ICD-9-CM-3.ICD-9-
CM-3 refers to《International Classification of Diseases 9th edition clinical modification volume 3》, it is one and is used to that medical operating and disease classify to refer to
The professional books led.ICD-9-CM-3 is according to the disease of needle pair, the complexity of operating process and for technology
It is required that operation is classified and encoded.
Invention content
Majority medical and health organization at present, is still the coding work that surgical procedure is accomplished manually by medical record administrator,
There are efficiency it is low, of high cost the shortcomings that.Moreover, because the surgical procedure information that surgical doctor is write in medical record belongs to nature language
Speech, form complexity is various, and ununified standard is (for example, using multilingual mixing expression, use grammer lack of standardization, typing
Have false information, using abbreviation or be commonly called as replacing standard terminology, be mingled with gibberish such as symbol etc. in word), medical record management
Personnel generally require finally determine surgical procedure title with reference to the detailed content of medical record and complete to encode, further reduced
The efficiency of coding also tends to have higher error rate.
For this purpose, the present invention provides a kind of autocoding mechanism of surgical procedure title, automatically, quickly and accurately to know
Do not go out to perform the operation and action name and it be encoded.
In the present context, embodiments of the present invention are intended to provide a kind of autocoding side of Chinese surgical procedure information
Method and system.
In the first aspect of embodiment of the present invention, a kind of autocoding side of Chinese surgical procedure information is provided
Method, including:
Step 1, Chinese surgical procedure information is inputted;
Step 2, natural language processing is carried out to the Chinese surgical procedure information, obtains one or more names to be encoded
Claim;
Step 3, matched based on the standard terminology library and expansion terminology bank, lookup pre-established with the title to be encoded
Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as described to be encoded
The coding of title;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is international disease point
Surgical procedure title specified in class ICD, the coding of the standard terminology are accordingly performed the operation specified in International Classification of Diseases ICD
The coding of action name;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is that have with the standard terminology
There are the word of synonymy or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously.
In the second aspect of embodiment of the present invention, a kind of autocoding system of Chinese surgical procedure information is provided
System, including:
Import modul, for inputting Chinese surgical procedure information;
Natural language processing module for carrying out natural language processing to the Chinese surgical procedure information, obtains one
Or multiple titles to be encoded;
Endowed module is matched, for waiting to compile with described based on the standard terminology library and expansion terminology bank, lookup that pre-establish
The standard terminology that matches of code title expands term, and by the standard terminology of successful match or expands the coding of term, determines
Coding for the title to be encoded;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is international disease point
Surgical procedure title specified in class ICD, the coding of the standard terminology are accordingly performed the operation specified in International Classification of Diseases ICD
The coding of action name;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is that have with the standard terminology
There are the word of synonymy or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously.
By means of above-mentioned technical proposal, the present invention has fully considered that the Chinese surgical procedure information of surgical doctor input belongs to
The features such as natural language, form complexity are various, without unified standard is established a variety of using advance foundation ICD-9-CM-3
Dictionary matches Chinese surgical procedure information character string, so as to it is automatic, surgical procedure title is fast and accurately identified simultaneously
It is encoded, whole process improves coding rate, reduce coding cost, and ensure that coding is correct without manually participating in
Rate.
Description of the drawings
Detailed description below, above-mentioned and other mesh of exemplary embodiment of the invention are read by reference to attached drawing
, feature and advantage will become prone to understand.In the accompanying drawings, if showing the present invention's by way of example rather than limitation
Dry embodiment, wherein:
Fig. 1 schematically shows the application scenarios that embodiments of the present invention can be implemented within;
Fig. 2 schematically shows the autocoding flows of the Chinese surgical procedure information of illustrative embodiment of the present invention
Figure;
Fig. 3 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention one;
Fig. 4 schematically shows the autocoding flow charts of the Chinese surgical procedure information of the embodiment of the present invention two;
Fig. 5 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention three;
Fig. 6 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention four;
Fig. 7 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention five;
Fig. 8 schematically shows the natural language processing flow chart of the embodiment of the present invention six;
Fig. 9 schematically shows the cutting first kind substring of the embodiment of the present invention six and the sub- character of Second Type
The flow chart of string;
Standard terminology that the lookup that Figure 10 schematically shows the embodiment of the present invention seven matches with title to be encoded or
Expand the flow chart of term;
Figure 11 schematically shows the autocoding system of the Chinese surgical procedure information of illustrative embodiment of the present invention
System block diagram.
In the accompanying drawings, identical or corresponding label represents identical or corresponding part.
Specific embodiment
The principle and spirit of the invention are described below with reference to several illustrative embodiments.It should be appreciated that provide this
A little embodiments are not with any just for the sake of better understood when those skilled in the art and then realize the present invention
Mode limits the scope of the invention.On the contrary, these embodiments are provided so that the disclosure is more thorough and complete, and energy
It is enough that the scope of the present disclosure is completely communicated to those skilled in the art.
Art technology technical staff knows, embodiments of the present invention can be implemented as a kind of system, device, equipment,
Method or computer program product.Therefore, the disclosure can be with specific implementation is as follows, i.e.,:It is complete hardware, complete soft
The form that part (including firmware, resident software, microcode etc.) or hardware and software combine.
According to the embodiment of the present invention, it is proposed that a kind of method and apparatus.
Herein, any number of elements in attached drawing is used to example and unrestricted and any name is only used for
It distinguishes, without any restrictions meaning.
Below with reference to several representative embodiments of the present invention, the principle and spirit of the invention are illustrated in detail.
Application scenarios overview
Referring initially to Fig. 1, it illustrates the application scenarios that embodiments of the present invention can be implemented within.
Scene shown in Fig. 1 includes medical information platform 100 and Chinese surgical procedure information automatic coding system
200.Medical information platform 100 can be loaded into desktop computer used in doctor, laptop, tablet computer, individual
Software in the equipment such as digital assistants.Chinese surgical procedure information automatic coding system 200 can be operate in information for hospital clothes
Software being engaged in device etc..It for example can be between medical information platform 100 and Chinese surgical procedure information automatic coding system 200
It is communicatively coupled by hospital lan etc..
Surgical doctor is inputted in medical information platform 100 after Chinese surgical procedure information, Chinese surgical procedure letter
Breath is transferred to Chinese surgical procedure information automatic coding system 200, right by Chinese surgical procedure information automatic coding system 200
It carries out natural language processing and autocoding, last exports coding result.
Illustrative methods
With reference to the application scenarios of Fig. 1, it is described with reference to Figure 2 and is performed the operation according to the Chinese of exemplary embodiment of the invention
The automatic coding of operation information.It should be noted that above application scene is for only for ease of the spirit for understanding the present invention
It is shown with principle, embodiments of the present invention are unrestricted in this regard.On the contrary, embodiments of the present invention can answer
For applicable any scene.
As shown in Fig. 2, the automatic coding of Chinese surgical procedure information, including:
Step S101 inputs Chinese surgical procedure information.
Step S102 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
The step can be based on the characteristics of surgical procedure information, carry out the processing such as mechanical Chinese word segmentation to operation operation information, obtain
To title to be encoded.It will introduce how this illustrative methods carries out nature to Chinese surgical procedure information by embodiment six below
A kind of specific embodiment of Language Processing.
Step S103 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded
Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded
Coding.
In illustrative methods of the present invention, standard terminology library includes several standard terminologies and its coding, and standard terminology is international
Surgical procedure title specified in classification of diseases ICD, the coding of standard terminology is corresponding hand specified in International Classification of Diseases ICD
The coding of art action name.
Expand terminology bank and include several expansion terms and its coding, it is to have synonymy with standard terminology to expand term
Word or the word with relation of genus and species.Wherein, can be standard terminology when expand term has synonymy with standard terminology
Be commonly called as, nickname or abbreviation etc., when expand term has relation of genus and species with standard terminology, can be conceptive or apply upper packet
Include standard terminology (the high rank of surgical procedure type represented relative to standard terminology) or included by standard terminology (relative to
The low rank of surgical procedure type that standard terminology represents).In order to encode needs, according to clinical experience, enable and expand term and tool
There is the corresponding coding of the standard terminology of synonymy or relation of genus and species unanimously.
In this illustrative methods, acceptable revised standard terminology bank in real time expands terminology bank, for example, increasing new expansion
Term deletes existing expansion term, so that standard terminology library, expansion terminology bank more meet the need of ICD-9-CM-3 codings
It will.
In this illustrative methods, standard terminology library and expansion terminology bank form an ontology dictionary, standard terminology and expansion art
Language is the ontology in the ontology dictionary, is as shown in table 1 the part of standards term that ontology dictionary includes and expansion term and its volume
Code.
Table 1
It will introduce how this illustrative methods searches the standard art to match with title to be encoded by embodiment seven below
Language or a kind of specific embodiment for expanding term.
Embodiment one
As shown in figure 3, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S201 inputs Chinese surgical procedure information.
Step S202 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S203 based on the standard terminology library pre-established, expands terminology bank and Hypothetical classification terminology bank, searches and treat
Standard terminology that encoding name matches expands term or Hypothetical classification term, and by the standard terminology of successful match, expand art
The coding of language or Hypothetical classification term is determined as the coding of title to be encoded.
The present embodiment is that Hypothetical classification terminology bank is increased in illustrative methods, which includes several
Hypothetical classification term and its coding.
Hypothetical classification term represents ad hoc type treatment means, and ad hoc type treatment means correspond to a variety of resection operation types,
This variety of resection operation type is standard terminology.
Hypothetical classification term has one-to-one standard terminology, it is assumed that term of classifying is encoded to its corresponding standard art
The coding of language.According to the regulation of ICD-9-CM-3, if the disease that the ad hoc type treatment means are directed to is the non-malignant of site of pathological change
Tumour, then the corresponding standard terminology of Hypothetical classification term is the disease damage resection of site of pathological change;If the ad hoc type treatment means needle
To disease for site of pathological change malignant tumour and do not need to do organ transplant, then the corresponding standard terminology of Hypothetical classification term is
The total resection of site of pathological change;If the disease that the ad hoc type treatment means are directed to is the malignant tumour of site of pathological change and does not do organ
Transplanting is not suitable for cutting entirely, then the corresponding standard terminology of Hypothetical classification term is the ablation of site of pathological change.
In the present embodiment, Hypothetical classification terminology bank can also be revised in real time, for example, increase new Hypothetical classification term or
Existing Hypothetical classification term is deleted, so that Hypothetical classification terminology bank more meets the needs of ICD-9-CM-3 codings.
For example, table 2 show part Hypothetical classification term and its corresponding standard terminology that Hypothetical classification terminology bank includes
And coding.
Table 2
Hypothetical classification term (ad hoc type treatment means) | Standard terminology | Coding |
Liver Cancer under Radical Operation | Partial hepatectomy | 50.22011 |
Hepatic cyst resection | Hepatopathy damages resection | 50.29009 |
Diverticulectomy of stomach | Excision of lesion of stomach | 43.42004 |
As shown in table 2, the disease that ad hoc type treatment means " Liver Cancer under Radical Operation " are directed to is the malignant tumour and discomfort of liver
Organ transplant is done in conjunction, then corresponding standard terminology is " partial hepatectomy ".
Again as shown in table 2, the disease that ad hoc type treatment means " hepatic cyst resection " are directed to is the non-malignant tumors of liver,
Then corresponding standard terminology is " hepatopathy damage resection ".
Again as shown in table 2, the disease that ad hoc type treatment means " resection of gastric carcinoma " are directed to is the malignant tumour of stomach and needs
Organ transplant is done, then corresponding standard terminology is " Radical Gastrectomy ".
Embodiment two
As shown in figure 4, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S301 inputs Chinese surgical procedure information.
Step S302 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S303 based on the standard terminology library pre-established, expands terminology bank and odd encoder terminology bank, searches and wait to compile
Standard terminology that code title matches expands term or odd encoder term, and by the standard terminology of successful match, expand term or
The coding of odd encoder term is determined as the coding of title to be encoded.
The present embodiment is that odd encoder terminology bank is increased in illustrative methods, which includes several more volumes
Code term and its coding.
Odd encoder term is ad hoc type surgical procedure type;The premise that ad hoc type surgical procedure type performs is another hand
Art action type;Ad hoc type surgical procedure type and another surgical procedure type for standard terminology or expand term;
The coding of the coding for being encoded to ad hoc type surgical procedure type of odd encoder term and another surgical procedure type
Combination.
In actual clinical, if doctor has write a surgical procedure title, the premise which performs is another
A surgical procedure, then the surgical procedure title just belongs to odd encoder term.
In the present embodiment, odd encoder terminology bank can also be revised in real time, for example, increasing new odd encoder term or deleting
Existing odd encoder term, so that odd encoder terminology bank more meets the needs of ICD-9-CM-3 codings.
For example, table 3 show the part odd encoder term that odd encoder terminology bank includes and its corresponding standard terminology and volume
Code.
Table 3
Embodiment three
As shown in figure 5, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S401 inputs Chinese surgical procedure information.
Step S402 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S403 based on the merging terminology bank pre-established, treats encoding name and is pre-processed.
Merge terminology bank and include several merging terms and its coding;Wherein, merge term to provide for International Classification of Diseases ICD
Can substitute at least two and meanwhile occur other standards term single standard terminology;At least two simultaneously occur other
Standard terminology is the combining objects of the merging term;Merge terminology bank and further include each whole combining objects for merging term.
Wherein, merge term and be different from its any one corresponding combining objects.
Clinically doctor may write multiple surgical procedure titles in a medical record, according to the rule of ICD-9-CM-3
Fixed, these surgical procedure titles can be classified as a surgical procedure title, i.e., practical above multiple surgical procedure titles are one
Multiple steps of surgical procedure title.
In the present embodiment, merging terminology bank can also be revised in real time, had for example, increasing new merging term or deleting
Merging term or modification combining objects so that merge terminology bank more meet ICD-9-CM-3 coding needs.
Table 4 is to merge one that terminology bank includes to merge term and its coding and whole combining objects.
Table 4
Step S403 is specially:Judge in one or more titles to be encoded, if include any one or more merging
Whole combining objects of term, if comprising any one or more whole combining objects for merging term are substituted for correspondence
Merging term.
Step S404 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded
Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded
Coding.
Example IV
As shown in fig. 6, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S501 inputs Chinese surgical procedure information.
Step S502 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S503 based on the omission terminology bank pre-established, treats encoding name and is pre-processed.
It omits terminology bank and includes several omission terms and its coding;Wherein, omitting term can as defined in ICD-9-CM-3
To substitute the single standard terminology of at least two standard terminologies occurred simultaneously;It is at least two while the mark occurred to omit term
One in quasi- term;At least two standard terminologies occurred simultaneously are the omission object of the omission term;Omit terminology bank also
Object is omitted altogether including each omission term;
In medical record when certain surgical procedure titles occur simultaneously, surgical procedure be other surgical procedures leading hand
Art, according to the regulation of ICD-9-CM-3, some of corresponding surgical procedure titles are without coding.
In the present embodiment, omission terminology bank can also be revised in real time, had for example, increasing new omission term or deleting
Omission term or modification omit object, so as to omit the needs that term Kuku more meets ICD-9-CM-3 codings.
Table 5 is to omit one that terminology bank includes to omit term and its coding and be omitted altogether object.
Table 5
Step S503 is specially:Judge in one or more titles to be encoded, if include any one or more omissions
Term is omitted altogether object, if comprising any one or more objects that are omitted altogether for omitting term are substituted for correspondence
Omission term.
Step S504 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded
Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded
Coding.
Embodiment five
As shown in fig. 7, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S601 inputs Chinese surgical procedure information.
Step S602 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S603 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded
Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded
Coding.
This, which is completed, searches the standard terminology to match with title to be encoded or expands term this process, should during have
The standard terminology to match less than title to be encoded or expansion term may be searched.This is because the ontology in ontology dictionary is (no
By being standard terminology or expand term) it is the relevant word of surgical procedure title, however practical Chinese surgical procedure letter
Often be related to a variety of concepts of medical field, not only surgical procedure title in breath, it is also possible to be related to disease name (such as
" fracture of sternum Flail chest "), nomenclature of drug (such as " cetirizine "), medical treatment consumptive materials title (such as " pseudoxanthoma elasticum gum ") etc.,
But the present invention is only to the coding for action name of performing the operation, therefore, if there is disease name, drug in Chinese surgical procedure information
Title, medical treatment consumptive materials title etc., the present invention can select not to encode it.In addition, practical Chinese surgical procedure information is also
Although it is to represent surgical procedure information that may include some, it not can determine which kind of surgical procedure title corresponded specifically to
Word, such as some do not meet ICD-9-CM-3 taxonomic hierarchies, it is impossible to determine its specific corresponding surgical procedure title.Such as
" attrition ", although representing surgical procedure title, its concept is too general, can not determine specifically art is worn down at what position, be face
Portion's attrition, cheekbone attrition or Laser final guidance shell;For another example, " denaturation art " is specific although representing surgical procedure title
It is that man becomes female urethra displacement plasty or man becomes vagina reconstruction and can not also determine.
In view of problem above, illustrative methods of the present invention also preset one without encryption description library, this is without encryption description
Library includes several no encryption descriptions.These include without encryption description:It is preset to be used to represent surgical procedure information but determine
The word of surgical procedure title;Preset disease name;Preset nomenclature of drug;And preset medical treatment consumptive materials title.
For example, table 6 show part that no encryption description dictionary includes without encryption description.
Table 6
Step S604, will do not determine coding title to be encoded in no encryption description library without encryption description progress
Match, if successful match, perform preset processing step to represent not determining this title to be encoded of coding encodes,
If it fails to match, this not determined to, the title to be encoded of coding is sent to artificial treatment platform and carries out artificial treatment.
Wherein, for not finding the standard terminology to match or the title to be encoded for expanding term, if phase can be found
Matched no encryption description then illustrates that it belongs to expression surgical procedure information but can not determine the word of surgical procedure title, disease
Name of disease claims, one kind in nomenclature of drug, medical treatment consumptive materials title, not encodes, and for cannot find match without coding
The title to be encoded of term, illustrates that it is not belonging to the above-mentioned type, and for this kind of title to be encoded, the present embodiment sends it to people
Work processing platform, by manually continuing with, concrete processing procedure, the present invention is not construed as limiting it.
Embodiment six
As shown in figure 8, nature language is carried out to Chinese surgical procedure information to be suitable for a kind of of illustrative methods of the present invention
Speech is handled to obtain the specific embodiment of title to be encoded, including:
Step S71 pre-processes Chinese surgical procedure information character string, obtains pretreated Chinese surgical procedure
Information character string.
The purpose of the step is that the character in Chinese surgical procedure information character string is converted into unified coded format, with
Just subsequent processing.
Optionally, which can implement according to following concrete mode:To non-in Chinese surgical procedure information character string
Chinese character is into row format normalized (for example, the symbol in Chinese surgical procedure information character string is all converted to half-angle lattice
Formula is all converted to full-shape form, and English alphabet therein is all converted to uppercase format or lower case format);And in deleting
Non-medical term in literary surgical procedure information character string.The non-medical term dictionary that wherein non-medical term is pre-established by one
There is provided, and non-medical term be the word of remarks effect, phrase or descriptive sentence (such as " opening inspection ", " benefit emergency treatment is remembered
Account ", " bed expense is exceeded at one's own expense ", " being added more than one month, monthly received less than one month ", " paediatrics is added " etc.).
Based on the ontology dictionary, orientation dictionary, grade dictionary pre-established, pretreated Chinese is performed the operation by step S72
Operation information character string is cut into several first kind substrings and/or Second Type substring.
Wherein, first kind substring can directly be matched with the ontology in ontology dictionary, Second Type substring
It can not directly be matched with the ontology in ontology dictionary.First kind substring and Second Type the substring tool being syncopated as
There is independent semanteme, i.e., represented surgical procedure project information is not influenced by the character before or after it.
Ontology dictionary include aforesaid standards terminology bank and expand terminology bank, as shown in table 1, specifically include several ontologies and
Ontology encodes correspondingly, standard terminology or expands term and is considered ontology in ontology dictionary.
It should be noted that before use has been arrived in the automatic coding of Chinese surgical procedure information provided by the invention
When the Hypothetical classification terminology bank and/or odd encoder terminology bank stated, ontology dictionary should also be as consisting of assuming that classification terminology bank and/or more
Encryption description library is (at this point, Hypothetical classification term and/or odd encoder term, omission term are also considered the sheet in ontology dictionary
Body) so that can be with vacation when the first kind substring or Second Type substring that are syncopated as are as title to be encoded
Surely classification term or odd encoder term or omission term match.
Orientation dictionary includes several directional terminologies, and directional terminology is for describing the targeted orientation of surgical procedure project
Word.For example, directional terminology can be:Unilateral side, bilateral, left side, right side, both sides, side etc..
Grade dictionary includes several grade terms, and grade term is for describing the rank of surgical procedure project, type
Word.For example, grade term can be:A grades, B grades, C grades, superfine etc..
The purpose of step S72 is that Chinese surgical procedure information is cut into the independent semantic substring (first kind
Type substring or Second Type substring), effectively to avoid multiple characters with incidence relation being identified respectively
The problem of so as to cause identification mistake.
After the first kind substring being syncopated as and Second Type substring are determined as title to be encoded, rear
It is continuous when treating encoding name using the merging terminology bank in embodiment three or the omission terminology bank in example IV and being pre-processed,
Since first kind substring and the corresponding ontology of Second Type substring may be expansion term, and merge in terminology bank
Combining objects and the omission object that omits in terminology bank be standard terminology, therefore, need to by first kind substring and
Expansion term corresponding to Second Type substring is converted to corresponding standard terminology, then recycles and merges terminology bank or province
Slightly terminology bank is pre-processed.
As shown in figure 9, step S72 is specifically included:
Whether step S80 judges pretreated Chinese surgical procedure information character string comprising symbol;If include symbol
Number, then perform step S81;If step S82 is not performed comprising symbol.
Step S81, by the character between every adjacent two symbols in pretreated Chinese surgical procedure information character string
It is matched as a whole with the ontology in ontology dictionary;If successful match, step S811 is performed;If it fails to match, hold
Row step S812.
Step S811, using the character cutting between the adjacent two symbols out as first kind substring.
Step S812, the adjacent two symbols and its between character be determined as wouldn't cutting character string, then perform step
Rapid S83.
Step S81, step S811, step S812 foundations processing rule be:Alphabet between adjacent-symbol is made
It is matched for entirety with ontology, ability cutting when only matching, otherwise temporarily not cutting.
Such as the cutting shown in table 7 to " cardiac output monitors, and consumes technology, ventricular puncture, implanted conduit with oxygen ",
In, " cardiac output monitors, and consumes technology with oxygen ", " ventricular puncture, implanted conduit " they are the alphabet between symbol, and
And the ontology to match can be found, therefore, it is split out respectively.
Table 7
Step S82, will be in pretreated Chinese surgical procedure information character string and ontology dictionary using mechanical Chinese word segmentation method
Ontology matched;If all characters in pretreated Chinese surgical procedure information character string can be with ontology
Match, then perform step S821;If there is the list failed with Ontology Matching in pretreated Chinese surgical procedure information character string
A character or multiple continuous characters, then perform step S822.
Step S821 cuts the character in pretreated Chinese surgical procedure information character string according to the matched ontology of institute
It branches away as first kind substring.
Step S822, judgement fail with the single character of Ontology Matching or multiple continuous characters whether be directional terminology or
Grade term;If directional terminology or grade term, then perform step S8221;If not directional terminology or grade term,
Then perform step S8222.
Step S82, step S821, step S822 foundations processing rule be:It will be pretreated using mechanical Chinese word segmentation method
Character in Chinese surgical procedure information character string is matched with ontology, and only alphabet can find the sheet to match
Ability cutting during body, otherwise temporarily not cutting.
Such as table 8 show the cutting to " 24 hours monitoring of blood pressure of electroencephalogram ", can be searched respectively using mechanical Chinese word segmentation method
The ontology to match to " electroencephalogram " and " 24 hours monitoring of blood pressure ", therefore, is split out respectively.
Table 8
The mechanical Chinese word segmentation method that step S82 is used can be Forward Maximum Method type, reverse maximum matching type or minimum cutting
Type.Specific dicing process, the present embodiment repeat no more.
Step S8221, according to fail with the single character of Ontology Matching or multiple continuous characters after the pre-treatment in
Position in literary surgical procedure information character string, will fail with the single character of Ontology Matching or multiple continuous characters and its it
It is preceding or can merge with the single character of Ontology Matching or multiple continuous characters cut out as the sub- word of Second Type later
Symbol string, and can be with the single character of Ontology Matching or multiple continuous character cuttings out as first kind using remaining
Character string.
Step S8222 integrally cuts out pretreated Chinese surgical procedure information character string as Second Type
Substring.
Step S8221, the processing rule of step S8222 foundations is:Failing to the single character of Ontology Matching or more
A continuous character is directional terminology or grade term, then performs cutting, and during cutting be by its with before or after it
Character merging is cut out.
Such as table 9 show the cutting to " lung volume reduction surgery right lung neoplasty ", can be looked into respectively using mechanical Chinese word segmentation method
Find " lung volume reduction surgery ", the ontology that " lung neoplasty " matches, " right side " therein is directional terminology, therefore, by " right side "
Merge with " lung neoplasty " and cut out, " lung volume reduction surgery " is individually cut out.
Table 9
Whether step S83, judgement wouldn't include preset additional character in cutting character string;It if wouldn't cutting character string
In comprising additional character, then perform step S831;If additional character wouldn't not be included in cutting character string, step is performed
S833。
Step S831, search wouldn't be belonging to cutting character string character model, and the character model according to belonging to this corresponds to
Segmentation rules to wouldn't cutting character string carry out cutting;Wherein, the character model library that character model is pre-established by one provides,
And character model has one-to-one segmentation rules.
Step S832 matches the character cut out with the ontology in ontology dictionary, should if successful match
The character cut out is determined as first kind substring, if it fails to match, the character that this cuts out is determined as
Two type substrings;
Step S833, wouldn't cutting character string be determined directly as Second Type substring.
Step S83, step S831, step S832, step S833 foundations processing rule be:When wouldn't be in cutting character string
During comprising preset additional character, cutting is carried out according to character model that wouldn't be belonging to cutting character string, is otherwise directly syncopated as
Come;And match the character being syncopated as based on character model with ontology again, it wherein can will directly be matched with ontology
Conduct first kind substring, it is impossible to directly it is matched be used as Second Type substring.
Such as preset additional character can include but is not limited to fullstop, colon, plus sige, branch, slash line etc..
Such as following partial character model and its segmentation rules in character model library:
(1) character model:XAY types, A is plus sige, colon;
Segmentation rules:XAY is cut out as a whole;
(2) character model:CDE types, and one of C, E are Chinese character, D is fullstop, branch;
Segmentation rules:Chinese character segmentation in C, E is come out;
(3) character model:STU types, and S and/or U is individual Chinese character, T is slash line;
Segmentation rules:STU is cut out as a whole.
Such as to " blood fat (P).Renal function detects (P) " cutting is carried out, it understands to belong to CDE through searching character model library
Type then individually cuts out " blood fat (P) ", " renal function detects (P) ".
Such as cutting is carried out to " thoracoscope lower lung neoplasty+pulmonary belb resection ", it understands to belong to through searching character model library
In XAY types, then " thoracoscope lower lung neoplasty+pulmonary belb resection " is integrally cut out.
Such as cutting is carried out to " 3/4 laryngectomy and laryngeal reconstruction ", it understands to belong to STU through searching character model library
Type then integrally cuts out " 3/4 laryngectomy and laryngeal reconstruction ".
The first kind substring being syncopated as and Second Type substring are determined as title to be encoded by step S73.
The present embodiment has fully considered operation doctor during natural language processing is carried out to Chinese surgical procedure information
The Chinese surgical procedure information of teacher's input belongs to the features such as natural language, form complexity are various, without unified standard, using pre-
First a variety of dictionaries for being established carry out cutting and matching to Chinese surgical procedure information character string, with this by surgical procedure entry name
Title is identified as title to be encoded.
Embodiment seven
As shown in Figure 10, the standard to match to be suitable for a kind of lookup of illustrative methods of the present invention with title to be encoded
Term or the specific embodiment for expanding term, including:
Step S90, if entitled first kind substring to be encoded, which is matched
Ontology is determined as the standard terminology to match with the title to be encoded or expands term, if entitled Second Type to be encoded
Character string then carries out each ontology in Second Type substring and ontology dictionary the parsing of the first dimension, obtains second
Several first dimension analysis results of several first dimension analysis results of type substring and each ontology;
The step optionally, carries out analysis object using Second Type substring and ontology as analysis object
The parsing of first dimension can include but is not limited to:
(1) directional terminology included in analysis object is determined, if not including directional terminology, this analysis result wherein
For sky;
(2) the grade term included in analysis object is determined, if not including grade term, this analysis result wherein
For sky;
(3) character in analysis object bracket is determined, if not including bracket wherein, this analysis result is sky;
(4) character after dash in analysis object is determined, if not including dash wherein, this analysis result is
It is empty;And
(5) it determines in analysis object in addition to the character in directional terminology, grade term, bracket, the character after dash
Character (the remaining character hereinafter referred to as in ontology), the generally core stem of analysis object.
When analysis object is Second Type substring, each first dimension analysis result can include but unlimited
In:The grade term in directional terminology, Second Type substring, Second Type substring in Second Type substring
Character in character, Second Type substring in bracket after dash, the remaining character in Second Type substring.
When analysis object is ontology, each first dimension analysis result can include but is not limited to:Side in ontology
Position term, the grade term in ontology, the character in ontology bracket, the character after dash, the residue in ontology in ontology
Character.
Step S91, by ontology each in each first dimension analysis result of Second Type substring and ontology dictionary
The analysis result of each first dimension is matched, and is searched whether there are each first dimension analysis result of some ontology with second
Each first dimension analysis result of type substring matches;If there is such ontology, then step S92 is performed, if
There is no such ontologies, then perform step S93.
The ontology found is determined as the ontology that Second Type substring matches by step S92.
Step S93 chooses part the first dimension solution in all the first dimension analysis results of Second Type substring
Result is analysed to carry out with part the first dimension analysis result in all the first dimension analysis results of ontology each in ontology dictionary
Matching, and search whether the part there are this of some ontology part the first dimension analysis result and Second Type substring
First dimension analysis result matches;If there is such ontology, then step S931 is performed;If there is no such sheet
Body then performs step S932.
The ontology found is determined as the ontology that Second Type substring matches by step S931.
The directional terminology included in Second Type substring is matched with the directional terminology included in ontology respectively,
The grade term included in Second Type substring is matched with the grade term included in ontology, by Second Type
Character in character string bracket is matched with the character in ontology bracket, after dash in Second Type substring
Character matched with the character after dash in ontology bracket, by the remaining character in Second Type substring with this
Remaining character in body is matched.
If the first whole dimension analysis results match, which is determined as Second Type substring phase
The ontology matched.
If certain first dimension analysis results mismatch, the first dimension of selected part analysis result carries out respectively
Match.
It is often the core information of Second Type substring in view of the remaining character in Second Type substring, because
This, in specific implementation, preferably, selected part the first dimension analysis result is included at least in Second Type substring
Remaining character and ontology in remaining character.For example, only choose the character after the remaining character and dash of analysis object
It is matched respectively, alternatively, the remaining character for only choosing analysis object is matched, alternatively, analysis object can also be chosen
Remaining character is matched respectively with the character in directional terminology or grade term or bracket or directional terminology or grade term.
Such as a certain Second Type substring is " left mastostomy (big) ", and the solution of the first dimension is carried out to it
Analysis, obtained analysis result is as shown in table 10, as shown in table 11 for the ontology that matches with the Second Type substring and its
Each first dimension analysis result.
Table 10
The first dimension parsing knot of the ontology " mastostomy " to match with " left mastostomy (big) "
Fruit is as shown in table 11:
Table 11
Step S932 carries out each ontology in Second Type substring and ontology dictionary the parsing of the second dimension,
Obtain each second dimension solution of each ontology in each second dimension analysis result of Second Type substring and ontology dictionary
Analyse result.
The step optionally, carries out analysis object using Second Type substring and ontology as analysis object
The parsing of default dimension can include but is not limited to:
(1) each Chinese character in analysis object is determined;
(2) initial consonant of each Chinese character in analysis object is determined;
(3) simple or compound vowel of a Chinese syllable of each Chinese character in analysis object is determined;
(4) initial character of analysis object is determined;
(5) phonetic of the initial character of analysis object is determined;And
(6) non-chinese character in analysis object is determined, if not including non-chinese character, this analysis result wherein
For sky.
When analysis object is Second Type substring, the analysis result of each dimension can include but is not limited to:
The sub- character of initial consonant, Second Type of each Chinese character in each Chinese character, Second Type substring in Second Type substring
Each simple or compound vowel of a Chinese syllable of Chinese character in string, the initial character of Second Type substring, Second Type substring initial character phonetic,
Non-chinese character in two type substrings.
When analysis object is entry, analysis result can include but is not limited to:It is every in each Chinese character, entry in entry
Each simple or compound vowel of a Chinese syllable of Chinese character in the initial consonant of a Chinese character, entry, the initial character of entry, the phonetic of initial character of entry, entry the non-Chinese
Word character.
For example, table 12 is each second dimension analysis result of Second Type substring " deciduous teeth arrachement ".
Table 12
Step S933, several of several second dimension analysis results and ontology based on Second Type substring
Two-dimensions analysis result calculates the matching degree of Second Type substring and each ontology.
Specifically, which can calculate the similarity of Second Type substring and each ontology, can also calculate
Total confidence level of two type substrings and each ontology.Wherein, compared to similarity, total confidence level can more embody Second Type
The matching degree of substring and each ontology, but the calculating process of total confidence level compared to similarity calculating process also more
It is complicated.When step S933 is embodied, if desired faster processing speed, then can select to calculate the process of similarity, if
More accurately matching result is needed, then can select to calculate the process of total confidence level.
A kind of embodiment of step S933 is to calculate the similarity of Second Type substring and each ontology, specifically such as
Under:
The similarity of Second Type substring and each ontology is calculated according to equation below, and similar by what is be calculated
Degree is determined as the matching degree of Second Type substring and each ontology:
Wherein, M represents similarity;
T represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) represent in the second identical dimension, the second dimension analysis result of Second Type substring with
The frequency that second dimension analysis result of ontology matches;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension parsing
As a result the sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology.
A kind of embodiment of step S933 is to calculate total confidence level of Second Type substring and each ontology, specifically
It is as follows:
Total confidence level of Second Type substring and each ontology is calculated as follows, and total by what is be calculated
Confidence level is determined as the matching degree of Second Type substring and each ontology:
1) each Chinese character in Second Type substring is determined.
2) the cosine confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
Wherein, N represents cosine confidence level;
V represents the Chinese character sum that Second Type substring and its ontology to match are included;
Q represents Second Type substring;
D' represents the ontology to match with Second Type substring;
wQ,jRepresent the frequency that each Chinese character occurs in Second Type substring;
wd',jRepresent the frequency occurred in the ontology that each Chinese character matches in Second Type substring;
J represents the serial number of Chinese character that Second Type substring and its ontology to match are included.
3) total confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
S=M × a+N × b
Wherein, S represents total confidence level;
M represents similarity;
A represents the corresponding preset weights of similarity M;
B represents the corresponding preset weights of cosine confidence level N;
Also, similarity M is calculated according to equation below:
Wherein, t represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) represent in the second identical dimension, the second dimension analysis result of Second Type substring with
The frequency that second dimension analysis result of ontology matches;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension parsing
As a result the sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology.
Step S934 according to the matching degree of Second Type substring and each ontology, determines one or more ontology
The ontology to match as Second Type substring.
Optionally, which can have following specific embodiment:According to the matching journey with Second Type substring
The size of degree sorts to whole ontologies, and the ontology of the forward preset quantity that wherein sorts (such as forward 2 that sort) is true
It is set to the ontology that Second Type substring matches;Alternatively, by reaching default with the matching degree of Second Type substring
One or more ontologies of threshold value are determined as the ontology that Second Type substring matches.
During the specific implementation present invention, for the matching journey for the ontology that clear and definite Second Type substring matches with each
It spends and it is used, can also can also match in the result of final output including Second Type substring with it
Each ontology matching degree.For example, the matching degree of output Second Type substring and each ontology to match, so
It can therefrom select one again by manual type according to the size of matching degree and be used as Second Type substring and match afterwards
Ontology.
Step S94 reaches default by ontology that Second Type substring matches or with Second Type substring
One or more ontologies with condition are determined as standard terminology or expansion term that title to be encoded matches.
The present embodiment has fully considered operation doctor during natural language processing is carried out to Chinese surgical procedure information
The Chinese surgical procedure information of teacher's input belongs to the features such as natural language, form complexity are various, without unified standard, using pre-
The a variety of dictionaries first established carry out cutting and matching to Chinese surgical procedure information character string, and title phase to be encoded is searched with this
Matched standard terminology expands term.
Exemplary system
After the method for exemplary embodiment of the invention is described, next, exemplary to the present invention with reference to figure 11
The automatic coding system of the Chinese surgical procedure information of embodiment is introduced.
The implementation of the automatic coding system of Chinese surgical procedure information may refer to the implementation of the above method, repeat part not
It repeats again.Term " module " used below can be the combination of the software and/or hardware of realizing predetermined function.Although with
The lower described system of embodiment is preferably realized with software, but the realization of the combination of hardware or software and hardware
It may and be contemplated.
As shown in figure 11, the automatic coding system of Chinese surgical procedure information can include:Import modul 111, natural language
Say processing module 112, the endowed module 113 of matching.
Import modul 111, for inputting Chinese surgical procedure information.
Natural language processing module 112, for Chinese surgical procedure information carry out natural language processing, obtain one or
Multiple titles to be encoded.
Match endowed module 113, for based on the standard terminology library that pre-establishes and expanding terminology bank, search with it is to be encoded
Standard terminology that title matches expands term, and by the standard terminology of successful match or expands the coding of term, is determined as
The coding of title to be encoded.
Optionally, as shown in figure 11, the automatic coding system of the Chinese surgical procedure information can also include:Merging treatment
Module 114 omits processing module 115.
Wherein, merging treatment module 114 is for judging in one or more titles to be encoded, if comprising any one or
Multiple whole combining objects for merging term, if comprising any one or more whole combining objects for merging term are replaced
Change corresponding merging term into.
The step of processing module 115 is used to pre-process one or more titles to be encoded is omitted, including:Judge one
In a or multiple titles to be encoded, if object is omitted altogether comprising any one or more omission terms, if comprising will
Any one or more objects that are omitted altogether for omitting term are substituted for corresponding omission term.
In this exemplary system, wherein, it is standard terminology library, the expansion terminology bank, the Hypothetical classification terminology bank, described
Odd encoder terminology bank, the specifying information for merging terminology bank are with reference to the automatic coding of above-mentioned Chinese surgical procedure information
It introduces, overlaps will not be repeated.
Particular embodiments described above has carried out the purpose of the present invention, technical solution and advantageous effect further in detail
Describe in detail it is bright, it should be understood that the above is only a specific embodiment of the present invention, the guarantor being not intended to limit the present invention
Range is protected, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should be included in this
Within the protection domain of invention.
Those skilled in the art will also be appreciated that the various illustrative components, blocks that the embodiment of the present invention is listed
(illustrative logical block), unit and step can pass through the knot of electronic hardware, computer software, or both
Conjunction is realized.To clearly show that the replaceability (interchangeability) of hardware and software, above-mentioned various explanations
Property component (illustrative components), unit and step universally describe their function.Such work(
Can be that specific application and the design requirement of whole system are depended on to realize by hardware or software.Those skilled in the art
Can be for each specific function of applying, the realization of various methods can be used described, but this realization is understood not to
Beyond the range of protection of the embodiment of the present invention.
Various illustrative logical blocks or unit or device described in the embodiment of the present invention can be by general
Processor, digital signal processor, application-specific integrated circuit (ASIC), field programmable gate array or other programmable logic dress
It puts, discrete gate or transistor logic, described work(is realized or operated in the design of discrete hardware components or any of the above described combination
Energy.General processor can be microprocessor, and optionally, which may be any traditional processor, control
Device, microcontroller or state machine.Processor can also be realized by the combination of computing device, for example, digital signal processor and
Microprocessor, multi-microprocessor, one or more microprocessors combine a digital signal processor core or any other class
As configuration realize.
The step of method or algorithm described in the embodiment of the present invention can be directly embedded into hardware, processor perform it is soft
The combination of part module or the two.Software module can be stored in RAM memory, flash memory, ROM memory, EPROM storages
Other any form of storaging mediums in device, eeprom memory, register, hard disk, moveable magnetic disc, CD-ROM or this field
In.Illustratively, storaging medium can be connect with processor, so that processor can read information from storaging medium, and
It can be to storaging medium stored and written information.Optionally, storaging medium can also be integrated into processor.Processor and storaging medium can
To be set in ASIC, ASIC can be set in user terminal.Optionally, processor and storaging medium can also be set to use
In different components in the terminal of family.
In one or more illustrative designs, the described above-mentioned function of the embodiment of the present invention can be in hardware, soft
Part, firmware or the arbitrary of this three combine to realize.If realized in software, these functions can store and computer-readable
It is transmitted on the medium of computer-readable on medium or with one or more instruction or code form.Computer readable medium includes electricity
Brain storaging medium and convenient for allow computer program to be transferred to from a place telecommunication media in other places.Storaging medium can be with
It is that any general or special computer can be with the useable medium of access.For example, such computer readable media can include but
It is not limited to RAM, ROM, EEPROM, CD-ROM or other optical disc storage, disk storage or other magnetic storage devices or other
What can be used for carrying or store with instruct or data structure and it is other can be by general or special computer or general or specially treated
The medium of the program code of device reading form.In addition, any connection can be properly termed computer readable medium, example
Such as, if software is to pass through a coaxial cable, fiber optic cables, double from a web-site, server or other remote resources
Twisted wire, Digital Subscriber Line (DSL) are defined with being also contained in for the wireless way for transmitting such as example infrared, wireless and microwave
In computer readable medium.The disk (disk) and disk (disc) includes compress disk, radium-shine disk, CD, DVD, floppy disk
And Blu-ray Disc, disk is usually with magnetic duplication data, and disk usually carries out optical reproduction data with laser.Combinations of the above
It can also be included in computer readable medium.
Claims (17)
1. a kind of automatic coding of Chinese surgical procedure information, including:
Step 1, Chinese surgical procedure information is inputted;
Step 2, natural language processing is carried out to the Chinese surgical procedure information, obtains one or more titles to be encoded;
Step 3, based on the standard terminology library and expansion terminology bank pre-established, the mark to match with the title to be encoded is searched
Quasi- term or expand term, and by the standard terminology of successful match or expand term coding, be determined as the title to be encoded
Coding;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is International Classification of Diseases ICD
Specified in surgical procedure title, the coding of the standard terminology is corresponding surgical procedure specified in International Classification of Diseases ICD
The coding of title;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is with same with the standard terminology
The word of adopted relationship or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously;
The step 2 includes:
Step 21, the Chinese surgical procedure information character string is pre-processed, obtains pretreated Chinese surgical procedure
Information character string;
Step 22, based on the ontology dictionary, orientation dictionary, grade dictionary pre-established, the pretreated Chinese is performed the operation
Operation information character string is cut into several first kind substrings and/or Second Type substring;
Wherein, the ontology dictionary includes the standard terminology library and expands terminology bank, the standard terminology and the expansion art
Language is ontology;The orientation dictionary includes several directional terminologies, and the directional terminology is targeted for describing surgical procedure
Orientation word;The grade dictionary includes several grade terms, and the grade term is the grade for describing surgical procedure
Not, the word of type;
The first kind substring can directly be matched with the ontology in the ontology dictionary, the sub- character of Second Type
String can not directly be matched with the ontology in the ontology dictionary;
Step 23, the first kind substring being syncopated as and Second Type substring are determined as title to be encoded;
The step 21 includes:
To the non-Chinese character in the Chinese surgical procedure information character string into row format normalized, and delete the Chinese hand
Non-medical term in art operation information character string obtains pretreated Chinese surgical procedure information character string, wherein described
The non-medical term dictionary that non-medical term is pre-established by one provides, and the word that the non-medical term has been remarks effect
Language, phrase or sentence;
The step 22 includes:
Judge the pretreated Chinese surgical procedure information character string whether comprising symbol;
If the pretreated Chinese surgical procedure information character string includes symbol, by the pretreated Chinese hand
Character in art operation information character string between every adjacent two symbols is matched as a whole with the ontology in ontology dictionary;
If successful match, using the character cutting between the adjacent two symbols out as first kind substring;If matching is lost
Lose, then by the adjacent two symbols and its between character be determined as wouldn't cutting character string, and judge described in wouldn't cutting word
Whether preset additional character is included in symbol string;
If it is described wouldn't in cutting character string comprising additional character, search described in wouldn't be belonging to cutting character string character mould
Type, and the corresponding segmentation rules of character model according to belonging to this to it is described wouldn't cutting character string carry out cutting, will be syncopated as
The character come is matched with the ontology in ontology dictionary, if successful match, using the character cut out as the first kind
Type substring, if it fails to match, using the character cut out as Second Type substring;Wherein, the character
The character model library that model is pre-established by one provides, and the character model has one-to-one segmentation rules;
If described wouldn't not include additional character in cutting character string, by it is described wouldn't cutting character string be determined directly as second
Type substring;
If the pretreated Chinese surgical procedure information character string is not comprising symbol, using mechanical Chinese word segmentation method by described in
In single character or multiple continuous characters and the ontology dictionary in pretreated Chinese surgical procedure information character string
Ontology matched;
If all characters in the pretreated Chinese surgical procedure information character string can be with Ontology Matching, foundation
Matched ontology by the single character in the pretreated Chinese surgical procedure information character string or multiple continuous words
Symbol is cut out as first kind substring;
Fail and the single character of Ontology Matching or more if existing in the pretreated Chinese surgical procedure information character string
Whether a continuous character then fails with the single character of Ontology Matching or multiple continuous characters to be directional terminology described in judgement
Or grade term;
When described fail with the single character of Ontology Matching or multiple continuous characters as directional terminology or grade term, according to
It is described to fail with the single character of Ontology Matching or multiple continuous characters in the pretreated Chinese surgical procedure information
Position in character string fails described and the single character of Ontology Matching or multiple continuous characters and energy before or after it
It is enough merge with the single character of Ontology Matching or multiple continuous characters cut out as Second Type substring, and by institute
Stating remaining in pretreated Chinese surgical procedure information character string can be with the single character of Ontology Matching or multiple continuous
Character cutting out as first kind substring;
It, will when described fail with the single character of Ontology Matching or multiple continuous characters for directional terminology or grade term
The pretreated Chinese surgical procedure information character string is integrally cut out as Second Type substring.
2. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
The step 3 further includes:Based on the Hypothetical classification terminology bank pre-established, search what is matched with the title to be encoded
Hypothetical classification term;And by the coding of the Hypothetical classification term of successful match, it is determined as the coding of the title to be encoded;
The Hypothetical classification terminology bank includes several Hypothetical classification terms and its coding;
The Hypothetical classification term represents ad hoc type treatment means, and the ad hoc type treatment means correspond to a variety of resection operation classes
Type, a variety of resection operation types are the standard terminology;
The coding of the Hypothetical classification term and the full resection operation type of organ in a variety of resection operation types or part
The coding of resection operation type is consistent.
3. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
The step 3 further includes:Based on the odd encoder terminology bank pre-established, lookup matches more with the title to be encoded
Encryption description;And by the coding of the odd encoder term of successful match, it is determined as the coding of the title to be encoded;
The odd encoder terminology bank includes several odd encoder terms and its coding;
The odd encoder term is ad hoc type surgical procedure type;The premise that the ad hoc type surgical procedure type performs is another
Kind surgical procedure type;The ad hoc type surgical procedure type and another surgical procedure type for the standard terminology or
The expansion term;
The coding for being encoded to the ad hoc type surgical procedure type of the odd encoder term and another surgical procedure class
The combination of the coding of type.
4. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
Before the step 3, further include:Based on the merging terminology bank pre-established, to one or more of names to be encoded
Title is pre-processed;
The merging terminology bank includes several merging terms and its coding;Wherein, the merging term is International Classification of Diseases ICD
The defined single standard terminology that can substitute at least two while the other standards term occurred;Described at least two go out simultaneously
Existing other standards term is the combining objects of the merging term;The terminology bank that merges further includes the complete of each merging term
Portion's combining objects;
It is described based on merging terminology bank, the step of pretreatment to one or more of titles to be encoded, including:Judge
In one or more of titles to be encoded, if comprising any one or more whole combining objects for merging term, if packet
Contain, then any one or more whole combining objects for merging term are substituted for corresponding merging term.
5. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
Before the step 3, further include:Based on the omission terminology bank pre-established, to one or more of names to be encoded
Title is pre-processed;
The omission terminology bank includes several omission terms and its coding;Wherein, the omission term is International Classification of Diseases ICD
The defined single standard terminology that can substitute at least two while the standard terminology occurred;It is described omission term be it is described at least
One in two standard terminologies occurred simultaneously;Described at least two standard terminologies occurred simultaneously are the province of the omission term
Slightly object;It is described omit terminology bank and further include each and omit term be omitted altogether object;
It is described based on omitting terminology bank, the step of pretreatment to one or more of titles to be encoded, including:Judge
In one or more of titles to be encoded, if object is omitted altogether comprising any one or more omission terms, if packet
Contain, then any one or more objects that are omitted altogether for omitting term are substituted for corresponding omission term.
6. according to the automatic coding of any Chinese surgical procedure information of Claims 1 to 5, wherein, the step 3
Later, it further includes:
Step 4, by the title to be encoded for not determining coding and being matched without encryption description in no encryption description library, if matching
Success then performs preset processing step to represent not determining this title to be encoded of coding encodes, if matching is lost
It loses, then this not being determined to, the title to be encoded of coding is sent to artificial treatment platform and carries out artificial treatment;
Wherein, the no encryption description dictionary includes several no encryption descriptions;
Several no encryption descriptions include:
The preset word for representing surgical procedure information but can not determine surgical procedure title;
Preset disease name;
Preset nomenclature of drug;And
Preset medical treatment consumptive materials title.
7. the automatic coding of Chinese surgical procedure information according to claim 1, wherein, it is searched in the step 3
The step of standard terminology or expansion term for matching with the title to be encoded, including:
If the entitled first kind substring to be encoded, by the ontology that the first kind substring matches, really
It is set to the standard terminology to match with the title to be encoded or expands term;
If the entitled Second Type substring to be encoded,:
The parsing of the first dimension is carried out to each ontology in Second Type substring and ontology dictionary, obtains Second Type
Several first dimension analysis results of several first dimension analysis results of character string and each ontology;
By each the of ontology each in each first dimension analysis result of the Second Type substring and the ontology dictionary
Dimension analysis result is matched, judge whether each first dimension analysis result with the Second Type substring
The ontology that matches of each first dimension analysis result;
If there is each first dimension analysis result with each first dimension analysis result phase of the Second Type substring
The ontology is then determined as the ontology that the Second Type substring matches by matched ontology;
If there is no each first dimension analysis result with each first dimension analysis result of the Second Type substring
The ontology to match then chooses the first dimension of part in all the first dimension analysis results of the Second Type substring
Analysis result is tied with part the first dimension parsing in all the first dimension analysis results of ontology each in the ontology dictionary
Fruit is matched, and judges whether the described of part the first dimension analysis result and the Second Type substring
The ontology that part the first dimension analysis result matches;
If there are the part the first dimension solutions of part the first dimension analysis result and the Second Type substring
The ontology is then determined as the ontology that the Second Type substring matches by the ontology that matches of analysis result;
If there is no the first dimensions of the part of part the first dimension analysis result and the Second Type substring
The ontology that analysis result matches then carries out the to each ontology in the Second Type substring and the ontology dictionary
The parsing of two-dimensions obtains several second dimension analysis results of the Second Type substring and the ontology dictionary
In each ontology several second dimension analysis results;
Several second dimensions of several second dimension analysis results and the ontology based on the Second Type substring
Analysis result calculates the matching degree of the Second Type substring and each ontology;
According to the matching degree of the Second Type substring and each ontology, determine one or more ontologies as described the
The ontology that two type substrings match;
By the ontology that the Second Type substring matches, be determined as standard terminology that the title to be encoded matches or
Expand term.
8. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, the sub- word of Second Type
Symbol string described in each first dimension analysis result of ontology be respectively:
The Second Type substring described in directional terminology in ontology;
The Second Type substring described in grade term in ontology;
The Second Type substring described in character in ontology bracket;
The Second Type substring described in character in ontology after dash;And
The Second Type substring described in ontology except directional terminology, grade term, the character in bracket, after dash
Character other than character;
The Second Type substring described in ontology all part the first dimension parsing knots in the first dimension analysis results
Fruit includes:In the two types substring described in ontology except directional terminology, grade term, the character in bracket, dash
The character other than character afterwards;And one or more of the following items:
The Second Type substring described in directional terminology in ontology, grade term;
The Second Type substring described in character in ontology bracket;
The Second Type substring described in character in ontology after dash.
9. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, the sub- word of Second Type
Symbol string described in each second dimension analysis result of ontology be respectively:
The Second Type substring described in ontology each Chinese character;
The Second Type substring described in ontology each Chinese character initial consonant;
The Second Type substring described in ontology each Chinese character simple or compound vowel of a Chinese syllable;
The Second Type substring described in ontology initial character;
The Second Type substring described in ontology initial character phonetic;And
The Second Type substring described in non-chinese character in ontology.
10. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, it is described based on described the
Several second dimension analysis results of two type substrings and several second dimension analysis results of the ontology calculate
The step of matching degree of the Second Type substring and each ontology, includes:
The similarity of the Second Type substring and each ontology is calculated according to equation below:
Wherein, M represents similarity;
T represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) expressions are in the second identical dimension, the second dimension analysis result and ontology of Second Type substring
The frequency that matches of the second dimension analysis result;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension analysis result
The sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology;
The similarity being calculated is determined as to the matching degree of the Second Type substring and each ontology.
11. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, it is described based on described the
Several second dimension analysis results of two type substrings and several second dimension analysis results of the ontology calculate
The step of matching degree of the Second Type substring and each ontology, includes:
Determine each Chinese character in the Second Type substring;
The cosine confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
Total confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
S=M × a+N × b
Wherein, N represents cosine confidence level;
V represents the Chinese character sum that Second Type substring and its ontology to match are included;
Q represents Second Type substring;
D' represents the ontology to match with Second Type substring;
wQ,jRepresent the frequency that each Chinese character occurs in Second Type substring;
wd',jRepresent the frequency occurred in the ontology that each Chinese character matches in Second Type substring;
J represents the serial number of Chinese character that Second Type substring and its ontology to match are included;
S represents total confidence level;
M represents similarity;
A represents the corresponding preset weights of similarity M;
B represents the corresponding preset weights of cosine confidence level N;
Also, similarity M is calculated according to equation below:
Wherein, t represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) expressions are in the second identical dimension, the second dimension analysis result and ontology of Second Type substring
The frequency that matches of the second dimension analysis result;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension analysis result
The sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology;
The total confidence level being calculated is determined as to the matching degree of the Second Type substring and each ontology.
12. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, described in the basis
The matching degree of Second Type substring and each ontology determines one or more ontology as the sub- character of the Second Type
The step of ontology that string matches, including:
Size according to the matching degree with the Second Type substring sorts to whole ontologies, and it is forward wherein to sort
The ontology of preset quantity be determined as the ontology that the Second Type substring matches;
Alternatively,
One or more ontologies of predetermined threshold value will be reached with the matching degree of the Second Type substring, be determined as described
The ontology that Second Type substring matches.
13. a kind of automatic coding system of Chinese surgical procedure information, including:
Import modul, for inputting Chinese surgical procedure information;
Natural language processing module for carrying out natural language processing to the Chinese surgical procedure information, obtains one or more
A title to be encoded;
Endowed module is matched, for based on the standard terminology library and expansion terminology bank pre-established, searching and the name to be encoded
Claim the standard terminology that matches or expand term, and by the standard terminology of successful match or expand the coding of term, be determined as institute
State the coding of title to be encoded;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is International Classification of Diseases ICD
Specified in surgical procedure title, the coding of the standard terminology is corresponding surgical procedure specified in International Classification of Diseases ICD
The coding of title;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is with same with the standard terminology
The word of adopted relationship or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously;
In the natural language processing module, natural language processing is carried out to the Chinese surgical procedure information, obtain one or
Multiple titles to be encoded are in the following way:
Step 21, the Chinese surgical procedure information character string is pre-processed, obtains pretreated Chinese surgical procedure
Information character string;
Step 22, based on the ontology dictionary, orientation dictionary, grade dictionary pre-established, the pretreated Chinese is performed the operation
Operation information character string is cut into several first kind substrings and/or Second Type substring;
Wherein, the ontology dictionary includes the standard terminology library and expands terminology bank, the standard terminology and the expansion art
Language is ontology;The orientation dictionary includes several directional terminologies, and the directional terminology is targeted for describing surgical procedure
Orientation word;The grade dictionary includes several grade terms, and the grade term is the grade for describing surgical procedure
Not, the word of type;
The first kind substring can directly be matched with the ontology in the ontology dictionary, the sub- character of Second Type
String can not directly be matched with the ontology in the ontology dictionary;
Step 23, the first kind substring being syncopated as and Second Type substring are determined as title to be encoded;
The step 21 includes:
To the non-Chinese character in the Chinese surgical procedure information character string into row format normalized, and delete the Chinese hand
Non-medical term in art operation information character string obtains pretreated Chinese surgical procedure information character string, wherein described
The non-medical term dictionary that non-medical term is pre-established by one provides, and the word that the non-medical term has been remarks effect
Language, phrase or sentence;
The step 22 includes:
Judge the pretreated Chinese surgical procedure information character string whether comprising symbol;
If the pretreated Chinese surgical procedure information character string includes symbol, by the pretreated Chinese hand
Character in art operation information character string between every adjacent two symbols is matched as a whole with the ontology in ontology dictionary;
If successful match, using the character cutting between the adjacent two symbols out as first kind substring;If matching is lost
Lose, then by the adjacent two symbols and its between character be determined as wouldn't cutting character string, and judge described in wouldn't cutting word
Whether preset additional character is included in symbol string;
If it is described wouldn't in cutting character string comprising additional character, search described in wouldn't be belonging to cutting character string character mould
Type, and the corresponding segmentation rules of character model according to belonging to this to it is described wouldn't cutting character string carry out cutting, will be syncopated as
The character come is matched with the ontology in ontology dictionary, if successful match, using the character cut out as the first kind
Type substring, if it fails to match, using the character cut out as Second Type substring;Wherein, the character
The character model library that model is pre-established by one provides, and the character model has one-to-one segmentation rules;
If described wouldn't not include additional character in cutting character string, by it is described wouldn't cutting character string be determined directly as second
Type substring;
If the pretreated Chinese surgical procedure information character string is not comprising symbol, using mechanical Chinese word segmentation method by described in
In single character or multiple continuous characters and the ontology dictionary in pretreated Chinese surgical procedure information character string
Ontology matched;
If all characters in the pretreated Chinese surgical procedure information character string can be with Ontology Matching, foundation
Matched ontology by the single character in the pretreated Chinese surgical procedure information character string or multiple continuous words
Symbol is cut out as first kind substring;
Fail and the single character of Ontology Matching or more if existing in the pretreated Chinese surgical procedure information character string
Whether a continuous character then fails with the single character of Ontology Matching or multiple continuous characters to be directional terminology described in judgement
Or grade term;
When described fail with the single character of Ontology Matching or multiple continuous characters as directional terminology or grade term, according to
It is described to fail with the single character of Ontology Matching or multiple continuous characters in the pretreated Chinese surgical procedure information
Position in character string fails described and the single character of Ontology Matching or multiple continuous characters and energy before or after it
It is enough merge with the single character of Ontology Matching or multiple continuous characters cut out as Second Type substring, and by institute
Stating remaining in pretreated Chinese surgical procedure information character string can be with the single character of Ontology Matching or multiple continuous
Character cutting out as first kind substring;
It, will when described fail with the single character of Ontology Matching or multiple continuous characters for directional terminology or grade term
The pretreated Chinese surgical procedure information character string is integrally cut out as Second Type substring.
14. the automatic coding system of Chinese surgical procedure information according to claim 13, wherein,
The endowed module of matching is additionally operable to, based on the Hypothetical classification terminology bank pre-established, search and the title to be encoded
The Hypothetical classification term to match;And by the coding of the Hypothetical classification term of successful match, it is determined as the title to be encoded
Coding;
The Hypothetical classification terminology bank includes several Hypothetical classification terms and its coding;
The Hypothetical classification term represents ad hoc type treatment means, and the ad hoc type treatment means correspond to a variety of resection operation classes
Type, a variety of resection operation types are the standard terminology;
The coding of the Hypothetical classification term and the full resection operation type of organ in a variety of resection operation types or part
The coding of resection operation type is consistent.
15. the automatic coding system of Chinese surgical procedure information according to claim 13, wherein,
The endowed module of matching is additionally operable to, based on the odd encoder terminology bank pre-established, search and the title phase to be encoded
Matched odd encoder term;And by the coding of the odd encoder term of successful match, it is determined as the coding of the title to be encoded;
The odd encoder terminology bank includes several odd encoder terms and its coding;
The odd encoder term is ad hoc type surgical procedure type;The premise that the ad hoc type surgical procedure type performs is another
Kind surgical procedure type;The ad hoc type surgical procedure type and another surgical procedure type for the standard terminology or
The expansion term;
The coding for being encoded to the ad hoc type surgical procedure type of the odd encoder term and another surgical procedure class
The combination of the coding of type.
16. the automatic coding system of Chinese surgical procedure information according to claim 13, further includes:
Merging treatment module, for based on the merging terminology bank pre-established, being carried out to one or more of titles to be encoded
Pretreatment;
The merging terminology bank includes several merging terms and its coding;Wherein, the merging term is International Classification of Diseases ICD
The defined single standard terminology that can substitute at least two while the other standards term occurred;Described at least two go out simultaneously
Existing other standards term is the combining objects of the merging term;The terminology bank that merges further includes the complete of each merging term
Portion's combining objects;
The merging treatment module, specifically for judging in one or more of titles to be encoded, if comprising any one
Or multiple whole combining objects for merging term, if comprising by any one or more whole merging for merging term
Object is substituted for corresponding merging term.
17. the automatic coding system of Chinese surgical procedure information according to claim 13, further includes:
Processing module is omitted, for based on the omission terminology bank pre-established, being carried out to one or more of titles to be encoded
Pretreatment;
The omission terminology bank includes several omission terms and its coding;Wherein, the omission term is International Classification of Diseases ICD
The defined single standard terminology that can substitute at least two while the standard terminology occurred;It is described omission term be it is described at least
One in two standard terminologies occurred simultaneously;Described at least two standard terminologies occurred simultaneously are the province of the omission term
Slightly object;It is described omit terminology bank and further include each and omit term be omitted altogether object;
The omission processing module, the step of specifically for being pre-processed to one or more of titles to be encoded, including:
Judge in one or more of titles to be encoded, if object is omitted altogether comprising any one or more omission terms,
If comprising any one or more objects that are omitted altogether for omitting term are substituted for corresponding omission term.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510496500.3A CN105069123B (en) | 2015-08-13 | 2015-08-13 | A kind of automatic coding and system of Chinese surgical procedure information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510496500.3A CN105069123B (en) | 2015-08-13 | 2015-08-13 | A kind of automatic coding and system of Chinese surgical procedure information |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105069123A CN105069123A (en) | 2015-11-18 |
CN105069123B true CN105069123B (en) | 2018-06-26 |
Family
ID=54498493
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510496500.3A Active CN105069123B (en) | 2015-08-13 | 2015-08-13 | A kind of automatic coding and system of Chinese surgical procedure information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105069123B (en) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105630873B (en) * | 2015-12-18 | 2018-12-25 | 河南思维自动化设备股份有限公司 | The graphical assist edit method disclosed in yard |
CN105963022B (en) * | 2016-04-19 | 2018-08-14 | 中国中医科学院中医临床基础医学研究所 | Treat encoder |
CN106874643B (en) * | 2016-12-27 | 2020-02-28 | 中国科学院自动化研究所 | Method and system for automatically constructing knowledge base to realize auxiliary diagnosis and treatment based on word vectors |
CN108257667A (en) * | 2016-12-28 | 2018-07-06 | 中国科学院深圳先进技术研究院 | A kind of data processing method and terminal device |
CN108320778A (en) * | 2017-01-16 | 2018-07-24 | 医渡云(北京)技术有限公司 | Medical record ICD coding methods and system |
CN106844308B (en) * | 2017-01-20 | 2020-04-03 | 天津艾登科技有限公司 | Method for automatic disease code conversion using semantic recognition |
CN107577826B (en) * | 2017-10-25 | 2018-05-15 | 山东众阳软件有限公司 | Classification of diseases coding method and system based on raw diagnostic data |
CN107705839B (en) * | 2017-10-25 | 2020-06-26 | 山东众阳软件有限公司 | Disease automatic coding method and system |
CN108182207B (en) * | 2017-12-15 | 2020-11-13 | 中电科软件信息服务有限公司 | Intelligent coding method and system for Chinese surgical operation based on word segmentation network |
CN108182977A (en) * | 2018-02-05 | 2018-06-19 | 南方医科大学顺德医院(佛山市顺德区第人民医院) | Patient diagnosis coding method and system |
CN108831522A (en) * | 2018-05-28 | 2018-11-16 | 陈丽璇 | A kind of the medical insurance disease score value charging system and its construction method of autocoding |
CN109273062A (en) * | 2018-08-09 | 2019-01-25 | 北京爱医声科技有限公司 | ICD intelligence Auxiliary Encoder System |
CN109256216B (en) * | 2018-08-14 | 2023-06-27 | 平安医疗健康管理股份有限公司 | Medical data processing method, medical data processing device, computer equipment and storage medium |
CN109918655B (en) * | 2019-02-27 | 2023-11-14 | 浙江数链科技有限公司 | Logistics term library generation method and device |
CN110442844B (en) * | 2019-07-03 | 2023-09-26 | 北京达佳互联信息技术有限公司 | Data processing method, device, electronic equipment and storage medium |
CN111128388B (en) * | 2019-12-03 | 2024-02-27 | 东软集团股份有限公司 | Value range data matching method and device and related products |
CN112131868A (en) * | 2020-09-22 | 2020-12-25 | 上海亿普医药科技有限公司 | Clinical trial medical coding method |
CN112131867A (en) * | 2020-09-22 | 2020-12-25 | 上海亿普医药科技有限公司 | Clinical trial medical coding system |
CN112749307B (en) * | 2020-12-30 | 2022-11-08 | 杭州依图医疗技术有限公司 | Medical data processing method and device and storage medium |
CN115017326B (en) * | 2022-05-12 | 2023-08-18 | 青岛普瑞盛医药科技有限公司 | Medical coding method and device |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102456100A (en) * | 2010-11-03 | 2012-05-16 | 通用电气公司 | Systems, methods, and apparatus for computer-assisted full medical code scheme to code scheme mapping |
CN104156415A (en) * | 2014-07-31 | 2014-11-19 | 沈阳锐易特软件技术有限公司 | Mapping processing system and method for solving problem of standard code control of medical data |
-
2015
- 2015-08-13 CN CN201510496500.3A patent/CN105069123B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102456100A (en) * | 2010-11-03 | 2012-05-16 | 通用电气公司 | Systems, methods, and apparatus for computer-assisted full medical code scheme to code scheme mapping |
CN104156415A (en) * | 2014-07-31 | 2014-11-19 | 沈阳锐易特软件技术有限公司 | Mapping processing system and method for solving problem of standard code control of medical data |
Non-Patent Citations (1)
Title |
---|
中文分词算法的研究与实现;林冬盛;《中国优秀硕士学位论文全文数据库信息科技辑》;20110815(第08期);第24页 * |
Also Published As
Publication number | Publication date |
---|---|
CN105069123A (en) | 2015-11-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105069123B (en) | A kind of automatic coding and system of Chinese surgical procedure information | |
CN105184053B (en) | A kind of automatic coding and system of Chinese medical service item information | |
CN105069124B (en) | A kind of International Classification of Diseases coding method of automation and system | |
CN105095665B (en) | A kind of natural language processing method and system of Chinese medical diagnosis on disease information | |
CN105138829B (en) | A kind of natural language processing method and system of Chinese medical information | |
Zhang et al. | MIE: A medical information extractor towards medical dialogues | |
CN108549639A (en) | Based on the modified Chinese medicine case name recognition methods of multiple features template and system | |
CN106407443A (en) | Structured medical data generation method and device | |
CN106844351B (en) | Medical institution organization entity identification method and device oriented to multiple data sources | |
CN108647203B (en) | Method for calculating text similarity of traditional Chinese medicine disease conditions | |
US11042712B2 (en) | Simplifying and/or paraphrasing complex textual content by jointly learning semantic alignment and simplicity | |
CN106934220A (en) | Towards the disease class entity recognition method and device of multi-data source | |
Khin et al. | A deep learning architecture for de-identification of patient notes: Implementation and evaluation | |
CN111651991B (en) | Medical named entity identification method utilizing multi-model fusion strategy | |
CN109192255A (en) | Case history structural method | |
US20170193197A1 (en) | System and method for automatic unstructured data analysis from medical records | |
Ji et al. | A BILSTM-CRF method to Chinese electronic medical record named entity recognition | |
CN108804423A (en) | Medical Text character extraction and automatic matching method and system | |
WO2020211250A1 (en) | Entity recognition method and apparatus for chinese medical record, device and storage medium | |
Polignano et al. | Comparing Transformer-based NER approaches for analysing textual medical diagnoses. | |
Yu et al. | Bios: An algorithmically generated biomedical knowledge graph | |
CN113658720A (en) | Method, apparatus, electronic device and storage medium for matching diagnostic name and ICD code | |
Costumero et al. | Text analysis and information extraction from Spanish written documents | |
CN106776535A (en) | Scientific and technical literature fine granularity relation excavation method based on two-stage syntax parsing | |
Jain | Supervised Named Entity Recognition for Clinical Data. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |