CN105069123B - A kind of automatic coding and system of Chinese surgical procedure information - Google Patents

A kind of automatic coding and system of Chinese surgical procedure information Download PDF

Info

Publication number
CN105069123B
CN105069123B CN201510496500.3A CN201510496500A CN105069123B CN 105069123 B CN105069123 B CN 105069123B CN 201510496500 A CN201510496500 A CN 201510496500A CN 105069123 B CN105069123 B CN 105069123B
Authority
CN
China
Prior art keywords
ontology
surgical procedure
type
term
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510496500.3A
Other languages
Chinese (zh)
Other versions
CN105069123A (en
Inventor
金以东
陈志永
朱华玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ebaotech Internet Medical Information Technology (beijing) Co Ltd
Original Assignee
Ebaotech Internet Medical Information Technology (beijing) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ebaotech Internet Medical Information Technology (beijing) Co Ltd filed Critical Ebaotech Internet Medical Information Technology (beijing) Co Ltd
Priority to CN201510496500.3A priority Critical patent/CN105069123B/en
Publication of CN105069123A publication Critical patent/CN105069123A/en
Application granted granted Critical
Publication of CN105069123B publication Critical patent/CN105069123B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24522Translation of natural language queries to structured queries

Abstract

Embodiments of the present invention provide a kind of automatic coding and system of Chinese surgical procedure information, and this method includes:Natural language processing is carried out to the Chinese surgical procedure information of input, obtains title to be encoded;It searches the standard terminology to match with title to be encoded or expands term, and by the standard terminology of successful match or expand the coding of term, be determined as the coding of title to be encoded;Standard terminology is surgical procedure title specified in International Classification of Diseases ICD, and the coding of standard terminology is the coding of corresponding surgical procedure title specified in International Classification of Diseases ICD;Expanding term is and word of the standard terminology with synonymy or the word with relation of genus and species;It is consistent to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species.The present invention can automatically, surgical procedure title is fast and accurately identified and it is encoded, whole process has that coding rate is fast, at low cost, high accuracy for examination without manually participating in.

Description

A kind of automatic coding and system of Chinese surgical procedure information
Technical field
Embodiments of the present invention are related to medical information field, more specifically, embodiments of the present invention are related to one kind The automatic coding and system of Chinese surgical procedure information.
Background technology
Background that this section is intended to provide an explanation of the embodiments of the present invention set forth in the claims or context.Herein Description recognizes it is the prior art not because not being included in this part.
At present in medicine and hygiene fields, the execution of surgical procedure is generally managed using operation differentiated control system, is being divided In grade management, the writing of surgical procedure title and coding are particularly significant.Under normal circumstances, surgical procedure name is write by surgical doctor Claim, then it is encoded by medical record administrator, it is that surgical procedure is suitable to write correct surgical procedure title and accurate coding The basis that profit carries out is conducive to improve the normalization of surgical procedure, reduces medical-risk.
The Ministry of Public Health of China regulation medical and health industry is unified to perform surgical procedure coding according to ICD-9-CM-3.ICD-9- CM-3 refers to《International Classification of Diseases 9th edition clinical modification volume 3》, it is one and is used to that medical operating and disease classify to refer to The professional books led.ICD-9-CM-3 is according to the disease of needle pair, the complexity of operating process and for technology It is required that operation is classified and encoded.
Invention content
Majority medical and health organization at present, is still the coding work that surgical procedure is accomplished manually by medical record administrator, There are efficiency it is low, of high cost the shortcomings that.Moreover, because the surgical procedure information that surgical doctor is write in medical record belongs to nature language Speech, form complexity is various, and ununified standard is (for example, using multilingual mixing expression, use grammer lack of standardization, typing Have false information, using abbreviation or be commonly called as replacing standard terminology, be mingled with gibberish such as symbol etc. in word), medical record management Personnel generally require finally determine surgical procedure title with reference to the detailed content of medical record and complete to encode, further reduced The efficiency of coding also tends to have higher error rate.
For this purpose, the present invention provides a kind of autocoding mechanism of surgical procedure title, automatically, quickly and accurately to know Do not go out to perform the operation and action name and it be encoded.
In the present context, embodiments of the present invention are intended to provide a kind of autocoding side of Chinese surgical procedure information Method and system.
In the first aspect of embodiment of the present invention, a kind of autocoding side of Chinese surgical procedure information is provided Method, including:
Step 1, Chinese surgical procedure information is inputted;
Step 2, natural language processing is carried out to the Chinese surgical procedure information, obtains one or more names to be encoded Claim;
Step 3, matched based on the standard terminology library and expansion terminology bank, lookup pre-established with the title to be encoded Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as described to be encoded The coding of title;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is international disease point Surgical procedure title specified in class ICD, the coding of the standard terminology are accordingly performed the operation specified in International Classification of Diseases ICD The coding of action name;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is that have with the standard terminology There are the word of synonymy or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously.
In the second aspect of embodiment of the present invention, a kind of autocoding system of Chinese surgical procedure information is provided System, including:
Import modul, for inputting Chinese surgical procedure information;
Natural language processing module for carrying out natural language processing to the Chinese surgical procedure information, obtains one Or multiple titles to be encoded;
Endowed module is matched, for waiting to compile with described based on the standard terminology library and expansion terminology bank, lookup that pre-establish The standard terminology that matches of code title expands term, and by the standard terminology of successful match or expands the coding of term, determines Coding for the title to be encoded;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is international disease point Surgical procedure title specified in class ICD, the coding of the standard terminology are accordingly performed the operation specified in International Classification of Diseases ICD The coding of action name;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is that have with the standard terminology There are the word of synonymy or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously.
By means of above-mentioned technical proposal, the present invention has fully considered that the Chinese surgical procedure information of surgical doctor input belongs to The features such as natural language, form complexity are various, without unified standard is established a variety of using advance foundation ICD-9-CM-3 Dictionary matches Chinese surgical procedure information character string, so as to it is automatic, surgical procedure title is fast and accurately identified simultaneously It is encoded, whole process improves coding rate, reduce coding cost, and ensure that coding is correct without manually participating in Rate.
Description of the drawings
Detailed description below, above-mentioned and other mesh of exemplary embodiment of the invention are read by reference to attached drawing , feature and advantage will become prone to understand.In the accompanying drawings, if showing the present invention's by way of example rather than limitation Dry embodiment, wherein:
Fig. 1 schematically shows the application scenarios that embodiments of the present invention can be implemented within;
Fig. 2 schematically shows the autocoding flows of the Chinese surgical procedure information of illustrative embodiment of the present invention Figure;
Fig. 3 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention one;
Fig. 4 schematically shows the autocoding flow charts of the Chinese surgical procedure information of the embodiment of the present invention two;
Fig. 5 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention three;
Fig. 6 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention four;
Fig. 7 schematically shows the autocoding flow chart of the Chinese surgical procedure information of the embodiment of the present invention five;
Fig. 8 schematically shows the natural language processing flow chart of the embodiment of the present invention six;
Fig. 9 schematically shows the cutting first kind substring of the embodiment of the present invention six and the sub- character of Second Type The flow chart of string;
Standard terminology that the lookup that Figure 10 schematically shows the embodiment of the present invention seven matches with title to be encoded or Expand the flow chart of term;
Figure 11 schematically shows the autocoding system of the Chinese surgical procedure information of illustrative embodiment of the present invention System block diagram.
In the accompanying drawings, identical or corresponding label represents identical or corresponding part.
Specific embodiment
The principle and spirit of the invention are described below with reference to several illustrative embodiments.It should be appreciated that provide this A little embodiments are not with any just for the sake of better understood when those skilled in the art and then realize the present invention Mode limits the scope of the invention.On the contrary, these embodiments are provided so that the disclosure is more thorough and complete, and energy It is enough that the scope of the present disclosure is completely communicated to those skilled in the art.
Art technology technical staff knows, embodiments of the present invention can be implemented as a kind of system, device, equipment, Method or computer program product.Therefore, the disclosure can be with specific implementation is as follows, i.e.,:It is complete hardware, complete soft The form that part (including firmware, resident software, microcode etc.) or hardware and software combine.
According to the embodiment of the present invention, it is proposed that a kind of method and apparatus.
Herein, any number of elements in attached drawing is used to example and unrestricted and any name is only used for It distinguishes, without any restrictions meaning.
Below with reference to several representative embodiments of the present invention, the principle and spirit of the invention are illustrated in detail.
Application scenarios overview
Referring initially to Fig. 1, it illustrates the application scenarios that embodiments of the present invention can be implemented within.
Scene shown in Fig. 1 includes medical information platform 100 and Chinese surgical procedure information automatic coding system 200.Medical information platform 100 can be loaded into desktop computer used in doctor, laptop, tablet computer, individual Software in the equipment such as digital assistants.Chinese surgical procedure information automatic coding system 200 can be operate in information for hospital clothes Software being engaged in device etc..It for example can be between medical information platform 100 and Chinese surgical procedure information automatic coding system 200 It is communicatively coupled by hospital lan etc..
Surgical doctor is inputted in medical information platform 100 after Chinese surgical procedure information, Chinese surgical procedure letter Breath is transferred to Chinese surgical procedure information automatic coding system 200, right by Chinese surgical procedure information automatic coding system 200 It carries out natural language processing and autocoding, last exports coding result.
Illustrative methods
With reference to the application scenarios of Fig. 1, it is described with reference to Figure 2 and is performed the operation according to the Chinese of exemplary embodiment of the invention The automatic coding of operation information.It should be noted that above application scene is for only for ease of the spirit for understanding the present invention It is shown with principle, embodiments of the present invention are unrestricted in this regard.On the contrary, embodiments of the present invention can answer For applicable any scene.
As shown in Fig. 2, the automatic coding of Chinese surgical procedure information, including:
Step S101 inputs Chinese surgical procedure information.
Step S102 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
The step can be based on the characteristics of surgical procedure information, carry out the processing such as mechanical Chinese word segmentation to operation operation information, obtain To title to be encoded.It will introduce how this illustrative methods carries out nature to Chinese surgical procedure information by embodiment six below A kind of specific embodiment of Language Processing.
Step S103 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded Coding.
In illustrative methods of the present invention, standard terminology library includes several standard terminologies and its coding, and standard terminology is international Surgical procedure title specified in classification of diseases ICD, the coding of standard terminology is corresponding hand specified in International Classification of Diseases ICD The coding of art action name.
Expand terminology bank and include several expansion terms and its coding, it is to have synonymy with standard terminology to expand term Word or the word with relation of genus and species.Wherein, can be standard terminology when expand term has synonymy with standard terminology Be commonly called as, nickname or abbreviation etc., when expand term has relation of genus and species with standard terminology, can be conceptive or apply upper packet Include standard terminology (the high rank of surgical procedure type represented relative to standard terminology) or included by standard terminology (relative to The low rank of surgical procedure type that standard terminology represents).In order to encode needs, according to clinical experience, enable and expand term and tool There is the corresponding coding of the standard terminology of synonymy or relation of genus and species unanimously.
In this illustrative methods, acceptable revised standard terminology bank in real time expands terminology bank, for example, increasing new expansion Term deletes existing expansion term, so that standard terminology library, expansion terminology bank more meet the need of ICD-9-CM-3 codings It will.
In this illustrative methods, standard terminology library and expansion terminology bank form an ontology dictionary, standard terminology and expansion art Language is the ontology in the ontology dictionary, is as shown in table 1 the part of standards term that ontology dictionary includes and expansion term and its volume Code.
Table 1
It will introduce how this illustrative methods searches the standard art to match with title to be encoded by embodiment seven below Language or a kind of specific embodiment for expanding term.
Embodiment one
As shown in figure 3, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S201 inputs Chinese surgical procedure information.
Step S202 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S203 based on the standard terminology library pre-established, expands terminology bank and Hypothetical classification terminology bank, searches and treat Standard terminology that encoding name matches expands term or Hypothetical classification term, and by the standard terminology of successful match, expand art The coding of language or Hypothetical classification term is determined as the coding of title to be encoded.
The present embodiment is that Hypothetical classification terminology bank is increased in illustrative methods, which includes several Hypothetical classification term and its coding.
Hypothetical classification term represents ad hoc type treatment means, and ad hoc type treatment means correspond to a variety of resection operation types, This variety of resection operation type is standard terminology.
Hypothetical classification term has one-to-one standard terminology, it is assumed that term of classifying is encoded to its corresponding standard art The coding of language.According to the regulation of ICD-9-CM-3, if the disease that the ad hoc type treatment means are directed to is the non-malignant of site of pathological change Tumour, then the corresponding standard terminology of Hypothetical classification term is the disease damage resection of site of pathological change;If the ad hoc type treatment means needle To disease for site of pathological change malignant tumour and do not need to do organ transplant, then the corresponding standard terminology of Hypothetical classification term is The total resection of site of pathological change;If the disease that the ad hoc type treatment means are directed to is the malignant tumour of site of pathological change and does not do organ Transplanting is not suitable for cutting entirely, then the corresponding standard terminology of Hypothetical classification term is the ablation of site of pathological change.
In the present embodiment, Hypothetical classification terminology bank can also be revised in real time, for example, increase new Hypothetical classification term or Existing Hypothetical classification term is deleted, so that Hypothetical classification terminology bank more meets the needs of ICD-9-CM-3 codings.
For example, table 2 show part Hypothetical classification term and its corresponding standard terminology that Hypothetical classification terminology bank includes And coding.
Table 2
Hypothetical classification term (ad hoc type treatment means) Standard terminology Coding
Liver Cancer under Radical Operation Partial hepatectomy 50.22011
Hepatic cyst resection Hepatopathy damages resection 50.29009
Diverticulectomy of stomach Excision of lesion of stomach 43.42004
As shown in table 2, the disease that ad hoc type treatment means " Liver Cancer under Radical Operation " are directed to is the malignant tumour and discomfort of liver Organ transplant is done in conjunction, then corresponding standard terminology is " partial hepatectomy ".
Again as shown in table 2, the disease that ad hoc type treatment means " hepatic cyst resection " are directed to is the non-malignant tumors of liver, Then corresponding standard terminology is " hepatopathy damage resection ".
Again as shown in table 2, the disease that ad hoc type treatment means " resection of gastric carcinoma " are directed to is the malignant tumour of stomach and needs Organ transplant is done, then corresponding standard terminology is " Radical Gastrectomy ".
Embodiment two
As shown in figure 4, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S301 inputs Chinese surgical procedure information.
Step S302 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S303 based on the standard terminology library pre-established, expands terminology bank and odd encoder terminology bank, searches and wait to compile Standard terminology that code title matches expands term or odd encoder term, and by the standard terminology of successful match, expand term or The coding of odd encoder term is determined as the coding of title to be encoded.
The present embodiment is that odd encoder terminology bank is increased in illustrative methods, which includes several more volumes Code term and its coding.
Odd encoder term is ad hoc type surgical procedure type;The premise that ad hoc type surgical procedure type performs is another hand Art action type;Ad hoc type surgical procedure type and another surgical procedure type for standard terminology or expand term;
The coding of the coding for being encoded to ad hoc type surgical procedure type of odd encoder term and another surgical procedure type Combination.
In actual clinical, if doctor has write a surgical procedure title, the premise which performs is another A surgical procedure, then the surgical procedure title just belongs to odd encoder term.
In the present embodiment, odd encoder terminology bank can also be revised in real time, for example, increasing new odd encoder term or deleting Existing odd encoder term, so that odd encoder terminology bank more meets the needs of ICD-9-CM-3 codings.
For example, table 3 show the part odd encoder term that odd encoder terminology bank includes and its corresponding standard terminology and volume Code.
Table 3
Embodiment three
As shown in figure 5, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S401 inputs Chinese surgical procedure information.
Step S402 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S403 based on the merging terminology bank pre-established, treats encoding name and is pre-processed.
Merge terminology bank and include several merging terms and its coding;Wherein, merge term to provide for International Classification of Diseases ICD Can substitute at least two and meanwhile occur other standards term single standard terminology;At least two simultaneously occur other Standard terminology is the combining objects of the merging term;Merge terminology bank and further include each whole combining objects for merging term. Wherein, merge term and be different from its any one corresponding combining objects.
Clinically doctor may write multiple surgical procedure titles in a medical record, according to the rule of ICD-9-CM-3 Fixed, these surgical procedure titles can be classified as a surgical procedure title, i.e., practical above multiple surgical procedure titles are one Multiple steps of surgical procedure title.
In the present embodiment, merging terminology bank can also be revised in real time, had for example, increasing new merging term or deleting Merging term or modification combining objects so that merge terminology bank more meet ICD-9-CM-3 coding needs.
Table 4 is to merge one that terminology bank includes to merge term and its coding and whole combining objects.
Table 4
Step S403 is specially:Judge in one or more titles to be encoded, if include any one or more merging Whole combining objects of term, if comprising any one or more whole combining objects for merging term are substituted for correspondence Merging term.
Step S404 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded Coding.
Example IV
As shown in fig. 6, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S501 inputs Chinese surgical procedure information.
Step S502 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S503 based on the omission terminology bank pre-established, treats encoding name and is pre-processed.
It omits terminology bank and includes several omission terms and its coding;Wherein, omitting term can as defined in ICD-9-CM-3 To substitute the single standard terminology of at least two standard terminologies occurred simultaneously;It is at least two while the mark occurred to omit term One in quasi- term;At least two standard terminologies occurred simultaneously are the omission object of the omission term;Omit terminology bank also Object is omitted altogether including each omission term;
In medical record when certain surgical procedure titles occur simultaneously, surgical procedure be other surgical procedures leading hand Art, according to the regulation of ICD-9-CM-3, some of corresponding surgical procedure titles are without coding.
In the present embodiment, omission terminology bank can also be revised in real time, had for example, increasing new omission term or deleting Omission term or modification omit object, so as to omit the needs that term Kuku more meets ICD-9-CM-3 codings.
Table 5 is to omit one that terminology bank includes to omit term and its coding and be omitted altogether object.
Table 5
Step S503 is specially:Judge in one or more titles to be encoded, if include any one or more omissions Term is omitted altogether object, if comprising any one or more objects that are omitted altogether for omitting term are substituted for correspondence Omission term.
Step S504 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded Coding.
Embodiment five
As shown in fig. 7, for a kind of automatic coding of specific Chinese surgical procedure information, including:
Step S601 inputs Chinese surgical procedure information.
Step S602 carries out natural language processing to Chinese surgical procedure information, obtains one or more titles to be encoded.
Step S603 based on the standard terminology library pre-established and expands terminology bank, and lookup matches with title to be encoded Standard terminology or expand term, and by the standard terminology of successful match or expand term coding, be determined as title to be encoded Coding.
This, which is completed, searches the standard terminology to match with title to be encoded or expands term this process, should during have The standard terminology to match less than title to be encoded or expansion term may be searched.This is because the ontology in ontology dictionary is (no By being standard terminology or expand term) it is the relevant word of surgical procedure title, however practical Chinese surgical procedure letter Often be related to a variety of concepts of medical field, not only surgical procedure title in breath, it is also possible to be related to disease name (such as " fracture of sternum Flail chest "), nomenclature of drug (such as " cetirizine "), medical treatment consumptive materials title (such as " pseudoxanthoma elasticum gum ") etc., But the present invention is only to the coding for action name of performing the operation, therefore, if there is disease name, drug in Chinese surgical procedure information Title, medical treatment consumptive materials title etc., the present invention can select not to encode it.In addition, practical Chinese surgical procedure information is also Although it is to represent surgical procedure information that may include some, it not can determine which kind of surgical procedure title corresponded specifically to Word, such as some do not meet ICD-9-CM-3 taxonomic hierarchies, it is impossible to determine its specific corresponding surgical procedure title.Such as " attrition ", although representing surgical procedure title, its concept is too general, can not determine specifically art is worn down at what position, be face Portion's attrition, cheekbone attrition or Laser final guidance shell;For another example, " denaturation art " is specific although representing surgical procedure title It is that man becomes female urethra displacement plasty or man becomes vagina reconstruction and can not also determine.
In view of problem above, illustrative methods of the present invention also preset one without encryption description library, this is without encryption description Library includes several no encryption descriptions.These include without encryption description:It is preset to be used to represent surgical procedure information but determine The word of surgical procedure title;Preset disease name;Preset nomenclature of drug;And preset medical treatment consumptive materials title.
For example, table 6 show part that no encryption description dictionary includes without encryption description.
Table 6
Step S604, will do not determine coding title to be encoded in no encryption description library without encryption description progress Match, if successful match, perform preset processing step to represent not determining this title to be encoded of coding encodes, If it fails to match, this not determined to, the title to be encoded of coding is sent to artificial treatment platform and carries out artificial treatment.
Wherein, for not finding the standard terminology to match or the title to be encoded for expanding term, if phase can be found Matched no encryption description then illustrates that it belongs to expression surgical procedure information but can not determine the word of surgical procedure title, disease Name of disease claims, one kind in nomenclature of drug, medical treatment consumptive materials title, not encodes, and for cannot find match without coding The title to be encoded of term, illustrates that it is not belonging to the above-mentioned type, and for this kind of title to be encoded, the present embodiment sends it to people Work processing platform, by manually continuing with, concrete processing procedure, the present invention is not construed as limiting it.
Embodiment six
As shown in figure 8, nature language is carried out to Chinese surgical procedure information to be suitable for a kind of of illustrative methods of the present invention Speech is handled to obtain the specific embodiment of title to be encoded, including:
Step S71 pre-processes Chinese surgical procedure information character string, obtains pretreated Chinese surgical procedure Information character string.
The purpose of the step is that the character in Chinese surgical procedure information character string is converted into unified coded format, with Just subsequent processing.
Optionally, which can implement according to following concrete mode:To non-in Chinese surgical procedure information character string Chinese character is into row format normalized (for example, the symbol in Chinese surgical procedure information character string is all converted to half-angle lattice Formula is all converted to full-shape form, and English alphabet therein is all converted to uppercase format or lower case format);And in deleting Non-medical term in literary surgical procedure information character string.The non-medical term dictionary that wherein non-medical term is pre-established by one There is provided, and non-medical term be the word of remarks effect, phrase or descriptive sentence (such as " opening inspection ", " benefit emergency treatment is remembered Account ", " bed expense is exceeded at one's own expense ", " being added more than one month, monthly received less than one month ", " paediatrics is added " etc.).
Based on the ontology dictionary, orientation dictionary, grade dictionary pre-established, pretreated Chinese is performed the operation by step S72 Operation information character string is cut into several first kind substrings and/or Second Type substring.
Wherein, first kind substring can directly be matched with the ontology in ontology dictionary, Second Type substring It can not directly be matched with the ontology in ontology dictionary.First kind substring and Second Type the substring tool being syncopated as There is independent semanteme, i.e., represented surgical procedure project information is not influenced by the character before or after it.
Ontology dictionary include aforesaid standards terminology bank and expand terminology bank, as shown in table 1, specifically include several ontologies and Ontology encodes correspondingly, standard terminology or expands term and is considered ontology in ontology dictionary.
It should be noted that before use has been arrived in the automatic coding of Chinese surgical procedure information provided by the invention When the Hypothetical classification terminology bank and/or odd encoder terminology bank stated, ontology dictionary should also be as consisting of assuming that classification terminology bank and/or more Encryption description library is (at this point, Hypothetical classification term and/or odd encoder term, omission term are also considered the sheet in ontology dictionary Body) so that can be with vacation when the first kind substring or Second Type substring that are syncopated as are as title to be encoded Surely classification term or odd encoder term or omission term match.
Orientation dictionary includes several directional terminologies, and directional terminology is for describing the targeted orientation of surgical procedure project Word.For example, directional terminology can be:Unilateral side, bilateral, left side, right side, both sides, side etc..
Grade dictionary includes several grade terms, and grade term is for describing the rank of surgical procedure project, type Word.For example, grade term can be:A grades, B grades, C grades, superfine etc..
The purpose of step S72 is that Chinese surgical procedure information is cut into the independent semantic substring (first kind Type substring or Second Type substring), effectively to avoid multiple characters with incidence relation being identified respectively The problem of so as to cause identification mistake.
After the first kind substring being syncopated as and Second Type substring are determined as title to be encoded, rear It is continuous when treating encoding name using the merging terminology bank in embodiment three or the omission terminology bank in example IV and being pre-processed, Since first kind substring and the corresponding ontology of Second Type substring may be expansion term, and merge in terminology bank Combining objects and the omission object that omits in terminology bank be standard terminology, therefore, need to by first kind substring and Expansion term corresponding to Second Type substring is converted to corresponding standard terminology, then recycles and merges terminology bank or province Slightly terminology bank is pre-processed.
As shown in figure 9, step S72 is specifically included:
Whether step S80 judges pretreated Chinese surgical procedure information character string comprising symbol;If include symbol Number, then perform step S81;If step S82 is not performed comprising symbol.
Step S81, by the character between every adjacent two symbols in pretreated Chinese surgical procedure information character string It is matched as a whole with the ontology in ontology dictionary;If successful match, step S811 is performed;If it fails to match, hold Row step S812.
Step S811, using the character cutting between the adjacent two symbols out as first kind substring.
Step S812, the adjacent two symbols and its between character be determined as wouldn't cutting character string, then perform step Rapid S83.
Step S81, step S811, step S812 foundations processing rule be:Alphabet between adjacent-symbol is made It is matched for entirety with ontology, ability cutting when only matching, otherwise temporarily not cutting.
Such as the cutting shown in table 7 to " cardiac output monitors, and consumes technology, ventricular puncture, implanted conduit with oxygen ", In, " cardiac output monitors, and consumes technology with oxygen ", " ventricular puncture, implanted conduit " they are the alphabet between symbol, and And the ontology to match can be found, therefore, it is split out respectively.
Table 7
Step S82, will be in pretreated Chinese surgical procedure information character string and ontology dictionary using mechanical Chinese word segmentation method Ontology matched;If all characters in pretreated Chinese surgical procedure information character string can be with ontology Match, then perform step S821;If there is the list failed with Ontology Matching in pretreated Chinese surgical procedure information character string A character or multiple continuous characters, then perform step S822.
Step S821 cuts the character in pretreated Chinese surgical procedure information character string according to the matched ontology of institute It branches away as first kind substring.
Step S822, judgement fail with the single character of Ontology Matching or multiple continuous characters whether be directional terminology or Grade term;If directional terminology or grade term, then perform step S8221;If not directional terminology or grade term, Then perform step S8222.
Step S82, step S821, step S822 foundations processing rule be:It will be pretreated using mechanical Chinese word segmentation method Character in Chinese surgical procedure information character string is matched with ontology, and only alphabet can find the sheet to match Ability cutting during body, otherwise temporarily not cutting.
Such as table 8 show the cutting to " 24 hours monitoring of blood pressure of electroencephalogram ", can be searched respectively using mechanical Chinese word segmentation method The ontology to match to " electroencephalogram " and " 24 hours monitoring of blood pressure ", therefore, is split out respectively.
Table 8
The mechanical Chinese word segmentation method that step S82 is used can be Forward Maximum Method type, reverse maximum matching type or minimum cutting Type.Specific dicing process, the present embodiment repeat no more.
Step S8221, according to fail with the single character of Ontology Matching or multiple continuous characters after the pre-treatment in Position in literary surgical procedure information character string, will fail with the single character of Ontology Matching or multiple continuous characters and its it It is preceding or can merge with the single character of Ontology Matching or multiple continuous characters cut out as the sub- word of Second Type later Symbol string, and can be with the single character of Ontology Matching or multiple continuous character cuttings out as first kind using remaining Character string.
Step S8222 integrally cuts out pretreated Chinese surgical procedure information character string as Second Type Substring.
Step S8221, the processing rule of step S8222 foundations is:Failing to the single character of Ontology Matching or more A continuous character is directional terminology or grade term, then performs cutting, and during cutting be by its with before or after it Character merging is cut out.
Such as table 9 show the cutting to " lung volume reduction surgery right lung neoplasty ", can be looked into respectively using mechanical Chinese word segmentation method Find " lung volume reduction surgery ", the ontology that " lung neoplasty " matches, " right side " therein is directional terminology, therefore, by " right side " Merge with " lung neoplasty " and cut out, " lung volume reduction surgery " is individually cut out.
Table 9
Whether step S83, judgement wouldn't include preset additional character in cutting character string;It if wouldn't cutting character string In comprising additional character, then perform step S831;If additional character wouldn't not be included in cutting character string, step is performed S833。
Step S831, search wouldn't be belonging to cutting character string character model, and the character model according to belonging to this corresponds to Segmentation rules to wouldn't cutting character string carry out cutting;Wherein, the character model library that character model is pre-established by one provides, And character model has one-to-one segmentation rules.
Step S832 matches the character cut out with the ontology in ontology dictionary, should if successful match The character cut out is determined as first kind substring, if it fails to match, the character that this cuts out is determined as Two type substrings;
Step S833, wouldn't cutting character string be determined directly as Second Type substring.
Step S83, step S831, step S832, step S833 foundations processing rule be:When wouldn't be in cutting character string During comprising preset additional character, cutting is carried out according to character model that wouldn't be belonging to cutting character string, is otherwise directly syncopated as Come;And match the character being syncopated as based on character model with ontology again, it wherein can will directly be matched with ontology Conduct first kind substring, it is impossible to directly it is matched be used as Second Type substring.
Such as preset additional character can include but is not limited to fullstop, colon, plus sige, branch, slash line etc..
Such as following partial character model and its segmentation rules in character model library:
(1) character model:XAY types, A is plus sige, colon;
Segmentation rules:XAY is cut out as a whole;
(2) character model:CDE types, and one of C, E are Chinese character, D is fullstop, branch;
Segmentation rules:Chinese character segmentation in C, E is come out;
(3) character model:STU types, and S and/or U is individual Chinese character, T is slash line;
Segmentation rules:STU is cut out as a whole.
Such as to " blood fat (P).Renal function detects (P) " cutting is carried out, it understands to belong to CDE through searching character model library Type then individually cuts out " blood fat (P) ", " renal function detects (P) ".
Such as cutting is carried out to " thoracoscope lower lung neoplasty+pulmonary belb resection ", it understands to belong to through searching character model library In XAY types, then " thoracoscope lower lung neoplasty+pulmonary belb resection " is integrally cut out.
Such as cutting is carried out to " 3/4 laryngectomy and laryngeal reconstruction ", it understands to belong to STU through searching character model library Type then integrally cuts out " 3/4 laryngectomy and laryngeal reconstruction ".
The first kind substring being syncopated as and Second Type substring are determined as title to be encoded by step S73.
The present embodiment has fully considered operation doctor during natural language processing is carried out to Chinese surgical procedure information The Chinese surgical procedure information of teacher's input belongs to the features such as natural language, form complexity are various, without unified standard, using pre- First a variety of dictionaries for being established carry out cutting and matching to Chinese surgical procedure information character string, with this by surgical procedure entry name Title is identified as title to be encoded.
Embodiment seven
As shown in Figure 10, the standard to match to be suitable for a kind of lookup of illustrative methods of the present invention with title to be encoded Term or the specific embodiment for expanding term, including:
Step S90, if entitled first kind substring to be encoded, which is matched Ontology is determined as the standard terminology to match with the title to be encoded or expands term, if entitled Second Type to be encoded Character string then carries out each ontology in Second Type substring and ontology dictionary the parsing of the first dimension, obtains second Several first dimension analysis results of several first dimension analysis results of type substring and each ontology;
The step optionally, carries out analysis object using Second Type substring and ontology as analysis object The parsing of first dimension can include but is not limited to:
(1) directional terminology included in analysis object is determined, if not including directional terminology, this analysis result wherein For sky;
(2) the grade term included in analysis object is determined, if not including grade term, this analysis result wherein For sky;
(3) character in analysis object bracket is determined, if not including bracket wherein, this analysis result is sky;
(4) character after dash in analysis object is determined, if not including dash wherein, this analysis result is It is empty;And
(5) it determines in analysis object in addition to the character in directional terminology, grade term, bracket, the character after dash Character (the remaining character hereinafter referred to as in ontology), the generally core stem of analysis object.
When analysis object is Second Type substring, each first dimension analysis result can include but unlimited In:The grade term in directional terminology, Second Type substring, Second Type substring in Second Type substring Character in character, Second Type substring in bracket after dash, the remaining character in Second Type substring.
When analysis object is ontology, each first dimension analysis result can include but is not limited to:Side in ontology Position term, the grade term in ontology, the character in ontology bracket, the character after dash, the residue in ontology in ontology Character.
Step S91, by ontology each in each first dimension analysis result of Second Type substring and ontology dictionary The analysis result of each first dimension is matched, and is searched whether there are each first dimension analysis result of some ontology with second Each first dimension analysis result of type substring matches;If there is such ontology, then step S92 is performed, if There is no such ontologies, then perform step S93.
The ontology found is determined as the ontology that Second Type substring matches by step S92.
Step S93 chooses part the first dimension solution in all the first dimension analysis results of Second Type substring Result is analysed to carry out with part the first dimension analysis result in all the first dimension analysis results of ontology each in ontology dictionary Matching, and search whether the part there are this of some ontology part the first dimension analysis result and Second Type substring First dimension analysis result matches;If there is such ontology, then step S931 is performed;If there is no such sheet Body then performs step S932.
The ontology found is determined as the ontology that Second Type substring matches by step S931.
The directional terminology included in Second Type substring is matched with the directional terminology included in ontology respectively, The grade term included in Second Type substring is matched with the grade term included in ontology, by Second Type Character in character string bracket is matched with the character in ontology bracket, after dash in Second Type substring Character matched with the character after dash in ontology bracket, by the remaining character in Second Type substring with this Remaining character in body is matched.
If the first whole dimension analysis results match, which is determined as Second Type substring phase The ontology matched.
If certain first dimension analysis results mismatch, the first dimension of selected part analysis result carries out respectively Match.
It is often the core information of Second Type substring in view of the remaining character in Second Type substring, because This, in specific implementation, preferably, selected part the first dimension analysis result is included at least in Second Type substring Remaining character and ontology in remaining character.For example, only choose the character after the remaining character and dash of analysis object It is matched respectively, alternatively, the remaining character for only choosing analysis object is matched, alternatively, analysis object can also be chosen Remaining character is matched respectively with the character in directional terminology or grade term or bracket or directional terminology or grade term.
Such as a certain Second Type substring is " left mastostomy (big) ", and the solution of the first dimension is carried out to it Analysis, obtained analysis result is as shown in table 10, as shown in table 11 for the ontology that matches with the Second Type substring and its Each first dimension analysis result.
Table 10
The first dimension parsing knot of the ontology " mastostomy " to match with " left mastostomy (big) " Fruit is as shown in table 11:
Table 11
Step S932 carries out each ontology in Second Type substring and ontology dictionary the parsing of the second dimension, Obtain each second dimension solution of each ontology in each second dimension analysis result of Second Type substring and ontology dictionary Analyse result.
The step optionally, carries out analysis object using Second Type substring and ontology as analysis object The parsing of default dimension can include but is not limited to:
(1) each Chinese character in analysis object is determined;
(2) initial consonant of each Chinese character in analysis object is determined;
(3) simple or compound vowel of a Chinese syllable of each Chinese character in analysis object is determined;
(4) initial character of analysis object is determined;
(5) phonetic of the initial character of analysis object is determined;And
(6) non-chinese character in analysis object is determined, if not including non-chinese character, this analysis result wherein For sky.
When analysis object is Second Type substring, the analysis result of each dimension can include but is not limited to: The sub- character of initial consonant, Second Type of each Chinese character in each Chinese character, Second Type substring in Second Type substring Each simple or compound vowel of a Chinese syllable of Chinese character in string, the initial character of Second Type substring, Second Type substring initial character phonetic, Non-chinese character in two type substrings.
When analysis object is entry, analysis result can include but is not limited to:It is every in each Chinese character, entry in entry Each simple or compound vowel of a Chinese syllable of Chinese character in the initial consonant of a Chinese character, entry, the initial character of entry, the phonetic of initial character of entry, entry the non-Chinese Word character.
For example, table 12 is each second dimension analysis result of Second Type substring " deciduous teeth arrachement ".
Table 12
Step S933, several of several second dimension analysis results and ontology based on Second Type substring Two-dimensions analysis result calculates the matching degree of Second Type substring and each ontology.
Specifically, which can calculate the similarity of Second Type substring and each ontology, can also calculate Total confidence level of two type substrings and each ontology.Wherein, compared to similarity, total confidence level can more embody Second Type The matching degree of substring and each ontology, but the calculating process of total confidence level compared to similarity calculating process also more It is complicated.When step S933 is embodied, if desired faster processing speed, then can select to calculate the process of similarity, if More accurately matching result is needed, then can select to calculate the process of total confidence level.
A kind of embodiment of step S933 is to calculate the similarity of Second Type substring and each ontology, specifically such as Under:
The similarity of Second Type substring and each ontology is calculated according to equation below, and similar by what is be calculated Degree is determined as the matching degree of Second Type substring and each ontology:
Wherein, M represents similarity;
T represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) represent in the second identical dimension, the second dimension analysis result of Second Type substring with The frequency that second dimension analysis result of ontology matches;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension parsing As a result the sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology.
A kind of embodiment of step S933 is to calculate total confidence level of Second Type substring and each ontology, specifically It is as follows:
Total confidence level of Second Type substring and each ontology is calculated as follows, and total by what is be calculated Confidence level is determined as the matching degree of Second Type substring and each ontology:
1) each Chinese character in Second Type substring is determined.
2) the cosine confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
Wherein, N represents cosine confidence level;
V represents the Chinese character sum that Second Type substring and its ontology to match are included;
Q represents Second Type substring;
D' represents the ontology to match with Second Type substring;
wQ,jRepresent the frequency that each Chinese character occurs in Second Type substring;
wd',jRepresent the frequency occurred in the ontology that each Chinese character matches in Second Type substring;
J represents the serial number of Chinese character that Second Type substring and its ontology to match are included.
3) total confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
S=M × a+N × b
Wherein, S represents total confidence level;
M represents similarity;
A represents the corresponding preset weights of similarity M;
B represents the corresponding preset weights of cosine confidence level N;
Also, similarity M is calculated according to equation below:
Wherein, t represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) represent in the second identical dimension, the second dimension analysis result of Second Type substring with The frequency that second dimension analysis result of ontology matches;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension parsing As a result the sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology.
Step S934 according to the matching degree of Second Type substring and each ontology, determines one or more ontology The ontology to match as Second Type substring.
Optionally, which can have following specific embodiment:According to the matching journey with Second Type substring The size of degree sorts to whole ontologies, and the ontology of the forward preset quantity that wherein sorts (such as forward 2 that sort) is true It is set to the ontology that Second Type substring matches;Alternatively, by reaching default with the matching degree of Second Type substring One or more ontologies of threshold value are determined as the ontology that Second Type substring matches.
During the specific implementation present invention, for the matching journey for the ontology that clear and definite Second Type substring matches with each It spends and it is used, can also can also match in the result of final output including Second Type substring with it Each ontology matching degree.For example, the matching degree of output Second Type substring and each ontology to match, so It can therefrom select one again by manual type according to the size of matching degree and be used as Second Type substring and match afterwards Ontology.
Step S94 reaches default by ontology that Second Type substring matches or with Second Type substring One or more ontologies with condition are determined as standard terminology or expansion term that title to be encoded matches.
The present embodiment has fully considered operation doctor during natural language processing is carried out to Chinese surgical procedure information The Chinese surgical procedure information of teacher's input belongs to the features such as natural language, form complexity are various, without unified standard, using pre- The a variety of dictionaries first established carry out cutting and matching to Chinese surgical procedure information character string, and title phase to be encoded is searched with this Matched standard terminology expands term.
Exemplary system
After the method for exemplary embodiment of the invention is described, next, exemplary to the present invention with reference to figure 11 The automatic coding system of the Chinese surgical procedure information of embodiment is introduced.
The implementation of the automatic coding system of Chinese surgical procedure information may refer to the implementation of the above method, repeat part not It repeats again.Term " module " used below can be the combination of the software and/or hardware of realizing predetermined function.Although with The lower described system of embodiment is preferably realized with software, but the realization of the combination of hardware or software and hardware It may and be contemplated.
As shown in figure 11, the automatic coding system of Chinese surgical procedure information can include:Import modul 111, natural language Say processing module 112, the endowed module 113 of matching.
Import modul 111, for inputting Chinese surgical procedure information.
Natural language processing module 112, for Chinese surgical procedure information carry out natural language processing, obtain one or Multiple titles to be encoded.
Match endowed module 113, for based on the standard terminology library that pre-establishes and expanding terminology bank, search with it is to be encoded Standard terminology that title matches expands term, and by the standard terminology of successful match or expands the coding of term, is determined as The coding of title to be encoded.
Optionally, as shown in figure 11, the automatic coding system of the Chinese surgical procedure information can also include:Merging treatment Module 114 omits processing module 115.
Wherein, merging treatment module 114 is for judging in one or more titles to be encoded, if comprising any one or Multiple whole combining objects for merging term, if comprising any one or more whole combining objects for merging term are replaced Change corresponding merging term into.
The step of processing module 115 is used to pre-process one or more titles to be encoded is omitted, including:Judge one In a or multiple titles to be encoded, if object is omitted altogether comprising any one or more omission terms, if comprising will Any one or more objects that are omitted altogether for omitting term are substituted for corresponding omission term.
In this exemplary system, wherein, it is standard terminology library, the expansion terminology bank, the Hypothetical classification terminology bank, described Odd encoder terminology bank, the specifying information for merging terminology bank are with reference to the automatic coding of above-mentioned Chinese surgical procedure information It introduces, overlaps will not be repeated.
Particular embodiments described above has carried out the purpose of the present invention, technical solution and advantageous effect further in detail Describe in detail it is bright, it should be understood that the above is only a specific embodiment of the present invention, the guarantor being not intended to limit the present invention Range is protected, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should be included in this Within the protection domain of invention.
Those skilled in the art will also be appreciated that the various illustrative components, blocks that the embodiment of the present invention is listed (illustrative logical block), unit and step can pass through the knot of electronic hardware, computer software, or both Conjunction is realized.To clearly show that the replaceability (interchangeability) of hardware and software, above-mentioned various explanations Property component (illustrative components), unit and step universally describe their function.Such work( Can be that specific application and the design requirement of whole system are depended on to realize by hardware or software.Those skilled in the art Can be for each specific function of applying, the realization of various methods can be used described, but this realization is understood not to Beyond the range of protection of the embodiment of the present invention.
Various illustrative logical blocks or unit or device described in the embodiment of the present invention can be by general Processor, digital signal processor, application-specific integrated circuit (ASIC), field programmable gate array or other programmable logic dress It puts, discrete gate or transistor logic, described work(is realized or operated in the design of discrete hardware components or any of the above described combination Energy.General processor can be microprocessor, and optionally, which may be any traditional processor, control Device, microcontroller or state machine.Processor can also be realized by the combination of computing device, for example, digital signal processor and Microprocessor, multi-microprocessor, one or more microprocessors combine a digital signal processor core or any other class As configuration realize.
The step of method or algorithm described in the embodiment of the present invention can be directly embedded into hardware, processor perform it is soft The combination of part module or the two.Software module can be stored in RAM memory, flash memory, ROM memory, EPROM storages Other any form of storaging mediums in device, eeprom memory, register, hard disk, moveable magnetic disc, CD-ROM or this field In.Illustratively, storaging medium can be connect with processor, so that processor can read information from storaging medium, and It can be to storaging medium stored and written information.Optionally, storaging medium can also be integrated into processor.Processor and storaging medium can To be set in ASIC, ASIC can be set in user terminal.Optionally, processor and storaging medium can also be set to use In different components in the terminal of family.
In one or more illustrative designs, the described above-mentioned function of the embodiment of the present invention can be in hardware, soft Part, firmware or the arbitrary of this three combine to realize.If realized in software, these functions can store and computer-readable It is transmitted on the medium of computer-readable on medium or with one or more instruction or code form.Computer readable medium includes electricity Brain storaging medium and convenient for allow computer program to be transferred to from a place telecommunication media in other places.Storaging medium can be with It is that any general or special computer can be with the useable medium of access.For example, such computer readable media can include but It is not limited to RAM, ROM, EEPROM, CD-ROM or other optical disc storage, disk storage or other magnetic storage devices or other What can be used for carrying or store with instruct or data structure and it is other can be by general or special computer or general or specially treated The medium of the program code of device reading form.In addition, any connection can be properly termed computer readable medium, example Such as, if software is to pass through a coaxial cable, fiber optic cables, double from a web-site, server or other remote resources Twisted wire, Digital Subscriber Line (DSL) are defined with being also contained in for the wireless way for transmitting such as example infrared, wireless and microwave In computer readable medium.The disk (disk) and disk (disc) includes compress disk, radium-shine disk, CD, DVD, floppy disk And Blu-ray Disc, disk is usually with magnetic duplication data, and disk usually carries out optical reproduction data with laser.Combinations of the above It can also be included in computer readable medium.

Claims (17)

1. a kind of automatic coding of Chinese surgical procedure information, including:
Step 1, Chinese surgical procedure information is inputted;
Step 2, natural language processing is carried out to the Chinese surgical procedure information, obtains one or more titles to be encoded;
Step 3, based on the standard terminology library and expansion terminology bank pre-established, the mark to match with the title to be encoded is searched Quasi- term or expand term, and by the standard terminology of successful match or expand term coding, be determined as the title to be encoded Coding;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is International Classification of Diseases ICD Specified in surgical procedure title, the coding of the standard terminology is corresponding surgical procedure specified in International Classification of Diseases ICD The coding of title;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is with same with the standard terminology The word of adopted relationship or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously;
The step 2 includes:
Step 21, the Chinese surgical procedure information character string is pre-processed, obtains pretreated Chinese surgical procedure Information character string;
Step 22, based on the ontology dictionary, orientation dictionary, grade dictionary pre-established, the pretreated Chinese is performed the operation Operation information character string is cut into several first kind substrings and/or Second Type substring;
Wherein, the ontology dictionary includes the standard terminology library and expands terminology bank, the standard terminology and the expansion art Language is ontology;The orientation dictionary includes several directional terminologies, and the directional terminology is targeted for describing surgical procedure Orientation word;The grade dictionary includes several grade terms, and the grade term is the grade for describing surgical procedure Not, the word of type;
The first kind substring can directly be matched with the ontology in the ontology dictionary, the sub- character of Second Type String can not directly be matched with the ontology in the ontology dictionary;
Step 23, the first kind substring being syncopated as and Second Type substring are determined as title to be encoded;
The step 21 includes:
To the non-Chinese character in the Chinese surgical procedure information character string into row format normalized, and delete the Chinese hand Non-medical term in art operation information character string obtains pretreated Chinese surgical procedure information character string, wherein described The non-medical term dictionary that non-medical term is pre-established by one provides, and the word that the non-medical term has been remarks effect Language, phrase or sentence;
The step 22 includes:
Judge the pretreated Chinese surgical procedure information character string whether comprising symbol;
If the pretreated Chinese surgical procedure information character string includes symbol, by the pretreated Chinese hand Character in art operation information character string between every adjacent two symbols is matched as a whole with the ontology in ontology dictionary; If successful match, using the character cutting between the adjacent two symbols out as first kind substring;If matching is lost Lose, then by the adjacent two symbols and its between character be determined as wouldn't cutting character string, and judge described in wouldn't cutting word Whether preset additional character is included in symbol string;
If it is described wouldn't in cutting character string comprising additional character, search described in wouldn't be belonging to cutting character string character mould Type, and the corresponding segmentation rules of character model according to belonging to this to it is described wouldn't cutting character string carry out cutting, will be syncopated as The character come is matched with the ontology in ontology dictionary, if successful match, using the character cut out as the first kind Type substring, if it fails to match, using the character cut out as Second Type substring;Wherein, the character The character model library that model is pre-established by one provides, and the character model has one-to-one segmentation rules;
If described wouldn't not include additional character in cutting character string, by it is described wouldn't cutting character string be determined directly as second Type substring;
If the pretreated Chinese surgical procedure information character string is not comprising symbol, using mechanical Chinese word segmentation method by described in In single character or multiple continuous characters and the ontology dictionary in pretreated Chinese surgical procedure information character string Ontology matched;
If all characters in the pretreated Chinese surgical procedure information character string can be with Ontology Matching, foundation Matched ontology by the single character in the pretreated Chinese surgical procedure information character string or multiple continuous words Symbol is cut out as first kind substring;
Fail and the single character of Ontology Matching or more if existing in the pretreated Chinese surgical procedure information character string Whether a continuous character then fails with the single character of Ontology Matching or multiple continuous characters to be directional terminology described in judgement Or grade term;
When described fail with the single character of Ontology Matching or multiple continuous characters as directional terminology or grade term, according to It is described to fail with the single character of Ontology Matching or multiple continuous characters in the pretreated Chinese surgical procedure information Position in character string fails described and the single character of Ontology Matching or multiple continuous characters and energy before or after it It is enough merge with the single character of Ontology Matching or multiple continuous characters cut out as Second Type substring, and by institute Stating remaining in pretreated Chinese surgical procedure information character string can be with the single character of Ontology Matching or multiple continuous Character cutting out as first kind substring;
It, will when described fail with the single character of Ontology Matching or multiple continuous characters for directional terminology or grade term The pretreated Chinese surgical procedure information character string is integrally cut out as Second Type substring.
2. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
The step 3 further includes:Based on the Hypothetical classification terminology bank pre-established, search what is matched with the title to be encoded Hypothetical classification term;And by the coding of the Hypothetical classification term of successful match, it is determined as the coding of the title to be encoded;
The Hypothetical classification terminology bank includes several Hypothetical classification terms and its coding;
The Hypothetical classification term represents ad hoc type treatment means, and the ad hoc type treatment means correspond to a variety of resection operation classes Type, a variety of resection operation types are the standard terminology;
The coding of the Hypothetical classification term and the full resection operation type of organ in a variety of resection operation types or part The coding of resection operation type is consistent.
3. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
The step 3 further includes:Based on the odd encoder terminology bank pre-established, lookup matches more with the title to be encoded Encryption description;And by the coding of the odd encoder term of successful match, it is determined as the coding of the title to be encoded;
The odd encoder terminology bank includes several odd encoder terms and its coding;
The odd encoder term is ad hoc type surgical procedure type;The premise that the ad hoc type surgical procedure type performs is another Kind surgical procedure type;The ad hoc type surgical procedure type and another surgical procedure type for the standard terminology or The expansion term;
The coding for being encoded to the ad hoc type surgical procedure type of the odd encoder term and another surgical procedure class The combination of the coding of type.
4. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
Before the step 3, further include:Based on the merging terminology bank pre-established, to one or more of names to be encoded Title is pre-processed;
The merging terminology bank includes several merging terms and its coding;Wherein, the merging term is International Classification of Diseases ICD The defined single standard terminology that can substitute at least two while the other standards term occurred;Described at least two go out simultaneously Existing other standards term is the combining objects of the merging term;The terminology bank that merges further includes the complete of each merging term Portion's combining objects;
It is described based on merging terminology bank, the step of pretreatment to one or more of titles to be encoded, including:Judge In one or more of titles to be encoded, if comprising any one or more whole combining objects for merging term, if packet Contain, then any one or more whole combining objects for merging term are substituted for corresponding merging term.
5. the automatic coding of Chinese surgical procedure information according to claim 1, wherein,
Before the step 3, further include:Based on the omission terminology bank pre-established, to one or more of names to be encoded Title is pre-processed;
The omission terminology bank includes several omission terms and its coding;Wherein, the omission term is International Classification of Diseases ICD The defined single standard terminology that can substitute at least two while the standard terminology occurred;It is described omission term be it is described at least One in two standard terminologies occurred simultaneously;Described at least two standard terminologies occurred simultaneously are the province of the omission term Slightly object;It is described omit terminology bank and further include each and omit term be omitted altogether object;
It is described based on omitting terminology bank, the step of pretreatment to one or more of titles to be encoded, including:Judge In one or more of titles to be encoded, if object is omitted altogether comprising any one or more omission terms, if packet Contain, then any one or more objects that are omitted altogether for omitting term are substituted for corresponding omission term.
6. according to the automatic coding of any Chinese surgical procedure information of Claims 1 to 5, wherein, the step 3 Later, it further includes:
Step 4, by the title to be encoded for not determining coding and being matched without encryption description in no encryption description library, if matching Success then performs preset processing step to represent not determining this title to be encoded of coding encodes, if matching is lost It loses, then this not being determined to, the title to be encoded of coding is sent to artificial treatment platform and carries out artificial treatment;
Wherein, the no encryption description dictionary includes several no encryption descriptions;
Several no encryption descriptions include:
The preset word for representing surgical procedure information but can not determine surgical procedure title;
Preset disease name;
Preset nomenclature of drug;And
Preset medical treatment consumptive materials title.
7. the automatic coding of Chinese surgical procedure information according to claim 1, wherein, it is searched in the step 3 The step of standard terminology or expansion term for matching with the title to be encoded, including:
If the entitled first kind substring to be encoded, by the ontology that the first kind substring matches, really It is set to the standard terminology to match with the title to be encoded or expands term;
If the entitled Second Type substring to be encoded,:
The parsing of the first dimension is carried out to each ontology in Second Type substring and ontology dictionary, obtains Second Type Several first dimension analysis results of several first dimension analysis results of character string and each ontology;
By each the of ontology each in each first dimension analysis result of the Second Type substring and the ontology dictionary Dimension analysis result is matched, judge whether each first dimension analysis result with the Second Type substring The ontology that matches of each first dimension analysis result;
If there is each first dimension analysis result with each first dimension analysis result phase of the Second Type substring The ontology is then determined as the ontology that the Second Type substring matches by matched ontology;
If there is no each first dimension analysis result with each first dimension analysis result of the Second Type substring The ontology to match then chooses the first dimension of part in all the first dimension analysis results of the Second Type substring Analysis result is tied with part the first dimension parsing in all the first dimension analysis results of ontology each in the ontology dictionary Fruit is matched, and judges whether the described of part the first dimension analysis result and the Second Type substring The ontology that part the first dimension analysis result matches;
If there are the part the first dimension solutions of part the first dimension analysis result and the Second Type substring The ontology is then determined as the ontology that the Second Type substring matches by the ontology that matches of analysis result;
If there is no the first dimensions of the part of part the first dimension analysis result and the Second Type substring The ontology that analysis result matches then carries out the to each ontology in the Second Type substring and the ontology dictionary The parsing of two-dimensions obtains several second dimension analysis results of the Second Type substring and the ontology dictionary In each ontology several second dimension analysis results;
Several second dimensions of several second dimension analysis results and the ontology based on the Second Type substring Analysis result calculates the matching degree of the Second Type substring and each ontology;
According to the matching degree of the Second Type substring and each ontology, determine one or more ontologies as described the The ontology that two type substrings match;
By the ontology that the Second Type substring matches, be determined as standard terminology that the title to be encoded matches or Expand term.
8. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, the sub- word of Second Type Symbol string described in each first dimension analysis result of ontology be respectively:
The Second Type substring described in directional terminology in ontology;
The Second Type substring described in grade term in ontology;
The Second Type substring described in character in ontology bracket;
The Second Type substring described in character in ontology after dash;And
The Second Type substring described in ontology except directional terminology, grade term, the character in bracket, after dash Character other than character;
The Second Type substring described in ontology all part the first dimension parsing knots in the first dimension analysis results Fruit includes:In the two types substring described in ontology except directional terminology, grade term, the character in bracket, dash The character other than character afterwards;And one or more of the following items:
The Second Type substring described in directional terminology in ontology, grade term;
The Second Type substring described in character in ontology bracket;
The Second Type substring described in character in ontology after dash.
9. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, the sub- word of Second Type Symbol string described in each second dimension analysis result of ontology be respectively:
The Second Type substring described in ontology each Chinese character;
The Second Type substring described in ontology each Chinese character initial consonant;
The Second Type substring described in ontology each Chinese character simple or compound vowel of a Chinese syllable;
The Second Type substring described in ontology initial character;
The Second Type substring described in ontology initial character phonetic;And
The Second Type substring described in non-chinese character in ontology.
10. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, it is described based on described the Several second dimension analysis results of two type substrings and several second dimension analysis results of the ontology calculate The step of matching degree of the Second Type substring and each ontology, includes:
The similarity of the Second Type substring and each ontology is calculated according to equation below:
Wherein, M represents similarity;
T represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) expressions are in the second identical dimension, the second dimension analysis result and ontology of Second Type substring The frequency that matches of the second dimension analysis result;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension analysis result The sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology;
The similarity being calculated is determined as to the matching degree of the Second Type substring and each ontology.
11. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, it is described based on described the Several second dimension analysis results of two type substrings and several second dimension analysis results of the ontology calculate The step of matching degree of the Second Type substring and each ontology, includes:
Determine each Chinese character in the Second Type substring;
The cosine confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
Total confidence level of the matched each ontology of Second Type substring is calculated according to equation below:
S=M × a+N × b
Wherein, N represents cosine confidence level;
V represents the Chinese character sum that Second Type substring and its ontology to match are included;
Q represents Second Type substring;
D' represents the ontology to match with Second Type substring;
wQ,jRepresent the frequency that each Chinese character occurs in Second Type substring;
wd',jRepresent the frequency occurred in the ontology that each Chinese character matches in Second Type substring;
J represents the serial number of Chinese character that Second Type substring and its ontology to match are included;
S represents total confidence level;
M represents similarity;
A represents the corresponding preset weights of similarity M;
B represents the corresponding preset weights of cosine confidence level N;
Also, similarity M is calculated according to equation below:
Wherein, t represents each second dimension analysis result of Second Type substring;
Q represents Second Type substring;
T in q represent each second dimension of Second Type substring;
D represents ontology;
Tf (t in d) expressions are in the second identical dimension, the second dimension analysis result and ontology of Second Type substring The frequency that matches of the second dimension analysis result;
Wherein, T represents the sum of ontology in ontology dictionary, and T (t) represents each second dimension analysis result The sum of ontology to match with each second dimension analysis result of Second Type substring;
T.getBoost () represents the preset weights of each second dimension;
Norm (t, d) represents the length normalization method factor of ontology;
The total confidence level being calculated is determined as to the matching degree of the Second Type substring and each ontology.
12. the automatic coding of Chinese surgical procedure information according to claim 7, wherein, described in the basis The matching degree of Second Type substring and each ontology determines one or more ontology as the sub- character of the Second Type The step of ontology that string matches, including:
Size according to the matching degree with the Second Type substring sorts to whole ontologies, and it is forward wherein to sort The ontology of preset quantity be determined as the ontology that the Second Type substring matches;
Alternatively,
One or more ontologies of predetermined threshold value will be reached with the matching degree of the Second Type substring, be determined as described The ontology that Second Type substring matches.
13. a kind of automatic coding system of Chinese surgical procedure information, including:
Import modul, for inputting Chinese surgical procedure information;
Natural language processing module for carrying out natural language processing to the Chinese surgical procedure information, obtains one or more A title to be encoded;
Endowed module is matched, for based on the standard terminology library and expansion terminology bank pre-established, searching and the name to be encoded Claim the standard terminology that matches or expand term, and by the standard terminology of successful match or expand the coding of term, be determined as institute State the coding of title to be encoded;
Wherein, the standard terminology library includes several standard terminologies and its coding, and the standard terminology is International Classification of Diseases ICD Specified in surgical procedure title, the coding of the standard terminology is corresponding surgical procedure specified in International Classification of Diseases ICD The coding of title;
The expansion terminology bank includes several expansion terms and its coding, and the expansion term is with same with the standard terminology The word of adopted relationship or the word with relation of genus and species;
It is described to expand term coding corresponding with the standard terminology with synonymy or relation of genus and species unanimously;
In the natural language processing module, natural language processing is carried out to the Chinese surgical procedure information, obtain one or Multiple titles to be encoded are in the following way:
Step 21, the Chinese surgical procedure information character string is pre-processed, obtains pretreated Chinese surgical procedure Information character string;
Step 22, based on the ontology dictionary, orientation dictionary, grade dictionary pre-established, the pretreated Chinese is performed the operation Operation information character string is cut into several first kind substrings and/or Second Type substring;
Wherein, the ontology dictionary includes the standard terminology library and expands terminology bank, the standard terminology and the expansion art Language is ontology;The orientation dictionary includes several directional terminologies, and the directional terminology is targeted for describing surgical procedure Orientation word;The grade dictionary includes several grade terms, and the grade term is the grade for describing surgical procedure Not, the word of type;
The first kind substring can directly be matched with the ontology in the ontology dictionary, the sub- character of Second Type String can not directly be matched with the ontology in the ontology dictionary;
Step 23, the first kind substring being syncopated as and Second Type substring are determined as title to be encoded;
The step 21 includes:
To the non-Chinese character in the Chinese surgical procedure information character string into row format normalized, and delete the Chinese hand Non-medical term in art operation information character string obtains pretreated Chinese surgical procedure information character string, wherein described The non-medical term dictionary that non-medical term is pre-established by one provides, and the word that the non-medical term has been remarks effect Language, phrase or sentence;
The step 22 includes:
Judge the pretreated Chinese surgical procedure information character string whether comprising symbol;
If the pretreated Chinese surgical procedure information character string includes symbol, by the pretreated Chinese hand Character in art operation information character string between every adjacent two symbols is matched as a whole with the ontology in ontology dictionary; If successful match, using the character cutting between the adjacent two symbols out as first kind substring;If matching is lost Lose, then by the adjacent two symbols and its between character be determined as wouldn't cutting character string, and judge described in wouldn't cutting word Whether preset additional character is included in symbol string;
If it is described wouldn't in cutting character string comprising additional character, search described in wouldn't be belonging to cutting character string character mould Type, and the corresponding segmentation rules of character model according to belonging to this to it is described wouldn't cutting character string carry out cutting, will be syncopated as The character come is matched with the ontology in ontology dictionary, if successful match, using the character cut out as the first kind Type substring, if it fails to match, using the character cut out as Second Type substring;Wherein, the character The character model library that model is pre-established by one provides, and the character model has one-to-one segmentation rules;
If described wouldn't not include additional character in cutting character string, by it is described wouldn't cutting character string be determined directly as second Type substring;
If the pretreated Chinese surgical procedure information character string is not comprising symbol, using mechanical Chinese word segmentation method by described in In single character or multiple continuous characters and the ontology dictionary in pretreated Chinese surgical procedure information character string Ontology matched;
If all characters in the pretreated Chinese surgical procedure information character string can be with Ontology Matching, foundation Matched ontology by the single character in the pretreated Chinese surgical procedure information character string or multiple continuous words Symbol is cut out as first kind substring;
Fail and the single character of Ontology Matching or more if existing in the pretreated Chinese surgical procedure information character string Whether a continuous character then fails with the single character of Ontology Matching or multiple continuous characters to be directional terminology described in judgement Or grade term;
When described fail with the single character of Ontology Matching or multiple continuous characters as directional terminology or grade term, according to It is described to fail with the single character of Ontology Matching or multiple continuous characters in the pretreated Chinese surgical procedure information Position in character string fails described and the single character of Ontology Matching or multiple continuous characters and energy before or after it It is enough merge with the single character of Ontology Matching or multiple continuous characters cut out as Second Type substring, and by institute Stating remaining in pretreated Chinese surgical procedure information character string can be with the single character of Ontology Matching or multiple continuous Character cutting out as first kind substring;
It, will when described fail with the single character of Ontology Matching or multiple continuous characters for directional terminology or grade term The pretreated Chinese surgical procedure information character string is integrally cut out as Second Type substring.
14. the automatic coding system of Chinese surgical procedure information according to claim 13, wherein,
The endowed module of matching is additionally operable to, based on the Hypothetical classification terminology bank pre-established, search and the title to be encoded The Hypothetical classification term to match;And by the coding of the Hypothetical classification term of successful match, it is determined as the title to be encoded Coding;
The Hypothetical classification terminology bank includes several Hypothetical classification terms and its coding;
The Hypothetical classification term represents ad hoc type treatment means, and the ad hoc type treatment means correspond to a variety of resection operation classes Type, a variety of resection operation types are the standard terminology;
The coding of the Hypothetical classification term and the full resection operation type of organ in a variety of resection operation types or part The coding of resection operation type is consistent.
15. the automatic coding system of Chinese surgical procedure information according to claim 13, wherein,
The endowed module of matching is additionally operable to, based on the odd encoder terminology bank pre-established, search and the title phase to be encoded Matched odd encoder term;And by the coding of the odd encoder term of successful match, it is determined as the coding of the title to be encoded;
The odd encoder terminology bank includes several odd encoder terms and its coding;
The odd encoder term is ad hoc type surgical procedure type;The premise that the ad hoc type surgical procedure type performs is another Kind surgical procedure type;The ad hoc type surgical procedure type and another surgical procedure type for the standard terminology or The expansion term;
The coding for being encoded to the ad hoc type surgical procedure type of the odd encoder term and another surgical procedure class The combination of the coding of type.
16. the automatic coding system of Chinese surgical procedure information according to claim 13, further includes:
Merging treatment module, for based on the merging terminology bank pre-established, being carried out to one or more of titles to be encoded Pretreatment;
The merging terminology bank includes several merging terms and its coding;Wherein, the merging term is International Classification of Diseases ICD The defined single standard terminology that can substitute at least two while the other standards term occurred;Described at least two go out simultaneously Existing other standards term is the combining objects of the merging term;The terminology bank that merges further includes the complete of each merging term Portion's combining objects;
The merging treatment module, specifically for judging in one or more of titles to be encoded, if comprising any one Or multiple whole combining objects for merging term, if comprising by any one or more whole merging for merging term Object is substituted for corresponding merging term.
17. the automatic coding system of Chinese surgical procedure information according to claim 13, further includes:
Processing module is omitted, for based on the omission terminology bank pre-established, being carried out to one or more of titles to be encoded Pretreatment;
The omission terminology bank includes several omission terms and its coding;Wherein, the omission term is International Classification of Diseases ICD The defined single standard terminology that can substitute at least two while the standard terminology occurred;It is described omission term be it is described at least One in two standard terminologies occurred simultaneously;Described at least two standard terminologies occurred simultaneously are the province of the omission term Slightly object;It is described omit terminology bank and further include each and omit term be omitted altogether object;
The omission processing module, the step of specifically for being pre-processed to one or more of titles to be encoded, including: Judge in one or more of titles to be encoded, if object is omitted altogether comprising any one or more omission terms, If comprising any one or more objects that are omitted altogether for omitting term are substituted for corresponding omission term.
CN201510496500.3A 2015-08-13 2015-08-13 A kind of automatic coding and system of Chinese surgical procedure information Active CN105069123B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510496500.3A CN105069123B (en) 2015-08-13 2015-08-13 A kind of automatic coding and system of Chinese surgical procedure information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510496500.3A CN105069123B (en) 2015-08-13 2015-08-13 A kind of automatic coding and system of Chinese surgical procedure information

Publications (2)

Publication Number Publication Date
CN105069123A CN105069123A (en) 2015-11-18
CN105069123B true CN105069123B (en) 2018-06-26

Family

ID=54498493

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510496500.3A Active CN105069123B (en) 2015-08-13 2015-08-13 A kind of automatic coding and system of Chinese surgical procedure information

Country Status (1)

Country Link
CN (1) CN105069123B (en)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105630873B (en) * 2015-12-18 2018-12-25 河南思维自动化设备股份有限公司 The graphical assist edit method disclosed in yard
CN105963022B (en) * 2016-04-19 2018-08-14 中国中医科学院中医临床基础医学研究所 Treat encoder
CN106874643B (en) * 2016-12-27 2020-02-28 中国科学院自动化研究所 Method and system for automatically constructing knowledge base to realize auxiliary diagnosis and treatment based on word vectors
CN108257667A (en) * 2016-12-28 2018-07-06 中国科学院深圳先进技术研究院 A kind of data processing method and terminal device
CN108320778A (en) * 2017-01-16 2018-07-24 医渡云(北京)技术有限公司 Medical record ICD coding methods and system
CN106844308B (en) * 2017-01-20 2020-04-03 天津艾登科技有限公司 Method for automatic disease code conversion using semantic recognition
CN107577826B (en) * 2017-10-25 2018-05-15 山东众阳软件有限公司 Classification of diseases coding method and system based on raw diagnostic data
CN107705839B (en) * 2017-10-25 2020-06-26 山东众阳软件有限公司 Disease automatic coding method and system
CN108182207B (en) * 2017-12-15 2020-11-13 中电科软件信息服务有限公司 Intelligent coding method and system for Chinese surgical operation based on word segmentation network
CN108182977A (en) * 2018-02-05 2018-06-19 南方医科大学顺德医院(佛山市顺德区第人民医院) Patient diagnosis coding method and system
CN108831522A (en) * 2018-05-28 2018-11-16 陈丽璇 A kind of the medical insurance disease score value charging system and its construction method of autocoding
CN109273062A (en) * 2018-08-09 2019-01-25 北京爱医声科技有限公司 ICD intelligence Auxiliary Encoder System
CN109256216B (en) * 2018-08-14 2023-06-27 平安医疗健康管理股份有限公司 Medical data processing method, medical data processing device, computer equipment and storage medium
CN109918655B (en) * 2019-02-27 2023-11-14 浙江数链科技有限公司 Logistics term library generation method and device
CN110442844B (en) * 2019-07-03 2023-09-26 北京达佳互联信息技术有限公司 Data processing method, device, electronic equipment and storage medium
CN111128388B (en) * 2019-12-03 2024-02-27 东软集团股份有限公司 Value range data matching method and device and related products
CN112131868A (en) * 2020-09-22 2020-12-25 上海亿普医药科技有限公司 Clinical trial medical coding method
CN112131867A (en) * 2020-09-22 2020-12-25 上海亿普医药科技有限公司 Clinical trial medical coding system
CN112749307B (en) * 2020-12-30 2022-11-08 杭州依图医疗技术有限公司 Medical data processing method and device and storage medium
CN115017326B (en) * 2022-05-12 2023-08-18 青岛普瑞盛医药科技有限公司 Medical coding method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102456100A (en) * 2010-11-03 2012-05-16 通用电气公司 Systems, methods, and apparatus for computer-assisted full medical code scheme to code scheme mapping
CN104156415A (en) * 2014-07-31 2014-11-19 沈阳锐易特软件技术有限公司 Mapping processing system and method for solving problem of standard code control of medical data

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102456100A (en) * 2010-11-03 2012-05-16 通用电气公司 Systems, methods, and apparatus for computer-assisted full medical code scheme to code scheme mapping
CN104156415A (en) * 2014-07-31 2014-11-19 沈阳锐易特软件技术有限公司 Mapping processing system and method for solving problem of standard code control of medical data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
中文分词算法的研究与实现;林冬盛;《中国优秀硕士学位论文全文数据库信息科技辑》;20110815(第08期);第24页 *

Also Published As

Publication number Publication date
CN105069123A (en) 2015-11-18

Similar Documents

Publication Publication Date Title
CN105069123B (en) A kind of automatic coding and system of Chinese surgical procedure information
CN105184053B (en) A kind of automatic coding and system of Chinese medical service item information
CN105069124B (en) A kind of International Classification of Diseases coding method of automation and system
CN105095665B (en) A kind of natural language processing method and system of Chinese medical diagnosis on disease information
CN105138829B (en) A kind of natural language processing method and system of Chinese medical information
Zhang et al. MIE: A medical information extractor towards medical dialogues
CN108549639A (en) Based on the modified Chinese medicine case name recognition methods of multiple features template and system
CN106407443A (en) Structured medical data generation method and device
CN106844351B (en) Medical institution organization entity identification method and device oriented to multiple data sources
CN108647203B (en) Method for calculating text similarity of traditional Chinese medicine disease conditions
US11042712B2 (en) Simplifying and/or paraphrasing complex textual content by jointly learning semantic alignment and simplicity
CN106934220A (en) Towards the disease class entity recognition method and device of multi-data source
Khin et al. A deep learning architecture for de-identification of patient notes: Implementation and evaluation
CN111651991B (en) Medical named entity identification method utilizing multi-model fusion strategy
CN109192255A (en) Case history structural method
US20170193197A1 (en) System and method for automatic unstructured data analysis from medical records
Ji et al. A BILSTM-CRF method to Chinese electronic medical record named entity recognition
CN108804423A (en) Medical Text character extraction and automatic matching method and system
WO2020211250A1 (en) Entity recognition method and apparatus for chinese medical record, device and storage medium
Polignano et al. Comparing Transformer-based NER approaches for analysing textual medical diagnoses.
Yu et al. Bios: An algorithmically generated biomedical knowledge graph
CN113658720A (en) Method, apparatus, electronic device and storage medium for matching diagnostic name and ICD code
Costumero et al. Text analysis and information extraction from Spanish written documents
CN106776535A (en) Scientific and technical literature fine granularity relation excavation method based on two-stage syntax parsing
Jain Supervised Named Entity Recognition for Clinical Data.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant