CN104992173A - Symbol recognition method and system used for medical report - Google Patents

Symbol recognition method and system used for medical report Download PDF

Info

Publication number
CN104992173A
CN104992173A CN201510300821.1A CN201510300821A CN104992173A CN 104992173 A CN104992173 A CN 104992173A CN 201510300821 A CN201510300821 A CN 201510300821A CN 104992173 A CN104992173 A CN 104992173A
Authority
CN
China
Prior art keywords
foreground image
vertical line
symbol
medical report
template
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510300821.1A
Other languages
Chinese (zh)
Other versions
CN104992173B (en
Inventor
刘立
温成超
吴诗展
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Paiyipai Intelligent Technology Co ltd
Original Assignee
Beijing Haoyundao Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Haoyundao Information Technology Co Ltd filed Critical Beijing Haoyundao Information Technology Co Ltd
Priority to CN201510300821.1A priority Critical patent/CN104992173B/en
Publication of CN104992173A publication Critical patent/CN104992173A/en
Application granted granted Critical
Publication of CN104992173B publication Critical patent/CN104992173B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • G06V10/23Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on positionally close patterns or neighbourhood relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/192Recognition using electronic means using simultaneous comparisons or correlations of the image signals with a plurality of references
    • G06V30/194References adjustable by an adaptive method, e.g. learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to a symbol recognition method and system used for medical reports. The method includes configuring a training sample training classifier according to features of different symbols; collecting the symbols from foreground images of pictures in the medical reports and obtaining features of the different symbols for configuring a symbol template; and by utilizing the trained classifier for recognizing and detecting the symbol template so that abnormal indicators in the medical reports can be obtained and the areas with the abnormal indicators can be determined. The method and system provided by the invention can be used for analyzing special symbols in the medical reports and the recognition and analyzing efficiency can be improved.

Description

For Symbol Recognition and the system of medical report list
Technical field
The present invention relates to image identification technical field, particularly relate to a kind of Symbol Recognition for medical report list and system.
Background technology
Along with the special character variation occurred in the complicated of text typesetting format and text, make the text in process image more and more difficult.At present, OCR (Optical CharacterRecognition, optical character identification) system has higher discrimination to most of text, is widely used.
Laboratory test report in medical domain is the important evidence for the treatment of patient, also may be the legal basis of the situations such as medical insurance Claims Resolution, the qualification of disability accident and medical tangle.Therefore, to resolving automatically and efficiently of laboratory test report, there is higher realistic meaning.Such as, but conventional special symbol identifies abnormal index, plus sige, minus sign, the number of being more than or equal to, the number of being less than or equal to, arrow etc. in medical report list.Now, the recognition accuracy of OCR system is not fully up to expectations, have impact on the correct parsing of laboratory test report to a certain extent.
Summary of the invention
One of them object of the present invention is to provide a kind of Symbol Recognition for medical report list and system, for providing the recognition accuracy of OCR system, improves medical report list analyzing efficiency.
First aspect, embodiments provides a kind of Symbol Recognition for medical report list, comprising:
According to the latent structure training sample training classifier of distinct symbols;
From medical report free hand drawing sheet, foreground image gathers distinct symbols, obtains the feature of distinct symbols to construct symbol guide;
Utilize described sorter identification and detected symbol template, for obtaining abnormal index in medical report list and position.
Alternatively, described structure training sample training classifier adopts logistic regression method, comprises the following steps:
Structure training sample;
Described training sample is carried out size normalization, obtains the characteristics of image of same dimension;
The image feature value of calculation training sample;
Train described sorter according to described image feature value, obtain described classifier parameters.
Alternatively, before from medical report free hand drawing sheet, foreground image gathers distinct symbols, the method also comprises pre-treatment step, specifically comprises:
Utilize Hough transformation method to obtain the apex coordinate of foreground image, determine the dimensional information of described foreground image;
Perspective transform method is utilized to carry out slant correction to described foreground image, to obtain the described foreground image of orthogonal projection;
Utilize local thresholding method that described foreground image is divided into some regions, binaryzation is carried out to each described region.
Alternatively, described from medical report free hand drawing sheet foreground image gather distinct symbols, also comprise this step of height of determining often to compose a piece of writing, comprising:
That reads in described foreground image is text filed, carries out dilation and erosion obtain connected domain to described text;
When the horizontal projection energy of described connected domain is greater than energy preset value, the maximum difference of the position of described horizontal projection is height originally of often composing a piece of writing.
Alternatively, when the symbol that foreground image gathers is arrow, comprise the following steps:
According to the latent structure vertical line template of vertical line;
All separable vertical lines in foreground image according to described vertical line Template Location;
According to described vertical line structure of transvers plate arrow template;
Construct training sample respectively, training study obtains classifier parameters;
Arrow locations is detected in the position of separable vertical line.
Alternatively, the described latent structure vertical line template according to vertical line, comprising:
Pixel value when the position up and down of continuous line segment is 0, and when being greater than predetermined threshold value with the matching degree of symbol guide correspondence position, marking continuous line segment is separable vertical line;
Travel through described foreground image, separable vertical lines all in the foreground image of location.
Alternatively, according to described vertical line structure of transvers plate arrow template, comprising:
In the position at each separable vertical line place, in the region identical with vertical line template size, carry out horizontal projection;
Calculated level projection energy value is greater than the maximum difference of the position of energy preset value, obtains the live width of vertical line.
Alternatively, the function expression of described sorter is:
P ( t ) = 1 1 + e - t
Wherein, P (t) is classification results, and t is the weighted sum of proper vector;
t = Σ i = 1 N w i × x i
N is the dimension of feature, w ibe the weight coefficient of the i-th dimensional feature, x iit is the eigenwert of the i-th dimensional feature.
Second aspect, the embodiment of the present invention additionally provides a kind of symbol recognition system for medical report list, comprising:
Sorter generation module, for the latent structure training sample training classifier according to distinct symbols;
Symbol guide constructing module, gathers distinct symbols for foreground image from medical report free hand drawing sheet, obtains the feature of distinct symbols to construct symbol guide;
Template matches module, utilizes described sorter identification and detected symbol template, for obtaining abnormal index in medical report list and position.
Compared with prior art, the present invention not only compensate for traditional OCR deficiency low to special symbol discrimination, and the typesetting format also having adapted to text is complicated and diversified; This recognition methods is applied in the parsing of medical report list of medical domain simultaneously, utilize upwards or a certain index that describes in medicalization verification certificate of downward arrow there is situation higher or on the low side, abnormal index is characterized with this, fast resolving medical report list can be realized, there is high using value and vast potential for future development.
Accompanying drawing explanation
Can understanding the features and advantages of the present invention clearly by reference to accompanying drawing, accompanying drawing is schematic and should not be construed as and carry out any restriction to the present invention, in the accompanying drawings:
For the Symbol Recognition schematic flow sheet of medical report list in Fig. 1 embodiment of the present invention;
Fig. 2 is the medical report list schematic diagram of a kind of video camera shooting in the embodiment of the present invention;
Fig. 3 is the normal picture obtained after utilizing Hough transformation and transitting probability to correct the list of medical report shown in Fig. 2 in the embodiment of the present invention;
Fig. 4 is the medical report list schematic diagram demarcating symbol in the embodiment of the present invention;
Fig. 5 is the abnormal index schematic diagram that the arrow extracted in the embodiment of the present invention is expert at;
Fig. 6 is a kind of symbol recognition system block diagram for medical report list in the embodiment of the present invention.
Embodiment
In order to more clearly understand above-mentioned purpose of the present invention, feature and advantage, below in conjunction with the drawings and specific embodiments, the present invention is further described in detail.It should be noted that, when not conflicting, the feature in the embodiment of the application and embodiment can combine mutually.
Set forth a lot of detail in the following description so that fully understand the present invention; but; the present invention can also adopt other to be different from other modes described here and implement, and therefore, protection scope of the present invention is not by the restriction of following public specific embodiment.
On the one hand, the present invention proposes a kind of Symbol Recognition for medical report list, as shown in Figure 1, comprising:
S10, latent structure training sample training classifier according to distinct symbols;
S20, from medical report free hand drawing sheet, foreground image gathers distinct symbols, obtains the feature of distinct symbols to construct symbol guide;
S30, utilize described sorter identification and detected symbol template, for obtaining abnormal index in medical report list and position.
Will be understood that, traditional Chinese medicine report picture of the present invention refers to picture medical report list being placed on somewhere shooting; Foreground image then refers to the imaging of medical report singly in picture.
For the problem that OCR system in prior art is low to special symbol discrimination in text, the Symbol Recognition that the embodiment of the present invention provides, train different sorters according to distinct symbols, then gather symbol from medical report list and set up template, template matches is carried out to sorter.Not only compensate for the deficiency that traditional OCR is low to special symbol discrimination, can be applied in the parsing of medical report list simultaneously, thus carry out fast resolving medical report list, be conducive to improving analyzing efficiency.
Generally, the image obtained by scanner is all orthogonal projection, and image does not have angular deviation, is conducive to the Text region in image.And during camera acquisition image, being subject to restriction and the interference of various condition, the figure sector-meeting of captured object deforms (such as near big and far smaller), needs could use through certain distortion correction process, to ensure not occur error to during image recognition.As shown in Figure 1, the picture of the medical report list of video camera shooting, there is significantly distortion in this picture, therefore needs to carry out pre-service to image.
Alternatively, before from medical report free hand drawing sheet, foreground image gathers distinct symbols, the method also comprises pre-treatment step, specifically comprises:
Utilize Hough transformation method to obtain the apex coordinate of foreground image, determine the dimensional information of described foreground image;
Utilize local thresholding method that described foreground image is divided into some regions, binaryzation is carried out to each described region.
First, introduce the apex coordinate utilizing Hough transformation method to obtain foreground image, determine the step of the dimensional information of described foreground image.
The present invention utilizes Hough transformation method to detect the foreground image edge of medical report free hand drawing sheet, determines the size of foreground image.According to the duality of point with line, curve representation form given for input picture space is become the point of parameter space, thus the test problems of given curve in input picture is converted into the spike problem found in parameter space.Be about to detect global feature and be converted into detection local characteristics, like this by obtaining the edge line equation of foreground image, and the intersection point of straight line, the apex coordinate of foreground image and the dimensional information of foreground image can be obtained.Then, the width value then after correcting using width and maximal value highly as foreground image respectively and height value.
Secondly, introduction utilizes perspective transform method to carry out slant correction to described foreground image, to obtain the step of the described foreground image of orthogonal projection.
After determining the size of foreground image, slant correction is carried out to foreground image.In the present invention, foreground image is mapped in shot object plane, is equivalent to by video camera perpendicular to medical report list, thus obtain desirable picture shape, and do not lose the information that foreground image comprises.
In practical application, those skilled in the art realize the correction to tilted image, and other preprocess methods also can be adopted to solve the problems of the technologies described above, and realize basic effect, the present invention is not construed as limiting.
Finally, introduce and utilize local thresholding method that described foreground image is divided into some regions, binaryzation is carried out to each described region.
Because foreground image comprises 256 brightness degrees, for reducing the complexity calculated, improve the recognition efficiency of special symbol.The present invention carries out binary conversion treatment to this foreground image.
When image binaryzation process, the present invention adopts local thresholding method.This foreground image is divided into several regions, a threshold value is arranged to each region and carries out binaryzation, thus obtain the foreground image of binaryzation, in binaryzation foreground image, better can distinguish target and background.
After pre-service is carried out to Fig. 2, the foreground image after correcting can be obtained, see Fig. 3.
The medicalization verification certificate image that the present invention utilizes video camera to take is example, identifies the arrow wherein occurred, and obtains its positional information, more accurately and fast to determine the abnormal index item of patient.
1) template of vertical line is constructed, all separable vertical lines in the foreground image of location.According to the feature of vertical line, the pixel value namely above vertical line, below, in the preset range on the left side and the right side is 0, structure vertical line template.
According to the vertical line template constructed, the horizontal and vertical direction of foreground image adopts different step-lengths to travel through, orient all separable vertical lines in foreground image.Such as, in the embodiment of the present invention, height and the width of initialization vertical line template are respectively 40,3, and the horizontal direction moving step length of window is 4, vertical direction moving step length is 2, and the size of moving window is the width of vertical line template, adds moving step length on horizontal and vertical direction respectively highly again.Then, utilize vertical line template and moving step length scanning foreground image, according to the feature of separable vertical line, (namely line segment is continuous print, and the pixel value of vertical line up and down in certain limit is 0), when the vertical line matching degree of correspondence position is greater than predetermined threshold value in the line segment in foreground image and this vertical line template, then this line segment is labeled as vertical line.In practical application, the size of predetermined threshold value can be determined by test of many times.After having traveled through this foreground image, the coordinate of the position of all separable vertical lines can be obtained, and by it stored in being used for follow-up identification in text.
2) the row height of text and the live width of vertical line is determined.Carry out base conditioning to foreground image Chinese version region, such as dilation and corrosion, obtains connected domain, determines according to the width of connected domain height originally of often composing a piece of writing.And using this size of height originally as initialization arrow of often composing a piece of writing, i.e. the height of arrow.Constantly upgraded and revised in testing process.
To all separable vertical lines oriented, the embodiment of the present invention adopts difference method to calculate the live width of vertical line.
First structure detects the template of arrow.Arrow is made up of vertical line and symmetrical oblique line, and therefore on vertical line template basis, construct arrow template, namely the horizontal ordinate of vertical line gets 3,6 step-lengths respectively to left and right in the horizontal direction, and the ordinate of vertical line upwards gets 2 step-lengths.Then, get in the region identical with vertical line template size, carry out horizontal projection in the position at each vertical line place.The energy finding out horizontal projection is greater than position during energy preset value, then calculates the difference of this position.When obtaining the maximum difference of this position, it is exactly the live width of arrow vertical line.Certainly, the height of the acquisition vertical line that uses the same method can also be adopted.
Such as: horizontal projection energy is greater than position during energy preset value, is set to 1, otherwise be 0, the array of a multiline text can be obtained:
A=[0,0,0,1,1,1,1,1,0,0];
To the data step-by-step negate in this array A, obtain array B:
B=[1,1,1,0,0,0,0,0,1,1];
Namely the position having 1 in array B is respectively los=1,2,3,8,9;
And ask expectation difference in position:
c=diff(los)=(1,1,5,1)
d=max(c)=5。
The live width that can obtain this vertical line is 5.
3) arrow template is trained.According to the feature of arrow, embodiment of the present invention structure training sample.By continuous training study, obtain the parameter of sorter, for detecting the arrow in foreground image.The feature continuing to utilize sorter to judge whether to meet arrow to have in the position at vertical line place, if be identified as arrow, write down position now, and it is abnormal to think that the index of medical report list this journey occurs.
After determining arrow live width and line height, according to the feature of arrow, detection arrow is divided into upper and lower two parts.Wherein, the first half comprises symmetrical oblique line, and the pixel value around the latter half vertical line of arrow simultaneously outside predeterminable range is 0 entirely.Such as, in the first half, whether comprise in the scope detecting the live width of each one times of the arranged on left and right sides of vertical line in the embodiment of the present invention and comprise oblique line; In the latter half, within the scope of the live width of each one times of the both sides of oblique line, pixel value is 0.
Therefore, the present invention is by training and determining that the method for sorter distinguishes separable vertical line and arrow.According to the component characteristic of arrow, construct training sample respectively, by continuous training study, try to achieve classifier parameters, for the detection identification division of arrow.
Alternatively, utilize logistic regression method to obtain sorter, comprising:
Structure training sample;
Described training sample is carried out size normalization, obtains the characteristics of image of same dimension;
The image feature value of calculation training sample;
Train described sorter according to image feature value, obtain described classifier parameters.
The present invention's preferred Logistic logistic regression sorter trains vertical line both sides whether containing oblique line and blank, and Logistic logistic regression sorter has the advantages such as calculated amount is very little, speed is very fast, storage resources is low when realizing simple, classification.Logistic is a kind of linear classifier, and function expression is:
P ( t ) = 1 1 + e - t
Wherein, P (t) is classification results, and t is the weighted sum of proper vector;
t = Σ i = 1 N w i × x i
N is the dimension of feature, w ibe the weight coefficient of the i-th dimensional feature, x iit is the eigenwert of the i-th dimensional feature.
In the embodiment of the present invention, adopt logistic regression method (Logistic) to obtain sorter, comprising:
(1) training sample set is constructed.Collect a large amount of arrow sample images and be used for training, in training process, to select on the left of arrow vertical line using the line width values of certain multiple as wide and high structure square template in the first half, due to symmetrical, only need flip horizontal can obtain on the right side of the first half template.Same method, structure arrow the latter half, the template of arranged on left and right sides, extracts above tetrameric characteristics of image and is used for training classifier.
(2) image of all samples of training sample set is carried out size normalization, now can obtain the characteristics of image of same dimension, for the training of sorter.
(3) eigenwert of calculation training sample.Using the pixel value after the binaryzation of foreground image and conversion thereof as characteristics of image, representation is simple, and computation complexity is low.
(4) train Logistic sorter, obtain classifier parameters.
(5) classifier parameters trained is utilized, for carrying out the position of marker arrow according to vertical line coordinate.As shown in Figure 4, the position of arrow has been marked with dotted line frame 2 in the enlarged drawing of dotted line frame 1.The abnormal index in this medical report list can be obtained by the position of arrow.As shown in Figure 5, arrow has marked an abnormal index: the 20th index monocyte absolute value in medical report list is higher.
The embodiment of the present invention is only illustrated Symbol Recognition in conjunction with arrow.In practical application, Symbol Recognition provided by the invention can also be applied in the identification of other symbols, such as plus sige, minus sign, the number of being more than or equal to, the number of being less than or equal to, fullstop, exclamation, percentage sign, Roman number and asterisk etc.According to the feature of distinct symbols, two symmetrical semicircles can be seen as in such as fullstop, exclamation can be divided into vertical line and put two parts, percentage sign can be divided into oblique line and the circle being positioned at these oblique line both sides etc., the Symbol Recognition that the embodiment of the present invention can be utilized to provide carries out identifying and other symbols detected in medical report list, does not repeat them here.
On the other hand, the embodiment of the present invention additionally provides a kind of symbol recognition system for medical report list, as shown in Figure 6, comprising:
Sorter generation module, obtains sorter for the latent structure training sample according to distinct symbols;
Symbol guide constructing module, from medical report free hand drawing sheet, foreground image gathers distinct symbols, obtains the feature of distinct symbols to construct symbol guide;
Template matches module, utilizes described sorter identification and detected symbol template, for obtaining abnormal index in medical report list and position.
Based on same inventive concept, a kind of symbol recognition system for medical report list that the embodiment of the present invention provides, this symbol recognition system realizes owing to adopting above-mentioned Symbol Recognition, thus can solve same technical matters, and obtain identical technique effect, no longer detailed at this.
It should be noted that in describing the invention, term " on ", the orientation of the instruction such as D score or position relationship be based on orientation shown in the drawings or position relationship, only the present invention for convenience of description and simplified characterization, instead of indicate or imply that the device of indication or element must have specific orientation, with specific azimuth configuration and operation, therefore can not be interpreted as limitation of the present invention.
Although describe embodiments of the present invention by reference to the accompanying drawings, but those skilled in the art can make various modifications and variations without departing from the spirit and scope of the present invention, such amendment and modification all fall into by within claims limited range.

Claims (9)

1. for a Symbol Recognition for medical report list, it is characterized in that, comprising:
According to the latent structure training sample training classifier of distinct symbols;
From medical report free hand drawing sheet, foreground image gathers distinct symbols, obtains the feature of distinct symbols to construct symbol guide;
Utilize described sorter identification and detected symbol template, for obtaining abnormal index in medical report list and position.
2. Symbol Recognition as claimed in claim 1, is characterized in that, described structure training sample training classifier adopts logistic regression method, comprises the following steps:
Structure training sample;
Described training sample is carried out size normalization, obtains the characteristics of image of same dimension;
The image feature value of calculation training sample;
Train described sorter according to described image feature value, obtain described classifier parameters.
3. Symbol Recognition according to claim 1, is characterized in that, before foreground image gathers distinct symbols from medical report free hand drawing sheet, also comprises pre-treatment step, specifically comprises:
Utilize Hough transformation method to obtain the apex coordinate of foreground image, determine the dimensional information of described foreground image;
Perspective transform method is utilized to carry out slant correction to described foreground image, to obtain the described foreground image of orthogonal projection;
Utilize local thresholding method that described foreground image is divided into some regions, binaryzation is carried out to each described region.
4. Symbol Recognition according to claim 1, is characterized in that,
Described from medical report free hand drawing sheet foreground image gather distinct symbols, also comprise this step of height of determining often to compose a piece of writing, comprising:
That reads in described foreground image is text filed, obtains connected domain to the described text filed dilation and erosion that carries out;
When the energy of the horizontal projection of described connected domain is greater than energy preset value, the maximum difference of the position of described horizontal projection is height originally of often composing a piece of writing.
5. Symbol Recognition according to claim 1, is characterized in that,
When the symbol that foreground image gathers is arrow, comprise the following steps:
According to the latent structure vertical line template of vertical line;
All separable vertical lines in foreground image according to described vertical line Template Location;
According to described vertical line structure of transvers plate arrow template;
Construct training sample respectively, training study obtains classifier parameters;
Arrow locations is detected in the position of separable vertical line.
6. Symbol Recognition according to claim 5, is characterized in that,
The described latent structure vertical line template according to vertical line, comprising:
Pixel value when the position up and down of continuous line segment is 0, and when being greater than predetermined threshold value with the matching degree of symbol guide correspondence position, marking continuous line segment is separable vertical line;
Travel through described foreground image, separable vertical lines all in the foreground image of location.
7. Symbol Recognition according to claim 6, is characterized in that,
According to described vertical line structure of transvers plate arrow template, comprising:
In each separable vertical line position, in the region identical with vertical line template size, carry out horizontal projection;
Calculated level projection energy value is greater than the maximum difference of the position of energy preset value, obtains the live width of vertical line.
8. Symbol Recognition according to claim 6, is characterized in that,
The function expression of described sorter is:
P ( t ) = 1 1 + e - t
Wherein, P (t) is classification results, and t is the weighted sum of proper vector;
t = Σ i = 1 N w i × x i
N is the dimension of feature, w ibe the weight coefficient of the i-th dimensional feature, x iit is the eigenwert of the i-th dimensional feature.
9. for a symbol recognition system for medical report list, it is characterized in that, comprising:
Sorter generation module, for the latent structure training sample training classifier according to distinct symbols;
Symbol guide constructing module, gathers distinct symbols for foreground image from medical report free hand drawing sheet, obtains the feature of distinct symbols to construct symbol guide;
Template matches module, utilizes described sorter identification and detected symbol template, for obtaining abnormal index in medical report list and position.
CN201510300821.1A 2015-06-03 2015-06-03 Symbol Recognition and system for medical report list Expired - Fee Related CN104992173B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510300821.1A CN104992173B (en) 2015-06-03 2015-06-03 Symbol Recognition and system for medical report list

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510300821.1A CN104992173B (en) 2015-06-03 2015-06-03 Symbol Recognition and system for medical report list

Publications (2)

Publication Number Publication Date
CN104992173A true CN104992173A (en) 2015-10-21
CN104992173B CN104992173B (en) 2018-08-17

Family

ID=54303986

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510300821.1A Expired - Fee Related CN104992173B (en) 2015-06-03 2015-06-03 Symbol Recognition and system for medical report list

Country Status (1)

Country Link
CN (1) CN104992173B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105260997A (en) * 2015-09-22 2016-01-20 北京好运到信息科技有限公司 Method for automatically obtaining target image
CN105447177A (en) * 2015-12-11 2016-03-30 浪潮电子信息产业股份有限公司 Method for archiving OQC (open ended quality control) inspection report
CN105760891A (en) * 2016-03-02 2016-07-13 上海源庐加佳信息科技有限公司 Chinese character verification code recognition method
CN106355177A (en) * 2016-08-17 2017-01-25 天津德闻数据科技有限公司 Method for automatically extracting item information from medical examination reports
CN106874823A (en) * 2015-12-10 2017-06-20 中国移动通信集团公司 A kind of information identifying method and device
CN109478419A (en) * 2016-07-15 2019-03-15 皇家飞利浦有限公司 The automatic identification of significant discovery code in structuring and narrative report
CN111985574A (en) * 2020-08-31 2020-11-24 平安医疗健康管理股份有限公司 Medical image recognition method, device, equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070133883A1 (en) * 2005-12-12 2007-06-14 Microsoft Corporation Logical structure and layout based offline character recognition
CN102663423A (en) * 2012-03-28 2012-09-12 北京航空航天大学 Method for automatic recognition and playing of numbered musical notation image
CN103235945A (en) * 2013-03-27 2013-08-07 重庆市科学技术研究院 Method for recognizing handwritten mathematical formulas and generating MathML (mathematical makeup language) based on Android system
CN104392205A (en) * 2014-10-24 2015-03-04 浙江力石科技股份有限公司 Abnormal vehicle license plate recognition method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070133883A1 (en) * 2005-12-12 2007-06-14 Microsoft Corporation Logical structure and layout based offline character recognition
CN102663423A (en) * 2012-03-28 2012-09-12 北京航空航天大学 Method for automatic recognition and playing of numbered musical notation image
CN103235945A (en) * 2013-03-27 2013-08-07 重庆市科学技术研究院 Method for recognizing handwritten mathematical formulas and generating MathML (mathematical makeup language) based on Android system
CN104392205A (en) * 2014-10-24 2015-03-04 浙江力石科技股份有限公司 Abnormal vehicle license plate recognition method and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
卢中亮: "数据公式字符识别及BP并行算法分类器", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105260997A (en) * 2015-09-22 2016-01-20 北京好运到信息科技有限公司 Method for automatically obtaining target image
CN106874823A (en) * 2015-12-10 2017-06-20 中国移动通信集团公司 A kind of information identifying method and device
CN105447177A (en) * 2015-12-11 2016-03-30 浪潮电子信息产业股份有限公司 Method for archiving OQC (open ended quality control) inspection report
CN105760891A (en) * 2016-03-02 2016-07-13 上海源庐加佳信息科技有限公司 Chinese character verification code recognition method
CN109478419A (en) * 2016-07-15 2019-03-15 皇家飞利浦有限公司 The automatic identification of significant discovery code in structuring and narrative report
CN109478419B (en) * 2016-07-15 2023-06-27 皇家飞利浦有限公司 Automatic identification of salient discovery codes in structured and narrative reports
CN106355177A (en) * 2016-08-17 2017-01-25 天津德闻数据科技有限公司 Method for automatically extracting item information from medical examination reports
CN106355177B (en) * 2016-08-17 2019-05-10 天津德闻数据科技有限公司 A kind of inspection item information automation extraction method of medical inspection list
CN111985574A (en) * 2020-08-31 2020-11-24 平安医疗健康管理股份有限公司 Medical image recognition method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN104992173B (en) 2018-08-17

Similar Documents

Publication Publication Date Title
CN104992173A (en) Symbol recognition method and system used for medical report
US10817741B2 (en) Word segmentation system, method and device
CN102208019B (en) Method for detecting lane change of vehicle based on vehicle-mounted camera
WO2019169532A1 (en) License plate recognition method and cloud system
CN102509091B (en) Airplane tail number recognition method
US20170262738A1 (en) Intelligent scoring method and system for text objective question
CN110503054B (en) Text image processing method and device
CN110619326B (en) English test paper composition detection and identification system and method based on scanning
CN103310211B (en) A kind ofly fill in mark recognition method based on image procossing
CN106033535B (en) Electronic paper marking method
CN103559490A (en) Answer sheet automatic grading method based on binary image connected domain statistics
CN104239850A (en) Answer sheet filling information automatic recognition method based on comprehensive features of images
CN105303153A (en) Vehicle license plate identification method and apparatus
US20170351925A1 (en) Analysis method of lane stripe images, image analysis device, and non-transitory computer readable medium thereof
CN113762274B (en) Answer sheet target area detection method, system, storage medium and equipment
CN103544504A (en) Scene character recognition method based on multi-scale map matching core
CN106033534B (en) Electronic paper marking method based on straight line detection
CN103699876A (en) Method and device for identifying vehicle number based on linear array CCD (Charge Coupled Device) images
CN111008594A (en) Error correction evaluation method, related equipment and readable storage medium
CN111008635A (en) OCR-based multi-bill automatic identification method and system
CN105975955A (en) Detection method of text area in image
CN116824333B (en) Nasopharyngeal carcinoma detecting system based on deep learning model
CN112926571A (en) Formula picture recognition and question judgment system based on deep learning
CN110298236B (en) Automatic Braille image identification method and system based on deep learning
CN110276347B (en) Text information detection and identification method and equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100085 room 3, building 8, Chuang Chuang Road, Haidian District, Beijing (five story), room 3-7, -839.

Applicant after: BEIJING MEDP.AI INTELLIGENT TECHNOLOGY CO.,LTD.

Address before: 100085 room 3, building 8, Chuang Chuang Road, Haidian District, Beijing (five story), room 3-7, -839.

Applicant before: BEIJING HAOYUNDAO INFORMATION TECHNOLOGY Co.,Ltd.

CB02 Change of applicant information
TA01 Transfer of patent application right

Effective date of registration: 20180706

Address after: 100195 Beijing Haidian District west Shan creative park four district three building 3 floor 159.

Applicant after: BEIJING PAIYIPAI INTELLIGENT TECHNOLOGY Co.,Ltd.

Address before: 100085 room 3, building 8, Chuang Chuang Road, Haidian District, Beijing (five story), room 3-7, -839.

Applicant before: BEIJING MEDP.AI INTELLIGENT TECHNOLOGY CO.,LTD.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180817

Termination date: 20210603

CF01 Termination of patent right due to non-payment of annual fee