JP2005275510A

JP2005275510A - Character recognition processing method, and method and device for recognition processing of content of medical fee bill

Info

Publication number: JP2005275510A
Application number: JP2004084083A
Authority: JP
Inventors: Yoshimi Yamada; 義美山田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2004-03-23
Filing date: 2004-03-23
Publication date: 2005-10-06

Abstract

<P>PROBLEM TO BE SOLVED: To recognize and collate an abbreviated name by limiting characters subject to recognition and collation in a standard name. <P>SOLUTION: A content recognition processing device for optically reading an image of a medical fee bill, recognizing each character segmented from the image to generate each character code, and collating the character code string with registered words to determine a matching word comprises a master part 3 for registering standard names of the registered words, a word extraction processing part 4 for limiting characters subject to collation in component characters of the registered words, and a character recognition part 1 for collating the character code string with the component character strings limited in the registered words. For a character code string read from an abstract field section 2, collation is performed by using the master part 3. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、光学式文字読取装置を用いた文字認識処理方法、診療報酬明細書の内容認識処理方法及び装置に関し、特に、診療報酬明細書の摘要欄の医薬品名等を認識し処理する方法及び装置に関するものである。 The present invention relates to a character recognition processing method using an optical character reader, a content recognition processing method and apparatus for a medical fee description, and in particular, a method for recognizing and processing a medicine name in a summary column of a medical fee description, and the like. It relates to the device.

診療報酬明細書とは、医療機関がその診療行為に対する報酬を保険機関等に対して要求する際の書類をいう。
従来の診療報酬明細書（以下、「レセプト」という。）を読み取る光学式文字読取装置（以下、「ＯＣＲ」という。）における摘要欄の読み取りの認識装置を図１２に示す。 The medical remuneration statement is a document when a medical institution requests a remuneration for the medical practice from an insurance institution or the like.
FIG. 12 shows a recognition device for recognizing a summary column in an optical character reader (hereinafter referred to as “OCR”) that reads a conventional medical fee description (hereinafter referred to as “recepto”).

図１２において、文字認識部１１は、レセプトの摘要欄部２を光学的に読み取る。摘要欄部２は、図６に示すように、レセプトの中央の右部に設けられており、図１１の文字認識部１１は、この摘要欄部２に印字された医薬品、診療行為等を含んだ文字列を１行単位に認識する。 In FIG. 12, the character recognition unit 11 optically reads the summary column 2 of the receipt. As shown in FIG. 6, the summary column 2 is provided at the center right part of the receipt, and the character recognition unit 11 in FIG. 11 includes the medicines, medical practices, etc. printed on the summary column 2. The character string is recognized on a line-by-line basis.

一方、マスタ部１３はデータの記憶装置であり、医薬品、診療行為等の具体的な内容を表す単語、例えば、図２に示すように、「ァロテック錠１０ｍｇ」などを記憶し登録している。そして、文字認識部１１は、摘要欄部２から認識した行単位の医薬品、診療行為等の文字列でマスタ部１３を検索する。 On the other hand, the master unit 13 is a data storage device, and stores and registers words representing specific contents such as medicines and medical treatments, for example, “Alotec lock 10 mg” as shown in FIG. Then, the character recognizing unit 11 searches the master unit 13 with a character string such as a line-by-line medicine or medical practice recognized from the summary column unit 2.

ここで、検索の結果、一致する単語が存在する場合は、医薬品、診療行為等の単語は、正しく読まれたものとして、その読まれた行を認識結果とする。 Here, if there is a matching word as a result of the search, it is assumed that words such as medicines and medical treatments are read correctly, and the read line is set as a recognition result.

一方、一致する単語が存在しない場合は、対応する医薬品、診療行為等の単語がない、あるいは、文字認識に失敗したとして、行ＮＧフラグというフラグを立てていた。そして、行ＮＧフラグとともに、その行を認識結果として図示しない上位装置に送っていた。 On the other hand, if there is no matching word, a flag of row NG flag is set on the assumption that there is no word for the corresponding medicine or medical treatment or the character recognition has failed. Then, along with the row NG flag, the row is sent as a recognition result to a host device (not shown).

そして、このような認識結果を受け取った上位装置では、行ＮＧフラグの立っている行について、オペレータが修正を行っていた。
その結果、図５に示すような様式で摘要欄部２（図６及び図１２）の医薬品名を印字していた。 Then, in the host device that has received such a recognition result, the operator has corrected the row in which the row NG flag is set.
As a result, the name of the medicine in the summary column 2 (FIGS. 6 and 12) was printed in the manner shown in FIG.

しかしながら、上述した従来技術には次のような課題があった。
上述した医薬品名は、それを特定できる範囲での省略が許容されている。そして、各医療機関（病院）では、都合に応じて独自の書式で上述した診療報酬明細書を作成しており、医療機関によって医薬品名の省略を行う場合と省略を行わずに正式名称を使用する場合とがある。診療行為の名称についても同様の事情がある。 However, the above-described prior art has the following problems.
The above-mentioned pharmaceutical names are allowed to be omitted as long as they can be specified. Each medical institution (hospital) creates the above-mentioned medical remuneration statement in its own format according to circumstances, and uses the official name without omitting the drug name by the medical institution There is a case to do. The same situation applies to the names of medical practices.

このため、診療報酬明細書に記載されている医薬品や診療行為等の単語が、マスタ部１３に登録されている、厚生労働省が提供している医薬品や診療行為等の名称と異なっている場合がある。 For this reason, the words such as medicines and medical treatments described in the medical remuneration statement may be different from the names of medicines and medical treatments provided by the Ministry of Health, Labor and Welfare registered in the master unit 13. is there.

特に、医薬品においては、例えば、「ネオフィリン注２．５％１０ｍＬ」を「ネオフィリン注」、「ァロテック錠１０ｍｇ」を「ァロテック錠」等というように、「％」や「ｍＬ」、「ｍｇ」を省略してレセプトに印字されることがある。 In particular, in medicines, for example, “Neophyrin Injection 2.5% 10 mL” is “Neophyrin Injection”, “Arotech Tablets 10 mg” is “Arotech Tablets”, etc. “%”, “mL”, “mg” May be omitted and printed on the receipt.

この場合、「ネオフィリン注」や「ァロテック錠」のみの単語はマスタ部１３に登録されていない。このため、一致する医薬品単語が存在せず、認識が不成功となる場合があるという不具合があった。 In this case, words including only “Neophyllin Note” and “Arrotech Lock” are not registered in the master unit 13. For this reason, there is a problem that there is no matching drug word and recognition may be unsuccessful.

また、医薬品や診療行為等のうち、特に医薬品については、例えば、「スペロン注射液２５％１ｍＬ」を、各医療機関では、例えば、空白のない「スペロン注射液２５％１ｍＬ」や空白の多い「スペロン注射液２５％１ｍＬ」と印字する場合がある。 In addition, among medicines and medical treatments, especially for medicines, for example, “Speron injection solution 25% 1 mL” is used in each medical institution, for example, “Speron injection solution 25% 1 mL” without blanks or “ "Speron Injection 25% 1mL" may be printed.

そこで、マスタ部１３では、空白のない「スペロン注射液２５％１ｍＬ」とか、「イソジンゲル１０％」という形式で登録している。 Therefore, in the master unit 13, registration is made in the form of “speron injection solution 25% 1 mL” or “isodine gel 10%” without a blank.

このため、各医療機関で空白のある「スペロン注射液２５％１ｍＬ」とか、「イソジンゲル１０％」を記載した場合には、空白の部分を詰める手順が必要となり、処理が煩雑化していた。 For this reason, when “Speron injection solution 25% 1 mL” or “Isodine gel 10%” with a blank is described in each medical institution, a procedure for filling the blank portion is necessary, and the processing becomes complicated.

本発明は、以下の方法及び装置を最も主要な特徴とする。
第１は、書類の画像を光学的に読み取り、当該画像から切り出した各文字を認識して各文字コードを生成し、当該文字コード列を登録単語と照合して一致する単語を決定する場合において、前記登録単語の構成文字のうち、照合の対象とすべき文字を限定し、前記文字コード列を前記登録単語の限定された構成文字列と照合することを特徴とする文字認識処理方法である。 The present invention is mainly characterized by the following method and apparatus.
First, when an image of a document is optically read, each character cut out from the image is recognized to generate each character code, and the character code string is compared with a registered word to determine a matching word. The character recognition processing method is characterized in that, among the constituent characters of the registered word, characters to be collated are limited, and the character code string is collated with the limited constituent character string of the registered word. .

第２は、診療報酬明細書の画像を光学的に読み取り、当該画像から切り出した各文字を認識して各文字コードを生成し、当該文字コード列を登録単語と照合して一致する単語を決定する場合において、前記登録単語の構成文字のうち、照合の対象とすべき文字を限定し、前記文字コード列を前記登録単語の限定された構成文字列と照合することを特徴とする診療報酬明細書の内容認識処理方法である。 Second, optically read an image of the medical remuneration statement, recognize each character cut out from the image, generate each character code, and collate the character code string with a registered word to determine a matching word In this case, the medical remuneration specification is characterized in that among the constituent characters of the registered word, the character to be collated is limited, and the character code string is collated with the limited constituent character string of the registered word. This is a content recognition processing method of a book.

第３は、診療報酬明細書の画像を光学的に読み取り、当該画像から切り出した各文字を認識して各文字コードを生成し、当該文字コード列を登録単語と照合して一致する単語を決定する内容認識処理装置において、前記登録単語の正式名称を登録するマスタ部と、前記登録単語の構成文字のうち、照合の対象とすべき文字を限定する単語抽出処理部と、前記文字コード列を前記登録単語の限定された構成文字列と照合する文字認識部とから成ることを特徴とする診療報酬明細書の内容認識処理装置である。 Third, optically read the image of the medical fee description, recognize each character cut out from the image, generate each character code, and collate the character code string with the registered word to determine a matching word In the content recognition processing device, the master unit for registering the formal name of the registered word, the word extraction processing unit for limiting the characters to be collated among the constituent characters of the registered word, and the character code string It is a content recognition processing device for a medical fee remuneration statement, characterized by comprising a character recognition unit that collates with a limited constituent character string of the registered word.

本発明の文字認識処理方法、診療報酬明細書の内容認識処理方法及び装置によれば、簡略化された表現で記載された医薬品名が診療報酬明細書に印字されている場合でも、正式名称を登録単語とするマスタファイルを使用して文字認識後の単語照合を行うことができ、診療報酬明細書の内容を認識する際に好適な方法及び装置を提供できる。 According to the character recognition processing method and the content recognition processing method and apparatus of the medical fee remuneration specification of the present invention, even if the medicine name described in a simplified expression is printed on the medical fee remuneration statement, Word verification after character recognition can be performed using a master file as a registered word, and a method and apparatus suitable for recognizing the contents of a medical fee remuneration statement can be provided.

照合の対象とすべき文字を限定するという目的を、正式名称から抽出する文字の数を指定する抽出文字数を登録することで、実現した。 The purpose of limiting the characters that should be matched is achieved by registering the number of extracted characters that specify the number of characters extracted from the official name.

図１は、本発明の文字認識方法を適用したレセプト（診療報酬明細書）読取装置の実施例１を示すブロック図である。
図示の装置は、文字認識部１、マスタ部３、単語抽出処理部４から成る。 FIG. 1 is a block diagram showing a first embodiment of a receipt (medical fee description) reading apparatus to which the character recognition method of the present invention is applied.
The illustrated apparatus includes a character recognition unit 1, a master unit 3, and a word extraction processing unit 4.

文字認識部１は、レセプトの摘要欄部２を光学的に読み取る。摘要欄部２は、図６に示すように、レセプトの中央の右部に設けられており、この摘要欄部２には、医薬品、診療行為等を含んだ文字列が印字されている。文字認識部１は、この摘要欄部２に印字された医薬品、診療行為等を含んだ文字列を１行内の一連の文字列単位に認識する。 The character recognition unit 1 optically reads the summary section 2 of the receipt. As shown in FIG. 6, the summary column part 2 is provided at the center right part of the receipt, and the summary column part 2 is printed with a character string including a medicine, a medical practice and the like. The character recognizing unit 1 recognizes a character string including a medicine, a medical practice, and the like printed in the summary column unit 2 in a series of character strings in one line.

マスタ部３は、データ記憶装置であり、医薬品、診療行為等の具体的な内容を表す単語を記憶し登録している。また、このマスタ部３には、医薬品名の単語の文字列から所定数の文字列を抽出すべく、抽出文字数が登録されている。 The master unit 3 is a data storage device, and stores and registers words representing specific contents such as medicines and medical treatments. In addition, the number of extracted characters is registered in the master unit 3 so as to extract a predetermined number of character strings from the character string of the drug name word.

図２に、抽出文字数が登録してあるマスタ部３の例を示す。
図２に示すように、抽出文字数は、「％」や「ｍL」、「ｍｇ」等の部分を省略しても同名の医薬品が生じない単語すなわち一種類しか存在しない医薬品単語に対して登録されている。例えば、「フェニタレン坐剤」の「ｍｇ」部のように、「ｍｇ」、「ｍｌ」、「％」等の部分を省略すると、他の単語と同じになってしまう単語には、抽出文字数は登録されない。 FIG. 2 shows an example of the master unit 3 in which the number of extracted characters is registered.
As shown in FIG. 2, the number of extracted characters is registered for a word that does not produce a drug with the same name even if the parts such as “%”, “mL”, and “mg” are omitted, that is, a drug word that has only one type. ing. For example, if you omit “mg”, “ml”, “%”, etc., like the “mg” part of “phenitalene suppository”, the number of extracted characters will be the same as other words. Not registered.

図１の単語抽出処理部４は、マスタ部３から登録単語を取り出す際に、登録単語の全文字を取り出すとともに、各登録単語に抽出文字数が登録されているときは、その抽出文字数分の文字列を抽出する。 When extracting the registered words from the master unit 3, the word extraction processing unit 4 in FIG. 1 extracts all characters of the registered words, and when the number of extracted characters is registered in each registered word, the number of characters for the extracted characters Extract columns.

そして、文字認識部１は、摘要欄部２から認識した行単位の医薬品、診療行為等の文字列についてマスタ部３に登録されている単語を検索する。この際、摘要欄部２から認識した文字数と同じ文字数の登録単語について検索を行う。 Then, the character recognition unit 1 searches for a word registered in the master unit 3 for a character string such as a line-by-line medicine or medical practice recognized from the summary column unit 2. At this time, a search is performed for registered words having the same number of characters as the number of characters recognized from the summary column part 2.

次に、上述した装置の動作を図３のフローチャートを参照しつつ説明する。
まず、レセプトの摘要欄部２に記載された文字について行切り出し（ステップS1）、文字切り出し（ステップS2）を行った後、文字認識（ステップS3）を行い、各文字を文字コード（JISコード）にする。そして、各行の最初の空白までの文字列についてマスタ部３に一致する医薬品単語が存在するかどうか検索する（ステップS4,S5）。 Next, the operation of the above-described apparatus will be described with reference to the flowchart of FIG.
First, line extraction (step S1) and character extraction (step S2) are performed for the characters listed in the summary section 2 of the receipt, then character recognition (step S3) is performed, and each character is converted into a character code (JIS code). To. Then, a search is made as to whether or not there is a medicine word that matches the master part 3 for the character string up to the first blank in each line (steps S4 and S5).

その際、マスタ部３における抽出文字数が登録されている場合は、単語抽出処理部４は、その登録されている単語から、抽出文字数に従い、その抽出文字数分の単語を抽出する。そして、その抽出した単語と一致するかどうか検索を行う（ステップS4）。一致する単語が存在する場合は（ステップS5）、その一致した医薬品単語が認識結果となる（ステップS8）。 At this time, if the number of extracted characters in the master unit 3 is registered, the word extraction processing unit 4 extracts words corresponding to the number of extracted characters from the registered words according to the number of extracted characters. Then, a search is performed as to whether or not the extracted word matches (step S4). If there is a matching word (step S5), the matching drug word is the recognition result (step S8).

ここで、ステップS4でのマスタ部３における医薬品名の取り出し手順を図４のフローチャートを参照しつつ説明する。 Here, the procedure for extracting the drug name in the master unit 3 in step S4 will be described with reference to the flowchart of FIG.

まず、ステップS11で、摘要欄部２の先頭文字から空白までの文字数をカウントする。そして、ステップS12で、マスタ部３（図２）において上から順に医薬品名を検索する。次のステップS13では、マスタ部３（図２）に登録された医薬品名の登録文字数（先頭から空白までの文字数）と摘要欄部２に記載された医薬品名の文字数とが一致するか否かを判別する。その結果、ステップS13で、摘要欄部２に記載された文字数がマスタ部３において検索された文字数と一致するときは、ステップS14で、医薬品名（正式名称）の全文字を取り出し、ステップS15で、全文字の比較照合を行う。その結果、ステップS16で、正式名称としての医薬品名が一致するときは、処理を終了し、図３のステップS5に戻る。
一方、ステップS13で、摘要欄部２に記載された文字数がマスタ部３の文字数と一致しないときは、ステップS17で、マスタ部３の医薬品名に対応した抽出文字数が登録されているか否かを判別する。その結果、抽出文字数が登録されていないときは、次の登録医薬品名を検索すべく、ステップS12に進む。 First, in step S11, the number of characters from the first character of the summary column part 2 to a blank is counted. Then, in step S12, the drug names are searched in order from the top in the master unit 3 (FIG. 2). In the next step S13, whether or not the number of registered characters (number of characters from the beginning to the blank) of the drug name registered in the master unit 3 (FIG. 2) matches the number of characters of the drug name described in the summary column unit 2 Is determined. As a result, in step S13, when the number of characters described in the summary column portion 2 matches the number of characters searched in the master portion 3, in step S14, all characters of the medicine name (official name) are extracted, and in step S15 Compare all characters. As a result, when the drug name as the official name matches in step S16, the process ends and the process returns to step S5 in FIG.
On the other hand, when the number of characters described in the summary field 2 is not the same as the number of characters in the master unit 3 in step S13, it is determined in step S17 whether the number of extracted characters corresponding to the drug name in the master unit 3 is registered. Determine. As a result, when the number of extracted characters is not registered, the process proceeds to step S12 to search for the next registered medicine name.

ステップS17で、抽出文字数が登録されているときは、ステップS18で、抽出文字数が摘要欄部２の医薬品名の文字数と一致するか否かを判別する。その結果、両者の文字数が一致しないときは、次の登録医薬品名を検索すべく、ステップS12に進む。
一方、ステップS18で、マスタ部３の抽出文字数と摘要欄部２の医薬品名の文字数とが一致するときは、次のステップS19で、先頭文字から抽出文字数分の文字列（JISコード）を取り出す。
このようにして、正式名称の医薬品名を使って簡略名称の医薬品名を作り出している。
そして、ステップS15に進み、簡略名称としての医薬品名の照合を行う。その結果、ステップS16で簡略名称としての医薬品名が一致するときは、処理を終了し、図３のステップS5に戻る。一致しないときは、次の登録医薬品名を検索すべく、ステップS12に進む。 If the number of extracted characters is registered in step S17, it is determined in step S18 whether or not the number of extracted characters matches the number of characters in the medicine name in the summary column section 2. If the number of characters does not match, the process proceeds to step S12 to search for the next registered medicine name.
On the other hand, if the number of extracted characters in the master unit 3 matches the number of characters in the medicine name in the summary column unit 2 in step S18, a character string (JIS code) corresponding to the number of extracted characters is extracted from the first character in the next step S19. .
In this way, a simple drug name is created using the official drug name.
Then, the process proceeds to step S15, where the drug name as a simplified name is collated. As a result, when the medicine name as the simplified name matches in step S16, the process is terminated and the process returns to step S5 in FIG. If not, the process proceeds to step S12 to search for the next registered medicine name.

ところで、本発明は、図４に示す処理手順に限らず、他の方法を使用することも可能である。例えば、抽出文字数の欄をマスタ部３に設けずに、簡略名称を正式名称と同等に扱い、マスタ部に登録する方法である。 By the way, the present invention is not limited to the processing procedure shown in FIG. 4, and other methods can be used. For example, the column of the number of extracted characters is not provided in the master unit 3, but the simplified name is handled in the same way as the official name and registered in the master unit.

しかしながら、そのような方法では、従来のマスタ部１３と比較して、マスタ部３は２倍近い記憶容量を必要としてしまう。また、同じ医薬品を示す名称が２種類存在することになり、検索時間がかかることになる。 However, in such a method, the master unit 3 requires nearly twice the storage capacity as compared with the conventional master unit 13. In addition, there are two types of names indicating the same medicine, which takes a long search time.

この点、図２のマスタ部を用いた図４の方法によれば、医薬品名数×１バイト程度の記憶容量の増加で済み、検索処理時間もさほどかからない。 In this regard, according to the method of FIG. 4 using the master unit of FIG. 2, the storage capacity is increased by the number of drug names × about 1 byte, and the search processing time is not so much.

図３のステップS5で、一致する単語が存在しない場合は、空白の次に文字列がある場合（ステップS6）、次の空白までの文字列と空白をのぞいて結合し（ステップS7）、検索する。それでも、一致する単語が存在しない場合は、更に次の空白までの文字列と空白をのぞいて結合し、検索する。以後、一致する単語が存在しない場合に、順次、最後の文字列まで結合しながら、マスタ部３を検索する。 If there is no matching word in step S5 in FIG. 3, if there is a character string next to a blank (step S6), the character string up to the next blank is combined with the blank (step S7), and the search is performed. To do. If there is still no matching word, the character string up to the next blank and the blank are combined and searched. Thereafter, when there is no matching word, the master unit 3 is searched while sequentially joining up to the last character string.

その際、一致する医薬品単語が存在した場合は、その行の医薬品単語は、正しく読み取っていることになる。よって、その行全体をそのまま認識結果とする（ステップS8）。その行の最後の文字列まで変換して一致する医薬品単語が存在しない場合、その行の医薬品単語は誤読している、あるいは、存在しない医薬品の可能性がある。よって、NGフラグを立て（ステップS9）、上位装置に認識結果とともに送出する。 At this time, if there is a matching drug word, the drug word in that line is read correctly. Therefore, the entire line is used as a recognition result as it is (step S8). If there is no matching drug word after conversion to the last character string on the line, the drug word on the line may be misread or may not exist. Therefore, an NG flag is set (step S9) and sent to the higher-level device together with the recognition result.

ここで、単語抽出処理部４の作用について、図５に示す医薬品の印字例を用いて説明する。
まず、文字認識部１では、１行目の最初の空白までの「ポピヨード液１０％」という文字コード列についてマスタ部３に一致する登録単語があるかどうか検索する。 Here, the effect | action of the word extraction process part 4 is demonstrated using the printing example of the pharmaceutical shown in FIG.
First, the character recognition unit 1 searches for a registered word that matches the master unit 3 for a character code string of “popiod liquid 10%” up to the first blank in the first line.

その際、「ポピヨード液１０％」の文字数は、９文字であるが、これはマスタ部３における登録文字数と一致する。よって、単語抽出処理部４は、マスタ部３の登録単語をそのまま抽出する。そして、一致する登録単語であるかどうかを判定する。 At this time, the number of characters of “popiod liquid 10%” is nine characters, which matches the number of registered characters in the master unit 3. Therefore, the word extraction processing unit 4 extracts the registered words of the master unit 3 as they are. Then, it is determined whether the registered word matches.

今説明している例では、「ポピヨード液１０％」は、マスタ部３に登録されており、一致する医薬品単語が存在することになる。これにより、その行の認識結果が上位装置に送出される。 In the example now described, “popiodine solution 10%” is registered in the master unit 3 and there is a matching drug word. As a result, the recognition result of the row is sent to the host device.

次の行の「ポラザG軟膏２．４ｇ」については、最初の空白までの文字列、すなわち、「ポラザG軟膏」という文字コード列と一致する登録単語は、マスタ部３に登録されていない。また最初の空白を除いて結合した文字列、すなわち、「ポラザG軟膏２．４ｇ」についても、一致する登録単語は、マスタ部３に登録されていない。このため、認識結果はNGとなり、NGフラグが立てられる。 For “Polaza G ointment 2.4 g” in the next line, the registered character string that matches the character string up to the first blank, that is, the character code string “Polaza G ointment” is not registered in the master unit 3. In addition, for the character string combined with the first blank removed, that is, “Polaza G ointment 2.4 g”, no matching registered word is registered in the master unit 3. For this reason, the recognition result is NG and an NG flag is set.

３行目の「イソジンゲル４ｇ」については、文字認識部１は、最初の空白までの「イソジンゲル」についてマスタ部３に一致する登録単語があるかどうか検索する。その際、「イソジンゲル」の文字数は、６文字であり、正式名称の「イソジンゲル10％」とは一致しないが、単語抽出処理部４は、マスタ部３の抽出文字数が６文字となっているので、その登録単語の頭文字から６文字を抽出して照合する。 For “Isodin gel 4g” on the third line, the character recognition unit 1 searches for “Isodingel” up to the first blank to see if there is a registered word that matches the master unit 3. At that time, the number of characters of “Isodingel” is 6 and does not match the official name “Isodingel 10%”. However, the word extraction processing unit 4 has 6 extracted characters in the master unit 3. , 6 characters are extracted from the initial of the registered word and collated.

このように、マスタ部３には「イソジンゲル１０％」があり、抽出文字数は６文字であるので、単語抽出処理部４は「イソジンゲル」を抽出し、一致する医薬品単語が存在することになり、その行の認識結果が上位装置に送出される。 In this way, the master unit 3 has “Isodine gel 10%” and the number of extracted characters is 6, so the word extraction processing unit 4 extracts “Isodine gel” and there is a matching drug word, The recognition result of the row is sent to the host device.

４行目の「フェニタレン坐剤５０ｍｇ１個」については、文字認識部１は、最初の空白までの「フェニタレン坐剤」についてマスタ部３に一致する登録単語があるかどうか検索する。その際、「フェニタレン坐剤」の文字数は８文字であり、単語抽出処理部４は、マスタ部３の抽出文字数が８文字の登録単語について、その登録単語の頭文字から８文字を抽出して一致単語があるかどうか検索する。 For the “phenitalene suppository 50 mg 1 piece” in the fourth line, the character recognition unit 1 searches for “phenitalene suppository” up to the first blank to see if there is a registered word that matches the master unit 3. At that time, the number of characters of “phenitalene suppository” is 8 characters, and the word extraction processing unit 4 extracts 8 characters from the initials of the registered words for the registered words whose number of extracted characters of the master unit 3 is 8 characters. Search for a matching word.

この場合、「フェニタレン坐剤」では一致する登録単語は存在せず、文字認識部１では、次の空白までの「５０ｍｇ」と結合して「フェニタレン坐剤５０ｍｇ」でマスタ部３を検索する。「フェニタレン坐剤５０ｍｇ」では一致するので、その行の認識結果が上位装置に送出される。 In this case, there is no matching registered word in “phenitalene suppository”, and the character recognition unit 1 searches for the master unit 3 by combining “50 mg” up to the next blank and “phenitalene suppository 50 mg”. Since “phenitalene suppository 50 mg” matches, the recognition result of that line is sent to the host device.

最後の「スペロン注射液１A」については、文字認識部１は、最初の空白までの「スペロン注射液」についてマスタ部３に一致する登録単語があるかどうか検索する。 For the last “speron injection solution 1A”, the character recognition unit 1 searches for a registered word that matches the master unit 3 for the “speron injection solution” up to the first blank.

その際、「スペロン注射液」の文字数は７文字であり、単語抽出処理部４は、マスタ部３の抽出文字数が７文字の登録単語について、その登録単語の頭文字から７文字を抽出して一致する登録単語があるかどうか検索する。 At that time, the number of characters of the “speron injection solution” is 7 characters, and the word extraction processing unit 4 extracts 7 characters from the initials of the registered words for the registered words whose number of extracted characters in the master unit 3 is 7 characters. Search for a matching registered word.

「スペロン注射液」の医薬品は２種類存在するが、「スペロン注射液」では一致せず、文字認識部１では、「１A」と結合して「スペロン注射液１A」でマスタ部３を検索する。「スペロン注射液１A」でも一致しないので、一致する医薬品単語が存在しないとして、行NGフラグを立て、その行の認識結果を上位装置に送出する。 There are two types of medicines of “Speron injection solution”, but “Speron injection solution” does not match, and character recognition unit 1 searches for master unit 3 by combining “1A” with “Speron injection solution 1A”. . Since “Speron injection solution 1A” does not match, a line NG flag is set and a recognition result of the line is sent to the host device, assuming that there is no matching drug word.

上記のようにして、マスタ部における各単語に対応して抽出文字数を登録し、抽出文字数診療報酬明細書に簡略化された表現で記載された医薬品名がレセプトに印字された場合でも、医薬品として特定でき、正式名称の医薬品名を上位装置に出力できる。 As described above, the number of extracted characters corresponding to each word in the master part is registered, and even when the name of the medicine written in a simplified expression on the number of extracted characters medical fee remuneration is printed on the receipt, It is possible to specify and output the official drug name to the host device.

図７は、本発明の実施例２に係る認識装置の構成図であり、図８は、そのマスタ部３’の構成図である。
図８に示すように、図７のマスタ部３’には、医薬品名とともに医薬品コードが登録されている。 FIG. 7 is a configuration diagram of the recognition apparatus according to the second embodiment of the present invention, and FIG. 8 is a configuration diagram of the master unit 3 ′.
As shown in FIG. 8, a medicine code is registered together with the medicine name in the master part 3 ′ of FIG.

これにより、マスタ部３’の検索の結果、一致する単語が存在する場合は、文字認識の結果が医薬品単語となり、上位装置にその文字認識の結果を送出するとともに、一致した単語に対応して登録されている医薬品コードを上位装置に送出する。 As a result, if there is a matching word as a result of the search of the master unit 3 ′, the character recognition result becomes a pharmaceutical word, and the character recognition result is sent to the host device, and the matching word is The registered drug code is sent to the host device.

一方、一致する単語が存在しない場合は、NGフラグを立て、文字認識の結果を上位装置に医薬品コードなしで送出する。 On the other hand, if there is no matching word, an NG flag is set and the result of character recognition is sent to the host device without a pharmaceutical code.

以上のように実施例２によれば、正式名称の医薬品名に付与される医薬品コードを、簡略名称の医薬品名にも付与することができる。その結果、レセプトの集計処理がより円滑に行える。 As described above, according to the second embodiment, the drug code given to the official drug name can also be given to the simple drug name. As a result, the receipt counting process can be performed more smoothly.

図９は、本発明の実施例３に係る認識装置の構成図であり、図１０は、そのマスタ部３”の構成図である。
図１０に示すように、図９のマスタ部３”には、医薬品名とともに医薬品コード及び検索ヒット数が登録されている。この検索ヒット数には、文字認識の結果として得られた単語がマスタ部３”で一致した過去の件数が登録される。 FIG. 9 is a configuration diagram of a recognition apparatus according to Embodiment 3 of the present invention, and FIG. 10 is a configuration diagram of the master unit 3 ″.
As shown in FIG. 10, a drug code and a search hit number are registered together with a drug name in the master unit 3 ″ of FIG. 9. In this search hit number, a word obtained as a result of character recognition is a master. The past number of records that coincided with part 3 ″ is registered.

また、検索ヒット数加算処理部５が設けられており、これにより、今回の文字認識の結果得られた単語がマスタ部３”で一致した場合にその一致した登録単語に対応する検索ヒット数の更新すなわち数値“１”の加算がなされる。そして、検索ヒット数出力部６が設けられており、これにより、更新後の検索ヒット数が上位装置に出力される。 In addition, a search hit number addition processing unit 5 is provided. When the word obtained as a result of the character recognition this time is matched in the master unit 3 ″, the number of search hits corresponding to the matched registered word is set. Update, that is, addition of the numerical value “1” is performed, and a search hit number output unit 6 is provided, whereby the updated search hit number is output to the host device.

これにより、マスタ部３”の検索の結果、一致する単語が存在する場合は、文字認識の結果が医薬品単語となり、上位装置にその文字認識の結果を送出するとともに、一致した単語に対応して登録されている医薬品コード及び更新した検索ヒット数を上位装置に送出する。 As a result, when there is a matching word as a result of the search of the master unit 3 ″, the result of character recognition becomes a pharmaceutical word, and the result of character recognition is sent to the host device, and the corresponding word is matched. The registered drug code and the updated number of search hits are sent to the host device.

一方、一致する単語が存在しない場合は、NGフラグを立て、文字認識の結果を上位装置に医薬品コードなしで更新しない検索ヒット数とともに送出する。 On the other hand, if there is no matching word, an NG flag is set and the result of character recognition is sent to the host device together with the number of search hits that are not updated without a pharmaceutical code.

以上のように実施例３によれば、正式名称の医薬品名であるか、簡略名称の医薬品名であるかにかかわらず、医薬品ごとに検索ヒット数を集計することができる。その結果、レセプトの集計処理がより円滑に行える。 As described above, according to the third embodiment, the number of search hits can be tabulated for each drug regardless of whether the drug name is an official name or a simple drug name. As a result, the receipt counting process can be performed more smoothly.

図１１は、本発明の実施例４に係る認識装置の構成図である。
図１１の装置は、図１０に示す実施例３のものに、テキストファイル出力部７を設けており、その他の構成は、図１０と同様である。 FIG. 11 is a configuration diagram of a recognition apparatus according to Embodiment 4 of the present invention.
The apparatus of FIG. 11 is provided with a text file output unit 7 in the third embodiment shown in FIG. 10, and the other configuration is the same as that of FIG.

テキストファイル出力部７は、認識結果、コード、及び検索ヒット数、すなわち文字認識の結果として得られた漢字コード(JISコード、医薬品コード、及びマスタ部３”における検索ヒット数を、テキスト形式(CSV形式等)で出力する。 The text file output unit 7 displays the recognition result, code, and search hit number, that is, the search hit number in the kanji code (JIS code, pharmaceutical code, and master unit 3 ″ obtained as a result of character recognition in text format (CSV Format).

以上のように実施例４によれば、医薬品名、医薬品コード、及び検索ヒット数を、テキスト形式で出力するため、上述した実施例１から実施例３までの効果に加えて、他のアプリケーションへのデータの受け渡しを容易に行うことができるという効果を有する。 As described above, according to the fourth embodiment, the drug name, the drug code, and the number of search hits are output in a text format. Therefore, in addition to the effects of the first to third embodiments described above, to other applications. The data can be easily exchanged.

本発明の実施例１に係る認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the recognition apparatus which concerns on Example 1 of this invention. 抽出文字数が登録してあるマスタ部の例の説明図である。It is explanatory drawing of the example of the master part in which the number of extracted characters is registered. 本発明に係る文字認識部の処理手順を説明するフローチャートである。It is a flowchart explaining the process sequence of the character recognition part which concerns on this invention. 図２のマスタ部の検索処理手順を説明するフローチャートである。It is a flowchart explaining the search processing procedure of the master part of FIG. 医薬品の印字例の説明図である。It is explanatory drawing of the example of printing of a pharmaceutical. レセプト（診療報酬明細書）の概略図（医科入院外）の説明図である。It is explanatory drawing of the schematic (not medical hospitalization) of a receipt (medical fee detailed statement). 本発明の実施例２に係る認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the recognition apparatus which concerns on Example 2 of this invention. 抽出文字数が登録してあるマスタ部の例で単語名対応コードの追加を行った場合の説明図である。It is explanatory drawing at the time of adding the word name corresponding | compatible code in the example of the master part in which the number of extracted characters is registered. 本発明の実施例３に係る認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the recognition apparatus which concerns on Example 3 of this invention. 抽出文字数が登録してあるマスタ部の例で検索ヒット数を追加した場合の説明図である。It is explanatory drawing at the time of adding the number of search hits in the example of the master part in which the number of extracted characters is registered. 本発明の実施例４に係る認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the recognition apparatus which concerns on Example 4 of this invention. 従来の認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the conventional recognition apparatus.

Explanation of symbols

１文字認識部
２摘要欄部
３マスタ部
４単語抽出処理部
1 Character recognition part 2 Summary column part 3 Master part 4 Word extraction processing part

Claims

In the case of optically reading an image of a document, recognizing each character cut out from the image to generate each character code, checking the character code string against a registered word, and determining a matching word,
Of the constituent characters of the registered word, limit the characters to be collated,
A character recognition processing method comprising: collating the character code string with a limited constituent character string of the registered word.

The character recognition processing method according to claim 1, wherein the character to be collated is limited by the number of extracted characters registered corresponding to the registered word.

In the case of optically reading the image of the medical remuneration statement, recognizing each character cut out from the image and generating each character code, checking the character code string against a registered word and determining a matching word,
Of the constituent characters of the registered word, limit the characters to be collated,
A method for recognizing the contents of a medical remuneration statement, wherein the character code string is collated with a limited constituent character string of the registered word.

The content recognition processing method of the medical remuneration statement according to claim 3, wherein the character to be collated is limited by the number of extracted characters registered corresponding to the registered word.

The registered word is a drug name, a drug code is defined for each drug, the drug code corresponding to the drug name is registered, and a drug that matches the character code string as a result of the verification 4. The method for recognizing the contents of a medical fee remuneration statement according to claim 3, wherein when a name is searched, a drug code corresponding to the drug name is output.

The registered word is a drug name, the search hit number for the drug name corresponding to the drug name is registered, and when the drug name matching the character code string is searched as a result of the collation, the drug 4. The method for recognizing the contents of a medical remuneration statement according to claim 3, wherein the number of search hits corresponding to the name is updated.

A content recognition process for optically reading an image of a medical remuneration statement, generating each character code by recognizing each character cut out from the image, and comparing the character code string with a registered word to determine a matching word In the device
A master unit for registering the formal name of the registered word;
Of the constituent characters of the registered word, a word extraction processing unit that limits characters to be collated;
A content recognition processing device for medical remuneration statement, comprising: a character recognition unit that collates the character code string with a limited constituent character string of the registered word.